CN109858557B - Novel semi-supervised classification method for hyperspectral image data - Google Patents

Novel semi-supervised classification method for hyperspectral image data Download PDF

Info

Publication number
CN109858557B
CN109858557B CN201910113156.3A CN201910113156A CN109858557B CN 109858557 B CN109858557 B CN 109858557B CN 201910113156 A CN201910113156 A CN 201910113156A CN 109858557 B CN109858557 B CN 109858557B
Authority
CN
China
Prior art keywords
data
hyperspectral
classification
training
hyperspectral image
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Active
Application number
CN201910113156.3A
Other languages
Chinese (zh)
Other versions
CN109858557A (en
Inventor
阎庆
张凯琳
丁云
张晶晶
寻丽娜
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Anhui University
Original Assignee
Anhui University
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Anhui University filed Critical Anhui University
Priority to CN201910113156.3A priority Critical patent/CN109858557B/en
Publication of CN109858557A publication Critical patent/CN109858557A/en
Application granted granted Critical
Publication of CN109858557B publication Critical patent/CN109858557B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Images

Landscapes

  • Information Retrieval, Db Structures And Fs Structures Therefor (AREA)
  • Image Analysis (AREA)

Abstract

The invention discloses a novel semi-supervised classification algorithm of hyperspectral image data, which comprises the steps of firstly acquiring all hyperspectral data and taking the hyperspectral data as initial data; secondly, aggregating all initial data by using sparse subspace clustering as a clustering method, and acquiring pseudo labels of all initial data; secondly, pre-training all initial data and pseudo labels thereof in the CNN 1; for the objective function in the classification layer in the CNN1, a new objective function based on metric learning is proposed; then, CNN1 is used for removing the final classification layer, and a full connection layer is added to form CNN2; the tagged data and their known authentic tags are then trimmed to CNN 2. The invention relates to a semi-supervised deep learning mechanism for hyperspectral image classification by utilizing a pseudo label; two-stage training is used, and training information is expanded; in order to better master the problem of the structural characteristics of the hyperspectral sample, a new objective function is proposed in the pre-training stage.

Description

Novel semi-supervised classification method for hyperspectral image data
Technical Field
The invention belongs to the field of hyperspectral image classification, relates to a data semi-supervised classification technology, and particularly relates to a novel hyperspectral image data semi-supervised classification method.
Background
Through the development of the latter half of the 20 th century, the remote sensing technology has changed significantly in theory, technology and application. The hyperspectral image remote sensing technology is undoubtedly an important aspect in the great change of the remote sensing technology. The hyperspectral image remote sensing technology utilizes a hyperspectral sensor/imaging spectrometer to simultaneously image a target area by tens of to hundreds of continuously subdivided wave bands, combines an image with a spectrum, and simultaneously acquires spatial and spectral information to obtain a hyperspectral image with a pixel as a unit. The image contains a large amount of spectral information and is capable of collecting radiation or reflection information of the target in the ultraviolet, visible, near infrared and mid infrared regions of the electromagnetic spectrum.
The hyperspectral remote sensing data contains abundant spatial information and spectral information. The hyperspectral image has the main characteristic that continuous spectrums of ground objects and ground surface space images can be obtained simultaneously, and spectrum dimensional information and traditional space dimensional image information are fused together. The process of classifying the hyperspectral images is a process of classifying each pixel or a relatively uniform pixel group according to the spectral characteristic information and the spatial characteristic information of the image pixels.
Disclosure of Invention
The invention provides a semi-supervised classification algorithm for hyperspectral image data, which integrates spectral information and spatial information in a probability framework; firstly, acquiring pseudo labels of all data through sparse subspace clustering for pre-training, and adding a metric learning regularization term in a pre-training stage to construct a new objective function; then, fine adjustment is carried out by using the data with the label and the label thereof; the markov random field is then used as an image post-processing to further exploit the spatial information.
The invention aims to provide a novel semi-supervised classification method for hyperspectral image data.
The purpose of the invention can be realized by the following technical scheme:
a novel semi-supervised classification method for hyperspectral image data comprises the following steps:
the method comprises the following steps: acquiring all hyperspectral data, and taking the hyperspectral data as initial data;
step two: clustering all initial data by using sparse subspace clustering as a clustering method, and acquiring pseudo labels of all initial data; the pseudo label of the initial data is directly obtained through sparse subspace clustering;
step three: pre-training all initial data and pseudo labels thereof in the CNN 1;
for an objective function in a classification layer in the CNN1, a new objective function based on metric learning is provided;
step four: removing the final classification layer by using CNN1, and forming CNN2 by adding a full connection layer;
step five: fine-tuning the tagged data and their tags on CNN2;
step six: after the classification layer, the space information of the hyperspectral image is utilized through a Markov random field, the classification result is further subjected to post-processing, and the classification precision is improved.
Further, the initial data comprises marked hyperspectral data and unmarked hyperspectral data; and simultaneously training the CNN1 by using the marked hyperspectral data and the unmarked hyperspectral data.
Further, CNN1 in the third step refers to an initial deep convolutional neural network.
Further, the pre-training in the third step means that all initial data and pseudo labels thereof are put into a neural network CNN1 for training; pre-training can pre-train the network CNN1 through a large amount of initial data and obtain a set of appropriate parameters, and can learn the characteristics; since the marked data in the initial data is less, the unmarked hyperspectral data and the marked hyperspectral data are used.
Further, CNN2 in the fourth step refers to a new deep convolutional neural network; the method is formed by removing the last classification layer by CNN1 and then adding a full connection layer; the objective function for the classification layer in CNN2 is the softmax cross entropy loss function.
Further, the fine tuning in the fifth step means that the data with the real label and the label thereof are used as initial data and put into the CNN2 for training.
Further, the spatial information of the image is further utilized by the Markov random field in the sixth step, and the Markov random field encourages adjacent pixels to have the same class label in the image classification task; the classification accuracy of the hyperspectral image classification task can be further improved; the effect of the markov random field is to encourage neighboring pixels to belong to the same class of labels, assuming that the neighboring pixels are more likely to belong to the same class.
The invention has the beneficial effects that:
the invention relates to a semi-supervised deep learning mechanism for hyperspectral image classification by utilizing pseudo markers; two-stage training is used to extend the training information. Meanwhile, the invention provides a method for optimizing a new target loss function and adding a metric learning regularization condition term for characterizing hyperspectral data; and finally, the classification result is post-processed through a Markov random field, so that the spatial information of the hyperspectral image can be further utilized, the spatial information and the spectral information of the hyperspectral data are jointly extracted, namely, the space and the spectrum are combined, and the final classification effect is effectively improved.
Drawings
In order to facilitate understanding for those skilled in the art, the present invention will be further described with reference to the accompanying drawings.
FIG. 1 is a flow chart of a semi-supervised classification algorithm for hyperspectral image data according to the invention;
FIG. 2 is a block diagram of a semi-supervised deep learning framework of the present invention;
FIG. 3 is a block diagram of the pre-training and fine-tuning stages of the present invention.
Detailed Description
As shown in fig. 1-3, a novel semi-supervised classification method for hyperspectral image data comprises the following steps:
the method comprises the following steps: acquiring all hyperspectral data, and taking the hyperspectral data as initial data; the initial data comprises marked hyperspectral data and unmarked hyperspectral data;
step two: clustering all initial data by using sparse subspace clustering as a clustering method, and acquiring pseudo labels of all initial data;
wherein, the pseudo label of the initial data is directly obtained by sparse subspace clustering; the sparse subspace clustering algorithm has the main idea that any one sample can be represented by other sample points in an infinite number, so that the samples from the same subspace can be represented mutually, and only have representation coefficients, which means that in the infinite number of representations, most coefficients are 0, and are irrelevant samples, sparse representation is generated;
step three: pre-training all initial data and pseudo labels thereof in the CNN 1; the pre-training is to put all initial data and pseudo labels thereof into the neural network CNN1 for training.
Pre-training can pre-train the network CNN1 through a large amount of data and obtain a set of appropriate parameters, which can learn features; because the labeled data in the hyperspectrum is less, the large amount of data adopted by the pre-training also uses the unlabeled data;
for an objective function in a classification layer in the CNN1, a new objective function based on metric learning is provided; wherein the new objective function is referred to as J in FIG. 3; CNN1 refers to the initial deep convolutional neural network;
step four: removing the final classification layer by using the CNN1, and forming a CNN2 by adding a full connection layer; CNN2 refers to a new deep convolutional neural network; it is formed by removing the last classification layer by CNN1 and then adding a full connection layer. For the objective function of the classification layer in CNN2, it is a softmax cross entropy loss function.
Step five: fine-tuning the tagged data and their tags at CNN2; the fine tuning means that the data with the real label and the label thereof are used as initial data and put into the CNN2 for training.
Step six: after the hyperspectral image is subjected to the classification step, the spatial information of the hyperspectral image is further utilized by a Markov random field.
The specific principle of the invention can be explained as that the process of the hyperspectral image classification is a process of classifying each pixel or a relatively uniform pixel group according to the spectral characteristic information and the spatial characteristic information of the image pixels;
firstly, because the hyperspectral data are marked by few samples and only a small amount of marked data are used for training, sparse subspace clustering is used as a clustering method to aggregate all initial data and obtain pseudo labels of all data, all the initial data comprise unmarked hyperspectral data and marked hyperspectral data, and sparse subspace clustering is an excellent clustering method based on sparse expression and has good application effect in many fields;
next, all the initial data and the pseudo labels thereof are used for pre-training the deep convolutional neural network; unlike the traditional CNN model, which minimizes only cross-entropy loss, our proposed model is trained by optimizing a new discriminative objective function; for this reason, besides minimizing the softmax cross entropy loss function, we also add a metric learning regularization term; the metric learning regularization term is to utilize the contrast loss in metric learning and then combine the contrast loss with the cross entropy to construct a new target function in the pre-training stage CNN 1; and (3) metric learning, wherein the learning object is the distance of a sample feature vector generally, and the purpose of metric learning is to reduce or limit the distance between samples of the same type and simultaneously increase the distance between samples of different types through training and learning. Because there is a phenomenon in the hyperspectral image that different objects have the same spectrum and the same object has different spectra. Intra-class diversity and inter-class similarity remain two major challenges that generally degrade the performance of hyperspectral image classification. By adding the metric learning regularization term, the cluster can be gathered and the cluster can be dispersed.
Then, deleting the last layer from the pre-trained network, and adding a fully-connected layer and a softmax classification layer to construct a new deep network CNN2; at this time, we only use the tag data with the real class tag to fine tune the new deep network CNN2;
finally, a markov random field is used for further utilizing spatial information after the hyperspectral image classification step; in the image classification task, the Markov random field encourages adjacent pixels to have the same class label; this has been shown to greatly improve the classification accuracy of hyperspectral image classification tasks; the effect of the markov random field is to encourage neighboring pixels to belong to the same class of labels, assuming that they are more likely to belong to the same class.
The method performs semi-supervised deep learning of hyperspectral image classification by using the pseudo-markers; two-stage training is used to extend the training information. Meanwhile, the invention provides a metric learning regularization item which definitely emphasizes CNN characteristics by optimizing a new target loss function, and can better utilize spatial information and realize the combined extraction of spatial spectrum information in hyperspectral classification by integrating a Markov random field and CNN.
The foregoing is merely exemplary and illustrative of the present invention and various modifications, additions and substitutions may be made by those skilled in the art to the specific embodiments described without departing from the scope of the invention as defined in the following claims.

Claims (7)

1. A novel hyperspectral image data semi-supervised classification method is characterized by comprising the following steps of:
the method comprises the following steps: acquiring all hyperspectral data, and taking the hyperspectral data as initial data;
step two: clustering all initial data by using sparse subspace clustering as a clustering method, and acquiring pseudo labels of all initial data; the pseudo label of the initial data is directly obtained through sparse subspace clustering;
step three: pre-training all initial data and pseudo labels thereof in the CNN 1;
for the objective function in the classification layer in the CNN1, a new objective function based on metric learning is proposed;
step four: removing the final classification layer by using the CNN1, and forming a CNN2 by adding a full connection layer;
step five: fine-tuning the tagged data and their tags on CNN2;
step six: after the classification layer, the space information of the hyperspectral image is utilized through a Markov random field, the classification result is further subjected to post-processing, and the classification precision is improved.
2. The new semi-supervised classification method for hyperspectral image data according to claim 1, wherein the initial data comprises marked hyperspectral data and unmarked hyperspectral data; and simultaneously, training the CNN1 by using the marked hyperspectral data and the unmarked hyperspectral data.
3. The novel semi-supervised classification method for hyperspectral image data according to claim 1, wherein CNN1 in the third step refers to an initial deep convolutional neural network.
4. The novel semi-supervised classification method for hyperspectral image data according to claim 1 is characterized in that the pre-training in the third step is to put all initial data and pseudo labels thereof into a neural network CNN1 for training; pre-training can pre-train the network CNN1 through a large amount of initial data and obtain a set of appropriate parameters, and can learn the characteristics; since the marked data in the initial data is less, the unmarked hyperspectral data and the marked hyperspectral data are used.
5. The new semi-supervised classification method for hyperspectral image data according to claim 1, wherein CNN2 in the fourth step refers to a new deep convolutional neural network; the method is formed by removing the last classification layer by CNN1 and then adding a full connection layer; the objective function for the classification layer in CNN2 is the softmax cross entropy loss function.
6. The novel semi-supervised classification method for hyperspectral image data according to claim 1, wherein the fine tuning in the fifth step is to place data with real labels and labels thereof as initial data into CNN2 for training.
7. The method for semi-supervised classification of hyperspectral image data according to claim 1, wherein the spatial information of the hyperspectral image data is further utilized by the Markov random field in the sixth step, and the Markov random field encourages adjacent pixels to have the same class label in the image classification task; the classification accuracy of the hyperspectral image classification task can be further improved; the effect of the markov random field is to encourage neighboring pixels to belong to the same class of labels, assuming that they are more likely to belong to the same class.
CN201910113156.3A 2019-02-13 2019-02-13 Novel semi-supervised classification method for hyperspectral image data Active CN109858557B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201910113156.3A CN109858557B (en) 2019-02-13 2019-02-13 Novel semi-supervised classification method for hyperspectral image data

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201910113156.3A CN109858557B (en) 2019-02-13 2019-02-13 Novel semi-supervised classification method for hyperspectral image data

Publications (2)

Publication Number Publication Date
CN109858557A CN109858557A (en) 2019-06-07
CN109858557B true CN109858557B (en) 2022-12-27

Family

ID=66897911

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201910113156.3A Active CN109858557B (en) 2019-02-13 2019-02-13 Novel semi-supervised classification method for hyperspectral image data

Country Status (1)

Country Link
CN (1) CN109858557B (en)

Families Citing this family (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN110309868A (en) * 2019-06-24 2019-10-08 西北工业大学 In conjunction with the hyperspectral image classification method of unsupervised learning
CN111401426B (en) * 2020-03-11 2022-04-08 西北工业大学 Small sample hyperspectral image classification method based on pseudo label learning
CN112541904B (en) * 2020-12-16 2023-03-24 西安电子科技大学 Unsupervised remote sensing image change detection method, storage medium and computing device
CN112784611A (en) * 2021-01-21 2021-05-11 阿里巴巴集团控股有限公司 Data processing method, device and computer storage medium
US20220309651A1 (en) * 2021-03-24 2022-09-29 Ping An Technology (Shenzhen) Co., Ltd. Method, device, and storage medium for semi-supervised learning for bone mineral density estimation in hip x-ray images
CN113361628B (en) * 2021-06-24 2023-04-14 海南电网有限责任公司电力科学研究院 CNN insulator aging spectrum classification method under multi-task learning
CN115546567B (en) * 2022-12-01 2023-04-28 成都考拉悠然科技有限公司 Unsupervised domain adaptive classification method, system, equipment and storage medium

Citations (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2017158058A1 (en) * 2016-03-15 2017-09-21 Imra Europe Sas Method for classification of unique/rare cases by reinforcement learning in neural networks
CN107958216A (en) * 2017-11-27 2018-04-24 沈阳航空航天大学 Based on semi-supervised multi-modal deep learning sorting technique

Patent Citations (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2017158058A1 (en) * 2016-03-15 2017-09-21 Imra Europe Sas Method for classification of unique/rare cases by reinforcement learning in neural networks
CN107958216A (en) * 2017-11-27 2018-04-24 沈阳航空航天大学 Based on semi-supervised multi-modal deep learning sorting technique

Non-Patent Citations (1)

* Cited by examiner, † Cited by third party
Title
基于卷积神经网络的半监督高光谱图像分类;李绣心等;《电子测量与仪器学报》;20181015(第10期);全文 *

Also Published As

Publication number Publication date
CN109858557A (en) 2019-06-07

Similar Documents

Publication Publication Date Title
CN109858557B (en) Novel semi-supervised classification method for hyperspectral image data
Deng et al. Deep metric learning-based feature embedding for hyperspectral image classification
CN111695467B (en) Spatial spectrum full convolution hyperspectral image classification method based on super-pixel sample expansion
CN111652038B (en) Remote sensing sea ice image classification method based on convolutional neural network
CN109784392B (en) Hyperspectral image semi-supervised classification method based on comprehensive confidence
CN108460391B (en) Hyperspectral image unsupervised feature extraction method based on generation countermeasure network
CN111414942A (en) Remote sensing image classification method based on active learning and convolutional neural network
CN110728192A (en) High-resolution remote sensing image classification method based on novel characteristic pyramid depth network
He et al. A dual global–local attention network for hyperspectral band selection
CN107451565B (en) Semi-supervised small sample deep learning image mode classification and identification method
CN103955702A (en) SAR image terrain classification method based on depth RBF network
CN104616032A (en) Multi-camera system target matching method based on deep-convolution neural network
CN111639587B (en) Hyperspectral image classification method based on multi-scale spectrum space convolution neural network
CN111814685B (en) Hyperspectral image classification method based on double-branch convolution self-encoder
CN105184314B (en) Wrapper formula EO-1 hyperion band selection methods based on pixel cluster
CN110414616B (en) Remote sensing image dictionary learning and classifying method utilizing spatial relationship
Yang et al. Local label descriptor for example based semantic image labeling
CN114842264A (en) Hyperspectral image classification method based on multi-scale spatial spectral feature joint learning
He et al. Multi-spectral remote sensing land-cover classification based on deep learning methods
CN111639697B (en) Hyperspectral image classification method based on non-repeated sampling and prototype network
Qi et al. Global-local three-dimensional convolutional transformer network for hyperspectral image classification
CN114155443A (en) Hyperspectral image classification method based on multi-receptive-field attention network
CN112052758A (en) Hyperspectral image classification method based on attention mechanism and recurrent neural network
Guo et al. CNN‐combined graph residual network with multilevel feature fusion for hyperspectral image classification
CN115496934A (en) Hyperspectral image classification method based on twin neural network

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant