CN113792809A

CN113792809A - Remote sensing picture classification method based on random semi-supervised feature extraction model

Info

Publication number: CN113792809A
Application number: CN202111101521.2A
Authority: CN
Inventors: 向雪霜; 刘雪娇; 徐遥
Original assignee: China Academy of Space Technology CAST
Current assignee: China Academy of Space Technology CAST
Priority date: 2021-09-18
Filing date: 2021-09-18
Publication date: 2021-12-14
Anticipated expiration: 2041-09-18
Also published as: CN113792809B

Abstract

The invention relates to a remote sensing image classification method based on a random semi-supervised feature extraction model, comprising the following steps: a. establishing a remote sensing scene image database; b. building a random semi-supervised feature extraction model; c. building a classification network; d. establishing a random The optimization goal of the semi-supervised feature extraction model; e, game alternately trains the random semi-supervised feature extraction model; f, trains the classifier; g, completes the remote sensing image classification task. The invention can effectively improve the utilization rate of massive weakly labeled remote sensing data, and improve the performance of remote sensing image classification task.

Description

Remote sensing picture classification method based on random semi-supervised feature extraction model

Technical Field

The invention relates to a remote sensing picture classification method based on a random semi-supervised feature extraction model.

Background

With the improvement of the spatial resolution of the remote sensing data, the scene classification task becomes a research hotspot of the remote sensing image classification task. The remote sensing image scene classification is to correctly label a given scene image with predefined semantic categories. In the prior art, a method based on deep learning is generally adopted for image classification. The method is widely applied to the picture classification task of life scenes, and reaches or surpasses the level of human beings. However, the remote sensing scene classification algorithm based on deep learning mainly focuses on supervised learning, which requires a large amount of labeled data. However, compared with the life scene pictures, the remote sensing pictures have large intra-class difference, small inter-class separability and multi-scale targets, and the remote sensing data samples have high labeling cost and strong specialization and show the characteristics of massive weak labels, so that the remote sensing picture classification task faces a larger challenge.

The existing methods based on deep learning are mainly divided into three categories: convolutional neural network CNN based methods, variational self-encoders VAE based methods and methods based on generation of antagonistic networks GAN. In the face of a large number of remote sensing images, CNN-based methods require the use of a large number of labeled samples to train the model or fine tune a pre-trained convolutional neural network. However, the remote sensing data sample labeling cost is high, and the specialization is strong, so that the application is not ideal. Although the method based on the VAE is an unsupervised generation type model learning method, the method has good effect in the scene classification of the remote sensing image, but the general feature representation of the original image cannot be learned sometimes in the process of reconstructing the similar image through the input image, so most of the VAE-based methods cannot learn the optimal distinguishable features of different scene classes. GAN-based methods are another generative model learning method that can be used unsupervised or semi-supervised, which is widely used in the task of remote sensing images.

For example, patent CN111339935A discloses an optical remote sensing image classification method based on an interpretable CNN image classification model, which focuses mainly on the interpretability problem of a deep learning model, and improves the accuracy of remote sensing image classification by proposing an interpretable CNN image classification model. For another example, patent CN108596248 discloses a remote sensing image classification model based on an improved deep convolutional neural network, which provides an improved deep convolutional neural network, and through dimension reduction, convolutional multi-channel optimization, feature extraction capability improvement and wave-band processing of spatial location features on a remote sensing feature image, the computing resource consumption of the deep convolutional neural network is reduced, meanwhile, the feature extraction effect is ensured, and the recognition degree of the spatial location features is improved. Therefore, the technologies are proposed around the remote sensing classification problem, the purpose is to reduce the consumption of computing resources and improve the remote sensing classification effect, and all the adopted technologies are fully supervised models, so that a large amount of labeled data is needed.

Of course, there are also some techniques proposed to classify pictures using a generation countermeasure network. The generation countermeasure network is composed of a generation network and a discrimination network, and deep feature representation of the data is learned under the condition of no-marking or weak-marking training data through a countermeasure training process between the generation network and the discrimination network. In the prior art, the generation of the countermeasure network is widely applied to the typical picture classification task of the life scene, and a competitive result is obtained. However, the mode of generating the countermeasure network is not widely applied to the remote sensing scene classification task, and the remote sensing picture has the characteristics of large intra-class difference, small inter-class separability, multi-scale target and the like compared with a typical life scene picture, so that the remote sensing picture classification task still faces a larger challenge.

Disclosure of Invention

The invention aims to provide a remote sensing picture classification method based on a random semi-supervised feature extraction model.

In order to achieve the purpose, the invention provides a remote sensing picture classification method based on a random semi-supervised feature extraction model, which comprises the following steps:

a. establishing a remote sensing scene picture database;

b. constructing a random semi-supervised feature extraction model;

c. constructing a classification network;

d. establishing an optimization target of a random semi-supervised feature extraction model;

e. game alternative training random semi-supervised feature extraction model;

f. training a classifier;

g. and finishing the task of classifying the remote sensing pictures.

According to one aspect of the invention, in the step (a), remote sensing scene picture data is collected and labeled according to land use types, then all data are divided into labeled data sets and unlabeled data sets according to whether category labels are provided, wherein one part of the labeled data sets is used as test data and does not participate in training, the rest part and all unlabeled data form a training data set, and finally data enhancement is performed on the training data set in a mode of horizontally overturning, vertically overturning and rotating by 90 degrees.

According to an aspect of the present invention, the constructing of the random semi-supervised feature extraction model in the step (b) includes constructing a random generation network G and a semi-supervised feature extraction network D;

the random generation network G comprises an input noise layer, a random layer, a resampling layer, a deconvolution layer and an output picture layer;

the semi-supervised feature extraction network D comprises an input picture layer, a convolution layer, a feature layer, a full connection layer and an output layer;

the optimization goal of the randomly generated network G is to find α so that:

wherein p is_zThe method comprises the following steps that pre-defined distribution obeyed by an input variable z, epsilon is a middle variable of a random layer, and obey Gaussian distribution N (0, I), an output variable of the random layer obeys Gaussian prior distribution, alpha is a parameter to be trained, f (·) represents an output feature of a picture after semi-supervised feature extraction network D, E is a mathematical expectation about the variable, and x is an input picture;

the semi-supervised feature extraction network D is optimized to find η so that:

wherein eta is a parameter to be trained of the feature extraction network; y is_iAn ith dimension component representing label y; d_i() represents the ith dimension component of the network output, K is the total number of classes of the classification task;

the random generation network G is constructed by the following steps:

b11, randomly sampling noise from a predefined distribution as an input of the random generation network G;

b12, setting the widths and depths of the random layer, the resampling layer and the deconvolution layer according to the specific task difficulty, wherein the output variables of the random layer obey Gaussian prior distribution;

b13, outputting pseudo data with the same size as the original data and serving as the input of the semi-supervised feature extraction network D;

when constructing the semi-supervised feature extraction network D:

in the game training stage, inputting pictures including real labeled data, real unlabelled data and generated pseudo data, and outputting values including true and false logic output and picture category output;

in the classifier training stage, the input of the semi-supervised feature extraction network D is labeled data, and the output is corresponding high-dimensional features which are used as the input of a classifier;

in the stage of realizing the remote sensing classification task, the input of the semi-supervised feature extraction network D is a remote sensing picture to be classified, and the output is corresponding high-dimensional features;

the random generation network G comprises 9 layers in total, and the first layer is an input layer; the second layer is a random layer and consists of two identical fully-connected networks and is used for learning the mean value and the variance of Gaussian prior distribution obeyed by the output variable of the layer; the third layer is a resampling layer; the subsequent five layers are deconvolution layers, wherein the size of a convolution kernel is 4 multiplied by 4, and the step length is 2; the last layer is an output layer, and 3-channel pictures with the size of 256 multiplied by 256 are output; the deconvolution layers all adopt ReLU-form activation functions, and the output layer adopts tanh-type activation functions;

the semi-supervised feature extraction network D comprises 9 layers, wherein the first layer is a picture input layer, and input pictures comprise marked real pictures, unmarked real pictures and generated pictures; the subsequent six layers are convolution layers, wherein the size of a convolution kernel is 5 multiplied by 5, the step length is 2, the activation function is LEAKYRELU, and the parameter is 0.2; the characteristic layer combines the characteristic information of the previous three layers; the last layer is a full connection layer, and the category or the authenticity information of the picture is output.

According to an aspect of the present invention, in the step (C), a linear support vector machine network is used to construct the classification network, and the regularization parameter C is 1000.

According to an aspect of the present invention, in step (D), the optimization goal of establishing the random semi-supervised feature extraction model is game-play-a maximum and minimum value about the randomly generated network G and the semi-supervised feature extraction network D, and is composed of supervised learning loss and unsupervised learning loss, as follows:

wherein p is the true remote sensing data distribution; p is a radical of_GIs the distribution of the generated data of the randomly generated network G; p is a radical of_D(y | x, y ≦ K) represents the probability that the input picture x is determined to be the label y; p is a radical of_D(y ≦ K | x) represents the probability that the input picture x is real data; p is a radical of_D(K +1| x) represents the probability that the input picture x is generating dummy data, J is a loss function with respect to networks G and D.

According to one aspect of the present invention, in said step (e), the training goal of the semi-supervised feature extraction network D is to maximize the probability that the labeled data classification is correct, by minimizing the cross-entropy

To realize the operation; maximizing the probability that the true data is judged true and the generated data is judged false by minimizing

And

to realize the operation;

the training goal of the randomly generated network G is to minimize the probability that the generated data is judged to be false data by minimizing

To realize the operation; while minimizing the distance between the generated data features and the real data features by minimizing

To realize the operation;

and repeating the alternate training between the semi-supervised feature extraction network D and the random generation network G until reaching the specified training step number, and storing the trained random semi-supervised feature extraction network.

According to one aspect of the invention, the step (e) is game alternate training according to the following steps:

e1, setting the total training round number N to be 200, randomly generating a network G to update k to be 2 times every time the semi-supervised feature extraction network D updates, and optimizing the learning rate lambda and the momentum parameter beta in the parameter of the Adam optimization algorithm₁λ ═ 0.0002 and β, respectively₁0.5, small batch size m 64, hidden spatial dimension d 100, and for UC-merceded dataset K21;

e2, randomly initializing network parameters eta and alpha;

e3, training N times alternately according to the following modes:

e 31: randomly sampling m hidden variables z¹，...，z^m}～U^d[-1，1]And the Gaussian variable { ∈¹，...，∈^mN (0, I), calculating and judging the loss L of the network about the generated picture_D1：

e 32: randomly selecting m true samples { x ] from the label-free training dataset¹，...，x^mCalculating the loss L of the discrimination network about the unmarked picture_D2：

e 33: randomly selecting m true samples from a labeled training dataset (x)¹，y¹)，...，(x^m，y^m) Calculating the loss L of the marked picture of the judgment network_D3：

e 34: updating a semi-supervised feature extraction network parameter eta:

e 35: randomly sampling m hidden variables z¹，...，z^m}～U^d[-1，1]And the Gaussian variable { ∈¹，...，∈^mN (0, I), calculating the true and false loss L of the random generation network G about the generated picture_G1：

e 36: randomly selecting m true samples { x ] from the label-free training dataset¹，...，x^mCalculating the characteristic loss L of the random generation network G about the generated picture_G2：

e 37: updating the randomly generated network parameter α:

e 38: repeating said steps (e35) to (e37) k times, returning to said step (e 31);

e4, storing the trained random semi-supervised feature extraction network;

wherein U is uniformly distributed, I is a unit matrix +_ηIs a gradient on a network parameter η +_αIs a gradient with respect to the network parameter alpha.

According to one aspect of the present invention, in step (f), all the labeled data in the training data set are input into the trained random semi-supervised feature extraction network to obtain corresponding high-dimensional features, and the features and the label information are input into the support vector machine network for training.

According to one aspect of the invention, in the step (g), the trained random semi-supervised feature extraction model and the support vector machine network are saved to form a classification model;

selecting remote sensing data to be classified from the test data set, and preprocessing the remote sensing data to be classified;

inputting the processed picture into a trained random semi-supervised feature extraction network to obtain corresponding high-dimensional features;

and inputting the high-dimensional features into a trained classifier, outputting the class information of the remote sensing picture, and completing the task of classifying the remote sensing picture.

According to one aspect of the invention, the predefined distribution of input variables is a uniform distribution between [ -1,1 ].

According to the concept of the invention, the remote sensing picture classification method based on the random semi-supervised feature extraction model is provided aiming at the characteristics of complex data distribution and mass weak labeling of the remote sensing picture.

According to the scheme of the invention, the random semi-supervised feature extraction network with higher high-dimensional feature capability for capturing the remote sensing data is obtained through game alternate training of the random generation network with higher expression capability and the semi-supervised feature extraction network, so that the utilization rate of mass weakly labeled remote sensing data is effectively improved, and the game alternate training is finally used for improving the task performance of remote sensing image classification.

Drawings

FIG. 1 is a schematic flow chart of a remote sensing picture classification method based on a random semi-supervised feature extraction model according to an embodiment of the invention;

FIG. 2 is a schematic diagram of a randomly generated network architecture in a remote sensing picture classification method based on a random semi-supervised feature extraction model according to an embodiment of the present invention;

FIG. 3 is a schematic diagram of a semi-supervised feature extraction network architecture in a remote sensing picture classification method based on a random semi-supervised feature extraction model according to an embodiment of the present invention;

fig. 4 is a schematic diagram illustrating a confusion matrix result of a surface feature scene classification task in a remote sensing image classification method based on a random semi-supervised feature extraction model according to an embodiment of the present invention.

Detailed Description

In order to more clearly illustrate the embodiments of the present invention or the technical solutions in the prior art, the drawings used in the embodiments will be briefly described below. It is obvious that the drawings in the following description are only some embodiments of the invention, and that for a person skilled in the art, other drawings can be derived from them without inventive effort.

The present invention is described in detail below with reference to the drawings and the specific embodiments, which are not repeated herein, but the embodiments of the present invention are not limited to the following embodiments.

Referring to fig. 1, the invention discloses a remote sensing picture classification method based on a random semi-supervised feature extraction model, relates to the technical field of remote sensing image interpretation, and belongs to the key intelligent theory and algorithm research of a space intelligent system. The method comprises the steps of firstly establishing a remote sensing scene picture database, establishing a random semi-supervised feature extraction model, establishing a classification network, establishing an optimization target of the random semi-supervised feature extraction model, training the random semi-supervised feature extraction model in a game alternating mode, training a classifier and finishing a remote sensing picture classification task.

In the embodiment, in order to show the effect and the capability of the method for processing the scene classification task of the remote sensing picture, a UC-merceded remote sensing data set is taken as an example to complete the scene classification task of 21 types of ground objects. UC-Merced data set, comprising 21 land use categories, as shown in fig. 4, 1: mobile home park; 2: beach (beach); 3: tenis court; 4: airplane (airplane); 5: dense residential area; 6: harbor (seaport); 7: building; 8: forest; 9: intersectional (crossroad); 10: river (river); 11: sparse residence; 12: runway (runway); 13: park lot; 14: baseball Diamond (baseball field); 15: agricultural (agricultural); 16: storage tanks; 17: chaparral (jungle); 18: golf course; 19: freeway (expressway); 20: medium residential area; 21: overpass (overpass). Each class consists of 100 aerial images, 256 × 256 pixels in size. Of course, the method of the embodiment is mainly used for completing the task of classifying the remote sensing picture, and in other embodiments, other interpretation tasks based on picture features, such as a target detection task, can also be completed by using the method of the present invention.

In the invention, when the remote sensing scene picture database is established, the remote sensing scene picture data is collected and is labeled according to the land use type, and then the remote sensing scene picture data is divided into a labeled data set and a non-labeled data set according to whether a category label is provided. Of course, data division is also required, and for the data set of this embodiment, specifically, all samples in the data set are randomly divided into training data (accounting for 80%, 1680) and test data (accounting for 20%, 420). In addition, because the data set comprises more categories, and each category only comprises 100 pictures, the training data set also needs to be subjected to data enhancement, specifically, 3 times of enhanced data is obtained by means of horizontal turning, vertical turning and 90-degree rotation, the extended training data set comprises 6720 pictures in total by combining the original data, and because the data set is labeled data, the invention adopts a form of removing corresponding labels of the training data set to construct a required label-free data set.

In the invention, the random semi-supervised feature extraction model construction comprises a random generation network G and a semi-supervised feature extraction network D.

As shown in fig. 2, the random generation network G includes an input noise layer, a random layer, a resampling layer, a deconvolution layer, and an output picture layer. The random generation network G comprises 9 layers in total, and the first layer is an input layer; the second layer is a random layer and consists of two identical fully-connected networks and is used for learning the mean value and the variance of Gaussian prior distribution obeyed by the output variable of the layer; the third layer is a resampling layer; the subsequent five layers are deconvolution layers, wherein the size of a convolution kernel is 4 multiplied by 4, and the step length is 2; the last layer is an output layer, and 3-channel pictures with the size of 256 multiplied by 256 are output; the deconvolution layers all adopt ReLU-form activation functions, and the output layer adopts tanh-type activation functions;

wherein p is_zThe method comprises the steps of obtaining a pre-defined distribution obeyed by an input variable z, representing an intermediate variable of a random layer by epsilon, obeying a Gaussian distribution N (0, I), obtaining an output variable of the random layer by obeying a Gaussian prior distribution, wherein alpha is a parameter to be trained, f (-) represents an output characteristic of a picture after semi-supervised characteristic extraction network D, E is a mathematical expectation about the variable, and x is an input picture.

When the random generation network G is constructed, random sampling noise in predefined distribution is used as input of the random generation network G, the widths and depths of a random layer, a resampling layer and a deconvolution layer are set according to specific task difficulty, output variables of the random layer obey Gaussian prior distribution, and finally pseudo data with the same size as original data are output and used as input of a semi-supervised feature extraction network D.

As shown in fig. 3, the semi-supervised feature extraction network D includes an input picture layer, a convolutional layer, a feature layer, a full connection layer and an output layer. The semi-supervised feature extraction network D comprises 9 layers, wherein the first layer is a picture input layer, and input pictures comprise marked real pictures, unmarked real pictures and generated pictures; the subsequent six layers are convolution layers, wherein the size of a convolution kernel is 5 multiplied by 5, the step length is 2, the activation function is LEAKYRELU, and the parameter is 0.2; the characteristic layer combines the characteristic information of the previous three layers; the last layer is a full connection layer, and the category or the authenticity information of the picture is output.

wherein eta is a parameter to be trained of the feature extraction network; y is_iAn ith dimension component representing label y; d_i(. cndot.) represents the ith dimension component of the network output, and K is the total number of classes for the classification task.

When a semi-supervised feature extraction network D is constructed, in a game training stage, inputting pictures including real labeled data, real unlabelled data and generated pseudo data, and outputting values including true and false logic output and picture category output; in the classifier training stage, the input of the semi-supervised feature extraction network D is labeled data, and the output is corresponding high-dimensional features which are used as the input of a classifier; in the stage of realizing the remote sensing classification task, the input of the semi-supervised feature extraction network D is a remote sensing picture to be classified, and the output is corresponding high-dimensional features.

In the invention, a classification network is constructed by adopting a linear support vector machine network, and the regularization parameter C is 1000.

In the invention, the optimization target for establishing the random semi-supervised feature extraction model is the problem of the maximum and minimum values of a game about a randomly generated network G and a semi-supervised feature extraction network D, and the random semi-supervised feature extraction model consists of two parts of supervised learning loss and unsupervised learning loss, and specifically comprises the following steps:

In the invention, the training goal of the semi-supervised feature extraction network D is to maximize the probability of correct classification of labeled data by minimizing the cross entropy

To realize the operation; simultaneously maximizing the probability that the true data is judged to be true and the generated data is judged to be false, by minimizing

And

to be implemented.

To be implemented.

When the game alternative training is carried out, firstly, parameter setting is carried out, specifically, the total training round number N is set to be 200, the semi-supervised feature extraction network D randomly generates the network G to update k to be 2 times every time the semi-supervised feature extraction network D updates, and Adam optimizes algorithm parameters (the learning rate lambda is 0.0002, and the momentum parameter beta is 0.0002)₁0.5), mini-batch size m 64, hidden spatial dimension d 100, and K for UC-merceded dataset21. Network parameters η and α are randomly initialized. Alternately training in the following way:

for total number of training rounds, ndo

Randomly sampling m hidden variables z¹，...，z^m}～U^d[-1，1]And the Gaussian variable { ∈¹，...，∈^mN (0, I), calculating and judging the loss L of the network about the generated picture_D1：

Randomly selecting m true samples { x ] from the label-free training dataset¹，...，x^mCalculating the loss L of the discrimination network about the unmarked picture_D2：

Randomly selecting m true samples from a labeled training dataset (x)¹，y¹)，...，(x^m，y^m) Calculating the loss L of the marked picture of the judgment network_D3：

Updating a semi-supervised feature extraction network parameter eta:

randomly sampling m hidden variables z¹，...，z^m}～U^d[-1，1]And the Gaussian variable { ∈¹，...，∈^mN (0, I), calculating the true and false loss L of the random generation network G about the generated picture_G1：

Randomly selecting m true samples { x ] from the label-free training dataset¹，...，x^mCalculating the characteristic loss L of the random generation network G about the generated picture_G2：

Updating the randomly generated network parameter α:

repeating the steps to calculate the loss L_G1And characteristic loss L_G2And updating the randomly generated network parameter alpha for a plurality of times (k), namely, the picture loss L can be calculated_D1A step (2);

end for

and finally, storing the trained random semi-supervised feature extraction network.

Wherein U is uniformly distributed, I is a unit matrix,

in order to be a gradient with respect to the network parameter η,

is a gradient with respect to the network parameter alpha.

In the invention, when training a classifier, all labeled data of a training data set are input into a trained random semi-supervised feature extraction network to obtain corresponding high-dimensional features, and the features and label information are input into a support vector machine network to train the support vector machine network.

In the invention, a trained random semi-supervised feature extraction model and a support vector machine network are stored to form a classification model, the remote sensing picture to be classified is input into the classification model, and finally the class information of the remote sensing picture is output. Specifically, firstly, remote sensing data to be classified is selected from the test data set, the remote sensing data to be classified is preprocessed, and the pictures are uniformly cut into 256 × 256 pictures. And then inputting the processed picture into a trained random semi-supervised feature extraction network to obtain corresponding high-dimensional features. And finally, inputting the high-dimensional features into a trained classifier, and finally outputting the class information of the remote sensing picture, thereby completing the task of classifying the remote sensing picture.

In the present invention, the predefined distribution of input variables is a uniform distribution between [ -1,1 ].

The average classification accuracy of the method can reach more than 98%, and a confusion matrix of the classification result is shown in figure 4.

In summary, the invention combines the randomly generated countermeasure network with stronger expression capability and the semi-supervision method by utilizing the capability of generating the distribution of the countermeasure network capturing data and the advantage of processing mass weak annotation data by the semi-supervision method, is applied to the remote sensing picture classification task, can effectively utilize mass weak annotation remote sensing data, has stronger picture characteristic extraction capability, can be used for completing the remote sensing picture classification task, and can also be widely applied to other remote sensing interpretation tasks based on picture characteristics, such as target detection, ground feature segmentation and the like.

The above description is only one embodiment of the present invention, and is not intended to limit the present invention, and it is apparent to those skilled in the art that various modifications and variations can be made in the present invention. Any modification, equivalent replacement, or improvement made within the spirit and principle of the present invention should be included in the protection scope of the present invention.

Claims

1. A remote sensing image classification method based on a random semi-supervised feature extraction model, comprising the following steps:

a. Establish a remote sensing scene image database;

b. Build a random semi-supervised feature extraction model;

c. Build a classification network;

d. Establish the optimization objective of the random semi-supervised feature extraction model;

e. Game alternate training random semi-supervised feature extraction model;

f. Training the classifier;

g. Complete the task of remote sensing image classification.

2. method according to claim 1, is characterized in that, in described step (a), collect remote sensing scene picture data, and carry out labeling by land use type, then divide all data according to whether there is a class label into have. Labeled data set and unlabeled data set, in which a part of the labeled data set is used as test data and does not participate in training. The remaining part and all unlabeled data constitute a training data set. Finally, the training data set is flipped horizontally, vertically and rotated. Data augmentation in a 90-degree manner.

3. The method according to claim 1, wherein in the step (b), constructing a random semi-supervised feature extraction model comprises constructing a random generation network G and a semi-supervised feature extraction network D;

The random generation network G includes an input noise layer, a random layer, a resampling layer, a deconvolution layer and an output image layer;

The semi-supervised feature extraction network D includes an input image layer, a convolutional layer, a feature layer, a fully connected layer and an output layer;

The optimization objective of the randomly generated network G is to find α* such that:

Among them, p _z is the predefined distribution that the input variable z obeys, ∈ is the intermediate variable of the random layer, which obeys the Gaussian distribution N(0, I), and the output variable of the random layer obeys the Gaussian prior distribution, α is the parameter to be trained, f ( ) represents the output feature of the image through the semi-supervised feature extraction network D, E is the mathematical expectation about the variable, and x is the input image;

The optimization goal of the semi-supervised feature extraction network D is to find η* such that:

Wherein, n is the parameter to be trained of the feature extraction network; y _i represents the i-th dimension component of the label y; D _i ( ) represents the i-th dimension component of the network output, and K is the total number of categories of the classification task;

Building a random generation network G includes the following steps:

b11. Randomly sample noise from a predefined distribution as the input of the random generation network G;

b12. Set the width and depth of the random layer, resampling layer, and deconvolution layer according to the difficulty of the specific task, wherein the output variable of the random layer obeys the Gaussian prior distribution;

b13. Output pseudo data of the same size as the original data, and use it as the input of the semi-supervised feature extraction network D;

When building a semi-supervised feature extraction network D:

In the game training stage, the input pictures include real labeled data, real unlabeled data and generated pseudo data, and the output values include true and false logic output and picture category output;

In the classifier training stage, the input of the semi-supervised feature extraction network D is labeled data, and the output is the corresponding high-dimensional feature as the input of the classifier;

In the stage of realizing remote sensing classification task, the input of the semi-supervised feature extraction network D is the remote sensing image to be classified, and the output is the corresponding high-dimensional feature;

The random generation network G consists of 9 layers, the first layer is the input layer; the second layer is the random layer, which consists of two identical fully connected networks, which are used to learn the mean and sum of the Gaussian prior distributions that the output variables of this layer obey. variance; the third layer is the resampling layer; the next five layers are the deconvolution layers, where the size of the convolution kernel is 4×4, and the stride is 2; the last layer is the output layer, and the output size is 3 with a size of 256×256 Channel picture; among them, the deconvolution layer adopts the activation function in the form of ReLU, and the output layer adopts the activation function of the tanh type;

The semi-supervised feature extraction network D consists of 9 layers. The first layer is the image input layer. The input images include labeled real pictures, unlabeled real pictures, and generated pictures; the subsequent six layers are convolutional layers, in which the convolution kernels are The size is 5×5, the step size is 2, the activation function is leakyReLU, and the parameter is 0.2; the feature layer combines the feature information of the previous three layers; the last layer is a fully connected layer, which outputs the category or authenticity information of the picture.

4 . The method according to claim 1 , wherein in the step (c), a linear support vector machine network is used to construct a classification network, and the regularization parameter C is 1000. 5 .

5. method according to claim 1, is characterized in that, in described step (d), the optimization goal of establishing random semi-supervised feature extraction model is a game about randomly generating network G and semi-supervised feature extraction network D. The maximum and minimum values are composed of two parts: supervised learning loss and unsupervised learning loss, as follows:

Among them, p is the real remote sensing data distribution; p _G is the generated data distribution of the random generation network G; p _D (y|x, y≤K) represents the probability that the input image x is judged as the label y; p _D (y≤ K|x) represents the probability that the input image x is the real data; p _D (K+1|x) represents the probability that the input image x is the generated fake data, and J is the loss function of the network G and D.

6. The method according to claim 1, wherein in the step (e), the training objective of the semi-supervised feature extraction network D is to maximize the probability that the labeled data is classified correctly, by minimizing the cross-entropy

To achieve; maximize the probability that the real data is judged to be true and the generated data is judged to be false, by minimizing

Know

to fulfill;

The training goal of the random generation network G is to minimize the probability that the generated data is judged to be false data, by minimizing

To achieve; at the same time minimize the distance between the generated data features and the real data features, by minimizing

to fulfill;

Repeat the alternate training between the semi-supervised feature extraction network D and the random generation network G until the specified number of training steps is reached, and save the trained random semi-supervised feature extraction network.

7. method according to claim 6, is characterized in that, in described step (e), carry out game alternating training according to the following steps:

e1. Set the total number of training rounds N=200, and the semi-supervised feature extraction network D will randomly generate the network G to update k=2 times each time it is updated. The learning rate λ and the momentum parameter β1 in the parameters of the Adam optimization algorithm are λ ₌ 0.0002 and β ₁ =0.5, mini-batch size m=64, latent space dimension d=100, K=21 for UC-Merced dataset;

e2. Randomly initialize network parameters η and a;

e3. Alternately train N times as follows:

e31: randomly sample m latent variables {z ¹ ,...,z ^m }～U ^d [-1,1] and Gaussian variables {∈ ¹ ,...,∈ ^m }～N(0,I), Calculate the loss L _D1 of the discriminative network for generating images:

_e32 : Randomly select ^m real samples {x ¹ , .

e33: Randomly select m real samples {(x ¹ , y ¹ ), ..., (x ^m , y ^m )} from the labeled training data set, and calculate the loss L _D3 of the discriminant network for labeled images:

e34: Update semi-supervised feature extraction network parameters η:

e35: randomly sample m latent variables {z ¹ ,...,z ^m }～U ^d [-1,1] and Gaussian variables {∈ ¹ ,...,∈ ^m }～N(0,I), Calculate the authenticity loss L _G1 of the generated image by the random generation network G:

_e36 : Randomly select ^m real samples {x ¹ , .

e37: Update randomly generated network parameters α:

e38: repeat the step (e35) to the step (e37) k times, and return to the step (e31);

e4. Save the trained random semi-supervised feature extraction network;

Among them, U is the uniform distribution, I is the identity matrix,

is the gradient with respect to the network parameter η,

is the gradient with respect to the network parameter α.

8. method according to claim 1, is characterized in that, in described step (f), all label data in training data set is input into trained random semi-supervised feature extraction network, obtains corresponding high-dimensional feature, And input the feature and label information into the support vector machine network for training.

9. method according to claim 1, is characterized in that, in described step (g), save training good random semi-supervised feature extraction model and support vector machine network, form classification model;

Select remote sensing data to be classified from the test data set, and preprocess the remote sensing data to be classified;

Input the processed images into the trained random semi-supervised feature extraction network to obtain corresponding high-dimensional features;

Input the high-dimensional features into the trained classifier, output the category information of the remote sensing image, and complete the remote sensing image classification task.

10. The method according to any one of claims 1-9, wherein the predefined distribution of the input variable is a uniform distribution between [-1, 1].