CN112560960A

CN112560960A - Hyperspectral image classification method and device and computing equipment

Info

Publication number: CN112560960A
Application number: CN202011498866.1A
Authority: CN
Inventors: 樊硕
Original assignee: Beijing Moviebook Technology Corp ltd
Current assignee: Beijing Moviebook Technology Corp ltd
Priority date: 2020-12-16
Filing date: 2020-12-16
Publication date: 2021-03-26
Anticipated expiration: 2040-12-16
Also published as: CN112560960B

Abstract

The application discloses a hyperspectral image classification method and device and computing equipment. The method comprises the following steps: expanding the original training set by adopting a semi-supervised learning strategy to obtain a first training set D; expanding the first training set by adopting an OCSP algorithm to obtain a second training set D'; the single convolution layer neural network is trained by using FD, and the trained single convolution layer neural network is obtained, wherein the FD is { D, D' }; and classifying the hyperspectral images to be classified by using the trained single convolutional layer neural network. The device comprises: the device comprises a first data expansion module, a second data expansion module, a training module and a classification module. The computing device comprises a memory, a processor, and a computer program stored in the memory and executable by the processor, wherein the processor implements the methods described herein when executing the computer program.

Description

Hyperspectral image classification method and device and computing equipment

Technical Field

The application relates to the field of hyperspectral image classification, in particular to an expansion technology of training samples.

Background

A hyperspectral sensor can capture hundreds of spectral bands with high spectral resolution. Hyperspectral images (HSI) are applicable in many fields, such as land cover detection, environmental monitoring, medical diagnostics, and military reconnaissance. Hyperspectral classification is an important research topic, and assigns a label to each pixel in a hyperspectral image. Traditional classical algorithms such as K-nearest neighbors (KNN), Maximum Likelihood Classification (MLC), Support Vector Machines (SVM) and Artificial Neural Networks (ANN) have been successfully applied to hyperspectral classification with acceptable accuracy.

The detailed and rich spectral information contained in the hyperspectral image allows a more precise distinction between the different classes. Due to certain practical limitations, the cost of HSI high spectral resolution is its limited spatial resolution, which leads to the widespread existence of mixed pixels. More than one class is included in each blended pixel, whose spectral response is effectively a blend of the responses of the various materials present in the instantaneous field of view (IFOV) of the sensor, and therefore conventional pixel-level hard classification is not suitable for blended pixel classification.

Sub-pixel mapping is a well-known technique to solve this problem. Each hybrid pixel is divided into several sub-pixels, and different sub-pixels may belong to the same class. The sub-pixel mapping takes as input the abundance of each class within the blended pixel and predicts the spatial distribution of the sub-pixels therein. That is, the sub-pixel map outputs a hard classification map with higher spatial resolution.

On the other hand, cursing of the HSI dimension requires a large number of training samples to ensure the accuracy of the HSI supervised classification. In practice, however, the available training samples are often very limited.

Deep neural networks are capable of learning advanced functions through deep learning. The Stack Automatic Encoder (SAE), the Deep Belief Network (DBN) and the Convolutional Neural Network (CNN) can be used for the vision-based problem as a typical deep neural network architecture, and especially the CNN has unique local receptive field characteristics and plays a major role in image classification. CNN is a typical supervised model, requiring a large training data set to trigger its function, but hyperspectral images can only provide a certain number of labeled samples. Nonetheless, CNN is still widely used to provide better performance than SVM in different implementations of hyperspectral image classification. All the aforementioned algorithms depend on a training set with a balanced distribution.

In general, in most cases, traditional hyperspectral classification algorithms tend to perform better on large classes than on small classes, which means that these algorithms only focus on improving the overall accuracy, and neglect class-specific accuracy. It is a common consensus among people for hyperspectral image classification that, in order to improve overall accuracy, it contributes more to correct classification of large-size data than small-size data. In practice, however, the correct classification of small classes is more important than the classification of large classes, since they are usually foreground classes of interest. However, the number of small classes far exceeds the number of large classes. Therefore, recent research has focused on unbalanced data issues, with particular attention to small sample sets or categories.

In summary, the hyperspectral classification has the following problems:

1. hyperspectral images can only provide a certain number of marked samples and cannot meet the need of cursing CNN and HSI dimensions;

2. the accuracy of sub-pixel mapping is greatly influenced by the problem that the hyperspectral image training samples are limited.

Disclosure of Invention

It is an object of the present application to overcome the above problems or to at least partially solve or mitigate the above problems.

According to one aspect of the application, a hyperspectral image classification method is provided, and comprises the following steps:

expanding the original training set by adopting a semi-supervised learning strategy to obtain a first training set D;

expanding the first training set by adopting an OCSP algorithm to obtain a second training set D';

the single convolution layer neural network is trained by using FD, and the trained single convolution layer neural network is obtained, wherein the FD is { D, D' };

and classifying the hyperspectral images to be classified by using the trained single convolutional layer neural network.

Optionally, the expanding the original training set by using the semi-supervised learning strategy to obtain the first training set D includes:

will be assembled

Initializing to an empty set, wherein k represents the number of circulation;

the current training set

Middle unlabeled training sample X_iIs shown as

Calculating to obtain coefficients

Wherein v represents the training sample X_iDirect neighborhood pixels in four directions, up, down, left, right,

v

1, 2, 3, 4,

i

1, 2, …, n,

representing the set of training samples in the kth iteration cycle,

λ is the global regularization parameter, m is

The number of training samples;

according to the coefficient

Computing

Fractional abundance belonging to each class c

Using samples

To pair

Updating: if it is not

Then the sample is taken

Assigning to category labels

And add the sample to

Td is a preset threshold value; in the same way, using

Four direct neighborhood sample pairs per training sample

Updating to obtain the augmented training set in the kth cycle

If the number of training samples of the augmented training set, of which no pixels are surrounded by the augmented training set, meets the requirement, the augmented training set is used as a first training set D; otherwise, enter the (k +1) th cycle.

Optionally, the expanding the first training set D by using the OCSP algorithm to obtain a second training set D' includes:

generating an artificial sample set R according to the spectral range of the training samples in the first training set D;

applying gradient constraint on the artificial sample set R to filter samples in the artificial sample set R to obtain a synthesized sample R_u；

By passing

Computing a corresponding positive interaction complement space

Wherein I is an identity matrix, D^#Expressing the pseudo-inverse of D, and calculating

To obtain the projections of the sample set Ru on the orthogonal subspace of the first training set D, and to select the samples with projection values smaller than Ng in D as the training set finally obtained by OCSP, i.e. the second training set D', where Ng is a predefined parameter.

Optionally, the single convolutional layer neural network sequentially includes an input layer, a convolutional layer, a max pooling layer, a full connection layer, and an output layer.

Optionally, the convolutional layer uses tanh as an activation function, the max-pooling layer uses maxporoling as an activation function, the fully-connected layer uses tanh as an activation function, and the output layer uses softmax as an activation function.

According to the hyperspectral image classification method, the current training set is expanded by adopting a semi-supervised learning strategy of iterative cycle, and the OCSP is adopted for sample expansion, so that a sufficient number of training samples can be obtained, the hyperspectral classification precision of a convolutional neural network is improved, and the hyperspectral image classification performance of the hyperspectral image classification device is enhanced.

According to another aspect of the present application, there is provided a hyperspectral image classification apparatus including:

a first data expansion module configured to expand an original training set by using a semi-supervised learning strategy to obtain a first training set D;

the second data expansion module is configured to expand the first training set by adopting an OCSP algorithm to obtain a second training set D';

a training module configured to train the single convolutional layer neural network with the FD to obtain a trained single convolutional layer neural network, where FD is { D, D'; and

a classification module configured to classify the hyperspectral image to be classified using the trained single convolutional layer neural network.

Optionally, the first data expansion module includes:

an initialization submodule configured to aggregate

Initializing to an empty set, wherein k represents the number of circulation;

a coefficient calculation submodule configured to calculate a current training set

Middle unlabeled training sample X_iIs shown as

Calculating to obtain coefficients

Wherein, i is 1, 2, …, n,

representing the set of training samples in the kth iteration cycle,

λ is the global regularization parameter, m is

The number of training samples;

a fractional abundance calculation submodule configured to calculate a fractional abundance from the coefficient

Computing

Fractional abundance belonging to each class c

An update submodule configured to utilize the samples

To pair

Updating: if it is not

Then the sample is taken

Assigning to category labels

And add the sample to

Wherein Td is a predetermined threshold value, and the same manner is adopted by

Four direct neighborhood sample pairs per training sample

Updating to obtain the augmented training set in the kth cycle

And

a judgment sub-module configured to take the augmented training set as a first training set D if the number of training samples in the augmented training set for which no pixel is surrounded by the augmented training set meets a requirement; otherwise, enter the (k +1) th cycle.

Optionally, the second data expansion module includes:

an artificial sample set generation submodule configured to generate an artificial sample set R from the spectral ranges of the training samples in the first training set D;

a filtering submodule configured to apply a gradient constraint to the set of artificial samples R to filter the samples in the set of artificial samples R to obtain a synthetic sample R_u(ii) a And

a sample selection submodule configured to pass

Computing a corresponding positive interaction complement space

To obtain a sample set R_uThe projections of the orthogonal subspace on the first training set D, samples with projection values smaller than Ng are selected in D as the training set finally obtained by OCSP, i.e. the second training set D', where Ng is a predefined parameter.

Optionally, the single convolutional layer neural network sequentially comprises an input layer, a convolutional layer, a max pooling layer, a fully-connected layer and an output layer, wherein the convolutional layer uses tanh as an activation function, the max pooling layer uses maxporoling as an activation function, the fully-connected layer uses tanh as an activation function, and the output layer uses softmax as an activation function.

The hyperspectral image classification device adopts the iteration-cycle semi-supervised learning strategy to expand the current training set, and adopts the OCSP to expand the samples, so that a sufficient number of training samples can be obtained, the hyperspectral classification precision of a convolutional neural network is improved, and the hyperspectral image classification performance of the hyperspectral image classification device is enhanced.

According to a third aspect of the present application, there is provided a computing device comprising a memory, a processor and a computer program stored in the memory and executable by the processor, wherein the processor implements the method of the present application when executing the computer program.

The above and other objects, advantages and features of the present application will become more apparent to those skilled in the art from the following detailed description of specific embodiments thereof, taken in conjunction with the accompanying drawings.

Drawings

Some specific embodiments of the present application will be described in detail hereinafter by way of illustration and not limitation with reference to the accompanying drawings. The same reference numbers in the drawings identify the same or similar elements or components. Those skilled in the art will appreciate that the drawings are not necessarily drawn to scale. In the drawings:

FIG. 1 is a schematic flow chart of a hyperspectral image classification method according to an embodiment of the application;

FIG. 2 is a schematic flow chart of step S1 in FIG. 1;

FIG. 3 is a schematic diagram of training data set enhancement in a single cycle of a semi-supervised learning strategy according to an embodiment of the present application;

FIG. 4 is a schematic flow chart of step S2 in FIG. 1;

FIG. 5 is a schematic structural diagram of a hyperspectral image classification apparatus according to an embodiment of the application;

FIG. 6 is a schematic diagram of a first data expansion module of FIG. 5;

FIG. 7 is a schematic diagram of a second data expansion module shown in FIG. 5;

FIG. 8 is a schematic block diagram of a computing device according to one embodiment of the present application;

fig. 9 is a schematic structural diagram of a computer-readable storage medium according to an embodiment of the present application.

Detailed Description

Fig. 1 is a schematic flow chart of a hyperspectral image classification method according to an embodiment of the application. The method may generally include:

step S1, expanding the original training set by adopting a semi-supervised learning strategy to obtain a first training set D;

step S2, expanding the first training set by adopting an OCSP algorithm to obtain a second training set D';

step S3, the single convolution layer neural network is trained by using FD, and the trained single convolution layer neural network is obtained, wherein the FD is { D, D' };

and step S4, classifying the hyperspectral images to be classified by using the trained single convolutional layer neural network.

The original training set has a total of C classes, C represents the class label, and C is 1, 2. Original training set

Wherein the n training samples are respectively

To

Any one of the samples

R^N×PRepresenting an HSI image with N pixels P-band in each spectral band, j-1, 2. Samples in training set and corresponding classification labelsThe label is represented as

Wherein

Is that

The classification label of (1).

As shown in fig. 2, the step S1 includes:

step S11, in the k-th iteration, the set is collected

Initializing to an empty set containing no content, k representing the number of cycles;

step S12, for the current training set

Any one of the training samples X_iI-1, 2, …, n, four of its immediately adjacent nodes being

If said X is_iIs not marked, it can be represented cooperatively as

Current m training samples

Linear combination of (2), X_iThe calculation process of (2) is as follows:

coefficient of performance

The calculation process of (2) is as follows:

wherein,

representing the set of training samples in the kth iteration cycle,

λ is the global regularization parameter, m is

The number of training samples in the training set,

representing in the k-th iteration loop

Current m training samples

Representing a current training set in the k iteration process;

step S13, according to the coefficient

Computing

Fractional abundance belonging to each class c

Because of small sample space, an expanded training sample set is obtained by violence selection from an original training sample set, and the closest training sample set is selected from the expanded training sample set in order to integrate information on space

K spatially adjacent training samples

The corresponding coefficients of the samples are obtained through the calculation of the formulas (1), (2) and (3)

Coefficient of performance

Is normalized to beta_i＝[β_i，1，β_i，2，…，β_i，j，…β_i，K]。β_iIn (1)

Sample(s)

Fractional abundance belonging to each class c

The calculation process of (2) is as follows:

wherein alpha is_jThe value may be a fixed value or a weighted valueTo adjust according to the training result; beta is a_i，jIs that

The regularization coefficients of (a) are,

λ_jis a sample

To the sample

The distance between the two or more of the two or more,

representing a sample

Coordinates on the X-axis;

representing a sample

Coordinates on the X-axis;

representing a sample

Coordinates on the Y-axis;

representing a sample

Coordinates on the Y-axis;

in formula (4)

Representation and sample

The number of training samples belonging to class c among the eight adjacent training samples, and thus the obtained training samples can be used

Calculating fractional abundance for each class c

Step S14 of using the sample

To pair

Updating:

if it is not

Then the sample is taken

Assigning to category labels

And mixing the sample

Is added to

Td is a preset threshold value;

in the same way, using

Four direct neighborhood sample pairs per training sample

Updating to obtain the augmented training set in the kth cycle

Step S15, if the number of training samples of the augmented training set is satisfied, the augmented training set is used as a first training set D; otherwise, enter the (k +1) th cycle.

Fig. 3 depicts the training set expansion process in a single cycle. Therein

A neighborhood set representing all current training samples.

And middle A, B, C and D represent four samples, respectively. From

To

Means that four sample points directly adjacent to the sample are screened out

To

The calculation result after the formula (1) is shown.

As shown in fig. 4, the step S2 includes:

step S21, based on the subclass of original training data set (i.e. the first training set D, D ═ D)₁，d₂，…d_j，…，d_p}) to generate a sample set R of artificial random screening, i.e. an artificial sample set:

R＝{d₁，d₂，…d_j，…，d_q}

where p ≠ q is allowed.d_jThe spectral range of each band in the spectrum is defined as [ db ]_min：db_max]，db_minAnd db_maxRespectively representing the frequency minima and the frequency maxima of the spectral range of said band, and thus, can be within

Randomly selecting a spectral value of the h wave band of the artificial sample, wherein h belongs to {1, 2, 3 …, mn }, and mn represents the wave band number in the hyperspectral data set;

step S22, applying gradient constraint to filter the synthetic samples in R which are seriously deviated from the actual training samples to obtain synthetic samples R_u：

Is provided with

For the average sample of the true sample band, the gradient vector is calculated as follows:

here, the

The indicative expression can be written as:

r＝[r₁，r₂，…，r_s，…，r_mn-1] (6)

wherein,

r is used to ensure that the randomly synthesized instances have the same trend of variation as the original actual training samples.

According to the above steps, instances can be further selected from R to form a new sample set R_u。

Generating a sample set R_uThe pseudo-code of (1) is as follows:

step S23, by

Computing a corresponding positive interaction complement space

From a global perspective, samples exhibiting similar spectral features are likely to belong to the same class, and from a local perspective, spatially neighboring pixels are more likely to share the same class label, and thus, the training set expansion strategy of step S2 is feasible.

In step S3, the data set FD of the input single-layer neural network is { D, D' }, the convolutional neural network is composed of an input layer, a convolutional layer, a max pooling layer, a full-link layer, and an output layer, the convolutional layer uses tanh as an activation function, the max pooling layer uses maxporoling as an activation function, the full-link layer uses tanh as an activation function, the output layer uses softmax as an activation function, and each sample pixel is used as an input to the input layer.

The size of the input layer is (n)₁，1)，n₁Representing the number of bands of the hyperspectral image. Convolutional layer pass k₁X 1 size t core and n₁The x 1 input vector is filtered. Then, the number of nodes in the convolutional layer becomes t × n₂X 1, and n₂＝n₁-k₁+1. Between the input layer and the convolutional layer, there is t × (k)₁+1) training parameters. The kernel size adopted by the maximum pooling layer is k₂X 1. Maximum pool stratification comprises t × n₃X 1 nodes, where n₃＝n₂÷k₂. The full connection layer comprises n₄A node, t x (n) between the layer and the previous layer₃+1)×n₄A training parameter. The last output layer has n₅A node, n₅Represents the number of classes, and has (n)₄+1)×n₅And (c) trainable parameters.

In a single convolutional layer neural network, the convolutional layer and the max pooling layer may serve as feature extractors for input of the hyperspectral dataset. Fully connected layers may be identified as trainable classifiers.

In this embodiment, values of different parameters in a set of neural networks are given, but sizes of images in actual tasks may be different, which causes the parameters of the neural networks to change, and the present application is not limited to the given set of parameters. The values of various parameters in the neural network are determined according to input data, and a group of optimal solutions is selected after training is finished. This set of parameters is:

n₁＝200，k₁＝28，t＝20，k₂＝5，n₄＝100，n₅＝16

fig. 5 is a schematic structural diagram of a hyperspectral image classification apparatus according to an embodiment of the application. As shown in fig. 5, the hyperspectral image classification apparatus includes:

a first data expansion module 1 configured to expand an original training set by using a semi-supervised learning strategy to obtain a first training set D;

a second data expansion module 2 configured to expand the first training set by using an OCSP algorithm to obtain a second training set D';

a training module 3 configured to train the single convolutional layer neural network with FD, so as to obtain a trained single convolutional layer neural network, where FD is { D, D'; and

a classification module 4 configured to classify the hyperspectral image to be classified using the trained single convolutional layer neural network.

As shown in fig. 6, the first data expansion module 1 includes:

an initialization submodule 11 configured to aggregate

Initializing to an empty set, wherein k represents the number of circulation;

a coefficient calculation submodule 12 configured to calculate a current training set

Middle unlabeled training sample X_iIs shown as

Calculating to obtain coefficients

Wherein, i is 1, 2, …, n,

representing the set of training samples in the kth iteration cycle,

λ is the global regularization parameter, m is

Middle trainingThe number of training samples;

a fractional abundance calculation submodule 13 configured to calculate a fractional abundance from the coefficient

Computing

Fractional abundance belonging to each class c

An update submodule 14 configured to utilise the samples

To pair

Updating: if it is not

Then the sample is taken

Assigning to category labels

And add the sample to

Four direct neighborhood sample pairs per training sample

Updating to obtain the augmented training set in the kth cycle

And

a judging submodule 15 configured to take the augmented training set as a first training set D if the number of training samples in the augmented training set for which no pixel is surrounded by the augmented training set meets a requirement; otherwise, enter the (k +1) th cycle.

As shown in fig. 7, the second data expansion module 2 includes:

an artificial sample set generating submodule 21 configured to generate an artificial sample set R from the spectral ranges of the training samples in the first training set D;

a filtering submodule 22 configured to apply a gradient constraint to the set of artificial samples R to filter the samples in the set of artificial samples R to obtain a synthetic sample R_a(ii) a And

a sample selection submodule 23 configured to pass

Computing a corresponding positive interaction complement space

The single convolutional layer neural network sequentially comprises an input layer, a convolutional layer, a maximum pooling layer, a fully-connected layer and an output layer, wherein the convolutional layer uses tanh as an activation function, the maximum pooling layer uses maxpouling as an activation function, the fully-connected layer uses tanh as an activation function, and the output layer uses softmax as an activation function.

The working principle and the effect of the hyperspectral image classification device in the embodiment of the application are the same as those of the hyperspectral image classification method in the embodiment of the application, and the description is omitted here.

Further provided is a computing device, referring to fig. 8, comprising a memory 1120, a processor 1110 and a computer program stored in said memory 1120 and executable by said processor 1110, the computer program being stored in a space 1130 for program code in the memory 1120, the computer program, when executed by the processor 1110, implementing the method steps 1131 for performing any of the methods according to the invention.

The embodiment of the application also provides a computer readable storage medium. Referring to fig. 9, the computer readable storage medium comprises a storage unit for program code provided with a program 1131' for performing the steps of the method according to the invention, which program is executed by a processor.

The embodiment of the application also provides a computer program product containing instructions. Which, when run on a computer, causes the computer to carry out the steps of the method according to the invention.

In the above embodiments, the implementation may be wholly or partially realized by software, hardware, firmware, or any combination thereof. When implemented in software, may be implemented in whole or in part in the form of a computer program product. The computer program product includes one or more computer instructions. When loaded and executed by a computer, cause the computer to perform, in whole or in part, the procedures or functions described in accordance with the embodiments of the application. The computer may be a general purpose computer, a special purpose computer, a network of computers, or other programmable device. The computer instructions may be stored in a computer readable storage medium or transmitted from one computer readable storage medium to another, for example, from one website site, computer, server, or data center to another website site, computer, server, or data center via wired (e.g., coaxial cable, fiber optic, Digital Subscriber Line (DSL)) or wireless (e.g., infrared, wireless, microwave, etc.). The computer-readable storage medium can be any available medium that can be accessed by a computer or a data storage device, such as a server, a data center, etc., that incorporates one or more of the available media. The usable medium may be a magnetic medium (e.g., floppy Disk, hard Disk, magnetic tape), an optical medium (e.g., DVD), or a semiconductor medium (e.g., Solid State Disk (SSD)), among others.

Those of skill would further appreciate that the various illustrative components and algorithm steps described in connection with the embodiments disclosed herein may be implemented as electronic hardware, computer software, or combinations of both, and that the various illustrative components and steps have been described above generally in terms of their functionality in order to clearly illustrate this interchangeability of hardware and software. Whether such functionality is implemented as hardware or software depends upon the particular application and design constraints imposed on the implementation. Skilled artisans may implement the described functionality in varying ways for each particular application, but such implementation decisions should not be interpreted as causing a departure from the scope of the present application.

It will be understood by those skilled in the art that all or part of the steps in the method for implementing the above embodiments may be implemented by a program, and the program may be stored in a computer-readable storage medium, where the storage medium is a non-transitory medium, such as a random access memory, a read only memory, a flash memory, a hard disk, a solid state disk, a magnetic tape (magnetic tape), a floppy disk (floppy disk), an optical disk (optical disk), and any combination thereof.

The above description is only for the preferred embodiment of the present application, but the scope of the present application is not limited thereto, and any changes or substitutions that can be easily conceived by those skilled in the art within the technical scope of the present application should be covered within the scope of the present application. Therefore, the protection scope of the present application shall be subject to the protection scope of the claims.

Claims

1. A hyperspectral image classification method comprises the following steps:

2. The method of claim 1, wherein the expanding the original training set using the semi-supervised learning strategy to obtain the first training set D comprises:

will be assembled

Initializing to an empty set, wherein k represents the number of circulation;

the current training set

Middle unlabeled training sample X_iIs shown as

Calculating to obtain coefficients

Wherein v represents the training sample X_iDirect neighborhood pixels in four directions, up, down, left, right, v 1, 2, 3, 4, i 1, 2, …, n,

representing the set of training samples in the kth iteration cycle,

λ is the global regularization parameter, m is

The number of training samples;

according to the coefficient

Computing

Fractional abundance belonging to each class c

Using samples

To pair

Updating: if it is not

Then the sample is taken

Assigning to category labels

And add the sample to

Td is a preset threshold value; in the same way, using

Four direct neighborhood sample pairs per training sample

Updating to obtain the augmented training set in the kth cycle

3. The method of claim 1 or 2, wherein the expanding the first training set D by using the OCSP algorithm to obtain a second training set D' comprises:

By passing

Computing a corresponding positive interaction complement space

4. The method of any one of claims 1-3, wherein the single convolutional layer neural network comprises an input layer, a convolutional layer, a max-pooling layer, a fully-connected layer, and an output layer in that order.

5. The method of claim 4, wherein the convolutional layer uses tanh as an activation function, the max-pooling layer uses maxporoling as an activation function, the fully-connected layer uses tanh as an activation function, and the output layer uses softmax as an activation function.

6. A hyperspectral image classification apparatus comprising:

7. The apparatus of claim 6, wherein the first data expansion module comprises:

an initialization submodule configured to aggregate

Initializing to an empty set, wherein k represents the number of circulation;

Middle unlabeled training sample X_iIs shown as

Calculating to obtain coefficients

Wherein, i is 1, 2, …, n,

representing the set of training samples in the kth iteration cycle,

λ is the global regularization parameter, m is

The number of training samples;

Computing

Fractional abundance belonging to each class c

An update submodule configured to utilize the samples

To pair

Updating: if it is not

Then the sample is taken

Assigning to category labels

And add the sample to

Four direct neighborhood sample pairs per training sample

Updating to obtain the augmented training set in the kth cycle

And

8. The apparatus of claim 6 or 7, wherein the second data expansion module comprises:

a sample selection submodule configured to pass

Computing a corresponding positive interaction complement space

9. The apparatus of any one of claims 6-8, wherein the single convolutional layer neural network comprises an input layer, a convolutional layer, a max pooling layer, a fully-connected layer, and an output layer in this order, the convolutional layer using tanh as an activation function, the max pooling layer using maxpouling as an activation function, the fully-connected layer using tanh as an activation function, and the output layer using softmax as an activation function.

10. A computing device comprising a memory, a processor, and a computer program stored in the memory and executable by the processor, wherein the processor implements the method of any of claims 1-5 when executing the computer program.