CN113743450A

CN113743450A - Hyperspectral image segmentation method based on non-local feature fusion

Info

Publication number: CN113743450A
Application number: CN202110818767.5A
Authority: CN
Inventors: 郑建炜; 刘豪; 冯宇超; 刘宇; 李鹏飞; 吴杰; 许金山
Original assignee: Zhejiang University of Technology ZJUT
Current assignee: Zhejiang University of Technology ZJUT
Priority date: 2021-07-20
Filing date: 2021-07-20
Publication date: 2021-12-03

Abstract

The invention discloses a hyperspectral image segmentation method based on Non-local feature fusion, which utilizes a standard residual block to perform spectrum dimensionality reduction on a hyperspectral image, then performs operation of two times of downsampling feature fusion, namely two times of downsampling units and Non-local modules to output feature maps with the same size, effectively extracts long-distance information through a Non-local attention mechanism, performs feature fusion through adding operation, reduces the image to one fourth of the original image, and quadruples the number of channels to extract the features of the spectral image, performs upsampling through two times of upsampling, performs residual connection operation to obtain an output feature map, can fully utilize spectrum and space feature information to obtain more pixel features and global information, and finally utilizes the standard residual block to output a final probability distribution map so as to fully utilize the convolution advantages, the segmentation network is simplified, the parameter quantity is reduced, and the accuracy and the speed of pixel segmentation are obviously improved.

Description

Hyperspectral image segmentation method based on non-local feature fusion

Technical Field

The application belongs to the technical field of image processing, and particularly relates to a hyperspectral image segmentation method based on non-local feature fusion.

Background

The hyperspectral image classification is an important means for earth observation, and is widely applied to the aspects of refined agriculture, military detection, environmental monitoring and the like. With the continuous deepening of the technology, the capability of the hyperspectral image to express different objects is continuously enhanced by acquiring spectral information from continuous spectral bands of hundreds of surface objects, the spatial resolution of image data acquired by a ground observation mode including hyperspectral imaging is continuously improved, and the traditional pixel-by-pixel processing mode is not suitable for data of a high spatial resolution type.

Inspired by the internal structure of a visual system, with the emergence of a new deep learning technology, the classification of hyperspectral images is developed in a breakthrough manner, and a typical method is a Convolutional Neural Network (CNN) and is used for extracting a spectrum-space feature map with discriminability from the hyperspectral images. However, the two-dimensional convolution can only extract spatial information from the hyperspectral image, and information among spectral bands and global information are lost, which brings obvious influence on the segmentation of the hyperspectral image.

Disclosure of Invention

The application aims to provide a hyperspectral image segmentation method based on non-local feature fusion, and the accuracy of image pixel segmentation is remarkably improved.

In order to achieve the purpose, the technical scheme adopted by the application is as follows:

a hyperspectral image segmentation method based on non-local feature fusion comprises the following steps:

s1, acquiring a hyperspectral image to be segmented, wherein the size of the hyperspectral image to be segmented is DxHxWxN;

s2, reducing the dimension of the input hyperspectral image by using a standard residual error block, wherein the dimension of the hyperspectral image after dimension reduction is DxHxWxC;

step S3, respectively inputting the dimensionality reduced hyperspectral images into two branches of a down-sampling unit and a Non-local module, and adding feature maps output by the two branches;

step S4, inputting the feature map subjected to the adding operation in step S3 into two branches of a down-sampling unit and a Non-local module respectively, and adding the feature maps output by the two branches;

step S5, the feature map after the adding operation in the step S4 passes through an up-sampling unit, and the output of the up-sampling unit and the output of the step S3 are added;

step S6, the characteristic diagram obtained after the adding operation in the step S5 passes through an up-sampling unit, and the output of the up-sampling unit and the output of the step S2 are subjected to the adding operation;

and step S7, passing the output feature map with the size of D multiplied by H multiplied by W multiplied by C obtained after the addition operation in the step S6 through a standard residual block with the same structure as the standard residual block in the step S2 to obtain a probability distribution map with the dimension of D multiplied by H multiplied by W multiplied by N, and completing the segmentation of the hyperspectral image, wherein D represents the number of image groups, H represents the height of the image, W represents the width of the image, C represents the number of image channels, and N represents the preset number of categories for the segmentation of the hyperspectral image.

Several alternatives are provided below, but not as an additional limitation to the above general solution, but merely as a further addition or preference, each alternative being combinable individually for the above general solution or among several alternatives without technical or logical contradictions.

Preferably, the step S2, performing dimension reduction on the input hyperspectral image by using a standard residual block, includes:

s201, acquiring a hyperspectral image to be segmented, marking the hyperspectral image as F, wherein the dimension of the hyperspectral image is DxHxWxN;

s202, obtaining a characteristic diagram F' with the size of D multiplied by H multiplied by W multiplied by C by convolving the acquired hyperspectral image by a branch of two layers of BN + ReLu6 and 3 multiplied by 3;

s203, adding the feature maps F and F 'to obtain a feature map F', and then remolding the feature map F into a feature map F with dimensions of D multiplied by H multiplied by W multiplied by C₀。

Preferably, in step S3, the method for inputting the reduced hyperspectral image into two branches, namely, a downsampling unit and a Non-local module, and adding the feature maps output by the two branches includes:

s301, acquiring the hyperspectral image after dimensionality reduction, and marking as F₀The size is DxHxWxC;

s302, obtaining a hyperspectral image F₀Passing through a down-sampling unit, namely a branch convolved by a convolution branch of 3x3x3 and a branch convolved by BN + ReLu6 and 3x3x3 of two layers respectively, and adding the outputs of the two branches to obtain the value of

Characteristic diagram F of₁；

S303, performing hyperspectral image F₀Through a Non-local module, namely, firstly, a residual block is passed through, 3D convolution and reshape operation of 1 multiplied by 1 are carried out in the residual block, and the characteristics output by the residual block and the original hyperspectral image F₀Fusing to obtain a feature map F with the size of DxHxWxC_oo；

S304, and then, converting the feature map F_ooPerforming Attention operation through three branches of 1 × 1 × 1 3D convolution, respectively converting into D size_Q×H_Q×W_Q×C_KThe feature map QueryMatrix of D × H × W × C_KThe feature map Keyvector and the size of D multiplied by H multiplied by W multiplied by C_VThen splitting the channel numbers of the feature maps QueryMatrix, KeyVector and ValueMasx, and obtaining tensors Q, K and V by using Unfold operation according to dimension expansion, wherein the formula is as follows:

Q＝Unfold(QueryMatrix)

K＝Unfold(KeyVector)

V＝Unfold(Valuematrix)

where Query Transform is a 1x1x1 convolution operation, Unfold (·) is a dimension expansion operation, Conv is a 1x1 3D convolution operation, C_KAnd C_VThe method comprises the steps of setting a preset hyper-parameter related to an image channel;

s305, calculating attention matrices a and O based on tensors Q, K and V as follows:

O＝AV

wherein Softmax (-) is a Softmax function, K^TIs the transpose of tensor K, and the dimension of the attention matrix A is D_Q×H_Q×W_QThe dimension of the attention matrix O is (D)_Q×H_Q×W_Q)×C_VWherein D is_Q、H_QAnd W_QNumber of channels C determined by tensor Q_VDetermined by the matrix V;

s306, conducting Multi-Head Attention operation on the tensors Q, K and V, obtaining n groups Q, K and V by Q, K and V through n times of linear transformation respectively, wherein n is preset n-Head hyper-parameter, and then each group Q_i、K_iAnd V_iObtaining corresponding output Head through Attention operation_iI 1,2,3 … n, and finally splicing Head₁…Head_nObtaining an output:

Head_i＝Atttention(QW_i ^Q,KW_i ^K,VW_i ^V)

MultiHead(Q,K,V)＝Concat(Head₁,…,Head_n)W_i ^O

in the formula (I), the compound is shown in the specification,

is a trained parameter matrix, and Attention () is an Attenttion operation;

s307, reshaping the obtained MultiHead (Q, K, V) into a matrix O with a constant size through Fold operation, and changing the number of output channels into C through 1 × 1 × 1 3D convolution_OObtaining an output Y:

Y＝Conv_1_CO(Fold(MultiHead(Q,K,V)))

in the formula, the dimension of Y is the dimension D of the characteristic diagram QueryMatrix_Q×H_Q×W_Q×C_KDetermine and set C_K＝C_V＝C_OFold (. cndot.) is the number of passes by Fold operation, and the dimension of Y in this step is

S308, the characteristic diagram F obtained in the step S302₁And the Y obtained in the step S307 is subjected to addition operation to obtain the dimension of

Characteristic diagram F of₂。

Preferably, in step S4, the adding the feature map obtained in step S3 into two branches, i.e., the down-sampling unit and the Non-local module, and adding the feature maps obtained by the two branches includes:

s401, obtaining the size output by the step S3 as

Characteristic diagram F of₂；

S402, setting the size to be

Characteristic diagram F of₂Respectively inputting two branches, and obtaining the size of the down-sampling unit by the first branch

Characteristic diagram F of₂₁The second branch of the Non-local module is used to obtain the size of

Characteristic diagram F of₂₂；

S403, comparing feature maps F₂₁And feature map F₂₂Performing an addition operation to obtain a size of

Characteristic diagram output F of₃。

Preferably, in step S5, the passing the feature map after the adding operation in step S4 through the upsampling unit, and adding the output of the upsampling unit and the output of step S3 includes:

s501, obtaining the size output in the step S4 as

Characteristic diagram F of₃；

S502, converting the characteristic diagram F₃By means of an up-sampling unit, i.e. two branches are input separately, the size is obtained by means of the first branch of the Non-local module

Characteristic diagram F of₃₁The second branch of the transposed convolution by 3x3x3 yields a size of

Characteristic diagram F of₃₂；

S503, comparing the feature map F₃₁And feature map F₃₂Performing a machining operation to obtain a dimension of

Characteristic diagram output F of₃₃；

S504, feature map F₃₃And the dimension in step S3 is

Characteristic diagram F of₂Doing an add operation to obtain a dimension of

Characteristic diagram F of₄。

Preferably, in step S6, the processing of passing the feature map obtained by the adding operation in step S5 through an up-sampling unit and adding the output of the up-sampling unit and the output of step S2 includes:

s601, obtaining the feature graph F with the dimension of D multiplied by H multiplied by W multiplied by C output by the step S2₀And the size output of step S5 is

Characteristic diagram F of₄；

S602, feature map F₄By means of an upsampling unit, i.e. by inputting two branches separately, a profile F of size DxHxWxC is obtained by means of the first branch of the Non-local module₄₁After the second branch of the transposed convolution of 3x3x3, a feature map F with dimensions D × H × W × C is obtained₄₂；

S603, matching the feature map F₄₁And feature map F₄₂Performing an addition operation to obtain a feature map output F with dimensions D × H × W × C₄₃；

S604, converting the characteristic diagram F₄₃And a feature map F with dimensions D × H × W × C in step S2₀Adding operation is carried out to obtain a feature map F with dimensions of D multiplied by H multiplied by W multiplied by C₅。

Preferably, in step S7, the step of passing the output feature map with the size of D × H × W × C obtained by the addition operation in step S6 through the standard residual block with the same structure as the standard residual block in step S2 to obtain a probability distribution map with the dimension of D × H × W × N, so as to complete the hyperspectral image segmentation, includes:

s701, obtaining a characteristic diagram F after the addition operation of the step S6₅The size is DxHxWxC;

s702, obtaining a characteristic diagram F₅Obtaining a feature map F with the size of D × H × W × C through standard residual block, namely through a branch of two-layer BN + ReLu6 and 3 × 3 convolution₅₁；

S703, adding F₅₁Remodelling into a probability distribution map F with dimensions D × H × W × N_out。

The hyperspectral image segmentation method based on Non-local feature fusion provided by the application utilizes a standard residual Block to perform spectrum dimensionality reduction on a hyperspectral image, then performs Down-Sampling feature fusion twice, namely outputs feature maps with the same size through two branches of a Down-Sampling Block and a Non-local module twice, effectively extracts long-distance information through a Non-local attention mechanism, then performs feature fusion through adding operation, reduces the image to one fourth of the original image, and changes the channel number to four times of the original image to extract the features of the spectral image, then performs Up-Sampling twice through an Up-Sampling Block, performs residual connection operation through skip-connection to obtain an output feature map, can fully utilize spectrum and spatial feature information to obtain more pixel features and global information, and finally utilizes the standard residual Block to output a final probability distribution map, therefore, the advantages of convolution are fully utilized, the segmentation network is simplified, the parameter quantity is reduced, and the accuracy and the speed of pixel segmentation are obviously improved.

Drawings

FIG. 1 is a flow chart of a hyperspectral image segmentation method based on non-local feature fusion according to the application;

FIG. 2 is a schematic diagram of a standard residual block structure of the present application;

FIG. 3 is a schematic diagram of the structure of a downsampling unit of the present application;

FIG. 4 is a schematic diagram of the present application utilizing Non-local module processing features;

FIG. 5 is a schematic diagram of the structure of a Non-local module of the present application;

fig. 6 is a schematic structural diagram of an upsampling unit of the present application.

Detailed Description

The technical solutions in the embodiments of the present application will be clearly and completely described below with reference to the drawings in the embodiments of the present application, and it is obvious that the described embodiments are only a part of the embodiments of the present application, and not all of the embodiments. All other embodiments, which can be derived by a person skilled in the art from the embodiments given herein without making any creative effort, shall fall within the protection scope of the present application.

Unless defined otherwise, all technical and scientific terms used herein have the same meaning as commonly understood by one of ordinary skill in the art to which this application belongs. The terminology used in the description of the present application herein is for the purpose of describing particular embodiments only and is not intended to be limiting of the application.

In one embodiment, a hyperspectral image segmentation method based on non-local feature fusion is provided, long-distance information and attention mechanism comprehensive global information of a hyperspectral image are extracted through a non-local feature fusion U-Nets segmentation network, and then non-local features of down-sampling comprehensive global information of a CNN neural network are fused to achieve hyperspectral image pixel segmentation. The method fully utilizes the advantages of three-dimensional convolution, feature fusion and Non-local attention mechanism Non-local modules, and realizes the acquisition of global information so as to obtain a pixel segmentation result with high accuracy.

Specifically, as shown in fig. 1, the hyperspectral image segmentation method based on non-local feature fusion in the embodiment includes the following steps:

and step S1, acquiring the hyperspectral image to be segmented, wherein the size of the acquired hyperspectral image to be segmented is DxHxWxN.

In order to improve the data processing speed and accuracy, the acquired hyperspectral image needs to be preprocessed first, and the specific preprocessing step is shown in step S2.

And step S2, reducing the dimension of the input hyperspectral image by using a standard residual error module.

In this embodiment, the input hyperspectral image is reduced in dimension through a standard residual error module to achieve the purposes of reducing parameters and improving reasoning speed, as shown in fig. 2, the dimension reduction in this embodiment specifically includes the following steps:

s201, acquiring a hyperspectral image to be segmented, and marking the hyperspectral image as F, wherein the dimension of the hyperspectral image is D multiplied by H multiplied by W multiplied by N.

S202, obtaining a feature map F' with the size of D multiplied by H multiplied by W multiplied by C through a branch formed by convolution of BN (batch normalization) + ReLu6 (activation function) and 3 multiplied by 3.

S203, adding the feature maps F and F 'to obtain a feature map F', and reshaping (reshape) the feature map F into a feature map F with dimensions of D × H × W × C₀。

And step 3, inputting the dimensionality reduced hyperspectral image into two branches of a Down-sampling Block and a Non-local module respectively, and adding the feature maps output by the two branches.

S301, obtaining the hyperspectral image F after dimensionality reduction₀And the size is D × H × W × C.

S302, obtaining a hyperspectral image F₀Passing through a down-sampling unit, as shown in FIG. 3, namely, respectively passing through a convolution branch of 3x3x3 and branches of two-layer convolution of BN + ReLu6 and 3x3x3, and adding the outputs of the two branches to obtain the output with the size of

Characteristic diagram F of₁。

S303, as shown in FIG. 4, the hyperspectral image F₀After passing through a Non-local Block (Non-local Block), as shown in fig. 5, a 1 × 1 × 1 3D convolution and reshape operation are performed in the residual Block once through a residual Block, and the output features of the residual Block are compared with the original hyperspectral image F₀Fusing to obtain a feature map F with the size of DxHxWxC_oo。

S304, and then, converting the feature map F_ooPerforming Attention operation through three branches of 1 × 1 × 1 3D convolution, respectively converting into D size_Q×H_Q×W_Q×C_KThe feature map QueryMatrix of D × H × W × C_KThe feature map Keyvector and the size of D multiplied by H multiplied by W multiplied by C_VThe feature map Valuemtrix, then the feature maps QueryMatrix, KeyVector and ValThe channel number of the uematrix is split, tensors Q, K and V are obtained by using Unfold operation according to dimension expansion, and the formula is as follows:

Q＝Unfold(QueryMatrix)

K＝Unfold(KeyVector)

V＝Unfold(Valuematrix)

where Query Transform is a 1x1x1 convolution operation, Unfold (·) is a dimension expansion operation, Conv is a 1x1 3D convolution operation, C_KAnd C_VIs a preset hyper-parameter related to the image channel.

O＝AV

wherein Softmax (-) is a Softmax function, K^TIs the transpose of tensor K, and the dimension of the attention matrix A is D_Q×H_Q×W_QThe dimension of the attention matrix O is (D)_Q×H_Q×W_Q)×C_VWherein D is_Q、H_QAnd W_QNumber of channels C determined by tensor Q_VDetermined by the matrix V.

S306, conducting Multi-Head orientation operation on the tensors Q, K and V, obtaining n groups Q, K and V by Q, K and V through n times of linear transformation, wherein n is preset n-Head hyper-parameters, n corresponds to the manually set n-Head hyper-parameters, and then each group Q_i、K_iAnd V_iObtaining corresponding output Head through Attention operation_iI 1,2,3 … n, and finally splicing Head₁…Head_nObtaining an output:

Head_i＝Atttention(QW_i ^Q,KW_i ^K,VW_i ^V)

MultiHead(Q,K,V)＝Concat(Head₁,…,Head_n)W_i ^O

in the formula (I), the compound is shown in the specification,

is a trained parameter matrix, Attention (. cndot.) is the Attenttion operation.

S307, the obtained Multihead (Q, K, V) is reshaped into a size-invariant (i.e. still being (D) by the Fold operation_Q×H_Q×W_Q) ) is processed by a 3D convolution of 1 × 1 × 1 to change the number of output channels to C_OObtaining an output Y:

Characteristic diagram F of₂。

And step S4, inputting the feature map subjected to feature fusion in step S3 into two branches of a downsampling unit and a Non-local module respectively, and adding the feature maps output by the two branches.

S401, obtaining the size output by the step S3 as

Characteristic diagram F of₂。

S402, setting the size to be

Characteristic diagram F of₂₂(ii) a Wherein the structures of the down-sampling unit and the Non-local module are identical to those of step S3.

Characteristic diagram output F of₃。

In the implementation, the spectrum-space features are extracted by three-dimensional convolution and divided into three linear spaces, then a channel of a spectrum image is expanded by a multi-head attention mechanism and Unfold operation, and the global information of a feature map with any dimension is fused along the dimension of the channel by using a similar self-attention mechanism, so that long-distance information is extracted, and the parameter number and the reasoning speed are reduced.

The Non-local attention module of the Non-local module is operated to focus on global information in a given image to extract hyperspectral image information, for three-dimensional convolution operation and down-sampling operation, besides obtaining information by obtaining a spectrum-space feature map, the capability of feature representation can be improved by utilizing correlation among different channels, and then a feature map containing global information and local information and high-abundance information is obtained by adding the down-sampled feature map and the global feature map of the Non-local module.

The embodiment can fully acquire global information and long-distance information to achieve image pixel segmentation characteristics through a Non-local feature fused U-Nets network and a Non-local attention module of a Non-local module.

And step S5, passing the feature map subjected to feature fusion in the step S4 through an Up-Sampling Block (Up-Sampling Block), and adding the feature map and the output feature map in the step 3.

S501, obtaining the size output in the step S4 as

Characteristic diagram F of₃。

S502, converting the characteristic diagram F₃By means of an up-sampling unit, as shown in fig. 6, i.e. two branches are input separately, the first branch of the Non-local module being dimensioned to

Characteristic diagram F of₃₂。

Characteristic diagram output F of₃₃。

S504, feature map F₃₃And the dimension in step S3 is

Characteristic diagram F of₂Doing an add operation to obtain a dimension of

Characteristic diagram F of₄。

And step S6, passing the feature map obtained after the adding operation in step S5 through an Up-Sampling Block (Up-Sampling Block), and adding the output of the Up-Sampling Block and the output of step S2.

Characteristic diagram F of₄。

S602, feature map F₄By means of an upsampling unit, i.e. by inputting two branches separately, a profile F of size DxHxWxC is obtained by means of the first branch of the Non-local module₄₁After the second branch of the transposed convolution of 3x3x3, a feature map F with dimensions D × H × W × C is obtained₄₂。

S603, matching the feature map F₄₁And feature map F₄₂Performing an addition operation to obtain a feature map output F with dimensions D × H × W × C₄₃。

In order to restore the pixel characteristics of the original image, Up-Sampling is carried out through two times of Up-Sampling blocks, so that the purpose of reducing the dimension can be achieved, and the specified image effect can be restored through the manually set hyper-parameters.

And S7, passing the output feature map with the size of D multiplied by H multiplied by W multiplied by C obtained after the addition operation in the step S6 through a standard residual block with the same structure as the standard residual block in the step S2 to obtain a probability distribution map with the dimension of D multiplied by H multiplied by W multiplied by N, and completing the segmentation of the hyperspectral image.

S701, obtaining a characteristic diagram F after the addition operation of the step S6₅And the size is D × H × W × C.

S702, obtaining a characteristic diagram F₅Obtaining a feature map F with the size of D × H × W × C through standard residual block, namely through a branch of two-layer BN + ReLu6 and 3 × 3 convolution₅₁。

S703, adding F₅₁Remodelling into a probability distribution map F with dimensions D × H × W × N_out. Wherein D represents the number of image groupsH represents the image height, W represents the image width, C represents the number of image channels, and N represents the number of categories preset for hyperspectral image segmentation.

In the embodiment, the input image size D × H × W × C is changed into the final probability distribution map with the dimension D × H × W × N by using the standard residual block, where N is the number of classes of the segmentation task. And combining the class information of the segmentation task to output a final class segmentation probability map. It should be noted that the add operation in this embodiment includes add and concat, and the specific execution may distinguish the specific selected add operation mode according to the addition result of the feature, which is a conventional technique in the feature processing and is not specifically described here.

The technical features of the embodiments described above may be arbitrarily combined, and for the sake of brevity, all possible combinations of the technical features in the embodiments described above are not described, but should be considered as being within the scope of the present specification as long as there is no contradiction between the combinations of the technical features.

The above-mentioned embodiments only express several embodiments of the present application, and the description thereof is more specific and detailed, but not construed as limiting the scope of the invention. It should be noted that, for a person skilled in the art, several variations and modifications can be made without departing from the concept of the present application, which falls within the scope of protection of the present application. Therefore, the protection scope of the present patent shall be subject to the appended claims.

Claims

1. A hyperspectral image segmentation method based on non-local feature fusion is characterized by comprising the following steps:

2. The non-local feature fusion-based hyperspectral image segmentation method according to claim 1, wherein the step S2 of performing dimensionality reduction on the input hyperspectral image by using a standard residual block comprises:

3. The method for segmenting the hyperspectral image based on Non-local feature fusion according to claim 1, wherein in the step S3, the hyperspectral image after dimensionality reduction is respectively input into two branches of a downsampling unit and a Non-local module, and feature maps output by the two branches are added, and the method comprises:

Characteristic diagram F of₁；

Q＝Unfold(QueryMatrix)

K＝Unfold(KeyVector)

V＝Unfold(Valuematrix)

O＝AV

Head_i＝Atttention(QW_i ^Q,KW_i ^K,VW_i ^V)

MultiHead(Q,K,V)＝Concat(Head₁,…,Head_n)W_i ^O

in the formula, W_i ^Q,W_i ^K，W_i ^V，W_i ^OIs a trained parameter matrix, and Attention () is an Attenttion operation;

s307, the obtained MultiHead (Q, K, V) is reshaped into a matrix O with a constant size through Fold operation, and then the matrix O is processed by 13D convolution by 1X1 changes the output channel number to C_OObtaining an output Y:

Characteristic diagram F of₂。

4. The hyperspectral image segmentation method based on Non-local feature fusion according to claim 3, wherein in the step S4, the feature map after the adding operation in the step S3 is respectively input into two branches of a down-sampling unit and a Non-local module, and the feature map output by the two branches is added, and the method comprises the following steps:

s401, obtaining the size output by the step S3 as

Characteristic diagram F of₂；

S402, setting the size to be

Characteristic diagram F of₂₂；

Characteristic diagram output F of₃。

5. The hyperspectral image segmentation method based on non-local feature fusion according to claim 4, wherein in the step S5, the feature map after the adding operation in the step S4 passes through an upsampling unit, and the adding operation is performed on the output of the upsampling unit and the output of the step S3, and the method comprises:

s501, obtaining the size output in the step S4 as

Characteristic diagram F of₃；

Characteristic diagram F of₃₂；

Characteristic diagram output F of₃₃；

S504, feature map F₃₃And the dimension in step S3 is

Characteristic diagram F of₂Doing an add operation to obtain a dimension of

Characteristic diagram F of₄。

6. The hyperspectral image segmentation method based on non-local feature fusion according to claim 5, wherein in the step S6, the feature map obtained by the addition operation in the step S5 passes through an upsampling unit, and the addition operation is performed on the output of the upsampling unit and the output of the step S2, and the method comprises:

Characteristic diagram F of₄；

7. The hyperspectral image segmentation method based on non-local feature fusion of claim 1, wherein in the step S7, the step S6 is performed to obtain the output feature map with the size of D × H × W × C, and the output feature map passes through the standard residual block with the same structure as the standard residual block in the step S2 to obtain the probability distribution map with the dimension of D × H × W × N, so as to complete hyperspectral image segmentation, and the method comprises: