CN114708496A

CN114708496A - Remote sensing change detection method based on improved spatial pooling pyramid

Info

Publication number: CN114708496A
Application number: CN202210240578.9A
Authority: CN
Inventors: 邵攀; 高梓昂
Original assignee: China Three Gorges University CTGU
Current assignee: China Three Gorges University CTGU
Priority date: 2022-03-10
Filing date: 2022-03-10
Publication date: 2022-07-05

Abstract

A remote sensing change detection method based on an improved spatial pooling pyramid comprises the following steps: step 1: for two-phase remote sensingPreprocessing the image by X respectively₁And X₂Representing the first and second time period images, generating a differential image, and recording the differential image as DI; then X is put in₁、X₂And DI cascade, inputting the result into the residual error neural network of the integrated cavity convolution; step 2: to X_inDown-sampling and feature extraction are carried out to obtain high-level feature X_HAnd low level feature X_L(ii) a And step 3: using an improved spatial pooling pyramid to pair high level features X_HExtracting features, cascading the extracted features to form a feature pyramid, and recording the feature pyramid as FPYR; and 4, step 4: upsampling the FPYR; and outputting the low-level feature X output in the step 2_LCascading to an up-sampled mirror image characteristic layer through short connection, and then solving a change probability graph through a SoftMax layer; and 5: training network parameters through back propagation by using an improved cross entropy loss function based on a change probability graph and a real change graph, and performing change detection through the trained network parameters; and carrying out remote sensing change detection through the steps.

Description

Remote sensing change detection method based on improved spatial pooling pyramid

Technical Field

The invention belongs to the technical field of remote sensing, relates to a remote sensing change detection technology, and particularly relates to a remote sensing change detection technology based on an improved spatial pooling pyramid.

Background

With the continuous development of deep learning technology, the deep learning technology has been widely applied in the field of remote sensing. Compared with the traditional machine learning technology, the remote sensing change detection based on deep learning has higher precision, especially for high-resolution remote sensing images.

However, the conventional Change Detection technology based on deep learning, for example, "End-to-End Change Detection for High Resolution software Using Improved UNet + +" takes two-phase Images as input, and cannot fully consider the difference image space. In addition, the change detection problem is usually a serious class imbalance problem, and the changed class area is generally much smaller than the unchanged class area. In order to solve the problems and effectively utilize the multi-scale characteristic of the remote sensing image with high spatial resolution, the invention provides a remote sensing change detection technology based on an improved spatial pooling pyramid.

Disclosure of Invention

The invention provides a high-resolution remote sensing image change detection method based on an improved spatial pooling pyramid, aiming at the problems that the existing deep learning change detection technology is easily affected by pseudo changes and the detection of the edge of a change area is incomplete.

A remote sensing change detection method based on an improved spatial pooling pyramid comprises the following steps:

step 1: preprocessing the two-stage remote sensing image by respectively using X₁And X₂Representing the first and second time period images and generating X by difference method₁And X₂The difference image of (2) is recorded as DI; then X is put in₁、X₂And DI cascade, denoted X_in，

Indicating a cascade operation, finally X_inInputting the residual error neural network integrated with the cavity convolution;

step 2: residual neural network pair X using integrated hole convolution_inDown-sampling and feature extraction are carried out to obtain high-level feature X_HAnd low level feature X_L；

And step 3: for high-level feature X_HRespectively extracting features of different scales by using convolution, a position and channel attention mechanism, cavity convolution of three different cavity rates and global maximum pooling, cascading the extracted features to form a feature pyramid, and recording the feature pyramid as FPYR;

and 4, step 4: upsampling the FPYR; and the low-level feature X output in the step 2 is input_LCascading to an up-sampled mirror image characteristic layer through short connection, and then solving a change probability graph through a SoftMax layer;

and 5: training network parameters through back propagation by using an improved cross entropy loss function based on a change probability graph and a real change graph, and performing change detection through the trained network parameters;

and carrying out remote sensing change detection through the steps.

In step 2, the method specifically comprises the following steps:

step 2-1: inputting network into X_inSending the convolution layer Conv to preliminarily extract the characteristics and increase the number of characteristic channels;

step 2-2: sequentially passing the Conv output of the convolutional layer through a batch normalization layer, an activation layer and a global maximum pooling layer to enable the network to have nonlinear expression capability;

step 2-3: obtaining low-level feature X by integrating residual error neural network of cavity convolution_LAnd high level feature X_H。

In step 3, the high-level feature X is processed_HPerforming multi-scale feature extraction, specifically:

extracting features of different scales by using a convolution mechanism, a position and channel attention mechanism, cavity convolution of three different cavity rates and global maximum pooling;

the output of the position attention mechanism is denoted as PAM (X)_H) The channel attention mechanism output is recorded as CAM (X)_H) The output of the convolution (including the convolution layer, batch normalization layer and Relu function activation layer) is denoted as Conv₁(X_H) The outputs of the three hole convolutions (each convolution comprising a convolution layer, a batch normalization layer and a Relu function activation layer) are respectively denoted as AConv₁(X_H)，AConv₂(X_H)，AConv₃(X_H) The output of the global max pooling is denoted Pool (X)_H) And cascading the outputs to obtain a cascaded feature pyramid FPYR, namely:

PAM (X) as defined above_H) The specific acquisition method comprises the following steps: firstly, the high-level feature X obtained in the step 2_HProcessing by a position attention mechanism, and then adjusting the number of characteristic channels by using the convolutional layer ConvP;

the CAM (X)_H) The specific acquisition method comprises the following steps: firstly, the high-level feature X obtained in the step 2_HThe number of characteristic channels is adjusted by the channel attention mechanism processing and then by the convolutional layer ConvC.

The residual error neural network integrating the cavity convolution is composed of a plurality of sequentially connected cavity convolution residual blocks, and the output of the n-th layer residual block Block is recorded as B_n(X), then the low-level feature X_LFor the output of the 1 st residual block, the high-level feature X_HThe output of the last 1 residual block.

In step 5, the proposed network structure is trained using an improved cross entropy loss function, which is:

wherein N represents the total number of pixels, u and c represent the unchanged class and the changed class, respectively, p_ikProbability, y, that pixel i predicted for the model belongs to class k_ikFor a Boolean variable equal to 0 or 1, by the class l to which the pixel i belongs_iAnd (3) calculating:

w_kfor the weight belonging to class k, k ∈ { u, c }, by the formula

And (4) determining. Wherein a is_uAnd a_cRespectively represent the proportion of the unchanged class u and the changed class c, and beta is an equilibrium coefficient.

The network structure based on the improved spatial pooling pyramid for remote sensing image processing comprises an improved spatial pooling pyramid module: the device consists of a convolution unit, a position and channel attention mechanism unit, three cavity convolution units with different cavity rates and a global maximum pooling unit; the input of the improved space pooling pyramid module is connected with the first output of the cavity residual error module, and the first output of the cavity residual error module is a high-level feature X_HThe second output of the hole residual module is the low-level feature X_L(ii) a The output of the improved spatial pooling pyramid module is connected with the input of the first up-sampling module; low level feature X_LAnd cascading with the features output by the first up-sampling module to obtain a new feature map after cascading.

The hole residual module is composed of a plurality of hole convolution residual blocks which are connected in sequence, and the output of the nth layer residual block is recorded as B_n(X), then the low-level feature X_LFor the output of the 1 st residual block, the high-level feature X_HIs the output of the last 1 residual block;

cavity residual error moduleThe input comprises a first time period remote sensing image X₁And a second period remote sensing image X₂First time period remote sensing image X₁And the remote sensing image X of the second period₂The three are cascaded to form a network input X_in，X_in＝X₁⊕X₂⊕DI。

Network input X_inSequentially passing through the convolutional layer Conv, the batch normalization layer, the activation layer and the global maximum pooling layer, wherein the output of the global maximum pooling layer is connected with the input of the cavity residual error module.

Feature map size and low-level feature X output by first up-sampling module_LThe sizes are the same; and inputting the new cascaded feature map into a second up-sampling module after passing through at least 1 layer of convolution layer, wherein the size of the feature map output by the second up-sampling module is the same as the size of the image of the original input remote sensing image.

Compared with the prior art, the invention has the following technical effects:

the technical scheme provided by the invention simultaneously considers the original image space and the differential image space of the two-stage high-resolution remote sensing image; the multi-scale characteristic of the high-resolution remote sensing image is considered by improving the spatial pooling pyramid, so that the detection capability of a large change target is ensured; enhancing the detection capability of detail change information by using an attention mechanism in parallel; and a novel loss function is provided to adapt to the unbalance problem of the changed class and the unchanged class. Through the measures, the invention can obtain a better change detection result.

Drawings

The invention is further illustrated by the following examples in conjunction with the accompanying drawings:

FIG. 1: the invention provides a main frame structure of a network.

FIG. 2: the invention is a schematic diagram of a residual block structure of a residual neural network integrating cavity convolution.

FIG. 3: is a schematic diagram of a residual error unit adopted by the invention.

FIG. 4 is a schematic view of: the invention provides an improved spatial pooling pyramid structure diagram.

FIG. 5: is a schematic structural diagram of a position attention mechanism adopted by the invention.

FIG. 6: is a structural schematic diagram of a channel attention mechanism only used by the invention.

FIG. 7: is a schematic diagram of the upsampling technique dupsamplification employed by the present invention.

FIG. 8: example images of experimental data used in embodiments of the present invention include T1 and T2 session images and their variation reference maps.

FIG. 9: is a graph of the results of the change detection on the exemplary images for the six-ratio technique and the present invention.

Detailed Description

The invention discloses a remote sensing change detection method based on an improved spatial pooling pyramid, which comprises the following steps:

step 1: preprocessing the two-phase remote sensing images by respectively using X₁And X₂Representing the first and second time period images and generating X by difference method₁And X₂The difference image of (2) is recorded as DI; then X is put in₁、X₂And DI cascade, denoted X_in，

and 2, step: residual neural network pair X using integrated hole convolution_inDown-sampling and feature extraction are carried out to obtain high-level feature X_HAnd low level feature X_L；

And 3, step 3: for high-level feature X_HRespectively extracting features of different scales by using convolution, position and channel attention mechanism, cavity convolution of three different cavity rates and global maximum pooling, and cascading the extracted features to form a feature pyramidTower, denoted FPYR;

and 4, step 4: upsampling the FPYR; and outputting the low-level feature X output in the step 2_LCascading to an up-sampled mirror image characteristic layer through short connection, and then solving a change probability graph through a SoftMax layer;

and carrying out remote sensing change detection through the steps.

In step 2, the method specifically comprises the following steps:

extracting features of different scales by using convolution, a position and channel attention mechanism, cavity convolution of three different cavity rates and global maximum pooling;

the position attention mechanism output is denoted as PAM (X)_H) The channel attention mechanism output is recorded as CAM (X)_H) The output of the convolution (including the convolution layer, batch normalization layer and Relu function activation layer) is denoted as Conv₁(X_H) The outputs of the three hole convolutions (each convolution comprising a convolution layer, a batch normalization layer and a Relu function activation layer) are respectively denoted as AConv₁(X_H)，AConv₂(X_H)，AConv₃(X_H) The output of the global max pooling is denoted Pool (X)_H) And cascading the outputs to obtain a cascaded characteristic pyramid FPYR, namely:

the PAM (X)_H) The specific acquisition method comprises the following steps: firstly, the high-level feature X obtained in the step 2_HProcessing by a position attention mechanism, and then adjusting the number of the characteristic channels by using the convolutional layer ConvP;

the CAM (X)_H) The specific acquisition method comprises the following steps: firstly, the high-level feature X obtained in the step 2_HThe channel attention mechanism is used to process the channel and then the ConvC is used to adjust the number of the characteristic channels.

The residual error neural network integrating the hole convolution is composed of a plurality of hole convolution residual blocks which are connected in sequence, and the output of the nth layer of residual block Block is marked as B_n(X), then the low-level feature X_LFor the output of the 1 st residual block, the high-level feature X_HThe output of the last 1 residual block.

In step 5, the proposed network structure is trained using an improved cross-entropy loss function, which is:

w_kfor the weight belonging to class k, k ∈ { u, c }, by the formula

The invention also comprises a pyramid based on improved spatial pooling for remote sensing image processingA network structure of a pyramid comprising an improved spatial pooling pyramid module: the system is composed of a convolution unit, a position and channel attention mechanism unit, three cavity convolution units with different cavity rates and a global maximum pooling unit; the input of the improved space pooling pyramid module is connected with the first output of the cavity residual error module, and the first output of the cavity residual error module is a high-level feature X_HThe second output of the hole residual module is the low-level feature X_L(ii) a The output of the improved spatial pooling pyramid module is connected with the input of the first up-sampling module; low level feature X_LAnd cascading with the features output by the first up-sampling module to obtain a new feature map after cascading.

The hole residual module is composed of a plurality of hole convolution residual blocks which are connected in sequence, and the output of the n-th layer residual block is recorded as B_n(X), then the low-level feature X_LFor the output of the 1 st residual block, the high-level feature X_HIs the output of the last 1 residual block;

the input of the cavity residual error module comprises a first time period remote sensing image X₁And a second period remote sensing image X₂First time period remote sensing image X₁And the second period remote sensing image X₂The three are cascaded to form a network input X_in，X_in＝X₁⊕X₂⊕DI。

To facilitate a further understanding of the present invention by those of ordinary skill in the art, the following is further illustrated:

in an embodiment, the experiment is performed using a change detection dataset disclosed by the documents "s.ji, s.wei, and m.lu, full volumetric Networks for multiple source construction From an Open image and a software image Data Set, IEEE Transactions on geo Sensing and Remote Sensing, vol.57, pp.574-586,2019", the dataset comprising two sets of high resolution Remote Sensing images, each Set of images comprising two periods of Remote Sensing images and a true change map of the two periods of Remote Sensing images, the first Set of images having a size of 21243 x 15354 pixels, and the second Set of images having a size of 11265 x 15354 pixels. To facilitate web training, two large images are divided into 256 × 256 pixel groups of small images, and the groups of completely unchanged and completely changed images are removed, leaving 1863 groups of 256 × 256 small images, where the training set is 1250 groups and the test set is 613 groups;

fig. 1 shows a main frame structure of the network according to the present invention. The invention discloses a remote sensing change detection method based on an improved spatial pooling pyramid, which comprises the following steps of:

step 1: preprocessing the two-stage high-resolution remote sensing image, including registration, relative radiation correction and the like; respectively by X₁And X₂Representing first and second time period images and generating X by difference method₁And X₂The difference image of (2) is recorded as DI, and the DI calculation formula is as follows:

DI＝|X1-X2|

then X is put in₁、X₂And DI cascade, denoted X_in，

Indicating a cascade operation, finally X_inInto the proposed network.

And 2, step: residual neural network AtrousResNet50 pair network input X using integrated hole convolution_inDown-sampling and feature extraction are carried out to obtain high-level feature X_HAnd low level feature X_L。

Firstly, inputting a network into X_inSending the obtained product into a convolutional layer Conv to primarily extract features and increase the number of feature channels, and then passing through a batchThe quantity normalization layer (BN layer), the activation layer (adopting Relu function) and the global maximum pooling layer enable the network to have nonlinear expression capability, can avoid degradation caused by continuous convolution, and finally obtain the low-level feature X through the residual error neural network integrating the cavity convolution_LAnd high level feature X_H。

The residual block structure of the residual neural network integrated with cavity convolution, which is adopted in the embodiment, is shown in fig. 2, and is composed of four cavity convolution residual blocks, each residual block is composed of a plurality of residual units, and specific parameters such as the number of the residual units and the structure of the residual units can be adjusted according to specific applications, in the embodiment, each residual unit is composed of a common convolution with a step length of 1 and a cavity rate of 0 except for the middle layer, and fig. 3 shows an example of the structure of the residual unit.

Let the output of the n-th residual block Block be B_n(X), then the low-level feature X_LFor the output of the 1 st residual block, i.e. X_L＝B₁(X_in) High level feature X_HFor the output of the 4 th residual block, i.e. X_H＝B₄(B₃(B₂(B₁(X_in)))). Respectively obtaining low-level features X through a down-sampling step_LAnd high level feature X_HDifferent scale information can be included. In this step, the number of residual blocks, the residual unit structure, the void ratio, and the size and number of convolution kernels can be adjusted according to specific applications. The detailed parameters of the convolutional layer and residual unit and the number of residual blocks used in this embodiment are shown in table 1:

TABLE 1 convolution layer and residual Unit detailed parameters

And step 3: and (3) extracting features of different scales from the high-level features by respectively using 1 × 1 convolution, a space and channel attention mechanism, cavity convolution of three different cavity rates and global maximum pooling, and cascading the features to form a feature pyramid. Fig. 4 shows a specific structure of step 3 in this embodiment.

The obtained cascade pyramid characteristics can fully consider the multi-scale characteristics and the spatial context information of the high-resolution remote sensing image. Conv of 1 x 1 convolution layer₁The output of (D) is recorded as Conv₁(X_H) The output of the spatial attention mechanism is denoted PAM (X)_H) The output of the channel attention mechanism is denoted as CAM (X)_H) Three void convolutional layers AConv₁，AConv₂，AConv₃The outputs are respectively recorded as AConv₁(X_H)，AConv₂(X_H)，AConv₃(X_H) The output of the global max pooling layer is denoted Pool (X)_H) Wherein X is_HIs the high-level feature obtained in step 2. Cascading the outputs to obtain a cascaded characteristic pyramid FPYR, i.e.

The present invention enhances the ability to characterize detailed objects and object boundaries by cascading spatial attention (PAM) and channel attention features (CAM), a positional attention mechanism as shown in fig. 5 and a channel attention mechanism as shown in fig. 6. PAM (X)_H) The specific acquisition method comprises the following steps: firstly, the high-level feature X obtained in the step 2_HObtained by the spatial attention mechanism process shown in fig. 5 and then adjusting the number of characteristic channels by using the convolution layer ConvP. CAM (X)_H) The specific acquisition method comprises the following steps: firstly, the high-level feature X obtained in the step 2_HObtained by performing the channel attention mechanism process shown in fig. 6 and then adjusting the number of characteristic channels by using the convolutional layer ConvC.

Table 2 shows the use of the convolution layer Conv in step 3 of this example₁、AConv₁、AConv₂、AConv₃ConvP and ConvC convolution kernel size, number, void rate and step size. The convolutional layers comprise a BN layer and a Relu function activation layer. It should be noted that the size, number, void rate, step size, maximum pooling mode, and number of void convolution layers used in step 3 may all be adjusted according to specific applications.

TABLE 2 convolution layer Conv₁、AConv₁、AConv₂、AConv₃Specific parameters of ConvP and ConvC

And 4, step 4: using an improved upsampling operation to upsample the pyramid of the cascaded features in the step 3 and outputting the low-level features X output in the step 2_LCascading to an up-sampled mirror image characteristic layer through short connection, and finally solving a change probability graph through a SoftMax layer;

first, the pyramid of cascaded features FPYR in step three is fed into the convolutional layer Conv₂Reducing the number of characteristic channels, increasing the nonlinear expression capacity through a BN layer and an activation layer (adopting Relu function), performing quadruple up-sampling by using an improved up-sampling technology DUpsampling, and outputting the characteristic diagram size and the low-level characteristic X_LAnd similarly, cascading the two as a new feature map.

Conv for passing the cascaded feature map through two convolutional layers₃And Conv₄And (including the BN layer and the Relu function activation layer) carrying out channel number adjustment, wherein the number of convolution layer convolution kernels can be adjusted according to specific application. And then carrying out quadruple upsampling by using an improved upsampling technology DUpsmpling, wherein the size of an output characteristic diagram is the same as that of an input image, and finally solving a variation probability diagram by using the output characteristic diagram through a SoftMax layer.

The improved upsampling technique dupsamping is an improvement on the conventional linear interpolation, and the structure is shown in fig. 7. In the embodiment, dupsamping is used for replacing linear upsampling, and dupsamping generates a filling part required in the process of restoring the feature resolution through convolution, so that the restoration of the detail information of the feature map is facilitated, and particularly, the high-resolution remote sensing image with more detail parts is obtained. The detailed parameters of the convolutional layer for adjusting the number of channels in this step are shown in table 3:

TABLE 3 convolution layer Conv₂、Conv₃And Conv₄Specific parameters

And 5: and (4) calculating loss based on the change probability graph and the real change graph obtained in the step (4) by using an improved cross entropy loss function, iteratively training parameters in the network structure through back propagation until an iteration stop condition is met, and storing the parameters when the iteration is stopped for obtaining a change detection graph.

In practical applications, the changed area is often much smaller than the unchanged area. The remote sensing change detection problem is generally a problem that the proportion of a changed class is greatly unbalanced with that of an unchanged class. Aiming at the problem, the invention provides a brand-new self-adaptive weight cross entropy loss function, calculates the loss of a change probability graph of the network output by using the loss function, and optimizes parameters in the network through back propagation.

The method comprises the following steps of designing a weight self-adaptive cross entropy loss function suitable for the class imbalance problem on the basis of the cross entropy loss function, wherein the weight self-adaptive calculation method is based on the proportion of a variable class and an unchanged class, and the calculation formula of the proposed weight self-adaptive cross entropy function is as follows:

w_kfor the weight belonging to class k, k ∈ { u, c }, by the formula

And (4) determining. Wherein a is_uAnd a_cRespectively represent the proportion of the unchanged class u and the changed class c, generally a_uFar greater than 1/2, a_cMuch less than 1/2. Beta is equalThe balance coefficient can ensure that the two types of weights are not excessively unbalanced. In the present embodiment, β is 1/10.

To verify the change detection effect of the present invention, the present invention is compared with 6 advanced level deep learning change detection techniques. The 6 comparison techniques are respectively as follows: full convolution twin neural network (FC-Sim-Conv), DeepLab v3+, Dual attention neural network (DANet), modified U-type network (Unet + +), Multi-output fusion modified U-type network (Unet + + _ MSOF), and Difference-graph-based modified U-type network (DifUNet + +). Four widely used quantitative indicators are used to evaluate the performance of different change detection techniques, namely accuracy, precision, recall and F₁The value is obtained.

FIG. 9 is a graph showing the results of variation detection according to the present invention and the comparative technique. Respectively, a full convolution twin neural network (FC-Sim-Conv), a DeepLab v3+, a double attention neural network (DANet), an improved U-type network (Unet + +), a multi-output fusion improved U-type network (Unet + + _ MSOF), an improved U-type network (DifUNet + +) based on a difference map, and a change detection result map of the present invention. Table 4 gives four quantitative indices for different change detection results.

TABLE 4 quantitative index of change detection results

Comparing the change detection result graph with the quantitative statistical result, the change detection effect of the invention is obviously superior to the change detection results of other advanced level deep learning change detection technologies. In the present embodiment, it can be seen that the present invention achieves better results from either the complete degree of the change or the refinement degree of the edge. As can be seen from Table 4, the change detection result of the invention is superior to other change detection technologies in three precision indexes of accuracy, recall rate and F1 value. For example, the F1 value of the invention is 0.9046, which is higher than other methods by 0.0521, 0.0329, 0.0433, 0.0407, 0.0605 and 0.0284.

Claims

1. A remote sensing change detection method based on an improved spatial pooling pyramid is characterized by comprising the following steps:

step 1: preprocessing the two-stage remote sensing image by respectively using X₁And X₂Representing the first and second time images and generating X by difference method₁And X₂The difference image of (2) is recorded as DI; then X is put in₁、X₂And DI cascade, denoted X_in，

And 3, step 3: for high-level feature X_HExtracting features of different scales by respectively using a convolution mechanism, a position and channel attention mechanism, three void convolutions of different void ratios and global maximum pooling, and cascading the extracted features to form a feature pyramid which is recorded as FPYR;

and carrying out remote sensing change detection through the steps.

2. The method according to claim 1, characterized in that in step 2, it comprises in particular the steps of:

3. The method of claim 1, wherein in step 3, high-level feature X is processed_HPerforming multi-scale feature extraction, specifically:

4. method according to claim 3, characterized in that the PAM (X)_H) The specific acquisition method comprises the following steps: firstly, the high-level feature X obtained in step 2_HProcessed by a position attention mechanism, thenAdjusting the number of the characteristic channels by using the convolutional layer ConvP;

5. The method according to claim 2, wherein the residual neural network integrating hole convolution is composed of a plurality of hole convolution residual blocks connected in sequence, and the output of the n-th layer residual block is denoted as B_n(X), then the low-level feature X_LFor the output of the 1 st residual block, the high-level feature X_HThe output of the last 1 residual block.

6. The method according to claim 1, characterized in that in step 5, the loss is calculated based on the variation probability map and the true variation map using an improved cross-entropy loss function, which is:

wherein N represents the total number of pixels, u and c represent the unchanged class and the changed class, respectively, p_ikProbability, y, of a pixel i predicted for the model to belong to class k_ikFor a Boolean variable equal to 0 or 1, by the class l to which the pixel i belongs_iAnd (3) calculating:

w_kfor the weight belonging to the class k, k belongs to { u, c }, through a formula

Is determined in which a_uAnd a_cRespectively represent the proportion of the unchanged class u and the changed class c, and beta is an equilibrium coefficient.

7. For remote-sensing image processingNetwork structure based on improve space pooling pyramid, its characterized in that, it is including improving space pooling pyramid module: the device consists of a convolution unit, a position and channel attention mechanism unit, three cavity convolution units with different cavity rates and a global maximum pooling unit; the input of the improved space pooling pyramid module is connected with the first output of the cavity residual error module, and the first output of the cavity residual error module is a high-level feature X_HThe second output of the hole residual module is the low-level feature X_L(ii) a The output of the improved spatial pooling pyramid module is connected with the input of the first up-sampling module; low level feature X_LAnd cascading with the features output by the first up-sampling module to obtain a new feature map after cascading.

8. The network structure according to claim 7, wherein the hole residual block is composed of a plurality of sequentially connected hole convolution residual blocks, and the output of the n-th layer residual block is denoted as B_n(X), then the low-level feature X_LFor the output of the 1 st residual block, the high-level feature X_HIs the output of the last 1 residual block;

the input of the cavity residual error module comprises a first time period remote sensing image X₁And a second period remote sensing image X₂First time period remote sensing image X₁And the second period remote sensing image X₂The three are cascaded to form a network input X_in，

9. The network of claim 8, wherein the network input X_inSequentially passing through the convolutional layer Conv, the batch normalization layer, the activation layer and the global maximum pooling layer, wherein the output of the global maximum pooling layer is connected with the input of the cavity residual error module.

10. The network of claim 7, wherein the first upsampling module outputs the feature map size and the low-level feature X_LThe sizes are the same; and inputting the new cascaded feature map into a second up-sampling module after passing through at least 1 convolution layer, wherein the size of the feature map output by the second up-sampling module is the same as the size of the image of the original input remote sensing image.