CN110263845B

CN110263845B - SAR image change detection method based on semi-supervised countermeasure depth network

Info

Publication number: CN110263845B
Application number: CN201910527007.1A
Authority: CN
Inventors: 王英华; 杨振东; 王剑; 刘宏伟; 秦庆喜
Original assignee: Xidian University
Current assignee: Xidian University
Priority date: 2019-06-18
Filing date: 2019-06-18
Publication date: 2023-05-02
Anticipated expiration: 2039-06-18
Also published as: CN110263845A

Abstract

The invention discloses a SAR image change detection method based on a semi-supervised countermeasure depth network, which mainly solves the problems that the detection effect false alarm rate is high and the detection area is inaccurate when the label data are less in the existing change detection technology. The scheme is as follows: 1) Calculating a logarithmic ratio difference map of the two images by using the two-time-phase SAR image data; 2) Extracting training samples and test samples from the two-phase SAR image and the difference image; 3) Constructing a change detection dual network and two discrimination networks; 4) Performing supervision training by using the labeled data, performing countermeasure training and collaborative training by using the unlabeled data, and obtaining a trained detection network; 5) And inputting the test data into a trained change detection network to obtain a change detection result. The invention combines a large amount of non-tag data to extract the change detection characteristics of the separability, improves the generalization performance of the supervision training model when the labeled training sample is insufficient, and can be used for SAR image change detection.

Description

SAR image change detection method based on semi-supervised countermeasure depth network

Technical Field

The invention belongs to the technical field of radar image processing, and particularly relates to a change detection method of SAR images, which can be used for disaster monitoring, land investigation and target investigation.

Background

The SAR system is less influenced by weather conditions and illumination conditions, and can realize all-weather earth observation in all weather, so that the change detection of multi-time-phase SAR data is an important means for analyzing the change of the earth surface, and is widely applied to disaster monitoring, land investigation and target investigation.

The traditional SAR change detection methods are of three types: the first type is an unsupervised change detection method, such as a change detection method based on principal component analysis and k-means clustering proposed by Celik et al; SAR variation detection method based on image fusion and fuzzy clustering proposed by Gong et al; the Yan Wang et al proposes a SIFT key point detection and region information-based change detection method. The second type is a supervised change detection method, such as the kernel-based change detection method proposed by Camps-Valls et al, which uses tagged data as training samples; yuLi et al, a change detection method based on matching pursuits. The third class is a semi-supervised change detection method, such as a SAR image change detection method based on a neighborhood clustering kernel proposed by Lu Jia and the like; the SAR image semi-supervised change detection algorithm based on random fields and maximum entropy is proposed by Linan et al.

Among the above three methods, the unsupervised method does not need to use labeled data, so it is a mainstream method in the field of change detection, but because of lack of supervision and guidance of labeled data, the detection result of the method is generally quite different from the real change area, and the false alarm is quite large. For the supervised method, the supervised method can obtain good results in a scene with a large amount of tagged data, but in a real scene, the cost of acquiring the tagged data is large, that is to say, the amount of tagged data is small in general, and in this case, the effect of the supervised method is poor, and the generalization performance of the model is poor. Compared with the supervised and unsupervised methods, the semi-supervised change detection method can be combined with a small amount of tagged data and a large amount of untagged data to learn together, the characteristic of separability is extracted, and the detection performance is improved. The existing semi-supervised SAR image change detection method is generally based on a semi-supervised method in traditional machine learning, the input characteristics of a model often need to be designed manually, all information of original data cannot be effectively utilized, and high false alarm rate and low detection precision are caused, so that the performance of the method is limited.

Disclosure of Invention

Aiming at the defects of the three existing SAR change detection methods, the invention provides an SAR image change detection method based on a semi-supervised countermeasure depth network, so that the detection precision is improved by combining a large amount of unlabeled data under the condition that the sample size of labeled data is small, and the false alarm rate is reduced.

The technical scheme of the invention is as follows: firstly, a small amount of labeled samples and a large amount of unlabeled samples are extracted by utilizing a sliding window model, then the labeled samples and the unlabeled samples are used for training a deep neural network model together, after the model converges, the trained neural network is applied to test data, and a final change detection mark graph is obtained, and the implementation steps comprise the following steps:

(1) Calculating a logarithmic ratio difference graph K of the two images by using the two-phase SAR image data;

(2) Extracting training samples and test samples on the two-time-phase SAR image and the difference image in a sliding window mode, randomly selecting 4% of the training samples as labeled training samples, and taking the rest as unlabeled training samples;

(3) Building a training network model:

(3a) Setting SAR change detection dual network ψ ₁ And psi is ₂ ：

Each network is composed of six layers, wherein the first four layers are sharing layers, namely the first layer is a full-connection layer L ₁ The second layer is an activation function layer ReLU and the third layer is a full connection layer L ₂ The fourth layer is an activation function layer ReLU, and the fifth and sixth are non-shared layers, wherein:

first network ψ ₁ The fifth layer of (2) is a full connection layer L ₁₃ The sixth layer is Softmax classifier layer S ₁₁ ，

Second network ψ ₂ The fifth layer of (2) is a full connection layer L ₂₃ The sixth layer is Softmax classifier layer S ₂₁ ；

(3b) Setting two discrimination networks

And->

The two discrimination networks are identical and are composed of six layers, namely, a first layer is a full-connection layer, a second layer is an activation function layer ReLU, a third layer is a full-connection layer, a fourth layer is an activation function layer, a fifth layer is a full-connection layer, a sixth layer is a Softmax classifier layer,

(3c) Connecting the dual network with two discrimination networks, i.e. the first discrimination network

Connected to a first detection network ψ ₁ After that, the second discrimination network->

Connected to a second detection network ψ ₂ Then, forming a training network model;

(6) Inputting training sample data into the constructed training network model (3), and sequentially performing supervised training with label data, countermeasure training without label data and collaborative training to obtain a trained change detection network ψ;

(7) And inputting the test sample data into a trained change detection network ψ for detection to obtain a change detection result of the SAR image.

Compared with the prior art, the invention has the following advantages:

1) According to the invention, the two-time phase SAR is utilized, so that the change detection characteristic of the separability can be extracted by combining a large amount of non-tag data under the condition that the sample size of the tagged sample is small, and the change detection performance is improved;

2) The invention utilizes the advantage of deep learning on classification tasks, and promotes the mutual promotion of two networks in the training process by combining a double-network structure with the countermeasure training and the cooperative training, thereby finally promoting the performance of change detection.

Drawings

FIG. 1 is a general flow chart of an implementation of the present invention;

FIG. 2 is a schematic diagram of constructing an input sample in the present invention;

FIG. 3 is a graph of experimental data;

fig. 4 is a graph showing the result of the change detection of the SAR image shown in fig. 3 by the present invention and the conventional method.

Detailed Description

The embodiments and effects of the present invention will be described in detail below with reference to the attached drawings:

referring to fig. 1, the implementation steps of the present invention are as follows:

and step 1, calculating a corresponding logarithmic ratio difference map according to the two-phase SAR image.

Calculating a corresponding logarithmic ratio difference graph K by using the two-phase SAR image data:

wherein I is ₁ For the first time phase diagram of the original SAR image, I ₂ Is the second time-phase diagram of the original SAR image.

And 2, extracting training samples and test samples from the two-time-phase SAR image and the difference image in a sliding window mode.

Referring to fig. 2, this step is implemented as follows:

2.1 Setting the size of the sliding window as N multiplied by N, setting the center of the sliding window as (i, j), for the training sample, performing sliding window selection image blocks in a designated area, and for the test sample, performing sliding window selection image blocks on the whole image;

2.2 Image I) in two phases ₁ And I ₂ In (i, j), image blocks of size N×N are selected as the center

And->

Will->

And->

Splicing together along a first dimension to obtain a sample first channel with the size of 2NxN;

2.3 Corresponding to two-phase image I ₁ And I ₂ A middle pixel point (i, j), and selecting an image block centered on (i, j) in the difference image K as a mark

The size of the image block is N, calculating +.>

Extending m into a matrix of size 2N x N as a sample second channel;

2.4 When a training sample is selected, sliding the sliding window in a designated area, and forming the training sample by using a first sample channel and a second sample channel; when a test sample is selected, sliding a sliding window on the whole graph, and forming the test sample by using a sample first channel and a sample second channel;

2.5 Randomly selecting 4% of the training samples as labeled training samples, and the rest as unlabeled training samples.

And 3, constructing a training network model.

3.1 Set SAR change detection dual network ψ ₁ And psi is ₂ ：

The two detection networks are composed of six layers of structures, wherein the first four layers are sharing layers, and the second two layers are non-sharing layers, and the two layers are:

3.11)Ψ ₁ and psi is ₂ Parameter setting and relation of each layer of the first four layers:

first full-connection layer L ₁ The device is provided with 1000 neurons for extracting shallow features of a training sample and a test sample, and the shallow features of the training sample and the test sample generate an output vector of 1000 dimensions;

the second layer activation function layer ReLU is used for carrying out nonlinear mapping on the output of the upper layer full-connection layer, and the nonlinear mapping formula is as follows:

ReLU(x)＝max(0,x)，

wherein x is input ReLU (x) as output, and the dimension of the input and the output of the layer are consistent;

the third layer is a full connection layer L ₂ 1000 neurons are arranged for extracting deeper features from shallow features output by the upper full-connection layer, and the layer generates an output vector of 1000 dimensions;

the fourth layer is an activation function layer ReLU, and the action and principle of the fourth layer are consistent with those of the ReLU layer;

3.12)Ψ ₁ fifth and sixth layer parameter settings and relationships:

Ψ ₁ the fifth layer of (2) is a full connection layer L ₃₁ The device is used for extracting different types of features from the output of the last active layer, wherein the input dimension of the layer is 1000, and the output dimension is 2;

Ψ ₁ the sixth layer is Softmax classifier layer S ₁₁ The function of this layer is to give L ₃₁ The output 2-dimensional column vectors are respectively converted into classification probabilities, namely the probability that the current input sample belongs to a change class and a non-change class, and the samples are classified according to the probability values;

3.13)Ψ ₂ fifth and sixth layer parameter settings and relationships:

Ψ ₂ the fifth layer of (2) is a full connection layer L ₃₂ The device is used for extracting different types of features from the output of the last active layer, wherein the input dimension of the layer is 1000, and the output dimension is 2;

Ψ ₂ is a Softmax classifier layer S ₂₁ The function of this layer is to give L ₃₂ The 2-dimensional column vectors output by the layers are respectively converted into classification probabilities, namely the probability that the current input sample belongs to the change and non-change classes, and the samples are classified according to the probability values.

3.2 Setting up two discrimination networks

And->

The two discrimination networks are identical and are composed of six layers of structures, namely a first layer is a full-connection layer, a second layer is an activation function layer ReLU, a third layer is a full-connection layer, a fourth layer is an activation function layer, a fifth layer is a full-connection layer, a sixth layer is a Softmax classifier layer, and the parameter settings and relationships of the layers are as follows:

a first fully connected layer provided with 1000 neurons for extracting shallow discriminating features from the inputs of the network, this layer yielding an output vector of 1000 dimensions;

the second layer activates the function layer ReLU, is used for carrying on the nonlinear mapping to the output of the upper layer of all link layers, the dimension of input and output of this layer is unanimous;

the third layer is a full-connection layer, which is provided with 1000 neurons and is used for extracting deeper discrimination features from shallow features output by the upper full-connection layer, and the layer generates 1000-dimensional output vectors;

the fourth layer is an activation function layer ReLU;

the fifth layer is a full-connection layer and comprises 2 neurons, and is used for reducing the output dimension of the upper layer to a 2-dimensional vector for the subsequent classification probability calculation;

the sixth layer is a Softmax classifier layer, and the function of the layer is to convert the 2-dimensional column vector output by the previous layer into two-dimensional classification probability, namely the probability that the current input sample belongs to a true distribution sample and belongs to a generator generated sample, and judge the sample according to the probability value:

3.3 Connecting the dual network constructed in 3.1) with the two discrimination networks constructed in 3.2), namely the first discrimination network

Connected to a first detection network ψ ₁ After the sixth layer, the second discrimination network +.>

Connected to a second detection network ψ ₂ After the fifth layer, a training network model is formed.

And step 4, inputting training sample data into the training network model constructed in the step 3, and carrying out supervision training of the labeled data, countermeasure training of the unlabeled data and collaborative training in sequence to obtain a trained change detection network ψ.

4.1 Supervised training of tagged data):

4.11 For sending tagged data to the detection dual network ψ ₁ 、Ψ ₂ In which a detection network ψ is calculated from labels fed into samples ₁ Is a loss function L of (2) _s1 And detecting network ψ ₂ Is a loss function L of (2) _s2 ：

Wherein v is ₁ 、v ₂ For network ψ ₁ Two values, z, of the two-dimensional vector of the output of the last fully connected layer ₁ 、z ₂ For network ψ ₂ I is the correct class label corresponding to the current input sample, i is 1 and indicates that the input sample is of a changed class, i is 2 and indicates that the input sample is of an unchanged class;

4.12 Using 4.11) the calculated first detection network ψ ₁ Is a loss function L of (2) _s1 And a second detection network ψ ₂ Is a loss function L of (2) _s2 Updating the first detection network ψ according to the gradient descent algorithm back propagation ₁ Parameter and second detection network ψ ₂ Parameters;

4.2 Performing countermeasure training on the unlabeled data;

4.2.1 To the first detection network ψ ₁ As a generator network P ₁ And with the first arbiter network

Together form a first generation-opposing network GAN ₁ The label-free data is subjected to countermeasure training, and the following is realized:

firstly, a real sample and a generated sample are sequentially sent into a first judgmentNetwork of devices

In the above, the first discriminant network loss function is calculated according to the label of the input sample>

Wherein O is _r1 And O _f1 Two values of a two-dimensional vector output by the last full-connection layer of the discrimination network are respectively and correspondingly judged to be a real sample and a characteristic value of the generated sample, y ₁ For network P ₁ An output of (2);

then, calculating the gradient of the loss function of the first discriminant network, and updating the first discriminant network by back propagation of the gradient descent algorithm

Parameters;

then, the first generator network ψ ₁ Generating a sample and sending the sample to a first discrimination network

Among them, calculate the first generator network P ₁ Loss function->

Wherein O is _r1 And O _f1 Two values of a two-dimensional vector output by the last full-connection layer of the first judging network are respectively and correspondingly judged to be a real sample and a characteristic value judged to be a sample, y ₁ For network P ₁ An output of (2);

then, the gradient of the first generator network loss function is calculated, and the gradient descent algorithm is utilized to back-propagate and update the first generator network P ₁ Parameters;

4.2.2 To the second detection network ψ ₂ The first five layers of (a) are regarded as a generator network P ₂ And with a second arbiter network

Together form a second generation countermeasure network GAN ₂ Performing countermeasure training:

the first step, the real sample and the generated sample are sequentially sent into a second discriminator network, and a second discriminating network loss function is calculated according to the label sent into the sample

Wherein O is _r2 And O _f2 Two values of a two-dimensional vector output by the last full-connection layer of the second judging network are respectively and correspondingly judged to be a real sample and a characteristic value of the generated sample, and y ₂ For network P ₂ An output of (2);

step two, calculating the gradient of the loss function of the second discrimination network, and updating the network by using the back propagation of the loss

Parameters of (2);

third, the second generator network generates a sample and sends the sample to a second discrimination network

Among them, the second generator network P is calculated ₂ Loss function->

Wherein O is _r2 And O _f2 Two values of a two-dimensional vector output by the last full-connection layer of the second judging network are respectively and correspondingly judged to be a real sample and a characteristic value of the generated sample, and y ₂ For the second generator network P ₂ An output of (2);

fourth, calculating the gradient of the loss function of the second generator network, and back-propagating and updating the second generator network P by using a gradient descent algorithm ₂ Parameters.

4.3 Respectively and simultaneously sending the unlabeled data into the detection dual network ψ ₁ And psi is ₂ The following co-training was performed:

4.31 First detection network ψ ₁ Extracting category characteristics from the unlabeled data, and classifying the category characteristics by using a Softmax classifier to obtain pseudo-label PL corresponding to each unlabeled sample ₁ And a classification probability vector py, calculating the confidence of the ith unlabeled exemplar classification as: cony _i ＝max(py _i ) And confidence of the classification Cony _i With a preset confidence threshold T _y Comparing the sample with confidence greater than the threshold with the labeled sample to obtain a second detection network ψ ₂ Is a supervised training sample;

4.32 Second detection network ψ ₂ Extracting mode features from the label-free data, and classifying the mode features by using a Softmax classifier to obtain pseudo labels PL corresponding to each label-free sample ₂ And classifying the probability vector pz, and calculating the confidence of the ith label-free sample classification as follows: conz _i ＝max(pz _i ) And will Conz _i With a preset confidence threshold T _z Comparing the samples with confidence greater than the threshold with the labeled samples to obtain a first detection network ψ ₁ Is a training sample of supervised training;

the trained change detection network ψ is obtained by iterating the above step 4.1) supervised training of the tagged data, 4.2) countermeasure training of the untagged data and 4.3) co-training of the untagged data.

And 5, inputting the test sample data into a trained change detection network ψ for detection to obtain a change detection result of the SAR image.

The effect of the invention can be further illustrated by the following experimental data:

experimental conditions

1) Experimental data

This experiment uses four sets of SAR image data, as shown in fig. 3, in which:

fig. 3 (a 1) and 3 (a 2) are images obtained by ERS-2SAR sensors at 8/5/2003 and 256 x 256 in image size, respectively, and fig. 3 (a 3) is a corresponding change detection reference map, this group being called San Francisco data.

Fig. 3 (b 1) and 3 (b 2) are SAR images obtained by a Radarsat-1 sensor at 7, 1997 and 8, respectively, with an image size of 290 x 350, and fig. 3 (b 3) is a corresponding change detection reference map, this group being called Ottawa data.

Fig. 3 (c 1) and 3 (c 2) are SAR images obtained by a Radarsat-2 sensor at month 6 of 2008 and month 6 of 2009, respectively, the image size is 289×257, and fig. 3 (c 3) is a corresponding change detection reference map, which is referred to as Yellow River Farmland I data in this group.

Fig. 3 (d 1) and 3 (d 2) are SAR images obtained by a Radarsat-2 sensor at month 6 of 2008 and month 6 of 2009, respectively, the image size is 291×306, and fig. 3 (d 3) is a corresponding change detection reference map, which is referred to as Yellow River Farmland II data in this group.

2) Evaluation criterion

The experimental results were evaluated using the following criteria

False alarm rate FA, false alarm rate MD, overall error rate OE, classification accuracy PCC and Kappa coefficient KC.

Second, experimental details

Experiment one: the invention and the deep neural network DNN supervision algorithm A1 are used for carrying out a change detection comparison experiment on the data against the self-encoder SAAE semi-supervision algorithm A2 and the semi-supervision deep neural network SSDC semi-supervision algorithm A3 combined with cooperative training, and the comparison result of the performance parameters is shown in the table 1.

TABLE 1 comparison of the performance parameters of the inventive method and the related models

In table 1: semi-supervised deep neural network SSDC semi-supervised algorithm A3 experiments compared to the lack of the countermeasure training section of the present invention, the deep neural network DNN supervised algorithm A1 experiments used the same network as the present invention, but only the supervised training process.

As can be seen from Table 1, the best results were obtained with the present invention, which also performed more stably than the other methods. The experimental result of SSDC semi-supervised algorithm of the semi-supervised deep neural network is compared, so that the increase of the resistance training is reasonable, and the classification effect is improved.

Experiment II: the invention combines with the existing PCA and K-means combined unsupervised change detection algorithm PCAKM, gabor transformation and two-stage clustering combined unsupervised change detection method GaborTLC, PCANet based unsupervised change detection method PCANet and ELM based unsupervised change detection method of overrun learning machine, and the data are subjected to change detection comparison experiments, and the performance parameter comparison results are shown in Table 2.

TABLE 2 comparison of performance parameters of the inventive method with the existing unsupervised method

As can be seen from Table 2, the invention has better performance, because the semi-supervised model can extract the identification information from the labeled sample and the unlabeled sample, and the collaborative training algorithm improves the generalization performance by introducing the pseudo-label training sample, so the invention obtains better detection result than the existing method.

Experiment III: the result of the change detection comparison experiment of the data obtained by the method of the present invention with the prior art method used in experiment one and experiment two is shown in fig. 4, wherein:

FIG. 4 (a 1) is a graph showing the results of the A2 method on San Francisco data;

FIG. 4 (a 2) is a graph showing the results of the A1 method on San Francisco data;

FIG. 4 (A3) is a graph showing the results of the A3 method on San Francisco data;

FIG. 4 (a 4) is a graph showing the results of the detection of the present invention on San Francisco data;

FIG. 4 (a 5) is a graph of the real variation area of the San Francisco data;

FIG. 4 (a 6) is a graph showing the results of the PCAKM method on San Francisco data;

FIG. 4 (a 7) is a graph showing the detection results of GaborTLC on San Francisco data;

FIG. 4 (a 8) is a graph showing the results of the PCANet method on San Francisco data;

FIG. 4 (a 9) is a graph showing the detection results of the ELM method on San Francisco data;

FIG. 4 (b 1) is a graph showing the results of the A2 method on Ottawa data;

FIG. 4 (b 2) is a graph showing the results of the A1 method on Ottawa data;

FIG. 4 (b 3) is a graph showing the results of the A3 method on Ottawa data;

FIG. 4 (b 4) is a graph showing the detection results of the present invention on Ottawa data;

FIG. 4 (b 5) is a graph of the true change area of Ottawa data;

FIG. 4 (b 6) is a graph showing the detection result of the PCAKM method on Ottawa data;

FIG. 4 (b 7) is a graph showing the detection results of GaborTLC method on Ottawa data;

FIG. 4 (b 8) is a graph showing the detection result of the PCANet method on Ottawa data;

FIG. 4 (b 9) is a graph showing the detection result of the ELM method on Ottawa data;

FIG. 4 (c 1) is a graph showing the detection result of the A2 method on Yellow River Farmland I data;

FIG. 4 (c 2) is a graph showing the detection result of the A1 method on Yellow River Farmland I data;

FIG. 4 (c 3) is a graph showing the detection result of the A3 method on Yellow River Farmland I data;

FIG. 4 (c 4) is a graph showing the detection result of Yellow River Farmland I data according to the present invention;

FIG. 4 (c 5) is a graph of the true variation area of Yellow River Farmland I data;

FIG. 4 (c 6) is a graph showing the detection result of the PCAKM method on Yellow River Farmland I data;

FIG. 4 (c 7) is a graph showing the detection results of GaborTLC method on Yellow River Farmland I data;

FIG. 4 (c 8) is a graph showing the detection result of the PCANet method on Yellow River Farmland I data;

FIG. 4 (c 9) is a graph showing the detection results of the ELM method on Yellow River Farmland I data;

FIG. 4 (d 1) is a graph showing the detection result of the A2 method on Yellow River Farmland II data;

FIG. 4 (d 2) is a graph showing the detection result of the A1 method on Yellow River Farmland II data;

FIG. 4 (d 3) is a graph showing the detection result of the A3 method on Yellow River Farmland II data;

FIG. 4 (d 4) is a graph showing the detection result of Yellow River Farmland II data according to the present invention;

FIG. 4 (d 5) is a graph of the true variation area of Yellow River Farmland II data;

FIG. 4 (d 6) is a graph showing the detection result of the PCAKM method on Yellow River Farmland II data;

FIG. 4 (d 7) is a graph showing the detection results of GaborTLC method on Yellow River Farmland II data;

FIG. 4 (d 8) is a graph showing the detection result of the PCANet method on Yellow River Farmland II data;

FIG. 4 (d 9) is a graph showing the detection results of the ELM method on Yellow River Farmland II data.

As can be seen from fig. 4, the detection result graph of the invention is closer to the real change area graph, can more accurately reflect the shape of the change area, and has better detection effect.

The above description is only one specific example of the invention and does not constitute any limitation of the invention, and it will be apparent to those skilled in the art that various modifications and changes in form and details may be made without departing from the principles, construction of the invention, but these modifications and changes based on the idea of the invention are still within the scope of the claims of the invention.

Claims

1. The SAR image change detection method based on the semi-supervised countermeasure depth network is characterized by comprising the following steps of:

(3) Building a training network model:

(3a) Setting SAR change detection dual network ψ ₁ And psi is ₂ ：

(3b) Setting two discrimination networks

And->

The two discrimination networks are identical and are composed of six layers of structures, namely a first layer is a full-connection layer, a second layer is an activation function layer ReLU, a third layer is a full-connection layer, a fourth layer is an activation function layer, a fifth layer is a full-connection layer, and a sixth layer is a Softmax classifier layer;

(4) Inputting training sample data into the constructed training network model (3), and carrying out supervision training of the labeled data, countermeasure training of the unlabeled data and collaborative training in sequence in an iterated manner to obtain a trained change detection network ψ;

the supervision training of the tagged data is to send the tagged data to the detection dual network ψ ₁ 、Ψ ₂ The used loss function is a two-class cross entropy loss function, and the formula is as follows:

the challenge training for unlabeled data is achieved as follows:

(4a) Will first detect network ψ ₁ As a generator network P ₁ And with the first arbiter network

Together form a first generation-opposing network GAN ₁ Challenge training was performed on unlabeled data:

4a1) Sequentially feeding the real sample and the generated sample into a first discriminator network

/>

4a2) Calculating the gradient of the loss function of the first discriminant network, and updating the first discriminant network by using the backward propagation of the gradient descent algorithm

Parameters;

4a3) Will first generator network P ₁ Generating a sample and sending the sample to a first discrimination network

Among them, calculate the first generator network P ₁ Loss function->

4a4) Calculating the gradient of the loss function of the first generator network, and back-propagating and updating the first generator network P by using a gradient descent algorithm ₁ Parameters;

(4b) Will second detection network ψ ₂ The first five layers of (a) are regarded as a generator network P ₂ And with a second arbiter network

4b1) Sequentially sending the real sample and the generated sample into a second discriminator network, and calculating a second discrimination network loss function according to the label of the sent sample

4b2) Calculating a second discriminant network loss function gradient, updating the network with back propagation of losses

Parameters of (2);

4b3) Sending the second generator network generated sample to a second discrimination network

Among them, the second generator network P is calculated ₂ Loss function->

4b4) Calculating a second generator network loss function gradient, and back-propagating and updating the second generator network P by using a gradient descent algorithm ₂ Parameters;

the co-training of the non-label data is to send the non-label data into the detection dual-network ψ at the same time ₁ And psi is ₂ The following co-training was performed:

4c) First detection network ψ ₁ Extracting category characteristics from the unlabeled data, and classifying the category characteristics by using a Softmax classifier to obtain pseudo-label PL corresponding to each unlabeled sample ₁ And a classification probability vector py, calculating the confidence of the ith unlabeled exemplar classification as: cony _i ＝max(py _i ) And confidence of the classification Cony _i With a preset confidence threshold T _y Comparing, using the sample with confidence greater than the threshold as the second detection network ψ ₂ Is a supervised training sample;

4d) Second detection network ψ ₂ Extracting mode features from the label-free data, and classifying the mode features by using a Softmax classifier to obtain pseudo labels PL corresponding to each label-free sample ₂ The classification probability vector pz,the confidence of the ith unlabeled exemplar class is calculated as: conz _i ＝max(pz _i ) And will Conz _i With a preset confidence threshold T _z Comparing, using a sample with confidence greater than a threshold as the first detection network ψ ₁ Is a training sample of supervised training;

(5) And inputting the test sample data into a trained change detection network ψ for detection to obtain a change detection result of the SAR image.

2. The method of claim 1, wherein the corresponding log-ratio disparity map is calculated in step (1) using the two-phase SAR image data, as follows:

3. The method of claim 1, wherein the training samples and the test samples are selected on the two-phase SAR image and the disparity map using a sliding window model in (2) as follows:

2a) Setting the size of a sliding window as N multiplied by N, setting the center of the sliding window as (i, j), for a training sample, carrying out sliding window selection image blocks in a designated area, and for a test sample, carrying out sliding window selection image blocks on the whole image;

2b) At two time phase image I ₁ And I ₂ In (i, j), image blocks of size N×N are selected as the center

And->

Will->

And->

2c) Corresponding to two-phase image I ₁ And I ₂ A middle pixel point (i, j), and selecting an image block centered on (i, j) in the difference image K as a mark

The size of the image block is N, calculating +.>

Extending m into a matrix of size 2N x N as a sample second channel;

2d) When a training sample is selected, the sliding window slides in a designated area, the training sample is formed by the first sample channel and the second sample channel, and when a test sample is selected, the sliding window slides in the whole graph, and the test sample is formed by the first sample channel and the second sample channel.

4. The method of claim 1, wherein the change in (3 a) detects a dual network structure ψ ₁ And psi is ₂ The parameter settings and relation of each layer are as follows:

Ψ ₁ and psi is ₂ The first four layers of (a) are shared layers, wherein:

ReLU(x)＝max(0,x)

wherein x is input ReLU (x) is output, and the dimension of the input and the output of the layer are consistent;

the third layer is a full connection layer L ₂ Which is provided with 1000 neurons for extracting deeper features from shallow features output from the upper full-connection layer,this layer produces a 1000-dimensional output vector;

Ψ ₁ and psi is ₂ Is a non-shared layer, wherein:

Ψ ₂ the fifth layer is a full connection layer L ₃₂ The device is used for extracting different types of features from the output of the last active layer, wherein the input dimension of the layer is 1000, and the output dimension is 2;

Ψ ₂ the sixth layer is Softmax classifier layer S ₂₁ The function of this layer is to give L ₃₂ The 2-dimensional column vectors output by the layers are respectively converted into classification probabilities, namely the probability that the current input sample belongs to the change and non-change classes, and the samples are classified according to the probability values.

5. The method of claim 1, wherein two of (3 b) discriminate networks

And->

The structure is the same, and the parameter settings and the relation of each layer are as follows:

the fourth layer is an activation function layer ReLU;

the sixth layer is a Softmax classifier layer, and the function of the layer is to convert the 2-dimensional column vector output by the previous layer into two-dimensional classification probability, namely the probability that the current input sample belongs to a true distribution sample and belongs to a generator generated sample, and judge the sample according to the probability value.