CN112330562A

CN112330562A - Heterogeneous remote sensing image transformation method and system

Info

Publication number: CN112330562A
Application number: CN202011236018.3A
Authority: CN
Inventors: 刘瑜; 谭大宁; 李刚; 丁自然; 孙顺; 姜乔文
Original assignee: Tsinghua University; Naval Aeronautical University
Current assignee: Tsinghua University; Naval Aeronautical University
Priority date: 2020-11-09
Filing date: 2020-11-09
Publication date: 2021-02-05
Anticipated expiration: 2040-11-09
Also published as: CN112330562B

Abstract

The invention relates to a heterogeneous remote sensing image transformation method and a system, wherein a visible light gray image is set as a middle state from an SAR image to a visible light color image, the SAR image is generated into the visible light gray image through an SAR image noise reduction generator and is identified through a first identifier, the visible light gray image is generated into the visible light color image through the visible light image generator and is identified through a second identifier, the distortion rate from the SAR image to the visible light color image is reduced, and the quality and the efficiency of remote sensing image transformation are improved.

Description

Heterogeneous remote sensing image transformation method and system

Technical Field

The invention relates to the technical field of image transformation, in particular to a heterogeneous remote sensing image transformation method and system.

Background

With the continuous development of information perception technology, the processing and application of remote sensing images gradually change from single mode to multi-mode. The reason is that the image acquired by a single sensor cannot meet the requirements of certain specific scenes, and under the condition that the performance of the sensor cannot be effectively improved temporarily, the heterogeneous images acquired by multiple types of sensors are combined into a feasible way.

Taking optical and Synthetic Aperture Radar (SAR) images as an example, optical images are sensitive to weather, especially cloud layers, which severely limits their use for observation and monitoring of ground targets. A possible alternative is to correlate images acquired by different modality sensors at the same time, using a weather insensitive SAR image as a compensation. In this respect, deep learning based approaches provide a good solution. Under heterogeneous image-driven coupling time series, deep learning based methods can learn the mapping from SAR images to visible light images and use it for the transformation of known SAR images to unknown available light images. The obtained result has the spectrum information of the visible light image and can also obtain the all-weather observation advantages of the SAR image all day long.

Heterogeneous transformations of remote sensing images can be classified as an image style transformation problem, and can be further classified into feature-based methods and methods based on generation of a countermeasure network GAN in terms of image style transformation. The method based on the characteristics adopts the idea that the style and semantic content of an original image are respectively extracted by utilizing the characteristic extraction capability of an artificial neural network, the local affine transformation from input to output is limited in a color space by considering the distortion of certain style and content, the constraint is expressed as a self-defined fully differentiable energy item, and then the combined reconstruction is carried out to generate a new style image. Another idea of style transformation is based on a method of generating a countermeasure network GAN. The idea of the method is to adopt a conditional generation countermeasure network cGAN to perform conditional generation and transformation on the image, so that the generated image has the semantic information of an original image domain and the style characteristics of a target image domain.

For the two existing methods, although the characteristic-based method has good effect on artistic image style transformation, when a new style photo is generated, the loss function needs to be derived for many times, so that the parameter calculation amount is large when the method is used for heterogeneous transformation of remote sensing images, and the processing instantaneity is poor. Compared with the former method, the method for generating the anti-network cGAN based on the condition has the advantages of low calculation cost, higher flexibility, ideal conversion result between general images with clear structures and symmetrical information, and serious texture loss and distortion when the method is used for heterogeneous conversion of remote sensing images, which influences final decision and judgment.

At present, the heterogeneous remote sensing image transformation for generating the countermeasure network based on the conditions has the following technical difficulties: firstly, due to the principle difference of the SAR image and the visible light image acquisition sensor, the SAR image and the visible light image acquisition sensor have the characteristic of information asymmetry, and experiments are carried out by utilizing the existing image transformation models such as pix2pix, Cycle-GAN and the like, so that the RGB visible light image generated by the SAR image has serious distortion phenomena such as characteristic loss and the like, so that the interpretability and the reliability of the final transformation result are low, and the subsequent research and decision are influenced. Therefore, the influence of multiplicative gamma noise pollution on the SAR image on image cross-modal transformation needs to be reduced, the retention and recovery capability of the model on the main characteristics of the SAR image is improved, and the finally transformed visible light image has the semantic information of the original SAR image and certain spectral information, so that more reliable image information is provided for geological disaster monitoring, site investigation, sea area monitoring and the like; and secondly, the information of the visible light image exists in a plurality of channels, the SAR image is a gray level image, the information of the visible light image exists in one channel, channel difference and semantic gap exist between the visible light image and the SAR image, and the problems of inaccurate mapping of spectral information and semantic information and the like exist when a generator and an identifier are directly adopted to generate an anti-network for heterogeneous transformation.

In terms of generating heterogeneous image transformation of a countermeasure network based on conditions, no transformation model aiming at SAR image to visible light image (RGB) can solve the problems well at present. In the aspects of accuracy and real-time performance of heterogeneous transformation, the existing methods cannot meet the requirements of complication and diversification of remote sensing images, so that the problem needs to be solved urgently.

Disclosure of Invention

Based on the above, the invention aims to provide a heterogeneous remote sensing image transformation method and system, which improve the quality and efficiency of remote sensing image transformation.

In order to achieve the purpose, the invention provides the following scheme:

a heterogeneous remote sensing image transformation method, comprising:

denoising the plurality of SAR images to obtain a visible light gray level image of each SAR image;

obtaining a visible light color image of each SAR image;

the SAR image is used as the input of an SAR image noise reduction generator, the visible light gray level image is used as the output of the SAR image noise reduction generator, a first gradient discriminator is used for discriminating the visible light gray level image output by the SAR image noise reduction generator, and the SAR image noise reduction generator and the first gradient discriminator are trained;

training the visible light image generator and a second gradient discriminator by taking the visible light gray scale image as the input of the visible light image generator, taking the visible light color image as the output of the visible light image generator and discriminating the visible light color image output by the visible light image generator by using the second gradient discriminator;

inputting the SAR image to be transformed into the trained SAR image noise reduction generator to obtain a visible light gray image to be transformed, and inputting the visible light gray image to be transformed into the trained visible light image generator to obtain a transformed visible light color image.

Optionally, the SAR image noise reduction generator is of an improved U-Net network structure, an input layer and an output layer of the SAR image noise reduction generator are both single channels, and each adjacent single-layer convolutional layer in the SAR image noise reduction generator is connected by adopting a residual block and a jump connection.

Optionally, the visible light image generator is of a modified U-Net network structure, and the number of output layer channels of the visible light image generator is 3.

Optionally, the first gradient discriminator and the second gradient discriminator are both markov discriminators.

Optionally, the first training loss function for training the SAR image noise reduction generator and the first gradient discriminator is:

wherein,

representing conditional generation-countering network loss of the SAR image noise reduction generator from inputting the SAR image to outputting a visible light grayscale image;

representing a reconstruction loss of the SAR image noise reduction generator from inputting the SAR image to outputting a visible light grayscale image,

representing a texture retention loss of the SAR image noise reduction generator from inputting the SAR image to outputting a visible light grayscale image;

representing the SAR image noise reduction generator, x⁽ⁱ⁾Representing the set of SAR images, y⁽ⁱ⁾Representing a set of visible light grayscale images;

the second training loss function training the visible light image generator and the second gradient discriminator is:

wherein,

representing conditional generation-countering network loss of the visible light image generator from inputting the visible light grayscale image to outputting a visible light color image;

representing a reconstruction loss of the visible light image generator from inputting the visible light gray scale image to outputting a visible light color image;

representing a texture-preserving loss of the visible light image generator from inputting the visible light grayscale image to outputting a visible light color image;

representing a visible light image generator, z⁽ⁱ⁾Representing a set of visible color images;

α₁denotes a first weight parameter, β₁Representing a second weight parameter, α₂Denotes a third weight parameter, β₂Representing a fourth weight parameter;

stopping training the SAR image noise reduction generator, the first gradient discriminator, the visible light image generator and the second gradient discriminator when the sum of the first training loss function and the second training loss function is less than a set threshold.

The invention also discloses a heterogeneous remote sensing image transformation system, which comprises:

and the visible light gray level image acquisition module is used for denoising the SAR images to obtain the visible light gray level image of each SAR image.

The visible light color image acquisition module is used for acquiring a visible light color image of each SAR image;

the first training module is used for taking the SAR image as the input of the SAR image noise reduction generator, taking the visible light gray image as the output of the SAR image noise reduction generator, identifying the visible light gray image output by the SAR image noise reduction generator by using a first gradient identifier, and training the SAR image noise reduction generator and the first gradient identifier;

a second training module, which takes the visible light gray scale image as the input of a visible light image generator, takes the visible light color image as the output of the visible light image generator, and uses a second gradient discriminator to discriminate the visible light color image output by the visible light image generator, so as to train the visible light image generator and the second gradient discriminator;

and the SAR image conversion module is used for inputting the SAR image to be converted into the trained SAR image noise reduction generator to obtain the visible light gray image to be converted, and inputting the visible light gray image to be converted into the trained visible light image generator to obtain the converted visible light color image.

wherein,

representing the SAR image noise reduction generator, x⁽ⁱ⁾Representing the set of SAR images, y⁽ⁱ⁾A set of visible light grayscale images is represented,

wherein,

According to the specific embodiment provided by the invention, the invention discloses the following technical effects:

the invention discloses a heterogeneous remote sensing image transformation method and a system, wherein a visible light gray image is set as a middle state from an SAR image to a visible light color image, the SAR image is generated into the visible light gray image through an SAR image noise reduction generator, the visible light gray image is identified through a first identifier, the visible light gray image is generated into the visible light color image through the visible light image generator, the visible light color image is identified through a second identifier, the distortion rate from the SAR image to the visible light color image is reduced, and the quality and the efficiency of remote sensing image transformation are improved.

Drawings

In order to more clearly illustrate the embodiments of the present invention or the technical solutions in the prior art, the drawings needed to be used in the embodiments will be briefly described below, and it is obvious that the drawings in the following description are only some embodiments of the present invention, and it is obvious for those skilled in the art to obtain other drawings without inventive exercise.

FIG. 1 is a schematic flow chart of a heterogeneous remote sensing image transformation method of the invention;

FIG. 2 is a schematic diagram of the heterogeneous image transformation from SAR images to visible color images of the present invention;

FIG. 3 is a schematic diagram of a network structure of an SAR image noise reduction generator according to the present invention;

FIG. 4 is a schematic diagram of a network structure of a visible light image generator according to the present invention;

FIG. 5 is a diagram of a first authenticator network architecture in accordance with the present invention;

FIG. 6 is a diagram of a second discriminator network according to the invention;

FIG. 7 is a schematic structural diagram of a heterogeneous remote sensing image transformation system according to the present invention.

Detailed Description

The technical solutions in the embodiments of the present invention will be clearly and completely described below with reference to the drawings in the embodiments of the present invention, and it is obvious that the described embodiments are only a part of the embodiments of the present invention, and not all of the embodiments. All other embodiments, which can be derived by a person skilled in the art from the embodiments given herein without making any creative effort, shall fall within the protection scope of the present invention.

The invention aims to provide a heterogeneous remote sensing image transformation method and a heterogeneous remote sensing image transformation system, which improve the quality and efficiency of remote sensing image transformation.

In order to make the aforementioned objects, features and advantages of the present invention comprehensible, embodiments accompanied with figures are described in further detail below.

Fig. 1 is a schematic flow chart of a heterogeneous remote sensing image transformation method of the present invention, and as shown in fig. 1, the heterogeneous remote sensing image transformation method includes:

step 101: and denoising the plurality of SAR images to obtain a visible light gray level image of each SAR image.

Step 102: and obtaining a visible light color image of each SAR image.

Step 103: and taking the SAR image as the input of an SAR image noise reduction generator, taking the visible light gray image as the output of the SAR image noise reduction generator, identifying the visible light gray image output by the SAR image noise reduction generator by using a first gradient identifier, and training the SAR image noise reduction generator and the first gradient identifier.

Step 104: and training the visible light image generator and a second gradient discriminator by taking the visible light gray scale image as the input of the visible light image generator, taking the visible light color image as the output of the visible light image generator, and discriminating the visible light color image output by the visible light image generator by using the second gradient discriminator.

Step 105: inputting the SAR image to be transformed into the trained SAR image noise reduction generator to obtain a visible light gray image to be transformed, and inputting the visible light gray image to be transformed into the trained visible light image generator to obtain a transformed visible light color image.

The SAR image noise reduction generator is of an improved U-Net network structure, an input layer and an output layer of the SAR image noise reduction generator are single channels, and each adjacent single-layer convolution layer in the SAR image noise reduction generator is connected through a residual block and a jump connection.

The visible light image generator is of an improved U-Net network structure, and the number of output layer channels of the visible light image generator is 3.

The first gradient discriminator and the second gradient discriminator are both Markov discriminators.

The first training loss function for training the SAR image noise reduction generator and the first gradient discriminator is as follows:

wherein,

representing the SAR image noise reduction generator, x⁽ⁱ⁾Representing the set of SAR images, y⁽ⁱ⁾Representing a set of visible light grayscale images.

wherein,

to representVisible light image generator, z⁽ⁱ⁾Representing a collection of visible color images.

α₁Denotes a first weight parameter, β₁Representing a second weight parameter, α₂Denotes a third weight parameter, β₂Representing a fourth weight parameter.

The following is a detailed description of a heterogeneous remote sensing image transformation method of the present invention.

As shown in FIG. 2, for ease of illustration, the following convention is made: let X, Y, Z denote the SAR image domain, the visible light grayscale image domain (intermediate visible light image domain) and the visible light color image domain, x, respectively⁽ⁱ⁾∈X、y⁽ⁱ⁾∈Y、z⁽ⁱ⁾E Z are the dataset samples for the corresponding image domain, respectively (i ═ 1,2, …, N denotes the total number of samples of the dataset). Therefore, the transformation of the SAR image into a visible light grayscale image (intermediate-state visible light image) can be defined as a mapping transformation of the source domain X into the target domain Y

Let x be⁽ⁱ⁾Is a random sample taken from the SAR image domain X and has a distribution function of

x⁽ⁱ⁾The random sample mapped to the visible light image domain is y⁽ⁱ⁾. The remote sensing image transformation network provided by the invention is finally realized

Final z⁽ⁱ⁾Is derived from x by a deep learning network⁽ⁱ⁾Generating and obtaining:

remote sensing image changeThe switching network comprises two generators and two discriminators which are respectively used for reducing noise of the SAR image

Visible light image generator

Gradient discriminator

(first gradient discriminator) and gradient discriminator

(second gradient discriminator). SAR image noise reduction generator

Short for generator

Visible light image generator

Short for generator

Generator

The SAR image is converted into a visible light gray level image, so that high-frequency details and low-frequency semantic information of the SAR image are recovered, and a semantic gap from the SAR image to the visible light image is reduced; generator

Converting the visible light gray level image into a visible light color image with spectral information, so that the recovered visible light color image has the semantic information of the original SAR image domain and the spectral details of the visible light image; discriminator

The gradient details of the visible light gray scale image in the data set Y are used for identification, so that the generator

Obtaining stronger noise reduction capability of the SAR image, and simultaneously reserving semantic information in the original SAR image; discriminator

And discriminator

Similarly, the gradient information of the visible color image is used for discrimination, thus the generator

The resulting image allows for better recovery of spectral information while preserving detailed information.

As shown in fig. 3, the generator

By adopting an improved U-Net network structure, an input layer and an output layer are single channels, and each adjacent single-layer convolutional layer is connected by a Residual Block and a Skip Connection, so that the learning capability of the generator can be fully enhanced while the network depth is expanded. Meanwhile, instead of converting a multiplicative speckle model into an additive speckle model through input logarithmic conversion, a residual error learning strategy the same as SAR-DRN (SAR-Dual regression networks) is adopted, and a generator

The output layer is Gamma noise extracted from the input image, and the clear visible light gray image can be obtained by subtracting the input image from the output image.

As shown in fig. 4, the generator

And generator

The structure of the network is similar, an improved U-Net network structure is adopted, an input layer is a single Channel, the difference is that the number of output layer channels of the network is 3(Channel is 3), and the input layer is a generator

And generating a visible light gray scale image. In principle, the generator

The learning is the mapping from the SAR image contaminated by Gamma noise to the visible light gray scale image, its essence is the noise reduction of the image, and the generator

Learned is the mapping from the visible light gray-scale image to the visible light color image. Therefore, the progressive cross-mode transformation can obtain a better transformation effect, the expression capability of the model from SAR image reconstruction with less information amount to visible light color image is enhanced by increasing the layer number of the network, and the heterogeneous transformation of the image is realized.

As shown in fig. 5-6, discriminator D₁And discriminator D₂The Markov judgers are adopted, which are completely composed of convolution layers, and finally, an n-by-n matrix is output. The input of the Markov discriminator is 256 multiplied by 256 to randomly cut a 70 multiplied by 70 image Patch, the output of the Markov discriminator is the mean value of an output matrix (4 multiplied by 4), the final discrimination result is True/False, the True represents that the picture generated by the generator cheats the discriminator, the expected effect is achieved, and the training of the generator is finished; "False" means that the picture generated by the generator has not "spoofed" the discriminator, the generated picture is not as expected, and the generator needs to continue training. Since each output in the output matrix represents a field of view in the original image and corresponds to a slice (patch) of the original image, the GAN constructed in this way may be called patch GAN. In a different aspect, the discriminator

Is for discriminating the generator

Generated image (True/False), and discriminator

Is for discriminating the generator

The resulting image (True/False). In the heterogeneous image transformation process, the migration transformation of the content part and the texture part is included. The content portion is the similarity in content (semantic) between the generated image and the original image, and the texture portion is the similarity in texture between the generated image and the target image. Theory shows that with PatchGAN, texture detail can be maintained.

In the remote sensing image transformation network training stage, a generator is trained firstly

And a discriminator

Generator

Intermediate-state visible light image deception discriminator generated by learning speckle noise removal and super-resolution reconstruction (equal-scale) mapping of SAR image from paired SAR and visible light gray level image

Making it impossible to distinguish between the visible light gray scale image and the generated intermediate state image; and a discriminator

Judging whether the intermediate state image generated by the generator is close to the visible light corresponding to the original input SAR image or notA grayscale image. Then, the generator is fixed

And a discriminator

Is a training generator

And a discriminator

Generator

And learning the spectral information mapping of the visible light image from the paired visible light gray level image and visible light color image to complete the coloring function of the image. During the training process, the generator

And a discriminator

After the balance is achieved, the generator is carried out

And a discriminator

And (4) training.

A testing phase, the input SAR image is processed by a generator

Generating a denoised intermediate visible light gray scale image, and inputting the image to a generator

Via a generator

And generating a visible light color image, and completing the conversion from the SAR image to the visible light image.

Design for loss function

1) cGAN (Conditional generational adaptive networks) loss

Under the premise that paired training data exists, the cGAN is adopted by the method to improve the performance of the generator. By generating a countermeasure network, a generator

Will learn the mapping from X to Y and spoof the discriminator

Generator

Will learn the mapping from Y to Z and spoof the discriminator

The network structure of the discriminator has a function of distinguishing a forged image from an actual image. Thus, the cGAN losses from X to Y and Y to Z are:

in the first half of training, when the discriminator

Learn to make

When maximizing, the generator

Attempt to make

Minimized until both reach an optimal state (nash equilibrium); in the second half of training, when the discriminator

Learn to make

When maximizing, the generator

Attempt to make

Minimized until both reach an optimum (Nash equilibrium),

representing a mathematical expectation, a smaller mathematical expectation is better, and a smaller mathematical expectation is less loss.

2) Loss of reconstruction

Because the invention adopts the improved U-Net structure, the introduced residual block and jump connection increase the network depth, so the L is adopted in the reconstruction loss design₁The loss is used for minimizing the distance between the reconstructed image and the original image, and the reconstruction losses of the two training stages are respectively:

3) texture retention loss

As shown in fig. 2, in the optimum conversion state

Output of the network

Should approximate the visible light image y⁽ⁱ⁾. The invention provides gradient-guided texture retention loss on the basis of the traditional reconstruction loss. Assuming that M (-) represents an image gradient operation, the texture retention penalty for the two training phases is:

for the image I, M (-) the operation of extracting the image gradient is calculated as follows:

I_x(x,y)＝I(x+1,y)-I(x-1,y),

I_y(x,y)＝I(x,y+1)-I(x,y-1),

in a specific implementation, the M (-) operation is implemented by convolution with a fixed convolution kernel, and (x, y) represents the pixel coordinates.

Thus, the first half of the model training phase, the generator

And a discriminator

Formation of antagonistic networks GAN¹The total training loss is:

the second half of the model training phase, the generator

And a discriminator

Formation of antagonistic networks GAN²The total training loss is:

wherein,

Representing the visible light image generationGenerating-countering network loss from a condition of inputting the visible light gray-scale image to outputting a visible light color image;

representing a visible light image generator, z⁽ⁱ⁾Representing a collection of visible color images.

The technical effect of the heterogeneous remote sensing image transformation method of the invention comprises the following points:

1. a heterogeneous remote sensing image transformation method constructs a weighted target function with reconstruction loss, cGAN loss and texture preserving gradient loss, realizes natural transformation from an SAR image to a visible light image by weighting three constraint conditions of comprehensive semantic information, style characteristics and one-dimensional gradient characteristics, has obvious advantages in overcoming noise and distortion, and has strong adaptability, wide application range and strong robustness.

2. A heterogeneous remote sensing image transformation method is characterized in that a residual block (Residualblock) and a Skip Connection (Skip Connection) are used for improving U-Net and a generator for an SAR image to a visible light gray image and a generator for a visible light gray image to a visible light color image for the first time, so that the network depth and the multi-scale feature extraction capability and the mapping capability are increased, the mapping transformation capability of a single-channel gray remote sensing image to an RGB channel remote sensing image is remarkably improved, and the spectrum mapping distortion is greatly reduced.

3. A heterogeneous remote sensing image transformation method optimally designs a total objective function, and based on a traditional method, a result generated by one-dimensional gradient feature constraint retains the detail features of an original SAR image, and improves the quality of heterogeneous transformation.

4. A heterogeneous remote sensing image transformation method overcomes the distortion phenomenon of SAR image to RGB visible light image transformation by adjusting the final transformation effect of network structure constraint aiming at the characteristic of asymmetric information between an SAR image and a visible light color image.

Fig. 7 is a schematic structural diagram of a heterogeneous remote sensing image transformation system according to the present invention, and as shown in fig. 7, the heterogeneous remote sensing image transformation system includes:

a visible light grayscale image obtaining module 201, configured to denoise multiple SAR images to obtain a visible light grayscale image of each SAR image;

a visible light color image obtaining module 202, configured to obtain a visible light color image of each SAR image;

the first training module 203 is configured to use the SAR image as an input of a SAR image noise reduction generator, use the visible light grayscale image as an output of the SAR image noise reduction generator, and use a first gradient discriminator to discriminate the visible light grayscale image output by the SAR image noise reduction generator, so as to train the SAR image noise reduction generator and the first gradient discriminator;

a second training module 204, configured to train the visible light image generator and the second gradient discriminator by using the visible light grayscale image as an input of the visible light image generator, using the visible light color image as an output of the visible light image generator, and discriminating the visible light color image output by the visible light image generator by using a second gradient discriminator;

the SAR image conversion module 205 is configured to input the SAR image to be converted into the trained SAR image noise reduction generator to obtain a visible light grayscale image to be converted, and input the visible light grayscale image to be converted into the trained visible light image generator to obtain a converted visible light color image.

wherein,

wherein,

The embodiments in the present description are described in a progressive manner, each embodiment focuses on differences from other embodiments, and the same and similar parts among the embodiments are referred to each other. For the system disclosed by the embodiment, the description is relatively simple because the system corresponds to the method disclosed by the embodiment, and the relevant points can be referred to the method part for description.

The principles and embodiments of the present invention have been described herein using specific examples, which are provided only to help understand the method and the core concept of the present invention; meanwhile, for a person skilled in the art, according to the idea of the present invention, the specific embodiments and the application range may be changed. In view of the above, the present disclosure should not be construed as limiting the invention.

Claims

1. A heterogeneous remote sensing image transformation method is characterized by comprising the following steps:

obtaining a visible light color image of each SAR image;

2. The heterogeneous remote sensing image transformation method according to claim 1, wherein the SAR image noise reduction generator is of an improved U-Net network structure, an input layer and an output layer of the SAR image noise reduction generator are both single channels, and each adjacent single-layer convolution layer in the SAR image noise reduction generator is connected through a residual block and a jump connection.

3. The heterogeneous remote sensing image transformation method according to claim 1, wherein the visible light image generator is of an improved U-Net network structure, and the number of output layer channels of the visible light image generator is 3.

4. The heterogeneous remote sensing image transformation method of claim 1, wherein the first gradient discriminator and the second gradient discriminator are both Markov discriminators.

5. The heterogeneous remote sensing image transformation method of claim 1, wherein a first training loss function for training the SAR image noise reduction generator and the first gradient discriminator is:

wherein,

representing the SAR image noise reduction generator slave inputTexture retention loss from entering the SAR image to outputting a visible light gray level image;

wherein,

6. A heterogeneous remote sensing image transformation system, the system comprising:

the visible light gray level image acquisition module is used for denoising the SAR images to obtain visible light gray level images of the SAR images;

the second training module is used for taking the visible light gray-scale image as the input of the visible light image generator, taking the visible light color image as the output of the visible light image generator, and identifying the visible light color image output by the visible light image generator by using a second gradient identifier to train the visible light image generator and the second gradient identifier;

7. The heterogeneous remote sensing image transformation system of claim 6, wherein the SAR image noise reduction generator is of an improved U-Net network structure, an input layer and an output layer of the SAR image noise reduction generator are both single channels, and each adjacent single-layer convolution layer in the SAR image noise reduction generator is connected through a residual block and a jump connection.

8. The heterogeneous remote sensing image transformation system of claim 6, wherein the visible light image generator is of a modified U-Net network structure, and the number of output layer channels of the visible light image generator is 3.

9. The heterogeneous remote sensing image transformation system of claim 6, wherein the first gradient discriminator and the second gradient discriminator are both Markov discriminators.

10. The heterogeneous remote sensing image transformation system of claim 6, wherein the first training loss function for training the SAR image noise reduction generator and the first gradient discriminator is:

wherein,

representing a texture retention loss of the SAR image noise reduction generator from inputting the SAR image to outputting a visible light grayscale image; p denotes the SAR image noise reduction generator, x⁽ⁱ⁾Representing the set of SAR images, y⁽ⁱ⁾A set of visible light grayscale images is represented,

wherein,

representing a texture-preserving loss of the visible light image generator from inputting the visible light grayscale image to outputting a visible light color image; q denotes a visible light image generator, z⁽ⁱ⁾Representing a set of visible color images;