CN112446859A

CN112446859A - Satellite-borne thermal infrared camera image cloud detection method based on deep learning

Info

Publication number: CN112446859A
Application number: CN202011292436.4A
Authority: CN
Inventors: 陈凡胜; 李丽圆
Original assignee: Shanghai Institute of Technical Physics of CAS
Current assignee: Shanghai Institute of Technical Physics of CAS
Priority date: 2020-11-18
Filing date: 2020-11-18
Publication date: 2021-03-05

Abstract

The invention discloses a satellite-borne thermal infrared camera image cloud detection method based on deep learning, which comprises the following steps: (1) converting an uncalibrated thermal infrared image digital quantization value into an atmospheric top layer brightness temperature value, fusing all thermal infrared bands into a multi-channel image, cutting a remote sensing image, and constructing a data set; (2) the method comprises the steps that a full convolution neural network is combined with a residual error network to serve as a basic structure, cavity convolution is introduced to replace the ordinary convolution and pooling processes, and a new fusion neural network is constructed; (3) before training the model, setting a loss function distributed according to the weight; (4) training the constructed data set by a back propagation algorithm until a new model converges to obtain a deep learning model; (5) individual pixels are determined to be cloud or non-cloud targets using this model. The model has the advantages of light weight, strong generalization capability and suitability for various scenes and multi-channel images.

Description

Satellite-borne thermal infrared camera image cloud detection method based on deep learning

Technical Field

The invention belongs to the technical field of remote sensing image detection and deep learning, and particularly relates to a satellite-borne thermal infrared camera image cloud detection method based on deep learning.

Background

Estimated from global energy and water cycle experimental (GEWEX) cloud assessment databases, the global annual average cloud coverage is about 68%. When a satellite observes the earth surface, the existence of a cloud layer can block earth surface information, and extreme weather can be predicted by analyzing the movement of the cloud layer in meteorology, so remote sensing image cloud detection is an important step in various remote sensing applications. More importantly, when the thermal infrared band image is used for observation all day long, the satellite-ground transmission resources are occupied by excessive cloud layer information, so that the satellite-carried infrared camera image on-orbit cloud detection becomes a trend in the field of remote sensing.

The current cloud detection method comprises a traditional detection method and an intelligent detection method. However, the traditional cloud detection method is difficult to distinguish cloud layers with complex image backgrounds from the earth surface, and depends too much on human experience, so that many missed detections and false detections exist. In order to solve the above problem, an intelligent detection method is proposed for cloud detection. The deep learning network can eliminate the step of manually setting the characteristic parameters in the traditional cloud detection method, so that the artificial subjective influence is weakened, and the robustness of the detection algorithm is improved. The full convolution neural network and the U-type network evolve cloud detection into a classification problem at the pixel level, and end-to-end training is performed by using a remote sensing image through an encoder-decoder structure. However, the full convolution neural network has a network degradation phenomenon, and the pooling operation reduces the resolution, resulting in a reduction in the detection accuracy. The number of floating point operations required by the U-type network is high, the requirement on the computing capacity of hardware is high, and the on-orbit cloud detection is not facilitated.

In summary, according to the requirements of all-day, accurate, real-time and on-orbit cloud detection, there is an urgent need to develop a method for cloud detection of images of satellite-borne heat infrared cameras.

Disclosure of Invention

The invention aims to overcome the defects of the prior art, provides a satellite-borne thermal infrared camera image cloud detection method based on deep learning, and solves the problem of satellite-borne thermal infrared camera image cloud detection.

The technical problem to be solved by the invention is realized by adopting the following technical scheme:

1) converting an uncalibrated thermal infrared image digital quantization value into an atmospheric top layer brightness temperature value, fusing all thermal infrared bands into a multi-channel image, cutting a remote sensing image, and constructing a data set, wherein the method comprises the following specific steps:

1-1) converting the digital quantized value of the uncalibrated thermal infrared image into the value of the brightness temperature of the top layer of the atmosphere with physical significance, wherein the conversion formula is expressed by the following equation:

TOA_RA＝M_L·DN+A_L (1)

wherein TOA _ RA is the atmospheric top level radiance value, DN is the digital quantization value, M_LMultiplication rescaling factor for the thermal infrared band, A_LIs the addition and readjustment factor of the thermal infrared band, TOA _ BT is the value of the top brightness temperature of the atmosphere, K₁And K₂Thermal conversion constants for the thermal infrared band;

1-2) fusing all thermal infrared bands into a multi-channel image by using a GDAL library in software;

1-3) cutting the remote sensing image into small blocks of images n.n, wherein n is between 200 and 600;

1-4) when the data volume is less, the proportion of the verification set and the test set is consistent to be 40-20%, and the proportion of the training set is 60-80%;

2) the method is characterized in that a full convolution neural network is combined with a residual error network to serve as a basic structure, a cavity convolution is introduced to replace the common convolution and pooling process, a new fusion neural network is constructed, and a residual error module and a cavity convolution formula are expressed by the following equation:

y＝F(x)+x (3)

wherein x and y represent the input and output of the residual module, and f (x) is the residual function to be learned;

(F*_lk)(s+lt)＝∑_s+ltF(s)k(t) (4)

wherein F is a discrete function_lFor discrete convolution, k is a discrete filter, s is a step size, l is a hole factor, t is an argument, F(s) is a discrete convolution with a step size s, k (t) is an argument of tA dispersive filter;

3) before training the model, setting a loss function distributed according to weight, wherein the loss function L is the sum of weight distribution of Dice loss suitable for extreme unevenness of the sample and binary cross entropy loss suitable for class equalization of the sample, and is represented by the following equation:

L＝W_BceL_Bce+W_DiceL_Dice (5)

L_Bce＝-∑_iT_iln(P_i)+(1-T_i)ln(1-P_i) (6)

W_Bce+W_Dice＝1 (8)

wherein L is the loss function of the method, L_BceIs a two-class cross entropy loss, L_DiceIs the loss of Dice, P is the network prediction result, T is the true value, i is the independent variable, W_DiceThe weight loss of the Dice is in the range of 0 to 0.2, W_BceThe cross entropy loss weight for the binary class is in the range of 1 to 0.8;

4) training by using the constructed data set through a back propagation algorithm until a new model converges to obtain a deep learning model FCN _ ResNet _ k;

5) individual pixels are determined to be cloud or non-cloud targets using this model.

The invention has the advantages and positive effects that:

1. the invention takes the full convolution neural network and the residual error network as the basic structure, and effectively avoids the network degradation phenomenon caused by the deeper network. The cavity convolution is introduced to replace the common convolution and pooling processes, so that the influence of the reduction of the resolution due to the pooling operation in the original model is overcome, and the receptive field index is increased. And constructing a new converged neural network FCN _ ResNet _ k for cloud detection.

2. The invention has reasonable design and less floating point operation times required by the network structure, and can improve the reasoning speed and the network efficiency. The method has the advantages of light weight, strong generalization capability, suitability for various scenes and multi-channel images and the like, and can be applied and popularized in the field of satellite-borne heat infrared camera image cloud detection.

Drawings

Fig. 1 is a diagram illustrating an exemplary remote sensing image detection process according to an embodiment of the present invention;

fig. 2 is an exemplary diagram of a remote sensing image preprocessing process according to an embodiment of the present invention, where (a) and (b) are original images of bands of 10um to 11.19um and 11.5um to 12.51um, respectively, and (c) and (d) are images obtained by converting digital quantization values of corresponding bands into atmospheric top luminance temperature values, respectively, and (e) is a two-channel image obtained by fusing two thermal infrared bands.

Detailed Description

The following takes thermal infrared band cloud detection as an example, and the following detailed description is made on the embodiments of the present invention with reference to the accompanying drawings:

the invention realizes the high-precision infrared band cloud detection method by constructing an improved FCN model and introducing a voting principle to vote by using three different models. The method mainly comprises the following steps:

TOA_RA＝M_L·DN+A_L (1)

wherein TOA _ RA is the atmospheric top level radiance value, DN is the digital quantization value, M_L＝3.342×10^-4Multiplication rescaling factor for the thermal infrared band, A_L0.1 is the thermal infrared band addition and readjustment factor, TOA _ BT is the atmospheric top luminance temperature value, K₁And K₂In the thermal infrared bandThermal conversion constant of (d); selecting 774.89 for K1 with a wave band of 10um-11.19um, 1321.08 for K2, 480.89 for K1 with a wave band of 11.5um-12.51um and 1201.14 for K2;

1-2) fusing two thermal infrared bands of 10um-11.19um and 11.5um-12.51um into a two-channel image by using a GDAL library in software, and converting a 16-bit remote sensing image into an 8-bit remote sensing image;

1-3) cutting 96 two-channel images into small block images n.n, wherein n is 224, and 38400 small block images 224 are obtained after cutting;

1-4) the data volume is less, a training set is set: and (4) verification set: the ratio of test sets was 8: 1: 1, the data set comprises 32000 training sets, 3200 verification sets and 3200 test sets;

wherein F is a discrete function_lFor discrete convolution, k is a discrete filter, s is a step size, l is a hole factor, t is an argument, F(s) is discrete convolution with a step size s, and k (t) is a discrete filter with an argument t;

2) the method is characterized in that a full convolution neural network is combined with a residual error network to serve as a basic structure, void convolution is introduced to replace the common convolution and pooling process, a new model is constructed, and a residual error module and a void convolution formula are expressed by the following equation:

y＝F(x)+x (3)

(F*_lk)(s+lt)＝∑_s+ltF(s)k(t) (4)

L＝W_BceL_Bce+W_DiceL_Dice (5)

L_Bce＝-∑_iT_iln(P_i)+(1-T_i)ln(1-P_i) (6)

W_Bce+W_Dice＝1 (8)

wherein P is the network prediction result, T is the truth value, and L_DiceFor loss of Dice, L_BceIs a binary cross entropy loss, i is an independent variable, W_BceTake 0.2, W for the Dice loss weight_DiceTaking 0.8 as the weight range of the binary cross entropy loss, wherein L is the loss function of the method;

6) And calculating the floating point operation times of the model and measuring the calculation complexity of the model. The number of floating point operations of the new fusion neural network is 7445043712, the number of floating point operations of the U-type network is 85683945472, and the calculation complexity of the U-type network is about 11.51 times that of the new fusion neural network.

Claims

1. A satellite-borne thermal infrared camera image cloud detection method based on deep learning is characterized by comprising the following steps:

TOA_RA＝M_L·DN+A_L (1)

y＝F(x)+x (3)

(F*_lk)(s+lt)＝∑_s+ltF(s)k(t) (4)

L＝W_BceL_Bce+W_DiceL_Dice (5)

L_Bce＝-∑_iT_iln(P_i)+(1-T_i)ln(1-P_i) (6)

W_Bce+W_Dice＝1 (8)

wherein P is the network prediction result, T is the truth value, and L_DiceFor loss of Dice, L_BceIs a binary cross entropy loss, i is an independent variable, W_BceThe weight loss of the Dice is in the range of 0 to 0.2, W_DiceThe weight loss range of the binary cross entropy is 1-0.8, and L is the loss function of the method;

4) training the constructed data set by a back propagation algorithm until a new model converges to obtain a deep learning model;