CN116384237A - Thermal infrared atmospheric parameter inversion method and device and electronic equipment - Google Patents

Thermal infrared atmospheric parameter inversion method and device and electronic equipment Download PDF

Info

Publication number
CN116384237A
CN116384237A CN202310327016.2A CN202310327016A CN116384237A CN 116384237 A CN116384237 A CN 116384237A CN 202310327016 A CN202310327016 A CN 202310327016A CN 116384237 A CN116384237 A CN 116384237A
Authority
CN
China
Prior art keywords
neural network
term memory
short
network model
memory neural
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Granted
Application number
CN202310327016.2A
Other languages
Chinese (zh)
Other versions
CN116384237B (en
Inventor
赵恩宇
王玉磊
马成健
高彩霞
段四波
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Dalian Maritime University
Aerospace Information Research Institute of CAS
Institute of Agricultural Resources and Regional Planning of CAAS
Original Assignee
Dalian Maritime University
Aerospace Information Research Institute of CAS
Institute of Agricultural Resources and Regional Planning of CAAS
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Dalian Maritime University, Aerospace Information Research Institute of CAS, Institute of Agricultural Resources and Regional Planning of CAAS filed Critical Dalian Maritime University
Priority to CN202310327016.2A priority Critical patent/CN116384237B/en
Publication of CN116384237A publication Critical patent/CN116384237A/en
Application granted granted Critical
Publication of CN116384237B publication Critical patent/CN116384237B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Images

Classifications

    • GPHYSICS
    • G01MEASURING; TESTING
    • G01WMETEOROLOGY
    • G01W1/00Meteorology
    • G01W1/02Instruments for indicating weather conditions by measuring two or more variables, e.g. humidity, pressure, temperature, cloud cover or wind speed
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F30/00Computer-aided design [CAD]
    • G06F30/20Design optimisation, verification or simulation
    • G06F30/27Design optimisation, verification or simulation using machine learning, e.g. artificial intelligence, neural networks, support vector machines [SVM] or training a model
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N3/00Computing arrangements based on biological models
    • G06N3/02Neural networks
    • G06N3/04Architecture, e.g. interconnection topology
    • G06N3/044Recurrent networks, e.g. Hopfield networks
    • G06N3/0442Recurrent networks, e.g. Hopfield networks characterised by memory or gating, e.g. long short-term memory [LSTM] or gated recurrent units [GRU]
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N3/00Computing arrangements based on biological models
    • G06N3/02Neural networks
    • G06N3/04Architecture, e.g. interconnection topology
    • G06N3/045Combinations of networks
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N3/00Computing arrangements based on biological models
    • G06N3/02Neural networks
    • G06N3/08Learning methods
    • G06N3/084Backpropagation, e.g. using gradient descent
    • YGENERAL TAGGING OF NEW TECHNOLOGICAL DEVELOPMENTS; GENERAL TAGGING OF CROSS-SECTIONAL TECHNOLOGIES SPANNING OVER SEVERAL SECTIONS OF THE IPC; TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
    • Y02TECHNOLOGIES OR APPLICATIONS FOR MITIGATION OR ADAPTATION AGAINST CLIMATE CHANGE
    • Y02ATECHNOLOGIES FOR ADAPTATION TO CLIMATE CHANGE
    • Y02A90/00Technologies having an indirect contribution to adaptation to climate change
    • Y02A90/10Information and communication technologies [ICT] supporting adaptation to climate change, e.g. for weather forecasting or climate simulation

Landscapes

  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • Evolutionary Computation (AREA)
  • Life Sciences & Earth Sciences (AREA)
  • General Physics & Mathematics (AREA)
  • General Engineering & Computer Science (AREA)
  • Artificial Intelligence (AREA)
  • Software Systems (AREA)
  • Computational Linguistics (AREA)
  • Computing Systems (AREA)
  • Mathematical Physics (AREA)
  • Molecular Biology (AREA)
  • General Health & Medical Sciences (AREA)
  • Data Mining & Analysis (AREA)
  • Biophysics (AREA)
  • Biomedical Technology (AREA)
  • Health & Medical Sciences (AREA)
  • Environmental & Geological Engineering (AREA)
  • Environmental Sciences (AREA)
  • Ecology (AREA)
  • Biodiversity & Conservation Biology (AREA)
  • Geometry (AREA)
  • Computer Hardware Design (AREA)
  • Atmospheric Sciences (AREA)
  • Medical Informatics (AREA)
  • Computer Vision & Pattern Recognition (AREA)
  • Transforming Light Signals Into Electric Signals (AREA)
  • Radiation Pyrometers (AREA)
  • Photometry And Measurement Of Optical Pulse Characteristics (AREA)
  • Image Analysis (AREA)

Abstract

The invention belongs to the technical field of remote sensing image processing, and relates to a thermal infrared atmospheric parameter inversion method, a device and electronic equipment, wherein the method comprises the following steps: extracting surface emissivity information and atmospheric information; determining atmospheric parameters and establishing a simulation data set; constructing a two-way long-short-term memory neural network model; model training to determine structural parameters of the model; weighting each channel information of the output feature vector of the two-way long-short-term memory neural network model by using an attention mechanism; training a model; iteratively updating until the output of the model converges to obtain an atmospheric parameter inversion model; and obtaining a thermal infrared atmospheric parameter inversion result. The invention effectively solves the problem that the deep neural network has weaker data mining capability, provides a two-way long-short-term memory neural network structure, weights different channel information by utilizing the channel attention module, acquires the correlation characteristics of different channel radiance data, and obtains a thermal infrared atmospheric parameter inversion result.

Description

Thermal infrared atmospheric parameter inversion method and device and electronic equipment
Technical Field
The invention belongs to the technical field of remote sensing image processing, and particularly relates to a long-term and short-term memory neural network thermal infrared atmospheric parameter inversion method and device based on an attention mechanism, and electronic equipment.
Background
Atmospheric parameter inversion has received a great deal of attention in recent years and has found wide application in a number of discipline fields. Inversion of atmospheric parameters has also attracted the interest of many scholars, for example, modeling the correlation of the atmospheric transmittance of different wavelengths, and inverting and calculating the atmospheric transmittance of radiation with adjacent wavelengths; and obtaining the transmittance by modeling the statistical relationship between the water vapor and the atmospheric transmittance in different wave bands. In practical application, the observed bright temperature is obtained by practical observation of a microwave radiometer, and the space resolution is low, so that the atmospheric parameter inversion accuracy is low, and the deep neural network has the problem of weak data mining capability.
Disclosure of Invention
In order to solve the technical problems, the invention provides a thermal infrared atmospheric parameter inversion method, a thermal infrared atmospheric parameter inversion device and electronic equipment.
In a first aspect, the invention provides a thermal infrared atmospheric parameter inversion method comprising:
extracting surface emissivity information and atmospheric information of a ground object spectrum library and an atmospheric profile library;
according to the earth surface emissivity information and the atmosphere information, determining atmosphere parameters, and establishing a simulation data set;
constructing a two-way long-short-term memory neural network model;
training the two-way long-short-term memory neural network model by utilizing hyperspectral data in the simulation data set, and determining structural parameters of the two-way long-short-term memory neural network model;
weighting each channel information of the output feature vector of the two-way long-short-term memory neural network model by using an attention mechanism to obtain a target two-way long-short-term memory neural network model;
performing data training on the target two-way long-short-term memory neural network model through the atmospheric parameters in the simulation data set;
the trained target two-way long-short-term memory neural network model parameters are continuously iteratively updated by adopting a back propagation algorithm until the output of the target two-way long-term memory neural network model converges, and an atmospheric parameter inversion model is obtained;
and acquiring thermal infrared hyperspectral radiance data and inputting the data as the atmospheric parameter inversion model to obtain a thermal infrared atmospheric parameter inversion result.
The invention provides a thermal infrared atmospheric parameter inversion device, which comprises an extraction unit, a simulation data set establishment unit, a first model construction unit, a first model training unit, a second model construction unit, a second model training unit, an iteration updating unit, an input unit and an output unit;
the extraction unit is used for extracting surface emissivity information and atmosphere information of the ground object spectrum library and the atmosphere profile library;
the simulation data set establishing unit is used for determining atmospheric parameters according to the earth surface emissivity information and the atmospheric information and establishing a simulation data set;
the first model construction unit is used for constructing a two-way long-short-term memory neural network model;
the first model training unit is used for training the two-way long-short-term memory neural network model by utilizing hyperspectral data in the simulation data set, and determining structural parameters of the two-way long-short-term memory neural network model;
the second model building unit is used for weighting the channel information of the output feature vector of the two-way long-short-term memory neural network model by using an attention mechanism to obtain a target two-way long-short-term memory neural network model;
the second model training unit is used for carrying out data training on the target two-way long-short-term memory neural network model through the atmospheric parameters in the simulation data set;
the iteration updating unit is used for adopting a back propagation algorithm to continuously and iteratively update the trained parameters of the target two-way long-short-term memory neural network model until the output of the target two-way long-term memory neural network model converges, so as to obtain an atmospheric parameter inversion model;
the input unit is used for acquiring thermal infrared hyperspectral radiance data and taking the data as the input of the atmospheric parameter inversion model;
and the output unit is used for outputting a thermal infrared atmospheric parameter inversion result.
In a third aspect, the present invention discloses an electronic device, comprising:
a processor and a memory;
the memory is used for storing computer operation instructions;
and the processor is used for executing the thermal infrared atmospheric parameter inversion method by calling the computer operation instruction.
The beneficial effects of the invention are as follows: the method not only effectively solves the problem that the deep neural network has weaker data mining capacity, but also strengthens the importance degree of the network on the correlation information between hyperspectral adjacent channel data in the inversion process. The invention provides a two-way long-short-term memory neural network structure which is used for extracting correlation characteristics of different channels; then, weighting different channel information by using a channel attention module so as to improve the proportion of the channel with higher weight in the characteristics; finally, mapping the characteristics to a sample marking space by adopting a full-connection structure, and converting the characteristics into the estimation results of the atmospheric uplink radiation, the downlink radiation and the transmissivity of each channel; the invention utilizes the radiance information of the remote sensing image, utilizes a circulating neural network module with a two-way long-short-term memory neural network to add a attention mechanism, obtains the correlation between the radiance and the atmospheric parameter, and obtains the uplink radiation, the downlink radiation and the transmittance of the atmosphere by inversion.
On the basis of the technical scheme, the invention can be improved as follows.
Further, the two-way long-short-term memory neural network model comprises two LSTM neural networks with mutually independent parameters, the hyperspectral data in the simulation data set are respectively input into the LSTM neural networks in a positive sequence and a reverse sequence to perform feature extraction, two feature extraction vectors are obtained, and the feature extraction vectors are spliced to form an output vector.
Further, weighting each channel information of the output feature vector of the two-way long-short term memory neural network model by using an attention mechanism, including:
through convolution operation, the input data with the given characteristic channel number being the first channel number is transformed to obtain a characteristic vector with the characteristic channel number being the second channel number;
compressing the feature vector in a space dimension by using a global method, and compressing the two-dimensional feature vector of each channel into a real number to serve as a feature value of each channel;
forming a Bottleneck structure through a plurality of full-connection layers to generate weight values for the characteristic values of all channels, and normalizing the weight values;
and weighting the normalized weight to the characteristic value of each channel, and carrying out weighted summation on the weight value and the characteristic vector.
Further, the full connection layer adopts three layers of full connection modules for mapping; the fully connected modules of adjacent layers are activated by using an activation function.
Further, the trained parameters of the target two-way long-short-term memory neural network model are continuously and iteratively updated by adopting a back propagation algorithm until the output of the target two-way long-term memory neural network model converges, and an atmospheric parameter inversion model is obtained, which comprises the following steps:
acquiring hyperspectral atmospheric parameter inversion data from the simulation data set as training data;
reading a plurality of atmosphere parameters corresponding to the radiance under different atmosphere parameters from the tag data, and matching the atmosphere parameters with the radiance information under the same atmosphere parameter state;
updating parameters of the target two-way long-short-term memory neural network model by adopting a back propagation algorithm, and inputting the radiance under different states to enable the root mean square error value of the target two-way long-short-term memory neural network model to be gradually attenuated to a set value;
and using the mean square error as a loss function, and selecting a network module with the minimum final loss function as the atmospheric parameter inversion model.
Further, by setting a channel attention module and setting the number of downsampling channels at equal intervals in the channel attention module, the number of downsampling channels of the channel attention module is determined according to the convergence speed of the target two-way long-short-term memory neural network model.
Drawings
FIG. 1 is a flow chart of a thermal infrared atmospheric parameter inversion method provided in embodiment 1 of the present invention;
FIG. 2 is a schematic diagram of a thermal infrared atmospheric parameter inversion apparatus according to embodiment 1 of the present invention;
fig. 3 is a schematic diagram of an electronic device according to embodiment 3 of the present invention.
Icon: 30-an electronic device; 310-a processor; 320-bus; 330-memory; 340-transceiver.
Detailed Description
For the purpose of making the objects, technical solutions and advantages of the embodiments of the present invention more apparent, the technical solutions of the embodiments of the present invention will be clearly and completely described below with reference to the accompanying drawings in the embodiments of the present invention, and it is apparent that the described embodiments are some embodiments of the present invention, but not all embodiments of the present invention. The components of the embodiments of the present invention generally described and illustrated in the figures herein may be arranged and designed in a wide variety of different configurations.
Example 1
As an embodiment, as shown in fig. 1, to solve the above technical problem, the embodiment provides a thermal infrared atmospheric parameter inversion method, which includes:
extracting surface emissivity information and atmospheric information of a ground object spectrum library and an atmospheric profile library;
according to the earth surface emissivity information and the atmosphere information, determining atmosphere parameters, and establishing a simulation data set;
constructing a two-way long-short-term memory neural network model;
training the two-way long-short-term memory neural network model by utilizing hyperspectral data in the simulation data set, and determining structural parameters of the two-way long-short-term memory neural network model;
weighting each channel information of the output feature vector of the two-way long-short-term memory neural network model by using an attention mechanism to obtain a target two-way long-short-term memory neural network model;
performing data training on the target two-way long-short-term memory neural network model through simulating atmospheric parameters in the data set;
the back propagation algorithm is adopted to continuously and iteratively update the trained parameters of the target two-way long-short-term memory neural network model until the output of the target two-way long-term memory neural network model converges, and an atmospheric parameter inversion model is obtained;
and acquiring thermal infrared hyperspectral radiance data and inputting the data as an atmospheric parameter inversion model to obtain a thermal infrared atmospheric parameter inversion result.
In the practical application process, the earth surface emissivity information and the atmosphere information of an earth surface spectrum library and an atmosphere profile library (thermodynamic initial guess retrieval, TIGR) are extracted, and the atmosphere parameters are calculated by using MODTRA to establish a simulation data set. Hyperspectral data are obtained through the simulation data set, the radiance under different emissivity and the atmospheric parameters corresponding to the radiance are matched, and the atmospheric parameters are matched with the radiance information under the same state. 2311 different regional atmosphere profiles are contained in an original database, 945 atmosphere profiles are selected through cloud removal screening and water vapor and bottom temperature screening, different surface emissivity information is extracted through a ground feature emissivity library, the emissivity of 66 different ground features including ground feature types such as bare soil, vegetation and water is simulated through MODTRA, 74844000 simulated radiance data are formed through setting different ground features and different atmosphere profiles, adjacent 10 wave numbers are processed into a wave band through an arithmetic average value (The average value), and 24 wave bands are obtained.
Optionally, the two-way long-short-term memory neural network model comprises two LSTM neural networks with mutually independent parameters, hyperspectral data in the simulation data set are respectively input into the LSTM neural networks in a positive sequence and a reverse sequence to perform feature extraction, two feature extraction vectors are obtained, and the feature extraction vectors are spliced to form an output vector.
In the practical application process, a two-way long-short-term memory neural network model is constructed, the two-way long-term memory neural network is adopted to perform feature extraction on the simulation data set, the long-term neural network operation is used for mapping the input data set X with the size of H multiplied by W multiplied by S into the output data set Y with the size of H multiplied by W multiplied by T, and the formula is as follows:
Figure BDA0004153601480000071
Y h',w',t for outputting data;
Figure BDA0004153601480000072
is input data; c (C) m,n,s,t The weight of a certain current position; b m,n,s,t Is the bias of a certain current position; h is the batch of data; w is the number of channels; s is the data length; m is the current batch; n is the current channel number; s is the current data length; t is the time.
The specific calculation process is as follows:
f(t)=σ(Wfh t-1 +Ufx t +bf);
i(t)=σ(W i h t-1 +U i x t +b i );
a(t)=tanh(W a h t-1 +U a x t +b a );
o(t)=σ(W o h t-1 +U o x t +b o );
wherein x is t Input at time t, h t-1 The state value of the hidden layer at the time t-1 is represented; w (W) f 、W i 、W o And W is a Respectively representing forgetting gate, input gate, output gate and h in the characteristic extraction process t-1 Weight coefficient of (2); u (U) f 、U i 、U o And U a Respectively representing forgetting gate, input gate, output gate and x in the characteristic extraction process t Weight coefficient of (2); b f 、b i 、b o And b a Respectively representing the bias values in the forgetting gate, the input gate, the output gate and the feature extraction process, wherein tan h (·) represents a tangent hyperbolic function, sigma (·) represents an activation function Sigmoid, and the formula is as follows:
Figure BDA0004153601480000073
Figure BDA0004153601480000074
the result of the forget gate and input gate calculation acts on c (t-1), forming the variable state c (t) at time t, expressed as:
c(t)=c(t-1)⊙f(t)+i(t)⊙a(t);
wherein, as follows, hadamard product, f (t) is the information left by the forgetting gate at the moment t; i (t) is information of input gate update at the moment t; a (t) is a value for transition to the variable state at time t. Finally, the state h (t) of the hidden layer at the time t is obtained from the output gate o (t) and the variable state c (t) at the current time, and is expressed as:
h(t)=o(t)⊙tanh(c(t));
the bidirectional long-short term neural network module consists of 2 LSTM with mutually independent parameters, the data set is respectively input into the 2 LSTM neural networks in positive sequence and reverse sequence for feature extraction, and a new feature vector formed after 2 output vectors (namely feature extraction vectors) are spliced is obtained and is used as the final feature expression of the data. Under this concept, the bidirectional LSTM module allows feature data obtained at time t to have correlation information between the past and future at the same time.
Optionally, weighting each channel information of the output feature vector of the two-way long-short term memory neural network model by using an attention mechanism includes:
through convolution operation, the input data with the given characteristic channel number being the first channel number is transformed to obtain a characteristic vector with the characteristic channel number being the second channel number;
compressing the feature vector in the space dimension by using a global method, and compressing the two-dimensional feature vector of each channel into a real number to serve as the feature value of each channel;
a plurality of full-connection layers form a Bottleneck structure to generate weight values for the characteristic values of all channels, and the weight values are normalized;
and weighting the normalized weights to the characteristic values of the channels, and carrying out weighted summation on the weight values and the characteristic vectors.
In the practical application process, a characteristic channel number T is obtained by convolution operation for the input conversion of a given characteristic channel number T 1 Is characterized by (2); compressing the features in the space dimension by using a global pooling method, compressing the two-dimensional features of each channel into a real number, and obtaining an image area corresponding to the feature value; wherein the C-th element Z of the feature Z is calculated C The values are formulated as:
Figure BDA0004153601480000091
wherein H is the height of the image; w is the width of the image; H×W is a channelIs a two-dimensional feature of (2); i is a counter that accumulates in height; j is a counter that accumulates over the width; u (u) c The pixel value at the C channel with the height of i and the width of j;
the importance of each channel was evaluated by two fully-connected layer composition Bottleneck structures, by Bottleneck Z C Generating a weight value for each characteristic channel of the model, and normalizing the weight; finally, the normalized weights are weighted to the features of each channel, and the weights and feature vectors are weighted and summed.
Optionally, the full connection layer adopts three layers of full connection modules for mapping; the fully connected modules of adjacent layers are activated by using an activation function.
In the practical application process, the characteristics of the weighted channels are output to the full-connection layer, the extracted characteristics of the channels are mapped to the sample marking space, and through testing, when the network uses a three-layer full-connection layer structure, the regressive atmosphere parameters are stable, and the accuracy is higher.
Optionally, the method for obtaining the atmospheric parameter inversion model by adopting a back propagation algorithm to continuously and iteratively update the trained target two-way long-short-term memory neural network model parameters until the output of the target two-way long-term memory neural network model converges comprises the following steps:
acquiring hyperspectral atmospheric parameter inversion data from the simulation data set as training data;
reading a plurality of atmosphere parameters corresponding to the radiance under different atmosphere parameters from the tag data, and matching the atmosphere parameters with the radiance information under the same atmosphere parameter state;
updating parameters of the target two-way long-short-term memory neural network model by adopting a back propagation algorithm, and inputting the radiance under different states to enable the root mean square error value of the target two-way long-term memory neural network model to be gradually attenuated to a set value;
and using the mean square error as a loss function, and selecting a network module with the minimum final loss function as an atmospheric parameter inversion model.
In the practical application process, the invention uses the combined structure of the channel attention mechanism and the full-connection layer to form the channel attention module, and performs dimension lifting on the spectrum characteristics extracted by the long-term and short-term network. And finally determining the number of downsampling channels of the attention module by setting the number of downsampling channels of the attention modules at equal intervals, mapping the extracted features to a full connection layer (Linear), normalizing the features by a normalization function, and activating the features by a Sigmoid activation function. Feature mapping is performed by using three layers of fully connected modules (Linear 1, linear2 and Linear 3), feature activation is performed between Linear1 and Linear2 by using a Sigmoid activation function through testing, and feature activation is performed between Linear2 and Linear3 by using a Softmax activation function.
And mapping the characteristics extracted by the channel attention module to a sample space through a multi-layer full-connection module, wherein the output of the module is the required atmospheric parameter.
Optionally, the channel attention module is set, and the number of downsampling channels of the channel attention module is determined according to the convergence speed of the target two-way long-short-term memory neural network model in a mode that the channel attention module is set with the number of downsampling channels at equal intervals.
The method not only effectively solves the problem that the deep neural network has weaker data mining capacity, but also strengthens the importance degree of the network on the correlation information between hyperspectral adjacent channel data in the inversion process. The invention provides a two-way long-short-term memory neural network structure which is used for extracting correlation characteristics of different channels; then, weighting different channel information by using a channel attention module so as to improve the proportion of the channel with higher weight in the characteristics; and finally, mapping the characteristics to a sample marking space by adopting a full-connection structure, and converting the characteristics into the estimation results of the atmospheric uplink radiation, the downlink radiation and the transmissivity of each channel.
Example 2
Based on the same principle as the method shown in the embodiment 1 of the present invention, as shown in fig. 2, the embodiment of the present invention further provides a thermal infrared atmospheric parameter inversion device, which includes an extraction unit, a simulation data set building unit, a first model training unit, a second model building unit, a second model training unit, an iteration updating unit, an input unit and an output unit;
the extraction unit is used for extracting the earth surface emissivity information and the atmosphere information of the earth surface spectrum library and the atmosphere profile library;
the simulation data set establishing unit is used for determining atmospheric parameters according to the earth surface emissivity information and the atmospheric information and establishing a simulation data set;
the first model building unit is used for building a two-way long-short-term memory neural network model;
the first model training unit is used for training the two-way long-short-term memory neural network model by utilizing hyperspectral data in the simulation data set, and determining structural parameters of the two-way long-term memory neural network model;
the second model construction unit is used for weighting the information of each channel of the output characteristic vector of the two-way long-short-term memory neural network model by using an attention mechanism to obtain a target two-way long-short-term memory neural network model;
the second model training unit is used for carrying out data training on the target two-way long-short-term memory neural network model through simulating the atmospheric parameters in the data set;
the iteration updating unit is used for adopting a back propagation algorithm to continuously and iteratively update the trained target two-way long-short-term memory neural network model parameters until the output of the target two-way long-term memory neural network model converges, so as to obtain an atmospheric parameter inversion model;
the input unit is used for acquiring the thermal infrared hyperspectral radiance data and inputting an atmospheric parameter inversion model;
and the output unit is used for outputting a thermal infrared atmospheric parameter inversion result.
Optionally, the two-way long-short-term memory neural network model comprises two LSTM neural networks with mutually independent parameters, hyperspectral data in the simulation data set are respectively input into the LSTM neural networks in a positive sequence and a reverse sequence to perform feature extraction, two feature extraction vectors are obtained, and the feature extraction vectors are spliced to form an output vector.
Optionally, weighting each channel information of the output feature vector of the two-way long-short term memory neural network model by using an attention mechanism includes:
through convolution operation, the input data with the given characteristic channel number being the first channel number is transformed to obtain a characteristic vector with the characteristic channel number being the second channel number;
compressing the feature vector in the space dimension by using a global method, and compressing the two-dimensional feature vector of each channel into a real number to serve as the feature value of each channel;
a plurality of full-connection layers form a Bottleneck structure to generate weight values for the characteristic values of all channels, and the weight values are normalized;
and weighting the normalized weights to the characteristic values of the channels, and carrying out weighted summation on the weight values and the characteristic vectors.
Optionally, the full connection layer adopts three layers of full connection modules for mapping; the fully connected modules of adjacent layers are activated by using an activation function.
Optionally, the method for obtaining the atmospheric parameter inversion model by adopting a back propagation algorithm to continuously and iteratively update the trained target two-way long-short-term memory neural network model parameters until the output of the target two-way long-term memory neural network model converges comprises the following steps:
acquiring hyperspectral atmospheric parameter inversion data from the simulation data set as training data;
reading a plurality of atmosphere parameters corresponding to the radiance under different atmosphere parameters from the tag data, and matching the atmosphere parameters with the radiance information under the same atmosphere parameter state;
updating parameters of the target two-way long-short-term memory neural network model by adopting a back propagation algorithm, and inputting the radiance under different states to enable the root mean square error value of the target two-way long-term memory neural network model to be gradually attenuated to a set value;
and using the mean square error as a loss function, and selecting a network module with the minimum final loss function as an atmospheric parameter inversion model.
Optionally, the channel attention module is set, and the number of downsampling channels of the channel attention module is determined according to the convergence speed of the target two-way long-short-term memory neural network model in a mode that the channel attention module is set with the number of downsampling channels at equal intervals.
Example 3
Based on the same principle as the method shown in the embodiment of the present invention, there is also provided an electronic device in the embodiment of the present invention, as shown in fig. 3, which may include, but is not limited to: a processor and a memory; a memory for storing a computer program; a processor for executing the method according to any of the embodiments of the invention by invoking a computer program.
In an alternative embodiment, an electronic device is provided, the electronic device 30 shown in fig. 3 comprising: a processor 310 and a memory 350. Processor 310 is coupled to memory 350, such as via bus 320.
Optionally, the electronic device 30 may further comprise a transceiver 340, and the transceiver 340 may be used for data interaction between the electronic device and other electronic devices, such as transmission of data and/or reception of data, etc. It should be noted that, in practical applications, the transceiver 340 is not limited to one, and the structure of the electronic device 30 is not limited to the embodiment of the present invention.
The processor 310 may be a CPU central processing unit, a general purpose processor, a DSP data signal processor, an ASIC specific integrated circuit, an FPGA field programmable gate array or other programmable logic device, a hardware component, or any combination thereof. Processor 310 may also be a combination that performs computing functions, e.g., including one or more microprocessor combinations, a combination of a DSP and a microprocessor, etc.
Bus 320 may include a path that communicates information between the components. Bus 320 may be a PCI peripheral component interconnect standard bus or an EISA extension industry standard architecture bus, among others. The bus 320 may be classified into a control bus, a data bus, an address bus, and the like. For ease of illustration, only one thick line is shown in fig. 3, but not only one bus or one type of bus.
Memory 350 may be, but is not limited to, ROM read only memory or other type of static storage device that can store static information and instructions, RAM random access memory or other type of dynamic storage device that can store information and instructions, EEPROM electrically erasable programmable read only memory, CD-ROM read only or other optical disk storage, optical disk storage (including optical disks, laser disks, compact disks, digital versatile disks, etc.), magnetic disk storage media, or any other medium that can be used to carry or store desired program code in the form of instructions or data structures and that can be accessed by a computer.
The memory 350 is used to store application program codes (computer programs) for executing the inventive arrangements and is controlled to be executed by the processor 310. The processor 310 is configured to execute the application code stored in the memory 350 to implement what is shown in the foregoing method embodiments.
The above is only a preferred embodiment of the present invention, and is not intended to limit the present invention, but various modifications and variations can be made to the present invention by those skilled in the art. Any modification, equivalent replacement, improvement, etc. made within the spirit and principle of the present invention should be included in the protection scope of the present invention.

Claims (8)

1. The thermal infrared atmospheric parameter inversion method is characterized by comprising the following steps of:
extracting surface emissivity information and atmospheric information of a ground object spectrum library and an atmospheric profile library;
according to the earth surface emissivity information and the atmosphere information, determining atmosphere parameters, and establishing a simulation data set;
constructing a two-way long-short-term memory neural network model;
training the two-way long-short-term memory neural network model by utilizing hyperspectral data in the simulation data set, and determining structural parameters of the two-way long-short-term memory neural network model;
weighting each channel information of the output feature vector of the two-way long-short-term memory neural network model by using an attention mechanism to obtain a target two-way long-short-term memory neural network model;
performing data training on the target two-way long-short-term memory neural network model through the atmospheric parameters in the simulation data set;
the trained target two-way long-short-term memory neural network model parameters are continuously iteratively updated by adopting a back propagation algorithm until the output of the target two-way long-term memory neural network model converges, and an atmospheric parameter inversion model is obtained;
and acquiring thermal infrared hyperspectral radiance data and inputting the data as the atmospheric parameter inversion model to obtain a thermal infrared atmospheric parameter inversion result.
2. The thermal infrared atmospheric parameter inversion method according to claim 1, wherein the two-way long-short-term memory neural network model comprises two LSTM neural networks with mutually independent parameters, the hyperspectral data in the simulation data set are respectively input into the LSTM neural networks in a positive sequence and a reverse sequence to perform feature extraction, two feature extraction vectors are obtained, and the feature extraction vectors are spliced to form an output vector.
3. The thermal infrared atmospheric parameter inversion method according to claim 1, wherein weighting each channel information of the output feature vector of the two-way long-short-term memory neural network model by using an attention mechanism comprises:
through convolution operation, the input data with the given characteristic channel number being the first channel number is transformed to obtain a characteristic vector with the characteristic channel number being the second channel number;
compressing the feature vector in a space dimension by using a global method, and compressing the two-dimensional feature vector of each channel into a real number to serve as a feature value of each channel;
forming a Bottleneck structure through a plurality of full-connection layers to generate weight values for the characteristic values of all channels, and normalizing the weight values;
and weighting the normalized weight to the characteristic value of each channel, and carrying out weighted summation on the weight value and the characteristic vector.
4. The thermal infrared atmospheric parameter inversion method according to claim 3 wherein said fully-connected layer is mapped using a three-layer fully-connected module; the fully connected modules of adjacent layers are activated by using an activation function.
5. The thermal infrared atmospheric parameter inversion method according to claim 1, wherein the step of obtaining the atmospheric parameter inversion model by using a back propagation algorithm to iteratively update the trained parameters of the target two-way long-short-term memory neural network model until the output of the target two-way long-term memory neural network model converges comprises the steps of:
acquiring hyperspectral atmospheric parameter inversion data from the simulation data set as training data;
reading a plurality of atmosphere parameters corresponding to the radiance under different atmosphere parameters from the tag data, and matching the atmosphere parameters with the radiance information under the same atmosphere parameter state;
updating parameters of the target two-way long-short-term memory neural network model by adopting a back propagation algorithm, and inputting the radiance under different states to enable the root mean square error value of the target two-way long-short-term memory neural network model to be gradually attenuated to a set value;
and using the mean square error as a loss function, and selecting a network module with the minimum final loss function as the atmospheric parameter inversion model.
6. The thermal infrared atmospheric parameter inversion method according to claim 1, wherein the number of downsampling channels of the channel attention module is determined according to the convergence speed of the target two-way long-short-term memory neural network model by setting the channel attention module and setting the number of downsampling channels at equal intervals in the channel attention module.
7. The thermal infrared atmospheric parameter inversion device is characterized by comprising an extraction unit, a simulation data set establishment unit, a first model training unit, a second model establishment unit, a second model training unit, an iteration updating unit, an input unit and an output unit;
the extraction unit is used for extracting surface emissivity information and atmosphere information of the ground object spectrum library and the atmosphere profile library;
the simulation data set establishing unit is used for determining atmospheric parameters according to the earth surface emissivity information and the atmospheric information and establishing a simulation data set;
the first model construction unit is used for constructing a two-way long-short-term memory neural network model;
the first model training unit is used for training the two-way long-short-term memory neural network model by utilizing hyperspectral data in the simulation data set, and determining structural parameters of the two-way long-short-term memory neural network model;
the second model building unit is used for weighting the channel information of the output feature vector of the two-way long-short-term memory neural network model by using an attention mechanism to obtain a target two-way long-short-term memory neural network model;
the second model training unit is used for carrying out data training on the target two-way long-short-term memory neural network model through the atmospheric parameters in the simulation data set;
the iteration updating unit is used for adopting a back propagation algorithm to continuously and iteratively update the trained parameters of the target two-way long-short-term memory neural network model until the output of the target two-way long-term memory neural network model converges, so as to obtain an atmospheric parameter inversion model;
the input unit is used for acquiring thermal infrared hyperspectral radiance data and taking the data as the input of the atmospheric parameter inversion model;
and the output unit is used for outputting a thermal infrared atmospheric parameter inversion result.
8. An electronic device, comprising:
a processor and a memory;
the memory is used for storing computer operation instructions;
the processor for performing the thermal infrared atmospheric parameter inversion method of any one of claims 1 to 6 by invoking the computer operating instructions.
CN202310327016.2A 2023-03-29 2023-03-29 Thermal infrared atmospheric parameter inversion method and device and electronic equipment Active CN116384237B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN202310327016.2A CN116384237B (en) 2023-03-29 2023-03-29 Thermal infrared atmospheric parameter inversion method and device and electronic equipment

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN202310327016.2A CN116384237B (en) 2023-03-29 2023-03-29 Thermal infrared atmospheric parameter inversion method and device and electronic equipment

Publications (2)

Publication Number Publication Date
CN116384237A true CN116384237A (en) 2023-07-04
CN116384237B CN116384237B (en) 2024-08-23

Family

ID=86965077

Family Applications (1)

Application Number Title Priority Date Filing Date
CN202310327016.2A Active CN116384237B (en) 2023-03-29 2023-03-29 Thermal infrared atmospheric parameter inversion method and device and electronic equipment

Country Status (1)

Country Link
CN (1) CN116384237B (en)

Citations (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20200202211A1 (en) * 2018-12-25 2020-06-25 Abbyy Production Llc Neural network training utilizing loss functions reflecting neighbor token dependencies
CN111735772A (en) * 2020-05-13 2020-10-02 中国科学院空天信息创新研究院 Improved high-spectrum data earth surface reflectivity inversion method of cascade neural network
CN112733394A (en) * 2020-12-21 2021-04-30 国家卫星气象中心(国家空间天气监测预警中心) Atmospheric parameter inversion method and device
CN112733725A (en) * 2021-01-12 2021-04-30 西安电子科技大学 Hyperspectral image change detection method based on multistage cyclic convolution self-coding network
CN113379146A (en) * 2021-06-24 2021-09-10 合肥工业大学智能制造技术研究院 Pollutant concentration inversion method based on multi-feature selection algorithm
CN115422703A (en) * 2022-07-19 2022-12-02 南京航空航天大学 Surface thermal infrared emissivity inversion method based on MODIS data and Transformer network

Patent Citations (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20200202211A1 (en) * 2018-12-25 2020-06-25 Abbyy Production Llc Neural network training utilizing loss functions reflecting neighbor token dependencies
CN111735772A (en) * 2020-05-13 2020-10-02 中国科学院空天信息创新研究院 Improved high-spectrum data earth surface reflectivity inversion method of cascade neural network
CN112733394A (en) * 2020-12-21 2021-04-30 国家卫星气象中心(国家空间天气监测预警中心) Atmospheric parameter inversion method and device
CN112733725A (en) * 2021-01-12 2021-04-30 西安电子科技大学 Hyperspectral image change detection method based on multistage cyclic convolution self-coding network
CN113379146A (en) * 2021-06-24 2021-09-10 合肥工业大学智能制造技术研究院 Pollutant concentration inversion method based on multi-feature selection algorithm
CN115422703A (en) * 2022-07-19 2022-12-02 南京航空航天大学 Surface thermal infrared emissivity inversion method based on MODIS data and Transformer network

Non-Patent Citations (2)

* Cited by examiner, † Cited by third party
Title
毛克彪;唐华俊;李丽英;许丽娜;: "一个从MODIS数据同时反演地表温度和发射率的神经网络算法", 遥感信息, no. 04, pages 9 - 16 *
虞浩跃;沈韬;朱艳;刘英莉;余正涛;: "基于双向长短期记忆网络的太赫兹光谱识别", 光谱学与光谱分析, no. 12, pages 91 - 96 *

Also Published As

Publication number Publication date
CN116384237B (en) 2024-08-23

Similar Documents

Publication Publication Date Title
CN109508655B (en) SAR target recognition method based on incomplete training set of twin network
Saralioglu et al. Semantic segmentation of land cover from high resolution multispectral satellite images by spectral-spatial convolutional neural network
US11720727B2 (en) Method and system for increasing the resolution of physical gridded data
US20230153615A1 (en) Neural network distillation method and apparatus
CN111814607B (en) Deep learning model suitable for small sample hyperspectral image classification
CN112183718A (en) Deep learning training method and device for computing equipment
CN107563497A (en) Computing device and method
CN111797970B (en) Method and device for training neural network
CN110472280B (en) Power amplifier behavior modeling method based on generation of antagonistic neural network
Maier et al. Exploding the myths: An introduction to artificial neural networks for prediction and forecasting
Gad et al. A robust deep learning model for missing value imputation in big NCDC dataset
CN114912673A (en) Water level prediction method based on whale optimization algorithm and long-term and short-term memory network
Mai et al. Optimization of interval type-2 fuzzy system using the PSO technique for predictive problems
Anochi et al. Two geoscience applications by optimal neural network architecture
Mukherjee et al. Identification of the types of disease for tomato plants using a modified gray wolf optimization optimized MobileNetV2 convolutional neural network architecture driven computer vision framework
Rivas-Perea et al. Statistical and neural pattern recognition methods for dust aerosol detection
JP2019096237A (en) System, method, program and storage medium storing the program for generating emulation model and generating and displaying information on uncertainty of target variable of the generated emulation model
CN114444657A (en) Image processing method, system, equipment and readable storage medium
CN116384237B (en) Thermal infrared atmospheric parameter inversion method and device and electronic equipment
CN117830701A (en) Attention mechanism-based multiscale feature fusion star map identification method and device
Ghosh et al. Deep learning enabled surrogate model of complex food processes for rapid prediction
Kuter et al. Modern applied mathematics for alternative modeling of the atmospheric effects on satellite images
CN116958709A (en) Hyperspectral classification method of lightweight tensor convolution long-short-term memory network
CN116822716A (en) Typhoon prediction method, system, equipment and medium based on space-time attention
Tao et al. Error variance estimation for individual geophysical parameter retrievals

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant