CN116612396A

CN116612396A - Ocean surface temperature sequence prediction method based on space-time double-flow non-stationary sensing

Info

Publication number: CN116612396A
Application number: CN202310607658.8A
Authority: CN
Inventors: 聂婕; 杜家豪; 温琦; 叶敏; 左子杰; 宋宁
Original assignee: Ocean University of China
Current assignee: Ocean University of China
Priority date: 2023-05-26
Filing date: 2023-05-26
Publication date: 2023-08-18

Abstract

The invention belongs to the technical field of image processing, and discloses a marine surface temperature sequence prediction method based on space-time double flow non-stationary perception.

Description

Ocean surface temperature sequence prediction method based on space-time double-flow non-stationary sensing

Technical Field

The invention belongs to the technical field of image processing, and particularly relates to a marine surface temperature sequence prediction method based on space-time double-flow non-stationary perception.

Background

Ocean surface temperature (SST) prediction methods can be divided into two major categories, numerical methods, which are mathematical models built by a series of complex physical, chemical and biological parameters to predict sea temperature changes, and data-driven methods, which generally require significant computational costs due to excessive parameters. The data driving method aims at solving the SST prediction problem by taking data as a center, and builds an SST prediction model by learning the change rule of SST from historical SST data. The data driven method does not require much knowledge of the ocean and atmosphere domain and can predict sea surface temperature with higher resolution on a smaller scale than the numerical method.

The main data driven methods include traditional statistical methods and machine learning methods. With the development of deep learning, the existing SST prediction method mostly adopts a space-time prediction model based on a cyclic neural network, captures non-stationary characteristics through simple state transition, establishes a simple spatial dependency while establishing a time dependency, however, the existing method has the following problems: first, focusing on building correlations in time series, full mining of ocean surface temperature space information is not achieved. The existing ocean surface temperature prediction model structure mostly adopts a circulating structure, and a method for performing simple feature extraction on space information and then performing time feature aggregation. However, the spatio-temporal data has spatial autocorrelation, and the potential spatial correlation information cannot be captured sufficiently by simple feature extraction, so that the spatial information needs to be further mined deeply according to features of different scales. Second, marine physics space-time shows complex non-stationarity in both time and space, such as el nino and southern billow phenomena, which cause abnormal increases in ocean surface temperature. The existing method cannot fully model the non-stationarity in the ocean surface temperature in time and space at the same time, so that uncertainty in space-time dynamics cannot be inferred.

Disclosure of Invention

Aiming at the defects existing in the prior art, the invention provides a marine surface temperature sequence prediction method based on space-time double-flow non-stationary perception, a prediction network based on double flow is constructed, and firstly, the time non-stationary characteristics in the space-time sequence are captured through a diagonal cyclic neural network stacked by MIM modules; then constructing a parallel time non-stationary feature modeling sub-network and a space non-stationary feature modeling sub-network, and simultaneously mining non-stationary features in time and space; and finally, the time comprehensive characteristics and the space comprehensive characteristics obtained from the two subnetworks are adaptively learned by a space-time fusion module based on stacked cross attention, so that the space-time correlation of the sea surface temperature is effectively excavated by the weight of the time dependence and the space dependence on the sea temperature field, a final SST prediction result is obtained, and the accuracy of SST prediction is improved.

In order to solve the technical problems, the invention adopts the following technical scheme:

a marine surface temperature sequence prediction method based on space-time double-flow non-stationary perception comprises the following steps: s1, acquiring a satellite remote sensing image of ocean surface temperature;

s2, constructing and training a marine surface temperature sequence prediction network based on space-time double-flow non-stationary perception, wherein the marine surface temperature sequence prediction network comprises a space non-stationary characteristic modeling sub-network, a time non-stationary characteristic modeling sub-network and a stacking cross attention space-time fusion module;

the time non-stationary characteristic modeling network consists of ST-LSTM modules and MIM modules, and comprises multiple layers of network structures with the same structure, wherein each layer of network structure comprises a plurality of layers of network structuresThe network structure comprises an ST-LSTM module and a plurality of MIM modules, wherein the modules of the upper layer and the modules of the lower layer are stacked and interconnected through a saw-tooth path, the ST-LSTM module or the MIM module at the corresponding position of each layer is connected through a vertical path, and an input ocean surface temperature satellite remote sensing image X is input ₁ ,X ₂ ,...,X _t The method comprises the steps of obtaining the SST time sequence characteristics of the ocean surface temperature through ST-LSTM of each layer, transmitting the output hidden states of two vertically adjacent modules to the next layer of modules through diagonal paths, capturing non-stationary information in the SST time sequence characteristics of the ocean surface temperature in MIM modules by a differential-based method, capturing the non-stationary information through stacked diagonal structures, modeling the time sequence non-stationary characteristics of the SST of the ocean surface temperature, and finally outputting y of the last MIM module of each layer ₁ ,y ₂ ,...,y _t Polymerizing to obtain SST time comprehensive characteristics beta;

the space non-stationary characteristic modeling sub-network is composed of a space non-stationary characteristic extraction module and a space characteristic fusion module, and specifically comprises the following components:

in a space non-stationary feature extraction module, downsampling a marine surface temperature satellite remote sensing image with the size of M to form a feature map with three scales of M, (M) M/4, (M) M/8, performing differential operation on the satellite remote sensing images with the sizes of (M) M/4 and (M) M/8 through convolution operation, and capturing a small-scale non-stationary feature n; similarly, performing a differential operation on the satellite remote sensing images of M and (M) M/4 through convolution operation, and capturing a large-scale non-stationary characteristic N;

in a space feature fusion module, aggregating an original feature map S, a small-scale non-stable feature N and a large-scale non-stable feature N of M, respectively performing sigmoid function processing and tanh function processing on the obtained space aggregation features, reinforcing the non-obvious non-stable features in space by the sigmoid function to prevent loss, reserving complete space comprehensive features, and multiplying corresponding elements of two function processing results to obtain SST space comprehensive features alpha;

the SST time comprehensive feature beta obtained from the time non-stationary feature modeling sub-network is fused with the SST space comprehensive feature alpha in the space non-stationary feature modeling sub-network through a stacking-based cross attention time-space fusion module, and the SST space comprehensive feature alpha and the SST time comprehensive feature beta self-adaptively learn the weight of the SST space comprehensive feature alpha and the SST time comprehensive feature beta to the ocean surface temperature field in the stacking-based cross attention time-space fusion module, and fuse time and space information;

and S3, inputting the ocean surface temperature satellite remote sensing image to be processed into a trained ocean surface temperature sequence prediction network based on space-time double flow non-stationary sensing, and outputting an ocean surface temperature prediction result.

Further, the calculation flow of the SST spatial synthesis feature α is as follows:

o＝tanh(W _os *S+W _oN *N+W _on *n)

g＝σ(W _gs *S+W _gN *N+W _gn *n)

wherein W is _os 、W _oN 、W _on 、W _gs 、W _gN 、W _gn Representing a convolution parameter; * Representing a convolution operation; sigma is a sigmoid function, an activation function applied to the input weighted sum; tanh represents a tanh activation function; n represents a small-scale non-stationary feature; n represents a large scale non-stationary feature; s represents an original feature map;representing a Hadamard product; o represents the result of the tan h function after the original feature map S, the small-scale non-stationary feature N and the large-scale non-stationary feature N are aggregated; g represents the result of the original feature map S, the small-scale non-stationary feature N and the large-scale non-stationary feature N after aggregation by a sigmoid function.

Further, in the space-time fusion module based on stacked cross attention, specifically:

firstly, SST space integrated feature alpha in (M, M) dimension is flattened and mapped through an attention coding mechanism to be respectively used as query Q in (M multiplied by M, 1) dimension _α Sum value V _α Matrix, SST time synthesisThe feature β likewise flattens the mapping by the attention encoding mechanism as keys K in the (m×m, 1) dimension, respectively _β Sum value V _β Matrix:

Q _α ＝W _q *α,V _α ＝W _v *α

K _β ＝W _k *β,V _β ＝W _v *β

wherein W is _q ，W _k ，W _v Represents a convolution parameter, represents a convolution operation, Q _α Representing a spatial query matrix; v (V) _α Representing a matrix of spatial values, K _β Representing a time key matrix, V _β Representing a time value matrix;

for Q _α And K _β And performing matrix multiplication and modeling through each grid point of a softmax function to obtain a correlation score e of the SST time comprehensive feature beta and the SST space comprehensive feature alpha, wherein the dimension is (M multiplied by M ) and the formula is as follows:

where softmax represents the normalized exponential function,representing a matrix multiplication;

then, the space value matrix V _α Matrix multiplication with e is performed on a space value matrix V _α Modeling each grid point on the model and remodelling the result through a reshape function to obtain a spatial attention characteristic Z through the (M, M) dimension _α Similarly, the time value matrix V _β Matrix multiplication with e is performed on a time value matrix V _β Modeling each grid point on the model and remodelling the result through a reshape function to the (M, M) dimension to obtain a time attention feature Z _β ：

Where reshape denotes a function that converts tensor dimensions,represents matrix multiplication, e represents correlation scores on temporal and spatial features, V _α Representing a matrix of spatial values, V _β Representing a matrix of time values, Z _α Representing spatial attention features, Z _β Representing a temporal attention feature;

finally, a final ocean surface temperature satellite remote sensing image prediction result is obtained through the aggregation characteristics and the multi-layer perceptron, and is expressed as follows:

Z＝MLP(W _z [Z _α ,Z _β ])

wherein MLP represents a multi-layer perceptron, W _Z Representing the convolution parameters, Z _α Representing spatial attention features, Z _β And Z represents the prediction result of the marine surface temperature satellite remote sensing image.

Compared with the prior art, the invention has the advantages that:

(1) A spatially non-stationary feature modeling sub-network is designed to model spatially non-stationary features in SST data. Firstly, down-sampling an input SST space-time sequence to construct space features of three scales of small, medium and large; then differential modeling is carried out on the multi-scale space features, so that full excavation on small-scale and large-scale non-stable features and large-scale stable feature changes is realized; and finally, designing a spatial feature fusion module based on a gating mechanism, and dynamically carrying out association modeling of non-stationary features and stationary feature changes.

(2) The invention can effectively extract the non-stationary characteristics in time and space simultaneously. The invention constructs a prediction network based on double flow, firstly, capturing time non-stationary characteristics in a time-space sequence through a diagonal cyclic neural network stacked by MIM modules; then constructing a parallel time non-stationary feature modeling sub-network and a space non-stationary feature modeling sub-network, and simultaneously mining non-stationary features in time and space; and finally, the time comprehensive features and the space comprehensive features obtained from the two subnetworks are adaptively learned by a space-time fusion module based on stacked cross attention, the weight of time dependence and space dependence on sea temperature fields is effectively excavated by the space-time fusion module based on stacked cross attention, so that a final SST prediction result is obtained, and the accuracy of SST prediction is improved.

Drawings

In order to more clearly illustrate the technical solutions of the embodiments of the present invention, the drawings required for the description of the embodiments will be briefly described below, and it is obvious that the drawings in the following description are only some embodiments of the present invention, and other drawings may be obtained according to these drawings without inventive effort for a person skilled in the art.

FIG. 1 is a diagram of a network architecture of the present invention;

FIG. 2 is a schematic diagram of a stacked cross-attention based spatiotemporal fusion module of the present invention.

Detailed Description

The invention will be further described with reference to the accompanying drawings and specific examples.

The embodiment provides a marine surface temperature sequence prediction method based on space-time double-flow non-stationary sensing, which comprises the following steps:

s1, acquiring a satellite remote sensing image of ocean surface temperature;

and S2, constructing and training a marine surface temperature sequence prediction network based on space-time double-flow non-stationary sensing, and inputting a marine surface temperature satellite remote sensing image to be processed into the trained marine surface temperature sequence prediction network based on space-time double-flow non-stationary sensing, and outputting a marine surface temperature prediction result.

The following describes the structure of the marine surface temperature sequence prediction network based on space-time double-flow non-stationary sensing constructed by the embodiment with reference to fig. 1, wherein the marine surface temperature sequence prediction network comprises a space non-stationary characteristic modeling sub-network, a time non-stationary characteristic modeling sub-network and a stacked cross-attention space-time fusion module, and the composition, the function and the data processing process of each module are respectively described.

The method comprises the steps that a time non-stationary characteristic modeling sub-network is built, a circulating neural network stacked by MIM modules is used for capturing the time non-stationary characteristic, the time non-stationary characteristic is formed by ST-LSTM modules and MIM modules, the time non-stationary characteristic modeling sub-network comprises a multi-layer network structure with the same structure, each layer of network structure comprises an ST-LSTM module and a plurality of MIM modules, the modules of the upper layer are stacked and interconnected with the modules of the lower layer through sawtooth paths, the ST-LSTM modules or the MIM modules of the corresponding positions of each layer are connected through vertical paths, and the MIM modules are connected and interconnected through diagonal paths and used for modeling differential information in space-time prediction; by stacking multiple MIM modules, non-stationarity features can potentially be captured from spatio-temporal dynamics, gradually smoothing the spatio-temporal process, making predicted future SST time integration features more accurate.

Input ocean surface temperature satellite remote sensing image X ₁ ,X ₂ ,...,X _t The SST time sequence characteristics of the ocean surface temperature are obtained through ST-LSTM of each layer respectively, but the SST has non-stationary information in natural time and space, so that the output hidden states of two vertically adjacent modules are transmitted to the next layer of modules through diagonal paths, the non-stationary information in the SST time sequence characteristics of the ocean surface temperature is captured by differential operation in the MIM modules, the non-stationary information is captured by stacked diagonal structures, the time sequence non-stationary characteristics of the SST of the ocean surface temperature are modeled, and finally the output y of the last MIM module of each layer is obtained ₁ ,y ₂ ,...,y _t And polymerizing to obtain SST time comprehensive characteristics beta.

The space non-stationary characteristic modeling sub-network consists of a space non-stationary characteristic extraction module and a space characteristic fusion module, and specifically comprises the following components:

firstly, in a space non-stationary feature extraction module, due to the characteristic that the physical process change is slow in the marine physical field, a marine surface temperature satellite remote sensing image with the size of M is firstly downsampled into feature images with three scales of M, M/4, M/8, and the larger the scale of the remote sensing image is, the more abundant and obvious the features are contained. Performing differential operation on satellite remote sensing images of (M×M)/4 and (M×M)/8 through convolution operation to capture small-scale non-stationary features n; similarly, a difference operation is carried out on the satellite remote sensing images of M, M and (M, M)/4 through convolution operation, and a large-scale non-stationary characteristic N is captured. Because any non-stationary process can be decomposed into a plurality of deterministic time-varying polynomials and a zero-mean random value, the order of the time-varying polynomials can be reduced by properly applying differential operation, and the time-varying polynomials gradually tend to be stationary, so that deterministic complex trend information becomes predictable, and the problem that the non-stationary features of the space cannot be captured is solved.

And then, in a spatial feature fusion module, aggregating the original feature map S, the small-scale non-stable feature N and the large-scale non-stable feature N of M, respectively performing sigmoid function processing and tanh function processing on the obtained spatial aggregation features, reinforcing the non-stable features which are not obvious in the space by the sigmoid function to prevent loss, reserving the complete spatial comprehensive features, and finally multiplying corresponding elements of the two function processing results to obtain the SST spatial comprehensive feature alpha. The calculation flow of the SST spatial synthesis feature α is as follows:

o＝tanh(W _os *S+W _oN *N+W _on *n)

g＝σ(W _gs *S+W _gN *N+W _gn *n)

wherein W is _os 、W _oN 、W _on 、W _gs 、W _gN 、W _gn Representing a convolution parameter; * Representing a convolution operation; sigma is a sigmoid function, an activation function applied to the input weighted sum; tanh represents a tanh activation function; n represents a small-scale non-stationary feature; n represents a large scale non-stationary feature; s represents an original feature map;representing a Hadamard product; o represents the result of the tan h function after the original feature map S, the small-scale non-stationary feature N and the large-scale non-stationary feature N are aggregatedThe method comprises the steps of carrying out a first treatment on the surface of the g represents the result of the original feature map S, the small-scale non-stationary feature N and the large-scale non-stationary feature N after aggregation by a sigmoid function.

And finally, fusing the SST time comprehensive characteristic beta obtained from the time non-stationary characteristic modeling sub-network with the SST space comprehensive characteristic alpha in the space non-stationary characteristic modeling sub-network through a stacking cross attention time-space fusion module, and adaptively learning the weight of the SST space comprehensive characteristic alpha and the SST time comprehensive characteristic beta on the ocean surface temperature field in the stacking cross attention time-space fusion module, fusing important information in time and space, and filtering unimportant information, so that the prediction result is more accurate and reliable.

In conjunction with the illustration of fig. 2, within the stacked cross-attention based spatiotemporal fusion module, in particular:

firstly, SST space integrated feature alpha in (M, M) dimension is flattened and mapped through an attention coding mechanism to be respectively used as query Q in (M multiplied by M, 1) dimension _α Sum value V _α Matrix, SST time complex feature β likewise flattens the mapping by means of an attention coding mechanism as keys K in the (M1) dimension, respectively _β Sum value V _β Matrix:

Q _α ＝W _q *α,V _α ＝W _v *α

K _β ＝W _k *β,V _β ＝W _v *β

Z＝MLP(W _z [Z _α ,Z _β ])

The network model models the uncertain trend in the SST dynamic space-time process with non-stationary information, so that the model is better fitted with the SST natural space-time process, and the prediction accuracy is improved.

Wherein the training of the network model is not a design gist of the present invention, and is not described in detail herein.

In summary, the invention (1) designs a spatially non-stationary feature modeling sub-network to model spatially non-stationary features in SST data. Firstly, down-sampling an input SST space-time sequence to construct space features of three scales of small, medium and large; then differential modeling is carried out on the multi-scale space features, so that full excavation on small-scale and large-scale non-stable features and large-scale stable feature changes is realized; and finally, designing a spatial feature fusion module based on a gating mechanism, and dynamically carrying out association modeling of non-stationary features and stationary feature changes. (2) constructing a predictive network based on double flow. First, capturing temporal non-stationary features in a spatio-temporal sequence by a diagonal recurrent neural network stacked by MIM modules; then constructing a parallel time non-stationary feature modeling sub-network and a space non-stationary feature modeling sub-network, and simultaneously mining non-stationary features in time and space; and finally, the time comprehensive features and the space comprehensive features obtained from the two sub-networks are adaptively learned by a space-time fusion module based on stacked cross attention, so that the weight of time dependence and space dependence on a sea temperature field is obtained, a final SST prediction result is obtained, and the accuracy of SST prediction is improved.

It should be understood that the above description is not intended to limit the invention to the particular embodiments disclosed, but to limit the invention to the particular embodiments disclosed, and that various changes, modifications, additions and substitutions can be made by those skilled in the art without departing from the spirit and scope of the invention.

Claims

1. The marine surface temperature sequence prediction method based on space-time double-flow non-stationary perception is characterized by comprising the following steps of:

s1, acquiring a satellite remote sensing image of ocean surface temperature;

the time non-stationary characteristic modeling network consists of ST-LSTM modules and MIM modules, and comprises a multi-layer network structure with the same structure, wherein each layer of network structure comprises one ST-LSTM module and a plurality of MIM modules, the modules of the upper layer and the modules of the lower layer are stacked and interconnected through a saw-tooth path, the ST-LSTM modules or the MIM modules of the corresponding positions of each layer are connected through a vertical path, and an input ocean surface temperature satellite remote sensing image X is input ₁ ,X ₂ ,...,X _t The method comprises the steps of obtaining the SST time sequence characteristics of the ocean surface temperature through ST-LSTM of each layer, transmitting the output hidden states of two vertically adjacent modules to the next layer of modules through diagonal paths, capturing non-stationary information in the SST time sequence characteristics of the ocean surface temperature in MIM modules by a differential-based method, capturing the non-stationary information through stacked diagonal structures, modeling the time sequence non-stationary characteristics of the SST of the ocean surface temperature, and finally outputting y of the last MIM module of each layer ₁ ,y ₂ ,...,y _t Polymerizing to obtain SST time comprehensive characteristics beta;

2. The marine surface temperature sequence prediction method based on space-time double flow non-stationary sensing as set forth in claim 1, wherein the calculation flow of the SST spatial integrated feature α is as follows:

o＝tanh(W _os *S+W _oN *N+W _on *n)

g＝σ(W _gs *S+W _gN *N+W _gn *n)

3. The marine surface temperature sequence prediction method based on space-time double flow non-stationary perception according to claim 2, wherein in the stacked cross-attention based space-time fusion module, specifically:

Q _α ＝W _q *α,V _α ＝W _v *α

K _β ＝W _k *β,V _β ＝W _v *β

Z＝MLP(W _z [Z _α ,Z _β ])