CN115359310B - SIC prediction method and system based on ConvLSTM and conditional random field - Google Patents

SIC prediction method and system based on ConvLSTM and conditional random field Download PDF

Info

Publication number
CN115359310B
CN115359310B CN202210798362.4A CN202210798362A CN115359310B CN 115359310 B CN115359310 B CN 115359310B CN 202210798362 A CN202210798362 A CN 202210798362A CN 115359310 B CN115359310 B CN 115359310B
Authority
CN
China
Prior art keywords
sic
prediction
convlstm
weight matrix
pixel
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Active
Application number
CN202210798362.4A
Other languages
Chinese (zh)
Other versions
CN115359310A (en
Inventor
张辉
任开军
李小勇
任小丽
谭家明
蓝云杰
柴星宇
徐青
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
National University of Defense Technology
Original Assignee
National University of Defense Technology
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by National University of Defense Technology filed Critical National University of Defense Technology
Priority to CN202210798362.4A priority Critical patent/CN115359310B/en
Publication of CN115359310A publication Critical patent/CN115359310A/en
Application granted granted Critical
Publication of CN115359310B publication Critical patent/CN115359310B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06VIMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
    • G06V10/00Arrangements for image or video recognition or understanding
    • G06V10/70Arrangements for image or video recognition or understanding using pattern recognition or machine learning
    • G06V10/77Processing image or video features in feature spaces; using data integration or data reduction, e.g. principal component analysis [PCA] or independent component analysis [ICA] or self-organising maps [SOM]; Blind source separation
    • G06V10/774Generating sets of training patterns; Bootstrap methods, e.g. bagging or boosting
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F17/00Digital computing or data processing equipment or methods, specially adapted for specific functions
    • G06F17/10Complex mathematical operations
    • G06F17/16Matrix or vector computation, e.g. matrix-matrix or matrix-vector multiplication, matrix factorization
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06VIMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
    • G06V10/00Arrangements for image or video recognition or understanding
    • G06V10/70Arrangements for image or video recognition or understanding using pattern recognition or machine learning
    • G06V10/764Arrangements for image or video recognition or understanding using pattern recognition or machine learning using classification, e.g. of video objects
    • G06V10/765Arrangements for image or video recognition or understanding using pattern recognition or machine learning using classification, e.g. of video objects using rules for classification or partitioning the feature space
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06VIMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
    • G06V10/00Arrangements for image or video recognition or understanding
    • G06V10/70Arrangements for image or video recognition or understanding using pattern recognition or machine learning
    • G06V10/82Arrangements for image or video recognition or understanding using pattern recognition or machine learning using neural networks

Landscapes

  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • General Physics & Mathematics (AREA)
  • Databases & Information Systems (AREA)
  • Software Systems (AREA)
  • Evolutionary Computation (AREA)
  • Computing Systems (AREA)
  • Multimedia (AREA)
  • Medical Informatics (AREA)
  • General Health & Medical Sciences (AREA)
  • Computer Vision & Pattern Recognition (AREA)
  • Artificial Intelligence (AREA)
  • Health & Medical Sciences (AREA)
  • Mathematical Physics (AREA)
  • Mathematical Analysis (AREA)
  • Mathematical Optimization (AREA)
  • Computational Mathematics (AREA)
  • Pure & Applied Mathematics (AREA)
  • Data Mining & Analysis (AREA)
  • Algebra (AREA)
  • General Engineering & Computer Science (AREA)
  • Management, Administration, Business Operations System, And Electronic Commerce (AREA)

Abstract

The invention discloses a SIC prediction method and a system based on ConvLSTM and a conditional random field, which comprise the steps of obtaining historical SIC data of a plurality of continuous moments, carrying out normalization processing on the historical SIC data, carrying out time step splicing on the processed historical SIC data through a time sliding window to generate a sequence data sample set, and inputting the sequence data sample set into a preset prediction model to obtain SIC prediction results of the next moment output by the prediction model, wherein the prediction model is a ConvLSTM network model combining the fully connected conditional random field, and the SIC prediction results of the next moment output by the prediction model are obtained by taking the prediction results provided by the ConvLSTM as the priori probability of the fully connected conditional random field, so that the prediction accuracy of the whole SIC and the prediction accuracy of a boundary area are improved.

Description

SIC prediction method and system based on ConvLSTM and conditional random field
Technical Field
The invention relates to the technical field of SIC prediction, in particular to a SIC prediction method and system based on ConvLSTM and a conditional random field.
Background
In recent years, with global warming, arctic sea ice coverage is gradually reduced, and arctic regions are also receiving increasing attention. The change in sea ice concentration directly reflects the change in sea ice coverage. Predicting sea ice concentration (Sea Ice Concentration, SIC) is important for understanding the effects of climate change, resource exploitation, and new arctic planning.
The traditional sea ice prediction method mainly comprises a statistical model or a numerical model. The statistical model is driven by data, and a fitting sea ice change rule is constructed according to historical observation data. The observed data mainly includes meteorological elements (e.g., temperature, sea level pressure), marine elements (e.g., sea surface temperature, salinity), and sea ice elements (e.g., concentration, range, ice type, and thickness). However, statistical methods cannot take into account the interactions between sea ice and the atmosphere and can only build a point-by-point model, ignoring interactions between neighboring points.
In recent years, machine learning methods have begun to be applied to sea ice prediction. For example, two deep learning methods, namely MLP (Multilayer Perceptron, MLP) and LSTM (Long Short-Term Memory Network, LSTM), can be used for metaphase prediction of the north SIC, and the effect is better than that of the traditional autoregressive model. However, LSTM only considers time-series information, and cannot process spatial information. Because LSTM needs to convert two-dimensional space data into one-dimensional data for calculation, original space structural features are difficult to keep in the process, and the prediction effect is poor when the sea ice coverage area is changed severely in melting and freezing seasons. In addition, the convolution operation can well consider the spatial correlation, and SIC of the next month of the month prediction is realized through CNN (Convolutional Neural Networks, CNN). However, CNN has obvious defects that SIC at time t is completely dependent on SIC at time t-1, and information in all past time cannot be extracted.
Disclosure of Invention
The present invention aims to at least solve the technical problems existing in the prior art. Therefore, the SIC prediction method and system based on ConvLSTM and conditional random fields, and the prediction accuracy of the whole SIC and the prediction accuracy of the boundary area are improved by introducing the conditional random fields into ConvLSTM.
In a first aspect of the present invention, there is provided a SIC prediction method based on ConvLSTM and conditional random fields, comprising the steps of:
acquiring historical SIC data at a plurality of continuous moments, carrying out normalization processing on the historical SIC data, and carrying out time step splicing on the processed historical SIC data through a time sliding window to generate a sequence data sample set;
and inputting the sequence data sample set into a preset prediction model to obtain a SIC prediction result of the next moment output by the prediction model, wherein the prediction model is a ConvLSTM network model combined with a full-connection conditional random field.
According to the embodiment of the invention, at least the following technical effects are achieved:
according to the method, historical SIC data of a plurality of continuous moments are obtained, normalization processing is carried out on the historical SIC data, the processed historical SIC data are spliced in a time step mode through a time sliding window, a sequence data sample set is generated, the sequence data sample set is input into a preset prediction model, and SIC prediction results of the next moment output by the prediction model are obtained, wherein the prediction model is a ConvLSTM network model combining a full-connection conditional random field, and the prediction results provided by the ConvLSTM are used as priori probabilities of the full-connection conditional random field to obtain SIC prediction results of the next moment output by the prediction model, so that the prediction accuracy of the integral SIC and the prediction accuracy of a boundary area are improved.
According to some embodiments of the invention, the internal structural formula of the ConvLSTM network model includes:
i t =σ(W xi *X t +W hi *H t-1 +W ci ⊙C t-1 +b i )
f t =σ(W xf *X t +W hf *H t-1 +W cf ⊙C t-1 +b f )
C t =f t ⊙C t-1 +i t ⊙tanh(W xc *X t +W hc *H t-1 +b c )
o t =σ(W xo *X t +W ho *H t-1 +W co ⊙C t +b o )
H t =o t ⊙tanh(C t )
wherein σ is the gate, which is the convolution operator, as well as the inner product, tanh is the hyperbolic tangent function, i t An input gate at time t, f t Forgetting door at t moment, C t Memory cell at time t, o t Output gate at time t, X t Is a two-dimensional image at the time t, W xi Is X as a two-dimensional image related to an input door t Weight matrix of (H) t As the network predicted value at time t, W hi Is H as a hidden state related to the input door t-1 Weight matrix, W of (2) ci The memory cell associated with the input gate is C t-1 Weight matrix of b) i To input the bias value of the gate, W xf Is X as a two-dimensional image related to a forgetting door t Weight matrix, W of (2) hf Is H as a hidden state related to the forgetful door t-1 Weight matrix, W of (2) cf The memory unit related to the forgetting gate is C t-1 Weight matrix of b) f Is the bias value of the forgetting gate, W xc Is X as a two-dimensional image related to a memory unit t Weight matrix, W of (2) hc Is associated with a memory cell and has a hidden state H t-1 Weight matrix of b) c Is the bias value of the memory cell, W xo Is X as a two-dimensional image related to an output gate t Weight matrix, W of (2) ho Is H as a hidden state related to an output door t-1 Weight matrix, W of (2) co The memory cell associated with the output gate is C t Weight matrix of b) o To output the gate bias value.
According to some embodiments of the present invention, the inputting the sequence data sample set into a preset prediction model to obtain a SIC prediction result at a next moment output by the prediction model includes:
inputting the sequence data sample set into a ConvLSTM network model in the preset prediction model to obtain a first SIC prediction result, wherein the first SIC prediction result is the SIC prediction result of the next moment output by the ConvLSTM network model in the prediction model;
obtaining a pixel set X= { X according to the first SIC prediction result 1 ,x 2 …x n Tag set l= { L of tag categories 1 ,L 2 …L k Characteristic vector f= { F } 1 ,f 2 …f n Color feature vector i= { I } 1 ,I 2 …I n Position feature vector p= { P } 1 ,p 2 …p n Pixel i is classified as label x i Probability P (x) i ) Wherein x is n Classification label for pixel n, L k Classifying labels for the kth category in the label set, p n Is the position characteristic vector of the pixel point n, I n Is the color feature vector of the pixel point n, f n Is the characteristic vector of the pixel point n, and the f n The method comprises the steps of including a position feature vector of the pixel point n and a color feature vector of the pixel point n;
The fully connected conditional random field satisfies gibbs distribution, and the probability function is as follows:
wherein Z (I) is a normalization factor, E (X|I) is an energy function;
defining an energy function on a full graph basis as:
E(X∣I)=∑ i ψ u (x i )+∑ i, ψ p (x i ,x j )
ψ u (x i )=-log P(x i )
wherein, psi is u (x i ) As a unitary potential function, ψ p (x i ,x j ) As a binary potential function, μ (x i ,x j ) As a tag comparison function, if x i ≠x j Time mu (x) i ,x j ) Equal to 1, if said x i =x j Time mu (x) i ,x j ) Equal to 0,w m And (3) the weight corresponding to the Gaussian kernel function of the m-th category in the tag set is m epsilon [1, k.For the m-th class of Gaussian kernel function in the tag set, f i Is the characteristic vector of the pixel point i, f j Is the eigenvector of pixel j, p i Is the position characteristic vector of the pixel point i, p j Is the position characteristic vector of the pixel point j, I i Is the color feature vector of pixel point I, I j Is the color feature vector of the pixel point j, theta α To control the scale of the position information, θ β To control the scale of the color information, θ γ The scale of the control position information;
iteratively updating Q (x) using an average field algorithm such that the K-L divergence between the probability function P (x) and the Q (x) reaches a predetermined value, then using Q i (x i ) Instead of P (x) i ) And obtaining SIC prediction results of the next moment output by the prediction model, wherein the calculation steps are as follows:
Q(x)=∏ i Q i (x i )
wherein Q is i (x i ) For pixel i, classification label x i I is the classification label of pixel i, l' is the prediction classification label of pixel i, Z i Mu (l, l') is a tag comparison function, which is a normalization factor.
According to some embodiments of the invention, the ConvLSTM network model sets 4 ConvLSTM cells, and the parameter REturn_sequences of the 4 th ConvLSTM cell is set to false.
In a second aspect of the present invention, there is provided a SIC prediction system based on ConvLSTM and conditional random fields, the SIC prediction system based on ConvLSTM and conditional random fields comprising:
the data acquisition module is used for acquiring historical SIC data at a plurality of continuous moments, carrying out normalization processing on the historical SIC data, and carrying out time-step splicing on the processed historical SIC data through a time sliding window to generate a sequence data sample set;
the data output module is used for inputting the sequence data sample set into a preset prediction model to obtain a SIC prediction result of the next moment output by the prediction model, wherein the prediction model is a ConvLSTM network model combined with a full-connection conditional random field.
The system obtains historical SIC data at a plurality of continuous moments, performs normalization processing on the historical SIC data, performs time-step splicing on the processed historical SIC data through a time sliding window, generates a sequence data sample set, inputs the sequence data sample set into a preset prediction model to obtain SIC prediction results at the next moment output by the prediction model, wherein the prediction model is a ConvLSTM network model combining a full-connection conditional random field, and obtains SIC prediction results at the next moment output by the prediction model by taking the prediction results provided by the ConvLSTM as the priori probability of the full-connection conditional random field, thereby improving the prediction accuracy of the integral SIC and the prediction accuracy of a boundary region.
According to some embodiments of the invention, the internal structural formula of the ConvLSTM network model includes:
i t =σ(W xi *X t +W hi *H t-1 +W ci ⊙C t-1 +b i )
f t =σ(W xf *X t +W hf *H t-1 +W cf ⊙C t-1 +b f )
C t =f t ⊙C t-1 +i t ⊙tanh(W xc *X t +W hc *H t-1 +b c )
o t =σ(W xo *X t +W ho *H t-1 +W co ⊙C t +b o )
H t =o t ⊙tanh(C t )
wherein σ is the gate, which is the convolution operator, as well as the inner product, tanh is the hyperbolic tangent function, i t An input gate at time t, f t Forgetting door at t moment, C t Memory cell at time t, o t Output gate at time t, X t Is a two-dimensional image at the time t, W xi Is X as a two-dimensional image related to an input door t Weight matrix of (H) t As the network predicted value at time t, W hi Is H as a hidden state related to the input door t-1 Weight matrix, W of (2) ci The memory cell associated with the input gate is C t-1 Weight matrix of b) i To input the bias value of the gate, W xf Is X as a two-dimensional image related to a forgetting door t Weight matrix, W of (2) hf Is H as a hidden state related to the forgetful door t-1 Weight matrix, W of (2) cf The memory unit related to the forgetting gate is C t-1 Weight matrix of b) f Is the bias value of the forgetting gate, W xc Is X as a two-dimensional image related to a memory unit t Weight matrix, W of (2) hc Is associated with a memory cell and has a hidden state H t-1 Weight matrix of b) c Is the bias value of the memory cell, W xo Is X as a two-dimensional image related to an output gate t Weight matrix, W of (2) ho Is H as a hidden state related to an output door t-1 Weight matrix, W of (2) co The memory cell associated with the output gate is C t Weight matrix of b) o To output the gate bias value.
According to some embodiments of the invention, the data output module comprises:
the first prediction module is used for inputting the sequence data sample set into a ConvLSTM network model in the preset prediction model to obtain a first SIC prediction result, wherein the first SIC prediction result is a SIC prediction result of the next moment output by the ConvLSTM network model in the prediction model;
the data generation module is used for generating a pixel set X= { X according to the first SIC prediction result 1 ,x 2 …x n Tag set l= { L of tag categories 1 ,L 2 …L k Characteristic vector f= { F } 1 ,f 2 …f n Color feature vector i= { I } 1 ,I 2 …I n Position feature vector p= { P } 1 ,p 2 …p n Pixel i is classified as label x i Probability P (x) i ) Wherein x is n Classification label for pixel n, L k Classifying labels for the kth category in the label set, p n Is the position characteristic vector of the pixel point n, I n Is the color feature vector of the pixel point n, f n Is the characteristic vector of the pixel point n, and the f n The method comprises the steps of including a position feature vector of the pixel point n and a color feature vector of the pixel point n;
The fully connected conditional random field satisfies gibbs distribution, and the probability function is as follows:
wherein Z (I) is a normalization factor, E (X|I) is an energy function;
defining an energy function on a full graph basis as:
E(X∣I)=∑ i ψ u (x i )+∑ i,j ψ p (x i ,x j )
ψ u (x i )=-log P(x i )
wherein, psi is u (x i ) As a unitary potential function, ψ p (x i ,x j ) As a binary potential function, μ (x i ,x j ) As a tag comparison function, if x i ≠x j Time mu (x) i ,x j ) Equal to 1, if said x i =x j Time mu (x) i ,x j ) Equal to 0,w m For the weight corresponding to the Gaussian kernel function of the mth category in the tag set, m is E [1, k]。For the m-th class of Gaussian kernel function in the tag set, f i Is the characteristic vector of the pixel point i, f j Is the eigenvector of pixel j, p i Is the position characteristic vector of the pixel point i, p j Is the position characteristic vector of the pixel point j, I i Is the color feature vector of pixel point I, I j Is the color feature vector of the pixel point j, theta α To control the scale of the position information, θ β To control the scale of the color information, θ γ The scale of the control position information;
a result output module for iteratively updating Q (x) using an average field algorithm to achieve a predetermined value of K-L divergence between the probability function P (x) and the Q (x), and then using Q i (x i ) Instead of P (x) i ) And obtaining SIC prediction results of the next moment output by the prediction model, wherein the calculation steps are as follows:
Q(x)=∏ i Q i (x i )
Wherein Q is i (x i ) For pixel i, classification label x i I is the classification label of pixel i, l' is the prediction classification label of pixel i, Z i Mu (l, l') is a tag comparison function, which is a normalization factor.
According to some embodiments of the invention, the ConvLSTM network model sets 4 ConvLSTM cells, and the parameter REturn_sequences of the 4 th ConvLSTM cell is set to false.
In a third aspect of the invention, there is provided a ConvLSTM and conditional random field based SIC prediction electronics comprising at least one control processor and a memory for communicative connection with the at least one control processor; the memory stores instructions executable by the at least one control processor to enable the at least one control processor to perform the ConvLSTM and conditional random field based SIC prediction method described above.
In a fourth aspect of the present invention, there is provided a computer-readable storage medium storing computer-executable instructions for causing a computer to perform the above-described ConvLSTM and conditional random field-based SIC prediction method.
It should be noted that the advantages of the second to fourth aspects of the present invention and the prior art are the same as those of the above-described SIC prediction system based on ConvLSTM and conditional random fields, and are not described in detail here.
Additional aspects and advantages of the invention will be set forth in part in the description which follows, and in part will be obvious from the description, or may be learned by practice of the invention.
Drawings
The foregoing and/or additional aspects and advantages of the invention will become apparent and may be better understood from the following description of embodiments taken in conjunction with the accompanying drawings in which:
FIG. 1 is a flow chart of a method for SIC prediction based on ConvLSTM and conditional random fields in accordance with an embodiment of the present invention;
FIG. 2 is a schematic diagram of a set of sequence data samples constructed in accordance with one embodiment of the present invention;
FIG. 3 is a diagram of a ConvLSTM network architecture provided by one embodiment of the present invention;
FIG. 4 is a comparison of ConvLSTM and ConvLSTM-CRF iterative predictions of MAE for the next 12 months provided by one embodiment of the present invention;
FIG. 5 is a comparison graph of ConvLSTM and ConvLSTM-CRF iterative predictions for a future 12 month RMSE, provided by an embodiment of the invention;
fig. 6 is a flow chart of a ConvLSTM and conditional random field based SIC prediction system in accordance with an embodiment of the present invention.
Detailed Description
Embodiments of the present invention are described in detail below, examples of which are illustrated in the accompanying drawings, wherein like or similar reference numerals refer to like or similar elements or elements having like or similar functions throughout. The embodiments described below by referring to the drawings are illustrative only and are not to be construed as limiting the invention.
In the description of the present invention, the description of first, second, etc. is for the purpose of distinguishing between technical features only and should not be construed as indicating or implying a relative importance or implicitly indicating the number of technical features indicated or implicitly indicating the precedence of the technical features indicated.
In the description of the present invention, it should be understood that the direction or positional relationship indicated with respect to the description of the orientation, such as up, down, etc., is based on the direction or positional relationship shown in the drawings, is merely for convenience of describing the present invention and simplifying the description, and does not indicate or imply that the apparatus or element referred to must have a specific orientation, be constructed and operated in a specific orientation, and thus should not be construed as limiting the present invention.
In the description of the present invention, unless explicitly defined otherwise, terms such as arrangement, installation, connection, etc. should be construed broadly and the specific meaning of the terms in the present invention can be determined reasonably by a person skilled in the art in combination with the specific content of the technical solution.
The machine learning method starts to be applied to sea ice prediction. For example, two deep learning methods, namely MLP (Multilayer Perceptron, MLP) and LSTM (Long Short-Term Memory Network, LSTM), can be used for metaphase prediction of the north SIC, and the effect is better than that of the traditional autoregressive model. However, LSTM only considers time-series information, and cannot process spatial information. Because LSTM needs to convert two-dimensional space data into one-dimensional data for calculation, original space structural features are difficult to keep in the process, and the prediction effect is poor when the sea ice coverage area is changed severely in melting and freezing seasons. In addition, the convolution operation can well consider the spatial correlation, and SIC of the next month of the month prediction is realized through CNN (Convolutional Neural Networks, CNN). However, CNN has obvious defects that SIC at time t is completely dependent on SIC at time t-1, and information in all past time cannot be extracted.
To solve the above technical drawbacks, referring to fig. 1 and 2, an embodiment of the present invention provides a SIC prediction method based on ConvLSTM and conditional random fields, including:
s101, acquiring historical SIC data of a plurality of continuous moments, carrying out normalization processing on the historical SIC data, and carrying out time step splicing on the processed historical SIC data through a time sliding window to generate a sequence data sample set.
S102, inputting the sequence data sample set into a preset prediction model to obtain SIC prediction results of the next moment output by the prediction model, wherein the prediction model is a ConvLSTM network model combined with a fully connected conditional random field.
Referring to FIG. 3, in some embodiments, the internal structural formula of the ConvLSTM network model includes:
i t =σ(W xi *X t +W hi *H t-1 +W ci ⊙C t-1 +b i )
f t =σ(W xf *X t +W hf *H t-1 +W cf ⊙C t-1 +b f )
C t =f t ⊙C t-1 +i t ⊙tanh(W xc *X t +W hc *H t-1 +b c )
o t =σ(W xo *X t +W ho *H t-1 +W co ⊙C t +b o )
H t =o t ⊙tanh(C t )
wherein σ is the gate, which is the convolution operator, as well as the inner product, tanh is the hyperbolic tangent function, i t An input gate at time t, f t Forgetting door at t moment, C t Memory cell at time t, o t Output gate at time t, X t Is a two-dimensional image at the time t, W xi Is X as a two-dimensional image related to an input door t Weight matrix of (H) t As the network predicted value at time t, W hi Is H as a hidden state related to the input door t-1 Weight matrix, W of (2) ci The memory cell associated with the input gate is C t-1 Weight matrix of b) i To input the bias value of the gate, W xf Is X as a two-dimensional image related to a forgetting door t Weight matrix, W of (2) hf Is H as a hidden state related to the forgetful door t-1 Weight matrix, W of (2) cf The memory unit related to the forgetting gate is C t-1 Weight matrix of b) f Is the bias value of the forgetting gate, W xc Is X as a two-dimensional image related to a memory unit t Weight matrix, W of (2) hc Is associated with a memory cell and has a hidden state H t-1 Weight matrix of b) c Is the bias value of the memory cell, W xo Is X as a two-dimensional image related to an output gate t Weight matrix, W of (2) ho Is H as a hidden state related to an output door t-1 Weight matrix, W of (2) co The memory cell associated with the output gate is C t Weight matrix of b) o To output the gate bias value.
In some embodiments, inputting the sequence data sample set into a preset prediction model to obtain a SIC prediction result at the next moment output by the prediction model, including:
and inputting the sequence data sample set into a ConvLSTM network model in a preset prediction model to obtain a first SIC prediction result, wherein the first SIC prediction result is the SIC prediction result of the next moment output by the ConvLSTM network model in the prediction model.
Obtaining a pixel set X= { X according to the first SIC prediction result 1 ,x 2 …x n Tag set l= { L of tag categories 1 ,L 2 …L k Characteristic vector f= { F } 1 ,f 2 …f n Color feature vector i= { I } 1 ,I 2 …I n Position feature vector p= { P } 1 ,p 2 …p n Pixel i is classified as label x i Probability P (x) i ) Wherein x is n Classification label for pixel n, L k Classifying tags for the kth category in the tag set, p n Is the position characteristic vector of the pixel point n, I n Is the color feature vector of pixel point n, f n Is the characteristic vector of the pixel point n, f n The method comprises a position characteristic vector of a pixel point n and a color characteristic vector of the pixel point n.
The fully connected conditional random field satisfies the gibbs distribution with the probability function as follows:
wherein Z (I) is a normalization factor, E (X|I) is an energy function.
Defining an energy function on a full graph basis as:
E(X∣I)=∑ i ψ u (x i )+∑ i, ψ p (x i ,x j )
ψ u (x i )=-log P(x i )
wherein, psi is u (x i ) As a unitary potential function, ψ p (x i ,x j ) As a binary potential function, μ (x i ,x j ) As a tag comparison function, if x i ≠x j Time mu (x) i ,x j ) Equal to 1, if x i =x j Time mu (x) i ,x j ) Equal to 0,w m And (3) the weight corresponding to the Gaussian kernel function of the m-th category in the tag set is m epsilon [1, k.For the Gaussian kernel function of the m-th class in the tag set, f i Is the characteristic vector of the pixel point i, f j Is the eigenvector of pixel j, p i Is the position characteristic vector of the pixel point i, p j Is the position characteristic vector of the pixel point j, I i Is the color feature vector of pixel point I, I j Is the color feature vector of the pixel point j, theta α To control the scale of the position information, θ β To control the scale of the color information, θ γ To control the scale of the location information.
Iteratively updating Q (x) using an average field algorithm such that the K-L divergence between the probability functions P (x) and Q (x) reaches a predetermined value, then using Q i (x i ) Instead of P (x) i ) The SIC prediction result of the next moment output by the prediction model is obtained, and the calculation steps are as follows:
Q(x)=∏ i Q i (x i )
wherein Q is i (x i ) For pixel i, classification label x i I is the classification label of pixel i, l' is the prediction classification label of pixel i, Z i Mu (l, l') is a tag comparison function, which is a normalization factor.
In some embodiments, the ConvLSTM network model sets 4 ConvLSTM cells, and the parameter REturn_sequences of the 4 th ConvLSTM cell is set to false.
For ease of understanding by those skilled in the art, a set of preferred embodiments are provided below:
1. generating a sequence data sample set:
acquiring historical SIC data at a plurality of continuous moments, carrying out normalization processing on the historical SIC data, and carrying out time step splicing on the processed historical SIC data through a time sliding window to generate a sequence data sample set.
The historical SIC data is selected from monthly SIC data of NSIDC fourth edition, the research area is selected from North Pole, the horizontal resolution is 25×25km, the SIC coverage is 31.1N-89.8N,180E-180W, the time span is 11 months 1978 to 12 months 2020, and the total time is 506 months, wherein all missing values of land and the like in the SIC data are assigned to 0, which means that no sea ice coverage exists.
Referring to fig. 2, a sequence data sample set was constructed based on data from month 11 in 1978 to month 12 in 2020 for 506 months. From the first month, data was progressively divided as a group every 13 months. Of each set of data, the first 12 months of data were entered as a model and the 13 th month of data were used as tag values. 506 months of data are divided into 494 groups, the first 422 groups are used as training sets, the 423 th to 446 th groups are used as verification sets, 24 groups are used as Test sets, and 48 groups are used as Test sets, wherein model training is performed by using the training sets, train Set is used as the training Set, validation Set is used as the verification Set, and Test Set is used as the Test Set in fig. 2.
2. Obtaining a prediction result:
and inputting the sequence data sample set into a ConvLSTM network model in a preset prediction model to obtain a first SIC prediction result, wherein the first SIC prediction result is the SIC prediction result of the next moment output by the ConvLSTM network model in the prediction model.
Obtaining a pixel set X= { X according to the first SIC prediction result 1 ,x 2 …x n Tag set l= { L of tag categories 1 ,L 2 …L k Characteristic vector f= { F } 1 ,f 2 …f n Color feature vector i= { I } 1 ,I 2 …I n Position feature vector p= { P } 1 ,p 2 …p n Pixel i is classified as label x i Probability P (x) i ) Wherein x is n Classification label for pixel n, L k Classifying tags for the kth category in the tag set, p n Is the position characteristic vector of the pixel point n, I n Is the color feature vector of pixel point n, f n Is the characteristic vector of the pixel point n, f n The method comprises a position feature vector of a pixel point n and a color feature vector of the pixel point n;
the fully connected conditional random field satisfies the gibbs distribution with the probability function as follows.
Wherein Z (I) is a normalization factor, E (X|I) is an energy function.
Defining an energy function on a full graph basis as:
E(X∣I)=∑ i ψ u (x i )+∑ i,j ψ p (x i ,x j )
ψ u (x i )=-log P(x i )
wherein, psi is u (x i ) As a unitary potential function, ψ p (x i ,x j ) As a binary potential function, μ (x i ,x j ) As a tag comparison function, if x i ≠x j Time mu (x) i ,x j ) Equal to 1, if x i =x j Time mu (x) i ,x j ) Equal to 0,w m For the weight corresponding to the Gaussian kernel function of the mth category in the tag set, m is E [1, k]。Gaussian kernel function for the m-th class in a tag set,f i Is the characteristic vector of the pixel point i, f j Is the eigenvector of pixel j, p i Is the position characteristic vector of the pixel point i, p j Is the position characteristic vector of the pixel point j, I i Is the color feature vector of pixel point I, I j Is the color feature vector of the pixel point j, theta α To control the scale of the position information, θ β To control the scale of the color information, θ γ To control the scale of the location information.
Iteratively updating Q (x) using an average field algorithm such that the K-L divergence between the probability functions P (x) and Q (x) reaches a predetermined value, then using Q i (x i ) Instead of P (x) i ) The SIC prediction result of the next moment output by the prediction model is obtained, and the calculation steps are as follows:
Q(x)=∏ i Q i (x i )
wherein Q is i (x i ) For pixel i, classification label x i I is the classification label of pixel i, l' is the prediction classification label of pixel i, Z i Mu (l, l') is a tag comparison function, which is a normalization factor.
The internal structural formula of the ConvLSTM network model comprises:
i t =σ(W xi *X t +W hi *H t-1 +W ci ⊙C t-1 +b i )
f t =σ(W xf *X t +W hf *H t-1 +W cf ⊙C t-1 +b f )
C t =f t ⊙C t-1 +i t ⊙tanh(W xc *X t +W hc *H t-1 +b c )
o t =σ(W xo *X t +W ho *H t-1 +W co ⊙C t +b o )
H t =o t ⊙tanh(C t )
wherein σ is the gate, which is the convolution operator, as well as the inner product, tanh is the hyperbolic tangent function, i t An input gate at time t, f t Forgetting door at t moment, C t Memory cell at time t, o t Output gate at time t, X t Is a two-dimensional image at the time t, W xi Is X as a two-dimensional image related to an input door t Weight matrix of (H) t As the network predicted value at time t, W hi Is H as a hidden state related to the input door t-1 Weight matrix, W of (2) ci The memory cell associated with the input gate is C t-1 Weight matrix of b) i To input the bias value of the gate, W xf Is X as a two-dimensional image related to a forgetting door t Weight matrix, W of (2) hf Is H as a hidden state related to the forgetful door t-1 Weight matrix, W of (2) cf The memory unit related to the forgetting gate is C t-1 Weight matrix of b) f Is the bias value of the forgetting gate, W xc Is X as a two-dimensional image related to a memory unit t Weight matrix, W of (2) hc Is associated with a memory cell and has a hidden state H t-1 Weight matrix of b) c Is the bias value of the memory cell, W xo Is X as a two-dimensional image related to an output gate t Weight matrix, W of (2) ho Is H as a hidden state related to an output door t-1 Weight matrix, W of (2) co The memory cell associated with the output gate is C t Weight matrix of b) o To output the gate bias value.
The ConvLSTM network model sets 4 ConvLSTM cells, and the parameter REturn_sequences of the 4 th ConvLSTM cell is set to false.
3. Experiment and results:
ConvLSTM and ConvLSTM-CRF performance differences were compared experimentally using a sample set of sequence data.
The single month prediction ability and the iterative prediction ability of the test model are tested, and the single month prediction refers to the prediction of the next month T13 by inputting 12 months of data of T1-T12. The iterative prediction is to use the predicted value as input and predict the value of the next moment, for example, the predicted value T13 is used as the input for predicting the next moment, the next month T14 is predicted based on 12 months of data of T2-T13, a plurality of months in the future are iteratively predicted according to the predicted value, 48 groups of data of the training set are used for single month prediction, and data of 1 month in 2019-12 months in 2019 are used for iteratively predicting data of 12 months in the future.
The prediction results of ConvLSTM and ConvLSTM-CRF are compared by MAE (Mean Absolute Error, MAE) and RMSE (Root Mean Squared Error, RMSE), convLSTM-CRF is used as a prediction model, and the calculation method of the two evaluation indexes is as follows:
to eliminate the influence of open water and land which do not change with time, the 0 value in the predicted result is removed, wherein n is the number of non-zero values in the predicted SIC, y i Is the true value of SIC, p i Is a predicted value.
In the single month prediction experiment, convLSTM and ConvLSTM-CRF are used for respectively predicting 48 groups of data of the test set, and the season and annual average MAE and RMSE percentage values of the prediction results are shown in tables 1 and 2:
table 1 shows a comparison of seasonal average MAE and annual average MAE for ConvLSTM model and ConvLSTM-CRF model;
TABLE 1
Table 2 shows a comparison of seasonal average MAE and annual average RMSE for ConvLSTM model and ConvLSTM-CRF model;
TABLE 2
As can be seen from tables 1 and 2, the prediction of ConvLSTM-CRF is superior to ConvLSTM in that sea ice melts to produce a pool in the thawing season, both MAE and RMSE are higher than the average in the freezing season, but both MAE and RMSE of ConvLSTM-CRF are lower than those of ConvLSTM.
Referring to fig. 4 and 5, in the iterative prediction experiment, data of 1 month in 2019 to 12 months in 2019 are used to iteratively predict data of 12 months in the future. Firstly, data from 1 month in 2019 to 12 months in 2019 are input into two models, a prediction result of 1 month in 20 years is obtained, and the prediction result is used as input of the next prediction. Finally, data from 1 month in 2020 to 12 months in 2020 are obtained. The monthly MAE and RMSE percentage values of the predicted results are shown in fig. 4 and 5. As can be seen from the figure, both the MAE and MRSE of the two models grow with increasing iteration number. ConvLSTM-CRF grows more slowly, however, and its MAE and MRSE are also smaller than ConvLSTM.
The invention combines ConvLSTM with fully connected conditional random fields, and proposes a ConvLSTM-CRF model to predict future moon SIC of North. Experiments prove that the prediction precision of the ConvLSTM-CRF model is superior to that of a ConvLSTM method, and the SIC boundary range can be predicted more accurately. Especially, when SIC changes drastically in thawing and freezing seasons, the prediction result of ConvLSTM-CRF model is closer to the real situation. In the sea ice iterative prediction, the ConvLSTM-CRF model also shows a better prediction effect, in addition, the ConvLSTM-CRF model can generalize application scenes, and all SIC-like time sequence prediction problems can adopt the model, such as precipitation prediction and snowfall prediction.
According to the method, historical SIC data of a plurality of continuous moments are obtained, normalization processing is carried out on the historical SIC data, the processed historical SIC data are spliced in a time step mode through a time sliding window, a sequence data sample set is generated, the sequence data sample set is input into a preset prediction model, and SIC prediction results of the next moment output by the prediction model are obtained, wherein the prediction model is a ConvLSTM network model combining a full-connection conditional random field, and the prediction results provided by the ConvLSTM are used as priori probabilities of the full-connection conditional random field to obtain SIC prediction results of the next moment output by the prediction model, so that the prediction accuracy of the integral SIC and the prediction accuracy of a boundary area are improved.
In addition, referring to fig. 6, the present invention further provides a SIC prediction system based on ConvLSTM and conditional random fields, including a data acquisition module 1100 and a data output module 1200, where:
the data acquisition module 1100 is configured to acquire historical SIC data at a plurality of consecutive moments, normalize the historical SIC data, and perform time-step splicing on the processed historical SIC data through a time sliding window to generate a sequence data sample set;
the data output module 1200 is configured to input the sequence data sample set into a preset prediction model, and obtain a SIC prediction result at a next moment output by the prediction model, where the prediction model is a ConvLSTM network model combined with a fully connected conditional random field.
The system obtains historical SIC data at a plurality of continuous moments, performs normalization processing on the historical SIC data, performs time-step splicing on the processed historical SIC data through a time sliding window, generates a sequence data sample set, inputs the sequence data sample set into a preset prediction model to obtain SIC prediction results at the next moment output by the prediction model, wherein the prediction model is a ConvLSTM network model combining a full-connection conditional random field, and obtains SIC prediction results at the next moment output by the prediction model by taking the prediction results provided by the ConvLSTM as the priori probability of the full-connection conditional random field, thereby improving the prediction accuracy of the integral SIC and the prediction accuracy of a boundary region.
In some embodiments, the internal structural formula of the ConvLSTM network model includes:
i t =σ(W xi *X t +W hi *H t-1 +W ci ⊙C t-1 +b i )
f t =σ(W xf *X t +W hf *H t-1 +W cf ⊙C t-1 +b f )
C t =f t ⊙C t-1 +i t ⊙tanh(W xc *X t +W hc *H t-1 +b c )
o t =σ(W xo *X t +W ho *H t-1 +W co ⊙C t +b o )
H t =o t ⊙tanh(C t )
wherein σ is the gate, which is the convolution operator, as well as the inner product, tanh is the hyperbolic tangent function, i t An input gate at time t, f t Forgetting door at t moment, C t Memory cell at time t, o t Output gate at time t, X t Is a two-dimensional image at the time t, W xi Is X as a two-dimensional image related to an input door t Weight matrix of (H) t As the network predicted value at time t, W hi Is H as a hidden state related to the input door t-1 Weight matrix, W of (2) ci The memory cell associated with the input gate is C t-1 Weight matrix of b) i To input the bias value of the gate, W xf Is X as a two-dimensional image related to a forgetting door t Weight matrix, W of (2) hf Is H as a hidden state related to the forgetful door t-1 Weight matrix, W of (2) cf The memory unit related to the forgetting gate is C t-1 Weight matrix of b) f Is the bias value of the forgetting gate, W xc Is X as a two-dimensional image related to a memory unit t Weight matrix, W of (2) hc Is associated with a memory cell and has a hidden state H t-1 Weight matrix of b) c Is the bias value of the memory cell, W xo Is X as a two-dimensional image related to an output gate t Weight matrix, W of (2) ho Is H as a hidden state related to an output door t-1 Weight matrix, W of (2) co The memory cell associated with the output gate is C t Weight matrix of b) o To output the gate bias value.
In some embodiments, the data output module comprises:
the first prediction module is used for inputting the sequence data sample set into a ConvLSTM network model in a preset prediction model to obtain a first SIC prediction result, wherein the first SIC prediction result is a SIC prediction result of the next moment output by the ConvLSTM network model in the prediction model;
the data generation module is used for generating a pixel set X= { X according to the first SIC prediction result 1 ,x 2 …x n Tag set l= { L of tag categories 1 ,L 2 …L k Characteristic vector f= { F } 1 ,f 2 …f n Color feature vector i= { I } 1 ,I 2 …I n Position feature vector p= { P } 1 ,p 2 …p n Pixel i is classified as label x i Probability P (x) i ) Wherein x is n Classification label for pixel n, L k Classifying tags for the kth category in the tag set, p n Is the position characteristic vector of the pixel point n, I n Is the color feature vector of pixel point n, f n Is the characteristic vector of the pixel point n, f n The method comprises a position feature vector of a pixel point n and a color feature vector of the pixel point n;
the fully connected conditional random field satisfies the gibbs distribution with the probability function as follows:
wherein Z (I) is a normalization factor, E (X|I) is an energy function;
Defining an energy function on a full graph basis as:
E(X∣I)=∑ i ψ u (x i )+∑ i,j ψ p (x i ,x j )
ψ u (x i )=-log P(x i )
wherein, psi is u (x i ) As a unitary potential function, ψ p (x i ,x j ) As a binary potential function, μ (x i ,x j ) As a tag comparison function, if x i ≠x j Time mu (x) i ,x j ) Equal to 1, if x i =x j Time mu (x) i ,x j ) Equal to 0,w m For the weight corresponding to the Gaussian kernel function of the mth category in the tag set, m is E [1, k]。For the Gaussian kernel function of the m-th class in the tag set, f i Is the characteristic vector of the pixel point i, f j Is the eigenvector of pixel j, p i Is the position characteristic vector of the pixel point i, p j Is the position characteristic vector of the pixel point j, I i Is the color feature vector of pixel point I, I j Is the color feature vector of the pixel point j, theta α To control the scale of the position information, θ β To control the scale of the color information, θ γ The scale of the control position information;
a result output module for iteratively updating Q (x) using an average field algorithm to achieve a predetermined value of the K-L divergence between the probability functions P (x) and Q (x), and then using Q i (x i ) Instead of P (x) i ) The SIC prediction result of the next moment output by the prediction model is obtained, and the calculation steps are as follows:
Q(x)=∏ i Q i (x i )
wherein Q is i (x i ) For pixel i, classification label x i I is the classification label of pixel i, l' is the prediction classification label of pixel i, Z i Mu (l, l') is a tag comparison function, which is a normalization factor.
In some embodiments, the ConvLSTM network model sets 4 ConvLSTM cells, and the parameter REturn_sequences of the 4 th ConvLSTM cell is set to false.
It should be noted that, the embodiment of the present system and the embodiment of the method described above are based on the same inventive concept, so that the relevant content of the embodiment of the method described above is also applicable to the embodiment of the present system, and will not be repeated here.
The processor and the memory may be connected by a bus or other means.
The memory, as a non-transitory computer readable storage medium, may be used to store non-transitory software programs as well as non-transitory computer executable programs. In addition, the memory may include high-speed random access memory, and may also include non-transitory memory, such as at least one magnetic disk storage device, flash memory device, or other non-transitory solid state storage device. In some embodiments, the memory optionally includes memory remotely located relative to the processor, the remote memory being connectable to the processor through a network. Examples of such networks include, but are not limited to, the internet, intranets, local area networks, mobile communication networks, and combinations thereof.
The non-transitory software programs and instructions required to implement the ConvLSTM and conditional random field based SIC prediction method of the above embodiments are stored in a memory, which when executed by a processor, performs the ConvLSTM and conditional random field based SIC prediction method of the above embodiments, for example, performs the method steps S101 to S102 in fig. 1 described above.
The present application also provides a computer-readable storage medium storing computer-executable instructions for performing: SIC prediction methods based on ConvLSTM and conditional random fields as described above.
The computer-readable storage medium stores computer-executable instructions that are executed by a processor or controller, for example, by a processor in the above-described electronic device embodiment, which may cause the processor to perform the convls (tm) and conditional random field based SIC prediction method in the above-described embodiment, for example, to perform the method steps S101 to S102 in fig. 1 described above.
Those of ordinary skill in the art will appreciate that all or some of the steps, systems, and methods disclosed above may be implemented as software, firmware, hardware, and suitable combinations thereof. Some or all of the physical components may be implemented as software executed by a processor, such as a central processing unit, digital signal processor, or microprocessor, or as hardware, or as an integrated circuit, such as an application specific integrated circuit. Such software may be distributed on computer readable media, which may include computer storage media (or non-transitory media) and communication media (or transitory media). The term computer storage media includes both volatile and nonvolatile, removable and non-removable media implemented in any method or technology for storage of information such as computer readable instructions, data structures, program elements or other data, as known to those skilled in the art. Computer storage media includes, but is not limited to, RAM, ROM, EEPROM, flash memory or other memory technology, CD-ROM, digital Versatile Disks (DVD) or other optical disk storage, magnetic cassettes, magnetic tape, magnetic disk storage or other magnetic storage devices, or any other medium which can be used to store the desired information and which can be accessed by a computer. Furthermore, as is well known to those of ordinary skill in the art, communication media typically embodies computer readable instructions, data structures, program elements or other data in a modulated data signal such as a carrier wave or other transport mechanism and may include any information delivery media.
The embodiments of the present invention have been described in detail with reference to the accompanying drawings, but the present invention is not limited to the above embodiments, and various changes can be made within the knowledge of one of ordinary skill in the art without departing from the spirit of the present invention.

Claims (8)

1. A method for SIC prediction based on ConvLSTM and conditional random fields, the method comprising:
acquiring historical SIC data at a plurality of continuous moments, carrying out normalization processing on the historical SIC data, and carrying out time step splicing on the processed historical SIC data through a time sliding window to generate a sequence data sample set;
inputting the sequence data sample set into a preset prediction model to obtain a SIC prediction result of the next moment output by the prediction model, wherein the prediction model refers to a ConvLSTM network model combined with a full-connection conditional random field, and specifically comprises the following steps:
inputting the sequence data sample set into a ConvLSTM network model in the preset prediction model to obtain a first SIC prediction result, wherein the first SIC prediction result is the SIC prediction result of the next moment output by the ConvLSTM network model in the prediction model;
Obtaining a pixel set X= { X according to the first SIC prediction result 1 ,x 2 ...x n Tag set l= { L of tag categories 1 ,L 2 ...L k Characteristic vector f= { F } 1 ,f 2 ...f n Color feature vector i= { I } 1 ,I 2 ...I n Position feature vector p= { P } 1 ,p 2 ...p n Pixel i is classified as label x i Probability P (x) i ) Wherein x is n Classification label for pixel n, L k Classifying labels for the kth category in the label set, p n Is the position characteristic vector of the pixel point n, I n Is the color feature vector of the pixel point n, f n Is the characteristic vector of the pixel point n, and the f n The method comprises the steps of including a position feature vector of the pixel point n and a color feature vector of the pixel point n;
the fully connected conditional random field satisfies gibbs distribution, and the probability function is as follows:
wherein Z (I) is a normalization factor, E (X|I) energy function;
defining an energy function on a full graph basis as:
E(X|I)=∑ i ψ u (x i )+∑ i,j ψ p (x i ,x j )
ψ u (x i )=-log P(x i )
wherein, psi is u (x i ) As a unitary potential function, ψ p (x i ,x j ) As a binary potential function, μ (x i ,x j ) As a tag comparison function, if x i ≠x j Time mu (x) i ,x j ) Equal to 1, if said x i =x j Time mu (x) i ,x j ) Equal to 0,w m For the weight corresponding to the Gaussian kernel function of the mth category in the tag set, m is E [1, k],For the m-th class of Gaussian kernel function in the tag set, f i Is the characteristic vector of the pixel point i, f j Is the eigenvector of pixel j, p i Is the position characteristic vector of the pixel point i, p j Is the position characteristic vector of the pixel point j, I i Is the color feature vector of pixel point I, I j Is the color feature vector of the pixel point j, theta α To control the scale of the position information, θ β To control the scale of the color information, θ γ To control the scale of the position information, W (1) Weight corresponding to Gaussian kernel function of first category in label set, w (2) The weight corresponding to the Gaussian kernel function of the second category in the tag set is given;
iteratively updating Q (x) using an average field algorithm such that the K-L divergence between the probability function P (x) and the Q (x) reaches a predetermined value, then using Q i (x i ) Instead of P (x) i ) And obtaining SIC prediction results of the next moment output by the prediction model, wherein the calculation steps are as follows:
Q(x)=П i Q i (x i )
wherein Q is i (x i ) For pixel i, classification label x i I is the classification label of pixel i, l' is the prediction classification label of pixel i, Z i Mu (l, l') is a tag comparison function, ψ, as a normalization factor u (x i ) As a unitary potential function, w m And the weight corresponding to the Gaussian kernel function of the m-th category in the tag set.
2. The method for SIC prediction based on ConvLSTM and conditional random fields according to claim 1, wherein the internal structural formula of the ConvLSTM network model comprises:
i t =σ(W xi *x t +W hi *H t-1 +W ci ⊙C t-1 +b i )
f t =σ(W xf *X t +W hf *H t-1 +W cf ⊙C t-1 +b f )
C t =f t ⊙C t-1 +i t ⊙tanh(W xc *X t +W hc *H t-1 +b c )
o t =σ(W xo *X t +W ho *H t-1 +W co ⊙C t +b o )
H t =o t ⊙tanh(C t )
Wherein σ is the gate, which is the convolution operator, as well as the inner product, tanh is the hyperbolic tangent function, i t An input gate at time t, f t Forgetting door at t moment, C t Memory cell at time t, o t Is an output gate at the time t,X t is a two-dimensional image at the time t, W xi Is X as a two-dimensional image related to an input door t Weight matrix of (H) t As the network predicted value at time t, W hi Is H as a hidden state related to the input door t-1 Weight matrix, W of (2) ci The memory cell associated with the input gate is C t-1 Weight matrix of b) i To input the bias value of the gate, W xf Is X as a two-dimensional image related to a forgetting door t Weight matrix, W of (2) hf Is H as a hidden state related to the forgetful door t-1 Weight matrix, W of (2) cf The memory unit related to the forgetting gate is C t-1 Weight matrix of b) f Is the bias value of the forgetting gate, W xc Is X as a two-dimensional image related to a memory unit t Weight matrix, W of (2) hc Is associated with a memory cell and has a hidden state H t-1 Weight matrix of b) c Is the bias value of the memory cell, W xo Is X as a two-dimensional image related to an output gate t Weight matrix, W of (2) ho Is H as a hidden state related to an output door t-1 Weight matrix, W of (2) co The memory cell associated with the output gate is C t Weight matrix of b) o To output the gate bias value.
3. The method for predicting SIC based on ConvLSTM and conditional random field according to claim 2, wherein the ConvLSTM network model sets 4 ConvLSTM cells, and parameter return_sequences of the 4 th ConvLSTM cell is set to false.
4. A ConvLSTM and conditional random field based SIC prediction system, the system comprising:
the data acquisition module is used for acquiring historical SIC data at a plurality of continuous moments, carrying out normalization processing on the historical SIC data, and carrying out time-step splicing on the processed historical SIC data through a time sliding window to generate a sequence data sample set;
the data output module is used for inputting the sequence data sample set into a preset prediction model to obtain a SIC prediction result of the next moment output by the prediction model, wherein the prediction model refers to a ConvLSTM network model combined with a full-connection conditional random field, and specifically comprises the following steps:
the first prediction module is used for inputting the sequence data sample set into a ConvLSTM network model in the preset prediction model to obtain a first SIC prediction result, wherein the first SIC prediction result is a SIC prediction result of the next moment output by the ConvLSTM network model in the prediction model;
The data generation module is used for generating a pixel set X= { X according to the first SIC prediction result 1 ,x 2 ...x n Tag set l= { L of tag categories 1 ,L 2 ...L k Characteristic vector f= { F } 1 ,f 2 ...f n Color feature vector i= { I } 1 ,I 2 ...I n Position feature vector p= { P } 1 ,p 2 ...p n Pixel i is classified as label x i Probability P (x) i ) Wherein x is n Classification label for pixel n, L k Classifying labels for the kth category in the label set, p n Is the position characteristic vector of the pixel point n, I n Is the color feature vector of the pixel point n, f n Is the characteristic vector of the pixel point n, and the f n The method comprises the steps of including a position feature vector of the pixel point n and a color feature vector of the pixel point n;
the fully connected conditional random field satisfies gibbs distribution, and the probability function is as follows:
wherein Z (I) is a normalization factor, E (X|I) energy function;
defining an energy function on a full graph basis as:
E(X|I)=∑ i ψ u (x i )+∑ i,j ψ p (x i ,x j )
ψ u (x i )=-log P(x i )
wherein, psi is u (x i ) As a unitary potential function, ψ p (x i ,x j ) As a binary potential function, μ (x i ,x j ) As a tag comparison function, if x i ≠x j Time mu (x) i ,x j ) Equal to 1, if said x i =x j Time mu (x) i ,x j ) Equal to 0,w m For the weight corresponding to the Gaussian kernel function of the mth category in the tag set, m is E [1, k],For the m-th class of Gaussian kernel function in the tag set, f i Is the characteristic vector of the pixel point i, f j Is the eigenvector of pixel j, p i Is the position characteristic vector of the pixel point i, p j Is the position characteristic vector of the pixel point j, I i Is the color feature vector of pixel point I, I j Is the color feature vector of the pixel point j, theta α To control the scale of the position information, θ β To control the scale of the color information, θ γ To control the scale of the position information, W (1) Weight corresponding to Gaussian kernel function of first category in label set, w (2) The weight corresponding to the Gaussian kernel function of the second category in the tag set is given;
a result output module for iteratively updating Q (x) using an average field algorithm to achieve a predetermined value of K-L divergence between the probability function P (x) and the Q (x), and then using Q i (x i ) Instead of P (x) i ) Obtaining the next moment of the output of the prediction modelThe SIC prediction result of (2) is calculated as follows:
Q(x)=П i Q i (x i )
wherein Q is i (x i ) For pixel i, classification label x i I is the classification label of pixel i, l' is the prediction classification label of pixel i, Z i Mu (l, l') is a tag comparison function, ψ, as a normalization factor u (x i ) As a unitary potential function, w m And the weight corresponding to the Gaussian kernel function of the m-th category in the tag set.
5. The SIC prediction system of claim 4, wherein the internal structural formula of the ConvLSTM network model includes:
i t =σ(W xi *X t +W hi *H t-1 +W ci ⊙C t-1 +b i )
f t =σ(W xf *X t +W hf *H t-1 +W cf ⊙C t-1 +b f )
C t =f t ⊙C t-1 +i t ⊙tanh(W xc *X t +W hc *H t-1 +b c )
o t =σ(W xo *X t +W ho *H t-1 +W co ⊙C t +b o )
H t =o t ⊙tanh(C t )
Wherein σ is the gate, which is the convolution operator, as well as the inner product, tanh is the hyperbolic tangent function, i t An input gate at time t, f t Forgetting door at t moment, C t Memory cell at time t, o t Output gate at time t, X t Is a two-dimensional image at the time t, W xi Is X as a two-dimensional image related to an input door t Weight matrix of (H) t As the network predicted value at time t, W hi Is H as a hidden state related to the input door t-1 Weight matrix, W of (2) ci The memory cell associated with the input gate is C t-1 Weight matrix of b) i To input the bias value of the gate, W xf Is X as a two-dimensional image related to a forgetting door t Weight matrix, W of (2) hf Is H as a hidden state related to the forgetful door t-1 Weight matrix, W of (2) cf The memory unit related to the forgetting gate is C t-1 Weight matrix of b) f Is the bias value of the forgetting gate, W xc Is X as a two-dimensional image related to a memory unit t Weight matrix, W of (2) hc Is associated with a memory cell and has a hidden state H t-1 Weight matrix of b) c Is the bias value of the memory cell, W xo Is X as a two-dimensional image related to an output gate t Weight matrix, W of (2) ho Is H as a hidden state related to an output door t-1 Weight matrix, W of (2) co The memory cell associated with the output gate is C t Weight matrix of b) o To output the gate bias value.
6. The SIC prediction system of claim 5, wherein the ConvLSTM network model sets 4 convlstmcells, and the parameter return_sequences of the 4 th ConvLSTMcell is set to false.
7. A ConvLSTM and conditional random field based SIC prediction apparatus comprising at least one control processor and a memory for communication with the at least one control processor; the memory stores instructions executable by the at least one control processor to enable the at least one control processor to perform a ConvLSTM and conditional random field based SIC prediction method according to any one of claims 1 to 3.
8. A computer-readable storage medium, characterized by: the computer-readable storage medium stores computer-executable instructions for causing a computer to perform a ConvLSTM and conditional random field based SIC prediction method according to any one of claims 1 to 3.
CN202210798362.4A 2022-07-08 2022-07-08 SIC prediction method and system based on ConvLSTM and conditional random field Active CN115359310B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN202210798362.4A CN115359310B (en) 2022-07-08 2022-07-08 SIC prediction method and system based on ConvLSTM and conditional random field

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN202210798362.4A CN115359310B (en) 2022-07-08 2022-07-08 SIC prediction method and system based on ConvLSTM and conditional random field

Publications (2)

Publication Number Publication Date
CN115359310A CN115359310A (en) 2022-11-18
CN115359310B true CN115359310B (en) 2023-09-01

Family

ID=84032693

Family Applications (1)

Application Number Title Priority Date Filing Date
CN202210798362.4A Active CN115359310B (en) 2022-07-08 2022-07-08 SIC prediction method and system based on ConvLSTM and conditional random field

Country Status (1)

Country Link
CN (1) CN115359310B (en)

Families Citing this family (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN116091779B (en) * 2023-04-07 2023-06-27 湖北一方科技发展有限责任公司 Method and system for predicting based on hydrologic data

Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN110097115A (en) * 2019-04-28 2019-08-06 南开大学 A kind of saliency object detecting method based on attention metastasis
CN112508936A (en) * 2020-12-22 2021-03-16 中国科学院空天信息创新研究院 Remote sensing image change detection method based on deep learning
CN113344932A (en) * 2021-06-01 2021-09-03 电子科技大学 Semi-supervised single-target video segmentation method

Patent Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN110097115A (en) * 2019-04-28 2019-08-06 南开大学 A kind of saliency object detecting method based on attention metastasis
CN112508936A (en) * 2020-12-22 2021-03-16 中国科学院空天信息创新研究院 Remote sensing image change detection method based on deep learning
CN113344932A (en) * 2021-06-01 2021-09-03 电子科技大学 Semi-supervised single-target video segmentation method

Non-Patent Citations (1)

* Cited by examiner, † Cited by third party
Title
《Monthly Arctic Sea Ica Concentration Prediction Based on the ConvLSTM Model》;Li PL et al;《IEEE》;正文第二、三部分 *

Also Published As

Publication number Publication date
CN115359310A (en) 2022-11-18

Similar Documents

Publication Publication Date Title
Gasparin et al. Deep learning for time series forecasting: The electric load case
Khodayar et al. Convolutional graph autoencoder: A generative deep neural network for probabilistic spatio-temporal solar irradiance forecasting
US11835677B2 (en) Systems and methods for selecting global climate simulation models for training neural network climate forecasting models
US20230128989A1 (en) Systems and methods of data preprocessing and augmentation for neural network climate forecasting models
CN111406267B (en) Neural architecture search using performance prediction neural networks
US20220146708A1 (en) Methods and systems for climate forecasting using artificial neural networks
US10386544B2 (en) Solar power forecasting using mixture of probabilistic principal component analyzers
Corchado et al. Data mining using example-based methods in oceanographic forecast models
US11205099B2 (en) Training neural networks using data augmentation policies
US20170228871A1 (en) System and method for labelling aerial images
CN115359310B (en) SIC prediction method and system based on ConvLSTM and conditional random field
Mbuvha et al. Separable shadow Hamiltonian hybrid Monte Carlo for Bayesian neural network inference in wind speed forecasting
Demertzis et al. Geo-AI to aid disaster response by memory-augmented deep reservoir computing
US11880767B2 (en) Increasing accuracy and resolution of weather forecasts using deep generative models
Zakhrouf et al. Novel insights for streamflow forecasting based on deep learning models combined the evolutionary optimization algorithm
Toms et al. Testing the reliability of interpretable neural networks in geoscience using the Madden–Julian oscillation
Lee et al. Confidence‐aware deep learning forecasting system for daily solar irradiance
CN116861262B (en) Perception model training method and device, electronic equipment and storage medium
CN114926701A (en) Model training method, target detection method and related equipment
CN117494871A (en) Ship track prediction method considering ship interaction influence
Spezia et al. An evolutionary Monte Carlo method for the analysis of turbidity high‐frequency time series through Markov switching autoregressive models
CN116824583A (en) Weak supervision video scene graph generation method and device and electronic equipment
Lee et al. Model‐agnostic online forecasting for PV power output
Thuemmel et al. Inductive biases in deep learning models for weather prediction
Scharenbroich et al. A Bayesian framework for storm tracking using a hidden-state representation

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant