CN114385619A - Multi-channel ocean observation time sequence scalar data missing value prediction method and system - Google Patents

Multi-channel ocean observation time sequence scalar data missing value prediction method and system Download PDF

Info

Publication number
CN114385619A
CN114385619A CN202210285171.8A CN202210285171A CN114385619A CN 114385619 A CN114385619 A CN 114385619A CN 202210285171 A CN202210285171 A CN 202210285171A CN 114385619 A CN114385619 A CN 114385619A
Authority
CN
China
Prior art keywords
sequence
time
scalar data
attention module
observation time
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Granted
Application number
CN202210285171.8A
Other languages
Chinese (zh)
Other versions
CN114385619B (en
Inventor
常文庆
董火民
李响
王英龙
赵志刚
王春晓
武鲁
王金伟
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Shandong Computer Science Center National Super Computing Center in Jinan
Original Assignee
Shandong Computer Science Center National Super Computing Center in Jinan
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Shandong Computer Science Center National Super Computing Center in Jinan filed Critical Shandong Computer Science Center National Super Computing Center in Jinan
Priority to CN202210285171.8A priority Critical patent/CN114385619B/en
Publication of CN114385619A publication Critical patent/CN114385619A/en
Application granted granted Critical
Publication of CN114385619B publication Critical patent/CN114385619B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/20Information retrieval; Database structures therefor; File system structures therefor of structured data, e.g. relational data
    • G06F16/21Design, administration or maintenance of databases
    • G06F16/215Improving data quality; Data cleansing, e.g. de-duplication, removing invalid entries or correcting typographical errors
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/20Information retrieval; Database structures therefor; File system structures therefor of structured data, e.g. relational data
    • G06F16/24Querying
    • G06F16/245Query processing
    • G06F16/2458Special types of queries, e.g. statistical queries, fuzzy queries or distributed queries
    • G06F16/2474Sequence data queries, e.g. querying versioned data
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N3/00Computing arrangements based on biological models
    • G06N3/02Neural networks
    • G06N3/04Architecture, e.g. interconnection topology
    • G06N3/045Combinations of networks

Landscapes

  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • General Engineering & Computer Science (AREA)
  • Databases & Information Systems (AREA)
  • Data Mining & Analysis (AREA)
  • General Physics & Mathematics (AREA)
  • Software Systems (AREA)
  • Mathematical Physics (AREA)
  • Computational Linguistics (AREA)
  • Biomedical Technology (AREA)
  • Molecular Biology (AREA)
  • Computing Systems (AREA)
  • General Health & Medical Sciences (AREA)
  • Evolutionary Computation (AREA)
  • Biophysics (AREA)
  • Artificial Intelligence (AREA)
  • Life Sciences & Earth Sciences (AREA)
  • Quality & Reliability (AREA)
  • Health & Medical Sciences (AREA)
  • Fuzzy Systems (AREA)
  • Probability & Statistics with Applications (AREA)
  • Management, Administration, Business Operations System, And Electronic Commerce (AREA)

Abstract

The invention belongs to the field of computer systems based on specific calculation models, and provides a method and a system for predicting missing values of multi-channel ocean observation time sequence scalar data, which are used for acquiring ocean observation time sequence scalar data with ocean missing values; obtaining a marine missing value prediction result by adopting a TA-RNN model based on the marine observation time sequence scalar data; the TA-RNN model comprises a convolution attention module, a space attention module and a time attention module, wherein the convolution attention module is used for refining the ocean observation time series scalar data; the space attention module is used for capturing dynamic space correlation of the refined ocean observation time sequence scalar data; the temporal attention module is configured to capture dynamic temporal correlations between different time intervals in the spatial attention module output data.

Description

Multi-channel ocean observation time sequence scalar data missing value prediction method and system
Technical Field
The invention belongs to the field of computer systems based on specific calculation models, and particularly relates to a method and a system for predicting missing values of multi-channel ocean observation time sequence scalar data.
Background
The statements in this section merely provide background information related to the present disclosure and may not necessarily constitute prior art.
Ocean monitoring relies on widely deployed ocean buoys and observation stations that integrate various types of ocean sensors. The marine ecosystem has a complex structure, so that marine observation data has complexity and diversity. Missing values refer to clustering, grouping, deletion, or truncation of data due to lack of information in the original data, which means that some characteristic value or values in the data are incomplete. Due to the fact that the ocean observation data such as chlorophyll, wind speed, dissolved oxygen, salinity, temperature, oxygen content, wind speed and turbidity are collected in a cooperative mode through a buoy system, a navigation system and a database system, all collection systems are easily interfered by external environment factors, and missing values exist in the data. These data have an impact on the accuracy of downstream applications, such as ocean data assimilation and intelligent data mining. The traditional methods such as mathematical statistics, empirical prediction and the like cannot achieve the expected target on ocean observation data with the characteristics of multiple factors, irregularity, complexity and the like. Therefore, the accurate marine observation data prediction model is researched by taking data as drive, and irreplaceable effects are exerted on filling missing values of marine observation time sequence scalar data.
Disclosure of Invention
In order to solve the technical problems in the background art, the invention provides a method and a system for predicting missing values of multi-channel ocean observation time sequence scalar data, which predict the future change trend of the multi-channel ocean observation time sequence scalar data through historical data of the multi-channel ocean observation time sequence scalar data, and use the predicted data in filling of the missing values.
In order to achieve the purpose, the invention adopts the following technical scheme:
the invention provides a method for predicting missing values of multi-channel ocean observation time sequence scalar data.
A multi-channel ocean observation time sequence scalar data missing value prediction method comprises the following steps:
obtaining ocean observation time sequence scalar data with ocean deficiency values;
obtaining a marine missing value prediction result by adopting a TA-RNN model based on the marine observation time sequence scalar data;
the TA-RNN model comprises a convolution attention module, a space attention module and a time attention module, wherein the convolution attention module is used for refining the ocean observation time series scalar data; the space attention module is used for capturing dynamic space correlation of the refined ocean observation time sequence scalar data; the temporal attention module is configured to capture dynamic temporal correlations between different time intervals in the spatial attention module output data.
A second aspect of the invention provides a multi-channel ocean observation time series scalar data missing value prediction system.
A multi-channel ocean observation time series scalar data missing value prediction system, comprising:
a data acquisition module configured to: obtaining ocean observation time sequence scalar data with ocean deficiency values;
a prediction module configured to: obtaining a marine missing value prediction result by adopting a TA-RNN model based on the marine observation time sequence scalar data;
the TA-RNN model comprises a convolution attention module, a space attention module and a time attention module, wherein the convolution attention module is used for refining the ocean observation time series scalar data; the space attention module is used for capturing dynamic space correlation of the refined ocean observation time sequence scalar data; the temporal attention module is configured to capture dynamic temporal correlations between different time intervals in the spatial attention module output data.
Compared with the prior art, the invention has the beneficial effects that:
according to the three-stage attention-based recurrent neural network (TA-RNN) model, in the first stage, a convolution attention module is adopted to carry out thinning operation on an input sequence, so that the new input sequence has stronger representation capability; in the second stage, a space attention module is adopted to enable the model to selectively capture the dynamic correlation among different input sequences; and in the third stage, a time attention module is adopted, so that the model can adaptively capture the dynamic time correlation between different time intervals in the input sequence.
The method and the device can accurately fill the missing value, thereby avoiding the problems of inaccurate filling of the missing value, larger error and the like.
The method overcomes the defect that the existing missing value filling can only depend on single-channel data for filling, and aims at marine multi-channel observation time sequence scalar data, and fills the missing value existing in the chlorophyll sequence through the correlation between chlorophyll and marine observation time sequence scalar data such as depth, temperature, conductivity, salinity, oxygen content, dissolved oxygen concentration, chlorophyll (including the missing value), turbidity, PH value, wind speed and the like. Because ocean data is abundant and diverse, in most scenes, a target sequence usually does not exist independently but exists with numerous time sequences, a specific scene data set is formed together, missing value filling is carried out on a multichannel ocean observation time sequence scalar data set, and the actual situation of the data set acquired by an ocean acquisition system is more similar.
Drawings
The accompanying drawings, which are incorporated in and constitute a part of this specification, are included to provide a further understanding of the invention, and are incorporated in and constitute a part of this specification, illustrate exemplary embodiments of the invention and together with the description serve to explain the invention and not to limit the invention.
FIG. 1 is a flow chart of a method for predicting missing values of multi-channel marine observation time series scalar data according to an embodiment of the present invention;
FIG. 2 is a flow diagram illustrating missing value padding according to an embodiment of the present invention;
FIG. 3 is a diagram of a recursive neural network model framework based on three-stage attention according to an embodiment of the present invention;
FIG. 4 is a schematic diagram of a convolution attention module (CBAM) shown in an embodiment of the present invention;
FIG. 5 is a schematic diagram of a channel attention module shown in an embodiment of the present invention;
FIG. 6 is a schematic diagram of a spatial attention module shown in an embodiment of the present invention;
FIG. 7 is a diagram showing chlorophyll sequence with deletion values according to an embodiment of the present invention;
FIG. 8 is a graph showing the effect of chlorophyll sequence prediction on a sample set without missing values according to an embodiment of the present invention;
FIG. 9 is a filled chlorophyll sequence diagram according to an embodiment of the present invention;
FIG. 10 is a diagram showing a chlorophyll sequence containing deletion values, a part of which has a length of 50 according to an embodiment of the present invention;
FIG. 11 is a chlorophyll sequence diagram after linear interpolation processing according to an embodiment of the present invention;
fig. 12 is a diagram illustrating the effect of filling chlorophyll deletion after model prediction according to an embodiment of the present invention.
Detailed Description
The invention is further described with reference to the following figures and examples.
It is to be understood that the following detailed description is exemplary and is intended to provide further explanation of the invention as claimed. Unless defined otherwise, all technical and scientific terms used herein have the same meaning as commonly understood by one of ordinary skill in the art to which this invention belongs.
It is noted that the terminology used herein is for the purpose of describing particular embodiments only and is not intended to be limiting of exemplary embodiments according to the invention. As used herein, the singular forms "a", "an" and "the" are intended to include the plural forms as well, and it should be understood that when the terms "comprises" and/or "comprising" are used in this specification, they specify the presence of stated features, steps, operations, devices, components, and/or combinations thereof, unless the context clearly indicates otherwise.
It is noted that the flowchart and block diagrams in the figures illustrate the architecture, functionality, and operation of possible implementations of methods and systems according to various embodiments of the present disclosure. It should be noted that each block in the flowchart or block diagrams may represent a module, a segment, or a portion of code, which may comprise one or more executable instructions for implementing the logical function specified in the respective embodiment. It should also be noted that, in some alternative implementations, the functions noted in the block may occur out of the order noted in the figures. For example, two blocks shown in succession may, in fact, be executed substantially concurrently, or the blocks may sometimes be executed in the reverse order, depending upon the functionality involved. It will also be noted that each block of the flowchart illustrations and/or block diagrams, and combinations of blocks in the flowchart illustrations and/or block diagrams, can be implemented by special purpose hardware-based systems which perform the specified functions or acts, or combinations of special purpose hardware and computer instructions.
As introduced in the background art, most of the currently common methods for filling missing values of marine multichannel observation scalar data adopt fixed values, medians and modes to fill the missing values, but the problems that the filled missing values are inaccurate, large errors exist and the like can occur. According to the three-stage attention-based recurrent neural network (TA-RNN) model, in the first stage, a convolution attention module is adopted to carry out thinning operation on an input sequence, so that the new input sequence has stronger representation capability; in the second stage, a space attention module is adopted to enable the model to selectively capture the dynamic correlation among different input sequences; and in the third stage, a time attention module is adopted. Enabling the model to adaptively capture the dynamic time correlation between different time intervals in the input sequence. The method and the device can accurately fill the missing value, thereby avoiding the problems of inaccurate filling of the missing value, larger error and the like.
Lack of current deep learningThe value filling algorithm has the defect that missing value filling cannot be carried out on missing values of multi-channel ocean observation time sequence scalar data. The main current way to fill in missing values is E2GAN, but for sensor input to E2When the GAN is used for missing value filling, most of the GAN only has two columns of data, time and detection values. The method is basically absent in an actual ocean scene, and a plurality of sensors are integrated on an ocean buoy to work simultaneously, so that the data acquired by the ocean sensors are basically multi-channel data. The method aims at ocean multi-channel observation time sequence scalar data, adopts a three-stage attention-based recurrent neural network model, predicts the current value of a target sequence by using the past value of the target sequence and the current value and the past value of other sequences related to the target sequence, and fills the predicted value in the missing value position of the current data set.
The invention provides a three-stage attention-based recurrent neural network model for accurately predicting a multichannel ocean data missing value, wherein the three-stage attention-based recurrent neural network model is shown in figure 3, and the three-stage attention modules are respectively as follows:
(1) and the convolution attention module refines the original input sequence and increases the characterization capability of the original input sequence. The convolution attention module is proposed in 2018, mixes the space attention and the channel attention in the convolution module, is a lightweight and general module with good portability, and is used for processing a multi-channel input sequence.
(2) A spatial attention module that enables the model to selectively capture dynamic spatial correlations between different input sequences.
(3) A time attention module that enables the model to adaptively capture dynamic time correlations between different time intervals in the input sequence.
As shown in FIG. 3, a convolution attention module that takes the original input sequence
Figure 960010DEST_PATH_IMAGE001
Refining to generate new input sequence
Figure 807880DEST_PATH_IMAGE002
After convolution attention operation, the characterization capability of the original input sequence is increased; a spatial attention module capable of selectively capturing dynamic correlations between different input sequences; the gate control cycle unit can learn the hidden layer representation of the input sequence and update the hidden state at the current moment according to the input sequence and the hidden state at the previous moment; time attention module. It can adaptively capture the dynamic time correlation between different time intervals in the sequence.
Specific embodiments of the invention are described below from various embodiments:
example one
As shown in fig. 1, the present embodiment provides a method for predicting missing values of multi-channel ocean observation time series scalar data.
Here we use a multi-channel marine observation time series scalar dataset with chlorophyll deficiency values for the canadian ocean network, the multi-channel marine observation time series scalar dataset comprising: marine observation time sequence scalar data such as depth, temperature, conductivity, salinity, oxygen content, dissolved oxygen concentration, chlorophyll (containing a deletion value), turbidity, a pH value, wind speed and the like, wherein a chlorophyll sequence with the deletion value is shown in figure 7, wherein an x axis represents the length of the chlorophyll sequence, a y axis represents the value of the chlorophyll, and the deletion value is filled in a data set by using a fixed value 999 as shown in a circle. With reference to this data set, the technical solution of this embodiment is: the multi-channel ocean observation time series scalar data missing value prediction based on the three-stage attention recurrent neural network prediction model, as shown in FIG. 2, comprises the following steps:
(1) the data set is used as the input of the model, and data preprocessing is firstly carried out on the data set to obtain an initial sequence. The pretreatment stage comprises:
(1-1) processing chlorophyll data to be filled in a linear interpolation mode to obtain initial data;
(1-2) constructing a sample set without missing values, inputting the sample set without missing values into a model for training, and calculating corresponding numerical values by adopting a loss function.
(2) Taking chlorophyll sequence as the target sequence to be predicted, and measuring the correlation between other sequences and the target sequence through Pearson correlation coefficient. By calculating the quotient of covariance and standard deviation between the target sequence and the sequences such as depth, temperature, conductivity, salinity, oxygen content, dissolved oxygen concentration, chlorophyll (containing deletion value), turbidity, PH value, wind speed and the like, seven sequences of the depth, the wind speed, the oxygen content, the dissolved oxygen, the turbidity, the temperature and the salinity are selected to form an input sequence together with the chlorophyll sequence
Figure 733110DEST_PATH_IMAGE003
Wherein n represents the number of different types of sequences,
Figure 19735DEST_PATH_IMAGE004
which represents the size of the length of the input sequence,
Figure 346811DEST_PATH_IMAGE005
the multichannel sequence composed of seven sequences including depth, wind speed, oxygen content, dissolved oxygen, turbidity, temperature and salinity is shown.
(3) Decomposing the input sequence after (2) into chlorophyll sequences
Figure 427900DEST_PATH_IMAGE006
And a multichannel sequence consisting of seven sequences including depth, wind speed, oxygen content, dissolved oxygen, turbidity, temperature and salinity
Figure 840427DEST_PATH_IMAGE007
(4) The multi-channel sequence
Figure 602846DEST_PATH_IMAGE005
Input to the CBAM module, which is shown in fig. 4. First polymerizing signatures by average pooling and maximum pooling operationsAnd generating two different spatial context descriptors to respectively represent the average pool characteristic and the maximum pool characteristic according to the spatial information of the rays:
Figure 112325DEST_PATH_IMAGE008
and
Figure 567577DEST_PATH_IMAGE009
(5) as shown in FIG. 5, the two descriptors are input into a shared network composed of a multi-layer perceptron and a hidden layer to generate a channel attention map
Figure 529717DEST_PATH_IMAGE010
Namely:
Figure 95828DEST_PATH_IMAGE011
in the formula
Figure 131917DEST_PATH_IMAGE012
A sigmoid function is represented as a function,
Figure 820387DEST_PATH_IMAGE013
and
Figure 207506DEST_PATH_IMAGE014
representing multi-layer perceptron weights.
(6) Inputting the original sequence
Figure 311728DEST_PATH_IMAGE005
Element-by-element multiplication operations are performed with the sequence subject to the channel attention mapping. Obtaining a new input sequence
Figure 530220DEST_PATH_IMAGE015
Namely:
Figure 327275DEST_PATH_IMAGE016
in the formula,
Figure 936111DEST_PATH_IMAGE017
representing element-by-element multiplication.
(7) As shown in fig. 6, the newly generated sequence is
Figure 171920DEST_PATH_IMAGE015
Applying the average pooling and maximum pooling operations along the channel axis, aggregating the feature mapped channel information by two pooling operations, generating two spatial context descriptors:
Figure 182601DEST_PATH_IMAGE018
and
Figure 212874DEST_PATH_IMAGE019
. And concatenate them to generate the efficient feature descriptor, on which we apply the convolutional layer to generate the spatial attention map
Figure 309006DEST_PATH_IMAGE020
Namely:
Figure 755031DEST_PATH_IMAGE021
in the formula,
Figure 948115DEST_PATH_IMAGE012
a sigmiod activation function is represented,
Figure 86972DEST_PATH_IMAGE022
representing a filter size of
Figure 732717DEST_PATH_IMAGE023
The convolution operation of (1).
(8) Carrying out element-by-element multiplication operation on the new input sequence obtained in the step (6) and the sequence subjected to space attention mapping to obtain final refined output
Figure 982433DEST_PATH_IMAGE024
Namely:
Figure 967706DEST_PATH_IMAGE025
(9) output after thinning
Figure 339782DEST_PATH_IMAGE024
Generating a new input sequence by a spatial attention mechanism as an input to a spatial attention module
Figure 410506DEST_PATH_IMAGE026
Namely:
Figure 463913DEST_PATH_IMAGE027
in the formula,
Figure 366010DEST_PATH_IMAGE028
representing the kth input sequence
Figure 846670DEST_PATH_IMAGE029
Figure 139111DEST_PATH_IMAGE030
Attention weight representing hidden state of encoder at time t, attention weight
Figure 58525DEST_PATH_IMAGE030
Performing SoftMax function standardization to obtain
Figure 752812DEST_PATH_IMAGE031
Is the encoder hidden state at time t-1,
Figure 466690DEST_PATH_IMAGE032
and
Figure 512006DEST_PATH_IMAGE033
is a matrix of parameters that needs to be learned,
Figure 907215DEST_PATH_IMAGE034
is an attention weight that measures the importance of the kth input feature at time t.
(10) We take attention to the weight, we can update the input sequence and the encoder hidden state at time t, i.e.:
Figure 518325DEST_PATH_IMAGE035
(11) the hidden states of the decoder and the encoder at the t-1 moment and the hidden states of the encoder at the t moment are input into a time attention module, and a context vector is obtained through a time attention mechanism
Figure 606367DEST_PATH_IMAGE036
Namely:
Figure 873400DEST_PATH_IMAGE037
Figure 892476DEST_PATH_IMAGE038
in the formula,
Figure 561354DEST_PATH_IMAGE039
is a matrix of parameters that needs to be learned,
Figure 882614DEST_PATH_IMAGE040
is that
Figure 371364DEST_PATH_IMAGE041
The hidden state of the decoder is at the moment,
Figure 373955DEST_PATH_IMAGE042
is the hidden state of the encoder at time t-1,
Figure 694078DEST_PATH_IMAGE043
is hidden form of encoder at time tThe state of the optical disk is changed into a state,
Figure 858343DEST_PATH_IMAGE044
indicating attention weight of decoder at time t
Figure 99969DEST_PATH_IMAGE044
Performing SoftMax function standardization to obtain
Figure 968568DEST_PATH_IMAGE045
Attention weights that measure the importance of the ith input feature at time t,
Figure 346460DEST_PATH_IMAGE046
is a context vector.
(12) When obtaining the context vector at the time t
Figure 947205DEST_PATH_IMAGE046
Combining them with the target time series and updating the decoder hidden state at time t
Figure 472864DEST_PATH_IMAGE047
Namely:
Figure 817258DEST_PATH_IMAGE048
in the formula,
Figure 377552DEST_PATH_IMAGE049
and b is a parameter matrix mapping the connection to the decoder input,
Figure 883620DEST_PATH_IMAGE050
is the input to the decoder at time t-1,
Figure 99838DEST_PATH_IMAGE051
is the calculated context vector and the context vector,
Figure 44660DEST_PATH_IMAGE052
indicating a connection operation,
Figure 397144DEST_PATH_IMAGE053
Is a new input after a linear transformation,
Figure 74113DEST_PATH_IMAGE040
is the hidden state of the decoder at time t-1.
(13) Finally, the context vector is added
Figure 574364DEST_PATH_IMAGE054
Implicit with time-T decoder
Figure 260561DEST_PATH_IMAGE055
Concatenate to become the hidden state of the new decoder from which the final prediction is made:
Figure 529868DEST_PATH_IMAGE056
in the form of matrix
Figure 643318DEST_PATH_IMAGE057
Sum vector
Figure 568548DEST_PATH_IMAGE058
Mapping connections
Figure 855173DEST_PATH_IMAGE059
Finally we use a linear variation (
Figure 182249DEST_PATH_IMAGE060
And
Figure 466600DEST_PATH_IMAGE061
) Generating the final chlorophyll prediction result. The predicted effect graph is shown in fig. 8:
(14) the final filling result is obtained by filling the predicted chlorophyll data into the data set with the chlorophyll missing value, and the result is shown in fig. 9, in which the x axis represents the length of the chlorophyll sequence, the y axis represents the value of the chlorophyll concentration, and the circled portion represents the value after filling the missing value.
Here we take a portion of the length 50 chlorophyll sequence with missing values, as shown in FIG. 10, where the x-axis represents the length of the chlorophyll sequence and the y-axis represents the value of chlorophyll. The circled portion indicates the deletion value of the chlorophyll sequence, where the deletion value is indicated by 999 constant.
The linear interpolation of the chlorophyll sequence is shown in fig. 11, in which the x-axis represents the length of the chlorophyll sequence and the y-axis represents the value of chlorophyll. The circled portion indicates the result of filling the missing values of the chlorophyll sequence after linear interpolation.
The results of model prediction of chlorophyll sequence are shown in fig. 12, in which the x-axis represents the length of chlorophyll sequence and the y-axis represents the value of chlorophyll. The circled portion represents the result of filling in the deletion values of chlorophyll sequences as predicted by the model.
Comparing fig. 10, 11, and 12, we can see that the accuracy of the recursive neural network model based on three-stage attention for missing value padding is higher than that of linear interpolation.
The present embodiment includes the following advantages:
(1) the current value of the chlorophyll sequence is predicted based on the previous value, the current value and the past value of the depth, the wind speed, the oxygen content, the dissolved oxygen, the turbidity, the temperature and the salinity sequence of the chlorophyll sequence, and the defect that the existing missing value filling technology can only fill data in a marine single-channel observation time sequence scale data set is overcome.
(2) In the embodiment, the space attention module is used for replacing the original input attention module, and the dynamic space correlation among different input sequences can be selectively captured, so that the model can focus on the characteristics related to the prediction task, the prediction accuracy of the model is improved, the training cost of the model is reduced, and the accuracy of the model for filling missing values is improved.
(3) The embodiment uses the convolution attention module to refine the input sequence, compared with the original input attention module of the DA-RNN, the convolution attention module can refine the input sequence, and the characterization capability of the input sequence is enhanced. The problem of gradient decline of the model in training mass data is solved, the prediction performance is not reduced due to the increase of data volume, and the method has good stability. The model can effectively fill large batches of data sets with missing values.
Example two
The embodiment provides a multi-channel ocean observation time sequence scalar data missing value prediction system.
The technical scheme of the embodiment comprises the following modules:
1. acquisition and preprocessing module
Acquiring a multichannel ocean observation time sequence scalar data set with a chlorophyll deficiency value, and preprocessing the data set, wherein the preprocessing process comprises the following steps:
(1) and processing the part caused by chlorophyll sequence deletion in a linear interpolation mode, constructing a sample set without chlorophyll deletion values, inputting the sample set without the deletion values into the model of the invention for training, and calculating corresponding values by adopting a loss function.
(2) Taking the chlorophyll sequence as a target sequence to be predicted, and measuring the correlation between the depth, temperature, conductivity, salinity, oxygen content, dissolved oxygen concentration, chlorophyll (including a deletion value), turbidity, PH value, wind speed and other sequences in the marine multichannel data set and the chlorophyll sequence through Pearson correlation coefficients. By calculating the quotient of covariance and standard deviation between the target sequence and other sequences, we selected seven sequences of depth, wind speed, oxygen content, dissolved oxygen, turbidity, temperature, salinity, which are most related to chlorophyll sequence, and the target sequence together to form the input sequence:
Figure 410285DEST_PATH_IMAGE062
(3) decomposing the data after (2) into chlorophyll sequences
Figure 438284DEST_PATH_IMAGE063
And is composed ofNew input sequence composed of depth, wind speed, oxygen content, dissolved oxygen, turbidity, temperature and salt
Figure 947763DEST_PATH_IMAGE064
Where n represents the number of different types of sequences in the new input sequence,
Figure 403015DEST_PATH_IMAGE065
indicating the input sequence length.
2. Convolution attention module
Will be provided with
Figure 302838DEST_PATH_IMAGE066
As an input, the convolution attention module (CBAM) infers a one-dimensional channel attention map in turn
Figure 931265DEST_PATH_IMAGE067
And two-dimensional spatial attention mapping
Figure 967354DEST_PATH_IMAGE068
. The overall process can be expressed as follows:
Figure 593508DEST_PATH_IMAGE069
Figure 777365DEST_PATH_IMAGE070
wherein,
Figure 147166DEST_PATH_IMAGE071
representing element-by-element multiplication, in which the channel attention value is propagated along the spatial dimension,
Figure 303341DEST_PATH_IMAGE072
is the output of the final refinement.
The specific calculation process is that firstly, the spatial information of the feature mapping is aggregated by average pooling and maximum pooling operation to generate twoThe different spatial context descriptors represent the average pool characteristic and the maximum pool characteristic, respectively:
Figure 162712DEST_PATH_IMAGE073
and
Figure 771548DEST_PATH_IMAGE074
the two descriptors are then sent to a shared network to generate a channel attention map
Figure 741778DEST_PATH_IMAGE075
The shared network consists of a multilayer perceptron and a hidden layer, and after the shared layer is applied to each descriptor, the shared layer outputs a feature vector by using element summation and combination, and the channel attention calculation formula is as follows:
Figure 752460DEST_PATH_IMAGE076
wherein,
Figure 720416DEST_PATH_IMAGE077
a sigmoid function is represented as a function,
Figure 878865DEST_PATH_IMAGE078
and
Figure 590469DEST_PATH_IMAGE079
representing multi-layer perceptron weights.
Computation space note that we first apply the average pooling and max pooling operations along the channel axis and concatenate them to generate the efficient feature descriptors. Applying pool operations along the channel axis can effectively highlight the information region. On concatenated feature descriptors, we apply convolutional layers to generate spatial attention maps
Figure 721236DEST_PATH_IMAGE068
And generating two spatial context descriptors by aggregating the channel information of the feature mapping through two pool operations:
Figure 922410DEST_PATH_IMAGE080
and
Figure 240259DEST_PATH_IMAGE081
the spatial attention is calculated as follows:
Figure 552291DEST_PATH_IMAGE082
wherein,
Figure 537565DEST_PATH_IMAGE083
a sigmiod activation function is represented,
Figure 847324DEST_PATH_IMAGE084
representing a filter size of
Figure 980365DEST_PATH_IMAGE085
The convolution operation of (1). The input features are preprocessed through a convolution attention mechanism, so that the input features are refined, and the characterization capability of the input features is enhanced.
3. Space attention module
Output after thinning
Figure 33771DEST_PATH_IMAGE072
Generating a new input sequence by a spatial attention mechanism as an input to a spatial attention module
Figure 873551DEST_PATH_IMAGE086
Namely:
Figure 416528DEST_PATH_IMAGE087
in the formula,
Figure 974548DEST_PATH_IMAGE088
representing the kth input sequence
Figure 566067DEST_PATH_IMAGE089
Figure 588249DEST_PATH_IMAGE090
Attention weight representing hidden state of encoder at time t, attention weight
Figure 239811DEST_PATH_IMAGE090
Performing SoftMax function standardization to obtain
Figure 81865DEST_PATH_IMAGE091
Is the encoder hidden state at time t-1,
Figure 477074DEST_PATH_IMAGE092
and
Figure 291446DEST_PATH_IMAGE093
is a matrix of parameters that needs to be learned,
Figure 910646DEST_PATH_IMAGE094
is an attention weight that measures the importance of the kth input feature at time t. Through a spatial attention mechanism, the model is enabled to selectively capture dynamic spatial correlations between different input features.
4. Encoder for encoding a video signal
The encoder is essentially an RNN, which encodes an input sequence into a feature representation in machine translation. For the input sequence after the space attention operation
Figure 177680DEST_PATH_IMAGE095
The encoder is used for learning from
Figure 376580DEST_PATH_IMAGE096
To
Figure 373355DEST_PATH_IMAGE097
Mapping (at time t):
Figure 366718DEST_PATH_IMAGE098
wherein,
Figure 121048DEST_PATH_IMAGE099
representing the hidden state of the encoder at time t, m representing the size of the hidden state,
Figure 185956DEST_PATH_IMAGE100
representing a non-linear mapping function, here we use gated round-robin units (GRUs) as
Figure 709341DEST_PATH_IMAGE101
To capture long term dependencies in the sequence. The GRU consists of 2 gates: reset door
Figure 670344DEST_PATH_IMAGE102
Updating door
Figure 911969DEST_PATH_IMAGE103
. The update process of the GRU is as follows:
Figure 452672DEST_PATH_IMAGE104
wherein,
Figure 158460DEST_PATH_IMAGE105
encoder hidden state for time t-1
Figure 759205DEST_PATH_IMAGE106
And input of the current time t
Figure 284865DEST_PATH_IMAGE096
The connection of (a) to (b),
Figure 629258DEST_PATH_IMAGE107
are parameters that need to be learned.
Figure 923974DEST_PATH_IMAGE077
A sigmoid activation function is represented,
Figure 695620DEST_PATH_IMAGE108
representing element-by-element multiplication.
5. Time attention module
In the decoding stage, a time attention mechanism is used for modeling dynamic time correlation among different time intervals in an input sequence, the hidden states of a decoder and an encoder at the t-1 moment and the hidden states of the encoder at the t moment are input into a time attention module, and a context vector is obtained through the time attention mechanism
Figure 646259DEST_PATH_IMAGE109
The attention weight of each decoder hidden state at time t is defined as follows:
Figure 591081DEST_PATH_IMAGE110
wherein,
Figure 943565DEST_PATH_IMAGE111
is a matrix of parameters that needs to be learned,
Figure 948430DEST_PATH_IMAGE112
is that
Figure 386365DEST_PATH_IMAGE113
The hidden state of the decoder is at the moment,
Figure 72561DEST_PATH_IMAGE114
is the hidden state of the encoder at time t-1,
Figure 341868DEST_PATH_IMAGE115
is the hidden state of the encoder at time t,
Figure 455318DEST_PATH_IMAGE116
indicating attention weight of decoder at time t
Figure 442865DEST_PATH_IMAGE116
Performing SoftMax function scalingStandardized processing to obtain
Figure 667173DEST_PATH_IMAGE117
Attention weights that measure the importance of the ith input feature at time t,
Figure 790987DEST_PATH_IMAGE118
is a context vector.
6. Decoder
When obtaining the context vector at the time t
Figure 809759DEST_PATH_IMAGE118
We combine them with the target time series and update the new hidden states of the decoder at time t
Figure 550182DEST_PATH_IMAGE119
Figure 578181DEST_PATH_IMAGE120
Figure 822080DEST_PATH_IMAGE121
And b is a parameter matrix mapping the connection to the decoder input,
Figure 277332DEST_PATH_IMAGE122
is the input to the decoder at time t-1,
Figure 973893DEST_PATH_IMAGE123
is the calculated context vector and the context vector,
Figure 602320DEST_PATH_IMAGE124
it is shown that the connection operation is performed,
Figure 966305DEST_PATH_IMAGE125
is a new input after a linear transformation,
Figure 654776DEST_PATH_IMAGE112
is the hidden state of the decoder at time t-1. We will use context vectors
Figure 776316DEST_PATH_IMAGE126
And a hidden state
Figure 208434DEST_PATH_IMAGE127
Concatenate to become the hidden state of the new decoder from which the final prediction is made:
Figure 364609DEST_PATH_IMAGE128
wherein, the matrix
Figure 958401DEST_PATH_IMAGE129
Sum vector
Figure 895133DEST_PATH_IMAGE130
Mapping connections
Figure 803046DEST_PATH_IMAGE131
Finally we use a linear variation (
Figure 876044DEST_PATH_IMAGE132
And
Figure 844000DEST_PATH_IMAGE133
) Generating the final chlorophyll prediction result.
7. Model validation
As shown in fig. 11, after the prediction result is obtained, the loss value between the prediction result and the true value of the interpolated multi-channel data set is calculated by using the mean square error, and the network parameter of the model is adjusted to obtain the final chlorophyll prediction result.
8. Missing value filling
And filling the final chlorophyll prediction result into a missing value unit of the multi-channel data set to obtain a filling result.
The above description is only a preferred embodiment of the present invention and is not intended to limit the present invention, and various modifications and changes may be made by those skilled in the art. Any modification, equivalent replacement, or improvement made within the spirit and principle of the present invention should be included in the protection scope of the present invention.

Claims (10)

1. A multi-channel ocean observation time sequence scalar data missing value prediction method is characterized by comprising the following steps:
obtaining ocean observation time sequence scalar data with ocean deficiency values;
obtaining a marine missing value prediction result by adopting a TA-RNN model based on the marine observation time sequence scalar data;
the TA-RNN model comprises a convolution attention module, a space attention module and a time attention module, wherein the convolution attention module is used for refining the ocean observation time series scalar data; the space attention module is used for capturing dynamic space correlation of the refined ocean observation time sequence scalar data; the temporal attention module is configured to capture dynamic temporal correlations between different time intervals in the spatial attention module output data.
2. The multi-channel ocean observation time series scalar data missing value prediction method of claim 1, comprising, after the obtaining the ocean observation time series scalar data with ocean missing values: and preprocessing the ocean observation time sequence scalar data with the ocean deficiency value to obtain an initial sequence.
3. The method of multi-channel ocean observation time series scalar data missing value prediction according to claim 1, comprising, before the employing the TA-RNN model: and if the marine observation time sequence scalar data with the marine deficiency value is a chlorophyll sequence, selecting a depth sequence, a wind speed sequence, an oxygen content sequence, a dissolved oxygen sequence, a turbidity sequence, a temperature sequence and a salt sequence, and constructing a multichannel sequence according to the depth sequence, the wind speed sequence, the oxygen content sequence, the dissolved oxygen sequence, the turbidity sequence, the temperature sequence, the salt sequence and the chlorophyll sequence.
4. The multi-channel ocean observation time series scalar data missing value prediction method according to claim 3, characterized in that a convolution attention module is adopted to obtain a channel attention map and a space attention map according to the multi-channel sequence; multiplying the sequence of the channel attention mapping of the multichannel sequence by the multichannel sequence element by element to obtain an initial refined sequence; and multiplying the initial refined sequence by the spatial attention mapping sequence of the initial refined sequence element by element to obtain the final refined sequence.
5. The method for predicting missing values of multi-channel ocean observation time series scalar data according to claim 4, wherein based on the final refined sequence, a spatial attention module is adopted to capture dynamic spatial correlation between different input features in the final refined sequence to obtain an input sequence.
6. The method of claim 5, wherein the encoder is adapted to learn the mapping from the input sequence to the hidden state of the encoder at time t based on the input sequence to obtain the hidden state of the encoder at time t.
7. The method for predicting missing values of multi-channel ocean observation time sequence scalar data according to claim 6, characterized in that according to the hidden state of the encoder at the time t, a time attention module is adopted to capture the dynamic time correlation among different time intervals in the sequence of the hidden state of the encoder at the time t; the specific process adopting the time attention module comprises the following steps:
determining attention weight of each input feature at the time t according to the hidden state of the encoder at the time t and the hidden state of the decoder at the time t-1; determining an attention weight of a certain input feature to a predicted value at time t based on the attention weight of each input feature at time t; and obtaining a weighted sum of all encoder hidden states, namely a context vector, based on the attention weights of all input features to the predicted values at the time t and the hidden states of the encoder at the time t.
8. The method of claim 7, wherein the context vector at time t is determined and the target sequence at time t-1 is combined to update the decoder's hidden state at time t.
9. The method of claim 8, wherein the context vector at time T is concatenated with the updated decoder's hidden state at time T to form a new decoder's hidden state, and a missing chlorophyll sequence is predicted.
10. A multi-channel ocean observation time series scalar data missing value prediction system, comprising:
a data acquisition module configured to: obtaining ocean observation time sequence scalar data with ocean deficiency values;
a prediction module configured to: obtaining a marine missing value prediction result by adopting a TA-RNN model based on the marine observation time sequence scalar data;
the TA-RNN model comprises a convolution attention module, a space attention module and a time attention module, wherein the convolution attention module is used for refining the ocean observation time series scalar data; the space attention module is used for capturing dynamic space correlation of the refined ocean observation time sequence scalar data; the temporal attention module is configured to capture dynamic temporal correlations between different time intervals in the spatial attention module output data.
CN202210285171.8A 2022-03-23 2022-03-23 Multi-channel ocean observation time sequence scalar data missing value prediction method and system Active CN114385619B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN202210285171.8A CN114385619B (en) 2022-03-23 2022-03-23 Multi-channel ocean observation time sequence scalar data missing value prediction method and system

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN202210285171.8A CN114385619B (en) 2022-03-23 2022-03-23 Multi-channel ocean observation time sequence scalar data missing value prediction method and system

Publications (2)

Publication Number Publication Date
CN114385619A true CN114385619A (en) 2022-04-22
CN114385619B CN114385619B (en) 2022-07-15

Family

ID=81205385

Family Applications (1)

Application Number Title Priority Date Filing Date
CN202210285171.8A Active CN114385619B (en) 2022-03-23 2022-03-23 Multi-channel ocean observation time sequence scalar data missing value prediction method and system

Country Status (1)

Country Link
CN (1) CN114385619B (en)

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN115935139A (en) * 2023-01-09 2023-04-07 吉林大学 Space field interpolation method for ocean observation data

Citations (11)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN107577649A (en) * 2017-09-26 2018-01-12 广州供电局有限公司 The interpolation processing method and device of missing data
US20200089650A1 (en) * 2018-09-14 2020-03-19 Software Ag Techniques for automated data cleansing for machine learning algorithms
CN111339234A (en) * 2020-02-26 2020-06-26 上海澜启信息科技有限公司 Method, device and storage medium for repairing missing positioning data samples
US20200349169A1 (en) * 2019-05-03 2020-11-05 Accenture Global Solutions Limited Artificial intelligence (ai) based automatic data remediation
CN112232600A (en) * 2020-11-17 2021-01-15 烟台海颐软件股份有限公司 Short-term power load prediction method and system based on variational modal decomposition
CN112527788A (en) * 2020-12-17 2021-03-19 北京中恒博瑞数字电力科技有限公司 Method and device for detecting and cleaning abnormal value of transformer monitoring data
CN112685950A (en) * 2020-12-02 2021-04-20 山东省计算中心(国家超级计算济南中心) Method, system and equipment for detecting abnormality of ocean time sequence observation data
CN113094357A (en) * 2021-04-23 2021-07-09 大连理工大学 Traffic missing data completion method based on space-time attention mechanism
US11223543B1 (en) * 2020-09-29 2022-01-11 Dell Products L.P. Reconstructing time series datasets with missing values utilizing machine learning
CN113962432A (en) * 2021-09-10 2022-01-21 国网江苏省电力有限公司电力科学研究院 Wind power prediction method and system integrating three-dimensional convolution and light-weight convolution threshold unit
CN113988951A (en) * 2021-11-19 2022-01-28 中核第四研究设计工程有限公司 Commodity recommendation learning model construction method based on tensor decomposition and collaborative filtering

Patent Citations (11)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN107577649A (en) * 2017-09-26 2018-01-12 广州供电局有限公司 The interpolation processing method and device of missing data
US20200089650A1 (en) * 2018-09-14 2020-03-19 Software Ag Techniques for automated data cleansing for machine learning algorithms
US20200349169A1 (en) * 2019-05-03 2020-11-05 Accenture Global Solutions Limited Artificial intelligence (ai) based automatic data remediation
CN111339234A (en) * 2020-02-26 2020-06-26 上海澜启信息科技有限公司 Method, device and storage medium for repairing missing positioning data samples
US11223543B1 (en) * 2020-09-29 2022-01-11 Dell Products L.P. Reconstructing time series datasets with missing values utilizing machine learning
CN112232600A (en) * 2020-11-17 2021-01-15 烟台海颐软件股份有限公司 Short-term power load prediction method and system based on variational modal decomposition
CN112685950A (en) * 2020-12-02 2021-04-20 山东省计算中心(国家超级计算济南中心) Method, system and equipment for detecting abnormality of ocean time sequence observation data
CN112527788A (en) * 2020-12-17 2021-03-19 北京中恒博瑞数字电力科技有限公司 Method and device for detecting and cleaning abnormal value of transformer monitoring data
CN113094357A (en) * 2021-04-23 2021-07-09 大连理工大学 Traffic missing data completion method based on space-time attention mechanism
CN113962432A (en) * 2021-09-10 2022-01-21 国网江苏省电力有限公司电力科学研究院 Wind power prediction method and system integrating three-dimensional convolution and light-weight convolution threshold unit
CN113988951A (en) * 2021-11-19 2022-01-28 中核第四研究设计工程有限公司 Commodity recommendation learning model construction method based on tensor decomposition and collaborative filtering

Non-Patent Citations (2)

* Cited by examiner, † Cited by third party
Title
XIANG LI等: ""ECNN: One Online Deep Learning Model for Streaming Ocean Data Prediction"", 《ACM ICEA "21: PROCEEDINGS OF THE 2021 ACM INTERNATIONAL CONFERENCE ON INTELLIGENT COMPUTING AND ITS EMERGING APPLICATIONS》, 31 December 2021 (2021-12-31), pages 170 - 175, XP058872467, DOI: 10.1145/3491396.3506519 *
孙晓丽 等: ""基于改进神经过程的缺失数据填充算法"", 《中国科学院大学学报》, 15 March 2021 (2021-03-15), pages 280 - 287 *

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN115935139A (en) * 2023-01-09 2023-04-07 吉林大学 Space field interpolation method for ocean observation data

Also Published As

Publication number Publication date
CN114385619B (en) 2022-07-15

Similar Documents

Publication Publication Date Title
CN112347859B (en) Method for detecting significance target of optical remote sensing image
CN111428789A (en) Network traffic anomaly detection method based on deep learning
CN110427654B (en) Landslide prediction model construction method and system based on sensitive state
CN110298387A (en) Incorporate the deep neural network object detection method of Pixel-level attention mechanism
CN113128558B (en) Target detection method based on shallow space feature fusion and adaptive channel screening
CN110210513B (en) Data classification method and device and terminal equipment
CN115885289A (en) Modeling dependency with global self-attention neural networks
CN113627093B (en) Underwater mechanism trans-scale flow field characteristic prediction method based on improved Unet network
CN110765854A (en) Video motion recognition method
CN115390164B (en) Radar echo extrapolation forecasting method and system
CN110930378B (en) Emphysema image processing method and system based on low data demand
CN112347531B (en) Brittle marble Dan Sanwei crack propagation path prediction method and system
CN109272497A (en) Method for detecting surface defects of products, device and computer equipment
CN114385619B (en) Multi-channel ocean observation time sequence scalar data missing value prediction method and system
CN112949944A (en) Underground water level intelligent prediction method and system based on space-time characteristics
CN116861262A (en) Perception model training method and device, electronic equipment and storage medium
CN115953902B (en) Traffic flow prediction method based on multi-view space-time diagram convolutional network
CN117197632A (en) Transformer-based electron microscope pollen image target detection method
CN115438575B (en) Analysis method for high-precision airfoil flow field prediction
CN114239934A (en) Power grid load sequence prediction method and system based on generative countermeasure network
CN111382761B (en) CNN-based detector, image detection method and terminal
Wang et al. Research on Semantic Segmentation Algorithm for Multiscale Feature Images Based on Improved DeepLab v3+
CN118395385B (en) Ocean environment prediction method and system based on remote sensing and hydrologic sampling data fusion
CN118378178B (en) Transformer fault identification method and system based on residual map convolution neural network
CN116393188B (en) Microfluidic chip and method suitable for capturing circulating tumor cells

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant