CN112949896A

CN112949896A - Time sequence prediction method based on fusion sequence decomposition and space-time convolution

Info

Publication number: CN112949896A
Application number: CN202011525943.8A
Authority: CN
Inventors: 叶惠波; 郭长丰; 金苍宏; 董腾然
Original assignee: Zhejiang Baian Information Technology Co ltd
Current assignee: Zhejiang Baian Information Technology Co ltd
Priority date: 2020-12-22
Filing date: 2020-12-22
Publication date: 2021-06-11
Anticipated expiration: 2040-12-22
Also published as: CN112949896B

Abstract

The invention provides a time sequence prediction method based on fusion sequence decomposition and space-time convolution, which carries out prediction through a fusion sequence decomposition strategy and a neural network model (SDBRNN, Series-Decompose-Block RNN) of three-dimensional time sequence convolution. Aiming at the problem of uncertain period in the original STL decomposition, an improved algorithm based on trend seasonal decomposition is provided; when the time sequence block is constructed, the components are combined according to the correlation, so that the three-dimensional convolution module can better extract the space-time characteristics; meanwhile, the three-dimensional volume block is a module embedded in the LSTM cell, and replaces the gate updating operation of the LSTM, so that the LSTM can learn with the time-space characteristics of data. Experiments prove the rationality of the time sequence block structure and the effectiveness of the model.

Description

Time sequence prediction method based on fusion sequence decomposition and space-time convolution

Technical Field

The invention belongs to the field of data analysis, and particularly relates to a time sequence prediction algorithm based on fusion sequence decomposition and space-time convolution.

Technical Field

The time sequence data is used for describing the characteristics of the change of the object along with the time, and the research on the time sequence can help people to know the historical development mode of the object and predict the future change trend of the object, such as stock price fluctuation, road and vehicle flow prediction, user behavior analysis and the like. The essential characteristics of the sequence can be drawn according to the mean, variance, covariance and the like of the sequence, and the sequence is divided into a stationary time sequence and a non-stationary time sequence. On the traditional time series prediction problem, a plurality of machine learning methods such as a support vector machine and a decision tree algorithm appear. Meanwhile, with the rapid development of deep learning technology, more and more researchers apply the neural network to the prediction problem of the time series. For example, a Recurrent Neural Network (RNN) is used to introduce the time characteristics of the sequence into the Neural Network, and predictions are made on the detrended time series data. Many neural network variants based on RNN, such as Long-Short Term Memory network (LSTM), not only solve the problem of gradient disappearance of RNN, but also utilize a forgetting gate mechanism to help the sequence have a Memory function for Long-Term dependence. In addition, the Convolutional Network is also applied to the time sequence prediction problem, although the Convolutional Network cannot capture the time sequence characteristics as well as the RNN Network due to the limitation of the size of a Convolutional kernel, the effect of the Convolutional Network on the time sequence prediction is sometimes better than that of the RNN through reasonable structural design, for example, a time domain Convolutional Network (TCN) is not only trained faster than an LSTM, but also has stronger capability of capturing local time domain characteristics. The prior time convolution sequence prediction method usually predicts different observation points separately, but the method is effective, but ignores the relation among the observation points, because the spatial dependence exists for the time sequence of multiple observation points with stronger correlation.

Disclosure of Invention

The invention aims to provide a novel time sequence prediction method of a neural network model (SDBRNN, Series-Decompose-Block RNN) fusing a sequence decomposition strategy and three-dimensional time sequence convolution.

Therefore, the invention adopts the following technical scheme:

a time sequence prediction method based on fusion sequence decomposition and space-time convolution is characterized in that one-dimensional time sequence data is subjected to time sequence decomposition and spliced into a three-dimensional data block with time and space based on an STL decomposition strategy, high-dimensional space-time characteristics of the three-dimensional data block are extracted by using a three-dimensional convolution model, and finally back propagation learning is carried out on the high-dimensional space-time characteristics by using an LSTM sequence module.

The method carries out time sequence prediction through a sequence Decomposition Block Recurrent Neural Network model SDBRNN (Series-Decomposition-Block Recurrent Neural Network); the SDBRNN model comprises two modules, namely a time sequence Block construction module and a space-time fusion prediction module based on a decomposition strategy, wherein the space-time fusion prediction module comprises a three-dimensional convolution module and a Block-LSTM module.

Further, the sequential block construction is divided into an STL decomposition stage and a component reassembly stage.

Further, in the STL decomposition stage, the period n (p) of the original one-dimensional input sequence is determined first.

Further, when the period n (p) is determined, firstly inputting an alternative period n (P)_i(i ═ 1,2, …), performing STL decomposition on the original sequence at different periods, respectively, and obtaining different decomposition results as shown in formula (1):

to pair

Respectively with the original sequence O_vCalculating DTW distance dⁱGet dⁱMinimum period parameter n (P)^k(k epsilon {1, 2. }) is taken as an STL decomposition parameter, and a trend component T of the original sequence is obtained_vSeasonal component S_vAnd residual errorComponent R_vExpressed as:

O_v＝T_v+S_v+R_v,v＝1,2,...,N (2)。

further, the STL decomposition specific operation includes the following six steps:

step 1, performing trend removing processing on an original sequence, namely subtracting the trend of the sequence at the last moment, as shown in formula (3):

step 2, periodic subsequence smoothing (Cycle-subseries smoothing); to D_tMaking periodic subsequence smooth regression operation, making 1 period N (p) respectively extended before and after the sequence, and recording the sequence whose length is N +2N (p)

Wherein v ═ 1-N (p), …, N + N (p);

step 3, periodic subsequence Low-throughput filtering

Respectively carrying out the running average operation of the parameters N (p), and 3 for three times, and carrying out LOESS regression again to obtain the sequence with the length of N

I.e. a periodic subsequence

Low flux trend of (d);

step 4, smoothing the periodic subsequence trend (regression of smoothed cycles-subseries), and seasonally forming

Is composed of

And

a difference of (d); as shown in formula (4):

step 5, removing seasonality (desonalizing) and Trend smoothing (Trend smoothing), and carrying out comparison on the original sequence O_tBy subtracting seasonal components

Then, LOESS regression with smoothing coefficient n (t) is performed to obtain trend component

As shown in formula (5):

step 6, for the original sequence y_tObtained by subtracting the above steps

And

obtaining residual components; as shown in formula (6):

further, STL decomposition is carried out to obtain four equal-length sequences of three components and the original sequence, and a two-dimensional matrix M is constructed_N×h＝[O_t,T_v,S_v R_v]Where N is a single sequence length and h equals 4 represents 4 components;

and splicing the two-dimensional matrixes of different observation points to obtain the three-dimensional time sequence block with time and space characteristics.

Furthermore, in the space-time fusion prediction module, the three-dimensional convolution module extracts high-dimensional characteristics by performing three-dimensional convolution operation on the time sequence block, and replaces the gate updating operation of the LSTM; the Block-LSTM module takes the output of the three-dimensional convolution module as the input of the LSTM module, and at the moment, the cell data of the Block-LSTM has both temporal and spatial characteristics.

Further, the core portion of the LSTM module is the cellular state C transmitted in each LSTM cell_tAnd cell state C_tThe three gating mechanisms are used to determine the forgetting gate f_tInput gate I_tAnd an output gate O_t(ii) a Wherein, forget the door f_tActivation of the weight matrix W by a sigmoid activation function_fAnd the previous time hidden layer information h_t-1And the current input x_tDetermining whether the cell retains or discards the previous state information; input gate I_tFirstly, activating a weight matrix W through a sigmoid function_iAnd h_t-1And x_tDetermines updated information by the product of the two, and activates the weight matrix W by the tanh activation function_CAnd h_t-1And x_tThe product of (a) yields candidate cell information

New cell state C_tFrom f_tAnd C_t-1Sum of products I_tAnd

the sum of the products of (a); output gate O_tActivation of the weight matrix W by sigmoid_oThe product of ht-1 and xt, the current hidden layer state h_tFrom O_tWith tanh activated C_tObtaining the product of the two;

LSTM module passing hidden state h_t、C_tCalculating an error term for each step before the gradient delta t, and deducing to obtain a forgetting gate f according to a back propagation principle_tInput gate I_tOutput gate O_tParameter W of_f、W_IAnd W_oComputing propagation processesAs shown in formula (7):

further, the three-dimensional convolution kernel slides on the time sequence block and conducts convolution operation with pixel points on the time sequence block to obtain a high-order three-dimensional feature block, the feature block is evenly divided to obtain four small feature blocks which serve as forgetting gates f of the LSTM cells_tInput gate I_tOutput gate O_tAnd candidate cell C_t(ii) a Each gating mechanism in each Block-LSTM cell carries the timing characteristics of the data Block, where input gate I_tWith the hidden layer information h at the previous moment_t-1Splicing and activating by using sigmoid; forget door f_tSplicing with ht-1 and activating a sigmoid activation function; candidate cell

Splicing with ht-1 and activating by using a tanh activation function; new cell state C_tIs composed of

And I_tProduct of (a) and (f)_tAnd previous cell state C_t-1Summing; the cell operation of SDBRNN is shown in formula (8), where x represents the convolution operation.

The method carries out time sequence prediction through a sequence decomposition block recurrent neural network model SDBRNN; the method is suitable for the data sets with different observation points related in reality logic and strong correlation, and richer features can be extracted through time-space synchronous convolution, so that the prediction effect of the sequence is improved.

Drawings

FIG. 1 is a schematic diagram of a network structure of a recurrent neural network model of the present invention.

FIG. 2-1 is an exploded view of the STL structure of the timing block of the present invention, and FIG. 2-2 is a schematic view of the timing block structure of the present invention.

FIG. 3 is a schematic diagram of the structure of the SDBRNN cell of the present invention.

Detailed Description

The invention provides a time sequence prediction method based on fusion sequence decomposition and space-time convolution, which comprises the steps of firstly, carrying out time sequence decomposition on one-dimensional time sequence data based on an STL decomposition strategy, splicing the time sequence data into a three-dimensional data block with time and space, extracting high-dimensional space-time characteristics of the three-dimensional data block by using a three-dimensional convolution model, and finally carrying out back propagation learning on the high-dimensional space-time characteristics by using an LSTM sequence module.

Aiming at the problem of uncertain period in the original STL decomposition, the invention provides an improved algorithm based on trend seasonal decomposition; when the time sequence block is constructed, the components are combined according to the correlation, so that the three-dimensional convolution module can better extract the space-time characteristics; meanwhile, the three-dimensional convolution module is embedded in the LSTM cell and replaces the gate updating operation of the LSTM, so that the LSTM can learn the space-time characteristics of data. Experiments prove the rationality of the time sequence block structure and the effectiveness of the model.

As shown in FIG. 1, the invention provides a sequence Decomposition Block Recurrent Neural Network model (SDBRNN) by utilizing the correlation of space-time dimensions and the advantage of feature enhancement brought by multi-dimensional fusion. The SDBRNN model can be mainly divided into two modules, namely a time sequence Block construction module and a space-time fusion prediction module based on a decomposition strategy, wherein the space-time fusion prediction module comprises a three-dimensional convolution module and a Block-LSTM module.

The time sequence block structure is used for decomposing and combining the characteristics in the time sequence and splicing different sequences according to the sequence of observation points to form a three-dimensional data time sequence block structure module with time sequence characteristics, and comprises the following steps:

as shown in fig. 2-1 and 2-2, the sequential block construction can be divided into an STL decomposition stage and a component reassembly stage. In the STL decomposition stage, the period n (p) of the original one-dimensional input sequence needs to be determined first. Sequence period is often takenDepending on the data itself, if the sampling interval of the traffic speed data set is 5 minutes, there are 288 sample points in a day, and the subjective cycle is one day, n (p) is 288, and the longer cycle may be one week or even one month. How to select the period not only affects the sequence decomposition effect, but also affects the final prediction accuracy. In order to optimize the value of the period n (p), firstly, inputting an alternative period n (P)_i(i ═ 1,2, …), the original sequence was subjected to STL decomposition in different periods, and the obtained different decomposition results were as shown in formula (1).

To pair

Respectively with the original sequence O_vCalculating DTW distance dⁱIn the method, d is takenⁱMinimum period parameter n (P)^k(k epsilon {1, 2. }) is taken as an STL decomposition parameter, and a trend component T of the original sequence is obtained_vSeasonal component S_vAnd a residual component R_vIt can be expressed as:

O_v＝T_v+S_v+R_v,v＝1,2,...,N (2)

the specific operation of STL decomposition comprises the following six steps:

step 1, Detrending, namely, Detrending the original sequence, namely subtracting the trend of the sequence at the last moment, as shown in formula (3):

step 2, Cycle-subsequences smoothing_tMaking periodic subsequence smooth regression operation, making 1 period N (p) respectively extended before and after the sequence, and recording the sequence whose length is N +2N (p)

Wherein v ═ 1-N (p), …, N + N (p);

step 3, periodic subsequence Low-throughput filtering

I.e. a periodic subsequence

Low flux trend of (d);

and 4, smoothing the periodic subsequence trend (trimming of smoothed cycles-subseries)

Is composed of

And

a difference of (d); as shown in formula (4):

step 5, removing seasonality (desonalizing) and Trend smoothing (Trend smoothing), and carrying out O treatment on the original sequence_tBy subtracting seasonal components

As shown in formula (5):

step 6, for the original sequence y_tObtained by subtracting the above steps

And

obtaining residual components; as shown in formula (6):

different from the existing method for independently predicting and learning components, the model constructs a two-dimensional matrix M by decomposing three components obtained by STL and four sequences with equal length of an original sequence in a component recombination stage_N×h＝[O_t,T_v,S_v R_v]Where N is a single sequence length and h equals 4 represents 4 components.

It is clear that the order of combining the 4 components will affect the two-dimensional matrix values formed, depending on the nature of the convolution operation, for the given M above_N×hRow vector T in the middle of the matrix_vAnd S_vWill be compared to the row vector O at the edge_tAnd R_vAnd (4) participating in convolution calculation. The predicted effect of the different ranking methods will be demonstrated experimentally in the experimental part herein. Thus, a two-dimensional matrix of single observation points subjected to STL decomposition and component recombination is obtained. And finally, splicing the two-dimensional matrixes of different observation points to obtain the three-dimensional time sequence block with time and space characteristics.

The spatio-temporal fusion prediction module is described below, including a three-dimensional convolution module and an LSTM module. 1) A three-dimensional convolution module: the high-dimensional characteristics are extracted by performing three-dimensional convolution operation on the time sequence block, and the gate updating operation of the LSTM is replaced; 2) LSTM module: the output of the three-dimensional convolution module is used as the input of the LSTM module, and at the moment, the cell data of the LSTM has both time and space characteristics.

The core portion of the LSTM is the cellular state C transported in each LSTM cell_tAnd cell state C_tThe three gating mechanisms are used to determine the forgetting gate f_tInput gate I_tAnd an output gate O_t. Wherein, forget the door f_tActivation of the weight matrix W by a sigmoid activation function_fAnd the previous time hidden layer information h_t-1And the current input x_tDetermining whether the cell retains or discards the previous state information; input gate I_tFirstly, activating a weight matrix W through a sigmoid function_iAnd h_t-1And x_tDetermines updated information by the product of the two, and activates the weight matrix W by the tanh activation function_CAnd h_t-1And x_tThe product of (a) yields candidate cell information

New cell state C_tFrom f_tAnd C_t-1Sum of products I_tAnd

the sum of the products of (a); output gate O_tActivation of the weight matrix W by sigmoid_oThe product of ht-1 and xt, the current hidden layer state h_tFrom O_tWith tanh activated C_tThe product of the two is obtained.

LSTM passes through hidden state h_t、C_tAnd the gradient δ t calculates the error term towards each previous step, the key being that the calculation parameters are based on the partial derivatives of the loss function. According to the back propagation principle, deducing a forgetting gate f_tInput gate I_tOutput gate O_tParameter W of_f、W_IAnd W_oThe calculation propagation process is shown in equation (7).

However, the vast spatial relationships that exist for multi-observation time series cannot be exploited by LSTM. The space-time fusion convolutional network provided by the invention combines the space extraction capability of the convolutional layer and the time sequence extraction capability of the LSTM, and is called Block-LSTM. Different from a simple convolutional layer + LSTM network, the three-dimensional convolution operation of Block-LSTM is realized in the bottom layer inside the LSTM cell and replaces the gate updating and cell updating operation of LSTM.

As shown in the three-dimensional convolution module of fig. 3, when the three-dimensional convolution kernel slides on the time sequence block and performs convolution operation with the pixel points on the time sequence block, a high-order three-dimensional feature block is obtained, and the feature block is uniformly divided to obtain four small feature blocks serving as the forgetting gate f of the LSTM cell_tInput gate I_tOutput gate O_tAnd candidate cell C_t. Thus each gating mechanism in each Block-LSTM cell carries the timing characteristics of the data Block. Wherein, the input gate I_tWith the hidden layer information h at the previous moment_t-1Splicing and activating by using sigmoid; forget door f_tSplicing with ht-1 and activating a sigmoid activation function; candidate cell

And I_tProduct of (a) and (f)_tAnd previous cell state C_t-1And (4) summing. The cell operation of SDBRNN is shown in formula (8), where x represents the convolution operation.

By comparison, it can be found that the gate mechanism of Block-LSTM cells works in a manner similar to LSTM, with the most important change being the replacement of state updates between cells by three-dimensional convolution operations. The Block-LSTM can not only extract spatial features by utilizing the correlation among multiple sequences like a CNN network, but also establish the time sequence relation among the sequences by utilizing the LSTM, thereby achieving the space-time prediction.

The SDBRNN model carries out prediction verification on 3 data sets, including two traffic flow data sets PeMS-Bay and Seattle and a power data set Solar-Energy. The three data sets are all multi-observation point time series data sets. The comparison method and parameter settings were as follows:

1) LSTM is a special RNN network, which has a gate mechanism long-short term memory neural network and has better effect on solving the problem of long-term dependence. The effect can be contrasted with the STL decomposition module and the three-dimensional convolution module of the models herein. In the comparative experiment, the number of LSTM layers was 2 and the hidden layer size was [16,32 ].

2) Gru, a special RNN network, has a simpler network mechanism than LSTM, has better effect in solving long-term problems and the model is easier to train. In the comparative experiment, the number of GRU layers was 2 and the hidden layer size was [16,32 ].

3) Wavenet, a sequence generation model, has stronger time domain view than the common CNN structure, and has good effects on speech generation and text prediction. In a comparative experiment, the number of WaveNet layers is 4, the convolution kernel size is 2, the Residual channel coefficient Residual channel is 32, and the Skip channel coefficient Skip channel is 128.

4) Tcn time convolutional network. The method has the advantages that the method has a more flexible receptive field by utilizing the cavity convolution, and has a residual network structure, so that the gradient of the method in the training process is more stable. In the comparative experiment, the number of TCN layers is 2, the convolution kernel size is 2, and the hole coefficient size is 2.

5) ConvLSTM is a variant LSTM network for extracting spatial features by using two-dimensional convolution, has a good extraction effect on the spatial features, and has a good comparison effect with a three-dimensional convolution module of a text model. In the comparative experiment, the number of layers of ConvLSTM was 2, the convolution kernel size was 2 × 2, and the hidden layer size was [16,32 ].

The comparison parameters are 3 conventional evaluation indexes, namely RMSE, MAE and MAPE.

TABLE 1 results of the experimental predictions

As can be seen from the table, in the PeMS-Bay and Seattle traffic data sets, the TCN model with the best short-term prediction performance has MAPE of 2.94%, while the SDBRNN model has MAPE of 5.50%, the SDBRNN model with the best medium-long term prediction performance has SDBRNN proposed herein, predicts MAPE values of 3.47% and 2.54% for 6 and 12 time slices, which are about 1 and 2 percentage points better than TCN, respectively, which indicates that the extraction of different components of the sequence by the STL decomposition module can improve the long-term prediction effect, but is not obviously helpful in short-term prediction. It is not difficult to understand, because of the tendency component T of STL decomposition_vAnd seasonal ingredient S_vThe representative data is the characteristics of the data overall level, which means that the longer the data is, the more cycles the sequence contains, the better the decomposition effect is, and for the local characteristics, the STL has no good strategy for extraction. Therefore, the advantage of SDBRNN is more apparent when the predicted step size is longer in comparison of different models. Particularly, the results of comparing ConvLSTM and SDBRNN show that the effect of the SDBRNN model on long-term prediction is about 2 percent better than that of ConvLSTM, because the convolution module of ConvLSTM only performs two-dimensional convolution operation, namely spatial feature extraction, on a single moment, and the three-dimensional convolution module of SDBRNN simultaneously extracts space-time features, so that the effectiveness of the SDBRNN model on long-term time sequence prediction is verified.

Claims

1. A time sequence prediction method based on fusion sequence decomposition and space-time convolution is characterized in that one-dimensional time sequence data is subjected to time sequence decomposition and spliced into a three-dimensional data block with time and space based on an STL decomposition strategy, high-dimensional space-time characteristics of the three-dimensional data block are extracted by using a three-dimensional convolution model, and finally back propagation learning is carried out on the high-dimensional space-time characteristics by using an LSTM sequence module.

2. The method of claim 1, wherein the method performs time sequence prediction by using a sequence-Decomposition Block Recurrent Neural Network (SDBRNN) model; the SDBRNN model comprises two modules, namely a time sequence Block construction module and a space-time fusion prediction module based on a decomposition strategy, wherein the space-time fusion prediction module comprises a three-dimensional convolution module and a Block-LSTM module.

3. The method of claim 2, wherein the time sequence block structure is divided into an STL decomposition stage and a component reconstruction stage.

4. A method of temporal prediction based on fused sequence decomposition with spatio-temporal convolution according to claim 3 characterized in that in the STL decomposition stage, the period n (p) of the original one-dimensional input sequence is determined first.

5. The method as claimed in claim 4, wherein when determining the period n (p), the alternative period n (p) is inputted first_i(i ═ 1,2, …), performing STL decomposition on the original sequence at different periods, respectively, and obtaining different decomposition results as shown in formula (1):

to pair

Respectively with the original sequence O_vCalculating DTW distance dⁱGet dⁱMinimum period parameter n (P)^k(k epsilon {1, 2. }) is taken as an STL decomposition parameter, and a trend component T of the original sequence is obtained_vSeasonal component S_vAnd a residual component R_vExpressed as:

O_v＝T_v+S_v+R_v,v＝1,2,...,N (2)。

6. the method of claim 3, wherein the STL decomposition operation comprises the following six steps:

Wherein v ═ 1-N (p), …, N + N (p);

step 3, periodic subsequence Low-throughput filtering

I.e. a periodic subsequence

Low flux trend of (d);

Is composed of

And

a difference of (d); as shown in formula (4):

As shown in formula (5):

step 6, for the original sequence y_tObtained by subtracting the above steps

And

obtaining residual components; as shown in formula (6):

7. the time sequence prediction method based on fusion sequence decomposition and space-time convolution of claim 3, characterized in that STL decomposition obtains four sequences with the same length of three components and the original sequence itself, and constructs a two-dimensional matrix M_N×h＝[O_t,T_v,S_v R_v]Where N is a single sequence length and h equals 4 represents 4 components;

8. The time-series prediction method based on fusion sequence decomposition and space-time convolution of claim 3 is characterized in that in the space-time fusion prediction module, the three-dimensional convolution module extracts high-dimensional features by performing three-dimensional convolution operation on the time-series block instead of the gate updating operation of the LSTM; the Block-LSTM module takes the output of the three-dimensional convolution module as the input of the LSTM module, and at the moment, the cell data of the Block-LSTM has both temporal and spatial characteristics.

9. The method of claim 8 in which the core part of the LSTM module is the cell state C transmitted in each LSTM cell_tAnd cell state C_tThe three gating mechanisms are used to determine the forgetting gate f_tInput gate I_tAnd an output gate O_t(ii) a Wherein, forget the door f_tActivation of the weight matrix W by a sigmoid activation function_fAnd the previous time hidden layer information h_t-1And the current input x_tDetermining whether the cell retains or discards the previous state information; input gate I_tFirstly, activating a weight matrix W through a sigmoid function_iAnd h_t-1And x_tDetermines updated information by the product of the two, and activates the weight matrix W by the tanh activation function_CAnd h_t-1And x_tThe product of (a) yields candidate cell information

New cell state C_tFrom f_tAnd C_t-1Sum of products I_tAnd

LSTM module passing hidden state h_t、C_tCalculating an error term for each step before the gradient delta t, and deducing to obtain a forgetting gate f according to a back propagation principle_tInput gate I_tOutput gate O_tParameter W of_f、W_IAnd W_oThe calculation propagation process is shown in equation (7):

10. the method of claim 9, wherein a three-dimensional convolution kernel slides on the time sequence block and performs convolution operation with pixel points on the time sequence block to obtain a high-order three-dimensional feature block, and the feature block is uniformly divided to obtain four small feature blocks as the forgetting gate f of the LSTM cell_tInput gate I_tOutput gate O_tAnd candidate cell C_t(ii) a Each gating mechanism in each Block-LSTM cell carries the timing characteristics of the data Block, where input gate I_tWith the hidden layer information h at the previous moment_t-1Splicing and activating by using sigmoid; forget door f_tSplicing with ht-1 and activating a sigmoid activation function; candidate cell

And I_tProduct of (a) and (f)_tAnd previous cell state C_t-1Summing; the cell working mode of the SDBRNN is shown in a formula (8), wherein the operation mode represents convolution operation;