CN114565149A

CN114565149A - CGA fusion model-based time series data prediction method and device and computer equipment

Info

Publication number: CN114565149A
Application number: CN202210169309.8A
Authority: CN
Inventors: 雷建军; 邓磊; 秦振宇
Original assignee: Chongqing University of Post and Telecommunications
Current assignee: Chongqing University of Post and Telecommunications
Priority date: 2022-02-23
Filing date: 2022-02-23
Publication date: 2022-05-31

Abstract

The invention relates to the technical field of time sequence prediction, in particular to a time sequence data prediction method, a device and computer equipment based on a CGA fusion model; the method comprises the following steps: acquiring historical target data, and preprocessing the historical target data to obtain preprocessed historical target data; inputting the preprocessed historical target data into the constructed CGA fusion model to obtain a time sequence prediction result of the target data, and controlling future traffic flow according to the time sequence prediction result; the invention combines the stack type and the parallel type to fuse the multi-channel multi-scale convolution neural network and the gate control cycle unit network, and combines the multi-channel multi-scale convolution neural network, the gate control cycle unit network and the autoregressive model in a residual error mode, thereby reducing the training difficulty of the model, avoiding the loss of original information caused by a convolution structure and a GRU structure, and further improving the prediction precision.

Description

Time series data prediction method and device based on CGA fusion model and computer equipment

Technical Field

The invention belongs to the technical field of time sequence prediction, and particularly relates to a time sequence data prediction method and device based on a CGA fusion model, and computer equipment.

Background

The time series prediction task is a type of task ubiquitous in machine learning, and is widely applied to the fields of finance, industry, manufacturing, transportation and the like, such as power load prediction, traffic flow prediction, stock price prediction, weather data prediction and the like. With the rapid development of deep learning technology in recent years, the deep learning technology has proved to be an important modeling method in many fields. Meanwhile, some time series prediction methods have been developed, however, the existing time series prediction method based on the neural network has some problems, for example, the local features of the time series data on different time scales, the global time series feature of the whole data and the burst feature of the time series data are not considered at the same time, that is, the non-linear feature and the linear feature are not considered at the same time; or the single model has limited performance and limited applicable scenes, so that the prediction precision is not high. The following description will be made for a currently existing time series prediction method.

A multi-element time sequence multi-layer space-time dependency modeling method based on deep learning is disclosed in a patent with the application number of CN202010496285.8 and the name of 'multi-element time sequence multi-layer space-time dependency modeling method based on deep learning'. The method mainly utilizes a CNN-LSTM structure to preliminarily extract features and consists of a CNN space attention mechanism, a CNN channel attention mechanism, a CNN time attention mechanism and an autoregressive component. The method has the advantages that the concept of multi-layer space-time dependence is introduced, the CNN-based channel attention mechanism and the CNN-based space attention mechanism are used for respectively paying attention to space-time dependence characteristics of different layers, the redundant information is filtered, the characteristics with larger influence on the prediction result are effectively extracted, and the purpose of improving the prediction result is achieved. Although the method combines the nonlinear characteristic and the linear characteristic at the same time, the method does not consider the local characteristics of different time scales and the stacked CNN-LSTM structure, thereby increasing the training difficulty in model training.

A CNN-based time series prediction method and a model determination method are disclosed in a patent with an application number of CN201910460741.0 entitled "CNN-based time series prediction method and model determination method". The method comprises the following steps: acquiring historical time sequence data, and determining cycle parameters according to cycle characteristics of the historical time sequence data, wherein the cycle parameters comprise cycle types and corresponding cycle parameters; determining corresponding component data of the prediction time point in the historical time series data based on the prediction time point, the historical time series data, the period parameter and a preset cycle span, wherein the component data comprises closest time period data and period data; and predicting the component data by adopting the determined CNN model to obtain a prediction result corresponding to the prediction time point. The core of the method is to use the period parameters of the data, respectively use CNN to extract features through long, short periods and adjacent data, and predict after fusing the features. Although the method considers the characteristics of different time scales, the method has certain requirements on the periodicity of data, does not consider the non-periodic time-dependent characteristics and the short-term linear characteristics, and has certain limitations on the prediction effect and the applicable scene.

Disclosure of Invention

Aiming at the defects in the prior art, the invention provides a time series data prediction method, a device and computer equipment based on a CGA fusion model, wherein the method comprises the following steps: acquiring historical data of target data, and preprocessing the historical target data to obtain preprocessed historical target data; the target data is traffic flow data; inputting the preprocessed historical target data into the constructed CGA fusion model to obtain a time sequence prediction result of the target data, and controlling future traffic flow according to the time sequence prediction result; the CGA fusion model comprises a first layer of multi-channel multi-scale convolution neural network, a gated circulation unit network, a second layer of multi-channel multi-scale convolution neural network and an autoregressive model;

the process of constructing the CGA fusion model comprises the following steps:

s1: inputting historical target data into a first layer of multi-channel multi-scale convolutional neural network MCMSCNN to obtain primary features;

s2: inputting the primary characteristics into a gated cycle unit network GRU to obtain time sequence characteristics;

s3: inputting the primary features into a second-layer multi-channel multi-scale convolutional neural network MCMSCNN to obtain local features;

s4: inputting historical target data into an autoregressive model AR to obtain linear characteristics;

s5: and adding the time sequence characteristics, the local characteristics and the linear characteristics to obtain a constructed CGA fusion model.

Preferably, the process of preprocessing the historical target data includes:

removing abnormal values from the historical target data and filling missing values to obtain complete target data;

carrying out normalization processing on the perfect target data to obtain normalized data;

and setting the length T of a sliding window, and sampling the normalized data by adopting the sliding window to obtain preprocessed historical target data.

Preferably, the process of obtaining the preliminary characteristics comprises: convolution cores with three different sizes are adopted to carry out convolution, activation and dropout processing on input data, and three feature tensors with different lengths, namely primary features, are obtained.

Preferably, the process of obtaining the time sequence characteristics includes: processing the primary characteristics, and inputting the processed primary characteristics into a gated cycle unit network GRU to obtain time sequence characteristics; processing the preliminary features includes:

filling two feature tensors with shorter lengths in the three feature tensors with different lengths to obtain three feature tensors with the same length;

and splicing the three feature tensors with the same length to obtain the processed primary features.

Preferably, the process of obtaining the local feature includes:

performing convolution, activation, maximum pooling and dropout processing on input data by adopting convolution kernels with three different sizes to obtain three feature tensors with different lengths;

splicing and flatten operations are carried out on the feature tensors with three different lengths, and the feature tensors are output through a linear layer to obtain local features;

preferably, the process of obtaining the time-series prediction result of the target data includes:

inputting the sampling data of the first sliding window into a CGA fusion model to obtain a predicted value at the current moment;

splicing the target data with the predicted value at the current moment, sliding a window, intercepting the next sliding window sampling data, and inputting the next sliding window sampling data into a CGA fusion model to obtain the predicted value at the next moment; and repeating the process to obtain a time sequence prediction result of the target data.

A CGA fusion model-based time series data prediction device comprises: the system comprises a data preprocessing module, a primary feature extraction module, a local feature extraction module, a time sequence feature extraction module, a linear feature capture module and a time sequence prediction module;

the data preprocessing module is used for preprocessing historical target data;

the primary feature extraction module is used for extracting features of the preprocessed historical target data and outputting primary features;

the local feature extraction module is used for extracting local features according to the primary features and outputting the local features;

the time sequence feature extraction module is used for acquiring time sequence features according to the primary features;

the linear feature capturing module is used for acquiring linear features according to historical target data;

the time sequence prediction module is used for obtaining a time sequence prediction result according to the local characteristic, the time sequence characteristic and the linear characteristic and outputting the time sequence prediction result.

A computer device comprising a processor and a memory, said memory storing computer readable instructions which, when executed by said processor, perform the steps of the above method.

The invention has the beneficial effects that: compared with the existing time sequence prediction method, the CGA fusion model provided by the invention comprehensively considers nonlinear features and linear features in the aspect of feature extraction, wherein the nonlinear features comprise local features of a plurality of time scales captured by MCMSCNN, convolution output vectors of the scales are spliced, on one hand, the time sequence features are captured globally by GRU, on the other hand, the MCMSCNN added with a pooling layer is continuously sent to extract more complex local features; the AR model can capture the linear characteristics of the short-time burst, and the prediction result is finally obtained. Because the characteristics of time sequence data extraction in the fusion model are rich, a more accurate prediction result can be obtained. In the aspect of model fusion, the invention is different from a typical stacked mode and a typical parallel mode, combines the two modes simultaneously to fuse the MCMSCNN and GRU structures, combines the MCMSCNN, the GRU and the AR in a residual error mode, reduces the training difficulty of the model, and avoids the loss of original information caused by a convolution structure and the GRU structure, thereby further improving the prediction precision.

Drawings

FIG. 1 is a schematic diagram of a CGA fusion model framework according to the present invention;

FIG. 2 is a flow chart of the construction of a CGA fusion model according to the present invention;

FIG. 3 is a diagram illustrating the multi-scale convolution according to the present invention.

Detailed Description

The technical solutions in the embodiments of the present invention will be clearly and completely described below with reference to the drawings in the embodiments of the present invention, and it is obvious that the described embodiments are only a part of the embodiments of the present invention, and not all of the embodiments. All other embodiments, which can be derived by a person skilled in the art from the embodiments given herein without making any creative effort, shall fall within the protection scope of the present invention.

The invention provides a time series data prediction method, a time series data prediction device and computer equipment based on a CGA fusion model, and as shown in figure 1, the method comprises the following steps: acquiring historical data of target data, and preprocessing the historical target data to obtain preprocessed historical target data; the target data is traffic flow data; inputting the preprocessed historical target data into the constructed CGA fusion model to obtain a time sequence prediction result of the target data, and controlling future traffic flow according to the time sequence prediction result; the CGA fusion model comprises a first layer of multi-channel multi-scale convolution neural network, a gate control cycle unit network, a second layer of multi-channel multi-scale convolution neural network and an autoregressive model;

as shown in fig. 2, the process of constructing the CGA fusion model includes:

s5: and adding the time sequence characteristic, the local characteristic and the linear characteristic to obtain the constructed CGA fusion model.

Acquiring historical target data, and preprocessing the historical target data; specifically, removing abnormal values from the historical target data and filling missing values to obtain complete target data; carrying out normalization processing on the perfect target data to obtain normalized data; setting the length T of a sliding window, and sampling the normalized data by adopting the sliding window to obtain preprocessed historical target data; preferably, the dimensions of the historical target data may be multidimensional data, i.e., not limited to cell or multi-element time series data.

As shown in fig. 3, inputting historical target data into a first-layer multichannel multi-scale convolutional neural network MCMSCNN to obtain a primary feature; the first layer of multi-channel multi-scale convolution neural network MCMSCNN is composed of convolution kernels with various sizes, preferably, the number of the convolution kernels is 3, the sizes are respectively 1 × 2, 1 × 3 and 1 × 4, the number of the convolution kernels in each size is more than 2, and for example, the number of the convolution kernels in each size is 50; the first layer of MCMSCNN processes the historical target data as follows: performing convolution, activation and dropout processing on input data by adopting convolution cores with three different sizes to obtain three feature tensors with different lengths, wherein the three feature tensors with different lengths are primary features;

processing the primary features:

filling two feature tensors with shorter lengths in the three feature tensors with different lengths to obtain three feature tensors with the same length; splicing three feature tensors with the same length to obtain processed primary features; and inputting the processed primary characteristics into a gated cycle unit network GRU to obtain the time sequence characteristics.

After the historical target data is subjected to primary local feature extraction through the first layer of MCMSCNN, the historical target data still has a time sequence relation in a time dimension; the GRU is a recurrent neural network, is suitable for capturing time sequence characteristics, and has the advantages that the parameter quantity can be smaller under similar performance compared with an LSTM (long short term memory network); preferably, the primary signature is inputted into the gated loop unit network GRU corresponding to each time point thereof, and the timing signature can be captured. Optionally, in the gated cyclic unit neural network structure, the depth is not limited to one layer, and the gated cyclic unit structure may be replaced by other cyclic neural network structures (including, but not limited to, network structures such as RNN, LSTM, GRU, SRU, etc. of one or two directions).

Inputting the primary features into a second layer of multi-channel multi-scale convolutional neural network MCMSCNN to obtain local features; the second layer of multi-channel multi-scale convolutional neural network MCMSCNN is composed of convolution kernels with various sizes, preferably, the number of the convolution kernels is 3, the sizes are respectively 1 × 2, 1 × 3 and 1 × 4, the number of the convolution kernels in each size is more than 2, and for example, the number of the convolution kernels in each size is 50; the process of the second layer MCMSCNN for processing the primary features is as follows:

preferably, the historical target data is input into an autoregressive model AR to obtain linear characteristics:

feature_AR＝w^Tx+b

x＝(x₁,x₂,...,x_n)

wherein feature_ARRepresenting a linear characteristic, x_nThe value represented on the nth attribute of the historical target data corresponds to a time sequence and is the value of the nth time in a certain training sample, and w is equal to (w)₁,w₂,...,w_n) And b are parameters that require training.

Alternatively, the regression model may be replaced with other linear models (including but not limited to ARIMA, ARMA, MA, etc. models).

Adding the time sequence characteristics, the local characteristics and the linear characteristics to obtain a constructed CGA fusion model:

feature_all＝feature_CNN+feature_GRU+feature_AR

wherein feature_CNNLocal features (local features at various time scales) representing MCMSCNN extraction; feature_GRURepresenting the time sequence characteristics extracted by the GRU; feature_ARLinear features of AR extraction are represented.

Inputting the preprocessed historical target data into a constructed CGA fusion model, wherein the process of carrying out multi-step time sequence prediction on the CGA fusion model according to the historical target data comprises the following steps:

intercepting the first time window t by adopting a sliding window_N,…,t_N+T-1Inputting the sampled data into the CGA fusion model, and performing inverse normalization on the output of the CGA fusion model to obtain the current time t_N+TThe predicted value of (2);

splicing the target data with the predicted value of the current moment, and intercepting the next time window { t ] by a sliding window_N+1,…,t_N+TInputting the sampling data into the CGA fusion model to obtain the next time t_N+T+1The predicted value of (2); repeating the process to obtain the t of the target data_N+T+2,t_N+T+3… } time series prediction results.

If only single-step time sequence prediction is needed, a sliding window is not needed, the current sampling data is input into the CGA fusion model, and the output of the CGA fusion model is subjected to inverse normalization to obtain a time sequence prediction result of the target data at the next moment.

The user can control the future traffic flow according to the time sequence prediction result.

The invention also provides a CGA fusion model-based time series data prediction device, which comprises: the system comprises a data preprocessing module, a primary feature extraction module, a local feature extraction module, a time sequence feature extraction module, a linear feature capture module and a time sequence prediction module;

the data preprocessing module is used for preprocessing historical target data;

The invention also provides computer equipment comprising a processor and a memory, wherein the memory stores computer readable instructions, and the computer readable instructions are executed by the processor to execute the steps of the method.

Compared with the existing time sequence prediction method, the CGA fusion model comprehensively considers nonlinear features and linear features in the aspect of feature extraction, wherein the nonlinear features comprise local features of a plurality of time scales captured by MCMSCNN, convolution output vectors of the plurality of scales are spliced, on one hand, the convolution output vectors are sent to GRU to capture time sequence features globally, on the other hand, the MCMSCNN added with a pooling layer is continuously sent to the GRU, and more complex local features are extracted; the AR model can capture the linear characteristics of the short-time burst, and the prediction result is finally obtained. Because the characteristics of time sequence data extraction in the fusion model are rich, a more accurate prediction result can be obtained. In the aspect of model fusion, the method is different from a typical stacked type and a parallel type, a convolutional neural network and a gated cyclic unit neural network are fused by combining a stacking mode and a parallel mode, and an autoregressive model is combined in a residual error mode, so that the loss of original information caused by model stacking is avoided, the training difficulty is reduced, and the prediction precision is further improved.

The above-mentioned embodiments, which further illustrate the objects, technical solutions and advantages of the present invention, should be understood that the above-mentioned embodiments are only preferred embodiments of the present invention, and should not be construed as limiting the present invention, and any modifications, equivalents, improvements, etc. made within the spirit and principle of the present invention should be included in the protection scope of the present invention.

Claims

1. A CGA fusion model-based time series data prediction method is characterized by comprising the following steps: acquiring historical data of target data, and preprocessing the historical target data to obtain preprocessed historical target data; the target data is traffic flow data; inputting the preprocessed historical target data into the constructed CGA fusion model to obtain a time sequence prediction result of the target data, and controlling future traffic flow according to the time sequence prediction result; the CGA fusion model comprises a first layer of multi-channel multi-scale convolution neural network, a gate control cycle unit network, a second layer of multi-channel multi-scale convolution neural network and an autoregressive model;

the process of constructing the CGA fusion model comprises the following steps:

2. The CGA fusion model-based time series data prediction method of claim 1, wherein the pre-processing of the historical target data comprises:

and setting the length T of a sliding window, and sampling the normalized data by adopting the sliding window to obtain the preprocessed historical target data.

3. The method of claim 1, wherein the step of obtaining the primary features comprises: convolution cores with three different sizes are adopted to carry out convolution, activation and dropout processing on input data, and three feature tensors with different lengths, namely primary features, are obtained.

4. The method of claim 1, wherein the step of obtaining the time series feature comprises: processing the primary characteristics, and inputting the processed primary characteristics into a gated cycle unit network GRU to obtain time sequence characteristics; processing the preliminary features includes:

5. The method of claim 1, wherein the obtaining of the local features comprises:

and splicing and flatten operations are carried out on the feature tensors with three different lengths, and the local features are obtained through linear layer output.

6. The method of claim 1, wherein obtaining the time-series prediction result of the target data comprises:

7. A time series data prediction device based on a CGA fusion model is characterized by comprising: the system comprises a data preprocessing module, a primary feature extraction module, a local feature extraction module, a time sequence feature extraction module, a linear feature capture module and a time sequence prediction module;

the data preprocessing module is used for preprocessing historical target data;

8. A computer device comprising a processor and a memory, said memory storing computer readable instructions which, when executed by said processor, perform the steps of the method of any one of claims 1 to 6.