CN111428201A

CN111428201A - Prediction method for time series data based on empirical mode decomposition and feedforward neural network

Info

Publication number: CN111428201A
Application number: CN202010230486.3A
Authority: CN
Inventors: 姚若侠; 刘云鹤
Original assignee: Shaanxi Normal University
Current assignee: Shaanxi Normal University
Priority date: 2020-03-27
Filing date: 2020-03-27
Publication date: 2020-07-17
Anticipated expiration: 2040-03-27
Also published as: CN111428201B

Abstract

A prediction method for time sequence data based on empirical mode decomposition and a feedforward neural network comprises the steps of data set missing value processing, single hot coding processing, principal component analysis method dimension reduction, empirical mode decomposition, data standardization processing, feedforward neural network training and test set testing. The invention adopts a principal component analysis method, a dimensionality reduction and empirical mode decomposition method, reduces the number of prediction variables through dimensionality reduction, obtains data containing most information of original data, ensures that each variable in the data obtained after dimensionality reduction does not contain repeated original data information, uses an eigenmode function to replace original time sequence data for training when a feedforward neural network is trained, inputs a data set after dimensionality reduction, reduces the number of variables, obtains accurate results, greatly reduces training time, and can be used for predicting time sequence data.

Description

Prediction method for time series data based on empirical mode decomposition and feedforward neural network

Technical Field

The invention belongs to the field of time sequence data prediction, and particularly relates to methods for EMD decomposition, PCA dimension reduction, BP neural network training and the like.

Background

Many methods are available for predicting time series data, such as vector autoregressive models, autoregressive moving average models, autoregressive integral moving average models, and regression-based methods that support vector regression linearly. These models often assume a deterministic distribution or functional form of the time series, but do not capture the complex potential nonlinear relationships. Other models, such as the gaussian process, require high computational cost to process large-scale data.

At present, most of time sequence data recorded in actual life is data without any functional characteristics, the data are directly input into a neural network for training, an optimal training model cannot be obtained, a large amount of time is consumed, only the latter two points can be predicted by using the conventional time sequence data prediction method, and a satisfactory prediction effect cannot be obtained.

Disclosure of Invention

The invention aims to solve the prediction problem and overcome the defects of the existing prediction method, and provides a prediction method for time sequence data based on empirical mode decomposition and a feedforward neural network, which has accurate prediction result, high data processing speed and high precision.

The technical scheme adopted for solving the technical problems comprises the following steps:

(1) processing missing values of a data set

A time series data set A_bWhen the missing values exceed 3 continuous occurrences, deleting the rows containing the missing values, and when the number of the continuous positions of the missing values is 1-3, filling the missing values by using a mean value interpolation method in the missing value interpolation method to obtain a time sequence data set A { A } of the time sequence data set₁,A₂,A₃In which A is₁{x₁,x₂,...,x_m|x_i＝(x_i1,x_i2,…,x_iq)^TI is a finite positive integer; q is the number of samples of dataset A } is the class variable in the dataset, A₂{y₁,y₂,…,y_n|y_j＝(y_j1,y_j2,…,y_jq)^TJ is a finite positive integer } is a variable in the data set excluding the class variable and other variables of the time series data to be predicted, A₃{z₁,z₂,…,z_p|z_l＝(z_l1,z_l2,…,z_lq)^TAnd l is a finite positive integer is time series data to be predicted in the data set.

(2) One-hot encoding process

For class variable A in data set₁Performing one-hot coding method processing, and counting class variable A₁Each of the category variables x in_iThe number of class values of (1) is set for a class variable x by using a continuous natural number_iThe number of natural numbers is the number of class variable values, then the one-hot encoding process is carried out to replace the class value of (A)₁Conversion into binary coded matrix B { B₁,B₂,…，B_m|B_iIs a category variable x_iData obtained by one-hot encoding }.

(3) Principal component analysis method for reducing dimension

For other variables A in the dataset₂Removing the time stamp variable in the data set to obtain a residual variable A₄{s₁,s₂,…,s_t|s₁,s₂,…,s_tFor other variables A₂Removing the residual variable of the timestamp variable, wherein t is a finite positive integer and is less than or equal to n, and drawing a residual variable A₄Is observed for the remaining variable A₄Will leave a variable A₄Performing dimensionality reduction processing by a principal component analysis method to obtain a matrix P { P₁,p₂,…，p_k|p₁,p₂,…，p_kIs a residual variable A₄And (3) obtaining data after dimensionality reduction by using a principal component analysis method, wherein k is a finite positive integer and is less than or equal to t }.

(4) Empirical mode decomposition

For time series data A in data set₃Performing empirical mode decomposition to obtain a matrix I { IMF) containing an eigenmode function and a margin₁,IMF₂,…，IMF_s，r|IMF_eIs an eigenmode function, e is 1,2, …, s; s is the number of eigenmode functions obtained by empirical mode decomposition, and r is the margin }.

(5) Data normalization process

Each eigenmode function IMF_eAnd the margin r is spliced with binary coding matrixes B and P respectively to form e new data sets C_e{IMF_eB, P and data set C_r{ r, B, P }, for all data sets C_eAnd a data set C_rProcessing the data by a data standardization processing method to obtain a corresponding data set D_e{d₁,d₂,…，d_g|d₁,d₂,…，d_gAs a data set C_eData obtained after processing by a data normalization method, g is a finite positive integer, and a data set D_r{d₁,d₂,…，d_g|d₁,d₂,…，d_gAs a data set C_rData is obtained after being processed by a data standardization method }, and a data set C is obtained according to the following formula_eAnd a data set C_rAll data in (2) are projected to [ -1,1]Interval:

wherein x is a data set C_eAnd a data set C_rNormalized data value of each variable is [ -1,1 [)]Value of interval, x_meanAs a data set C_eAnd a data set C_rAverage value of data values of each variable in (1), x_maxAs a data set C_eAnd a data set C_rOf each variable value of (1), x_minAs a data set C_eAnd a data set C_rThe minimum value of each variable value.

(6) Feed-forward neural network training

Data set D_eAnd a data set D_rAs input to a feedforward neural network, and a data set D_eAnd a data set D_rThe samples of (1) are divided into a training set and a testing set, and the sample ratio of the training set to the testing set is 450: 1, data set D_eAnd a data set D_rCorresponding eigenmode function IMF_eAnd the margin r as output, the eigenmode function IMF_eAnd the residual r is also divided into a training set and a testing set, and the sample ratio of the training set to the testing set is 450: and 1, sequentially inputting the training set into a feedforward neural network to train a prediction model, and stopping training when the minimum error of a training target is less than 0.001 to obtain the prediction model.

(7) Testing the test set

And the test set sequentially inputs corresponding prediction models to obtain prediction results, all the prediction results are added to obtain the sum of predicted values, and the standard deviation between the sum of the prediction results and the true value is determined.

In the principal component analysis method dimensionality reduction step (3), the principal component analysis method is as follows:

1) constructing n sample matrices

Acquiring standardized p-dimensional random vector x of original data variable, and constructing n sample matrixes

x＝(x₁,x₂,K,x_p)^T

x_i＝(x_i1,x_i2,...,x_in)^T

Wherein n and p are finite positive integers, n > p, and the matrix x is subjected to the following normalized change:

wherein Z_ijIs x_ijNormalized value, x_iIs the average of all elements, S_j ²Is x_iRoot mean square of all elements, transformed to the normalized matrix Z.

2) Determining a matrix of correlation coefficients

The correlation coefficient matrix R is determined as follows:

wherein i and j are finite positive integers.

3) Determining unit feature vectors

P characteristic roots are obtained according to the following formula:

|R-λI_p|＝0

obtaining the value of m according to the following formula, and determining the main component:

where t represents the utilization of the information, for each lambda_j，

Rb_j＝λ_jb_j

Obtaining unit feature vector

4) Converting the normalized variables into principal components

The principal component was determined as follows:

5) and performing weighted summation on the obtained m principal components, wherein the weight is the variance contribution rate of each principal component, and obtaining a final evaluation value.

In the empirical mode decomposition step (4) of the present invention, the empirical mode decomposition method comprises the following steps:

1) finding out all maximum points and minimum points of the original time sequence data sequence x (t), and fitting by using a cubic spline interpolation function to form an upper envelope line and a lower envelope line of the data.

2) The mean m1(t) of the upper and lower envelopes was determined as follows:

where up (t) is the upper envelope formed by maxima and low (t) is the lower envelope.

3) Determining eigenmode functions

x(t)-m1(t)＝h1(t)

Regarding h1(t) as a new signal x (t), repeating steps 1) and 2) until h1(t) satisfies the conditions of the eigenmode function described below.

a. The local extreme points and the zero-crossing points of the function are equal or different by 1 in the whole time range.

b. At any time, the upper envelope of the local maxima and the lower envelope of the local minima average to zero.

4) The residual component r1(t) is determined as follows

r1(t)＝x(t)-h1(t)

Where h1(t) is the first eigenmode function.

5) Taking the residual component r1(t) as new original data, and repeating the steps 1) to 4) until all eigenmode functions and 1 trend term are obtained.

The invention adopts the steps of processing missing values of a time sequence data set, processing category variable acquisition in external variables of the processed data set by a unique hot coding method, carrying out dimensionality reduction processing on residual variables except category variables, time sequence data to be predicted and timestamp variables by a principal component analysis method, decomposing the time sequence data to be predicted into an eigenmode function and a residual by an empirical mode decomposition method, splicing the data sets to obtain a new data set, carrying out data standardization method processing on the obtained data set, inputting a feedforward neural network for training to obtain a prediction model, and predicting a test set. The invention processes the data set, greatly saves time in the training process of the processed experimental data, reduces the times for reaching the convergence result and improves the training precision. The method has good prediction effect on the time sequence data with complex signal characteristics.

Drawings

FIG. 1 is a flow chart of embodiment 1 of the present invention.

Detailed Description

The present invention will be described in further detail below with reference to the drawings and examples, but the present invention is not limited to the embodiments described below.

Example 1

In fig. 1, the steps of the prediction method for time series data based on empirical mode decomposition and feedforward neural network of the embodiment are as follows:

(1) processing missing values of a data set

A time series data set A_bWhen the missing values exceed 3 continuous occurrences, deleting the rows containing the missing values, and when the number of the continuous positions of the missing values is 1-3, filling the missing values by using a mean value interpolation method in the missing value interpolation method to obtain a time sequence data set A { A } of the time sequence data set₁,A₂,A₃In which A is₁{x₁,x₂,...，x_m|x_i＝(x_i1,x_i2,…,x_iq)^TI is a finite positive integer; q is the number of samples of dataset A } is the class variable in the dataset, A₂{y₁,y₂,…,y_n|y_j＝(y_j1,y_j2,…,y_jq)^TJ is a finite positive integer } is a variable in the data set excluding the class variable and other variables of the time series data to be predicted, A₃{z₁,z₂,…,z_p|z_l＝(z_l1,z_l2,…,z_lq)^TAnd l is a finite positive integer is time series data to be predicted in the data set.

The embodiment is realized by comparing a time sequence data set A_bThe missing value is analyzed and processed, the precision of the feedforward neural network training is improved, and a more accurate training result is obtained.

(2) One-hot encoding process

Since the embodiment performs the one-hot encoding method processing on the category variable, each encoded value has only one valid bit, and the positions of the valid bits are different, thereby ensuring that the data are different from each other. The distance between the variable values of each category is recalculated by the data processed by the one-hot coding method, and the processed data can be used for the training of the feedforward neural network.

(3) Principal component analysis method for reducing dimension

The principal component analysis method of this example is as follows:

1) constructing n sample matrices

x＝(x₁,x₂,...,x_p)^T

x_i＝(x_i1,x_i2,...,x_in)^T

Where n and p are finite positive integers, n > p, and the matrix x is normalized as follows.

2) Determining a matrix of correlation coefficients

The correlation coefficient matrix R is determined as follows:

wherein i, j are finite positive integers.

3) Determining unit feature vectors

P characteristic roots are obtained according to the following formula:

|R-λI_p|＝0

where t represents the utilization of the information, for each lambda_j，

Rb_j＝λ_jb_j

Obtaining unit feature vector

4) Converting the normalized variables into principal components

The principal component was determined as follows:

According to the principal component analysis method, the variables which are mutually connected are found, the number of the prediction variables is reduced through dimensionality reduction, the obtained data contain most information of original data, it is guaranteed that each variable in the data obtained after dimensionality reduction does not contain repeated original data variable information, when a feedforward neural network is trained, a data set after dimensionality reduction is input, an accurate result is obtained, the number of the variables is reduced, training time is greatly reduced, and training precision is improved.

(4) Empirical mode decomposition

The empirical mode decomposition method of the embodiment comprises the following steps:

2) The mean m1(t) of the upper and lower envelopes was determined as follows:

3) Determining eigenmode functions

x(t)-m1(t)＝h1(t)

4) The residual component r1(t) is determined as follows

r1(t)＝x(t)-h1(t)

Where h1(t) is the first eigenmode function.

For time series data A in data set₃And performing empirical mode decomposition to obtain the characteristic of single eigenmode function, and in the neural network training, training by using the eigenmode function instead of original time sequence data, so that the training time is saved, and the training precision is improved.

(5) Data normalization process

Each eigenmode function IMF_eAnd the margin r is spliced with binary coding matrixes B and P respectively to form e new data sets C_e{IMF_eB, P and data set C_r{ r, B, P }, for all data sets C_eAnd a data set C_rProcessing the data by a data standardization processing method to obtain a corresponding data set D_e{d₁,d₂,…，d_g|d₁,d₂,…，d_gAs a data set C_eBy means of data standardizationData obtained after processing by the method, g is a finite positive integer, and a data set D_r{d₁,d₂,…，d_g|d₁,d₂,…，d_gAs a data set C_rData is obtained after being processed by a data standardization method }, and a data set C is obtained according to the following formula_eAnd a data set C_rAll data in (2) are projected to [ -1,1]An interval.

Wherein x is^*As a data set C_eAnd a data set C_rNormalized data value of each variable is [ -1,1 [)]Value of interval, x_meanAs a data set C_eAnd a data set C_rAverage value of data values of each variable in (1), x_maxAs a data set C_eAnd a data set C_rOf each variable value of (1), x_minAs a data set C_eAnd a data set C_rThe minimum value of each variable value.

The data set is processed by the standardized method through the steps, and the processed data set is used for feedforward neural network training, so that the convergence speed and the training precision of the model can be improved.

(6) Feed-forward neural network training

The processed data set in the embodiment is input into a feedforward neural network to obtain all prediction models, so that the training time is saved in the training process, the training times required for convergence are reduced, and the training precision is improved.

(7) Testing the test set

Claims

1. A prediction method for time series data based on empirical mode decomposition and a feedforward neural network is characterized by comprising the following steps:

(1) processing missing values of a data set

A time series data set A_bWhen the missing values exceed 3 continuous occurrences, deleting the rows containing the missing values, and when the number of the continuous positions of the missing values is 1-3, filling the missing values by using a mean value interpolation method in the missing value interpolation method to obtain a time sequence data set A { A } of the time sequence data set₁,A₂,A₃In which A is₁{x₁,x₂,...,x_m|x_i＝(x_i1,x_i2,…,x_iq)^TI is a finite positive integer; q is the number of samples of dataset A } is the class variable in the dataset, A₂{y₁,y₂,…,y_n|y_j＝(y_j1,y_j2,…,y_jq)^TJ is a finite positive integer } is a variable in the data set excluding the class variable and other variables of the time series data to be predicted, A₃{z₁,z₂,…,z_p|z_l＝(z_l1,z_l2,…,z_lq)^TL is a finite positive integer is time sequence data needing to be predicted in the data set;

(2) one-hot encoding process

For class variable A in data set₁Performing one-hot coding method processing, and counting class variable A₁Each of the category variables x in_iThe number of class values of (1) is set for a class variable x by using a continuous natural number_iThe number of natural numbers is the classThe number of the class variable values is then subjected to the one-hot encoding process to convert the class variable A into the class variable A₁Conversion into binary coded matrix B { B₁,B₂,…，B_m|B_iIs a category variable x_iData obtained by one-hot encoding };

(3) principal component analysis method for reducing dimension

For other variables A in the dataset₂Removing the time stamp variable in the data set to obtain a residual variable A₄{s₁,s₂,…,s_t|s₁,s₂,…,s_tFor other variables A₂Removing the residual variable of the timestamp variable, wherein t is a finite positive integer and is less than or equal to n, and drawing a residual variable A₄Is observed for the remaining variable A₄Will leave a variable A₄Performing dimensionality reduction processing by a principal component analysis method to obtain a matrix P { P₁,p₂,…，p_k|p₁,p₂,…，p_kIs a residual variable A₄Using a principal component analysis method to reduce the dimension to obtain data, wherein k is a finite positive integer and is less than or equal to t };

(4) empirical mode decomposition

For time series data A in data set₃Performing empirical mode decomposition to obtain a matrix I { IMF) containing an eigenmode function and a margin₁,IMF₂,…，IMF_s，r|IMF_eIs an eigenmode function, e is 1,2, …, s; s is the number of eigenmode functions obtained by empirical mode decomposition, and r is the margin };

(5) data normalization process

wherein x is^*As a data set C_eAnd a data set C_rNormalized data value of each variable is [ -1,1 [)]Value of interval, x_meanAs a data set C_eAnd a data set C_rAverage value of data values of each variable in (1), x_maxAs a data set C_eAnd a data set C_rOf each variable value of (1), x_minAs a data set C_eAnd a data set C_rThe minimum value of each variable value;

(6) feed-forward neural network training

Data set D_eAnd a data set D_rAs input to a feedforward neural network, and a data set D_eAnd a data set D_rThe samples of (1) are divided into a training set and a testing set, and the sample ratio of the training set to the testing set is 450: 1, data set D_eAnd a data set D_rCorresponding eigenmode function IMF_eAnd the margin r as output, the eigenmode function IMF_eAnd the residual r is also divided into a training set and a testing set, and the sample ratio of the training set to the testing set is 450: 1, sequentially inputting a training set into a feedforward neural network to perform prediction model training, and stopping training when the minimum error of a training target is less than 0.001 to obtain a prediction model;

(7) testing the test set

2. The prediction method based on empirical mode decomposition and feedforward neural network time series data according to claim 1, wherein in the principal component analysis method dimension reduction step (3), the principal component analysis method is:

(1) constructing n sample matrices

x＝(x₁,x₂,K,x_p)^T

x_i＝(x_i1,x_i2,...,x_in)^T

wherein Z_ijIs x_ijNormalized value, x_iIs the average of all elements, S_j ²Is x_iConverting the root mean square of all elements to obtain a normalized matrix Z;

(2) determining a matrix of correlation coefficients

The correlation coefficient matrix R is determined as follows:

wherein i and j are finite positive integers;

(3) determining unit feature vectors

P characteristic roots are obtained according to the following formula:

|R-λI_p|＝0

where t represents the utilization of the information, for each lambda_j，

Rb_j＝λ_jb_j

Obtaining unit feature vector

(4) Converting the normalized variables into principal components

The principal component was determined as follows:

(5) and performing weighted summation on the obtained m principal components, wherein the weight is the variance contribution rate of each principal component, and obtaining a final evaluation value.

3. The method for predicting time series data based on empirical mode decomposition and feedforward neural network as claimed in claim 1, wherein in the step (4) of empirical mode decomposition, the steps of empirical mode decomposition are as follows:

(1) finding out all maximum points and minimum points of the original time sequence data sequence x (t), and fitting by using a cubic spline interpolation function to form an upper envelope line and a lower envelope line of the data;

(2) the mean m1(t) of the upper and lower envelopes was determined as follows:

wherein up (t) is an upper envelope formed by a maximum value, and low (t) is a lower envelope;

(3) determining eigenmode functions

x(t)-m1(t)＝h1(t)

Considering h1(t) as a new signal x (t), repeating steps (1), (2) until h1(t) satisfies the following conditions for eigenmode functions:

1) in the function, the number of local extreme points and zero-crossing points is equal to or different by 1 in the whole time range;

2) at any moment, the upper envelope line of the local maximum value and the lower envelope line of the local minimum value are averagely zero;

(4) the residual component r1(t) is determined as follows

r1(t)＝x(t)-h1(t)

Where h1(t) is the first eigenmode function; .

(5) And (5) taking the residual component r1(t) as new original data, and repeating the steps (1) to (4) until all eigenmode functions and 1 trend term are obtained.