CN112085254B - Prediction method and model based on multi-fractal cooperative measurement gating circulation unit - Google Patents

Prediction method and model based on multi-fractal cooperative measurement gating circulation unit Download PDF

Info

Publication number
CN112085254B
CN112085254B CN202010775496.5A CN202010775496A CN112085254B CN 112085254 B CN112085254 B CN 112085254B CN 202010775496 A CN202010775496 A CN 202010775496A CN 112085254 B CN112085254 B CN 112085254B
Authority
CN
China
Prior art keywords
fractal
data
time sequence
gating
spectrum
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Active
Application number
CN202010775496.5A
Other languages
Chinese (zh)
Other versions
CN112085254A (en
Inventor
张冬梅
余想
李金平
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
China University of Geosciences
Original Assignee
China University of Geosciences
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by China University of Geosciences filed Critical China University of Geosciences
Priority to CN202010775496.5A priority Critical patent/CN112085254B/en
Publication of CN112085254A publication Critical patent/CN112085254A/en
Application granted granted Critical
Publication of CN112085254B publication Critical patent/CN112085254B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06QINFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES, NOT OTHERWISE PROVIDED FOR
    • G06Q10/00Administration; Management
    • G06Q10/04Forecasting or optimisation specially adapted for administrative or management purposes, e.g. linear programming or "cutting stock problem"
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N3/00Computing arrangements based on biological models
    • G06N3/02Neural networks
    • G06N3/04Architecture, e.g. interconnection topology
    • G06N3/044Recurrent networks, e.g. Hopfield networks
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N3/00Computing arrangements based on biological models
    • G06N3/02Neural networks
    • G06N3/04Architecture, e.g. interconnection topology
    • G06N3/045Combinations of networks
    • YGENERAL TAGGING OF NEW TECHNOLOGICAL DEVELOPMENTS; GENERAL TAGGING OF CROSS-SECTIONAL TECHNOLOGIES SPANNING OVER SEVERAL SECTIONS OF THE IPC; TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
    • Y02TECHNOLOGIES OR APPLICATIONS FOR MITIGATION OR ADAPTATION AGAINST CLIMATE CHANGE
    • Y02ATECHNOLOGIES FOR ADAPTATION TO CLIMATE CHANGE
    • Y02A90/00Technologies having an indirect contribution to adaptation to climate change
    • Y02A90/10Information and communication technologies [ICT] supporting adaptation to climate change, e.g. for weather forecasting or climate simulation

Landscapes

  • Engineering & Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • Theoretical Computer Science (AREA)
  • Business, Economics & Management (AREA)
  • General Physics & Mathematics (AREA)
  • Computational Linguistics (AREA)
  • Artificial Intelligence (AREA)
  • Software Systems (AREA)
  • Mathematical Physics (AREA)
  • General Engineering & Computer Science (AREA)
  • Computing Systems (AREA)
  • Molecular Biology (AREA)
  • Human Resources & Organizations (AREA)
  • General Health & Medical Sciences (AREA)
  • Evolutionary Computation (AREA)
  • Economics (AREA)
  • Health & Medical Sciences (AREA)
  • Life Sciences & Earth Sciences (AREA)
  • Strategic Management (AREA)
  • Biomedical Technology (AREA)
  • Biophysics (AREA)
  • Data Mining & Analysis (AREA)
  • Development Economics (AREA)
  • Game Theory and Decision Science (AREA)
  • General Business, Economics & Management (AREA)
  • Tourism & Hospitality (AREA)
  • Quality & Reliability (AREA)
  • Operations Research (AREA)
  • Marketing (AREA)
  • Entrepreneurship & Innovation (AREA)
  • Management, Administration, Business Operations System, And Electronic Commerce (AREA)

Abstract

The invention relates to a prediction method and a model based on multi-fractal cooperative measurement gating circulation unit, wherein the method comprises the following steps: preprocessing the input time sequence data to obtain preprocessed time sequence data; the processed time sequence data is segmented by sliding time windows, the spectrum width delta alpha of the multi-fractal spectrum on each time window is calculated by utilizing a multi-fractal algorithm, a spectrum width matrix E of the multi-fractal spectrum of the time sequence data is obtained, an updated gate weight matrix in a gate control circulation unit is improved to a weight matrix related to the spectrum width matrix E, an improved gate control circulation unit is obtained, and a network model is constructed by utilizing the improved gate control circulation unit to predict. The MF-GRU model provided by the invention adopts a multi-fractal technology to extract fluctuation change degree characteristics of time sequence data, and a dynamic adjustment matrix is arranged to replace the change trend of a non-stationary data section in the traditional updated gate weight matrix learning data so as to improve the prediction accuracy of the non-stationary time sequence data.

Description

Prediction method and model based on multi-fractal cooperative measurement gating circulation unit
Technical Field
The invention relates to the technical field of time sequence prediction, in particular to a prediction method and model based on a multi-fractal synergetic measure gating circulation unit.
Background
Time series prediction plays an important role in various fields of society, such as stock price, air quality, voice recognition, and the like. However, the actual timing problem is not stable, so that the accuracy of predicting the non-stable data sequence by the traditional prediction method is not high.
The traditional time sequence prediction method fits time sequence trend by establishing a proper mathematical model, such as an ARMA model, an ES model, a logistic regression model and the like, but the prediction accuracy is lower due to the multi-variable dependency relationship of difficult data processing. The time sequence prediction model based on machine learning, such as support vector regression, K nearest neighbor algorithm and the like, can better process time sequence data dependent on multiple variables, but is easy to cause poor prediction effect due to over fitting. Deep neural networks, as a highly complex nonlinear system, have certain advantages in processing complex time series data of multivariate dependence. The long-term dependence problem exists in the cyclic neural network model, and a gating mechanism is introduced into Hochreater and Schmidhuber on the basis of RNN to provide a long-term and short-term memory model so as to solve the long-term dependence problem of RNN; chung et al, on the basis of which a gating mechanism is simplified, propose a gating circulation unit (Gated Recurrent Unit, GRU) so that the model has faster training efficiency and equivalent accuracy; the minimum gating unit proposed by Zhou et al further simplifies the model structure on the basis of GRU, and the training efficiency is further improved. However, the above model improves efficiency, converges in prediction performance, and has poor prediction effect on non-stationary time series data.
Disclosure of Invention
Aiming at the defects, the invention provides a prediction method and a model based on multi-fractal synergetic measurement gating circulation unit. According to the invention, the fluctuation change degree of the variable to be predicted is quantized through the spectrum width of the multi-fractal spectrum, and an improved gating circulation unit network is established to realize training and prediction of non-stationary time sequence data.
In order to solve the technical problems, the invention adopts the following technical scheme:
the prediction method based on the multi-fractal cooperative measurement gating cycle unit comprises the following steps:
step 1, preprocessing input time sequence data to obtain preprocessed time sequence data;
step 2, calculating a spectrum function f (alpha) and a spectrum width delta alpha of a multi-fractal spectrum of the preprocessed time series data by utilizing a multi-fractal algorithm, wherein for a time series data sequence X= { X with the length of n 1 ,X 2 ,…,X n Setting the length of a sliding time window as w and the step length as 1, and calculating the spectrum width delta alpha of the multi-fractal spectrum on each time window by using a multi-fractal algorithm i Obtaining a spectrum width matrix E=G of the multi-fractal spectrum T =[Δα 1 ,Δα 2 ,…,Δα n-w ] T
Step 3, improving an updated gate weight matrix in the gate control circulation unit into a weight matrix related to the spectrum width matrix E to obtain an improved gate control circulation unit;
step 4, inputting the preprocessed time sequence data into an improved gate control circulation unit to obtain predicted data; judging whether the current iteration times are greater than or equal to the preset highest iteration times, if so, turning to the step 7, otherwise, turning to the step 4;
step 5, calculating a loss function by using the predicted data and the actual data;
step 6, judging whether the loss function meets the convergence condition, if so, turning to step 7; if not, carrying out error back propagation through an optimization algorithm, updating a weight matrix and a bias matrix of the improved gating circulation unit, and turning to the step 5 after the current iteration times are increased by 1;
and 7, outputting the prediction data.
Further, the method for calculating the spectrum width by using the multi-fractal algorithm in the step 2 specifically includes the following steps:
step 31, dividing the time series data with length N into (N-w) time series with window length w by step length 1, and probability measure P of each window i Expressed as:
Figure BDA0002618207810000031
wherein I is i (w) is the sum of all sample values in the ith interval (i.e. (1, (N-w)) for a time scale of w;
if the time series has multi-fractal characteristics, the following power law relationship is satisfied within the scale-free interval:
P i (w)∝w α
wherein alpha is a mark P corresponding to the ith interval i Singular index of (w) magnitude reflecting P i (w) varying degrees of singular from interval to interval with w;
step 32, handle having the same αThe number of intervals of time scale w is noted as N α (w) decreasing with w, N α (w) increasing, if there is a multi-scale relationship for this time sequence, the following power law relationship is satisfied within the scale-free interval:
N α (w)∝w -f(α)
step 33, defining a distribution function χ of the multi-fractal system q (w) is the q-order moment of the normalized time series data:
Figure BDA0002618207810000032
wherein χ is q (w) is a time series data P reflecting normalization i (w) statistics of non-uniformity, q being a weight factor;
from the self-similarity of fractal, χ is known q (w) and w satisfy the following power law relationship in the scale-free interval:
χ q (w)∝w τ(q)
the logarithm of the two sides is calculated to obtain ln chi q (w) lnw curve, wherein the slope of the curve τ (q) is the mass index, τ (q) can be determined by ln chi q Performing least square fitting on linear points in the (w) to lnw double logarithmic curve to perform estimation;
step 34, on the premise of knowing a q-tau (q) curve, alpha (q) and f (alpha) can be obtained from q and tau (q) through Legend transformation (Legendre transformation);
Figure BDA0002618207810000041
wherein Δα=α maxmin The spectrum width of the multi-fractal spectrum of the time sequence data is obtained, and f (alpha) is a multi-fractal spectrum function of the time sequence data.
Further, the improvement method of the improved gating cycle unit in the step 3 is as follows:
splitting the updated gate weight matrix in the gate control circulation unit into two new updated gate matrices W z1 ,W z2 Respectively with activatedMultiplying the multi-fractal spectrum width matrixes sigma (E) and (1-sigma (E)) to obtain temporary gating output
Figure BDA0002618207810000042
To learn the fluctuation variation characteristics of the data in the non-stationary time series data segment:
Figure BDA0002618207810000043
Figure BDA0002618207810000044
outputting temporary gate control
Figure BDA0002618207810000045
Adding to obtain an integral updated gate z t Participation in calculations, i.e.
Figure BDA0002618207810000046
Other update rules of the improved gating loop unit are consistent with the gating loop unit:
Figure BDA0002618207810000047
a t =tanh(W h ⊙[r t *h t-1 ,x t ]+b a )
h t =h t-1 *(1-z t )+z t *a t
wherein, the liquid crystal display device comprises a liquid crystal display device,
Figure BDA0002618207810000048
representing two temporary update gates, respectively +.>
Figure BDA0002618207810000049
Representing two temporary updated gate matrices, respectively, sigma (x) beingSigmoid activation function, E is the spectrum width matrix of multi-fractal spectrum of input data, z t 、r t To represent the calculation results of the update gate and the reset gate at the time t, h t-1 And h t Output at time (t-1) and time t respectively,/represents multiplication of matrix corresponding elements,/represents point-wise multiplication, W * And b * Representing a weight matrix and a bias vector for the network.
Further, the improved gating loop cell network model comprises an input layer, a hidden layer and an output layer, wherein the input layer is used for inputting time sequence data, and a time sequence X= (X) containing n variable sequences is given 1 ,x 2 ,x 3 ,…,x n ) T T represents a time step, n represents an input dimension, and then the time sequence prediction model based on the gating weight unit is expressed as follows:
y^ T+1 =F(h 1 ,…,h T-1 ,x 1 ,…,x T )
wherein h is t In the hidden layer state, h t E, R, F () is a nonlinear mapping function which needs to be learned by a model hidden layer, y ^ T+1 Is the prediction target of the model, and is the prediction output of the output layer of the next time point; data sequence x= (X) 1 ,x 2 ,x 3 ,…,x n ) T The input matrix at the whole input layer is denoted as (x 1 ,x 2 ,x 3 ,…,x T )∈R n*T
Further, the loss functions in the step 5 and the step 6 are root mean square error loss functions.
Further, the method also comprises the step of training the improved gating cycle unit to obtain model parameters of the improved gating cycle unit, and the training method comprises the following steps:
step 11, selecting a part of data from the input time sequence data as a training set;
step 12, inputting the data of the training set into an improved gating circulation unit to obtain the predicted data of the training set;
and 13, calculating average absolute error, root mean square error and average error percentage of the predicted data and the actual data of the training set, evaluating a model by taking the average absolute error, root mean square error and average error percentage as measurement indexes to obtain optimized model parameters of the improved gating circulation unit, and taking the optimized model parameters as initial model parameters of the improved gating circulation unit.
The prediction model based on the multi-fractal cooperative measurement gating cycle unit comprises a data preprocessing unit, a spectrum width calculation unit based on a multi-fractal algorithm and an improved gating cycle unit, wherein the data preprocessing unit is used for preprocessing input data to obtain preprocessed time sequence data; the spectrum width calculation unit based on the multi-fractal algorithm is used for calculating a spectrum function f (alpha) and a spectrum width delta alpha of a multi-fractal spectrum of the preprocessed time sequence data by using the multi-fractal algorithm, and for a time sequence data sequence X= { X with a length of n 1 ,X 2 ,…,X n Setting the length of a sliding time window as w and the step length as 1, and calculating the spectrum width delta alpha of the multi-fractal spectrum on each time window by using a multi-fractal algorithm i Obtaining a spectrum width matrix E=G of the multi-fractal spectrum T =[Δα 1 ,Δα 2 ,…,Δα n-w ] T The method comprises the steps of carrying out a first treatment on the surface of the The improved gating circulation unit is used for improving the updated gating weight matrix in the gating circulation unit into a weight matrix related to the spectrum width matrix E, obtaining the improved gating circulation unit, predicting the preprocessed time sequence data, carrying out error back propagation through an optimization algorithm after each prediction, updating the weight matrix and the bias vector of the improved gating circulation unit until the number of times of prediction is greater than or equal to the highest iteration number or the loss function meets the convergence condition, and outputting the predicted data.
The beneficial effects of the invention are as follows: the invention provides an improved gating circulation unit (MF-GRU) based on the improvement of the gating circulation unit (GRU), wherein the updated gating weight matrix of the gating circulation unit (GRU) is improved to a weight matrix related to a spectrum width matrix, the fluctuation change characteristics of non-stationary time sequence data are extracted by introducing a multi-fractal technology, and a dynamic adjustment matrix is arranged to replace the traditional updated gating weight matrix so as to learn the change trend of a non-stationary data segment in the data, so that the prediction precision of the non-stationary data can be improved, and the problem that the traditional deep neural network faces the difficulty of learning the non-stationary time sequence data is solved.
The invention will now be described in detail with reference to the drawings and examples.
Drawings
FIG. 1 is a schematic flow chart of the present invention;
FIG. 2 is a schematic diagram of a modified door cycle unit;
FIG. 3 is a schematic diagram showing the influence of the number of hidden layer nodes in embodiment 1;
FIG. 4 is a schematic diagram showing the effect of the number of hidden layers in embodiment 1;
FIG. 5 is a "open reading" prediction curve for a BAC dataset;
FIG. 6 is a "open value" prediction curve for a C dataset;
FIG. 7 is a "open value" prediction curve for a JPM dataset;
FIG. 8 is a plot of "open reading" predictions for an MS data set;
fig. 9 is a "humidity" prediction curve for the USV dataset.
Detailed Description
The principles and features of the present invention are described below with reference to the drawings, the examples are illustrated for the purpose of illustrating the invention and are not to be construed as limiting the scope of the invention.
As shown in fig. 1, the prediction method based on the multi-fractal synergy metric gating cycle unit comprises the following steps:
step 1, preprocessing input time sequence data to obtain preprocessed time sequence data;
step 2, calculating a spectrum function f (alpha) and a spectrum width delta alpha of a multi-fractal spectrum of the preprocessed time series data by utilizing a multi-fractal algorithm, wherein for a time series data sequence X= { X with the length of n 1 ,X 2 ,…,X n Setting the length of a sliding time window as w and the step length as 1, and calculating the spectrum width delta alpha of the multi-fractal spectrum on each time window by using a multi-fractal algorithm i Obtaining a spectrum width matrix E=G of the multi-fractal spectrum T =[Δα 1 ,Δα 2 ,…,Δα n-w ] T
Step 3, improving an updated gate weight matrix in the gate control circulation unit into a weight matrix related to the spectrum width matrix E to obtain an improved gate control circulation unit;
step 4, inputting the preprocessed time sequence data into an improved gate control circulation unit to obtain predicted data; judging whether the current iteration times are greater than or equal to the preset highest iteration times, if so, turning to the step 7, otherwise, turning to the step 5;
step 5, calculating a loss function by using the predicted data and the actual data;
step 6, judging whether the loss function meets the convergence condition, if so, turning to step 7; if not, carrying out error back propagation through an optimization algorithm, updating a weight matrix and a bias vector of the improved gating loop unit, and turning to the step 4 after the current iteration times are increased by 1;
and 7, outputting the prediction data.
As an implementation manner, the method for calculating the spectrum width by using the multi-fractal algorithm in the step 2 specifically includes the following steps:
step 31, dividing the time series data with length N into (N-w) time series with window length w by step length 1, and probability measure P of each window i Expressed as:
Figure BDA0002618207810000071
wherein I is i (w) is the sum of all sample values in the ith interval (i.e. (1, (N-w))) for a time scale of w;
if the time series has multi-fractal characteristics, the following power law relationship is satisfied within the scale-free interval:
P i (w)∝w α
wherein alpha is a mark P corresponding to the ith interval i Singular index of (w) magnitude reflecting P i (w) varying degrees of singular from interval to interval with w;
step 32, counting the number of intervals with the same alpha and time scale w asN α (w) decreasing with w, N α (w) increasing, if there is a multi-scale relationship for this time sequence, the following power law relationship is satisfied within the scale-free interval:
N α (w)∝w -f(α)
step 33, defining a distribution function χ of the multi-fractal system q (w) is the q-order moment of the normalized time series data:
Figure BDA0002618207810000081
wherein χ is q (w) is a time series data P reflecting normalization i (w) statistics of non-uniformity, q being a weight factor;
from the self-similarity of fractal, χ is known q (w) and w satisfy the following power law relationship in the scale-free interval:
χ q (w)∝w τ(q)
the logarithm of the two sides is calculated to obtain ln chi q (w) lnw curve, wherein the slope of the curve τ (q) is the mass index, τ (q) can be determined by ln chi q Performing least square fitting on linear points in the (w) to lnw double logarithmic curve to perform estimation;
step 34, on the premise of knowing a q-tau (q) curve, alpha (q) and f (alpha) can be obtained from q and tau (q) through Legend transformation (Legendre transformation);
Figure BDA0002618207810000091
to sum up, by calculating the probability measure P i (w) the score function χ q And (w) and a quality index tau (q), and obtaining alpha and a multi-fractal spectrum function f (alpha) by adopting least square regression fitting. In the f (alpha) -alpha pattern, the spectral width Δalpha (i.e., alpha maxmin ) The size of the fractal structure reflects the non-uniformity degree of the whole fractal structure on probability measure distribution, and the fluctuation degree of time sequence data is immediately drawn. A larger Δα indicates a more uneven distribution of data within the corresponding time-series data segment,the more severe the fluctuation changes. Thus, for a time-series data sequence x= { X of length n 1 ,X 2 ,…,X n Using the length of the time window as w and the step length as 1, calculating the spectrum width delta alpha of the multi-fractal spectrum on each time window by using the multi-fractal algorithm i Finally, a spectrum width matrix E=G of a multi-fractal spectrum of the time sequence data can be obtained T =[Δα 1 ,Δα 2 ,…,Δα n-w ] T To extract the characteristic of the fluctuation degree of the time series data.
As shown in fig. 2, as an embodiment, the improved method of the improved gating cycle unit of the step 3 is as follows:
splitting the updated gate weight matrix in the gate control circulation unit into two new updated gate matrices W z1 ,W z2 Multiplying the multi-fractal spectrum width matrices sigma (E) and (1-sigma (E)) with the activated multi-fractal spectrum width matrices respectively to obtain temporary gating output
Figure BDA0002618207810000092
To learn the fluctuation variation characteristics of the data in the non-stationary time series data segment:
Figure BDA0002618207810000093
Figure BDA0002618207810000094
outputting temporary gate control
Figure BDA0002618207810000095
Adding to obtain an integral updated gate z t Participation in calculations, i.e.
Figure BDA0002618207810000096
Other update rules of the improved gating loop unit are consistent with the gating loop unit:
Figure BDA0002618207810000097
a t =tanh(W h ⊙[r t *h t-1 ,x t ]+b a )
h t =h t-1 *(1-z t )+z t *a t
wherein, the liquid crystal display device comprises a liquid crystal display device,
Figure BDA0002618207810000101
representing two temporary update gates, respectively +.>
Figure BDA0002618207810000102
Representing two temporary updated gate matrices respectively, sigma (x) being Sigmoid activation function, E being a spectral width matrix of multi-fractal spectrum of input data, z t 、r t To represent the calculation results of the update gate and the reset gate at the time t, h t-1 And h t Output at time (t-1) and time t respectively,/represents multiplication of matrix corresponding elements,/represents point-wise multiplication, W * And b * Representing a weight matrix and a bias vector for the network.
As one embodiment, the improved gated loop cell network model includes an input layer for the input of time series data, given a time series x= (X) comprising n variable sequences, a hidden layer, and an output layer 1 ,x 2 ,x 3 ,…,x n ) T T represents a time step, n represents an input dimension, and then the time sequence prediction model based on the gating weight unit is expressed as follows:
y ^ T+1 =F(h 1 ,…,h T-1 ,x 1 ,…,x T )
wherein h is t In the hidden layer state, h t E, R, F () is a nonlinear mapping function which needs to be learned by a model hidden layer, y ^ T+1 Is the prediction target of the model, and is the prediction output of the output layer of the next time point; data sequence x= (X) 1 ,x 2 ,x 3 ,…,x n ) T The input matrix at the whole input layer is denoted as (x 1 ,x 2 ,x 3 ,…,x T )∈R n*T
As an embodiment, the loss function in the step 5 and the step 6 is a root mean square error loss function.
As an embodiment, the method further comprises the step of training the improved gating cycle cell to obtain model parameters of the improved gating cycle cell, and the training method comprises the following steps:
step 11, selecting a part of data from the input time sequence data as a training set;
step 12, inputting the data of the training set into an improved gating circulation unit to obtain the predicted data of the training set;
and 13, calculating average absolute error, root mean square error and average error percentage of the predicted data and the actual data of the training set, evaluating a model by taking the average absolute error, root mean square error and average error percentage as measurement indexes to obtain optimized model parameters of the improved gating circulation unit, and taking the optimized model parameters as initial model parameters of the improved gating circulation unit.
The prediction model based on the multi-fractal cooperative measurement gating cycle unit comprises a data preprocessing unit, a spectrum width calculation unit based on a multi-fractal algorithm and an improved gating cycle unit, wherein the data preprocessing unit is used for preprocessing input data to obtain preprocessed time sequence data; the spectrum width calculation unit based on the multi-fractal algorithm is used for calculating a spectrum function f (alpha) and a spectrum width delta alpha of a multi-fractal spectrum of the preprocessed time sequence data by using the multi-fractal algorithm, and for a time sequence data sequence X= { X with a length of n 1 ,X 2 ,…,X n Setting the length of a sliding time window as w and the step length as 1, and calculating the spectrum width delta alpha of the multi-fractal spectrum on each time window by using a multi-fractal algorithm i Obtaining a spectrum width matrix E=G of the multi-fractal spectrum T =[Δα 1 ,Δα 2 ,…,Δα n-w ] T The method comprises the steps of carrying out a first treatment on the surface of the The improved gating cyclic unit is used for improving the updated gating weight matrix in the gating cyclic unit into an AND spectrumAnd updating the weight matrix and the bias vector of the improved gating circulating unit until the number of the predictions is greater than or equal to the highest iteration number or the loss function meets the convergence condition, and outputting the predicted data.
Example 1
1. Example data:
aiming at the problem of poor prediction effect of non-stationary time sequence data in the traditional deep learning method, the fluctuation change degree of the feature to be predicted is quantized by adopting a multi-fractal technology, the performances of 3 models of an improved gating circulation unit (MF-GRU), a gating circulation unit (GRU) and a long-term short-term memory network (LSTM) are subjected to comparative analysis by combining 5 non-stationary time sequence data sets, and the prediction effect based on the multi-fractal collaborative metric gating circulation unit is analyzed and evaluated.
Example data: the 5 multivariate time series data sets included 4 market data sets, 1 air humidity index data set. All data are divided into a training set according to the first 60 percent and a testing set according to the last 40 percent;
the 4 stock datasets include: the public data set recorded in the Kaggle Datasets machine learning library mainly records stock data of each big bank, and each data set has 2517 records and comprises 5 effective characteristics (closing price, opening price, lowest price, highest price and bargain). The four stock datasets are respectively noted as "BAC", "C", "JPM", "MS";
1 air humidity index prediction dataset: the public data set recorded in the UCI machine learning storage library and the machine learning library is mainly recorded in sea surface humidity data of the coastal area of Athens. The dataset has 1672 records containing 4 valid features (device number, humidity, temperature, reporting time) and is denoted "USV".
2. Metric index:
average absolute error (MAE) and average absolute percent error (MAPE), root Mean Square Error (RMSE) were chosen as metrics to evaluate model performance. Three metrics are defined as follows:
Figure BDA0002618207810000121
Figure BDA0002618207810000122
Figure BDA0002618207810000123
where n is the total number of samples, y predict Is the predicted value, y real Is a true value.
3. Parameter tuning:
the time sequence prediction based on the improved gating circulating unit needs to set parameters in the model, the setting of the model parameters can have great influence on the performance of the model, and the model parameters mainly comprise a loss function, an optimization algorithm, a learning rate, the number of hidden layer nodes, the number of hidden layer layers, the number of iterations required by model training and the like.
(1) Prediction effect when nodes with same hidden layer number and different hidden layers are used
The number of hidden layer nodes is sequentially selected from the candidate set {8,16,32,64,96,128} and tested. As a result, as shown in fig. 3, as the number of hidden layer nodes increases, the prediction performance increases first, and when the number of hidden layer nodes is 32 or 64 (the two accuracies are very similar), the highest accuracy in the candidate set parameters is reached, and when the number of hidden layer nodes exceeds 64, the accuracy starts to decrease. As shown in table 1, when the number of hidden layer nodes is selected to be 64, all of the 3 metrics are better than the result when the number of neurons is selected to be 32. The experiment set the number of hidden layer nodes to 64.
TABLE 1 influence of hidden layer node quantity schematic form "
Figure BDA0002618207810000131
Note that: the data represented by gray background is the best result
(2) Prediction effect when same hidden layer node number is different from hidden layer number
After the number of neurons is determined, the number of neurons is set as a fixed value, the number of hidden layers is sequentially selected from a hidden layer number candidate set {1,2 and 3} and tested, the experimental result is drawn into a histogram as shown in fig. 4, the prediction performance is firstly increased along with the increase of the number of hidden layers, and the highest precision in the candidate set parameters is achieved when the number of hidden layers is 2, so that the number of hidden layer numbers is set to 2. As shown in table 2, when the number of hidden layers is 2,3 metrics are optimal;
TABLE 2 influence of number of hidden layers schematic form'
Figure BDA0002618207810000132
Note that: the data represented by gray background is the best result
As can be seen from the above comparative experiments, when the number of hidden layers is 2 and the number of nodes in each hidden layer is 64, the performance of the GRU network model is optimal. The specific model parameter table is shown in table 3;
TABLE 3 parameter set Table "
Parameters (parameters) Value taking
Loss function Root Mean Square Error (RMSE)
Optimization algorithm Adam
Output layer activation function tanh
Learning rate 0.005
Number of input nodes 4
Number of output nodes 1
Number of hidden layers 2
Hidden layer node number 64
Number of iterations 4000
For fairness, all deep learning models in subsequent experiments will use the table 3 parameter settings to set 2 hidden layers, 64 nodes per hidden layer.
4. Comparison of experimental results
In order to verify the performance of the MF-GRU time sequence prediction model, a time sequence prediction model is built based on MF-GRU, LSTM, GRU unit, parameters when GRU takes the best performance and training sets and test sets with the same proportion are adopted for all models to perform time sequence prediction comparison experiments, the precision of each model is compared, and the experimental results of each model on 5 time sequence data sets are shown in table 4. Wherein gray represents the optimal result of the model, as shown in Table 4, the performance of the MF-GRU model on 5 data sets is superior to that of the traditional LSTM and GRU models;
TABLE 4 schematic representation of experimental results "
Figure BDA0002618207810000151
The 5 data set BAC, C, JPM, MS, USV prediction curves used for the experiments are shown in the corresponding graphs of fig. 5, 6, 7, 8 and 9, respectively. The MF-GRU time sequence prediction model has the most similar non-stationary data segments of data sets BAC (figure 5) 0 to 20, data set C (figure 6) 20 to 30, data set JPM (figure 7) 20 to 30, data set MS (figure 8) 90 to 100 and data set USV (figure 9) 25 to 50 with the true values, and the prediction effect is superior to other models. In addition, when the time sequence data set C is predicted, the overall prediction effect of the MF-GRU is obviously better than that of a GRU model. This is because the weight matrix of the multi-fractal spectrum width can dynamically adjust weights for non-stationary data segments to reduce errors;
the result shows that compared with LSTM and GRU models, the MF-GRU model has better prediction performance and higher precision.
The foregoing is illustrative of the best mode of carrying out the invention, and is not presented in any detail as is known to those of ordinary skill in the art. The protection scope of the invention is defined by the claims, and any equivalent transformation based on the technical teaching of the invention is also within the protection scope of the invention.

Claims (7)

1. The prediction method based on the multi-fractal cooperative measurement gating cycle unit is characterized by comprising the following steps of:
step 1, preprocessing the input air humidity index time sequence data to obtain preprocessed time sequence data;
step 2, calculating the spectrum function f (alpha) and the spectrum width delta alpha of the multi-fractal spectrum of the preprocessed air humidity index time sequence data by using a multi-fractal algorithm, wherein for a time sequence data sequence X=with the length of n
{X 1 ,X 2 ,…,X n Time window lengthThe degree is w, the step length is 1, and the spectrum width delta alpha of the multi-fractal spectrum on each time window is calculated by utilizing the multi-fractal algorithm i Obtaining a spectrum width matrix E=G of the multi-fractal spectrum T =[Δα 1 ,Δα 2 ,…,Δα n-w ] T
Step 3, improving an updated gate weight matrix in the gate control circulation unit into a weight matrix related to the spectrum width matrix E to obtain an improved gate control circulation unit;
step 4, inputting the preprocessed air humidity index time sequence data into an improved gate control circulating unit to obtain predicted air humidity index data; judging whether the current iteration times are greater than or equal to the preset highest iteration times or not, if so, turning to the step 7, and if not, turning to the step 5;
step 5, calculating a loss function by using the predicted air humidity index data and the actual air humidity index data;
step 6, judging whether the loss function meets the convergence condition, if so, turning to step 7; if not, carrying out error back propagation through an optimization algorithm, updating a weight matrix and a bias vector of the improved gating loop unit, and turning to the step 5 after adding one to the current iteration times;
and 7, outputting predicted air humidity index data.
2. The prediction method based on multi-fractal synergy metric gating cycle unit as recited in claim 1, wherein the method for calculating the spectral width using the multi-analysis algorithm in step 2 specifically comprises the following steps:
step 31, dividing the time series data of the air humidity index with the length N into (N-w) time series with the window length w according to the step length of 1, and measuring the probability P of each window i Expressed as:
Figure QLYQS_1
wherein I is i (w) is the sum of all sample values in the ith interval at a time scale of w(i∈(1,(N-w)));
If the time series has multi-fractal characteristics, the following power law relationship is satisfied within the scale-free interval:
P i (w)∝w α
wherein alpha is a mark P corresponding to the ith interval i Singular index of (w) magnitude reflecting P i (w) varying degrees of singular from interval to interval with w;
step 32, counting the number of intervals with the same alpha and time scale w as N α (w) decreasing with w, N α (w) increasing, if there is a multi-scale relationship for this time sequence, the following power law relationship is satisfied within the scale-free interval:
N α (w)∝w -f(α)
step 33, defining a distribution function χ of the multi-fractal system q (w) is the q-order moment of the normalized time series data:
Figure QLYQS_2
wherein χ is q (w) is a time series data P reflecting normalization i (w) statistics of non-uniformity, q being a weight factor;
from the self-similarity of fractal, χ is known q (w) and w satisfy the following power law relationship in the scale-free interval:
χ q (w)∝w τ(q)
the logarithm of the two sides is calculated to obtain ln chi q (w) lnw curve, wherein the slope of the curve τ (q) is the mass index, τ (q) can be determined by ln chi q Performing least square fitting on linear points in the (w) to lnw double logarithmic curve to perform estimation;
step 34, on the premise of knowing a q-tau (q) curve, alpha (q) and f (alpha) can be obtained from q and tau (q) through Legend transformation (Legendre transformation);
Figure QLYQS_3
wherein Δα=α maxmin ) The spectrum width of the multi-fractal spectrum of the time sequence data is obtained, and f (alpha) is a multi-fractal spectrum function of the time sequence data.
3. The prediction method based on multi-fractal synergy metric gating cycle unit as recited in claim 1, wherein the improved method of the improved gating cycle unit of step 4 is as follows:
splitting the updated gate weight matrix in the gate control circulation unit into two new updated gate matrices W z1 ,W z2 Multiplying the multi-fractal spectrum width matrices sigma (E) and (1-sigma (E)) with the activated multi-fractal spectrum width matrices respectively to obtain temporary gating output
Figure QLYQS_4
To learn the fluctuation variation characteristics of the data in the non-stationary time series data segment:
Figure QLYQS_5
Figure QLYQS_6
outputting temporary gate control
Figure QLYQS_7
Adding to obtain an integral updated gate z t Participation in calculations, i.e.
Figure QLYQS_8
Other update rules of the improved gating loop unit are consistent with the gating loop unit:
Figure QLYQS_9
Figure QLYQS_10
h t =h t-1 *(1-z t )+z t *a t
wherein the method comprises the steps of
Figure QLYQS_11
Representing two temporary update gates, respectively +.>
Figure QLYQS_12
Representing two temporary updated gate matrices respectively, sigma (x) being Sigmoid activation function, E being a spectral width matrix of multi-fractal spectrum of input data, z t 、r t To represent the calculation results of the update gate and the reset gate at the time t, h t-1 And h t Output at time (t-1) and time t respectively,/represents multiplication of matrix corresponding elements,/represents point-wise multiplication, W * And b * Representing a weight matrix and a bias vector for the network.
4. The prediction method based on multi-fractal synergy metric gating cyclic unit as recited in claim 1, wherein the improved gating cyclic unit network model comprises an input layer, a hidden layer and an output layer, the input layer is used for inputting time series data, and a time series x= (X) containing n variable series is given 1 ,x 2 ,x 3 ,...,x n ) T T represents a time step, n represents an input dimension, and then the time sequence prediction model based on the gating weight unit is expressed as follows:
y^ T+1 =F(h 1 ,...,h T-1 ,x 1 ,...,x T )
wherein h is t In the hidden layer state, h t E, R, F ()'s are nonlinear mapping functions that the model hidden layer needs to learn, y ≡ T+1 Is the prediction target of the model, and is the prediction output of the output layer of the next time point; data sequence x= (X) 1 ,x 2 ,x 3 ,...,x n ) T Throughout the whole deliveryThe input matrix of the input layer is denoted as (x 1 ,x 2 ,x 3 ,...,x T )∈R n*T
5. The prediction method based on multi-fractal synergy metric gating cycle unit as recited in claim 1, wherein the loss function in step 5 and step 6 is a root mean square error loss function.
6. The prediction method based on multi-fractal synergy metric gating cycle unit as recited in claim 1, further comprising the step of training the improved gating cycle unit to obtain model parameters of the improved gating cycle unit, the training method comprising the steps of:
step 11, selecting a part of data from the time sequence data of the input air humidity index as a training set;
step 12, inputting the data of the training set into an improved gating circulation unit to obtain the predicted air humidity index time sequence data of the training set;
and 13, calculating average absolute error, root mean square error and average error percentage of the predicted data and the actual data of the training set, evaluating the model by taking the average absolute error, root mean square error and average error percentage as measurement indexes to obtain optimized model parameters of the improved gating circulation unit, and taking the optimized model parameters as initial model parameters of the improved gating circulation unit.
7. The prediction model based on the multi-fractal cooperative measurement gating circulation unit is characterized by comprising a data preprocessing unit, a spectrum width calculation unit based on a multi-fractal algorithm and an improved gating circulation unit, wherein the data preprocessing unit is used for preprocessing input air humidity index time sequence data to obtain preprocessed air humidity index time sequence data; the spectrum width calculation unit based on the multi-fractal algorithm is used for calculating a spectrum function f (alpha) and a spectrum width delta alpha of a multi-fractal spectrum of the preprocessed time sequence data by using the multi-fractal algorithm, and for a time sequence data sequence X= { X with a length of n 1 ,X 2 ,…,X n Using the length of the time window as w and the step length as 1, calculating the spectrum width delta alpha of the multi-fractal spectrum on each time window by using the multi-fractal algorithm i Obtaining a spectrum width matrix E=G of the multi-fractal spectrum T =[Δα 1 ,Δα 2 ,…,Δα n-w ] T The method comprises the steps of carrying out a first treatment on the surface of the The improved gating circulation unit is used for improving an updated gating weight matrix in the gating circulation unit into a weight matrix related to the spectrum width matrix E, obtaining the improved gating circulation unit, predicting the preprocessed air humidity index time sequence data, carrying out error counter propagation through an optimization algorithm after each prediction, updating the weight matrix and the bias vector of the improved gating circulation unit until the number of predictions is greater than or equal to the highest iteration number or when a loss function meets a convergence condition, and outputting the predicted air humidity index time sequence data.
CN202010775496.5A 2020-08-05 2020-08-05 Prediction method and model based on multi-fractal cooperative measurement gating circulation unit Active CN112085254B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN202010775496.5A CN112085254B (en) 2020-08-05 2020-08-05 Prediction method and model based on multi-fractal cooperative measurement gating circulation unit

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN202010775496.5A CN112085254B (en) 2020-08-05 2020-08-05 Prediction method and model based on multi-fractal cooperative measurement gating circulation unit

Publications (2)

Publication Number Publication Date
CN112085254A CN112085254A (en) 2020-12-15
CN112085254B true CN112085254B (en) 2023-06-13

Family

ID=73735815

Family Applications (1)

Application Number Title Priority Date Filing Date
CN202010775496.5A Active CN112085254B (en) 2020-08-05 2020-08-05 Prediction method and model based on multi-fractal cooperative measurement gating circulation unit

Country Status (1)

Country Link
CN (1) CN112085254B (en)

Families Citing this family (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN113783099B (en) * 2021-04-15 2024-02-20 北京科益虹源光电技术有限公司 Depth GRU-based excimer laser dose control method and device
CN113346969B (en) * 2021-06-07 2022-04-15 中山大学 Frequency spectrum sensing method and system based on gated cyclic unit
CN114785824B (en) * 2022-04-06 2024-05-14 深圳前海用友力合科技服务有限公司 Intelligent Internet of things big data transmission method and system
CN115564078A (en) * 2022-09-28 2023-01-03 电子科技大学长三角研究院(湖州) Short-time-domain prediction method, system, equipment and terminal for parking space occupancy rate of parking lot
CN116502774B (en) * 2023-06-26 2023-09-12 南京信息工程大学 Time sequence prediction method based on time sequence decomposition and Legend projection

Citations (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN105719029A (en) * 2016-03-11 2016-06-29 中国农业大学 Combined wind power prediction method based on wind speed fluctuation characteristic extraction
CN106529745A (en) * 2016-12-16 2017-03-22 中国地质大学(武汉) Stock data processing method and device
CN107120111A (en) * 2017-03-24 2017-09-01 中国地质大学(武汉) A kind of appraisal procedure and system of the oil reservoir interwell communication degree based on multi-fractal
CN109816095A (en) * 2019-01-14 2019-05-28 湖南大学 Based on the network flow prediction method for improving gating cycle neural network
CN110090012A (en) * 2019-03-15 2019-08-06 上海图灵医疗科技有限公司 A kind of human body diseases detection method and testing product based on machine learning
CN110532471A (en) * 2019-08-27 2019-12-03 华侨大学 Active Learning collaborative filtering method based on gating cycle unit neural network

Family Cites Families (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN107025338B (en) * 2017-03-27 2020-04-03 北京工业大学 Recursive RBF neural network-based sludge bulking fault identification method
US20190138887A1 (en) * 2017-11-01 2019-05-09 Board Of Trustees Of Michigan State University Systems, methods, and media for gated recurrent neural networks with reduced parameter gating signals and/or memory-cell units

Patent Citations (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN105719029A (en) * 2016-03-11 2016-06-29 中国农业大学 Combined wind power prediction method based on wind speed fluctuation characteristic extraction
CN106529745A (en) * 2016-12-16 2017-03-22 中国地质大学(武汉) Stock data processing method and device
CN107120111A (en) * 2017-03-24 2017-09-01 中国地质大学(武汉) A kind of appraisal procedure and system of the oil reservoir interwell communication degree based on multi-fractal
CN109816095A (en) * 2019-01-14 2019-05-28 湖南大学 Based on the network flow prediction method for improving gating cycle neural network
CN110090012A (en) * 2019-03-15 2019-08-06 上海图灵医疗科技有限公司 A kind of human body diseases detection method and testing product based on machine learning
CN110532471A (en) * 2019-08-27 2019-12-03 华侨大学 Active Learning collaborative filtering method based on gating cycle unit neural network

Non-Patent Citations (2)

* Cited by examiner, † Cited by third party
Title
基于SCSO-GRU模型的网络流量预测;高佰宏;刘朝晖;刘华;;计算机与现代化(04);全文 *
基于多重分形主曲线模型多目标演化算法研究;张冬梅;龚小胜;戴光明;;计算机研究与发展(09);全文 *

Also Published As

Publication number Publication date
CN112085254A (en) 2020-12-15

Similar Documents

Publication Publication Date Title
CN112085254B (en) Prediction method and model based on multi-fractal cooperative measurement gating circulation unit
CN110705692B (en) Nonlinear dynamic industrial process product prediction method of space-time attention network
CN110730046B (en) Cross-frequency-band spectrum prediction method based on deep migration learning
CN110705743B (en) New energy consumption electric quantity prediction method based on long-term and short-term memory neural network
CN110909926A (en) TCN-LSTM-based solar photovoltaic power generation prediction method
CN111860982A (en) Wind power plant short-term wind power prediction method based on VMD-FCM-GRU
CN111861013B (en) Power load prediction method and device
CN111680786B (en) Time sequence prediction method based on improved weight gating unit
Dong et al. An integrated deep neural network approach for large-scale water quality time series prediction
CN115308558B (en) Method and device for predicting service life of CMOS (complementary metal oxide semiconductor) device, electronic equipment and medium
CN111931983A (en) Precipitation prediction method and system
CN113935513A (en) CEEMDAN-based short-term power load prediction method
CN116187835A (en) Data-driven-based method and system for estimating theoretical line loss interval of transformer area
CN114694379B (en) Traffic flow prediction method and system based on self-adaptive dynamic graph convolution
CN115049115A (en) RDPG wind speed correction method considering NWP wind speed transverse and longitudinal errors
CN115062528A (en) Prediction method for industrial process time sequence data
CN113850438A (en) Public building energy consumption prediction method, system, equipment and medium
CN113159395A (en) Deep learning-based sewage treatment plant water inflow prediction method and system
CN117494545A (en) Circuit breaker aging fault rate prediction method based on hybrid learning method
CN116709409A (en) Knowledge distillation-based lightweight spectrum prediction method
CN116933025A (en) Transformer top layer oil temperature prediction method based on VMD and DBO-LSTM-AT
CN115860277A (en) Data center energy consumption prediction method and system
CN115759343A (en) E-LSTM-based user electric quantity prediction method and device
CN115796327A (en) Wind power interval prediction method based on VMD (vertical vector decomposition) and IWOA-F-GRU (empirical mode decomposition) -based models
CN114707684A (en) Improved LSTM-based raw tobacco stack internal temperature prediction algorithm

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant