CN111160650B

CN111160650B - Adaboost algorithm-based traffic flow characteristic analysis and prediction method

Info

Publication number: CN111160650B
Application number: CN201911401878.5A
Authority: CN
Inventors: 文成林; 郑乐军; 沈硕; 尉涛
Original assignee: Hangzhou Dianzi University; State Grid Hubei Electric Power Co Ltd
Current assignee: Hangzhou Dianzi University; State Grid Hubei Electric Power Co Ltd
Priority date: 2019-12-31
Filing date: 2019-12-31
Publication date: 2022-08-09
Anticipated expiration: 2039-12-31
Also published as: CN111160650A

Abstract

The invention discloses a traffic flow characteristic analysis and prediction method based on an Adaboost algorithm. The invention provides a thought evolution algorithm (MEC) -based directional search of optimal initial parameters of a neural network, which aims to solve the problem that the neural network is easy to fall into local optimization; and integrating the optimized neural network by using an Adaboost algorithm to solve the problem of poor generalization performance of the neural network on a new sample set, and readjusting the weight distribution of the Adaboost algorithm on the weak predictors by using a prediction error square and reciprocal criterion on the basis, so that the network prediction precision of each predictor is improved to the maximum extent. The invention can improve the traffic flow prediction precision and has better adaptability to different traffic flow states.

Description

Adaboost algorithm-based traffic flow characteristic analysis and prediction method

Technical Field

The invention belongs to the field of intelligent traffic, and particularly relates to a traffic flow characteristic analysis and prediction method based on Adaboost algorithm.

Background

The urban traffic control system is used for reasonably controlling the traffic flow in an urban road network, so that the traffic flow can use intersections in a time-sharing manner, traffic accidents are avoided, traffic congestion is prevented, and traffic condition information is timely provided for related personnel and pedestrians on vehicles so as to improve traffic safety. In order to realize the control, the system needs to know real-time traffic conditions instantly, the same prediction method has different accuracy of traffic flow prediction in different time periods and regions, and the results obtained by adopting different prediction methods in the same group of data have great difference.

At present, the traffic flow analysis and research mainly carries out chaotic identification through a recursive graph method of a chaos theory, a Kolmogorov entropy, a Lyapunov index and the like, so as to judge whether the traffic flow has predictability. However, most of these methods require a large sample size, and the calculation methods are not yet sophisticated enough to perform comparable measurements. The research of a prediction model of a traffic state has a plurality of theories and methods, the existing short-time traffic flow prediction method can be roughly divided into two categories, one category is a mathematical model method based on determination, and the other category is an intelligent model prediction method based on knowledge, for example, the proposed Kalman filtering prediction traffic flow has the characteristics of few model parameters and relatively simple and convenient calculation, but the nonlinearity and uncertainty in the traffic flow prediction process are difficult to reflect; the genetic algorithm is used for optimizing the neural network, so that the problems of low convergence speed, poor popularization capability and the like are solved, and the whole population evolution search efficiency is low. Because the early neural network adopts the traditional BP learning to solve the problem of weight correction of a hidden layer, and the overall minimum value cannot be effectively searched for a multi-peak value and an immaterial function, the method has the assignment randomness to network parameters and the sensitivity to initial values, so that the simulation result of a neural network model in engineering application is unstable; and a disadvantage of the conventional BP learning is that online learning cannot be performed, and sufficient samples need to be accumulated for unified training, so that network parameters cannot be adjusted in real time according to new samples.

Disclosure of Invention

Aiming at the defects of the prior art, the invention provides a traffic flow characteristic analysis and prediction method based on Adaboost algorithm.

The technical scheme adopted by the invention for solving the technical problem is as follows:

the invention specifically comprises the following steps:

step (1) short-term characteristic analysis of traffic flow based on R/S analysis method

Step (1-1) calculation step:

a time series { x (t) },

t

1,2, …, M is set as follows.

1) Divide it into length n [ M/n ]]A length of equal subsequences, I _a Denotes the a-th sub-sequence segment, and the time-sequence segment at the a-th is denoted by { x (i) }, i ═ 1,2, …, n. E _a Represents the mean over the a-th subsequence segment:

2) subsequence section I _a Cumulative deviation X (i, a) of the elements in (a) from the mean:

3) subsequence section I _a Is extremely poor

And standard deviation of sample

4) Sub-sequence segment length is the re-standard range value (R/S) of n divisions _n

Step (1-2) analysis process:

the time series can be classified into three types according to the difference of the hurst-specific values:

(1)0< H <0.5, indicating that the sequence is not a random walk sequence, but is an inversely correlated time sequence, i.e., the trend of change in the future is opposite to the trend of the past, and the closer H is to 0, the stronger the persistence is.

(2) H is 0.5, which indicates that the sequence is a standard random walk sequence, i.e. the future trend of change has no relation with the increment of the past trend.

(3)0.5< H <1, indicating that the time series is persistent, the past increasing trend is predictive of a future increasing trend, and the past decreasing trend is predictive of a future decreasing trend. When H approaches 1, it indicates that the past is closely related to the future. Quantitative analysis can be made according to the future change trend of the persistence and the anti-persistence to the time sequence.

Step (2) traffic flow time sequence phase space reconstruction

The phase space reconstruction theory is a vital part in chaotic system analysis, and the phase space is constructed by utilizing the time sequence data of the traffic flow, so that the sequence is hidden with a rule in the evolution process and useful internal information can be embodied. Setting the time sequence of traffic flow as

Let time delay be τ and embedding dimension be M, then the M-dimensional phase space vectors constructed by the time-delayed phase space reconstruction method are:

X＝{X(t)|X(t)＝[x(t),x(t+τ),…,x(t+(m-1)τ)] ^T ,t＝1,2,…,M} (6)

wherein X is an M × M dimensional matrix, the number of phase points in the reconstructed phase space is M ═ N- (M-1) τ, the M phase points form a phase type in the M dimensional phase space, the phase type represents the state of the traffic flow system at a certain moment, and the phase type is connected according to the time increasing sequence, so that the evolution track of the traffic flow system in the M dimensional phase space can be described, and therefore, the original one-dimensional time sequence prediction problem is converted into the prediction of the M dimensional phase point sequence. Assuming that the predicted phase points { X (t), X (t-1), …, X (t-k) }, k ═ 1,2, …, t-1 are known, and the phase points to be predicted at the current time t + (m-1) τ are { X (t +1), X (t +2), …, X (t + p) }, where p ═ 1 is referred to as one-step prediction, and p >1 is referred to as multi-step prediction, the prediction model can be expressed as:

{x(t+(m-1)τ+1),…,x(t+(m-1)τ+p)}＝F(X(t),…,X(t-k)) (7)

and the generalized approximation capability of the feedforward neural network is utilized to realize one-step or multi-step prediction of the traffic flow. The method utilizes a C-C method to calculate the embedding dimension and the delay time, and calculates the maximum Lyapunov index of the traffic flow through a wolf method to judge the chaos characteristic of the traffic flow.

Step (3) MEC-BP fusion algorithm

The thought evolution algorithm is a novel evolution algorithm aiming at the defects of the genetic algorithm and simulating the human thought evolution process. The method inherits partial thought of a genetic algorithm and introduces two new operation operators of 'convergence' and 'dissimilarity'. Convergence and differentiation are respectively responsible for local and global optimization, the two operators are independent and coordinated with each other, the improvement of any operation can improve the overall search efficiency of the algorithm, and the directional learning and memory mechanism of the algorithm enables the algorithm to have extremely strong global optimization capability.

Setting t as the current number of iterations in the MEC global iteration; ρ is the number of iterations currently in progress in an iteration within a certain sub-population. Each individual in the sub-population represents an initial weight value and a threshold value in a group of BP neural network fusion algorithms, and a single individual N is measured _i,j The fitness index is obtained by further calculating the fusion result obtained by the converged BP neural network fusion model after training, and in the internal iteration of the sub-populations, the optimal individual N is selected from each sub-population through a local bulletin board _i,pbest Then, the individual represents the whole sub-population to participate in the global competition through a global bulletin board, and the global optimal sub-population S is selected _gbest And a global optimal individual N included therein _gbest . After multiple iterations, the MEC-BP neural network model obtained by training the initial weight represented by the final global optimal individual and the threshold is the final obtainedA multi-source traffic data fusion model.

Step (4) neural network integrated prediction model based on Adaboost algorithm

The adaptive enhancement algorithm (Adaboost) obtains the sample weight by repeatedly searching the sample feature space, continuously adjusts the weight of the training sample in the iteration process, increases (reduces) the weight of the sample with low (high) prediction precision, and adopts the method of weighted majority voting to combine to form a strong predictor, namely, increases (reduces) the weight of a weak predictor with smaller (larger) prediction error rate, so that the weight plays a larger (smaller) role in voting, and the prediction performance of the learning algorithm is obviously improved.

Step (4-1) Adaboost algorithm step

Step 1: data acquisition and network initialization. Selecting m groups of training samples T { (X) from the sample space _i ,y _i ) Giving the weight distribution of training samples as w _1i 1/m, i is 1,2, …, m, the network structure is determined according to the input and output dimensions of the samples, the initial weight and the threshold value of the neural network are obtained by the optimization of the improved thought evolution algorithm, D (1) represents the initial weight for obtaining the samples, and K represents the number of predictors.

D(1)＝(w ₁₁ ,w ₁₂ ,…w _1i ,…,w _1m ) (8)

Step 2: carry out iteration K ═ 1,2, …, K

(a) When training the k weak predictor, the weak predictor H is used _k (x) Training samples and predicting training data output regression error rate ξ _k Calculating the maximum error E of the samples on the training set _k And relative error xi of each sample _ki ：

E _k ＝max(|y _i -H _k (X _i )) (9)

(b) Calculating the weight a of the weak predictor in the final predictor _k ：

(c) According to the predicted sequence weight a _k And adjusting the weight of the next round of training samples:

D(k+1)＝(w _k+1,1 ,w _k+1,2 ,…,w _k+1,m ) (13)

step 3: training K rounds to obtain K groups of weak prediction functions H _k (x) Combining the weak prediction functions according to the weight of the weak predictor to obtain a strong predictor h (x) which is as follows:

step 4: in order to better solve the weighted value of each group of weak predictors, weak prediction function values H of K groups of weak predictors are obtained by training MEC-BP neural network through Adaboost algorithm _k (x) Then, the weighted value w of each group of weak prediction functions is carried out by adopting the square sum reciprocal criterion of the prediction error again _k Solving to obtain the accumulated strong predictor h (x) -sigma w _k *H _k (x _k ,a _k ). The larger the square sum of the prediction errors is, the lower the prediction accuracy of the prediction model is, so that the importance of the prediction model in the combined prediction is reduced, and a larger weighting coefficient is assigned to the single prediction model with the smaller square sum of the prediction errors in the combined prediction. The weighting coefficient calculation method comprises the following steps:

let y _ki For the prediction of the k-th weak predictor at the i-th momentValue, y _i The observed value at the ith time point of the same prediction object, m represents the time length, E _k The k-th weak predictor is the sum of the square of the prediction errors.

The invention has the beneficial effects that: aiming at the basic characteristics of uncertainty, complexity and high nonlinearity of short-time traffic flow, the R/S analysis method is applied to short-time traffic flow analysis, can reveal the internal law of microscopic traffic flow movement and quantitatively reveal the dynamic characteristics of a traffic system. And simultaneously, optimizing initial parameter selection of the BP neural network by adopting a thought evolution algorithm, improving the prediction precision of the neural network, performing Adaboost algorithm integration effective comprehensive decision on the network optimized by a plurality of thought evolution algorithms, improving the generalization of the network, and readjusting the weight distribution of the Adaboost algorithm on the weak predictors by a prediction error square and reciprocal criterion on the basis, so that the prediction precision of the network is improved to the maximum extent by each predictor. And then, predicting the short-time traffic flow according to the created integrated neural network prediction model by adopting a PeMS system data set.

Drawings

FIG. 1 is a block diagram of the MEC-BP fusion algorithm structure.

FIG. 2 is a diagram of an integrated neural network architecture based on the Adaboost algorithm.

FIG. 3 is a time sequence diagram of traffic flow with different statistical scales for 5 consecutive days.

And 5, the wavelet transform real part time-frequency distribution of short-term traffic flow at 45 min intervals is shown.

Fig. 5 plots of Hurst indices for different statistical scales.

FIG. 6 log (R/S) of traffic flow time series at statistical scale 10min _n And V _n With respect to the logn variation curve.

FIG. 7 is a graph of V obtained at different statistical scales for the same time length _n Change curve with logn.

FIG. 8 is a three-dimensional phase space reconstruction of traffic flow sequences of different statistical scales.

FIG. 9 is a comparison of predicted values of short-term traffic flow under different models.

FIG. 10 compares the predicted traffic flow values of different models with the absolute error values of the estimated values.

Detailed Description

The invention comprises the following steps:

Step (1-1) calculation step:

a time series { x (t) }, t ═ 1,2, …, M is set with the following calculation.

3) subsequence section I _a Extreme difference of (2)

And standard deviation of sample

4) Re-scale range (R/S) with subsequences length n divided _n

Step (1-2) analysis process:

Step (2) traffic flow time sequence phase space reconstruction

X＝{X(t)|X(t)＝[x(t),x(t+τ),…,x(t+(m-1)τ)] ^T ,t＝1,2,…,M} (6)

{x(t+(m-1)τ+1),…,x(t+(m-1)τ+p)}＝F(X(t),…,X(t-k)) (7)

Step (3) MEC-BP fusion algorithm

Referring to fig. 1, t is the number of iterations currently in progress in the MEC global iteration; ρ is the number of iterations currently in progress in an iteration within a certain sub-population. Each individual in the sub-population represents an initial weight value and a threshold value in a group of BP neural network fusion algorithms, and a single individual N is measured _i,j The fitness index is obtained by further calculating the fusion result obtained by the converged BP neural network fusion model after training, and in the internal iteration of the sub-populations, the optimal individual N is selected from each sub-population through a local bulletin board _i,pbest Then, the individual represents the whole sub-population and participates in the whole population through the global bulletin boardLocal competition, selecting global optimum sub-population S _gbest And global optimal individual N included therein _gbest . After multiple iterations, the MEC-BP neural network model obtained by training the initial weight represented by the final global optimal individual and the threshold is the finally obtained multi-source traffic data fusion model.

Step (4) neural network integrated prediction model based on Adaboost algorithm

The adaptive boosting algorithm (Adaboost) is to obtain the sample weight by repeatedly searching the sample feature space, continuously adjust the weight of the training sample in the iterative process, increase (reduce) the weight of the sample with low (high) prediction precision, and combine by adopting a weighted majority voting method to form a strong predictor, i.e. increase (reduce) the weight of a weak predictor with a smaller (larger) prediction error rate, so that the weight plays a larger (smaller) role in voting, and the prediction performance of the learning algorithm is obviously improved, as shown in fig. 2.

Step (4-1) Adaboost algorithm step

D(1)＝(w ₁₁ ,w ₁₂ ,…w _1i ,…,w _1m ) (8)

Step 2: carry out iteration K ═ 1,2, …, K

E _k ＝max(|y _i -H _k (X _i )) (9)

D(k+1)＝(w _k+1,1 ,w _k+1,2 ,…,w _k+1,m ) (13)

step 4: in order to better solve the weighted value of each group of weak predictors, weak prediction function values H of K groups of weak predictors are obtained by training MEC-BP neural network through Adaboost algorithm _k (x) Then, the weighted value w of each group of weak prediction functions is carried out by adopting the square sum reciprocal criterion of the prediction error again _k Solving to obtain the accumulated strong predictor h (x) -sigma w _k *H _k (x _k ,a _k ). The larger the square sum of the prediction errors is, the lower the prediction accuracy of the prediction model is, so that the importance of the prediction model in the combined prediction is reduced, and a larger weighting coefficient is assigned to the single prediction model with the smaller square sum of the prediction errors in the combined prediction. The calculation method of the weighting coefficient comprises the following steps:

let y _ki Is the predicted value of the k type weak predictor at the i time, y _i The observed value at the ith time point of the same prediction object, m represents the time length, E _k The k-th weak predictor is the sum of the square of the prediction errors.

Step (5) loading a PeMS data set to carry out traffic flow simulation test

To verify the effectiveness of the present invention, two types of source data are used in the PeMS system: 30 seconds traffic flow and lane occupancy, which aggregates the 30 seconds data into 5min, 15min, 1hour, etc. data sets. Experimental data set 1: collecting single road section traffic flow aggregation of 4 continuous working days from 5 months 2 days to 5 months 5 days in 2011, and recording traffic flow data under a 5min statistical scale; experimental data set 2: the method comprises the steps of aggregating 3 road traffic flows of 5 continuous dates from 6 months 1 days in 2011 to 5 months in 2011 (from wednesday to sunday), continuously recording traffic flow data under different statistical scales of 5, 10, 15, 20 and the like by adopting 24 continuous hours per day as observation time, and respectively obtaining 1440, 720, 480 and 360 data.

The similarity between the curves in fig. 3 shows that on different scales, the traffic flow changes have self-similarity, and the traffic flow data can be found to show obvious quasi-periodic trend by observing the change trend of the traffic flow time sequence in the data of the time period 5 min. In order to identify the self-similarity of traffic flow data, wavelet transformation is adopted to decompose the traffic flow data, and wavelet decomposition coefficients of the traffic flow data shown in fig. 4 refer to similarity indexes (RI), wherein the larger the RI is, the larger the self-similarity is, and due to the change of travel demands, the wavelet coefficients of working days (the first three days) and weekends (the last two days) are different, which indicates that the traffic flow has the time-interval property, so that the time-interval of the traffic flow data can be divided into busy time intervals, idle time intervals and normal time intervals. The experimental data show that the traffic flow time period can be divided into: the busy time period is 7:00-9:30, 14:30-18: 30; the idle time period is 0:00-5: 00; the rest is the normal time period.

A. Result of predictive analysis of short-term traffic flow based on R/S analysis method

The method for solving the Hurst value by the R/S analysis method is influenced by the sample size, in order to further track and compare data with different observation scales, the traffic flow sequence is accumulated by taking a natural period of the traffic flow as a unit one day, information which represents the change rule of the traffic flow sequence in the period is reserved to the maximum extent for calculation, and the following is the analysis of the traffic flow of the data set 2.

(1) Fig. 5 is a Hurst change curve for different days on different statistical scales, and shows that the values of the Hurst index are all located in an interval [0.5,1], which indicates that the traffic flow time series has long-term memory property, and indicates that the overall direction of traffic flow change inherits the past overall trend, and the past increasing (decreasing) trend indicates the future increasing (decreasing) trend. Each curve in the graph shows an integral descending trend along with the increase of the time length, namely, the Hurst index is reduced along with the increase of the sample amount, which shows that in the same statistical scale range, when the time sequence reaches a certain scale, the self-similarity of the original time sequence is damaged by increasing data; the Hurst index shows a descending trend along with the increase of a time statistical scale(s) under the same time length, the traffic flow sequence has short-term effectiveness, and the long memory of the time sequence is weakened along with the increase of time.

(2) Table 1 shows the calculation of the Hurst indexes for the same number of days (5 days) in different statistical scales in three different time periods, and the results show that the Hurst indexes for traffic flows in the same statistical scale from idle time periods to busy time periods show an increasing trend, because the busy traffic is stronger and the self-similarity is stronger, and the traffic predictability is stronger in the same time scale; the Hurst indexes of different scales at the same time period are in a descending trend, and the Hurst indexes are expected to be closer to 0.5 along with the continuous increase of the statistical scale, so that the traffic flow has no fractal feature, mainly because the correlation time sequence existing in the past and the future is a completely independent process.

TABLE 1 Hurst index at different time intervals and on different statistical scales for the same number of days

(3) If the time series is long-range correlated, the interdependence between times is strong. FIG. 6 shows the log (R/S) of the traffic flow time series at a statistical scale of 10min _n And V _n With respect to the logn variation curve, the original sequence V can be seen _n The average cycle period of the traffic flow with the statistical scale of 10min is judged to be 207min, namely the sequence loses memory of the initial condition after passing 207min on average; meanwhile, the Hurst index (0.6233) after the sequence is disturbed is found to be smaller than the Hurst index (0.7031) of the original sequence, because the correlation structure of the original sequence is destroyed after the data are disturbed, and the ordered degree of the traffic flow time sequence is reduced; finding V after scrambling the sequence _n Is a flat curve, and shows that the sequence becomes independent random process without long-range correlation.

(4) FIG. 7 is a graph of V obtained at different statistical scales for the same time length _n The change curve along with logn is the calculation result of short-term traffic flow sequence under different statistical scales, and V is found along with the reduction of the statistical scales _n The longer the mutation time, i.e. the longer the time required for the long memory to disappear, but in practice this long-term memory is not infinite, but gradually diminishes over time until it is forgotten, so short-term predictions are still possible. When tau is 1hour _n The rising trend of the statistical curve is not obvious, and the closer the Hurst index is to 0.5, the more noise in the sequence is, the closer the sequence is to a random process.

(5) In order to quantitatively describe the complexity of traffic flow, traffic complexity based on fractal, chaos and entropy is analyzed, as shown in table 2, the Hurst index and sample entropy are gradually reduced along with the increase of statistical scale, and the time sequence is more complex when the sample entropy of 5min sampling is found to be the maximum; the maximum Lyapunov exponent is always a positive number, so that the motion of the system in a certain vector direction is unstable, and the chaotic attractor appears in the direction, so that the motion of the whole system is in a chaotic state. As shown in fig. 8, it can be seen from the different components of the reconstructed phase space of the traffic flow sequence that the trajectories thereof are repeatedly folded and cross each other to form a dense band, and as the statistical scale becomes larger, the traffic characteristics become more obvious.

TABLE 2 traffic flow feature complexity analysis at different statistical scales

B. Model predictive analysis

Designing a BP neural network according to the characteristics of traffic flow, wherein the network is divided into an input layer, a hidden layer and an output layer, and a characteristic vector and a corresponding output construction process when the window width is m-4 are used to finally obtain a sample for training the neural network

The adopted network structure is 4-3-p, the input layer is provided with 4 nodes which represent the traffic flow of 4 time points before the time node; the hidden layer has 3 nodes, and the output layer has p nodes for the traffic flow predicted by the network. In the improved BP _ Adaboost algorithm, a strong predictor consisting of 10 groups of weak predictors is arranged to predict data samples, wherein an error threshold is set to be 0.1, 3-day historical data are adopted for each data set to predict the traffic flow situation of 4 th day, the front 864 groups are training samples, and the rear 288 groups are test samples. The invention adopts the following 3 error indexes to measure the accuracy of combined prediction.

Wherein: n is the length of the traffic flow data sequence, y _i For the sample output value, d _i Is the sample target value. Determining the coefficient (R) ² ) The larger the index is, the better the model effect is, R ² ∈[0,1](ii) a The smaller the indexes of Mean Square Error (MSE) and mean absolute error (MAD) are, the more reasonable the structure of the corresponding model is.

A Matlab2017b simulation software is used for training a traditional BP method, a BP _ Adaboost method, an MEC-BP _ Adaboost model and an improved MEC-BP _ Adaboost model (the method), the trained models are used for carrying out short-time traffic flow single-step prediction on the data set 1, and the result is shown in a table 3 and fig. 9 and 10.

TABLE 3 comparison of Performance indicators for different prediction models

As can be seen from table 3 above and fig. 9 and 10, based on the MEC-BP model, the mean square error and the mean absolute error are respectively reduced by 29.8% and 3.5% compared with the conventional BP model, which proves the effectiveness of MEC in optimizing the initial parameters of the BP model; based on the BP _ Adaboost model, compared with the traditional BP model, the mean square error and the average absolute error are respectively reduced by 56.3 percent and 27.1 percent, which proves that the generalization capability of the Adaboost algorithm to a neural network is greatly improved, and the Adaboost algorithm adopts a weighted majority voting method, so that the prediction precision of the model can be effectively improved, and the phenomenon of 'over-fitting' of the model is avoided; based on the method, compared with a BP model, the mean square error and the mean absolute error are respectively reduced by 78.2 percent and 46.4 percent, and the rationality of the improved method for traffic flow prediction is proved; compared with the MEC-BP _ Adaboost model, the mean square error and the average absolute error are respectively reduced by 44.9 percent and 25.9 percent based on the method, and the weight value of the weak predictor by adopting the error square sum reciprocal criterion is proved, so that the prediction precision of the weak predictor is higher, and the generalization capability of the predictor is more effectively improved.

In order to better show the prediction effect of each weak predictor, the weight of each group of weak prediction functions is solved by a prediction error square sum reciprocal method, so that the performance of each weak predictor is better shown, and the decision performance of the whole model is improved, wherein the weight comparison of each weak predictor of the MEC-BP-Adaboost model and the weight comparison of each weak predictor of the MEC-BP-Adaboost model are shown in a table 4:

TABLE 4 weight comparison of each weak predictor in the two models

The experimental results in table 4 show that, according to the improved weight values of 10 MECs optimized neural networks, the weight ratios of the neural networks of 3 rd, 4 th and 8 th of the MEC-BP _ Adaboost model are the largest, which indicates that the 3 neural networks have a more obvious traffic flow prediction effect, and after the improvement of the method, the weight of other neural networks having small influence on the model is reduced, the influence of the 3 neural networks on the whole model is increased, valuable information provided by the networks is fully utilized, and the accuracy of the prediction result is maximized.

In order to further verify the effectiveness and universality of the model, 2-step, 3-step, 4-step and 5-step prediction is carried out by adopting a data set 2, as shown in table 5, so that the prediction error of the method is generally smaller than that of the original method along with the increase of the number of the prediction steps, but the prediction precision is reduced along with the increase of the step length under the same model.

TABLE 5 MSE-value comparison of different prediction step sizes for different models

Claims

1. A traffic flow characteristic analysis and prediction method based on Adaboost algorithm specifically comprises the following steps:

Step (1-1) calculation step:

setting a time series { x (t) }, t ═ 1,2, …, M with the following calculations;

1) divide it into length n [ M/n ]]Length of equal-length subsequences, I _a Represents the a-th sub-sequence segment, and the time sequence segment at the a-th sub-sequence segment is represented as { x (i) }, i ═ 1,2, …, n; e _a Represents the mean over the a-th subsequence segment:

3) subsequence section I _a Is extremely poor

And standard deviation of sample

Step (1-2) analysis process:

time series are classified into three types according to the difference of the hurst-specific values:

(1)0< H <0.5, which indicates that the sequence is not a random walk sequence, and is an inversely correlated time sequence, namely the future change trend is opposite to the past trend, and the closer H is to 0, the stronger the reverse persistence is;

(2) h ═ 0.5, indicating that the sequence is a standard random walk sequence, i.e. the future trend of change has no relation to the increment of the past trend;

(3)0.5< H <1, indicating that the time series is persistent, a past increasing trend is predictive of a future increasing trend, and a past decreasing trend is predictive of a future decreasing trend; when H approaches 1, it indicates that the past is closely related to the future; quantitative analysis can be carried out according to the future change trend of the persistence and the anti-persistence to the time sequence;

step (2) traffic flow time sequence phase space reconstruction

Setting the time sequence of traffic flow as

X＝{X(t)|X(t)＝[x(t),x(t+τ),…,x(t+(m-1)τ)] ^T ,t＝1,2,…,M} (6)

wherein X is an M × M dimensional matrix, the number of phase points in a reconstructed phase space is M ═ N- (M-1) τ, the M phase points form a phase type in the M dimensional phase space, the phase type represents the state of a traffic flow system at a certain moment, the phase type is connected according to the time increasing sequence, the evolution track of the traffic flow system in the M dimensional phase space can be described, and the original one-dimensional time sequence prediction problem is converted into the prediction of the M dimensional phase point sequence; assuming that the predicted phase points { X (t), X (t-1), …, X (t-k) }, k ═ 1,2, …, t-1 are known, and the phase points to be predicted at the current time t + (m-1) τ are { X (t +1), X (t +2), …, X (t + p) }, where p ═ 1 is referred to as one-step prediction, and p >1 is referred to as multi-step prediction, the prediction model is expressed as:

{x(t+(m-1)τ+1),…,x(t+(m-1)τ+p)}＝F(X(t),…,X(t-k)) (7)

one-step or multi-step prediction of traffic flow is realized by utilizing the generalization approximation capability of a feedforward neural network;

step (3) MEC-BP fusion

Setting t as the current number of iterations in the MEC global iteration; rho is the iteration number currently carried out in the iteration inside a certain sub-population; each individual in the sub-population represents an initial weight value and a threshold value in a group of BP neural network fusion algorithms, and a single individual N is measured _i,j The fitness index is obtained by further calculating the fusion result obtained by the converged BP neural network fusion model after training, and in the internal iteration of the sub-populations, the optimal individual N is selected from each sub-population through a local bulletin board _i,pbest Then the individual represents the whole sub-population to participate in the global competition through a global bulletin board, and a global optimal sub-population S is selected _gbest And a global optimal individual N included therein _gbest (ii) a After multiple iterations, the MEC-BP neural network model obtained by training the initial weight represented by the final global optimal individual and the threshold is the finally obtained multi-source traffic data fusion model;

step (4) neural network integrated prediction model based on Adaboost algorithm

The method comprises the following steps of obtaining sample weight by repeatedly searching a sample feature space, continuously adjusting the weight of a training sample in an iteration process, increasing the weight of a sample with low prediction precision, reducing the weight of a sample with high prediction precision, and combining by adopting a weighted majority voting method to form a strong predictor, wherein the method specifically comprises the following steps:

step (4-1) Adaboost algorithm step

Step 1: data acquisition and network initialization; selecting m groups of training samples T { (X) from the sample space _i ,y _i ) Giving the weight distribution of training samples as w _1i 1/m, i 1,2, …, m, depending on the sample input and output dimensionsThe method comprises the steps that a network structure is determined, initial weights and threshold values of a neural network are obtained through thought evolution algorithm optimization, D (1) represents the initial weights of obtained samples, and K represents the number of predictors;

D(1)＝(w ₁₁ ,w ₁₂ ,…w _1i ,…,w _1m ) (8)

step 2: carry out iteration K ═ 1,2, …, K

E _k ＝max(|y _i -H _k (X _i )|) (9)

D(k+1)＝(w _k+1,1 ,w _k+1,2 ,…,w _k+1,m ) (13)

step 3: k groups of weak predictors H are obtained after K rounds of training _k (x) Combining each weak prediction function according to the weight of the weak predictor to obtain a strong predictor h(x) Comprises the following steps:

step 4: in order to better solve the weighted value of each group of weak predictors, K groups of weak predictors H are obtained by training MEC-BP neural network through Adaboost algorithm _k (x) Then, the weighted value w of each group of weak prediction functions is carried out by adopting the square sum reciprocal criterion of the prediction error again _k Solving to obtain the accumulated strong predictor h (x) -sigma w _k *H _k (x _k ,a _k ) (ii) a The larger the square sum of the prediction errors is, the lower the prediction accuracy of the prediction model is, so that the importance of the prediction model in combined prediction is reduced, and a larger weighting coefficient is given to the single prediction model with smaller square sum of the prediction errors in the combined prediction; the calculation method of the weighting coefficient comprises the following steps:

let y _ki Is the predicted value of the k type weak predictor at the i time, y _i E 'is the observed value at the ith time of the same prediction object, N represents the time length' _k The prediction error square sum of the kth weak predictor;