CN109598935A

CN109598935A - A kind of traffic data prediction technique based on ultra-long time sequence

Info

Publication number: CN109598935A
Application number: CN201811532408.8A
Authority: CN
Inventors: 李建元; 章步镐; 杨柏林; 陆俊杰; 田彦
Original assignee: Enjoyor Co Ltd
Current assignee: Yinjiang Technology Co ltd
Priority date: 2018-12-14
Filing date: 2018-12-14
Publication date: 2019-04-09
Anticipated expiration: 2038-12-14
Also published as: CN109598935B

Abstract

The present invention relates to a kind of traffic data prediction techniques based on ultra-long time sequence, first, position mark processing is carried out to historical traffic data, the method for carrying out position mark processing to historical traffic data is as follows: determining the corresponding relationship for the data space that individual data position and ultra-long time sequence are formed in traffic data ultra-long time sequence；Position mark is made to the traffic data in traffic data ultra-long time sequence according to corresponding relationship；The ultra-long time sequence refers to across multiple time serieses for dividing the period；Finally, choosing the historical traffic data with position mark as input data, traffic data prediction is carried out.Using a kind of method arbitrary in neural net model method, the method for moving average, exponential smoothing, AR modelling when traffic data is predicted.The present invention improves traffic data precision of prediction by the correlation of Data Position in perception traffic data ultra-long time sequence.

Description

Traffic data prediction method based on ultra-long time sequence

Technical Field

The invention relates to the field of traffic data prediction, in particular to a traffic data prediction method based on an ultra-long time sequence.

Background

The development of real-time dynamic traffic makes real-time accurate traffic data popular. However, traffic data is greatly affected by the time factor. Mainly divided into active factors and passive factors. The active factors are: weekday and non-weekday, seasonal changes, time lapse, and the like. The passive factors are: holidays, activities that lead traffic changes, etc.

The traffic data time sequence can be divided according to time periods of minutes, hours, days, weeks, months, years and the like, and traffic data prediction mainly utilizes traffic data of historical time periods to predict traffic data of future time periods. The existing traffic prediction methods, such as a moving average method, an exponential smoothing method, an AR model method and the like, have weak perception on an ultra-long time sequence spanning multiple divided time periods; therefore, it is necessary to design a traffic data prediction method based on an ultra-long time series.

Disclosure of Invention

The invention aims to overcome the defects and provides a traffic data prediction method based on an overlong time sequence.

The invention achieves the aim through the following technical scheme: a traffic data prediction method based on an ultra-long time sequence comprises the following steps:

(1) the method for carrying out the position marking processing on the historical traffic data comprises the following steps:

(1.1) determining the corresponding relation between a single data position in the traffic data super-long time sequence and a data space formed by the super-long time sequence;

(1.2) marking the position of the traffic data in the traffic data super-long time sequence according to the corresponding relation; the ultra-long time sequence refers to a time sequence spanning multiple divided time periods;

(2) and selecting historical traffic data with position marks as input data to predict the traffic data.

Preferably, in the step (1.1), any one of a regression model method, a user setting method and a neural network model method can be adopted to determine the corresponding relation between a single data position in the traffic data super-long time sequence and a data space formed by the super-long time sequence; preferably, a regression model method is used, the formula is as follows:

Q＝∑(y-∑(w^px^p+b^p))＝min

wherein y is traffic data at a single data position, p represents a data space position, and w and b which minimize the Q function are solved as the corresponding relation between the single data position and the data space.

Preferably, in the step (1.2), the traffic data in the traffic data super-long time sequence is marked according to the corresponding relation, wherein w and b are obtained by adopting a regression model, and different grades are divided according to the size of w and are used as position marks.

Preferably, the divided time interval is determined according to a periodic phenomenon presented by traffic data, a plurality of groups of traffic data time sequences with a time window as a unit are selected, the average similarity is calculated, and when the average similarity meets a set threshold value, the time window can be used as the divided time interval; the similarity can be calculated by cosine similarity, Pearson correlation coefficient and Jaccard similarity coefficient.

Preferably, in the step (2), the traffic data prediction is performed by using any one of a neural network model method, a moving average method, an exponential smoothing method, and an AR model method.

Preferably, the traffic data includes, but is not limited to, data collected by traffic devices, data converted from data collected by traffic devices; the data collected by the traffic equipment includes, but is not limited to, traffic flow and traffic speed, and the data converted according to the data collected by the traffic equipment includes, but is not limited to, traffic state and traffic index.

Preferably, the neural network model method preferably adopts an LSTM long-term neural network to predict traffic data, specifically as follows:

(i) performing traffic flow prediction by setting an input driving sequence, and establishing a linear regression model for a predicted value and a remote traffic flow value;

(ii) after training, screening out the flow value of which the position mark meets the requirement, and marking the position of the remote flow value with large weight obtained by screening;

(iii) adding the traffic flow value with the position mark and the weight into an LSTM network input layer, carrying out back propagation training of LSTM +, firstly calculating the output value of each neuron in a forward direction, secondly calculating the error term value of each neuron in a reverse direction, and finally calculating the gradient of each weight according to the corresponding error term and updating the weight value;

(iv) and predicting the traffic flow in a future preset period by using the trained LSTM network model.

Preferably, in step (i), the drive sequence of the setting input is X ═ X (X)_1，1，1，x_1，1，2，...，x_kk，nn，mm，y_{kk，nn+1，1}，y_{kk，nn+1，2}，...，y_{kk，nn+1，t-1，})，

Wherein y represents data of the prediction day, and the flow prediction problem is essentially that the flow y is built and predicted through the driving sequence X_{kk，nn+1，t}To (3) is performed.

Preferably, in the step (i), when a linear regression model is established, data around the time interval t of each day is selected, and the regression model is as follows:

wherein,d is the step size of the image,and b are the weight and bias of the regression model, respectively.

Preferably, the LSTM network is a special RNN recurrent neural network, and compared with RNN, an "input gate", "forget gate", "output gate", "input gate", "forget gate", "output gate" are added, and LSTM cell states are as follows:

an input gate:

forget the door:

cell state:

an output gate:

where I represents the length of the input current sequence, C represents the number of LSTM cells, P represents the number of labeled high impact traffic values, l represents the input status,indicating the state of the forgetting gate, o indicating the state of the output gate, f and g being activation functions,represents the c-th cell state value at time t, tg_pThe marked traffic flow value representing the pth weighted, the final output is:

the invention has the beneficial effects that: the method and the device improve the traffic data prediction accuracy by sensing the relevance of the data positions in the traffic data super-long time sequence, marking the importance of the data positions, and screening the traffic data of the positions to serve as the basis of traffic data prediction. Preferably, the traffic flow value of the important position is added into the LSTM, so that the LSTM model not only has a long-term and short-term memory function, but also has a certain memory function for an ultra-long term, and the traffic flow prediction is more accurate.

Drawings

FIG. 1 is a schematic flow diagram of the process of the present invention.

Detailed Description

The invention will be further described with reference to specific examples, but the scope of the invention is not limited thereto:

example 1: as shown in fig. 1, a traffic data prediction method based on an ultra-long time series includes the following steps:

the traffic data includes data collected by traffic equipment, such as traffic flow, traffic speed, and the like, and data converted according to the data collected by the traffic equipment, such as traffic state, traffic index, and the like.

The ultra-long time series refers to a time series spanning multiple divided periods, such as: the 4 divided time periods are respectively 1 minute, 1 hour, 1 day and 1 week, x [ K, M, N, L ] represents traffic data of the nth hour and the ith minute on the mth day of the kth week, K, M, N and L are integers, K is an optional range [1-K ], M is an optional range [1-M ],1< M <8, N is an optional range [1-N ],1< N <25, L is an optional range [1-L ], and 1< L < 61; due to different division time intervals, the ultra-long time sequence forms a data space [1-K,1-M,1-N,1-L ], [ K, M, N, L ] corresponding to a certain data space position. Similarly, the time series spanning 3 divided periods also form a lengthy time series.

The divided time interval is determined according to the periodic phenomenon presented by the traffic data, a plurality of groups of traffic data time sequences with the time window as a unit are selected, the average similarity is calculated, and when the average similarity meets a set threshold value, the time window can be used as the divided time interval. In this embodiment, a set of historical traffic data sets is obtained: { (x)₁,t₁),(x₂,t₂),…(x_j,t_j),…,(x₃₀₀,t₃₀₀)},t₂-t₁1 hour, the time window TK is 12 hours, and a plurality of groups of traffic data time sequences taking TK as a unit are taken, taking 3 groups as an example, x_TK1＝{x₁,x₂,…,x₁₂},x_TK2＝{x₂₅,x₂₆,…,x₃₇},x_TK3＝{x₁₃,x₁₄,…,x₂₄And the selection can be carried out sequentially at 12-hour intervals or randomly.

The similarity can be calculated by cosine similarity, Pearson correlation coefficient, Jaccard similarity coefficient and the like. S<x_TK1,x_TK2>Is a time sequence x_TK1And x_TK2The similarity of (c). Average similarity S is calculated_TK＝(S<x_TK1,x_TK2>+S<x_TK1,x_TK3>+S<x_TK2,x_TK3>) A/3; the average similarity satisfies a set threshold, and a 12-hour time window may be used as the divided period.

Similarly, the time window TK is 24 hours, the average similarity satisfies a set threshold, and the 24 hour time window may be used as the divided period.

The method may use a regression model, user settings, a neural network model, and the like to determine a correspondence between a single data position in the traffic data super-long time sequence and a data space formed by the super-long time sequence, in this embodiment, the regression model is preferably used, and the formula is as follows:

Q＝∑(y-∑(w^px^p+b^p))＝min

wherein y is traffic data at a single data position, p represents a data space position, and w and b which minimize the Q function are solved as the corresponding relation between the single data position and the data space. And marking the position of the traffic data in the traffic data super-long time sequence according to the corresponding relation, acquiring w and b by adopting a regression model, and dividing different levels according to the size of w to be used as the position mark.

(2) And selecting historical traffic data with position marks as input data to predict the traffic data. The traffic data prediction may be performed by using a neural network model method, a moving average method, an exponential smoothing method, an AR model method, or the like.

In the present embodiment, a neural network model method is used below to predict traffic data; specifically, an LSTM long-time neural network is adopted for traffic data prediction, and the method specifically comprises the following steps:

(i) the traffic flow data is subjected to weaker perception analysis through an LSTM network, and the characteristic enhancement is performed on the high-influence flow value far away from the predicted value; performing traffic flow prediction by setting an input driving sequence, and establishing a linear regression model for a predicted value and a remote traffic flow value; after training, screening out the flow value of which the position mark meets the requirement, and marking the position of the remote flow value with large weight obtained by screening;

the long and short time neural network (LSTM) is widely applied to prediction of time series, and is a special Recurrent Neural Network (RNN), compared with the RNN, the LSTM is additionally provided with an input gate, an output gate and a forgetting gate, and the gates enable the LSTM to have the function of long-term memory. In traffic flow data prediction, predicted traffic flow data is closely related to flow data of the first few time stamps and also related to flow data near the same time stamp in the first few days of working days, however, due to the existence of LSTM error accumulation, LSTM has weak data perception on an ultra-long sequence, which makes LSTM have weak perception capability on prediction data of the first few days with certain influence in traffic flow, thereby leading to suboptimal analysis and prediction.

The scheme comprises two main steps, firstly, a linear regression model is established for far-distance traffic flow values and predicted values to sense the influence of the far-distance traffic flow values on the predicted values, then, the traffic flow values with position marks meeting requirements are screened out, and the screened traffic flow values with the position marks are added into an LSTM input layer.

It is assumed here that the driving sequence of the input is

X＝(x_1，11，x_1，1，2，...，x_kk，nn，mm，y_{kk，nn+1，1}，y_{kk，nn+1，2}，...，y_{kk，nn+1，t-1，})，

For the convenience of distinguishing, data of the predicted day is represented by y, and the flow prediction problem is to establish and predict the flow y through a driving sequence X_{kk，nn+1，t}To (3) is performed.

Relatively speaking, the flow value near the same time interval n days before has a higher influence on the predicted value, and the farther the flow value is from the time interval t, the weaker the influence is, so in order to reduce the redundancy of the data amount, when performing regression, data near the time interval t every day is selected, and the regression model is as follows:

whereind is the step size of the image,and b are the weight and the offset of the regression model respectively, and after the training is finished, the positions of the flow values with large weights are marked.

In this example, regression was used for weight screening to predict the kth Tuesday 12: traffic data of 00. Setting the step length d as 1, and after the training is finished, the internal corresponding weight matrix is as follows:

first week

Second week

When training is completed, each position will have a corresponding weight, which is represented by < weight, tag > to indicate whether to mark or not, and weight represents the corresponding weight, and the above example marks positions with weights greater than 0.13, and these marked position data represent high correlation positions.

(ii) Adding the traffic flow value with the position mark and the weight into an LSTM network input layer, carrying out back propagation training of LSTM +, firstly calculating the output value of each neuron in a forward direction, secondly calculating the error term value of each neuron in a reverse direction, and finally calculating the gradient of each weight according to the corresponding error term and updating the weight value;

during the training of the LSTM, these marked weighted traffic data are added to the LSTM network, and the states of the input gate, the forgetting gate, the output gate, and the LSTM cell in the LSTM network are as follows:

an input gate:

forget the door:

cell state:

an output gate:

(iii) and predicting the traffic flow in a future preset period by using the trained LSTM network model.

Example 2: a traffic data prediction method based on an ultra-long time sequence comprises the following steps:

Q＝∑(y-∑(w^px^p+b^p))＝min

In this embodiment, a moving average method is used to predict traffic data of 8:00 on kth +1 th week, which is specifically as follows:

ultra-long time series of historical traffic data:

week 1

Week 2

x_1,1,1Traffic data at 1:00 on 1 week Monday, x_kk,nn,mmTraffic data for the mth hour from 1 o integer on the nnth day from the monday on the kk week.

Historical traffic data with position markers is formed<x_kk,nn,mm,tg_kk,nn,mm>tg_kk,nn,mmIs x_kk,nn,mmLocation markers for the data.

Predict the traffic data of 1:00 every kth +1 week, select tg_kk,nn,mmHistorical traffic data satisfying a threshold { x_kk,7,24,x_kk,7,23,x_kk,7,22,x_kk,7,1,x_kk,6,24,x_kk,6,23,x_kk-1,7,24,…}，x_kk,7,24,x_kk,7,23,x_kk,7,22First 3 hours, x, of 1:00 on kk +1 week_kk,7,1,x_kk,6,24,x_kk,6,23The time x around 1:00 of the first 1 day_kk-1,7,24About 1:00 for the first 2 days. The importance of different time positions on data prediction is reflected.

And performing average operation on the R historical traffic data, and sequentially sliding until all the data are processed to finally obtain an average value serving as a predicted value of the traffic data, wherein R is smaller than the time sequence length of the selected historical traffic data.

While the invention has been described in connection with specific embodiments thereof, it will be understood by those skilled in the art that various changes in form and details may be made therein without departing from the spirit and scope of the invention as defined by the appended claims.

Claims

1. A traffic data prediction method based on an ultra-long time sequence is characterized by comprising the following steps:

2. The traffic data prediction method based on the ultra-long time series according to claim 1, characterized in that: in the step (1.1), any one of a regression model method, a user setting method and a neural network model method can be adopted to determine the corresponding relation between a single data position in the traffic data super-long time sequence and a data space formed by the super-long time sequence; preferably, a regression model method is used, the formula is as follows:

Q＝∑(y-∑(w^px^p+b^p))＝min

3. The traffic data prediction method based on the ultra-long time series according to claim 1, characterized in that: and (3) marking the position of the traffic data in the traffic data super-long time sequence according to the corresponding relation in the step (1.2), wherein a regression model is adopted to obtain w and b, and different grades are divided according to the size of w and are used as position marks.

4. The traffic data prediction method based on the ultra-long time series according to claim 1, characterized in that: the divided time interval is determined according to the periodic phenomenon presented by the traffic data, a plurality of groups of traffic data time sequences with the time window as a unit are selected, the average similarity is calculated, and when the average similarity meets a set threshold value, the time window can be used as the divided time interval; the similarity can be calculated by cosine similarity, Pearson correlation coefficient and Jaccard similarity coefficient.

5. The traffic data prediction method based on the ultra-long time series according to claim 1, characterized in that: in the step (2), the traffic data is predicted by adopting any one of a neural network model method, a moving average method, an exponential smoothing method and an AR model method.

6. The traffic data prediction method based on the ultra-long time series according to claim 1, characterized in that: the traffic data comprises but is not limited to data collected by traffic equipment and data converted according to the data collected by the traffic equipment; the data collected by the traffic equipment includes, but is not limited to, traffic flow and traffic speed, and the data converted according to the data collected by the traffic equipment includes, but is not limited to, traffic state and traffic index.

7. The traffic data prediction method based on the ultra-long time series according to claim 5, characterized in that: the neural network model method preferably adopts an LSTM long-time neural network to predict traffic data, and specifically comprises the following steps:

8. The method of claim 7, wherein the traffic data is predicted based on the very long time seriesCharacterized in that: in step (i), the input driving sequence is set to X ═ X_1，1，1，x_1，1，2，…，x_kk，nn，mm，y_kk，nn+1,1，y_kk，nn+1,2，…,y_{kk，nn+1,t-1}Y) where y represents the data of the predicted day and the flow prediction problem is essentially the creation and prediction of flow y by driving sequence X_{kk，nn+1，t}To (3) is performed.

9. The traffic data prediction method based on the ultra-long time series according to claim 7, characterized in that: when a linear regression model is established, selecting data near a time interval t every day, wherein the regression model is as follows:

10. The traffic data prediction method based on the ultra-long time series according to claim 7, characterized in that: the LSTM network is a special RNN recurrent neural network, and compared with RNNs, "input gate", "forget gate", "output gate" are added, and LSTM cell states are as follows:

an input gate:

forget the door:

cell state:

an output gate: