CN114267170A

CN114267170A - Traffic flow prediction method based on graph space-time transform model considering human mobility

Info

Publication number: CN114267170A
Application number: CN202111140879.6A
Authority: CN
Inventors: 孔祥杰; 赵振振; 沈国江; 熊海涛; 刘娜利; 刘志
Original assignee: Zhejiang University of Technology ZJUT
Current assignee: Zhejiang University of Technology ZJUT
Priority date: 2021-09-28
Filing date: 2021-09-28
Publication date: 2022-04-01

Abstract

A traffic flow prediction method based on a spatio-temporal transform model considering human mobility. The method comprises the following steps: 1) data collection: the method adopts real traffic data sets of different areas in the United states, and obtains data required by an experiment after screening, extracting and preprocessing; 2) spatial feature extraction: designing multi-view graph convolution to reveal different behavior modes of people hidden in traffic data, thereby showing an invisible graph structure in the traffic data and fully mining cross-road network characteristics on a space; 3) time characteristic extraction: performing time feature extraction by using a Transformer network with a multi-head attention mechanism; 4) constructing a model: combining time and space characteristics to construct a map space-time Transformer model for traffic flow prediction; 5) data set verification: the performance evaluation method adopts a real world traffic data set to carry out performance evaluation on the graph space-time Transformer model, finally determines the effectiveness of the graph space-time Transformer model, and is used for traffic flow prediction. Experiments show that the invention has excellent performance for treating similar problems.

Description

Traffic flow prediction method based on graph space-time transform model considering human mobility

Technical Field

The invention relates to the fields of artificial intelligence and data mining, in particular to a method for accurately predicting traffic flow and guiding the construction of an intelligent traffic system in a city.

Background

With the rapid development of economy and information technology, the problems of traffic jam, environmental pollution, resource allocation and the like are brought about by the continuously improved modern life of people and the continuously increased automobile holding capacity, the construction of an intelligent traffic system is urgent, and the problems are difficult to solve because the road information is not fully sensed in the past. Nowadays, due to the maturity of various sensor technologies and cloud computing services, the running state of a road vehicle can be accurately sensed, and then the average speed of a road is estimated. These accurate city big data provide new possibilities and ideas for solving these problems. The traffic information of the road surface is accurately predicted by using urban big data and an advanced artificial intelligence technology, so that valuable information references are provided for urban managers and planners, the traffic control capacity and the service efficiency are improved, the traveling experience of urban residents is optimized, and the problems and challenges encountered in urban development can be better handled. Meanwhile, the construction and development of a distributed platform and cloud computing also technically enable the capability of processing the large-scale urban data.

Data mining is a process that combines multiple techniques, such as artificial intelligence, machine learning, pattern recognition, statistics, databases, etc., to reveal implicit, previously unknown, and potentially valuable information from large amounts of data. There is now a lot of data that can be widely used, and there is an urgent need to convert these data into useful information and knowledge for guiding various fields such as city planning, market analysis and scientific exploration.

The timely and accurate traffic flow prediction can better establish an intelligent traffic system, provide accurate traffic information for urban residents, and bring excellent travel experience. However, due to the mobility of humans, people have different patterns of behavior on different days. People are more likely to drive from residential areas to work areas on weekdays, and prefer to go from residential areas to leisure areas such as parks, places of interest, or shopping malls on weekends, which results in people selecting different roads to reach destinations. Therefore, traffic flow shows strong cross-road network characteristics in terms of space. On the other hand, the traffic information is generated not only in relation to the previous time but also in relation to a period of history, but the data inputted in time series cannot directly represent the internal relation in different time scales, in other words, the traffic flow in different times has different influences on the traffic flow in the future, and therefore, the traffic flow shows a strong random disturbance characteristic in time. The urban traffic information data reflects different behaviors of urban residents from the side, people can characterize the mobility of people hidden in the traffic data by using a data mining method, and then the dynamic space-time traffic data is accurately modeled, so that new opportunities are brought to the construction of an intelligent traffic system.

Disclosure of Invention

The present invention is to overcome the above disadvantages of the prior art, and provides a traffic flow prediction method based on a spatiotemporal transform model considering human mobility.

The invention is just applying artificial intelligence method to model the traffic information data in the city. And the urban traffic flow is predicted, the change of the urban road in a period of time in the future can be mastered, the running state of the city is known, the traffic control efficiency is further improved, and the method has important practical significance for building an intelligent traffic system.

The invention achieves the aim through the following technical scheme: the traffic flow prediction method based on the graph space-time Transformer model considering human mobility is characterized by comprising the following steps of:

(1) regarding the road sensor nodes as graph nodes, regarding traffic information on the sensors as node attributes, representing the traffic information by graph structure data, and defining problems;

(2) modeling a plurality of behavior modes hidden in traffic information, and mining cross-road network characteristics of traffic data on the space by applying a multi-view graph convolution neural network (GCN) model;

(3) combining the characteristics obtained in the step (2), performing time characteristic representation on traffic information by using a Transformer neural network, and capturing the random disturbance of traffic data in time so as to determine a graph space-time Transformer model for traffic flow prediction;

(4) combining the space-time characteristics obtained in the step (3), carrying out experimental verification on real traffic data sets in two different areas, and finally determining the effectiveness of the graph space-time transform model;

(5) and (4) adopting the graph space-time transform model determined in the step (4) for traffic flow prediction.

Further, the step (1) specifically comprises the following steps:

1a) normalizing the original traffic data, and representing a road network by G-V, E and W, wherein V represents all road sets, E represents an edge set, and W represents the connection between nodes;

1b) counting the traffic data of different roads by five minutes, and expressing the traffic data as a characteristic matrix of nodes

h represents the length of the historical time, and N represents the number of nodes;

1c) given a road network G, a feature matrix

Aiming at finding a mapping function capable of learning spatiotemporal features from historical traffic information

Enabling it to predict the next traffic information.

Further, the step (2) specifically comprises the following steps:

2a) carrying out behavior mode division according to the traffic data obtained by statistics in the step (1), wherein people have different activity tracks on working days and rest days, so that the data are divided into two types according to the working days and the rest days, and the behavior mode hidden in the traffic information is represented;

2b) combining the data divided in the step (2a), calculating the traffic information similarity by using a Pearson correlation coefficient, wherein for the traffic information vector U, V of any two roads, the calculation formula is as follows:

wherein

The mean of the feature vectors is represented. Finally, a matrix A representing the similarity of any two road traffic information is obtained, and the more similar the two road traffic information is, the more consistent the behavior patterns of people are;

2c) for any two road nodes v, according to the physical characteristics of the road network itself_i,v_jThe weights are expressed as follows:

where θ represents the standard deviation of the data, dist (v)_i,v_j)²Representing the distance between any two nodes. Finally, obtaining a matrix D representing the physical characteristics of the road network;

2d) combining the plurality of feature matrices obtained in the steps (2b) and (2c), and characterizing different behavior patterns by using a GCN model, wherein a fast convolution formula on the graph structure data is as follows:

wherein

I_NRepresenting a unit matrix, D is a degree matrix D ═ Σ_jA_ij，λ_maxIs L^sysMaximum eigenvalue of the matrix, T_k(. cndot.) denotes a Chebyshev polynomial of order k. θ represents a vector of chebyshev coefficients. Meanwhile, the parameter vector is optimized by utilizing the neural network, and the layer-wise graph convolution neural network is adopted to extract the characteristics. The formula is as follows:

H^l+1＝σ(g_θH^lW^l+b^l) (4)

wherein H^lRepresenting the output of the neural network with residual structure, g_θThe constant term can be obtained by calculation, the parameter term to be optimized is optimized by the neural network parameters, and finally the spatial feature embedded vector representation of the traffic information data can be obtained.

Further, the step (3) specifically comprises the following steps:

3a) constructing an attention query vector Q, a key vector K and a value vector V by adopting linear projection in combination with the spatial feature representation obtained in the step (2);

Q＝ZW^Q,K＝ZW^K,V＝ZW^V (5)

3b) using the scaled dot product function as the calculation of the attention mechanism fraction, and combining the query vector Q, the key vector K, and the value vector V with different linear projections to construct a plurality of attention functions, forming a multi-headed attention mechanism.

MultiHead＝Concat(head₁,…,head_n)W^O (6)

Wherein

The scaled dot product function of the attention mechanism fractional computation can be expressed as

3c) And (3) combining the multi-head attention representation obtained in the step (3b), constructing a two-layer feedforward neural network, and adding residual connection to relieve the problem of gradient disappearance under a deep network.

FFN＝Wσ(WX+b)+b (7)

Where σ denotes the activation function, W and b denote learnable network parameters;

3d) building an encoder module and a decoder module by combining the step (3b) and the step (3c), stacking the plurality of encoder modules and the decoder module, and building a Transformer neural network for extracting the time characteristics of the traffic data;

3e) combining the step (2) and the step (3d), and combining the space-time characteristics to construct a map space-time transform model for traffic flow prediction.

Further, the step (4) specifically includes the following steps:

4a) and (3) combining the traffic data space characteristics extracted in the step (2) and the traffic data time characteristics extracted in the step (3), constructing a loss function, and training a graph space-time Transformer model and optimizing parameters.

Wherein N represents the number of samples, Y_iAnd

representing a real traffic information value and a predicted traffic information value;

4b) the traffic prediction problem is a classical regression problem. Therefore, in order to evaluate the prediction performance of the model, the Mean Absolute Error (MAE), the Mean Absolute Percentage Error (MAPE), and the Root Mean Square Error (RMSE) were selected as indicators. For MAE, RMSE and MAPE indices, smaller values indicate better prediction performance;

4c) inputting the real traffic data sets of two different regions into the model, training in a batch processing mode, and observing the performance of the model. Finally, a graph space-time Transformer model with the characteristic traffic data space characteristics and time trends can be obtained and applied to traffic flow prediction;

the innovation of the invention is that:

(1) a traffic prediction framework considering human mobility is provided, and the traffic data can be found out in cross-road network characteristics in space and random disturbance characteristics in time.

(2) Different adjacent matrixes under multiple views are constructed to reveal a graph structure which cannot be seen by traffic data, and a Transformer network is applied to simulate a time trend.

(3) Experimental validation was performed on two real traffic data sets in the united states.

The invention has the advantages that:

(1) and carrying out display modeling on different behavior modes hidden in the traffic data, and directly connecting nodes which are logically related to each other.

(2) The method does not strictly depend on data input according to time sequence, gives different weights to traffic flow at different time by multi-head attention, and better represents the time dynamics of traffic data.

Drawings

FIG. 1 is an overall flow chart of the present invention.

FIG. 2 is a flow chart of a multi-view convolutional neural network model used in the present invention.

FIG. 3 is a flow chart of a Transformer neural network model used in the present invention.

FIGS. 4(a) -4 (d) are graphs of results of different filter sizes in the example of METR-LA data set of the present invention, where FIG. 4(a) is a graph of the variation of training set loss function of spatio-temporal Transformer model at different filter sizes, FIG. 4(b) is a graph of the variation of validation set loss function of spatio-temporal Transformer model at different filter sizes, FIG. 4(c) is a graph of the variation of model evaluation indicator RMSE of spatio-temporal Transformer model at different filter sizes, and FIG. 4(d) is a graph of the variation of test set loss function of spatio-temporal Transformer model at different filter sizes.

Fig. 5(a) -5 (d) are graphs showing results of different behavior patterns in the example of the METR-LA data set according to the present invention, wherein fig. 5(a) is a graph showing the variation of the training set loss function of the spatio-temporal Transformer model in different behavior patterns, fig. 5(b) is a graph showing the variation of the verification set loss function of the spatio-temporal Transformer model in different behavior patterns, fig. 5(c) is a graph showing the variation of the test set loss function of the spatio-temporal Transformer model in different behavior patterns, and fig. 5(d) is a graph showing the variation of the model evaluation indicator RMSE of the spatio-temporal Transformer model in different behavior patterns.

Fig. 6(a) to fig. 6(d) are result graphs of different training cycles in the example of the METR-LA dataset according to the present invention, where fig. 6(a) is a result graph of a change in a real value and a predicted value in the METR-LA dataset when the spatio-temporal Transformer model is trained for 1 cycle, fig. 6(b) is a result graph of a change in a real value and a predicted value in the METR-LA dataset when the spatio-temporal Transformer model is trained for 15 cycles, fig. 6(c) is a result graph of a change in a real value and a predicted value in the METR-LA dataset when the METR-LA dataset model is trained for 30 cycles, and fig. 6(d) is a result graph of a change in a real value and a predicted value in the METR-LA dataset when the METR-spatio-temporal Transformer model is trained for 50 cycles.

FIGS. 7(a) -7 (d) are graphs of results of different filter sizes in the example of the PEMS-BAY dataset according to the present invention, where FIG. 7(a) is a graph of the variation of the training set loss function of the spatio-temporal Transformer model at different filter sizes, FIG. 7(b) is a graph of the variation of the validation set loss function of the spatio-temporal Transformer model at different filter sizes, FIG. 7(c) is a graph of the variation of the test set loss function of the spatio-temporal Transformer model at different filter sizes, and FIG. 7(d) is a graph of the variation of the model evaluation indicator RMSE of the spatio-temporal Transformer model at different filter sizes.

Fig. 8(a) -8 (d) are graphs showing results of different behavior patterns in the example of the PEMS-BAY data set according to the present invention, wherein fig. 8(a) is a graph showing the variation of the training set loss function of the spatio-temporal Transformer model in different behavior patterns, fig. 8(b) is a graph showing the variation of the verification set loss function of the spatio-temporal Transformer model in different behavior patterns, fig. 8(c) is a graph showing the variation of the test set loss function of the spatio-temporal Transformer model in different behavior patterns, and fig. 8(d) is a graph showing the variation of the model evaluation indicator RMSE of the spatio-temporal Transformer model in different behavior patterns.

Fig. 9(a) to 9(d) are result graphs of different training cycles in the example of the PEMS-BAY dataset according to the present invention, where fig. 9(a) is a result graph of changes of real values and predicted values in the PEMS-BAY dataset when the spatio-temporal Transformer model is trained for 1 cycle, fig. 9(b) is a result graph of changes of real values and predicted values in the PEMS-BAY dataset when the spatio-temporal Transformer model is trained for 15 cycles, fig. 9(c) is a result graph of changes of real values and predicted values in the PEMS-BAY dataset when the spatio-temporal Transformer model is trained for 30 cycles, and fig. 9(d) is a result graph of changes of real values and predicted values in the PEMS-BAY dataset when the spatio-temporal Transformer model is trained for 50 cycles.

Detailed Description

The invention is further described below in connection with examples of traffic flow predictions in los angeles and california, usa.

The overall framework of the traffic flow prediction method in this example is shown in fig. 1, and specifically includes the following steps:

(1) the statistics of the data set used in the present invention are as follows:

METR-LA the traffic data set contains the traffic speeds of 207 road sensors on the los Angeles expressway, USA. Furthermore, the readings of all sensors deployed on the road are summarized into a 5 minute window. We used data from 4/2/2012 to 5/31/2012 with a total of 17,280 time slices of 5 minutes per sensor.

PEMS-BAY the traffic data set is from the California department of transportation (Caltrans) Performance measurement System (PeMS). It contains the traffic speed of 325 road sensors in the bay. In addition, the readings of all sensors deployed on the road are also aggregated into a 5 minute window. We used data from 3/6/2017 to 4/30/2017 with 16,128 total 5 minute timeslices for each sensor.

(2) Taking traffic speed as an example, dividing data of two regions into two categories of working days and resting days according to time, and constructing a multi-view structure adjacency matrix, wherein the method specifically comprises the following steps:

a) physical road network distance matrix:

for any two roads v_i,v_jAnd a certain distance is formed between the real road networks, and the distance is coded to represent the geometric association relationship between the real road networks. The calculation is carried out by using a threshold Gaussian kernel function, and the calculation formula is as follows:

the closer the physical distance between the two road sections is, the larger the value of the Gaussian kernel function is, and finally, a road network distance matrix D representing any two road sections is obtained, the threshold value is set to be 0.1, and the size of the matrix is n multiplied by n.

b) A weekday and weekday similarity matrix:

traffic information is the result of human activity, and people take different modes of activity on different days. Therefore, we classify the data into two categories according to weekday and weekend. Then, we analyze the correlation between the road sensors by using the threshold Pearson correlation, and for the traffic information vector U, V of any two roads, the calculation formula is as follows: .

The more similar the traffic information of the two road sections is, the larger the value of the Pearson correlation coefficient is, the more similar the behavior patterns of people on the road sections are, and finally, a similarity matrix A representing the working day and the rest day of any two road sections is obtained, the threshold value is set to be 0.6, and the size of the similarity matrix is n multiplied by n.

(3) After the multi-view adjacency matrix is constructed, a GCN model is needed to aggregate node features through edge characteristics, and a new node representation is generated, wherein the adopted GCN model has the following formula:

H^l+1＝σ(g_θH^lW^l+b^l) (4)

And new nodes generated by GCN models under different views are fused, and the connection relation between the nodes is dynamically adjusted through a neural network, so that an adjacency matrix with more beneficial connections is generated, and the method has important significance for representing the cross-road network spatial characteristics of traffic data.

(4) After the GCN extracts the characteristics, a Transformer network is adopted, and the method specifically comprises the following steps:

a) construct the query vector Q, key vector K, and value vector V of attention using linear projection.

Q＝ZW^Q,K＝ZW^K,V＝ZW^V (5)

b) Using the scaled dot product function as the calculation of the attention mechanism fraction, and combining the query vector Q, the key vector K, and the value vector V with different linear projections to construct a plurality of attention functions, forming a multi-headed attention mechanism.

MultiHead＝Concat(head₁,…,head_n)W^O (6)

Wherein

c) And constructing two layers of feedforward neural networks, and adding residual connection to relieve the problem of gradient disappearance under a deep network.

FFN＝Wσ(WX+b)+b (7)

Where σ denotes the activation function and W and b denote learnable network parameters.

d) Combining the step (b) and the step (c) to construct an encoder module and a decoder module, overlapping a plurality of encoder modules and decoder modules to construct a transform neural network, extracting time characteristics of traffic data, and finally combining GCN models in different views to determine a map space-time transform model for traffic flow prediction

(5) Graph spatiotemporal Transformer model performance evaluation was performed on two real traffic data sets:

a) different filter sizes, in experiments we found the filter size of the Transformer network to be very important. Therefore, to obtain the best performance for each data set, we selected the number of filter sizes of the transform network from [128,256,512,1024,2048], and analyzed the variation in prediction accuracy. The training loss, validation loss, test loss, and RMSE for the METR-LA and PEMS-BAY datasets are shown in fig. 4 and 7, respectively.

b) Different behavior patterns, modeling different behavior patterns caused by human movement and inputting the different behavior patterns into the multi-view graph convolutional network, and hopefully, mining spatial features of the cross-road network. Therefore, to more intuitively illustrate the effectiveness and necessity of multiple behavioral patterns, we have eliminated the weekday pattern and weekend pattern modules from the method. The results of the ablation experiments for the METR-LA and PEMS-BAY datasets are shown in FIGS. 5 and 8, respectively.

c) Different training periods, in order to better understand the traffic prediction results, we visualize the prediction results of the proposed method. We compare the predicted traffic speed values for different training sessions with the ground truth values on these two data sets, and the experimental results are shown in fig. 6 and fig. 9, respectively.

Claims

1. The traffic flow prediction method based on the graph space-time Transformer model considering human mobility comprises the following steps:

2. The traffic flow prediction method based on a spatiotemporal Transformer model considering human fluidity according to claim 1, characterized in that: the step (1) specifically comprises the following steps:

1c) given a road network G, a feature matrix

Enabling it to predict the next traffic information.

3. The traffic flow prediction method based on a spatiotemporal Transformer model considering human fluidity according to claim 1, characterized in that: the step (2) specifically comprises the following steps:

wherein

where θ represents the standard deviation of the data, dist (v)_i,v_j)²Representing the distance between any two nodes. Finally obtaining the physical characteristics of the characterized road networkA characterized matrix D;

wherein

I_NRepresenting a unit matrix, D is a degree matrix D ═ Σ_jA_ij，λ_maxIs L^sysMaximum eigenvalue of the matrix, T_k(. h) represents a Chebyshev polynomial of order k; θ represents a vector of chebyshev coefficients. Meanwhile, the parameter vector is optimized by using the neural network, the layer-wise graph convolution neural network is used for feature extraction, and the formula is as follows:

H^l+1＝σ(g_θH^lW^l+b^l) (4)

4. The traffic flow prediction method based on a spatiotemporal Transformer model considering human fluidity according to claim 1, characterized in that: the step (3) specifically comprises the following steps:

Q＝ZW^Q,K＝ZW^K,V＝ZW^V (5)

3b) adopting a scaling dot product function as the calculation of the attention mechanism fraction, and combining the query vector Q, the key vector K and the value vector V with different linear projections to construct a plurality of attention functions to form a multi-head attention mechanism;

MultiHead＝Concat(head₁,…,head_n)W^O (6)

wherein the head_i＝Attention(QW_i ^Q,KW_i ^K,VW_i ^V) Note that the scaled dot product function of the fractional computation of the mechanism can be expressed as

3c) Constructing two layers of feedforward neural networks by combining the multi-head attention expression obtained in the step (3b), and adding residual connection to relieve the problem of gradient disappearance under a deep network;

FFN＝Wσ(WX+b)+b (7)

3e) combining the step (2) and the step (3d), and combining the spatio-temporal characteristics to construct a diagram spatio-temporal transform model.

5. The traffic flow prediction method based on a spatiotemporal Transformer model considering human fluidity according to claim 1, characterized in that: the step (4) specifically comprises the following steps:

4a) combining the traffic data space characteristics extracted in the step (2) and the traffic data time characteristics extracted in the step (3), constructing a loss function, and training a graph space-time Transformer model and optimizing parameters;

wherein N represents the number of samples, Y_iAnd

4b) the traffic prediction problem is a classical regression problem. Therefore, in order to evaluate the prediction performance of the model, the Mean Absolute Error (MAE), the Mean Absolute Percent Error (MAPE), and the Root Mean Square Error (RMSE) were selected as indicators; for MAE, RMSE and MAPE indices, smaller values indicate better prediction performance;

4c) inputting real traffic data sets of two different regions into a model, training in a batch processing mode, and observing the performance of the model; finally, a space-time Transformer model with a graph representing the spatial characteristics and the time trend of the traffic data can be obtained and applied to traffic flow prediction.