CN115482656B

CN115482656B - Traffic flow prediction method by using space dynamic graph convolutional network

Info

Publication number: CN115482656B
Application number: CN202210559673.5A
Authority: CN
Inventors: 周腾; 李桦樱; 杨舒敏
Original assignee: Shantou University
Current assignee: Shantou University
Priority date: 2022-05-23
Filing date: 2022-05-23
Publication date: 2023-09-26
Anticipated expiration: 2042-05-23
Also published as: CN115482656A

Abstract

The invention discloses a traffic flow prediction method by using a space dynamic graph convolution network, which firstly provides a novel attention fusion network comprising a space attention and a gating mechanism so as to generate a dynamic graph at a time node. The network aims to more effectively capture spatial dependencies through dynamic node embedding. Secondly, a dynamic diffusion map convolution loop module is presented that combines bi-directional diffusion convolution with a gated loop unit to capture both global spatial and temporal correlations. After the modules, the invention adds a residual connection switching network with zero initialization to extract long-term features. Finally, the space dynamic graph convolution network is evaluated on two real traffic data sets through a large number of experiments, and the experiments show that the space dynamic graph convolution network is superior to the current most advanced method.

Description

Traffic flow prediction method by using space dynamic graph convolutional network

Technical Field

The invention relates to the technical field of traffic data analysis, in particular to a traffic flow prediction method by using a space dynamic graph convolution network.

Background

In intelligent transportation systems (Intelligent Transport System, ITS), spatiotemporal prediction is an important task in real life. Everyone is working to improve the accuracy of predicting future traffic flows. It is beneficial for many applications including autopilot operation, energy and smart grid optimization, logistics and supply chain management. In this context, we have successfully improved predictive performance by addressing dynamic time complexity.

To capture complex time-dependence of historical time series of traffic nodes, recurrent neural networks (Recurrent Neural Network, RNNs) are introduced to predict traffic flow. At the same time, there is a significant spatial correlation between adjacent nodes of the traffic network. Researchers apply convolutional neural networks (Convolutional Neural Networks, CNN) to extract spatial correlations. CNNs can handle mesh structures well. The space-time cyclic convolutional network (Spatiotemporal recurrent convolutional network, SRCN) inherits the advantages of the deep convolutional neural network (deep convolutional neural networks, DCNN) and long short-term memory (LSTM) neural network.

Road networks are a typical non-European type of data, i.e. topology. These methods are poor at capturing non-euclidean correlations to predict traffic time series. To address this problem, most methods use graph rolling networks (Graph Convolutional Networks, GCN) to process spatio-temporal information. The graph convolutional neural network effectively extracts non-grid local spatial features through a Laplacian matrix. The network of graph convolution in combination with a recurrent neural network or Gate-CNN (gated convolutional network) is widely adopted in space-time graph modeling in traffic networks. The network achieves better performance in spatial and temporal modeling than convolutional and recurrent neural networks.

However, in the above prior art, there are the following main drawbacks.

First, the dynamic of efficiently extracting traffic data in both the temporal and spatial dimensions remains challenging. Most methods of graph convolution employ static graphs and fixed adjacency matrices to capture the spatial correlation of node feature representations. However, each node of the traffic network is dynamically changing. Graph convolution based on static graphs cannot dynamically capture the characteristics of nodes. The space-time diagram convolutional network based on the laplacian matrix (space-temporal graph convolutional networks, STGCN) proposed by big Yu et al ignores many useful spatial relationships hidden in traffic data. Gating time convolutional layer (Gated temporal convolution layer, gated TCN) and adaptive adjacency matrix are proposed in Graph WaveNet network to extract spatial and long distance time information, respectively, but it cannot mine correlation between different dynamic graphs

Second, the current network does not have sufficient time series acquisition. The cyclic neural based network is prone to problems of gradient extinction and explosion, so the capture effect is not ideal in long sequence models. Attention-based networks perform well in capturing long-term complexity, but do not efficiently extract local information.

The main reason for the above-mentioned disadvantages is that:

most of the applications of graph convolutional neural networks and recurrent neural networks in traffic flow prediction use a pre-defined matrix and static graph structure, which is constructed based on the distance between sensors in the road nodes. Traffic flow data is highly nonlinear and dynamically changes from time to time due to a variety of factors, so that the correlation between road networks dynamically changes over time, such as: for a certain period of time, two road nodes are physically connected, but in logical space they are weakly connected, weakly correlated. The dynamic graph obtained according to the current road data can truly reflect the strong and weak correlation between nodes, and plays a key role in learning expression of space nodes.

The cyclic neural network involves a large number of derivative operations on time sequences during back propagation, whether the gradient disappears or the gradient explodes, and is derived from the fact that the network structure is too deep, so that the network weight is unstable, and basically, the cyclic neural network is due to the continuous multiplication effect in the back propagation of the gradient. The long-short distance neural network forgets a part of the gradient through the gate function, so that the probability of gradient disappearance can be greatly reduced, but the problem of gradient explosion can not be solved yet. Attention-based networks focus more on the similarity of sequences of time nodes before and after than on local time similarities.

Disclosure of Invention

The technical problem to be solved by the embodiment of the invention is to provide a traffic flow prediction method by using a space dynamic graph convolution network. The method can effectively solve the problems that in traffic flow prediction, nodes of a node relation restriction map fixed in a static map further feature learning and long-time dependency capturing of a circulating neural network are insufficient.

In order to solve the technical problems, an embodiment of the present invention provides a traffic flow prediction method using a space dynamic graph convolutional network, which is characterized by comprising the following steps:

step 1: construction network diagram structureGEstablishing an adjacency matrix by calculating the distance between the sensors and the paired road network and utilizing a threshold Gaussian kernelA；

Step 2: graph fusion network construction, traffic flow data for current time nodeX _t First, capturing a global graph node characteristic MAH under the current time by using a spatial attention mechanism ^t Secondly, adopting a gating mechanism to make each node characteristic H of the previous time period _t-1 Node characteristic MAH obtained by spatial attention mechanism under current time ^t Performing adaptive fusion to obtain attention-based image filtering AF _t ；

Step 3: a dynamic diffusion convolution module construction using bi-directional diffusion convolution to capture more of the effects from upstream and downstream traffic flows;

step 4: the time correlation module is constructed by a graph rolling circulation unit and a residual error connection conversion network with zero initialization, wherein the graph rolling circulation unit captures time dependence by using a gating circulation unit and replaces a matrix multiplication part in the gating circulation unit by using a dynamic diffusion convolution module, and the output H of each graph rolling circulation unit at time t _t The characteristics of long-time dependence of the residual connection conversion network mining node with zero initialization can be input;

step 5: prediction layer output, HO derived from last layer residual connection transition network output with zero initialization ^’(i) And obtaining the final traffic flow prediction output through linear transformation.

2. The traffic flow prediction method using a spatially dynamic graph convolutional network according to claim 1, wherein the adjacency matrix of step 1AThe formula expression is:

where V represents a set of nodes, E represents a set of edges,representative node v _i And node v _j The distance between K is the threshold value for controlling the sparsity of the adjacency matrix A, sigma is the standard deviation of the distance, +.>An exponential function based on a natural constant e is represented.

Further, the formula expression of the spatial attention mechanism in the step 2 is:

wherein the traffic flow dataLinearly transformed into three matrices Q _t =X _t W _Q ，K _t =X _t W _K And V is equal to _t =X _t W _V ；W _Q ，W _K ，W _V Three learnable parameter matrices; softmax (·) represents the normalized exponential function; attention (&) is a function represented by an implementation Attention mechanism to obtain node characteristics; w (W) ^o Is a learnable projection matrix; />Indicating the output of the ith attention at time t and Concat (·) indicating the splice operation.

Further, the graph filtering AF in the step 2 _t The formula expression of (2) is:

Wz,1，W _z,2 ，W _h ，b _h ，W _z,l are all learnable parameters; sigma (·) represents the activation function, which will randomly initialize the model parametersAttention filtering AF _t The corresponding elements are multiplied to obtain a dynamic graph node, and the formula is as follows:

the parameter beta is a super parameter for controlling the saturation rate of the activation function;<>representing the Hadamard inner product; tanh (·) is the activation function;seen as source dynamic node embedding, +.>The dynamic adjacency matrix DA is obtained by calculating the similarity between the source dynamic node embedding and the target dynamic node embedding ^t

ReLU (·) is an activation function, and by constructing a graph fusion network, a corresponding dynamic adjacency matrix is generated at each time node.

Further, the formulation in step 3 to capture more effects from upstream and downstream traffic flows using bi-directional diffusion convolution is:

wherein ,representing a transfer matrix; />Are non-outbound diagonal matrices; a represents a static adjacency matrix; /> and />Representing a transfer matrix P _f ，P _b Power series of (a); θ _k,f ，θ _k,b Is a learnable parameter that extracts relationships on spatial nodes from multiple angles, as shown below:

representing a dynamic forward transfer matrix; />Representing a dynamic reverse transfer matrix; θ _k,f ，θ _k,b ，θ _k,df ，θ _k,db Are all learnable parameters.

The embodiment of the invention has the following beneficial effects:

1. the static graph adjacency matrix can not completely reflect the node relation under each time node, and has certain limitation on graph node characteristic extraction. The attention fusion network provided by the invention generates attention force diagram filtering at each time node, and the filtering and the learning parameters are combined to generate dynamic node embedding; by calculating the similarity of embedding the dynamic nodes, a dynamic adjacency matrix under each time node can be obtained. Compared with a static fixed adjacency matrix, the dynamic adjacency matrix under each time node reflects the road network node relation according to the current traffic data, so that node characteristic learning is performed.

2. The traditional cyclic neural network has poor effect of capturing long-time dependence, and gradient explosion can occur. The present invention captures the dependency of each node in the time dimension by combining a graph convolution loop unit with a connection transition network with zero initialization residuals. In the graph convolution loop unit, the time and space dependency capture can be performed simultaneously, and the neuron forward propagation capacity of the converter is improved by connecting the zero initialization residual error to the converter so as to explore the deeper long-time dependency.

Drawings

FIG. 1 is a structure of a spatially dynamic graph convolutional neural network;

FIG. 2 is a schematic diagram of an attention fusion network;

fig. 3 is a transducer architecture with zero initialization residual connection.

Detailed Description

The present invention will be described in further detail with reference to the accompanying drawings, for the purpose of making the objects, technical solutions and advantages of the present invention more apparent.

As shown in FIG. 1, the traffic flow prediction method using the space dynamic graph convolutional network can effectively solve the problems of further feature learning of nodes of a node relation restriction graph fixed in a static graph and insufficient long-time dependency capturing of a cyclic neural network in traffic flow prediction. The invention realizes traffic flow prediction according to the following steps:

step 1: constructing a network diagram structure g= (V, E, a). The adjacency matrix A is established by calculating the distance between the sensors and the paired road network and utilizing a threshold Gaussian kernel, and the formula is expressed as follows:

(1)

v represents the set of nodes, E represents the set of edges,representative node v _i And node v _j The distance between K is the threshold value for controlling the sparsity of the adjacency matrix A, sigma is the standard deviation of the distance, +.>An exponential function based on a natural constant e is represented.

Step 2: and (5) constructing a graph fusion network. The graph fusion network is used to generate a dynamic adjacency matrix. Traffic flow data for current time nodeFirst, using the spatial attention mechanism, as shown in fig. 2, the global graph node feature MAH at the current time is captured as follows ^t 。

Here, traffic flow dataLinearly transformed into three matrices Q _t =X _t W _Q ，K _t =X _t W _K And V is equal to _t =X _t W _V ；W _Q ，W _K ，W _V Three learnable parameter matrices; softmax (·) represents the normalized exponential function; attention (&) is a function represented by an implementation Attention mechanism to obtain node characteristics; w (W) ^o Is a learnable projection matrix; />Indicating the output of the ith attention at time t and Concat (·) indicating the splice operation.

Then, the gating mechanism is adopted to characteristic each node in the previous time periodNode characteristic MAH obtained by spatial attention mechanism under current time ^t Performing adaptive fusion to obtain attention-based image filtering AF _t The formula is as follows:

（4）

Wz,1，W _z,2 ，W _h ，b _h ，W _z,l are all learnable parameters; sigma (·) represents the activation function, which will randomly initialize the model parametersAttention filtering AF _t Corresponding elementMultiplying to obtain a dynamic graph node, wherein the formula is as follows:

（5）

the parameter beta is a super parameter for controlling the saturation rate of the activation function;<>representing the Hadamard inner product; tanh (·) is the activation function;seen as source dynamic node embedding, +.>The dynamic adjacency matrix DA is obtained by calculating the similarity between the source dynamic node embedding and the target dynamic node embedding ^t 。

ReLU (·) is the activation function. And 2, constructing a fusion network through the step 2, and generating a corresponding dynamic adjacency matrix under each time node. The following steps describe how to combine static and dynamic graphs to perform graph node feature learning.

Step 3: and (5) constructing a dynamic diffusion convolution module. The diffusion convolution recurrent neural network (Diffusion Convolutional Recurrent Neural Network, DCRNN) uses bi-directional diffusion convolution to capture more of the effects from upstream and downstream traffic flows, as shown in (7).

（7）

wherein ,representing a transfer matrix; />Are non-outbound diagonal matrices; as shown in (1), A represents a static adjacency matrix; /> and />Representing a transfer matrix P _f ，P _b Power series of (a); θ _k,f ，θ _k,b Is a learnable parameter. The invention improves on the basis of (7), considers the diffusion convolution of the dynamic graph, extracts the relation on the space nodes from a plurality of angles, and has the following formula:

(8)

representing a dynamic forward transfer matrix; />Representing a dynamic reverse transfer matrix; θ _k,f ，θ _k,b ，θ _k,df ，θ _k,db All the learnable parameters are learnable parameters. Equation (8) shows that the bidirectional diffusion convolution of the static diagram and the bidirectional diffusion convolution process of the dynamic diagram together form a dynamic diffusion convolution module.

Step 4: and (5) constructing a time correlation module. The time correlation module consists of a graph convolution loop unit (Graph Convolution Recurrent Unit, GCRU) and a residual connection conversion network (rezero transformer) with zero initialization, as shown in fig. 3. In the graph convolution loop unit, the present invention uses a gating loop unit to capture the time dependence. Wherein the dynamic diffusion convolution module replaces a matrix multiplication part in the gating cyclic unit. The process will be the current timeTraffic flow data->The graph convolution loop unit outputs H at time t-1 _t-1 And (3) performing splicing, and putting the spliced yarns into a graph convolution circulation unit, wherein the formula is expressed as follows:

(9)

r ^(t) is a reset gate, u ^(t) Is an update gate. ⋆ G represents a dynamic diffusion convolution as shown in (8).Is a learnable parameter of the diffusion convolution layer. The temporal and spatial features of the node can be captured simultaneously by means of a graph convolution loop unit.

Next, the output H of each graph convolution loop unit at time t _t The characteristics of long-term dependence of the residual connection switching network mining node with zero initialization are input. Residual connection conversion networks (rezerotransformers) with zero initialization are improved over conversion networks (transformers). It promotes forward propagation of the conversion network (transformer) by means of the dynamic equidistant idea to promote its dependency capture at long-term nodes. It removes the original converter normalization process while preserving the position coding, multi-headed attention mechanism and forward propagating modules and uses residual connection with zero initialization (recro). Position-coding PE _t Is represented by a sine function and a cosine function:

H _t,i representing the output of the graph convolution unit at time t node i, d _model Representing the dimensions of the hidden layer. Initially, the nodes at each time are spliced after position codingAs input to the layer 1 rezero switching network. In->Layer rezero converter, input H ^,(i-1) Output HO which is the upper layer i-1 ^,(i-1) Input is firstly processed by a multi-head attention mechanism, and then residual connection is performed to obtain H ^,i And obtaining final output HO of the layer through forward propagation and residual connection ^,(i) 。

Alpha is a learnable parameter that the multi-headed attention mechanism shares with forward propagation.

Step 5: and outputting a prediction layer. HO derived from last layer residual connection transition network output with zero initialization ^,(i) Obtaining the final traffic flow prediction output through linear transformationThe formula is expressed as follows:

W _o and b_o Is a learnable parameter.

Finally, a space dynamic graph rolling network can be built through the 5 steps to conduct traffic flow prediction tasks.

Compared with the prior art, the invention has the following advantages:

most of the existing methods adopt a traffic road Network graph structure constructed based on distance, a space-time graph convolution Network (space-Temporal Graph Convolutional Networks, STGCN) carries out graph convolution learning node characteristics based on a static graph structure, meanwhile, a plurality of types of static graphs are constructed from a plurality of semantic angles, such as a road Network graph constructed based on time similarity, a road Network graph constructed based on POI (point of interest) and the like, a graph structure based on road risks is constructed by a geographic semantic space-time Network (A Geographical and Semantic spatial-temporal Network, GSNet), the graph structure based on distance and the graph structure based on POI carries out graph convolution to carry out traffic accident prediction. Such methods are summarized as follows:

(13) The formula is to construct the corresponding graph structure according to different semantic angles.Representing an adjacency matrix corresponding to the corresponding semantics; />Representing the corresponding similarity function. (14) is a graph convolution formula. The graph Fourier radix U is a matrix formed by eigenvectors of the normalized graph Laplace matrix L. Drawing Laplace matrix>，I _n Is an identity matrix, D is a diagonal matrix, wherein +.>⋀ is a diagonal matrix, in which the eigenvalues of L are stored. />Representing a filter, and a represents the constructed adjacency matrix therein.

However, the graph structures are static graphs, and the limited node relation representation limits feature learning in the graph node space. To overcome the difficulty of a static graph with limited representation of node relationships, an adaptive matrix is proposed that can be updated in an end-to-end learning process. The graph WaveNet proposes an adaptive matrix to learn graph nodes, but the adaptive matrix cannot achieve dynamics, and in each cycle, the graph of each time node is static, and there is a possibility that node relations with large area missing exist. Graph wavenet uses diffusion convolution with adaptive adjacency matrix and static graph to perform graph node feature learning summarized as follows:

for formula (15), E ₁ Embedding for the source node; e (E) ₂ Embedding for the target node; sosftmax (·) is the normalization function and ReLU (·) is the activation function;is an adaptive adjacency matrix. For formula (16), p ^k A power series representing a transition matrix;is a learnable parameter; x is a traffic signal.

In long-term dependent capture, most use is made of recurrent neural networks, which are prone to gradient extinction or gradient explosion in long-sequence-dependent capture. A diffusion convolution recurrent neural network (Diffusion Convolutional Recurrnet Neural Network, DCRNN) employs a gated loop unit and replaces the matrix multiplied portion of the gated loop unit with diffusion convolution to capture both spatial and temporal dependencies. The time convolution network (Temporal Convolutional Network, TCN) is a generic structure that extracts features in the time dimension, and Graph WaveNet proposes to combine time convolution with a gatekeeper to capture the time dependence. However, to capture a long-term sequence over time, TCN requires the superposition of multiple convolutional layers to connect any two positions in the sequence. But this compromises the ability of TCNs to learn long-term dependencies.

In the space dynamic graph rolling network, the self-adaptive adjacency matrix of (15) is different from the static graph constructed by the formula (13). Capturing the characteristics of the current time node by using a door mechanism in an attention fusion network, adaptively fusing the characteristics of the current time node with the characteristics of the previous time node to obtain attention filtering, generating dynamic node embedding pairs by the attention filtering according to formulas (5) and (6), and obtaining a dynamic adjacent matrix DA corresponding to each time node by calculating the similarity of the dynamic node embedding pairs ^t . As shown in equation (8), the dynamic adjacency matrix and the static adjacency matrix are applied in a bi-directional diffusion convolution to capture the correlation of the nodes of the upstream and downstream spatial effects. Meanwhile, in a graph convolution cyclic unit (GCRU), matrix multiplication of the gating cyclic unit is replaced with a dynamic diffusion convolution module to capture the correlation of time and space simultaneously. In long-term dependency capture, unlike the past methods, as shown in equation (10), a residual join (recro) with zero initialization is applied into the conversion network (transducer) to explore deep long-term dependencies.

The above disclosure is only a preferred embodiment of the present invention, and it is needless to say that the scope of the invention is not limited thereto, and therefore, the equivalent changes according to the claims of the present invention still fall within the scope of the present invention.

Claims

1. A method for traffic flow prediction using a spatial dynamic graph convolutional network, comprising the steps of:

step 1: construction network diagram structureGBy calculating the distance between the sensors and the roads, an adjacency matrix is established by using a threshold Gaussian kernelA；

Step 2: graph fusion network construction, traffic flow data for current time nodeX _t First, capturing a global graph node characteristic MAH under the current time by using a spatial attention mechanism ^t Secondly, adopting a gating mechanism to make each node characteristic H of the previous time period _t-1 With the section obtained by the spatial attention mechanism at the current timePoint characteristics MAH ^t Performing adaptive fusion to obtain attention-based image filtering AF _t The method comprises the steps of carrying out a first treatment on the surface of the Will randomly initialize model parametersAttention filtering AF _t Multiplying corresponding elements to obtain dynamic graph nodes, and obtaining dynamic adjacency matrix DA according to the dynamic graph nodes ^t The method comprises the steps of carrying out a first treatment on the surface of the Through the construction of the graph fusion network, a corresponding dynamic adjacency matrix is generated under each time node;

step 3: a dynamic diffusion convolution module construction using bi-directional diffusion convolution to capture more of the effects from upstream and downstream traffic flows; obtaining the bidirectional diffusion convolution of the static diagram according to the static adjacent matrix A, obtaining the bidirectional diffusion convolution of the dynamic diagram according to the dynamic adjacent matrix DAt, and forming a dynamic diffusion convolution module together by the bidirectional diffusion convolution of the static diagram and the bidirectional diffusion convolution process of the dynamic diagram;

step 4: the time correlation module is constructed by a graph rolling circulation unit and a residual error connection conversion network with zero initialization, wherein the graph rolling circulation unit captures time dependence by using a gating circulation unit and replaces a matrix multiplication part in the gating circulation unit by using a dynamic diffusion convolution module, and the output H of each graph rolling circulation unit at time t _t Is input with the characteristic of long-time dependence of the residual connection conversion network mining node with zero initialization to obtain output HO ^’(i) ；

；

3. The traffic flow prediction method using the spatial dynamic convolutional network according to claim 2, wherein the formula of the spatial attention mechanism in step 2 is:

；

wherein the traffic flow data X _t Linearly transformed into three matrices Q _t =X _t W _Q ，K _t =X _t W _K And V is equal to _t =X _t W _V ；W _Q ，W _K ，W _V Three learnable parameter matrices; softmax (·) represents the normalized exponential function; attention (&) is a function represented by an implementation Attention mechanism to obtain node characteristics; w (W) ^o Is a learnable projection matrix;indicating the output of the ith attention at time t and Concat (·) indicating the splice operation.

4. Traffic flow prediction using spatially dynamic graph convolutional network of claim 3The method is characterized in that the image filtering AF in the step 2 _t The formula expression of (2) is:

；

Wz,1，W _z,2 ，W _h ，b _h are all learnable parameters; sigma (·) represents the activation function, which will randomly initialize the model parametersAttention filtering AF _t The corresponding elements are multiplied to obtain a dynamic graph node, and the formula is as follows:

；

the parameter beta is a super parameter for controlling the saturation rate of the activation function;< >representing the Hadamard inner product; tanh (·) is the activation function;seen as source dynamic node embedding, +.>The dynamic adjacency matrix DA is obtained by calculating the similarity between the source dynamic node embedding and the target dynamic node embedding ^t

；

5. The traffic flow prediction method using a spatial dynamic graph convolution network according to claim 4, wherein the formula for capturing more influence from upstream and downstream traffic flows using bi-directional diffusion convolution in step 3 is:

；

wherein ,representing a transfer matrix; />Are non-outbound diagonal matrices; /> and />Representing a transfer matrix P _f ，P _b Power series of (a); />Representing a dynamic forward transfer matrix; />Representing a dynamic reverse transfer matrix; θ _k,f ，θ _k,b ，θ _k,df ，θ _k,db Are all learnable parameters.