CN116307152A - Traffic prediction method for space-time interactive dynamic graph attention network - Google Patents

Traffic prediction method for space-time interactive dynamic graph attention network Download PDF

Info

Publication number
CN116307152A
CN116307152A CN202310209521.7A CN202310209521A CN116307152A CN 116307152 A CN116307152 A CN 116307152A CN 202310209521 A CN202310209521 A CN 202310209521A CN 116307152 A CN116307152 A CN 116307152A
Authority
CN
China
Prior art keywords
traffic
network
representing
graph
dynamic
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN202310209521.7A
Other languages
Chinese (zh)
Inventor
韩雪
丁治明
郭黎敏
陈雅君
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Beijing University of Technology
Original Assignee
Beijing University of Technology
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Beijing University of Technology filed Critical Beijing University of Technology
Priority to CN202310209521.7A priority Critical patent/CN116307152A/en
Publication of CN116307152A publication Critical patent/CN116307152A/en
Pending legal-status Critical Current

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06QINFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES, NOT OTHERWISE PROVIDED FOR
    • G06Q10/00Administration; Management
    • G06Q10/04Forecasting or optimisation specially adapted for administrative or management purposes, e.g. linear programming or "cutting stock problem"
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N3/00Computing arrangements based on biological models
    • G06N3/02Neural networks
    • G06N3/04Architecture, e.g. interconnection topology
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N3/00Computing arrangements based on biological models
    • G06N3/02Neural networks
    • G06N3/08Learning methods
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06QINFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES, NOT OTHERWISE PROVIDED FOR
    • G06Q50/00Information and communication technology [ICT] specially adapted for implementation of business processes of specific business sectors, e.g. utilities or tourism
    • G06Q50/10Services
    • G06Q50/26Government or public services
    • YGENERAL TAGGING OF NEW TECHNOLOGICAL DEVELOPMENTS; GENERAL TAGGING OF CROSS-SECTIONAL TECHNOLOGIES SPANNING OVER SEVERAL SECTIONS OF THE IPC; TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
    • Y02TECHNOLOGIES OR APPLICATIONS FOR MITIGATION OR ADAPTATION AGAINST CLIMATE CHANGE
    • Y02TCLIMATE CHANGE MITIGATION TECHNOLOGIES RELATED TO TRANSPORTATION
    • Y02T10/00Road transport of goods or passengers
    • Y02T10/10Internal combustion engine [ICE] based vehicles
    • Y02T10/40Engine management systems

Landscapes

  • Engineering & Computer Science (AREA)
  • Business, Economics & Management (AREA)
  • Theoretical Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • General Physics & Mathematics (AREA)
  • Tourism & Hospitality (AREA)
  • Economics (AREA)
  • General Health & Medical Sciences (AREA)
  • Human Resources & Organizations (AREA)
  • Strategic Management (AREA)
  • Health & Medical Sciences (AREA)
  • Computational Linguistics (AREA)
  • Molecular Biology (AREA)
  • General Business, Economics & Management (AREA)
  • Software Systems (AREA)
  • Marketing (AREA)
  • Mathematical Physics (AREA)
  • General Engineering & Computer Science (AREA)
  • Computing Systems (AREA)
  • Evolutionary Computation (AREA)
  • Life Sciences & Earth Sciences (AREA)
  • Artificial Intelligence (AREA)
  • Biomedical Technology (AREA)
  • Biophysics (AREA)
  • Development Economics (AREA)
  • Data Mining & Analysis (AREA)
  • Primary Health Care (AREA)
  • Quality & Reliability (AREA)
  • Game Theory and Decision Science (AREA)
  • Entrepreneurship & Innovation (AREA)
  • Educational Administration (AREA)
  • Operations Research (AREA)
  • Traffic Control Systems (AREA)

Abstract

The invention discloses a traffic prediction method of a space-time interactive dynamic graph attention network. The invention provides an interactive learning module, which is used for carrying out staggered downsampling on acquired traffic data according to time intervals, generating a new dynamic diagram through an interactive learning strategy and synchronously capturing the space-time dependence of the partitioned traffic data. The invention also provides a dynamic graph attention module which captures the spatial correlation of dynamic changes in the traffic network on the basis of the dynamic graph generation module. The dynamic graph generating module generates a dynamic graph structure based on the historical information and the input data, and the structure can deeply mine dynamic relevance in the network topology. The invention can improve the accuracy of traffic prediction, relieve traffic pressure and improve the toughness of urban traffic.

Description

Traffic prediction method for space-time interactive dynamic graph attention network
Technical Field
The invention belongs to the technical field of traffic prediction and the field of deep learning, and particularly relates to a traffic prediction method of a space-time interactive dynamic graph attention network.
Background
The control and the guidance of the traffic flow are important research directions of intelligent traffic, and the real-time accurate traffic flow prediction is the basis of traffic flow induction, is an important means for vehicle evacuation in a congestion area, and is one of the core contents of intelligent traffic. Short-time traffic flow prediction relies on a big data environment, and traffic flow in a certain road section or even in the whole city range in a future period can be accurately predicted by means of artificial intelligence, cloud computing and the like, so that the travel efficiency and the safety of urban vehicles are effectively improved, urban traffic management departments can conveniently and fully balance traffic loads of road networks, full coordination of traffic resources is realized, the transportation efficiency of the road networks is maximized, the traffic jam degree is effectively reduced, and therefore, great practical application value and profound practical significance are provided for developing and deeply researching the theory and the method of traffic flow prediction of the road networks.
In recent years, with the continuous development of socioeconomic level, large cities are developing toward smart cities, and Intelligent Transportation Systems (ITS) have received considerable attention as an important component in smart cities. The traffic prediction is taken as an important component of an intelligent traffic system, and can provide support in aspects of traffic jam control, reasonable path planning and the like. The existing methods for solving the urban traffic prediction problem mainly comprise three types: a statistical-based predictive model, a nonlinear theoretical model, and an intelligent theory-based predictive model. For a statistical-based prediction model, a historical averaging method is the earliest statistical-based prediction model, the method assumes that the traffic flow at a certain point is operated according to a certain rule, the current traffic flow measured at the point and the historical data are added and averaged, and the obtained value is taken as the traffic flow value at the next moment of the point. These methods have great limitations on modeling of nonlinear data, and perform poorly in the practical practice of current data rapid growth. The nonlinear theory model is a model based on the nonlinear theory such as chaos theory, synergetic theory, system dynamics and the like, and is expressed in the traffic flow prediction field as analyzing the nonlinear characteristics of traffic flow by utilizing various nonlinear theory, so that the nonlinear model is established. There are typically models based on mutation theory, chaotic models, wavelet analysis, etc. Prediction model based on intelligent theory: the traffic flow prediction model based on the intelligent theory is based on a modern big data analysis technology, has no fixed model structure and no fixed relation between input and output, extracts valuable information through training of different modes of input information, and predicts traffic flow by using the extracted information. Currently, researchers have used KNNs, automatic encoders, etc. to identify traffic patterns, GNNs, GCNs, etc. to model traffic road network structures and extract spatial features, LSTM, etc. to model temporal correlations of traffic flows, etc.
While current deep learning exhibits superior performance in terms of the ability of modeling traffic flow data and extracting complex data features, there are still some drawbacks: the partial model only considers the time correlation, but ignores the space correlation; the dynamic change of the traffic condition is ignored by part of the model, and only the static adjacency matrix is established, so that the prediction capability of the model is reduced; the partial model has the problem of gradient disappearance or gradient explosion, so that data training is difficult to perform.
Aiming at the challenges, the invention provides a traffic prediction method of a space-time interactive dynamic graph attention network to realize efficient and accurate prediction of urban traffic flow data.
Disclosure of Invention
Aiming at the defects of the current model, the invention creatively provides a new traffic prediction method of a space-time interactive dynamic graph attention network, which can synchronously capture time correlation and space correlation, and can learn space-time dependence through an interactive learning structure and a dynamic graph convolution network, thereby realizing accurate and effective traffic flow prediction.
In order to achieve the above purpose, the invention adopts the following technical scheme:
a traffic prediction method for a spatio-temporal interactive dynamic graph attention network, the method comprising the steps of:
step 1: acquiring historical traffic flow data of each traffic node and preprocessing to obtain application data;
step 2: traffic network information and traffic prediction problem definition;
step 3: constructing an interactive learning module;
step 4: constructing a dynamic graph attention model;
step 5: the collected traffic flow data set is divided into a training set, a verification set and a test set, and the average absolute error, the average absolute percentage error and the root mean square error are used as evaluation indexes to evaluate the traffic prediction model of the space-time interactive dynamic graph attention network.
Preferably, the preprocessing operation of the historical traffic flow data in step 1 includes the following steps:
step 1: the invention adopts a Z-Score method to avoid the influence of abnormal values and extreme values, thereby obtaining preprocessed application data.
Preferably, step 2 includes the following steps for traffic network information and traffic prediction problem definition:
step 2-1: defining a traffic network as a weighted directed graph g= (V, E, a), wherein V represents a set of nodes, |v|=n, which is a set of road segments or sensors in the traffic network; e represents a set of edges, edges in the graph G represent connection relations among nodes, and weight values of the edges represent distances among the nodes;
Figure BDA0004112105860000021
the weighted adjacency matrix of the graph G is represented, and the element values in the a are represented by the distance between two nodes, which can be specifically represented by:
Figure BDA0004112105860000022
wherein d ij E is the distance between two nodes in the road network 1 Super-parameters for controlling the sparsity of the adjacent matrix A; sigma is the standard deviation;
step (a)2-2: defining a traffic prediction problem: the traffic flow prediction problem is aimed at passing through the history observation sequence
Figure BDA0004112105860000031
On the basis of which future traffic sequences of the traffic network are predicted
Figure BDA0004112105860000032
The traffic prediction problem is defined as:
Figure BDA0004112105860000033
wherein,,
Figure BDA0004112105860000034
representing the observation value of graph G at time step t,/for>
Figure BDA0004112105860000035
N represents the number of nodes in the traffic network, and C represents the number of characteristic channels; t represents the length of a given historical time series, T' represents the length of the time series to be predicted; f represents a learning function that can predict future flow sequences from historical observation sequences.
Preferably, the step 3 of constructing the interactive learning module includes the following:
step 3: for most time series data, the interleaved downsampled subsequence still retains most of the information of the original sequence due to trending and proximity; the invention adopts a processing mode of staggered downsampling to enable the two sub-sequences after segmentation to share the parameter weight in the dynamic graph attention module; assume that
Figure BDA0004112105860000036
For the input traffic flow data, the subsequences of the divided parity index sections of X are respectively expressed as
Figure BDA0004112105860000037
The outputs after the first interactive learning are respectively:
X′ odd =tanh(DGAT(Conv(X even )))⊙X odd
X′ even =tanh(DGAT(Conv(X odd )))⊙X even
the final outputs after the second interactive learning are respectively:
X odd_out =X′ odd +tanh(DGAT(Conv(X′ even )))
X even_out =X′ even +tanh(DGAT(Conv(X′ odd )))
wherein,,
Figure BDA0004112105860000038
and->
Figure BDA0004112105860000039
Respectively, traffic flow subsequences obtained after the first interactive learning, and +.>
Figure BDA00041121058600000310
For a real number set, C represents the number of characteristic channels, N represents the number of nodes in a traffic network, and T represents the length of a given historical time sequence; />
Figure BDA00041121058600000311
And->
Figure BDA00041121058600000312
Representing a traffic flow subsequence finally obtained after two interactive learning; as indicated by the letter Hadamard product, tan h is the activation function, DGAT is the dynamic attention model, conv is the one-dimensional convolution;
preferably, step 4 of constructing the dynamic graph attention model includes the following:
step 4-1: the structure of the DGAT module consists of two main modules: a graph generator module and a graph annotation force module; the DGAT module utilizes the generated graph structure to better explore the urban road with deeper levelThe spatial characteristics of the road network improve the performance of capturing spatial heterogeneity by the model; wherein the graph generator module is to hide features
Figure BDA00041121058600000313
And a predefined initial adjacency matrix->
Figure BDA00041121058600000314
Figure BDA00041121058600000315
As input; at input +.>
Figure BDA00041121058600000316
After the treatment of the graph attention mechanism, the graph attention mechanism is transmitted to the MLP to generate a matrix A':
A′=SoftMax(MLP(GAT(X,A)))
in the process, ,
Figure BDA0004112105860000041
representing an adjacency matrix with spatiotemporal features, MLP representing a multi-layer perceptron, GAT representing a graph-annotating force operation,/->
Figure BDA0004112105860000042
Representing input features in a road network, +.>
Figure BDA0004112105860000043
Representing an initial weighted adjacency matrix of each node in the road network;
step 4-2: in addition to the adjacency matrix A' generated by the graph generator, the present invention also defines an adaptive adjacency matrix
Figure BDA0004112105860000044
Figure BDA0004112105860000045
Wherein,,
Figure BDA0004112105860000046
and->
Figure BDA0004112105860000047
Is a trainable parameter of the model, +.>
Figure BDA0004112105860000048
For real number set, C represents characteristic channel number, N represents node number in traffic network, A apt The initial value is the adjacency matrix predefined by the prior map +.>
Figure BDA0004112105860000049
Step 4-3: using an adaptive fusion architecture will A apt And A', and fusing the adjacent matrix A obtained after fusing dyn Input into GAT for dynamic association simulation between nodes and exploring undiscovered node connections in the road network, the fusion operation may be defined as follows:
A dy =αA apt +(1-α)A′
wherein A is dy And alpha is a learnable adaptive parameter factor for the fused dynamic adjacency matrix.
Step 4-4: the GAT module is defined as follows:
Figure BDA00041121058600000410
wherein X is in To input data A apt The initial value is an adjacency matrix predefined by a priori map
Figure BDA00041121058600000411
Representing a parameter matrix.
Preferably, the step 5 of evaluating the traffic prediction model of the time-space interactive dynamic graph attention network includes the following steps:
step 5: the invention adopts average absolute error MAE (representing the average value of absolute error between actual value and predicted value), average absolute percent error MAPE (representing the average value of absolute percent error between actual value and predicted value) and root mean square error RMSE (representing the arithmetic square root of mean square error between actual value and predicted value) as the evaluation index for evaluating the prediction performance of the module, and the evaluation index is calculated as follows:
Figure BDA00041121058600000412
Figure BDA00041121058600000413
Figure BDA0004112105860000051
wherein N represents the number of samples, Y i A true value representing the traffic of a certain node at a certain moment,
Figure BDA0004112105860000052
represents Y i And the corresponding time corresponds to the predicted value of the traffic flow of the node.
Compared with the prior art, the invention has the following beneficial effects and advantages:
most of the current research is limited to mining time and space characteristics respectively, so that the model cannot learn the characteristics of traffic flow data comprehensively. The invention provides a new traffic prediction model of a space-time interactive dynamic graph attention network, which embeds a graph attention mechanism into an interactive learning structure and synchronously captures time and space correlations. The time-space dependency can be learned through the interactive learning structure and the dynamic graph attention network, so that the characteristics of the nodes are comprehensively learned, and the prediction accuracy is further improved.
Drawings
FIG. 1 is a general framework of a traffic prediction model of a spatio-temporal interactive dynamic graph attention network provided by the invention.
Detailed Description
The technical solutions in the embodiments of the present invention will be clearly and completely described below with reference to the accompanying drawings in the embodiments of the present invention. It will be apparent that the described embodiments are only some, but not all, embodiments of the invention.
The method obtains a large amount of traffic flow data from sensors of the urban road network, performs data cleaning on the traffic flow data, and obtains the attributes such as longitude and latitude, flow value, predicted starting time, predicted ending time and the like after finishing. The method is widely applicable to various time sequence prediction fields based on the dynamic graph attention neural network, and effectively processes complex time sequence data. FIG. 1 is a general framework diagram of the model of the present invention, which effectively improves the overall predictive performance of the model. The specific implementation is as follows:
step 1: acquiring historical traffic flow data of each traffic node and preprocessing to obtain application data;
step 2: traffic network information and traffic prediction problem definition;
step 3: constructing an interactive learning module;
step 4: constructing a dynamic graph attention model;
step 5: the collected traffic flow data set is divided into a training set, a verification set and a test set, and the average absolute error, the average absolute percentage error and the root mean square error are used as evaluation indexes to evaluate the traffic prediction model of the space-time interactive dynamic graph attention network.
Specifically, the preprocessing operation for the historical traffic flow data includes the following steps: the invention adopts a Z-Score method to avoid the influence of abnormal values and extreme values, thereby obtaining preprocessed application data.
Specifically, the definition of traffic network information and traffic prediction problems includes the following:
(1) Defining a traffic network as a weighted directed graph g= (V, E, a), where V represents a set of nodes, |v|=n, which is a road segment or sensor in the traffic networkIs a collection of (3); e represents a set of edges, edges in the graph G represent connection relations among nodes, and weight values of the edges represent distances among the nodes;
Figure BDA0004112105860000061
the weighted adjacency matrix of the graph G is represented, and the element values in the a are represented by the distance between two nodes, which can be specifically represented by:
Figure BDA0004112105860000062
wherein d ij E is the distance between two nodes in the road network 1 Super-parameters for controlling the sparsity of the adjacent matrix A; sigma is the standard deviation;
defining a traffic prediction problem: the traffic flow prediction problem is aimed at passing through the history observation sequence
Figure BDA0004112105860000063
On the basis of which future traffic sequences of the traffic network are predicted
Figure BDA0004112105860000064
The traffic prediction problem is defined as:
Figure BDA0004112105860000065
wherein,,
Figure BDA0004112105860000066
representing the observation value of graph G at time step t,/for>
Figure BDA0004112105860000067
N represents the number of nodes in the traffic network, and C represents the number of characteristic channels; t represents the length of a given historical time series, T' represents the length of the time series to be predicted; f represents a learning function that can predict future flow sequences from historical observation sequences.
Specifically, constructing the interactive learning module includes the following: for most time series data, the interleaved downsampled subsequence still retains most of the information of the original sequence due to trending and proximity; the invention adopts a processing mode of staggered downsampling to enable the two sub-sequences after segmentation to share the parameter weight in the dynamic graph attention module; assume that
Figure BDA0004112105860000068
Figure BDA0004112105860000069
For the input traffic flow data, the subsequences of the divided parity index sections of X are respectively expressed as
Figure BDA00041121058600000610
Figure BDA00041121058600000611
The outputs after the first interactive learning are respectively:
X′ odd =tanh(DGAT(Conv(X even )))⊙X odd
X′ even =tanh(DGAT(Conv(X odd )))⊙X even
the final outputs after the second interactive learning are respectively:
X odd_out =X odd +tanh(DGAT(Conv(X′ even )))
X even_out =X′ even +tanh(DGAT(Conv(X′ odd )))
wherein,,
Figure BDA00041121058600000612
and->
Figure BDA00041121058600000613
Respectively, traffic flow subsequences obtained after the first interactive learning, and +.>
Figure BDA0004112105860000071
For a real number set, C represents the number of characteristic channels, N represents the number of nodes in a traffic network, and T represents the length of a given historical time sequence; />
Figure BDA0004112105860000072
And->
Figure BDA0004112105860000073
Representing a traffic flow subsequence finally obtained after two interactive learning; as a result, hadamard product, tanh as the activation function, DGAT as the dynamic attention model, and Conv as the one-dimensional convolution operation.
Specifically, constructing the dynamic graph attention model includes the following:
(1) The structure of the DGAT module consists of two main modules: a graph generator module and a graph annotation force module; the DGAT module utilizes the generated graph structure to better explore the spatial characteristics of a deeper urban road network, and improves the performance of capturing spatial heterogeneity of the model; wherein the graph generator module is to hide features
Figure BDA0004112105860000074
And a predefined initial adjacency matrix
Figure BDA0004112105860000075
Figure BDA0004112105860000076
As input; at input +.>
Figure BDA0004112105860000077
Thereafter (I)>
Figure BDA0004112105860000078
After the treatment of the graph attention mechanism, the graph attention mechanism is transmitted to the MLP to generate a matrix A':
A′=SoftMax(MLP(GAT(X,A)))
wherein,,
Figure BDA0004112105860000079
representing an adjacency matrix with spatiotemporal features, MLP representing a multi-layer perceptron, GAT representing a graph-annotating force operation,/->
Figure BDA00041121058600000710
Representing input features in a road network, +.>
Figure BDA00041121058600000711
Representing an initial weighted adjacency matrix of each node in the road network;
(2) In addition to the adjacency matrix A' generated by the graph generator, the present invention also defines an adaptive adjacency matrix
Figure BDA00041121058600000712
Figure BDA00041121058600000713
Figure BDA00041121058600000714
Wherein,,
Figure BDA00041121058600000715
and->
Figure BDA00041121058600000716
Is a trainable parameter of the model, +.>
Figure BDA00041121058600000717
For real number set, C represents characteristic channel number, N represents node number in traffic network, A apt The initial value is the adjacency matrix predefined by the prior map +.>
Figure BDA00041121058600000718
(3) Using an adaptive fusion architecture will A apt And A', fusing, and obtaining the adjacentConnection matrix A dyn Input into GAT for dynamic association simulation between nodes and exploring undiscovered node connections in the road network, the fusion operation may be defined as follows:
A dy =αA apt +(1-α)A′
wherein A is dy And alpha is a learnable adaptive parameter factor for the fused dynamic adjacency matrix.
(4) The GAT module is defined as follows:
Figure BDA00041121058600000719
wherein X is in To input data A apt The initial value is an adjacency matrix predefined by a priori map
Figure BDA00041121058600000720
Representing a parameter matrix.
Specifically, evaluating a traffic prediction model of a time-space interactive dynamic graph attention network includes: the invention adopts average absolute error MAE (representing the average value of absolute error between actual value and predicted value), average absolute percent error MAPE (representing the average value of absolute percent error between actual value and predicted value) and root mean square error RMSE (representing the arithmetic square root of mean square error between actual value and predicted value) as the evaluation index for evaluating the prediction performance of the module, and the evaluation index is calculated as follows:
Figure BDA0004112105860000081
Figure BDA0004112105860000082
Figure BDA0004112105860000083
wherein N represents the number of samples, Y i A true value representing the traffic of a certain node at a certain moment,
Figure BDA0004112105860000084
represents Y i And the corresponding time corresponds to the predicted value of the traffic flow of the node.
The foregoing is merely illustrative of the present invention, and the scope of the invention is not limited thereto, as all other examples obtained by those skilled in the art without making any inventive effort fall within the scope of the invention.

Claims (6)

1. A traffic prediction method for a spatio-temporal interactive dynamic graph attention network, comprising the steps of:
step 1: acquiring historical traffic flow data of each traffic node and preprocessing to obtain application data;
step 2: traffic network information and traffic prediction problem definition;
step 3: constructing an interactive learning module;
step 4: constructing a dynamic graph attention model;
step 5: the collected traffic flow data set is divided into a training set, a verification set and a test set, and the average absolute error, the average absolute percentage error and the root mean square error are used as evaluation indexes to evaluate the traffic prediction model of the space-time interactive dynamic graph attention network.
2. The traffic prediction method of a spatio-temporal interactive dynamic graph attention network of claim 1, characterized by: in the implementation of the step 1, the abnormal value of the traffic data acquired from the sensor is processed, and the Z-Score method is adopted to avoid the abnormal value and the extreme value, so that the preprocessed application data is obtained.
3. The traffic prediction method of a spatio-temporal interactive dynamic graph attention network of claim 1, characterized by: the implementation steps of step 2 are as follows,
step 2-1: defining a traffic network as a weighted directed graph g= (V, E, a), wherein V represents a set of nodes, each node representing a road segment or sensor in the traffic network, |v|=n; e represents a set of edges, edges in the graph G represent connection relations among nodes, and weight values of the edges represent distances among the nodes;
Figure QLYQS_1
representing the weighted adjacency matrix of fig. G, the element values in a are representing the distance between two nodes, expressed as:
Figure QLYQS_2
wherein d ij E is the distance between two nodes in the road network 1 Super-parameters for controlling the sparsity of the adjacent matrix A; sigma is the standard deviation.
Step 2-2: defining a traffic prediction problem: the traffic flow prediction problem is aimed at passing through the history observation sequence
Figure QLYQS_3
On the basis of which future traffic sequences of the traffic network are predicted
Figure QLYQS_4
The traffic prediction problem is defined as:
Figure QLYQS_5
wherein,,
Figure QLYQS_6
representing the observation value of graph G at time step t,/for>
Figure QLYQS_7
N represents the number of nodes in the traffic network, and C represents the number of characteristic channelsThe method comprises the steps of carrying out a first treatment on the surface of the T represents the length of a given historical time series, T' represents the length of the time series to be predicted; f represents a learning function that can predict future flow sequences from historical observation sequences.
4. The traffic prediction method of a spatio-temporal interactive dynamic graph attention network of claim 1, characterized by: the implementation steps of the step 3 are as follows: the processing mode of staggered downsampling is adopted to enable the two sub-sequences after segmentation to share the parameter weight in the dynamic graph attention module; assume that
Figure QLYQS_8
For the input traffic flow data, the subsequences of X divided by parity index interval are respectively expressed as +.>
Figure QLYQS_9
The outputs after the first interactive learning are respectively:
X′ odd =tanh(DGAT(Conv(X even )))⊙X odd
X′ even =tanh(DGAT(Conv(X odd )))⊙X even
the final outputs after the second interactive learning are respectively:
X odd_out =X′ odd +tanh(DGAT(Conv(X′ even )))
X even_out X′ even +tanh(DGAT(Conv(X′ odd )))
wherein,,
Figure QLYQS_10
and->
Figure QLYQS_11
Respectively, traffic flow subsequences obtained after the first interactive learning, and +.>
Figure QLYQS_12
For real number set, C represents featureThe number of tracks, N represents the number of nodes in the traffic network, and T represents the length of a given historical time sequence; />
Figure QLYQS_13
And->
Figure QLYQS_14
Representing a traffic flow subsequence finally obtained after two interactive learning; as a result, hadamard product, tanh as the activation function, DGAT as the dynamic attention model, and Conv as the one-dimensional convolution operation.
5. The traffic prediction method of a spatio-temporal interactive dynamic graph attention network of claim 1, characterized by: the implementation steps of the step 4 are as follows, and the step 4-1 is as follows: the structure of the DGAT module consists of two main modules: a graph generator module and a graph annotation force module; the DGAT module utilizes the generated graph structure to explore the spatial characteristics of a deeper urban road network, so that the performance of capturing spatial heterogeneity of the model is improved; the graph generator module will hide the features
Figure QLYQS_15
And a pre-defined initial weighted adjacency matrix +.>
Figure QLYQS_16
As input; at input +.>
Figure QLYQS_17
After the treatment of the graph attention mechanism, the graph attention mechanism is transmitted to the MLP to generate a matrix A':
A′=SoftMax(MLP(GAT(X,A)))
wherein,,
Figure QLYQS_18
representing an adjacency matrix with spatiotemporal features, MLP representing a multi-layer perceptron, GAT representing a graph-annotating force operation,/->
Figure QLYQS_19
Representing input features in a road network, +.>
Figure QLYQS_20
Representing an initial weighted adjacency matrix of each node in the road network;
step 4-2: in addition to the adjacency matrix A' generated by the graph generator, the present invention also defines an adaptive adjacency matrix
Figure QLYQS_21
Figure QLYQS_22
Figure QLYQS_23
Wherein,,
Figure QLYQS_24
and->
Figure QLYQS_25
Is a trainable parameter of the model, +.>
Figure QLYQS_26
For real number set, C represents characteristic channel number, N represents node number in traffic network, A apt The initial value is the adjacency matrix predefined by the prior map +.>
Figure QLYQS_27
Step 4-3: using an adaptive fusion architecture will A apt And A', and then fusing the obtained adjacency matrix A dyn Input into GAT for dynamic association simulation between nodes and exploration of undiscovered node connections in the road network, the fusion operation is defined as follows:
A dy =αA apt +(1-α)A′
wherein A is dy And alpha is a learnable adaptive parameter factor for the fused dynamic adjacency matrix.
Step 4-4: the GAT module is defined as follows:
Figure QLYQS_28
wherein X is in To input data A apt The initial value is an adjacency matrix predefined by a priori map
Figure QLYQS_29
Representing a parameter matrix.
6. The traffic prediction method of a spatiotemporal interactive dynamic graph attention network of claim 1, wherein collected historical traffic flow data sets of each traffic node are divided into a training set, a validation set and a test set;
step 5: the average absolute error MAE is the average value representing the absolute error between the actual value and the predicted value, the average absolute percent error MAPE is the average value representing the absolute percent error between the actual value and the predicted value, and the root mean square error RMSE is the arithmetic square root representing the mean square error between the actual value and the predicted value, which is used as an evaluation index for evaluating the predicted performance of the module, and is calculated as follows:
Figure QLYQS_30
Figure QLYQS_31
Figure QLYQS_32
wherein N represents the number of samples, Y i A true value representing the traffic of a certain node at a certain moment,
Figure QLYQS_33
represents Y i A predicted value of traffic flow of the corresponding node at the corresponding moment;
the performance of the invention is proved to be in a better level after the results of the three evaluation indexes are calculated and compared with the current main stream model.
CN202310209521.7A 2023-03-07 2023-03-07 Traffic prediction method for space-time interactive dynamic graph attention network Pending CN116307152A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN202310209521.7A CN116307152A (en) 2023-03-07 2023-03-07 Traffic prediction method for space-time interactive dynamic graph attention network

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN202310209521.7A CN116307152A (en) 2023-03-07 2023-03-07 Traffic prediction method for space-time interactive dynamic graph attention network

Publications (1)

Publication Number Publication Date
CN116307152A true CN116307152A (en) 2023-06-23

Family

ID=86833673

Family Applications (1)

Application Number Title Priority Date Filing Date
CN202310209521.7A Pending CN116307152A (en) 2023-03-07 2023-03-07 Traffic prediction method for space-time interactive dynamic graph attention network

Country Status (1)

Country Link
CN (1) CN116307152A (en)

Cited By (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN116994427A (en) * 2023-07-04 2023-11-03 重庆邮电大学 Road condition prediction method based on big data
CN117131938A (en) * 2023-10-26 2023-11-28 合肥工业大学 Dynamic implicit relation mining method and system based on graph deep learning
CN117690318A (en) * 2024-02-04 2024-03-12 北京航空航天大学 Traffic flow prediction method based on air traffic space-time dynamic diagram

Cited By (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN116994427A (en) * 2023-07-04 2023-11-03 重庆邮电大学 Road condition prediction method based on big data
CN117131938A (en) * 2023-10-26 2023-11-28 合肥工业大学 Dynamic implicit relation mining method and system based on graph deep learning
CN117131938B (en) * 2023-10-26 2024-01-19 合肥工业大学 Dynamic implicit relation mining method and system based on graph deep learning
CN117690318A (en) * 2024-02-04 2024-03-12 北京航空航天大学 Traffic flow prediction method based on air traffic space-time dynamic diagram
CN117690318B (en) * 2024-02-04 2024-04-12 北京航空航天大学 Traffic flow prediction method based on air traffic space-time dynamic diagram

Similar Documents

Publication Publication Date Title
CN110570651B (en) Road network traffic situation prediction method and system based on deep learning
Zhao et al. Deep temporal convolutional networks for short-term traffic flow forecasting
CN110164128B (en) City-level intelligent traffic simulation system
CN107529651B (en) Urban traffic passenger flow prediction method and equipment based on deep learning
CN113313947B (en) Road condition evaluation method of short-term traffic prediction graph convolution network
CN116307152A (en) Traffic prediction method for space-time interactive dynamic graph attention network
CN114299723B (en) Traffic flow prediction method
CN114220271A (en) Traffic flow prediction method, equipment and storage medium based on dynamic space-time graph convolution cycle network
Zhang et al. Curb-gan: Conditional urban traffic estimation through spatio-temporal generative adversarial networks
CN114802296A (en) Vehicle track prediction method based on dynamic interaction graph convolution
CN115440032B (en) Long-short-period public traffic flow prediction method
CN110570035B (en) People flow prediction system for simultaneously modeling space-time dependency and daily flow dependency
Ou et al. Learn, assign, and search: real-time estimation of dynamic origin-destination flows using machine learning algorithms
CN114495507B (en) Traffic flow prediction method integrating space-time attention neural network and traffic model
CN112598165B (en) Urban functional area transfer flow prediction method and device based on private car data
CN113159403B (en) Intersection pedestrian track prediction method and device
CN114202120A (en) Urban traffic travel time prediction method aiming at multi-source heterogeneous data
CN113837148A (en) Pedestrian trajectory prediction method based on self-adjusting sparse graph transform
Haputhanthri et al. Short-term traffic forecasting using LSTM-based deep learning models
CN115376103A (en) Pedestrian trajectory prediction method based on space-time diagram attention network
CN113159371B (en) Unknown target feature modeling and demand prediction method based on cross-modal data fusion
CN116578858A (en) Air compressor fault prediction and health degree evaluation method and system based on graphic neural network
Lin et al. Network-wide multi-step traffic volume prediction using graph convolutional gated recurrent neural network
Hu et al. Hybrid process neural network based on spatio-temporal similarities for short-term traffic flow prediction
Di et al. An automatic and integrated self-diagnosing system for the silting disease of drainage pipelines based on SSAE-TSNE and MS-LSTM

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination