CN116307152A - Traffic prediction method for space-time interactive dynamic graph attention network - Google Patents
Traffic prediction method for space-time interactive dynamic graph attention network Download PDFInfo
- Publication number
- CN116307152A CN116307152A CN202310209521.7A CN202310209521A CN116307152A CN 116307152 A CN116307152 A CN 116307152A CN 202310209521 A CN202310209521 A CN 202310209521A CN 116307152 A CN116307152 A CN 116307152A
- Authority
- CN
- China
- Prior art keywords
- traffic
- network
- representing
- graph
- dynamic
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Pending
Links
- 230000002452 interceptive effect Effects 0.000 title claims abstract description 42
- 238000000034 method Methods 0.000 title claims abstract description 27
- 239000011159 matrix material Substances 0.000 claims description 39
- 102100036869 Diacylglycerol O-acyltransferase 1 Human genes 0.000 claims description 21
- 108050004099 Diacylglycerol O-acyltransferase 1 Proteins 0.000 claims description 21
- 230000003044 adaptive effect Effects 0.000 claims description 9
- 238000011156 evaluation Methods 0.000 claims description 9
- 230000007246 mechanism Effects 0.000 claims description 7
- 230000006870 function Effects 0.000 claims description 6
- 230000004927 fusion Effects 0.000 claims description 6
- 238000012549 training Methods 0.000 claims description 6
- 238000007781 pre-processing Methods 0.000 claims description 5
- 230000002159 abnormal effect Effects 0.000 claims description 4
- 238000012360 testing method Methods 0.000 claims description 4
- 101001095088 Homo sapiens Melanoma antigen preferentially expressed in tumors Proteins 0.000 claims description 3
- 102100037020 Melanoma antigen preferentially expressed in tumors Human genes 0.000 claims description 3
- 230000004913 activation Effects 0.000 claims description 3
- 238000012545 processing Methods 0.000 claims description 3
- 230000011218 segmentation Effects 0.000 claims description 3
- 238000004088 simulation Methods 0.000 claims description 3
- 238000012795 verification Methods 0.000 claims description 3
- 238000010200 validation analysis Methods 0.000 claims 1
- 238000010586 diagram Methods 0.000 abstract description 2
- 238000013135 deep learning Methods 0.000 description 2
- 239000000284 extract Substances 0.000 description 2
- 230000008569 process Effects 0.000 description 2
- 238000011160 research Methods 0.000 description 2
- 238000012935 Averaging Methods 0.000 description 1
- 238000004458 analytical method Methods 0.000 description 1
- 238000013473 artificial intelligence Methods 0.000 description 1
- 238000013528 artificial neural network Methods 0.000 description 1
- 230000009286 beneficial effect Effects 0.000 description 1
- 230000008859 change Effects 0.000 description 1
- 238000005291 chaos (dynamical) Methods 0.000 description 1
- 230000000739 chaotic effect Effects 0.000 description 1
- 238000004140 cleaning Methods 0.000 description 1
- 238000007405 data analysis Methods 0.000 description 1
- 230000007547 defect Effects 0.000 description 1
- 238000011161 development Methods 0.000 description 1
- 230000018109 developmental process Effects 0.000 description 1
- 230000008034 disappearance Effects 0.000 description 1
- 238000005516 engineering process Methods 0.000 description 1
- 238000004880 explosion Methods 0.000 description 1
- 230000006698 induction Effects 0.000 description 1
- 238000007726 management method Methods 0.000 description 1
- 238000005065 mining Methods 0.000 description 1
- 230000035772 mutation Effects 0.000 description 1
- 230000003068 static effect Effects 0.000 description 1
- 230000002195 synergetic effect Effects 0.000 description 1
- 230000002123 temporal effect Effects 0.000 description 1
Images
Classifications
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06Q—INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES, NOT OTHERWISE PROVIDED FOR
- G06Q10/00—Administration; Management
- G06Q10/04—Forecasting or optimisation specially adapted for administrative or management purposes, e.g. linear programming or "cutting stock problem"
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06N—COMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N3/00—Computing arrangements based on biological models
- G06N3/02—Neural networks
- G06N3/04—Architecture, e.g. interconnection topology
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06N—COMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N3/00—Computing arrangements based on biological models
- G06N3/02—Neural networks
- G06N3/08—Learning methods
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06Q—INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES, NOT OTHERWISE PROVIDED FOR
- G06Q50/00—Information and communication technology [ICT] specially adapted for implementation of business processes of specific business sectors, e.g. utilities or tourism
- G06Q50/10—Services
- G06Q50/26—Government or public services
-
- Y—GENERAL TAGGING OF NEW TECHNOLOGICAL DEVELOPMENTS; GENERAL TAGGING OF CROSS-SECTIONAL TECHNOLOGIES SPANNING OVER SEVERAL SECTIONS OF THE IPC; TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
- Y02—TECHNOLOGIES OR APPLICATIONS FOR MITIGATION OR ADAPTATION AGAINST CLIMATE CHANGE
- Y02T—CLIMATE CHANGE MITIGATION TECHNOLOGIES RELATED TO TRANSPORTATION
- Y02T10/00—Road transport of goods or passengers
- Y02T10/10—Internal combustion engine [ICE] based vehicles
- Y02T10/40—Engine management systems
Landscapes
- Engineering & Computer Science (AREA)
- Business, Economics & Management (AREA)
- Theoretical Computer Science (AREA)
- Physics & Mathematics (AREA)
- General Physics & Mathematics (AREA)
- Tourism & Hospitality (AREA)
- Economics (AREA)
- General Health & Medical Sciences (AREA)
- Human Resources & Organizations (AREA)
- Strategic Management (AREA)
- Health & Medical Sciences (AREA)
- Computational Linguistics (AREA)
- Molecular Biology (AREA)
- General Business, Economics & Management (AREA)
- Software Systems (AREA)
- Marketing (AREA)
- Mathematical Physics (AREA)
- General Engineering & Computer Science (AREA)
- Computing Systems (AREA)
- Evolutionary Computation (AREA)
- Life Sciences & Earth Sciences (AREA)
- Artificial Intelligence (AREA)
- Biomedical Technology (AREA)
- Biophysics (AREA)
- Development Economics (AREA)
- Data Mining & Analysis (AREA)
- Primary Health Care (AREA)
- Quality & Reliability (AREA)
- Game Theory and Decision Science (AREA)
- Entrepreneurship & Innovation (AREA)
- Educational Administration (AREA)
- Operations Research (AREA)
- Traffic Control Systems (AREA)
Abstract
The invention discloses a traffic prediction method of a space-time interactive dynamic graph attention network. The invention provides an interactive learning module, which is used for carrying out staggered downsampling on acquired traffic data according to time intervals, generating a new dynamic diagram through an interactive learning strategy and synchronously capturing the space-time dependence of the partitioned traffic data. The invention also provides a dynamic graph attention module which captures the spatial correlation of dynamic changes in the traffic network on the basis of the dynamic graph generation module. The dynamic graph generating module generates a dynamic graph structure based on the historical information and the input data, and the structure can deeply mine dynamic relevance in the network topology. The invention can improve the accuracy of traffic prediction, relieve traffic pressure and improve the toughness of urban traffic.
Description
Technical Field
The invention belongs to the technical field of traffic prediction and the field of deep learning, and particularly relates to a traffic prediction method of a space-time interactive dynamic graph attention network.
Background
The control and the guidance of the traffic flow are important research directions of intelligent traffic, and the real-time accurate traffic flow prediction is the basis of traffic flow induction, is an important means for vehicle evacuation in a congestion area, and is one of the core contents of intelligent traffic. Short-time traffic flow prediction relies on a big data environment, and traffic flow in a certain road section or even in the whole city range in a future period can be accurately predicted by means of artificial intelligence, cloud computing and the like, so that the travel efficiency and the safety of urban vehicles are effectively improved, urban traffic management departments can conveniently and fully balance traffic loads of road networks, full coordination of traffic resources is realized, the transportation efficiency of the road networks is maximized, the traffic jam degree is effectively reduced, and therefore, great practical application value and profound practical significance are provided for developing and deeply researching the theory and the method of traffic flow prediction of the road networks.
In recent years, with the continuous development of socioeconomic level, large cities are developing toward smart cities, and Intelligent Transportation Systems (ITS) have received considerable attention as an important component in smart cities. The traffic prediction is taken as an important component of an intelligent traffic system, and can provide support in aspects of traffic jam control, reasonable path planning and the like. The existing methods for solving the urban traffic prediction problem mainly comprise three types: a statistical-based predictive model, a nonlinear theoretical model, and an intelligent theory-based predictive model. For a statistical-based prediction model, a historical averaging method is the earliest statistical-based prediction model, the method assumes that the traffic flow at a certain point is operated according to a certain rule, the current traffic flow measured at the point and the historical data are added and averaged, and the obtained value is taken as the traffic flow value at the next moment of the point. These methods have great limitations on modeling of nonlinear data, and perform poorly in the practical practice of current data rapid growth. The nonlinear theory model is a model based on the nonlinear theory such as chaos theory, synergetic theory, system dynamics and the like, and is expressed in the traffic flow prediction field as analyzing the nonlinear characteristics of traffic flow by utilizing various nonlinear theory, so that the nonlinear model is established. There are typically models based on mutation theory, chaotic models, wavelet analysis, etc. Prediction model based on intelligent theory: the traffic flow prediction model based on the intelligent theory is based on a modern big data analysis technology, has no fixed model structure and no fixed relation between input and output, extracts valuable information through training of different modes of input information, and predicts traffic flow by using the extracted information. Currently, researchers have used KNNs, automatic encoders, etc. to identify traffic patterns, GNNs, GCNs, etc. to model traffic road network structures and extract spatial features, LSTM, etc. to model temporal correlations of traffic flows, etc.
While current deep learning exhibits superior performance in terms of the ability of modeling traffic flow data and extracting complex data features, there are still some drawbacks: the partial model only considers the time correlation, but ignores the space correlation; the dynamic change of the traffic condition is ignored by part of the model, and only the static adjacency matrix is established, so that the prediction capability of the model is reduced; the partial model has the problem of gradient disappearance or gradient explosion, so that data training is difficult to perform.
Aiming at the challenges, the invention provides a traffic prediction method of a space-time interactive dynamic graph attention network to realize efficient and accurate prediction of urban traffic flow data.
Disclosure of Invention
Aiming at the defects of the current model, the invention creatively provides a new traffic prediction method of a space-time interactive dynamic graph attention network, which can synchronously capture time correlation and space correlation, and can learn space-time dependence through an interactive learning structure and a dynamic graph convolution network, thereby realizing accurate and effective traffic flow prediction.
In order to achieve the above purpose, the invention adopts the following technical scheme:
a traffic prediction method for a spatio-temporal interactive dynamic graph attention network, the method comprising the steps of:
step 1: acquiring historical traffic flow data of each traffic node and preprocessing to obtain application data;
step 2: traffic network information and traffic prediction problem definition;
step 3: constructing an interactive learning module;
step 4: constructing a dynamic graph attention model;
step 5: the collected traffic flow data set is divided into a training set, a verification set and a test set, and the average absolute error, the average absolute percentage error and the root mean square error are used as evaluation indexes to evaluate the traffic prediction model of the space-time interactive dynamic graph attention network.
Preferably, the preprocessing operation of the historical traffic flow data in step 1 includes the following steps:
step 1: the invention adopts a Z-Score method to avoid the influence of abnormal values and extreme values, thereby obtaining preprocessed application data.
Preferably, step 2 includes the following steps for traffic network information and traffic prediction problem definition:
step 2-1: defining a traffic network as a weighted directed graph g= (V, E, a), wherein V represents a set of nodes, |v|=n, which is a set of road segments or sensors in the traffic network; e represents a set of edges, edges in the graph G represent connection relations among nodes, and weight values of the edges represent distances among the nodes;the weighted adjacency matrix of the graph G is represented, and the element values in the a are represented by the distance between two nodes, which can be specifically represented by:
wherein d ij E is the distance between two nodes in the road network 1 Super-parameters for controlling the sparsity of the adjacent matrix A; sigma is the standard deviation;
step (a)2-2: defining a traffic prediction problem: the traffic flow prediction problem is aimed at passing through the history observation sequenceOn the basis of which future traffic sequences of the traffic network are predictedThe traffic prediction problem is defined as:
wherein,,representing the observation value of graph G at time step t,/for>N represents the number of nodes in the traffic network, and C represents the number of characteristic channels; t represents the length of a given historical time series, T' represents the length of the time series to be predicted; f represents a learning function that can predict future flow sequences from historical observation sequences.
Preferably, the step 3 of constructing the interactive learning module includes the following:
step 3: for most time series data, the interleaved downsampled subsequence still retains most of the information of the original sequence due to trending and proximity; the invention adopts a processing mode of staggered downsampling to enable the two sub-sequences after segmentation to share the parameter weight in the dynamic graph attention module; assume thatFor the input traffic flow data, the subsequences of the divided parity index sections of X are respectively expressed as
The outputs after the first interactive learning are respectively:
X′ odd =tanh(DGAT(Conv(X even )))⊙X odd
X′ even =tanh(DGAT(Conv(X odd )))⊙X even
the final outputs after the second interactive learning are respectively:
X odd_out =X′ odd +tanh(DGAT(Conv(X′ even )))
X even_out =X′ even +tanh(DGAT(Conv(X′ odd )))
wherein,,and->Respectively, traffic flow subsequences obtained after the first interactive learning, and +.>For a real number set, C represents the number of characteristic channels, N represents the number of nodes in a traffic network, and T represents the length of a given historical time sequence; />And->Representing a traffic flow subsequence finally obtained after two interactive learning; as indicated by the letter Hadamard product, tan h is the activation function, DGAT is the dynamic attention model, conv is the one-dimensional convolution;
preferably, step 4 of constructing the dynamic graph attention model includes the following:
step 4-1: the structure of the DGAT module consists of two main modules: a graph generator module and a graph annotation force module; the DGAT module utilizes the generated graph structure to better explore the urban road with deeper levelThe spatial characteristics of the road network improve the performance of capturing spatial heterogeneity by the model; wherein the graph generator module is to hide featuresAnd a predefined initial adjacency matrix-> As input; at input +.>After the treatment of the graph attention mechanism, the graph attention mechanism is transmitted to the MLP to generate a matrix A':
A′=SoftMax(MLP(GAT(X,A)))
in the process, ,representing an adjacency matrix with spatiotemporal features, MLP representing a multi-layer perceptron, GAT representing a graph-annotating force operation,/->Representing input features in a road network, +.>Representing an initial weighted adjacency matrix of each node in the road network;
step 4-2: in addition to the adjacency matrix A' generated by the graph generator, the present invention also defines an adaptive adjacency matrix
Wherein,,and->Is a trainable parameter of the model, +.>For real number set, C represents characteristic channel number, N represents node number in traffic network, A apt The initial value is the adjacency matrix predefined by the prior map +.>
Step 4-3: using an adaptive fusion architecture will A apt And A', and fusing the adjacent matrix A obtained after fusing dyn Input into GAT for dynamic association simulation between nodes and exploring undiscovered node connections in the road network, the fusion operation may be defined as follows:
A dy =αA apt +(1-α)A′
wherein A is dy And alpha is a learnable adaptive parameter factor for the fused dynamic adjacency matrix.
Step 4-4: the GAT module is defined as follows:
wherein X is in To input data A apt The initial value is an adjacency matrix predefined by a priori mapRepresenting a parameter matrix.
Preferably, the step 5 of evaluating the traffic prediction model of the time-space interactive dynamic graph attention network includes the following steps:
step 5: the invention adopts average absolute error MAE (representing the average value of absolute error between actual value and predicted value), average absolute percent error MAPE (representing the average value of absolute percent error between actual value and predicted value) and root mean square error RMSE (representing the arithmetic square root of mean square error between actual value and predicted value) as the evaluation index for evaluating the prediction performance of the module, and the evaluation index is calculated as follows:
wherein N represents the number of samples, Y i A true value representing the traffic of a certain node at a certain moment,represents Y i And the corresponding time corresponds to the predicted value of the traffic flow of the node.
Compared with the prior art, the invention has the following beneficial effects and advantages:
most of the current research is limited to mining time and space characteristics respectively, so that the model cannot learn the characteristics of traffic flow data comprehensively. The invention provides a new traffic prediction model of a space-time interactive dynamic graph attention network, which embeds a graph attention mechanism into an interactive learning structure and synchronously captures time and space correlations. The time-space dependency can be learned through the interactive learning structure and the dynamic graph attention network, so that the characteristics of the nodes are comprehensively learned, and the prediction accuracy is further improved.
Drawings
FIG. 1 is a general framework of a traffic prediction model of a spatio-temporal interactive dynamic graph attention network provided by the invention.
Detailed Description
The technical solutions in the embodiments of the present invention will be clearly and completely described below with reference to the accompanying drawings in the embodiments of the present invention. It will be apparent that the described embodiments are only some, but not all, embodiments of the invention.
The method obtains a large amount of traffic flow data from sensors of the urban road network, performs data cleaning on the traffic flow data, and obtains the attributes such as longitude and latitude, flow value, predicted starting time, predicted ending time and the like after finishing. The method is widely applicable to various time sequence prediction fields based on the dynamic graph attention neural network, and effectively processes complex time sequence data. FIG. 1 is a general framework diagram of the model of the present invention, which effectively improves the overall predictive performance of the model. The specific implementation is as follows:
step 1: acquiring historical traffic flow data of each traffic node and preprocessing to obtain application data;
step 2: traffic network information and traffic prediction problem definition;
step 3: constructing an interactive learning module;
step 4: constructing a dynamic graph attention model;
step 5: the collected traffic flow data set is divided into a training set, a verification set and a test set, and the average absolute error, the average absolute percentage error and the root mean square error are used as evaluation indexes to evaluate the traffic prediction model of the space-time interactive dynamic graph attention network.
Specifically, the preprocessing operation for the historical traffic flow data includes the following steps: the invention adopts a Z-Score method to avoid the influence of abnormal values and extreme values, thereby obtaining preprocessed application data.
Specifically, the definition of traffic network information and traffic prediction problems includes the following:
(1) Defining a traffic network as a weighted directed graph g= (V, E, a), where V represents a set of nodes, |v|=n, which is a road segment or sensor in the traffic networkIs a collection of (3); e represents a set of edges, edges in the graph G represent connection relations among nodes, and weight values of the edges represent distances among the nodes;the weighted adjacency matrix of the graph G is represented, and the element values in the a are represented by the distance between two nodes, which can be specifically represented by:
wherein d ij E is the distance between two nodes in the road network 1 Super-parameters for controlling the sparsity of the adjacent matrix A; sigma is the standard deviation;
defining a traffic prediction problem: the traffic flow prediction problem is aimed at passing through the history observation sequenceOn the basis of which future traffic sequences of the traffic network are predictedThe traffic prediction problem is defined as:
wherein,,representing the observation value of graph G at time step t,/for>N represents the number of nodes in the traffic network, and C represents the number of characteristic channels; t represents the length of a given historical time series, T' represents the length of the time series to be predicted; f represents a learning function that can predict future flow sequences from historical observation sequences.
Specifically, constructing the interactive learning module includes the following: for most time series data, the interleaved downsampled subsequence still retains most of the information of the original sequence due to trending and proximity; the invention adopts a processing mode of staggered downsampling to enable the two sub-sequences after segmentation to share the parameter weight in the dynamic graph attention module; assume that For the input traffic flow data, the subsequences of the divided parity index sections of X are respectively expressed as
The outputs after the first interactive learning are respectively:
X′ odd =tanh(DGAT(Conv(X even )))⊙X odd
X′ even =tanh(DGAT(Conv(X odd )))⊙X even
the final outputs after the second interactive learning are respectively:
X odd_out =X odd +tanh(DGAT(Conv(X′ even )))
X even_out =X′ even +tanh(DGAT(Conv(X′ odd )))
wherein,,and->Respectively, traffic flow subsequences obtained after the first interactive learning, and +.>For a real number set, C represents the number of characteristic channels, N represents the number of nodes in a traffic network, and T represents the length of a given historical time sequence; />And->Representing a traffic flow subsequence finally obtained after two interactive learning; as a result, hadamard product, tanh as the activation function, DGAT as the dynamic attention model, and Conv as the one-dimensional convolution operation.
Specifically, constructing the dynamic graph attention model includes the following:
(1) The structure of the DGAT module consists of two main modules: a graph generator module and a graph annotation force module; the DGAT module utilizes the generated graph structure to better explore the spatial characteristics of a deeper urban road network, and improves the performance of capturing spatial heterogeneity of the model; wherein the graph generator module is to hide featuresAnd a predefined initial adjacency matrix As input; at input +.>Thereafter (I)>After the treatment of the graph attention mechanism, the graph attention mechanism is transmitted to the MLP to generate a matrix A':
A′=SoftMax(MLP(GAT(X,A)))
wherein,,representing an adjacency matrix with spatiotemporal features, MLP representing a multi-layer perceptron, GAT representing a graph-annotating force operation,/->Representing input features in a road network, +.>Representing an initial weighted adjacency matrix of each node in the road network;
(2) In addition to the adjacency matrix A' generated by the graph generator, the present invention also defines an adaptive adjacency matrix
Wherein,,and->Is a trainable parameter of the model, +.>For real number set, C represents characteristic channel number, N represents node number in traffic network, A apt The initial value is the adjacency matrix predefined by the prior map +.>
(3) Using an adaptive fusion architecture will A apt And A', fusing, and obtaining the adjacentConnection matrix A dyn Input into GAT for dynamic association simulation between nodes and exploring undiscovered node connections in the road network, the fusion operation may be defined as follows:
A dy =αA apt +(1-α)A′
wherein A is dy And alpha is a learnable adaptive parameter factor for the fused dynamic adjacency matrix.
(4) The GAT module is defined as follows:
wherein X is in To input data A apt The initial value is an adjacency matrix predefined by a priori mapRepresenting a parameter matrix.
Specifically, evaluating a traffic prediction model of a time-space interactive dynamic graph attention network includes: the invention adopts average absolute error MAE (representing the average value of absolute error between actual value and predicted value), average absolute percent error MAPE (representing the average value of absolute percent error between actual value and predicted value) and root mean square error RMSE (representing the arithmetic square root of mean square error between actual value and predicted value) as the evaluation index for evaluating the prediction performance of the module, and the evaluation index is calculated as follows:
wherein N represents the number of samples, Y i A true value representing the traffic of a certain node at a certain moment,represents Y i And the corresponding time corresponds to the predicted value of the traffic flow of the node.
The foregoing is merely illustrative of the present invention, and the scope of the invention is not limited thereto, as all other examples obtained by those skilled in the art without making any inventive effort fall within the scope of the invention.
Claims (6)
1. A traffic prediction method for a spatio-temporal interactive dynamic graph attention network, comprising the steps of:
step 1: acquiring historical traffic flow data of each traffic node and preprocessing to obtain application data;
step 2: traffic network information and traffic prediction problem definition;
step 3: constructing an interactive learning module;
step 4: constructing a dynamic graph attention model;
step 5: the collected traffic flow data set is divided into a training set, a verification set and a test set, and the average absolute error, the average absolute percentage error and the root mean square error are used as evaluation indexes to evaluate the traffic prediction model of the space-time interactive dynamic graph attention network.
2. The traffic prediction method of a spatio-temporal interactive dynamic graph attention network of claim 1, characterized by: in the implementation of the step 1, the abnormal value of the traffic data acquired from the sensor is processed, and the Z-Score method is adopted to avoid the abnormal value and the extreme value, so that the preprocessed application data is obtained.
3. The traffic prediction method of a spatio-temporal interactive dynamic graph attention network of claim 1, characterized by: the implementation steps of step 2 are as follows,
step 2-1: defining a traffic network as a weighted directed graph g= (V, E, a), wherein V represents a set of nodes, each node representing a road segment or sensor in the traffic network, |v|=n; e represents a set of edges, edges in the graph G represent connection relations among nodes, and weight values of the edges represent distances among the nodes;representing the weighted adjacency matrix of fig. G, the element values in a are representing the distance between two nodes, expressed as:
wherein d ij E is the distance between two nodes in the road network 1 Super-parameters for controlling the sparsity of the adjacent matrix A; sigma is the standard deviation.
Step 2-2: defining a traffic prediction problem: the traffic flow prediction problem is aimed at passing through the history observation sequenceOn the basis of which future traffic sequences of the traffic network are predictedThe traffic prediction problem is defined as:
wherein,,representing the observation value of graph G at time step t,/for>N represents the number of nodes in the traffic network, and C represents the number of characteristic channelsThe method comprises the steps of carrying out a first treatment on the surface of the T represents the length of a given historical time series, T' represents the length of the time series to be predicted; f represents a learning function that can predict future flow sequences from historical observation sequences.
4. The traffic prediction method of a spatio-temporal interactive dynamic graph attention network of claim 1, characterized by: the implementation steps of the step 3 are as follows: the processing mode of staggered downsampling is adopted to enable the two sub-sequences after segmentation to share the parameter weight in the dynamic graph attention module; assume thatFor the input traffic flow data, the subsequences of X divided by parity index interval are respectively expressed as +.>
The outputs after the first interactive learning are respectively:
X′ odd =tanh(DGAT(Conv(X even )))⊙X odd
X′ even =tanh(DGAT(Conv(X odd )))⊙X even
the final outputs after the second interactive learning are respectively:
X odd_out =X′ odd +tanh(DGAT(Conv(X′ even )))
X even_out X′ even +tanh(DGAT(Conv(X′ odd )))
wherein,,and->Respectively, traffic flow subsequences obtained after the first interactive learning, and +.>For real number set, C represents featureThe number of tracks, N represents the number of nodes in the traffic network, and T represents the length of a given historical time sequence; />And->Representing a traffic flow subsequence finally obtained after two interactive learning; as a result, hadamard product, tanh as the activation function, DGAT as the dynamic attention model, and Conv as the one-dimensional convolution operation.
5. The traffic prediction method of a spatio-temporal interactive dynamic graph attention network of claim 1, characterized by: the implementation steps of the step 4 are as follows, and the step 4-1 is as follows: the structure of the DGAT module consists of two main modules: a graph generator module and a graph annotation force module; the DGAT module utilizes the generated graph structure to explore the spatial characteristics of a deeper urban road network, so that the performance of capturing spatial heterogeneity of the model is improved; the graph generator module will hide the featuresAnd a pre-defined initial weighted adjacency matrix +.>As input; at input +.>After the treatment of the graph attention mechanism, the graph attention mechanism is transmitted to the MLP to generate a matrix A':
A′=SoftMax(MLP(GAT(X,A)))
wherein,,representing an adjacency matrix with spatiotemporal features, MLP representing a multi-layer perceptron, GAT representing a graph-annotating force operation,/->Representing input features in a road network, +.>Representing an initial weighted adjacency matrix of each node in the road network;
step 4-2: in addition to the adjacency matrix A' generated by the graph generator, the present invention also defines an adaptive adjacency matrix
Wherein,,and->Is a trainable parameter of the model, +.>For real number set, C represents characteristic channel number, N represents node number in traffic network, A apt The initial value is the adjacency matrix predefined by the prior map +.>
Step 4-3: using an adaptive fusion architecture will A apt And A', and then fusing the obtained adjacency matrix A dyn Input into GAT for dynamic association simulation between nodes and exploration of undiscovered node connections in the road network, the fusion operation is defined as follows:
A dy =αA apt +(1-α)A′
wherein A is dy And alpha is a learnable adaptive parameter factor for the fused dynamic adjacency matrix.
Step 4-4: the GAT module is defined as follows:
6. The traffic prediction method of a spatiotemporal interactive dynamic graph attention network of claim 1, wherein collected historical traffic flow data sets of each traffic node are divided into a training set, a validation set and a test set;
step 5: the average absolute error MAE is the average value representing the absolute error between the actual value and the predicted value, the average absolute percent error MAPE is the average value representing the absolute percent error between the actual value and the predicted value, and the root mean square error RMSE is the arithmetic square root representing the mean square error between the actual value and the predicted value, which is used as an evaluation index for evaluating the predicted performance of the module, and is calculated as follows:
wherein N represents the number of samples, Y i A true value representing the traffic of a certain node at a certain moment,represents Y i A predicted value of traffic flow of the corresponding node at the corresponding moment;
the performance of the invention is proved to be in a better level after the results of the three evaluation indexes are calculated and compared with the current main stream model.
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN202310209521.7A CN116307152A (en) | 2023-03-07 | 2023-03-07 | Traffic prediction method for space-time interactive dynamic graph attention network |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN202310209521.7A CN116307152A (en) | 2023-03-07 | 2023-03-07 | Traffic prediction method for space-time interactive dynamic graph attention network |
Publications (1)
Publication Number | Publication Date |
---|---|
CN116307152A true CN116307152A (en) | 2023-06-23 |
Family
ID=86833673
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN202310209521.7A Pending CN116307152A (en) | 2023-03-07 | 2023-03-07 | Traffic prediction method for space-time interactive dynamic graph attention network |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN116307152A (en) |
Cited By (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN116994427A (en) * | 2023-07-04 | 2023-11-03 | 重庆邮电大学 | Road condition prediction method based on big data |
CN117131938A (en) * | 2023-10-26 | 2023-11-28 | 合肥工业大学 | Dynamic implicit relation mining method and system based on graph deep learning |
CN117690318A (en) * | 2024-02-04 | 2024-03-12 | 北京航空航天大学 | Traffic flow prediction method based on air traffic space-time dynamic diagram |
-
2023
- 2023-03-07 CN CN202310209521.7A patent/CN116307152A/en active Pending
Cited By (5)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN116994427A (en) * | 2023-07-04 | 2023-11-03 | 重庆邮电大学 | Road condition prediction method based on big data |
CN117131938A (en) * | 2023-10-26 | 2023-11-28 | 合肥工业大学 | Dynamic implicit relation mining method and system based on graph deep learning |
CN117131938B (en) * | 2023-10-26 | 2024-01-19 | 合肥工业大学 | Dynamic implicit relation mining method and system based on graph deep learning |
CN117690318A (en) * | 2024-02-04 | 2024-03-12 | 北京航空航天大学 | Traffic flow prediction method based on air traffic space-time dynamic diagram |
CN117690318B (en) * | 2024-02-04 | 2024-04-12 | 北京航空航天大学 | Traffic flow prediction method based on air traffic space-time dynamic diagram |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN110570651B (en) | Road network traffic situation prediction method and system based on deep learning | |
Zhao et al. | Deep temporal convolutional networks for short-term traffic flow forecasting | |
CN110164128B (en) | City-level intelligent traffic simulation system | |
CN107529651B (en) | Urban traffic passenger flow prediction method and equipment based on deep learning | |
CN113313947B (en) | Road condition evaluation method of short-term traffic prediction graph convolution network | |
CN116307152A (en) | Traffic prediction method for space-time interactive dynamic graph attention network | |
CN114299723B (en) | Traffic flow prediction method | |
CN114220271A (en) | Traffic flow prediction method, equipment and storage medium based on dynamic space-time graph convolution cycle network | |
Zhang et al. | Curb-gan: Conditional urban traffic estimation through spatio-temporal generative adversarial networks | |
CN114802296A (en) | Vehicle track prediction method based on dynamic interaction graph convolution | |
CN115440032B (en) | Long-short-period public traffic flow prediction method | |
CN110570035B (en) | People flow prediction system for simultaneously modeling space-time dependency and daily flow dependency | |
Ou et al. | Learn, assign, and search: real-time estimation of dynamic origin-destination flows using machine learning algorithms | |
CN114495507B (en) | Traffic flow prediction method integrating space-time attention neural network and traffic model | |
CN112598165B (en) | Urban functional area transfer flow prediction method and device based on private car data | |
CN113159403B (en) | Intersection pedestrian track prediction method and device | |
CN114202120A (en) | Urban traffic travel time prediction method aiming at multi-source heterogeneous data | |
CN113837148A (en) | Pedestrian trajectory prediction method based on self-adjusting sparse graph transform | |
Haputhanthri et al. | Short-term traffic forecasting using LSTM-based deep learning models | |
CN115376103A (en) | Pedestrian trajectory prediction method based on space-time diagram attention network | |
CN113159371B (en) | Unknown target feature modeling and demand prediction method based on cross-modal data fusion | |
CN116578858A (en) | Air compressor fault prediction and health degree evaluation method and system based on graphic neural network | |
Lin et al. | Network-wide multi-step traffic volume prediction using graph convolutional gated recurrent neural network | |
Hu et al. | Hybrid process neural network based on spatio-temporal similarities for short-term traffic flow prediction | |
Di et al. | An automatic and integrated self-diagnosing system for the silting disease of drainage pipelines based on SSAE-TSNE and MS-LSTM |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PB01 | Publication | ||
PB01 | Publication | ||
SE01 | Entry into force of request for substantive examination | ||
SE01 | Entry into force of request for substantive examination |