CN110991713A - Irregular area flow prediction method based on multi-graph convolution sum GRU - Google Patents
Irregular area flow prediction method based on multi-graph convolution sum GRU Download PDFInfo
- Publication number
- CN110991713A CN110991713A CN201911148344.6A CN201911148344A CN110991713A CN 110991713 A CN110991713 A CN 110991713A CN 201911148344 A CN201911148344 A CN 201911148344A CN 110991713 A CN110991713 A CN 110991713A
- Authority
- CN
- China
- Prior art keywords
- regions
- time
- area
- flow
- irregular
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Granted
Links
- 230000001788 irregular Effects 0.000 title claims abstract description 46
- 238000000034 method Methods 0.000 title claims abstract description 24
- 239000011159 matrix material Substances 0.000 claims abstract description 55
- 238000010586 diagram Methods 0.000 claims abstract description 21
- 238000013528 artificial neural network Methods 0.000 claims abstract description 16
- 230000004927 fusion Effects 0.000 claims abstract description 14
- 238000012549 training Methods 0.000 claims abstract description 9
- 230000006870 function Effects 0.000 claims description 15
- 230000003993 interaction Effects 0.000 claims description 15
- 239000013598 vector Substances 0.000 claims description 10
- 238000004364 calculation method Methods 0.000 claims description 6
- 230000015654 memory Effects 0.000 claims description 6
- 238000013507 mapping Methods 0.000 claims description 4
- 238000012545 processing Methods 0.000 claims description 4
- ORILYTVJVMAKLC-UHFFFAOYSA-N Adamantane Natural products C1C(C2)CC3CC1CC2C3 ORILYTVJVMAKLC-UHFFFAOYSA-N 0.000 claims description 3
- 230000004913 activation Effects 0.000 claims description 3
- 230000002457 bidirectional effect Effects 0.000 claims description 3
- 238000010606 normalization Methods 0.000 claims description 3
- 238000005457 optimization Methods 0.000 claims description 3
- 230000002441 reversible effect Effects 0.000 claims description 3
- 238000000547 structure data Methods 0.000 claims description 3
- 238000012360 testing method Methods 0.000 claims description 3
- 238000012795 verification Methods 0.000 claims description 3
- 230000004931 aggregating effect Effects 0.000 claims description 2
- 238000003062 neural network model Methods 0.000 claims description 2
- 238000000611 regression analysis Methods 0.000 claims 2
- 238000013527 convolutional neural network Methods 0.000 description 4
- 238000013136 deep learning model Methods 0.000 description 2
- 230000006403 short-term memory Effects 0.000 description 2
- 230000002123 temporal effect Effects 0.000 description 2
- 230000002776 aggregation Effects 0.000 description 1
- 238000004220 aggregation Methods 0.000 description 1
- 230000009286 beneficial effect Effects 0.000 description 1
- 125000004122 cyclic group Chemical group 0.000 description 1
- 238000013135 deep learning Methods 0.000 description 1
- 230000007547 defect Effects 0.000 description 1
- 230000001419 dependent effect Effects 0.000 description 1
- 230000000694 effects Effects 0.000 description 1
- 230000008569 process Effects 0.000 description 1
- 230000000306 recurrent effect Effects 0.000 description 1
- 230000036962 time dependent Effects 0.000 description 1
Images
Classifications
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06Q—INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES, NOT OTHERWISE PROVIDED FOR
- G06Q10/00—Administration; Management
- G06Q10/04—Forecasting or optimisation specially adapted for administrative or management purposes, e.g. linear programming or "cutting stock problem"
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/20—Information retrieval; Database structures therefor; File system structures therefor of structured data, e.g. relational data
- G06F16/24—Querying
- G06F16/245—Query processing
- G06F16/2458—Special types of queries, e.g. statistical queries, fuzzy queries or distributed queries
- G06F16/2465—Query processing support for facilitating data mining operations in structured databases
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/20—Information retrieval; Database structures therefor; File system structures therefor of structured data, e.g. relational data
- G06F16/29—Geographical information databases
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F18/00—Pattern recognition
- G06F18/20—Analysing
- G06F18/25—Fusion techniques
- G06F18/253—Fusion techniques of extracted features
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06N—COMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N3/00—Computing arrangements based on biological models
- G06N3/02—Neural networks
- G06N3/04—Architecture, e.g. interconnection topology
- G06N3/045—Combinations of networks
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06Q—INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES, NOT OTHERWISE PROVIDED FOR
- G06Q50/00—Information and communication technology [ICT] specially adapted for implementation of business processes of specific business sectors, e.g. utilities or tourism
- G06Q50/10—Services
- G06Q50/26—Government or public services
Landscapes
- Engineering & Computer Science (AREA)
- Theoretical Computer Science (AREA)
- Physics & Mathematics (AREA)
- Business, Economics & Management (AREA)
- General Physics & Mathematics (AREA)
- Data Mining & Analysis (AREA)
- Databases & Information Systems (AREA)
- General Engineering & Computer Science (AREA)
- Tourism & Hospitality (AREA)
- Strategic Management (AREA)
- Human Resources & Organizations (AREA)
- Economics (AREA)
- Mathematical Physics (AREA)
- Development Economics (AREA)
- Life Sciences & Earth Sciences (AREA)
- General Business, Economics & Management (AREA)
- Computational Linguistics (AREA)
- General Health & Medical Sciences (AREA)
- Artificial Intelligence (AREA)
- Marketing (AREA)
- Evolutionary Computation (AREA)
- Software Systems (AREA)
- Health & Medical Sciences (AREA)
- Bioinformatics & Cheminformatics (AREA)
- Molecular Biology (AREA)
- Game Theory and Decision Science (AREA)
- Bioinformatics & Computational Biology (AREA)
- Evolutionary Biology (AREA)
- Entrepreneurship & Innovation (AREA)
- Computing Systems (AREA)
- Operations Research (AREA)
- Quality & Reliability (AREA)
- Computer Vision & Pattern Recognition (AREA)
- Biophysics (AREA)
- Educational Administration (AREA)
- Primary Health Care (AREA)
- Biomedical Technology (AREA)
- Fuzzy Systems (AREA)
- Probability & Statistics with Applications (AREA)
- Remote Sensing (AREA)
Abstract
The invention discloses an irregular area flow prediction method based on multi-graph convolution and GRU, which comprises the following steps: dividing a region into N unconnected irregular regions; step two, performing space-time simplification on the historical track data, and calculating to obtain the inflow and outflow of all the regions at each time step; establishing a plurality of correlation diagrams among the regions, constructing a corresponding adjacent matrix, and expressing diversified spatial correlation among irregular regions; designing a multi-graph convolution neural network based on the correlation diagram among the regions, fusing diversified spatial correlation characteristics among the regions, and obtaining a multi-graph convolution fusion result; step five, based on the multi-graph convolution fusion result, capturing time correlation by adopting a GRU neural network; and sixthly, selecting a proper loss function, training to obtain a prediction model, and predicting through the prediction model to obtain the inflow and outflow of each region.
Description
Technical Field
The invention relates to the field of traffic flow prediction, in particular to an irregular area flow prediction method based on multi-graph volume sum GRU.
Background
Traffic flow prediction is an important component of intelligent transportation systems. The purpose of regional flow prediction is to predict future flow values in urban areas based on given historical data, and accurate prediction can help traffic managers to control and manage flow in advance.
Regional flow prediction methods typically utilize spatial and temporal correlations between regions. Conventional regional flow prediction uses time series prediction methods such as autoregressive moving average model (ARIMA), time-varying poisson model, vector autoregressive model. The prediction accuracy is low only by considering the time-dependent correlation. With the rise of deep learning, researchers use deep learning models to predict flow. Compared with the traditional method, the model of the long Short-Term memory network LSTM (Long Short Term memory) and the gated cyclic unit GRU (gated Recurrent Unit) has better effect on the Short-Term time sequence prediction. However, they still focus on temporal associations only. To better capture the spatio-temporal correlation, researchers have proposed predicting regional traffic using Convolutional Neural Network (CNN) and residual neural network based methods, first dividing cities into grids and then predicting traffic at the grid level. However, these methods can only predict the traffic in regular areas.
Cities can be divided into meaningful areas based on road network information or administrative boundaries, which are often irregular and have complex topologies that carry more semantic information than regular grids. The grid-based prediction model cannot predict the traffic demand of the irregular area, and the usability of the prediction result is reduced.
Disclosure of Invention
In order to overcome the defects of the prior art and improve the accuracy of flow prediction in irregular areas, the invention adopts the following technical scheme:
the irregular area flow prediction method based on the multi-graph convolution sum GRU comprises the following steps:
dividing a region into N unconnected irregular regions;
step two, performing space-time simplification on the historical track data, and calculating to obtain the inflow and outflow of all the regions at each time step;
establishing a plurality of correlation diagrams among the regions, constructing a corresponding adjacent matrix, and expressing diversified spatial correlation among irregular regions;
designing a Multi-graph convolution neural Network (MGCN) based on the correlation diagram among the regions, and fusing diversified spatial correlation characteristics among the regions to obtain a Multi-graph convolution fusion result;
step five, based on the multi-graph convolution fusion result, capturing time correlation by adopting a GRU neural network;
selecting a proper loss function, training to obtain a prediction model, and predicting through the prediction model to obtain the inflow and the outflow of each region;
and step one, dividing the region into N unconnected irregular regions by adopting an irregular region division method based on the road network structure data of the region.
The second step is that the whole time period is divided into a plurality of time steps according to unit time, and then the original historical track data is mapped into the area according to the unit time based on the area division result, so as to obtain a simplified track:
TRSimp=(startRegion,startDate,startHour,endRegion,endDate,endHour)
wherein startRegion is the ID of the departure area, startDate is the departure date, startHour is the departure unit time, endRegion is the ID of the arrival area, endDate is the arrival date, endHour is the arrival unit time, and then the entering amount of each area is obtained by aggregation simplified track calculationAnd outflow volumeWherein saidRepresents the amount of entry of zone i at the t-th time step, saidRepresents the outflow of zone i at the t-th time step, based finally on theAnd saidCalculating to obtain the entering amount of all the areas under the time step tAnd outflow volume
Establishing different association graphs to represent diversified spatial associations among irregular areas, wherein the spatial associations comprise a distance graph, a flow interaction graph and a flow association graph, the different association graphs are represented by G (V, E), and a node V is represented by G (V, E)iE.g. V represents an irregular area, an edge (V)i,vj) E encodes the degree of association between irregular regions, represented by the adjacency matrix A E RN×NRepresents;
the weight of an edge of the distance map is the distance between two regions, and the adjacent matrix element A in the distance mapdThe values of (i, j) are calculated as follows:
the distance between said dist (i,j) representing the distance between the center points of the regions i and j, and the adjacency matrix AdNormalized to [0, 1 ]]And based on a predefined distance threshold thresdThe A is addeddConvert to an 0/1 matrix if Ad(i,j)≤thresdIndicating that the distance between the regions i and j is very close, let A bed(i, j) ═ 1, otherwise the said Ad(i,j)=0;
The flow interaction diagram indicates whether there is frequent bidirectional flow between the two areas by aiming at the TRSimpAggregating the data to obtain the flow value fn (i, j) from the area i to the area j and the flow value fn (j, i) from the area j to the area i in the whole analysis time period, wherein the flow interaction diagram is adjacent to the matrix element AinterThe values of (i, j) are calculated as follows:
the adjacency matrix AinterNormalized to [0, 1 ]]And based on a predefined traffic interaction threshold thresinterThe A is addedinterConvert to an 0/1 matrix if Ainter(i,j)≥thresinterIndicating that the interaction between the regions i and j is very strong, let A beinter(i, j) ═ 1, otherwise the said Ainter(i,j)=0;
The flow correlation diagram indicates the time correlation of flow among the areas, the historical flow value of each area in each time step is obtained, the time sequence of the inlet flow and the outlet flow of each area in the time period is set according to the time period needing to be analyzed, and for the area i, the time sequence is expressed as:the T represents the length of a time slot, correlation between the region i and the region j is calculated by adopting a Pearson correlation coefficient, and the adjacent matrix element A in the flow correlation diagramcorrThe values of (i, j) are calculated asThe following:
h isiAnd h is saidjSaid time sequence representing said regions i and j will listen to said adjacency matrix AcorrNormalized to [0, 1 ]]And based on a predefined flow-related threshold threscorrWill hear the statement AcorrConvert to an 0/1 matrix if Acorr(i,j)≥threscorrIndicating that the regions i and j have similar time usage patterns, let Acorr(i, j) is 1, otherwise said Acorr(i,j)=0。
Based on a plurality of association graphs among the regions, the multi-graph convolution neural network is proposed to fully mine useful information hidden in different graphs and capture complex spatial dependence association, and the method comprises the following steps:
(1) using Graph Convolutional neural Network (GCN) model f (X)t′A) processing each graph at time step t':
said Xt′=(It′,Ot′) Is the input of the model at the t' time step, representing the ingress and egress of the region,is an adjacency matrix with self-flow, said INIs an identity matrix, theIs a diagonal matrix in whichThe W isd、Winter、WcorrIs a trainable weight matrix, said tanh (-) represents an activation function;
(2) adding additional attribute attr to each time stept′Coding the flow influence factors at each time step;
(3) combining a plurality of the association graphs and the additional attributes by adopting a Full Connected Layer (FCL) to fuse various spatial association characteristics among the areas;
FCL(Xt′,Ad,Ainter,Acorr,attrt′)=Wfcl[fd(Xt′,Ad),finter(Xt′,Ainter),fcorr(Xt′,Acorr),attrt′]
the W isfclFor the weight matrix, the FCL (-) is the multi-graph convolution fusion result.
The additional attribute comprises a date attributeAn hour attributeWeather PropertiesAnd temperature propertyThe above-mentionedThe dimension is 7 dimensions, and represents the week number of one week; the above-mentionedThe dimension is 24 dimensions, which represents the first hour of the day; the above-mentionedIs divided into 8 categories including sunny, cloudy, light rain, medium rain, heavy rain, light snow, medium snow and heavy snow; the above-mentionedIs divided into 8 levels from 10 DEG F to 90 DEG F, each 10 DEG F corresponds to one level, all the additional attributes use one-hot coding, and vectors are obtained by connection
Based on the multi-graph convolution fusion result FCL (·), capturing time sequence correlation in history by adopting a GRU neural network, wherein the GRU takes a hidden state at a time step t '-1 and the multi-graph convolution result as input to obtain a flow state at a time step t', and a calculation process is as follows:
ut′=σ(Wu[FCL(·),ht′-1]+bu)
rt′=σ(Wr[FCL(·),ht′-1]+br)
ct'=tanh(Wc[FCL(·),(rt'⊙ht'-1)]+bc)
ht′=ut′ht′-1+(1-ut′)ct′
h ist'-1Represents the hidden state at said time step t' -1, said ut'To update the gate, the amount of which memory of the last moment is saved to the current time step is defined, rt'To reset the gate, it is decided how to combine the new input information with the previous memory, ct'For the stored content at the t' time step, the ht'Is the output state at the t 'time step, the FCL (-) is at the t' time stepThe multi-graph convolution result, the Wu、Wr、WcIs a weight matrix, said bu、br、bcIs a deviation vector, the sigma is a sigmoid function, and the ⊙ is an element-by-element multiplication operation.
Step six, using Smooth L1 as the loss function, training the weight matrix and the deviation vector by adopting a back propagation and Adam optimization algorithm based on a training set to obtain the prediction model, wherein the target of the prediction model is based on input historical data [ (I)0,O0),(I1,O1),...,(It-1,Ot-1)]Learning a function f (-) and mapping the inlet amount and the outlet amount of each area in the historical data to obtain the inlet amount and the outlet amount of the next time stepSo thatSaid (I)t,Ot) And selecting a model with the minimum root mean square error as a final prediction model for the real flow value of the next time step t according to the verification set, predicting by adopting the final prediction model based on the test set, and carrying out reverse normalization on an output result to obtain a final prediction result.
The invention has the advantages and beneficial effects that:
the invention provides a novel deep learning model facing irregular area traffic prediction, which is suitable for predicting the inflow and outflow of irregular areas in cities, uses a plurality of association graphs to encode diversified spatial associations among the irregular areas, designs a multi-graph convolution neural network to fuse the associations among the irregular areas to capture spatial dependence, and then adopts a GRU neural network to capture dynamic time sequence association, thereby effectively capturing complex time-space association at the same time, improving the practicability of prediction results, and simultaneously improving the prediction accuracy of irregular area traffic flow, so that the prediction results better assist the management and control of pedestrian and vehicle flow in urban areas.
Drawings
FIG. 1 is a flow chart of irregular area traffic prediction in accordance with the present invention.
FIG. 2 is a diagram of an irregular area traffic prediction model based on multi-graph convolution and GRU in the present invention.
Detailed Description
The invention is described in detail below with reference to the figures and the embodiments.
As shown in fig. 1, the method for predicting irregular area traffic based on multi-graph convolution and GRU of the present invention includes the following steps:
dividing a city into N unconnected irregular areas;
step two, performing space-time simplification on the historical track data, and calculating to obtain the inflow and outflow of all the regions at each time step;
establishing a plurality of correlation diagrams among the regions, constructing a corresponding adjacent matrix, and expressing diversified spatial correlation among irregular regions;
designing a multi-graph convolution neural network based on the correlation diagram among the regions, fusing diversified spatial correlation characteristics among the regions, and obtaining a multi-graph convolution fusion result;
step five, based on the multi-graph convolution fusion result, capturing time correlation by adopting a GRU neural network;
selecting a proper loss function, training to obtain a prediction model, and predicting through the prediction model to obtain the inflow and the outflow of each region;
and step one, dividing the city into N unconnected irregular areas by adopting an irregular area dividing method based on the road network structure data of the city.
Step two, firstly, dividing the whole time period into a plurality of time steps according to hours, then mapping the original historical track data into the region according to hours based on the region division result to obtain a simplified track:
TRSimp=(startRegion,startDate,startHour,endRegion,endDate,endHour)
wherein startRegion is the ID of the departure area, startDate is the departure date, startHour is the departure hour, endRegion is the ID of the arrival area, endDate is the arrival date, endHour is the arrival hour, and then the entering amount of each area is obtained by aggregating simplified track calculationAnd outflow volumeWherein saidRepresents the amount of entry of zone i at the t-th time step, saidRepresents the outflow of zone i at the t-th time step, based finally on theAnd saidCalculating to obtain the entering amount of all the areas under the time step tAnd outflow volume
Establishing different association graphs to represent diversified spatial associations among irregular areas, wherein the spatial associations comprise a distance graph, a flow interaction graph and a flow association graph, the different association graphs are represented by G (V, E), and a node V is represented by G (V, E)iE.g. V represents an irregular area, an edge (V)i,vj) E encodes the degree of association between irregular regions, represented by the adjacency matrix A E RN×NRepresents;
the distance map is used to map the distance between the two objects,the weight of an edge is the distance between two regions, such that adjacent regions are connected by a higher-weight edge, said distance map being bounded by the elements A of the adjacency matrixdThe values of (i, j) are calculated as follows:
the dist (i, j) represents the distance between the center points of the regions i and j, and the adjacency matrix AdNormalized to [0, 1 ]]And based on a predefined distance threshold thresdThe A is addeddConvert to an 0/1 matrix if Ad(i,j)≤thresdIndicating that the distance between the regions i and j is very close, let A bed(i, j) ═ 1, otherwise the said Ad(i,j)=0;
The flow interaction diagram indicates whether there is frequent bidirectional flow between the two areas by aiming at the TRSimpAggregating the data to obtain the flow value fn (i, j) from the area i to the area j and the flow value fn (j, i) from the area j to the area i in the whole analysis time period, wherein the flow interaction diagram is adjacent to the matrix element AinterThe values of (i, j) are calculated as follows:
the adjacency matrix AinterNormalized to [0, 1 ]]And based on a predefined traffic interaction threshold thresinterThe A is addedinterConvert to an 0/1 matrix if Ainter(i,j)≥thresinterIndicating that the interaction between the regions i and j is very strong, let A beinter(i, j) ═ 1, otherwise the said Ainter(i,j)=0;
The flow correlation diagram indicates the time correlation of the flow among the zones, the historical flow value of each zone at each time step is obtained, and assuming that the analysis time period is 1 year, each zone has a time sequence with the length of 17520(365 by 24 by 2)The hourly ingress and egress for the year is recorded and for the region i, the time series is expressed as:the T represents the time period length, the correlation between the area i and the area j is calculated by adopting a Pearson correlation coefficient, and the adjacent matrix element A in the flow correlation diagramcorrThe values of (i, j) are calculated as follows:
h isiAnd h is saidjRepresenting said time series of said regions i and j, said adjacency matrix AcorrNormalized to [0, 1 ]]And based on a predefined flow-related threshold threscorrWill hear the statement AcorrConvert to an 0/1 matrix if Acorr(i,j)≥threscorrIndicating that the regions i and j have similar time usage patterns, let Acorr(i, j) is 1, otherwise said Acorr(i,j)=0。
As shown in fig. 2, the fourth step is to propose the multi-map convolutional neural network to fully mine useful information hidden in different maps and capture complex spatially dependent relationships based on a plurality of relationship maps among regions, and includes the following steps:
(1) using a graph convolution neural network model f (X)t′A) processing each graph at time step t':
said Xt′=(It′,Ot′) Is the input of the model at the t' time step, representing the ingress and egress of the region,is an adjacency matrix with self-flow, said INIs an identity matrix, theIs a diagonal matrix in whichThe W isd、Winter、WcorrIs a trainable weight matrix, said tanh (-) represents an activation function;
(2) adding additional attribute attr to each time stept′Coding the flow influence factors at each time step;
(3) combining a plurality of association graphs and the additional attributes by adopting a full connection layer to fuse various spatial association characteristics among the regions;
FCL(Xt′,Ad,Ainter,Acorr,attrt′)=Wfcl[fd(Xt′,Ad),finter(Xt′,Ainter),fcorr(Xt′,Acorr),attrt′]
the W isfclFor the weight matrix, the FCL (-) is the multi-graph convolution fusion result.
The additional attribute comprises a date attributeAn hour attributeWeather PropertiesAnd temperaturePropertiesThe above-mentionedThe dimension is 7 dimensions, and represents the week number of one week; the above-mentionedThe dimension is 24 dimensions, which represents the first hour of the day; the above-mentionedIs divided into 8 categories including sunny, cloudy, light rain, medium rain, heavy rain, light snow, medium snow and heavy snow; the above-mentionedIs divided into 8 levels from 10 DEG F to 90 DEG F, each 10 DEG F corresponds to one level, all the additional attributes use one-hot coding, and vectors are obtained by connection
The fifth step, based on the multi-graph convolution fusion result FCL (-), capturing the time sequence correlation in the traffic flow history by using a GRU neural network, taking the hidden state at the t '-1 time step and the multi-graph convolution result as input by the GRU, obtaining the flow state at the t' time step, and calculating the flow as follows:
ut′=σ(Wu[FCL(·),ht′-1]+bu)
rt′=σ(Wr[FCL(·),ht′-1]+br)
ct'=tanh(Wc[FCL(·),(rt'⊙ht'-1)]+bc)
ht′=ut′ht′-1+(1-ut′)ct′
h ist'-1Represents said time step t' -1Hidden state of u, the ut'To update the gate, the amount of which memory of the last moment is saved to the current time step is defined, rt'To reset the gate, it is decided how to combine the new input information with the previous memory, ct'For the stored content at the t' time step, the ht'Is said t'Output state at time step, FCL (-) being the multi-graph convolution result at time step t', Wu、Wr、WcIs a weight matrix, said bu、br、bcIs a deviation vector, the sigma is a sigmoid function, and the ⊙ is an element-by-element multiplication operation.
Step six, using Smooth L1 as the loss function, training the weight matrix and the deviation vector by adopting a back propagation and Adam optimization algorithm based on a training set to obtain the prediction model, wherein the target of the prediction model is based on input historical data [ (I)0,O0),(I1,O1),...,(It-1,Ot-1)]Learning a function f (-) and mapping the inlet amount and the outlet amount of each area in the historical data to obtain the inlet amount and the outlet amount of the next time stepSo thatSaid (I)t,Ot) And selecting a model with the minimum Root Mean Square Error (RMSE) as a final prediction model for the real flow value of the next time step t according to the verification set, predicting by adopting the final prediction model based on the test set, and performing reverse normalization on an output result to obtain a final prediction result.
Claims (8)
1. The irregular area flow prediction method based on the multi-graph convolution sum GRU is characterized by comprising the following steps of:
dividing a region into N unconnected irregular regions;
step two, performing space-time simplification on the historical track data, and calculating to obtain the inflow and outflow of all the regions at each time step;
establishing a plurality of correlation diagrams among the regions, constructing a corresponding adjacent matrix, and expressing diversified spatial correlation among irregular regions;
designing a multi-graph convolution neural network based on the correlation diagram among the regions, fusing diversified spatial correlation characteristics among the regions, and obtaining a multi-graph convolution fusion result;
step five, based on the multi-graph convolution fusion result, capturing time correlation by adopting a GRU neural network;
and sixthly, selecting a proper loss function, training to obtain a prediction model, and predicting through the prediction model to obtain the inflow and outflow of each region.
2. The irregular area traffic prediction method based on multi-graph convolution and GRU as claimed in claim 1, wherein in the first step, the region is divided into N unconnected irregular areas by an irregular area division method based on road network structure data of the region.
3. The irregular area traffic prediction method based on multi-map convolution and GRU as claimed in claim 1, wherein in step two, the whole time period of processing is firstly divided into a plurality of time steps according to unit time, and then based on the area division result, the original historical track data is mapped into the area according to unit time, so as to obtain a simplified track:
TRSimp=(startRegion,startDate,startHour,endRegion,endDate,endHour)
wherein startRegion is the ID of the departure area, startDate is the departure date, startHour is the departure unit time, endRegion is the ID of the arrival area, endDate is the arrival date, endHour is the arrival unit time, and then the entering amount of each area is obtained by aggregating simplified track calculationAnd outflow volumeWherein saidRepresents the amount of entry of zone i at the t-th time step, saidRepresents the outflow of zone i at the t-th time step, based finally on theAnd saidCalculating to obtain the entering amount of all the areas under the time step tAnd outflow volume
4. The irregular area traffic prediction method based on multi-graph convolution and GRU (generalized regression analysis) of claim 3, wherein in the third step, different association graphs are established to represent diversified spatial associations between irregular areas, including distance graphs, traffic interaction graphs and traffic association graphs, the different association graphs are all represented by G ═ V (E), and node V is represented by G ═ EiE.g. V represents an irregular region, an edge (V)i,vj) E encodes the degree of association between irregular regions, represented by the adjacency matrix A E RN×NRepresents;
the weight of an edge of the distance map is the distance between two regions, and the adjacent matrix element A in the distance mapdValue of (i, j)The calculation is as follows:
the dist (i, j) represents the distance between the center points of the regions i and j, and the adjacency matrix AdNormalized to [0, 1 ]]And based on a predefined distance threshold thresdThe A is addeddConvert to an 0/1 matrix if Ad(i,j)≤thresdIndicating that the distance between the regions i and j is very close, let A bed(i, j) ═ 1, otherwise the said Ad(i,j)=0;
The flow interaction diagram indicates whether there is frequent bidirectional flow between the two areas by aiming at the TRSimpAggregating the data to obtain the flow value fn (i, j) from the area i to the area j and the flow value fn (j, i) from the area j to the area i in the whole analysis time period, wherein the flow interaction diagram is adjacent to the matrix element AinterThe values of (i, j) are calculated as follows:
the adjacency matrix AinterNormalized to [0, 1 ]]And based on a predefined traffic interaction threshold thresinterThe A is addedinterConvert to an 0/1 matrix if Ainter(i,j)≥thresinterIndicating that the interaction between the regions i and j is very strong, let A beinter(i, j) ═ 1, otherwise the said Ainter(i,j)=0;
The flow correlation diagram indicates the time correlation of flow among the areas, the historical flow value of each area in each time step is obtained, the time sequence of the inlet flow and the outlet flow of each area in the time period is set according to the time period needing to be analyzed, and for the area i, the time sequence is expressed as:the T represents the time period length, the correlation between the area i and the area j is calculated by adopting a Pearson correlation coefficient, and the adjacent matrix element A in the flow correlation diagramcorrThe values of (i, j) are calculated as follows:
h isiAnd h is saidjRepresenting said time series of said regions i and j, said adjacency matrix AcorrNormalized to [0, 1 ]]And based on a predefined flow-related threshold threscorrThe A is addedcorrConvert to an 0/1 matrix if Acorr(i,j)≥threscorrIndicating that the regions i and j have similar time usage patterns, let Acorr(i, j) ═ 1, otherwise the said Acorr(i,j)=0。
5. The irregular area traffic prediction method based on multi-graph convolution and GRU according to claim 4, wherein the fourth step proposes the multi-graph convolution neural network to fully mine useful information hidden in different graphs and capture complex spatial dependency relationships based on a plurality of the correlation graphs among areas, and comprises the following steps:
(1) using a graph convolution neural network model f (X)t′A) processing each graph at time step t':
said Xt′=(It′,Ot′) Is the input of the model at the t' time step, representing the ingress and egress of the region,is an adjacency matrix with self-flow, said INIs an identity matrix, theIs a diagonal matrix in whichThe W isd、Winter、WcorrIs a trainable weight matrix, said tanh (-) represents an activation function;
(2) adding additional attribute attr to each time stept′Coding the flow influence factors at each time step;
(3) combining a plurality of association graphs and the additional attributes by adopting a full connection layer to fuse various spatial association characteristics among the regions;
FCL(Xt′,Ad,Ainter,Acorr,attrt′)=Wfcl[fd(Xt′,Ad),finter(Xt′,Ainter),fcorr(Xt′,Acorr),attrt′]
the W isfclFor the weight matrix, the FCL (-) is the multi-graph convolution fusion result.
6. The method of claim 5, wherein the additional attributes comprise a date attributeAn hour attributeWeather PropertiesAnd temperature propertyThe above-mentionedThe dimension is 7 dimensions, and represents the week number of one week; the above-mentionedThe dimension is 24 dimensions, which represents the hours of the day; the above-mentionedIs divided into 8 categories including sunny, cloudy, light rain, medium rain, heavy rain, light snow, medium snow and heavy snow; the above-mentionedIs divided into 8 levels from 10 DEG F to 90 DEG F, each 10 DEG F corresponds to one level, all the additional attributes use one-hot coding, and vectors are obtained by connection
7. The irregular area traffic prediction method based on multi-map convolution and GRU according to claim 5, wherein in the fifth step, based on the multi-map convolution fusion result FCL (-) and using GRU neural network to capture time sequence correlation in history, the GRU takes hidden state at time step t '-1 and multi-map convolution result as input to obtain traffic state at time step t', and the calculation flow is as follows:
ut'=σ(Wu[FCL(·),ht'-1]+bu)
rt'=σ(Wr[FCL(·),ht'-1]+br)
ct'=tanh(Wc[FCL(·),(rt'⊙ht'-1)]+bc)
ht'=ut'ht'-1+(1-ut')ct'
h ist'-1Represents the hidden state at said time step t' -1, said ut'To update the gate, the amount of which memory of the last moment is saved to the current time step is defined, rt'To reset the gate, it is decided how to combine the new input information with the previous memory, ct'For the stored content at the t' time step, the ht'For the output state at the t 'time step, the FCL (-) is the multi-graph convolution result at the t' time step, the Wu、Wr、WcIs a weight matrix, said bu、br、bcIs a deviation vector, the sigma is a sigmoid function, and the ⊙ is an element-by-element multiplication operation.
8. The irregular area traffic prediction method based on multi-graph convolution and GRU (generalized regression analysis) of claim 7, wherein in the sixth step, the weight matrix and the deviation vector are trained by using a back propagation and Adam optimization algorithm based on a training set by using Smooth L1 as the loss function to obtain the prediction model, and the target of the prediction model is based on input historical data [ (I)0,O0),(I1,O1),...,(It-1,Ot-1)]Learning a function f (-) and mapping the inlet amount and the outlet amount of each area in the historical data to obtain the inlet amount and the outlet amount of the next time stepSo thatSaid (I)t,Ot) Is composed ofAnd selecting a model with the minimum root mean square error as a final prediction model according to the verification set for the real flow value of the next time step t, predicting by adopting the final prediction model based on the test set, and carrying out reverse normalization on the output result to obtain a final prediction result.
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201911148344.6A CN110991713B (en) | 2019-11-21 | 2019-11-21 | Irregular area flow prediction method based on multi-graph convolution sum GRU |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201911148344.6A CN110991713B (en) | 2019-11-21 | 2019-11-21 | Irregular area flow prediction method based on multi-graph convolution sum GRU |
Publications (2)
Publication Number | Publication Date |
---|---|
CN110991713A true CN110991713A (en) | 2020-04-10 |
CN110991713B CN110991713B (en) | 2022-04-01 |
Family
ID=70085720
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN201911148344.6A Active CN110991713B (en) | 2019-11-21 | 2019-11-21 | Irregular area flow prediction method based on multi-graph convolution sum GRU |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN110991713B (en) |
Cited By (19)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN111583263A (en) * | 2020-04-30 | 2020-08-25 | 北京工业大学 | Point cloud segmentation method based on joint dynamic graph convolution |
CN111639787A (en) * | 2020-04-28 | 2020-09-08 | 北京工商大学 | Spatio-temporal data prediction method based on graph convolution network |
CN111651504A (en) * | 2020-06-03 | 2020-09-11 | 湖南大学 | Multi-element time sequence multilayer space-time dependence modeling method based on deep learning |
CN111695679A (en) * | 2020-06-09 | 2020-09-22 | 北京百度网讯科技有限公司 | Method and device for predicting input and output amount, electronic device and storage medium |
CN111882925A (en) * | 2020-07-27 | 2020-11-03 | 交通运输部水运科学研究所 | Shipping traffic flow prediction system based on information propagation diagram and recurrent neural network |
CN112200351A (en) * | 2020-09-24 | 2021-01-08 | 深圳市综合交通运行指挥中心 | Urban area passenger flow volume prediction method based on mobile phone signaling data |
CN112383516A (en) * | 2020-10-29 | 2021-02-19 | 博雅正链(北京)科技有限公司 | Graph neural network construction method and abnormal flow detection method based on graph neural network |
CN112419710A (en) * | 2020-10-22 | 2021-02-26 | 深圳云天励飞技术股份有限公司 | Traffic congestion data prediction method, traffic congestion data prediction device, computer equipment and storage medium |
CN112562312A (en) * | 2020-10-21 | 2021-03-26 | 浙江工业大学 | GraphSAGE traffic network data prediction method based on fusion characteristics |
CN112561118A (en) * | 2020-10-29 | 2021-03-26 | 北京水慧智能科技有限责任公司 | Municipal pipe network water flow prediction method based on GRU neural network |
CN112801355A (en) * | 2021-01-20 | 2021-05-14 | 南京航空航天大学 | Data prediction method based on multi-graph fusion space-time attention of long-short-term space-time data |
CN112819213A (en) * | 2021-01-22 | 2021-05-18 | 华南理工大学 | Expressway freight volume prediction method and system based on deep learning network |
CN112911626A (en) * | 2021-02-01 | 2021-06-04 | 福州大学 | Wireless network flow prediction method based on multi-graph convolution |
WO2021221563A1 (en) * | 2020-04-30 | 2021-11-04 | Grabtaxi Holdings Pte. Ltd. | Method for predicting the destination location of a vehicle |
CN114358213A (en) * | 2022-03-08 | 2022-04-15 | 湖南大学 | Error ablation processing method, system and medium for nonlinear time series data prediction |
CN114358375A (en) * | 2021-11-29 | 2022-04-15 | 重庆邮电大学 | Crowd density prediction method and system based on big data |
CN114973653A (en) * | 2022-04-27 | 2022-08-30 | 中国计量大学 | Traffic flow prediction method based on space-time graph convolution network |
CN115018553A (en) * | 2022-06-30 | 2022-09-06 | 东南大学 | Regional logistics single quantity prediction system and method based on deep learning |
CN116405976A (en) * | 2023-06-06 | 2023-07-07 | 中国民用航空飞行学院 | ADS-B-based data bidirectional communication optimization method and system |
Citations (4)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN107967532A (en) * | 2017-10-30 | 2018-04-27 | 厦门大学 | The Forecast of Urban Traffic Flow Forecasting Methodology of integration region vigor |
WO2018155397A1 (en) * | 2017-02-24 | 2018-08-30 | 株式会社日立製作所 | Congestion forecasting system and pedestrian simulation device |
CN109389244A (en) * | 2018-09-06 | 2019-02-26 | 浙江鸿程计算机系统有限公司 | Tourist's number prediction technique in a kind of short-term scenic spot of multifactor perception based on GRU |
CN110119482A (en) * | 2019-05-13 | 2019-08-13 | 杭州电子科技大学 | Based on the crowd of POI and multi-source mobile data collection trip mode visible analysis method |
-
2019
- 2019-11-21 CN CN201911148344.6A patent/CN110991713B/en active Active
Patent Citations (4)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
WO2018155397A1 (en) * | 2017-02-24 | 2018-08-30 | 株式会社日立製作所 | Congestion forecasting system and pedestrian simulation device |
CN107967532A (en) * | 2017-10-30 | 2018-04-27 | 厦门大学 | The Forecast of Urban Traffic Flow Forecasting Methodology of integration region vigor |
CN109389244A (en) * | 2018-09-06 | 2019-02-26 | 浙江鸿程计算机系统有限公司 | Tourist's number prediction technique in a kind of short-term scenic spot of multifactor perception based on GRU |
CN110119482A (en) * | 2019-05-13 | 2019-08-13 | 杭州电子科技大学 | Based on the crowd of POI and multi-source mobile data collection trip mode visible analysis method |
Non-Patent Citations (3)
Title |
---|
僧德文 等: "基于多图卷积网络和门控循环单元的不规则区域交通流量预测(英文)", 《FRONTIERS OF INFORMATION TECHNOLOGY & ELECTRONIC ENGINEERING》 * |
王敬昌 等: "基于门控循环单元的多因素感知短期游客人数预测模型", 《浙江大学学报(工学版)》 * |
薛佳瑶等: "基于卷积循环神经网络的城市区域车流量预测模型", 《信息工程大学学报》 * |
Cited By (32)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN111639787A (en) * | 2020-04-28 | 2020-09-08 | 北京工商大学 | Spatio-temporal data prediction method based on graph convolution network |
CN111639787B (en) * | 2020-04-28 | 2024-03-15 | 北京工商大学 | Spatio-temporal data prediction method based on graph convolution network |
CN111583263B (en) * | 2020-04-30 | 2022-09-23 | 北京工业大学 | Point cloud segmentation method based on joint dynamic graph convolution |
US11815360B2 (en) | 2020-04-30 | 2023-11-14 | Grabtaxi Holdings Pte. Ltd. | Method for predicting the destination location of a vehicle |
CN111583263A (en) * | 2020-04-30 | 2020-08-25 | 北京工业大学 | Point cloud segmentation method based on joint dynamic graph convolution |
WO2021221563A1 (en) * | 2020-04-30 | 2021-11-04 | Grabtaxi Holdings Pte. Ltd. | Method for predicting the destination location of a vehicle |
CN111651504B (en) * | 2020-06-03 | 2021-10-08 | 湖南大学 | Multi-element time sequence multilayer space-time dependence modeling method based on deep learning |
CN111651504A (en) * | 2020-06-03 | 2020-09-11 | 湖南大学 | Multi-element time sequence multilayer space-time dependence modeling method based on deep learning |
CN111695679A (en) * | 2020-06-09 | 2020-09-22 | 北京百度网讯科技有限公司 | Method and device for predicting input and output amount, electronic device and storage medium |
CN111695679B (en) * | 2020-06-09 | 2023-12-29 | 北京百度网讯科技有限公司 | Method and device for predicting access amount, electronic equipment and storage medium |
CN111882925A (en) * | 2020-07-27 | 2020-11-03 | 交通运输部水运科学研究所 | Shipping traffic flow prediction system based on information propagation diagram and recurrent neural network |
CN112200351A (en) * | 2020-09-24 | 2021-01-08 | 深圳市综合交通运行指挥中心 | Urban area passenger flow volume prediction method based on mobile phone signaling data |
CN112562312A (en) * | 2020-10-21 | 2021-03-26 | 浙江工业大学 | GraphSAGE traffic network data prediction method based on fusion characteristics |
CN112419710A (en) * | 2020-10-22 | 2021-02-26 | 深圳云天励飞技术股份有限公司 | Traffic congestion data prediction method, traffic congestion data prediction device, computer equipment and storage medium |
CN112419710B (en) * | 2020-10-22 | 2022-07-26 | 深圳云天励飞技术股份有限公司 | Traffic congestion data prediction method, traffic congestion data prediction device, computer equipment and storage medium |
CN112561118A (en) * | 2020-10-29 | 2021-03-26 | 北京水慧智能科技有限责任公司 | Municipal pipe network water flow prediction method based on GRU neural network |
CN112383516A (en) * | 2020-10-29 | 2021-02-19 | 博雅正链(北京)科技有限公司 | Graph neural network construction method and abnormal flow detection method based on graph neural network |
CN112801355A (en) * | 2021-01-20 | 2021-05-14 | 南京航空航天大学 | Data prediction method based on multi-graph fusion space-time attention of long-short-term space-time data |
CN112801355B (en) * | 2021-01-20 | 2022-05-24 | 南京航空航天大学 | Data prediction method based on multi-graph fusion space-time attention of long-short-term space-time data |
CN112819213A (en) * | 2021-01-22 | 2021-05-18 | 华南理工大学 | Expressway freight volume prediction method and system based on deep learning network |
CN112819213B (en) * | 2021-01-22 | 2022-06-14 | 华南理工大学 | Highway freight volume prediction method and system based on deep learning network |
CN112911626A (en) * | 2021-02-01 | 2021-06-04 | 福州大学 | Wireless network flow prediction method based on multi-graph convolution |
CN114358375A (en) * | 2021-11-29 | 2022-04-15 | 重庆邮电大学 | Crowd density prediction method and system based on big data |
CN114358375B (en) * | 2021-11-29 | 2024-05-24 | 重庆邮电大学 | Crowd density prediction method and system based on big data |
CN114358213B (en) * | 2022-03-08 | 2022-06-10 | 湖南大学 | Error ablation processing method, system and medium for nonlinear time series data prediction |
CN114358213A (en) * | 2022-03-08 | 2022-04-15 | 湖南大学 | Error ablation processing method, system and medium for nonlinear time series data prediction |
CN114973653A (en) * | 2022-04-27 | 2022-08-30 | 中国计量大学 | Traffic flow prediction method based on space-time graph convolution network |
CN114973653B (en) * | 2022-04-27 | 2023-12-19 | 中国计量大学 | Traffic flow prediction method based on space-time diagram convolutional network |
CN115018553A (en) * | 2022-06-30 | 2022-09-06 | 东南大学 | Regional logistics single quantity prediction system and method based on deep learning |
CN115018553B (en) * | 2022-06-30 | 2024-05-07 | 东南大学 | Regional logistics single quantity prediction system and method based on deep learning |
CN116405976A (en) * | 2023-06-06 | 2023-07-07 | 中国民用航空飞行学院 | ADS-B-based data bidirectional communication optimization method and system |
CN116405976B (en) * | 2023-06-06 | 2023-09-22 | 中国民用航空飞行学院 | ADS-B-based data bidirectional communication optimization method and system |
Also Published As
Publication number | Publication date |
---|---|
CN110991713B (en) | 2022-04-01 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN110991713B (en) | Irregular area flow prediction method based on multi-graph convolution sum GRU | |
CN109658695B (en) | Multi-factor short-term traffic flow prediction method | |
Lin et al. | Quantifying uncertainty in short-term traffic prediction and its application to optimal staffing plan development | |
CN115240425B (en) | Traffic prediction method based on multi-scale space-time fusion graph network | |
CN112489426B (en) | Urban traffic flow space-time prediction scheme based on graph convolution neural network | |
CN113268916A (en) | Traffic accident prediction method based on space-time graph convolutional network | |
CN111612243A (en) | Traffic speed prediction method, system and storage medium | |
CN111242292B (en) | OD data prediction method and system based on deep space-time network | |
US20240054321A1 (en) | Traffic prediction | |
CN111242395B (en) | Method and device for constructing prediction model for OD (origin-destination) data | |
CN113298319B (en) | Traffic speed prediction method based on skip map attention gating cycle network | |
CN112071062A (en) | Driving time estimation method based on graph convolution network and graph attention network | |
US20240143999A1 (en) | Multi-modal data prediction method based on causal markov model | |
CN112488185A (en) | Method, system, electronic device and readable storage medium for predicting vehicle operating parameters including spatiotemporal characteristics | |
CN114692984A (en) | Traffic prediction method based on multi-step coupling graph convolution network | |
CN112862177A (en) | Urban area concentration degree prediction method, equipment and medium based on deep neural network | |
CN115206092A (en) | Traffic prediction method of BiLSTM and LightGBM model based on attention mechanism | |
CN114565187A (en) | Traffic network data prediction method based on graph space-time self-coding network | |
CN115936069A (en) | Traffic flow prediction method based on space-time attention network | |
CN115376317A (en) | Traffic flow prediction method based on dynamic graph convolution and time sequence convolution network | |
CN117252307B (en) | Traffic prediction method, traffic prediction device, computer equipment and storage medium | |
CN114572229A (en) | Vehicle speed prediction method, device, medium and equipment based on graph neural network | |
CN113327417A (en) | Traffic flow prediction method based on 3D dynamic space-time residual convolution associated network | |
CN114566048B (en) | Traffic control method based on multi-view self-adaptive space-time diagram network | |
CN115796030A (en) | Traffic flow prediction method based on graph convolution |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PB01 | Publication | ||
PB01 | Publication | ||
SE01 | Entry into force of request for substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
GR01 | Patent grant | ||
GR01 | Patent grant |