CN115482666B

CN115482666B - Multi-graph convolution neural network traffic prediction method based on data fusion

Info

Publication number: CN115482666B
Application number: CN202211110596.1A
Authority: CN
Inventors: 王兴起; 叶佳妮; 邵艳利; 魏丹; 陈滨; 方景龙
Original assignee: Hangzhou Dianzi University
Current assignee: Hangzhou Dianzi University
Priority date: 2022-09-13
Filing date: 2022-09-13
Publication date: 2024-05-07
Anticipated expiration: 2042-09-13
Also published as: CN115482666A

Abstract

The invention discloses a multi-graph convolution neural network traffic prediction method based on data fusion, which aims at solving the problem that the space-time characteristics of nodes are not considered when a traffic topological graph is constructed by a current main flow model, and 3D convolution is utilized to form a 3DRepVGG component to extract the space-time characteristics of European traffic flow data and take the space-time characteristics as the characteristics of each node in the traffic topological graph; the periodic gating logic unit is designed to process non-European traffic flow data, so that the capability of the model for extracting time characteristics is improved, the regional traffic condition is identified and quantified by using a clustering algorithm, the generalization capability of the model is improved, and the parameter quantity is reduced; and constructing a plurality of traffic topological graphs according to the regional traffic state, the non-European traffic flow data and the node characteristic data, enhancing the capability of the model in extracting the remote space characteristics, and further improving the prediction precision.

Description

Multi-graph convolution neural network traffic prediction method based on data fusion

Technical Field

The invention belongs to the field of traffic flow prediction, and relates to a multi-graph convolution neural network traffic prediction method based on data fusion.

Background

Currently, most traffic flow prediction models treat traffic flow data as an European data structure, i.e. a city is divided into grid areas of regular shape, each grid value representing the number of vehicles in the area. The benefit of this European structure is that it is straightforward and intuitive, and the traffic volume per zone can be clearly understood, but the drawbacks are also apparent, and the structure is not indicative of zone-to-zone vehicle flow. Along with the construction of expressways in cities, vehicles can flow more conveniently across areas, and dependency relationship can be generated between two areas which are far apart. Deployment of a large number of advanced sensors on roads also makes it possible to collect inter-regional vehicle flow information. By means of the vehicle flow information between the areas, the change of urban traffic can be more intuitively known. The areas will not only affect adjacent areas, but also vehicle flows with areas that are far apart, and even geographically adjacent areas will not be affected by the areas in certain time periods, and new traffic flow data will often exhibit non-European structural features, preserving effective long-range spatial correlation.

In general, european traffic flow data contains fewer long-range spatial features and because of the shared nature of the convolution kernel parameters, the field of view of the model extraction features is smaller. In recent years, in a model using european traffic flow data as a data set, in order to extract a distant spatial correlation, researchers often adopt two methods of hole convolution and deepening network depth, and although this method can obtain global spatial features to a certain extent, the possibility of extracting invalid information by the model is increased, so that the prediction accuracy of the model is affected. In addition, when facing a training set of a small scale, the prediction model with a deeper structure often has problems such as gradient explosion or gradient disappearance. In summary, it is insufficient to predict regional traffic data for the next time step using only European traffic flow data, and non-European traffic flow data is required to construct a traffic topology map. However, since traffic flow also has multiple characteristics, constructing a non-European topology map with only a single characteristic is not sufficient to fully characterize traffic flow data. In addition, node features in non-European data generally only have features such as flow or speed, but have no space-time features, and most mainstream prediction models often ignore the features. Therefore, how to effectively process non-European structures and consider various characteristics and node characteristics of traffic flow to accurately and comprehensively characterize traffic flow data to construct an accurate traffic flow prediction model has yet to be studied.

Disclosure of Invention

Aiming at the defects of the prior art, the invention discloses a multi-graph convolution neural network traffic prediction method based on data fusion, so as to realize efficient and accurate prediction of large-scale traffic flow data. The main content is as follows: (1) Aiming at the space-time characteristics of nodes are not considered when a traffic topological graph is constructed by a current main flow model, 3D convolution is utilized to form a 3DRepVGG component to extract the space-time characteristics of European traffic flow data, the space-time characteristics are used as each node characteristic (2) in the traffic topological graph, a periodic gating logic unit is designed to process non-European traffic flow data, the capacity of the model for extracting time characteristics is improved, the regional traffic condition is identified and quantified by a clustering algorithm, the generalization capacity of the model is improved, and the quantity of parameters is reduced. (3) And constructing a plurality of traffic topological graphs according to the regional traffic state, the non-European traffic flow data and the node characteristic data, enhancing the capability of the model in extracting the remote space characteristics, and further improving the prediction precision.

The invention comprises two steps: constructing a multi-graph convolution neural network traffic prediction model based on data fusion, and training and testing the multi-graph convolution neural network traffic prediction model based on data fusion.

Step 1: multi-graph convolution neural network traffic prediction model construction based on data fusion

The construction of the multi-graph convolution neural network traffic prediction model based on data fusion comprises 3 steps: non-European node characteristic information is obtained, a traffic topological graph is constructed, and a multi-graph convolution neural network is constructed;

Step 1-1: non-European node characteristic information acquisition

3D convolution is utilized to form a 3DRepVGG component to extract the space-time characteristics of European traffic flow data and serve as non-European node characteristic information;

The 3DRepVGG component contains three layers RepVGG operations, each layer RepVGG operation contains two 3D convolutions, one 3D hole convolution and activation operations; the formula for the 3DRepVGG component is as follows:

Wherein, sigma is the activation operation, The output of the operations for layer RepVGG, which is also the input of the operations for layer RepVGG, conv ¹ is a 3D convolution of 1 x 1 structure, conv ³ is a 3D convolution of a 3x 3 structure and dconv ³ is a 3D hole convolution of a 3x 3 structure.

The spatio-temporal feature Y _N is finally obtained by the 3DRepVGG component and serves as a non-euro node feature V.

Step 1-2: traffic topology construction

In order to fully express the traffic flow characteristics, the prediction accuracy of the model is further improved, and a traffic flow similarity topological graph and a traffic flow influence topological graph are respectively constructed to comprehensively describe the characteristics of traffic flow. The construction of the traffic topology map is divided into 4 steps: non-European traffic flow direction data acquisition, regional traffic state division, traffic flow direction similarity topological graph construction and traffic flow direction influence topological graph.

Step 1-2-1: non-European traffic flow data acquisition

The periodic gating logic unit is designed to obtain periodic non-European traffic flow data. The period gating logic unit consists of two pooling layers, a one-dimensional rolling and activating operation, and the formula of the unit is as follows:

Wherein X _NEur is non-European traffic flow data, AP is average pooling operation, MP is maximum pooling operation, delta ₁ and delta ₂ are self-defined learnable variables, sigma is activating operation, CNN is one-dimensional convolution, Is periodic traffic flow data.

Step 1-2-2: regional traffic state partitioning

And dividing the traffic state of the area by using a K-means clustering algorithm. The method comprises the following specific steps:

(1) Traffic flow data preprocessing. The number of vehicles flowing out of the area, the number of vehicles flowing in the area, and the number of area connections (the number of areas flowing with other vehicles) are obtained from the non-European traffic flow data, and a3 XH X W sample set is formed.

(2) Initializing a cluster class center. Setting the number K of needed cluster types, and randomly generating K cluster type centers Ck, K E (1..K) by an algorithm.

(3) Sample variance values are calculated. And calculating the difference value between the sample point and the cluster center point according to the Euclidean distance, wherein the class to which each sample belongs is the class represented by the cluster center with the smallest distance.

(4) Updating the cluster center. The average value of all samples in the current cluster is calculated and used as a new cluster center.

(5) Repeating the step (3) and the step (4) until all sample points are not changed in category.

And finally obtaining the original cluster labels C _k of the areas.

Step 1-2-3: traffic flow similarity topology graph construction

And constructing a traffic flow similarity topological graph by the node characteristic data, the periodic traffic flow data and the regional traffic state data. The definition of the traffic flow similarity value is as follows:

Wherein, Is the traffic state of the vehicle outflow zone,/>Is the traffic state of the vehicle inflow region, C _max is the maximum value of the clusters, k _all is the number of the clusters, and the total traffic state classification value of the current traffic state cluster is the upper part of the cluster. e _S (i, j) is the similarity value of the traffic flow direction of the area i to the area j, namely the edge feature in the traffic topology. E _s is the traffic flow similarity of the city, which is composed of the traffic flow similarity values of all areas. Finally, the traffic flow similarity topology is denoted as G _s＝(V,E_s).

Step 1-2-4: traffic flow direction influencing topological graph construction

And constructing a traffic flow similarity topological graph by the node characteristic data and the periodic traffic flow data. The traffic flow direction influence degree value is defined as follows:

Where e _(i,j) is the vehicle flow data between region i and region j, For the total flow value of the area i, the numerator is the difference value of the two, and e _P (i, j) is the traffic flow direction influence degree value of the area i and the area j. E _P is the degree of influence of the traffic flow direction of the city, which is composed of the traffic flow direction influence values of all areas. Finally, the traffic flow direction impact topology is denoted as G _P＝(V,E_P).

Step 1-3: multi-graph convolution neural network construction

After two traffic topological graphs are constructed, a multi-graph convolution neural network is designed to extract the characteristics of the two topological graphs.

The multi-graph convolutional neural network is composed of a graph convolutional neural network, an activation operation and a full connection layer. The two topological graphs are respectively subjected to graph convolution neural network and activation operation to extract features, and then the features are fused and further learned through a full connection layer to obtain the features of traffic flow data. The specific formula is as follows:

Y_G＝Θ(Θ(o₁×Y_S+o₂×Y_P))

wherein, Q _S is a node degree matrix obtained according to the traffic flow direction similarity topological graph G _S, and Q _P is a degree matrix obtained according to the traffic flow direction influence topological graph G _p; f is the total operator of the convolution operation and the activation function, and two operations are needed for G _S and G _p to obtain Y _S and Y _P, respectively. Θ is a fully connected operation, o ₁ and o ₂ are custom learnable variables, and Y _G is a feature of the traffic topology map.

And finally, fusing the features extracted by the 3DRepVGG component with the features of the traffic topological graph to obtain a final prediction result.

Y＝q₁×Y_N+q₂×Y_G

Wherein q ₁ and q ₂ are custom learnable variables.

Step 2: multi-graph convolution neural network traffic prediction model training and testing based on data fusion

Training and testing of a multi-graph convolution neural network traffic prediction model based on data fusion are divided into two steps: model training and model testing;

Step 2-1: model training

The traffic flow data is divided into a training set and a testing set, after the training set data are ordered according to time sequence, part of data are extracted as input according to the requirements of each component in each training round.

The Adam optimization algorithm is adopted to adjust parameters during model training; when the training times reach the set k value, the model stops training;

step 2-2: model testing

And predicting in the test set by using the trained model, and comparing the obtained predicted value with a real observed value to obtain the prediction precision.

The evaluation index of the model performance is as follows:

mean absolute error (Mean Absolute Error, MAE): the result is the average of the absolute errors between the actual and predicted values, and the formula is as follows:

Mean absolute percent error (Mean Absolute Percentage Error, MAPE): the result is the average of the absolute percentage error between the actual and predicted values, as follows:

Root mean square error (Root Mean Square Error, RMSE): the result is the arithmetic square root of the mean square error between the actual and predicted values, as follows:

after the model obtains three evaluation indexes, the model is compared with the current main stream model, and the performance of the model is proved to be in the leading position.

The method of the invention, namely the multi-graph convolution neural network traffic prediction model based on data fusion, can realize better extraction of the characteristics of traffic flow data under the condition of using European traffic flow data and non-European traffic flow data at the same time. Based on the trained parameter model, a high-precision prediction result can be obtained. Meanwhile, the invention has the following characteristics:

1) The important characteristic of non-European traffic flow data is the long-distance spatial characteristic, in the method of the invention, a multi-graph convolution neural network is formed by utilizing the graph convolution neural network and the full-connection layer, and the characteristic of the traffic topological graph is extracted by utilizing the spectrogram theory, so that the model can model urban traffic more comprehensively.

2) At present, most researches do not consider the space-time characteristics of the nodes, the flow or the speed of the nodes are generally used as single characteristics of the nodes, the space-time characteristics of the nodes are ignored, and the model cannot comprehensively learn the characteristics of traffic flow data. The invention adopts 3DRepVGG components to extract the space-time characteristics of European traffic flow data and takes the space-time characteristics as the space-time characteristics of the nodes, so that the model can learn the characteristics of the nodes more comprehensively and further improve the prediction precision.

3) Aiming at the defect of insufficient time feature extraction capability of the graph convolutional neural network, the invention designs the periodic gating logic unit to firstly process non-European traffic flow data to obtain periodic traffic flow data to construct a traffic topological graph, and improves the time feature extraction capability of the model, thereby helping the model to obtain more accurate prediction.

4) Aiming at the characteristic that a single traffic flow topological graph cannot comprehensively express traffic flow data, the traffic flow similarity topological graph and the traffic flow influence topological graph are respectively constructed, traffic flow is respectively modeled from two angles of traffic flow similarity and traffic flow influence degree, and the model is helped to learn the characteristics of different traffic flow characteristics and traffic flow influence degrees of similar areas, so that the prediction accuracy of the model is further improved.

Drawings

Fig. 1 is a general flow chart of the present invention.

Fig. 2 is a model flow diagram of the present invention.

Fig. 3 is a block diagram of 3DRepVGG of the present invention.

FIG. 4 is a block diagram of the cycle gating logic component of the present invention.

Fig. 5 is a multi-graph convolutional neural network structure of the present invention.

Detailed Description

As shown in fig. 1, the present invention comprises two steps: constructing a multi-graph convolution neural network traffic prediction model based on data fusion, and training and testing the multi-graph convolution neural network traffic prediction model based on data fusion.

The specific flow is shown in fig. 2, and the construction of the multi-graph convolution neural network traffic prediction model based on data fusion comprises 3 steps: non-European node characteristic information is obtained, a traffic topological graph is constructed, and a multi-graph convolution neural network is constructed;

Step 1-1: non-European node characteristic information acquisition

As shown in fig. 3, 3D convolution is used to form a 3DRepVGG component to extract the space-time characteristics of the european traffic flow data and to serve as non-european node characteristic information;

Step 1-2: traffic topology construction

In order to fully express traffic flow characteristics and further improve the prediction accuracy of the model, the invention respectively constructs a traffic flow similarity topological graph and a traffic flow influence topological graph to comprehensively describe the characteristics of traffic flow. The construction of the traffic topology map is divided into 4 steps: non-European traffic flow direction data acquisition, regional traffic state division, traffic flow direction similarity topological graph construction and traffic flow direction influence topological graph.

Step 1-2-1: non-European traffic flow data acquisition

As shown in fig. 4, the period gating logic is designed to obtain periodic non-european traffic flow data. The period gating logic unit consists of two pooling layers, a one-dimensional rolling and activating operation, and the formula of the unit is as follows:

Step 1-2-2: regional traffic state partitioning

And finally obtaining the original cluster labels C _k of the areas.

Step 1-2-3: traffic flow similarity topology graph construction

Step 1-2-4: traffic flow direction influencing topological graph construction

Step 1-3: multi-graph convolution neural network construction

As shown in fig. 5, the multi-graph convolutional neural network is composed of a graph convolutional neural network, an activation operation, and a full connection layer. The two topological graphs are respectively subjected to graph convolution neural network and activation operation to extract features, and then the features are fused and further learned through a full connection layer to obtain the features of traffic flow data. The specific formula is as follows:

Y_G＝Θ(Θ(o₁×Y_S+o₂×Y_P))

Wherein, Q _S is a degree matrix of each node obtained according to the traffic similarity topological graph G _S, and Q _P is a degree matrix obtained according to the traffic similarity topological graph G _p. f is the total operator of the convolution operation and the activation function, and two operations are needed for G _S and G _p to obtain Y _S and Y _P, respectively. Θ is the fully connected operation, o ₁ and o ₂ are the custom learnable variables, Y _G is the characteristics of the traffic topology map

Y＝q₁×Y_N+q₂×Y_G

Wherein q ₁ and q ₂ are custom learnable variables.

Step 2-1: model training

The data set is selected from taxi driving data of 1 month and 1 day in 2015, and New York. The dataset divided new york city into a 10x 20 grid plot, data collected every half hour, data divided into two formats: the first is data of European structure, and regional traffic data of the whole city is defined as tensor form: Where T is the time step and C is the traffic type (input traffic and output traffic). The second type of data is data with a non-European structure, and the traffic flow data among the whole urban areas is defined as tensor form: /(I) Representing the value of the vehicle flow between the region (i, j) and the other region at time step t.

The data of the first 40 days in the data set is used as a training set, and part of data is extracted as input according to the requirements of each component in each training round. The periodicity component extracts periodicity data for 4 time points as input (t=4), the near term component extracts 16 time point data for near term data as input (t=16), and the trend component extracts 16 time point data one week apart from the near term data as input (t=16).

The parameters are adjusted by adopting an Adam optimization algorithm during model training, the first-order exponential decay rate is set to 0.9, the second-order exponential decay rate is set to 0.999, the learning rate is set to 1e-3, and the weight decay is set to 0.005. When the number of training times reached 600 times set, the model stopped training.

Step 2-2: model testing

The evaluation index of the model performance is as follows:

The data of the first 40 days in the data set is used as a test set, a trained model is used for prediction in the test set, the obtained predicted value is compared with the actual observed value to obtain the prediction precision, and three indexes of MAE, MAPE and RMSE are adopted as the evaluation indexes of the model performance. After the model obtains three evaluation indexes, the three evaluation indexes are compared with the current mainstream model. The model provided by the invention is in the leading position on most indexes, the data of three indexes of the model are 5.51, 0.27 and 14.09 respectively, and experimental results prove that the invention is effective.

Claims

1. The traffic prediction method of the multi-graph convolution neural network based on data fusion is characterized by comprising the following steps of: the method specifically comprises the following steps:

step one: multi-graph convolution neural network traffic prediction model construction based on data fusion

The construction of the multi-graph convolution neural network traffic prediction model based on data fusion comprises 3 steps: non-European node characteristic information is obtained, a traffic topological graph is constructed, and a multi-graph convolution neural network is constructed; the traffic topological graph consists of a traffic flow direction similarity topological graph and a traffic flow direction influence topological graph;

the multi-graph convolution neural network is constructed specifically as follows:

After two traffic topological graphs are constructed, a multi-graph convolution neural network is designed to extract the characteristics of the two topological graphs;

The multi-graph convolution neural network consists of a graph convolution neural network, an activation operation and a full connection layer; the two topological graphs are respectively subjected to graph convolution neural network and activation operation to extract features, and then the features are fused and further learned through a full connection layer to obtain the features of traffic flow direction data; the specific formula is as follows:

Y_G＝Θ(Θ(o₁×Y_S+o₂×Y_P))

Wherein, Q _S is a node degree matrix obtained according to the traffic flow direction similarity topological graph G _S, and Q _P is a degree matrix obtained according to the traffic flow direction influence topological graph G _p; f is the total operator of the graph convolution operation and the activation function, and G _S and G _p can obtain Y _S and Y _P through two operations respectively; Θ is a fully connected operation, o ₁ and o ₂ are custom learnable variables, and Y _G is a feature of a traffic topology map;

finally, fusing the features extracted by the 3DRepVGG component with the features of the traffic topological graph to obtain a final prediction result;

Y＝q₁×Y_N+q₂×Y_G

Wherein q ₁ and q ₂ are self-defined learnable variables, and Y _N is a characteristic obtained by non-European node characteristic information;

step two: multi-graph convolution neural network traffic prediction model training and testing based on data fusion

The non-European node characteristic information is obtained specifically as follows:

Wherein, sigma is the activation operation, The output of the operations for layer RepVGG, which is also the input of the operations for layer RepVGG, conv ¹ is a 3D convolution of 1 x 1 structure, conv ³ is a 3D convolution of a3 x 3 structure, dconv ³ is a 3D hole convolution of a3 x 3 structure;

2. The data fusion-based multi-graph convolutional neural network traffic prediction method of claim 1, wherein the method comprises the steps of: the traffic topology map is constructed specifically as follows:

The construction of the traffic topology map is divided into 4 steps: non-European traffic flow direction data acquisition, regional traffic state division, traffic flow direction similarity topological graph construction and traffic flow direction influence topological graph;

step 1: non-European traffic flow data acquisition

The method comprises the steps of designing a period gating logic unit to obtain periodic non-European traffic flow direction data; the period gating logic unit consists of two pooling layers, a one-dimensional rolling and activating operation, and the formula of the unit is as follows:

Wherein X _NEur is non-European traffic flow data, AP is average pooling operation, MP is maximum pooling operation, delta ₁ and delta ₂ are self-defined learnable variables, sigma is activating operation, CNN is one-dimensional convolution, Is periodic traffic flow data;

Step2: regional traffic state partitioning

Dividing the traffic state of the area by using a K-means clustering algorithm; the method comprises the following specific steps:

(1) Traffic flow data preprocessing

Acquiring the number of vehicles flowing out of a region, the number of vehicles flowing in of the region and the number of region connection, namely the number of regions flowing with other vehicles, from non-European traffic flow data to form a3 XH XW sample set;

(2) Initializing cluster class centers

Setting the number K of needed clusters, and randomly generating K cluster centers Ck and K E (1..K) by an algorithm;

(3) Calculating a sample difference value

Calculating the difference value between the sample point and the cluster center point according to the Euclidean distance, wherein the class to which each sample belongs is the class represented by the cluster center with the smallest distance;

(4) Updating cluster centers

Calculating the average value of all samples in the current cluster, and taking the average value as a new cluster center;

(5) Repeating step (3) and step (4) until all sample points no longer change category;

finally, the original cluster labels C _k of all the areas are obtained;

Step 3: traffic flow similarity topology graph construction

Constructing a traffic flow similarity topological graph by node characteristic data, periodic traffic flow data and regional traffic state data; the definition of the traffic flow similarity value is as follows:

Wherein, Is the traffic state of the vehicle outflow zone,/>Is the traffic state of the vehicle inflow region, C _max is the maximum value of clusters, k _all is the number of clusters, and the total traffic state classification value of the current traffic state cluster is divided; e _S (i, j) is the similarity value of the traffic flow direction of the area i to the area j, namely the edge characteristic in the traffic topological graph; e _s is the similarity of the traffic flow directions of cities, and is composed of the traffic flow direction similarity values of all areas; finally, the traffic flow similarity topological graph is represented as G _s＝(V,E_s), and V is a non-European node characteristic;

Step 4: traffic flow direction influencing topological graph construction

Constructing a traffic flow similarity topological graph by the node characteristic data and the periodic traffic flow data; the traffic flow direction influence degree value is defined as follows:

Where e _(i,j) is the vehicle flow data between region i and region j, For the total flow value of the region i, the numerator is the difference value of the two, and e _P (i, j) is the traffic flow direction influence degree value of the region i and the region j; e _P is the influence degree of the traffic flow direction of the city, and is composed of the traffic flow direction influence values of all areas; finally, the traffic flow direction impact topology is denoted as G _P＝(V,E_P).

3. The data fusion-based multi-graph convolutional neural network traffic prediction method of claim 1, wherein the method comprises the steps of: the model training is specifically as follows:

dividing traffic flow data into a training set and a testing set, sequencing the training set data according to time sequence, and extracting partial data as input according to the requirements of each component in each training round;

The Adam optimization algorithm is adopted to adjust parameters during model training; when the training times reach the set k value, the model stops training.

4. The data fusion-based multi-graph convolutional neural network traffic prediction method of claim 1, wherein the method comprises the steps of: the model test specifically comprises the following steps:

Predicting in the test set by using the trained model, and comparing the obtained predicted value with a real observed value to obtain prediction precision;

The evaluation index of the model performance is as follows:

Average absolute error: the result is the average of the absolute errors between the actual and predicted values, and the formula is as follows:

average absolute percentage error: the result is the average of the absolute percentage error between the actual and predicted values, as follows:

Root mean square error: the result is the arithmetic square root of the mean square error between the actual and predicted values, as follows: