CN111754019A

CN111754019A - A road segment feature representation learning algorithm based on spatiotemporal graph information maximization model

Info

Publication number: CN111754019A
Application number: CN202010382570.7A
Authority: CN
Inventors: 刘威; 何枷瑜; 王海明; 朱怀杰; 余建兴; 印鉴; 邱爽
Original assignee: Sun Yat Sen University
Current assignee: Sun Yat Sen University
Priority date: 2020-05-08
Filing date: 2020-05-08
Publication date: 2020-10-09
Anticipated expiration: 2040-05-08
Also published as: CN111754019B

Abstract

The invention provides a road section feature representation learning algorithm based on a space-time diagram information maximization model, which considers the timeliness of road section states, deeply excavates the time information of road sections, adopts a maximization mutual information mechanism, extracts the mutual influence and interaction relation among the road section information, the time information and the traffic information and utilizes the relation in the learning algorithm based on a neural network. The obtained road section represents the real-time global traffic condition, and learns the upstream and downstream real-time dependency relationship among the road sections, so that the precision of travel time prediction is greatly improved.

Description

A road segment feature representation learning algorithm based on spatiotemporal graph information maximization model

技术领域technical field

本发明涉及图神经网络等相关领域，更具体地，涉及一种基于时空图信息最大化模型的路段特征表示学习算法。The invention relates to related fields such as graph neural networks, and more particularly, to a road segment feature representation learning algorithm based on a spatiotemporal graph information maximization model.

背景技术Background technique

随着机动车数量激增，城市交通拥堵状况日益严峻，并引出出行效率低下、资源浪费等一系列问题。旅行时间预测在交通管理、路径规划、拼车、车辆派单等应用都起着至关重要的作用。现如今几乎所有旅行服务应用都有这项功能，比如谷歌地图，百度地图，滴滴等。在准确的旅行时间估计支撑下，用户可以合理规划个人出现路径，避免在拥堵路段浪费时间。同时，城市亦可合理进行路径指引，有效减缓拥堵问题。因此，许多研究人员致力于及时有效的旅行时间估计。然而，由于旅行时间估计的复杂性，提供准确的估计仍然是一项挑战性的任务。有一种方法被应用于大多数旅行时间预测的任务中，那就是基于路段的旅行时间预测。它能大程度缓解基于路径的旅行时间预测所带来的数据稀疏的风险，得到了很多关注。往年关于基于路段的旅行时间预测的方法存在很多问题，其中一个很大的问题就是关于路段特征表示的学习不够准确，导致预测的精度不尽如人意。随着神经网络的兴起，Embedding被应用到路段特征表示中，并在提取路段静态信息如道路类型、路段间拓扑关系等中起到了很大的作用，其中就有非朴素的网络表示学习方法，该方法基于图卷积神经网络很好的学习到带属性的网络节点特征表示。但是，路网是一个复杂的网络，该方法在路网上还是存在以下几个问题，那就是1)它会导致邻接路段不可区分。目前所提出的方法的目标都是使节点表示和邻接节点表示更相近，但在实际的路网中会出现两条相邻路段只有其中一条拥堵的情况，那么它们应该更可区分；2)它没有考虑路段和交通条件的相互影响。全局的交通条件会影响着路网中每一条路段，而某些关键路段的状态也反过来决定交通状况；3)它没有考虑路段本身的时变性。在不同的时间段，路段的状态可能不一致，例如路段在早上拥堵而下午畅通。With the surge in the number of motor vehicles, urban traffic congestion is becoming more and more severe, and a series of problems such as low travel efficiency and waste of resources are introduced. Travel time prediction plays a vital role in applications such as traffic management, route planning, carpooling, and vehicle dispatching. Almost all travel service apps now have this feature, such as Google Maps, Baidu Maps, Didi, etc. With the support of accurate travel time estimates, users can reasonably plan their personal appearance paths and avoid wasting time in congested road sections. At the same time, the city can also reasonably guide the route to effectively alleviate the congestion problem. Therefore, many researchers work on timely and efficient travel time estimation. However, providing accurate estimates remains a challenging task due to the complexity of travel time estimation. One approach that is used in most travel time prediction tasks is segment-based travel time prediction. It can greatly alleviate the risk of data sparsity brought by path-based travel time prediction, and has received a lot of attention. In previous years, there have been many problems in the method of road segment-based travel time prediction. One of the big problems is that the learning of road segment feature representation is not accurate enough, resulting in unsatisfactory prediction accuracy. With the rise of neural networks, Embedding has been applied to road segment feature representation, and has played a great role in extracting road segment static information such as road type, topological relationship between road segments, etc. Among them, there are non-naive network representation learning methods. This method is based on the graph convolutional neural network to learn the network node feature representation with attributes very well. However, the road network is a complex network, and this method still has the following problems on the road network, that is, 1) it will cause the adjacent road segments to be indistinguishable. The goal of the proposed methods is to make the node representation and the adjacent node representation more similar, but in the actual road network, there will be two adjacent road segments only one of which is congested, so they should be more distinguishable; 2) it The interaction of road segments and traffic conditions is not considered. The global traffic conditions will affect each road segment in the road network, and the state of some key road segments also determines the traffic conditions in turn; 3) It does not consider the time-varying of the road segment itself. In different time periods, the status of the road segment may be inconsistent, for example, the road segment is congested in the morning and clear in the afternoon.

发明内容SUMMARY OF THE INVENTION

本发明提供一种基于时空图信息最大化模型的路段特征表示学习算法，该方法可实现学习路段的实时特征表示。The invention provides a road section feature representation learning algorithm based on a spatiotemporal map information maximization model, which can realize the real-time feature representation of the learned road section.

为了达到上述技术效果，本发明的技术方案如下：In order to achieve above-mentioned technical effect, technical scheme of the present invention is as follows:

一种基于时空图信息最大化模型的路段特征表示学习算法，包括以下步骤：A road segment feature representation learning algorithm based on a spatiotemporal graph information maximization model, comprising the following steps:

S1：从路网中提取路段属性，生成路段初始向量，并基于历史数据中的轨迹构造时间邻接矩阵；S1: Extract the attributes of the road segment from the road network, generate the initial vector of the road segment, and construct a temporal adjacency matrix based on the trajectory in the historical data;

S2：对交通状况采用CNN和max-pooling操作，提取对应的交通状态/流表示；S2: Use CNN and max-pooling operations on traffic conditions to extract the corresponding traffic state/flow representation;

S3：将S1、S2和S3的数据输入到编码器进行训练，获得实时的路段表示；S3: Input the data of S1, S2 and S3 into the encoder for training to obtain the real-time road segment representation;

S4：将得到的路段表示作为目标，通过全连接层得到路段的动态表示。S4: Take the obtained road segment representation as the target, and obtain the dynamic representation of the road segment through the fully connected layer.

进一步地，所述步骤S1的具体过程是：Further, the specific process of the step S1 is:

S11：进行数据预处理，通过路网获取每条路段的静态属性，在这里使用路段类型、车道数、是否为单行路这三个属性；S11: Perform data preprocessing, obtain the static attributes of each road section through the road network, and use the three attributes of road section type, number of lanes, and whether it is a one-way road here;

S12：对该三个属性生成对应的one-hot向量，进行拼接后通过全连接层得到路段初始向量R＝{r₁,r₂,…,r_N}；S12: generate the corresponding one-hot vector for the three attributes, and obtain the initial vector R={r ₁ ,r ₂ ,...,r _N } through the fully connected layer after splicing;

S13：将历史数据的路段轨迹按时间段分割，根据不同时间段的轨迹得到时间邻接矩阵A^(t)，即如果在某一时间段内，从历史轨迹中得到某些路段被多次行驶并存在上下游关系，则对应的路段则具有邻接关系，而不是简单的从拓扑关系确定邻接关系。S13: Divide the road segment trajectories of the historical data into time segments, and obtain the time adjacency matrix A ^(t) according to the trajectories of different time periods, that is, if in a certain time period, some road segments obtained from the historical trajectories have been driven multiple times and If there is an upstream-downstream relationship, the corresponding road segment has an adjacency relationship, instead of simply determining the adjacency relationship from the topological relationship.

进一步地，所述步骤S2的具体过程是：Further, the specific process of the step S2 is:

S21：将对应城市进行网格划分，计算对应网格的拥堵情况、交通流；S21: divide the corresponding city into a grid, and calculate the congestion situation and traffic flow of the corresponding grid;

S22：将网格数据输入到CNN中得到交通状态、交通流的表示；S22: Input grid data into CNN to obtain the representation of traffic state and traffic flow;

基于CNN，从网格数据中学习到可以放映实时交通情况的表示；使用同样的方法得到网格流入流出的表示；具体计算公式如下：Based on CNN, a representation that can show real-time traffic conditions is learned from grid data; the same method is used to obtain a representation of grid inflow and outflow; the specific calculation formula is as follows:

s^(t)＝CNN(S^(t))。s ^(t) =CNN(S ^(t) ).

进一步地，所述步骤S3的具体过程是：Further, the specific process of the step S3 is:

S31：利用图卷积神经网络，从路段初始向量以及时间邻接矩阵得到路段的邻接表示h^(t)；S31: Using a graph convolutional neural network, obtain the adjacency representation h ^(t) of the road segment from the initial vector of the road segment and the time adjacency matrix;

S32：通过负采样，重复S31的步骤得到损坏的路段邻接表示

S32: Through negative sampling, repeat the steps of S31 to obtain the adjacent representation of damaged road sections

S33：利用readout函数对路段的邻接表示进行归纳，得到图的全局表示g^(t)；S33: Use the readout function to summarize the adjacency representation of the road segment to obtain the global representation g ^(t) of the graph;

S34：将图的全局表示、交通状态、流入以及流出进行拼接，得到实时的图高阶归纳

S34: Splicing the global representation, traffic state, inflow, and outflow of the graph to obtain a real-time high-order graph induction

S35：将得到的邻接表示、负采样邻接表示、图高阶归纳根据以下目标函数使用梯度下降最大化进行模型的训练，训练稳定后得到的路段的邻接表示即为最终的路段表示，函数公式如下：S35: Use the obtained adjacency representation, negative sampling adjacency representation, and graph high-order induction to train the model using gradient descent maximization according to the following objective function, and the adjacency representation of the road segment obtained after the training is stabilized is the final road segment representation, and the function formula is as follows :

进一步地，实际上就是基于正样本和负样本之间的Jensen-Shannon divergence即J-S散度，最大化路段表示和全局表示的交互信息，那么得到的邻接表示更趋向于保留全局图表示的交互信息，发现和保留局部级别的相似性如具有相似结构特征的远距离路段。Further, in fact, based on the Jensen-Shannon divergence between the positive samples and the negative samples, that is, the J-S divergence, to maximize the interaction information between the road segment representation and the global representation, the obtained adjacency representation tends to retain the interaction information represented by the global graph. , find and preserve local-level similarity such as long-distance road segments with similar structural features.

进一步地，所述步骤S4的具体过程是：Further, the specific process of the step S4 is:

S41：考虑到路段表示具有时间周期规律，将路段静态表示和one-hot编码的时间通过全连接层映射到一个低维表示，获取动态路段表示；S41: Considering that the road segment representation has a time period regularity, map the static representation of the road segment and the time of one-hot encoding to a low-dimensional representation through a fully connected layer to obtain a dynamic road segment representation;

S42：使用训练稳定后的编码器获取路段表示；S42: Obtain the road segment representation using the stabilized encoder after training;

S43：利用L损失最小化两者的差别，并优化全连接层的参数，训练稳定后，便可基于全连接层得到路段的动态表示；S43: Use the L loss to minimize the difference between the two, and optimize the parameters of the fully connected layer. After the training is stable, the dynamic representation of the road segment can be obtained based on the fully connected layer;

S44：实际上就是根据路段状态本身的周期性，以及考虑到数据稀疏问题，对路段动态表示进行压缩，主要公式如下：S44: In fact, according to the periodicity of the state of the road segment itself, and considering the problem of data sparse, the dynamic representation of the road segment is compressed. The main formula is as follows:

H^(t)＝ε(R+A^(t))H ^(t) = ε(R+A ^(t) )

与现有技术相比，本发明技术方案的有益效果是：Compared with the prior art, the beneficial effects of the technical solution of the present invention are:

本发明考虑路段状态的时间性，深入挖掘了路段的时间信息，并采用了最大化互信息机制，把路段信息、时间信息和交通信息三者之间相互影响、相互作用的关系提取并利用到基于神经网络的学习算法中。得到的路段表示更好的反映实时的全局交通情况，并学习到路段间上下游实时的依赖关系，大大提高了旅行时间预测的精度。The present invention considers the temporality of the road section state, deeply excavates the time information of the road section, and adopts the maximizing mutual information mechanism to extract and utilize the mutual influence and interaction relationship among the road section information, time information and traffic information. in learning algorithms based on neural networks. The obtained road segment representation better reflects the real-time global traffic situation, and learns the real-time upstream and downstream dependencies between road segments, which greatly improves the accuracy of travel time prediction.

附图说明Description of drawings

图1为本发明方法流程图；Fig. 1 is the flow chart of the method of the present invention;

图2为本发明流程示意图。Figure 2 is a schematic flow chart of the present invention.

具体实施方式Detailed ways

附图仅用于示例性说明，不能理解为对本专利的限制；The accompanying drawings are for illustrative purposes only, and should not be construed as limitations on this patent;

为了更好说明本实施例，附图某些部件会有省略、放大或缩小，并不代表实际产品的尺寸；In order to better illustrate this embodiment, some parts of the drawings are omitted, enlarged or reduced, which do not represent the size of the actual product;

对于本领域技术人员来说，附图中某些公知结构及其说明可能省略是可以理解的。It will be understood by those skilled in the art that some well-known structures and their descriptions may be omitted from the drawings.

下面结合附图和实施例对本发明的技术方案做进一步的说明。The technical solutions of the present invention will be further described below with reference to the accompanying drawings and embodiments.

如图1所示，一种基于时空图信息最大化模型的路段特征表示学习算法，包括以下步骤：As shown in Figure 1, a road segment feature representation learning algorithm based on a spatiotemporal graph information maximization model includes the following steps:

步骤S2的具体过程是：The specific process of step S2 is:

s^(t)＝CNN(S^(t))。s ^(t) =CNN(S ^(t) ).

步骤S3的具体过程是：The specific process of step S3 is:

S32：通过负采样，重复S31的步骤得到损坏的路段邻接表示

S32: Through negative sampling, repeat the steps of S31 to obtain the adjacent representation of the damaged road segment

步骤S4的具体过程是：The specific process of step S4 is:

H^(t)＝ε(R+A^(t))H ^(t) = ε(R+A ^(t) )

如图2所示，本发明的核心目的是路段表示在交通预测上的作用。那么首先要研究路段表示对旅行时间预测的影响，并确定数据集，我们采用的是滴滴出行“盖亚”数据开发计划提供的城市出行数据集——成都以及西安数据集，发布在https://gaia.didichuxing.com，以及哈尔滨数据集，由DeepGTT发布。表1为三组数据集的路段数量以及轨迹数量。As shown in Figure 2, the core purpose of the present invention is the role of road segment representation in traffic prediction. Then we must first study the impact of road segment representation on travel time prediction, and determine the data set. We use the urban travel data set provided by Didi Chuxing's "Gaia" data development plan - the Chengdu and Xi'an data sets, published at https: //gaia.didichuxing.com, and the Harbin dataset, published by DeepGTT. Table 1 shows the number of road segments and trajectories of the three datasets.

然后要确定旅行时间预测的评判标准，这里采用在该领域常用的RMSE和MAE来表示模型的预测效果。即当我们对路径的预测时间以及真实时间进行差值评判。Then we need to determine the evaluation criteria for travel time prediction. Here, RMSE and MAE, which are commonly used in this field, are used to represent the prediction effect of the model. That is, when we evaluate the difference between the predicted time of the path and the real time.

根据评判标准，我们将三组数据集都分为训练集、验证集和测试集，其中对应滴滴数据集，训练集为前17天的轨迹，最后10天数据作为测试集，其余数据作为验证集；哈尔滨数据集则训练集为前3天的轨迹，最后1天数据作为测试集，其余数据作为验证集。According to the judging criteria, we divided the three sets of data sets into training set, validation set and test set, which corresponds to the Didi data set, the training set is the trajectory of the first 17 days, the last 10 days of data is used as the test set, and the rest of the data is used as the verification set For the Harbin data set, the training set is the trajectory of the first 3 days, the data of the last day is used as the test set, and the rest of the data is used as the validation set.

表1、数据集的维度信息与交互信息Table 1. Dimensional information and interaction information of the dataset

在本专利之前，常用的路段表示学习的方法都是基于非朴素的网络表示学习算法，该算法虽然对路网的路段静态属性进行了学习，但没有考虑路段的时间因素以及路网的交通因素，对旅行时间预测的精度影响还是比较明显的。所有我们提出的算法就采用的一种基于卷积神经网络的网络表示学习算法以及采用最大化互信息来考虑路段、路网全局以及交通状况之间的相互作用。Before this patent, the commonly used methods of road segment representation learning are all based on non-naive network representation learning algorithms. Although this algorithm learns the static attributes of road segments, it does not consider the time factors of road segments and the traffic factors of road networks. , the impact on the accuracy of travel time prediction is still relatively obvious. All our proposed algorithms use a convolutional neural network-based network representation learning algorithm and maximize mutual information to consider the interactions between road segments, road network globals, and traffic conditions.

为了和以前方法做对比，我们同样计算了这些方法在三个数据集上的RMSE和MAE的表现，训练集、验证集和测试集的分割方式同样和我们的方法保持一致。To compare with previous methods, we also calculated the RMSE and MAE performance of these methods on three datasets, and the splits of training, validation, and test sets are also consistent with our method.

另外，为了测试模型每个部分对于模型的作用，我们进行了消融实验，分别产生了以下三个模型变体：1)ST-DGI/S：去掉路段的静态信息，即将路段看作无属性的节点，对路段表示进行随机初始化；2)ST-DGI/T：忽视路段的时间因素；3)ST-DGI/G：忽视路网的交通因素，只使用路网图的归纳表示。In addition, in order to test the effect of each part of the model on the model, we conducted ablation experiments and produced the following three model variants: 1) ST-DGI/S: Remove the static information of the road segment, that is, treat the road segment as attributeless 2) ST-DGI/T: ignore the time factor of the road segment; 3) ST-DGI/G: ignore the traffic factor of the road network and only use the inductive representation of the road network map.

表2、多种模型在两组数据集上的表现Table 2. The performance of various models on the two datasets

从结果可以看出我们的发明相较于以前方法有比较明显的提升，这从很大程度是因为本发明从路段的时空属性出发，通过最大化互信息机制，最大程度地提取局部路段与全局路网的交互信息，从而学习到路段与交通情况的相互关系以及路段间动态的依赖关系。我们通过消融实验也可以确认到三个因素对于模型效果的重要性。基于准确的动态路段表示，我们才得以提高旅行时间预测的准确性。It can be seen from the results that our invention has obvious improvement compared with the previous methods, which is largely because the present invention starts from the spatiotemporal attributes of the road segments, and maximizes the extraction of local road segments and global segments by maximizing the mutual information mechanism. The interactive information of the road network is used to learn the relationship between road segments and traffic conditions, as well as the dynamic dependencies between road segments. We can also confirm the importance of three factors to the model effect through ablation experiments. Based on an accurate dynamic segment representation, we were able to improve the accuracy of travel time predictions.

相同或相似的标号对应相同或相似的部件；The same or similar reference numbers correspond to the same or similar parts;

附图中描述位置关系的用于仅用于示例性说明，不能理解为对本专利的限制；The positional relationship described in the accompanying drawings is only for exemplary illustration, and should not be construed as a limitation on this patent;

显然，本发明的上述实施例仅仅是为清楚地说明本发明所作的举例，而并非是对本发明的实施方式的限定。对于所属领域的普通技术人员来说，在上述说明的基础上还可以做出其它不同形式的变化或变动。这里无需也无法对所有的实施方式予以穷举。凡在本发明的精神和原则之内所作的任何修改、等同替换和改进等，均应包含在本发明权利要求的保护范围之内。Obviously, the above-mentioned embodiments of the present invention are only examples for clearly illustrating the present invention, rather than limiting the embodiments of the present invention. For those of ordinary skill in the art, changes or modifications in other different forms can also be made on the basis of the above description. There is no need and cannot be exhaustive of all implementations here. Any modification, equivalent replacement and improvement made within the spirit and principle of the present invention shall be included within the protection scope of the claims of the present invention.

Claims

1. a road segment feature representation learning algorithm based on a spatiotemporal map information maximization model, is characterized in that, comprises the following steps:

S1: Extract the attributes of the road segment from the road network, generate the initial vector of the road segment, and construct a temporal adjacency matrix based on the trajectory in the historical data;

S2: Use CNN and max-pooling operations on traffic conditions to extract the corresponding traffic state/flow representation;

S3: Input the data of S1, S2 and S3 into the encoder for training to obtain the real-time road segment representation;

S4: Take the obtained road segment representation as the target, and obtain the dynamic representation of the road segment through the fully connected layer.

2. the road section feature representation learning algorithm based on the spatiotemporal map information maximization model according to claim 1, is characterized in that, the concrete process of described step S1 is:

S11: Perform data preprocessing, obtain the static attributes of each road section through the road network, and use the three attributes of road section type, number of lanes, and whether it is a one-way road here;

S12: generate the corresponding one-hot vector for the three attributes, and obtain the initial vector R={r ₁ ,r ₂ ,...,r _N } through the fully connected layer after splicing;

S13: Divide the road segment trajectories of the historical data into time segments, and obtain the time adjacency matrix A ^(t) according to the trajectories of different time periods, that is, if in a certain time period, some road segments obtained from the historical trajectories have been driven multiple times and If there is an upstream-downstream relationship, the corresponding road segment has an adjacency relationship, instead of simply determining the adjacency relationship from the topological relationship.

3. the road section feature representation learning algorithm based on the spatiotemporal graph information maximization model according to claim 2, is characterized in that, the concrete process of described step S2 is:

S21: divide the corresponding city into a grid, and calculate the congestion situation and traffic flow of the corresponding grid;

S22: Input grid data into CNN to obtain the representation of traffic state and traffic flow;

Based on CNN, a representation that can display real-time traffic conditions is learned from grid data; the same method is used to obtain a representation of grid inflow and outflow; the specific calculation formula is as follows:

S ^(t) =CNN(S ^(t) ).

4. the road section feature representation learning algorithm based on the spatiotemporal graph information maximization model according to claim 3, is characterized in that, the concrete process of described step S3 is:

S31: Using a graph convolutional neural network, obtain the adjacency representation h ^(t) of the road segment from the initial vector of the road segment and the time adjacency matrix;

S33: Use the readout function to summarize the adjacency representation of the road segment to obtain the global representation g ^(t) of the graph;

S34: Splicing the global representation, traffic state, inflow and outflow of the graph to obtain a real-time high-order induction of the graph

S35: Use the obtained adjacency representation, negative sampling adjacency representation, and graph high-order induction to train the model using gradient descent maximization according to the following objective function, and the adjacency representation of the road segment obtained after the training is stabilized is the final road segment representation, and the function formula is as follows :

5. The road segment feature representation learning algorithm based on the spatiotemporal graph information maximization model according to claim 4, characterized in that, in fact, it is based on the Jensen-Shannon divergence between the positive samples and the negative samples, that is, the J-S divergence, which maximizes the The interaction information between the road segment representation and the global representation, then the obtained adjacency representation tends to retain the interaction information represented by the global graph, and finds and preserves local-level similarity such as long-distance road segments with similar structural features.

6. The road segment feature representation learning algorithm based on the spatiotemporal map information maximization model according to claim 5, is characterized in that, the concrete process of described step S4 is:

S41: Considering that the road segment representation has a time period regularity, map the static representation of the road segment and the time of one-hot encoding to a low-dimensional representation through a fully connected layer to obtain a dynamic road segment representation;

S42: Obtain the road segment representation using the stabilized encoder after training;

S43: Use the L loss to minimize the difference between the two, and optimize the parameters of the fully connected layer. After the training is stable, the dynamic representation of the road segment can be obtained based on the fully connected layer;

S44: In fact, according to the periodicity of the state of the road segment itself, and considering the problem of data sparse, the dynamic representation of the road segment is compressed. The main formula is as follows:

H ^(t) = ε(R+A ^(t) )