WO2023029234A1 - Method for bus arrival time prediction when lacking data - Google Patents

Method for bus arrival time prediction when lacking data Download PDF

Info

Publication number
WO2023029234A1
WO2023029234A1 PCT/CN2021/132828 CN2021132828W WO2023029234A1 WO 2023029234 A1 WO2023029234 A1 WO 2023029234A1 CN 2021132828 W CN2021132828 W CN 2021132828W WO 2023029234 A1 WO2023029234 A1 WO 2023029234A1
Authority
WO
WIPO (PCT)
Prior art keywords
bus
information
node
graph
attention
Prior art date
Application number
PCT/CN2021/132828
Other languages
French (fr)
Chinese (zh)
Inventor
马佳曼
罗喜伶
蒋淑园
金晨
Original Assignee
北京航空航天大学杭州创新研究院
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by 北京航空航天大学杭州创新研究院 filed Critical 北京航空航天大学杭州创新研究院
Publication of WO2023029234A1 publication Critical patent/WO2023029234A1/en

Links

Images

Classifications

    • GPHYSICS
    • G08SIGNALLING
    • G08GTRAFFIC CONTROL SYSTEMS
    • G08G1/00Traffic control systems for road vehicles
    • G08G1/01Detecting movement of traffic to be counted or controlled
    • G08G1/0104Measuring and analyzing of parameters relative to traffic conditions
    • G08G1/0125Traffic data processing
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N3/00Computing arrangements based on biological models
    • G06N3/02Neural networks
    • G06N3/04Architecture, e.g. interconnection topology
    • G06N3/044Recurrent networks, e.g. Hopfield networks
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N3/00Computing arrangements based on biological models
    • G06N3/02Neural networks
    • G06N3/04Architecture, e.g. interconnection topology
    • G06N3/045Combinations of networks
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N3/00Computing arrangements based on biological models
    • G06N3/02Neural networks
    • G06N3/08Learning methods
    • G06N3/084Backpropagation, e.g. using gradient descent
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06QINFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES, NOT OTHERWISE PROVIDED FOR
    • G06Q10/00Administration; Management
    • G06Q10/04Forecasting or optimisation specially adapted for administrative or management purposes, e.g. linear programming or "cutting stock problem"
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06QINFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES, NOT OTHERWISE PROVIDED FOR
    • G06Q50/00Information and communication technology [ICT] specially adapted for implementation of business processes of specific business sectors, e.g. utilities or tourism
    • G06Q50/10Services
    • G06Q50/26Government or public services
    • GPHYSICS
    • G08SIGNALLING
    • G08GTRAFFIC CONTROL SYSTEMS
    • G08G1/00Traffic control systems for road vehicles
    • G08G1/01Detecting movement of traffic to be counted or controlled
    • G08G1/0104Measuring and analyzing of parameters relative to traffic conditions
    • G08G1/0137Measuring and analyzing of parameters relative to traffic conditions for specific applications

Definitions

  • the invention relates to the field of bus arrival time prediction, in particular to a bus arrival time prediction method for missing data, which is used to provide accurate data for bus lines with uncertain departure times, problems with GPS equipment, etc. arrival time forecast.
  • the bus network is very important to the rapid development of urban transportation. Due to the characteristics of economy and environmental protection, buses are still the main solution for urban travel. The main factors hindering passengers from choosing public transport are their long waiting time and uncertainty of journey time, so accurate prediction of bus arrival time is very important to solve this problem. It can also help reduce traffic congestion and be used in other comprehensive intelligent transportation applications, such as trip planning.
  • the present invention provides a method for predicting the arrival time of buses with missing data, which can accurately and effectively learn and predict each route in the public transportation network
  • the travel time can not only improve the accuracy of arrival time prediction for bus lines with few current travel records, but also provide estimated travel time for bus lines in the design stage without historical records.
  • the invention provides a method for predicting the arrival time of buses facing missing data, which comprises the following steps:
  • the multi-head spatio-temporal graph attention network prediction model includes sequential A connected spatial attention module and a temporal attention module;
  • the spatial attention module is an attention module of a multi-head graph attention network with a mask, which applies a graph attention network to learn the spatial dependencies between different nodes, and
  • a multi-head map attention block with a mask is used to learn the global and local spatial dependencies between bus lines in different situations;
  • the temporal attention module includes an LSTM layer and a transformer layer for local time-dependent learning and global time-dependent learning;
  • using the obtained important geographic locations to represent the geographic structure of each route is specifically: using weights to represent the positions of intersections and stations, so as to represent the geographic structure of each bus route.
  • the node extraction is specifically: select a geographic location between every two adjacent stations of each bus route, and use the two station information and the selected geographic location information set between them as node s information to represent the road segment between two adjacent stations.
  • selecting a geographic location between each two adjacent stations to constitute node information is specifically: selecting the intersection with the farthest distance from the two adjacent stations and the largest flow of people in the bus route, and The intersection information and the geographic location information of two adjacent stations constitute node information.
  • edge extraction is specifically:
  • the construction method of the adjacency matrix A is specifically as follows: establishing an edge graph of geographical structure similarity, extracting the position information and node length information of the nodes according to the three important geographic location information contained in the extracted nodes, and using the DTW algorithm Do the similarity comparison, establish the geographical structure similarity adjacency matrix A g between the nodes; then, extract the urban functional category of each node according to the building category information contained in the data of the urban interest points near the three geographical locations in each node , according to the similarity of urban functions, the adjacency matrix A f of the urban functional area division relationship between nodes is established; finally, according to the distance relationship between bus routes, a third geographical distance adjacency matrix A d is designed; the edge in the adjacency matrix
  • the weights of are normalized and range between 0 and 1.
  • step 3) is specifically: for the bus lines with lack of data, use the multi-head space-time graph attention network prediction model and according to the similarity of the bus operation mode, learn the complete historical data in the first h time periods bus operation patterns; and then predict bus arrival times for bus lines with lack of data.
  • the present invention utilizes a density-based clustering method to locate important travel locations (such as stations, intersections) in the route according to the geographical structure of each public transport route and the space-time dependency between the routes ; Construct a weighted public transport network graph with traffic importance based on the mined important travel locations.
  • a multi-attention graph neural network is provided to learn the correlation between bus routes from both spatial and temporal perspectives.
  • an attention module multi-head GAT
  • a multi-head masked graph attention network is established, which can learn global and local routes in three views of city function, route distance, and bus structure similarity The importance of learning and effectively combining multiple influencing factors to combine learning travel modes.
  • LSTM long short-term memory
  • transformer layers to learn the bus operation mode of the far and recent time.
  • LSTM long short-term memory
  • transformer layers to learn the bus operation mode of the far and recent time.
  • LSTM long short-term memory
  • the invention can not only improve the prediction accuracy rate of the arrival time of the bus lines with few current travel records, but also provide estimated travel time between stations for the lines in the design stage without historical records.
  • Figure 1 is a flow chart of the bus arrival time prediction method with missing data
  • Fig. 2 is the schematic diagram of the pseudo-code of the density-based clustering method in the embodiment of the present invention.
  • Fig. 3 is a schematic diagram of a pseudocode of a method for selecting a geographic location between sites in an embodiment of the present invention.
  • the flow chart of the method of the present invention is shown in Fig. 1, and it is made up of two main parts, respectively is the construction of public transport network map and the construction based on multi-head attention map network model.
  • the GPS information of the historical bus running track, the location information of public transportation stations and the data of urban points of interest (POI) are integrated, and then the present invention proposes a density-based clustering method to automatically discover important geographic locations (including stations and Intersections), and use the excavated geographic location to represent the geographical structure of each route; according to the similarity of geographical structure (such as the distance between bus stops, the number of intersections), the distance between bus routes, and the division of urban functional areas , to construct three kinds of public transportation network graphs.
  • the present invention establishes a multi-head space-time graph attention network prediction model, including spatial and temporal attention modules to predict arrival time.
  • the spatial attention module the present invention designs a multi-head graph attention block with a mask to learn the global and local spatial dependencies between bus lines in different situations.
  • the multi-head mechanism performs ensemble learning on the travel times of adjacent bus line segments with complete historical travel records.
  • the designed masking mechanism to mask the weighted adjacency matrix composed of different neighbor bus line segments, the attention blocks can be focused on the selected neighbors and the computation time is reduced. For example, when a target route has few unstable histories, its own input history travel time is necessarily ignored.
  • the present invention constructs a time attention through an LSTM layer and a transformer layer module to obtain global (long distance) and local (nearest) temporal patterns for travel time prediction.
  • a time attention through an LSTM layer and a transformer layer module to obtain global (long distance) and local (nearest) temporal patterns for travel time prediction.
  • spatio-temporal attention it is possible to learn the bus operation mode (X th ,...,X t ) with complete historical data in the first h time period; according to the similarity of the operation mode, the arrival of the bus with the lack of historical data Time X t+1 forecast.
  • the present invention constructs a transportation network G from a new graphical view, and proposes a density-based clustering method (DBSCAN) to automatically discover important geographic locations ( Stations and intersections), and use the obtained geographic location to represent the geographic structure (Geo-structure) of each route. Then, based on the geographical structure of the discovered bus routes, the present invention constructs a public transport network G, where a node S in G represents a travel segment between every two adjacent stops.
  • DBSCAN density-based clustering method
  • the present invention utilizes a density-based clustering algorithm (DBSCAN) to extract the geographic structure of public transportation, including the exact location and density of intersections and stops.
  • DBSCAN density-based clustering algorithm
  • the process does not directly use the marked station information, but extracts the locations from the DBSCAN algorithm for two reasons: First, since the proposed travel segment is station-based, there are cases where two routes have the same stations but different paths . Therefore, it is inaccurate to extract the geographical structure of the route using only the location of the station. Second, the waiting time at each station and intersection is usually affected by the number of passengers and traffic lights, which can lead to large differences in travel times for different routes.
  • the present invention uses a density-based clustering method (DBSCAN) to find a dense area (Geo-structure discovery), without setting the number of clusters or a fixed shape.
  • DBSCAN density-based clustering method
  • the specific steps are shown in Figure 2 (Algorithm 1) in Figure 2, GPS point p contains longitude and latitude information (long, lat), and c is an important point in the bus network obtained after algorithmic clustering, including longitude, latitude, GPS number and station information (long, lat, num, st) .
  • the density-based clustering algorithm needs two parameters: the scanning radius eps and the minimum number of points minPts.
  • the parameters are set according to the number of GPS points with speed values around 0 in the database to find all important geographic locations along the way, including stations and intersections. . Stations and intersections are then weighted according to the number of GPS within each excavated geographic location.
  • the present invention proposes a node information representation method.
  • the method design first selects an important geographical location between every two adjacent stations of each bus route, and the two station information
  • the method for selecting the geographic location between sites is shown in Figure 3 (Algorithm 2).
  • step 3 traverses all the collections of clusters C obtained from DBSCAN.
  • steps 4-6 among the two adjacent c with sites, find the one with the farthest distance from the site and the largest flow of people c i+1 is one of the information in s
  • step 7 gathers two sites and c i+1 as the information of s.
  • the invention establishes an edge graph according to the node information to represent the relationship between road sections (nodes). Since many influencing factors will affect the travel mode of the public transport system, the present invention constructs three adjacency matrices A to represent different relationships. First, establish the edge graph of geographical structure similarity relationship, extract the location information of the road section (node) and the length information of the road section according to the three important geographical location information contained in the extracted nodes, use the DTW algorithm to compare the similarity, and establish the geographical relationship between nodes Structurally similar adjacency matrix A g ; then, according to the building category information contained in the data of urban interest points near the three positions in each node (set as the collection of building urban function category information within a distance of 100 meters in the radius of latitude and longitude of the node), extract The urban function category of each road section (node), according to the similarity of urban functions, establishes the adjacency matrix A f of the urban functional area division relationship between the nodes; finally, due to
  • the present invention proposes a multi-head spatio-temporal graph attention network prediction model for predicting the bus travel time of the whole city with limited data.
  • This model can effectively learn and predict the travel time of various bus lines in the city, especially for suburban lines and routes without any historical bus travel records. It can help update existing public transport systems, adjust outdated timetables for routes in developing regions, and help select new routes and design new ones, providing travel times for each route under normal and irregular traffic conditions.
  • This model contains two modules, the spatial attention module and the temporal attention module.
  • the present invention applies a graph attention network (GAT) to learn the spatial dependencies between different travel segments.
  • GAT graph attention network
  • GCN graph convolutional network
  • the graph attention layer is the fundamental part of GAT, which can learn the correlation between each node and update the hidden features of each pair of nodes. Node features are denoted as h t i in time interval t.
  • h t i is the input travel time record and embedded temporal information for segment s i .
  • the attention coefficient e t ij of s i and s j can be expressed as:
  • W is the learnable parameter of layer l
  • a(.) is the function to calculate the correlation.
  • the present invention utilizes the LeakyReLU active function to train the feed-forward neural network. For each layer, the output is normalized to [0,1] by the softmax function:
  • the present invention extends spatial attention to a masked multi-head attention mechanism, which has a learnable
  • the K independent attention heads of L are concatenated to achieve the final spatial attention result:
  • the present invention adds a mask attention mechanism to the adjacency matrix, which is used to pay attention to the bus lines with complete historical operation data and learn the operation mode.
  • the representation of mask m at layer l is:
  • X is the input data of the bus running time
  • X' is the output after adding the attention mechanism
  • X l ' is the output of the l layer after the mask mechanism is added
  • the present invention connects a temporal attention module. Since the travel time of each bus trip is greatly affected by real-time traffic conditions. For example, when traffic conditions are normal, the travel time of the target route's past long-distance history for the same time period in the current time period may be highly similar. However, when traffic congestion occurs, the travel pattern may be erratic, but still may have a similar pattern to the most recent time period. Therefore, for different traffic situations, both global (far time) and local (nearest) time travel patterns need to be considered.
  • a recurrent neural network is a type of artificial neural network that is particularly well suited for capturing temporal dependencies in sequence learning.
  • RNN recurrent neural network
  • LSTM Long Short-Term Memory
  • Wi ix Wi h
  • bi learnable parameter matrix and bias vector of the recurrent layer, ⁇ standard sigmoid function.
  • the input gate it of LSTM can be expressed as:
  • the present invention introduces a transformer layer in the temporal attention module.
  • a transformer layer for a single-head attention layer, there are usually three types of vector Q, query K, key, and value V for each node in the public transport network graph.
  • the hidden subspace learning process can be expressed as:
  • W Q , W K , W V are the parameters of the science department, and the output Attention of the global time attention is calculated according to the scaled dot product attention, where d K is the scaling factor, expressed as:
  • the present invention uses a linear layer for prediction.
  • is a learnable parameter in the model.
  • the present invention uses bus trajectory and POI information, and these data are all obtained from the traffic department of a certain city.
  • Bus trajectories include location, timestamp, speed, and bus ID information.
  • the average sampling frequency is 30 seconds per point, with a daily volume of about 300,000 points generated by 278 individual lines.
  • the POI dataset consists of building locations and categories (social functions).
  • three lines were selected as target lines to evaluate the performance and robustness of the method of the present invention.
  • the three test bus lines selected by the present invention are located in different areas of the city, including developed central areas, remote areas, and paths connecting the center and the suburbs. The results are then averaged.
  • the present invention also deletes all their histories and treats them as three routes under design (without any historical travel time records) to test the travel time estimation performance of new routes (site locations and routes are designed), This facilitates the evaluation of the proposed model, here named MAGTTE for the inventive method.
  • Travel time is predicted by calculating the historical average travel time of bus lines in each time period (15 minutes).
  • ⁇ E-knn This is a proposed model based on the weighted enhanced k-NN method, which uses the k records most similar to the current traffic condition to identify the traffic condition and predict the travel time.
  • the similarity of the travel mode is set to be above 90/%, and the target road segment is k neighbors.
  • RnnTTE The model is based on an LSTM neural network and contains a fully connected LSTM layer with 128 hidden units.
  • DeepTTE This is a model that combines geo-conv layers and lstm layers to predict travel time.

Landscapes

  • Engineering & Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • Theoretical Computer Science (AREA)
  • Business, Economics & Management (AREA)
  • General Physics & Mathematics (AREA)
  • General Health & Medical Sciences (AREA)
  • Health & Medical Sciences (AREA)
  • Software Systems (AREA)
  • Strategic Management (AREA)
  • Evolutionary Computation (AREA)
  • Computational Linguistics (AREA)
  • Molecular Biology (AREA)
  • Computing Systems (AREA)
  • General Engineering & Computer Science (AREA)
  • Biophysics (AREA)
  • Mathematical Physics (AREA)
  • Biomedical Technology (AREA)
  • Artificial Intelligence (AREA)
  • Life Sciences & Earth Sciences (AREA)
  • Data Mining & Analysis (AREA)
  • Human Resources & Organizations (AREA)
  • Tourism & Hospitality (AREA)
  • Economics (AREA)
  • Marketing (AREA)
  • Development Economics (AREA)
  • General Business, Economics & Management (AREA)
  • Chemical & Material Sciences (AREA)
  • Analytical Chemistry (AREA)
  • Educational Administration (AREA)
  • Primary Health Care (AREA)
  • Game Theory and Decision Science (AREA)
  • Entrepreneurship & Innovation (AREA)
  • Operations Research (AREA)
  • Quality & Reliability (AREA)
  • Traffic Control Systems (AREA)

Abstract

A method for bus arrival time prediction when lacking data, comprising: finding important geographic locations by using a density-based clustering method; using the found geographic locations to represent a geographic structure of each route to form node information; according to geographic structure similarity, constructing weighted edge information provided with traffic importance so as to form a public transportation network graph; and according to the public transportation network graph, establishing multi-head spatial and temporal graph attention network prediction models to learn the correlation between bus routes from the perspective of space and time, wherein a spatial attention module is a multi-head graph attention module provided with a mask, and a temporal attention module is built by an LSTM layer and a transformer layer. The described method may effectively learn and predict the travel time for each route in a public transportation network, which may not only improve the accuracy of arrival time prediction for bus routes that have few current travel records, but also provide the estimated travel time for bus routes which are in the design stage and do not have historical records.

Description

一种面向有缺失数据的公交车到站时间预测方法A bus arrival time prediction method for missing data 技术领域technical field
本发明涉及公交车到站时间预测领域,具体涉及一种面向有缺失数据的公交车到站时间预测方法,用于对出发时间不确定、GPS设备有问题等有数据缺失情况的公交线路提供准确的到站时间预测。The invention relates to the field of bus arrival time prediction, in particular to a bus arrival time prediction method for missing data, which is used to provide accurate data for bus lines with uncertain departure times, problems with GPS equipment, etc. arrival time forecast.
背景技术Background technique
公交车网络对快速发展的城市交通至关重要,由于经济,环保等特点,目前公交车仍是城市出行的主要方案。阻碍乘客选择公交的主要因素是其漫长的等待时间和旅程时间的不确定性,因此准确预测公交车的到站时间对于解决这个问题非常重要。它还可以帮助减少交通拥堵,并被用于其他的综合智能交通应用,如行程规划。The bus network is very important to the rapid development of urban transportation. Due to the characteristics of economy and environmental protection, buses are still the main solution for urban travel. The main factors hindering passengers from choosing public transport are their long waiting time and uncertainty of journey time, so accurate prediction of bus arrival time is very important to solve this problem. It can also help reduce traffic congestion and be used in other comprehensive intelligent transportation applications, such as trip planning.
现有的公交车到达或旅行时间预测方法,依赖大量的历史旅行记录和公共交通路线的广泛覆盖。因此,现有方法无法或难以准确预测有数据缺失的公交车到站时间,其原因主要包括:Existing bus arrival or travel time prediction methods rely on a large amount of historical travel records and extensive coverage of public transport routes. Therefore, existing methods cannot or are difficult to accurately predict the arrival time of buses with missing data. The main reasons include:
1)数据稀疏性:对于预测郊区(记录稀少)和正在考虑和设计(没有记录)的交通路线的旅行时间预测,主要问题是它们有大量的缺失数据,无法从历史记录中学习到准确的运行模式,从而无法预测到站时间;1) Data sparsity: The main problem with travel time predictions for predicting suburban (sparse records) and traffic routes under consideration and design (no records) is that they have a lot of missing data and cannot learn accurate runs from historical records mode, so that the arrival time cannot be predicted;
2)独立的旅行模式:从空间角度看,为扩大服务覆盖面积,城市公交网络中的公交线路不会有过多重复和重叠,导致公交线路之间难以互相学习运行模式。即使部分站点共享同一道路的部分,由于需求不同,乘客数量的不同,停车的顺序和位置不同,交通网络中的路线具有相对不同和独立的旅行模式;2) Independent travel mode: From the perspective of space, in order to expand the service coverage area, the bus lines in the urban bus network will not have too much repetition and overlap, making it difficult for bus lines to learn from each other. Even if some stations share parts of the same road, routes in the transportation network have relatively different and independent travel patterns due to different needs, different numbers of passengers, and different parking sequences and locations;
3)复杂的交通状况:公共交通线路的旅行模式比私人车辆更复杂,因为它们也有独特的交通信息。除了受到道路长度、方向和交互数字的影响外,公共交通线路还受到出发时间安排、数量和停靠地点的影响。3) Complex traffic conditions: The travel patterns of public transport lines are more complex than those of private vehicles, since they also have unique traffic information. In addition to being influenced by road lengths, directions and interaction numbers, public transport routes are also influenced by departure timing, numbers and places of stops.
因此,对于具有历史数据缺失的公交车到站时间预测方法的提出是必要且重要的。Therefore, it is necessary and important to propose a bus arrival time prediction method with missing historical data.
发明内容Contents of the invention
为了解决对有数据缺失问题的公交车到达时间预测的问题,本发明提供一种面向有缺失数据的公交车到站时间预测方法,该方法可以准确有效地学习和预测公共交通网络中每条路径的旅行时间,不仅能提高当前旅行记录少的公交线路的到达时间预测准确率,还可以对没有历史记录的处于设计阶段的公交线路提供预计旅行时间。In order to solve the problem of predicting the arrival time of buses with missing data, the present invention provides a method for predicting the arrival time of buses with missing data, which can accurately and effectively learn and predict each route in the public transportation network The travel time can not only improve the accuracy of arrival time prediction for bus lines with few current travel records, but also provide estimated travel time for bus lines in the design stage without historical records.
本发明的技术方案如下:Technical scheme of the present invention is as follows:
本发明提供了一种面向有缺失数据的公交车到站时间预测方法,其包括如下步骤:The invention provides a method for predicting the arrival time of buses facing missing data, which comprises the following steps:
1)将历史公交车运行轨迹GPS信息、公共交通站点位置信息与城市兴趣点数据(数据信息包含建筑经纬度和建筑城市功能分类信息)进行数据整合;通过基于密度的聚类方法从整合后的数据中提取公交网络中重要的地理位置,利用得到的地理位置来表示每条公交路线的地理结构,进行公共交通网络图的节点抽取和边抽取,构建公共交通网络图;1) Integrate historical bus track GPS information, public transport site location information and urban interest point data (data information includes building longitude and latitude and building urban function classification information); through the density-based clustering method from the integrated data Extract important geographic locations in the bus network, use the obtained geographic locations to represent the geographic structure of each bus route, and perform node extraction and edge extraction on the public transportation network graph to construct a public transportation network graph;
2)根据构建的公共交通网络图,建立多头时空图注意力网络预测模型从空间和时间的角度学习公交路线之间的相关性;其中,所述的多头时空图注意力网络预测模型包括顺次连接的空间注意力模块和时间注意力模块;所述空间注意力模块为多头具有掩码的图注意力网络的注意力模块,其应用图注意网络来学习不同节点之间的空间依赖关系,并利用带有掩码的多头图注意块学习不同情况下公交线路之间的全局和局部空间依赖关系;所述时间注意力模块包括一个LSTM层和一个transformer层,分别用于进行局部时间依赖学习和全局时间依赖学习;2) According to the public transportation network diagram of construction, set up the multi-head spatio-temporal graph attention network prediction model from the perspective of space and time to learn the correlation between bus routes; wherein, the multi-head spatio-temporal graph attention network prediction model includes sequential A connected spatial attention module and a temporal attention module; the spatial attention module is an attention module of a multi-head graph attention network with a mask, which applies a graph attention network to learn the spatial dependencies between different nodes, and A multi-head map attention block with a mask is used to learn the global and local spatial dependencies between bus lines in different situations; the temporal attention module includes an LSTM layer and a transformer layer for local time-dependent learning and global time-dependent learning;
3)利用多头时空图注意力网络预测模型,对具有缺失数据的公交车到站时间进行预测。3) Using a multi-head spatio-temporal graph attention network prediction model to predict bus arrival times with missing data.
进一步的,所述的通过基于密度的聚类方法从整合后的数据中提取公交网络中重要的地理位置,具体为:Further, the described extraction of important geographic locations in the bus network from the integrated data through the density-based clustering method is specifically:
根据整合后的数据中速度为0的GPS点的数量来设置基于密度的聚类方法的参数,通过基于密度的聚类方法得到公交网络中重要的地理位置,并根据每个地理位置包含的GPS点的数量来确定地理位置的权重。Set the parameters of the density-based clustering method according to the number of GPS points whose speed is 0 in the integrated data, and obtain the important geographic locations in the bus network through the density-based clustering method, and according to the GPS points contained in each geographic location The number of points to determine the weight of the geographic location.
进一步的,所述的利用得到的重要的地理位置来表示每条路线的地理结构,具体为:用权重来表现交叉口和站点的位置,以代表每条公交路线的地理结构。Further, using the obtained important geographic locations to represent the geographic structure of each route is specifically: using weights to represent the positions of intersections and stations, so as to represent the geographic structure of each bus route.
进一步的,所述的节点抽取具体为:在每条公交路线的每两个相邻站点之间选择一个地理位置,将两个站点信息和其之间的被选择的地理位置信息集合作为节点s的信息,以代表两个相邻站点之间的路段。Further, the node extraction is specifically: select a geographic location between every two adjacent stations of each bus route, and use the two station information and the selected geographic location information set between them as node s information to represent the road segment between two adjacent stations.
进一步的,所述的每两个相邻站点之间选择一个地理位置构成节点信息,具体为:选择公交路线中,与两个相邻站点位置距离最远,且人流量最大的交叉口,将该交叉口信息与两个相邻站点的地理位置信息一起构成节点信息。Further, selecting a geographic location between each two adjacent stations to constitute node information is specifically: selecting the intersection with the farthest distance from the two adjacent stations and the largest flow of people in the bus route, and The intersection information and the geographic location information of two adjacent stations constitute node information.
进一步的,所述的边抽取具体为:Further, the edge extraction is specifically:
建立边图来代表路段之间的关系,其中,边上编码的权重是空间相关性强度或相似性;构建三个分别表示地理结构相似度关系、公交路线之间的距离关系、城市功能区域划分关系的邻接矩阵A,得到三种表示不同关系的公共交通网络图。Create an edge graph to represent the relationship between road sections, where the weight of the edge code is the spatial correlation strength or similarity; construct three graphs to represent the similarity relationship of geographical structure, the distance relationship between bus routes, and the division of urban functional areas. The adjacency matrix A of the relationship is used to obtain three kinds of public transportation network graphs representing different relationships.
优选的,所述邻接矩阵A的构建方法,具体为:建立地理结构相似关系边图,根据提取的节点包含的三个重要地理位置信息,提取出节点的位置信息、节点长度信息,利用DTW算法做相似度比较,建立节点之间的地理结构相似邻接矩阵A g;然后,根据每个节点中三个地理位置附近的城市兴趣点数据中包含的建筑类别信息,提取每个节点的城市功能类别,根据城市功能的相似度,建立节点之间的城市功能区域划分关系邻接矩阵A f;最后,根据公交路线之间的距离关系,设计了第三种地理距离邻接矩阵A d;邻接矩阵中边的权重经过归一化处理,范围在0到1之间。 Preferably, the construction method of the adjacency matrix A is specifically as follows: establishing an edge graph of geographical structure similarity, extracting the position information and node length information of the nodes according to the three important geographic location information contained in the extracted nodes, and using the DTW algorithm Do the similarity comparison, establish the geographical structure similarity adjacency matrix A g between the nodes; then, extract the urban functional category of each node according to the building category information contained in the data of the urban interest points near the three geographical locations in each node , according to the similarity of urban functions, the adjacency matrix A f of the urban functional area division relationship between nodes is established; finally, according to the distance relationship between bus routes, a third geographical distance adjacency matrix A d is designed; the edge in the adjacency matrix The weights of are normalized and range between 0 and 1.
进一步的,所述的步骤3)具体为:对具有缺乏数据的公交线路,利用多头时空图注意力网络预测模型并根据公交车运行模式的相似度,学习前h个时间段内具有完整历史数据的公交车运行模式;继而预测具有缺乏数据的公交线路的公交车到站时间。Further, the step 3) is specifically: for the bus lines with lack of data, use the multi-head space-time graph attention network prediction model and according to the similarity of the bus operation mode, learn the complete historical data in the first h time periods bus operation patterns; and then predict bus arrival times for bus lines with lack of data.
与现有技术相比,本发明根据每条公共交通线路的地理结构与线路之间的空间-时间依赖关系,利用基于密度聚类方法来定位路线中的重要旅行地点(如车站、交叉口);根据挖掘的重要旅行地点,构建一个具有交通重要性的有权重的公共交通网络图。基于构建的交通网络图,提供了一个多注意图神经网络,从空间和时间的角度学习公交路线之间的相关性。在空间注意力模块中,建立了多头 具有掩码的图注意力网络的注意力模块(multi-head GAT),可以学习全局和局部路线在城市功能,路线距离,公交结构相似度三种视图上的重要性,并有效地结合多种影响因素来组合学习的旅行模式。在时间注意力模块中,提出用长短期记忆(LSTM)和transformer层来学习较远和最近时间的公交运行模式。结合空间和时间注意力模块,准确推断和学习具有稀疏和无历史数据的公交路线的旅行(到达)时间。本发明不仅能提高当前旅行记录少的公交线路的到达时间的预测准确率,还可以对没有历史记录的处于设计阶段的线路提供预计的站与站之间的旅行时间。Compared with the prior art, the present invention utilizes a density-based clustering method to locate important travel locations (such as stations, intersections) in the route according to the geographical structure of each public transport route and the space-time dependency between the routes ; Construct a weighted public transport network graph with traffic importance based on the mined important travel locations. Based on the constructed traffic network graph, a multi-attention graph neural network is provided to learn the correlation between bus routes from both spatial and temporal perspectives. In the spatial attention module, an attention module (multi-head GAT) with a multi-head masked graph attention network is established, which can learn global and local routes in three views of city function, route distance, and bus structure similarity The importance of learning and effectively combining multiple influencing factors to combine learning travel modes. In the temporal attention module, it is proposed to use long short-term memory (LSTM) and transformer layers to learn the bus operation mode of the far and recent time. Combine spatial and temporal attention modules to accurately infer and learn travel (arrival) times for bus routes with sparse and history-free data. The invention can not only improve the prediction accuracy rate of the arrival time of the bus lines with few current travel records, but also provide estimated travel time between stations for the lines in the design stage without historical records.
附图说明Description of drawings
图1为面向有缺失数据的公交车到站时间预测方法的流程图;Figure 1 is a flow chart of the bus arrival time prediction method with missing data;
图2为本发明实施例中基于密度的聚类方法的伪代码示意图;Fig. 2 is the schematic diagram of the pseudo-code of the density-based clustering method in the embodiment of the present invention;
图3为本发明实施例中选择站点之间的地理位置方法的伪代码示意图。Fig. 3 is a schematic diagram of a pseudocode of a method for selecting a geographic location between sites in an embodiment of the present invention.
具体实施方式Detailed ways
下面结合具体实施方式对本发明做进一步阐述和说明。The present invention will be further elaborated and described below in combination with specific embodiments.
本发明方法的流程图如图1所示,其由两个主要部分组成,分别是公共交通网络图构建和基于多头注意力图网络模型的构建。首先,将历史公交车运行轨迹GPS信息、公共交通站点位置信息与城市兴趣点数据(POI)数据整合,然后本发明提出了一种基于密度的聚类方法来自动发现重要地理位置(包括站点和交叉口),并利用挖掘的地理位置来表示每条路线的地理结构;根据地理结构相似度(如公交站点之间的距离,十字路口数量)、公交路线之间的距离,和城市功能区域划分,构建三种公共交通网络图。根据构建的公共交通网络图,本发明建立多头时空图注意力网络预测模型,包括空间和时间注意力模块来预测到站时间。在空间注意力模块中,本发明设计一个带有掩码的多头图注意块,以学习不同情况下公交线路之间的全局和局部空间依赖关系。多头机制对拥有完整历史旅行记录的相邻公交线路段的旅行时间进行了集合学习。通过设计的掩码机制,对不同邻居公交线路段组成的加权相邻矩阵进行屏蔽,注意力区块可以集中在选定的邻 居上,并减少计算时间。例如,当目标路线的不稳定历史记录很少时,其自身的输入历史旅行时间就有必要被忽略。在时间注意力模块中,为了准确模拟不同条件下以往历史数据的时间影响(如交通拥堵和天气条件下的正常和异常情况),本发明通过一个LSTM层和一个transformer层构建了一个时间注意力模块,以获得全局(远距离)和局部(最近)的时间模式,用于旅行时间预测。通过时空注意力,可以学习前h个时间段内具有完整历史数据的公交车运行模式(X t-h,…,X t);根据运行模式的相似度,对具有缺乏历史数据的公交车进行到站时间X t+1预测。 The flow chart of the method of the present invention is shown in Fig. 1, and it is made up of two main parts, respectively is the construction of public transport network map and the construction based on multi-head attention map network model. First of all, the GPS information of the historical bus running track, the location information of public transportation stations and the data of urban points of interest (POI) are integrated, and then the present invention proposes a density-based clustering method to automatically discover important geographic locations (including stations and Intersections), and use the excavated geographic location to represent the geographical structure of each route; according to the similarity of geographical structure (such as the distance between bus stops, the number of intersections), the distance between bus routes, and the division of urban functional areas , to construct three kinds of public transportation network graphs. According to the constructed public transportation network graph, the present invention establishes a multi-head space-time graph attention network prediction model, including spatial and temporal attention modules to predict arrival time. In the spatial attention module, the present invention designs a multi-head graph attention block with a mask to learn the global and local spatial dependencies between bus lines in different situations. The multi-head mechanism performs ensemble learning on the travel times of adjacent bus line segments with complete historical travel records. Through the designed masking mechanism to mask the weighted adjacency matrix composed of different neighbor bus line segments, the attention blocks can be focused on the selected neighbors and the computation time is reduced. For example, when a target route has few unstable histories, its own input history travel time is necessarily ignored. In the time attention module, in order to accurately simulate the time influence of past historical data under different conditions (such as normal and abnormal situations under traffic jams and weather conditions), the present invention constructs a time attention through an LSTM layer and a transformer layer module to obtain global (long distance) and local (nearest) temporal patterns for travel time prediction. Through spatio-temporal attention, it is possible to learn the bus operation mode (X th ,…,X t ) with complete historical data in the first h time period; according to the similarity of the operation mode, the arrival of the bus with the lack of historical data Time X t+1 forecast.
以下对本发明的公共交通网络图构建和多注意力图网络的预测模型的构建进行展开说明。The construction of the public transport network graph and the prediction model of the multi-attention graph network of the present invention are described below.
一、公共交通网络图的构建1. Construction of public transportation network map
由于公共交通的结构通常由一连串相连的路段或站点表示,现有的基于网格的地图分割对旅行时间的预测不起作用。于是,通过考虑公共交通的空间和时间特征,本发明从一个新的图形视图中构建了一个交通网络G,并提出了一种基于密度的聚类方法(DBSCAN)来自动发现重要的地理位置(站点和交叉口),并利用得到的地理位置来表示每条路线的地理结构(Geo-structure)。然后,基于发现的公交路线的地理结构,本发明构建了公共交通网络G,其中G中的节点S代表每两个相邻站点之间的旅行段。Since the structure of public transportation is usually represented by a succession of connected road segments or stops, existing grid-based map segmentation is ineffective for travel time prediction. Thus, by considering the spatial and temporal characteristics of public transportation, the present invention constructs a transportation network G from a new graphical view, and proposes a density-based clustering method (DBSCAN) to automatically discover important geographic locations ( Stations and intersections), and use the obtained geographic location to represent the geographic structure (Geo-structure) of each route. Then, based on the geographical structure of the discovered bus routes, the present invention constructs a public transport network G, where a node S in G represents a travel segment between every two adjacent stops.
(1)基于DBSCAN的公共交通地理结构挖掘(1) Mining the geographical structure of public transportation based on DBSCAN
首先,本发明利用基于密度的聚类算法(DBSCAN)来提取公共交通的地理结构,包括叉路口和站点的准确位置和密度。过程不直接使用标记的车站信息,而是从DBSCAN算法中提取位置,原因有二:首先,由于提出的旅行段是基于站点的,存在两条路线有相同的站点,但有不同的路径的情况。因此,只用车站的位置来提取路线的地理结构是不准确的。其次,每个车站和路口的等待时间通常受到乘客数量和交通信号灯的影响,这可能导致不同路线的旅行时间有很大的差异。所以,本发明通过基于密度的聚类方法(DBSCAN),寻找密集区域(Geo-structure discovery),不需要设定集群数量或固定形状,具体步骤如图2所示(Algorithm 1)图2中,GPS点p包含经度和纬度信息(long,lat),c是经过算法聚类后得出的公交网络中的重要点,包含经度,纬度,GPS数量和站点信 息(long,lat,num,st)。基于密度的聚类算法需要两个参数:扫描半径eps和最小点数minPts,参数根据数据库中速度值在0左右的GPS点的数量来设置,以找到沿途的所有重要地理位置,包括车站和交叉口。然后,根据每个挖掘出的地理位置范围内GPS数量来确定站点和交叉口的权重。First, the present invention utilizes a density-based clustering algorithm (DBSCAN) to extract the geographic structure of public transportation, including the exact location and density of intersections and stops. The process does not directly use the marked station information, but extracts the locations from the DBSCAN algorithm for two reasons: First, since the proposed travel segment is station-based, there are cases where two routes have the same stations but different paths . Therefore, it is inaccurate to extract the geographical structure of the route using only the location of the station. Second, the waiting time at each station and intersection is usually affected by the number of passengers and traffic lights, which can lead to large differences in travel times for different routes. Therefore, the present invention uses a density-based clustering method (DBSCAN) to find a dense area (Geo-structure discovery), without setting the number of clusters or a fixed shape. The specific steps are shown in Figure 2 (Algorithm 1) in Figure 2, GPS point p contains longitude and latitude information (long, lat), and c is an important point in the bus network obtained after algorithmic clustering, including longitude, latitude, GPS number and station information (long, lat, num, st) . The density-based clustering algorithm needs two parameters: the scanning radius eps and the minimum number of points minPts. The parameters are set according to the number of GPS points with speed values around 0 in the database to find all important geographic locations along the way, including stations and intersections. . Stations and intersections are then weighted according to the number of GPS within each excavated geographic location.
(2)公共交通网络图的节点抽取和边抽取(2) Node extraction and edge extraction of public transportation network graph
2.1)节点抽取2.1) Node extraction
在提取了公交网络的重要地理位置后,本发明提出了一种节点信息表示方法,方法设计首先在每条公交路线的每两个相邻站点之间选择一个重要地理位置,由两个站点信息和之间的重要地理位置信息的集合作为节点s的信息,以代表从一个站点到下一个站点的旅行段。选择站点之间的地理位置方法如图3(Algorithm 2)所示。图3中,步骤3遍历所有从DBSCAN中得到的聚类C的集合,步骤4-6中,在相邻的两个拥有站点的c中,找到与站点位置距离最远,且人流量最大的c i+1作为s中的信息之一,步骤7集合两个站点和c i+1作为s的信息。 After extracting the important geographical location of the bus network, the present invention proposes a node information representation method. The method design first selects an important geographical location between every two adjacent stations of each bus route, and the two station information The collection of important geographic location information between and is used as the information of node s to represent the travel segment from one station to the next. The method for selecting the geographic location between sites is shown in Figure 3 (Algorithm 2). In Figure 3, step 3 traverses all the collections of clusters C obtained from DBSCAN. In steps 4-6, among the two adjacent c with sites, find the one with the farthest distance from the site and the largest flow of people c i+1 is one of the information in s, step 7 gathers two sites and c i+1 as the information of s.
2.2)邻接矩阵(边)建立2.2) Adjacency matrix (edge) establishment
本发明根据节点的信息建立边图来代表路段(节点)之间的关系。由于许多影响因素会影响公交系统的出行模式,本发明构建了三个邻接矩阵A来表示不同的关系。首先建立地理结构相似关系边图,根据提取的节点包含的三个重要地理位置信息,提取出路段(节点)的位置信息,路段长度信息,利用DTW算法做相似度比较,建立节点之间的地理结构相似邻接矩阵A g;然后,根据每个节点中三个位置附近的城市兴趣点数据中包含的建筑类别信息(设置为节点经纬度半径范围100米距离内的建筑城市功能类别信息集合),提取每个路段(节点)的城市功能类别,根据城市功能的相似度,建立节点之间的城市功能区域划分关系邻接矩阵A f;最后,由于在一定空间地理范围内,道路上的交通情况可能比较相似。根据这一考量,本发明根据公交路线之间的距离关系,设计了第三种地理距离邻接矩阵A d。邻接矩阵中边的权重本发明设计经过归一化处理,范围在0到1之间。 The invention establishes an edge graph according to the node information to represent the relationship between road sections (nodes). Since many influencing factors will affect the travel mode of the public transport system, the present invention constructs three adjacency matrices A to represent different relationships. First, establish the edge graph of geographical structure similarity relationship, extract the location information of the road section (node) and the length information of the road section according to the three important geographical location information contained in the extracted nodes, use the DTW algorithm to compare the similarity, and establish the geographical relationship between nodes Structurally similar adjacency matrix A g ; then, according to the building category information contained in the data of urban interest points near the three positions in each node (set as the collection of building urban function category information within a distance of 100 meters in the radius of latitude and longitude of the node), extract The urban function category of each road section (node), according to the similarity of urban functions, establishes the adjacency matrix A f of the urban functional area division relationship between the nodes; finally, due to a certain spatial geographical range, the traffic conditions on the road may be compared resemblance. According to this consideration, the present invention designs a third geographic distance adjacency matrix A d according to the distance relationship between bus routes. The weight of the edge in the adjacency matrix is designed in the present invention after normalization processing, and the range is between 0 and 1.
二、多头时空图注意力网络预测模型构建2. Construction of multi-head spatio-temporal graph attention network prediction model
在构建公交网络图后,本发明提出了多头时空图注意力网络预测模型,用于利用有限的数据预测整个城市的公交车旅行时间。这个模型可以有效地学习和预 测全市范围内各条公交线路的行驶时间,特别是对于郊区的线路和没有任何历史公交行驶记录的路径。它可以帮助更新现有的公共交通系统,调整发展中地区线路的过时时间表,并帮助选择新的线路和设计新的线路,提供正常和非正常交通条件下每条线路的旅行时间。这个模型包含两个模块,即空间注意力模块和时间注意力模块。After constructing the bus network graph, the present invention proposes a multi-head spatio-temporal graph attention network prediction model for predicting the bus travel time of the whole city with limited data. This model can effectively learn and predict the travel time of various bus lines in the city, especially for suburban lines and routes without any historical bus travel records. It can help update existing public transport systems, adjust outdated timetables for routes in developing regions, and help select new routes and design new ones, providing travel times for each route under normal and irregular traffic conditions. This model contains two modules, the spatial attention module and the temporal attention module.
2.1)空间注意力模块2.1) Spatial attention module
在空间注意力模块中,本发明应用图注意网络(GAT)来学习不同旅行段之间的空间依赖关系。与图卷积网络(GCN)相比,GAT的动态图处理能力和归纳学习能力更适合在数据有限的情况下进行城市范围内的旅行时间预测。图关注层是GAT的基础部分,它可以学习每节点之间的相关性,并更新每对节点的隐藏特征。节点特征在时间间隔t中表示为h t i。在第一层,h t i是s i段的输入旅行时间记录和嵌入的时间信息。s i和s j的注意系数e t ij可以表示为: In the spatial attention module, the present invention applies a graph attention network (GAT) to learn the spatial dependencies between different travel segments. Compared with graph convolutional network (GCN), GAT's dynamic graph processing ability and inductive learning ability are more suitable for city-wide travel time prediction with limited data. The graph attention layer is the fundamental part of GAT, which can learn the correlation between each node and update the hidden features of each pair of nodes. Node features are denoted as h t i in time interval t. In the first layer, h t i is the input travel time record and embedded temporal information for segment s i . The attention coefficient e t ij of s i and s j can be expressed as:
Figure PCTCN2021132828-appb-000001
Figure PCTCN2021132828-appb-000001
其中W是l层的可学习参数,a(.)是计算相关性的函数。本发明利用LeakyReLU主动函数来训练前馈神经网络。对于每一层,通过softmax函数将输出归一化为[0,1]:where W is the learnable parameter of layer l, and a(.) is the function to calculate the correlation. The present invention utilizes the LeakyReLU active function to train the feed-forward neural network. For each layer, the output is normalized to [0,1] by the softmax function:
Figure PCTCN2021132828-appb-000002
Figure PCTCN2021132828-appb-000002
为了获得更丰富的旅行模式组合,用于有缺失数据的公交车线路准确学习具有完整历史数据的公交线路运行模式,本发明将空间注意扩展为有掩码的多头注意机制,其由具有可学习的K个独立注意力头被串联起来以达到最终的空间注意力结果:In order to obtain a richer combination of travel modes, which is used to accurately learn the operation mode of bus lines with complete historical data for bus lines with missing data, the present invention extends spatial attention to a masked multi-head attention mechanism, which has a learnable The K independent attention heads of L are concatenated to achieve the final spatial attention result:
Figure PCTCN2021132828-appb-000003
Figure PCTCN2021132828-appb-000003
其中σ为softmax函数。在每个注意力头中,本发明对邻接矩阵加入掩码注意力机制,用于关注具有完整历史运行数据的公交线路,并学习运行模式。掩码m在l层的表示为:where σ is the softmax function. In each attention head, the present invention adds a mask attention mechanism to the adjacency matrix, which is used to pay attention to the bus lines with complete historical operation data and learn the operation mode. The representation of mask m at layer l is:
Figure PCTCN2021132828-appb-000004
Figure PCTCN2021132828-appb-000004
Figure PCTCN2021132828-appb-000005
Figure PCTCN2021132828-appb-000005
其中γ为节点i和节点j的注意力a的阈值,对l层加了掩码后的X输出表示为:Where γ is the threshold of the attention a of node i and node j, and the X output after adding a mask to layer l is expressed as:
Figure PCTCN2021132828-appb-000006
Figure PCTCN2021132828-appb-000006
其中X为公交运行时间的输入数据,X’为加入注意力机制后的输出,X l’为在掩码机制加入后第l层的输出,
Figure PCTCN2021132828-appb-000007
为邻接矩阵于掩码矩阵的Hadamard乘积。
Among them, X is the input data of the bus running time, X' is the output after adding the attention mechanism, X l ' is the output of the l layer after the mask mechanism is added,
Figure PCTCN2021132828-appb-000007
is the Hadamard product of the adjacency matrix and the mask matrix.
2.2)时间注意力模块2.2) Temporal attention module
在学习空间依赖性后,本发明连接一个时间注意力模块。由于每个公交车行程的旅行时间受实时交通状况的影响很大。例如,当交通状况正常时,目标路线同一时间段的以往远距离历史记录在当前时间段的行驶时间可能高度相似。然而,当交通拥堵发生时,旅行模式可能是不稳定的,但仍然可能与最近的时间段有类似的模式。因此,对于不同的交通状况,全局(远时间)和局部(最近)的时间旅行模式都需要考虑。After learning spatial dependencies, the present invention connects a temporal attention module. Since the travel time of each bus trip is greatly affected by real-time traffic conditions. For example, when traffic conditions are normal, the travel time of the target route's past long-distance history for the same time period in the current time period may be highly similar. However, when traffic congestion occurs, the travel pattern may be erratic, but still may have a similar pattern to the most recent time period. Therefore, for different traffic situations, both global (far time) and local (nearest) time travel patterns need to be considered.
2.2.1局部时间依赖学习2.2.1 Local time-dependent learning
递归神经网络(RNN)是一种人工神经网络,特别适合捕捉序列学习中的时间依赖性。然而,以前的研究表明,由于梯度消失和爆炸的问题,RNN通常很难训练长序列。为了克服这些缺点,LSTM(Long Short-Term Memory)通过引入一个输入门和一个遗忘门来自动确定最佳时间滞后。因此,本发明建立了一个基于LSTM的模型来关注局部时间信息,其中h t-1是LSTM单元的输入向量,W ix,W ih和b i是递归层的可学习参数矩阵和偏置向量,σ标准sigmoid函数。LSTM的输入门i t可以表示为: A recurrent neural network (RNN) is a type of artificial neural network that is particularly well suited for capturing temporal dependencies in sequence learning. However, previous studies have shown that RNNs are often difficult to train for long sequences due to the problem of vanishing and exploding gradients. To overcome these shortcomings, LSTM (Long Short-Term Memory) automatically determines the optimal time lag by introducing an input gate and a forget gate. Therefore, the present invention builds an LSTM-based model to focus on local temporal information, where h t-1 is the input vector of the LSTM unit, Wi ix , Wi h and bi are the learnable parameter matrix and bias vector of the recurrent layer, σ standard sigmoid function. The input gate it of LSTM can be expressed as:
i t=σ(W ixx t+W ihh t-1+b i)        (7) i t =σ(W ix x t +W ih h t-1 +b i ) (7)
2.2.2全局时间依赖学习2.2.2 Global time-dependent learning
为了从全局角度发现时间上的公交旅行模式信息,本发明在时间注意力模块中引入了transformer层。对于单头注意力层,对于公共交通网络图中的每个节点通常有三种类型的向量Q,查询K、键和值V。隐藏子空间学习过程可以表述 为:In order to discover temporal bus travel pattern information from a global perspective, the present invention introduces a transformer layer in the temporal attention module. For a single-head attention layer, there are usually three types of vector Q, query K, key, and value V for each node in the public transport network graph. The hidden subspace learning process can be expressed as:
Q=X iW Q,K=X iW K,V=X iW V.            (8) Q=X i W Q , K=X i W K , V=X i W V . (8)
W Q,W K,W V为科学系参数,全局时间注意力的输出Attention是根据缩放的点积注意力计算的,,其中d K为缩放因子,表示为: W Q , W K , W V are the parameters of the science department, and the output Attention of the global time attention is calculated according to the scaled dot product attention, where d K is the scaling factor, expressed as:
Figure PCTCN2021132828-appb-000008
Figure PCTCN2021132828-appb-000008
2.3)公交到达时间预测层2.3) Bus arrival time prediction layer
当得到高维时空特征后,本发明使用线性层进行预测。通过最小化期望输出的预测值X’ t+1和真实值之间的均方误差L来训练预测X t+1,使用均方误差(MSE)损失来训练多头时空图注意力网络预测模型,可以表述为: After obtaining the high-dimensional spatio-temporal features, the present invention uses a linear layer for prediction. Train the prediction X t+1 by minimizing the mean square error L between the predicted value X' t+1 of the expected output and the true value, and use the mean square error (MSE) loss to train the multi-head spatiotemporal graph attention network prediction model, Can be expressed as:
Figure PCTCN2021132828-appb-000009
Figure PCTCN2021132828-appb-000009
其中θ是模型中的可学习参数。where θ is a learnable parameter in the model.
在本发明方法的实验验证环节,本发明使用了公交车轨迹和POI信息,这些数据都是从某城市的交通部门获得的。公交车轨迹包括位置、时间戳、速度和公交车ID信息。平均采样频率为每点30秒,每天的数量为278条单独线路产生的约30万个点。POI数据集由建筑位置和类别(社会功能)组成。使用交叉验证法,选择三条线路作为目标线路,以评估本本发明方法的性能和稳健性。本发明选择的三条测试公交线路位于城市的不同区域,其中包括发达的中心区域、偏远地区以及连接中心和郊区的路径。然后对结果进行平均。在三条测试公交线路的每条轨迹中随机删除40%、60%和80%的GPS记录点,以测试不同程度的记录稀少的线路的预测精度。本发明还删除了它们的所有历史记录,将其视为三条正在设计中的路线(没有任何历史旅行时间记录),以测试新路线的旅行时间估计性能(站点位置和路径是设计好的),这有助于评估所提出的模型,此处将本发明方法取名为MAGTTE。In the experimental verification link of the method of the present invention, the present invention uses bus trajectory and POI information, and these data are all obtained from the traffic department of a certain city. Bus trajectories include location, timestamp, speed, and bus ID information. The average sampling frequency is 30 seconds per point, with a daily volume of about 300,000 points generated by 278 individual lines. The POI dataset consists of building locations and categories (social functions). Using the cross-validation method, three lines were selected as target lines to evaluate the performance and robustness of the method of the present invention. The three test bus lines selected by the present invention are located in different areas of the city, including developed central areas, remote areas, and paths connecting the center and the suburbs. The results are then averaged. 40%, 60%, and 80% of the GPS recorded points were randomly deleted from each trajectory of the three test bus lines to test the prediction accuracy for different degrees of sparsely recorded lines. The present invention also deletes all their histories and treats them as three routes under design (without any historical travel time records) to test the travel time estimation performance of new routes (site locations and routes are designed), This facilitates the evaluation of the proposed model, here named MAGTTE for the inventive method.
对比实验方法:Comparative experimental methods:
●历史平均模型(HA)。通过计算每个时间段(15分钟)内公交线路的历史平均行驶时间来预测行驶时间。• Historical average model (HA). Travel time is predicted by calculating the historical average travel time of bus lines in each time period (15 minutes).
●空间-时间人工神经网络(ST-ANN)●Spatial-temporal artificial neural network (ST-ANN)
●卡尔曼滤波(KF)●Kalman filter (KF)
●支持向量回归(SVR)● Support Vector Regression (SVR)
●E-knn:这是一个基于加权增强的k-NN方法提出的模型,它使用与当前交通状况最相似的k条记录来识别交通状况并预测出行时间。在这里,将出行模式的相似度设定为90/%以上,目标路段为k个邻居。● E-knn: This is a proposed model based on the weighted enhanced k-NN method, which uses the k records most similar to the current traffic condition to identify the traffic condition and predict the travel time. Here, the similarity of the travel mode is set to be above 90/%, and the target road segment is k neighbors.
●RnnTTE:该模型基于LSTM神经网络,包含一个具有128个隐藏单元的全连接LSTM层。RnnTTE: The model is based on an LSTM neural network and contains a fully connected LSTM layer with 128 hidden units.
DeepTTE:这是一个模型结合了geo-conv层和lstm层来预测旅行时间。DeepTTE: This is a model that combines geo-conv layers and lstm layers to predict travel time.
实验结果如图4所示,结果表明,在设计过程中,本发明方法在旅行时间预测问题领域的表现优于现有的其他先进的方法,无论是稀疏的记录还是路线。它证明了所提出的MAGTTE能够有效地预测整个城市的公共交通出行时间,MAPE错误率最小,并用于更新和发展公共交通路径。The experimental results are shown in Fig. 4, and the results show that the proposed method outperforms other existing state-of-the-art methods in the field of travel time prediction problems during the design process, whether it is sparse records or routes. It demonstrates that the proposed MAGTTE can effectively predict the public transit travel time of the whole city with the minimum MAPE error rate, and is used to update and develop public transit routes.
上所述实施例仅表达了本发明的几种实施方式,其描述较为具体和详细,但并不能因此而理解为对本发明专利范围的限制。应当指出的是,对于本领域的普通技术人员来说,在不脱离本发明构思的前提下,还可以做出若干变形和改进,这些都属于本发明的保护范围。因此,本发明的保护范围应以所附权利要求为准。The above-mentioned embodiments only express several implementation modes of the present invention, and the descriptions thereof are relatively specific and detailed, but should not be construed as limiting the patent scope of the present invention. It should be noted that those skilled in the art can make several modifications and improvements without departing from the concept of the present invention, and these all belong to the protection scope of the present invention. Therefore, the protection scope of the present invention should be determined by the appended claims.

Claims (7)

  1. 一种面向有缺失数据的公交车到站时间预测方法,其特征在于,包括如下步骤:A method for predicting bus arrival time facing missing data, characterized in that it comprises the steps of:
    1)将历史公交车运行轨迹GPS信息、公共交通站点位置信息与城市兴趣点数据进行数据整合;通过基于密度的聚类方法从历史公交车运行轨迹GPS信息中提取公交网络中重要的地理位置,利用得到的地理位置来表示每条公交路线的地理结构,根据公共交通站点位置信息和城市兴趣点数据设计公共交通网络图的节点抽取和边权重表示方法,构建公共交通网络图;1) Data integration of GPS information of historical bus trajectories, location information of public transport stations and urban point of interest data; extraction of important geographic locations in the bus network from GPS information of historical bus trajectories through a density-based clustering method, Use the obtained geographic location to represent the geographical structure of each bus route, design the node extraction and edge weight representation methods of the public transportation network graph according to the location information of public transportation stations and the data of urban interest points, and construct the public transportation network graph;
    2)根据构建的公共交通网络图,建立多头时空图注意力网络预测模型从空间和时间的角度学习公交路线之间的相关性;其中,所述的多头时空图注意力网络预测模型包括顺次连接的空间注意力模块和时间注意力模块;所述空间注意力模块为具有掩码的多头图注意力模块,其中多头设计用于学习不同情况下公交线路之间的全局和局部空间依赖关系,掩码设计用于注意重要的具有完全历史数据的公交线路旅行规律;所述时间注意力模块包括一个LSTM层和一个transformer层,分别用于进行局部时间依赖学习和全局时间依赖学习;2) According to the public transportation network diagram of construction, set up the multi-head spatio-temporal graph attention network prediction model from the perspective of space and time to learn the correlation between bus routes; wherein, the multi-head spatio-temporal graph attention network prediction model includes sequential A connected spatial attention module and a temporal attention module; the spatial attention module is a multi-head map attention module with a mask, wherein the multi-head design is used to learn global and local spatial dependencies between bus lines in different situations, The mask is designed to pay attention to important bus line travel rules with complete historical data; the temporal attention module includes an LSTM layer and a transformer layer for local time-dependent learning and global time-dependent learning, respectively;
    3)利用多头时空图注意力网络预测模型,对具有缺失数据的公交车到站时间进行预测;3) Using the multi-head spatio-temporal graph attention network prediction model to predict the arrival time of buses with missing data;
    所述的步骤3)具体为:对具有缺乏数据的公交线路,利用多头时空图注意力网络预测模型并根据公交车运行模式的相似度,学习前h个时间段内具有完整历史数据的公交车运行模式;继而预测具有缺乏数据的公交线路的公交车到站时间。Described step 3) is specifically: for the bus route that lacks data, utilize multi-head spatio-temporal graph attention network prediction model and according to the similarity of bus operation mode, learn the bus that has complete historical data in the preceding h time period mode of operation; in turn predicting bus arrival times for bus lines with lack of data.
  2. 根据权利要求1所述的面向有缺失数据的公交车到站时间预测方法,其特征在于,通过基于密度的聚类方法从历史公交车运行轨迹GPS信息中提取公交网络中重要的地理位置,具体为:According to the method for predicting the arrival time of buses with missing data according to claim 1, it is characterized in that, extracting the important geographic location in the bus network from the GPS information of the historical bus running track by a density-based clustering method, specifically for:
    根据整合后的数据中速度为0的GPS点的数量来设置基于密度的聚类方法的参数,通过基于密度的聚类方法得到公交网络中重要的地理位置,并根据每个地理位置包含的GPS点的数量来确定地理位置的权重。Set the parameters of the density-based clustering method according to the number of GPS points whose speed is 0 in the integrated data, and obtain the important geographic locations in the bus network through the density-based clustering method, and according to the GPS points contained in each geographic location The number of points to determine the weight of the geographic location.
  3. 根据权利要求1所述的面向有缺失数据的公交车到站时间预测方法,其特征在于,所述的利用得到的重要的地理位置来表示每条路线的地理结构,具体为: 用权重来表现交叉口和站点的位置,以代表每条公交路线的地理结构。According to the method for predicting the arrival time of buses with missing data according to claim 1, it is characterized in that, the important geographic location obtained by the use is used to represent the geographical structure of each route, specifically: using weights to represent The location of intersections and stops to represent the geographic structure of each transit route.
  4. 根据权利要求1所述的面向有缺失数据的公交车到站时间预测方法,其特征在于,所述的节点抽取具体为:在每条公交路线的每两个相邻站点之间选择一个地理位置,将两个站点信息和其之间的被选择的地理位置信息集合作为节点s的信息,以代表两个相邻站点之间的路段。The method for predicting bus arrival time facing missing data according to claim 1, wherein said node extraction is specifically: selecting a geographic location between every two adjacent stations of each bus route , take the two site information and the selected geographic location information set between them as the information of node s to represent the road section between two adjacent sites.
  5. 根据权利要求4所述的面向有缺失数据的公交车到站时间预测方法,其特征在于,所述的每两个相邻站点之间选择一个地理位置构成节点信息,具体为:选择公交路线中,与两个相邻站点位置距离最远,且人流量最大的交叉口,将该交叉口信息与两个相邻站点的地理位置信息一起构成节点信息。According to the method for predicting the arrival time of buses with missing data according to claim 4, it is characterized in that a geographic location is selected between each two adjacent stations to form node information, specifically: selecting a bus route , the intersection with the farthest distance from the two adjacent stations and the largest flow of people, the intersection information and the geographic location information of the two adjacent stations constitute node information.
  6. 根据权利要求5所述的面向有缺失数据的公交车到站时间预测方法,其特征在于,所述的边权重表示方法具体为:According to the method for predicting the arrival time of buses with missing data according to claim 5, it is characterized in that, the described edge weight representation method is specifically:
    建立边图来代表路段之间的关系,其中,边上编码的权重是空间相关性强度或相似性;构建三个分别表示地理结构相似度关系、公交路线之间的距离关系、城市功能区域划分关系的邻接矩阵A,得到三种表示不同关系的公共交通网络图。Create an edge graph to represent the relationship between road sections, where the weight of the edge code is the spatial correlation strength or similarity; construct three graphs to represent the similarity relationship of geographical structure, the distance relationship between bus routes, and the division of urban functional areas. The adjacency matrix A of the relationship is used to obtain three kinds of public transportation network graphs representing different relationships.
  7. 根据权利要求6所述的面向有缺失数据的公交车到站时间预测方法,其特征在于,所述邻接矩阵A的构建方法,具体为:建立地理结构相似关系边图,根据提取的节点包含的三个重要地理位置信息,提取出节点的位置信息、节点长度信息,利用DTW算法做相似度比较,建立节点之间的地理结构相似邻接矩阵A g;然后,根据每个节点中三个地理位置附近的城市兴趣点数据中包含的建筑类别信息,提取每个节点的城市功能类别,根据城市功能的相似度,建立节点之间的城市功能区域划分关系邻接矩阵A f;最后,根据公交路线之间的距离关系,设计了第三种地理距离邻接矩阵A d;邻接矩阵中边的权重经过归一化处理,范围在0到1之间。 According to the method for predicting the arrival time of buses with missing data according to claim 6, it is characterized in that the construction method of the adjacency matrix A is specifically: establishing a geographical structure similarity relationship edge graph, according to the node included in the extraction Three important geographic location information, extract the node location information, node length information, use the DTW algorithm to do similarity comparison, and establish the geographic structure similarity adjacency matrix A g between nodes; then, according to the three geographic location information in each node According to the building category information contained in the nearby urban interest point data, the urban function category of each node is extracted, and the urban functional area division relationship adjacency matrix A f between nodes is established according to the similarity of urban functions; finally, according to the bus route The distance relationship among them, a third geographical distance adjacency matrix A d is designed; the weight of the edge in the adjacency matrix is normalized, and the range is between 0 and 1.
PCT/CN2021/132828 2021-09-01 2021-11-24 Method for bus arrival time prediction when lacking data WO2023029234A1 (en)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
CN202111022557.1 2021-09-01
CN202111022557.1A CN113470365B (en) 2021-09-01 2021-09-01 Bus arrival time prediction method oriented to missing data

Publications (1)

Publication Number Publication Date
WO2023029234A1 true WO2023029234A1 (en) 2023-03-09

Family

ID=77867139

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/CN2021/132828 WO2023029234A1 (en) 2021-09-01 2021-11-24 Method for bus arrival time prediction when lacking data

Country Status (2)

Country Link
CN (1) CN113470365B (en)
WO (1) WO2023029234A1 (en)

Cited By (9)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN116432870A (en) * 2023-06-13 2023-07-14 齐鲁工业大学(山东省科学院) Urban flow prediction method
CN116542438A (en) * 2023-03-28 2023-08-04 大连海事大学 Bus passenger starting and stopping point estimation and repair method based on non-reference real phase
CN116542374A (en) * 2023-05-05 2023-08-04 北京蔚行科技有限公司 Public transportation arrival time prediction method based on neural network and Kalman filtering
CN117057848A (en) * 2023-08-28 2023-11-14 淮阴工学院 Takeout order demand prediction method based on GAT-GPR
CN117235556A (en) * 2023-11-13 2023-12-15 浙江大学城乡规划设计研究院有限公司 Traffic manager-oriented multi-mode traffic composite network construction method
CN117334072A (en) * 2023-12-01 2024-01-02 青岛城运数字科技有限公司 Bus arrival time prediction method and device
CN117435935A (en) * 2023-09-13 2024-01-23 广州大学 Personnel group prediction method and device based on self-supervision graph attention network
CN117688453A (en) * 2024-02-02 2024-03-12 山东科技大学 Traffic flow prediction method based on space-time embedded attention network
CN117910660A (en) * 2024-03-18 2024-04-19 华中科技大学 Bus arrival time prediction method and system based on GPS data and space-time correlation

Families Citing this family (8)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN113470365B (en) * 2021-09-01 2022-01-14 北京航空航天大学杭州创新研究院 Bus arrival time prediction method oriented to missing data
CN113936462A (en) * 2021-10-18 2022-01-14 广州交信投科技股份有限公司 Bus road condition prediction method and system based on ASTGCN algorithm
CN114495484B (en) * 2021-12-17 2023-10-27 北京航空航天大学杭州创新研究院 Bus stop position recommendation method based on multi-source data hierarchical graph clustering algorithm
CN114358428A (en) * 2022-01-06 2022-04-15 深圳大学 Inter-bus station travel time prediction method based on space-time characteristics
CN114841268B (en) * 2022-05-06 2023-04-18 国网江苏省电力有限公司营销服务中心 Abnormal power customer identification method based on Transformer and LSTM fusion algorithm
CN115424467B (en) * 2022-08-19 2023-10-24 贵阳移动金融发展有限公司 Information acquisition system based on public transportation
CN116071939B (en) * 2023-03-24 2023-06-16 华东交通大学 Traffic signal control model building method and control method
CN116596126A (en) * 2023-04-27 2023-08-15 苏州大学 Bus string prediction method and system

Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN111554118A (en) * 2020-04-24 2020-08-18 深圳职业技术学院 Dynamic prediction method and system for bus arrival time
CN112071062A (en) * 2020-09-14 2020-12-11 山东理工大学 Driving time estimation method based on graph convolution network and graph attention network
CN112150207A (en) * 2020-09-30 2020-12-29 武汉大学 Online taxi appointment order demand prediction method based on space-time context attention network
US20210248460A1 (en) * 2020-02-07 2021-08-12 Uatc, Llc Systems and Methods for Optimized Multi-Agent Routing Between Nodes
CN113470365A (en) * 2021-09-01 2021-10-01 北京航空航天大学杭州创新研究院 Bus arrival time prediction method oriented to missing data

Family Cites Families (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP6902481B2 (en) * 2017-11-10 2021-07-14 株式会社日立製作所 Resource arbitration system and resource arbitration device
CN109410580A (en) * 2018-11-15 2019-03-01 山东管理学院 A kind of real-time arrival time prediction technique of public transport and system
CN109584552B (en) * 2018-11-28 2021-04-30 青岛大学 Bus arrival time prediction method based on network vector autoregressive model
CN110889546B (en) * 2019-11-20 2020-08-18 浙江省交通规划设计研究院有限公司 Attention mechanism-based traffic flow model training method
CN111899511A (en) * 2020-08-03 2020-11-06 西南交通大学 Bus arrival time prediction method for AVL data of collinear line

Patent Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20210248460A1 (en) * 2020-02-07 2021-08-12 Uatc, Llc Systems and Methods for Optimized Multi-Agent Routing Between Nodes
CN111554118A (en) * 2020-04-24 2020-08-18 深圳职业技术学院 Dynamic prediction method and system for bus arrival time
CN112071062A (en) * 2020-09-14 2020-12-11 山东理工大学 Driving time estimation method based on graph convolution network and graph attention network
CN112150207A (en) * 2020-09-30 2020-12-29 武汉大学 Online taxi appointment order demand prediction method based on space-time context attention network
CN113470365A (en) * 2021-09-01 2021-10-01 北京航空航天大学杭州创新研究院 Bus arrival time prediction method oriented to missing data

Non-Patent Citations (1)

* Cited by examiner, † Cited by third party
Title
"Master's Thesis", 1 June 2020, HARBIN INSTITUTE OF TECHNOLOGY, CN, article BOLATOVA MARZHAN: "Study on Prediction Method of Bus Arrival Time of Urban Public Transport", pages: 1 - 84, XP009544136, DOI: 10.27061/d.cnki.ghgdu.2020.002789 *

Cited By (15)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN116542438A (en) * 2023-03-28 2023-08-04 大连海事大学 Bus passenger starting and stopping point estimation and repair method based on non-reference real phase
CN116542438B (en) * 2023-03-28 2024-01-30 大连海事大学 Bus passenger starting and stopping point estimation and repair method based on non-reference real phase
CN116542374A (en) * 2023-05-05 2023-08-04 北京蔚行科技有限公司 Public transportation arrival time prediction method based on neural network and Kalman filtering
CN116432870B (en) * 2023-06-13 2023-10-10 齐鲁工业大学(山东省科学院) Urban flow prediction method
CN116432870A (en) * 2023-06-13 2023-07-14 齐鲁工业大学(山东省科学院) Urban flow prediction method
CN117057848A (en) * 2023-08-28 2023-11-14 淮阴工学院 Takeout order demand prediction method based on GAT-GPR
CN117057848B (en) * 2023-08-28 2024-05-31 淮阴工学院 Takeout order demand prediction method based on GAT-GPR
CN117435935A (en) * 2023-09-13 2024-01-23 广州大学 Personnel group prediction method and device based on self-supervision graph attention network
CN117235556B (en) * 2023-11-13 2024-02-27 浙江大学城乡规划设计研究院有限公司 Traffic manager-oriented multi-mode traffic composite network construction method
CN117235556A (en) * 2023-11-13 2023-12-15 浙江大学城乡规划设计研究院有限公司 Traffic manager-oriented multi-mode traffic composite network construction method
CN117334072A (en) * 2023-12-01 2024-01-02 青岛城运数字科技有限公司 Bus arrival time prediction method and device
CN117334072B (en) * 2023-12-01 2024-02-23 青岛城运数字科技有限公司 Bus arrival time prediction method and device
CN117688453A (en) * 2024-02-02 2024-03-12 山东科技大学 Traffic flow prediction method based on space-time embedded attention network
CN117688453B (en) * 2024-02-02 2024-04-30 山东科技大学 Traffic flow prediction method based on space-time embedded attention network
CN117910660A (en) * 2024-03-18 2024-04-19 华中科技大学 Bus arrival time prediction method and system based on GPS data and space-time correlation

Also Published As

Publication number Publication date
CN113470365B (en) 2022-01-14
CN113470365A (en) 2021-10-01

Similar Documents

Publication Publication Date Title
WO2023029234A1 (en) Method for bus arrival time prediction when lacking data
Zhang et al. Semi-supervised hierarchical recurrent graph neural network for city-wide parking availability prediction
Sun et al. DxNAT—Deep neural networks for explaining non-recurring traffic congestion
Xing et al. Traffic state estimation of urban road networks by multi-source data fusion: Review and new insights
CN112489426B (en) Urban traffic flow space-time prediction scheme based on graph convolution neural network
Elhenawy et al. Dynamic travel time prediction using data clustering and genetic programming
Ma et al. Multi-attention graph neural networks for city-wide bus travel time estimation using limited data
EP2590151A1 (en) A framework for the systematic study of vehicular mobility and the analysis of city dynamics using public web cameras
CN110851741A (en) Taxi passenger carrying hot spot identification recommendation algorithm
Tang et al. Exploring urban travel patterns using density-based clustering with multi-attributes from large-scaled vehicle trajectories
CN104318324A (en) Taxi GPS (Global Positioning System) record based airport bus station and path planning method
Barnes et al. Bustr: Predicting bus travel times from real-time traffic
Liu et al. Exploring time-dependent traffic congestion patterns from taxi trajectory data
Wang et al. GraphTTE: Travel time estimation based on attention-spatiotemporal graphs
Nie et al. Public curb parking demand estimation with poi distribution
Qaddoura et al. Temporal prediction of traffic characteristics on real road scenarios in Amman
Hongsakham et al. Estimating road traffic congestion from cellular handoff information using cell-based neural networks and K-means clustering
Wei et al. Turn-level network traffic bottleneck identification using vehicle trajectory data
Herring Real-time traffic modeling and estimation with streaming probe data using machine learning
Hu et al. A novel method for the detection of road intersections and traffic rules using big floating car data
Li et al. Personalized travel time prediction using a small number of probe vehicles
Gupta et al. Lstm based real-time smart parking system
CN115062708A (en) Abnormal node detection method based on track deviation point embedding and depth clustering
Tran et al. Tleta: Deep transfer learning and integrated cellular knowledge for estimated time of arrival prediction
Kakarla et al. Travel Time Prediction and Route Performance Analysis in BRTS based on Sparse GPS Data

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 21955762

Country of ref document: EP

Kind code of ref document: A1

NENP Non-entry into the national phase

Ref country code: DE

122 Ep: pct application non-entry in european phase

Ref document number: 21955762

Country of ref document: EP

Kind code of ref document: A1