CN109035761B - A Travel Time Estimation Method Based on Assisted Supervised Learning - Google Patents

A Travel Time Estimation Method Based on Assisted Supervised Learning Download PDF

Info

Publication number
CN109035761B
CN109035761B CN201810658375.5A CN201810658375A CN109035761B CN 109035761 B CN109035761 B CN 109035761B CN 201810658375 A CN201810658375 A CN 201810658375A CN 109035761 B CN109035761 B CN 109035761B
Authority
CN
China
Prior art keywords
time
neural network
grid
vector
training
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Expired - Fee Related
Application number
CN201810658375.5A
Other languages
Chinese (zh)
Other versions
CN109035761A (en
Inventor
孙未未
章瀚元
吴昊
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Fudan University
Original Assignee
Fudan University
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Fudan University filed Critical Fudan University
Priority to CN201810658375.5A priority Critical patent/CN109035761B/en
Publication of CN109035761A publication Critical patent/CN109035761A/en
Application granted granted Critical
Publication of CN109035761B publication Critical patent/CN109035761B/en
Expired - Fee Related legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Images

Classifications

    • GPHYSICS
    • G08SIGNALLING
    • G08GTRAFFIC CONTROL SYSTEMS
    • G08G1/00Traffic control systems for road vehicles
    • G08G1/01Detecting movement of traffic to be counted or controlled
    • G08G1/0104Measuring and analyzing of parameters relative to traffic conditions
    • G08G1/0125Traffic data processing
    • G08G1/0129Traffic data processing for creating historical data or processing based on historical data
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N3/00Computing arrangements based on biological models
    • G06N3/02Neural networks
    • G06N3/04Architecture, e.g. interconnection topology
    • G06N3/045Combinations of networks
    • GPHYSICS
    • G08SIGNALLING
    • G08GTRAFFIC CONTROL SYSTEMS
    • G08G1/00Traffic control systems for road vehicles
    • G08G1/01Detecting movement of traffic to be counted or controlled
    • G08G1/0104Measuring and analyzing of parameters relative to traffic conditions
    • G08G1/0137Measuring and analyzing of parameters relative to traffic conditions for specific applications

Landscapes

  • Physics & Mathematics (AREA)
  • Engineering & Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • Analytical Chemistry (AREA)
  • Chemical & Material Sciences (AREA)
  • Theoretical Computer Science (AREA)
  • General Health & Medical Sciences (AREA)
  • General Engineering & Computer Science (AREA)
  • Data Mining & Analysis (AREA)
  • Evolutionary Computation (AREA)
  • Biophysics (AREA)
  • Molecular Biology (AREA)
  • Computing Systems (AREA)
  • Computational Linguistics (AREA)
  • Biomedical Technology (AREA)
  • Mathematical Physics (AREA)
  • Software Systems (AREA)
  • Artificial Intelligence (AREA)
  • Life Sciences & Earth Sciences (AREA)
  • Health & Medical Sciences (AREA)
  • Traffic Control Systems (AREA)

Abstract

本发明属于智能交通技术领域,具体为一种基于辅助监督学习的行程时间估计方法。其从海量历史轨迹数据中寻找统计规律,通过端到端的深度学习模型对整个行程的时间进行整体的估计;步骤包括:特征提取和表示阶段,对轨迹数据进行预处理,分别抽取它的时间和空间特征,驾驶状态特征,短时间和长时间的交通状况特征;训练和预测阶段,将这些提取的特征用统一的双向循环神经网络进行训练和预测;循环神经网络每一步都输出通过当前小区域的时间开销;这些小区域的时间开销的总和即为总路径的时间开销。同时,还引入双向区间损失函数来约束中间时间开销。本方法可高效准确地对城市中的车辆行程时间进行估计,在实际环境下具有较好的效果。

Figure 201810658375

The invention belongs to the technical field of intelligent transportation, in particular to a travel time estimation method based on auxiliary supervision learning. It looks for statistical rules from massive historical trajectory data, and estimates the overall travel time through an end-to-end deep learning model; the steps include: feature extraction and representation stage, preprocessing the trajectory data, and extracting its time and time respectively. Spatial features, driving state features, short-term and long-term traffic status features; in the training and prediction stage, these extracted features are trained and predicted with a unified bidirectional recurrent neural network; the output of the recurrent neural network passes through the current small area at each step. The time cost of these small areas is the sum of the time cost of the total path. At the same time, a bidirectional interval loss function is also introduced to constrain the intermediate time overhead. The method can efficiently and accurately estimate the travel time of vehicles in the city, and has a good effect in the actual environment.

Figure 201810658375

Description

基于辅助监督学习的行程时间估计方法A Travel Time Estimation Method Based on Assisted Supervised Learning

技术领域technical field

本发明属于智能交通技术领域,具体涉及一种基于辅助监督学习的行程时间估计方法。The invention belongs to the technical field of intelligent transportation, and in particular relates to a travel time estimation method based on auxiliary supervision learning.

背景技术Background technique

行程时间估计是城市交通领域一个必不可少的重要技术,可以为人们的出行通勤提供帮助,也可以为政府规划决策提供支持。但这并不是一个简单的小问题,而是会受到各种动态因素的影响,如交通动态,路口状况,司机驾驶行为的变化和历史周期性的数据演化等等。这些因素导致行程时间估计存在不确定性和难度。随着支持GPS的移动设备的发展和普及,目前已经有大量的轨迹数据在源源不断地产生,并且覆盖城市的各个角落。有了这些海量的历史轨迹数据,我们可以挖掘数据背后的内在规律,通过构建算法模型来学习出行程时间的变化的周期和趋势,从而更加准确地推断当前查询轨迹所需的时间开销。Travel time estimation is an essential and important technology in the field of urban transportation, which can help people commute and support government planning decisions. But this is not a simple small problem, but will be affected by various dynamic factors, such as traffic dynamics, intersection conditions, changes in driver driving behavior and historical periodic data evolution, etc. These factors lead to uncertainty and difficulty in estimating travel time. With the development and popularization of GPS-enabled mobile devices, a large amount of trajectory data has been continuously generated, covering every corner of the city. With these massive historical trajectory data, we can mine the inherent laws behind the data, and learn the cycle and trend of travel time changes by building an algorithm model, so as to more accurately infer the time cost required for the current query trajectory.

目前已有的方法大多采用分而治之(divide-and-conquer)的方法,主要是通过将路径分解一系列的路段或者子路径这两类。At present, most of the existing methods adopt the divide-and-conquer method, which mainly decomposes the path into a series of road segments or sub-paths.

(1)基于单一路段的方法:(1) Method based on a single road segment:

基于单路段的方法主要通过估计每一条单一路段的轨迹经过时的平均速度,进而根据路段长度计算出经过的平均时间开销,最后将各个路段的时间和累加得到总的时间。但这种方法没有考虑路段之间的路口时间开销。另外,这种估计严重依赖于高质量的速度数据,而这往往在轨迹数据中无法得到。The single-segment-based method mainly estimates the average speed of each single-segment trajectory when it passes, and then calculates the average time cost according to the length of the segment, and finally accumulates the time of each segment to obtain the total time. But this method does not consider the intersection time overhead between road segments. Additionally, this estimation relies heavily on high-quality velocity data, which is often unavailable in trajectory data.

(2)基于子路径的方法:(2) Method based on sub-path:

基于子路径的方法主要通过将路径分割成一系列的子路径方法,使得路口的时间开销也得到考虑。主要思路都是对历史数据中丰富的公共子路径信息进行拼接和挖掘。尽管这种方法可以克服单一路段方法的许多缺陷,但它仍然是基于启发式设计,而不是直接将行程时间作为算法优化目标。The sub-path-based method mainly divides the path into a series of sub-path methods, so that the time cost of the intersection is also considered. The main idea is to splicing and mining the rich public sub-path information in historical data. Although this approach can overcome many of the shortcomings of the single-segment approach, it is still based on heuristic design rather than directly targeting travel time as an algorithmic optimization objective.

总而言之,目前已有的方法无法达到令人满意的准确性有两个方面的原因。一个是它们没有把路径看成一个整体,而是拆分成各个子块。在这一拆分过程中,损失了很多有用的信息。并且,它们没有充分利用轨迹数据特有的中间监督标签,也就是每一个中间GPS采样点的时间戳信息。另一方面,随着深度学习技术的发展和繁荣,更多的问题可以通过端到端一体式地解决,相较于传统启发式模型要更为高效。并且,深度学习有着强大的表征能力,与手工模型相比,可以捕捉到更多的潜在特征,能够处理行程估计问题中各种复杂的动态性。To sum up, there are two reasons why the existing methods cannot achieve satisfactory accuracy. One is that they don't think of the path as a whole, but split into individual sub-blocks. During this split, a lot of useful information is lost. Moreover, they do not take full advantage of the intermediate supervision labels specific to trajectory data, that is, the timestamp information of each intermediate GPS sample point. On the other hand, with the development and prosperity of deep learning technology, more problems can be solved in an end-to-end integrated manner, which is more efficient than traditional heuristic models. Moreover, deep learning has a powerful representation ability, which can capture more latent features than manual models, and can handle various complex dynamics in the itinerary estimation problem.

发明内容SUMMARY OF THE INVENTION

本发明的目的是针对传统的两类行程时间估计技术的局限性,提出一种基于辅助监督学习的历史轨迹的行程时间估计方法,以克服现有技术的不足。The purpose of the present invention is to propose a travel time estimation method based on historical trajectories of auxiliary supervised learning in view of the limitations of the traditional two types of travel time estimation techniques, so as to overcome the deficiencies of the prior art.

本发明方法从海量历史轨迹数据中寻找统计规律,通过端到端的深度学习模型对整个行程的时间进行整体的估计。基本步骤包括:特征提取和表示阶段,对轨迹数据进行预处理,分别抽取它的各方面特征;训练和预测阶段,将这些提取的特征用一个统一的双向循环神经网络进行训练和预测;循环神经网络每一步都输出通过当前小区域的时间开销;这些小区域的时间开销的总和即为总路径的时间开销;为了更加有效地进行训练,还引入了双向区间损失函数来约束中间时间开销。The method of the invention searches for statistical laws from massive historical trajectory data, and makes an overall estimation of the time of the entire journey through an end-to-end deep learning model. The basic steps include: feature extraction and representation stage, preprocessing trajectory data, and extracting its various features; training and prediction stage, using these extracted features to train and predict with a unified bidirectional recurrent neural network; recurrent neural network Each step of the network outputs the time cost of passing through the current small area; the sum of the time cost of these small areas is the time cost of the total path; for more efficient training, a bidirectional interval loss function is also introduced to constrain the intermediate time cost.

本发明提出的基于辅助监督学习的历史轨迹的行程时间估计方法,分为如下三个阶段:The travel time estimation method based on the historical trajectory of the auxiliary supervision learning proposed by the present invention is divided into the following three stages:

(一)特征提取和表示阶段,对历史轨迹数据进行预处理,抽取它的各方面特征(包括时间特征和空间特征,驾驶状态特征,短时间和长时间的交通状况特征等)。具体步骤为:(1) Feature extraction and representation stage, preprocessing the historical trajectory data and extracting its various features (including temporal and spatial features, driving state features, short- and long-term traffic conditions, etc.). The specific steps are:

步骤(1),在城市范围内,根据经纬度坐标对网格进行细粒度划分,形成一个个相邻的矩形小区域。将按时间顺序排序,由GPS坐标组成的轨迹序列中的每一个坐标点映射到对应的小区域中,形成一个由网格坐标组成的序列。对于相邻轨迹点距离较远,落在不连续的小区域内的情况,可以地图匹配等算法得到中间经过路径,补全这部分不连续的区域信息。In step (1), within the city limits, fine-grained division is performed on the grid according to the latitude and longitude coordinates to form adjacent small rectangular areas. Sorting in chronological order, each coordinate point in the trajectory sequence composed of GPS coordinates is mapped to the corresponding small area, forming a sequence composed of grid coordinates. For the case where the adjacent track points are far apart and fall in a discontinuous small area, the intermediate passing path can be obtained by algorithms such as map matching, and the information of this part of the discontinuous area can be completed.

步骤(2),对于每一个网格,挖掘它不同方面的特征。首先,使用嵌入向量技术来挖掘潜在语义信息。嵌入向量技术在自然语言处理和社交网络等领域等到了广泛的使用,主要是利用低维的实数向量来代表每一个词或者事物的语义信息,通过向量空间中的距离关系来衡量实物之间的对应关系。本发明利用嵌入向量技术来表征每一个网格小区域在不同空间以及不同时间段的语义信息。这些信息包含了城市不同的功能区域(例如居民区,商业区或工业区等等)空间区位信息,也包括了早高峰,周末等时间信息。具体地,利用低维向量来表示每一个网格的空间向量Vsp,将一天划分成多个时间桶(例如一个小时一个桶),每一条轨迹根据具体落入的时间桶来得到时间向量Vtp。对Vsp和Vtp进行随机初始化,之后在模型训练时跟着模型一起训练。Step (2), for each grid, excavate its features in different aspects. First, the embedding vector technique is used to mine latent semantic information. Embedding vector technology has been widely used in natural language processing and social networks. It mainly uses low-dimensional real-number vectors to represent the semantic information of each word or thing, and measures the distance between objects through the distance relationship in the vector space. Correspondence. The present invention utilizes the embedded vector technology to represent the semantic information of each small grid area in different spaces and different time periods. This information includes the spatial location information of different functional areas of the city (such as residential areas, commercial areas or industrial areas, etc.), as well as time information such as morning peak hours and weekends. Specifically, a low-dimensional vector is used to represent the space vector V sp of each grid, and a day is divided into multiple time buckets (for example, one bucket per hour), and each trajectory obtains a time vector V according to the time bucket it falls into. tp . Randomly initialize V sp and V tp , and then train with the model during model training.

步骤(3),司机在开车时,在不同的行驶状态时,行驶的速度和驾驶行为都会发生变化。例如,车辆在行驶路径的中间部分时,会更倾向于行驶在大路或者高架上,这时速度会更快。而在刚出发或者快到终点时,由于行驶在小路或者人多的区域,往往速度就会变慢。具体地,使用四维向量Vdri来表示当前行驶阶段是出发阶段,中途阶段,还是结束阶段,以及在各个阶段已经行驶的比例。例如,Vdri=(1,0,0,0.2)表示司机行驶在开始阶段,占了总行程的20%。In step (3), when the driver is driving, in different driving states, the driving speed and driving behavior will change. For example, when the vehicle is in the middle part of the driving path, it will be more inclined to drive on the road or on the elevated road, and then the speed will be faster. When you just start or approach the end point, the speed tends to slow down due to driving on a small road or in a crowded area. Specifically, the four-dimensional vector V dri is used to represent whether the current travel stage is a departure stage, an intermediate stage, or an end stage, as well as the proportions of which have been traveled in each stage. For example, V dri =(1,0,0,0.2) means that the driver is driving at the beginning, accounting for 20% of the total trip.

步骤(4),在一个区域内的交通状况,往往随着时间演变会有周期性和规律性的变化。例如,如果一个路段在8点到8点半都很堵,那么8点35分它也可能很堵。也就是说,过去短时间内的交通状况信息,对预测当前的交通状态很有帮助。定义该短时间的交通状况特征为Vshort。与此同时,长时间周期性的交通状况变化也能帮助预测当前交通状况,例如工作日和周末的交通状况变化规律。定义该长时间的交通状况特征为Vlong。具体来说,In step (4), the traffic conditions in an area often have periodic and regular changes over time. For example, if a road segment is congested between 8:00 and 8:30, it may also be congested by 8:35. That is to say, the traffic situation information in a short period of time in the past is very helpful for predicting the current traffic state. The traffic condition characteristic of this short time is defined as V short . At the same time, long-term periodic changes in traffic conditions can also help predict current traffic conditions, such as the regularity of traffic conditions on weekdays and weekends. The traffic condition characteristic for this long time is defined as V long . Specifically,

定义:

Figure BDA0001706094160000031
definition:
Figure BDA0001706094160000031

表示在过去第j个时间区间内,当前小区域gi的交通状况,其中vj表示历史平均速度,nj表示历史轨迹数据数量,leni/vj表示粗略估计的通过时间。将这些交通状况特征按照历史时间顺序输入到一个子循环神经网络中,可以抽取出交通状况特征。Represents the current traffic condition of the small area g i in the jth time interval in the past, where v j represents the historical average speed, n j represents the number of historical trajectory data, and len i /v j represents the rough estimated transit time. These traffic condition features are input into a sub-recurrent neural network according to the historical time sequence, and the traffic condition features can be extracted.

另外,由于历史数据在不同空间区域分布不均衡,有些区域轨迹经过数量较少,可能会对估计的准确性造成影响。为了解决这一数据稀疏问题,将邻接小区域的交通状况信息也考虑进来,即In addition, due to the uneven distribution of historical data in different spatial regions, the number of trajectories in some regions is small, which may affect the accuracy of the estimation. In order to solve this data sparse problem, the traffic condition information of adjacent small areas is also taken into account, namely

定义:

Figure BDA0001706094160000032
definition:
Figure BDA0001706094160000032

表示距离gi距离不超过d的网格集合,收集它们过去短时的交通状况特征,一起输入到神经网络中。其中,x,y表示网格的坐标,gj表示除gi以外的其他网格。Represents a set of grids whose distance from gi does not exceed d, and collects their short-term traffic conditions in the past and inputs them into the neural network together. Among them, x, y represent the coordinates of the grid, and g j represents other grids except gi .

(二)训练阶段,将历史轨迹数据中提取的特征输入到一个统一的双向循环神经网络(bidirectional LSTM,参考文献:Graves A,Schmidhuber J.Framewise phonemeclassification with bidirectional LSTM and other neural network architectures[J].Neural Networks,2005,18(5-6):602-610.)进行训练,并且以双向区间损失函数作为训练的约束;具体步骤为:(2) In the training phase, the features extracted from the historical trajectory data are input into a unified bidirectional recurrent neural network (bidirectional LSTM, reference: Graves A, Schmidhuber J. Framewise phonemeclassification with bidirectional LSTM and other neural network architectures [J]. Neural Networks, 2005, 18(5-6): 602-610.) for training, and the two-way interval loss function is used as the training constraint; the specific steps are:

步骤(1),构建循环神经网络。定义网络隐层为

Figure BDA0001706094160000033
输入数据为
Figure BDA0001706094160000034
那么,第t步的输入数据为xt,第t步得到的计算结果为ht,则有:Step (1), build a recurrent neural network. Define the hidden layer of the network as
Figure BDA0001706094160000033
The input data is
Figure BDA0001706094160000034
Then, the input data of the t-th step is x t , and the calculation result obtained in the t-th step is h t , then there are:

ht=φ(xt·Wx+ht-1·Wh+b) (3)h t =φ(x t ·W x +h t-1 ·W h +b) (3)

其中,

Figure BDA0001706094160000035
是输入数据的权重矩阵(weight matrix),
Figure BDA0001706094160000036
是隐层的权重矩阵,
Figure BDA0001706094160000037
是偏置参数(bias)。φ表示一个非线性激活函数,可以是sigmoid函数,ReLU函数,tanh函数等等。in,
Figure BDA0001706094160000035
is the weight matrix of the input data,
Figure BDA0001706094160000036
is the weight matrix of the hidden layer,
Figure BDA0001706094160000037
is the bias parameter (bias). φ represents a nonlinear activation function, which can be a sigmoid function, a ReLU function, a tanh function, and so on.

也就是说隐状态可以表示为函数:That is, the hidden state can be represented as a function:

ht=f(ht-1,xt) (4)h t =f(h t-1 ,x t ) (4)

在这基础上,定义遗忘门为:On this basis, the forget gate is defined as:

ft=σ(Wf·[ht-1,xt]+bf) (5)f t =σ(W f ·[h t-1 ,x t ]+b f ) (5)

输入门为:The input gate is:

it=σ(Wi·[ht-1,xt]+bi) (6)i t =σ(W i ·[h t-1 ,x t ]+b i ) (6)

输出门为:The output gate is:

ot=σ(Wo[ht-1,xt]+bo) (7)o t =σ(W o [h t-1 ,x t ]+b o ) (7)

记忆单元的更新为:The update of the memory unit is:

Figure BDA0001706094160000041
Figure BDA0001706094160000041

Figure BDA0001706094160000042
Figure BDA0001706094160000042

隐层的更新为:The update of the hidden layer is:

ht=Ot·tanh(Ct) (10)h t =O t ·tanh(C t ) (10)

其中,Wf、Wi、Wo

Figure BDA0001706094160000043
分别表示遗忘门、输入门、输出门和记忆单元的权重矩阵,bf、bi、bo
Figure BDA0001706094160000044
则是对应的偏置参数。σ()为一个非线性的激活函数,例如
Figure BDA0001706094160000045
是一个sigmoid函数,
Figure BDA0001706094160000046
是一个双曲正切函数,f( )表示一个包含各层参数的抽象神经网络函数。定义循环神经网络对应的参数为WN;从[-α,α]的均匀分布中对循环神经网络中的每个权重参数进行初始化,其中,α是为一个超参数,设定范围为0.01到1。Among them, W f , Wi , W o ,
Figure BDA0001706094160000043
respectively represent the weight matrix of forgetting gate, input gate, output gate and memory unit, b f , b i , b o ,
Figure BDA0001706094160000044
is the corresponding bias parameter. σ() is a nonlinear activation function, such as
Figure BDA0001706094160000045
is a sigmoid function,
Figure BDA0001706094160000046
is a hyperbolic tangent function, and f( ) represents an abstract neural network function containing parameters of each layer. Define the parameter corresponding to the recurrent neural network as W N ; initialize each weight parameter in the recurrent neural network from the uniform distribution of [-α,α], where α is a hyperparameter, and the setting range is 0.01 to 1.

双向循环神经网络同时使用一个正向的循环神经网络和一个反向的循环神经网络进行计算。其中正向循环神经网络根据序列的顺序依次将之前步骤提取的网格特征输入,而反向循环神经网络则将序列逆序后输入网格特征。这么做的优点在于,可以使得神经网络同时观察到当前网格距离起点和终点的位置距离,从而拥有一个整体的特征。定义它的隐变量为正向和反向网络的拼接

Figure BDA0001706094160000047
其中
Figure BDA0001706094160000048
表示正向循环神经网络的隐层,
Figure BDA0001706094160000049
表示反向循环神经网络的隐层。Bidirectional RNNs use both a forward RNN and a reverse RNN for computation. The forward recurrent neural network sequentially inputs the grid features extracted in the previous steps according to the sequence of the sequence, while the reverse recurrent neural network inputs the grid features in reverse order of the sequence. The advantage of this is that the neural network can simultaneously observe the position distance of the current grid from the starting point and the ending point, so as to have an overall feature. Define its latent variables as the concatenation of forward and reverse networks
Figure BDA0001706094160000047
in
Figure BDA0001706094160000048
represents the hidden layer of the forward recurrent neural network,
Figure BDA0001706094160000049
Represents the hidden layer of an inverse recurrent neural network.

步骤(2),将历史轨迹数据中提取的特征,即空间特征

Figure BDA00017060941600000410
时间特征
Figure BDA00017060941600000411
驾驶状态特征
Figure BDA00017060941600000412
历史上短时间和长时间的交通状态特征
Figure BDA00017060941600000413
Figure BDA00017060941600000414
拼接成一个统一的特征向量:Step (2), the features extracted from the historical trajectory data, that is, the spatial features
Figure BDA00017060941600000410
temporal features
Figure BDA00017060941600000411
Driving state characteristics
Figure BDA00017060941600000412
Historical short- and long-term traffic state characteristics
Figure BDA00017060941600000413
and
Figure BDA00017060941600000414
Concatenated into a unified feature vector:

Figure BDA00017060941600000415
Figure BDA00017060941600000415

在每一个经过的小网格输入到双向循环神经网络,以得到经过该网格的通过时间,即WT·hi+b。总的行程的时间开销

Figure BDA00017060941600000416
为:Input to the bidirectional recurrent neural network at each passing small grid to get the transit time of passing through the grid, namely W T · h i +b. total travel time
Figure BDA00017060941600000416
for:

Figure BDA00017060941600000417
Figure BDA00017060941600000417

定义

Figure BDA0001706094160000051
分别为计算总时间开销的权重矩阵和偏置参数。WT表示W矩阵的转置。definition
Figure BDA0001706094160000051
are the weight matrix and bias parameters for calculating the total time cost, respectively. W T represents the transpose of the W matrix.

步骤(3),定义轨迹经过各个网格序列的真实时间开销向量为T。顺序的真实时间开销向量为Tf,逆序的真实时间开销向量为Tb。则神经网络估计得到的时间开销向量为:In step (3), the real time cost vector of the trajectory passing through each grid sequence is defined as T. The sequential real time cost vector is T f , and the reversed real time cost vector is T b . Then the time cost vector estimated by the neural network is:

Figure BDA0001706094160000052
Figure BDA0001706094160000052

Figure BDA0001706094160000053
Figure BDA0001706094160000053

使用双向区间损失函数对模型进行辅助监督学习,使其不仅学习整条路径的时间开销,同时可以学习各个中间阶段的通行时间。定义双向区间损失函数为:The model is assisted with supervised learning using a bidirectional interval loss function, so that it not only learns the time cost of the entire path, but also learns the transit time of each intermediate stage. The two-way interval loss function is defined as:

Figure BDA0001706094160000054
Figure BDA0001706094160000054

其中,M表示轨迹是否经过小区域的掩码,[]表示向量每个元素间的操作。Among them, M represents whether the trajectory passes through the mask of the small area, and [] represents the operation between each element of the vector.

步骤(4),训练的目标是,最小化损失函数L,即:Step (4), the goal of training is to minimize the loss function L, that is:

Figure BDA0001706094160000055
Figure BDA0001706094160000055

其中,θ表示模型的训练参数,ε表示时间和空间上的嵌入向量,S是训练集的大小。最后,使用基于时间顺序的反向传播算法对模型进行参数的更新和优化。反向传播算法参考文献:Chauvin Y,Rumelhart D E.Backpropagation:theory,architectures,andapplications[M].Psychology Press,2013.where θ represents the training parameters of the model, ε represents the embedding vector in time and space, and S is the size of the training set. Finally, the parameters of the model are updated and optimized using a time-order-based backpropagation algorithm. Backpropagation Algorithm References: Chauvin Y, Rumelhart D E. Backpropagation:theory,architectures,andapplications[M].Psychology Press,2013.

(三)预测阶段,用双向循环神经网络对查询路径中提取的特征进行推断并估计行程时间;具体步骤为:(3) In the prediction stage, the bidirectional recurrent neural network is used to infer the features extracted from the query path and estimate the travel time; the specific steps are:

步骤(1),给定一条没有时间戳标记的真实行程作为查询路径,根据经过的实际路径,得到其映射的网格序列。对于每一个经过的小网格,使用特征提取和表示阶段抽取得到的时空特征Vsp和Vtp,驾驶状态特征Vdri,和历史上短时间和长时间的交通状态特征Vshort和Vlong,作为该网格的总特征表示V。其中,时空特征的嵌入向量使用经过训练过程的参数更新后的向量信息。短时和长时间的交通状态特征使用经过训练的子循环神经网络进行特征挖掘。In step (1), a real itinerary without timestamp is given as a query path, and its mapped grid sequence is obtained according to the actual path passed. For each passing small grid, using the spatiotemporal features Vsp and Vtp extracted from the feature extraction and representation stage, the driving state feature Vdri , and the historical short and long time traffic state features Vshort and Vlong , Denote V as the total feature of this mesh. Among them, the embedding vector of the spatiotemporal feature uses the vector information updated by the parameters of the training process. Short- and long-term traffic state features are mined using a trained sub-recurrent neural network.

步骤(2),在每一个经过的网格,将抽取的各方面特征输入到已经经过训练的双向循环神经网络中,得到当前的隐变量ht,那么经过当前区域的估计时间为WT·ht+b。而总的时间开销估计值为:Step (2), in each passing grid, input the extracted features into the bidirectional recurrent neural network that has been trained to obtain the current hidden variable h t , then the estimated time passing through the current area is W T · ht +b. And the total time cost estimate is:

Figure BDA0001706094160000056
Figure BDA0001706094160000056

其中,n表示经过的总网格数目,

Figure BDA0001706094160000057
为经过训练得到的,计算总时间开销的权重矩阵和偏置参数。WT表示W矩阵的转置。where n is the total number of grids passed,
Figure BDA0001706094160000057
For the weight matrix and bias parameters obtained by training, the total time overhead is calculated. W T represents the transpose of the W matrix.

总的来说,本发明方法有以下几个优点。首先,利用端到端(end-to-end)基于历史数据训练的深度学习方法,直接学习出整条路径的特征并估计出整体的通行时间。我们定义了一个双向区间损失函数,可以在监督整体的路径时间的基础上,同时辅助监督通过中间路段的时间开销。这种引入辅助监督的方法既丰富了路径的样本信息,又可以使得反向传播算法对参数更新时传播信号可以更加准确。其次,提出了一个特征抽取结构,通过提取时空嵌入向量,行驶状态,以及短时间和长时间的交通状况等不同维度的动态特征,能有效地估计出路径的通行时间。最后,在实际环境下经过实验验证,具有比已有方法更好的实验结果。In general, the method of the present invention has the following advantages. First, an end-to-end deep learning method based on historical data training is used to directly learn the characteristics of the entire route and estimate the overall transit time. We define a bidirectional interval loss function that can supervise the overall path time while assisting in supervising the time cost of passing intermediate road segments. This method of introducing auxiliary supervision not only enriches the sample information of the path, but also enables the backpropagation algorithm to propagate the signal more accurately when the parameters are updated. Secondly, a feature extraction structure is proposed, which can effectively estimate the travel time of the path by extracting the dynamic features of different dimensions such as the spatiotemporal embedding vector, the driving state, and the short-time and long-time traffic conditions. Finally, it has been experimentally verified in the actual environment, and has better experimental results than existing methods.

如表1所示,我们用真实的历史轨迹数据进行实验,包括波尔图和上海两个城市。我们用路段平均时间法,子路径动态规划法,网格全连接网络法,网格卷积网络法等已有方法进行对比。其中,路段平均时间法通过统计每个路段的平均通过时间,直接累加得到结果。子路径动态规划法[Yilun Wang,Yu Zheng,and Yexiang Xue.Travel timeestimation of a path using sparse trajecto-ries.In Proceedings of the 20thInternational Conference on Knowledge Discovery and Data Mining(SIGKDD),pages25–34,2014.]利用动态规划找到子路径的最优拼接方法。网格全连接网络法和网格卷积网络法将N×N的整体网格作为输入,分别用全连接网络(Multi-Layer Perceptron)和卷积神经网络(Convolutional Neural Network)进行优化和估计。我们使用MAE,RMSE,MAPE三个误差度量指标衡量方法的好坏。As shown in Table 1, we conduct experiments with real historical trajectory data, including the cities of Porto and Shanghai. We compare the existing methods such as the road segment average time method, the sub-path dynamic programming method, the grid fully connected network method, and the grid convolution network method. Among them, the average time of the road segment is obtained by directly accumulating the average passing time of each road segment. Subpath Dynamic Programming [Yilun Wang, Yu Zheng, and Yexiang Xue. Travel timeestimation of a path using sparse trajecto-ries. In Proceedings of the 20th International Conference on Knowledge Discovery and Data Mining (SIGKDD), pages25–34, 2014.] Use dynamic programming to find the optimal splicing method of subpaths. The grid fully connected network method and the grid convolutional network method take the N×N overall grid as input, and use the fully connected network (Multi-Layer Perceptron) and the convolutional neural network (Convolutional Neural Network) respectively for optimization and estimation. We use MAE, RMSE, MAPE three error metrics to measure the quality of the method.

Figure BDA0001706094160000061
Figure BDA0001706094160000061

Figure BDA0001706094160000062
Figure BDA0001706094160000062

Figure BDA0001706094160000063
Figure BDA0001706094160000063

其中,y表示真实值,

Figure BDA0001706094160000064
表示估计值,n表示样本总数。由表1结果可知,本发明方法在各项指标都要远好于已有的对比方法。例如,在上海数据集上,本发明方法估计MAE误差只有126秒,MAPE误差是13.3%,而已有最好方法的MAE误差在168秒,MAPE误差是19.1%。where y represents the true value,
Figure BDA0001706094160000064
represents the estimated value, and n represents the total number of samples. It can be seen from the results in Table 1 that the method of the present invention is far better than the existing comparative method in each index. For example, on the Shanghai dataset, the method of the present invention estimates that the MAE error is only 126 seconds, and the MAPE error is 13.3%, while the MAE error of the best method is 168 seconds, and the MAPE error is 19.1%.

表1Table 1

Figure BDA0001706094160000065
Figure BDA0001706094160000065

附图说明Description of drawings

图1表示一个真实轨迹样本,包含每一个中间GPS轨迹点的时间戳信息,一共经过720s。Figure 1 shows a real trajectory sample, including the timestamp information of each intermediate GPS trajectory point, after a total of 720s.

图2表示需要查询的路径样本,仅包含具体经过的路径信息,不包含任何时间戳信息。Figure 2 shows the path sample to be queried, which only contains the specific path information passed, and does not contain any timestamp information.

具体实施方式Detailed ways

下面结合具体实例来说明本发明的具体实施过程:The specific implementation process of the present invention is described below in conjunction with specific examples:

如图1中的历史轨迹用于训练,并估计图2中的行程时间。Historical trajectories in Figure 1 are used for training, and travel times in Figure 2 are estimated.

一、预处理阶段,特征提取和表示阶段,对轨迹数据进行预处理,抽取它的各方面特征。以图1为例,具体步骤为:1. The preprocessing stage, the feature extraction and representation stage, preprocesses the trajectory data and extracts all aspects of its features. Taking Figure 1 as an example, the specific steps are:

(1)在城市范围内,进行细粒度网格划分,分成一个个相邻的小区域。如图1中,将地图划分成5×6个网格。将轨迹序列中的每一个坐标点映射到对应的小区域中,形成一个网格序列,即g={g1,g2,…,g10}。(1) Within the city limits, fine-grained grid division is performed and divided into adjacent small areas. As shown in Figure 1, the map is divided into 5×6 grids. Each coordinate point in the trajectory sequence is mapped to the corresponding small area to form a grid sequence, ie g={g 1 , g 2 ,...,g 10 }.

(2)对于每一个网格,挖掘它不同方面的特征。例如,对于g1,使用随机向量

Figure BDA0001706094160000071
Figure BDA0001706094160000072
来表示时空语义信息。即:(2) For each grid, excavate its features in different aspects. For example, for g 1 , use a random vector
Figure BDA0001706094160000071
and
Figure BDA0001706094160000072
to represent spatiotemporal semantic information. which is:

Figure BDA0001706094160000073
Figure BDA0001706094160000073

Figure BDA00017060941600000711
Figure BDA00017060941600000711

其次,使用四维向量

Figure BDA0001706094160000074
来表示当前行驶阶段是出发阶段,中途阶段,还是结束阶段,以及在各个阶段已经行驶的比例,即:Second, use a four-dimensional vector
Figure BDA0001706094160000074
To indicate whether the current driving stage is a departure stage, a midway stage, or an end stage, and the proportion of travel in each stage, namely:

Figure BDA0001706094160000075
Figure BDA0001706094160000075

最后,使用过去短时间和长时间的交通状况信息来预测当前的交通状况特征

Figure BDA0001706094160000076
Figure BDA0001706094160000077
具体来说,定义
Figure BDA0001706094160000078
为过去的第1到6个时间区间(5min)内,当前区域g1的交通状况。例如
Figure BDA0001706094160000079
表示历史平均速度是10m/s,共有8条历史轨迹,平均通过时间估计为20m/s。将
Figure BDA00017060941600000710
将这些交通状况特征按照历史时间顺序输入到一个子循环神经网络中,将最后输出的隐层向量h6作为交通状况特征。Finally, use the short and long time traffic condition information in the past to predict the current traffic condition characteristics
Figure BDA0001706094160000076
and
Figure BDA0001706094160000077
Specifically, define
Figure BDA0001706094160000078
is the traffic condition of the current area g 1 in the past 1st to 6th time interval (5min). E.g
Figure BDA0001706094160000079
It means that the historical average speed is 10m/s, there are 8 historical trajectories, and the average transit time is estimated to be 20m/s. Will
Figure BDA00017060941600000710
These traffic condition features are input into a sub-recurrent neural network according to the historical time sequence, and the last output hidden layer vector h6 is used as the traffic condition feature.

二、训练阶段,具体步骤为:Second, the training phase, the specific steps are:

(1)建立双向循环神经网络(Bi-directional LSTM)模型。随机初始化模型的各项参数,包括遗忘门,输入门,输出门的矩阵参数和偏置参数。(1) Establish a bi-directional recurrent neural network (Bi-directional LSTM) model. Randomly initialize various parameters of the model, including forget gate, input gate, output gate matrix parameters and bias parameters.

(2)将历史轨迹数据中提取的特征,即空间特征Vsp,时间特征Vsp,驾驶状态特征Vdri,历史上短时间和长时间的交通状态特征Vshort和Vlong,拼接成一个统一的特征向量。以网格g1为例(2) The features extracted from the historical trajectory data, that is, the spatial feature V sp , the temporal feature V sp , the driving state feature V dri , the historical short-time and long-time traffic state features V short and V long , are spliced into a unified eigenvectors of . Take grid g 1 as an example

Figure BDA0001706094160000081
Figure BDA0001706094160000081

(3)在每一个经过的小网格输入到双向循环神经网络,以得到经过该网格的通过时间,即WT·hi+b。总的行程的时间开销为:(3) Input to the bidirectional recurrent neural network at each passing small grid to obtain the transit time of passing through the grid, namely W T · h i +b. The total travel time cost is:

Figure BDA0001706094160000082
Figure BDA0001706094160000082

例如,定义W=(0.1,0.3,..,0.7),10个网格隐层变量值为h1=(0.8,0.3,…,0.2),…,h10=(0.7,0.4,..0.5),偏置值b=0.7,则:For example, define W=(0.1,0.3,..,0.7), 10 grid hidden layer variable values h 1 =(0.8,0.3,...,0.2),...,h 10 =(0.7,0.4,.. 0.5), the offset value b=0.7, then:

Figure BDA0001706094160000083
Figure BDA0001706094160000083

(4)定义轨迹经过各个网格序列的真实时间开销向量为T。顺序的真实时间开销向量为Tf=(70,120,…,720),逆序的真实时间开销向量为Tb=(720,640,…,50)。则神经网络估计得到的时间开销向量为:(4) Define the real time cost vector of the trajectory passing through each grid sequence as T. The sequential real time cost vector is T f =(70,120,...,720), and the reversed real time cost vector is T b =(720,640,...,50). Then the time cost vector estimated by the neural network is:

Figure BDA0001706094160000084
Figure BDA0001706094160000084

Figure BDA0001706094160000085
Figure BDA0001706094160000085

使用双向区间损失函数对模型进行辅助监督学习,使其不仅学习整条路径的时间开销,同时可以学习各个中间阶段的通行时间。定义双向区间损失函数为:The model is assisted with supervised learning using a bidirectional interval loss function, so that it not only learns the time cost of the entire path, but also learns the transit time of each intermediate stage. The two-way interval loss function is defined as:

Figure BDA0001706094160000086
Figure BDA0001706094160000086

其中,M表示轨迹是否经过小区域的掩码,[]表示向量每个元素间的操作。Among them, M represents whether the trajectory passes through the mask of the small area, and [] represents the operation between each element of the vector.

(5)最小化损失函数L,即:(5) Minimize the loss function L, namely:

Figure BDA0001706094160000087
Figure BDA0001706094160000087

其中,θ表示模型的训练参数,ε表示时间和空间上的嵌入向量,S是训练集的大小。最后,使用基于时间顺序的反向传播算法对模型进行参数的更新和优化。where θ represents the training parameters of the model, ε represents the embedding vector in time and space, and S is the size of the training set. Finally, the parameters of the model are updated and optimized using a time-order-based backpropagation algorithm.

三.预测阶段,具体步骤为(以图2为例):3. Prediction stage, the specific steps are (take Figure 2 as an example):

(1)给定一条没有时间戳标记的真实行程作为查询路径g={g1,g2,…,g8},根据经过的实际路径,得到其映射的网格序列。对于每一个经过的小网格g1~g8,使用特征提取和表示阶段抽取得到的时空特征Vsp和Vtp,驾驶状态特征Vdri,和历史上短时间和长时间的交通状态特征Vshort和Vlong,作为该网格的总特征表示V。其中,时空特征的嵌入向量使用经过训练过程的参数更新后的向量信息。短时和长时间的交通状态特征使用经过训练的子循环神经网络进行特征挖掘。(1) Given a real itinerary without time stamp as the query path g={g 1 , g 2 , ..., g 8 }, according to the actual path passed, the grid sequence of its mapping is obtained. For each passing small grid g 1 -g 8 , use the spatiotemporal features V sp and V tp extracted in the feature extraction and representation stages, the driving state feature V dri , and the historical short- and long-term traffic state features V short and V long , denote V as the overall feature of the grid. Among them, the embedding vector of the spatiotemporal feature uses the vector information updated by the parameters of the training process. Short- and long-term traffic state features are mined using a trained sub-recurrent neural network.

(2)在每一个经过的网格,将抽取的各方面特征输入到已经经过训练的双向循环神经网络中,得到当前的隐变量ht,那么经过当前区域的估计时间为WT·ht+b。而总的时间开销估计值为;(2) In each passing grid, input the extracted features into the bidirectional recurrent neural network that has been trained to obtain the current hidden variable h t , then the estimated time passing through the current area is W T · h t +b. And the total time cost estimate is;

Figure BDA0001706094160000091
Figure BDA0001706094160000091

其中,W和b均由之前训练过程中得到的参数。Among them, W and b are the parameters obtained in the previous training process.

Claims (1)

1. A travel time estimation method based on auxiliary supervised learning is characterized by comprising three stages:
the method comprises the steps of (I) feature extraction and representation, wherein historical track data are preprocessed, and various features of the historical track data are extracted;
in the training stage, the features extracted from the historical track data are input into a uniform bidirectional cyclic neural network for training, and a bidirectional interval loss function is used as the constraint of training;
a prediction stage, deducing the features extracted from the query path by using a bidirectional cyclic neural network and estimating the travel time;
the specific steps of the characteristic extraction and representation stage are as follows:
step (1), in an urban area, carrying out fine-grained division on grids according to longitude and latitude coordinates to form adjacent rectangular small areas; mapping each coordinate point in a track sequence consisting of historical GPS coordinates sequenced according to a time sequence into a corresponding small region to form a sequence consisting of grid coordinates;
step (2), for each grid, excavating the characteristics of different aspects of the grid; firstly, representing semantic information of each grid small region in different spaces and different time periods by using an embedded vector technology; the information comprises spatial location information of different functional areas of a city, and also comprises information of early peak, weekend time; in particular, the space vector V of each mesh is represented by a low-dimensional vectorspDividing a day into a plurality of time buckets, and obtaining a time vector V according to the time bucket in which each track fallstp(ii) a To VspAnd VtpCarrying out random initialization, and then training the model along with the model during model training;
step (3) of using the four-dimensional vector VdriTo indicate whether the current driving phase is a starting phase, a midway phase or an ending phase, and the proportion of driving in each phase;
step (4), defining the short-time traffic condition characteristic as VshortDefining a long-term traffic condition characteristic as VlongIn the case of a liquid crystal display device, in particular,
definition of:
Figure FDA0002929081460000011
Indicating the current small area g in the past jth time intervaliIn which v isjRepresenting historical average speed, njIndicates the amount of historical track data, leni/vjRepresenting a coarse estimated transit time; inputting the traffic condition characteristics into a sub-cyclic neural network according to a historical time sequence so as to extract the traffic condition characteristics;
in addition, traffic condition information of adjacent small areas is taken into account, i.e.
Defining:
Figure FDA0002929081460000012
represents the distance giCollecting the traffic condition characteristics of the grid set with the distance not exceeding d in the past short time, and inputting the traffic condition characteristics into the neural network together; wherein x, y represent the coordinates of the grid, gjRepresents except giOther meshes than the mesh;
the second training stage comprises the following specific steps:
step (1), constructing a recurrent neural network; defining a network hidden layer as
Figure FDA0002929081460000021
Input data as
Figure FDA0002929081460000022
Then, the input data of the t step is xtAnd the calculation result obtained in the t step is htThen, there are:
ht=φ(xt·Wx+ht-1·Wh+b) (3)
wherein,
Figure FDA0002929081460000023
is a weight matrix of the input data,
Figure FDA0002929081460000024
is a weight matrix of the hidden layer(s),
Figure FDA0002929081460000025
is a bias parameter;
i.e. the hidden state is represented as a function:
ht=f(ht-1,xt) (4)
on this basis, define forgetting the door to be:
ft=σ(Wf·[ht-1,xt]+bf) (5)
the input gates are:
it=σ(Wi·[ht-1,xt]+bi) (6)
the output gate is:
ot=σ(Wo[ht-1,xt]+bo) (7)
the memory cell is updated by:
Figure FDA0002929081460000026
Figure FDA0002929081460000027
the hidden layer is updated as follows:
ht=Ot·tanh(Ct) (10)
wherein,
Figure FDA0002929081460000028
weight matrices respectively representing the forgetting gate, the input gate, the output gate and the memory cell,
Figure FDA0002929081460000029
then it is the corresponding bias parameter; σ () is a nonlinear activation function; f () represents an abstract neural network function containing parameters of each layer, and the corresponding parameter of the recurrent neural network is defined as WNFrom [ - α, α]Initializing each element in the uniform distribution, wherein alpha is a hyper-parameter and is set to be in a range of 0.01 to 1;
the bidirectional cyclic neural network simultaneously uses the forward cyclic neural network and the reverse cyclic neural network for calculation; the forward circulation neural network inputs the grid features extracted in the previous step according to the sequence of the sequence in sequence, and the reverse circulation neural network inputs the grid features after the sequence is in reverse order; its hidden variables are defined as the concatenation of the forward and reverse networks, i.e.:
Figure FDA00029290814600000210
step (2), extracting the feature in the historical track data, namely the spatial feature VspTime characteristic VtpDriving state characteristic VdriHistorically short and long time traffic status features VshortAnd VlongAnd splicing into a unified feature vector:
V=(Vsp,Vtp,Vdri,Vshort,Vlong) (11)
at each small grid passing through, the two-way recurrent neural network is input to obtain the passing time of the grid, namely WT·hi+ b, total travel time overhead
Figure FDA0002929081460000031
Comprises the following steps:
Figure FDA0002929081460000032
Figure FDA0002929081460000033
to calculate the total timeWeight matrix and bias parameter of the overhead, WTRepresents the transpose of the W matrix;
step (3), defining the real time overhead vector of the track passing through each grid sequence as T; the sequential real time overhead vector is TfThe real time overhead vector of the reverse order is Tb(ii) a The time overhead vector estimated by the neural network is:
Figure FDA0002929081460000034
Figure FDA0002929081460000035
the model is subjected to auxiliary supervised learning by using a bidirectional interval loss function, so that the time overhead of the whole path can be learned, and the transit time of each intermediate stage can be learned; the two-way interval loss function is defined as:
Figure FDA0002929081460000036
wherein, M represents whether the track passes through the mask of a small region, and [ ] represents the operation among each element of the vector;
step (4), the goal of training is to minimize the loss function L, i.e.:
Figure FDA0002929081460000037
wherein, theta represents the training parameter of the model, epsilon represents the embedding vector on time and space, and S is the size of the training set; finally, updating and optimizing parameters of the model by using a time sequence-based back propagation algorithm;
the third step of the prediction stage comprises the following steps:
step (1), a real journey without a time stamp mark is givenAs a query path, obtaining a grid sequence mapped by the actual path according to the actual path; for each small grid passing through, using space-time characteristics V obtained by characteristic extraction and expression stage extractionspAnd VtpDriving state characteristic VdriAnd historical short and long term traffic condition characteristics VshortAnd VlongV is represented as a total feature of the mesh; the embedded vector of the space-time characteristics uses vector information updated by parameters in a training process; carrying out feature mining on the traffic state features of short time and long time by using the trained sub-cycle neural network;
step (2), inputting the extracted characteristics of all aspects into the trained bidirectional cyclic neural network in each passing grid to obtain the current hidden variable htThen the estimated time to pass through the current region is WT·ht+ b, and the total time overhead estimate is:
Figure FDA0002929081460000038
where n represents the total number of grids passed,
Figure FDA0002929081460000039
for the trained weight matrix and bias parameters, W, of the total time overheadTRepresenting the transpose of the W matrix.
CN201810658375.5A 2018-06-25 2018-06-25 A Travel Time Estimation Method Based on Assisted Supervised Learning Expired - Fee Related CN109035761B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201810658375.5A CN109035761B (en) 2018-06-25 2018-06-25 A Travel Time Estimation Method Based on Assisted Supervised Learning

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201810658375.5A CN109035761B (en) 2018-06-25 2018-06-25 A Travel Time Estimation Method Based on Assisted Supervised Learning

Publications (2)

Publication Number Publication Date
CN109035761A CN109035761A (en) 2018-12-18
CN109035761B true CN109035761B (en) 2021-06-04

Family

ID=64611043

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201810658375.5A Expired - Fee Related CN109035761B (en) 2018-06-25 2018-06-25 A Travel Time Estimation Method Based on Assisted Supervised Learning

Country Status (1)

Country Link
CN (1) CN109035761B (en)

Families Citing this family (16)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN109799533B (en) * 2018-12-28 2021-07-27 中国石油化工股份有限公司 Reservoir prediction method based on bidirectional circulation neural network
CN109783771B (en) * 2019-01-22 2021-01-29 清华大学 Processing method and device for converting track sequence into image matrix and storage medium
CN111721306B (en) * 2019-03-20 2022-07-05 北京嘀嘀无限科技发展有限公司 Road matching method and device, electronic equipment and readable storage medium
CN110232169B (en) * 2019-05-09 2022-01-04 北京航空航天大学 Track denoising method based on bidirectional long-time and short-time memory model and Kalman filtering
DE102019209736A1 (en) * 2019-07-03 2021-01-07 Robert Bosch Gmbh Procedure for evaluating possible trajectories
CN110488835B (en) * 2019-08-28 2020-12-08 北京航空航天大学 An intelligent local path planning method for unmanned systems based on double back-propagation neural networks
CN110675632B (en) * 2019-11-11 2021-11-30 重庆邮电大学 Vehicle short-time trajectory prediction control method aiming at multi-feature space and data sparseness
CN110942211A (en) * 2019-12-12 2020-03-31 武汉中海庭数据技术有限公司 Prediction arrival time prediction method and device based on deep neural network
CN111047107B (en) * 2019-12-23 2022-05-10 北京百度网讯科技有限公司 Road traffic time prediction method, device, electronic equipment and storage medium
CN111339156B (en) * 2020-02-07 2023-09-26 京东城市(北京)数字科技有限公司 Method, apparatus and computer readable storage medium for long-term determination of business data
CN111738500B (en) * 2020-06-11 2024-01-12 大连海事大学 Navigation time prediction method and device based on deep learning
CN112101132B (en) * 2020-08-24 2022-04-19 西北工业大学 Traffic condition prediction method based on graph embedding model and metric learning
CN112017436B (en) * 2020-09-09 2021-09-28 中国科学院自动化研究所 Method and system for predicting urban traffic travel time
CN112859898B (en) * 2021-01-18 2022-03-22 中山大学 Aircraft trajectory prediction method based on two-channel bidirectional neural network
CN115394088B (en) * 2022-10-31 2023-04-07 江苏博宇鑫信息科技股份有限公司 Crossing traffic time prediction method based on space-time attention mechanism
CN116542438B (en) * 2023-03-28 2024-01-30 大连海事大学 A repair method for bus passenger start and end point estimation based on ground truth-free

Citations (17)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN103280110A (en) * 2013-06-08 2013-09-04 北京云星宇交通工程有限公司 Method and device for predicting expressway travel time
CN104157142A (en) * 2014-08-27 2014-11-19 河海大学 Urban path travel time forecasting method based on floating vehicle data
CN104269059A (en) * 2014-10-15 2015-01-07 河海大学 City path travel time forecasting method based on multi-source data fusion
CN104299442A (en) * 2014-10-15 2015-01-21 河海大学 Urban route travel time forecasting method based on pattern matching
CN104408958A (en) * 2014-11-11 2015-03-11 河海大学 Urban dynamic route travel time predication method
CN104615897A (en) * 2015-02-13 2015-05-13 北方工业大学 Road section travel time estimation method based on low-frequency GPS data
CN105006147A (en) * 2015-06-19 2015-10-28 武汉大学 Road segment travel time deducing method based on road space-time incidence relation
CN105679021A (en) * 2016-02-02 2016-06-15 重庆云途交通科技有限公司 Travel time fusion prediction and query method based on traffic big data
CN106023588A (en) * 2016-06-15 2016-10-12 重庆云途交通科技有限公司 Traffic big data-based travel time extraction, prediction and query method
CN106096767A (en) * 2016-06-07 2016-11-09 中国科学院自动化研究所 A kind of link travel time prediction method based on LSTM
CN106940930A (en) * 2016-11-01 2017-07-11 深圳市城市交通规划设计研究中心有限公司 Motorway journeys time prediction system and Forecasting Methodology
CN106981198A (en) * 2017-05-24 2017-07-25 北京航空航天大学 Deep learning network model and its method for building up for predicting travel time
CN107085943A (en) * 2015-12-23 2017-08-22 青岛海信网络科技股份有限公司 A kind of road travel time short term prediction method and system
CN107688556A (en) * 2017-07-24 2018-02-13 中山大学 A kind of real-time travel time computation method based on function type principal component analysis
CN107945507A (en) * 2016-10-13 2018-04-20 腾讯科技(深圳)有限公司 Travel Time Estimation Method and device
KR20180064692A (en) * 2016-12-06 2018-06-15 현대자동차주식회사 A navigator, a vehicle the navigator is installed in, a system for determining a route, a method for controlling the same
WO2018109516A1 (en) * 2016-12-13 2018-06-21 日産自動車株式会社 Automatic driving assistance method and device

Family Cites Families (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN106156531B (en) * 2016-08-04 2020-09-01 复旦大学 Travel time estimation method based on low-sampling historical track
CN108073941A (en) * 2016-11-17 2018-05-25 江南大学 A kind of image, semantic generation method based on deep learning
CN107480786B (en) * 2017-08-07 2021-04-30 复旦大学 Output state limitation-based recurrent neural network track likelihood probability calculation method

Patent Citations (17)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN103280110A (en) * 2013-06-08 2013-09-04 北京云星宇交通工程有限公司 Method and device for predicting expressway travel time
CN104157142A (en) * 2014-08-27 2014-11-19 河海大学 Urban path travel time forecasting method based on floating vehicle data
CN104269059A (en) * 2014-10-15 2015-01-07 河海大学 City path travel time forecasting method based on multi-source data fusion
CN104299442A (en) * 2014-10-15 2015-01-21 河海大学 Urban route travel time forecasting method based on pattern matching
CN104408958A (en) * 2014-11-11 2015-03-11 河海大学 Urban dynamic route travel time predication method
CN104615897A (en) * 2015-02-13 2015-05-13 北方工业大学 Road section travel time estimation method based on low-frequency GPS data
CN105006147A (en) * 2015-06-19 2015-10-28 武汉大学 Road segment travel time deducing method based on road space-time incidence relation
CN107085943A (en) * 2015-12-23 2017-08-22 青岛海信网络科技股份有限公司 A kind of road travel time short term prediction method and system
CN105679021A (en) * 2016-02-02 2016-06-15 重庆云途交通科技有限公司 Travel time fusion prediction and query method based on traffic big data
CN106096767A (en) * 2016-06-07 2016-11-09 中国科学院自动化研究所 A kind of link travel time prediction method based on LSTM
CN106023588A (en) * 2016-06-15 2016-10-12 重庆云途交通科技有限公司 Traffic big data-based travel time extraction, prediction and query method
CN107945507A (en) * 2016-10-13 2018-04-20 腾讯科技(深圳)有限公司 Travel Time Estimation Method and device
CN106940930A (en) * 2016-11-01 2017-07-11 深圳市城市交通规划设计研究中心有限公司 Motorway journeys time prediction system and Forecasting Methodology
KR20180064692A (en) * 2016-12-06 2018-06-15 현대자동차주식회사 A navigator, a vehicle the navigator is installed in, a system for determining a route, a method for controlling the same
WO2018109516A1 (en) * 2016-12-13 2018-06-21 日産自動車株式会社 Automatic driving assistance method and device
CN106981198A (en) * 2017-05-24 2017-07-25 北京航空航天大学 Deep learning network model and its method for building up for predicting travel time
CN107688556A (en) * 2017-07-24 2018-02-13 中山大学 A kind of real-time travel time computation method based on function type principal component analysis

Non-Patent Citations (5)

* Cited by examiner, † Cited by third party
Title
Long short-term memory neural network for traffic speed;Xiaolei Ma,Zhimin Tao, Yinhai Wang, Haiyang Yu;《Transportation Research Part C》;20150430(第54期);全文 *
Short-term travel time prediction by deep learning: A comparison of different LSTM-DNN models;Yangdong Liu;《2017 IEEE 20th International Conference on Intelligent Transportation Systems (ITSC)》;20171031;全文 *
WhenWill You Arrive? Estimating Travel Time Based on Deep Neural Networks;Dong Wang, Junbo Zhang, Wei Cao, Jian Li, Yu Zheng;《Microsoft》;20180131;全文 *
利用浮动车大数据进行稀疏路段行程时间推断;张发明,朱欣焰,呙维,胡涛;《武汉大学学报· 信息科学版》;20170131;第42卷(第1期);全文 *
基于时空贝叶斯模型的行程时间可靠性预测;杨庆芳,韦学武,林赐云,李志林,刘翔宇;《华南理工大学学报( 自然科学版)》;20160430;第44卷(第4期);全文 *

Also Published As

Publication number Publication date
CN109035761A (en) 2018-12-18

Similar Documents

Publication Publication Date Title
CN109035761B (en) A Travel Time Estimation Method Based on Assisted Supervised Learning
Majumdar et al. Congestion prediction for smart sustainable cities using IoT and machine learning approaches
CN109285346B (en) Urban road network traffic state prediction method based on key road sections
CN111653088B (en) Vehicle driving quantity prediction model construction method, prediction method and system
CN110709908B (en) Computer system and method for state prediction of traffic system
CN106971547B (en) A Short-Term Traffic Flow Prediction Method Considering Spatio-temporal Correlations
Yuan et al. Driving with knowledge from the physical world
Wang et al. Efficient metropolitan traffic prediction based on graph recurrent neural network
CN105513359B (en) A kind of urban expressway traffic method for estimating state based on smart mobile phone movement detection
CN104408913B (en) A kind of traffic flow three parameter real-time predicting method considering temporal correlation
CN109670277A (en) A kind of travel time prediction method based on multi-modal data fusion and multiple-model integration
CN110223517A (en) Short-term traffic flow forecast method based on temporal correlation
WO2018122803A1 (en) Smart road traffic anomaly detection method
CN106652534A (en) Method for predicting arrival time of bus
CN108983788A (en) The unmanned sanitation cart intelligence control system and method excavated based on big data
CN106781489A (en) A kind of road network trend prediction method based on recurrent neural network
Chen et al. A multiscale-grid-based stacked bidirectional GRU neural network model for predicting traffic speeds of urban expressways
Mikluščák et al. Using neural networks for route and destination prediction in intelligent transport systems
Xu et al. Trip-oriented travel time prediction (TOTTP) with historical vehicle trajectories
CN112785077A (en) Travel demand prediction method and system based on space-time data
CN116052427B (en) Inter-city inter-regional mobility prediction method and device based on private car travel track data
CN113537626B (en) Method for predicting neural network combined time sequence by aggregating information difference
Xu et al. A taxi dispatch system based on prediction of demand and destination
Zhang et al. A hybrid deep learning approach for urban expressway travel time prediction considering spatial-temporal features
CN114267170A (en) Traffic flow prediction method based on graph space-time transform model considering human mobility

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant
CF01 Termination of patent right due to non-payment of annual fee
CF01 Termination of patent right due to non-payment of annual fee

Granted publication date: 20210604