CN108230724A - A kind of urban mass-transit system Vehicle station name announcing missing data method for repairing and mending based on maximum probability estimation - Google Patents
A kind of urban mass-transit system Vehicle station name announcing missing data method for repairing and mending based on maximum probability estimation Download PDFInfo
- Publication number
- CN108230724A CN108230724A CN201810094908.1A CN201810094908A CN108230724A CN 108230724 A CN108230724 A CN 108230724A CN 201810094908 A CN201810094908 A CN 201810094908A CN 108230724 A CN108230724 A CN 108230724A
- Authority
- CN
- China
- Prior art keywords
- data
- station
- bus
- time
- missing
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Granted
Links
- 238000000034 method Methods 0.000 title claims abstract description 28
- 238000005315 distribution function Methods 0.000 claims description 6
- 230000011218 segmentation Effects 0.000 claims description 3
- 230000000694 effects Effects 0.000 abstract description 5
- 238000007418 data mining Methods 0.000 abstract description 3
- 238000004891 communication Methods 0.000 abstract description 2
- 238000011160 research Methods 0.000 abstract description 2
- 238000012360 testing method Methods 0.000 description 9
- 238000012545 processing Methods 0.000 description 2
- 230000009286 beneficial effect Effects 0.000 description 1
- 238000005516 engineering process Methods 0.000 description 1
- 238000005065 mining Methods 0.000 description 1
- 238000010295 mobile communication Methods 0.000 description 1
- 238000012986 modification Methods 0.000 description 1
- 230000004048 modification Effects 0.000 description 1
- 238000005070 sampling Methods 0.000 description 1
- 238000006467 substitution reaction Methods 0.000 description 1
- 238000012546 transfer Methods 0.000 description 1
- 238000012795 verification Methods 0.000 description 1
Classifications
-
- G—PHYSICS
- G08—SIGNALLING
- G08G—TRAFFIC CONTROL SYSTEMS
- G08G1/00—Traffic control systems for road vehicles
- G08G1/123—Traffic control systems for road vehicles indicating the position of vehicles, e.g. scheduled vehicles; Managing passenger vehicles circulating according to a fixed timetable, e.g. buses, trains, trams
- G08G1/133—Traffic control systems for road vehicles indicating the position of vehicles, e.g. scheduled vehicles; Managing passenger vehicles circulating according to a fixed timetable, e.g. buses, trains, trams within the vehicle ; Indicators inside the vehicles or at stops
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06N—COMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N5/00—Computing arrangements using knowledge-based models
- G06N5/04—Inference or reasoning models
- G06N5/041—Abduction
Landscapes
- Engineering & Computer Science (AREA)
- Theoretical Computer Science (AREA)
- General Physics & Mathematics (AREA)
- Physics & Mathematics (AREA)
- Evolutionary Computation (AREA)
- Data Mining & Analysis (AREA)
- Computing Systems (AREA)
- General Engineering & Computer Science (AREA)
- Computational Linguistics (AREA)
- Mathematical Physics (AREA)
- Software Systems (AREA)
- Artificial Intelligence (AREA)
- Radar, Positioning & Navigation (AREA)
- Remote Sensing (AREA)
- Management, Administration, Business Operations System, And Electronic Commerce (AREA)
- Traffic Control Systems (AREA)
Abstract
本发明针对城市公交车载报站系统中存在数据缺失的问题,提供了一种基于极大概率估计的城市公交系统车载报站缺失数据修补方法,包括以下步骤:通过分析缺失数据的特征以及公交乘客的刷卡行为特点并结合历史数据,构造了以行程时间作为后验条件参数集合的极大概率估计模型,借助乘客刷卡数据准确推断了城市公交系统中自动报站设备的缺失数据。本发明所提供的数据修补方法,克服了现有技术中受城市建筑阴影效应影响,公交车载自动报站系统易出现定位失败或通信丢包,导致报站信息不完整严重影响后续数据挖掘效果的缺陷,提高了乘客公交出行轨迹还原的成功率,增加了城市公交客流分布研究的信息有效性。
Aiming at the problem of missing data in the urban bus station announcement system, the present invention provides a method for repairing the missing data of the urban bus station announcement system based on maximum probability estimation, which includes the following steps: by analyzing the characteristics of the missing data and the bus Combining the characteristics of passengers' card swiping behavior with historical data, a maximum probability estimation model with travel time as a set of posterior condition parameters was constructed, and the missing data of automatic station announcement equipment in the urban bus system was accurately inferred with the help of passenger card swiping data. The data repair method provided by the present invention overcomes the influence of the shadow effect of urban buildings in the prior art, and the automatic bus station reporting system is prone to positioning failure or communication packet loss, resulting in incomplete station reporting information and seriously affecting the subsequent data mining effect It improves the success rate of passenger bus travel trajectory restoration and increases the information validity of urban bus passenger flow distribution research.
Description
技术领域technical field
本发明涉及智能化公共交通技术领域,涉及一种基于极大概率估计的城市公交系统车载报站缺失数据修补方法。The invention relates to the technical field of intelligent public transportation, and relates to a method for repairing missing data of on-vehicle call stations in urban public transportation systems based on maximum probability estimation.
背景技术Background technique
公交客流分布数据,是优化城市公交运营管理的基础,是制约城市公交管理水平的关键,也是城市公交研究者关注的重点。长期以来,管理者一般通过抽样调查方法,获得有限的公交出行数据,费时费力、成本高昂。近年来,随着自动收费系统(Automated FareCollection,AFC)与自动报站系统(Automated Vehicle Locating,AVL)的广泛应用,城市公交系统积累了大量运营管理数据。如何通过数据挖掘技术,从公交系统现有数据中低成本地获取公交客流信息,成为国内外城市公交研究者关注的重点。The distribution data of bus passenger flow is the basis for optimizing the operation and management of urban public transport, the key to restricting the level of urban public transport management, and the focus of urban public transport researchers. For a long time, managers have generally obtained limited bus travel data through sampling surveys, which is time-consuming, laborious and costly. In recent years, with the widespread application of Automated FareCollection (AFC) and Automated Vehicle Locating (AVL), urban public transport systems have accumulated a large amount of operational management data. How to obtain bus passenger flow information from the existing data of the bus system at low cost through data mining technology has become the focus of urban bus researchers at home and abroad.
在获取公交客流的相关研究中,将乘客乘车刷卡数据转换为乘客在公交系统中的运动轨迹是其中最为关键的环节。然而,现有的公交刷卡系统往往将设计重点放在运营清分功能上,而忽视记录刷卡站点,特别是目前使用最为广泛的一票制公交,只能记录乘客的上车时刻与车号、线路号,无上下车站点信息。因此,在数据处理过程中,往往需要先要将刷卡记录结合GPS行车记录系统的报站记录进行时间匹配来推断公交乘客刷卡上车站点,再基于最短距离换乘假设、连续出行链假设推断乘客的下车站点,最后,对于无法找到下车站点的刷卡记录,通过站点吸引权法估计下车站点。In the relevant research on obtaining bus passenger flow, the most critical link is to convert the passenger card swiping data into the passenger's movement trajectory in the bus system. However, the existing bus card swiping system often focuses on the operation and sorting function, and ignores the recording of the card swiping station, especially the most widely used one-ticket bus at present, which can only record the passenger boarding time and bus number, Line number, no information about getting on and off the bus. Therefore, in the process of data processing, it is often necessary to first match the card swiping records with the station reporting records of the GPS driving record system to infer the bus passengers’ swiping card boarding stations, and then infer passengers based on the shortest distance transfer assumption and continuous travel chain assumption. Finally, for the card swiping records where the drop-off site cannot be found, the drop-off site is estimated by the site attraction right method.
上述数据处理过程严重依赖车辆报站数据,然而,车载自动报站系统严重依赖GPS模块与2G移动通讯模块获取、上传位置信息,该类设备受城市建筑阴影效应影响严重,易出现定位失败或通信丢包,导致报站信息不完整。通过分析某城市的公交车辆AVL报站数据、班次调度数据以及AFC刷卡数据,发现在统计时间段内全市平均缺失数据占比6.25%,且数据缺失率高的区域位于该城市人口稠密区域。虽然数据缺失率不到10%,但后续的公交乘客上车站点匹配结果表明,缺失数据导致超过25%的刷卡数据无法匹配出上车站点。由此可见,城市公交系统中普遍存在的AVL报站数据缺失情况,已显著影响了乘客出行轨迹还原进程,并严重制约着公交客流挖掘效果。The above data processing process relies heavily on the vehicle station reporting data. However, the vehicle automatic station reporting system relies heavily on the GPS module and 2G mobile communication module to obtain and upload location information. This type of equipment is seriously affected by the shadow effect of urban buildings, and is prone to positioning failure or communication failure. Packets are lost, resulting in incomplete station reporting information. By analyzing the AVL station reporting data, shift scheduling data and AFC credit card data of a certain city, it is found that the average missing data in the city accounts for 6.25% during the statistical period, and the areas with high data missing rates are located in densely populated areas of the city. Although the data missing rate is less than 10%, the subsequent matching results of bus passengers' boarding stations show that more than 25% of the card swiping data cannot be matched to the boarding stations due to missing data. It can be seen that the lack of AVL station reporting data that is common in urban public transport systems has significantly affected the process of passenger travel trajectory restoration and seriously restricted the mining effect of bus passenger flow.
发明内容Contents of the invention
本发明的目的在于针对城市公交自动报站系统中普遍存在的数据丢失情况,提供一种基于极大概率估计的城市公交系统车载报站缺失数据修补方法。该方法通过分析缺失数据的特征以及公交乘客的刷卡行为特点,构造了以行程时间作为后验条件参数集合的极大概率估计模型,准确推断了城市公交系统缺失的车载报站数据,为开展后续的数据挖掘提供必要支撑。The purpose of the present invention is to provide a method for repairing the missing data of the urban public transport system based on the estimation of the maximum probability for the ubiquitous data loss situation in the automatic station announcement system of the urban public transport. By analyzing the characteristics of the missing data and the characteristics of bus passengers' credit card swiping behavior, this method constructs a maximum probability estimation model with travel time as the posterior condition parameter set, and accurately infers the missing on-board station announcement data of the urban bus system. Data mining provides the necessary support.
为了达到上述目的,本发明采用以下技术方案:In order to achieve the above object, the present invention adopts the following technical solutions:
本发明一种基于极大概率估计的城市公交系统车载报站缺失数据修补方法,包括以下步骤:A method for repairing missing data of vehicle-mounted station announcements in urban public transport systems based on maximum probability estimation of the present invention comprises the following steps:
S1、按每一次发车,对数据源2“AVL车辆报站数据”进行归类整理,将每一次发车的AVL报站数据根据公交线路站序排列:{S0,S1,S2,…,Si},其中Si对应车辆到达Si站点的AVL信息;S1. According to each departure, classify and organize the data source 2 "AVL vehicle station announcement data", and arrange the AVL station announcement data of each departure according to the station sequence of the bus line: {S 0 , S 1 , S 2 ,… ,S i }, where S i corresponds to the AVL information of the vehicle arriving at the S i site;
S2、将步骤S1中的某一次发车数据,与数据源1“公交调度数据”对应公交线路的行车站序进行对比;S2. Comparing the departure data of a certain time in step S1 with the sequence of stops of the bus line corresponding to the data source 1 "bus scheduling data";
S21、若站点信息完整,则不存在数据缺失,无需修复;S21. If the site information is complete, there is no missing data, and there is no need to repair;
S22、若站点信息不完整,则说明存在AVL数据缺失的情况,锁定存在数据丢失的站点区间Sl={S0,SL0,SL1,…,SLk,…,S1},并计算出该缺失数据区间的行程时间,记为tTRIP;S22. If the site information is incomplete, it means that there is a lack of AVL data, lock the site interval S l = {S 0 , S L0 , S L1 ,...,S Lk ,...,S 1 } with data loss, and calculate The travel time out of the missing data interval, denoted as t TRIP ;
S3、根据数据源3提供的“乘客AFC刷卡数据”,查询搭乘本次行程公交班车从S0到Si途中的乘客刷卡数据,并利用阈值分割法提取出有乘客刷卡乘车的站点中第1个刷卡事件的刷卡时间戳,其中SLk站第一个上车乘客的刷卡时刻记为 S3. According to the "passenger AFC card swiping data" provided by data source 3, query the card swiping data of passengers taking the bus on this journey from S 0 to S i , and use the threshold segmentation method to extract the first station where passengers swiped their cards to board the bus The card-swiping time stamp of a card-swiping event, where the card-swiping moment of the first boarding passenger at S Lk station is recorded as
S4,借助历史数据,分别计算从S0出发到SL0,SL1,…,SLk,满足控制条件Θ的行程时间概率分布函数:优选的,Θ=tTRIP∈[tTRIP-0.1×tTRIP,tTRIP+0.1×tTRIP];S4, with the help of historical data, calculate the travel time probability distribution function from S 0 to S L0 , S L1 ,...,S Lk satisfying the control condition Θ: Preferably, Θ=t TRIP ∈[t TRIP -0.1×t TRIP ,t TRIP +0.1×t TRIP ];
S5、将Sl行程中每个站点SLk第1个刷卡事件的刷卡时间戳减去S0的出站时刻,得到每个站点的行程时间,对任一行程时间,代入由S4得到的各站时间分布函数,得到概率值,输出最大值的函数所对应的站点即为时刻车辆到达的车站,数据推断完成。S5. The card swiping time stamp of the first card swiping event of each station S Lk in the itinerary of S l Subtract the outbound time of S 0 to obtain the travel time of each station. For any travel time, substitute the time distribution function of each station obtained by S4 to obtain the probability value, and the station corresponding to the function that outputs the maximum value is The station where the vehicle arrives at the moment, and the data inference is completed.
作为优选的技术方案,所述数据源1“公交调度数据”包括:a)班次编号;b)车牌号;c)线路号;d)发车时刻;e)站序;As a preferred technical solution, the data source 1 "bus scheduling data" includes: a) shift number; b) license plate number; c) line number; d) departure time; e) station sequence;
所述数据源2“AVL车辆报站数据”包括:a)车牌号;b)线路号;c)车站名;d)进站时刻;e)出站时刻;The data source 2 "AVL vehicle station reporting data" includes: a) license plate number; b) line number; c) station name; d) entry time; e) exit time;
所述数据源3“乘客AFC刷卡数据”包括:a)车牌号;b)线路号;c)车站名;d)刷卡时刻;e)乘客标识;f)站点匹配状态。The data source 3 "passenger AFC swiping card data" includes: a) license plate number; b) line number; c) station name; d) card swiping time; e) passenger identification; f) station matching status.
作为优选的技术方案,步骤S22具体为:As a preferred technical solution, step S22 is specifically:
通过对比当前车次的AVL报站数据与完整的公交线路行车站序,锁定存在数据丢失的站点区间,标记缺失的站点编号,假设Sl={S0,SL0,SL1,…,SLk,…,S1},为本次发车过程中车辆连续经过的一段站点区间,其中,S0与S1为数据完整的站点,SL0,SL1,…,SLk为S0与S1之间报站数据缺失的站点。By comparing the AVL station reporting data of the current train number with the complete bus line station sequence, lock the station interval with data loss, and mark the missing station number, assuming S l ={S 0 ,S L0 ,S L1 ,…,S Lk ,...,S 1 }, is a section of station intervals that vehicles pass through continuously during this departure, where S 0 and S 1 are stations with complete data, S L0 , S L1 ,...,S Lk are S 0 and S 1 Stations with missing station reporting data.
作为优选的技术方案,步骤S3具体为:As a preferred technical solution, step S3 is specifically:
由于乘客刷卡为短间隔连续事件,可设定时间阈值作为分割相邻车站Si,Si+1乘客的参考,即,若相邻刷卡记录间隔时间小于设定时间,则将该相邻刷卡记录视作同一车站的刷卡记录;否则后一条刷卡记录为下一站的第一条刷卡记录。Since passenger card swiping is a short-interval continuous event, the time threshold can be set as a reference for segmenting adjacent stations S i , S i+1 passengers, that is, if the interval between adjacent card swiping records is less than the set time, the adjacent card swiping The record is regarded as the card swiping record at the same station; otherwise, the last card swiping record is the first card swiping record at the next station.
作为优选的技术方案,步骤S4中,所述历史数据为不少于四周的历史AVL数据。As a preferred technical solution, in step S4, the historical data is historical AVL data of not less than four weeks.
作为优选的技术方案,步骤S4中,某存在上车乘客的未知站点SLk,第一个刷卡上车的乘客产生的刷卡时间戳为则该班次,公交车在该站的进站时刻可等价为 As a preferred technical solution, in step S4, at an unknown station S Lk where there are passengers boarding the bus, the time stamp of swiping the card generated by the first passenger who boards the bus is Then for this frequency, the arrival time of the bus at this station can be equivalent to
作为优选的技术方案,步骤S4中,假设服从高斯分布:As a preferred technical solution, in step S4, it is assumed that Follow a Gaussian distribution:
则,在Θ确定后,即可查询历史AVL数据中所有从站点S0到站点SLK且满足控制条件Θ的行程,计算模型参数时间的均值μ和标准差σ。Then, after Θ is determined, all trips from site S 0 to site S LK in the historical AVL data that satisfy the control condition Θ can be queried, and the mean value μ and standard deviation σ of the model parameter time can be calculated.
本发明与现有技术相比,具有如下优点和有益效果:Compared with the prior art, the present invention has the following advantages and beneficial effects:
1、本发明通过分析缺失数据的特征以及公交乘客的刷卡行为特点,构造了以行程时间作为后验条件参数集合的极大概率估计模型,充分发挥海量数据的优势,特别适合样本量巨大的场合。1. The present invention constructs a maximum probability estimation model with travel time as a set of posterior condition parameters by analyzing the characteristics of missing data and the card swiping behavior of bus passengers, and fully utilizes the advantages of massive data, especially suitable for occasions with huge sample sizes .
2、本发明采用的方法,构造某一运营车辆一次行程中连续经过的站点集St={S0,SL0,SL1,...,SLK,S1},其中SL0,SL1,...,SLK是数据完整站点S0和S1之间发生数据缺失的站点,可以适用于连续缺失多个站点的修复,修复效率高。2. The method adopted in the present invention is to construct a station set S t = {S 0 , S L0 , S L1 ,..., S LK , S 1 } that a certain operating vehicle passes through continuously in one trip, where S L0 , S L1 ,...,S LK are sites with missing data between the data complete sites S 0 and S 1 , which can be applied to the repair of multiple consecutive missing sites with high repair efficiency.
3、相比传统的方法,本发明数据还原准确率高,在连续缺失站点少于等于8个的情况下,能保持85%以上的还原准确率。3. Compared with the traditional method, the data restoration accuracy rate of the present invention is high, and the restoration accuracy rate of more than 85% can be maintained in the case of less than or equal to 8 consecutive missing sites.
附图说明Description of drawings
图1是本实施例公交系统AVL缺失数据修补方法的整体工作流程图。Fig. 1 is the overall working flow chart of the method for repairing missing AVL data of the public transportation system in this embodiment.
图2是本实施例不同后验证条件下数据还原成功率。FIG. 2 shows the success rate of data restoration under different post-verification conditions in this embodiment.
具体实施方式Detailed ways
下面结合实施例及附图对本发明作进一步详细的描述,但本发明的实施方式不限于此。The present invention will be further described in detail below in conjunction with the embodiments and the accompanying drawings, but the embodiments of the present invention are not limited thereto.
实施例Example
以某城市公交数据为例,提取2017年3月5日的AVL车辆报站数据为基础数据源,扫描AVL报站数据中每辆车的报站数据,以15%的概率随机选择开始站点,并取出之后8个站点的AVL报站数据加入备选测试集,共计取得94750条测试数据。对备选测试集中每一比数据,除第一个与最后一个报站记录(S0、S1)外的所有报站记录进站时刻加入均匀分布的随机干扰(范围:0-120s),得到测试集。测试集中每笔测试数据包含的关键信息字段如表1所示:Taking the bus data of a certain city as an example, extract the AVL vehicle station announcement data on March 5, 2017 as the basic data source, scan the station announcement data of each vehicle in the AVL station announcement data, and randomly select the starting station with a probability of 15%. After taking out the AVL station reporting data of the 8 stations and adding them to the candidate test set, a total of 94,750 pieces of test data were obtained. For each ratio of data in the candidate test set, uniformly distributed random interference (range: 0-120s) is added to all station reporting records except the first and last station reporting records (S 0 , S 1 ) when entering the station, get the test set. The key information fields contained in each test data in the test set are shown in Table 1:
表1 测试数据的关键信息字段Table 1 Key information fields of test data
因此,缺失数据推断任务为扫描测试集中所有记录,并根据其第一条与最后一条报站记录(S0、S1)及包含扰动的时间戳TL0‘,TL1’,…,TLk‘推断出SL0,SL1,…,SLk。Therefore, the missing data inference task is to scan all records in the test set, and according to the first and last station reporting records (S 0 , S 1 ) and timestamps T L0 ',T L1 ',…,T Lk containing disturbances 'Deduce S L0 , S L1 ,...,S Lk .
本发明提供一种基于极大概率估计的公交系统AVL缺失数据修补方法,其流程如图1所示,包括以下步骤:The present invention provides a method for repairing AVL missing data in public transport systems based on maximum probability estimation, the process of which is as shown in Figure 1, comprising the following steps:
第一步,按每一次发车,提取其AVL数据,并按照时间先后排序。The first step is to extract the AVL data of each departure and sort them in chronological order.
第二步,对比该线路的行车站序,1)若站点信息完整,则不存在数据缺失,无需修复;2)若站点信息不完整,则说明存在AVL数据缺失的情况,需要进行修复。对需要修复的情况,构造Sl={S0,SL0,SL1,…,SLk,…,S1}为该次行程中连续经过的站点区间,其中,S0与S1为数据完整的站点,SL0,SL1,…,SLk为S0与S1之间缺失报站数据站点。The second step is to compare the station sequence of the line. 1) If the station information is complete, there is no data loss and no need to repair; 2) if the station information is incomplete, it means that there is AVL data missing and needs to be repaired. For the situation that needs to be repaired, construct S l = {S 0 , S L0 , S L1 ,...,S Lk ,...,S 1 } as the station intervals that pass continuously in this trip, where S 0 and S 1 are data The complete stations, S L0 , S L1 ,..., S Lk are stations with missing station reporting data between S 0 and S 1 .
第三步,根据数据源3提供的“乘客AFC刷卡数据”,查询搭乘本次行程公交班车从S0到Si途中的乘客刷卡数据,并利用阈值分割法提取出有乘客刷卡乘车的站点中第1个刷卡事件的刷卡时间戳,其中SLk站第一个上车乘客的刷卡时刻记为 In the third step, according to the "passenger AFC card swiping data" provided by data source 3, query the card swiping data of passengers taking the bus from S 0 to S i on this journey, and use the threshold segmentation method to extract the stations where passengers swiped their cards to board the bus The card swiping time stamp of the first card swiping event in , where the card swiping time of the first boarding passenger at S Lk station is recorded as
第四步,借助历史数据,分别计算从S0出发到SL0,SL1,…,SLk,满足控制条件Θ的行程时间概率分布函数:Θ=tTRIP∈[tTRIP-0.1×tTRIP,tTRIP+0.1×tTRIP]为经过试验的优选方案,并可以根据实际情况灵活修改;故Θ可以是其它控制条件。The fourth step is to use historical data to calculate the travel time probability distribution function from S 0 to S L0 , S L1 ,...,S Lk satisfying the control condition Θ: Θ=t TRIP ∈[t TRIP -0.1×t TRIP ,t TRIP +0.1×t TRIP ] is the optimal scheme after testing, and can be flexibly modified according to the actual situation; therefore Θ can be other control conditions.
第五步,将Sl行程中每个站点SLk第1个刷卡事件的刷卡时间戳减去S0的出站时刻,得到每个站点的行程时间,对任一行程时间,代入由S4得到的各站时间分布函数,得到概率值,输出最大值的函数所对应的站点即为时刻车辆到达的车站,数据推断完成。The fifth step, the time stamp of the first card swiping event of each station S Lk in the itinerary of S l Subtract the outbound time of S 0 to obtain the travel time of each station. For any travel time, substitute the time distribution function of each station obtained by S4 to obtain the probability value, and the station corresponding to the function that outputs the maximum value is the time The station where the vehicle arrives, and the data inference is completed.
以不同的控制条件构建的数据修补模型性能对比如图2所示,本发明采用基于行程时间的极大概率估计模型,相对与先验概率模型,以及基于开始时刻等其它概率模型,本发明所提供的优选方案数据修补效果最佳,在连续缺失站点数小于等于8的情况下保持了85%以上的修复准确度。The performance comparison of data patching models constructed with different control conditions is shown in Figure 2. The present invention uses a maximum probability estimation model based on travel time, relative to the prior probability model, and other probability models based on the start time. The optimal scheme provided has the best data repair effect, and maintains a repair accuracy of more than 85% when the number of consecutive missing stations is less than or equal to 8.
上述实施例为本发明较佳的实施方式,但本发明的实施方式并不受上述实施例的限制,其他的任何未背离本发明的精神实质与原理下所作的改变、修饰、替代、组合、简化,均应为等效的置换方式,都包含在本发明的保护范围之内。The above-mentioned embodiment is a preferred embodiment of the present invention, but the embodiment of the present invention is not limited by the above-mentioned embodiment, and any other changes, modifications, substitutions, combinations, Simplifications should be equivalent replacement methods, and all are included in the protection scope of the present invention.
Claims (7)
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201810094908.1A CN108230724B (en) | 2018-01-31 | 2018-01-31 | A Method of Repairing Missing Data of Vehicle-mounted Stations in Urban Public Transport System Based on Maximum Probability Estimation |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201810094908.1A CN108230724B (en) | 2018-01-31 | 2018-01-31 | A Method of Repairing Missing Data of Vehicle-mounted Stations in Urban Public Transport System Based on Maximum Probability Estimation |
Publications (2)
Publication Number | Publication Date |
---|---|
CN108230724A true CN108230724A (en) | 2018-06-29 |
CN108230724B CN108230724B (en) | 2020-05-22 |
Family
ID=62670264
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN201810094908.1A Expired - Fee Related CN108230724B (en) | 2018-01-31 | 2018-01-31 | A Method of Repairing Missing Data of Vehicle-mounted Stations in Urban Public Transport System Based on Maximum Probability Estimation |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN108230724B (en) |
Cited By (7)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN109523821A (en) * | 2018-11-30 | 2019-03-26 | 湖南智慧畅行交通科技有限公司 | It is a kind of that point calculating method leaving from station is arrived based on city bus GPS information |
CN109615036A (en) * | 2018-11-30 | 2019-04-12 | 深圳大学 | Fine particle exposure risk assessment method based on bus IC card swiping system |
CN109727474A (en) * | 2019-01-29 | 2019-05-07 | 苏州工业园区测绘地理信息有限公司 | A kind of bus based on fused data precise recognition method out of the station |
CN110738845A (en) * | 2019-09-20 | 2020-01-31 | 江苏大学 | bus GPS data complement method and complement system based on abnormal data processing |
CN112767685A (en) * | 2020-12-22 | 2021-05-07 | 荆门汇易佳信息科技有限公司 | Public transport passenger flow analysis system based on positioning and card swiping information |
CN113344268A (en) * | 2021-06-02 | 2021-09-03 | 合肥泰瑞数创科技有限公司 | Urban traffic trip data analysis method |
CN116611984A (en) * | 2023-07-11 | 2023-08-18 | 鹏城实验室 | Travel data processing method, system, equipment and medium under multiple modes |
Citations (4)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN105574137A (en) * | 2015-12-15 | 2016-05-11 | 华南理工大学 | Public transportation multi-source data fusion based IC card swiping station matching method |
US20160231129A1 (en) * | 2015-02-05 | 2016-08-11 | Moovit App Global Ltd | Public and ordered transportation trip planning |
CN106448233A (en) * | 2016-08-19 | 2017-02-22 | 大连理工大学 | Big data based bus route time table collaborative optimization method |
CN106816029A (en) * | 2016-10-15 | 2017-06-09 | 广东探金电子科技有限公司 | A kind of control system of automatic bus stop announcing |
-
2018
- 2018-01-31 CN CN201810094908.1A patent/CN108230724B/en not_active Expired - Fee Related
Patent Citations (4)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US20160231129A1 (en) * | 2015-02-05 | 2016-08-11 | Moovit App Global Ltd | Public and ordered transportation trip planning |
CN105574137A (en) * | 2015-12-15 | 2016-05-11 | 华南理工大学 | Public transportation multi-source data fusion based IC card swiping station matching method |
CN106448233A (en) * | 2016-08-19 | 2017-02-22 | 大连理工大学 | Big data based bus route time table collaborative optimization method |
CN106816029A (en) * | 2016-10-15 | 2017-06-09 | 广东探金电子科技有限公司 | A kind of control system of automatic bus stop announcing |
Non-Patent Citations (1)
Title |
---|
胡继华等: "结合出行链的公交IC卡乘客下车站点判断概率模型", 《交通运输系统工程与信息》 * |
Cited By (9)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN109523821A (en) * | 2018-11-30 | 2019-03-26 | 湖南智慧畅行交通科技有限公司 | It is a kind of that point calculating method leaving from station is arrived based on city bus GPS information |
CN109615036A (en) * | 2018-11-30 | 2019-04-12 | 深圳大学 | Fine particle exposure risk assessment method based on bus IC card swiping system |
CN109727474A (en) * | 2019-01-29 | 2019-05-07 | 苏州工业园区测绘地理信息有限公司 | A kind of bus based on fused data precise recognition method out of the station |
CN110738845A (en) * | 2019-09-20 | 2020-01-31 | 江苏大学 | bus GPS data complement method and complement system based on abnormal data processing |
CN112767685A (en) * | 2020-12-22 | 2021-05-07 | 荆门汇易佳信息科技有限公司 | Public transport passenger flow analysis system based on positioning and card swiping information |
CN113344268A (en) * | 2021-06-02 | 2021-09-03 | 合肥泰瑞数创科技有限公司 | Urban traffic trip data analysis method |
CN113344268B (en) * | 2021-06-02 | 2022-04-19 | 合肥泰瑞数创科技有限公司 | Urban traffic trip data analysis method |
CN116611984A (en) * | 2023-07-11 | 2023-08-18 | 鹏城实验室 | Travel data processing method, system, equipment and medium under multiple modes |
CN116611984B (en) * | 2023-07-11 | 2024-02-02 | 鹏城实验室 | Travel data processing method, system, equipment and medium under multiple modes |
Also Published As
Publication number | Publication date |
---|---|
CN108230724B (en) | 2020-05-22 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN108230724A (en) | A kind of urban mass-transit system Vehicle station name announcing missing data method for repairing and mending based on maximum probability estimation | |
CN110533912B (en) | Driving behavior detection method and device based on block chain | |
CN101540098B (en) | Matching method based on bus IC card line | |
CN108335377B (en) | GIS technology-based automatic check method for road inspection vehicle service | |
CN104933869B (en) | Anti-noise type satellite positioning data-based method for identifying vehicle violation behaviors | |
CN102324128B (en) | Method for predicting OD (Origin-Destination) passenger flow among bus stations on basis of IC (Integrated Circuit)-card record and device | |
US20230274633A1 (en) | Method and system for extracting od locations of vehicle on expressway | |
CN103035123B (en) | Abnormal data acquisition methods and system in a kind of traffic track data | |
CN108288321A (en) | Method is determined based on the volume of the flow of passengers above and below the bus station of IC card data and vehicle GPS information | |
CN102157075A (en) | Method for predicting bus arrivals | |
CN108806301B (en) | Automatic identification method for bus information | |
CN109272168A (en) | Urban rail transit passenger flow change trend prediction method | |
CN111932925A (en) | Method, device and system for determining travel passenger flow of public transport station | |
CN100590674C (en) | A system for predicting and querying the real-time status of buses | |
CN102156732A (en) | Station matching method of bus IC card data based on characteristic stations | |
CN103838868B (en) | Urban bus arrival time predicting method based on multi-bus-route operating data fusion | |
CN114363842B (en) | Bus passenger departure station prediction method and device based on mobile phone signaling data | |
CN112734219A (en) | Vehicle transportation driving behavior analysis method and system | |
CN117115562B (en) | Inbound railway freight train carriage counting method based on deep learning | |
CN112036757A (en) | Parking transfer parking lot site selection method based on mobile phone signaling and floating car data | |
CN105792134A (en) | Carpooling method and device | |
KR20120068580A (en) | Trip pattern analysis system of the public transportation using the transportation card data and the trip pattern analysis methods of the same | |
Cyril et al. | Electronic ticket machine data analytics for public bus transport planning | |
CN110164136B (en) | Fake-licensed vehicle identification method | |
CN114037392B (en) | Intelligent dispatching distance matrix generation method and device based on vehicle trajectory |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PB01 | Publication | ||
PB01 | Publication | ||
SE01 | Entry into force of request for substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
GR01 | Patent grant | ||
GR01 | Patent grant | ||
CF01 | Termination of patent right due to non-payment of annual fee |
Granted publication date: 20200522 |
|
CF01 | Termination of patent right due to non-payment of annual fee |