WO2020186770A1 - Visual method for analyzing taxi pick-up or drop-off features - Google Patents

Visual method for analyzing taxi pick-up or drop-off features Download PDF

Info

Publication number
WO2020186770A1
WO2020186770A1 PCT/CN2019/115056 CN2019115056W WO2020186770A1 WO 2020186770 A1 WO2020186770 A1 WO 2020186770A1 CN 2019115056 W CN2019115056 W CN 2019115056W WO 2020186770 A1 WO2020186770 A1 WO 2020186770A1
Authority
WO
WIPO (PCT)
Prior art keywords
taxi
getting
data
boarding
travel
Prior art date
Application number
PCT/CN2019/115056
Other languages
French (fr)
Chinese (zh)
Inventor
吕伟韬
杨树
李璐
潘阳阳
Original Assignee
江苏智通交通科技有限公司
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by 江苏智通交通科技有限公司 filed Critical 江苏智通交通科技有限公司
Publication of WO2020186770A1 publication Critical patent/WO2020186770A1/en

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/20Information retrieval; Database structures therefor; File system structures therefor of structured data, e.g. relational data
    • G06F16/24Querying
    • G06F16/245Query processing
    • G06F16/2458Special types of queries, e.g. statistical queries, fuzzy queries or distributed queries
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/20Information retrieval; Database structures therefor; File system structures therefor of structured data, e.g. relational data
    • G06F16/26Visual data mining; Browsing structured data
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/20Information retrieval; Database structures therefor; File system structures therefor of structured data, e.g. relational data
    • G06F16/29Geographical information databases
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06QINFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES, NOT OTHERWISE PROVIDED FOR
    • G06Q50/00Information and communication technology [ICT] specially adapted for implementation of business processes of specific business sectors, e.g. utilities or tourism
    • G06Q50/40Business processes related to the transportation industry

Definitions

  • the invention relates to a visualized method for analyzing the characteristics of taxi getting on and off.
  • the research on the hot spots and travel purpose of taxis is the key to taxi scheduling and route optimization.
  • the traditional visual display of taxi travel mode is mainly heat map or cold hotspot drawing research.
  • the drawing of the taxi boarding and boarding heat map mainly depends on the two parameters of the kernel size and the search radius.
  • Each pick-up point is regarded as the center of the two-dimensional Gaussian distribution.
  • the kernel size determines the discrete type of the two-dimensional Gaussian distribution
  • the search radius determines the area
  • the number of the inner pick-up and drop-off points can be obtained by summing these two-dimensional Gaussian distributions to obtain the hotspot or heat map, but the optimal solution of this data method comes from two parameters, and the values of the two parameters are determined by experience .
  • taxi pick-up and drop-off points are generally along urban roads, while traditional heat map or cold and hot research and drawing uses each pick-and-drop point as the center of a two-dimensional Gaussian distribution.
  • the Gaussian distribution has no direction constraint, which means that In the heat map, there are also getting on and off behavior in areas without road coverage.
  • this visual display method of heat map and heat map can only display regional hot spots, and cannot display time information. It is necessary to manually filter the data collection statistical time period before visual display.
  • the purpose of the present invention is to provide a visualized taxi boarding and boarding feature analysis method, so that users can effectively grasp the taxi boarding and boarding travel feature information in the road network and avoid the traditional
  • the heat map display method needs to manually determine the empirical mode of parameter information, and solves the problem of how to display the time and space information of the road and the car together in the existing technology, so that the traffic control user can clearly grasp the information of the taxi alight and the car on each road problem.
  • a visualized taxi boarding and boarding feature analysis method by extracting the boarding and boarding information from the taxi GPS trajectory data, clustering analysis of taxi travel features based on the K-medoids algorithm, and then analyzing the average fluctuation trend curve based on the road section.
  • the pattern shape of the extracted and clustered daily commuting travel purpose feature analysis, and finally the taxi travel feature information of each road section is presented in the GIS electronic map, including the following steps:
  • step S3 Use k-medoids to perform cluster analysis on the frequency domain values transformed in step S2 to determine the type of taxi travel, including daily commuting travel mode, holiday travel mode, mixed travel mode and data missing mode;
  • step S5 Based on the travel characteristics determined in step S3 and the commuter travel purpose of each road segment determined in step S4, mark the characteristic information of the taxi boarding and boarding travel purpose of each road segment in the GIS electronic map.
  • step S1 is specifically:
  • step S11 the taxi getting on and off data is extracted, specifically,
  • the taxi status changes from empty to heavy, indicating that there are passengers
  • the status of the taxi carrying passengers from heavy to empty means that the passengers get off, and then the points of getting on and off are extracted from the GPS track data and arranged in time series.
  • step S12 the average daily getting on and off status of each road section at each time period is obtained, specifically,
  • S121 Determine the road section according to the urban road network information, the distance between adjacent intersections is the road section, and then rely on the longitude and latitude (lng i , lat i ) in the GPS trajectory data to divide the GPS trajectory data into each road section; Time sorts and sorts the time information t i in the GPS trajectory data, and then integrates the number of getting on and off vehicles in each time period of each road segment;
  • S122 Solve the average value of taxis getting on and off at each time period of each day, and obtain the average number of getting on and off vehicles at each time of each road section each day, and arrange them in time series.
  • the specific Fourier formula is:
  • represents frequency
  • t represents time
  • e- i ⁇ t is a complex variable function
  • f(t) is the amount of data for getting on and off cars
  • F( ⁇ ) is the frequency value of getting on and off cars.
  • step S3 the taxi travel mode is determined, including daily commuting travel mode, holiday travel mode, mixed travel mode and data missing mode, specifically,
  • This kind of visualized taxi boarding and boarding feature analysis method is compared with the traditional heat map display method. This method does not need to manually set the kernel size and search radius parameter values based on experience, and can avoid the traditional heat map from configuring the kernel through manual experience
  • the size and search radius result in the visualization of the area of the pick-up and drop-off point.
  • This kind of visualized taxi boarding and boarding feature analysis method uses Fourier transform to convert the time series boarding and boarding data in the GPS trajectory data into frequency values. Spatio-temporal information will not be lost during the conversion, which is more convenient for data clustering analysis.
  • the method of the present invention uses the K-medoids center point method to perform cluster analysis on GPS getting on and off trajectory data, and divides the four situations of daily commuting travel mode, holiday travel mode, mixed travel mode and data missing travel mode, and based on getting on and off.
  • the trend graph shape (English letters) of the average fluctuation trend of the data divides the characteristic information of the taxi travel purpose, so that the characteristic information of the taxi travel is displayed by superimposing the English letters on the GIS electronic map.
  • this kind of visualized taxi boarding and boarding feature analysis method by extracting and integrating GPS trajectory information, analyzing taxi boarding and boarding data, clustering analysis of taxi travel patterns with K-medoids algorithm, and showing the trend graph of the fluctuations of the data on boarding and alighting
  • the shape of the English letters analyzes travel characteristics, thereby forming an intuitive and clear innovative visual display of taxi boarding and boarding characteristics.
  • This kind of visualized taxi boarding and boarding feature analysis method is compared with the spatial clustering analysis of traditional heat map, through k-medoids center point clustering analysis of the boarding and boarding data on each road section and the average fluctuation trend graph of the boarding and boarding data Draw, find the information of taxi boarding and boarding travel patterns and travel characteristics, so as to facilitate the traffic planning managers to perform taxi scheduling and route optimization.
  • FIG. 1 is a schematic flowchart of a visualized method for analyzing the characteristics of getting on and off a taxi according to an embodiment of the present invention.
  • Fig. 2 is a schematic diagram of four types of entertainment travel (U-type), work travel (M-type), dining travel (W-type), and mixed-type (VVV-type) in the situation of getting on a taxi in the embodiment.
  • Fig. 3 is a schematic diagram of the distribution of taxi boarding points in a certain city as a specific example in the embodiment.
  • Fig. 4 is a histogram from time domain to frequency domain for a certain road section in a certain city on a certain two days in an embodiment.
  • Figure 5 shows the four modes of daily commuting travel mode, holiday travel mode, mixed travel mode and data missing mode after clustering analysis on the amount of data on the bus in each day and each time period of a certain road section statistical period (quarter) in the embodiment. Illustration of the situation.
  • Fig. 6 is a schematic diagram of a curve drawn in the daily commuting travel mode in the embodiment.
  • Fig. 7 is a schematic diagram illustrating an embodiment of the recreational travel (U-type), work travel (M-type), dining travel (W-type), and hybrid (VVV-type) types superimposed on the map in the embodiment.
  • a visualized taxi boarding and boarding feature analysis method by extracting the boarding and boarding information from the taxi GPS trajectory data, clustering analysis of taxi travel features based on the K-medoids algorithm, and then analyzing the average fluctuation trend curve based on the road section.
  • the pattern shape of analyzes the characteristics of the trip purpose of the daily commuting class after the extraction and clustering, and finally presents the trip feature information of each road section in the GIS electronic map, as shown in Figure 1.
  • a heavy vehicle means that there are passengers getting on the bus, and the status of a taxi from heavy to empty means that the passengers get off the bus, and then the points of getting on and off from the GPS track data (arranged in time series) are extracted.
  • S121 Determine the road section according to the urban road network information (the distance between adjacent intersections is the road section), and then rely on the longitude and latitude (lng i , lat i ) in the GPS trajectory data to divide the GPS trajectory data into each road section; according to the date Sort and sort the time information t i in the GPS trajectory data with time, and then integrate the number of getting on and off vehicles in each time period of each road segment;
  • S122 Solve the average value of taxis getting on and off at each time of the day, and get the average number of getting on and off each road each day, and arrange them in time series. For example, extracting the boarding data in section A, average the data of boarding at the same time on section A to get the average fluctuation trend of boarding in section A in one day.
  • the specific Fourier formula is:
  • step S3 Use k-medoids to perform cluster analysis on the frequency domain values transformed in step S2 to determine the type of taxi trips, including daily commuting trips, holiday trips, mixed trips and missing data.
  • S31 Respectively integrate the frequency data of getting on and off the vehicle in each time period of each road section of the sample number of the statistical time period, and determine the number of clusters k of the sample clustering analysis;
  • GPS trajectory data in the frequency cluster analysis of GPS trajectory data, it can be divided into four categories, including daily commuting travel mode, holiday travel mode, mixed travel mode and data missing mode. Among them, the difference between weekdays and weekends is less and the default is Daily commuting travel patterns, while the changes between days of the week are small.
  • the characteristic information of the taxi boarding and boarding travel purpose of each road segment is marked in the GIS electronic map.
  • This kind of visualized taxi boarding and boarding feature analysis method extracts the GPS trajectory information of taxi vehicles, sorts the data of each road section in time, and clusters the taxi trips through Fourier transform and k-medoids algorithm
  • the purpose of taxi trips on the road section is displayed in a visual way in the shape of English letters, so as to display and process in the GIS electronic map, so that users can effectively grasp the characteristics of taxis getting on and off the road network and avoid the traditional heat map display method Need to manually determine the empirical model of parameter information.
  • the innovative visual display method of taxi boarding and boarding features of the embodiment is compared with the spatial clustering analysis of the traditional heat map, and the k-medoids center point clustering analysis of the boarding and boarding data on each road section and the average fluctuation trend graph of the boarding and boarding data are performed Draw, find the information of taxi boarding and boarding travel patterns and travel characteristics, so as to facilitate traffic planning managers in taxi scheduling and route optimization.
  • Step S1 Select a certain city as the research object, obtain the operating data of taxis in a certain quarter of the city, including GPS trajectory data, and extract the boarding data for research according to the status of the heavy/empty vehicle.
  • the distribution is shown in Figure 3.
  • Step S2 collect statistics on the data of each time of the day, and convert it to frequency value through Fourier transform, draw a histogram from time domain to frequency domain, and obtain the frequency index after conversion.
  • the histogram of a certain road section is shown in Figure 4.
  • Step S3 Perform cluster analysis on the amount of vehicle boarding data in each day and each time period of a certain road section statistical period (quarter), and divide four modes, as shown in FIG. 5.
  • Step S4 Further draw the fluctuation trend of the number of taxis on the day in each mode. Taking daily commuting as an example, the drawn curve is shown in FIG. 6.
  • the peak time period for each type of travel mode is determined as follows:
  • Step S5 Further superimpose each type of data on the map, and the specific information is shown in FIG. 7.

Landscapes

  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • Databases & Information Systems (AREA)
  • Physics & Mathematics (AREA)
  • General Physics & Mathematics (AREA)
  • General Engineering & Computer Science (AREA)
  • Data Mining & Analysis (AREA)
  • Business, Economics & Management (AREA)
  • Remote Sensing (AREA)
  • Human Resources & Organizations (AREA)
  • Software Systems (AREA)
  • Probability & Statistics with Applications (AREA)
  • Mathematical Physics (AREA)
  • Fuzzy Systems (AREA)
  • Health & Medical Sciences (AREA)
  • Economics (AREA)
  • General Health & Medical Sciences (AREA)
  • Computational Linguistics (AREA)
  • Marketing (AREA)
  • Primary Health Care (AREA)
  • Strategic Management (AREA)
  • Tourism & Hospitality (AREA)
  • General Business, Economics & Management (AREA)
  • Traffic Control Systems (AREA)
  • Navigation (AREA)
  • Management, Administration, Business Operations System, And Electronic Commerce (AREA)

Abstract

The invention provides a visual method for analyzing taxi pick-up or drop-off features. The method comprises: extracting, from taxi GPS path data, information about taxi pick-up or drop-off; performing taxi travel feature clustering analysis by means of a K-medoids algorithm; then, performing daily commuting travel destination feature analysis after performing extraction and clustering on the pattern shape of an average fluctuation trend curve of taxi pick-up or drop-off road segments; and finally displaying, on an electronic GIS map, taxi travel feature information of each road segment. The method eliminates the need for personnel to manually configure kernel size and search radius parameter values, and avoids personnel having to manually configure kernel size and a search radius visualization of regions of taxi pick-up or drop-off positions on a conventional heat map. Taxi travel destination feature information is acquired by means of the shape of an average fluctuation trend diagram of taxi pick-up or drop-off data, then taxi travel feature information is superimposed and displayed on an electronic GIS map, thereby achieving a novel visual display method for intuitively and clearly displaying taxi pick-up or drop-off features.

Description

可视化的出租车上下车特征分析方法Visualized analysis method of taxi getting on and off characteristics 技术领域Technical field
本发明涉及一种可视化的出租车上下车特征分析方法。The invention relates to a visualized method for analyzing the characteristics of taxi getting on and off.
背景技术Background technique
出租车上下车热点区域及出行目的的研究是出租车调度和路径优化的关键,而传统出租车出行模式可视化展示方式主要为热力图或冷热点绘制研究。出租车上下车热力图的绘制主要取决于内核尺寸和搜索半径两个参数,其中每一个上下车点看作二维高斯分布的中心,内核尺寸决定二维高斯分布的离散类型,搜索半径决定区域内上下车点的数量,从而通过对这些二维高斯分布的求和得到热点或热力图,但是这种数据方法的最优解来自两个参数,而两个参数的取值均是由经验决定。The research on the hot spots and travel purpose of taxis is the key to taxi scheduling and route optimization. The traditional visual display of taxi travel mode is mainly heat map or cold hotspot drawing research. The drawing of the taxi boarding and boarding heat map mainly depends on the two parameters of the kernel size and the search radius. Each pick-up point is regarded as the center of the two-dimensional Gaussian distribution. The kernel size determines the discrete type of the two-dimensional Gaussian distribution, and the search radius determines the area The number of the inner pick-up and drop-off points can be obtained by summing these two-dimensional Gaussian distributions to obtain the hotspot or heat map, but the optimal solution of this data method comes from two parameters, and the values of the two parameters are determined by experience .
同时出租车上下车点一般沿着城市道路沿线,而传统热力图或冷热点的研究绘制是将每个上下车点作为二维高斯分布的中心,高斯分布没有方向约束,这就意味着绘制出的热力图中,在没有道路覆盖的区域也存在上下车行为。另一方面,这种热力图和热点图的可视化展示方式仅能展现出区域热点,针对时间信息无法展示,在可视化展示前均需人工筛选数据的收集统计时间段。At the same time, taxi pick-up and drop-off points are generally along urban roads, while traditional heat map or cold and hot research and drawing uses each pick-and-drop point as the center of a two-dimensional Gaussian distribution. The Gaussian distribution has no direction constraint, which means that In the heat map, there are also getting on and off behavior in areas without road coverage. On the other hand, this visual display method of heat map and heat map can only display regional hot spots, and cannot display time information. It is necessary to manually filter the data collection statistical time period before visual display.
综上所述,针对传统出租车上下车可视化展示方法(热力图/热点图),急需提供一种将时空信息一起展示的道路上下车可视化展示方法,使交管用户能够清楚掌握各条道路上出租车上下车模式信息。To sum up, in view of the traditional visual display method of taxi getting on and off (heat map/heat map), it is urgent to provide a visual display method of getting on and off the road that displays time and space information together, so that traffic control users can clearly grasp the rental on each road Car getting on and off mode information.
发明内容Summary of the invention
针对传统出租车上下车热点区域热力图的可视化展现模式,本发明的目的是提供一种可视化的出租车上下车特征分析方法,使用户有效掌握路网内出租车上下车出行特征信息,避免传统热力图展示方式需要人工确定参数信息的经验化模式,并解决现有技术中存在的如何将道路上下车进行时空信息一起展示,使交管用户能够清楚掌握各条道路上出租车上下车模式信息的问题。Aiming at the visual display mode of the heat map of the traditional taxi boarding and boarding hotspots, the purpose of the present invention is to provide a visualized taxi boarding and boarding feature analysis method, so that users can effectively grasp the taxi boarding and boarding travel feature information in the road network and avoid the traditional The heat map display method needs to manually determine the empirical mode of parameter information, and solves the problem of how to display the time and space information of the road and the car together in the existing technology, so that the traffic control user can clearly grasp the information of the taxi alight and the car on each road problem.
本发明的技术解决方案是:The technical solution of the present invention is:
一种可视化的出租车上下车特征分析方法,通过提取出租车GPS轨迹数据中的上下车信息,基于K-medoids算法的出租车出行特征聚类分析,进而对基于路段上下车平均波动趋势曲线图的图案形状对提取聚类后的日常通勤类的出行 目的特征分析,最终在GIS电子地图中呈现出各路段出租车出行特征信息,包括以下步骤,A visualized taxi boarding and boarding feature analysis method, by extracting the boarding and boarding information from the taxi GPS trajectory data, clustering analysis of taxi travel features based on the K-medoids algorithm, and then analyzing the average fluctuation trend curve based on the road section. The pattern shape of the extracted and clustered daily commuting travel purpose feature analysis, and finally the taxi travel feature information of each road section is presented in the GIS electronic map, including the following steps:
S1、提取出租车的GPS轨迹数据并整合汇总,得到每条路段平均每天各时段的上下车情况;S1. Extract the GPS trajectory data of taxis and integrate them to get the average daily getting on and off status of each road section;
S2、基于傅里叶变换将上下车数据的时间序列从时域转换为频域数值;S2, based on the Fourier transform, convert the time series of the vehicle getting on and off data from the time domain to the frequency domain value;
S3、利用k-medoids将步骤S2中变换后频域数值进行聚类分析,确定出租车出行类型,包括日常通勤出行模式、节假日出行模式、混合出行模式和数据缺失模式;S3. Use k-medoids to perform cluster analysis on the frequency domain values transformed in step S2 to determine the type of taxi travel, including daily commuting travel mode, holiday travel mode, mixed travel mode and data missing mode;
S4、提取日常通勤出行模式下的各时段出租车上下车数量,绘制出当日各时段上下车数波动趋势图,基于波动趋势图的图案形状确定时段下出租车出行的出行目的;S4. Extract the number of taxis getting on and off at each time period in the daily commuting travel mode, draw a fluctuation trend graph of the number of cars on and off at each time period of the day, and determine the travel purpose of the taxi trip during the time period based on the pattern shape of the fluctuation trend graph;
S5、基于步骤S3确定的出行特征以及步骤S4确定的各路段通勤出行目的,在GIS电子地图中标注出各路段的出租车上下车出行目的特征信息。S5. Based on the travel characteristics determined in step S3 and the commuter travel purpose of each road segment determined in step S4, mark the characteristic information of the taxi boarding and boarding travel purpose of each road segment in the GIS electronic map.
进一步地,步骤S1具体为,Further, step S1 is specifically:
S11、接入出租车车载终端,获取出租车GPS轨迹数据,提取出租车上下车数据;S11. Connect to the taxi on-board terminal, obtain the taxi GPS trajectory data, and extract the taxi getting on and off data;
S12、整合每条路段每天的出租车GPS轨迹数据,得到每条路段平均每天各时段的上下车情况。S12. Integrate the daily GPS trajectory data of each road section to obtain the average daily getting on and off status of each road section.
进一步地,步骤S11中,提取出租车上下车数据,具体为,Further, in step S11, the taxi getting on and off data is extracted, specifically,
首先对出租车轨迹信息进行数据清洗工作再进行数据分析;出租车车辆的GPS轨迹数据是按其时间进行排序所行成的时间序列traj={p 1,p 2,…,p i,…,p n},其中,p i=(lng i,lat i,t i),lng i表示车辆在第i个位置时的经度,lat i表示车辆在第i个位置时的纬度,t i表示车辆在第i个位置时的时刻;出租车上下车情况根据出租车GPS轨迹数据中的载客状态包括重车和空车确定,即出租车载客状态由空车变为重车则说明有乘客上车,出租车载客状态由重车变为空车则说明乘客下车,进而从GPS轨迹数据中提取出上下车点并按时间序列排列。 First, perform data cleaning work on taxi trajectory information and then perform data analysis; the GPS trajectory data of taxi vehicles is a time sequence traj = {p 1 , p 2 ,..., p i ,..., sorted by time. p n}, where, p i = (lng i, lat i, t i), lng i represents the vehicle longitude at the i-th position, lat i represents the latitude of the vehicle at the time of the i-th position, t i represents the vehicle At the time of the i-th position; the taxi boarding and disembarking conditions are determined according to the passenger loading status in the taxi GPS trajectory data, including heavy and empty vehicles. That is, the taxi status changes from empty to heavy, indicating that there are passengers When getting on the bus, the status of the taxi carrying passengers from heavy to empty means that the passengers get off, and then the points of getting on and off are extracted from the GPS track data and arranged in time series.
进一步地,步骤S12中,得到每条路段平均每天各时段的上下车情况,具体为,Further, in step S12, the average daily getting on and off status of each road section at each time period is obtained, specifically,
S121、根据城市路网信息确定路段,相邻交叉口之间的距离即为路段,进而依托GPS轨迹数据中的经纬度(lng i,lat i)将GPS轨迹数据分至各路段上;根据日期和时间将GPS轨迹数据中的时刻信息t i进行分类排序,进而整合每条路段每天各时间段内上下车数量; S121. Determine the road section according to the urban road network information, the distance between adjacent intersections is the road section, and then rely on the longitude and latitude (lng i , lat i ) in the GPS trajectory data to divide the GPS trajectory data into each road section; Time sorts and sorts the time information t i in the GPS trajectory data, and then integrates the number of getting on and off vehicles in each time period of each road segment;
S122、求解出每天各时段出租车上下车均值,得到每条路段平均每天各时段的上下车数量,并按时间序列排列。S122. Solve the average value of taxis getting on and off at each time period of each day, and obtain the average number of getting on and off vehicles at each time of each road section each day, and arrange them in time series.
进一步地,基于傅里叶变换将上下车数据的时间序列从时域转换为频域数值,具体为,基于时空数据变换特征,通过傅里叶变换公示将上下车数据情况值N t={p 1,p 2,…,p j}转换为频率数值ω(t)={ω 1,ω 2,…,ω j},即针对步骤S1求得各时段内上下车数量转换到各时段频率数值,具体傅里叶公式为: Further, the time series of the getting on and off data is converted from the time domain to the frequency domain value based on the Fourier transform, specifically, based on the characteristics of the spatio-temporal data transformation, the condition value of the getting on and off data is publicized through the Fourier transform N t ={p 1 , p 2 ,..., p j } is converted to frequency value ω(t)={ω 1 , ω 2 ,..., ω j }, that is, the number of vehicles getting on and off in each period is converted to the frequency value of each period for step S1 , The specific Fourier formula is:
Figure PCTCN2019115056-appb-000001
Figure PCTCN2019115056-appb-000001
式中,ω代表频率,t代表时间,e -iωt为复变函数,f(t)为上下车数据量,F(ω)为上下车数量频率数值。 In the formula, ω represents frequency, t represents time, e- iωt is a complex variable function, f(t) is the amount of data for getting on and off cars, and F(ω) is the frequency value of getting on and off cars.
进一步地,步骤S3中,确定出租车出行模式,包括日常通勤出行模式、节假日出行模式、混合出行模式和数据缺失模式,具体为,Further, in step S3, the taxi travel mode is determined, including daily commuting travel mode, holiday travel mode, mixed travel mode and data missing mode, specifically,
S31、分别对统计时间段样本数量各路段各时间段内的上下车频率数据进行整合,确定样聚类分析的簇个数k;S31. Integrate the frequency data of getting on and off the vehicle in each time period of each road section of the sample number in the statistical time period respectively, and determine the number of clusters k of the sample clustering analysis;
S32、在统计时间范围内样本点中随机选择k个点作为初始中心点,分别计算剩下点距离k个中心点的欧氏距离从而划分出簇,进一步根据划分的簇再求解簇内剩下的点与其其他样本点的曼哈顿距离之和,并将曼哈顿距离之和最小的点作为簇的中心点;不断重复求解簇直到中心点不再改变,得到聚类结果,即出租车GPS上下车频率数据的划分情况。S32. Randomly select k points from the sample points in the statistical time range as the initial center points, calculate the Euclidean distances between the remaining points and the k center points respectively to divide the clusters, and then solve the remaining clusters according to the divided clusters. The sum of the Manhattan distance between the point and other sample points, and the point with the smallest sum of Manhattan distance as the center point of the cluster; the cluster is solved repeatedly until the center point no longer changes, and the clustering result is obtained, that is, the frequency of taxi GPS getting on and off The division of data.
本发明的有益效果是:The beneficial effects of the present invention are:
一、该种可视化的出租车上下车特征分析方法,与传统热力图的展现方式相比较,该方法不用根据经验人工设定内核尺寸和搜索半径参数数值,能够避免传统热力图通过人工经验配置内核尺寸和搜素半径导致上下车点区域可视化的情 况。1. This kind of visualized taxi boarding and boarding feature analysis method is compared with the traditional heat map display method. This method does not need to manually set the kernel size and search radius parameter values based on experience, and can avoid the traditional heat map from configuring the kernel through manual experience The size and search radius result in the visualization of the area of the pick-up and drop-off point.
二、该种可视化的出租车上下车特征分析方法,通过傅里叶变换将GPS轨迹数据中的时间序列上下车数据转换为频率数值,时空信息在转换中不会丢失,更便于数据的聚类分析。2. This kind of visualized taxi boarding and boarding feature analysis method uses Fourier transform to convert the time series boarding and boarding data in the GPS trajectory data into frequency values. Spatio-temporal information will not be lost during the conversion, which is more convenient for data clustering analysis.
三、本发明方法通过K-medoids中心点方法对GPS上下车轨迹数据进行聚类分析,划分出日常通勤出行模式、节假日出行模式、混合出行模式和数据缺失出行模式四种情况,并基于上下车数据的平均波动趋势的趋势图形状(英文字母)划分出出租车出行目的特征信息,从而通过在GIS电子地图中叠加英文字母展现出出租车出行特征信息。3. The method of the present invention uses the K-medoids center point method to perform cluster analysis on GPS getting on and off trajectory data, and divides the four situations of daily commuting travel mode, holiday travel mode, mixed travel mode and data missing travel mode, and based on getting on and off. The trend graph shape (English letters) of the average fluctuation trend of the data divides the characteristic information of the taxi travel purpose, so that the characteristic information of the taxi travel is displayed by superimposing the English letters on the GIS electronic map.
四、该种可视化的出租车上下车特征分析方法,通过将GPS轨迹信息提取整合,分析出租车上下车数据,以K-medoids算法聚类分析出租车出行模式,以上下车数据波动趋势图的英文字母形状分析出行特征,从而形成直观清晰的出租车上下车特征的创新可视化展示方式。Fourth, this kind of visualized taxi boarding and boarding feature analysis method, by extracting and integrating GPS trajectory information, analyzing taxi boarding and boarding data, clustering analysis of taxi travel patterns with K-medoids algorithm, and showing the trend graph of the fluctuations of the data on boarding and alighting The shape of the English letters analyzes travel characteristics, thereby forming an intuitive and clear innovative visual display of taxi boarding and boarding characteristics.
五、该种可视化的出租车上下车特征分析方法,对比传统热力图的空间聚类分析,通过对各路段上的上下车数据进行k-medoids中心点聚类分析以及上下车数据平均波动趋势图绘制,找到出租车上下车出行模式及出行特征规律信息,从而方便交通规划管理者进行出租车调度和路径优化。5. This kind of visualized taxi boarding and boarding feature analysis method is compared with the spatial clustering analysis of traditional heat map, through k-medoids center point clustering analysis of the boarding and boarding data on each road section and the average fluctuation trend graph of the boarding and boarding data Draw, find the information of taxi boarding and boarding travel patterns and travel characteristics, so as to facilitate the traffic planning managers to perform taxi scheduling and route optimization.
附图说明Description of the drawings
图1是本发明实施例可视化的出租车上下车特征分析方法的流程示意图。FIG. 1 is a schematic flowchart of a visualized method for analyzing the characteristics of getting on and off a taxi according to an embodiment of the present invention.
图2是实施例中出租车上车情况的娱乐出行(U型)、工作出行(M型)、就餐出行(W型)和混合型(VVV型)四个类型的示意图。Fig. 2 is a schematic diagram of four types of entertainment travel (U-type), work travel (M-type), dining travel (W-type), and mixed-type (VVV-type) in the situation of getting on a taxi in the embodiment.
图3是实施例中具体示例的某市的出租车上车点位分布示意图。Fig. 3 is a schematic diagram of the distribution of taxi boarding points in a certain city as a specific example in the embodiment.
图4是实施例中某市某两日某路段时域到频域的直方图。Fig. 4 is a histogram from time domain to frequency domain for a certain road section in a certain city on a certain two days in an embodiment.
图5是实施例中对某路段统计时段(季度)各日各时段内的上车数据量进行聚类分析后划分出日常通勤出行模式、节假日出行模式、混合出行模式和数据缺失模式四种模式情况的说明示意图。Figure 5 shows the four modes of daily commuting travel mode, holiday travel mode, mixed travel mode and data missing mode after clustering analysis on the amount of data on the bus in each day and each time period of a certain road section statistical period (quarter) in the embodiment. Illustration of the situation.
图6是实施例中对日常通勤出行模式下绘制的曲线的示意图。Fig. 6 is a schematic diagram of a curve drawn in the daily commuting travel mode in the embodiment.
图7是实施例中将娱乐出行(U型)、工作出行(M型)、就餐出行(W型)和混合型(VVV型)各类型叠加到地图后的说明示意图。Fig. 7 is a schematic diagram illustrating an embodiment of the recreational travel (U-type), work travel (M-type), dining travel (W-type), and hybrid (VVV-type) types superimposed on the map in the embodiment.
具体实施方式detailed description
下面结合附图详细说明本发明的优选实施例。The preferred embodiments of the present invention will be described in detail below with reference to the accompanying drawings.
实施例Example
一种可视化的出租车上下车特征分析方法,通过提取出租车GPS轨迹数据中的上下车信息,基于K-medoids算法的出租车出行特征聚类分析,进而对基于路段上下车平均波动趋势曲线图的图案形状对提取聚类后的日常通勤类的出行目的特征分析,最终在GIS电子地图中呈现出各路段出租车出行特征信息,如图1,具体步骤如下:A visualized taxi boarding and boarding feature analysis method, by extracting the boarding and boarding information from the taxi GPS trajectory data, clustering analysis of taxi travel features based on the K-medoids algorithm, and then analyzing the average fluctuation trend curve based on the road section. The pattern shape of, analyzes the characteristics of the trip purpose of the daily commuting class after the extraction and clustering, and finally presents the trip feature information of each road section in the GIS electronic map, as shown in Figure 1. The specific steps are as follows:
S1.提取出租车GPS轨迹数据并整合汇总,得到每条路段平均每天各时段的上下车情况。S1. Extract the taxi GPS trajectory data and integrate them to obtain the average daily getting on and off status of each road section.
S11.接入出租车车载终端,获取出租车GPS轨迹数据,提取出租车上/下车数据。S11. Connect to the taxi on-board terminal, obtain taxi GPS trajectory data, and extract taxi boarding/getting off data.
优选地,在提取时首先对出租车轨迹信息进行数据清洗工作再进行数据分析;出租车车辆的GPS轨迹数据是按其时间进行排序所行成的时间序列traj={p 1,p 2,…,p i,…,p n},其中,p i=(lng i,lat i,t i),lng i表示车辆在第i个位置时的经度,lat i表示车辆在第i个位置时的纬度,t i表示车辆在第i个位置时的时刻;出租车上下车情况根据出租车GPS轨迹数据中的载客状态(重车和空车)确定,即出租车载客状态由空车变为重车则说明有乘客上车,出租车载客状态由重车变为空车则说明乘客下车,进而从GPS轨迹数据中提取出上下车点(按时间序列排列)。 Preferably, when extracting the taxi trajectory information, first perform data cleaning work and then perform data analysis; the GPS trajectory data of taxi vehicles is a time sequence traj = {p 1 , p 2 ,... , P i ,..., p n }, where p i = (lng i , lat i , t i ), lng i represents the longitude of the vehicle at the i-th position, and lat i represents the vehicle’s longitude at the i-th position Latitude, t i represents the time when the vehicle is at the i-th position; the taxi getting on and off is determined according to the passenger status (heavy and empty) in the taxi GPS trajectory data, that is, the taxi's passenger status changes from empty A heavy vehicle means that there are passengers getting on the bus, and the status of a taxi from heavy to empty means that the passengers get off the bus, and then the points of getting on and off from the GPS track data (arranged in time series) are extracted.
S12.整合每条路段每天的上下车GPS轨迹数据,得到每条路段平均每天各时段的上下车情况,其中路段为相邻交叉口之间的距离。S12. Integrate the daily GPS trajectory data of each road section to get on and off the vehicle at each time of the day on each road section. The road section is the distance between adjacent intersections.
S121.根据城市路网信息确定路段(相邻交叉口之间的距离即为路段),进而依托GPS轨迹数据中的经纬度(lng i,lat i)将GPS轨迹数据分至各路段上;根据日期和时间将GPS轨迹数据中的时刻信息t i进行分类排序,进而整合每条路段每天各时间段内上下车数量; S121. Determine the road section according to the urban road network information (the distance between adjacent intersections is the road section), and then rely on the longitude and latitude (lng i , lat i ) in the GPS trajectory data to divide the GPS trajectory data into each road section; according to the date Sort and sort the time information t i in the GPS trajectory data with time, and then integrate the number of getting on and off vehicles in each time period of each road segment;
S122.求解出每天各时段出租车上下车均值,得到每条路段平均每天各时段的上下车数量,并按时间序列排列。如提取出A路段中的上车数据,对A路段 同时段上车数据求均值得到A路段平均一天内的上车波动趋势情况。S122. Solve the average value of taxis getting on and off at each time of the day, and get the average number of getting on and off each road each day, and arrange them in time series. For example, extracting the boarding data in section A, average the data of boarding at the same time on section A to get the average fluctuation trend of boarding in section A in one day.
S2.基于傅里叶变换将上下车数据的时间序列从时域转换为频域数值。S2. Convert the time series of the vehicle getting on and off data from the time domain to the frequency domain value based on the Fourier transform.
具体来说,基于时空数据变换特征,通过傅里叶变换公示将上下车数据情况值N t={p 1,p 2,…,p j}转换为频率数值ω(t)={ω 1,ω 2,…,ω j},即针对S1步骤求得各时段内上下车数量转换到各时段频率数值,具体傅里叶公式为: Specifically, based on the characteristics of spatio-temporal data transformation, through Fourier transform publicity, the data condition value N t ={p 1 , p 2 ,..., p j } is converted into a frequency value ω(t)={ω 1 , ω 2 ,..., ω j }, that is, the number of vehicles getting on and off in each time period is converted to the frequency value of each time period for the S1 step. The specific Fourier formula is:
Figure PCTCN2019115056-appb-000002
Figure PCTCN2019115056-appb-000002
S3.利用k-medoids将步骤S2中变换后频域数值进行聚类分析,确定出租车出行类型,包括日常通勤出行、节假日出行、混合出行和数据缺失。S3. Use k-medoids to perform cluster analysis on the frequency domain values transformed in step S2 to determine the type of taxi trips, including daily commuting trips, holiday trips, mixed trips and missing data.
S31.分别对统计时间段样本数量各路段各时间段内的上下车频率数据进行整合,确定样聚类分析的簇个数k;S31. Respectively integrate the frequency data of getting on and off the vehicle in each time period of each road section of the sample number of the statistical time period, and determine the number of clusters k of the sample clustering analysis;
S32.在统计时间范围内样本点中随机选择k个点作为初始中心点,分别计算剩下点距离k个中心点的欧氏距离从而划分出簇,进一步根据划分的簇再求解簇内剩下的点与其其他样本点的曼哈顿距离之和,并将曼哈顿距离之和最小的点作为簇的中心点;不断重复求解簇直到中心点不再改变,得到聚类结果,即出租车GPS上下车频率数据的划分情况。S32. Randomly select k points from the sample points in the statistical time range as the initial center points, calculate the Euclidean distance between the remaining points and the k center points respectively to divide the clusters, and then solve the remaining clusters according to the divided clusters The sum of the Manhattan distance between the point and other sample points, and the point with the smallest sum of Manhattan distance as the center point of the cluster; the cluster is solved repeatedly until the center point no longer changes, and the clustering result is obtained, that is, the frequency of taxi GPS getting on and off The division of data.
具体来说,在对GPS轨迹数据频率聚类分析时,可以划分为四类,包括日常通勤出行模式、节假日出行模式、混合出行模式和数据缺失模式,其中工作日和周末差别较少均默认为日常通勤出行模式,同时一周内各日之间的变化较小。Specifically, in the frequency cluster analysis of GPS trajectory data, it can be divided into four categories, including daily commuting travel mode, holiday travel mode, mixed travel mode and data missing mode. Among them, the difference between weekdays and weekends is less and the default is Daily commuting travel patterns, while the changes between days of the week are small.
S4.提取日常通勤出行模式下的各时段出租车上下车数量,绘制出当日各时段上下车数波动趋势图,基于波动趋势图的图案形状确定时段下出租车出行的出行目的。S4. Extract the number of taxis getting on and off at each time period in the daily commuting travel mode, draw a fluctuation trend graph of the number of cars getting on and off at each time period of the day, and determine the travel purpose of the taxi trip during the time period based on the pattern shape of the fluctuation trend graph.
以出租车上车情况为例,具体包括趋势图呈现“U型”形状的娱乐出行、趋势图呈现“M型”形状的工作出行、趋势图呈现“W型”形状的就餐出行和趋势图呈现“VVV型”形状的混合型,如图2所示。Take the taxi boarding situation as an example, specifically including entertainment trips with a "U-shaped" trend graph, work trips with a "M-shape" trend graph, dining trips with a "W-shape" trend graph, and trend graph presentation "VVV type" shape hybrid type, as shown in Figure 2.
S5.基于GIS电子地图呈现各路段出租车上下车的出行特征信息。S5. Based on the GIS electronic map, it presents the travel characteristic information of taxis getting on and off each road section.
具体来说,基于步骤S3确定的出行特征以及步骤S4确定的各路段通勤出行目的,在GIS电子地图中标注出各路段的出租车上下车出行目的特征信息。Specifically, based on the travel characteristics determined in step S3 and the commuter travel purpose of each road segment determined in step S4, the characteristic information of the taxi boarding and boarding travel purpose of each road segment is marked in the GIS electronic map.
该种可视化的出租车上下车特征分析方法,提取出出租车车辆的GPS轨迹信息,将各路段上下车数据按时间排序,通过傅里叶变换和k-medoids算法将出租车出行进行聚类分析,针对日常通勤模式以英文字母形状的可视化方式展现出路段出租车出行目的,从而在GIS电子地图中展示处理,使用户有效掌握路网内出租车上下车出行特征信息,避免传统热力图展示方式需要人工确定参数信息的经验化模式。This kind of visualized taxi boarding and boarding feature analysis method extracts the GPS trajectory information of taxi vehicles, sorts the data of each road section in time, and clusters the taxi trips through Fourier transform and k-medoids algorithm In view of the daily commuting mode, the purpose of taxi trips on the road section is displayed in a visual way in the shape of English letters, so as to display and process in the GIS electronic map, so that users can effectively grasp the characteristics of taxis getting on and off the road network and avoid the traditional heat map display method Need to manually determine the empirical model of parameter information.
实施例的创新的出租车上下车特征可视化展现方式,对比传统热力图的空间聚类分析,通过对各路段上的上下车数据进行k-medoids中心点聚类分析以及上下车数据平均波动趋势图绘制,找到出租车上下车出行模式及出行特征规律信息,从而方便交通规划管理者进行出租车调度和路径优化。The innovative visual display method of taxi boarding and boarding features of the embodiment is compared with the spatial clustering analysis of the traditional heat map, and the k-medoids center point clustering analysis of the boarding and boarding data on each road section and the average fluctuation trend graph of the boarding and boarding data are performed Draw, find the information of taxi boarding and boarding travel patterns and travel characteristics, so as to facilitate traffic planning managers in taxi scheduling and route optimization.
实施例的一个具体示例如下:A specific example of the embodiment is as follows:
步骤S1、选取某市作为研究对象,获取该市某季度内出租车的运营数据包括GPS轨迹数据,根据车辆重车/空车状态确定提取出上车数据进行研究,其中出租车上车点位分布如图3所示。Step S1. Select a certain city as the research object, obtain the operating data of taxis in a certain quarter of the city, including GPS trajectory data, and extract the boarding data for research according to the status of the heavy/empty vehicle. The distribution is shown in Figure 3.
步骤S2、按路段对每日各时段的上车数据进行统计,并通过傅里叶变换转换为频率数值,绘制出时域到频域的直方图,得到转换后的频率指数,其中某两日某路段的直方图如图4所示。Step S2, according to the road section, collect statistics on the data of each time of the day, and convert it to frequency value through Fourier transform, draw a histogram from time domain to frequency domain, and obtain the frequency index after conversion. The histogram of a certain road section is shown in Figure 4.
步骤S3、对某路段统计时段(季度)各日各时段内的上车数据量进行聚类分析,划分出四种模式情况,如图5所示。Step S3: Perform cluster analysis on the amount of vehicle boarding data in each day and each time period of a certain road section statistical period (quarter), and divide four modes, as shown in FIG. 5.
步骤S4、进一步分别对各模式下当日出租车上车数量波动趋势进行绘制,以日常通勤为例,其绘制的曲线如图6所示。Step S4: Further draw the fluctuation trend of the number of taxis on the day in each mode. Taking daily commuting as an example, the drawn curve is shown in FIG. 6.
其中在日常通勤出行模式下,针对全天各时间段内确定各类型出行方式其高峰时间段如下:Among them, in the daily commuting travel mode, the peak time period for each type of travel mode is determined as follows:
Figure PCTCN2019115056-appb-000003
Figure PCTCN2019115056-appb-000003
Figure PCTCN2019115056-appb-000004
Figure PCTCN2019115056-appb-000004
步骤S5、进一步将数据将各类型叠加到地图上,具体信息如图7所示。Step S5: Further superimpose each type of data on the map, and the specific information is shown in FIG. 7.

Claims (6)

  1. 一种可视化的出租车上下车特征分析方法,其特征在于:通过提取出租车GPS轨迹数据中的上下车信息,基于K-medoids算法的出租车出行特征聚类分析,进而对基于路段上下车平均波动趋势曲线图的图案形状对提取聚类后的日常通勤类的出行目的特征分析,最终在GIS电子地图中呈现出各路段出租车出行特征信息,包括以下步骤,A visualized taxi boarding and boarding feature analysis method, which is characterized by: extracting the boarding and boarding information in the taxi GPS trajectory data, clustering analysis of taxi travel features based on the K-medoids algorithm, and then averaging The pattern shape of the fluctuation trend graph analyzes the characteristics of the daily commuting travel purpose after the extraction and clustering, and finally presents the taxi travel characteristic information of each road section in the GIS electronic map, including the following steps:
    S1、提取出租车的GPS轨迹数据并整合汇总,得到每条路段平均每天各时段的上下车情况;S1. Extract the GPS trajectory data of taxis and integrate them to obtain the average daily getting on and off status of each road section;
    S2、基于傅里叶变换将上下车数据的时间序列从时域转换为频域数值;S2, based on the Fourier transform, convert the time series of the vehicle getting on and off data from the time domain to the frequency domain value;
    S3、利用k-medoids将步骤S2中变换后频域数值进行聚类分析,确定出租车出行类型,包括日常通勤出行模式、节假日出行模式、混合出行模式和数据缺失模式;S3. Use k-medoids to perform cluster analysis on the frequency domain values transformed in step S2 to determine the type of taxi travel, including daily commuting travel mode, holiday travel mode, mixed travel mode and data missing mode;
    S4、提取日常通勤出行模式下的各时段出租车上下车数量,绘制出当日各时段上下车数波动趋势图,基于波动趋势图的图案形状确定时段下出租车出行的出行目的;S4. Extract the number of taxis getting on and off at each time period in the daily commuting travel mode, draw a fluctuation trend graph of the number of cars on and off at each time period of the day, and determine the travel purpose of the taxi trip during the time period based on the pattern shape of the fluctuation trend graph;
    S5、基于步骤S3确定的出行特征以及步骤S4确定的各路段通勤出行目的,在GIS电子地图中标注出各路段的出租车上下车出行目的特征信息。S5. Based on the travel characteristics determined in step S3 and the commuter travel purpose of each road segment determined in step S4, mark the characteristic information of the taxi boarding and boarding travel purpose of each road segment in the GIS electronic map.
  2. 如权利要求1所述的可视化的出租车上下车特征分析方法,其特征在于:步骤S1具体为,The visualized taxi getting on and off characteristic analysis method according to claim 1, wherein the step S1 is specifically:
    S11、接入出租车车载终端,获取出租车GPS轨迹数据,提取出租车上下车数据;S11. Connect to the taxi on-board terminal, obtain the taxi GPS trajectory data, and extract the taxi getting on and off data;
    S12、整合每条路段每天的出租车GPS轨迹数据,得到每条路段平均每天各时段的上下车情况。S12. Integrate the daily GPS trajectory data of each road section to obtain the average daily getting on and off status of each road section.
  3. 如权利要求2所述的可视化的出租车上下车特征分析方法,其特征在于:步骤S11中,提取出租车上下车数据,具体为,The visualized taxi getting on and off characteristic analysis method according to claim 2, characterized in that: in step S11, extracting taxi getting on and off data, specifically,
    首先对出租车轨迹信息进行数据清洗工作再进行数据分析;出租车车辆的GPS轨迹数据是按其时间进行排序所行成的时间序列traj={p 1,p 2,…,p i,…,p n},其中,p i=(lng i,lat i,t i),lng i表示车辆在第i个位置时的经度,lat i表示车辆在第i个位置时的纬度,t i表示车辆在第i个位置时的时刻;出租车上下车情况根 据出租车GPS轨迹数据中的载客状态包括重车和空车确定,即出租车载客状态由空车变为重车则说明有乘客上车,出租车载客状态由重车变为空车则说明乘客下车,进而从GPS轨迹数据中提取出上下车点并按时间序列排列。 First, perform data cleaning work on taxi trajectory information and then perform data analysis; the GPS trajectory data of taxi vehicles is a time sequence traj = {p 1 , p 2 ,..., p i ,..., sorted by time. p n}, where, p i = (lng i, lat i, t i), lng i represents the vehicle longitude at the i-th position, lat i represents the latitude of the vehicle at the time of the i-th position, t i represents the vehicle At the time of the i-th position; the taxi boarding and disembarking conditions are determined according to the passenger loading status in the taxi GPS trajectory data, including heavy and empty vehicles. That is, the taxi status changes from empty to heavy, indicating that there are passengers When getting on the bus, the status of the taxi carrying passengers from heavy to empty means that the passengers get off, and then the points of getting on and off are extracted from the GPS track data and arranged in time series.
  4. 如权利要求2所述的可视化的出租车上下车特征分析方法,其特征在于:步骤S12中,得到每条路段平均每天各时段的上下车情况,具体为,The visualized taxi getting on and off characteristic analysis method according to claim 2, characterized in that: in step S12, the getting on and off conditions of each road section on average every day are obtained, specifically:
    S121、根据城市路网信息确定路段,相邻交叉口之间的距离即为路段,进而依托GPS轨迹数据中的经纬度(lng i,lat i)将GPS轨迹数据分至各路段上;根据日期和时间将GPS轨迹数据中的时刻信息t i进行分类排序,进而整合每条路段每天各时间段内上下车数量; S121. Determine the road section according to the urban road network information, the distance between adjacent intersections is the road section, and then rely on the longitude and latitude (lng i , lat i ) in the GPS trajectory data to divide the GPS trajectory data into each road section; Time sorts and sorts the time information t i in the GPS trajectory data, and then integrates the number of getting on and off vehicles in each time period of each road segment;
    S122、求解出每天各时段出租车上下车均值,得到每条路段平均每天各时段的上下车数量,并按时间序列排列。S122. Solve the average value of taxis getting on and off at each time period of each day, and obtain the average number of getting on and off vehicles at each time of each road section each day, and arrange them in time series.
  5. 如权利要求1-4任一项所述的可视化的出租车上下车特征分析方法,其特征在于:步骤S2中,基于傅里叶变换将上下车数据的时间序列从时域转换为频域数值,具体为,基于时空数据变换特征,通过傅里叶变换公示将上下车数据情况值N t={p 1,p 2,…,p j}转换为频率数值ω(t)={ω 1,ω 2,…,ω j},即针对步骤S1求得各时段内上下车数量转换到各时段频率数值,具体傅里叶公式为: The visualized taxi boarding and boarding feature analysis method according to any one of claims 1 to 4, characterized in that: in step S2, the time series of the boarding and boarding data is converted from the time domain to the frequency domain value based on Fourier transform Specifically, based on the characteristics of spatio-temporal data transformation, through Fourier transform publicity, the vehicle getting on and off data condition value N t ={p 1 , p 2 ,..., p j } is converted into a frequency value ω(t)={ω 1 , ω 2 ,..., ω j }, that is, the number of vehicles getting on and off in each period is converted to the frequency value of each period for step S1. The specific Fourier formula is:
    Figure PCTCN2019115056-appb-100001
    Figure PCTCN2019115056-appb-100001
    式中,ω代表频率,t代表时间,e -iωt为复变函数,f(t)为上下车数据量,F(ω)为上下车数量频率数值。 In the formula, ω represents frequency, t represents time, e- iωt is a complex variable function, f(t) is the amount of data for getting on and off cars, and F(ω) is the frequency value of getting on and off cars.
  6. 如权利要求1-4任一项所述的可视化的出租车上下车特征分析方法,其特征在于:步骤S3中,确定出租车出行模式,包括日常通勤出行模式、节假日出行模式、混合出行模式和数据缺失模式,具体为,The visualized taxi getting on and off characteristic analysis method according to any one of claims 1-4, characterized in that: in step S3, the taxi travel mode is determined, including daily commuting travel mode, holiday travel mode, mixed travel mode and Data missing pattern, specifically,
    S31、分别对统计时间段样本数量各路段各时间段内的上下车频率数据进行整合,确定样聚类分析的簇个数k;S31. Integrate the frequency data of getting on and off the vehicle in each time period of each road section of the sample number in the statistical time period respectively, and determine the number of clusters k of the sample clustering analysis;
    S32、在统计时间范围内样本点中随机选择k个点作为初始中心点,分别计算剩下点距离k个中心点的欧氏距离从而划分出簇,进一步根据划分的簇再求解 簇内剩下的点与其其他样本点的曼哈顿距离之和,并将曼哈顿距离之和最小的点作为簇的中心点;不断重复求解簇直到中心点不再改变,得到聚类结果,即出租车GPS上下车频率数据的划分情况。S32. Randomly select k points from the sample points in the statistical time range as the initial center points, calculate the Euclidean distances between the remaining points and the k center points respectively to divide the clusters, and then solve the remaining clusters according to the divided clusters. The sum of the Manhattan distance between the point and other sample points, and the point with the smallest sum of Manhattan distance as the center point of the cluster; the cluster is solved repeatedly until the center point no longer changes, and the clustering result is obtained, that is, the frequency of taxi GPS getting on and off The division of data.
PCT/CN2019/115056 2019-03-21 2019-11-01 Visual method for analyzing taxi pick-up or drop-off features WO2020186770A1 (en)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
CN201910219988.3 2019-03-21
CN201910219988.3A CN109885639B (en) 2019-03-21 2019-03-21 Visual taxi getting-on and getting-off characteristic analysis method

Publications (1)

Publication Number Publication Date
WO2020186770A1 true WO2020186770A1 (en) 2020-09-24

Family

ID=66933606

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/CN2019/115056 WO2020186770A1 (en) 2019-03-21 2019-11-01 Visual method for analyzing taxi pick-up or drop-off features

Country Status (2)

Country Link
CN (1) CN109885639B (en)
WO (1) WO2020186770A1 (en)

Cited By (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN112380906A (en) * 2020-10-19 2021-02-19 上汽通用五菱汽车股份有限公司 Method for determining user address based on driving data
CN112734532A (en) * 2021-01-15 2021-04-30 东南大学 Commuting behavior identification method based on shared electric bicycle borrowing and returning point data
CN112905729A (en) * 2021-03-05 2021-06-04 亿海蓝(北京)数据技术股份公司 Thermodynamic diagram generation method and device for track data, electronic equipment and storage medium
CN114579884A (en) * 2022-02-16 2022-06-03 中冶南方城市建设工程技术有限公司 Shared bicycle trip display method and system in urban built-up area
CN116029624A (en) * 2022-07-21 2023-04-28 大连海事大学 Goods source place identification method integrating truck track and POI data

Families Citing this family (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN110428604B (en) * 2019-07-30 2022-04-22 山东交通学院 Taxi illegal parking monitoring and early warning method based on track and map data
KR20210042537A (en) * 2019-10-10 2021-04-20 엘지전자 주식회사 Method of estimating position in local area in large sapce and robot and cloud server implementing thereof

Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN106529754A (en) * 2016-06-27 2017-03-22 江苏智通交通科技有限公司 Taxi operation condition assessment method based on big data analysis
CN107577725A (en) * 2017-08-22 2018-01-12 长安大学 A kind of urban taxi passenger trip characteristicses visual analysis method
CN108959466A (en) * 2018-06-20 2018-12-07 淮阴工学院 Taxi hot spot method for visualizing and system based on BCS-DBSCAN
CN109254861A (en) * 2018-09-17 2019-01-22 江苏智通交通科技有限公司 OD requirement extract and its analysis method for reliability based on track data

Family Cites Families (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP2010250673A (en) * 2009-04-17 2010-11-04 Ntt Docomo Inc Taxi user extraction device and taxi user extraction method
CN106384509A (en) * 2016-10-08 2017-02-08 大连理工大学 Urban road driving time distribution estimation method considering taxi operation states
CN108242149B (en) * 2018-03-16 2020-06-30 成都智达万应科技有限公司 Big data analysis method based on traffic data

Patent Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN106529754A (en) * 2016-06-27 2017-03-22 江苏智通交通科技有限公司 Taxi operation condition assessment method based on big data analysis
CN107577725A (en) * 2017-08-22 2018-01-12 长安大学 A kind of urban taxi passenger trip characteristicses visual analysis method
CN108959466A (en) * 2018-06-20 2018-12-07 淮阴工学院 Taxi hot spot method for visualizing and system based on BCS-DBSCAN
CN109254861A (en) * 2018-09-17 2019-01-22 江苏智通交通科技有限公司 OD requirement extract and its analysis method for reliability based on track data

Cited By (9)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN112380906A (en) * 2020-10-19 2021-02-19 上汽通用五菱汽车股份有限公司 Method for determining user address based on driving data
CN112380906B (en) * 2020-10-19 2024-05-31 上汽通用五菱汽车股份有限公司 Method for determining user address based on driving data
CN112734532A (en) * 2021-01-15 2021-04-30 东南大学 Commuting behavior identification method based on shared electric bicycle borrowing and returning point data
CN112905729A (en) * 2021-03-05 2021-06-04 亿海蓝(北京)数据技术股份公司 Thermodynamic diagram generation method and device for track data, electronic equipment and storage medium
CN112905729B (en) * 2021-03-05 2024-01-30 亿海蓝(北京)数据技术股份公司 Thermodynamic diagram generation method and device for track data, electronic equipment and storage medium
CN114579884A (en) * 2022-02-16 2022-06-03 中冶南方城市建设工程技术有限公司 Shared bicycle trip display method and system in urban built-up area
CN114579884B (en) * 2022-02-16 2024-05-31 中冶南方城市建设工程技术有限公司 Method and system for displaying travel of shared bicycle in urban built-up area
CN116029624A (en) * 2022-07-21 2023-04-28 大连海事大学 Goods source place identification method integrating truck track and POI data
CN116029624B (en) * 2022-07-21 2024-02-06 大连海事大学 Goods source place identification method integrating truck track and POI data

Also Published As

Publication number Publication date
CN109885639B (en) 2022-12-23
CN109885639A (en) 2019-06-14

Similar Documents

Publication Publication Date Title
WO2020186770A1 (en) Visual method for analyzing taxi pick-up or drop-off features
CN107067722B (en) Method for constructing vehicle running condition
WO2018064931A1 (en) Method for estimating travel time distribution of taxi on urban roads when operating states of taxis are considered
Cao et al. The gaps in satisfaction with transit services among BRT, metro, and bus riders: Evidence from Guangzhou
US7440846B2 (en) Vehicle information collection system having point issuing device
CN104021671B (en) The determination methods of the road real-time road that a kind of svm combines with fuzzy Judgment
CN112270460A (en) Goods source station identification method for overweight truck based on multi-source data
CN103295394B (en) Method based on generalized GPS (global position system) data for determining passenger-waiting station alternative addresses of taxis
US8700244B2 (en) System for providing information on power consumption amount of electric vehicle
CN110298500A (en) A kind of urban transportation track data set creation method based on taxi car data and city road network
DE112004002635T5 (en) Route search method for a navigation device
CN106651027B (en) Internet regular bus route optimization method based on social network
CN109993215B (en) Route guidance method based on traffic big data
CN107798867A (en) A kind of congested traffic network equalization methods based on electric automobile and diesel locomotive mixed flow
CN109829573A (en) A kind of intelligent paths planning method merging user driving habits
DE102010017485A1 (en) Method for reducing fuel consumption and travel costs
CN109583611A (en) Customization bus station site selecting method based on net about car data
CN110807917A (en) Highway intelligent security system based on cloud calculates
EP2288872A1 (en) Method and device for calculating a navigation route to associated destinations
CN108648453A (en) A method of traffic trip data portrait is carried out based on mobile phone location fresh information
CN108510168A (en) Commerial vehicle paths planning method based on traffic accident correlation rule
CN108399742B (en) Traffic situation heat map visualization method based on traffic saturation
CN109740957A (en) A kind of urban traffic network node-classification method
CN116882609A (en) Customized bus route multi-objective optimization method based on improved track clustering algorithm
Rodrigues et al. Hybrid context-aware multimodal routing

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 19920501

Country of ref document: EP

Kind code of ref document: A1

NENP Non-entry into the national phase

Ref country code: DE

122 Ep: pct application non-entry in european phase

Ref document number: 19920501

Country of ref document: EP

Kind code of ref document: A1

122 Ep: pct application non-entry in european phase

Ref document number: 19920501

Country of ref document: EP

Kind code of ref document: A1

32PN Ep: public notification in the ep bulletin as address of the adressee cannot be established

Free format text: NOTING OF LOSS OF RIGHTS PURSUANT TO RULE 112(1) EPC (EPO FORM 1205A DATED 28.04.2022)

122 Ep: pct application non-entry in european phase

Ref document number: 19920501

Country of ref document: EP

Kind code of ref document: A1