CN110298500B - Urban traffic track data set generation method based on taxi data and urban road network - Google Patents

Urban traffic track data set generation method based on taxi data and urban road network Download PDF

Info

Publication number
CN110298500B
CN110298500B CN201910532080.8A CN201910532080A CN110298500B CN 110298500 B CN110298500 B CN 110298500B CN 201910532080 A CN201910532080 A CN 201910532080A CN 110298500 B CN110298500 B CN 110298500B
Authority
CN
China
Prior art keywords
region
area
road
data
vehicle
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Active
Application number
CN201910532080.8A
Other languages
Chinese (zh)
Other versions
CN110298500A (en
Inventor
孔祥杰
马凯
商迪
侯明良
郝欣宇
冯嘉伟
夏锋
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Dalian University of Technology
Original Assignee
Dalian University of Technology
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Dalian University of Technology filed Critical Dalian University of Technology
Priority to CN201910532080.8A priority Critical patent/CN110298500B/en
Publication of CN110298500A publication Critical patent/CN110298500A/en
Application granted granted Critical
Publication of CN110298500B publication Critical patent/CN110298500B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06QINFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES, NOT OTHERWISE PROVIDED FOR
    • G06Q10/00Administration; Management
    • G06Q10/04Forecasting or optimisation specially adapted for administrative or management purposes, e.g. linear programming or "cutting stock problem"
    • G06Q10/043Optimisation of two dimensional placement, e.g. cutting of clothes or wood
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06QINFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES, NOT OTHERWISE PROVIDED FOR
    • G06Q50/00Information and communication technology [ICT] specially adapted for implementation of business processes of specific business sectors, e.g. utilities or tourism
    • G06Q50/10Services
    • G06Q50/26Government or public services

Landscapes

  • Business, Economics & Management (AREA)
  • Human Resources & Organizations (AREA)
  • Engineering & Computer Science (AREA)
  • Economics (AREA)
  • Strategic Management (AREA)
  • Tourism & Hospitality (AREA)
  • Physics & Mathematics (AREA)
  • Development Economics (AREA)
  • Theoretical Computer Science (AREA)
  • Marketing (AREA)
  • General Physics & Mathematics (AREA)
  • General Business, Economics & Management (AREA)
  • Game Theory and Decision Science (AREA)
  • Quality & Reliability (AREA)
  • Operations Research (AREA)
  • Entrepreneurship & Innovation (AREA)
  • Educational Administration (AREA)
  • Health & Medical Sciences (AREA)
  • General Health & Medical Sciences (AREA)
  • Primary Health Care (AREA)
  • Traffic Control Systems (AREA)

Abstract

A city traffic track data set generation method based on taxi data and a city road network belongs to the field of traffic transportation. The model consists of a preparation layer, a generation layer and a verification layer, generates track data of private cars based on taxi data and urban road network information for the first time, and provides a set of complete verification method to verify the accuracy of the generated data set. The invention provides a city functional area dividing method based on an adjacent road dividing method, which is used for dividing a city into different functional areas. Meanwhile, a regional population weight opportunity model is provided, so that population movement patterns can be more accurately depicted. In order to verify the authenticity and the accuracy of the invention, models based on macroscopic and microscopic angles are respectively provided at a verification layer to verify the authenticity and the accuracy of a data set generated by a verification model. After verification of the verification layer, the method has higher authenticity and accuracy according to the track data of the private car generated by the taxi GPS data and the urban vehicle network information.

Description

Urban traffic track data set generation method based on taxi data and urban road network
Technical Field
The invention belongs to the field of traffic transportation, and relates to a method for generating a social vehicle track data set based on floating vehicle track data in the field of traffic big data.
Background
The Internet of things is an important component of a new generation of information technology and is an important stage of development of the information era. With more and more Vehicles accessing the Internet of Things (Internet of Things), the popularity and application of Internet of Vehicles (Internet of Vehicles) technology is rising. The human movement trajectory data is the basis of vehicle communication in the internet of vehicles and can reflect individual movement patterns and social migration rules in space-time environments. Through the track data of social vehicles (such as private cars) and floating vehicles (such as taxis, buses and the like), complex and various Vehicle Social Networks (VSNs) can be constructed. The trajectory data set of the floating car can be easily obtained from the internet. However, due to the protection of personal privacy and some relevant government policies, it is almost impossible for general researchers to obtain the trajectory data of social vehicles, which greatly hinders the researchers' intensive research and development progress in the related fields.
Disclosure of Invention
The invention aims to overcome the defects of the existing data set Generation research, and provides a three-layer RDMP (Region Division and Mobility Pattern based Vehicle track transaction Model) Vehicle track data set Generation Model which comprises a preparation layer, a Generation layer and a verification layer.
The technical scheme of the invention is as follows:
a method for generating urban traffic track data sets based on taxi data and urban road networks is completed on three layers of RDMP vehicle track data set generation models, and the three layers of RDMP vehicle track data set generation models comprise the following steps:
s1: in a preparation layer, preprocessing original taxi track data, and deleting useless and abnormal data; meanwhile, the method for dividing the urban functional area based on the adjacent road division method is used for dividing the urban area, and the urban road network is constructed, and the method comprises the following steps:
s1.1: pretreatment: deleting useless track data in an empty taxi state, clearing paradoxical data caused by equipment precision and statistical errors, and obtaining available taxi track data;
s1.2: based on an adjacent road segmentation method, a researched city is divided into different areas by utilizing a human moving track and POI interest points, and the traveling of all vehicles is converted into circulation among the different areas, wherein the specific process is as follows:
s1.2.1: using a DMR-based probabilistic topic model of unsupervised learning, taking the whole city as a document, taking different functions owned by each region as topics, taking the movement patterns among the regions as words, taking the characteristic vectors of the POIs as metadata of the document, and taking the frequency density v of the ith POIs in the region r i,r The calculation formula is as follows:
Figure BDA0002100072270000021
wherein, num i Representing the number of types of POIs in area r, S r Represents the area of the region r, and the POI feature vector of the region r is marked as x r =(v 1,r ,v 2,r ,...,v i,r ,...,v F,r 1), metadata representing the region r, where F is the number of POI categories in r, and the last vector 1 is the default feature; the subject distribution of the region r is a K-dimensional vector theta r =(θ r,1 ,θ r,2 ,...,θ r,e ...,θ r,K ),θ r,e Representing the proportion of the subject e in the area r;
s1.2.2: clustering the theme distribution obtained in the S1.2.1 by using a k-means clustering algorithm, inputting the coordinate information of the starting point and the end point of each trip into a kernel density estimation KDE model, and quantifying the functional strength in a functional area; setting n regions, and calculating the functional strength of the region r by using a nuclear density estimator through a nuclear density estimation model KDE model:
Figure BDA0002100072270000022
Figure BDA0002100072270000023
wherein, d i,r Represents the distance from the region i to the region R, R represents the bandwidth, KF (·) represents a gaussian kernel function; after the estimation of the function intensity is finished, the divided areas are annotated to reflect the actual functions of the city, and the number attribute of the area r is defined as K a
S1.2.3: region clustering
Using adjacent road segmentation method, regarding important road as line segment and grid as node, calculating Euclidean distance from each grid to road, and recording road number K nearest to each grid l Clustering each grid in the rasterized map after the calculation is finished, and clustering K a And K l The nodes with the same value are taken as a cluster;
the judgment principle of the important road is as follows: the urban map is rasterized, grids are divided according to the longitude and latitude of 0.001 × 0.001, and roads with the average traffic volume of more than one hundred thousand in the whole day are extracted and taken as important roads.
S1.3: according to an actual research area, cutting and layering original files downloaded from an Open Street Map website, manually modifying road travel limits in the real world, updating the latest traffic conditions, and constructing a road network of a city;
s2: in the generation layer:
s2.1: and calculating the quantity of the social vehicles in each area according to the static proportion of the floating vehicles to the social vehicles in different functional areas, wherein the calculation formula is as follows:
Figure BDA0002100072270000024
Figure BDA0002100072270000031
wherein, SA i The sum of private car trips in the area i; SG j Representing the number of taxis, N, in each grid j contained in region i i Indicating the number of meshes divided in the area i, alpha i Representing the ratio of private cars to taxis, SR, in region i w Representing the number of taxis on road w in grid j within region i, n j Represents the number of roads in grid j;
s2.2: providing a Regional Population Weight Opportunity (RPWO) model, and calculating an origin-destination (OD) matrix of social vehicles, wherein the method comprises the following steps:
s2.2.1: setting the attraction force of an end area j to a start area i and the distance R between the center of gravity of the start area i and the center of gravity of the end area j with the center of gravity of the area j as the center of a circle ij Total population Q in a circular area of radius ji In inverse proportion;
s2.2.2: for center of gravity of
Figure BDA0002100072270000032
The calculation method of (2) is as follows:
Figure BDA0002100072270000033
wherein L represents the number of meshes of the region; x is the number of l And y l Respectively representing the relative longitude and relative latitude of the grid within the area;
s2.2.3: calculating the attraction force A of the end region j to the start region i according to the barycenter of the start region and the end region ji The calculation method is as follows:
Figure BDA0002100072270000034
Figure BDA0002100072270000035
wherein Q ji Is represented by R ij The total number of private cars in a circular area with the radius and the center of gravity of the endpoint area j as the center of a circle; n is a radical of hydrogen s Is the number of regions contained in the circle, beta r Representing the proportion of the area r in the circular area; the total area of the region r is S r The area of the region r in the circular region is
Figure BDA0002100072270000036
P r Is the number of private cars of the region r, A ji Representing the attraction force, o, of the end region j relative to the start region i j Representing the number of private cars in the destination area j, and M representing the total number of private cars in the whole city;
s2.2.4: based on the attractions of the respective zones, the traffic volume from zone i to zone j is calculated:
Figure BDA0002100072270000037
wherein, SA i The number of private car trips from the area i is calculated by S2.1, and n is the number of areas divided in the city;
s2.3: and generating simulation track data by using an SUMO simulation tool in combination with the OD matrix, wherein the steps are as follows:
s2.3.1: classifying the urban roads according to the divided areas; the road network file comprises the longitude and latitude of each road connection point, the longitude and latitude are used for calculating which area the road belongs to, and the road ID contained in each area is written in the road network file;
s2.3.2: using an OD2TRIPS plug-in an SUMO tool, importing an O/D matrix, decomposing the O/D matrix into single vehicle journey, inputting the O/D matrix generated by S2.2, a road network file and a road list contained in an area according to the specific situation and data of a city, setting a generation time period, a trip proportion of each time period and vehicle type parameters, and generating an xml file of a series of vehicle journey information, wherein each journey information comprises a vehicle ID, a departure time, a departure place ID and a destination ID;
s2.3.3: in order to generate the path track information of private vehicles, a DUAROUTER plug-in an SUMO tool is used, a road network file and travel information generated by OD2TRIPS are input, a simulation time period and a shortest path calculation method are set, a simulated vehicle path is generated, the simulated vehicle path comprises a vehicle ID, travel time and the passing condition of a road, and finally vehicle travel track information is generated;
s2.3.4: in order to generate information of a vehicle in a specified time interval every second, wherein the information comprises longitude and latitude positions, driving angles, instantaneous speeds and road numbers, a Trace File Generation plugin in an SUMO tool is used, corresponding information is input in a DUAROUTER function, corresponding configuration files are written at the same time, the time interval is set to delta t minutes, and vehicle tracking files in different time periods are generated;
s3: in the verification layer, a verification model is designed from the macroscopic aspect and the microscopic aspect respectively, and the accuracy and the authenticity of generated data are verified, wherein the verification model comprises the following steps:
s3.1: in the macro model, analyzing and comparing the generated track data with the actual traffic condition, including traffic flow comparison, trip range comparison and traffic condition comparison;
s3.2: in a microscopic verification model, the authenticity of generated data is analyzed and evaluated from the aspects of acceleration and relative distance, and a method for quantitatively detecting track data is designed, and the method comprises the following steps:
s3.2.1: analyzing acceleration, verifying generated track data acceleration and gradient value J thereof e [m/s 3 ]When the result precision is in a reasonable range, the generated data set is proved to have higher internal rationality, and acceleration gradient analysis is carried out by utilizing three indexes: j. the design is a square e Greater than +/-3 m/s 3 Percentage of trace data of threshold, data set J e Maximum, data set J e A minimum value;
s3.2.2: consistency analysis, which verifies the authenticity of the generated data set from the distance between vehicles, and when an abnormally small distance interval exists in the data set, the accuracy of the data set is questionable;
s3.2.3: the distance between the pair of vehicles is calculated as follows:
Figure BDA0002100072270000041
Figure BDA0002100072270000051
represents the vehicle separation of the vehicles v1 and v2 at time T,
Figure BDA0002100072270000052
and
Figure BDA0002100072270000053
meaning that the starting points of the roads are respectively to the point
Figure BDA0002100072270000054
And
Figure BDA0002100072270000055
i.e. the projection of the road alignment of the actual vehicle position;
s3.2.4: the distance between vehicles is directly calculated from projection coordinates corresponding to geographic coordinates of the vehicles by setting that the vehicles always run along a straight line on a road:
Figure BDA0002100072270000056
when in use
Figure BDA0002100072270000057
At least in a moment when the distance between two vehicles is below 5m, the two vehicles collide to cause traffic accidents, therefore, the detection is carried out
Figure BDA0002100072270000058
The proportion of the time in the total data set is used for verifying the accuracy of the data set.
The invention has the beneficial effects that: according to the method, the generation and verification of the private car movement track data are realized by constructing a three-layer vehicle track data set generation model. The invention provides a novel method for dividing functional areas of cities by combining a main road network and travel track data. Meanwhile, the flow pattern of the urban vehicles is deeply analyzed, an inter-area population weight opportunity model is provided, the relation among different urban functional areas is considered, and the inter-urban vehicle migration pattern and law are well described. In addition, the invention constructs a verification model from two aspects of macroscopic view and microcosmic view, and verifies that the private car track data generated by the provided data set generation method has accuracy and use value. The invention provides a new method for generating a private car track data set, which can provide powerful data support for car networking and traffic research work.
Drawings
Fig. 1 is an RDMP model proposed by the present invention, which is a three-layer vehicle trajectory data set generation model, and is composed of a preparation layer, a generation layer, and a verification layer.
Fig. 2 shows the ARS region partitioning method proposed by us. Taking Beijing as an example, a city can be divided into 153 areas.
Fig. 3 is a traffic distribution proportion diagram on the main road comparing the data of the private car in five rings of beijing city generated by the invention with the real data.
Fig. 4 is a comparison graph of travel time of private car data and real data in five rings in beijing city generated by the present invention. Wherein, (a) is the ratio of the travel time from 1 point to 12 points; and (b) is the ratio of travel time from 13 to 24.
Fig. 5 is a comparison graph of the travel distance between the data of the private car in five rings of beijing city and the real data generated by the invention.
FIG. 6 is a comparison between a simulated path generated using a Baidu map API navigation service and a data set generated by the present invention. The result shows that the distribution of the travel route is basically the same within 5km and between 5km and 10km. Wherein, (a) represents that the travel route is within 5km, and (b) represents that the travel route is 5-10km.
FIG. 7 is a comparison of the true data set and the generated data set in terms of acceleration. Wherein, (a) is a vehicle acceleration profile of a real data set in the range of 7; (b) A vehicle acceleration profile for the real dataset at 7; (c) To generate a vehicle acceleration profile for the data set at 7; (d) To generate a vehicle acceleration profile for the data set at 7.
FIG. 8 is a plot of the relative distance of the real data set from the generated data set.
Detailed Description
In order to make the objects, technical solutions and advantages of the present invention more apparent, embodiments of the present invention will be described in further detail below.
The present example uses a taxi track dataset from beijing, month 11 of 2012 (containing about 2.7 ten thousand GPS records for over 100 billion taxis), and provides a method for generating a social vehicle track dataset based on floating vehicle track data. The core of the invention is a three-layer vehicle track data set generation model (RDMP), and the structure of the model is shown in figure 1. The model consists of a preparation layer, a generation layer and a verification layer, and can generate the track data of private cars based on taxi GPS data and urban vehicle network information. The method comprises the following steps:
s1 is an implementation scheme of a preparation layer of the RDMP model, and in the layer, data are preprocessed to obtain available taxi track data. By preparing the layers, a foundation is laid for the next work.
S1.1: and preprocessing the data set. In the example, we used taxi track data from 11 months 2012 in china beijing, which contains about 2.7 tens of thousands of taxis in excess of 100 billion GPS records. The taxi's location information is updated by the GPS device at a frequency of 1 time every 11 seconds. The original data file is stored in a text document named at the storage time. The taxi has two travel modes, namely a passenger carrying mode and an empty mode. Only in the passenger carrying mode, the running track of the taxi can be regular and can be similar to that of a private carThe travel mode of (1). In contrast, in the empty mode, the taxi only runs purposelessly, which is meaningless for our research. Therefore, when the taxi has no passenger, we delete the useless trajectory data. Furthermore, some data is apparently paradoxical due to device accuracy and statistical errors. For example, some journeys are too long or too short to obtain a valuable analysis result, and this part of the erroneous data also needs to be cleared. We then process the condensed data to obtain each vehicle trip. We extract the same vehicle ID into one file and sort them by travel time. Thereby obtaining the running track of a single taxi. It is worth mentioning that our research focuses on the vehicle movement pattern within the five rings of Beijing: (
Figure BDA0002100072270000062
And
Figure BDA0002100072270000063
). Therefore, the vehicle track data with the longitude and the latitude not within the five-ring range is eliminated.
TABLE 1 proportion of taxi to private car on main traffic trunk in Beijing City
Figure BDA0002100072270000061
Figure BDA0002100072270000071
S1.2: and (5) dividing the region. The areas of Beijing are divided, and the traveling of vehicles is simplified into traveling among different areas. The invention provides an adjacent road segmentation method (ARS) which is based on the utilization of travel tracks and POIs (geographical interest points) of people and is combined with main traffic lines of a road network to carry out regional division and comprises the following specific steps:
s1.2.1: identifying the function of a single area, an area with multiple functions, as if it contained multiple functions, using a topic model-based approachA document of a subject. The city to be researched is taken as a document as a whole, and different functions owned by each region are taken as subjects. Meanwhile, the moving mode between the regions is taken as a word, and the POIs characteristic vector is taken as the metadata of the document. In the present invention, a DMR-based topic model in unsupervised learning is used, based on the Late Dirichlet Allocation (LDA) and Dirichlet Multinominal Regression (DMR). The POIs characteristic vector and the movement mode are combined, and the function of the region is comprehensively researched from two aspects. For each region r, the number of different classes of POIs in the region is statistically available. Frequency density v of i-th POIs in region r i,r The calculation formula is as follows:
Figure BDA0002100072270000072
wherein, num i Representing the number of types of POIs in area r, S r Representing the area of region r. In addition, for region r, its POI feature vector may be denoted as x r =(v 1,r ,v 2,r ,...,v i,r ,...,v F,r 1), representing the metadata of the region r, F is the number of POI classes in r, and the last vector 1 is the default feature. After parameter estimation using the DMR model, for region r, its topic distribution is a K-dimensional vector θ r =(θ r,1 ,θ r,2 ,...,θ r,e ...,θ r,K ),θ r,e Representing the proportion of the subject e in the region r.
S1.2.2: and (4) clustering the functional regions by using a k-means algorithm which is a classic clustering algorithm in unsupervised machine learning. Experiments confirm that the clustering result is optimal when the final clustering value is set to 9 for Beijing. To quantify the popularity and extent of the area within a functional zone, the functional strength of each functional zone is estimated. The popularity of a functional area is potentially linked to traffic volume, which represents human movement patterns meaning the functional strength of the area. And inputting the coordinate information of the starting point and the end point of each travel into a kernel density estimation KDE model, and quantifying the function intensity in the function area. Assuming n regions, the functional strength of region s is calculated by the KDE model using the kernel density estimator:
Figure BDA0002100072270000081
Figure BDA0002100072270000082
wherein, d i,r Represents the distance of the region i from the region R, R represents the bandwidth, KF (·) represents the gaussian kernel function. After the estimation of the function strength is finished, the divided areas are annotated to reflect the actual functions of the city, and the number attribute of the divided areas is defined as K a . We then consider region labeling from four perspectives. First, we ranked the frequencies of POIs in the functional regions based on the average frequency density of the feature vectors of POIs in each region, and ranked the frequency of all the functional regions containing the POIs. Second, we count the most frequent movement patterns within each functional zone. Thirdly, we explore the most representative POIs in each functional kernel by using the functional strength, and then carry out regional annotation, and fourthly, we carry out artificial marking according to the actual situation, such as the historical sites of interest.
S1.2.3: firstly, the functional area is rasterized, and is divided into a large number of grids on the map projection according to the latitude and longitude of 0.001. The relative length and width of each grid is defined as 1, and there is a fixed ID. If a plurality of different areas are contained in one grid, the grid is classified into the functional area with the largest occupied area. Aiming at Beijing city, a functional area depth division method based on an adjacent road division method is provided. And screening 18 main urban roads according to the road traffic flow information counted by the government, and numbering according to Arabic numerals 1-18. Each important road is marked in the rasterized map and treated as a line segment. At the same time, we consider each mesh approximation as a node. Subsequently, the euclidean distance from each node (mesh) to the line segment (road) is calculated, and the road number Kl closest to each mesh is recorded. After the distance calculation is finished, each grid node has two attribute values, namely the functional area number Ka and the nearest road number Kl. And clustering each grid in the rasterized map, and taking the nodes with the same Ka and kl values as a cluster. Finally, 153 subdivided functional areas are obtained. Due to the fact that the proportion of taxis to private cars is different between different roads, in the dividing method, the same region not only has the same urban function, but also the proportion of the taxis to the private cars is the same.
S1.3 road network description. The city road network map data may be downloaded free of charge from an OpenStreetMap (OSM) or other open source website. The OSM data may be uploaded by any user so that everyone can maintain and modify the map data. In our study, we downloaded the OSM file from beijing, including road, underground, various construction facility information, reflecting the geographic information of the city. However, due to the open source nature, there may be some errors between the downloaded data and the actual situation. In order to build an accurate simulated road network, the road topology is modified to match the real world. We modify it using Java OpenStreetMap (JOSM) technology, which is a free tool for editing open street map geographic information. Furthermore, since the template of the present invention consists in simulating and generating private cars, not all vehicles, we have eliminated redundant information on railways, sidewalks, etc. And after relevant data such as railway data and the like are deleted, comparing the map data with the real world and modifying the map data.
And S2 is an implementation scheme of a generation layer of the RDMP model, and a private car track data set is generated by utilizing track data obtained by preprocessing of the previous layer in the step 1). The invention provides a regional-based population weight opportunity (RPWO) model, which predicts the private traffic volume origin-destination matrix (ODmatrix) of two regions by using the traffic volume of each divided region. We then use the SUMO simulation tool to convert the matrix between the entire regions into microscopic trajectory paths for each private car.
S2.1: and calculating the traffic volume between the areas. The traffic volume of a vehicle refers to the number of vehicles passing a certain section of a road in a certain time. The traffic volume can reflect the total traffic flow of the road, and has important research value. In the invention, the traffic volume of the vehicle movement between the divided areas is predicted by calculating the traffic volume in each area and utilizing the inter-area population weight opportunity model provided by the invention. A simulated social vehicle dataset is then generated based on the predicted amount of traffic between the regions. In step 1) the beijing has been divided into different regions. The daily traffic volume in each area is now calculated. The traffic volume in the area is accumulated by the traffic volume of all roads in the area, so the traffic volume of a single road is calculated at the beginning. The traffic volume on the road is determined by social vehicles and floating vehicles, but the obtained data is only the track data of the floating vehicles (taxis). Therefore, we need to calculate the traffic volume of the social vehicle (private car) by the traffic volume of the floating vehicle. The proportion of the social vehicles and the floating vehicles on each road is different, and the traffic volume of the private car can be easily calculated through the existing taxi traffic volume according to the proportion relation. Based on the information provided in the "annual report of traffic development in beijing in 2012", we obtained different ratios of the vehicles moving on the main traffic roads (shown in table 1) to the social vehicles. Because we divide the area based on these major traffic roads, we can assume that all roads within the same area have the same proportion of floating vehicles and social vehicles as the major traffic roads. We calculate the number of social vehicles in each functional zone by the following formula:
Figure BDA0002100072270000091
Figure BDA0002100072270000092
wherein, SA i The sum of private car trips in the area i; SG j Representative regionNumber of taxis in each grid j contained in i, N i Indicating the number of meshes divided in the area i, alpha i Representing the ratio of private cars to taxis, SR, in region i w Representing the number of taxis on road w in grid j within region i, n j Representing the number of roads in grid j. The number of private cars in each functional area is obtained through the two formulas.
S2.2: and (3) a space-time interaction model. After the traffic volume of each area in Beijing is obtained, the invention continues to construct the human trip mode under the city scale. Despite the long history of building human movement models, researchers still lack highly accurate methods of predicting urban movement patterns, especially if the data types are not sufficiently diverse. The present invention proposes an inter-area population weight opportunity model (RPWO) to capture the potential drivers of human movement patterns on a city scale, which does not depend on any adjustable parameters. It is worth mentioning that the results of the study on the model show that the model is very suitable for the population flow pattern between cities, but not between countries, which indicates the diversity of human flow at different spatial scales. Different from the traditional population weight opportunity model, the method takes region factors into consideration when a human migration model is considered, and performs cluster-cluster research. That is, the present invention studies the migration pattern of human between different regions by a region division method. The method comprises the following specific steps:
s2.2.1: in the RPWO model, the attraction force of an end region j to a start region i and the distance R between the center of gravity of the start region i and the center of gravity of the end region j around the center of gravity of the region j are set ij Total population Q in a circular area of radius ji In inverse proportion;
s2.2.2: to center of gravity of
Figure BDA0002100072270000101
The calculation method of (2) is as follows:
Figure BDA0002100072270000102
where L represents the number of grid areas of the area (0.001 latitude and longitude as division). x is the number of l And y l Respectively, the relative longitude and relative latitude of the grid nodes within the area.
S2.2.3: then, the attraction force to the start point by the end point (i.e., the start point area to end point area traffic volume) can be calculated from the center of gravity of the start point and the end point area:
Figure BDA0002100072270000103
Figure BDA0002100072270000104
wherein Q is ji Is represented by R ij The total number of private cars in a circular area with the radius and the center of gravity of the endpoint area j as the center of a circle; n is a radical of s Is the number of regions contained in the circle, beta r Represents the proportion of the region r in the circular region; the total area of the region r is S r The area of the region r in the circular region is
Figure BDA0002100072270000105
P r Is the number of private cars of the region r, A ji Representing the attraction force, o, of the end region j relative to the start region i j Representing the number of private cars in the destination area j, and M representing the total number of private cars in the whole city;
s2.2.4: based on the attractions of the respective zones, the traffic volume from zone i to zone j is calculated:
Figure BDA0002100072270000106
wherein, SA i The number of private car trips from the area i is calculated by S2.1, and n is the number of areas divided in the city;
s2.3: the specific implementation of trajectory simulation and data generation is as follows:
s2.3.1: the invention firstly classifies the urban roads according to the divided areas. The road network file contains the longitude and latitude of each road connection point, the longitude and latitude are used for calculating which area the road belongs to, and the road ID contained in each area is written in the road network file.
S2.3.2: in S2.2, the urban traffic volume O/D matrix is obtained, and the data set simulation generation work is carried out by combining the modified road network file. To achieve this goal, we use the SUMO tool. First, we use the OD2TRIPS plug-in the SUMO tool, import the O/D matrix and decompose it into individual vehicle TRIPS. The traffic volume O/D matrix may reflect the overall people's travel patterns within the city. Further, from a microscopic perspective, the individual vehicle path information generated by the matrix may reflect the individual's movement pattern. In the OD2TRIPS plug-in, O/D matrix information, a road network file and a road list contained in an area are input, and a generation time period, a trip proportion of each time period and a generation vehicle type parameter are set. Then, an xml file of a series of pieces of vehicle travel information each including a vehicle ID, a departure time, a departure place ID, and a destination ID is generated.
S2.3.3: with the help of OD2TRIPS, it is not obvious enough to generate origin and destination information for each vehicle trip. Therefore, we have used the DUAROUTER plug-in the SUMO tool, which uses the shortest path to calculate the vehicle path that the SUMO may use, generating path trajectory information for the vehicle. Inputting road network files and travel information generated by OD2TRIPS, setting a simulation time period and a shortest path calculation method, and finally generating vehicle travel track information comprising vehicle ID, travel time and road passing condition. The vehicle track information can help us to analyze urban road traffic and regional travel modes. Our generated daily trace dataset size is about 3GB.
S2.3.4: in addition to vehicle trajectory information, microscopic information of the instantaneous speed, relative position, latitude and longitude of the vehicle in a unit time period (per second) is also important for data verification in the present invention. We use the Trace File Generation plugin in the SUMO tool to generate vehicle information including latitude and longitude position, driving angle, instantaneous speed, road number of the vehicle every second within a specified time interval. We input the corresponding information in the durauter function while writing the corresponding configuration file, set the time interval to 15 minutes, and generate vehicle tracking files for different time periods. The size of this type of data set generated a day is about 200GB.
S3 is an embodiment of the validation layer of the RDMP model in the present invention. After social vehicle trajectory data are generated in the step 2), a verification model is designed, and the accuracy and the authenticity of the generated data are verified. In the verification layer, the invention verifies the data from the macroscopic aspect and the microscopic aspect respectively.
S3.1: in the macro model, the generated track data is analyzed and compared with the actual traffic condition described in the Beijing traffic development annual report of 2012.
S3.1.1: and comparing the traffic flow. FIG. 3 is a comparison diagram of traffic flow of Beijing major roads. The actual data used was the main road traffic flow data published by the beijing city traffic development research center in 2012. The results show that the all day traffic of the west and east tetracyclic rings is in the front, both actual and generated data. The traffic flow of the west pentacycle and the south dicyclo is low, which indicates that the traffic load is light. From the overall comparison results, the data generated by the method are better matched with the actual data except for the south five-ring and the east five-ring.
S3.1.2: and (4) trip range. For human movement studies, travel time distribution and distance distribution are two key parameters. Through studying the travel amount in different periods, researchers can provide a better travel optimization scheme, relieve road conditions and improve travel efficiency. In addition, the travel distance distribution of researchers can also play an important role in road planning and travel prediction. Thus, in the present invention, we use the generated trajectory data set to analyze the travel time and distance distribution. The accuracy of the simulation data was evaluated by comparison with the actual data. Fig. 4 is a distribution of the travel amount of the resident travel time. The data involved in the comparison included official statistics of the generated data, the first half of the year 2012 and the second half of the year 2012. The result shows that the generated trajectory data has the same travel characteristics as the actual data. Furthermore, as can be seen from fig. 7, 7. The traffic volume in these two periods is about 50% of the total daily traffic volume. In terms of resident travel distance distribution, fig. 5 shows our analysis results. We compare the generated data with official trip distance distribution data of the first and second year of 2012. From the distribution of travel distances, the number of travels is inversely proportional to the total distance. As the distance increases, the number of trips decreases. In the aspect of driving, people prefer short-distance travel, and the shortest distance is 0-5 kilometers and accounts for more than 40%. In consideration of specific conditions, when the travel distance is too long, people can select travel modes such as trains and subways instead of driving by considering factors such as oil consumption and time.
S3.1.3: traffic conditions. From a macroscopic traffic flow situation, the overall distance traveled by the vehicle and the time traveled are regular. In the present invention, we validate the generated data set using the navigation service of Baidu Map APIs. Today, as mobile devices are used more and more, everyone has experience with the map service application. In the software, the positions of a departure place and a destination are input, and a proper travel mode, such as walking, public transport, private car and the like, is selected to obtain the estimated route length and travel time. These results are based on real-world historical data, including data provided by the department of transportation, GPS generated vehicle trajectory data, and various applications and electronic devices that can transmit the position fix. It is worth mentioning that currently, mainstream Map service providers such as Google Map and Baidu Map provide corresponding api interfaces for software developers, which facilitates direct service invocation. Navigation services can be used to verify two important indicators in traffic flow:
the route length, the indicator representing the length of the journey. Using Baidu Map APIs, we input the coordinates of the start and end points of each trip in the generated dataset into the routing function. Through the navigation service, a corresponding path and a corresponding travel length can be generated;
travel time the indicator represents travel time. Similar to the method of obtaining the route length, we use the navigation service to estimate the duration of travel. As can be seen from fig. 5, more than 70% of the vehicles travel less than 10km. Therefore, we focus on the driving of vehicles within 10km. FIG. 6 is a scatter plot of travel time and route length predicted by our generated data and Baidu Maps APIs. Fig. 6 (a) shows the driving condition of the vehicle within a range of 5km, and fig. 6 (b) shows the driving condition of the vehicle within a range of 5-10km. We note that the travel time generated by the present invention overlaps to a large extent with the evaluation provided by the navigation service in both route length ranges. This also reflects that the generated vehicle speed distribution is closer to the vehicle speed distribution of the real vehicle. In addition, compared with the distance of 5-10km, the generated travel time data is closer to the real data in the short-distance travel within 5km, which proves that the model is more suitable for generating travel tracks with shorter distances.
S3.2: in the microscopic model, whether the dataset considered is itself contrary to reality. The acceleration analysis is first performed, starting with the calculation of the vehicle acceleration and its gradient, to analyze whether the data set is reliable. Then, consistency analysis is carried out, vehicle pairs are randomly extracted, and the relative distance between the two vehicles is analyzed. The invention designs a method for quantitatively detecting track data. In this section, we selected the early peak time of the working day (7 00) as the study time, in particular 7. In addition, we used four sets of observations in total for comparison, with each set of data being separated by 15 minutes. These trajectory data are from 7.
S3.2.1: and (6) analyzing the acceleration. Vehicle acceleration is an important component in studying vehicle dynamics and traffic flow. Therefore, it is necessary to verify the accuracy of the acceleration of the generated trajectory data. If the accuracy of the result is within a reasonable range, the generated data set is proved to have high intrinsic reasonableness. There is an obvious way to validate the acceleration data, that is, to check its distribution throughout the data set. During data acquisition and estimation, two types of problems can be clearly found, namely infeasible extrema and distributed irregular shapes. FIG. 7 is a graph of acceleration distribution frequencies for a raw data set and a generated data set. As can be seen from fig. 7, the frequency distribution of the vehicle acceleration is a normal distribution regardless of the real data or the generated data, which proves that the method provided by the present invention is feasible.
In addition to the distribution of the acceleration values, the acceleration gradient can also be an important indicator of the data quality. Acceleration gradient J e [m/s 3 ]Representing the change in acceleration with time, is the derivative of the acceleration. In the present invention, we consider that the concentration is within. + -. 3m/s 3 The actual acceleration gradient values on the left and right are acceptable values under study. Therefore, we propose three indices for acceleration gradient analysis:
·J e greater than +/-3 m/s 3 A threshold percentage of trajectory data;
j maximum in dataset;
the j minimum in the dataset.
The statistical results of the acceleration gradient errors obtained by analysis are shown in table 2. As can be seen, +/-3 m/s 3 Other J e The percentages of the values were 2.99% (raw data (7-00-7)) and 3.28% (raw data (7. This means that the error statistics for j are fairly good. In the lateral comparison, the error of the data set is less than 10%, which shows that the data set is relatively accurate. In addition, the maximum and minimum j values of the selected data both reach relatively unreasonable values.
TABLE 2 acceleration gradient error statistics of true and generated trajectories
Figure BDA0002100072270000131
Figure BDA0002100072270000141
S3.2.2: and (5) analyzing consistency. The vehicle must maintain a reasonable distance from other vehicles during travel. Otherwise, traffic accidents are likely to occur. This situation is also reflected in the vehicle trajectory data set. The invention verifies the authenticity of the generated data set from the distance between the vehicles. The accuracy of a data set is questionable if there are many unusually small distance intervals in the data set.
When we are concerned with two vehicles in close proximity to each other, the distance between the pair of vehicles can be used to quantify the error in estimating the trajectory data. In fact, at a certain moment, the distance between a pair of vehicles can be measured directly from the positions of two vehicles at that moment.
In our study we assume that it is the length of the road segment between spaced vehicles. Formally defined as:
Figure BDA0002100072270000142
Figure BDA0002100072270000143
represents the vehicle separation of vehicles v1 and v2 at time T,
Figure BDA0002100072270000144
and
Figure BDA0002100072270000145
meaning that the starting points of the roads are respectively to the point
Figure BDA0002100072270000146
And
Figure BDA0002100072270000147
i.e. the road-aligned projection of the actual vehicle position.
To simplify the calculation process, we assume that the vehicle always travels in a straight line on the road. Then, the distance between the vehicles is directly calculated from the projection coordinates corresponding to the geographic coordinates of the vehicles:
Figure BDA0002100072270000148
under the general condition of
Figure BDA0002100072270000149
When the vehicle drops below 5m at least in one moment, the two vehicles collide with each other, resulting in traffic accidents. Thus, by detecting
Figure BDA00021000722700001410
The proportion of the time in the total data set can verify the accuracy of the data set. If there are a large number of outliers in the dataset, the dataset is most likely to be problematic. For the consistency analysis, the following statistics are meaningful:
this is the total number of vehicle pairs in each selected data set;
average vehicle separation of vehicle pairs in the data set;
number and proportion of vehicles with speed below 5 meters;
vehicle pair maximum separation.
The statistical results of the analyzed consistency are shown in table 3. We specify that two vehicles within 50m of each other can be considered a set of vehicle pairs. From the calculation results, we can know that the total number of vehicle pairs is about 4000 to 5000. In all selected data sets, we used the original data sets at 7. For the data set we generated, at two time periods 7-00-7. It can be seen that the abnormal data in the generated data set is kept in a small range, and the generated data is proved to have high authenticity. Further, in the generated data set, the average vehicle-to-vehicle distance was about 27m, which is not much different from the actual average value (about 22 m). Fig. 8 reflects the frequency of occurrence of the vehicle to the pitch length. Obviously, the four groups of observation data have very similar distribution, which means that the generated data set has a vehicle movement pattern similar to real data, and the model provided by the invention is proved to be good in effect.
TABLE 3 consistency (Interval indicator) error statistics for real and generated trajectories
Figure BDA0002100072270000151
While the invention has been described in connection with specific embodiments thereof, it will be understood by those skilled in the art that various changes in form and details may be made therein without departing from the spirit and scope of the invention as defined by the appended claims.

Claims (2)

1. A method for generating an urban traffic track data set based on taxi data and an urban road network is characterized in that the method for generating the urban traffic track data set is completed on a three-layer RDMP vehicle track data set generation model, and the three-layer RDMP vehicle track data set generation model comprises the following steps:
s1: in a preparation layer, preprocessing original taxi track data, and deleting useless and abnormal data; meanwhile, the method for dividing the urban functional area based on the adjacent road division method is used for dividing the urban area, and the urban road network is constructed, and the method comprises the following steps:
s1.1: pretreatment: deleting useless track data in an empty state, clearing up paradoxical data caused by equipment precision and statistical errors, and obtaining available taxi track data;
s1.2: based on an adjacent road segmentation method, a researched city is divided into different areas by using a human moving track and POI interest points, and the travel of all vehicles is converted into circulation among the different areas, and the specific process is as follows:
s1.2.1: using DMR-based probability topic model of unsupervised learning, using city as a document, using different functions of each region as topics, using movement patterns between regions as words, using POIs feature vector as metadata of document, and frequency density v of the ith POIs in region r i,r The calculation formula is as follows:
Figure FDA0002100072260000011
wherein, num i Representing the number of types of POIs in area r, S r Representing the area of the region r, the POI feature vector of the region r is marked as x r =(v 1,r ,v 2,r ,...,v i,r ,...,v F,r 1), metadata representing the region r, where F is the number of POI categories in r, and the last vector 1 is the default feature; the subject distribution of the region r is a K-dimensional vector theta r =(θ r,1 ,θ r,2 ,...,θ r,e ...,θ r,K ),θ r,e Representing the proportion of the subject e in the region r;
s1.2.2: clustering the theme distribution obtained in the S1.2.1 by using a k-means clustering algorithm, inputting the coordinate information of the starting point and the end point of each trip into a KDE model, and quantifying the functional strength in a functional area; setting n regions, and calculating the functional strength of the region r by using a nuclear density estimator through a nuclear density estimation model KDE model:
Figure FDA0002100072260000012
Figure FDA0002100072260000013
wherein d is i,r Represents the distance from the region i to the region R, R represents the bandwidth, KF (·) represents a gaussian kernel function; after the estimation of the function intensity is finished, the divided areas are annotated to reflect the actual functions of the city, and the number attribute of the area r is defined as K a
S1.2.3: region clustering
Using adjacent road segmentation method, regarding important road as line segment and grid as node, calculating Euclidean distance from each grid to road, and recordingRecording the road number K with the nearest distance of each grid l Clustering each grid in the rasterized map after the calculation is finished, and clustering K a And K l The nodes with the same value are taken as a cluster;
s1.3: according to an actual research area, cutting and layering original files downloaded from an Open Street Map website, manually modifying road travel limits in the real world, updating the latest traffic conditions, and constructing a road network of a city;
s2: in the generation layer:
s2.1: and calculating the quantity of the social vehicles in each area according to the static proportion of the floating vehicles to the social vehicles in different functional areas, wherein the calculation formula is as follows:
Figure FDA0002100072260000021
Figure FDA0002100072260000022
wherein, SA i The sum of private car trips in the area i; SG j Representing the number of taxis, N, in each grid j contained in region i i Indicating the number of meshes divided in the area i, alpha i Representing the ratio of private cars to taxis, SR, in region i w Representing the number of taxis on road w in grid j within region i, n j Represents the number of roads in grid j;
s2.2: providing a Regional Population Weight Opportunity (RPWO) model, and calculating an origin-destination (OD) matrix of social vehicles, wherein the method comprises the following steps:
s2.2.1: setting the attraction force of an end point area j to a start point area i, and the distance R between the center of gravity of the start point area i and the center of gravity of the end point area j with the center of gravity of the area j as the center ij Total population Q in circular area of radius ji In inverse proportion;
s2.2.2: to center of gravity of
Figure FDA0002100072260000023
The calculation method of (2) is as follows:
Figure FDA0002100072260000024
wherein L represents the number of meshes of the region; x is a radical of a fluorine atom l And y l Respectively representing the relative longitude and relative latitude of the grid within the area;
s2.2.3: calculating attraction A of the end point region j to the start point region i based on the center of gravity of the start point region and the end point region ji The calculation method is as follows:
Figure FDA0002100072260000025
Figure FDA0002100072260000026
wherein Q is ji Is represented by R ij The total number of private cars in a circular area with the radius and the center of gravity of the endpoint area j as the center of a circle; n is a radical of s Is the number of regions contained in the circle, beta r Represents the proportion of the region r in the circular region; the total area of the region r is S r The area of the region r in the circular region is
Figure FDA0002100072260000031
P r Is the number of private cars of the region r, A ji Representing the attraction force, o, of the end region j relative to the start region i j Representing the number of private cars in the destination area j, and M representing the total number of private cars in the whole city;
s2.2.4: based on the attractions of the respective zones, the traffic volume from zone i to zone j is calculated:
Figure FDA0002100072260000032
wherein, SA i The number of private car trips from the area i is calculated by S2.1, and n is the number of areas divided in the city;
s2.3: and generating simulation track data by using an SUMO simulation tool in combination with the OD matrix, wherein the steps are as follows:
s2.3.1: classifying the urban roads according to the divided areas; the road network file comprises the longitude and latitude of each road connection point, the longitude and latitude are used for calculating which area the road belongs to, and the road ID contained in each area is written in the road network file;
s2.3.2: using an OD2TRIPS plug-in an SUMO tool, importing an O/D matrix, decomposing the O/D matrix into single vehicle journey, inputting the O/D matrix generated by S2.2, a road network file and a road list contained in an area according to the specific situation and data of a city, setting a generation time period, a trip proportion of each time period and vehicle type parameters, and generating an xml file of a series of vehicle journey information, wherein each journey information comprises a vehicle ID, a departure time, a departure place ID and a destination ID;
s2.3.3: in order to generate the path track information of private vehicles, a DUAROUTER plug-in an SUMO tool is used, a road network file and travel information generated by OD2TRIPS are input, a simulation time period and a shortest path calculation method are set, a simulated vehicle path is generated, the simulated vehicle path comprises a vehicle ID, travel time and the passing condition of a road, and finally vehicle travel track information is generated;
s2.3.4: in order to generate information of a vehicle, including longitude and latitude positions, driving angles, instantaneous speeds and road numbers, generated every second within a specified time interval, using a Trace File Generation plugin in an SUMO tool, inputting corresponding information in a DUAROUTER function, simultaneously writing a corresponding configuration File, setting the time interval to delta t minutes, and generating vehicle tracking files of different time periods;
s3: in the verification layer, a verification model is designed from the macroscopic aspect and the microscopic aspect respectively, and the accuracy and the authenticity of generated data are verified, wherein the verification model comprises the following steps:
s3.1: in the macro model, analyzing and comparing the generated track data with the actual traffic condition, including traffic flow comparison, trip range comparison and traffic condition comparison;
s3.2: in a microscopic verification model, the authenticity of generated data is analyzed and evaluated from the aspects of acceleration and relative distance, and a method for quantitatively detecting track data is designed, and the method comprises the following steps:
s3.2.1: analyzing acceleration, verifying generated track data acceleration and gradient value J thereof e [m/s 3 ]When the result precision is in a reasonable range, the generated data set is proved to have higher internal rationality, and acceleration gradient analysis is carried out by utilizing three indexes: j. the design is a square e Greater than +/-3 m/s 3 Percentage of trace data of threshold, data set J e Maximum, data set J e A minimum value;
s3.2.2: consistency analysis, namely verifying the authenticity of a generated data set from the distance between vehicles, and when an abnormally small distance interval exists in the data set, the accuracy of the data set is doubtful;
s3.2.3: the distance between the pair of vehicles is calculated as follows:
Figure FDA0002100072260000041
Figure FDA0002100072260000042
represents the vehicle separation of the vehicles v1 and v2 at time T,
Figure FDA0002100072260000043
and
Figure FDA0002100072260000044
meaning that the starting points of the roads are respectively to the point
Figure FDA0002100072260000045
And
Figure FDA0002100072260000046
i.e. the projection of the road alignment of the actual vehicle position;
s3.2.4: the distance between vehicles is directly calculated from projection coordinates corresponding to geographic coordinates of the vehicles by setting that the vehicles always run along a straight line on a road:
Figure FDA0002100072260000047
when in use
Figure FDA0002100072260000048
At least in a moment when the distance between the two vehicles is below 5m, the two vehicles collide to cause traffic accidents, therefore, the detection is carried out
Figure FDA0002100072260000049
The proportion of the time to the total data set is used for verifying the accuracy of the data set.
2. The method for generating an urban traffic trajectory data set based on taxi data and an urban road network according to claim 1, wherein the principle of judging important roads is as follows: the urban map is rasterized, grids are divided according to the longitude and latitude of 0.001 × 0.001, and roads with the average traffic volume of more than one hundred thousand in the whole day are extracted and taken as important roads.
CN201910532080.8A 2019-06-19 2019-06-19 Urban traffic track data set generation method based on taxi data and urban road network Active CN110298500B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201910532080.8A CN110298500B (en) 2019-06-19 2019-06-19 Urban traffic track data set generation method based on taxi data and urban road network

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201910532080.8A CN110298500B (en) 2019-06-19 2019-06-19 Urban traffic track data set generation method based on taxi data and urban road network

Publications (2)

Publication Number Publication Date
CN110298500A CN110298500A (en) 2019-10-01
CN110298500B true CN110298500B (en) 2022-11-08

Family

ID=68028301

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201910532080.8A Active CN110298500B (en) 2019-06-19 2019-06-19 Urban traffic track data set generation method based on taxi data and urban road network

Country Status (1)

Country Link
CN (1) CN110298500B (en)

Families Citing this family (16)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN111104792B (en) * 2019-12-13 2023-05-23 浙江工业大学 Traffic track data semantic analysis and visualization method based on topic model
CN111126713B (en) * 2019-12-31 2023-05-09 方正国际软件(北京)有限公司 Space-time hot spot prediction method and device based on bayonet data and controller
CN111401663B (en) * 2020-04-12 2021-04-27 广州通达汽车电气股份有限公司 Method and device for updating public transport space-time OD matrix in real time
CN111583641A (en) * 2020-04-30 2020-08-25 北京嘀嘀无限科技发展有限公司 Road congestion analysis method, device, equipment and storage medium
CN111651502B (en) * 2020-06-01 2021-09-14 中南大学 City functional area identification method based on multi-subspace model
CN111966770A (en) * 2020-07-21 2020-11-20 中国地质大学(武汉) Urban street function identification method and system based on geographic semantic word embedding
CN112101132B (en) * 2020-08-24 2022-04-19 西北工业大学 Traffic condition prediction method based on graph embedding model and metric learning
CN112559909B (en) * 2020-12-18 2022-06-21 浙江工业大学 Business area discovery method based on GCN embedded spatial clustering model
CN112632208B (en) * 2020-12-25 2022-12-16 际络科技(上海)有限公司 Traffic flow trajectory deformation method and device
CN112906948B (en) * 2021-02-02 2023-12-22 湖南大学 Urban area attraction prediction method, device and medium based on private car track big data
CN113111514B (en) * 2021-04-12 2022-05-27 清华大学深圳国际研究生院 Vehicle microscopic driving scene simulation method and computer readable storage medium
CN113343781B (en) * 2021-05-17 2022-02-01 武汉大学 City functional area identification method using remote sensing data and taxi track data
CN113298144A (en) * 2021-05-24 2021-08-24 中南大学 Urban three-generation space identification and situation analysis method based on multi-source data
CN113254565B (en) * 2021-06-21 2021-09-28 浙江口碑网络技术有限公司 Region identification method and device, computer equipment and computer readable storage medium
CN114724362B (en) * 2022-03-23 2022-12-27 中交信息技术国家工程实验室有限公司 Vehicle track data processing method
CN115995151B (en) * 2023-03-06 2023-12-22 北京白龙马云行科技有限公司 Network vehicle-closing abnormal behavior detection method applied to city management

Citations (11)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP2006234569A (en) * 2005-02-24 2006-09-07 Matsushita Electric Ind Co Ltd Generation method and device of traffic information, and reproduction method and device
CN103531024A (en) * 2013-10-28 2014-01-22 武汉旭云科技有限公司 Dynamic traffic network urban road feature model and modeling method thereof
CN105117595A (en) * 2015-08-19 2015-12-02 大连理工大学 Floating car data based private car travel data integration method
GB201711408D0 (en) * 2016-12-30 2017-08-30 Maxu Tech Inc Early entry
CN107392245A (en) * 2017-07-19 2017-11-24 南京信息工程大学 A kind of taxi trajectory clustering algorithm Tr OPTICS
CN107609107A (en) * 2017-09-13 2018-01-19 大连理工大学 A kind of trip co-occurrence phenomenon visual analysis method based on multi-source Urban Data
GB201803663D0 (en) * 2017-01-10 2018-04-25 Beijing Didi Infinity Technology & Dev Co Ltd No details
CN108681795A (en) * 2018-05-23 2018-10-19 华南理工大学 Electric vehicle charging load spatio-temporal prediction method under urban road network and the constraint of user's Trip chain
CN109166317A (en) * 2018-10-29 2019-01-08 东北林业大学 Method is determined by the time based on the urban transportation path of state feature
CN109272175A (en) * 2018-11-15 2019-01-25 山东管理学院 A kind of data collection system and method based on Urban Migrant network
CN109686091A (en) * 2019-01-17 2019-04-26 中南大学 A kind of magnitude of traffic flow based on multisource data fusion fills up algorithm

Family Cites Families (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US8799038B2 (en) * 2011-09-07 2014-08-05 National Tsing Hua University Dynamic taxi-sharing system and sharing method thereof

Patent Citations (11)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP2006234569A (en) * 2005-02-24 2006-09-07 Matsushita Electric Ind Co Ltd Generation method and device of traffic information, and reproduction method and device
CN103531024A (en) * 2013-10-28 2014-01-22 武汉旭云科技有限公司 Dynamic traffic network urban road feature model and modeling method thereof
CN105117595A (en) * 2015-08-19 2015-12-02 大连理工大学 Floating car data based private car travel data integration method
GB201711408D0 (en) * 2016-12-30 2017-08-30 Maxu Tech Inc Early entry
GB201803663D0 (en) * 2017-01-10 2018-04-25 Beijing Didi Infinity Technology & Dev Co Ltd No details
CN107392245A (en) * 2017-07-19 2017-11-24 南京信息工程大学 A kind of taxi trajectory clustering algorithm Tr OPTICS
CN107609107A (en) * 2017-09-13 2018-01-19 大连理工大学 A kind of trip co-occurrence phenomenon visual analysis method based on multi-source Urban Data
CN108681795A (en) * 2018-05-23 2018-10-19 华南理工大学 Electric vehicle charging load spatio-temporal prediction method under urban road network and the constraint of user's Trip chain
CN109166317A (en) * 2018-10-29 2019-01-08 东北林业大学 Method is determined by the time based on the urban transportation path of state feature
CN109272175A (en) * 2018-11-15 2019-01-25 山东管理学院 A kind of data collection system and method based on Urban Migrant network
CN109686091A (en) * 2019-01-17 2019-04-26 中南大学 A kind of magnitude of traffic flow based on multisource data fusion fills up algorithm

Non-Patent Citations (3)

* Cited by examiner, † Cited by third party
Title
TWIT:社交网络中局部信任值的双向计算;李凤岐等;《计算机工程与应用》;20160215(第04期);全文 *
基于出租车轨迹数据的交通异常识别算法;王雷等;《科学技术与工程》;20181118(第32期);全文 *
基于城市交通大数据的车辆类别挖掘及应用分析;纪丽娜等;《计算机应用》;20190121(第05期);全文 *

Also Published As

Publication number Publication date
CN110298500A (en) 2019-10-01

Similar Documents

Publication Publication Date Title
CN110298500B (en) Urban traffic track data set generation method based on taxi data and urban road network
Zhao et al. Uncovering the spatiotemporal patterns of CO2 emissions by taxis based on Individuals' daily travel
Kong et al. RMGen: A tri-layer vehicular trajectory data generation model exploring urban region division and mobility pattern
CN110836675B (en) Decision tree-based automatic driving search decision method
CN104282142B (en) Bus station arrangement method based on taxi GPS data
Shen et al. Identification of accident blackspots on rural roads using grid clustering and principal component clustering
CN112036757A (en) Parking transfer parking lot site selection method based on mobile phone signaling and floating car data
CN110837973A (en) Human trip selection information mining method based on traffic trip data
Deng et al. Heterogenous trip distance-based route choice behavior analysis using real-world large-scale taxi trajectory data
Guo et al. Mass rapid transit ridership forecast based on direct ridership models: A case study in Wuhan, China
Zhang et al. How road network transformation may be associated with reduced carbon emissions: An exploratory analysis of 19 major Chinese cities
Yang et al. Carbon dioxide emissions from commuter traffic in Xi'an, China
Bi et al. Mining taxi pick-up hotspots based on grid information entropy clustering algorithm
Tian et al. Spatial analysis of commuting carbon emissions in main urban area of Beijing: A GPS trajectory-based approach
André et al. Characterisation of an urban bus network for environmental purposes
Ma et al. Finding optimal reconstruction plans for separating trucks and passenger vehicles systems at urban intersections considering environmental impacts
Crawford et al. Analysing spatial intrapersonal variability of road users using point-to-point sensor data
Dunne et al. A large scale method for extracting geographical features on bus routes from OpenStreetMap and assessment of their impact on bus speed and reliability
Boitor et al. A new tool for the evaluation of co2 emissions from road traffic: a case study in cluj-napoca, romania
Sabet et al. A cost-effective methodology to compare travel time and speed: a tale of 11 cities
Chandrasiri et al. Application of GPS/GIS Based Travel Mode Detection Method for Energy Efficient Transportation Sector
Mannini et al. FCD data for on‐street parking search time estimation
Kartika Visual exploration of spatial-temporal traffic congestion patterns using floating car data
An et al. Discovering public transit riders’ travel pattern from GPS data: a case study in harbin
Xu et al. Congestion detection and distribution pattern analysis based on spatiotemporal density clustering

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant