CN115691128A - Bus stop passenger flow calculation method based on multi-source bus data combined mining - Google Patents
Bus stop passenger flow calculation method based on multi-source bus data combined mining Download PDFInfo
- Publication number
- CN115691128A CN115691128A CN202211327367.5A CN202211327367A CN115691128A CN 115691128 A CN115691128 A CN 115691128A CN 202211327367 A CN202211327367 A CN 202211327367A CN 115691128 A CN115691128 A CN 115691128A
- Authority
- CN
- China
- Prior art keywords
- bus
- longitude
- latitude
- gps
- data
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Granted
Links
Images
Classifications
-
- Y—GENERAL TAGGING OF NEW TECHNOLOGICAL DEVELOPMENTS; GENERAL TAGGING OF CROSS-SECTIONAL TECHNOLOGIES SPANNING OVER SEVERAL SECTIONS OF THE IPC; TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
- Y02—TECHNOLOGIES OR APPLICATIONS FOR MITIGATION OR ADAPTATION AGAINST CLIMATE CHANGE
- Y02D—CLIMATE CHANGE MITIGATION TECHNOLOGIES IN INFORMATION AND COMMUNICATION TECHNOLOGIES [ICT], I.E. INFORMATION AND COMMUNICATION TECHNOLOGIES AIMING AT THE REDUCTION OF THEIR OWN ENERGY USE
- Y02D10/00—Energy efficient computing, e.g. low power processors, power management or thermal management
Abstract
The invention discloses a bus stop passenger flow calculation method based on multi-source bus data combined mining, which comprises the following steps of: acquiring longitude and latitude of an IC-GPS corresponding to an IC card number; and the running direction in the latitude and longitude of the IC-GPS; acquiring a trained bus card swiping longitude and latitude set; acquiring the distance between the elements in the trained bus card swiping longitude and latitude set and the longitude and latitude of the IC-GPS and the name of a passenger boarding station; acquiring the name of a passenger getting-off station according to the joint card swiping data of the bus track; and calculating the passenger flow. According to the invention, the longitude and latitude of the bus stop are trained based on multi-source bus data, so that the accurate name of the bus stop is obtained through the trained accurate longitude and latitude of the bus stop, and meanwhile, the bus travel origin-destination of a passenger is deduced according to the bus and track joint card swiping data of the passenger, so that the passenger flow distribution condition among the bus stops is estimated, and the problems of low precision and poor transportability of the traditional method are solved.
Description
Technical Field
The invention relates to the technical field of multi-source bus big data processing and analysis, in particular to a bus stop passenger flow calculation method based on multi-source bus data combined mining.
Background
With the increasing diversification of the travel demands of passengers, the travel data of the sea level come along with the diversification of the travel demands of the passengers. Even so, public transportation remains the primary option for most passengers to exclude car travel. However, due to the asymmetry of the boarding and disembarking information of the passengers, that is, the disembarking stations of the passengers cannot be confirmed, the passenger flow between the bus stations is difficult to obtain accurately in practice. The prior art lacks effective integration and integrated processing of multi-type bus data, and large data processing relies on a platform tool, so that the cost is high; and as a result, reliability is difficult to verify.
Disclosure of Invention
The invention provides a bus stop passenger flow calculation method based on multi-source bus data combined mining, and aims to solve the technical problem.
In order to realize the purpose, the technical scheme of the invention is as follows:
a bus stop passenger flow calculation method based on multi-source bus data combined mining comprises the following steps:
s1: acquiring multi-source public transportation data; the multi-source public transportation data comprises traffic operation data and traffic GPS data;
the traffic operation data comprises a plurality of IC card numbers, bus card swiping data and track card swiping data;
the bus card swiping data comprises bus transaction time, bus card swiping longitude and latitude and bus card swiping license plate number; the track card swiping data comprises track transaction time;
the traffic GPS data comprises GPS time, GPS longitude and latitude and GPS license plate number;
s2: acquiring the longitude and latitude of an IC-GPS corresponding to the card number of the IC card according to the bus card swiping data and the traffic GPS data; the longitude and latitude of the IC-GPS are traffic GPS data corresponding to the bus transaction time corresponding to the IC card number;
s3: acquiring the running direction of the bus in the longitude and latitude of the IC-GPS according to the longitude and latitude of the IC-GPS;
s4, acquiring a trained bus card swiping longitude and latitude set by adopting a density-based DBSCAN spatial clustering algorithm according to the bus card swiping longitude and latitude and the running direction of the bus in the IC-GPS longitude and latitude; the trained bus card swiping longitude and latitude set comprises an uplink trained bus card swiping longitude and latitude set and a downlink trained bus card swiping longitude and latitude set;
s5: according to the IC-GPS longitude and latitude and the trained bus card swiping longitude and latitude set, acquiring the distance between an element in the trained bus card swiping longitude and latitude set and the IC-GPS longitude and latitude to acquire the name of a passenger getting-on station;
s6: acquiring bus track combined card swiping data according to the bus card swiping data and the track card swiping data;
s7: acquiring the transaction time of getting off the bus of the passenger according to the bus track combined card swiping data and the trained bus card swiping longitude and latitude set so as to acquire the name of the bus stop of the passenger;
s8: and obtaining the passenger flow between the two stations according to the name of the passenger boarding station and the name of the passenger disembarking station.
Further, in S2, the method for acquiring the latitude and longitude of the IC-GPS includes: s21: acquiring GPS data corresponding to the same GPS time as the bus transaction time corresponding to the IC card number, and recording as bus transaction time-GPS data;
s22: acquiring bus transaction time-license plate-GPS data according to the bus transaction time-GPS data, wherein the bus transaction time-license plate-GPS data is the bus transaction time-GPS data corresponding to the same GPS license plate number as the bus card swiping license plate number corresponding to the IC card number;
s23: acquiring longitude and latitude of an IC-GPS (integrated circuit-global positioning system) according to the bus transaction time-license plate-GPS data; the IC-GPS longitude and latitude are longitude and latitude in the public transportation transaction time-license plate-GPS data.
Further, in S3, the running direction of the bus in the longitude and latitude of the IC-GPS is acquired:
s31: acquiring the uplink direction and the downlink direction of a line number corresponding to the IC card number, and the uplink station longitude and latitude in the uplink direction and the downlink station longitude and latitude in the downlink direction through a third-party open source data platform;
s32: acquiring the longitude and latitude of a first originating station and the longitude and latitude of a second originating station according to the longitude and latitude of an uplink station and the longitude and latitude of a downlink station; the first starting station longitude and latitude are the first station longitude and latitude in the uplink direction, and the second starting station longitude and latitude are the first station longitude and latitude in the downlink direction;
s33: obtaining the distance between the longitude and latitude of the IC-GPS and the longitude and latitude of the first originating stationAnd the distance between the longitude and latitude of the IC-GPS and the longitude and latitude of the second originating station
S34: and acquiring the running direction of the bus in the latitude and longitude of the IC-GPS.
Further, the method for acquiring the running direction of the bus in the longitude and latitude of the IC-GPS comprises the following steps:
when in useThen the running direction of the bus of the IC card number at the bus transaction time is uplink and is recorded as dir i =a;
When the temperature is higher than the set temperatureThen the running direction of the bus of the IC card number at the bus transaction time is downward and is recorded as dir i =-a;
Wherein, dir i The running direction of the GPS longitude and latitude in the traffic GPS data corresponding to the ith bus transaction time of the bus card swiping license plate number is shown, namely the running direction of the GPS longitude and latitude corresponding to the ith GPS time in the traffic GPS data corresponding to the bus card swiping license plate number is shown; dir i-1 The running direction of the GPS longitude and latitude in the traffic GPS data corresponding to the i-1 bus transaction time of the bus card swiping license plate number is the running direction of the GPS longitude and latitude of the last GPS time of the corresponding GPS time; and i is the serial number of the GPS time corresponding to the GPS license plate number.
Further, in S5, the method for obtaining the distance between the element in the trained bus card swiping longitude and latitude set and the IC-GPS longitude and latitude is as follows:
wherein: (lambda. Alpha. 1 ,) Is IC-GPS longitude and latitude; (lambda. Alpha. 2 ,) The longitude and latitude of any element in the set of the longitude and latitude of the bus card swiping; r is the radius of the earth; d is the distance between any element in the trained bus card swiping longitude and latitude set and the IC-GPS longitude and latitude.
Further, in S7, the method for acquiring the get-off station of the passenger includes:
and selecting the bus transaction time or the track transaction time in the bus track joint card swiping data of the next transaction time of the bus transaction time as the getting-off time of the card-holding passenger of the IC card number according to the bus track joint card swiping data so as to obtain the getting-off station of the passenger.
Has the beneficial effects that: according to the bus stop passenger flow calculation method based on multi-source bus data combined mining, the longitude and latitude of a bus stop can be trained based on multi-source bus data, so that the accurate name of the bus stop is obtained through the trained accurate longitude and latitude of the bus stop, meanwhile, the bus travel origin and destination of a passenger are deduced according to the bus and track combined card swiping data of the passenger, the passenger flow distribution condition among the bus stops is further estimated, and the problems of low precision and poor transportability of a traditional method are solved.
Drawings
In order to more clearly illustrate the embodiments of the present invention or the technical solutions in the prior art, the drawings needed to be used in the description of the embodiments or the prior art will be briefly introduced below, and it is obvious that the drawings in the following description are some embodiments of the present invention, and for those skilled in the art, other drawings can be obtained according to these drawings without creative efforts.
FIG. 1 is a flow chart of a bus stop passenger flow calculation method according to the invention;
FIG. 2 is a graph comparing accuracy of different estimation methods according to an embodiment of the present invention;
fig. 3 is a schematic diagram illustrating the distribution of the number of passengers getting on and off between the intermediate stations in the embodiment of the present invention.
Detailed Description
In order to make the objects, technical solutions and advantages of the embodiments of the present invention clearer, the technical solutions in the embodiments of the present invention will be clearly and completely described below with reference to the drawings in the embodiments of the present invention, and it is obvious that the described embodiments are some, but not all embodiments of the present invention. All other embodiments, which can be obtained by a person skilled in the art without making any creative effort based on the embodiments in the present invention, belong to the protection scope of the present invention.
The embodiment provides a bus stop passenger flow calculation method based on multi-source bus data combined mining, as shown in fig. 1, the method comprises the following steps:
s1: acquiring multi-source bus data; the multi-source public transportation data comprises traffic operation data and traffic GPS data;
the traffic operation data comprises a plurality of IC card numbers, and bus card swiping data and track card swiping data corresponding to the IC card numbers;
the bus card swiping data comprises bus transaction time, bus card swiping longitude and latitude and bus card swiping license plate number; the track swiping data comprises track transaction time; the bus transaction time is the card swiping time of the IC card number used by the passenger for swiping the card when getting on the bus, and the track transaction time is the card swiping time of the passenger when taking a track transportation bus, such as a subway.
The traffic GPS data comprises GPS time, GPS longitude and latitude and GPS license plate number;
specifically, in the embodiment, in the bus card swiping data, only the card swiping data of the passengers getting on the bus is included, and the card swiping data of the passengers getting off the bus is not included, so that the passengers cannot know the stop of the passengers getting off the bus, and therefore the passenger flow of the bus cannot be obtained.
Specifically, the traffic GPS data includes information such as a GPS license plate number, a line number, GPS time, GPS longitude and latitude, an angle, and a speed; as shown in table 1: the bus card swiping data comprises bus transaction time, bus card swiping longitude and latitude and bus card swiping license plate numbers, as shown in a table 2, and the track card swiping data comprises track transaction time, as shown in a table 3; the traffic GPS data, the bus card swiping data and the track card swiping data are all read out through vehicle-mounted equipment; the traffic GPS data comprises all lines of a certain bus operation period and the operation data of all license plate number buses at all times.
TABLE 1 sample of bus GPS data
ID | License plate number | Line number | GPS time | Longitude (G) | Latitude | Angle of rotation | Speed of rotation | Self numbering |
1 | XX13893F | 376 | 2022-02-23 09:44:52 | 106.591 | 29.583 | 104 | 31 | 19264 |
2 | XX13893F | 376 | 2022-02-23 09:45:02 | 106.591 | 29.583 | 104 | 31 | 19264 |
3 | XX69722 | 248 | 2022-02-23 13:44:27 | 106.455 | 29.537 | 0 | 0 | 20679 |
4 | XX60917 | 271 | 2022-02-23 12:43:14 | 106.334 | 29.622 | 77 | 24 | 20316 |
… | … | … | … | … | … | … | … | … |
TABLE 2 passenger's bus IC card data sample of swiping card
ID | Public transport IC card number | Line number | Vehicle number | Public transport IC card transaction time | Type of transaction | Card type |
1 | 4000003905088724 | 183 | 83243 | 2022-02-23 07:38:03 | 8482 | 0 |
2 | 4000000210132482 | 585 | 5431 | 2022-02-23 07:37:57 | 8451 | 0 |
3 | 4000030110703166 | 320 | 33967 | 2022-02-23 07:38:03 | 8452 | 0 |
4 | 4000000211327983 | 313 | 14436 | 2022-02-23 07:37:54 | 8451 | 0 |
… | … | … | … | … | … | … |
TABLE 3 passenger track IC card data sample
ID | Track IC card number | Track station numbering | Track station longitude | Track station latitude | Track IC card transaction time |
1 | 907813224 | 603 | 106.271 | 29.264 | 2022-02-21 06:19:39 |
2 | 907813224 | 603 | 106.465 | 29.313 | 2022-02-21 06:21:22 |
3 | 908172350 | 604 | 106.392 | 29.921 | 2022-02-21 06:18:08 |
4 | 908302272 | 606 | 106.337 | 29.872 | 2022-02-21 06:21:11 |
… | … | … | … |
S2: acquiring the longitude and latitude of an IC-GPS corresponding to the card number of the IC card according to the bus card swiping data and the traffic GPS data; the longitude and latitude of the IC-GPS are traffic GPS data corresponding to the bus transaction time corresponding to the IC card number;
preferably, in S2, the method for acquiring the latitude and longitude of the IC-GPS includes:
s21: acquiring GPS data corresponding to the same GPS time as the bus transaction time corresponding to the IC card number according to the bus card swiping data, and recording the GPS data as bus transaction time-GPS data;
specifically, according to the bus transaction time in the bus card swiping data, the GPS time in the traffic GPS data, which is the same as the bus transaction time, is found, and all GPS data corresponding to the GPS time, namely bus transaction time-GPS data, are obtained, wherein the GPS data corresponding to all GPS license plates and corresponding to the GPS time are included;
s22: acquiring bus transaction time-license plate-GPS data according to the bus transaction time-GPS data, wherein the bus transaction time-license plate-GPS data is the bus transaction time-GPS data corresponding to the same GPS license plate number as the bus card swiping license plate number corresponding to the IC card number;
specifically, the bus transaction time-GPS data comprises GPS data corresponding to all GPS license plate numbers at the GPS time; therefore, according to the bus card-swiping license plate number of the IC card number when swiping the card, the GPS license plate number which is the same as the bus card-swiping license plate number is found, and the corresponding traffic GPS data, namely the bus transaction time-license plate-GPS data, can be obtained.
S23: acquiring longitude and latitude of an IC-GPS (integrated circuit-global positioning system) according to the bus transaction time-license plate-GPS data; the longitude and latitude of the IC-GPS is the longitude and latitude in the bus transaction time-license plate-GPS data, namely the GPS longitude and latitude in the bus transaction time-license plate-GPS data corresponding to the bus transaction time of the IC card number.
Specifically, according to the bus transaction time, the bus transaction time-GPS data and the bus transaction time-license plate-GPS data, a link between the bus card swiping data corresponding to the IC card number and the traffic GPS data can be established, and the longitude and latitude of the IC card number in the traffic GPS data of the same license plate number corresponding to the bus transaction time can be obtained;
s3: acquiring the running direction of the bus in the longitude and latitude of the IC-GPS according to the longitude and latitude of the IC-GPS;
preferably, in S3, the running direction of the bus in the longitude and latitude of the IC-GPS is acquired:
s31: acquiring the uplink direction and the downlink direction of a line number corresponding to the IC card number, and the uplink station longitude and latitude in the uplink direction and the downlink station longitude and latitude in the downlink direction through a third-party open source data platform;
specifically, the uplink and downlink directions of the line number corresponding to the IC card number are acquired through a third-party open source data platform in the uplink and downlink directions of the sub-vehicle operation, namely the uplink and downlink directions of the line number of the bus on which the passenger rides, the name of the uplink station in the uplink direction, the longitude and latitude of the uplink station, the name of the downlink station in the downlink direction and the longitude and latitude of the downlink station; the uplink and downlink directions of the line number are defined by a third-party platform, the name of an uplink station and the name of a downlink station are acquired from a bus inquiry website through a web crawler technology, and the longitude and latitude of the uplink station and the longitude and latitude of the downlink station are respectively inquired and acquired through inputting the name of the uplink station and the name of the downlink station through an electronic map API (application program interface).
S32: acquiring the longitude and latitude of a first originating station and the longitude and latitude of a second originating station according to the longitude and latitude of an uplink station and the longitude and latitude of a downlink station; the first starting station longitude and latitude are the uplink direction starting station longitude and latitude, and the second starting station longitude and latitude are the downlink direction starting station longitude and latitude;
s33: obtaining the distance between the longitude and latitude of the IC-GPS and the longitude and latitude of the first originating stationAnd the distance between the longitude and latitude of the IC-GPS and the longitude and latitude of the second originating station
S34: and acquiring the running direction of the bus in the longitude and latitude of the IC-GPS.
Preferably, the method for acquiring the running direction of the bus in the longitude and latitude of the IC-GPS comprises the following steps:
when the temperature is higher than the set temperatureThen the running direction of the IC card number in the bus at the bus transaction time is uplink and is recorded as dir i =a;
When in useThen the running direction of the bus of the IC card number at the bus transaction time is downward and is recorded as dir i =-a;
The distance threshold in this embodiment is taken as 0.05km;
wherein, dir i The running direction of the GPS longitude and latitude in the traffic GPS data corresponding to the ith bus transaction time of the bus card swiping license plate number (namely the running direction of the GPS longitude and latitude corresponding to the ith GPS time in the traffic GPS data corresponding to the bus card swiping license plate number); dir i-1 The running direction of the GPS longitude and latitude in the traffic GPS data corresponding to the i-1 bus transaction time of the bus card swiping license plate number (namely the running direction of the GPS longitude and latitude of the last GPS time of the corresponding GPS time at the moment); i is corresponding to GPS license plate numberThe number of GPS time, a, is a constant.
S4, acquiring a trained bus card swiping longitude and latitude set and a trained bus card swiping station set according to the bus card swiping longitude and latitude in the bus card swiping data of the traffic operation data and a density-based DBSCAN spatial clustering algorithm; the trained bus card swiping longitude and latitude set comprises an uplink trained bus card swiping longitude and latitude set and a downlink trained bus card swiping longitude and latitude set;
firstly, determining the running directions of all bus card swiping longitudes and latitudes of the bus operation data through the method of S3, and acquiring an uplink-trained bus card swiping longitude and latitude set and a downlink-trained bus card swiping longitude and latitude set;
specifically, a density-based DBSCAN spatial clustering algorithm is adopted to train the bus card swiping longitude and latitude with the uplink/downlink running direction in traffic operation data respectively, so that longitude and latitude information is spontaneously aggregated into data clusters, the longitude and latitude average value of a point in each cluster is used as the bus card swiping longitude and latitude, and then a trained station longitude and latitude set can be obtained, wherein the trained station longitude and latitude set comprises an uplink trained bus card swiping longitude and latitude set and a downlink trained bus card swiping longitude and latitude set. Specifically, the density-based DBSCAN spatial clustering algorithm is the prior art, and is not developed here. Recording the longitude and latitude set of the trained station as STOP _ lonT and STOP _ lat T (ii) a Specifically, the trained bus card swiping longitude and latitude are input into an electronic map API (application program interface) interface to inquire the station name, namely the bus card swiping station corresponding to the trained bus card swiping longitude and latitude can be obtained, and then the trained bus card swiping station set is obtained and recorded as STOP _ name T ;
S5: according to the IC-GPS longitude and latitude and the trained bus card swiping longitude and latitude set, acquiring the distance between an element in the trained bus card swiping longitude and latitude set and the IC-GPS longitude and latitude to acquire the passenger boarding station longitude and latitude and the passenger boarding station name;
specifically, according to the longitude and latitude of the IC-GPS and the running direction of the bus in the longitude and latitude of the IC-GPS and the distance between the elements in the trained station longitude and latitude set, specifically, when the running direction of the bus in the longitude and latitude of the IC-GPS is uplink, the distance between the elements in the uplink trained bus card swiping longitude and latitude set and the longitude and latitude of the IC-GPS is calculated, when the running direction of the bus in the longitude and latitude of the IC-GPS is downlink, the distance between the elements in the downlink trained bus card swiping longitude and latitude set and the longitude and latitude of the IC-GPS is calculated, and the station longitude and latitude in the trained station longitude and latitude set corresponding to the minimum distance is the longitude and latitude of the passenger boarding station, that is the longitude and latitude of the passenger boarding station, and the station name in the trained station name set corresponding to the longitude and latitude of the passenger boarding station is the boarding station of the passenger. Specifically, longitude and latitude of a passenger boarding station are input into an API (application program interface) of an electronic map to inquire the name of the station, namely the name of the passenger boarding station can be obtained;
preferably, in S5, the method for obtaining the distance between the element in the trained bus card swiping longitude and latitude set and the IC-GPS longitude and latitude is as follows:
wherein: (lambda 1 ,) Is IC-GPS longitude and latitude; (lambda 2 ,) The longitude and latitude of any element in the bus card swiping longitude and latitude set are obtained; r is the radius of the earth; d is the distance between any element in the trained bus card swiping longitude and latitude set and the IC-GPS longitude and latitude.
S6: acquiring bus track joint card swiping data according to the bus card swiping data and the track card swiping data;
specifically, based on a hash algorithm, a hash value is obtained according to the IC card number; sorting the bus card swiping data and the track card swiping data corresponding to the hash value according to the card swiping time (namely the bus transaction time and the track transaction time) from small to large to obtain the bus track joint card swiping data;
specifically, in this embodiment, the last 9 bits of the IC card number are taken as hash values and a hash table is constructed, where each hash value corresponds to an entity IC card. Storing bus card swiping data and track card swiping data corresponding to the same hash value in a data dictionary corresponding to the hash value, assigning a card swiping type field of the data to be a bus or a track according to a data source, and sorting the bus card swiping data and the track card swiping data in the data dictionary from first to last according to card swiping time to generate ordered bus and track combined card swiping data, namely ordered data of the same entity card from small to large according to the card swiping time, so that all the bus card swiping data and the track card swiping data corresponding to the card number of the entity card can be quickly retrieved through the hash value; when the same hash value corresponds to more than one IC card number, the 10 digits behind the IC card number are used as the hash value.
S7: acquiring the transaction time of getting-off of the passenger according to the bus track combined card swiping data, and acquiring the longitude and latitude of the passenger getting-off station and the name of the passenger getting-off station according to the trained bus card swiping longitude and latitude set;
specifically, according to the bus track joint card swiping data and the trained station longitude and latitude set, deducting and matching are carried out on the getting-off stations of the passengers, and the getting-off stations of the passengers are obtained; so as to obtain the passenger flow distribution among bus stops; the method for acquiring the passenger getting-off station comprises the following steps:
and selecting the bus transaction time or the track transaction time in the bus track combined card swiping data of the next transaction time of the bus transaction time as the getting-off time of the card-holding passenger of the IC card number according to the bus track combined card swiping data, so that the bus card swiping longitude and latitude or the track card swiping longitude and latitude corresponding to the next transaction time can be obtained, and the element with the minimum distance from the element in the trained station longitude and latitude set in the same running direction is used as the longitude and latitude of the passenger getting-off station, so that the passenger getting-off station can be obtained.
Specifically, if the current bus track is combined with the next card swiping dataReading the card data, if the transaction time interval is less than the time threshold (10 s), considering that the card-holding passenger at the current station carries out the replacement reading on other passengers, so the next bus track combined card reading data is not used as the data for acquiring the get-off station, otherwise, the next card reading data longitude and latitude information and the card reading type information of the current bus track combined card reading data are used as the get-off longitude and latitude of the passenger, and in the trained station longitude and latitude information, the get-off longitude and latitude (STOP _ lon longitude and latitude) are calculated a ,STOP_lat a ) The longitude and latitude (STOP _ lon) of each station after being trained in the corresponding uplink and downlink directions T ,STOP l at T ) Distance between, resulting in a data set dist a Selecting the index ind corresponding to the smallest element a Corresponding siteAs a drop-off station for the passenger; if the card swiping type information is a track, recording the matching mode of the passenger as track matching; if the card swiping type information is public transport, the matching mode of the passenger is recorded as public transport matching;
if the next card swiping data of the current card swiping data is empty, selecting a random station after the passenger gets on the bus station as a get-off station in the trained station information table according to the current uplink and downlink direction of the passenger, complementing the name of the get-off station and the longitude and latitude information of the get-off station, and endowing a random matching mode for the passenger;
if the information of the current card swiping data is missing, the current card swiping data is regarded as being unsuccessfully matched, and the matching mode is recorded as no matching; the bus stop names and the longitude and latitude of all passengers are matched, and the obtained bus trip data of the passengers are shown in table 4:
table 4 passenger bus trip data sample
S8: and acquiring the passenger flow between the two stations according to the name of the passenger boarding station and the name of the passenger disembarking station.
Preferably, in S8, the method for acquiring the passenger flow volume between two stations is as follows:
the number of passengers getting on the bus at the mth station is obtained as follows:
b m =count(B k =(STOP_name T ) m )
wherein: b m The number of persons getting on the bus at the mth bus getting on station; b is k Represents the boarding station name of passenger k, and k represents the number of the passenger; count () represents a count; (STOP _ name) T ) m The name of the bus card swiping station after the training of the mth station is obtained;
the number of people getting off at the nth station site is obtained as follows:
a n =count(A k =(STOP_name T ) n )
wherein: a is a n Representing the number of the passengers getting on the bus at the nth bus getting-on station; a. The k The name of the get-off station of a passenger k is represented, and the k represents the number of the passenger; (STOP _ name) T ) n The name of the bus card swiping station after the training of the nth station is obtained;
obtaining the passenger flow P between the station of the mth station and the station of the nth station mn The following were used:
P mn =count((A k =(STOP_name T ) n )∩(B k =(STOP_name T ) m ))。
fig. 2 shows the ratio of various matching modes in S7, which can illustrate that the success rate of matching passengers on/off bus stations according to multi-source bus data can reach 88.60% in this embodiment.
Fig. 3 shows the distribution of the number of passengers getting on and off between the stops in the embodiment, and the passenger flow distribution between the stops is represented by a chord between the stop names, which can explain that the embodiment can perform refined estimation on the passenger flow between the bus stops.
The invention carries out reasonable processing and mining on multi-source bus data comprising bus card swiping data and track card swiping data, provides a method for calculating the bus passenger flow, and effectively integrates and integrally processes multi-type bus data by mining the card swiping data of the passengers of the combined bus and subway and taking the card swiping position of the bus or the subway at the next transaction time of the current transaction time as the bus getting-off station of the passenger; the reliability of the estimation result is high.
Finally, it should be noted that: the above embodiments are only used to illustrate the technical solution of the present invention, and not to limit the same; while the invention has been described in detail and with reference to the foregoing embodiments, it will be understood by those skilled in the art that: the technical solutions described in the foregoing embodiments may still be modified, or some or all of the technical features may be equivalently replaced; and the modifications or the substitutions do not make the essence of the corresponding technical solutions depart from the scope of the technical solutions of the embodiments of the present invention.
Claims (6)
1. A bus stop passenger flow calculation method based on multi-source bus data combined mining is characterized by comprising the following steps:
s1: acquiring multi-source bus data; the multi-source public transportation data comprises traffic operation data and traffic GPS data;
the traffic operation data comprises a plurality of IC card numbers, bus card swiping data and track card swiping data;
the bus card swiping data comprises bus transaction time, bus card swiping longitude and latitude and bus card swiping license plate number; the track card swiping data comprises track transaction time;
the traffic GPS data comprises GPS time, GPS longitude and latitude and GPS license plate number;
s2: acquiring the IC-GPS longitude and latitude corresponding to the IC card number according to the bus card swiping data and the traffic GPS data; the longitude and latitude of the IC-GPS are traffic GPS data corresponding to the bus transaction time corresponding to the IC card number;
s3: acquiring the running direction of the bus in the longitude and latitude of the IC-GPS according to the longitude and latitude of the IC-GPS;
s4, acquiring a trained bus card swiping longitude and latitude set by adopting a density-based DBSCAN spatial clustering algorithm according to the bus card swiping longitude and latitude and the running direction of the bus in the IC-GPS longitude and latitude; the trained bus card swiping longitude and latitude set comprises an uplink trained bus card swiping longitude and latitude set and a downlink trained bus card swiping longitude and latitude set;
s5: according to the IC-GPS longitude and latitude and the trained bus card swiping longitude and latitude set, acquiring the distance between an element in the trained bus card swiping longitude and latitude set and the IC-GPS longitude and latitude to acquire the name of a passenger getting-on station;
s6: acquiring bus track combined card swiping data according to the bus card swiping data and the track card swiping data;
s7: acquiring the transaction time of getting off the bus of the passenger according to the bus track combined card swiping data and the trained bus card swiping longitude and latitude set so as to acquire the name of the passenger getting-off station;
s8: and acquiring the passenger flow between the two stations according to the name of the passenger boarding station and the name of the passenger disembarking station.
2. The method for calculating the passenger flow at the bus stop based on the multi-source bus data combined mining as claimed in claim 1, wherein in S2, the method for acquiring the longitude and latitude of the IC-GPS is as follows:
s21: acquiring GPS data corresponding to the same GPS time as the bus transaction time corresponding to the IC card number, and recording as bus transaction time-GPS data;
s22: acquiring bus transaction time-license plate-GPS data according to the bus transaction time-GPS data, wherein the bus transaction time-license plate-GPS data are bus transaction time-GPS data corresponding to a GPS license plate number which is the same as a bus card swiping license plate number corresponding to the IC card number;
s23: acquiring longitude and latitude of an IC-GPS (integrated circuit-global positioning system) according to the bus transaction time-license plate-GPS data; the IC-GPS longitude and latitude are longitude and latitude in the public transportation transaction time-license plate-GPS data.
3. The method for calculating the passenger flow at the bus stop based on the multi-source bus data combined mining as claimed in claim 1, wherein in S3, the running direction of the bus in the longitude and latitude of the IC-GPS is obtained:
s31: acquiring the uplink direction and the downlink direction of a line number corresponding to the IC card number, and the uplink station longitude and latitude in the uplink direction and the downlink station longitude and latitude in the downlink direction through a third-party open source data platform;
s32: acquiring the longitude and latitude of a first originating station and the longitude and latitude of a second originating station according to the longitude and latitude of an uplink station and the longitude and latitude of a downlink station; the first starting station longitude and latitude are the uplink direction starting station longitude and latitude, and the second starting station longitude and latitude are the downlink direction starting station longitude and latitude;
s33: obtaining the distance between the longitude and latitude of the IC-GPS and the longitude and latitude of the first originating stationAnd the distance between the longitude and latitude of the IC-GPS and the longitude and latitude of the second originating station
S34: and acquiring the running direction of the bus in the latitude and longitude of the IC-GPS.
4. The method for calculating the passenger flow of the bus stop based on the multi-source bus data combined mining as claimed in claim 3, wherein the method for acquiring the running direction of the bus in the longitude and latitude of the IC-GPS is as follows:
when in useThen the running direction of the bus of the IC card number at the bus transaction time is uplink and is recorded as dir i =a;
When in useThen the running direction of the bus of the IC card number at the bus transaction time is downward and is recorded as dir i =-a;
Wherein, dir i The running direction of the GPS longitude and latitude in the traffic GPS data corresponding to the ith bus transaction time of the bus card swiping license plate number is shown, namely the running direction of the GPS longitude and latitude corresponding to the ith GPS time in the traffic GPS data corresponding to the bus card swiping license plate number is shown; dir i-1 The running direction of the GPS longitude and latitude in the traffic GPS data corresponding to the i-1 bus transaction time of the bus card swiping license plate number is the running direction of the GPS longitude and latitude of the last GPS time of the corresponding GPS time; i is the serial number of the GPS time corresponding to the GPS license plate number, and a is a constant.
5. The method for calculating the passenger flow of the bus stop based on the multi-source bus data combined mining as claimed in claim 1, wherein in S5, the method for obtaining the distance between the elements in the trained bus card swiping longitude and latitude set and the IC-GPS longitude and latitude is as follows:
wherein:is IC-GPS longitude and latitude;the longitude and latitude of any element in the bus card swiping longitude and latitude set are obtained; r is the radius of the earth; d is the distance between any element in the trained bus card swiping longitude and latitude set and the IC-GPS longitude and latitude.
6. The method for calculating the passenger flow of the bus stop based on the multi-source bus data combined mining as claimed in claim 1, wherein in S7, the method for obtaining the getting-off stop of the passenger comprises the following steps:
and selecting the bus transaction time or the track transaction time in the bus track combined card swiping data of the next transaction time of the bus transaction time as the getting-off time of the card-holding passenger of the IC card number according to the bus track combined card swiping data so as to obtain the getting-off station of the passenger.
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN202211327367.5A CN115691128B (en) | 2022-10-27 | 2022-10-27 | Bus stop passenger flow calculation method based on multi-source bus data combined mining |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN202211327367.5A CN115691128B (en) | 2022-10-27 | 2022-10-27 | Bus stop passenger flow calculation method based on multi-source bus data combined mining |
Publications (2)
Publication Number | Publication Date |
---|---|
CN115691128A true CN115691128A (en) | 2023-02-03 |
CN115691128B CN115691128B (en) | 2023-07-18 |
Family
ID=85100180
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN202211327367.5A Active CN115691128B (en) | 2022-10-27 | 2022-10-27 | Bus stop passenger flow calculation method based on multi-source bus data combined mining |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN115691128B (en) |
Cited By (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN116542438A (en) * | 2023-03-28 | 2023-08-04 | 大连海事大学 | Bus passenger starting and stopping point estimation and repair method based on non-reference real phase |
Citations (8)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN101540098A (en) * | 2009-04-30 | 2009-09-23 | 北京工业大学 | Matching method based on public transport IC card lines |
CN105574137A (en) * | 2015-12-15 | 2016-05-11 | 华南理工大学 | Public transportation multi-source data fusion based IC card swiping station matching method |
CN107818412A (en) * | 2017-10-18 | 2018-03-20 | 福州大学 | A kind of extensive bus passenger OD parallel calculating methods based on MapReduce |
CN110084442A (en) * | 2019-05-16 | 2019-08-02 | 重庆大学 | A kind of method of joint public transport and the progress passenger flow OD calculating of rail traffic brushing card data |
CN110853156A (en) * | 2019-11-18 | 2020-02-28 | 西南交通大学 | Passenger OD identification method integrating bus GPS track and IC card data |
CN111402618A (en) * | 2020-03-27 | 2020-07-10 | 北京嘀嘀无限科技发展有限公司 | Method and device for determining boarding station, storage medium and electronic equipment |
CN111932867A (en) * | 2020-06-18 | 2020-11-13 | 东南大学 | Multisource data-based bus IC card passenger getting-off station derivation method |
WO2021159865A1 (en) * | 2020-02-11 | 2021-08-19 | 罗普特科技集团股份有限公司 | Data calibration-based bus route prediction method and system |
-
2022
- 2022-10-27 CN CN202211327367.5A patent/CN115691128B/en active Active
Patent Citations (8)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN101540098A (en) * | 2009-04-30 | 2009-09-23 | 北京工业大学 | Matching method based on public transport IC card lines |
CN105574137A (en) * | 2015-12-15 | 2016-05-11 | 华南理工大学 | Public transportation multi-source data fusion based IC card swiping station matching method |
CN107818412A (en) * | 2017-10-18 | 2018-03-20 | 福州大学 | A kind of extensive bus passenger OD parallel calculating methods based on MapReduce |
CN110084442A (en) * | 2019-05-16 | 2019-08-02 | 重庆大学 | A kind of method of joint public transport and the progress passenger flow OD calculating of rail traffic brushing card data |
CN110853156A (en) * | 2019-11-18 | 2020-02-28 | 西南交通大学 | Passenger OD identification method integrating bus GPS track and IC card data |
WO2021159865A1 (en) * | 2020-02-11 | 2021-08-19 | 罗普特科技集团股份有限公司 | Data calibration-based bus route prediction method and system |
CN111402618A (en) * | 2020-03-27 | 2020-07-10 | 北京嘀嘀无限科技发展有限公司 | Method and device for determining boarding station, storage medium and electronic equipment |
CN111932867A (en) * | 2020-06-18 | 2020-11-13 | 东南大学 | Multisource data-based bus IC card passenger getting-off station derivation method |
Cited By (2)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN116542438A (en) * | 2023-03-28 | 2023-08-04 | 大连海事大学 | Bus passenger starting and stopping point estimation and repair method based on non-reference real phase |
CN116542438B (en) * | 2023-03-28 | 2024-01-30 | 大连海事大学 | Bus passenger starting and stopping point estimation and repair method based on non-reference real phase |
Also Published As
Publication number | Publication date |
---|---|
CN115691128B (en) | 2023-07-18 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN110942198B (en) | Passenger path identification method and system for rail transit operation | |
CN111653096B (en) | Urban trip mode identification method based on mobile phone signaling data | |
CN109903553B (en) | Multi-source data mining bus station identification and inspection method | |
CN108242149A (en) | A kind of big data analysis method based on traffic data | |
CN109410568B (en) | Get-off site presumption method and system based on user portrait and transfer rule | |
CN102044146B (en) | Method for reckoning getting-on stops on basis of data of one-ticket public-transport integrated circuit (IC) card | |
CN110704993A (en) | Customized bus route design method for relieving subway passenger flow pressure | |
CN104809344A (en) | IC (Integrated Circuit) card data-based estimation method for passenger flow in bus station interval | |
CN110298516B (en) | Method and device for splitting overlong bus line based on passenger flow OD data, mobile terminal equipment and server | |
CN109547930B (en) | Method and device for analyzing urban rail transit passenger flow source based on operator data | |
CN112005562B (en) | Method, computing system and computer readable medium for determining vehicle congestion using real-time location data | |
Huang et al. | A method for bus OD matrix estimation using multisource data | |
CN109903555B (en) | Bus passenger getting-off data prediction method and system based on big data | |
CN114363842B (en) | Bus passenger departure station prediction method and device based on mobile phone signaling data | |
CN107590239B (en) | Method for measuring connection radius of public bicycle at subway station based on IC card data | |
CN110969861B (en) | Vehicle identification method, device, equipment and computer storage medium | |
JP2020071778A (en) | Vehicle dispatch system, server, and information processing method | |
CN112036757A (en) | Parking transfer parking lot site selection method based on mobile phone signaling and floating car data | |
CN115691128A (en) | Bus stop passenger flow calculation method based on multi-source bus data combined mining | |
CN111581325B (en) | K-means station area division method based on space-time influence distance | |
CN114358808A (en) | Public transport OD estimation and distribution method based on multi-source data fusion | |
CN107578619B (en) | Method for measuring public bicycle service range of subway station based on IC card data | |
CN113177742A (en) | Public transport service method, system, terminal and storage medium based on intelligent transportation | |
CN115496316A (en) | Multi-source data-based bus line passenger flow dynamic distribution method | |
CN113408833A (en) | Public traffic key area identification method and device and electronic equipment |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PB01 | Publication | ||
PB01 | Publication | ||
SE01 | Entry into force of request for substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
GR01 | Patent grant | ||
GR01 | Patent grant |