CN115691128A - Bus stop passenger flow calculation method based on multi-source bus data combined mining - Google Patents

Bus stop passenger flow calculation method based on multi-source bus data combined mining Download PDF

Info

Publication number
CN115691128A
CN115691128A CN202211327367.5A CN202211327367A CN115691128A CN 115691128 A CN115691128 A CN 115691128A CN 202211327367 A CN202211327367 A CN 202211327367A CN 115691128 A CN115691128 A CN 115691128A
Authority
CN
China
Prior art keywords
bus
longitude
latitude
gps
data
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Granted
Application number
CN202211327367.5A
Other languages
Chinese (zh)
Other versions
CN115691128B (en
Inventor
李欣
李怀悦
李炎皓
乔靖元
袁昀
胡笳
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Dalian Maritime University
Original Assignee
Dalian Maritime University
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Dalian Maritime University filed Critical Dalian Maritime University
Priority to CN202211327367.5A priority Critical patent/CN115691128B/en
Publication of CN115691128A publication Critical patent/CN115691128A/en
Application granted granted Critical
Publication of CN115691128B publication Critical patent/CN115691128B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Images

Classifications

    • YGENERAL TAGGING OF NEW TECHNOLOGICAL DEVELOPMENTS; GENERAL TAGGING OF CROSS-SECTIONAL TECHNOLOGIES SPANNING OVER SEVERAL SECTIONS OF THE IPC; TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
    • Y02TECHNOLOGIES OR APPLICATIONS FOR MITIGATION OR ADAPTATION AGAINST CLIMATE CHANGE
    • Y02DCLIMATE CHANGE MITIGATION TECHNOLOGIES IN INFORMATION AND COMMUNICATION TECHNOLOGIES [ICT], I.E. INFORMATION AND COMMUNICATION TECHNOLOGIES AIMING AT THE REDUCTION OF THEIR OWN ENERGY USE
    • Y02D10/00Energy efficient computing, e.g. low power processors, power management or thermal management

Abstract

The invention discloses a bus stop passenger flow calculation method based on multi-source bus data combined mining, which comprises the following steps of: acquiring longitude and latitude of an IC-GPS corresponding to an IC card number; and the running direction in the latitude and longitude of the IC-GPS; acquiring a trained bus card swiping longitude and latitude set; acquiring the distance between the elements in the trained bus card swiping longitude and latitude set and the longitude and latitude of the IC-GPS and the name of a passenger boarding station; acquiring the name of a passenger getting-off station according to the joint card swiping data of the bus track; and calculating the passenger flow. According to the invention, the longitude and latitude of the bus stop are trained based on multi-source bus data, so that the accurate name of the bus stop is obtained through the trained accurate longitude and latitude of the bus stop, and meanwhile, the bus travel origin-destination of a passenger is deduced according to the bus and track joint card swiping data of the passenger, so that the passenger flow distribution condition among the bus stops is estimated, and the problems of low precision and poor transportability of the traditional method are solved.

Description

Bus stop passenger flow calculation method based on multi-source bus data combined mining
Technical Field
The invention relates to the technical field of multi-source bus big data processing and analysis, in particular to a bus stop passenger flow calculation method based on multi-source bus data combined mining.
Background
With the increasing diversification of the travel demands of passengers, the travel data of the sea level come along with the diversification of the travel demands of the passengers. Even so, public transportation remains the primary option for most passengers to exclude car travel. However, due to the asymmetry of the boarding and disembarking information of the passengers, that is, the disembarking stations of the passengers cannot be confirmed, the passenger flow between the bus stations is difficult to obtain accurately in practice. The prior art lacks effective integration and integrated processing of multi-type bus data, and large data processing relies on a platform tool, so that the cost is high; and as a result, reliability is difficult to verify.
Disclosure of Invention
The invention provides a bus stop passenger flow calculation method based on multi-source bus data combined mining, and aims to solve the technical problem.
In order to realize the purpose, the technical scheme of the invention is as follows:
a bus stop passenger flow calculation method based on multi-source bus data combined mining comprises the following steps:
s1: acquiring multi-source public transportation data; the multi-source public transportation data comprises traffic operation data and traffic GPS data;
the traffic operation data comprises a plurality of IC card numbers, bus card swiping data and track card swiping data;
the bus card swiping data comprises bus transaction time, bus card swiping longitude and latitude and bus card swiping license plate number; the track card swiping data comprises track transaction time;
the traffic GPS data comprises GPS time, GPS longitude and latitude and GPS license plate number;
s2: acquiring the longitude and latitude of an IC-GPS corresponding to the card number of the IC card according to the bus card swiping data and the traffic GPS data; the longitude and latitude of the IC-GPS are traffic GPS data corresponding to the bus transaction time corresponding to the IC card number;
s3: acquiring the running direction of the bus in the longitude and latitude of the IC-GPS according to the longitude and latitude of the IC-GPS;
s4, acquiring a trained bus card swiping longitude and latitude set by adopting a density-based DBSCAN spatial clustering algorithm according to the bus card swiping longitude and latitude and the running direction of the bus in the IC-GPS longitude and latitude; the trained bus card swiping longitude and latitude set comprises an uplink trained bus card swiping longitude and latitude set and a downlink trained bus card swiping longitude and latitude set;
s5: according to the IC-GPS longitude and latitude and the trained bus card swiping longitude and latitude set, acquiring the distance between an element in the trained bus card swiping longitude and latitude set and the IC-GPS longitude and latitude to acquire the name of a passenger getting-on station;
s6: acquiring bus track combined card swiping data according to the bus card swiping data and the track card swiping data;
s7: acquiring the transaction time of getting off the bus of the passenger according to the bus track combined card swiping data and the trained bus card swiping longitude and latitude set so as to acquire the name of the bus stop of the passenger;
s8: and obtaining the passenger flow between the two stations according to the name of the passenger boarding station and the name of the passenger disembarking station.
Further, in S2, the method for acquiring the latitude and longitude of the IC-GPS includes: s21: acquiring GPS data corresponding to the same GPS time as the bus transaction time corresponding to the IC card number, and recording as bus transaction time-GPS data;
s22: acquiring bus transaction time-license plate-GPS data according to the bus transaction time-GPS data, wherein the bus transaction time-license plate-GPS data is the bus transaction time-GPS data corresponding to the same GPS license plate number as the bus card swiping license plate number corresponding to the IC card number;
s23: acquiring longitude and latitude of an IC-GPS (integrated circuit-global positioning system) according to the bus transaction time-license plate-GPS data; the IC-GPS longitude and latitude are longitude and latitude in the public transportation transaction time-license plate-GPS data.
Further, in S3, the running direction of the bus in the longitude and latitude of the IC-GPS is acquired:
s31: acquiring the uplink direction and the downlink direction of a line number corresponding to the IC card number, and the uplink station longitude and latitude in the uplink direction and the downlink station longitude and latitude in the downlink direction through a third-party open source data platform;
s32: acquiring the longitude and latitude of a first originating station and the longitude and latitude of a second originating station according to the longitude and latitude of an uplink station and the longitude and latitude of a downlink station; the first starting station longitude and latitude are the first station longitude and latitude in the uplink direction, and the second starting station longitude and latitude are the first station longitude and latitude in the downlink direction;
s33: obtaining the distance between the longitude and latitude of the IC-GPS and the longitude and latitude of the first originating station
Figure BDA0003912407340000031
And the distance between the longitude and latitude of the IC-GPS and the longitude and latitude of the second originating station
Figure BDA0003912407340000032
S34: and acquiring the running direction of the bus in the latitude and longitude of the IC-GPS.
Further, the method for acquiring the running direction of the bus in the longitude and latitude of the IC-GPS comprises the following steps:
when in use
Figure BDA0003912407340000033
Then the running direction of the bus of the IC card number at the bus transaction time is uplink and is recorded as dir i =a;
When the temperature is higher than the set temperature
Figure BDA0003912407340000034
Then the running direction of the bus of the IC card number at the bus transaction time is downward and is recorded as dir i =-a;
If it is
Figure BDA0003912407340000035
Or
Figure BDA0003912407340000036
Then dir i =dir i-1
Wherein, dir i The running direction of the GPS longitude and latitude in the traffic GPS data corresponding to the ith bus transaction time of the bus card swiping license plate number is shown, namely the running direction of the GPS longitude and latitude corresponding to the ith GPS time in the traffic GPS data corresponding to the bus card swiping license plate number is shown; dir i-1 The running direction of the GPS longitude and latitude in the traffic GPS data corresponding to the i-1 bus transaction time of the bus card swiping license plate number is the running direction of the GPS longitude and latitude of the last GPS time of the corresponding GPS time; and i is the serial number of the GPS time corresponding to the GPS license plate number.
Further, in S5, the method for obtaining the distance between the element in the trained bus card swiping longitude and latitude set and the IC-GPS longitude and latitude is as follows:
Figure BDA0003912407340000041
wherein: (lambda. Alpha. 1 ,
Figure BDA0003912407340000042
) Is IC-GPS longitude and latitude; (lambda. Alpha. 2 ,
Figure BDA0003912407340000043
) The longitude and latitude of any element in the set of the longitude and latitude of the bus card swiping; r is the radius of the earth; d is the distance between any element in the trained bus card swiping longitude and latitude set and the IC-GPS longitude and latitude.
Further, in S7, the method for acquiring the get-off station of the passenger includes:
and selecting the bus transaction time or the track transaction time in the bus track joint card swiping data of the next transaction time of the bus transaction time as the getting-off time of the card-holding passenger of the IC card number according to the bus track joint card swiping data so as to obtain the getting-off station of the passenger.
Has the beneficial effects that: according to the bus stop passenger flow calculation method based on multi-source bus data combined mining, the longitude and latitude of a bus stop can be trained based on multi-source bus data, so that the accurate name of the bus stop is obtained through the trained accurate longitude and latitude of the bus stop, meanwhile, the bus travel origin and destination of a passenger are deduced according to the bus and track combined card swiping data of the passenger, the passenger flow distribution condition among the bus stops is further estimated, and the problems of low precision and poor transportability of a traditional method are solved.
Drawings
In order to more clearly illustrate the embodiments of the present invention or the technical solutions in the prior art, the drawings needed to be used in the description of the embodiments or the prior art will be briefly introduced below, and it is obvious that the drawings in the following description are some embodiments of the present invention, and for those skilled in the art, other drawings can be obtained according to these drawings without creative efforts.
FIG. 1 is a flow chart of a bus stop passenger flow calculation method according to the invention;
FIG. 2 is a graph comparing accuracy of different estimation methods according to an embodiment of the present invention;
fig. 3 is a schematic diagram illustrating the distribution of the number of passengers getting on and off between the intermediate stations in the embodiment of the present invention.
Detailed Description
In order to make the objects, technical solutions and advantages of the embodiments of the present invention clearer, the technical solutions in the embodiments of the present invention will be clearly and completely described below with reference to the drawings in the embodiments of the present invention, and it is obvious that the described embodiments are some, but not all embodiments of the present invention. All other embodiments, which can be obtained by a person skilled in the art without making any creative effort based on the embodiments in the present invention, belong to the protection scope of the present invention.
The embodiment provides a bus stop passenger flow calculation method based on multi-source bus data combined mining, as shown in fig. 1, the method comprises the following steps:
s1: acquiring multi-source bus data; the multi-source public transportation data comprises traffic operation data and traffic GPS data;
the traffic operation data comprises a plurality of IC card numbers, and bus card swiping data and track card swiping data corresponding to the IC card numbers;
the bus card swiping data comprises bus transaction time, bus card swiping longitude and latitude and bus card swiping license plate number; the track swiping data comprises track transaction time; the bus transaction time is the card swiping time of the IC card number used by the passenger for swiping the card when getting on the bus, and the track transaction time is the card swiping time of the passenger when taking a track transportation bus, such as a subway.
The traffic GPS data comprises GPS time, GPS longitude and latitude and GPS license plate number;
specifically, in the embodiment, in the bus card swiping data, only the card swiping data of the passengers getting on the bus is included, and the card swiping data of the passengers getting off the bus is not included, so that the passengers cannot know the stop of the passengers getting off the bus, and therefore the passenger flow of the bus cannot be obtained.
Specifically, the traffic GPS data includes information such as a GPS license plate number, a line number, GPS time, GPS longitude and latitude, an angle, and a speed; as shown in table 1: the bus card swiping data comprises bus transaction time, bus card swiping longitude and latitude and bus card swiping license plate numbers, as shown in a table 2, and the track card swiping data comprises track transaction time, as shown in a table 3; the traffic GPS data, the bus card swiping data and the track card swiping data are all read out through vehicle-mounted equipment; the traffic GPS data comprises all lines of a certain bus operation period and the operation data of all license plate number buses at all times.
TABLE 1 sample of bus GPS data
ID License plate number Line number GPS time Longitude (G) Latitude Angle of rotation Speed of rotation Self numbering
1 XX13893F 376 2022-02-23 09:44:52 106.591 29.583 104 31 19264
2 XX13893F 376 2022-02-23 09:45:02 106.591 29.583 104 31 19264
3 XX69722 248 2022-02-23 13:44:27 106.455 29.537 0 0 20679
4 XX60917 271 2022-02-23 12:43:14 106.334 29.622 77 24 20316
TABLE 2 passenger's bus IC card data sample of swiping card
ID Public transport IC card number Line number Vehicle number Public transport IC card transaction time Type of transaction Card type
1 4000003905088724 183 83243 2022-02-23 07:38:03 8482 0
2 4000000210132482 585 5431 2022-02-23 07:37:57 8451 0
3 4000030110703166 320 33967 2022-02-23 07:38:03 8452 0
4 4000000211327983 313 14436 2022-02-23 07:37:54 8451 0
TABLE 3 passenger track IC card data sample
ID Track IC card number Track station numbering Track station longitude Track station latitude Track IC card transaction time
1 907813224 603 106.271 29.264 2022-02-21 06:19:39
2 907813224 603 106.465 29.313 2022-02-21 06:21:22
3 908172350 604 106.392 29.921 2022-02-21 06:18:08
4 908302272 606 106.337 29.872 2022-02-21 06:21:11
S2: acquiring the longitude and latitude of an IC-GPS corresponding to the card number of the IC card according to the bus card swiping data and the traffic GPS data; the longitude and latitude of the IC-GPS are traffic GPS data corresponding to the bus transaction time corresponding to the IC card number;
preferably, in S2, the method for acquiring the latitude and longitude of the IC-GPS includes:
s21: acquiring GPS data corresponding to the same GPS time as the bus transaction time corresponding to the IC card number according to the bus card swiping data, and recording the GPS data as bus transaction time-GPS data;
specifically, according to the bus transaction time in the bus card swiping data, the GPS time in the traffic GPS data, which is the same as the bus transaction time, is found, and all GPS data corresponding to the GPS time, namely bus transaction time-GPS data, are obtained, wherein the GPS data corresponding to all GPS license plates and corresponding to the GPS time are included;
s22: acquiring bus transaction time-license plate-GPS data according to the bus transaction time-GPS data, wherein the bus transaction time-license plate-GPS data is the bus transaction time-GPS data corresponding to the same GPS license plate number as the bus card swiping license plate number corresponding to the IC card number;
specifically, the bus transaction time-GPS data comprises GPS data corresponding to all GPS license plate numbers at the GPS time; therefore, according to the bus card-swiping license plate number of the IC card number when swiping the card, the GPS license plate number which is the same as the bus card-swiping license plate number is found, and the corresponding traffic GPS data, namely the bus transaction time-license plate-GPS data, can be obtained.
S23: acquiring longitude and latitude of an IC-GPS (integrated circuit-global positioning system) according to the bus transaction time-license plate-GPS data; the longitude and latitude of the IC-GPS is the longitude and latitude in the bus transaction time-license plate-GPS data, namely the GPS longitude and latitude in the bus transaction time-license plate-GPS data corresponding to the bus transaction time of the IC card number.
Specifically, according to the bus transaction time, the bus transaction time-GPS data and the bus transaction time-license plate-GPS data, a link between the bus card swiping data corresponding to the IC card number and the traffic GPS data can be established, and the longitude and latitude of the IC card number in the traffic GPS data of the same license plate number corresponding to the bus transaction time can be obtained;
s3: acquiring the running direction of the bus in the longitude and latitude of the IC-GPS according to the longitude and latitude of the IC-GPS;
preferably, in S3, the running direction of the bus in the longitude and latitude of the IC-GPS is acquired:
s31: acquiring the uplink direction and the downlink direction of a line number corresponding to the IC card number, and the uplink station longitude and latitude in the uplink direction and the downlink station longitude and latitude in the downlink direction through a third-party open source data platform;
specifically, the uplink and downlink directions of the line number corresponding to the IC card number are acquired through a third-party open source data platform in the uplink and downlink directions of the sub-vehicle operation, namely the uplink and downlink directions of the line number of the bus on which the passenger rides, the name of the uplink station in the uplink direction, the longitude and latitude of the uplink station, the name of the downlink station in the downlink direction and the longitude and latitude of the downlink station; the uplink and downlink directions of the line number are defined by a third-party platform, the name of an uplink station and the name of a downlink station are acquired from a bus inquiry website through a web crawler technology, and the longitude and latitude of the uplink station and the longitude and latitude of the downlink station are respectively inquired and acquired through inputting the name of the uplink station and the name of the downlink station through an electronic map API (application program interface).
S32: acquiring the longitude and latitude of a first originating station and the longitude and latitude of a second originating station according to the longitude and latitude of an uplink station and the longitude and latitude of a downlink station; the first starting station longitude and latitude are the uplink direction starting station longitude and latitude, and the second starting station longitude and latitude are the downlink direction starting station longitude and latitude;
s33: obtaining the distance between the longitude and latitude of the IC-GPS and the longitude and latitude of the first originating station
Figure BDA0003912407340000082
And the distance between the longitude and latitude of the IC-GPS and the longitude and latitude of the second originating station
Figure BDA0003912407340000081
S34: and acquiring the running direction of the bus in the longitude and latitude of the IC-GPS.
Preferably, the method for acquiring the running direction of the bus in the longitude and latitude of the IC-GPS comprises the following steps:
when the temperature is higher than the set temperature
Figure BDA0003912407340000083
Then the running direction of the IC card number in the bus at the bus transaction time is uplink and is recorded as dir i =a;
When in use
Figure BDA0003912407340000084
Then the running direction of the bus of the IC card number at the bus transaction time is downward and is recorded as dir i =-a;
If it is
Figure BDA0003912407340000085
Or
Figure BDA0003912407340000086
Then dir i =dir i-1
The distance threshold in this embodiment is taken as 0.05km;
wherein, dir i The running direction of the GPS longitude and latitude in the traffic GPS data corresponding to the ith bus transaction time of the bus card swiping license plate number (namely the running direction of the GPS longitude and latitude corresponding to the ith GPS time in the traffic GPS data corresponding to the bus card swiping license plate number); dir i-1 The running direction of the GPS longitude and latitude in the traffic GPS data corresponding to the i-1 bus transaction time of the bus card swiping license plate number (namely the running direction of the GPS longitude and latitude of the last GPS time of the corresponding GPS time at the moment); i is corresponding to GPS license plate numberThe number of GPS time, a, is a constant.
S4, acquiring a trained bus card swiping longitude and latitude set and a trained bus card swiping station set according to the bus card swiping longitude and latitude in the bus card swiping data of the traffic operation data and a density-based DBSCAN spatial clustering algorithm; the trained bus card swiping longitude and latitude set comprises an uplink trained bus card swiping longitude and latitude set and a downlink trained bus card swiping longitude and latitude set;
firstly, determining the running directions of all bus card swiping longitudes and latitudes of the bus operation data through the method of S3, and acquiring an uplink-trained bus card swiping longitude and latitude set and a downlink-trained bus card swiping longitude and latitude set;
specifically, a density-based DBSCAN spatial clustering algorithm is adopted to train the bus card swiping longitude and latitude with the uplink/downlink running direction in traffic operation data respectively, so that longitude and latitude information is spontaneously aggregated into data clusters, the longitude and latitude average value of a point in each cluster is used as the bus card swiping longitude and latitude, and then a trained station longitude and latitude set can be obtained, wherein the trained station longitude and latitude set comprises an uplink trained bus card swiping longitude and latitude set and a downlink trained bus card swiping longitude and latitude set. Specifically, the density-based DBSCAN spatial clustering algorithm is the prior art, and is not developed here. Recording the longitude and latitude set of the trained station as STOP _ lonT and STOP _ lat T (ii) a Specifically, the trained bus card swiping longitude and latitude are input into an electronic map API (application program interface) interface to inquire the station name, namely the bus card swiping station corresponding to the trained bus card swiping longitude and latitude can be obtained, and then the trained bus card swiping station set is obtained and recorded as STOP _ name T
S5: according to the IC-GPS longitude and latitude and the trained bus card swiping longitude and latitude set, acquiring the distance between an element in the trained bus card swiping longitude and latitude set and the IC-GPS longitude and latitude to acquire the passenger boarding station longitude and latitude and the passenger boarding station name;
specifically, according to the longitude and latitude of the IC-GPS and the running direction of the bus in the longitude and latitude of the IC-GPS and the distance between the elements in the trained station longitude and latitude set, specifically, when the running direction of the bus in the longitude and latitude of the IC-GPS is uplink, the distance between the elements in the uplink trained bus card swiping longitude and latitude set and the longitude and latitude of the IC-GPS is calculated, when the running direction of the bus in the longitude and latitude of the IC-GPS is downlink, the distance between the elements in the downlink trained bus card swiping longitude and latitude set and the longitude and latitude of the IC-GPS is calculated, and the station longitude and latitude in the trained station longitude and latitude set corresponding to the minimum distance is the longitude and latitude of the passenger boarding station, that is the longitude and latitude of the passenger boarding station, and the station name in the trained station name set corresponding to the longitude and latitude of the passenger boarding station is the boarding station of the passenger. Specifically, longitude and latitude of a passenger boarding station are input into an API (application program interface) of an electronic map to inquire the name of the station, namely the name of the passenger boarding station can be obtained;
preferably, in S5, the method for obtaining the distance between the element in the trained bus card swiping longitude and latitude set and the IC-GPS longitude and latitude is as follows:
Figure BDA0003912407340000091
wherein: (lambda 1 ,
Figure BDA0003912407340000092
) Is IC-GPS longitude and latitude; (lambda 2 ,
Figure BDA0003912407340000093
) The longitude and latitude of any element in the bus card swiping longitude and latitude set are obtained; r is the radius of the earth; d is the distance between any element in the trained bus card swiping longitude and latitude set and the IC-GPS longitude and latitude.
S6: acquiring bus track joint card swiping data according to the bus card swiping data and the track card swiping data;
specifically, based on a hash algorithm, a hash value is obtained according to the IC card number; sorting the bus card swiping data and the track card swiping data corresponding to the hash value according to the card swiping time (namely the bus transaction time and the track transaction time) from small to large to obtain the bus track joint card swiping data;
specifically, in this embodiment, the last 9 bits of the IC card number are taken as hash values and a hash table is constructed, where each hash value corresponds to an entity IC card. Storing bus card swiping data and track card swiping data corresponding to the same hash value in a data dictionary corresponding to the hash value, assigning a card swiping type field of the data to be a bus or a track according to a data source, and sorting the bus card swiping data and the track card swiping data in the data dictionary from first to last according to card swiping time to generate ordered bus and track combined card swiping data, namely ordered data of the same entity card from small to large according to the card swiping time, so that all the bus card swiping data and the track card swiping data corresponding to the card number of the entity card can be quickly retrieved through the hash value; when the same hash value corresponds to more than one IC card number, the 10 digits behind the IC card number are used as the hash value.
S7: acquiring the transaction time of getting-off of the passenger according to the bus track combined card swiping data, and acquiring the longitude and latitude of the passenger getting-off station and the name of the passenger getting-off station according to the trained bus card swiping longitude and latitude set;
specifically, according to the bus track joint card swiping data and the trained station longitude and latitude set, deducting and matching are carried out on the getting-off stations of the passengers, and the getting-off stations of the passengers are obtained; so as to obtain the passenger flow distribution among bus stops; the method for acquiring the passenger getting-off station comprises the following steps:
and selecting the bus transaction time or the track transaction time in the bus track combined card swiping data of the next transaction time of the bus transaction time as the getting-off time of the card-holding passenger of the IC card number according to the bus track combined card swiping data, so that the bus card swiping longitude and latitude or the track card swiping longitude and latitude corresponding to the next transaction time can be obtained, and the element with the minimum distance from the element in the trained station longitude and latitude set in the same running direction is used as the longitude and latitude of the passenger getting-off station, so that the passenger getting-off station can be obtained.
Specifically, if the current bus track is combined with the next card swiping dataReading the card data, if the transaction time interval is less than the time threshold (10 s), considering that the card-holding passenger at the current station carries out the replacement reading on other passengers, so the next bus track combined card reading data is not used as the data for acquiring the get-off station, otherwise, the next card reading data longitude and latitude information and the card reading type information of the current bus track combined card reading data are used as the get-off longitude and latitude of the passenger, and in the trained station longitude and latitude information, the get-off longitude and latitude (STOP _ lon longitude and latitude) are calculated a ,STOP_lat a ) The longitude and latitude (STOP _ lon) of each station after being trained in the corresponding uplink and downlink directions T ,STOP l at T ) Distance between, resulting in a data set dist a Selecting the index ind corresponding to the smallest element a Corresponding site
Figure BDA0003912407340000111
As a drop-off station for the passenger; if the card swiping type information is a track, recording the matching mode of the passenger as track matching; if the card swiping type information is public transport, the matching mode of the passenger is recorded as public transport matching;
if the next card swiping data of the current card swiping data is empty, selecting a random station after the passenger gets on the bus station as a get-off station in the trained station information table according to the current uplink and downlink direction of the passenger, complementing the name of the get-off station and the longitude and latitude information of the get-off station, and endowing a random matching mode for the passenger;
if the information of the current card swiping data is missing, the current card swiping data is regarded as being unsuccessfully matched, and the matching mode is recorded as no matching; the bus stop names and the longitude and latitude of all passengers are matched, and the obtained bus trip data of the passengers are shown in table 4:
table 4 passenger bus trip data sample
Figure BDA0003912407340000112
S8: and acquiring the passenger flow between the two stations according to the name of the passenger boarding station and the name of the passenger disembarking station.
Preferably, in S8, the method for acquiring the passenger flow volume between two stations is as follows:
the number of passengers getting on the bus at the mth station is obtained as follows:
b m =count(B k =(STOP_name T ) m )
wherein: b m The number of persons getting on the bus at the mth bus getting on station; b is k Represents the boarding station name of passenger k, and k represents the number of the passenger; count () represents a count; (STOP _ name) T ) m The name of the bus card swiping station after the training of the mth station is obtained;
the number of people getting off at the nth station site is obtained as follows:
a n =count(A k =(STOP_name T ) n )
wherein: a is a n Representing the number of the passengers getting on the bus at the nth bus getting-on station; a. The k The name of the get-off station of a passenger k is represented, and the k represents the number of the passenger; (STOP _ name) T ) n The name of the bus card swiping station after the training of the nth station is obtained;
obtaining the passenger flow P between the station of the mth station and the station of the nth station mn The following were used:
P mn =count((A k =(STOP_name T ) n )∩(B k =(STOP_name T ) m ))。
fig. 2 shows the ratio of various matching modes in S7, which can illustrate that the success rate of matching passengers on/off bus stations according to multi-source bus data can reach 88.60% in this embodiment.
Fig. 3 shows the distribution of the number of passengers getting on and off between the stops in the embodiment, and the passenger flow distribution between the stops is represented by a chord between the stop names, which can explain that the embodiment can perform refined estimation on the passenger flow between the bus stops.
The invention carries out reasonable processing and mining on multi-source bus data comprising bus card swiping data and track card swiping data, provides a method for calculating the bus passenger flow, and effectively integrates and integrally processes multi-type bus data by mining the card swiping data of the passengers of the combined bus and subway and taking the card swiping position of the bus or the subway at the next transaction time of the current transaction time as the bus getting-off station of the passenger; the reliability of the estimation result is high.
Finally, it should be noted that: the above embodiments are only used to illustrate the technical solution of the present invention, and not to limit the same; while the invention has been described in detail and with reference to the foregoing embodiments, it will be understood by those skilled in the art that: the technical solutions described in the foregoing embodiments may still be modified, or some or all of the technical features may be equivalently replaced; and the modifications or the substitutions do not make the essence of the corresponding technical solutions depart from the scope of the technical solutions of the embodiments of the present invention.

Claims (6)

1. A bus stop passenger flow calculation method based on multi-source bus data combined mining is characterized by comprising the following steps:
s1: acquiring multi-source bus data; the multi-source public transportation data comprises traffic operation data and traffic GPS data;
the traffic operation data comprises a plurality of IC card numbers, bus card swiping data and track card swiping data;
the bus card swiping data comprises bus transaction time, bus card swiping longitude and latitude and bus card swiping license plate number; the track card swiping data comprises track transaction time;
the traffic GPS data comprises GPS time, GPS longitude and latitude and GPS license plate number;
s2: acquiring the IC-GPS longitude and latitude corresponding to the IC card number according to the bus card swiping data and the traffic GPS data; the longitude and latitude of the IC-GPS are traffic GPS data corresponding to the bus transaction time corresponding to the IC card number;
s3: acquiring the running direction of the bus in the longitude and latitude of the IC-GPS according to the longitude and latitude of the IC-GPS;
s4, acquiring a trained bus card swiping longitude and latitude set by adopting a density-based DBSCAN spatial clustering algorithm according to the bus card swiping longitude and latitude and the running direction of the bus in the IC-GPS longitude and latitude; the trained bus card swiping longitude and latitude set comprises an uplink trained bus card swiping longitude and latitude set and a downlink trained bus card swiping longitude and latitude set;
s5: according to the IC-GPS longitude and latitude and the trained bus card swiping longitude and latitude set, acquiring the distance between an element in the trained bus card swiping longitude and latitude set and the IC-GPS longitude and latitude to acquire the name of a passenger getting-on station;
s6: acquiring bus track combined card swiping data according to the bus card swiping data and the track card swiping data;
s7: acquiring the transaction time of getting off the bus of the passenger according to the bus track combined card swiping data and the trained bus card swiping longitude and latitude set so as to acquire the name of the passenger getting-off station;
s8: and acquiring the passenger flow between the two stations according to the name of the passenger boarding station and the name of the passenger disembarking station.
2. The method for calculating the passenger flow at the bus stop based on the multi-source bus data combined mining as claimed in claim 1, wherein in S2, the method for acquiring the longitude and latitude of the IC-GPS is as follows:
s21: acquiring GPS data corresponding to the same GPS time as the bus transaction time corresponding to the IC card number, and recording as bus transaction time-GPS data;
s22: acquiring bus transaction time-license plate-GPS data according to the bus transaction time-GPS data, wherein the bus transaction time-license plate-GPS data are bus transaction time-GPS data corresponding to a GPS license plate number which is the same as a bus card swiping license plate number corresponding to the IC card number;
s23: acquiring longitude and latitude of an IC-GPS (integrated circuit-global positioning system) according to the bus transaction time-license plate-GPS data; the IC-GPS longitude and latitude are longitude and latitude in the public transportation transaction time-license plate-GPS data.
3. The method for calculating the passenger flow at the bus stop based on the multi-source bus data combined mining as claimed in claim 1, wherein in S3, the running direction of the bus in the longitude and latitude of the IC-GPS is obtained:
s31: acquiring the uplink direction and the downlink direction of a line number corresponding to the IC card number, and the uplink station longitude and latitude in the uplink direction and the downlink station longitude and latitude in the downlink direction through a third-party open source data platform;
s32: acquiring the longitude and latitude of a first originating station and the longitude and latitude of a second originating station according to the longitude and latitude of an uplink station and the longitude and latitude of a downlink station; the first starting station longitude and latitude are the uplink direction starting station longitude and latitude, and the second starting station longitude and latitude are the downlink direction starting station longitude and latitude;
s33: obtaining the distance between the longitude and latitude of the IC-GPS and the longitude and latitude of the first originating station
Figure FDA0003912407330000021
And the distance between the longitude and latitude of the IC-GPS and the longitude and latitude of the second originating station
Figure FDA0003912407330000022
S34: and acquiring the running direction of the bus in the latitude and longitude of the IC-GPS.
4. The method for calculating the passenger flow of the bus stop based on the multi-source bus data combined mining as claimed in claim 3, wherein the method for acquiring the running direction of the bus in the longitude and latitude of the IC-GPS is as follows:
when in use
Figure FDA0003912407330000031
Then the running direction of the bus of the IC card number at the bus transaction time is uplink and is recorded as dir i =a;
When in use
Figure FDA0003912407330000032
Then the running direction of the bus of the IC card number at the bus transaction time is downward and is recorded as dir i =-a;
If it is
Figure FDA0003912407330000033
Or
Figure FDA0003912407330000034
Then dir i =dir i-1
Wherein, dir i The running direction of the GPS longitude and latitude in the traffic GPS data corresponding to the ith bus transaction time of the bus card swiping license plate number is shown, namely the running direction of the GPS longitude and latitude corresponding to the ith GPS time in the traffic GPS data corresponding to the bus card swiping license plate number is shown; dir i-1 The running direction of the GPS longitude and latitude in the traffic GPS data corresponding to the i-1 bus transaction time of the bus card swiping license plate number is the running direction of the GPS longitude and latitude of the last GPS time of the corresponding GPS time; i is the serial number of the GPS time corresponding to the GPS license plate number, and a is a constant.
5. The method for calculating the passenger flow of the bus stop based on the multi-source bus data combined mining as claimed in claim 1, wherein in S5, the method for obtaining the distance between the elements in the trained bus card swiping longitude and latitude set and the IC-GPS longitude and latitude is as follows:
Figure FDA0003912407330000035
wherein:
Figure FDA0003912407330000036
is IC-GPS longitude and latitude;
Figure FDA0003912407330000037
the longitude and latitude of any element in the bus card swiping longitude and latitude set are obtained; r is the radius of the earth; d is the distance between any element in the trained bus card swiping longitude and latitude set and the IC-GPS longitude and latitude.
6. The method for calculating the passenger flow of the bus stop based on the multi-source bus data combined mining as claimed in claim 1, wherein in S7, the method for obtaining the getting-off stop of the passenger comprises the following steps:
and selecting the bus transaction time or the track transaction time in the bus track combined card swiping data of the next transaction time of the bus transaction time as the getting-off time of the card-holding passenger of the IC card number according to the bus track combined card swiping data so as to obtain the getting-off station of the passenger.
CN202211327367.5A 2022-10-27 2022-10-27 Bus stop passenger flow calculation method based on multi-source bus data combined mining Active CN115691128B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN202211327367.5A CN115691128B (en) 2022-10-27 2022-10-27 Bus stop passenger flow calculation method based on multi-source bus data combined mining

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN202211327367.5A CN115691128B (en) 2022-10-27 2022-10-27 Bus stop passenger flow calculation method based on multi-source bus data combined mining

Publications (2)

Publication Number Publication Date
CN115691128A true CN115691128A (en) 2023-02-03
CN115691128B CN115691128B (en) 2023-07-18

Family

ID=85100180

Family Applications (1)

Application Number Title Priority Date Filing Date
CN202211327367.5A Active CN115691128B (en) 2022-10-27 2022-10-27 Bus stop passenger flow calculation method based on multi-source bus data combined mining

Country Status (1)

Country Link
CN (1) CN115691128B (en)

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN116542438A (en) * 2023-03-28 2023-08-04 大连海事大学 Bus passenger starting and stopping point estimation and repair method based on non-reference real phase

Citations (8)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101540098A (en) * 2009-04-30 2009-09-23 北京工业大学 Matching method based on public transport IC card lines
CN105574137A (en) * 2015-12-15 2016-05-11 华南理工大学 Public transportation multi-source data fusion based IC card swiping station matching method
CN107818412A (en) * 2017-10-18 2018-03-20 福州大学 A kind of extensive bus passenger OD parallel calculating methods based on MapReduce
CN110084442A (en) * 2019-05-16 2019-08-02 重庆大学 A kind of method of joint public transport and the progress passenger flow OD calculating of rail traffic brushing card data
CN110853156A (en) * 2019-11-18 2020-02-28 西南交通大学 Passenger OD identification method integrating bus GPS track and IC card data
CN111402618A (en) * 2020-03-27 2020-07-10 北京嘀嘀无限科技发展有限公司 Method and device for determining boarding station, storage medium and electronic equipment
CN111932867A (en) * 2020-06-18 2020-11-13 东南大学 Multisource data-based bus IC card passenger getting-off station derivation method
WO2021159865A1 (en) * 2020-02-11 2021-08-19 罗普特科技集团股份有限公司 Data calibration-based bus route prediction method and system

Patent Citations (8)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101540098A (en) * 2009-04-30 2009-09-23 北京工业大学 Matching method based on public transport IC card lines
CN105574137A (en) * 2015-12-15 2016-05-11 华南理工大学 Public transportation multi-source data fusion based IC card swiping station matching method
CN107818412A (en) * 2017-10-18 2018-03-20 福州大学 A kind of extensive bus passenger OD parallel calculating methods based on MapReduce
CN110084442A (en) * 2019-05-16 2019-08-02 重庆大学 A kind of method of joint public transport and the progress passenger flow OD calculating of rail traffic brushing card data
CN110853156A (en) * 2019-11-18 2020-02-28 西南交通大学 Passenger OD identification method integrating bus GPS track and IC card data
WO2021159865A1 (en) * 2020-02-11 2021-08-19 罗普特科技集团股份有限公司 Data calibration-based bus route prediction method and system
CN111402618A (en) * 2020-03-27 2020-07-10 北京嘀嘀无限科技发展有限公司 Method and device for determining boarding station, storage medium and electronic equipment
CN111932867A (en) * 2020-06-18 2020-11-13 东南大学 Multisource data-based bus IC card passenger getting-off station derivation method

Cited By (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN116542438A (en) * 2023-03-28 2023-08-04 大连海事大学 Bus passenger starting and stopping point estimation and repair method based on non-reference real phase
CN116542438B (en) * 2023-03-28 2024-01-30 大连海事大学 Bus passenger starting and stopping point estimation and repair method based on non-reference real phase

Also Published As

Publication number Publication date
CN115691128B (en) 2023-07-18

Similar Documents

Publication Publication Date Title
CN110942198B (en) Passenger path identification method and system for rail transit operation
CN111653096B (en) Urban trip mode identification method based on mobile phone signaling data
CN109903553B (en) Multi-source data mining bus station identification and inspection method
CN108242149A (en) A kind of big data analysis method based on traffic data
CN109410568B (en) Get-off site presumption method and system based on user portrait and transfer rule
CN102044146B (en) Method for reckoning getting-on stops on basis of data of one-ticket public-transport integrated circuit (IC) card
CN110704993A (en) Customized bus route design method for relieving subway passenger flow pressure
CN104809344A (en) IC (Integrated Circuit) card data-based estimation method for passenger flow in bus station interval
CN110298516B (en) Method and device for splitting overlong bus line based on passenger flow OD data, mobile terminal equipment and server
CN109547930B (en) Method and device for analyzing urban rail transit passenger flow source based on operator data
CN112005562B (en) Method, computing system and computer readable medium for determining vehicle congestion using real-time location data
Huang et al. A method for bus OD matrix estimation using multisource data
CN109903555B (en) Bus passenger getting-off data prediction method and system based on big data
CN114363842B (en) Bus passenger departure station prediction method and device based on mobile phone signaling data
CN107590239B (en) Method for measuring connection radius of public bicycle at subway station based on IC card data
CN110969861B (en) Vehicle identification method, device, equipment and computer storage medium
JP2020071778A (en) Vehicle dispatch system, server, and information processing method
CN112036757A (en) Parking transfer parking lot site selection method based on mobile phone signaling and floating car data
CN115691128A (en) Bus stop passenger flow calculation method based on multi-source bus data combined mining
CN111581325B (en) K-means station area division method based on space-time influence distance
CN114358808A (en) Public transport OD estimation and distribution method based on multi-source data fusion
CN107578619B (en) Method for measuring public bicycle service range of subway station based on IC card data
CN113177742A (en) Public transport service method, system, terminal and storage medium based on intelligent transportation
CN115496316A (en) Multi-source data-based bus line passenger flow dynamic distribution method
CN113408833A (en) Public traffic key area identification method and device and electronic equipment

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant