CN114021883A - Dispatching method for subway transfer shared bicycle in peak period - Google Patents

Dispatching method for subway transfer shared bicycle in peak period Download PDF

Info

Publication number
CN114021883A
CN114021883A CN202111142142.8A CN202111142142A CN114021883A CN 114021883 A CN114021883 A CN 114021883A CN 202111142142 A CN202111142142 A CN 202111142142A CN 114021883 A CN114021883 A CN 114021883A
Authority
CN
China
Prior art keywords
data
shared bicycle
points
bicycle
python
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN202111142142.8A
Other languages
Chinese (zh)
Inventor
吴涛
吴鼎新
赵奕娇
沙雯怡
庄俊杰
张新茹
陈瑶
葛格
章宇轩
柴树林
谢乐恬
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Huaiyin Institute of Technology
Original Assignee
Huaiyin Institute of Technology
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Huaiyin Institute of Technology filed Critical Huaiyin Institute of Technology
Priority to CN202111142142.8A priority Critical patent/CN114021883A/en
Publication of CN114021883A publication Critical patent/CN114021883A/en
Pending legal-status Critical Current

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06QINFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES, NOT OTHERWISE PROVIDED FOR
    • G06Q10/00Administration; Management
    • G06Q10/06Resources, workflows, human or project management; Enterprise or organisation planning; Enterprise or organisation modelling
    • G06Q10/063Operations research, analysis or management
    • G06Q10/0631Resource planning, allocation, distributing or scheduling for enterprises or organisations
    • G06Q10/06315Needs-based resource requirements planning or analysis
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/90Details of database functions independent of the retrieved data types
    • G06F16/95Retrieval from the web
    • G06F16/951Indexing; Web crawling techniques
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/90Details of database functions independent of the retrieved data types
    • G06F16/95Retrieval from the web
    • G06F16/953Querying, e.g. by the use of web search engines
    • G06F16/9537Spatial or temporal dependent retrieval, e.g. spatiotemporal queries
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F18/00Pattern recognition
    • G06F18/20Analysing
    • G06F18/23Clustering techniques
    • G06F18/232Non-hierarchical techniques
    • G06F18/2321Non-hierarchical techniques using statistics or function optimisation, e.g. modelling of probability density functions
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F18/00Pattern recognition
    • G06F18/20Analysing
    • G06F18/24Classification techniques
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06QINFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES, NOT OTHERWISE PROVIDED FOR
    • G06Q10/00Administration; Management
    • G06Q10/04Forecasting or optimisation specially adapted for administrative or management purposes, e.g. linear programming or "cutting stock problem"
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06QINFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES, NOT OTHERWISE PROVIDED FOR
    • G06Q30/00Commerce
    • G06Q30/06Buying, selling or leasing transactions
    • G06Q30/0645Rental transactions; Leasing transactions

Landscapes

  • Engineering & Computer Science (AREA)
  • Business, Economics & Management (AREA)
  • Theoretical Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • Data Mining & Analysis (AREA)
  • Human Resources & Organizations (AREA)
  • General Physics & Mathematics (AREA)
  • Strategic Management (AREA)
  • Databases & Information Systems (AREA)
  • Economics (AREA)
  • General Engineering & Computer Science (AREA)
  • Development Economics (AREA)
  • General Business, Economics & Management (AREA)
  • Marketing (AREA)
  • Entrepreneurship & Innovation (AREA)
  • Accounting & Taxation (AREA)
  • Finance (AREA)
  • Artificial Intelligence (AREA)
  • Bioinformatics & Cheminformatics (AREA)
  • Evolutionary Computation (AREA)
  • Game Theory and Decision Science (AREA)
  • Life Sciences & Earth Sciences (AREA)
  • Bioinformatics & Computational Biology (AREA)
  • Operations Research (AREA)
  • Quality & Reliability (AREA)
  • Tourism & Hospitality (AREA)
  • Computer Vision & Pattern Recognition (AREA)
  • Evolutionary Biology (AREA)
  • Probability & Statistics with Applications (AREA)
  • Educational Administration (AREA)
  • Traffic Control Systems (AREA)

Abstract

The invention discloses a dispatching method for a subway transfer shared bicycle in a peak period, which comprises the steps of acquiring subway station data and shared bicycle data of a research area in advance, and carrying out data cleaning and data analysis on the shared bicycle data; dividing the research area into areas by using a Geohash algorithm, converting the areas into a coordinate system, and obtaining the distance between the areas through coordinates so as to determine the distance problem during scheduling; classifying the shared bicycle by using a DBSCAN clustering algorithm, wherein a clustering result provides a scheduling object for implementing a greedy algorithm; and selecting the optimal substructure by using a greedy algorithm and combining the prior data to obtain a local optimal scheduling scheme. The invention is not only beneficial to relieving serious traffic pressure, but also has positive reference significance for solving the problem of the tide of the bicycle in other areas, and promotes the development of a rail transit station shared bicycle connection system to a healthier and more reasonable direction.

Description

Dispatching method for subway transfer shared bicycle in peak period
Technical Field
The invention belongs to the field of traffic, and particularly relates to a dispatching method of a subway transfer shared bicycle in a peak period.
Background
The shared bicycle is an emerging vehicle, has the characteristics of convenience and no pollution, and is rapidly developed after being pushed out. The shared bicycle greatly solves the problem of the last kilometer, and gradually becomes one of the important connection modes for public transport travel.
The subway connection sharing bicycle improves the traveling efficiency of people, however, around some subway stations, the following problems exist in connection of the subway and the sharing bicycle: if the car is returned at a fixed rental location, or the rental location has no car, and the rental location is full, the travel experience of the pedestrians is seriously influenced.
Particularly, in the peak period, the phenomenon that one vehicle is difficult to share is more serious. Meanwhile, the shared bicycle is parked around some stations, so that the phenomenon of parking congestion occurs, and the traffic service capability of corresponding areas is weakened. And management departments lack detailed demand forecasting support, and the refined management of 'one station for one strategy' is difficult to be carried out only by the total quantity of regional trips.
Therefore, a method is needed to alleviate the problems of disordered parking and uneven distribution of the shared bicycle at the subway station entrance and solve the problem of tide of the shared bicycle.
Disclosure of Invention
The purpose of the invention is as follows: aiming at the defects in the prior art, the invention provides a dispatching method of a subway transfer shared bicycle in a peak period, so as to relieve the disordered parking of the shared bicycle at the current subway station port and solve the problem of uneven distribution of the shared bicycle.
The technical scheme is as follows: the invention provides a dispatching method of a subway transfer shared bicycle in a peak period, which specifically comprises the following steps:
(1) crawling subway station data in the targeted area on a hectogram application program by utilizing a Python crawler;
(2) the method comprises the steps that shared bicycle data in a targeted area are crawled by a Python crawler and a related website, and data cleaning and data analysis are conducted on the shared bicycle data;
(3) dividing the regions into regions by using a Geohash algorithm, establishing a coordinate system, and obtaining the distance between the regions through coordinates in the coordinate system;
(4) the shared bicycle data are classified by using a DBSCAN clustering algorithm, the shared bicycle data can be divided into three types of core points, boundary points and noise points according to a clustering result, the core points are areas with large usage of the shared bicycle, the noise points are areas with small usage, the usage of the shared bicycle of the noise points is between the core points and the noise points, and the clustering result provides a scheduling object for implementing the greedy algorithm;
(5) and selecting an optimal substructure by using a greedy algorithm and combining the regional distance data provided by the Geohash algorithm and the scheduling object data provided by the DBSCAN clustering algorithm to obtain a local optimal scheduling scheme.
Further, the step (1) is realized as follows:
the method comprises the steps of establishing a Baidu map application from a Baidu map open platform by utilizing a Python crawler, applying for a signaling AK, opening a POI module of an API (application program interface), selecting a retrieval mode, setting a retrieval city, retrieving a keyword subway, substituting an obtained URL (uniform resource locator) and the Baidu signaling AK into the Python, and utilizing Python coding to obtain required data.
Further, the step (3) includes the steps of:
(31) obtaining a region Geohash code according to the geographical position information; obtaining a regional Geohash code by utilizing a Geohash six-bit code and setting precision;
(32) according to the Geohash code graph, establishing a plane rectangular coordinate system as a distance coordinate system, making a point line graph, and obtaining the distance of a corresponding area according to the distance between points; converting the longitude and latitude of the shared bicycle order information into a Geohash code, and recording the bicycle stock at each position according to the bicycle lock opening and closing:
Figure BDA0003284148920000021
wherein, LOCK _ STAUS represents the locking and unlocking record of the bicycle, 0 is unlocking, 1 is locking, and underflow represents the regional stock; inflow-1 represents the regional inventory-1; inflow +1 represents the zone inventory + 1.
Further, the step (4) comprises the steps of:
(41) calculating Euclidean distance between data in the set area:
setting data in the shared bicycle gathering area as two tuples, and calculating the Euclidean distance between the data in the two tuples:
Figure BDA0003284148920000022
where d is the Euclidean distance between the data in two tuples, x1Longitude, y, for a set of shared bicycle data in the first tuple1For a group of shared numbers of single cars in the first tupleAccording to the latitude; x is the number of2Longitude for a set of shared bicycle data within the second tuple; y is2A latitude for a set of shared bicycle data within the second tuple;
(42) selecting a radius parameter r and a neighborhood density threshold minPts, and utilizing Python coding to realize clustering:
using Python coding to take a data set packaged in sklern as a parameter, setting two lists by combining the number of the obtained Euclidean distance calculation points, putting the accessed points into one list, putting the points which are not accessed into the other list, and selecting a proper radius r and a neighborhood density threshold Eps for traversal processing; if the list for storing the points which are not accessed yet has points, the traversal processing is continued until the list reaches an empty set; if there is no access point, randomly selecting a no access point, if the number of objects in the neighborhood of the point is larger than a specified threshold, indicating that the point is a core object, adding the accessed points to the points in the neighborhood of the point, removing the no access points, finding out the core objects of the points, and placing the core objects in a list, which is called core point; putting the remaining points in the neighborhood into a list, called boundary points; the other points are placed in a list called noise points;
(43) evaluation with reference to the contour coefficients yields the following formula:
Figure BDA0003284148920000031
wherein a (i) is the average distance from the sample i to other samples in the same cluster, b (i) is the average distance from the sample i to some other cluster CjIf s (i) is close to 1, the average distance of all samples in (1) indicates that the sample i is reasonably clustered; if the clustering is reasonable, ending; and unreasonable, selecting a new radius r and a neighborhood density threshold Eps, and repeating the steps.
Further, the step (5) includes the steps of:
(51) acquiring a shared bicycle gathering area and flow:
substituting the shared bicycle data of each type of clusters of the DBSCAN clustering algorithm into Python, obtaining the area and the central longitude and latitude of each bicycle aggregation area by using the Python, and numbering each shared bicycle aggregation area; time extraction is carried out on order data, and specific regional flow statistics is completed, namely the flow of different times in a certain range is counted;
(52) determining an optimization index:
the optimization index is min (1-global single vehicle utilization rate) + scheduling cost
Min (1-global bicycle utilization) + min (number of dispatching bicycles)
+ min (number of dispatching bicycles) + min (distance dispatching bicycle)
(53) Selecting an optimal substructure, and implementing a greedy algorithm by using Python coding:
comparing the number of the contained bicycles and the bicycle flow of each shared bicycle aggregation area by utilizing Python codes, calculating the value of subtracting the number of the contained shared bicycles from the bicycle flow of each aggregation area, and putting the value into two lists according to the positive and negative of the obtained value, wherein if the value is 0, the value can not be considered; maximizing the optimization index to be used as an optimal substructure of a greedy algorithm; randomly selecting one from the list for storing positive values as an initial target, and selecting data from the list for storing negative values to enable the sum of the positive values and the negative values to be close to 0, wherein the selected number of the negative values can be selected more; and sequentially proceeding downwards until the lists for storing positive values all finish scheduling, and obtaining a local optimal solution.
Has the advantages that: compared with the prior art, the invention has the beneficial effects that: 1. according to the invention, a Geohash algorithm is utilized to divide a region to be researched into regions, the regions are converted into a coordinate system, and the distance between the regions is obtained through coordinates, so that the distance problem during dispatching is determined; 2. according to the invention, a DBSCAN clustering algorithm and a greedy algorithm are combined to obtain a more reasonable and effective scheduling scheme, and an unreasonable or less reasonable scheduling scheme obtained by a single algorithm due to error is eliminated; 3. the rationality and effectiveness of the scheduling scheme are verified by using analog simulation, and if the rationality and effectiveness are reasonable, the scheduling scheme is ended; otherwise, returning to the algorithm step; 4. the dispatching method for solving the problem of subway transfer shared bicycle in the peak period can relieve serious traffic pressure, and has positive reference significance in solving the problem of tide of the shared bicycle.
Drawings
FIG. 1 is a flow chart of the present invention.
Detailed Description
The present invention will be described in further detail with reference to the accompanying drawings.
The invention provides a dispatching method of a subway transfer shared bicycle in a peak period, which specifically comprises the following steps as shown in figure 1:
step 1: the method comprises the steps of establishing a Baidu map application on a Baidu map open platform by utilizing a Python crawler, applying for a signaling AK, opening a POI module of an API (application program interface), selecting a rectangular retrieval mode, setting a retrieval city, retrieving a keyword subway, substituting an obtained URL (uniform resource locator) and the Baidu signaling AK into the Python, and utilizing Python coding to obtain required data.
And in the crawling process, attention is paid to conversion between the Baidu coordinates and the Mars coordinates, and data visualization is realized by using a Gaode map API after data cleaning.
Step 2: and crawling the shared bicycle data in the targeted area by using a Python crawler and a related website, and performing data cleaning and data analysis on the shared bicycle data.
In the embodiment, when cleaning the shared bicycle data, the order data of the shared bicycle is crawled by using Python in a digital Chinese website, if one cycle of unlocking and locking occurs in one order, the order is a group of successful orders, and if the other cycle of continuously unlocking for multiple times or continuously locking for multiple times occurs, the order is defined as a problem order. When a problem order is found, the first unlocking record and the last locking record are stored, and other redundant records are deleted, so that the accuracy of the number of vehicles is ensured.
Then, the shared bicycle data is analyzed, in this embodiment, the shared bicycle data is calculated from 6 in the morning: 00 to night 21: 00 quantitative analysis is carried out every 1 hour, and the analysis determines the peak time of the connection of the sharing single vehicle.
And step 3: dividing the research area into areas by using a Geohash algorithm, establishing a coordinate system, obtaining the distance between the areas through coordinates in the coordinate system, and determining the distance between the areas when scheduling is performed by using a greedy algorithm, wherein the specific process comprises the following steps:
(3.1) aiming at the geographical position information, obtaining a region Geohash code: in this embodiment, the geo-hash six-bit code is used to set the precision, the latitude and longitude information of a region is converted into a string code which can be sorted and compared, i.e., the geo-hash code of the region is obtained, and a geo-hash code map is drawn by using drawing software (such as a high-grade map API).
And (3.2) converting the longitude and latitude of the shared bicycle into a Geohash code, and recording the storage amount of the shared bicycle, wherein the specific process comprises the steps of establishing a plane rectangular coordinate system as a distance coordinate system of the shared bicycle according to a Geohash code diagram, manufacturing a point line diagram, and obtaining the distance between areas according to the distance between points (the distance between the areas can be used in the dispatching distance in the greedy algorithm optimization index). According to the order information of the shared bicycle, the latitude and longitude of each order information is converted into a Geohash code (the Geohash code is that the latitude and longitude information of a region is converted into a character string code which can be sequenced and compared), and the bicycle storage at each position is recorded according to the bicycle lock, namely the bicycle storage at each position is recorded according to the bicycle lock
Figure BDA0003284148920000051
The LOCK _ STAUS represents a single-vehicle LOCK-unlocking record, 0 is unlocking, 1 is locking, inflow represents area stock, inflow-1 represents that a shared single vehicle leaves the area because of unlocking, the area stock is-1, and inflow +1 represents that the shared single vehicle stays in the area because of locking, and the area stock is + 1.
The division of the area is that the longitude and latitude conversion is based on the dichotomy idea, namely, the earth dimension interval is divided into two intervals, the left interval is 0, the right interval is 1, and the division is continuously carried out according to the precision requirement. Dividing the earth dimension interval of-90, 90 to obtain two intervals of-90, 0 and 0,90, wherein the left interval of-90, 0 is 0 and the right interval of 0,90 is 1, and obtaining table 1.
Table 1 partitioning a region into regions using a Geohash algorithm
bit min mid max
1 -90 0 90
0 0 45 90
1 0 22.5 45
1 22.5 33.75 45
And 4, step 4: the shared bicycle data are classified by using a DBSCAN clustering algorithm, the shared bicycle data can be classified into three types of core points, boundary points and noise points according to clustering results, the core points are areas with large shared bicycle usage, the noise points are areas with small usage, the shared bicycle usage of the noise points is between the core points and the noise points, and the clustering results provide scheduling objects for implementing the greedy algorithm.
The DBSCAN clustering algorithm of the invention utilizes numpy, random and math libraries in Python and refers to data set databases packaged in sklearn to implement the DBSCAN clustering algorithm, and the specific process is as follows:
(4.1) calculating the Euclidean distance between two groups of shared bicycle data in the set area:
setting data in the shared bicycle gathering area as two tuples, and calculating the Euclidean distance between the data in the two tuples:
Figure BDA0003284148920000061
wherein d is the Euclidean distance between data in two tuples; x is the number of1Longitude, y, for a set of shared bicycle data in the first tuple1Latitude of a group of shared bicycle data in the first tuple; x is the number of2Longitude for a set of shared bicycle data within the second tuple; y is2The latitude of a set of shared bicycle data within the second tuple.
(4.2) selecting a radius parameter r and a neighborhood density threshold minPts of a DBSCAN clustering algorithm, and clustering by utilizing Python coding:
and (2) taking a data set packaged in the sklern as a parameter by utilizing Python coding, combining the number of Euclidean distance calculation points obtained in the step (1), constructing two lists by utilizing Python, putting accessed points into one list, putting points which are not accessed into the other list, and performing traversal processing on a row with the radius r and the neighborhood density threshold value minPts. If there are more points on the list that have not been accessed, the traversal process continues until the list reaches an empty set. If there is no access point, randomly selecting a no access point, if the number of objects in the neighborhood of the point is larger than a specified threshold, indicating that the point is a core object, adding the accessed points to the points in the neighborhood of the point, removing the no access points, finding out the core objects of the points, and placing the core objects in a list, which is called core point; putting the remaining points in the neighborhood into a list, called boundary points; the other points are placed in a list called noise points, i.e. the data is classified into three categories, core points, boundary points and noise points.
(4.3) evaluating the rationality of the DBSCAN clustering algorithm:
the invention refers to the contour coefficient for evaluation, and the evaluation is carried out by using the following formula:
Figure BDA0003284148920000071
wherein, a (i) is the average distance from the sample i to other samples in the same cluster, b (i) is the average distance from the sample i to all samples in some other cluster, and s (i) is the rationality value of the clustering result, and if s (i) is closer to 1, the more rational the clustering of the sample i is. Generally, it is reasonable to cluster s (i) greater than 0.85.
If the DBSCAN cluster is reasonable, ending; and unreasonable, selecting a new radius r and a neighborhood density threshold value minPts, and repeating the steps. The radius r is typically of the order of ± 10 from the calculated euclidean distance mean.
And 5: the optimal substructure of the greedy algorithm is selected by utilizing the greedy algorithm and combining the prior data to obtain a local optimal scheduling scheme, and the specific process is as follows:
(5.1) acquiring the gathering area and the flow rate of the shared bicycle:
and substituting the shared bicycle data of each type of clusters in the DBSCAN clustering algorithm into Python, obtaining the area and the central longitude and latitude of each shared bicycle aggregation area by using the Python, and numbering each shared bicycle aggregation area. In this embodiment, order data is extracted from order data time, and specific statistics of the flow rate of the shared vehicles in the area is completed, that is, the flow rate of the shared vehicle in the area where the shared vehicles are gathered at different times in a certain range is counted.
(5.2) determining an optimization index when the shared bicycle is dispatched:
when the method for dispatching the shared bicycle is used for solving the connection problem, the cost is considered, and the specific weighing factors of dispatching comprise: the usage rate of the shared bicycle after scheduling; scheduling cost in the scheduling process; scheduling the number of the shared bicycles; and dispatching the number of the shared bicycle. The optimization indexes of the shared bicycle scheduling method are as follows:
the optimization index is min (1-global single vehicle utilization rate) + scheduling cost
Min (1-global bicycle utilization) + min (number of dispatching bicycles)
+ min (number of dispatching bicycles) + min (distance dispatching bicycle)
(5.3) selecting an optimal substructure, and using Python coding to realize a greedy algorithm:
the invention compares the number of the contained single vehicles and the single vehicle flow of each shared single vehicle gathering area by utilizing Python codes, and calculates the difference value of the single vehicle flow of each gathering area minus the number of the contained shared single vehicles. The formula is as follows:
n=flow-amount
and n is a difference value, flow is the flow of the shared bicycles in one aggregation area, and amount is the number of the shared bicycles in one aggregation area.
And constructing two lists by utilizing Python, and putting the two lists according to the positive and negative properties of the obtained difference value n, namely putting a positive value in the difference value n into one list, putting a negative value in the difference value n into the other list, and if the value is 0, not considering the putting into the list. And maximizing the optimized value of the optimization index to be used as an optimal substructure of the greedy algorithm. Randomly selecting one from the list for storing positive values as an initial target, and selecting data from the list for storing negative values to enable the sum of the positive values and the negative values to be close to 0, wherein the selection number of the negative values can be selected more. And sequentially proceeding downwards until the lists for storing positive values all finish scheduling, and obtaining a local optimal solution, namely an optimal scheduling scheme.
And verifying the effectiveness of the obtained scheduling scheme by using analog modeling, wherein the specific process is as follows:
1) inputting the longitude and latitude of a subway station, and establishing a model according to data:
in this embodiment, two models before and after the optimization of the subway station are constructed, one model is a control group model, and the other model is an experimental group model. Firstly, finding a research area by using an electronic map GIS (geographic information system) of the analog, inputting the longitude and latitude of subway stations of the research area into the electronic map GIS, and setting the number of people flowing out of each station per hour as the average value of the number of people using a sharing bicycle to plug in each station per hour. A shared bicycle gathering area is set up according to the data near each station. At this point, the preliminary model building of the two models is completed. On the basis of the preliminary model, maintaining an original scheduling scheme for shared single vehicles in a research area in a group model; and dispatching the shared single vehicles in the research area in the experimental group model according to the result obtained by the greedy algorithm.
2) And running a control group model and an experimental group model, comparing the number of people staying in the two groups of stations, and verifying the rationality and effectiveness of the scheduling scheme.
TABLE 2 comparison of results after model run
Figure BDA0003284148920000091
After 24 hours of operation, the amount of retained pedestrians at the station is compared. If the number of detained people at each site of the experimental group model is reduced by more than 15 percent compared with that of the control group model, the scheduling scheme is reasonable and effective, and the process is finished; if the reduction is not realized or the reduction degree is not 15%, the scheduling scheme is unreasonable and ineffective, and the greedy algorithm step 5 is returned. The result of the model operation is shown in table 2, and the rationality and effectiveness of the scheduling scheme of the present invention can be seen from table 2.

Claims (5)

1. A dispatching method for subway transfer shared bicycle in peak period is characterized by comprising the following steps:
(1) crawling subway station data in the targeted area on a hectogram application program by utilizing a Python crawler;
(2) the method comprises the steps that shared bicycle data in a targeted area are crawled by a Python crawler and a related website, and data cleaning and data analysis are conducted on the shared bicycle data;
(3) dividing the regions into regions by using a Geohash algorithm, establishing a coordinate system, and obtaining the distance between the regions through coordinates in the coordinate system;
(4) the shared bicycle data are classified by using a DBSCAN clustering algorithm, the shared bicycle data can be divided into three types of core points, boundary points and noise points according to a clustering result, the core points are areas with large usage of the shared bicycle, the noise points are areas with small usage, the usage of the shared bicycle of the noise points is between the core points and the noise points, and the clustering result provides a scheduling object for implementing the greedy algorithm;
(5) and selecting an optimal substructure by using a greedy algorithm and combining the regional distance data provided by the Geohash algorithm and the scheduling object data provided by the DBSCAN clustering algorithm to obtain a local optimal scheduling scheme.
2. The method for dispatching the subway docking sharing single vehicle in the rush hour according to claim 1, wherein said step (1) is implemented as follows:
the method comprises the steps of establishing a Baidu map application from a Baidu map open platform by utilizing a Python crawler, applying for a signaling AK, opening a POI module of an API (application program interface), selecting a retrieval mode, setting a retrieval city, retrieving a keyword subway, substituting an obtained URL (uniform resource locator) and the Baidu signaling AK into the Python, and utilizing Python coding to obtain required data.
3. The method for dispatching the subway docking sharing single vehicle during the rush hour according to claim 1, wherein said step (3) comprises the steps of:
(31) obtaining a region Geohash code according to the geographical position information; obtaining a regional Geohash code by utilizing a Geohash six-bit code and setting precision;
(32) according to the Geohash code graph, establishing a plane rectangular coordinate system as a distance coordinate system, making a point line graph, and obtaining the distance of a corresponding area according to the distance between points; converting the longitude and latitude of the shared bicycle order information into a Geohash code, and recording the bicycle stock at each position according to the bicycle lock opening and closing:
Figure FDA0003284148910000011
wherein, LOCK _ STAUS represents the locking and unlocking record of the bicycle, 0 is unlocking, 1 is locking, and underflow represents the regional stock; inflow-1 represents the regional inventory-1; inflow +1 represents the zone inventory + 1.
4. The method for dispatching the subway docking sharing single vehicle during the rush hour according to claim 1, wherein said step (4) comprises the steps of:
(41) calculating Euclidean distance between data in the set area:
setting data in the shared bicycle gathering area as two tuples, and calculating the Euclidean distance between the data in the two tuples:
Figure FDA0003284148910000021
where d is the Euclidean distance between the data in two tuples, x1Longitude, y, for a set of shared bicycle data in the first tuple1Latitude of a group of shared bicycle data in the first tuple; x is the number of2Longitude for a set of shared bicycle data within the second tuple; y is2A latitude for a set of shared bicycle data within the second tuple;
(42) selecting a radius parameter r and a neighborhood density threshold minPts, and utilizing Python coding to realize clustering:
using Python coding to take a data set packaged in sklern as a parameter, setting two lists by combining the number of the obtained Euclidean distance calculation points, putting the accessed points into one list, putting the points which are not accessed into the other list, and selecting a proper radius r and a neighborhood density threshold Eps for traversal processing; if the list for storing the points which are not accessed yet has points, the traversal processing is continued until the list reaches an empty set; if there is no access point, randomly selecting a no access point, if the number of objects in the neighborhood of the point is larger than a specified threshold, indicating that the point is a core object, adding the accessed points to the points in the neighborhood of the point, removing the no access points, finding out the core objects of the points, and placing the core objects in a list, which is called core point; putting the remaining points in the neighborhood into a list, called boundary points; the other points are placed in a list called noise points;
(43) evaluation with reference to the contour coefficients yields the following formula:
Figure FDA0003284148910000022
wherein a (i) is the average distance from the sample i to other samples in the same cluster, b (i) is the average distance from the sample i to some other cluster CjIf s (i) is close to 1, the average distance of all samples in (1) indicates that the sample i is reasonably clustered; if the clustering is reasonable, ending; and unreasonable, selecting a new radius r and a neighborhood density threshold Eps, and repeating the steps.
5. The method for dispatching of the rush hour subway docking sharing bicycle according to claim 1, wherein said step (5) comprises the steps of:
(51) acquiring a shared bicycle gathering area and flow:
substituting the shared bicycle data of each type of clusters of the DBSCAN clustering algorithm into Python, obtaining the area and the central longitude and latitude of each bicycle aggregation area by using the Python, and numbering each shared bicycle aggregation area; time extraction is carried out on order data, and specific regional flow statistics is completed, namely the flow of different times in a certain range is counted;
(52) determining an optimization index:
the optimization index is min (1-global single vehicle utilization rate) + scheduling cost
Min (1-global bicycle usage) + min (number of scheduled bicycles) + min (scheduled bicycle distance)
(53) Selecting an optimal substructure, and implementing a greedy algorithm by using Python coding:
comparing the number of the contained bicycles and the bicycle flow of each shared bicycle aggregation area by utilizing Python codes, calculating the value of subtracting the number of the contained shared bicycles from the bicycle flow of each aggregation area, and putting the value into two lists according to the positive and negative of the obtained value, wherein if the value is 0, the value can not be considered; maximizing the optimization index to be used as an optimal substructure of a greedy algorithm; randomly selecting one from the list for storing positive values as an initial target, and selecting data from the list for storing negative values to enable the sum of the positive values and the negative values to be close to 0, wherein the selected number of the negative values can be selected more; and sequentially proceeding downwards until the lists for storing positive values all finish scheduling, and obtaining a local optimal solution.
CN202111142142.8A 2021-09-28 2021-09-28 Dispatching method for subway transfer shared bicycle in peak period Pending CN114021883A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN202111142142.8A CN114021883A (en) 2021-09-28 2021-09-28 Dispatching method for subway transfer shared bicycle in peak period

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN202111142142.8A CN114021883A (en) 2021-09-28 2021-09-28 Dispatching method for subway transfer shared bicycle in peak period

Publications (1)

Publication Number Publication Date
CN114021883A true CN114021883A (en) 2022-02-08

Family

ID=80054879

Family Applications (1)

Application Number Title Priority Date Filing Date
CN202111142142.8A Pending CN114021883A (en) 2021-09-28 2021-09-28 Dispatching method for subway transfer shared bicycle in peak period

Country Status (1)

Country Link
CN (1) CN114021883A (en)

Cited By (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN114022195A (en) * 2021-10-21 2022-02-08 淮阴工学院 Express industry delivery network planning method based on density and combined with cell data
CN115438898A (en) * 2022-05-25 2022-12-06 珠海优特电力科技股份有限公司 First object distribution method and device, storage medium and electronic device
CN115619051A (en) * 2022-12-19 2023-01-17 北京阿帕科蓝科技有限公司 Shared vehicle allocation method, system and computer storage medium
CN116703132A (en) * 2023-08-08 2023-09-05 北京阿帕科蓝科技有限公司 Management method and device for dynamic scheduling of shared vehicles and computer equipment
CN116824861A (en) * 2023-08-24 2023-09-29 北京亦庄智能城市研究院集团有限公司 Method and system for scheduling sharing bicycle based on multidimensional data of urban brain platform

Citations (10)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN109508865A (en) * 2018-10-19 2019-03-22 东南大学 The dispositions method of bicycle is shared in subway station radiation scope based on space-time use pattern
CN110147885A (en) * 2019-05-07 2019-08-20 浙江大学城市学院 A kind of shared bicycle of improved adaptive GA-IAGA parks point distribution method
CN110210749A (en) * 2019-05-28 2019-09-06 东南大学 A kind of urban railway station is connected shared bicycle injected volume and placement position determines method
CN110472999A (en) * 2019-07-17 2019-11-19 深圳市综合交通运行指挥中心 Passenger flow pattern analysis method and device based on subway and shared bicycle data
CN110704993A (en) * 2019-09-11 2020-01-17 东南大学 Customized bus route design method for relieving subway passenger flow pressure
CN111190982A (en) * 2019-12-25 2020-05-22 航天信息股份有限公司 Shared bicycle management method and system based on big data
CN112734532A (en) * 2021-01-15 2021-04-30 东南大学 Commuting behavior identification method based on shared electric bicycle borrowing and returning point data
WO2021135208A1 (en) * 2019-12-31 2021-07-08 苏宁云计算有限公司 Delivery path planning method and system taking order aggregation degree into consideration
CN114022195A (en) * 2021-10-21 2022-02-08 淮阴工学院 Express industry delivery network planning method based on density and combined with cell data
CN116663843A (en) * 2023-06-15 2023-08-29 武汉理工大学 Shared bicycle comprehensive scheduling method and system for early-peak subway station connection

Patent Citations (10)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN109508865A (en) * 2018-10-19 2019-03-22 东南大学 The dispositions method of bicycle is shared in subway station radiation scope based on space-time use pattern
CN110147885A (en) * 2019-05-07 2019-08-20 浙江大学城市学院 A kind of shared bicycle of improved adaptive GA-IAGA parks point distribution method
CN110210749A (en) * 2019-05-28 2019-09-06 东南大学 A kind of urban railway station is connected shared bicycle injected volume and placement position determines method
CN110472999A (en) * 2019-07-17 2019-11-19 深圳市综合交通运行指挥中心 Passenger flow pattern analysis method and device based on subway and shared bicycle data
CN110704993A (en) * 2019-09-11 2020-01-17 东南大学 Customized bus route design method for relieving subway passenger flow pressure
CN111190982A (en) * 2019-12-25 2020-05-22 航天信息股份有限公司 Shared bicycle management method and system based on big data
WO2021135208A1 (en) * 2019-12-31 2021-07-08 苏宁云计算有限公司 Delivery path planning method and system taking order aggregation degree into consideration
CN112734532A (en) * 2021-01-15 2021-04-30 东南大学 Commuting behavior identification method based on shared electric bicycle borrowing and returning point data
CN114022195A (en) * 2021-10-21 2022-02-08 淮阴工学院 Express industry delivery network planning method based on density and combined with cell data
CN116663843A (en) * 2023-06-15 2023-08-29 武汉理工大学 Shared bicycle comprehensive scheduling method and system for early-peak subway station connection

Non-Patent Citations (2)

* Cited by examiner, † Cited by third party
Title
XIE XINXIN 等: "Research on optimizing the layout of parking site for shared bicycles", 《TRANSPORTATIONREA RESEARCH BOARD》, 17 September 2020 (2020-09-17), pages 2588 - 2599 *
靳爽;庞明宝;: "基于K-means的城市轨道交通社区接驳共享单车停靠点规划", 科学技术与工程, no. 30, 28 October 2019 (2019-10-28) *

Cited By (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN114022195A (en) * 2021-10-21 2022-02-08 淮阴工学院 Express industry delivery network planning method based on density and combined with cell data
CN115438898A (en) * 2022-05-25 2022-12-06 珠海优特电力科技股份有限公司 First object distribution method and device, storage medium and electronic device
CN115619051A (en) * 2022-12-19 2023-01-17 北京阿帕科蓝科技有限公司 Shared vehicle allocation method, system and computer storage medium
CN116703132A (en) * 2023-08-08 2023-09-05 北京阿帕科蓝科技有限公司 Management method and device for dynamic scheduling of shared vehicles and computer equipment
CN116703132B (en) * 2023-08-08 2023-12-12 北京阿帕科蓝科技有限公司 Management method and device for dynamic scheduling of shared vehicles and computer equipment
CN116824861A (en) * 2023-08-24 2023-09-29 北京亦庄智能城市研究院集团有限公司 Method and system for scheduling sharing bicycle based on multidimensional data of urban brain platform
CN116824861B (en) * 2023-08-24 2023-12-05 北京亦庄智能城市研究院集团有限公司 Method and system for scheduling sharing bicycle based on multidimensional data of urban brain platform

Similar Documents

Publication Publication Date Title
CN114021883A (en) Dispatching method for subway transfer shared bicycle in peak period
CN105788260B (en) A kind of bus passenger OD projectional techniques based on intelligent public transportation system data
CN110390349A (en) Bus passenger flow volume based on XGBoost model predicts modeling method
CN109543934B (en) Method for evaluating comprehensive index of urban public transport network
CN110836675B (en) Decision tree-based automatic driving search decision method
CN111079875A (en) Public transport passenger flow monitoring method and device based on multi-source data and storage medium
CN104376327B (en) A kind of clustering method of public bicycles lease point
CN110472999B (en) Passenger flow mode analysis method and device based on subway and shared bicycle data
CN111881939B (en) Shared single-vehicle parking area layout method based on clustering algorithm
CN112734532B (en) Commute behavior identification method based on shared electric bicycle borrowing and returning point data
CN105868861A (en) Bus passenger flow evolution analysis method based on time-space data fusion
CN112036757B (en) Mobile phone signaling and floating car data-based parking transfer parking lot site selection method
CN111914940B (en) Shared vehicle station clustering method, system, device and storage medium
CN113344240A (en) Shared bicycle flow prediction method and system
CN110245774A (en) A method of regular service route optimization is carried out according to employee's home address
CN111190942A (en) Urban road parking spot overall analysis method based on data mining technology
CN115662124A (en) GPS track data road section flow matching method based on network coding
CN115496316A (en) Multi-source data-based bus line passenger flow dynamic distribution method
CN116611586A (en) Newly built road network flow prediction method and system based on double-layer heterogeneous network
CN113850310A (en) Shared bicycle electronic fence planning method based on block subdivision and maximum area coverage
CN114022195A (en) Express industry delivery network planning method based on density and combined with cell data
CN113868553B (en) Layered taxi passenger carrying recommendation method and system
CN115294773A (en) Bus lane optimal configuration method, electronic device and storage medium
CN113723459B (en) Bus operation time interval division and one-way point recommendation method based on time aggregation
CN112308339B (en) Processing method and device of charging data

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination