WO2017133627A1 - User commuter track management method, device and system - Google Patents

User commuter track management method, device and system Download PDF

Info

Publication number
WO2017133627A1
WO2017133627A1 PCT/CN2017/072697 CN2017072697W WO2017133627A1 WO 2017133627 A1 WO2017133627 A1 WO 2017133627A1 CN 2017072697 W CN2017072697 W CN 2017072697W WO 2017133627 A1 WO2017133627 A1 WO 2017133627A1
Authority
WO
WIPO (PCT)
Prior art keywords
trajectory
commute
user
cluster
navigation path
Prior art date
Application number
PCT/CN2017/072697
Other languages
French (fr)
Chinese (zh)
Inventor
韦薇
谢思远
范贤友
王启贵
Original Assignee
中兴通讯股份有限公司
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by 中兴通讯股份有限公司 filed Critical 中兴通讯股份有限公司
Publication of WO2017133627A1 publication Critical patent/WO2017133627A1/en

Links

Images

Classifications

    • G06Q50/40
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/20Information retrieval; Database structures therefor; File system structures therefor of structured data, e.g. relational data
    • G06F16/29Geographical information databases
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06QINFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES, NOT OTHERWISE PROVIDED FOR
    • G06Q10/00Administration; Management
    • G06Q10/04Forecasting or optimisation specially adapted for administrative or management purposes, e.g. linear programming or "cutting stock problem"

Definitions

  • the present disclosure relates to the field of user commuting, for example, to a user commute trajectory management method, apparatus, and system.
  • Commuting commuting is the focus of urban transportation. In order to improve the traffic experience of users commuting to work, the concept of commuter transportation is proposed. Commuting traffic connects family position and working position with strong regularity.
  • the travel rules of commuting traffic of relevant urban residents usually adopt macroscopic surveys and research. For example, the questionnaires are used, and the travel rules of commuting traffic of relevant urban residents are not timely, the sample size is insufficient, and the user's commuting cannot be fully understood and controlled.
  • the present disclosure provides a user commute trajectory management method, device and system, which can fully understand and control user commuting.
  • the present disclosure provides a user commute trajectory management method, including:
  • the travel data includes a time stamp corresponding to the stay point and the stay point;
  • the regularity track of each user's user is output to the commuter road management system.
  • calculating a regularity track of each user according to each user's travel data includes:
  • the commute feature is calculated based on the travel data of each user, and the commute features include a home location, a work location, and a commute time period;
  • the stay point in the travel data, and the time stamp corresponding to the stop point The travel data belonging to the commute time period is filtered, and the stay points in the filtered travel data are sorted by time stamp to generate a commute trajectory;
  • the navigation path is used as the commuting regular trajectory of the cluster cluster, according to the cluster cluster
  • the regular trajectory of commuting generates a regular trajectory of the user.
  • the commute time includes a commute time and a commute time
  • the commute trajectory includes a commute trajectory and a commute trajectory
  • the navigation path is generated according to a time stamp corresponding to the home location, the work location, the frequent commute trajectory, and the frequent commute trajectory.
  • the method further includes: if the trajectory similarity between the navigation path and the representative commute trajectory is less than the second threshold, reselecting the frequent trajectory trajectory and generating a new navigation path, calculating and determining the new navigation path and representing the commute Whether the trajectory similarity between the trajectories is greater than the second threshold, if the trajectory similarity between the new navigation path and the representative commute trajectory is greater than the second threshold, the new navigation path is used as the commuting rule of the cluster cluster a trajectory, if the trajectory similarity between the new navigation path and the representative commute trajectory is not greater than the second threshold, performing a step of reselecting frequent points of the commute trajectory and generating a new navigation path until the new navigation path The trajectory similarity to the representative commute trajectory is greater than the second threshold.
  • the method further includes: calculating a navigation estimation duration of the navigation path, and determining whether the navigation estimation duration of the navigation path and the representative commute are performed when determining whether the trajectory similarity between the navigation path and the representative commutation trajectory is greater than a second threshold. Whether the difference of the duration of the trajectory is smaller than the third threshold; and if the trajectory similarity between the navigation path and the representative commute trajectory is greater than the second threshold, and the navigation estimation duration and representative of the navigation path If the duration difference of the commute trajectory is less than the third threshold, the navigation path is taken as the commuting regular trajectory of the cluster cluster.
  • generating a user regular trajectory according to the commuting regular trajectory of the clustering cluster comprises: calculating a proportion of the number of commuting trajectories of each cluster cluster, and occupying a number of commuting trajectories of the cluster cluster Compared with the weight of the commuting regular trajectory as the clustering cluster, the set of commuting regular trajectories of each clustering cluster is taken as the user regular trajectory; and if the clustering cluster is one, the commuting regular trajectory according to the clustering cluster Generating the user's regular trajectory includes: using the commuting regular trajectory of the cluster cluster as the user's regular trajectory.
  • the method further includes: deleting the cluster cluster whose occupation trajectory number is smaller than the fourth threshold.
  • determining a representative commute trajectory of each cluster cluster includes: calculating, by one by one, a sum of trajectory similarities between each commute trajectory and other commute trajectories in the cluster cluster, and a commuting trajectory having the largest sum of similar trajectories As a representative commuter track.
  • the calculation of the trajectory similarity between the trajectories comprises: calculating the trajectory similarity between the trajectory A and the trajectory B into all the locations of the trajectory A path, and the minimum distance from the point of the trajectory B path is less than the fifth threshold
  • the number a'; the trajectory similarity between the calculated trajectory B and the trajectory A is the number b' of the distance minimum from the point of the trajectory A path among all the points of the trajectory B path; and the calculated trajectory A and the trajectory B
  • the trajectory similarity between (a'+b')/(a+b) a is the number of locations of the trajectory A pathway, b is the number of locations of the trajectory A pathway, and / represents the division operation.
  • the method further includes: calculating, by using an outlier detection algorithm, a trajectory anomaly coefficient, and deleting a commutation trajectory whose trajectory anomaly coefficient is greater than a sixth threshold.
  • the method further includes: selecting, from all the users, the commute users that are identifiable by the home location and the work location, and generating the traversable trajectory of the identifiable commute user.
  • the commuting user that can be identified from all the users by the home location and the work location includes: calculating a discrete entropy of the travel location of each user according to the travel data of each user, and the discrete entropy of the travel location is less than a seventh threshold.
  • the user acts as a commute user; and obtains and according to the travel data of each commute user, identifies the home location and work location of each commute user, and deletes at least one unrecognizable commuter user in the home location and the work location.
  • the present disclosure provides a user commute trajectory management apparatus, including:
  • Obtaining a module configured to obtain, by using a positioning system, travel data of each user, where the travel data includes a time stamp corresponding to the stay point and the stay point;
  • a processing module configured to calculate a regularity trajectory of each user according to each user's travel data
  • the output module is configured to output a regular trajectory of each user to the commuter road management system.
  • the processing module includes:
  • the commute feature calculation sub-module is configured to calculate a commute feature according to each user's travel data, and the commute feature includes a home location, a work location, and a commute time period;
  • the commute trajectory generation sub-module is configured to filter the travel data belonging to the commute time period according to the commute time of the commute feature, the stay point in the travel data, and the time stamp corresponding to the stop point, and select the stay point in the filtered travel data according to the time. Poke sorting generates a commute trajectory;
  • the commuter cluster management sub-module is configured to calculate a trajectory similarity between each two commute trajectories of each user, and a commute trajectory of each user's two commute trajectories with a trajectory similarity greater than a first threshold
  • the class generates a cluster of clusters and determines a representative commute trajectory for each cluster of clusters
  • the navigation path generation sub-module is configured to select a frequent commutation trajectory from the stay points of all the commutation trajectories of each cluster cluster according to the selection strategy, according to the home location, the work location, the frequent points of the commute trajectory, and the time stamp corresponding to the frequent points of the commute trajectory Generate a navigation path;
  • the commute regular track sub-module is set to calculate the trajectory similarity between the navigation path and the representative commute trajectory. If the trajectory similarity between the navigation path and the representative commute trajectory is greater than the second threshold, the navigation path is used as the clustering group for commuting The regular trajectory generates a user regular trajectory according to the commuting regular trajectory of the cluster cluster.
  • the commute time includes a commute time and an off-duty commute time
  • the commute trajectory includes a commute trajectory and an off-duty commute trajectory
  • the navigation path generation sub-module is set to generate a user if the cluster cluster is generated by the commute trajectory of the commute
  • the home position is the starting point
  • the working position is the ending point
  • the clustering cluster is generated by the commuting trajectory trajectory clustering, the generation starts from the user's working position, the family position is the ending point, and The commute navigation path that frequently passes through the commute trajectory in turn.
  • the commute regular trajectory sub-module is further configured to: if the trajectory similarity between the navigation path and the representative commute trajectory is less than a second threshold, triggering the navigation path generating sub-module to re-select the commuting trajectory frequently Pointing and generating a new navigation path, calculating and determining whether the trajectory similarity between the new navigation path and the representative commute trajectory is greater than a second threshold, if the trajectory similarity between the new navigation path and the representative commute trajectory is greater than the The second threshold is used as a commuting regular trajectory of the cluster cluster. If the trajectory similarity between the new navigation path and the representative commute trajectory is not greater than the second threshold, the loop performs reselection. The step of frequently commuting the trajectory and generating a new navigation path until the trajectory similarity between the new navigation path and the representative commute trajectory is greater than a second threshold.
  • the navigation path generation sub-module is further configured to calculate a navigation estimation duration of the navigation path
  • the commute regular trajectory sub-module is further configured to: when determining whether the trajectory similarity between the navigation path and the representative commute trajectory is greater than a second threshold, Determining whether the difference between the navigation estimation duration of the navigation path and the duration of the representative commutation trajectory is less than a third threshold; and if the trajectory similarity between the navigation path and the representative commute trajectory is greater than a second threshold, and the navigation estimation duration of the navigation path is If the duration difference representing the commute trajectory is less than the third threshold, the navigation path is used as the commuting regular trajectory of the cluster cluster.
  • the commute regular trajectory sub-module is set to calculate the proportion of commuting trajectories of each cluster cluster, and the proportion of the commuting trajectories of the cluster clusters is used as the commuting regularity of the cluster clusters.
  • the weight of the trajectory, the set of commuting regular trajectories of each cluster cluster is taken as the user regular trajectory; and if the cluster cluster is one, the commute regular trajectory sub-module is set to use the commuting regular trajectory of the cluster cluster as the user Regular trajectory.
  • the commuting regular trajectory sub-module is further configured to delete the cluster cluster whose occupation trajectory number is smaller than the fourth threshold.
  • the commuter cluster management sub-module is configured to calculate the sum of the trajectory similarities between each commute trajectory and other commute trajectories in the cluster cluster one by one, and the commute trajectory with the largest sum of trajectories is used as the representative commuter trajectory .
  • the processing module further includes a trajectory similarity calculation sub-module, configured to calculate that the trajectory similarity between the trajectory A and the trajectory B is the trajectory A path, and the minimum distance from the point of the trajectory B path is less than the fifth threshold.
  • the number a'; the trajectory similarity between the calculated trajectory B and the trajectory A is the number b' of the distance minimum from the point of the trajectory A path among all the locations of the trajectory B path; and the calculated trajectory A and
  • the trajectory similarity between the trajectories B is (a'+b')/(a+b), a is the number of locations of the trajectory A pathway, b is the number of locations of the trajectory A pathway, and / represents a division operation.
  • the commute trajectory generating sub-module is further configured to calculate a trajectory abnormality coefficient by using an outlier detection algorithm, and delete a commute trajectory whose trajectory abnormal coefficient is greater than a sixth threshold.
  • the commute trajectory generating sub-module is further configured to screen the commute users that are identifiable from the home location and the working location from all the users, and generate a commensurate commute trajectory of the commute user.
  • the commute trajectory generation sub-module is configured to calculate a discrete entropy of each user's travel location according to the travel data of each user, and use a user whose travel location discrete entropy is less than a seventh threshold as a commute user; and obtain and according to each commute The travel data of the user identifies the home location and work location of each commute user, and deletes at least one unrecognizable commuter user of the home location and the work location.
  • the present disclosure provides a user commute trajectory management system, including: a positioning system, a commuter road management system, and a user commute trajectory management apparatus provided by the present disclosure; wherein the positioning system is configured to monitor user travel data, and the travel data includes a stay point and The time stamp corresponding to the stay point; the user commute trajectory management device is configured to acquire the travel data of each user through the positioning system, calculate the regularity trajectory of each user according to the travel data of each user, and output the user rule of each user The trajectory to the commuter road management system; and the commuter road management system is set to manage the commuting road according to the user's regular trajectory.
  • the present disclosure also provides a non-transitory computer readable storage medium storing computer executable instructions arranged to perform the user commute trajectory management method described above.
  • the present disclosure also provides an electronic device, including:
  • At least one processor At least one processor
  • the memory stores instructions executable by the at least one processor, the instructions being executed by the at least one processor to cause the at least one processor to perform the method described above.
  • the present disclosure provides a user commute trajectory management method, which acquires travel data of a user through a positioning system, such as a positioning device or a communication base station on a user terminal, calculates a user regular trajectory according to user travel data, and outputs, in the process, Without user research, the management of user commuting is enhanced, and a more comprehensive understanding and control of user commuting can be achieved, improving the user experience.
  • a positioning system such as a positioning device or a communication base station on a user terminal
  • FIG. 1 is a schematic structural diagram of a user commute trajectory management system according to a first embodiment of the present disclosure
  • FIG. 2 is a schematic structural diagram of a processing module in a first embodiment of the present disclosure
  • FIG. 3 is a flowchart of a user commute trajectory management method according to a second embodiment of the present disclosure
  • FIG. 5 is a flowchart of a user commute trajectory management method according to a third embodiment of the present disclosure.
  • FIG. 6 is a schematic diagram of calculation of trajectory similarity in a third embodiment of the present disclosure.
  • FIG. 7 is a schematic structural diagram of hardware of an electronic device according to an embodiment of the present disclosure.
  • FIG. 1 is a schematic structural diagram of a user commute trajectory management system according to a first embodiment of the present disclosure.
  • the user commute trajectory management system provided by the present disclosure includes: user commute trajectory management provided by the present disclosure.
  • the positioning system 2 is configured to monitor the travel data of the user, and the travel data includes a time stamp corresponding to the stay point and the stay point.
  • the positioning system 2 may include a base station, a Global Positioning System (GPS), a Beidou positioning system, and a Wireless Fidelity (WiFi) or Bluetooth based positioning system.
  • GPS Global Positioning System
  • Beidou positioning system Beidou positioning system
  • WiFi Wireless Fidelity
  • Bluetooth Bluetooth based positioning system.
  • the user commute trajectory management device 1 is configured to acquire travel data of each user through the positioning system, calculate a regularity trajectory of each user according to travel data of each user, and output a regular trajectory of each user to commute road management.
  • System 3 The commuter road management system 3 is arranged to manage commuting roads according to the user's regular trajectory.
  • the user commute trajectory management apparatus 1 may include an acquisition module 11 , a processing module 12 , and an output module 13 .
  • the obtaining module 11 is configured to acquire travel data of each user through a positioning system, wherein the travel data includes a time stamp corresponding to the stay point and the stay point.
  • the processing module 12 is arranged to calculate a user regularity trajectory for each user based on the travel data of each user.
  • the output module 13 is arranged to output a user trajectory of each user to the commuter road management system.
  • the processing module 12 in the foregoing embodiment may include: a commute feature calculation sub-module 121, a commute trajectory generation sub-module 122, a commuter cluster management sub-module 123, and a navigation path generation sub-module. 124 and the commuter regular track sub-module 125.
  • the commute feature calculation sub-module 121 is configured to calculate a commute feature based on travel data for each user, wherein the commute feature includes a home location, a work location, and a commute time period.
  • the commute trajectory generation sub-module 122 is configured to filter the travel data belonging to the commute time period according to the commute period of the commute feature, the stay point in the travel data, and the time stamp corresponding to the stay point, and select the stay point in the filtered travel data according to the time.
  • the stamp ordering generates a commute trajectory.
  • the commuter cluster management sub-module 123 is configured to calculate a trajectory similarity between each two commute trajectories of each user, and gather a commute trajectory with a trajectory similarity between each two commute trajectories of each user that is greater than a first threshold.
  • the class generates a cluster of clusters and determines the representative commuter trajectory of the cluster cluster.
  • the navigation path generation sub-module 124 is configured to select a frequent point of the commute trajectory from the stay points of all the commutation trajectories of the cluster cluster according to the selection strategy, and generate navigation according to the time position corresponding to the home location, the work position, the frequent points of the commute trajectory, and the frequent points of the commute trajectory. path.
  • the commute regular trajectory sub-module 125 is configured to calculate a trajectory similarity between the navigation path and the representative commute trajectory. If the trajectory similarity between the navigation path and the representative commute trajectory is greater than the second threshold, the navigation path is used as a clustering cluster for commuting The regular trajectory generates a user regular trajectory according to the commuting regular trajectory of the cluster cluster.
  • the commute time period in the above embodiment may include a commute time and an off-duty commute period
  • the commute trajectory includes a commute trajectory and an off-duty commute trajectory
  • the navigation path generation sub-module 124 may be configured to: if the cluster cluster is from work
  • the commutation trajectory clustering generation generates a work navigation path starting from the user's home position, the work position as the end point, and the frequent frequent commute trajectory; and if the cluster cluster is generated by the commute trajectory clustering, the user is generated.
  • the working position is the starting point
  • the family position is the ending point
  • the commute regular trajectory sub-module 125 in the above embodiment may further be configured to trigger the navigation path generation sub-module to re-select the commute trajectory if the trajectory similarity between the navigation path and the representative commute trajectory is less than the second threshold. Frequently generating a new navigation path, calculating and determining whether the trajectory similarity between the new navigation path and the representative commute trajectory is greater than a second threshold, if the trajectory similarity between the new navigation path and the representative commute trajectory is greater than The second threshold is used, and the new navigation path is used as a commuting regular trajectory of the cluster cluster. If the trajectory similarity between the new navigation path and the representative commute trajectory is not greater than the second threshold, the loop execution is performed again. The step of selecting a frequent commute trajectory and generating a new navigation path until the trajectory similarity between the new navigation path and the representative commute trajectory is greater than a second threshold.
  • the navigation path generation sub-module 124 in the above embodiment may further be configured to calculate a navigation estimation duration of the navigation path
  • the commute regular trajectory sub-module 125 may also be configured to determine between the navigation path and the representative commute trajectory. If the trajectory similarity is greater than the second threshold, determine whether the difference between the navigation estimation duration of the navigation path and the duration of the representative commutation trajectory is less than a third threshold; and if the trajectory similarity between the navigation path and the representative commutation trajectory is greater than the second
  • the threshold value and the difference between the navigation estimation duration of the navigation path and the duration of the representative commutation trajectory are smaller than the third threshold, and the navigation path is used as the commuting regular trajectory of the cluster cluster.
  • the commute regular trajectory sub-module 125 is configured to calculate the proportion of the number of commute trajectories of each cluster cluster, and the proportion of the commutation trajectories of the cluster clusters As the weight of the clustering commute regular trajectory, the set of commuting regular trajectories of each cluster cluster is taken as the user regular trajectory; and if the clustering cluster is one, the commuting regular trajectory sub-module 125 is set to cluster The regular trajectory of the commute of the cluster serves as the regular trajectory of the user.
  • the commute regular trajectory sub-module 125 is further configured to delete the cluster cluster whose occupation trajectory number is smaller than the fourth threshold.
  • the commuter cluster management sub-module 123 in the above embodiment is configured to calculate the sum of the trajectory similarities between each commute trajectory and other commute trajectories in the cluster cluster one by one, and maximize the sum of trajectory similarities.
  • the commuter trajectory serves as a commuter trajectory.
  • the processing module 12 in the above embodiment may further include a trajectory similarity calculation sub-module 126 configured to calculate the trajectory similarity between the trajectory A and the trajectory B as all locations of the trajectory A path.
  • the minimum distance from the point of the trajectory B path is less than the number a' of the fifth threshold; the trajectory similarity between the calculated trajectory B and the trajectory A is the smallest distance from the point of the trajectory A path in all the locations of the trajectory B path
  • the commute trajectory generating sub-module 121 in the foregoing embodiment may further be configured to calculate a trajectory abnormal coefficient by an outlier detection algorithm, and delete a commute whose trajectory abnormal coefficient is greater than a sixth threshold. Track.
  • the commute trajectory generation sub-module 121 in the foregoing embodiment may further be configured to filter out commute users whose home location and work location are identifiable from all users before generating the commute trajectory, and generate an identifiable commute. User's commute trajectory.
  • the commute trajectory generation sub-module 121 in the above embodiment is configured to calculate a discrete entropy of the travel location of each user according to the travel data of each user, and use the user whose travel location discrete entropy is less than the sixth threshold as the commute user. And obtaining and determining, according to the travel data of each commute user, a home location and a work location of each commute user, and deleting at least one unrecognizable commuter user of the home location and the work location.
  • FIG. 3 is a flowchart of a method for managing a user commute trajectory according to a second embodiment of the present disclosure.
  • step 310 the travel data of each user is obtained by the positioning system, wherein the travel data includes a time stamp corresponding to the stay point and the stay point.
  • step 320 the user's regularity trajectory for each user is calculated based on the travel data for each user.
  • step 330 the user's regular trajectory for each user is output to the commuter road management system.
  • calculating the user regularity trajectory of each user according to the travel data of each user in the above embodiment may include the following steps.
  • a commute feature is calculated based on the travel data, wherein the commute feature includes a home location, a work location, and a commute time period.
  • step 420 according to the commute time of the commute feature, the stay point in the travel data, and the timestamp corresponding to the stop point, the travel data belonging to the commute time period is filtered, and the stay points in the filtered travel data are sorted according to the time stamp. Commuter trajectory.
  • step 430 the trajectory similarity between each two commute trajectories of each user is calculated, the commute trajectory with the trajectory similarity greater than the first threshold is clustered to generate a cluster cluster, and the representative commute trajectory of the cluster cluster is determined.
  • step 440 the frequent points of the commute trajectory are selected from the stay points of all the commutation trajectories of the cluster cluster according to the selection strategy, and the navigation path is generated according to the time stamp corresponding to the home position, the working position, the frequent points of the commute trajectory and the frequent points of the commute trajectory.
  • step 450 the trajectory similarity between the navigation path and the representative commuter trajectory is calculated. If the trajectory similarity between the navigation path and the representative commute trajectory is greater than the second threshold, the navigation path is used as the commuting regular trajectory of the cluster cluster. According to the regularity trajectory of the clustering cluster, the user's regular trajectory is generated.
  • the commute time period in the above embodiment includes commuting time and commuting to work.
  • the commute trajectory includes a commute trajectory and an off-duty commute trajectory
  • step 440 includes: if the cluster cluster is generated by the commuting trajectory of the commute, the generation starts from the user's home position, the working position is the end point, and the commute trajectory is frequently followed. The work navigation path; and if the cluster cluster is generated by the commute trajectory clustering, the work navigation route starting from the user's work position, the family position as the end point, and the frequent commute trajectory in turn is generated.
  • the step 450 in the above embodiment may further include: if the trajectory similarity between the navigation path and the representative commute trajectory is less than the second threshold, reselecting the frequent points of the commute trajectory and generating a new navigation path, and calculating Determining whether the trajectory similarity between the new navigation path and the representative commute trajectory is greater than a second threshold. If the trajectory similarity between the new navigation path and the representative commute trajectory is greater than the second threshold, the new navigation path is used as a cluster cluster.
  • the commuting regularity trajectory if the trajectory similarity between the new navigation path and the representative commute trajectory is not greater than the second threshold, the loop performs the steps of reselecting the frequent points of the commute trajectory and generating a new navigation path until the new navigation path and The trajectory similarity between the representative commute trajectories is greater than the second threshold.
  • the method in the foregoing embodiment may further include: calculating a navigation estimation duration of the navigation path, and determining whether the navigation path is greater than whether the trajectory similarity between the navigation path and the representative commutation trajectory is greater than a second threshold. Whether the difference between the estimated duration and the duration of the commute trajectory is less than a third threshold; and if the trajectory similarity between the navigation path and the representative commute trajectory is greater than a second threshold, and the navigation estimation duration of the navigation path and the representative commute trajectory If the time difference is smaller than the third threshold, the navigation path is used as the commuting regular track of the cluster.
  • generating a user regular trajectory according to the commuting regular trajectory of the clustering cluster may include: calculating a proportion of the number of commuting trajectories of each cluster cluster, The number of commuting trajectories of the cluster clusters is the weight of the regular trajectory of the commuting clusters, and the set of commuting regular trajectories of each cluster cluster is taken as the regular trajectory of the user; and if the clustering cluster is one, according to The regularity trajectory of the commuting regularity trajectory of the clustering cluster includes the regular trajectory of the commuting of the clustering cluster as the regular trajectory of the user.
  • the number of commute trajectories of each cluster cluster is a ratio of the number of commute trajectories in each cluster cluster to the number of all commute trajectories.
  • the user commute trajectory management method may further include: deleting cluster clusters whose occupation trajectory number is smaller than a fourth threshold.
  • the representative commute trajectory for determining each cluster cluster in the above embodiment may be packaged. Including: calculating the sum of the trajectories similarity between each commute trajectory and other commuting trajectories in the cluster cluster one by one, and taking the commute trajectory with the largest sum of trajectories similarity as the representative commuter trajectory.
  • the calculation of the trajectory similarity between the trajectories in the above embodiment may include: calculating the trajectory similarity between the trajectory A and the trajectory B as the distance from the point of the trajectory B path among all the locations of the trajectory A path The minimum value is less than the number of the fifth threshold a'; the trajectory similarity between the calculated trajectory B and the trajectory A is the number b' of the distance minimum from the point of the trajectory A path among all the locations of the trajectory B path And the trajectory similarity between the calculated trajectory A and the trajectory B is (a'+b')/(a+b), where a is the number of locations of the trajectory A pathway, and b is the number of locations of the trajectory A pathway, / Indicates the division operation.
  • the method in the foregoing embodiment may further include: calculating a trajectory abnormality coefficient by using an outlier detection algorithm, and deleting a commutation trajectory whose trajectory abnormal coefficient is greater than a sixth threshold.
  • the method in the foregoing embodiment may further include: selecting, from all users, a commute user that is identifiable by the home location and the work location, and generating a commensurable commute trajectory of the commute user.
  • the commuting user that can identify the home location and the work location from all the users in the foregoing embodiment may include: calculating the discrete entropy of each user's travel location according to the travel data of each user, and traveling a user whose location discrete entropy is less than a sixth threshold is used as a commute user; and obtains and identifies a home location and a work location of each commute user according to travel data of each commute user, and if not recognized, deletes the home location and the work At least one unrecognizable commuter user in the location.
  • the relevant commuter survey method is to use the questionnaire. This method has poor timeliness, insufficient sample size, and poor commutation statistics.
  • This embodiment provides a user commute trajectory management system, which may include: a storage device (or a distributed storage device), a collection device, a positioning system, a user commute trajectory management device provided by the present disclosure, and a commuter road management system.
  • the obtaining of the commute information by the positioning system involved in the present application may include: statistic of commuting characteristics, such as home position, working position, commute duration, and radius of gyration, by means of GPS, Beidou positioning system or mobile network positioning data; Card (Integrated Circuit Card, IC)
  • the data obtains the commuter travel characteristic information of the residents, such as commuter travel time, commuter travel distance and transfer characteristics, etc.; or, obtains commute information by monitoring the live broadcast of the camera at the early peak and the late peak hours.
  • the collecting device is configured to collect GPS data, data of a Beidou positioning system, mobile network location signaling data, or call bill data, and extract a user identifier, a time stamp, and a location field.
  • the location field may be longitude and latitude.
  • the location field may be a location area number and a base station cell number.
  • Each piece of position data is the place or stop point in the trip, collectively referred to as the stop point.
  • the data collected by the collection device can be pre-processed, including de-privacy processing, and the privacy-removal process can replace the user identifier with a meaningless identifier.
  • base stations with close spherical distances can also be combined by base station clustering.
  • the storage device (or distributed storage device) can store a variety of data, including data and calculation results of the collection device, the user commute trajectory management device, and the commuter road management system.
  • the user commute trajectory management device can calculate the discrete entropy of each user's travel location based on the travel data of each user.
  • Entropy is a measure of chaos and disorder. The smaller the entropy (closer to 0), the higher the stability and the stronger the predictability.
  • the travel location discrete entropy is the discrete optional entropy of the user's travel location. The discrete entropy of the travel location can be analyzed from the time and location data of each user's travel, and the discrete entropy of the travel location per hour and 7 days per user is calculated, and the calculation result is saved in the storage device.
  • the commute users proposed by the present disclosure may be typical office workers with small discrete entropy during office hours and nighttime sleep periods on weekdays, such users typically having stable home locations and work locations. Calculate the regular commute trajectory of such users, and calculate the commute characteristics of each commute user for each commute user.
  • the commute features may include family location, work location, commute time and working hours, commute time and off-duty time. The calculation result is saved to the storage device.
  • the travel data of each commute user from the family location to the work location during the commute time of the working day is extracted, and each commuter user is extracted from the working position during the commute time of the working day.
  • Travel data for home location Time-stamped travel data for the same user can form a sequence of commute travel trajectories.
  • the user commute trajectory management device can process the commute travel trajectory sequence of each commute user, wherein the user commute trajectory management device can separately process the commute commute data and the commute commute data.
  • the trajectory abnormality coefficient is calculated by the outlier detection algorithm, and the commutation trajectory whose trajectory abnormal coefficient is greater than the sixth threshold is deleted.
  • the trajectory anomaly coefficient indicates the difference between the trajectory and other travel trajectory data of the user.
  • the commute trajectory with the trajectory anomaly coefficient greater than the sixth threshold may be identified, and the identified commute trajectory may be regarded as an irregular abnormal trajectory, and the irregularity abnormal trajectory is discarded in the preprocessing of the commuter trajectory clustering.
  • the commute trajectory from which the irregularity trajectory is removed is recorded as set A, and the trajectory similarity between each two commute trajectories in the set A is calculated, and the commute trajectory with the trajectory similarity between each two commute trajectories being greater than the first threshold is performed.
  • One or more cluster clusters Ci form a merged set C, and calculate a frequent point of the commute trajectory of each cluster cluster Ci in each of C, and each commute trajectory is frequently accompanied by a commute trajectory number and an hour tag of the frequent trajectory of the commute trajectory, and Calculate the proportion of the number of commute trajectories for each cluster cluster.
  • For each commute track tr in each cluster cluster Ci calculate the sum of the trajectory similarities between the commute trajectories other than the commute trajectory tr in Ci, tr_dist, and set the commute trajectory with the largest tr_dist as the cluster cluster Ci's representative commuter track Cij.
  • the frequent points of the commute trajectory may be a position that the user frequently passes and stays in the commute trajectory, and the frequent selection strategy of the commute trajectory may be that the number of frequent points is equal to the average number of stay points of each commute trajectory in the cluster cluster.
  • the value, the frequent point may also be a stay point where the number of commute tracks accounts for more than 0.2.
  • the hour tag of the frequent commute track may be the time stamp of the commute track at the stay point, the hour tag may be accurate to the hour, and if there are multiple timestamps, the average is averaged and the average is accurate to the hour.
  • a navigation path Di is generated which takes the family position as the starting point and the working position as the end point, and sequentially passes through each commute trajectory frequently.
  • the GPS coordinates of the stay point of the navigation path Di and the navigation estimation duration of the navigation path can be extracted, the trajectory similarity between the representative commute trajectory Cij of the cluster cluster Ci and the corresponding navigation path Di, and the duration of the commute trajectory Cij can be calculated.
  • the navigation path Di and the navigation estimation duration can be obtained through a path planning and navigation function application programming interface (API) provided by the location service provider's map open platform.
  • API application programming interface
  • the duration representing the commute trajectory Cij is the time difference between the start time stamp and the end point time stamp representing the commute trajectory Cij. If the trajectory similarity between the commute trajectory Cij and the navigation path Di is greater than the second threshold, and the difference between the duration of the commute trajectory Cij and the navigation estimation duration of the navigation path Di is less than the third threshold, the navigation may be The path Di serves as a commute regular trajectory for the user.
  • the second threshold and the third threshold may be set according to the accuracy requirements of the user or the developer. Because there may be multiple clusters in the merged set C, there may be multiple navigation paths, and there are multiple commuter regular trajectories.
  • Extract the GPS coordinates of the navigation path Di route point where, extract The number of the navigation path Di route points may be an integer part of the average value of the number of track stay points in the cluster cluster Ci, and the extracted navigation path Di route point is a point with a certain interval between the frequent points of the commute track.
  • the regularity of commuting can also be determined by comprehensive similarity.
  • the comprehensive similarity F f (x, y), x is the trajectory similarity, y is the difference of the commute duration, and f (x, y) is a function of the two variables x and y.
  • the weight of the influence of x on F is greater than the influence of y on F.
  • F is in the threshold range, it indicates that the merged track and the merged track have a high degree of coincidence of the navigation path, and the navigation path is a regular track of the user. If F is not within the threshold range, it indicates that the navigation path corresponding to the merged track and the merged track has a large difference, and the frequent points of the commutation trajectory of the cluster cluster can be re-selected and a new navigation path is generated, and F is recalculated. If the number of commuting trajectories of the cluster cluster is too small (for example, less than 0.1), the cluster cluster trajectory may be discarded.
  • the commute regular trajectory of the obtained cluster cluster is the user's regular trajectory of the user. If there are multiple clusters in the merged set C, calculate the proportion of the number of commuting trajectories of each cluster cluster, and the proportion of the number of commuting trajectories of the cluster clusters is used as the weight of the commuting regular trajectory of the cluster cluster, and each will be The set of commuting regular trajectories of the clustering clusters serves as the user's regular trajectory.
  • the present disclosure also proposes a user commute trajectory management method.
  • step 510 data preprocessing is performed.
  • Data pre-processing can be to process the collected raw user travel location data into a specific format.
  • the attribute data including the anonymously processed user identifier, the user location (the GPS coordinates of the base station cell number to be converted into the base station location), and the talk time may be extracted from the original data.
  • the original call data is usually massive and redundant.
  • preprocessing and filtering out certain data the amount of data can be reduced and the efficiency of subsequent processing can be improved.
  • base stations with very close spherical distances can be combined, and smooth handover can be used to suppress frequent handover of base stations, reduce data volume, and enhance data validity.
  • step 520 a user travel stability feature is calculated.
  • the discrete entropy of each user's travel location may be calculated according to the travel data of each user, and the user's travel stability is measured by using discrete entropy.
  • the discrete entropy of the travel location is defined as follows:
  • R i is the empirical probability of the user at the location R i
  • R i may be the base station number. The greater the discrete entropy of the travel location, the lower the regularity of the motion of the identified user. Calculate the discrete entropy of the user's travel location on weekdays, and calculate the discrete entropy of each user's travel location on weekdays by hour.
  • step 530 the city commute time period is extracted.
  • Pattern[24][7] describes the characteristics of the user's travel location
  • Pattern[24][7] indicates the most frequent stopover time of 24 hours per day per week. It is called an array of hourly stops.
  • the most frequent stopover location can be the location with the longest stay time during that time period.
  • the two-dimensional array Pattern_go[24][7] is used to describe the change of the user's stop location.
  • the hourly element value is 0, and the current hour is different from the previous hour.
  • the hour element value is 1.
  • Analyze the two-dimensional array Pattern_go[24][7] to find out which days of the week the workday is, and to know the commute time, that is, the morning peak and the late peak commute travel time.
  • the statistical value distribution of the 24 hours of working days in Pattern_go[24][7] presents a double peak, and the double peak periods correspond to the morning peak and the late peak commuting time respectively.
  • the morning and evening peak commute times are the city commuting hours.
  • step 540 the commuter user is screened.
  • a user who can define the travel date of the workday with a discrete entropy less than a certain threshold is a commute user.
  • step 550 the user commute feature is extracted.
  • the user commute feature description can be a four-tuple of ⁇ home location, work location, time taken to work, time taken to work ⁇ .
  • the OD Olin-Destination
  • the commute OD between the home location and the work location is a travel mode.
  • the base stations that are close to each other are merged to form a new call location point; according to the call periodicity, the call frequency of each location point is calculated; according to the frequency of the call, the location points are filtered to delete the location where the call is sparse; And the location point where the call frequency is the most frequent among the Tnight data is D and O, that is, the work position and the home position; and the commute OD of each user is output as the home position and the work position in the user commute feature.
  • Call frequency refers to the percentage of calls that the user has made at that location as a percentage of the total number of calls for that user.
  • the time taken to work on the road may refer to the average value of the time taken by the commuter user from the home position to the work position during the commute time on the working day from the pre-processed data; the time taken after the work is taken from the pre-processed The data is used to calculate the average time taken by the commuter user from the working position to the home position during the commute time of the working day.
  • step 560 an irregularity trajectory is detected.
  • t is the trajectory to be tested
  • T is the set of trajectories to be compared
  • m is the number of runs
  • is the number of samples per run
  • array n[m] is used to count the value obtained for each run
  • array n[m] is initialized. Is 0.
  • H(i) ln(i)+0.57721566, where 0.57721566 is the Euler constant and / represents the division operation.
  • S is also referred to as a trajectory anomaly coefficient, wherein the commute trajectory in which the trajectory anomaly coefficient S exceeds the sixth threshold is an abnormal trajectory, and the abnormal trajectory can be deleted.
  • step 570 commutation trajectory clustering is performed.
  • the commute trajectory except the regular trajectory obtained by the above-mentioned irregular trajectory detection is recorded as set A, and the trajectory similarity between each two commute trajectories in the set A is calculated and the trajectories with high trajectory similarity are combined and merged.
  • Set C calculates the frequent points of the commute trajectory of each cluster cluster in C and the number of commute trajectories of each cluster cluster.
  • the algorithm description for calculating the trajectory similarity in the commute trajectory to work includes:
  • the trajectory similarity between the calculated trajectory A and the trajectory B is: (Tdis(A, B, limit_len) + Tdis(B, A, limit_len))/(a+b), and the trajectory similarity ranges from 0. Between 1 and 1. In extreme cases, the A and B tracks are coincident or very similar, and the trajectory similarity is 1.
  • a is the number of trajectory A passing locations (including non-adjacent identical locations)
  • b is the number of trajectory B passing locations (including non-adjacent identical locations)
  • limit_len is the distance threshold, and / represents the division operation.
  • the trajectory A is recorded as ⁇ a1, a2, a3, a4>, passing 4 locations along the way;
  • the trajectory B is recorded as ⁇ b1, b2, b3, b4, b5>, passing 5 locations along the way ;limit_len is set to 2.
  • step 580 the commute trajectory map is matched.
  • the navigation path is generated, the GPS coordinates of the navigation path approach point are extracted, and the navigation estimation duration of the navigation path is calculated, and the cluster cluster is calculated.
  • Sexual trajectory representing the trajectory similarity between the commute trajectory and the navigation path, and the difference between the duration representing the commute trajectory and the navigation estimation duration of the navigation path does not satisfy the above condition, and the C clustering in step 570 can be reselected.
  • the commuter trajectory of the cluster is frequent. If the number of commuting trajectories of the cluster cluster is too small (for example, less than 0.1), the cluster cluster trajectory may be discarded.
  • step 590 a user regular commute trajectory is generated.
  • the commute regular trajectory of the obtained cluster cluster may be the user's regular trajectory of the user. If there are multiple cluster clusters in the merged set C, calculate the proportion of the number of commuting trajectories of each cluster cluster, and the proportion of the number of commuting trajectories of the cluster clusters as the weight of the commuting regular trajectory of the cluster clusters, The set of commuting regular trajectories of each cluster cluster is taken as the user regular trajectory.
  • the solution is: the traffic of several common lines going to work in the commute users in the A location and the working location in the B location. how many.
  • the embodiment provides a solution for solving the problem in the scenario 1, and the solution includes: calculating a commute feature of the user, screening a user of the commute user whose work position is A, and working position B, and calculating The number of users; the commute trajectory of each user in the collection user is extracted, the commute trajectory is an ordered sequence of the base station of the working route, and the GPS coordinates of the user location are obtained by the mapping relationship between the cell number of the base station and the GPS coordinates of the base station location; Each user in the user uses the outlier detection algorithm to detect the irregular commute trajectory and eliminates it; for each user Ui in the set users, the trajectory clustering and map matching are performed on the commute trajectory except the regular commute trajectory.
  • the set of the regular trajectories Tra_i is recorded as Tra_users; according to the above method, trajectory clustering and map matching are performed on the Tra_users trajectory set to obtain weighted
  • One or more commute regular trajectories Tra_users_j the weight of the commute regular trajectory Tra_users_j is the sum of the weights of the commuting regular trajectories in the corresponding cluster clusters
  • the set of commuting regular trajectories Tra_users_j is the line set CC from A to B
  • Each line CCi in the CC is Tra_users_j, and the traffic of each line is the weight of Tra_users_j.
  • the GPS data of a large number of vehicles is collected to solve the problem: the number of vehicles passing through a section r to work, the proportion of traffic per hour, extracting user information and regular commute information, such as the distribution of family locations, working position What is the distribution, and what is the habitual route of commuting.
  • r may be a highway, or it may be a bridge or other connected section.
  • This embodiment provides a solution for solving the problem in the scenario 2, the solution includes: screening the road segment set R related to the road segment r, calculating the latitude and longitude range of the R Region_R; calculating the user commute feature; extracting the user commute trajectory, each commute trajectory Corresponding to one hour time label, the user trajectory containing the Region_R and the user in the commute trajectory are filtered, and the user trajectory set is recorded as T_R, and the user set is recorded as Users. For each user Ui in Users, the eligible commuting trajectory is recorded. The weight Ui_num is not followed by the users in Users. All the commute tracks of each user Ui in the user set Users are recorded as T; after counting, the number of vehicles per hour can be obtained; and each user Ui in Users is extracted.
  • the commute feature can derive specific user information; obtain regular commute information from the user.
  • the obtaining the regular commute information of the user may include: detecting, for each Ui, an irregular commute trajectory of the user Ui for the user set User and the commute trajectory T; and clustering clusters and map matching of the user's commute trajectory, Get one or more commuter regular trajectories with weights from the user.
  • the present disclosure provides a user commute trajectory management method, which acquires travel data of a user through a positioning system, such as a positioning device or a communication base station on a user terminal, calculates a user regular trajectory according to user travel data, and outputs, in the process, Without user research, the management of user commuting is enhanced, and the user's commuting can be more fully understood and controlled, improving the user experience.
  • a positioning system such as a positioning device or a communication base station on a user terminal
  • the disclosure also collects the user travel location data, extracts the user commute trajectory, detects the irregular commute trajectory by the outlier detection algorithm, and performs the clustering and map matching on the commute trajectory of the regular commute trajectory, thereby realizing the user regularity.
  • the calculation system and method of the trajectory is beneficial to the fine management of the commuter road section, and is convenient for grasping some key traffic information, such as the load level of each road section during the peak of the commute, such as one road section closure, regulation or restriction, etc. How many citizens travel, and the distribution of these citizens.
  • Embodiments of the present disclosure also provide a non-transitory computer readable storage medium storing computer executable instructions arranged to perform the method of any of the above embodiments.
  • the embodiment of the present disclosure further provides a hardware structure diagram of an electronic device.
  • the electronic device includes:
  • At least one processor 70 which is exemplified by a processor 70 in FIG. 7; and a memory 71, may further include a communication interface 72 and a bus 73.
  • the processor 70, the communication interface 72, and the memory 71 can complete communication with each other through the bus 73.
  • Communication interface 72 can be used for information transfer.
  • Processor 30 may invoke logic instructions in memory 31 to perform the methods of the above-described embodiments.
  • logic instructions in the memory 71 described above may be implemented in the form of a software functional unit and sold or used as a stand-alone product, and may be stored in a computer readable storage medium.
  • the memory 71 is a computer readable storage medium and can be used to store a software program, a computer executable program, a program instruction or a module corresponding to the method in the embodiment of the present disclosure.
  • the processor 70 executes the functional application and the data processing by executing a software program, an instruction or a module stored in the memory 71, that is, implementing the method in the above method embodiment.
  • the memory 71 may include a storage program area and a storage data area, wherein the storage program area may store an operating system, an application required for at least one function; the storage data area may store data created according to usage of the terminal device, and the like. Further, the memory 71 may include a high speed random access memory, and may also include a nonvolatile memory.
  • the technical solution of the present disclosure may be embodied in the form of a software product stored in a storage medium, including one or more instructions for causing a computer device (which may be a personal computer, a server, a network device, etc.) Performing all or part of the steps of the method of the embodiments of the present disclosure.
  • the foregoing storage medium may be a non-transitory storage medium, including: a USB flash drive, a mobile hard disk, a read-only memory (ROM), and a random storage memory (Random-Access).
  • a medium that can store program code such as a memory, a RAM, or an optical disk, or a transient storage medium.
  • the user commute trajectory management method, device and system provided by the disclosure obtain the travel data of the user through the positioning system, calculate the regular trajectory of the user according to the travel data of the user, and output, without user research, and enhance the management of the user's commuting, A more comprehensive understanding and control of user commuting has improved the user experience.

Abstract

A method, a device and a system for user commuter track management, the method comprising: acquiring, by means of a positioning system, travel data of each user , the travel data including a stay point and a time stamp corresponding to the stay point (310); calculating, according to the travel data of each user, a user regularity track of each user (320); and outputting the user regularity track of each user to a commuter road management system (330).

Description

用户通勤轨迹管理方法、装置及系统User commute trajectory management method, device and system 技术领域Technical field
本公开涉及用户通勤领域,例如涉及一种用户通勤轨迹管理方法、装置及系统。The present disclosure relates to the field of user commuting, for example, to a user commute trajectory management method, apparatus, and system.
背景技术Background technique
上下班通勤是城市交通的重点,为了提高用户上下班的交通体验,提出了通勤交通这一概念,通勤交通连接家庭位置与工作位置,规律性强。相关城市居民通勤交通的出行规律通常采用宏观性的调查和研究,如采用调查问卷,相关城市居民通勤交通的出行规律时效性差,样本量不足,不能对用户通勤进行全面了解和掌控。Commuting commuting is the focus of urban transportation. In order to improve the traffic experience of users commuting to work, the concept of commuter transportation is proposed. Commuting traffic connects family position and working position with strong regularity. The travel rules of commuting traffic of relevant urban residents usually adopt macroscopic surveys and research. For example, the questionnaires are used, and the travel rules of commuting traffic of relevant urban residents are not timely, the sample size is insufficient, and the user's commuting cannot be fully understood and controlled.
因此,本领域技术人员亟待提供一种用户通勤轨迹管理方法,以解决用户出行规律采用宏观调查和研究导致不能对用户通勤进行全面了解和掌控的技术问题。Therefore, those skilled in the art urgently need to provide a user commute trajectory management method to solve the technical problem that the user travel rules can not fully understand and control the user's commuting by using macro investigation and research.
发明内容Summary of the invention
本公开提供了一种用户通勤轨迹管理方法、装置及系统,可以对用户通勤进行更全面了解和掌控。The present disclosure provides a user commute trajectory management method, device and system, which can fully understand and control user commuting.
本公开提供了一种用户通勤轨迹管理方法,包括:The present disclosure provides a user commute trajectory management method, including:
通过定位系统获取每个用户的出行数据,其中,出行数据包括停留点及停留点对应的时间戳;Obtaining travel data of each user by using a positioning system, wherein the travel data includes a time stamp corresponding to the stay point and the stay point;
根据每个用户的出行数据计算每个用户的用户规律性轨迹;以及Calculate the user's regular trajectory for each user based on the travel data of each user;
输出每个用户的用户规律性轨迹至通勤道路管理系统。The regularity track of each user's user is output to the commuter road management system.
可选的,根据每个用户的出行数据计算每个用户的用户规律性轨迹包括:Optionally, calculating a regularity track of each user according to each user's travel data includes:
根据每个用户的出行数据计算通勤特征,通勤特征包括家庭位置、工作位置以及通勤时段;The commute feature is calculated based on the travel data of each user, and the commute features include a home location, a work location, and a commute time period;
根据通勤特征的通勤时段、出行数据中的停留点及停留点对应的时间戳, 筛选得到属于通勤时段的出行数据,将筛选得到的出行数据中的停留点按照时间戳排序生成通勤轨迹;According to the commute time of the commute feature, the stay point in the travel data, and the time stamp corresponding to the stop point, The travel data belonging to the commute time period is filtered, and the stay points in the filtered travel data are sorted by time stamp to generate a commute trajectory;
计算每个用户的每两个通勤轨迹之间的轨迹相似度,将每个用户的每两个通勤轨迹之间的轨迹相似度大于第一阈值的通勤轨迹聚类生成聚类簇,并确定每个聚类簇的代表通勤轨迹;Calculating a trajectory similarity between each two commute trajectories of each user, and clustering the commute trajectories of each user's two commute trajectories with a trajectory similarity greater than the first threshold to generate a cluster cluster, and determining each Representative clustering trajectories;
根据选择策略从每个聚类簇所有通勤轨迹的停留点中选择通勤轨迹频繁点,根据家庭位置、工作位置、通勤轨迹频繁点及通勤轨迹频繁点对应的时间戳生成导航路径;以及Selecting a frequent frequency of the commute trajectory from the stay points of all the commutation trajectories of each cluster cluster according to the selection strategy, and generating a navigation path according to the time stamp corresponding to the home location, the working position, the frequency of the commute trajectory, and the frequent points of the commute trajectory;
计算导航路径与代表通勤轨迹之间的轨迹相似度,若导航路径与代表通勤轨迹之间的轨迹相似度大于第二阈值,则将导航路径作为聚类簇的通勤规律性轨迹,根据聚类簇的通勤规律性轨迹生成用户规律性轨迹。Calculating the trajectory similarity between the navigation path and the representative commute trajectory. If the trajectory similarity between the navigation path and the representative commute trajectory is greater than the second threshold, the navigation path is used as the commuting regular trajectory of the cluster cluster, according to the cluster cluster The regular trajectory of commuting generates a regular trajectory of the user.
可选的,通勤时段包括上班通勤时段及下班通勤时段,通勤轨迹包括上班通勤轨迹及下班通勤轨迹,根据家庭位置、工作位置、通勤轨迹频繁点及通勤轨迹频繁点对应的时间戳生成导航路径包括:若聚类簇由上班通勤轨迹聚类生成,则生成以用户的家庭位置为起点、工作位置为终点以及依次通过通勤轨迹频繁点的上班导航路径;以及若聚类簇由下班通勤轨迹聚类生成,则生成以用户的工作位置为起点、家庭位置为终点以及依次通过通勤轨迹频繁点的下班导航路径。Optionally, the commute time includes a commute time and a commute time, and the commute trajectory includes a commute trajectory and a commute trajectory, and the navigation path is generated according to a time stamp corresponding to the home location, the work location, the frequent commute trajectory, and the frequent commute trajectory. : If the cluster cluster is generated by the commute trajectory clustering, the work navigation route starting from the user's home position, the work position as the end point, and the frequent frequent commute trajectory is generated; and if the cluster cluster is clustered by the commute trajectory The generation generates a work-by-work navigation path starting from the user's work position, the home position as the end point, and frequent frequent commute trajectories.
可选的,所述方法还包括:若导航路径与代表通勤轨迹之间的轨迹相似度小于第二阈值,则重新选择通勤轨迹频繁点并生成新导航路径,计算并判断新导航路径与代表通勤轨迹之间的轨迹相似度是否大于第二阈值,若所述新导航路径与所述代表通勤轨迹之间的轨迹相似度大于所述第二阈值,则将新导航路径作为聚类簇的通勤规律性轨迹,若所述新导航路径与所述代表通勤轨迹之间的轨迹相似度不大于所述第二阈值,则循环执行重新选择通勤轨迹频繁点并生成新导航路径的步骤,直至新导航路径与代表通勤轨迹之间的轨迹相似度大于第二阈值。Optionally, the method further includes: if the trajectory similarity between the navigation path and the representative commute trajectory is less than the second threshold, reselecting the frequent trajectory trajectory and generating a new navigation path, calculating and determining the new navigation path and representing the commute Whether the trajectory similarity between the trajectories is greater than the second threshold, if the trajectory similarity between the new navigation path and the representative commute trajectory is greater than the second threshold, the new navigation path is used as the commuting rule of the cluster cluster a trajectory, if the trajectory similarity between the new navigation path and the representative commute trajectory is not greater than the second threshold, performing a step of reselecting frequent points of the commute trajectory and generating a new navigation path until the new navigation path The trajectory similarity to the representative commute trajectory is greater than the second threshold.
可选的,所述还包括:计算导航路径的导航预估时长,在判断导航路径与代表通勤轨迹之间的轨迹相似度是否大于第二阈值时,判断导航路径的导航预估时长与代表通勤轨迹的时长差值是否小于第三阈值;以及若导航路径与代表通勤轨迹之间的轨迹相似度大于第二阈值,且导航路径的导航预估时长与代表 通勤轨迹的时长差值小于第三阈值,则将导航路径作为聚类簇的通勤规律性轨迹。Optionally, the method further includes: calculating a navigation estimation duration of the navigation path, and determining whether the navigation estimation duration of the navigation path and the representative commute are performed when determining whether the trajectory similarity between the navigation path and the representative commutation trajectory is greater than a second threshold. Whether the difference of the duration of the trajectory is smaller than the third threshold; and if the trajectory similarity between the navigation path and the representative commute trajectory is greater than the second threshold, and the navigation estimation duration and representative of the navigation path If the duration difference of the commute trajectory is less than the third threshold, the navigation path is taken as the commuting regular trajectory of the cluster cluster.
可选的,若聚类簇为多个,根据聚类簇的通勤规律性轨迹生成用户规律性轨迹包括:计算每个聚类簇的通勤轨迹数量占比,将聚类簇的通勤轨迹数量占比作为聚类簇的通勤规律性轨迹的权重,将每个聚类簇的通勤规律性轨迹组成的集合作为用户规律性轨迹;以及若聚类簇为一个,根据聚类簇的通勤规律性轨迹生成用户规律性轨迹包括:将聚类簇的通勤规律性轨迹作为用户规律性轨迹。Optionally, if the clustering cluster is multiple, generating a user regular trajectory according to the commuting regular trajectory of the clustering cluster comprises: calculating a proportion of the number of commuting trajectories of each cluster cluster, and occupying a number of commuting trajectories of the cluster cluster Compared with the weight of the commuting regular trajectory as the clustering cluster, the set of commuting regular trajectories of each clustering cluster is taken as the user regular trajectory; and if the clustering cluster is one, the commuting regular trajectory according to the clustering cluster Generating the user's regular trajectory includes: using the commuting regular trajectory of the cluster cluster as the user's regular trajectory.
可选的,若聚类簇为多个,方法还包括:删除通勤轨迹数量占比小于第四阈值的聚类簇。Optionally, if the clustering cluster is multiple, the method further includes: deleting the cluster cluster whose occupation trajectory number is smaller than the fourth threshold.
可选的,确定每个聚类簇的代表通勤轨迹包括:逐一计算聚类簇中每个通勤轨迹与其他通勤轨迹之间的轨迹相似度之和,将该轨迹相似度之和最大的通勤轨迹作为代表通勤轨迹。Optionally, determining a representative commute trajectory of each cluster cluster includes: calculating, by one by one, a sum of trajectory similarities between each commute trajectory and other commute trajectories in the cluster cluster, and a commuting trajectory having the largest sum of similar trajectories As a representative commuter track.
可选的,轨迹之间的轨迹相似度的计算包括:计算轨迹A与轨迹B的轨迹相似度为轨迹A途径的所有地点中,与轨迹B途径的点的距离最小值小于第五阈值的个数a’;计算轨迹B与轨迹A的轨迹相似度为轨迹B途径的所有地点中,与轨迹A途径的点的距离最小值小于第五阈值的个数b’;以及计算轨迹A与轨迹B之间的轨迹相似度为(a’+b’)/(a+b),a为轨迹A途径的地点个数,b为轨迹A途径的地点个数,/表示除法运算。Optionally, the calculation of the trajectory similarity between the trajectories comprises: calculating the trajectory similarity between the trajectory A and the trajectory B into all the locations of the trajectory A path, and the minimum distance from the point of the trajectory B path is less than the fifth threshold The number a'; the trajectory similarity between the calculated trajectory B and the trajectory A is the number b' of the distance minimum from the point of the trajectory A path among all the points of the trajectory B path; and the calculated trajectory A and the trajectory B The trajectory similarity between (a'+b')/(a+b), a is the number of locations of the trajectory A pathway, b is the number of locations of the trajectory A pathway, and / represents the division operation.
可选的,在生成通勤轨迹之后,所述方法还包括:通过离群点检测算法计算轨迹异常系数,删除所述轨迹异常系数大于第六阈值的通勤轨迹。Optionally, after generating the commutation trajectory, the method further includes: calculating, by using an outlier detection algorithm, a trajectory anomaly coefficient, and deleting a commutation trajectory whose trajectory anomaly coefficient is greater than a sixth threshold.
可选的,在生成通勤轨迹之前,所述方法还包括:从所有用户中筛选出家庭位置和工作位置均可识别的通勤用户,生成可识别的通勤用户的通勤轨迹。Optionally, before the generating the commuter trajectory, the method further includes: selecting, from all the users, the commute users that are identifiable by the home location and the work location, and generating the traversable trajectory of the identifiable commute user.
可选的,从所有用户中筛选出家庭位置和工作位置均可识别的通勤用户包括:根据每个用户的出行数据计算每个用户的出行地点离散熵,将出行地点离散熵小于第七阈值的用户作为通勤用户;以及获取并根据每个通勤用户的出行数据,识别每个通勤用户的家庭位置和工作位置,删除家庭位置和工作位置中至少一个不可识别的通勤用户。Optionally, the commuting user that can be identified from all the users by the home location and the work location includes: calculating a discrete entropy of the travel location of each user according to the travel data of each user, and the discrete entropy of the travel location is less than a seventh threshold. The user acts as a commute user; and obtains and according to the travel data of each commute user, identifies the home location and work location of each commute user, and deletes at least one unrecognizable commuter user in the home location and the work location.
本公开提供了一种用户通勤轨迹管理装置,包括: The present disclosure provides a user commute trajectory management apparatus, including:
获取模块,设置为通过定位系统获取每个用户的出行数据,其中,出行数据包括停留点及停留点对应的时间戳;Obtaining a module, configured to obtain, by using a positioning system, travel data of each user, where the travel data includes a time stamp corresponding to the stay point and the stay point;
处理模块,设置为根据每个用户的出行数据计算每个用户的用户规律性轨迹;以及a processing module configured to calculate a regularity trajectory of each user according to each user's travel data;
输出模块,设置为输出每个用户的用户规律性轨迹至通勤道路管理系统。The output module is configured to output a regular trajectory of each user to the commuter road management system.
可选的,处理模块包括:Optionally, the processing module includes:
通勤特征计算子模块,设置为根据每个用户的出行数据计算通勤特征,通勤特征包括家庭位置、工作位置以及通勤时段;The commute feature calculation sub-module is configured to calculate a commute feature according to each user's travel data, and the commute feature includes a home location, a work location, and a commute time period;
通勤轨迹生成子模块,设置为根据通勤特征的通勤时段、出行数据中的停留点及停留点对应的时间戳,筛选得到属于通勤时段的出行数据,将筛选得到的出行数据中的停留点按照时间戳排序生成通勤轨迹;The commute trajectory generation sub-module is configured to filter the travel data belonging to the commute time period according to the commute time of the commute feature, the stay point in the travel data, and the time stamp corresponding to the stop point, and select the stay point in the filtered travel data according to the time. Poke sorting generates a commute trajectory;
通勤聚类管理子模块,设置为计算每个用户的每两个通勤轨迹之间的轨迹相似度,将每个用户的每两个通勤轨迹之间的轨迹相似度大于第一阈值的通勤轨迹聚类生成聚类簇,并确定每个聚类簇的代表通勤轨迹;The commuter cluster management sub-module is configured to calculate a trajectory similarity between each two commute trajectories of each user, and a commute trajectory of each user's two commute trajectories with a trajectory similarity greater than a first threshold The class generates a cluster of clusters and determines a representative commute trajectory for each cluster of clusters;
导航路径生成子模块,设置为根据选择策略从每个聚类簇所有通勤轨迹的停留点中选择通勤轨迹频繁点,根据家庭位置、工作位置、通勤轨迹频繁点及通勤轨迹频繁点对应的时间戳生成导航路径;The navigation path generation sub-module is configured to select a frequent commutation trajectory from the stay points of all the commutation trajectories of each cluster cluster according to the selection strategy, according to the home location, the work location, the frequent points of the commute trajectory, and the time stamp corresponding to the frequent points of the commute trajectory Generate a navigation path;
通勤规律轨迹子模块,设置为计算导航路径与代表通勤轨迹之间的轨迹相似度,若导航路径与代表通勤轨迹之间的轨迹相似度大于第二阈值,则将导航路径作为聚类簇的通勤规律性轨迹,根据聚类簇的通勤规律性轨迹生成用户规律性轨迹。The commute regular track sub-module is set to calculate the trajectory similarity between the navigation path and the representative commute trajectory. If the trajectory similarity between the navigation path and the representative commute trajectory is greater than the second threshold, the navigation path is used as the clustering group for commuting The regular trajectory generates a user regular trajectory according to the commuting regular trajectory of the cluster cluster.
可选的,通勤时段包括上班通勤时段及下班通勤时段,通勤轨迹包括上班通勤轨迹及下班通勤轨迹,导航路径生成子模块设置为,若聚类簇由上班通勤轨迹聚类生成,则生成以用户的家庭位置为起点、工作位置为终点以及依次通过通勤轨迹频繁点的上班导航路径;以及若聚类簇由下班通勤轨迹聚类生成,则生成以用户的工作位置为起点、家庭位置为终点以及依次通过通勤轨迹频繁点的下班导航路径。Optionally, the commute time includes a commute time and an off-duty commute time, the commute trajectory includes a commute trajectory and an off-duty commute trajectory, and the navigation path generation sub-module is set to generate a user if the cluster cluster is generated by the commute trajectory of the commute The home position is the starting point, the working position is the ending point, and the commuting navigation path frequently passing through the commute trajectory; and if the clustering cluster is generated by the commuting trajectory trajectory clustering, the generation starts from the user's working position, the family position is the ending point, and The commute navigation path that frequently passes through the commute trajectory in turn.
可选的,通勤规律轨迹子模块还设置为若导航路径与代表通勤轨迹之间的轨迹相似度小于第二阈值,则触发导航路径生成子模块重新选择通勤轨迹频繁 点并生成新导航路径,计算并判断新导航路径与代表通勤轨迹之间的轨迹相似度是否大于第二阈值,若所述新导航路径与所述代表通勤轨迹之间的轨迹相似度大于所述第二阈值,则将新导航路径作为聚类簇的通勤规律性轨迹,若所述新导航路径与所述代表通勤轨迹之间的轨迹相似度不大于所述第二阈值,则循环执行重新选择通勤轨迹频繁点并生成新导航路径的步骤,直至新导航路径与代表通勤轨迹之间的轨迹相似度大于第二阈值。Optionally, the commute regular trajectory sub-module is further configured to: if the trajectory similarity between the navigation path and the representative commute trajectory is less than a second threshold, triggering the navigation path generating sub-module to re-select the commuting trajectory frequently Pointing and generating a new navigation path, calculating and determining whether the trajectory similarity between the new navigation path and the representative commute trajectory is greater than a second threshold, if the trajectory similarity between the new navigation path and the representative commute trajectory is greater than the The second threshold is used as a commuting regular trajectory of the cluster cluster. If the trajectory similarity between the new navigation path and the representative commute trajectory is not greater than the second threshold, the loop performs reselection. The step of frequently commuting the trajectory and generating a new navigation path until the trajectory similarity between the new navigation path and the representative commute trajectory is greater than a second threshold.
可选的,导航路径生成子模块还设置为计算导航路径的导航预估时长,通勤规律轨迹子模块还设置为在判断导航路径与代表通勤轨迹之间的轨迹相似度是否大于第二阈值时,判断导航路径的导航预估时长与代表通勤轨迹的时长差值是否小于第三阈值;以及若导航路径与代表通勤轨迹之间的轨迹相似度大于第二阈值、且导航路径的导航预估时长与代表通勤轨迹的时长差值小于第三阈值,则将导航路径作为聚类簇的通勤规律性轨迹。Optionally, the navigation path generation sub-module is further configured to calculate a navigation estimation duration of the navigation path, and the commute regular trajectory sub-module is further configured to: when determining whether the trajectory similarity between the navigation path and the representative commute trajectory is greater than a second threshold, Determining whether the difference between the navigation estimation duration of the navigation path and the duration of the representative commutation trajectory is less than a third threshold; and if the trajectory similarity between the navigation path and the representative commute trajectory is greater than a second threshold, and the navigation estimation duration of the navigation path is If the duration difference representing the commute trajectory is less than the third threshold, the navigation path is used as the commuting regular trajectory of the cluster cluster.
可选的,若聚类簇为多个,通勤规律轨迹子模块设置为计算每个聚类簇的通勤轨迹数量占比,将聚类簇的通勤轨迹数量占比作为聚类簇的通勤规律性轨迹的权重,将每个聚类簇的通勤规律性轨迹组成的集合作为用户规律性轨迹;以及若聚类簇为一个,通勤规律轨迹子模块设置为将聚类簇的通勤规律性轨迹作为用户规律性轨迹。Optionally, if there are multiple clustering clusters, the commute regular trajectory sub-module is set to calculate the proportion of commuting trajectories of each cluster cluster, and the proportion of the commuting trajectories of the cluster clusters is used as the commuting regularity of the cluster clusters. The weight of the trajectory, the set of commuting regular trajectories of each cluster cluster is taken as the user regular trajectory; and if the cluster cluster is one, the commute regular trajectory sub-module is set to use the commuting regular trajectory of the cluster cluster as the user Regular trajectory.
可选的,若聚类簇为多个,通勤规律轨迹子模块还设置为删除通勤轨迹数量占比小于第四阈值的聚类簇。Optionally, if the clustering cluster is multiple, the commuting regular trajectory sub-module is further configured to delete the cluster cluster whose occupation trajectory number is smaller than the fourth threshold.
可选的,通勤聚类管理子模块设置为逐一计算聚类簇中每个通勤轨迹与其他通勤轨迹之间的轨迹相似度之和,将该轨迹相似度之和最大的通勤轨迹作为代表通勤轨迹。Optionally, the commuter cluster management sub-module is configured to calculate the sum of the trajectory similarities between each commute trajectory and other commute trajectories in the cluster cluster one by one, and the commute trajectory with the largest sum of trajectories is used as the representative commuter trajectory .
可选的,处理模块还包括轨迹相似度计算子模块,设置为计算轨迹A与轨迹B的轨迹相似度为轨迹A途径的所有地点中,与轨迹B途径的点的距离最小值小于第五阈值的个数a’;计算轨迹B与轨迹A的轨迹相似度为轨迹B途径的所有地点中,与轨迹A途径的点的距离最小值小于第五阈值的个数b’;以及计算轨迹A与轨迹B之间的轨迹相似度为(a’+b’)/(a+b),a为轨迹A途径的地点个数,b为轨迹A途径的地点个数,/表示除法运算。Optionally, the processing module further includes a trajectory similarity calculation sub-module, configured to calculate that the trajectory similarity between the trajectory A and the trajectory B is the trajectory A path, and the minimum distance from the point of the trajectory B path is less than the fifth threshold. The number a'; the trajectory similarity between the calculated trajectory B and the trajectory A is the number b' of the distance minimum from the point of the trajectory A path among all the locations of the trajectory B path; and the calculated trajectory A and The trajectory similarity between the trajectories B is (a'+b')/(a+b), a is the number of locations of the trajectory A pathway, b is the number of locations of the trajectory A pathway, and / represents a division operation.
可选的,通勤轨迹生成子模块在生成通勤轨迹之后,还设置为通过离群点检测算法计算轨迹异常系数,删除所述轨迹异常系数大于第六阈值的通勤轨迹。 Optionally, after the commute trajectory is generated, the commute trajectory generating sub-module is further configured to calculate a trajectory abnormality coefficient by using an outlier detection algorithm, and delete a commute trajectory whose trajectory abnormal coefficient is greater than a sixth threshold.
可选的,通勤轨迹生成子模块在生成通勤轨迹之前,还设置为从所有用户中筛选出家庭位置和工作位置均可识别的通勤用户,生成可识别的通勤用户的通勤轨迹。Optionally, before the commute trajectory is generated, the commute trajectory generating sub-module is further configured to screen the commute users that are identifiable from the home location and the working location from all the users, and generate a commensurate commute trajectory of the commute user.
可选的,通勤轨迹生成子模块设置为根据每个用户的出行数据计算每个用户的出行地点离散熵,将出行地点离散熵小于第七阈值的用户作为通勤用户;以及获取并根据每个通勤用户的出行数据,识别每个通勤用户的家庭位置和工作位置,删除所述家庭位置和所述工作位置中至少一个不可识别的通勤用户。Optionally, the commute trajectory generation sub-module is configured to calculate a discrete entropy of each user's travel location according to the travel data of each user, and use a user whose travel location discrete entropy is less than a seventh threshold as a commute user; and obtain and according to each commute The travel data of the user identifies the home location and work location of each commute user, and deletes at least one unrecognizable commuter user of the home location and the work location.
本公开提供了一种用户通勤轨迹管理系统,包括:定位系统、通勤道路管理系统以及本公开提供的用户通勤轨迹管理装置;其中,定位系统设置为监听用户的出行数据,出行数据包括停留点及停留点对应的时间戳;用户通勤轨迹管理装置设置为通过定位系统获取每个用户的出行数据,根据每个用户的出行数据计算每个用户的用户规律性轨迹,以及输出每个用户的用户规律性轨迹至通勤道路管理系统;以及通勤道路管理系统设置为根据用户规律性轨迹管理通勤道路。本公开还提供了一种非暂态计算机可读存储介质,存储有计算机可执行指令,所述计算机可执行指令设置为执行上述用户通勤轨迹管理方法。The present disclosure provides a user commute trajectory management system, including: a positioning system, a commuter road management system, and a user commute trajectory management apparatus provided by the present disclosure; wherein the positioning system is configured to monitor user travel data, and the travel data includes a stay point and The time stamp corresponding to the stay point; the user commute trajectory management device is configured to acquire the travel data of each user through the positioning system, calculate the regularity trajectory of each user according to the travel data of each user, and output the user rule of each user The trajectory to the commuter road management system; and the commuter road management system is set to manage the commuting road according to the user's regular trajectory. The present disclosure also provides a non-transitory computer readable storage medium storing computer executable instructions arranged to perform the user commute trajectory management method described above.
本公开还提供了一种电子设备,包括:The present disclosure also provides an electronic device, including:
至少一个处理器;以及At least one processor;
与所述至少一个处理器通信连接的存储器;其中,a memory communicatively coupled to the at least one processor; wherein
所述存储器存储有可被所述至少一个处理器执行的指令,所述指令被所述至少一个处理器执行,以使所述至少一个处理器执行上述的方法。The memory stores instructions executable by the at least one processor, the instructions being executed by the at least one processor to cause the at least one processor to perform the method described above.
本公开提供了一种用户通勤轨迹管理方法,通过定位系统,如用户终端上的定位设备或者通信基站等获取用户的出行数据,根据用户出行数据计算用户规律性轨迹并输出,在该过程中,不用进行用户调研,增强了对用户通勤的管理力度,可以对用户通勤进行更全面的了解和掌控,改善了用户的使用体验。The present disclosure provides a user commute trajectory management method, which acquires travel data of a user through a positioning system, such as a positioning device or a communication base station on a user terminal, calculates a user regular trajectory according to user travel data, and outputs, in the process, Without user research, the management of user commuting is enhanced, and a more comprehensive understanding and control of user commuting can be achieved, improving the user experience.
附图说明DRAWINGS
图1为本公开第一实施例提供的用户通勤轨迹管理系统的结构示意图;1 is a schematic structural diagram of a user commute trajectory management system according to a first embodiment of the present disclosure;
图2为本公开第一实施例中处理模块的结构示意图;2 is a schematic structural diagram of a processing module in a first embodiment of the present disclosure;
图3为本公开第二实施例提供的用户通勤轨迹管理方法的流程图; FIG. 3 is a flowchart of a user commute trajectory management method according to a second embodiment of the present disclosure;
图4为本公开第二实施例中处理出行数据这一步骤的流程图;4 is a flow chart showing the steps of processing travel data in the second embodiment of the present disclosure;
图5为本公开第三实施例提供的用户通勤轨迹管理方法的流程图;FIG. 5 is a flowchart of a user commute trajectory management method according to a third embodiment of the present disclosure;
图6为本公开第三实施例中轨迹相似度计算示意图;以及6 is a schematic diagram of calculation of trajectory similarity in a third embodiment of the present disclosure;
图7是本公开实施例提供的电子设备的硬件结构示意图。FIG. 7 is a schematic structural diagram of hardware of an electronic device according to an embodiment of the present disclosure.
具体实施方式detailed description
现通过具体实施方式结合附图的方式对本公开做出可选的诠释说明。在不冲突的情况下,以下实施例以及实施例中的特征可以相互任意组合。Alternative explanations of the present disclosure are now made by way of specific embodiments in conjunction with the accompanying drawings. The features of the following embodiments and embodiments may be arbitrarily combined with each other without conflict.
第一实施例First embodiment
图1为本公开第一实施例提供的用户通勤轨迹管理系统的结构示意图,由图1可知,在本实施例中,本公开提供的用户通勤轨迹管理系统包括:本公开提供的用户通勤轨迹管理装置1、定位系统2以及通勤道路管理系统3。其中,定位系统2设置为监听用户的出行数据,出行数据包括停留点及停留点对应的时间戳。定位系统2可以包括基站、全球定位系统(Global Positioning System,GPS)、北斗定位系统以及基于无线保真(Wireless Fidelity,WiFi)或者蓝牙的定位系统等。用户通勤轨迹管理装置1设置为通过定位系统获取每个用户的出行数据,根据每个用户的出行数据计算每个用户的用户规律性轨迹,以及输出每个用户的用户规律性轨迹至通勤道路管理系统3。通勤道路管理系统3设置为根据用户规律性轨迹管理通勤道路。FIG. 1 is a schematic structural diagram of a user commute trajectory management system according to a first embodiment of the present disclosure. As shown in FIG. 1 , in the embodiment, the user commute trajectory management system provided by the present disclosure includes: user commute trajectory management provided by the present disclosure. The device 1, the positioning system 2, and the commuter road management system 3. The positioning system 2 is configured to monitor the travel data of the user, and the travel data includes a time stamp corresponding to the stay point and the stay point. The positioning system 2 may include a base station, a Global Positioning System (GPS), a Beidou positioning system, and a Wireless Fidelity (WiFi) or Bluetooth based positioning system. The user commute trajectory management device 1 is configured to acquire travel data of each user through the positioning system, calculate a regularity trajectory of each user according to travel data of each user, and output a regular trajectory of each user to commute road management. System 3. The commuter road management system 3 is arranged to manage commuting roads according to the user's regular trajectory.
在一些实施例中,如图1所示,本公开提供的用户通勤轨迹管理装置1可以包括:获取模块11、处理模块12和输出模块13。In some embodiments, as shown in FIG. 1 , the user commute trajectory management apparatus 1 provided by the present disclosure may include an acquisition module 11 , a processing module 12 , and an output module 13 .
获取模块11设置为通过定位系统获取每个用户的出行数据,其中,出行数据包括停留点及停留点对应的时间戳。The obtaining module 11 is configured to acquire travel data of each user through a positioning system, wherein the travel data includes a time stamp corresponding to the stay point and the stay point.
处理模块12设置为根据每个用户的出行数据计算每个用户的用户规律性轨迹。The processing module 12 is arranged to calculate a user regularity trajectory for each user based on the travel data of each user.
输出模块13设置为输出每个用户的用户规律性轨迹至通勤道路管理系统。The output module 13 is arranged to output a user trajectory of each user to the commuter road management system.
在一些实施例中,如图2所示,上述实施例中的处理模块12可以包括:通勤特征计算子模块121、通勤轨迹生成子模块122、通勤聚类管理子模块123、导航路径生成子模块124以及通勤规律轨迹子模块125。 In some embodiments, as shown in FIG. 2, the processing module 12 in the foregoing embodiment may include: a commute feature calculation sub-module 121, a commute trajectory generation sub-module 122, a commuter cluster management sub-module 123, and a navigation path generation sub-module. 124 and the commuter regular track sub-module 125.
通勤特征计算子模块121设置为根据每个用户的出行数据计算通勤特征,其中,通勤特征包括家庭位置、工作位置以及通勤时段。The commute feature calculation sub-module 121 is configured to calculate a commute feature based on travel data for each user, wherein the commute feature includes a home location, a work location, and a commute time period.
通勤轨迹生成子模块122设置为根据通勤特征的通勤时段、出行数据中的停留点及停留点对应的时间戳,筛选得到属于通勤时段的出行数据,将筛选得到的出行数据中的停留点按照时间戳排序生成通勤轨迹。The commute trajectory generation sub-module 122 is configured to filter the travel data belonging to the commute time period according to the commute period of the commute feature, the stay point in the travel data, and the time stamp corresponding to the stay point, and select the stay point in the filtered travel data according to the time. The stamp ordering generates a commute trajectory.
通勤聚类管理子模块123设置为计算每个用户的每两个通勤轨迹之间的轨迹相似度,将每个用户的每两个通勤轨迹之间的轨迹相似度大于第一阈值的通勤轨迹聚类生成聚类簇,并确定聚类簇的代表通勤轨迹。The commuter cluster management sub-module 123 is configured to calculate a trajectory similarity between each two commute trajectories of each user, and gather a commute trajectory with a trajectory similarity between each two commute trajectories of each user that is greater than a first threshold. The class generates a cluster of clusters and determines the representative commuter trajectory of the cluster cluster.
导航路径生成子模块124设置为根据选择策略从聚类簇所有通勤轨迹的停留点中选择通勤轨迹频繁点,根据家庭位置、工作位置、通勤轨迹频繁点及通勤轨迹频繁点对应的时间戳生成导航路径。The navigation path generation sub-module 124 is configured to select a frequent point of the commute trajectory from the stay points of all the commutation trajectories of the cluster cluster according to the selection strategy, and generate navigation according to the time position corresponding to the home location, the work position, the frequent points of the commute trajectory, and the frequent points of the commute trajectory. path.
通勤规律轨迹子模块125设置为计算导航路径与代表通勤轨迹之间的轨迹相似度,若导航路径与代表通勤轨迹之间的轨迹相似度大于第二阈值,则将导航路径作为聚类簇的通勤规律性轨迹,根据聚类簇的通勤规律性轨迹生成用户规律性轨迹。The commute regular trajectory sub-module 125 is configured to calculate a trajectory similarity between the navigation path and the representative commute trajectory. If the trajectory similarity between the navigation path and the representative commute trajectory is greater than the second threshold, the navigation path is used as a clustering cluster for commuting The regular trajectory generates a user regular trajectory according to the commuting regular trajectory of the cluster cluster.
在一些实施例中,上述实施例中的通勤时段可以包括上班通勤时段及下班通勤时段,通勤轨迹包括上班通勤轨迹及下班通勤轨迹,导航路径生成子模块124可以设置为,若聚类簇由上班通勤轨迹聚类生成,则生成以用户的家庭位置为起点、工作位置为终点以及依次通过通勤轨迹频繁点的上班导航路径;以及若聚类簇由下班通勤轨迹聚类生成,则生成以用户的工作位置为起点、家庭位置为终点以及依次通过通勤轨迹频繁点的下班导航路径。In some embodiments, the commute time period in the above embodiment may include a commute time and an off-duty commute period, the commute trajectory includes a commute trajectory and an off-duty commute trajectory, and the navigation path generation sub-module 124 may be configured to: if the cluster cluster is from work The commutation trajectory clustering generation generates a work navigation path starting from the user's home position, the work position as the end point, and the frequent frequent commute trajectory; and if the cluster cluster is generated by the commute trajectory clustering, the user is generated. The working position is the starting point, the family position is the ending point, and the off-duty navigation path that is frequently followed by the commute trajectory.
在一些实施例中,上述实施例中的通勤规律轨迹子模块125还可以设置为若导航路径与代表通勤轨迹之间的轨迹相似度小于第二阈值,则触发导航路径生成子模块重新选择通勤轨迹频繁点并生成新导航路径,计算并判断新导航路径与代表通勤轨迹之间的轨迹相似度是否大于第二阈值,若所述新导航路径与所述代表通勤轨迹之间的轨迹相似度大于所述第二阈值,则将新导航路径作为聚类簇的通勤规律性轨迹,若所述新导航路径与所述代表通勤轨迹之间的轨迹相似度不大于所述第二阈值,则循环执行重新选择通勤轨迹频繁点并生成新导航路径的步骤,直至新导航路径与代表通勤轨迹之间的轨迹相似度大于第二阈值。 In some embodiments, the commute regular trajectory sub-module 125 in the above embodiment may further be configured to trigger the navigation path generation sub-module to re-select the commute trajectory if the trajectory similarity between the navigation path and the representative commute trajectory is less than the second threshold. Frequently generating a new navigation path, calculating and determining whether the trajectory similarity between the new navigation path and the representative commute trajectory is greater than a second threshold, if the trajectory similarity between the new navigation path and the representative commute trajectory is greater than The second threshold is used, and the new navigation path is used as a commuting regular trajectory of the cluster cluster. If the trajectory similarity between the new navigation path and the representative commute trajectory is not greater than the second threshold, the loop execution is performed again. The step of selecting a frequent commute trajectory and generating a new navigation path until the trajectory similarity between the new navigation path and the representative commute trajectory is greater than a second threshold.
在一些实施例中,上述实施例中的导航路径生成子模块124还可以设置为计算导航路径的导航预估时长,通勤规律轨迹子模块125还可以设置为在判断导航路径与代表通勤轨迹之间的轨迹相似度是否大于第二阈值时,判断导航路径的导航预估时长与代表通勤轨迹的时长差值是否小于第三阈值;以及若导航路径与代表通勤轨迹之间的轨迹相似度大于第二阈值、且导航路径的导航预估时长与代表通勤轨迹的时长差值小于第三阈值,则将导航路径作为聚类簇的通勤规律性轨迹。In some embodiments, the navigation path generation sub-module 124 in the above embodiment may further be configured to calculate a navigation estimation duration of the navigation path, and the commute regular trajectory sub-module 125 may also be configured to determine between the navigation path and the representative commute trajectory. If the trajectory similarity is greater than the second threshold, determine whether the difference between the navigation estimation duration of the navigation path and the duration of the representative commutation trajectory is less than a third threshold; and if the trajectory similarity between the navigation path and the representative commutation trajectory is greater than the second The threshold value and the difference between the navigation estimation duration of the navigation path and the duration of the representative commutation trajectory are smaller than the third threshold, and the navigation path is used as the commuting regular trajectory of the cluster cluster.
在一些实施例中,若上述实施例中的聚类簇为多个,通勤规律轨迹子模块125设置为计算每个聚类簇的通勤轨迹数量占比,将聚类簇的通勤轨迹数量占比作为聚类簇通勤规律性轨迹的权重,将每个聚类簇的通勤规律性轨迹组成的集合作为用户规律性轨迹;以及若聚类簇为一个,通勤规律轨迹子模块125设置为将聚类簇的通勤规律性轨迹作为用户规律性轨迹。In some embodiments, if there are multiple cluster clusters in the foregoing embodiment, the commute regular trajectory sub-module 125 is configured to calculate the proportion of the number of commute trajectories of each cluster cluster, and the proportion of the commutation trajectories of the cluster clusters As the weight of the clustering commute regular trajectory, the set of commuting regular trajectories of each cluster cluster is taken as the user regular trajectory; and if the clustering cluster is one, the commuting regular trajectory sub-module 125 is set to cluster The regular trajectory of the commute of the cluster serves as the regular trajectory of the user.
在一些实施例中,若上述实施例中的聚类簇为多个,通勤规律轨迹子模块125还设置为删除通勤轨迹数量占比小于第四阈值的聚类簇。In some embodiments, if there are multiple cluster clusters in the foregoing embodiment, the commute regular trajectory sub-module 125 is further configured to delete the cluster cluster whose occupation trajectory number is smaller than the fourth threshold.
在一些实施例中,上述实施例中的通勤聚类管理子模块123设置为逐一计算聚类簇中每个通勤轨迹与其他通勤轨迹之间的轨迹相似度之和,将轨迹相似度之和最大的通勤轨迹作为代表通勤轨迹。In some embodiments, the commuter cluster management sub-module 123 in the above embodiment is configured to calculate the sum of the trajectory similarities between each commute trajectory and other commute trajectories in the cluster cluster one by one, and maximize the sum of trajectory similarities. The commuter trajectory serves as a commuter trajectory.
在一些实施例中,如图2所述,上述实施例中的处理模块12还可以包括轨迹相似度计算子模块126,设置为计算轨迹A与轨迹B的轨迹相似度为轨迹A途径的所有地点中,与轨迹B途径的点的距离最小值小于第五阈值的个数a’;计算轨迹B与轨迹A的轨迹相似度为轨迹B途径的所有地点中,与轨迹A途径的点的距离最小值小于第五阈值的个数b’;以及计算轨迹A与轨迹B之间的轨迹相似度为(a’+b’)/(a+b),a为轨迹A途径的地点个数,b为轨迹A途径的地点个数,/表示除法运算。In some embodiments, as shown in FIG. 2, the processing module 12 in the above embodiment may further include a trajectory similarity calculation sub-module 126 configured to calculate the trajectory similarity between the trajectory A and the trajectory B as all locations of the trajectory A path. The minimum distance from the point of the trajectory B path is less than the number a' of the fifth threshold; the trajectory similarity between the calculated trajectory B and the trajectory A is the smallest distance from the point of the trajectory A path in all the locations of the trajectory B path The value b' is less than the fifth threshold; and the trajectory similarity between the calculated trajectory A and the trajectory B is (a'+b')/(a+b), where a is the number of locations of the trajectory A path, b For the number of locations in the trajectory A route, / indicates the division operation.
在一些实施例中,上述实施例中的通勤轨迹生成子模块121在生成通勤轨迹之后,还可以设置为通过离群点检测算法计算轨迹异常系数,删除所述轨迹异常系数大于第六阈值的通勤轨迹。In some embodiments, after the commute trajectory is generated, the commute trajectory generating sub-module 121 in the foregoing embodiment may further be configured to calculate a trajectory abnormal coefficient by an outlier detection algorithm, and delete a commute whose trajectory abnormal coefficient is greater than a sixth threshold. Track.
在一些实施例中,上述实施例中的通勤轨迹生成子模块121在生成通勤轨迹之前,还可以设置为从所有用户中筛选出家庭位置和工作位置均可识别的通勤用户,生成可识别的通勤用户的通勤轨迹。 In some embodiments, the commute trajectory generation sub-module 121 in the foregoing embodiment may further be configured to filter out commute users whose home location and work location are identifiable from all users before generating the commute trajectory, and generate an identifiable commute. User's commute trajectory.
在一些实施例中,上述实施例中的通勤轨迹生成子模块121设置为根据每个用户的出行数据计算每个用户的出行地点离散熵,将出行地点离散熵小于第六阈值的用户作为通勤用户;以及获取并根据每个通勤用户的出行数据,识别每个通勤用户的家庭位置和工作位置,删除所述家庭位置和所述工作位置中至少一个不可识别的通勤用户。In some embodiments, the commute trajectory generation sub-module 121 in the above embodiment is configured to calculate a discrete entropy of the travel location of each user according to the travel data of each user, and use the user whose travel location discrete entropy is less than the sixth threshold as the commute user. And obtaining and determining, according to the travel data of each commute user, a home location and a work location of each commute user, and deleting at least one unrecognizable commuter user of the home location and the work location.
第二实施例Second embodiment
图3为本公开第二实施例提供的用户通勤轨迹管理方法的流程图。FIG. 3 is a flowchart of a method for managing a user commute trajectory according to a second embodiment of the present disclosure.
在步骤310中,通过定位系统获取每个用户的出行数据,其中,出行数据包括停留点及停留点对应的时间戳。In step 310, the travel data of each user is obtained by the positioning system, wherein the travel data includes a time stamp corresponding to the stay point and the stay point.
在步骤320中,根据每个用户的出行数据计算每个用户的用户规律性轨迹。In step 320, the user's regularity trajectory for each user is calculated based on the travel data for each user.
在步骤330中,输出每个用户的用户规律性轨迹至通勤道路管理系统。In step 330, the user's regular trajectory for each user is output to the commuter road management system.
在一些实施例中,如图4所示,上述实施例中的根据每个用户的出行数据计算每个用户的用户规律性轨迹可以包括以下步骤。In some embodiments, as shown in FIG. 4, calculating the user regularity trajectory of each user according to the travel data of each user in the above embodiment may include the following steps.
在步骤410中,根据出行数据计算通勤特征,其中,通勤特征包括家庭位置、工作位置以及通勤时段。In step 410, a commute feature is calculated based on the travel data, wherein the commute feature includes a home location, a work location, and a commute time period.
在步骤420中,根据通勤特征的通勤时段、出行数据中的停留点及停留点对应的时间戳,筛选得到属于通勤时段的出行数据,将筛选得到的出行数据中的停留点按照时间戳排序生成通勤轨迹。In step 420, according to the commute time of the commute feature, the stay point in the travel data, and the timestamp corresponding to the stop point, the travel data belonging to the commute time period is filtered, and the stay points in the filtered travel data are sorted according to the time stamp. Commuter trajectory.
在步骤430中,计算每个用户的每两个通勤轨迹之间的轨迹相似度,将轨迹相似度大于第一阈值的通勤轨迹聚类生成聚类簇,并确定聚类簇的代表通勤轨迹。In step 430, the trajectory similarity between each two commute trajectories of each user is calculated, the commute trajectory with the trajectory similarity greater than the first threshold is clustered to generate a cluster cluster, and the representative commute trajectory of the cluster cluster is determined.
在步骤440中,根据选择策略从聚类簇所有通勤轨迹的停留点中选择通勤轨迹频繁点,根据家庭位置、工作位置、通勤轨迹频繁点及通勤轨迹频繁点对应的时间戳生成导航路径。In step 440, the frequent points of the commute trajectory are selected from the stay points of all the commutation trajectories of the cluster cluster according to the selection strategy, and the navigation path is generated according to the time stamp corresponding to the home position, the working position, the frequent points of the commute trajectory and the frequent points of the commute trajectory.
在步骤450中,计算导航路径与代表通勤轨迹之间的轨迹相似度,若导航路径与代表通勤轨迹之间的轨迹相似度大于第二阈值,则将导航路径作为聚类簇的通勤规律性轨迹,根据聚类簇的通勤规律性轨迹生成用户规律性轨迹。In step 450, the trajectory similarity between the navigation path and the representative commuter trajectory is calculated. If the trajectory similarity between the navigation path and the representative commute trajectory is greater than the second threshold, the navigation path is used as the commuting regular trajectory of the cluster cluster. According to the regularity trajectory of the clustering cluster, the user's regular trajectory is generated.
在一些实施例中,上述实施例中的通勤时段包括上班通勤时段及下班通勤 时段,通勤轨迹包括上班通勤轨迹及下班通勤轨迹,步骤440包括:若聚类簇由上班通勤轨迹聚类生成,则生成以用户的家庭位置为起点、工作位置为终点以及依次通过通勤轨迹频繁点的上班导航路径;以及若聚类簇由下班通勤轨迹聚类生成,则生成以用户的工作位置为起点、家庭位置为终点以及依次通过通勤轨迹频繁点的下班导航路径。In some embodiments, the commute time period in the above embodiment includes commuting time and commuting to work. During the time period, the commute trajectory includes a commute trajectory and an off-duty commute trajectory, and step 440 includes: if the cluster cluster is generated by the commuting trajectory of the commute, the generation starts from the user's home position, the working position is the end point, and the commute trajectory is frequently followed. The work navigation path; and if the cluster cluster is generated by the commute trajectory clustering, the work navigation route starting from the user's work position, the family position as the end point, and the frequent commute trajectory in turn is generated.
在一些实施例中,上述实施例中的步骤450还可以包括:若导航路径与代表通勤轨迹之间的轨迹相似度小于第二阈值,则重新选择通勤轨迹频繁点并生成新导航路径,计算并判断新导航路径与代表通勤轨迹之间的轨迹相似度是否大于第二阈值,若新导航路径与代表通勤轨迹之间的轨迹相似度大于所述第二阈值,则将新导航路径作为聚类簇的通勤规律性轨迹,若新导航路径与代表通勤轨迹之间的轨迹相似度不大于所述第二阈值,则循环执行重新选择通勤轨迹频繁点并生成新导航路径的步骤,直至新导航路径与代表通勤轨迹之间的轨迹相似度大于第二阈值。In some embodiments, the step 450 in the above embodiment may further include: if the trajectory similarity between the navigation path and the representative commute trajectory is less than the second threshold, reselecting the frequent points of the commute trajectory and generating a new navigation path, and calculating Determining whether the trajectory similarity between the new navigation path and the representative commute trajectory is greater than a second threshold. If the trajectory similarity between the new navigation path and the representative commute trajectory is greater than the second threshold, the new navigation path is used as a cluster cluster. The commuting regularity trajectory, if the trajectory similarity between the new navigation path and the representative commute trajectory is not greater than the second threshold, the loop performs the steps of reselecting the frequent points of the commute trajectory and generating a new navigation path until the new navigation path and The trajectory similarity between the representative commute trajectories is greater than the second threshold.
在一些实施例中,上述实施例中的方法还可以包括:计算导航路径的导航预估时长,在判断导航路径与代表通勤轨迹之间的轨迹相似度是否大于第二阈值时,判断导航路径的导航预估时长与代表通勤轨迹的时长差值是否小于第三阈值;以及若导航路径与代表通勤轨迹之间的轨迹相似度大于第二阈值,且导航路径的导航预估时长与代表通勤轨迹的时长差值小于第三阈值,则将导航路径作为聚类簇的通勤规律性轨迹。In some embodiments, the method in the foregoing embodiment may further include: calculating a navigation estimation duration of the navigation path, and determining whether the navigation path is greater than whether the trajectory similarity between the navigation path and the representative commutation trajectory is greater than a second threshold. Whether the difference between the estimated duration and the duration of the commute trajectory is less than a third threshold; and if the trajectory similarity between the navigation path and the representative commute trajectory is greater than a second threshold, and the navigation estimation duration of the navigation path and the representative commute trajectory If the time difference is smaller than the third threshold, the navigation path is used as the commuting regular track of the cluster.
在一些实施例中,若上述实施例中的聚类簇为多个,根据聚类簇的通勤规律性轨迹生成用户规律性轨迹可以包括:计算每个聚类簇的通勤轨迹数量占比,将聚类簇的通勤轨迹数量占比作为聚类簇的通勤规律性轨迹的权重,将每个聚类簇的通勤规律性轨迹组成的集合作为用户规律性轨迹;以及若聚类簇为一个,根据聚类簇的通勤规律性轨迹生成用户规律性轨迹包括:将聚类簇的通勤规律性轨迹作为用户规律性轨迹。In some embodiments, if the clustering clusters in the foregoing embodiment are multiple, generating a user regular trajectory according to the commuting regular trajectory of the clustering cluster may include: calculating a proportion of the number of commuting trajectories of each cluster cluster, The number of commuting trajectories of the cluster clusters is the weight of the regular trajectory of the commuting clusters, and the set of commuting regular trajectories of each cluster cluster is taken as the regular trajectory of the user; and if the clustering cluster is one, according to The regularity trajectory of the commuting regularity trajectory of the clustering cluster includes the regular trajectory of the commuting of the clustering cluster as the regular trajectory of the user.
其中,所述每个聚类簇的通勤轨迹数量占比为所述每个聚类簇中通勤轨迹的数量与所有通勤轨迹的数量的比值。The number of commute trajectories of each cluster cluster is a ratio of the number of commute trajectories in each cluster cluster to the number of all commute trajectories.
在一些实施例中,若上述实施例中的聚类簇为多个,用户通勤轨迹管理方法还可以包括:删除通勤轨迹数量占比小于第四阈值的聚类簇。In some embodiments, if the clustering clusters in the foregoing embodiment are multiple, the user commute trajectory management method may further include: deleting cluster clusters whose occupation trajectory number is smaller than a fourth threshold.
在一些实施例中,上述实施例中的确定每个聚类簇的代表通勤轨迹可以包 括:逐一计算聚类簇中每个通勤轨迹与其他通勤轨迹之间的轨迹相似度之和,将轨迹相似度之和最大的通勤轨迹作为代表通勤轨迹。In some embodiments, the representative commute trajectory for determining each cluster cluster in the above embodiment may be packaged. Including: calculating the sum of the trajectories similarity between each commute trajectory and other commuting trajectories in the cluster cluster one by one, and taking the commute trajectory with the largest sum of trajectories similarity as the representative commuter trajectory.
在一些实施例中,上述实施例中的轨迹之间的轨迹相似度的计算可以包括:计算轨迹A与轨迹B的轨迹相似度为轨迹A途径的所有地点中,与轨迹B途径的点的距离最小值小于第五阈值的个数a’;计算轨迹B与轨迹A的轨迹相似度为轨迹B途径的所有地点中,与轨迹A途径的点的距离最小值小于第五阈值的个数b’;以及计算轨迹A与轨迹B之间的轨迹相似度为(a’+b’)/(a+b),a为轨迹A途径的地点个数,b为轨迹A途径的地点个数,/表示除法运算。In some embodiments, the calculation of the trajectory similarity between the trajectories in the above embodiment may include: calculating the trajectory similarity between the trajectory A and the trajectory B as the distance from the point of the trajectory B path among all the locations of the trajectory A path The minimum value is less than the number of the fifth threshold a'; the trajectory similarity between the calculated trajectory B and the trajectory A is the number b' of the distance minimum from the point of the trajectory A path among all the locations of the trajectory B path And the trajectory similarity between the calculated trajectory A and the trajectory B is (a'+b')/(a+b), where a is the number of locations of the trajectory A pathway, and b is the number of locations of the trajectory A pathway, / Indicates the division operation.
在一些实施例中,上述实施例中的方法在生成通勤轨迹之后,还可以包括:通过离群点检测算法计算轨迹异常系数,删除所述轨迹异常系数大于第六阈值的通勤轨迹。In some embodiments, after the method of generating the commutation trajectory, the method in the foregoing embodiment may further include: calculating a trajectory abnormality coefficient by using an outlier detection algorithm, and deleting a commutation trajectory whose trajectory abnormal coefficient is greater than a sixth threshold.
在一些实施例中,上述实施例中的方法在生成通勤轨迹之前,还可以包括:从所有用户中筛选出家庭位置和工作位置均可识别的通勤用户,生成可识别的通勤用户的通勤轨迹。In some embodiments, the method in the foregoing embodiment may further include: selecting, from all users, a commute user that is identifiable by the home location and the work location, and generating a commensurable commute trajectory of the commute user.
在一些实施例中,上述实施例中的从所有用户中筛选出家庭位置和工作位置均可识别的通勤用户可以包括:根据每个用户的出行数据计算每个用户的出行地点离散熵,将出行地点离散熵小于第六阈值的用户作为通勤用户;以及获取并根据每个通勤用户的出行数据,识别每个通勤用户的家庭位置和工作位置,若不可识别,删除所述家庭位置和所述工作位置中至少一个不可识别的通勤用户。In some embodiments, the commuting user that can identify the home location and the work location from all the users in the foregoing embodiment may include: calculating the discrete entropy of each user's travel location according to the travel data of each user, and traveling a user whose location discrete entropy is less than a sixth threshold is used as a commute user; and obtains and identifies a home location and a work location of each commute user according to travel data of each commute user, and if not recognized, deletes the home location and the work At least one unrecognizable commuter user in the location.
第三实施例Third embodiment
相关的通勤调查方法是采用调查问卷,这种方法时效性差,样本量不足,通勤统计效果很差。The relevant commuter survey method is to use the questionnaire. This method has poor timeliness, insufficient sample size, and poor commutation statistics.
本实施例提供了一种用户通勤轨迹管理系统,该系统可以包括:存储装置(或分布式存储装置)、采集装置、定位系统、本公开提供的用户通勤轨迹管理装置以及通勤道路管理系统。This embodiment provides a user commute trajectory management system, which may include: a storage device (or a distributed storage device), a collection device, a positioning system, a user commute trajectory management device provided by the present disclosure, and a commuter road management system.
本申请所涉及的通过定位系统获取通勤信息可以包括:通过GPS、北斗定位系统或移动网络定位数据统计通勤特征,如家庭位置、工作位置、通勤时长以及回转半径等;通过公交和轨道交通集成电路卡(Integrated Circuit Card,IC) 数据获取居民通勤出行特征信息,如通勤出行时间、通勤出行距离和换乘特性等;或者,通过监控摄像头直播早高峰和晚高峰时段关键路段路况获取通勤信息。The obtaining of the commute information by the positioning system involved in the present application may include: statistic of commuting characteristics, such as home position, working position, commute duration, and radius of gyration, by means of GPS, Beidou positioning system or mobile network positioning data; Card (Integrated Circuit Card, IC) The data obtains the commuter travel characteristic information of the residents, such as commuter travel time, commuter travel distance and transfer characteristics, etc.; or, obtains commute information by monitoring the live broadcast of the camera at the early peak and the late peak hours.
可选的,采集装置设置为采集GPS数据、北斗定位系统的数据、移动网络位置信令数据或者通话话单数据,提取用户标识、时间戳以及位置字段。对于GPS数据和北斗定位系统数据,位置字段可以是经度和纬度,对于移动网络位置信令数据或者通话话单数据,位置字段可以是位置区编号和基站小区编号。每一条位置数据是出行中经过的地点或停留点,统称停留点。对采集装置采集的数据可以进行预处理,包括去隐私化处理,去隐私化处理可以把用户标识替换为无意义的标识符号。数据的预处理中还可以通过基站聚类把球面距离很近的基站进行合并。Optionally, the collecting device is configured to collect GPS data, data of a Beidou positioning system, mobile network location signaling data, or call bill data, and extract a user identifier, a time stamp, and a location field. For GPS data and Beidou positioning system data, the location field may be longitude and latitude. For mobile network location signaling data or call CDR data, the location field may be a location area number and a base station cell number. Each piece of position data is the place or stop point in the trip, collectively referred to as the stop point. The data collected by the collection device can be pre-processed, including de-privacy processing, and the privacy-removal process can replace the user identifier with a meaningless identifier. In the preprocessing of data, base stations with close spherical distances can also be combined by base station clustering.
存储装置(或分布式存储装置)可以存储多种数据,包括采集装置、用户通勤轨迹管理装置以及通勤道路管理系统的数据和计算结果。The storage device (or distributed storage device) can store a variety of data, including data and calculation results of the collection device, the user commute trajectory management device, and the commuter road management system.
用户通勤轨迹管理装置可以根据每个用户的出行数据计算每个用户的出行地点离散熵。熵是混乱和无序的量度,熵越小(越接近0),稳定性越高,可预测性越强。出行地点离散熵为用户的出行地点的离散的可选的熵。可以从每个用户的出行的时间和地点数据分析出行地点离散熵,计算每个用户一周7天每小时的出行地点离散熵,将计算结果保存到存储装置中。The user commute trajectory management device can calculate the discrete entropy of each user's travel location based on the travel data of each user. Entropy is a measure of chaos and disorder. The smaller the entropy (closer to 0), the higher the stability and the stronger the predictability. The travel location discrete entropy is the discrete optional entropy of the user's travel location. The discrete entropy of the travel location can be analyzed from the time and location data of each user's travel, and the discrete entropy of the travel location per hour and 7 days per user is calculated, and the calculation result is saved in the storage device.
本公开提出的通勤用户可以为在工作日的办公时段和夜间睡眠时段离散熵小的典型上班族,这样的用户通常有稳定的家庭位置和工作位置。计算这类用户的规律性通勤轨迹,对每一个通勤用户,计算每个通勤用户的通勤特征,通勤特征可以包括家庭位置、工作位置、上班通勤时段和上班时长以及下班通勤时段和下班时长,将计算结果保存到存储装置中。The commute users proposed by the present disclosure may be typical office workers with small discrete entropy during office hours and nighttime sleep periods on weekdays, such users typically having stable home locations and work locations. Calculate the regular commute trajectory of such users, and calculate the commute characteristics of each commute user for each commute user. The commute features may include family location, work location, commute time and working hours, commute time and off-duty time. The calculation result is saved to the storage device.
可选的,对于每个通勤用户,提取出每个通勤用户在工作日上班通勤时段内从家庭位置到工作位置的出行数据,提取出每个通勤用户在工作日下班通勤时段内从工作位置到家庭位置的出行数据。同一用户的带时间戳的出行数据可以形成通勤出行轨迹序列。Optionally, for each commute user, the travel data of each commute user from the family location to the work location during the commute time of the working day is extracted, and each commuter user is extracted from the working position during the commute time of the working day. Travel data for home location. Time-stamped travel data for the same user can form a sequence of commute travel trajectories.
用户通勤轨迹管理装置可以对每个通勤用户的通勤出行轨迹序列进行处理,其中,用户通勤轨迹管理装置可以对上班通勤数据和下班通勤数据分别处理。 The user commute trajectory management device can process the commute travel trajectory sequence of each commute user, wherein the user commute trajectory management device can separately process the commute commute data and the commute commute data.
可选的,通过离群点检测算法计算轨迹异常系数,删除所述轨迹异常系数大于第六阈值的通勤轨迹。所述轨迹异常系数表明该轨迹与与该用户其他的出行轨迹数据的区别。可以把轨迹异常系数大于第六阈值的通勤轨迹识别出来,并把识别出来的通勤轨迹作为非规律性异常轨迹,在通勤轨迹聚类的预处理中将非规律性异常轨迹丢弃。去除非规律性异常轨迹的通勤轨迹记为集合A,计算集合A中每两个通勤轨迹之间的轨迹相似度,把每两个通勤轨迹之间的轨迹相似度大于第一阈值的通勤轨迹进行聚类,并生成聚类簇Ci。一个或多个聚类簇Ci形成合并集合C,计算C中每一个聚类簇Ci的通勤轨迹频繁点,每个通勤轨迹频繁点附带有通勤轨迹序号和该通勤轨迹频繁点的小时标签,并计算每一个聚类簇的通勤轨迹数量占比。对于每个聚类簇Ci中每条通勤轨迹tr,计算与Ci中除通勤轨迹tr之外的通勤轨迹之间的轨迹相似度之和tr_dist,把tr_dist最大的那条通勤轨迹设置为聚类簇Ci的代表通勤轨迹Cij。Optionally, the trajectory abnormality coefficient is calculated by the outlier detection algorithm, and the commutation trajectory whose trajectory abnormal coefficient is greater than the sixth threshold is deleted. The trajectory anomaly coefficient indicates the difference between the trajectory and other travel trajectory data of the user. The commute trajectory with the trajectory anomaly coefficient greater than the sixth threshold may be identified, and the identified commute trajectory may be regarded as an irregular abnormal trajectory, and the irregularity abnormal trajectory is discarded in the preprocessing of the commuter trajectory clustering. The commute trajectory from which the irregularity trajectory is removed is recorded as set A, and the trajectory similarity between each two commute trajectories in the set A is calculated, and the commute trajectory with the trajectory similarity between each two commute trajectories being greater than the first threshold is performed. Cluster and generate cluster cluster Ci. One or more cluster clusters Ci form a merged set C, and calculate a frequent point of the commute trajectory of each cluster cluster Ci in each of C, and each commute trajectory is frequently accompanied by a commute trajectory number and an hour tag of the frequent trajectory of the commute trajectory, and Calculate the proportion of the number of commute trajectories for each cluster cluster. For each commute track tr in each cluster cluster Ci, calculate the sum of the trajectory similarities between the commute trajectories other than the commute trajectory tr in Ci, tr_dist, and set the commute trajectory with the largest tr_dist as the cluster cluster Ci's representative commuter track Cij.
其中,通勤轨迹频繁点可以是用户的通勤轨迹中经常经过和停留的位置,通勤轨迹频繁点的选择策略,可以是频繁点的数量等于该聚类簇中每条通勤轨迹的停留点数量的平均值,频繁点也可以是通勤轨迹数量占比超过0.2的停留点。通勤轨迹频繁点的小时标签可以是通勤轨迹在该停留点的时间戳,所述小时标签可以精确到小时,若存在多个时间戳则取平均值且平均值精确到小时。The frequent points of the commute trajectory may be a position that the user frequently passes and stays in the commute trajectory, and the frequent selection strategy of the commute trajectory may be that the number of frequent points is equal to the average number of stay points of each commute trajectory in the cluster cluster. The value, the frequent point may also be a stay point where the number of commute tracks accounts for more than 0.2. The hour tag of the frequent commute track may be the time stamp of the commute track at the stay point, the hour tag may be accurate to the hour, and if there are multiple timestamps, the average is averaged and the average is accurate to the hour.
根据C中每一个聚类簇Ci的通勤轨迹频繁点按照通勤轨迹频繁点的时间戳构成的序列,生成以家庭位置为起点、工作位置为终点,依次经过每个通勤轨迹频繁点的导航路径Di。可以提取导航路径Di的停留点的GPS坐标和导航路径的导航预估时长,计算聚类簇Ci的代表通勤轨迹Cij和对应的导航路径Di之间的轨迹相似度,以及代表通勤轨迹Cij的时长与导航路径Di的导航预估时长之间的差值。其中,可以通过位置服务提供商的地图开放平台提供的路径规划和导航功能应用程序编程接口(Application Programming Interface,API),获取导航路径Di和导航预估时长。代表通勤轨迹Cij的时长为代表通勤轨迹Cij的起点时间戳和终点时间戳之间的时间差。若代表通勤轨迹Cij和导航路径Di之间的轨迹相似度大于第二阈值,且代表通勤轨迹Cij的时长与导航路径Di的导航预估时长之间的差值小于第三阈值,则可将导航路径Di作为该用户的一条通勤规律性轨迹。其中,第二阈值和第三阈值可以根据用户或者开发者的精度要求进行设置。因为合并集合C中可能有多个聚类簇,所以可能有多条导航路径,也就有多条通勤规律性轨迹。提取导航路径Di途径点的GPS坐标,其中,提取 的导航路径Di途径点的数量可以为聚类簇Ci中轨迹停留点数量的平均值的整数部分,提取的导航路径Di途径点是通勤轨迹频繁点之间的有一定间隔的点。According to the sequence of the time-of-day trajectory of each cluster cluster Ci in C, according to the sequence formed by the time stamp of the frequent trajectory of the commute trajectory, a navigation path Di is generated which takes the family position as the starting point and the working position as the end point, and sequentially passes through each commute trajectory frequently. . The GPS coordinates of the stay point of the navigation path Di and the navigation estimation duration of the navigation path can be extracted, the trajectory similarity between the representative commute trajectory Cij of the cluster cluster Ci and the corresponding navigation path Di, and the duration of the commute trajectory Cij can be calculated. The difference between the estimated length of the navigation and the navigation path Di. The navigation path Di and the navigation estimation duration can be obtained through a path planning and navigation function application programming interface (API) provided by the location service provider's map open platform. The duration representing the commute trajectory Cij is the time difference between the start time stamp and the end point time stamp representing the commute trajectory Cij. If the trajectory similarity between the commute trajectory Cij and the navigation path Di is greater than the second threshold, and the difference between the duration of the commute trajectory Cij and the navigation estimation duration of the navigation path Di is less than the third threshold, the navigation may be The path Di serves as a commute regular trajectory for the user. The second threshold and the third threshold may be set according to the accuracy requirements of the user or the developer. Because there may be multiple clusters in the merged set C, there may be multiple navigation paths, and there are multiple commuter regular trajectories. Extract the GPS coordinates of the navigation path Di route point, where, extract The number of the navigation path Di route points may be an integer part of the average value of the number of track stay points in the cluster cluster Ci, and the extracted navigation path Di route point is a point with a certain interval between the frequent points of the commute track.
通勤规律性轨迹还可以通过综合相似度的进行确定。综合相似度F=f(x,y),x是轨迹相似度,y是通勤时长之差,f(x,y)是关于x、y两个变量的函数。F=f(x,y)可以是二元一次线性函数,则F=f(x,y)=ax+by+c,其中,a、b和c为常数。在两个变量中,x对F的影响权重比y对F的影响权重大。若F在阈值范围内,则说明合并轨迹和合并轨迹对应的导航路径重合度高,该导航路径为用户的规律性轨迹。若F不在阈值范围内,则说明合并轨迹和合并轨迹对应的导航路径差异大,可以重新选择聚类簇的通勤轨迹频繁点并生成新导航路径,重新计算得到F。若该聚类簇的通勤轨迹数量占比太小(比如小于0.1)也可以选择丢弃该聚类簇轨迹。The regularity of commuting can also be determined by comprehensive similarity. The comprehensive similarity F = f (x, y), x is the trajectory similarity, y is the difference of the commute duration, and f (x, y) is a function of the two variables x and y. F = f (x, y) can be a binary linear function, then F = f (x, y) = ax + by + c, where a, b and c are constants. Among the two variables, the weight of the influence of x on F is greater than the influence of y on F. If F is in the threshold range, it indicates that the merged track and the merged track have a high degree of coincidence of the navigation path, and the navigation path is a regular track of the user. If F is not within the threshold range, it indicates that the navigation path corresponding to the merged track and the merged track has a large difference, and the frequent points of the commutation trajectory of the cluster cluster can be re-selected and a new navigation path is generated, and F is recalculated. If the number of commuting trajectories of the cluster cluster is too small (for example, less than 0.1), the cluster cluster trajectory may be discarded.
若上述合并集合C中只有一个聚类簇,则得到的该聚类簇的通勤规律性轨迹就是该用户的用户规律性轨迹。若上述合并集合C中有多个聚类,计算每个聚类簇的通勤轨迹数量占比,将聚类簇的通勤轨迹数量占比作为聚类簇的通勤规律性轨迹的权重,将每个聚类簇的通勤规律性轨迹组成的集合作为用户规律性轨迹。If there is only one cluster cluster in the above merged set C, the commute regular trajectory of the obtained cluster cluster is the user's regular trajectory of the user. If there are multiple clusters in the merged set C, calculate the proportion of the number of commuting trajectories of each cluster cluster, and the proportion of the number of commuting trajectories of the cluster clusters is used as the weight of the commuting regular trajectory of the cluster cluster, and each will be The set of commuting regular trajectories of the clustering clusters serves as the user's regular trajectory.
如图5所示,本公开还提出一种用户通勤轨迹管理方法。As shown in FIG. 5, the present disclosure also proposes a user commute trajectory management method.
在步骤510中,进行数据预处理。In step 510, data preprocessing is performed.
数据预处理可以是将收集到的原始用户出行位置数据处理成特定的格式。以移动网络用户数据为例,可以从原始数据中抽取包括匿名处理后的用户标识、用户位置(基站小区编号需转换为基站位置的GPS坐标)以及通话时间的属性域。原始的通话数据通常是海量且冗余的,通过预处理筛选出一定的数据,可以减小数据量,提高后续处理的效率。在数据预处理中,可以将球面距离很近的基站进行合并,通过平滑处理抑制基站频繁切换,减少数据量,增强数据有效性。Data pre-processing can be to process the collected raw user travel location data into a specific format. Taking the mobile network user data as an example, the attribute data including the anonymously processed user identifier, the user location (the GPS coordinates of the base station cell number to be converted into the base station location), and the talk time may be extracted from the original data. The original call data is usually massive and redundant. By preprocessing and filtering out certain data, the amount of data can be reduced and the efficiency of subsequent processing can be improved. In data preprocessing, base stations with very close spherical distances can be combined, and smooth handover can be used to suppress frequent handover of base stations, reduce data volume, and enhance data validity.
在步骤520中,计算用户出行稳定性特征。In step 520, a user travel stability feature is calculated.
可选的,从所有用户中筛选出家庭位置和工作位置均可识别的通勤用户时,可以根据每个用户的出行数据计算每个用户的出行地点离散熵,采用离散熵衡量用户出行稳定性,出行地点离散熵定义如下: Optionally, when the commuting users whose home location and work location are identifiable are selected from all users, the discrete entropy of each user's travel location may be calculated according to the travel data of each user, and the user's travel stability is measured by using discrete entropy. The discrete entropy of the travel location is defined as follows:
Figure PCTCN2017072697-appb-000001
Figure PCTCN2017072697-appb-000001
其中,p(Ri)为用户在位置Ri的经验概率,对于移动网络用户数据,Ri可以为基站号。出行地点离散熵越大,标识用户的运动规律性就越低。计算用户在工作日的出行位置离散熵,并按小时计算每个用户在工作日的出行地点离散熵。Where p(R i ) is the empirical probability of the user at the location R i , and for the mobile network user data, R i may be the base station number. The greater the discrete entropy of the travel location, the lower the regularity of the motion of the identified user. Calculate the discrete entropy of the user's travel location on weekdays, and calculate the discrete entropy of each user's travel location on weekdays by hour.
在步骤530中,提取城市通勤时段。In step 530, the city commute time period is extracted.
计算用户出行地点特征。定义一种数据结构path-pattern=<h_1,R_1><h_2,R_2>…<h_n,R_n>,其中h_i表示第i个时间片(预设将一天的24个小时均等划分为24个时间片,每个时间片1个小时),i=1,2,…,24,R_i表示用户第i个时间片的用户的位置,数据结构path-pattern可表示用户的移动轨迹。Calculate the characteristics of the user's travel location. Define a data structure path-pattern=<h_1,R_1><h_2,R_2>...<h_n,R_n>, where h_i represents the ith time slice (preset to divide the 24 hours of the day into 24 time slices , each time slice 1 hour), i = 1, 2, ..., 24, R_i represents the user's position of the i-th time slice, and the data structure path-pattern can represent the user's movement track.
可选的,采用24行7列的二维数组Pattern[24][7]描述用户出行地点特征,Pattern[24][7]表示以一周为周期每天24个小时每小时最频繁的停留地点,称为每小时停留地点数组。最频繁的停留地点可以是该时段内停留时间最长的地点。Optionally, a two-dimensional array of 24 rows and 7 columns, Pattern[24][7], describes the characteristics of the user's travel location, and Pattern[24][7] indicates the most frequent stopover time of 24 hours per day per week. It is called an array of hourly stops. The most frequent stopover location can be the location with the longest stay time during that time period.
可选的,采用二维数组Pattern_go[24][7]描述用户停留地点的变化,当前小时较上一小时停留地点相同时该小时元素值为0,当前小时与上一小时停留地点不相同时,该小时元素值为1。分析二维数组Pattern_go[24][7]可以了解工作日是一个星期的哪几天,可以了解通勤时段,即早高峰和晚高峰通勤出行时段。可选地,Pattern_go[24][7]中工作日24小时的统计数值分布呈现双峰,双峰时段分别对应早高峰和晚高峰通勤出行时段。早高峰和晚高峰通勤出行时段即是城市通勤时段。Optionally, the two-dimensional array Pattern_go[24][7] is used to describe the change of the user's stop location. When the current hour is the same as the previous hour, the hourly element value is 0, and the current hour is different from the previous hour. , the hour element value is 1. Analyze the two-dimensional array Pattern_go[24][7] to find out which days of the week the workday is, and to know the commute time, that is, the morning peak and the late peak commute travel time. Optionally, the statistical value distribution of the 24 hours of working days in Pattern_go[24][7] presents a double peak, and the double peak periods correspond to the morning peak and the late peak commuting time respectively. The morning and evening peak commute times are the city commuting hours.
在步骤540中,筛选通勤用户。In step 540, the commuter user is screened.
可以定义工作日的出行地点离散熵小于特定阈值的用户为通勤用户。A user who can define the travel date of the workday with a discrete entropy less than a certain threshold is a commute user.
在步骤550中,提取用户通勤特征。In step 550, the user commute feature is extracted.
用户通勤特征描述可以为{家庭位置,工作位置,上班路途占用时间,下班路途占用时间}的四元组。The user commute feature description can be a four-tuple of {home location, work location, time taken to work, time taken to work}.
针对上述步骤540中筛选出的通勤用户,可以计算通勤OD(Origin-Destination),在家庭位置与工作位置之间的通勤OD是一种出行模式。For the commute user selected in the above step 540, the OD (Origin-Destination) can be calculated, and the commute OD between the home location and the work location is a travel mode.
以通话数据为例,将每个用户的通话数据T分为两个集合,Tday和Tnight, 分别代表白天和夜晚的通话数据,其中,T={<手机号,通话基站,通话时间>},{}表示一个用户的至少一个通话数据,每个通话数据是一个包括手机号、通话基站和通话时间的三元组序列;分别对Tday和Tnight通话数据按照通话基站进行划分,每条通话记录都等同于一条定位记录;按通话次数将基站的从大到小排列,通过基站聚类把球面距离很近的基站进行合并,形成新的通话位置点;根据通话周期性,计算每个位置点的通话频繁度;根据通话频繁度,对位置点进行筛选,删除通话稀疏的位置点;将Tday和Tnight数据中通话频繁度最大的位置点作为D和O,即工作位置和家庭位置;以及输出每个用户的通勤OD,作为用户通勤特征中的家庭位置,工作位置。通话频繁度是指该用户发生在该位置点的通话次数占该用户总通话次数的百分比。Taking the call data as an example, the call data T of each user is divided into two sets, Tday and Tnight, Representing the day and night call data, respectively, where T={<mobile number, call base station, talk time>}, {} represents at least one call data of a user, and each call data is a mobile phone number, a call base station, and The triple sequence of the call time; the Tday and Tnight call data are respectively divided according to the call base station, and each call record is equivalent to a positioning record; the base stations are arranged from large to small according to the number of calls, and the spherical surface is clustered by the base station. The base stations that are close to each other are merged to form a new call location point; according to the call periodicity, the call frequency of each location point is calculated; according to the frequency of the call, the location points are filtered to delete the location where the call is sparse; And the location point where the call frequency is the most frequent among the Tnight data is D and O, that is, the work position and the home position; and the commute OD of each user is output as the home position and the work position in the user commute feature. Call frequency refers to the percentage of calls that the user has made at that location as a percentage of the total number of calls for that user.
将通勤用户中家庭位置和工作位置中至少一个无法识别的用户去掉。Remove at least one unrecognizable user from the home location and work location of the commuter user.
上班路途占用时间,可以指从经过预处理后的数据里计算通勤用户在工作日上班通勤时段内从家庭位置到工作位置所占用时间的平均值;下班路途占用时间,可以指从经过预处理后的数据里计算通勤用户在工作日下班通勤时段内从工作位置到家庭位置所占用时间的平均值。The time taken to work on the road may refer to the average value of the time taken by the commuter user from the home position to the work position during the commute time on the working day from the pre-processed data; the time taken after the work is taken from the pre-processed The data is used to calculate the average time taken by the commuter user from the working position to the home position during the commute time of the working day.
在步骤560中,检测非规律性轨迹。In step 560, an irregularity trajectory is detected.
针对同一个用户,可以在用户在上班通勤时段从家庭位置到工作位置的轨迹中检测并剔除少量非规律性的轨迹。For the same user, a small number of irregular trajectories can be detected and eliminated in the trajectory of the user from the home position to the work position during the commute time.
t为待测试的轨迹,T为待比较的轨迹集合,m为运行次数,ψ为每次运行的样本数量,数组n[m]用于统计每次运行得到的值,数组n[m]初始化为0。t is the trajectory to be tested, T is the set of trajectories to be compared, m is the number of runs, ψ is the number of samples per run, array n[m] is used to count the value obtained for each run, and array n[m] is initialized. Is 0.
处理步骤包括:n[m]=0;for i=1to m do;The processing steps include: n[m]=0; for i=1to m do;
Figure PCTCN2017072697-appb-000002
Figure PCTCN2017072697-appb-000002
把每个t的n[m]数组作为中间结果,进行运算:Take the n[m] array of each t as an intermediate result and perform the operation:
S=2-E((n(t))/C(N))S=2 -E((n(t))/C(N)) ;
其中C(N)=2H(N-1)-2(N-1)/N;←表示赋值。Where C(N)=2H(N-1)-2(N-1)/N; ← denotes an assignment.
H(i)=ln(i)+0.57721566,其中,0.57721566为欧拉常数以及/表示除法运算。H(i)=ln(i)+0.57721566, where 0.57721566 is the Euler constant and / represents the division operation.
经过此运算,S值越高表示越偏离于主流的通勤轨迹。S也称为轨迹异常系数,其中,轨迹异常系数S超过第六阈值的通勤轨迹是异常轨迹,可以删除异常轨迹。After this operation, the higher the S value, the more deviation from the mainstream commuter trajectory. S is also referred to as a trajectory anomaly coefficient, wherein the commute trajectory in which the trajectory anomaly coefficient S exceeds the sixth threshold is an abnormal trajectory, and the abnormal trajectory can be deleted.
在步骤570中,进行通勤轨迹聚类。In step 570, commutation trajectory clustering is performed.
经过上述非规律性轨迹检测得到的除非规律性轨迹之外的通勤轨迹记为集合A,计算集合A中每两个通勤轨迹之间的轨迹相似度并把轨迹相似度高的轨迹进行合并得到合并集合C,计算C中每一个聚类簇的通勤轨迹频繁点和每一个聚类簇的通勤轨迹数量占比。The commute trajectory except the regular trajectory obtained by the above-mentioned irregular trajectory detection is recorded as set A, and the trajectory similarity between each two commute trajectories in the set A is calculated and the trajectories with high trajectory similarity are combined and merged. Set C calculates the frequent points of the commute trajectory of each cluster cluster in C and the number of commute trajectories of each cluster cluster.
其中,计算上班通勤轨迹中轨迹相似度的算法描述包括:The algorithm description for calculating the trajectory similarity in the commute trajectory to work includes:
计算轨迹A与轨迹B的轨迹相似度Tdis(A,B,limit_len),即A途经的所有地点中,与轨迹B途经的地点的欧氏距离最小值小于第五阈值limit_len的个数a’,a’小于或等于A的元素个数;Calculating the trajectory similarity Tdis(A, B, limit_len) of the trajectory A and the trajectory B, that is, the minimum value of the Euclidean distance of the location passing through the trajectory B is less than the number a' of the fifth threshold limit_len, among all the locations where the A passes, a' is less than or equal to the number of elements of A;
计算轨迹B与轨迹A的轨迹相似度Tdis(B,A,limit_len),即B途经的所有地点中,与轨迹A途经的点的欧氏距离最小值小于limit_len的个数b’,b’小于或等于B的元素个数;以及Calculate the trajectory similarity Tdis(B, A, limit_len) of the trajectory B and the trajectory A, that is, among all the locations where the B passes, the minimum Euclidean distance of the point passing through the trajectory A is smaller than the number b' of the limit_len, b' is smaller than Or the number of elements equal to B;
计算轨迹A与轨迹B之间的轨迹相似度即为:(Tdis(A,B,limit_len)+Tdis(B,A,limit_len))/(a+b),轨迹相似度的取值范围在0和1之间。极端情况下,A、B轨迹重合或非常相似,轨迹相似度为1。The trajectory similarity between the calculated trajectory A and the trajectory B is: (Tdis(A, B, limit_len) + Tdis(B, A, limit_len))/(a+b), and the trajectory similarity ranges from 0. Between 1 and 1. In extreme cases, the A and B tracks are coincident or very similar, and the trajectory similarity is 1.
其中,a为轨迹A途经地点的个数(包含非相邻的相同地点),b为轨迹B途经地点的个数(包含非相邻的相同地点),limit_len为距离阈值,/表示除法运算。Where a is the number of trajectory A passing locations (including non-adjacent identical locations), b is the number of trajectory B passing locations (including non-adjacent identical locations), limit_len is the distance threshold, and / represents the division operation.
举例说明,如图6所示,轨迹A记为<a1,a2,a3,a4>,沿途经过4个地点;轨迹B记为<b1,b2,b3,b4,b5>,沿途经过5个地点;limit_len设置为2。a1距离轨迹B上的b1点距离最近,a1与b1点的距离为1;a2距离轨迹B上的b2点距 离最近,a2与b2点的距离为2;a3距离轨迹B上的b3点距离最近,a3与b3点的距离为3;a4距离轨迹B上的b3点距离最近,a4与b3点的距离为2;那么Tdis(A,B,limit_len)=1+1+0+1=3。b1距离轨迹A上的a1点距离最近,b1与a1点的距离为1;b2距离轨迹A上的a1点距离最近,b2与a1点的距离为1;b3距离轨迹A上的a4点距离最近,b3与a4点的距离为2;b4距离轨迹A上的a4点距离最近,b4与a4点的距离为8;b5距离轨迹A上的a4点距离最近,b5与a4点的距离为3;那么Tdis(B,A,limit_len)=1+1+1+0+0=3。轨迹A和轨迹B之间的轨迹相似度为(3+3)/(4+5)=0.667。For example, as shown in FIG. 6, the trajectory A is recorded as <a1, a2, a3, a4>, passing 4 locations along the way; the trajectory B is recorded as <b1, b2, b3, b4, b5>, passing 5 locations along the way ;limit_len is set to 2. A1 is closest to the distance b1 on the trajectory B, and the distance between the a1 and b1 points is 1; a2 is the distance from the b2 point on the trajectory B Recently, the distance between a2 and b2 is 2; a3 is closest to b3 on track B, the distance between a3 and b3 is 3; a4 is closest to b3 on track B, and the distance between a4 and b3 is 2; then Tdis (A, B, limit_len) = 1 + 1 + 0 + 1 = 3. B1 is closest to the distance a1 on the trajectory A, the distance between the b1 and the a1 point is 1; b2 is the closest distance from the a1 point on the trajectory A, the distance between the b2 and the a1 point is 1; b3 is the closest distance from the a4 point on the trajectory A , b3 and a4 point distance is 2; b4 is closest to the distance a4 on the track A, the distance between b4 and a4 is 8; b5 is closest to the distance a4 on the track A, and the distance between b5 and a4 is 3; Then Tdis(B, A, limit_len) = 1+1 + 1 + 0 + 0 = 3. The trajectory similarity between the trajectory A and the trajectory B is (3+3) / (4 + 5) = 0.667.
在步骤580中,匹配通勤轨迹地图。In step 580, the commute trajectory map is matched.
根据步骤570的C中每一个聚类簇的通勤轨迹频繁点、家庭位置和工作位置生成导航路径,提取导航路径途径点的GPS坐标,并计算导航路径的导航预估时长,计算聚类簇的代表通勤轨迹和对应的导航路径之间的轨迹相似度和时长差。若代表通勤轨迹和导航路径之间的轨迹相似度大于第二阈值,且代表通勤轨迹的时长与导航路径的导航预估时长之间的差值小于第三阈值,则导航路径为该用户通勤规律性轨迹;代表通勤轨迹和导航路径之间的轨迹相似度,以及代表通勤轨迹的时长与导航路径的导航预估时长之间的差值不满足上述条件,可以重新选择步骤570的C中聚类簇的通勤轨迹频繁点。若该聚类簇的通勤轨迹数量占比太小(比如小于0.1)也可以选择丢弃该聚类簇轨迹。According to the frequent trajectory of the commuting trajectory, the home position and the working position of each cluster cluster in step C, the navigation path is generated, the GPS coordinates of the navigation path approach point are extracted, and the navigation estimation duration of the navigation path is calculated, and the cluster cluster is calculated. Represents the trajectory similarity and duration difference between the commute trajectory and the corresponding navigation path. If the trajectory similarity between the commute trajectory and the navigation path is greater than the second threshold, and the difference between the duration of the commute trajectory and the navigation estimation duration of the navigation path is less than the third threshold, the navigation path is the commuting rule of the user. Sexual trajectory; representing the trajectory similarity between the commute trajectory and the navigation path, and the difference between the duration representing the commute trajectory and the navigation estimation duration of the navigation path does not satisfy the above condition, and the C clustering in step 570 can be reselected. The commuter trajectory of the cluster is frequent. If the number of commuting trajectories of the cluster cluster is too small (for example, less than 0.1), the cluster cluster trajectory may be discarded.
在步骤590中,生成用户规律性通勤轨迹。In step 590, a user regular commute trajectory is generated.
对于上述合并集合C中只有一个聚类簇时,得到的该聚类簇的通勤规律性轨迹可以是该用户的用户规律性轨迹。若上述合并集合C中有多个聚类簇时,计算每个聚类簇的通勤轨迹数量占比,将聚类簇的通勤轨迹数量占比作为聚类簇的通勤规律性轨迹的权重,将每个聚类簇的通勤规律性轨迹组成的集合作为用户规律性轨迹。When there is only one cluster cluster in the above merged set C, the commute regular trajectory of the obtained cluster cluster may be the user's regular trajectory of the user. If there are multiple cluster clusters in the merged set C, calculate the proportion of the number of commuting trajectories of each cluster cluster, and the proportion of the number of commuting trajectories of the cluster clusters as the weight of the commuting regular trajectory of the cluster clusters, The set of commuting regular trajectories of each cluster cluster is taken as the user regular trajectory.
现结合场景1和场景2说明本公开的应用。The application of the present disclosure will now be described in connection with scenario 1 and scenario 2.
在场景1中,采集移动网络位置信令数据(包括定期位置更新信令数据)之后,求解:家庭位置在A地和工作位置在B地的通勤用户中,上班的几条常见线路的客流量是多少。In scenario 1, after collecting the mobile network location signaling data (including the periodic location update signaling data), the solution is: the traffic of several common lines going to work in the commute users in the A location and the working location in the B location. how many.
本实施例提供了一种求解场景1中问题的方案,该方案包括:计算用户的通勤特征,筛选家庭位置为A,且工作位置为B的通勤用户集合users,计算得 出用户数量;提取集合users中每个用户的通勤轨迹,上班通勤轨迹是上班途径的基站的有序序列,通过基站小区编号和基站位置的GPS坐标的映射关系得到用户位置的GPS坐标;对集合users中每个用户,采用离群点检测算法检测出非规律性通勤轨迹并剔除;对集合users中每个用户Ui,对除非规律性通勤轨迹之外的通勤轨迹进行轨迹聚类和地图匹配,得到每个用户的带权重的1条或多条规律性轨迹Tra_i,该规律性轨迹Tra_i的集合记为Tra_users;按照上述的方法,对Tra_users轨迹集合进行轨迹聚类和地图匹配,得到带权重的1条或多条通勤规律性轨迹Tra_users_j,通勤规律性轨迹Tra_users_j的权重为对应聚类簇中通勤规律性轨迹的权重之和,通勤规律性轨迹Tra_users_j的集合即是从A到B的线路集合CC,CC中的每一条线路CCi就是Tra_users_j,每条线路的客流量就是Tra_users_j的权重。The embodiment provides a solution for solving the problem in the scenario 1, and the solution includes: calculating a commute feature of the user, screening a user of the commute user whose work position is A, and working position B, and calculating The number of users; the commute trajectory of each user in the collection user is extracted, the commute trajectory is an ordered sequence of the base station of the working route, and the GPS coordinates of the user location are obtained by the mapping relationship between the cell number of the base station and the GPS coordinates of the base station location; Each user in the user uses the outlier detection algorithm to detect the irregular commute trajectory and eliminates it; for each user Ui in the set users, the trajectory clustering and map matching are performed on the commute trajectory except the regular commute trajectory. Obtaining one or more regular trajectories Tra_i with weights for each user, the set of the regular trajectories Tra_i is recorded as Tra_users; according to the above method, trajectory clustering and map matching are performed on the Tra_users trajectory set to obtain weighted One or more commute regular trajectories Tra_users_j, the weight of the commute regular trajectory Tra_users_j is the sum of the weights of the commuting regular trajectories in the corresponding cluster clusters, and the set of commuting regular trajectories Tra_users_j is the line set CC from A to B Each line CCi in the CC is Tra_users_j, and the traffic of each line is the weight of Tra_users_j.
在场景2中,在采集了大量车辆的GPS数据,求解:上班经过一路段r的车辆,每小时车量占比,提取用户信息和规律性通勤信息,比如家庭位置分布是怎样的,工作位置分布是怎样的,通勤出行的习惯性线路是怎样的。r可能是一段高速路,也可能是一大桥或是其他的一段连通路段。In scene 2, the GPS data of a large number of vehicles is collected to solve the problem: the number of vehicles passing through a section r to work, the proportion of traffic per hour, extracting user information and regular commute information, such as the distribution of family locations, working position What is the distribution, and what is the habitual route of commuting. r may be a highway, or it may be a bridge or other connected section.
本实施例提供了一种求解场景2中问题的方案,该方案包括:筛选与路段r相关的路段集合R,计算R的经纬度范围Region_R;计算用户通勤特征;提取用户通勤轨迹,每个通勤轨迹分别对应一个小时时间标签,筛选上班通勤轨迹中包含Region_R的用户轨迹和用户,该用户轨迹集合记为T_R,该用户集合记为Users,对于Users中的每个用户Ui,记录符合条件的通勤轨迹的权重Ui_num,不在Users中的用户后续不再关注,用户集合Users中每个用户Ui的全部通勤轨迹记为T;经过统计即可得出每小时的车辆数量;提取Users中每个用户Ui的通勤特征则可得出特定的用户信息;获取用户的规律性通勤信息。,其中,获取用户的规律性通勤信息可以包括:对于用户集合Users和通勤轨迹T,对于每一个Ui,检测用户Ui的非规律性通勤轨迹;以及用户的通勤轨迹的聚类簇和地图匹配,得到用户带权重的1条或多条通勤规律性轨迹。This embodiment provides a solution for solving the problem in the scenario 2, the solution includes: screening the road segment set R related to the road segment r, calculating the latitude and longitude range of the R Region_R; calculating the user commute feature; extracting the user commute trajectory, each commute trajectory Corresponding to one hour time label, the user trajectory containing the Region_R and the user in the commute trajectory are filtered, and the user trajectory set is recorded as T_R, and the user set is recorded as Users. For each user Ui in Users, the eligible commuting trajectory is recorded. The weight Ui_num is not followed by the users in Users. All the commute tracks of each user Ui in the user set Users are recorded as T; after counting, the number of vehicles per hour can be obtained; and each user Ui in Users is extracted. The commute feature can derive specific user information; obtain regular commute information from the user. The obtaining the regular commute information of the user may include: detecting, for each Ui, an irregular commute trajectory of the user Ui for the user set User and the commute trajectory T; and clustering clusters and map matching of the user's commute trajectory, Get one or more commuter regular trajectories with weights from the user.
本公开提供了一种用户通勤轨迹管理方法,通过定位系统,如用户终端上的定位设备或者通信基站等获取用户的出行数据,根据用户出行数据计算用户规律性轨迹并输出,在该过程中,不用进行用户调研,增强了对用户通勤的管理力度,可以对用户通勤进行更全面了解和掌控,改善了用户的使用体验。 The present disclosure provides a user commute trajectory management method, which acquires travel data of a user through a positioning system, such as a positioning device or a communication base station on a user terminal, calculates a user regular trajectory according to user travel data, and outputs, in the process, Without user research, the management of user commuting is enhanced, and the user's commuting can be more fully understood and controlled, improving the user experience.
本公开还通过采集用户出行位置数据,提取用户通勤轨迹,通过离群点检测算法检测出非规律性通勤轨迹,对除非规律性通勤轨迹的通勤轨迹进行聚类和地图匹配,可以实现用户规律性轨迹的计算系统和方法。本公开实施例提供的技术方案,有利于对通勤路段进行精细化的管理,便于掌握一些关键性交通信息,比如通勤高峰时每个路段的负荷水平,比如一路段封闭、管制或者限行等将影响多少市民的出行,以及这些市民的分布情况等。The disclosure also collects the user travel location data, extracts the user commute trajectory, detects the irregular commute trajectory by the outlier detection algorithm, and performs the clustering and map matching on the commute trajectory of the regular commute trajectory, thereby realizing the user regularity. The calculation system and method of the trajectory. The technical solution provided by the embodiments of the present disclosure is beneficial to the fine management of the commuter road section, and is convenient for grasping some key traffic information, such as the load level of each road section during the peak of the commute, such as one road section closure, regulation or restriction, etc. How many citizens travel, and the distribution of these citizens.
本公开实施例还提供了一种非暂态计算机可读存储介质,存储有计算机可执行指令,所述计算机可执行指令设置为执行上述任一实施例中的方法。Embodiments of the present disclosure also provide a non-transitory computer readable storage medium storing computer executable instructions arranged to perform the method of any of the above embodiments.
本公开实施例还提供了一种电子设备的硬件结构示意图。参见图7,该电子设备包括:The embodiment of the present disclosure further provides a hardware structure diagram of an electronic device. Referring to FIG. 7, the electronic device includes:
至少一个处理器(processor)70,图7中以一个处理器70为例;和存储器(memory)71,还可以包括通信接口(Communications Interface)72和总线73。其中,处理器70、通信接口72、存储器71可以通过总线73完成相互间的通信。通信接口72可以用于信息传输。处理器30可以调用存储器31中的逻辑指令,以执行上述实施例的方法。At least one processor 70, which is exemplified by a processor 70 in FIG. 7; and a memory 71, may further include a communication interface 72 and a bus 73. The processor 70, the communication interface 72, and the memory 71 can complete communication with each other through the bus 73. Communication interface 72 can be used for information transfer. Processor 30 may invoke logic instructions in memory 31 to perform the methods of the above-described embodiments.
此外,上述的存储器71中的逻辑指令可以通过软件功能单元的形式实现并作为独立的产品销售或使用时,可以存储在一个计算机可读取存储介质中。In addition, the logic instructions in the memory 71 described above may be implemented in the form of a software functional unit and sold or used as a stand-alone product, and may be stored in a computer readable storage medium.
存储器71作为一种计算机可读存储介质,可用于存储软件程序、计算机可执行程序,如本公开实施例中的方法对应的程序指令或模块。处理器70通过运行存储在存储器71中的软件程序、指令或模块,从而执行功能应用以及数据处理,即实现上述方法实施例中的方法。The memory 71 is a computer readable storage medium and can be used to store a software program, a computer executable program, a program instruction or a module corresponding to the method in the embodiment of the present disclosure. The processor 70 executes the functional application and the data processing by executing a software program, an instruction or a module stored in the memory 71, that is, implementing the method in the above method embodiment.
存储器71可包括存储程序区和存储数据区,其中,存储程序区可存储操作系统、至少一个功能所需的应用程序;存储数据区可存储根据终端设备的使用所创建的数据等。此外,存储器71可以包括高速随机存取存储器,还可以包括非易失性存储器。The memory 71 may include a storage program area and a storage data area, wherein the storage program area may store an operating system, an application required for at least one function; the storage data area may store data created according to usage of the terminal device, and the like. Further, the memory 71 may include a high speed random access memory, and may also include a nonvolatile memory.
本公开的技术方案可以以软件产品的形式体现出来,该计算机软件产品存储在一个存储介质中,包括一个或多个指令用以使得一台计算机设备(可以是个人计算机,服务器,或者网络设备等)执行本公开实施例所述方法的全部或部分步骤。而前述的存储介质可以是非暂态存储介质,包括:U盘、移动硬盘、只读存储器(Read-only Memory,ROM)、随机存储存储器(Random-Access  Memory,RAM)、磁碟或者光盘等多种可以存储程序代码的介质,也可以是暂态存储介质。The technical solution of the present disclosure may be embodied in the form of a software product stored in a storage medium, including one or more instructions for causing a computer device (which may be a personal computer, a server, a network device, etc.) Performing all or part of the steps of the method of the embodiments of the present disclosure. The foregoing storage medium may be a non-transitory storage medium, including: a USB flash drive, a mobile hard disk, a read-only memory (ROM), and a random storage memory (Random-Access). A medium that can store program code, such as a memory, a RAM, or an optical disk, or a transient storage medium.
工业实用性Industrial applicability
本公开提供的用户通勤轨迹管理方法、装置及系统,通过定位系统获取用户的出行数据,根据用户出行数据计算用户规律性轨迹并输出,无需进行用户调研,增强了对用户通勤的管理力度,可以对用户通勤进行更全面的了解和掌控,改善了用户的使用体验。 The user commute trajectory management method, device and system provided by the disclosure obtain the travel data of the user through the positioning system, calculate the regular trajectory of the user according to the travel data of the user, and output, without user research, and enhance the management of the user's commuting, A more comprehensive understanding and control of user commuting has improved the user experience.

Claims (26)

  1. 一种用户通勤轨迹管理方法,包括:A user commute trajectory management method includes:
    通过定位系统获取每个用户的出行数据,其中,所述出行数据包括停留点及所述停留点对应的时间戳;Obtaining, by the positioning system, travel data of each user, where the travel data includes a stay point and a time stamp corresponding to the stay point;
    根据所述每个用户的出行数据计算所述每个用户的用户规律性轨迹;以及Calculating a regularity trajectory of the user of each user according to the travel data of each user;
    输出所述每个用户的用户规律性轨迹至通勤道路管理系统。The user's regular trajectory of each user is output to the commuter road management system.
  2. 如权利要求1所述的方法,其中,所述根据所述每个用户的出行数据计算所述每个用户的用户规律性轨迹包括:The method of claim 1, wherein the calculating the user regularity trajectory of each user according to the travel data of each user comprises:
    根据所述每个用户的出行数据计算通勤特征,所述通勤特征包括家庭位置、工作位置以及通勤时段;Calculating a commute feature according to the travel data of each user, the commute feature including a home location, a work location, and a commute time period;
    根据所述通勤特征的通勤时段、所述出行数据中的停留点及所述停留点对应的时间戳,筛选得到属于所述通勤时段的所述出行数据,将筛选得到的所述出行数据中的停留点按照时间戳排序生成通勤轨迹;And selecting, according to the commute period of the commute feature, the stay point in the travel data, and the timestamp corresponding to the stay point, the travel data that belongs to the commute time period, and the selected travel data in the travel data The stop points are sorted by time stamp to generate a commute trajectory;
    计算所述每个用户的每两个通勤轨迹之间的轨迹相似度,将所述每个用户的每两个通勤轨迹之间的轨迹相似度大于第一阈值的通勤轨迹聚类生成聚类簇,并确定每个聚类簇的代表通勤轨迹;Calculating a trajectory similarity between each two commute trajectories of each user, and generating a cluster cluster by commuting trajectory clustering between each two commute trajectories of each user that is greater than a first threshold And determining the representative commute trajectory of each cluster cluster;
    根据选择策略从所述每个聚类簇所有通勤轨迹的停留点中选择通勤轨迹频繁点,根据所述家庭位置、所述工作位置、所述通勤轨迹频繁点及所述通勤轨迹频繁点对应的时间戳生成导航路径;以及Selecting, according to the selection strategy, a frequent point of the commute trajectory from the stay points of all the commutation trajectories of each cluster cluster, according to the home location, the working position, the frequent points of the commute trajectory, and the frequent points of the commute trajectory The timestamp generates a navigation path;
    计算所述导航路径与所述代表通勤轨迹之间的轨迹相似度,若所述导航路径与所述代表通勤轨迹之间的轨迹相似度大于第二阈值,则将所述导航路径作为所述聚类簇的通勤规律性轨迹,根据所述聚类簇的通勤规律性轨迹生成所述用户规律性轨迹。Calculating a trajectory similarity between the navigation path and the representative commute trajectory, and if the trajectory similarity between the navigation path and the representative commute trajectory is greater than a second threshold, using the navigation path as the convergence The commuting regularity trajectory of the clusters generates the regular trajectory of the user according to the commuting regular trajectory of the cluster cluster.
  3. 如权利要求2所述的方法,其中,所述通勤时段包括上班通勤时段及 下班通勤时段,所述通勤轨迹包括上班通勤轨迹及下班通勤轨迹,所述根据所述家庭位置、所述工作位置、所述通勤轨迹频繁点及所述通勤轨迹频繁点对应的时间戳生成导航路径包括:若所述聚类簇由所述上班通勤轨迹聚类生成,则生成以用户的所述家庭位置为起点、所述工作位置为终点以及依次通过所述通勤轨迹频繁点的上班导航路径;以及若所述聚类簇由所述下班通勤轨迹聚类生成,则生成以用户的所述工作位置为起点、所述家庭位置为终点以及依次通过所述通勤轨迹频繁点的下班导航路径。The method of claim 2 wherein said commuting time comprises commuting time to work and During the commute time, the commute trajectory includes a commute trajectory and an off-duty commute trajectory, and the navigation path is generated according to the home location, the working location, the frequent trajectory of the commute trajectory, and the time stamp corresponding to the frequent trajectory of the commute trajectory The method includes: if the clustering cluster is generated by the commuting trajectory trajectory, generating a working navigation path starting from the family position of the user, the working position being an end point, and frequently passing through the commuting trajectory; And if the clustering cluster is generated by the off-duty commute trajectory clustering, generating an off-duty navigation path starting from the working position of the user, the family location as an end point, and frequently passing through the commuting trajectory in turn.
  4. 如权利要求2所述的方法,所述方法还包括:若所述导航路径与所述代表通勤轨迹之间的轨迹相似度小于所述第二阈值,则重新选择所述通勤轨迹频繁点并生成新导航路径,计算并判断所述新导航路径与所述代表通勤轨迹之间的轨迹相似度是否大于所述第二阈值,若所述新导航路径与所述代表通勤轨迹之间的轨迹相似度大于所述第二阈值,则将所述新导航路径作为所述聚类簇的通勤规律性轨迹,若所述新导航路径与所述代表通勤轨迹之间的轨迹相似度不大于所述第二阈值,则循环执行重新选择所述通勤轨迹频繁点并生成新导航路径的步骤,直至新导航路径与所述代表通勤轨迹之间的轨迹相似度大于所述第二阈值。The method of claim 2, further comprising: if the trajectory similarity between the navigation path and the representative commute trajectory is less than the second threshold, reselecting the frequent trajectory of the commute trajectory and generating a new navigation path, calculating and determining whether a trajectory similarity between the new navigation path and the representative commute trajectory is greater than the second threshold, if a trajectory similarity between the new navigation path and the representative commute trajectory If the second threshold is greater than the second threshold, the new navigation path is used as a commute regular trajectory of the cluster cluster, if the trajectory similarity between the new navigation path and the representative commute trajectory is not greater than the second The threshold is performed by repeating the step of reselecting the frequent points of the commute trajectory and generating a new navigation path until the trajectory similarity between the new navigation path and the representative commute trajectory is greater than the second threshold.
  5. 如权利要求2所述的方法,所述方法还包括:计算所述导航路径的导航预估时长,在判断所述导航路径与所述代表通勤轨迹之间的轨迹相似度是否大于所述第二阈值时,判断所述导航路径的导航预估时长与所述代表通勤轨迹的时长差值是否小于第三阈值;以及若所述导航路径与所述代表通勤轨迹之间的轨迹相似度大于第二阈值,且所述导航路径的导航预估时长与所述代表通勤轨迹的时长差值小于第三阈值,则将所述导航路径作为所述聚类簇的通勤规律性轨迹。 The method of claim 2, the method further comprising: calculating a navigation estimation duration of the navigation path, determining whether a trajectory similarity between the navigation path and the representative commute trajectory is greater than the second And determining, by the threshold, whether a difference between the navigation estimation duration of the navigation path and the duration of the representative commute trajectory is less than a third threshold; and if the trajectory similarity between the navigation path and the representative commute trajectory is greater than the second And a threshold value, and the difference between the navigation estimation duration of the navigation path and the duration of the representative commute trajectory is less than a third threshold, and the navigation path is used as a commuting regular trajectory of the cluster cluster.
  6. 如权利要求2所述的方法,其中,若所述聚类簇为多个,所述根据所述聚类簇的通勤规律性轨迹生成所述用户规律性轨迹包括:计算每个聚类簇的通勤轨迹数量占比,将所述聚类簇的通勤轨迹数量占比作为所述聚类簇的通勤规律性轨迹的权重,将每个聚类簇的通勤规律性轨迹组成的集合作为用户规律性轨迹;以及若所述聚类簇为一个,所述根据所述聚类簇的通勤规律性轨迹生成所述用户规律性轨迹包括:将所述聚类簇的通勤规律性轨迹作为所述用户规律性轨迹。The method according to claim 2, wherein if the cluster cluster is plural, the generating the user regularity trajectory according to the commuting regular trajectory of the cluster cluster comprises: calculating each cluster cluster The number of commute trajectories is proportional, the number of commuting trajectories of the cluster clusters is taken as the weight of the commuting regular trajectory of the cluster clusters, and the set of commuting regular trajectories of each cluster cluster is taken as the user regularity. a trajectory; and if the cluster cluster is one, the generating the user regular trajectory according to the commuting regular trajectory of the cluster cluster comprises: using the commutation regular trajectory of the cluster cluster as the user rule Sexuality.
  7. 如权利要求6所述的方法,其中,若所述聚类簇为多个,所述方法还包括:删除通勤轨迹数量占比小于第四阈值的聚类簇。The method according to claim 6, wherein if the cluster of clusters is plural, the method further comprises: deleting cluster clusters in which the number of commutation trajectories is less than a fourth threshold.
  8. 如权利要求2所述的方法,其中,所述确定每个聚类簇的代表通勤轨迹包括:逐一计算所述聚类簇中每个通勤轨迹与其他通勤轨迹之间的轨迹相似度之和,将所述轨迹相似度之和最大的通勤轨迹作为所述代表通勤轨迹。The method of claim 2, wherein said determining a representative commute trajectory of each cluster cluster comprises: calculating a sum of trajectory similarities between each commute trajectory and other commute trajectories in said cluster cluster one by one, The commute trajectory having the largest sum of the trajectories of the trajectories is taken as the representative commute trajectory.
  9. 如权利要求2所述的方法,其中,所述轨迹之间的轨迹相似度的计算包括:The method of claim 2 wherein the calculation of the trajectory similarity between the trajectories comprises:
    计算轨迹A与轨迹B的轨迹相似度为轨迹A途径的所有地点中,与轨迹B途径的点的距离最小值小于第五阈值的个数a’;Calculating the trajectory similarity between the trajectory A and the trajectory B is the number a' of the distance minimum from the point of the trajectory B path among all the locations of the trajectory A path;
    计算轨迹B与轨迹A的轨迹相似度为轨迹B途径的所有地点中,与轨迹A途径的点的距离最小值小于第五阈值的个数b’;以及Calculating the trajectory similarity between the trajectory B and the trajectory A is the number b' of the minimum value of the distance from the point of the trajectory A path among all the points of the trajectory B path;
    计算轨迹A与轨迹B之间的轨迹相似度为(a’+b’)/(a+b),a为轨迹A途径的地点个数,b为轨迹A途径的地点个数,/表示除法运算。The trajectory similarity between the calculated trajectory A and the trajectory B is (a'+b')/(a+b), a is the number of locations of the trajectory A pathway, b is the number of locations of the trajectory A pathway, and / represents division Operation.
  10. 如权利要求2所述的方法,在生成通勤轨迹之后,所述方法还包括:通过离群点检测算法计算轨迹异常系数,删除所述轨迹异常系数大于第六阈值的通勤轨迹。 The method of claim 2, after generating the commute trajectory, the method further comprising: calculating a trajectory anomaly coefficient by an outlier detection algorithm, and deleting a commute trajectory of the trajectory anomaly coefficient greater than a sixth threshold.
  11. 如权利要求2至10任一项所述的方法,在生成通勤轨迹之前,所述方法还包括:从所有用户中筛选出所述家庭位置和所述工作位置均可识别的通勤用户,生成所述可识别的通勤用户的通勤轨迹。The method according to any one of claims 2 to 10, before the generating a commute trajectory, the method further comprises: selecting, from all users, the commuter user that is identifiable by the home location and the work location, generating a location The traversable trajectory of the identifiable commuter user.
  12. 如权利要求11所述的方法,其中,所述从所有用户中筛选出家庭位置和工作位置均可识别的通勤用户包括:根据每个用户的出行数据计算所述每个用户的出行地点离散熵,将出行地点离散熵小于第七阈值的用户作为通勤用户;以及获取并根据每个通勤用户的出行数据,识别所述每个通勤用户的家庭位置和工作位置,删除所述家庭位置和所述工作位置中至少一个不可识别的通勤用户。The method of claim 11, wherein the screening of the commute users identifiable from the home location and the work location from all the users comprises: calculating the discrete entropy of the travel location of each of the users according to the travel data of each user And identifying, as a commute user, a user whose travel location has a discrete entropy smaller than a seventh threshold; and obtaining and identifying, according to travel data of each commute user, a home location and a work location of each commute user, deleting the home location and the At least one unrecognizable commuter user in the work location.
  13. 一种用户通勤轨迹管理装置,包括:A user commute trajectory management device includes:
    获取模块,设置为通过定位系统获取每个用户的出行数据,其中,所述出行数据包括停留点及所述停留点对应的时间戳;Obtaining a module, configured to acquire travel data of each user by using a positioning system, where the travel data includes a stay point and a time stamp corresponding to the stay point;
    处理模块,设置为根据所述每个用户的出行数据计算所述每个用户的用户规律性轨迹;以及a processing module, configured to calculate a user regularity trajectory of each user according to the travel data of each user;
    输出模块,设置为输出所述每个用户的用户规律性轨迹至通勤道路管理系统。The output module is configured to output the regularity trajectory of each user of the user to the commuter road management system.
  14. 如权利要求13所述的装置,其中,所述处理模块包括:The apparatus of claim 13 wherein said processing module comprises:
    通勤特征计算子模块,设置为根据所述每个用户的出行数据计算通勤特征,所述通勤特征包括家庭位置、工作位置以及通勤时段;a commute feature calculation sub-module, configured to calculate a commute feature according to the travel data of each user, where the commute feature includes a home location, a work location, and a commute time period;
    通勤轨迹生成子模块,设置为根据所述通勤特征的通勤时段、所述出行数据中的停留点及所述停留点对应的时间戳,筛选得到属于所述通勤时段的所述出行数据,将筛选得到的所述出行数据中的停留点按照时间戳排序生成通勤轨迹; The commuting trajectory generating sub-module is configured to: according to the commute period of the commute feature, the stay point in the travel data, and the timestamp corresponding to the stay point, filter the travel data belonging to the commute period, and filter The obtained staying points in the travel data are sorted by time stamp to generate a commute trajectory;
    通勤聚类管理子模块,设置为计算所述每个用户的每两个通勤轨迹之间的轨迹相似度,将所述每个用户的每两个通勤轨迹之间的轨迹相似度大于第一阈值的通勤轨迹聚类生成聚类簇,并确定每个聚类簇的代表通勤轨迹;a commuting cluster management sub-module, configured to calculate a trajectory similarity between each two commute trajectories of each user, and the trajectory similarity between each two commute trajectories of each user is greater than a first threshold The commute trajectory clustering generates cluster clusters and determines representative commuter trajectories of each cluster cluster;
    导航路径生成子模块,设置为根据选择策略从所述每个聚类簇所有通勤轨迹的停留点中选择通勤轨迹频繁点,根据所述家庭位置、所述工作位置、所述通勤轨迹频繁点及所述通勤轨迹频繁点对应的时间戳生成导航路径;以及a navigation path generating submodule, configured to select a frequent point of the commute trajectory from the stay points of all the commuting trajectories of each cluster cluster according to the selection policy, according to the home location, the working location, the frequent points of the commute trajectory, and Generating a navigation path by a timestamp corresponding to a frequent point of the commute trajectory;
    通勤规律轨迹子模块,设置为计算所述导航路径与所述代表通勤轨迹之间的轨迹相似度,若所述导航路径与所述代表通勤轨迹之间的轨迹相似度大于第二阈值,则将所述导航路径作为所述聚类簇的通勤规律性轨迹,根据所述聚类簇的通勤规律性轨迹生成所述用户规律性轨迹。a commutation regular trajectory sub-module, configured to calculate a trajectory similarity between the navigation path and the representative commute trajectory, if a trajectory similarity between the navigation path and the representative commute trajectory is greater than a second threshold, The navigation path is used as a commuting regular trajectory of the cluster cluster, and the user regular trajectory is generated according to the commuting regular trajectory of the cluster cluster.
  15. 如权利要求14所述的装置,其中,所述通勤时段包括上班通勤时段及下班通勤时段,所述通勤轨迹包括上班通勤轨迹及下班通勤轨迹,所述导航路径生成子模块设置为,若所述聚类簇由所述上班通勤轨迹聚类生成,则生成以用户的所述家庭位置为起点、所述工作位置为终点以及依次通过所述通勤轨迹频繁点的上班导航路径;以及若所述聚类簇由所述下班通勤轨迹聚类生成,则生成以用户的所述工作位置为起点、所述家庭位置为终点以及依次通过所述通勤轨迹频繁点的下班导航路径。The device of claim 14, wherein the commute period includes a commute time and an off-duty commute, the commute trajectory includes a commute trajectory and an off-duty trajectory, and the navigation path generation sub-module is set to, if Generating cluster clusters by the commuting trajectory trajectory, generating a work navigation path starting from the user's home position, the work position being an end point, and frequently passing through the commute trajectory; and The cluster is generated by the off-duty commute trajectory clustering, and the off-duty navigation path is generated starting from the working position of the user, the home location as the end point, and the frequent passing point through the commute trajectory.
  16. 如权利要求14所述的装置,其中,所述通勤规律轨迹子模块还设置为若所述导航路径与所述代表通勤轨迹之间的轨迹相似度小于所述第二阈值,则触发所述导航路径生成子模块重新选择所述通勤轨迹频繁点并生成新导航路径,计算并判断所述新导航路径与所述代表通勤轨迹之间的轨迹相似度是否大于所述第二阈值,若所述新导航路径与所述代表通勤轨迹之间的轨迹相似度大于所述第二阈值,则将所述新导航路径作为所述聚类簇的通勤规律性轨迹,若 所述新导航路径与所述代表通勤轨迹之间的轨迹相似度不大于所述第二阈值,则循环执行重新选择所述通勤轨迹频繁点并生成新导航路径的步骤,直至新导航路径与所述代表通勤轨迹之间的轨迹相似度大于所述第二阈值。The apparatus of claim 14, wherein the commute regular trajectory sub-module is further configured to trigger the navigation if a trajectory similarity between the navigation path and the representative commute trajectory is less than the second threshold The path generation sub-module reselects the frequent points of the commute trajectory and generates a new navigation path, and calculates and determines whether the trajectory similarity between the new navigation path and the representative commute trajectory is greater than the second threshold, if the new If the trajectory similarity between the navigation path and the representative commute trajectory is greater than the second threshold, the new navigation path is used as a commuting regular trajectory of the cluster cluster, if If the trajectory similarity between the new navigation path and the representative commute trajectory is not greater than the second threshold, the step of reselecting the frequent points of the commute trajectory and generating a new navigation path is performed cyclically until the new navigation path and the location The trajectory similarity between the representative commute trajectories is greater than the second threshold.
  17. 如权利要求14所述的装置,其中,所述导航路径生成子模块还设置为计算所述导航路径的导航预估时长,所述通勤规律轨迹子模块还设置为在判断所述导航路径与所述代表通勤轨迹之间的轨迹相似度是否大于所述第二阈值时,判断所述导航路径的导航预估时长与所述代表通勤轨迹的时长差值是否小于第三阈值;以及若所述导航路径与所述代表通勤轨迹之间的轨迹相似度大于第二阈值、且所述导航路径的导航预估时长与所述代表通勤轨迹的时长差值小于第三阈值,则将所述导航路径作为所述聚类簇的通勤规律性轨迹。The apparatus according to claim 14, wherein the navigation path generation sub-module is further configured to calculate a navigation estimation duration of the navigation path, and the commute regular trajectory sub-module is further configured to determine the navigation path and the location Determining whether the difference between the navigation estimation duration of the navigation path and the duration of the representative commute trajectory is less than a third threshold when the trajectory similarity between the commute trajectories is greater than the second threshold; and if the navigation is And the trajectory similarity between the path and the representative commute trajectory is greater than a second threshold, and the difference between the navigation estimation duration of the navigation path and the duration of the representative commute trajectory is less than a third threshold, and the navigation path is used as The commuting regular trajectory of the cluster cluster.
  18. 如权利要求14所述的装置,其中,若所述聚类簇为多个,所述通勤规律轨迹子模块设置为计算每个聚类簇的通勤轨迹数量占比,将所述聚类簇的通勤轨迹数量占比作为所述聚类簇的通勤规律性轨迹的权重,将每个聚类簇的通勤规律性轨迹组成的集合作为用户规律性轨迹;以及若所述聚类簇为一个,所述通勤规律轨迹子模块设置为将所述聚类簇的通勤规律性轨迹作为所述用户规律性轨迹。The apparatus according to claim 14, wherein if the clustering cluster is plural, the commuting regular trajectory sub-module is configured to calculate a proportion of the number of commuting trajectories of each cluster cluster, and the clustering cluster The number of commuting trajectories is used as the weight of the commuting regular trajectory of the cluster cluster, and the set of the commuting regular trajectories of each cluster cluster is taken as the user regular trajectory; and if the cluster cluster is one, The commute regular trajectory sub-module is configured to use the commuting regular trajectory of the cluster cluster as the user regular trajectory.
  19. 如权利要求18所述的装置,其中,若所述聚类簇为多个,所述通勤规律轨迹子模块还设置为删除通勤轨迹数量占比小于第四阈值的聚类簇。The apparatus according to claim 18, wherein if the clustering cluster is plural, the commuting regular trajectory sub-module is further configured to delete a cluster cluster whose occupation trajectory number is smaller than a fourth threshold.
  20. 如权利要求14所述的装置,其中,所述通勤聚类管理子模块设置为逐一计算所述聚类簇中每个通勤轨迹与其他通勤轨迹之间的轨迹相似度之和,将所述轨迹相似度之和最大的通勤轨迹作为所述代表通勤轨迹。The apparatus according to claim 14, wherein said commuter cluster management sub-module is configured to calculate a sum of trajectory similarities between each commute trajectory and other commute trajectories in said cluster cluster one by one, said trajectory The commute trajectory with the largest sum of similarities serves as the representative commute trajectory.
  21. 如权利要求14所述的装置,其中,所述处理模块还包括轨迹相似度计算子模块,设置为计算轨迹A与轨迹B的轨迹相似度为轨迹A途径的所有地 点中,与轨迹B途径的点的距离最小值小于第五阈值的个数a’;计算轨迹B与轨迹A的轨迹相似度为轨迹B途径的所有地点中,与轨迹A途径的点的距离最小值小于第五阈值的个数b’;以及计算轨迹A与轨迹B之间的轨迹相似度为(a’+b’)/(a+b),a为轨迹A途径的地点个数,b为轨迹A途径的地点个数,/表示除法运算。The apparatus of claim 14, wherein the processing module further comprises a trajectory similarity calculation sub-module configured to calculate a trajectory similarity between the trajectory A and the trajectory B as all the trajectory A paths In the point, the minimum distance from the point of the trajectory B path is smaller than the number a' of the fifth threshold; the trajectory similarity between the calculated trajectory B and the trajectory A is the distance from the point of the trajectory A path among all the locations of the trajectory B path The minimum value is less than the number b' of the fifth threshold; and the trajectory similarity between the calculated trajectory A and the trajectory B is (a'+b')/(a+b), where a is the number of locations of the trajectory A path, b is the number of locations of the trajectory A route, and / represents the division operation.
  22. 如权利要求14所述的装置,其中,所述通勤轨迹生成子模块在生成通勤轨迹之后,设置为通过离群点检测算法计算轨迹异常系数,删除所述轨迹异常系数大于第六阈值的通勤轨迹。The apparatus according to claim 14, wherein the commute trajectory generation sub-module is configured to calculate a trajectory abnormality coefficient by an outlier detection algorithm after the commutation trajectory is generated, and delete a commute trajectory whose trajectory anomaly coefficient is greater than a sixth threshold .
  23. 如权利要求14至22任一项所述的装置,其中,所述通勤轨迹生成子模块在生成通勤轨迹之前,设置为从所有用户中筛选出所述家庭位置和所述工作位置均可识别的通勤用户,生成所述可识别的通勤用户的通勤轨迹。The apparatus according to any one of claims 14 to 22, wherein the commute trajectory generation sub-module is configured to filter out, from all users, the home location and the work location to be identifiable before generating a commute trajectory The commuter user generates a commute trajectory of the identifiable commute user.
  24. 如权利要求23所述的装置,其中,所述通勤轨迹生成子模块设置为根据每个用户的出行数据计算所述每个用户的出行地点离散熵,将出行地点离散熵小于第七阈值的用户作为通勤用户;以及获取并根据每个通勤用户的出行数据,识别所述每个通勤用户的家庭位置和工作位置,删除所述家庭位置和所述工作位置中至少一个不可识别的通勤用户。The apparatus of claim 23, wherein the commute trajectory generation sub-module is configured to calculate, according to each user's travel data, a discrete entropy of the travel location of each user, and a user whose travel location discrete entropy is less than a seventh threshold As a commuter user; and obtaining and according to the travel data of each commute user, identifying the home location and the work location of each commute user, deleting at least one unrecognizable commuter user of the home location and the work location.
  25. 一种用户通勤轨迹管理系统,包括:定位系统、通勤道路管理系统以及如权利要求13至24任一项所述的用户通勤轨迹管理装置;其中,所述定位系统设置为监听用户的出行数据,所述出行数据包括停留点及所述停留点对应的时间戳;所述用户通勤轨迹管理装置设置为通过所述定位系统获取每个用户的出行数据,根据所述每个用户的出行数据计算所述每个用户的用户规律性轨迹以及输出所述每个用户的用户规律性轨迹至所述通勤道路管理系统;以及所述通勤道路管理系统设置为根据所述用户规律性轨迹管理通勤道路。 A user commute trajectory management system, comprising: a positioning system, a commuter road management system, and the user commute trajectory management device according to any one of claims 13 to 24; wherein the positioning system is configured to monitor user travel data, The travel data includes a stay point and a time stamp corresponding to the stay point; the user commute trajectory management device is configured to acquire travel data of each user through the positioning system, and calculate the travel data according to the travel data of each user. Describe the user regularity trajectory of each user and output the user regularity trajectory of each user to the commute road management system; and the commuter road management system is configured to manage the commute road according to the user regular trajectory.
  26. 一种非暂态计算机可读存储介质,存储有计算机可执行指令,所述计算机可执行指令设置为执行权利要求1-12中任一项的用户通勤轨迹管理方法。 A non-transitory computer readable storage medium storing computer executable instructions arranged to perform the user commute trajectory management method of any of claims 1-12.
PCT/CN2017/072697 2016-02-03 2017-01-25 User commuter track management method, device and system WO2017133627A1 (en)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
CN201610077984.2A CN107038168A (en) 2016-02-03 2016-02-03 A kind of user's commuting track management method, apparatus and system
CN201610077984.2 2016-02-03

Publications (1)

Publication Number Publication Date
WO2017133627A1 true WO2017133627A1 (en) 2017-08-10

Family

ID=59499386

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/CN2017/072697 WO2017133627A1 (en) 2016-02-03 2017-01-25 User commuter track management method, device and system

Country Status (2)

Country Link
CN (1) CN107038168A (en)
WO (1) WO2017133627A1 (en)

Cited By (15)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN109405840A (en) * 2017-08-18 2019-03-01 中兴通讯股份有限公司 Map data updating method, server and computer readable storage medium
CN110222873A (en) * 2019-05-14 2019-09-10 重庆邮电大学 A kind of subway station passenger flow forecast method based on big data
CN110909037A (en) * 2019-10-09 2020-03-24 中国人民解放军战略支援部队信息工程大学 Frequent track mode mining method and device
CN111143779A (en) * 2019-12-31 2020-05-12 斑马网络技术有限公司 Method and device for obtaining commuting rule, electronic equipment and storage medium
CN111380547A (en) * 2018-12-29 2020-07-07 沈阳美行科技有限公司 Mature path track determining method and device, computer equipment and storage medium
CN111414437A (en) * 2019-01-08 2020-07-14 阿里巴巴集团控股有限公司 Method and device for generating line track
CN111784742A (en) * 2020-06-29 2020-10-16 杭州海康威视数字技术股份有限公司 Cross-lens tracking method and device for pedestrians
CN111859167A (en) * 2019-04-29 2020-10-30 北京四维图新科技股份有限公司 Path acquisition method, device, system and storage medium
CN112541012A (en) * 2020-01-02 2021-03-23 北京融信数联科技有限公司 City position balance demographic method based on mobile communication big data and entropy value
CN113190769A (en) * 2021-05-08 2021-07-30 北京百度网讯科技有限公司 Commuting characteristic data determination method and device, electronic equipment and storage medium
CN113724494A (en) * 2021-07-30 2021-11-30 东南大学 Customized bus demand area identification method
CN113984077A (en) * 2021-10-26 2022-01-28 武汉大学 Method and system for acquiring power inspection navigation path
CN114882066A (en) * 2022-07-11 2022-08-09 浙江大华技术股份有限公司 Target tracking method and related device, electronic equipment and storage medium
US11442133B2 (en) * 2018-03-21 2022-09-13 Sigfox Method and system for geolocation of group-evolving terminals
CN111859167B (en) * 2019-04-29 2024-04-30 北京四维图新科技股份有限公司 Path acquisition method, device, system and storage medium

Families Citing this family (20)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN107121690B (en) * 2017-03-31 2019-05-28 景致惠通工程咨询(武汉)有限公司 A kind of dwell point recognition methods and device based on parameter of doing more physical exercises
CN107958031B (en) * 2017-11-20 2021-07-06 上海市城市建设设计研究总院(集团)有限公司 Resident travel OD distribution extraction method based on fusion data
CN108256733A (en) * 2017-12-12 2018-07-06 北京荣之联科技股份有限公司 The judgment method and device of operational vehicle
CN108647735B (en) * 2018-05-15 2021-01-12 广州杰赛科技股份有限公司 User travel rule analysis method, device, equipment and storage medium
CN109413587A (en) * 2018-09-20 2019-03-01 广州纳斯威尔信息技术有限公司 User trajectory prediction technique based on WiFi log
CN110751531A (en) * 2018-11-13 2020-02-04 北京嘀嘀无限科技发展有限公司 Track identification method and device and electronic equipment
CN111369787A (en) * 2018-12-26 2020-07-03 杭州海康威视系统技术有限公司 Vehicle track prediction method and device and electronic equipment
CN111380541B (en) * 2018-12-29 2022-09-13 沈阳美行科技股份有限公司 Interest point determination method and device, computer equipment and storage medium
CN109948070B (en) * 2019-03-13 2022-08-09 深圳市同行者科技有限公司 Method for analyzing and determining positions of home and company, storage medium and terminal
CN110084308B (en) * 2019-04-30 2022-05-17 杭州远眺科技有限公司 Method for constructing commuting path based on vehicle track analysis
CN111341135B (en) * 2019-12-24 2021-05-14 中山大学 Mobile phone signaling data travel mode identification method based on interest points and navigation data
CN111310834A (en) * 2020-02-19 2020-06-19 深圳市商汤科技有限公司 Data processing method and device, processor, electronic equipment and storage medium
CN111432344B (en) * 2020-03-27 2020-12-08 杭州华量软件有限公司 Fishing boat space-time trajectory correction method and device based on adjacent ships
CN112639764A (en) * 2020-03-31 2021-04-09 华为技术有限公司 Automobile travel rule analysis method, server and system
CN111640303B (en) 2020-05-29 2021-09-28 青岛大学 City commuting path identification method and equipment
WO2022041262A1 (en) * 2020-08-31 2022-03-03 苏州大成电子科技有限公司 Big data-based method for calculating anchor point of urban rail transit user
CN113556698A (en) * 2021-07-21 2021-10-26 智慧足迹数据科技有限公司 Information clustering method, device, equipment and medium based on mobile phone signaling
CN114117261B (en) * 2022-01-29 2022-04-22 腾讯科技(深圳)有限公司 Track detection method and device, electronic equipment and storage medium
CN115440040B (en) * 2022-09-02 2023-09-22 重庆大学 Commuter vehicle identification method based on expressway traffic data
CN117332376B (en) * 2023-12-01 2024-02-27 北京航空航天大学 Method and system for identifying commuter and mode based on mobile phone signaling data

Citations (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101964148A (en) * 2009-07-24 2011-02-02 日立(中国)研究开发有限公司 Road traffic information recording server and GPS (Global Positioning System) user terminal
CN102595323A (en) * 2012-03-20 2012-07-18 北京交通发展研究中心 Method for obtaining resident travel characteristic parameter based on mobile phone positioning data
CN102611985A (en) * 2011-01-24 2012-07-25 国际商业机器公司 Method and equipment used for providing travelling information
CN102629297A (en) * 2012-03-06 2012-08-08 北京建筑工程学院 Traveler activity rule analysis method based on stroke recognition
CN103279802A (en) * 2013-04-17 2013-09-04 吉林大学 Method for predicting daily activity-travel time of commuter
CN104270714A (en) * 2014-09-11 2015-01-07 华为技术有限公司 Method and device for determining action trails of user

Patent Citations (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101964148A (en) * 2009-07-24 2011-02-02 日立(中国)研究开发有限公司 Road traffic information recording server and GPS (Global Positioning System) user terminal
CN102611985A (en) * 2011-01-24 2012-07-25 国际商业机器公司 Method and equipment used for providing travelling information
CN102629297A (en) * 2012-03-06 2012-08-08 北京建筑工程学院 Traveler activity rule analysis method based on stroke recognition
CN102595323A (en) * 2012-03-20 2012-07-18 北京交通发展研究中心 Method for obtaining resident travel characteristic parameter based on mobile phone positioning data
CN103279802A (en) * 2013-04-17 2013-09-04 吉林大学 Method for predicting daily activity-travel time of commuter
CN104270714A (en) * 2014-09-11 2015-01-07 华为技术有限公司 Method and device for determining action trails of user

Cited By (24)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN109405840A (en) * 2017-08-18 2019-03-01 中兴通讯股份有限公司 Map data updating method, server and computer readable storage medium
CN109405840B (en) * 2017-08-18 2023-09-08 中兴通讯股份有限公司 Map data updating method, server and computer readable storage medium
US11442133B2 (en) * 2018-03-21 2022-09-13 Sigfox Method and system for geolocation of group-evolving terminals
CN111380547A (en) * 2018-12-29 2020-07-07 沈阳美行科技有限公司 Mature path track determining method and device, computer equipment and storage medium
CN111414437A (en) * 2019-01-08 2020-07-14 阿里巴巴集团控股有限公司 Method and device for generating line track
CN111414437B (en) * 2019-01-08 2023-06-20 阿里巴巴集团控股有限公司 Method and device for generating line track
CN111859167B (en) * 2019-04-29 2024-04-30 北京四维图新科技股份有限公司 Path acquisition method, device, system and storage medium
CN111859167A (en) * 2019-04-29 2020-10-30 北京四维图新科技股份有限公司 Path acquisition method, device, system and storage medium
CN110222873B (en) * 2019-05-14 2023-10-31 北京城建智控科技股份有限公司 Subway station passenger flow volume prediction method based on big data
CN110222873A (en) * 2019-05-14 2019-09-10 重庆邮电大学 A kind of subway station passenger flow forecast method based on big data
CN110909037B (en) * 2019-10-09 2024-02-13 中国人民解放军战略支援部队信息工程大学 Frequent track mode mining method and device
CN110909037A (en) * 2019-10-09 2020-03-24 中国人民解放军战略支援部队信息工程大学 Frequent track mode mining method and device
CN111143779A (en) * 2019-12-31 2020-05-12 斑马网络技术有限公司 Method and device for obtaining commuting rule, electronic equipment and storage medium
CN111143779B (en) * 2019-12-31 2023-07-25 斑马网络技术有限公司 Method, device, electronic equipment and storage medium for obtaining commute rule
CN112541012A (en) * 2020-01-02 2021-03-23 北京融信数联科技有限公司 City position balance demographic method based on mobile communication big data and entropy value
CN111784742B (en) * 2020-06-29 2023-08-29 杭州海康威视数字技术股份有限公司 Pedestrian cross-lens tracking method and device
CN111784742A (en) * 2020-06-29 2020-10-16 杭州海康威视数字技术股份有限公司 Cross-lens tracking method and device for pedestrians
CN113190769B (en) * 2021-05-08 2023-09-19 北京百度网讯科技有限公司 Communication characteristic data determining method, device, electronic equipment and storage medium
CN113190769A (en) * 2021-05-08 2021-07-30 北京百度网讯科技有限公司 Commuting characteristic data determination method and device, electronic equipment and storage medium
CN113724494B (en) * 2021-07-30 2022-06-07 东南大学 Customized bus demand area identification method
CN113724494A (en) * 2021-07-30 2021-11-30 东南大学 Customized bus demand area identification method
CN113984077A (en) * 2021-10-26 2022-01-28 武汉大学 Method and system for acquiring power inspection navigation path
CN114882066B (en) * 2022-07-11 2022-11-22 浙江大华技术股份有限公司 Target tracking method and related device, electronic equipment and storage medium
CN114882066A (en) * 2022-07-11 2022-08-09 浙江大华技术股份有限公司 Target tracking method and related device, electronic equipment and storage medium

Also Published As

Publication number Publication date
CN107038168A (en) 2017-08-11

Similar Documents

Publication Publication Date Title
WO2017133627A1 (en) User commuter track management method, device and system
CN107305590B (en) Urban traffic trip characteristic determination method based on mobile phone signaling data
Alexander et al. Origin–destination trips by purpose and time of day inferred from mobile phone data
Trasarti et al. Mining mobility user profiles for car pooling
CN110868689B (en) Standing population determining method and device and electronic equipment
KR102097426B1 (en) Analysis method of fluidized population information capable of providing real-time fluidized population data by pcell algorithm
Caceres et al. Traffic flow estimation models using cellular phone data
Papandrea et al. On the properties of human mobility
CN108415975B (en) BDCH-DBSCAN-based taxi passenger carrying hot spot identification method
Fekih et al. A data-driven approach for origin–destination matrix construction from cellular network signalling data: a case study of Lyon region (France)
CN104902438B (en) A kind of statistical method and its system based on mobile communication terminal analysis passenger flow characteristic information
Fiadino et al. Steps towards the extraction of vehicular mobility patterns from 3G signaling data
US8843157B2 (en) Dynamic travel behavior estimation in mobile network
EP2608181B1 (en) Method for detecting traffic
Lin et al. Vehicle re-identification with dynamic time windows for vehicle passage time estimation
CN112770265B (en) Pedestrian identity information acquisition method, system, server and storage medium
Sinnott et al. Accident black spot identification and verification through social media
Guo et al. An algorithm for analyzing the city residents' activity information through mobile big data mining
Chen et al. A travel mode identification framework based on cellular signaling data
Batran et al. Urban travel time estimation in greater maputo using mobile phone big data
Dash et al. CDR-To-MoVis: Developing a mobility visualization system from CDR data
Tomaras et al. An efficient technique for event location identification using multiple sources of urban data
Yadlowsky et al. Link density inference from cellular infrastructure
Schmitt et al. A Method to Suggest Alternative Routes Based on Analysis of Automobiles' Trajectories
Saliba et al. Vehicular traffic flow intensity detection and prediction through mobile data usage

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 17746947

Country of ref document: EP

Kind code of ref document: A1

NENP Non-entry into the national phase

Ref country code: DE

122 Ep: pct application non-entry in european phase

Ref document number: 17746947

Country of ref document: EP

Kind code of ref document: A1