CN104615881A - User normal track analysis method based on movable position application - Google Patents

User normal track analysis method based on movable position application Download PDF

Info

Publication number
CN104615881A
CN104615881A CN201510050458.2A CN201510050458A CN104615881A CN 104615881 A CN104615881 A CN 104615881A CN 201510050458 A CN201510050458 A CN 201510050458A CN 104615881 A CN104615881 A CN 104615881A
Authority
CN
China
Prior art keywords
bunch
point
user
core
track
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Granted
Application number
CN201510050458.2A
Other languages
Chinese (zh)
Other versions
CN104615881B (en
Inventor
陈磊
李名臣
史波良
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
NANJING FIBERHOME INFORMATION DEVELOPMENT Co Ltd
Original Assignee
NANJING FIBERHOME INFORMATION DEVELOPMENT Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by NANJING FIBERHOME INFORMATION DEVELOPMENT Co Ltd filed Critical NANJING FIBERHOME INFORMATION DEVELOPMENT Co Ltd
Priority to CN201510050458.2A priority Critical patent/CN104615881B/en
Publication of CN104615881A publication Critical patent/CN104615881A/en
Application granted granted Critical
Publication of CN104615881B publication Critical patent/CN104615881B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Landscapes

  • Information Retrieval, Db Structures And Fs Structures Therefor (AREA)
  • Management, Administration, Business Operations System, And Electronic Commerce (AREA)

Abstract

The invention discloses a user normal track analysis method based on a movable position application. The user normal track analysis method based on the movable position application includes the following steps that firstly, a user track set P is input; secondly, an orderly convex polygonal track set Q comprising all Pn track points is obtained from the track set P; thirdly, the area of the convex polygon Q is calculated; fourthly, the cluster radius R and cluster density T are determined; fifthly, all core clusters are recorded; sixthly, the core clusters are combined; seventhly, recursion is executed until all the core clusters can not be combined, and then it is judged as convergence; eighthly, the points not contained in the core clusters are judged as noise points, the cluster radius R serves as the threshold value, and the noise points are gathered into a plurality of noise clusters through a proximity clustering principle; ninthly, intra-cluster track point election is carried out on the core clusters and the noise clusters, and it is guaranteed that one point is elected from one cluster.

Description

A kind of user's normality trajectory analysis method based on shift position application
Technical field
The present invention relates to Mobile data trajectory analysis field, be that one carries out mining analysis for User Activity track data, and then obtain the method for user's normality track.
Background technology
Along with the progress of science and technology, Mobile solution changes daily life dearly, for application developer, in order to provide better more humane service to user, usually need to carry out analysis mining to various user behavior data, and User Activity track data this wherein important one just.
At present, being directed to track data, to carry out the algorithm of mining analysis more, and clustering algorithm is also first-selected.But developer is mostly based on respective development requirement, all specific settings are done to the enter factor of clustering algorithm, caused the result applicability of clustering algorithm not to be very wide.
Summary of the invention
Goal of the invention: technical matters to be solved by this invention is for the deficiencies in the prior art, provides a kind of user's normality trajectory analysis method based on shift position application.
In order to solve the problems of the technologies described above, the invention discloses a kind of user's normality trajectory analysis method based on shift position application, comprising the following steps:
Step 1, input user trajectory set P, and object clustering cluster number K, track set P comprises the raw position data of tracing point;
Step 2, for Pn tracing point in track set P, obtains the convex polygon track set Q comprising whole Pn tracing point that in track set P one group is orderly;
Step 3, calculates the area of convex polygon Q;
Step 4, determines bunch radius R and bunch density T;
Step 5, with any one tracing point M in track set P for the center of circle, the some number comprised within bunch radius R is greater than T, then thinking that M is core point, take M as the center of circle, and institute's pointed set that R comprises for radius is combined into core bunch C; If the some number comprised within bunch radius R is less than a bunch density T, then think that M is general point; Points all in traversal P, records all cores bunch;
Step 6, suppose with M1 be core point formed core bunch C1 in comprise T1 tracing point, be comprise T2 tracing point in the core bunch C2 of core point formation with M2, if C1 and C2 two bunches of public tracks are counted, minimum tracing point in more than two bunches bunch counts 50%, then C1 and C2 is merged, form new bunch C12;
Step 7, recurrence performs step 7, until cannot merge between all cores bunch, then judges convergence;
Step 8, by not being included in the point in core bunch, is judged to be noise point; With bunch radius R for threshold value, take the principle of cluster nearby, noise point is gathered into several noises bunch;
Step 9, tracing point election in core bunch and noise bunch being carried out bunch, ensures that cluster elects a bit.
In the present invention, described raw position data comprises: user account, longitude, latitude and acquisition time.
In the present invention, in step 2, the algorithm of convex hull ConvexHull that increases income is used to obtain the orderly convex polygon track set Q comprising whole Pn tracing point of in track set P one group.
In the present invention, in step 3, computing method are: convex polygon Q is divided into several triangles, calculate triangle area and are added.
In the present invention, in step 9, longitude and latitude weighted mean is carried out for all tracing points in each bunch, generate bunch virtual center point, then the distance of this virtual center point is arrived a little in compute cluster, normality tracing point after election distance smallest point merges as this bunch, elects the normality tracing point after all bunches of merging and exports.
The present invention adopts BA-DBScan (Base area DBscan) algorithm when doing mining analysis, according to existing subscriber's track data, the method of dynamic calculation bunch density and bunch radius is carried out by footprint, and then the extraction realized user's normality track, improve the applicability of arithmetic result greatly.
Accompanying drawing explanation
To do the present invention below in conjunction with the drawings and specific embodiments and further illustrate, above-mentioned and/or otherwise advantage of the present invention will become apparent.
Fig. 1 is invention overview flow chart.
Fig. 2 is user A initial trace figure.
Fig. 3 is user A scope of activities figure.
Fig. 4 is user A normality trajectory diagram.
Embodiment
The present invention, from existing subscriber's track data, excavates and extracts user's normality event trace.The User Activity track data collected with Mobile solution, for sample, adopts BA-DBScan as user trajectory Data Clustering Algorithm, then to bunch carrying out a bunch inner track point election after cluster success, is exported by the point elected as normality tracing point;
The input of algorithm: trace information set P, object clustering cluster number K;
The output of algorithm: normality track set;
Introduce concrete steps below:
1> carries out pre-service to raw position data, as inputting data after removing abnormal longitude and latitude data; Ensure to comprise the information such as user account, longitude, latitude, acquisition time in input data.
2>, again for the track set P of wherein user A, supposes to count as Pn containing track in P.Use the algorithm of convex hull ConvexHull that increases income to obtain the orderly convex polygon track set Q of in P one group, ensure that Pn tracing point is all comprised by Q;
3> calculates the area of convex polygon Q.Convex polygon is divided into several triangles, calculates triangle area and be added;
P is triangle semi-perimeter:
p=(ab+bc+ac)/2;
Wherein a, b, c are an Atria summit respectively, and ab, bc, ac are the Atria limit length of side respectively, be converted into the mode that Mercator's coordinate calculates distance again can calculate the length of side by 2 longitudes and latitudes;
Triangle Sqabc areal calculation formula (Heron's formula) be made up of any 3 a, b, c:
Sqabc = p ( p - ab ) ( p - bc ) ( p - ac ) ,
The area Sqn computing formula of convex polygon area Q:
Sqn = Σ n = 3 n Sq 1 ( n - 1 ) n ,
Wherein n refers to the number of vertex of convex polygon Q, and the area of Q equals n-2 triangle area cumulative sum;
4> determines bunch radius and bunch density of clustering algorithm:
Bunch radius:
R = Sqn / K / Π ,
Bunch density:
T=Pn/K,
5> is with any point M in the track set P of user A for the center of circle, and the some number comprised within radius R is greater than T, then thinking that M is core point, take M as the center of circle, and institute's pointed set that R comprises for radius is combined into core bunch C; If the some number comprised within radius R is less than T, then think that R is general point.Points all in traversal P, records all cores bunch information;
If comprise T1 tracing point in the core bunch C1 that 6> take M1 as core point to be formed, be comprise T2 tracing point in the core bunch C2 of core point formation with M2, what simultaneously C1 and C2 two bunches of public tracks counted minimum tracing point in more than two bunches bunch counts 50%, then C1 and C2 is merged, form new bunch C12;
7> recurrence performs step 6, until all bunches all relatively independent, cannot merge, then think algorithm convergence success, the core bunch information after record merges;
8>, for the point be not included in core bunch, is referred to as noise point.Cluster principle is nearby: mark all noise points for " non-cluster ", calculate the distance of any two " non-cluster " noise points, if distance is less than a bunch radius R, then these two noise points being gathered into noise bunch, is " cluster " with these two noise points of tense marker; If distance is greater than a bunch radius R, then continue to calculate other " non-cluster " noise point distances, cycle calculations, until all " non-cluster " noise points are all greater than a bunch radius R with the distance of all " cluster " noise points, cycle calculations terminates.Remaining " non-cluster " noise point is independently become noise bunch separately, records all noises bunch information;
Tracing point election in the core produced above bunch and noise bunch are carried out bunch by 9>, ensures that cluster is selected a bit.Electoral machinery is as follows: carry out longitude and latitude weighted mean for all tracing points in each bunch, generate bunch virtual center point, then arrive a little the distance of this virtual center point in compute cluster, the normality tracing point after election distance smallest point merges as this bunch also exports.
By above several step, the normality track of user A can be obtained.
Embodiment 1
Fig. 1 is overview flow chart of the present invention, and concrete implementing procedure is described below:
1> prepares a collection of existing subscriber's track sample data, after cleaning, ensures must comprise the information such as user account, longitude, latitude, acquisition time in data;
Extract the initial trace data of user A in the sample data of 2> from step 1 as input trajectory data, the original input trajectory information of concrete user A as shown in Figure 2;
After 3> gets the initial trace figure of user A, utilizing the algorithm of convex hull ConvexHull that increases income to calculate the scope of activities of user A, as shown in Figure 3, is a convex pentagon surrounded by A.B.C.D.E coordinate points.
4> calculates the scope of activities area of user A, i.e. convex pentagon area.Can be three triangles by convex pentagon cutting.Wherein triangle length of side ab, bc, ac can be converted into Mercator's coordinate by two summit latitude and longitude coordinates and calculate.
Triangle semi-perimeter:
P=(ab+bc+ac)/2
Triangle area:
Sabc = p ( p - ab ) ( p - bc ) ( p - ac )
Convex pentagon area:
Sabcdef=Sabc+Sacd+Sade
5> goes out the enter factor of BA-DBScan algorithm according to scope of activities areal calculation, i.e. bunch radius and bunch density;
User A initial trace data acquisition and object cluster number K input as algorithm by 4>, execution algorithm, until algorithm convergence, Output rusults bunch, and election in carrying out bunch result bunch, obtain final user A normality track, algorithm terminates.The normality trajectory diagram of user A, as Fig. 4, can find, the present invention remains the normality track of user A preferably.
The invention provides a kind of user's normality trajectory analysis method based on shift position application; the method and access of this technical scheme of specific implementation is a lot; the above is only the preferred embodiment of the present invention; should be understood that; for those skilled in the art; under the premise without departing from the principles of the invention, can also make some improvements and modifications, these improvements and modifications also should be considered as protection scope of the present invention.The all available prior art of each ingredient not clear and definite in the present embodiment is realized.

Claims (5)

1., based on user's normality trajectory analysis method of shift position application, it is characterized in that, comprise the following steps:
Step 1, input user trajectory set P, and object clustering cluster number K, track set P comprises the raw position data of tracing point;
Step 2, for Pn tracing point in track set P, obtains the convex polygon track set Q comprising whole Pn tracing point that in track set P one group is orderly;
Step 3, calculates the area of convex polygon Q;
Step 4, determines bunch radius R and bunch density T;
Step 5, with any one tracing point M in track set P for the center of circle, the some number comprised within bunch radius R is greater than T, then thinking that M is core point, take M as the center of circle, and institute's pointed set that R comprises for radius is combined into core bunch C; If the some number comprised within bunch radius R is less than a bunch density T, then think that M is general point; Points all in traversal P, records all cores bunch;
Step 6, suppose with M1 be core point formed core bunch C1 in comprise T1 tracing point, be comprise T2 tracing point in the core bunch C2 of core point formation with M2, if C1 and C2 two bunches of public tracks are counted, the track of minimum tracing point in more than two bunches bunch is counted 50%, then C1 and C2 is merged, form new bunch C12;
Step 7, recurrence performs step 7, until cannot merge between all cores bunch, then judges convergence;
Step 8, by not being included in the point in core bunch, is judged to be noise point; With bunch radius R for threshold value, take the principle of cluster nearby, noise point is gathered into several noises bunch;
Step 9, tracing point election in core bunch and noise bunch being carried out bunch, ensures that cluster elects a bit.
2. a kind of user's normality trajectory analysis method based on shift position application according to claim 1, it is characterized in that, described raw position data comprises: user account, longitude, latitude and acquisition time.
3. a kind of user's normality trajectory analysis method based on shift position application according to claim 1, it is characterized in that, in step 2, the algorithm of convex hull ConvexHull that increases income is used to obtain the orderly convex polygon track set Q comprising whole Pn tracing point of in track set P one group.
4. a kind of user's normality trajectory analysis method based on shift position application according to claim 1, it is characterized in that, in step 3, computing method are: convex polygon Q is divided into several triangles, calculate triangle area and are added.
5. a kind of user's normality trajectory analysis method based on shift position application according to claim 1, it is characterized in that, in step 9, longitude and latitude weighted mean is carried out for all tracing points in each bunch, generate bunch virtual center point, then arrive a little the distance of this virtual center point in compute cluster, the normality tracing point after election distance smallest point merges as this bunch, elects the normality tracing point after all bunches of merging and exports.
CN201510050458.2A 2015-01-30 2015-01-30 A kind of user's normality trajectory analysis method based on shift position application Active CN104615881B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201510050458.2A CN104615881B (en) 2015-01-30 2015-01-30 A kind of user's normality trajectory analysis method based on shift position application

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201510050458.2A CN104615881B (en) 2015-01-30 2015-01-30 A kind of user's normality trajectory analysis method based on shift position application

Publications (2)

Publication Number Publication Date
CN104615881A true CN104615881A (en) 2015-05-13
CN104615881B CN104615881B (en) 2017-12-22

Family

ID=53150322

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201510050458.2A Active CN104615881B (en) 2015-01-30 2015-01-30 A kind of user's normality trajectory analysis method based on shift position application

Country Status (1)

Country Link
CN (1) CN104615881B (en)

Cited By (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN105321341A (en) * 2015-12-03 2016-02-10 北京航空航天大学 Resource supply method based on city moving mode
CN107369069A (en) * 2017-07-07 2017-11-21 成都理工大学 A kind of Method of Commodity Recommendation based on triangle area computation schema
WO2018059315A1 (en) * 2016-09-30 2018-04-05 阿里巴巴集团控股有限公司 Method and device for determining areas of interest based on geolocation data
CN108122012A (en) * 2017-12-28 2018-06-05 百度在线网络技术(北京)有限公司 Definite method, apparatus, equipment and the storage medium of resident dot center point
CN109948070A (en) * 2019-03-13 2019-06-28 深圳市同行者科技有限公司 The analysis of a kind of family and company position determines method, storage medium and terminal
CN110245715A (en) * 2019-06-20 2019-09-17 国网湖南省电力有限公司 User's division methods for accurate cutting load
CN110457315A (en) * 2019-07-19 2019-11-15 国家计算机网络与信息安全管理中心 A kind of group's accumulation mode analysis method and system based on user trajectory data

Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN102164405A (en) * 2010-12-17 2011-08-24 东软集团股份有限公司 Method and system for quickly positioning
CN102607553A (en) * 2012-03-06 2012-07-25 北京建筑工程学院 Travel track data-based stroke identification method
CN103218442A (en) * 2013-04-22 2013-07-24 中山大学 Method and system for life mode analysis based on mobile device sensor data

Patent Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN102164405A (en) * 2010-12-17 2011-08-24 东软集团股份有限公司 Method and system for quickly positioning
CN102607553A (en) * 2012-03-06 2012-07-25 北京建筑工程学院 Travel track data-based stroke identification method
CN103218442A (en) * 2013-04-22 2013-07-24 中山大学 Method and system for life mode analysis based on mobile device sensor data

Non-Patent Citations (2)

* Cited by examiner, † Cited by third party
Title
冯少荣等: "基于密度的DBSCAN聚类算法的研究及应用", 《计算机工程与应用》 *
桂智明等: "基于出租车轨迹的并行城市热点区域发现", 《华中科技大学学报(自然科学版)》 *

Cited By (12)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN105321341A (en) * 2015-12-03 2016-02-10 北京航空航天大学 Resource supply method based on city moving mode
WO2018059315A1 (en) * 2016-09-30 2018-04-05 阿里巴巴集团控股有限公司 Method and device for determining areas of interest based on geolocation data
US10943364B2 (en) 2016-09-30 2021-03-09 Advanced New Technologies Co., Ltd. Method and device for determining areas of interest based on geolocation data
US11087490B2 (en) 2016-09-30 2021-08-10 Advanced New Technologies Co., Ltd. Method and device for determining areas of interest based on geolocation data
CN107369069A (en) * 2017-07-07 2017-11-21 成都理工大学 A kind of Method of Commodity Recommendation based on triangle area computation schema
CN107369069B (en) * 2017-07-07 2020-06-05 成都理工大学 Commodity recommendation method based on triangular area calculation mode
CN108122012A (en) * 2017-12-28 2018-06-05 百度在线网络技术(北京)有限公司 Definite method, apparatus, equipment and the storage medium of resident dot center point
CN108122012B (en) * 2017-12-28 2020-11-24 百度在线网络技术(北京)有限公司 Method, device and equipment for determining center point of stationary point and storage medium
CN109948070A (en) * 2019-03-13 2019-06-28 深圳市同行者科技有限公司 The analysis of a kind of family and company position determines method, storage medium and terminal
CN109948070B (en) * 2019-03-13 2022-08-09 深圳市同行者科技有限公司 Method for analyzing and determining positions of home and company, storage medium and terminal
CN110245715A (en) * 2019-06-20 2019-09-17 国网湖南省电力有限公司 User's division methods for accurate cutting load
CN110457315A (en) * 2019-07-19 2019-11-15 国家计算机网络与信息安全管理中心 A kind of group's accumulation mode analysis method and system based on user trajectory data

Also Published As

Publication number Publication date
CN104615881B (en) 2017-12-22

Similar Documents

Publication Publication Date Title
CN104615881A (en) User normal track analysis method based on movable position application
CN107241512B (en) Intercity Transportation trip mode judgment method and equipment based on data in mobile phone
CN103593430A (en) Clustering method based on mobile object spatiotemporal information trajectory subsections
CN107301254A (en) A kind of road network hot spot region method for digging
CN103544379B (en) Landed Typhoon similarity analysis method based on GIS
CN103747419B (en) A kind of indoor orientation method based on signal strength difference and dynamic linear interpolation
US11009362B2 (en) Generating trail network maps
CN113139760B (en) Typhoon risk comprehensive evaluation method and system based on wind and rain big data
CN106339716A (en) Mobile trajectory similarity matching method based on weighted Euclidean distance
CN107818534B (en) Human activity network region division method with space constraint
CN104239453B (en) Data processing method and device
CN109688532A (en) A kind of method and device dividing city function region
CN105355042A (en) Road network extraction method based on taxi GPS
CN110716935A (en) Track data analysis and visualization method and system based on online taxi appointment travel
CN109034187A (en) A kind of subscriber household work address excavation process
CN107133236A (en) The method and apparatus of geographical position track optimizing
CN107908636A (en) A kind of method that mankind's activity spatiotemporal mode is excavated using social media
CN109947758A (en) A kind of route crash analysis method in Behavior-based control track library
CN107071721A (en) Stop point extracting method based on mobile phone location data
CN106802958B (en) Conversion method and system of the CAD data to GIS data
CN104219088A (en) Hive-based network alarm information OLAP method
CN105844031A (en) Mobile phone positioning data based urban traffic corridor identification method
CN104075709B (en) Method and apparatus is determined based on the motion track of sparse track point data
CN106412835A (en) Method for identifying user travel mode and apparatus thereof
CN106846795A (en) The acquisition methods and device in the crowd is dense region

Legal Events

Date Code Title Description
C06 Publication
PB01 Publication
C10 Entry into substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant