CN106294485B - Determine the method and device in significant place - Google Patents

Determine the method and device in significant place Download PDF

Info

Publication number
CN106294485B
CN106294485B CN201510307160.5A CN201510307160A CN106294485B CN 106294485 B CN106294485 B CN 106294485B CN 201510307160 A CN201510307160 A CN 201510307160A CN 106294485 B CN106294485 B CN 106294485B
Authority
CN
China
Prior art keywords
attribute
significant place
place
potential
conditional
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Active
Application number
CN201510307160.5A
Other languages
Chinese (zh)
Other versions
CN106294485A (en
Inventor
丁强
牛新征
张洪
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Huawei Technologies Co Ltd
University of Electronic Science and Technology of China
Original Assignee
Huawei Technologies Co Ltd
University of Electronic Science and Technology of China
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Huawei Technologies Co Ltd, University of Electronic Science and Technology of China filed Critical Huawei Technologies Co Ltd
Priority to CN201510307160.5A priority Critical patent/CN106294485B/en
Publication of CN106294485A publication Critical patent/CN106294485A/en
Application granted granted Critical
Publication of CN106294485B publication Critical patent/CN106294485B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/90Details of database functions independent of the retrieved data types
    • G06F16/95Retrieval from the web
    • G06F16/953Querying, e.g. by the use of web search engines
    • G06F16/9537Spatial or temporal dependent retrieval, e.g. spatiotemporal queries
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F18/00Pattern recognition
    • G06F18/20Analysing
    • G06F18/23Clustering techniques

Landscapes

  • Engineering & Computer Science (AREA)
  • Data Mining & Analysis (AREA)
  • Theoretical Computer Science (AREA)
  • Databases & Information Systems (AREA)
  • Physics & Mathematics (AREA)
  • General Physics & Mathematics (AREA)
  • General Engineering & Computer Science (AREA)
  • Artificial Intelligence (AREA)
  • Evolutionary Computation (AREA)
  • Evolutionary Biology (AREA)
  • Computer Vision & Pattern Recognition (AREA)
  • Bioinformatics & Computational Biology (AREA)
  • Bioinformatics & Cheminformatics (AREA)
  • Life Sciences & Earth Sciences (AREA)
  • Information Retrieval, Db Structures And Fs Structures Therefor (AREA)

Abstract

The present invention provides the method and devices in the significant place of determination, are related to mobile internet technical field.It is invented to solve the problems, such as in the prior art lower to the accuracy of identification in significant place.The method comprising the steps of one: determining that significant place determines attribute and step 2 according to sample position track: determining that attribute determines the significant place in location track to be processed according to significant place.Wherein, step 1 specifically includes: determining the potential significant place on sample position track;The value for obtaining corresponding first attribute set in each potential significant place, obtains attribute information table;According to attribute information table, significant place is chosen from the first attribute set using default Feature Selection algorithm and determines attribute;Obtain the threshold range that significant place determines attribute.Step 2 specifically includes: determining at least one potential significant place in location track to be processed;Significant place is determined that potential significant place of the value of attribute in threshold range is determined as significant place.

Description

Determine the method and device in significant place
Technical field
The present invention relates to mobile internet technical fields, more particularly to determine the method and device in significant place.
Background technique
Prevailing with mobile Internet, the behavioural analysis of mobile terminal user has become the focus of research.Wherein, with In the behavioural analysis of family, the position analysis tool of user has very important significance, and can be used based on the position acquisition of mobile terminal user The more position of family access times namely significant place are to provide more intelligent clothes for user when user is in significant place Business.
A kind of determination method in significant place is provided in the prior art, during the realization of this method, first to obtaining The location track got is pre-processed, and rejecting is clearly not the point in significant place, then uses clustering algorithm to remaining point Handled each point the point labeled as noise spot and significant place, labeled as significant place be determine significantly Point.There may be judge non-significant place by accident in the identification process in significant place for the method in the existing significant place of this determination The problem of for significant place, thus the prior art is lower to the accuracy of identification in significant place.
Summary of the invention
The present invention provides a kind of method and device for determining significant place, in the prior art to significant place to solve The lower problem of resolution.
In order to achieve the above objectives, the present invention adopts the following technical scheme:
In a first aspect, the present invention provides a kind of methods for determining significant place, which comprises according to sample position Track determines that significant place determines attribute and determines according to the significant place decision attribute significant in location track to be processed Place;
Wherein, described to determine that significant place determines attribute according to sample position track, it specifically includes:
At least one potential significant place on the sample position track is obtained by default clustering algorithm;
The value for obtaining corresponding first attribute set in each potential significant place, obtains attribute information table, and described One attribute set includes the conditional attribute and a decision attribute of predetermined number;
According to the attribute information table, chosen significantly from first attribute set using default Feature Selection algorithm Point determines attribute;
Obtain the threshold range that the significant place determines attribute;
It is described to determine that attribute determines the significant place in location track to be processed according to the significant place, it specifically includes:
At least one potential significant place in the location track to be processed is obtained by the default clustering algorithm;
The significant place is determined that potential significant place of the value of attribute in the threshold range is determined as significantly Point.
With reference to first aspect, described according to the attribute information table in the first implementation of first aspect, it uses Default Feature Selection algorithm chooses significant place from first attribute set and determines attribute, specifically includes:
There is fault tolerance using what default Feature Selection algorithm calculated each conditional attribute in first attribute set P norm value;
The corresponding conditional attribute of maximum value in all values of P norm described in being calculated with fault tolerance is true It is set to significant place and determines attribute.
The first implementation with reference to first aspect, it is described using pre- in second of implementation of first aspect If Feature Selection algorithm calculates the value of the P norm with fault tolerance of each conditional attribute in first attribute set, tool Body includes:
Using the first condition attribute in first attribute set as attribute to be assessed, remaining conditional attribute is constituted Set is determined as the second attribute set, and the first condition attribute is any conditional attribute;
Game playing algorithm is selected using the user property based on fuzzy entropy, successively judges the attribute to be assessed described second Whether the sharing in game for each destination subset in attribute set wins, and obtains judgement knot corresponding with each destination subset Fruit, the destination subset include at least two conditional attributes;
According to all judging results, the value of the P norm with fault tolerance of the attribute to be assessed is calculated.
With reference to first aspect, in the third implementation of first aspect, the default clustering algorithm includes apart from threshold Value, points three parameters of threshold value and time threshold;
It is described to be obtained in the sample position track or the location track to be processed at least by default clustering algorithm One potential significant place, specifically includes:
Place not labeled in the location track is used as to starting point, the label include be labeled as it is potential significantly Point or noise spot;
Search the target point for being less than or equal to the distance threshold at a distance from the starting point;
If the quantity of the target point found is more than or equal to the points threshold value, and each target point and institute The time interval for stating starting point is all larger than the time threshold, then the starting point and all target points is determined as one Cluster, and the starting point is labeled as potential significant place;
After all the points in the cluster are marked, the place that successively marks other not to be labeled.
With reference to first aspect or the first implementation of first aspect, second of implementation, the third realization side Any one implementation in formula includes speed in first attribute set in the 4th kind of implementation of first aspect Record number in the sample position track of degree, acceleration, potential significant place, the potential significant place are in the sample The time period of stay and Orientation differences standard deviation of the number of days, the potential significant place that occur in this location track.
Second aspect, the present invention also provides a kind of device for determining significant place, described device includes:
Processing module, for by preset clustering algorithm obtain on the sample position track at least one is potential significant Place;
Module is obtained, for obtaining the value of corresponding first attribute set in each potential significant place, obtains attribute Information table, first attribute set include the conditional attribute and a decision attribute of predetermined number;
The processing module, is also used to according to the attribute information table, using default Feature Selection algorithm from described first Significant place is chosen in attribute set determines attribute;
The acquisition module is also used to obtain the threshold range that the significant place determines attribute;
The processing module is also used to obtain in the location track to be processed at least by the default clustering algorithm One potential significant place;
The processing module is also used to determining the significant place into that the value of attribute is potential aobvious in the threshold range Touchdown point is determined as significant place.
In conjunction with second aspect, in the first implementation of second aspect, the acquisition module is specifically used for using pre- If Feature Selection algorithm calculates the value of the P norm with fault tolerance of each conditional attribute in first attribute set;
The corresponding conditional attribute of maximum value in all values of P norm described in being calculated with fault tolerance is true It is set to significant place and determines attribute.
In conjunction with the first implementation of second aspect, in second of implementation of second aspect, the acquisition mould Block, also particularly useful for:
Using the first condition attribute in first attribute set as attribute to be assessed, remaining conditional attribute is constituted Set is determined as the second attribute set, and the first condition attribute is any conditional attribute;
Game playing algorithm is selected using the user property based on fuzzy entropy, successively judges the attribute to be assessed described second Whether the sharing in game for each destination subset in attribute set wins, and obtains judgement knot corresponding with each destination subset Fruit, the destination subset include at least two conditional attributes;
According to all judging results, the value of the P norm with fault tolerance of the attribute to be assessed is calculated.
In conjunction with second aspect, in the third implementation of second aspect, the default clustering algorithm includes apart from threshold Value, points three parameters of threshold value and time threshold;
The processing module, is specifically used for:
Place not labeled in the location track is used as to starting point, the label include be labeled as it is potential significantly Point or noise spot;
Search the target point for being less than or equal to the distance threshold at a distance from the starting point;
If the quantity of the target point found is more than or equal to the points threshold value, and each target point and institute The time interval for stating starting point is all larger than the time threshold, then the starting point and all target points is determined as one Cluster, and the starting point is labeled as potential significant place;
After all the points in the cluster are marked, the place that successively marks other not to be labeled.
In conjunction with the first of second aspect or second aspect implementation, second implementation, the third realization side Any one implementation in formula, in the 4th kind of implementation of second aspect, described for obtaining module and obtaining It include record number in the sample position track of speed, acceleration, potential significant place in one attribute set, described latent The time period of stay and orientation of the number of days, the potential significant place that occur in the sample position track in significant place Change standard deviation.
The method and device in the significant place of determination provided in an embodiment of the present invention, passes through clustering algorithm for sample position track It obtains potential significant place, then goes out the decision attribute and its threshold range in significant place by Feature Selection algorithm picks, so Location track to be processed is handled again afterwards, by the decision in significant place in the potential significant place in location track to be processed The potential significant place that the value of attribute meets threshold range is determined as significant place, and will be selected in the prior art by clustering algorithm The point selected out is compared directly as significant place, and the present invention after clustering algorithm due to using Feature Selection algorithm, thus energy The False Rate in significant place is enough reduced, and then can be improved the accuracy for determining significant place.
Detailed description of the invention
In order to more clearly explain the embodiment of the invention or the technical proposal in the existing technology, to embodiment or will show below There is attached drawing needed in technical description to be briefly described, it should be apparent that, the accompanying drawings in the following description is only this Some embodiments of invention for those of ordinary skill in the art without creative efforts, can be with It obtains other drawings based on these drawings.
Fig. 1 is a kind of flow diagram for the method for determining significant place provided in an embodiment of the present invention;
Fig. 2 be it is provided in an embodiment of the present invention it is a kind of by preset clustering algorithm determine in location track it is potential significantly The flow diagram of the method for point;
Fig. 3 is aobvious using default Feature Selection algorithm picks to be provided in an embodiment of the present invention a kind of according to attribute information table Touchdown point determines the flow diagram of the method for attribute;
Fig. 4 is a kind of value for the P norm with fault tolerance for calculating each conditional attribute provided in an embodiment of the present invention Method flow diagram;
Fig. 5 is a kind of structural schematic diagram for the device for determining significant place provided in an embodiment of the present invention;
Fig. 6 is the structural schematic diagram of another device for determining significant place provided in an embodiment of the present invention.
Specific embodiment
Below in conjunction with the attached drawing in the present embodiment, the technical solution in the present embodiment is clearly and completely described, Obviously, described embodiments are only a part of the embodiments of the present invention, instead of all the embodiments.Based in the present invention Embodiment, every other embodiment obtained by those of ordinary skill in the art without making creative efforts, all Belong to the scope of protection of the invention.
The embodiment of the invention provides a kind of methods for determining significant place, which comprises according to sample position rail Mark determines that significant place determines attribute and to determine that attribute determines according to the significant place significant in location track to be processed Point.
Wherein, the sample position track is to have had determined which place is the location track in significant place.It is described to Handling location track is to have not determined the location track in significant place.The sample position track and the location track to be processed The location information in a period of time where various time points mobile terminal is contained as a kind of location track to be passed through in other words The route crossed.In general, having GPS sensor on existing mobile terminal, the mobile terminal of user is can receive and recorded Continuous GPS latitude and longitude coordinates point is to form location track signified in GPS track i.e. the embodiment of the present invention.
" the determining that significant place determines attribute according to sample position track " specifically includes following step 101 to step 104;" the determining that attribute determines the significant place in location track to be processed according to the significant place " specifically includes following Step 105 and 106.
As shown in Figure 1, the method in the significant place of determination provided in an embodiment of the present invention specifically includes:
101: at least one potential significant place on the sample position track is obtained by default clustering algorithm.
Wherein, the basic object of clustering algorithm is that will meet point one class of formation of specified conditions, common clustering algorithm Including density-based spatial clustering algorithm (Density-Based Spatial Clustering of Applications With Noise, abbreviation DBSCAN) algorithm etc..
Point in location track can be divided into noise spot and potential significant according to its probability for being likely to become significant place Point, signified potential significant place are the biggish point of probability as significant place.
102: obtaining the value of corresponding first attribute set in each potential significant place, obtain attribute information table, institute State the conditional attribute and a decision attribute that the first attribute set includes predetermined number.
Wherein, the value of each conditional attribute in the first attribute set and decision attribute can be by analyzing sample position track It directly reads or is obtained after certain operation.Such as: when the record number in track is set as item in the potential place in place When part attribute, it can potentially put appearance number by directly searching and obtain.For another example: when speed is a conditional attribute, In It, can be by obtaining the pole including the potential significant place for some potential significant place after potential significant place has been determined Segment path and the path corresponding time, the speed of the mobile terminal of this short time is obtained simultaneously by speed calculation formula Using the speed as the speed in the potential significant place.
As an optional implementation, may include in first attribute set speed, acceleration, it is potential significantly The day that point occurs in the sample position track in the record number in the sample position track, the potential significant place The time period of stay and Orientation differences standard deviation in several, the described potential significant place totally 6 conditional attributes.Signified decision attribute Refer to whether the potential significant place is significant place.
It include above-mentioned 6 items in the first attribute set table altogether to include 10 potential significant places in sample position track For part attribute and 1 decision attribute, following table table one gives the corresponding attribute information table in potential significant place.
Table one: attribute information table
Wherein, if in significant place column, " 1 " represents the potential significant place as significant place, and " 0 " represents should Potential significant place is not significant place.
Above-mentioned table one simply shows the way of realization of attribute information table, wherein the value of each conditional attribute can be according to reality Situation obtains after obtaining.
103: according to the attribute information table, being chosen from first attribute set using default Feature Selection algorithm aobvious Touchdown point determines attribute.
Wherein, signified default Feature Selection algorithm can be the P norm with fault tolerance.
104: obtaining the threshold range that the significant place determines attribute.
In a kind of specific implementation of this step, by obtain multiple places (both included non-significant place and also including Significant place) the corresponding significant place determines the value of attribute;If significant place determines some section packet of the value of attribute The number in the significant place contained is most, then the section is determined as the threshold range that significant place determines attribute.With by speed this For one conditional attribute determines attribute as significant place, the determination process specifically:
Obtain the velocity amplitude in multiple places on sample position track;
Section comprising the largest number of velocity amplitudes in significant place is determined as to the threshold range of this conditional attribute of speed.
105: by the default clustering algorithm obtain in the location track to be processed at least one is potential significantly Point.
106: the significant place is determined that potential significant place of the value of attribute in the threshold range is determined as showing Touchdown point.
The method in the significant place of determination provided in an embodiment of the present invention is obtained by clustering algorithm sample position track latent In significant place, then go out by Feature Selection algorithm picks the decision attribute and its threshold range in significant place, it is then right again Location track to be processed is handled, by the decision attribute in significant place in the potential significant place in location track to be processed The potential significant place that value meets threshold range is determined as significant place, and will be selected in the prior art by clustering algorithm Point is compared directly as significant place, and the present invention after clustering algorithm due to using Feature Selection algorithm, it is thus possible to reduce The False Rate in significant place, and then can be improved the accuracy for determining significant place.
It should be noted that determining attribute and the threshold of the decision attribute to improve the significant place of the embodiment of the present invention It is worth the accuracy of the selection of range, it can be by obtaining multiple sample position tracks and using step 101 identical to step 104 Method obtains one corresponding with each sample position track threshold range for determining attribute and the decision attribute, then carries out Final significant place, which is obtained, after analysis processing determines attribute and its threshold range.
A kind of implementation as the potential significant place in acquisition location track described in step 101 and step 105: Signified default clustering algorithm is the variant of DBSCAN algorithm, the default clustering algorithm include distance threshold, points threshold value and when Between three parameters of threshold value.
It is described to be obtained in the sample position track or the location track to be processed at least by default clustering algorithm One potential significant place, as shown in Fig. 2, specifically including:
201: using the place not being labeled in the location track as starting point.
Wherein, the label includes being labeled as potential significant place or noise spot.
202: searching the target point for being less than or equal to the distance threshold at a distance from the starting point.
It illustrates as one, the value of the distance threshold can be 100m.
203: if the quantity of the target point found is more than or equal to the points threshold value, and each target point It is all larger than the time threshold with the time interval of the starting point, then is determined as the starting point and all target points One cluster, and the starting point is labeled as potential significant place.
It illustrates as one, which can be 5;The time threshold can be 10 minutes.
If being unsatisfactory for above-mentioned condition, the starting point is labeled as noise spot.
204: after all the points in the cluster are marked, the place that successively marks other not to be labeled.
Step 201 is repeated to step 203, successively all the points in cluster are handled, when all the points are marked in cluster After having remembered (can be described as the cluster sufficiently to be extended), processing is marked to other unlabelled points.
This clustering algorithm used in the embodiment of the present invention is the variant of DBSCAN algorithm, it not only consider track midpoint and The distance between point, and the distance between putting point is distance on track rather than the direct range of two points, so more Close to real-life truth.And replace Minimum Area number to determine cluster using minimum time in cluster process, then User equipment can be prevented simply because a certain cause specific, it has to stop short time herein, and significant place is caused to judge by accident.
As step 103 " according to the attribute information table, using default Feature Selection algorithm from first attribute set A kind of specific implementation of the middle significant place decision attribute of selection ", as shown in figure 3, the process specifically includes:
301: there is tolerance using what default Feature Selection algorithm calculated each conditional attribute in first attribute set The value of the P norm of ability.
302: the corresponding condition category of maximum value in all values of the P norm described in being calculated with fault tolerance Property be determined as significant place and determine attribute.
Wherein, as shown in figure 4, in above-mentioned steps 301, the P norm with fault tolerance of each conditional attribute is calculated The detailed process of value are as follows:
401: using the first condition attribute in first attribute set as attribute to be assessed, by remaining conditional attribute structure At set be determined as the second attribute set, the first condition attribute is any conditional attribute.
402: selecting game playing algorithm using the user property based on fuzzy entropy, successively judge the attribute to be assessed described Whether the sharing in game for each destination subset in the second attribute set wins, and obtains judgement corresponding with each destination subset As a result, the destination subset includes at least two conditional attributes.
The basic principle of the realization process of this step is to be calculated using the user property selection game method based on fuzzy entropy Conditional mutual information, judges whether attribute to be assessed wins in sharing in game for attribute set.
403: according to all judging results, calculating the value of the P norm with fault tolerance of the attribute to be assessed.
401 are repeated the above steps to step 403, the corresponding P norm with fault tolerance of each conditional attribute can be obtained Value.
Assuming that sharing N-1 conditional attribute Ni and 1 decision attribute D in the first attribute set, potential location information is selected Either condition attribute Ni in table is assessed as attribute to be assessed, then remaining N-2 conditional attribute constitutes the second attribute Set.Second attribute set is corresponding with multiple subsets, any one for selecting in second attribute set contains at least two The attribute set Si of conditional attribute, to assess in the situation known to conditional attribute Ni, attribute set Si and final decision attribute Extent of information sharing.Its calculation formula is as follows:
MI(Si;D | Ni)=E (Si | Ni)-E (Si | D, Ni) (1)
Wherein, conditional entropy E (Si | Ni) is referred to as when known to Ni, the entropy of attribute set Si, E (Si | D, Ni) indicate to When evaluation condition attribute Ni and decision attribute D are existed simultaneously, the entropy of attribute set Si.
If in formula (1), MI (Si;D | Ni) value be greater than 0, then explanation increases in the presence of the attribute Ni to be assessed The information sharing of attribute set Si and decision attribute D claim Ni to win in sharing in game for attribute set Si.
According to the calculation method of formula (1), it is corresponding in each objective attribute target attribute subset can successively to obtain attribute Ni to be assessed Whether sharing in game for Si wins.
In addition, conditional entropy E (Si | Ni) calculation method are as follows:
E (Si | Ni)=- (∑ log2(|[xi]|/n))/n (2)
Wherein, | [xi] |=∑ rij, rijFor any two conditional attribute value x in Si setiWith xjSimilarity relation, n is The number of Si conditional attribute.rijFor example following formula (3) of calculation formula shown in:
Wherein, in formula (3), | | xi-xj| | it is the P norm with fault tolerance.
According to attribute to be assessed sharing whether to win in game and calculate attribute to be assessed in each objective attribute target attribute subset The value of P norm (Banzhaf) with fault tolerance.
In formula (1), as MI (Si;D | Ni) be greater than 0, i.e. conditional attribute Ni wins in sharing in game for attribute set Si When, count Δi(Si)=1, otherwise Δi(Si)=0.
Then the Banzhaf value calculating method of conditional attribute Ni is as follows:
The corresponding Banzhaf value of each conditional attribute is finally obtained, Banzhaf value is higher, illustrates it to attribute set Contribution it is bigger.Select the corresponding conditional attribute of Banzhaf value peak as determine the most important attribute in significant place namely Significant place determines attribute.
Since potential significant place is obtained by clustering algorithm, it is understood that there may be the sparse situation of sample, and this feature With the P norm of fault tolerance used in extracting, the situation that more to cope with sample than traditional P norm sparse is kept away simultaneously Equivalence relation sample discretization bring information loss is exempted from.
In addition, before step 101 and step 105, due to having drift when the location track got according to GPS sensor understands It moves, thus the shift point for needing for these to be evident as noise spot is rejected, for example moment has floated to Shanghai from Beijing, this is clearly Mistake, this GPS latitude and longitude coordinates point just needs to weed out.
As the realization of the above method, the embodiment of the invention also provides a kind of devices for determining significant place, such as Fig. 5 institute Show, which includes:
Processing module 501, for by preset clustering algorithm obtain on the sample position track at least one is potential Significant place;
Module 502 is obtained, for obtaining the value of corresponding first attribute set in each potential significant place, is belonged to Property information table, first attribute set includes the conditional attribute and a decision attribute of predetermined number;
The processing module 501, is also used to according to the attribute information table, using default Feature Selection algorithm from described Significant place is chosen in one attribute set determines attribute;
The acquisition module 502 is also used to obtain the threshold range that the significant place determines attribute;
The processing module 501 is also used to obtain in the location track to be processed by the default clustering algorithm At least one potential significant place;
The processing module 501 is also used to the significant place determining that the value of attribute is latent in the threshold range It is determined as significant place in significant place.
Further, the acquisition module 502 is specifically used for calculating first attribute using default Feature Selection algorithm The value of the P norm with fault tolerance of each conditional attribute in set;
The corresponding conditional attribute of maximum value in all values of P norm described in being calculated with fault tolerance is true It is set to significant place and determines attribute.
Further, the acquisition module 502, also particularly useful for:
Using the first condition attribute in first attribute set as attribute to be assessed, remaining conditional attribute is constituted Set is determined as the second attribute set, and the first condition attribute is any conditional attribute;
Game playing algorithm is selected using the user property based on fuzzy entropy, successively judges the attribute to be assessed described second Whether the sharing in game for each destination subset in attribute set wins, and obtains judgement knot corresponding with each destination subset Fruit, the destination subset include at least two conditional attributes;
According to all judging results, the value of the P norm with fault tolerance of the attribute to be assessed is calculated.
Further, the default clustering algorithm includes distance threshold, points three parameters of threshold value and time threshold;Then institute Processing module 501 is stated, is specifically used for:
Place not labeled in the location track is used as to starting point, the label include be labeled as it is potential significantly Point or noise spot;
Search the target point for being less than or equal to the distance threshold at a distance from the starting point;
If the quantity of the target point found is more than or equal to the points threshold value, and each target point and institute The time interval for stating starting point is all larger than the time threshold, then the starting point and all target points is determined as one Cluster, and the starting point is labeled as potential significant place;
After all the points in the cluster are marked, the place that successively marks other not to be labeled.
It further, include speed, acceleration in first attribute set for obtaining the acquisition of module 502, potential Significant place goes out in the sample position track in the record number in the sample position track, the potential significant place The time period of stay and Orientation differences standard deviation of existing number of days, the potential significant place.
The device in the significant place of determination provided in an embodiment of the present invention is obtained by clustering algorithm sample position track latent In significant place, then go out by Feature Selection algorithm picks the decision attribute and its threshold range in significant place, it is then right again Location track to be processed is handled, by the decision attribute in significant place in the potential significant place in location track to be processed The potential significant place that value meets threshold range is determined as significant place, and will be selected in the prior art by clustering algorithm Point is compared directly as significant place, and the present invention after clustering algorithm due to using Feature Selection algorithm, it is thus possible to reduce The False Rate in significant place, and then can be improved the accuracy for determining significant place.
As the realization of the above method, the embodiment of the invention also provides a kind of devices for determining significant place, such as Fig. 6 institute Show, which includes processor 601, memory 602 and bus 603, and processor 601 and memory 602 are connected by bus 603 It connects.Wherein:
Processor 601, for by preset clustering algorithm obtain on the sample position track at least one is potential aobvious Touchdown point;
The value for obtaining corresponding first attribute set in each potential significant place, obtains attribute information table, and described One attribute set includes the conditional attribute and a decision attribute of predetermined number;
According to the attribute information table, chosen significantly from first attribute set using default Feature Selection algorithm Point determines attribute;
Obtain the threshold range that the significant place determines attribute;
At least one potential significant place in the location track to be processed is obtained by the default clustering algorithm;
The significant place is determined that potential significant place of the value of attribute in the threshold range is determined as significantly Point.
Further, the processor 601 is specifically used for calculating first property set using default Feature Selection algorithm The value of the P norm with fault tolerance of each conditional attribute in conjunction;
The corresponding conditional attribute of maximum value in all values of P norm described in being calculated with fault tolerance is true It is set to significant place and determines attribute.
Further, the processor 601, also particularly useful for:
Using the first condition attribute in first attribute set as attribute to be assessed, remaining conditional attribute is constituted Set is determined as the second attribute set, and the first condition attribute is any conditional attribute;
Game playing algorithm is selected using the user property based on fuzzy entropy, successively judges the attribute to be assessed described second Whether the sharing in game for each destination subset in attribute set wins, and obtains judgement knot corresponding with each destination subset Fruit, the destination subset include at least two conditional attributes;
According to all judging results, the value of the P norm with fault tolerance of the attribute to be assessed is calculated.
Further, the processor 601, is specifically used for:
Place not labeled in the location track is used as to starting point, the label include be labeled as it is potential significantly Point or noise spot;
Search the target point for being less than or equal to the distance threshold at a distance from the starting point;
If the quantity of the target point found is more than or equal to the points threshold value, and each target point and institute The time interval for stating starting point is all larger than the time threshold, then the starting point and all target points is determined as one Cluster, and the starting point is labeled as potential significant place;
After all the points in the cluster are marked, the place that successively marks other not to be labeled.
It further, include speed, acceleration in first attribute set that the processor 601 obtains, potential aobvious Touchdown point occurs in the sample position track in the record number in the sample position track, the potential significant place Number of days, the potential significant place time period of stay and Orientation differences standard deviation.
Memory 602 is for program used in 601 implementation procedure of storage processor.
The device in the significant place of determination provided in an embodiment of the present invention is obtained by clustering algorithm sample position track latent In significant place, then go out by Feature Selection algorithm picks the decision attribute and its threshold range in significant place, it is then right again Location track to be processed is handled, by the decision attribute in significant place in the potential significant place in location track to be processed The potential significant place that value meets threshold range is determined as significant place, and will be selected in the prior art by clustering algorithm Point is compared directly as significant place, and the present invention after clustering algorithm due to using Feature Selection algorithm, it is thus possible to reduce The False Rate in significant place, and then can be improved the accuracy for determining significant place.
It should be noted that processor 601 described in the embodiment of the present invention can be a processor, it is also possible to multiple The general designation of processing element.For example, the processor 601 can be central processing unit (Central Processing Unit, abbreviation CPU), it is also possible to specific integrated circuit (Application Specific Integrated Circuit, abbreviation ASIC), Or be arranged to implement one or more integrated circuits of the embodiment of the present invention, such as: one or more microprocessors (digital signal processor, abbreviation DSP), or, one or more field programmable gate array (Field Programmable Gate Array, abbreviation FPGA).
Memory 602 can be a storage device, be also possible to the general designation of multiple memory elements, and for storing and can hold Line program code etc..And memory 602 may include random access memory (RAM), also may include nonvolatile memory (non- Volatile memory), for example, magnetic disk storage, flash memory (Flash) etc..
It is total that bus 603 can be industry standard architecture (Industry Standard Architecture, ISA) Line, external equipment interconnection (Peripheral Component, PCI) bus or extended industry-standard architecture (Extended Industry Standard Architecture, EISA) bus etc..It is total that the bus 603 can be divided into address bus, data Line, control bus etc..Only to be indicated with a thick line in Fig. 6, it is not intended that an only bus or a type convenient for indicating The bus of type.
Through the above description of the embodiments, it is apparent to those skilled in the art that the present invention can borrow Help software that the mode of required common hardware is added to realize, naturally it is also possible to which the former is more preferably by hardware, but in many cases Embodiment.Based on this understanding, the portion that technical solution of the present invention substantially in other words contributes to the prior art Dividing can be embodied in the form of software products, which stores in a readable storage medium, such as count The floppy disk of calculation machine, hard disk or CD etc., including some instructions are used so that computer equipment (it can be personal computer, Server or the network equipment etc.) execute method described in each embodiment of the present invention.
The above description is merely a specific embodiment, but scope of protection of the present invention is not limited thereto, any Those familiar with the art in the technical scope disclosed by the present invention, can easily think of the change or the replacement, and should all contain Lid is within protection scope of the present invention.

Claims (6)

1. a kind of method for determining significant place, which is characterized in that the described method includes: being determined according to sample position track significant Place determines attribute and determines that attribute determines the significant place in location track to be processed according to the significant place;
Wherein, described to determine that significant place determines attribute according to sample position track, it specifically includes:
At least one potential significant place on the sample position track is obtained by default clustering algorithm;
The value for obtaining corresponding first attribute set in each potential significant place, obtains attribute information table, described first belongs to Property set includes the conditional attribute and a decision attribute of predetermined number;Wherein, the conditional attribute include speed, acceleration, Record number of the potential significant place in the sample position track, the potential significant place are in the sample position track The number of days of middle appearance, the potential significant place time period of stay and Orientation differences standard deviation;The decision attribute refers to Whether the potential significant place is significant place;
According to the attribute information table, significant place is chosen from first attribute set using default Feature Selection algorithm and is determined Determine attribute;It specifically includes: calculating having for each conditional attribute in first attribute set using default Feature Selection algorithm The value of the P norm of fault tolerance;Maximum value in all values of P norm described in being calculated with fault tolerance is corresponding Conditional attribute be determined as significant place and determine attribute;
Obtain the threshold range that the significant place determines attribute;
It is described to determine that attribute determines the significant place in location track to be processed according to the significant place, it specifically includes:
At least one potential significant place in the location track to be processed is obtained by the default clustering algorithm;
Significant place is determined that potential significant place of the value of attribute in the threshold range is determined as significant place.
2. the method according to claim 1, wherein described calculate described first using default Feature Selection algorithm The value of the P norm with fault tolerance of each conditional attribute in attribute set, specifically includes:
Using the first condition attribute in first attribute set as attribute to be assessed, the set that remaining conditional attribute is constituted It is determined as the second attribute set, the first condition attribute is any conditional attribute;
Game playing algorithm is selected using the user property based on fuzzy entropy, successively judges the attribute to be assessed in second attribute Whether the sharing in game for each destination subset in set wins, and obtains judging result corresponding with each destination subset, institute Stating destination subset includes at least two conditional attributes;
According to all judging results, the value of the P norm with fault tolerance of the attribute to be assessed is calculated.
3. the method according to claim 1, wherein the default clustering algorithm includes distance threshold, points threshold Value and three parameters of time threshold;
It is described that at least one on the sample position track or the location track to be processed is obtained by default clustering algorithm Potential significant place, specifically includes:
Place not labeled in the location track is used as to starting point, the label include be labeled as potential significant place or Noise spot;
Search the target point for being less than or equal to the distance threshold at a distance from the starting point;
If the quantity of the target point found is more than or equal to the points threshold value, and each target point and described The time interval of initial point is all larger than the time threshold, then the starting point and all target points is determined as a cluster, And the starting point is labeled as potential significant place;
After all the points in the cluster are marked, the place that successively marks other not to be labeled.
4. a kind of device for determining significant place, which is characterized in that described device includes:
Processing module, for obtaining at least one potential significant place on sample position track by default clustering algorithm;
Module is obtained, for obtaining the value of corresponding first attribute set in each potential significant place, obtains attribute information Table, first attribute set include the conditional attribute and a decision attribute of predetermined number;Wherein, the conditional attribute includes Record number in the sample position track of speed, acceleration, potential significant place, the potential significant place are described The time period of stay and Orientation differences standard deviation of the number of days, the potential significant place that occur in sample position track;It is described Decision attribute refers to whether the potential significant place is significant place;
The processing module, is also used to according to the attribute information table, using default Feature Selection algorithm from first attribute Significant place is chosen in set determines attribute;
The acquisition module is specifically used for calculating each condition category in first attribute set using default Feature Selection algorithm The value of the P norm with fault tolerance of property;In all values of P norm described in being calculated with fault tolerance most It is worth corresponding conditional attribute greatly and is determined as significant place decision attribute;
The acquisition module is also used to obtain the threshold range that the significant place determines attribute;
The processing module, at least one for being also used to obtain in location track to be processed by the default clustering algorithm are potential Significant place;
The processing module, be also used to determine in the significant place value of attribute in the threshold range it is potential significantly Point is determined as significant place.
5. device according to claim 4, which is characterized in that the acquisition module, also particularly useful for:
Using the first condition attribute in first attribute set as attribute to be assessed, the set that remaining conditional attribute is constituted It is determined as the second attribute set, the first condition attribute is any conditional attribute;
Game playing algorithm is selected using the user property based on fuzzy entropy, successively judges the attribute to be assessed in second attribute Whether the sharing in game for each destination subset in set wins, and obtains judging result corresponding with each destination subset, institute Stating destination subset includes at least two conditional attributes;
According to all judging results, the value of the P norm with fault tolerance of the attribute to be assessed is calculated.
6. device according to claim 4, which is characterized in that the default clustering algorithm includes distance threshold, points threshold Value and three parameters of time threshold;
The processing module, is specifically used for:
Place not labeled in the location track is used as to starting point, the label include be labeled as potential significant place or Noise spot;
Search the target point for being less than or equal to the distance threshold at a distance from the starting point;
If the quantity of the target point found is more than or equal to the points threshold value, and each target point and described The time interval of initial point is all larger than the time threshold, then the starting point and all target points is determined as a cluster, And the starting point is labeled as potential significant place;
After all the points in the cluster are marked, the place that successively marks other not to be labeled.
CN201510307160.5A 2015-06-05 2015-06-05 Determine the method and device in significant place Active CN106294485B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201510307160.5A CN106294485B (en) 2015-06-05 2015-06-05 Determine the method and device in significant place

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201510307160.5A CN106294485B (en) 2015-06-05 2015-06-05 Determine the method and device in significant place

Publications (2)

Publication Number Publication Date
CN106294485A CN106294485A (en) 2017-01-04
CN106294485B true CN106294485B (en) 2019-11-01

Family

ID=57659648

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201510307160.5A Active CN106294485B (en) 2015-06-05 2015-06-05 Determine the method and device in significant place

Country Status (1)

Country Link
CN (1) CN106294485B (en)

Families Citing this family (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN107491895A (en) * 2017-08-30 2017-12-19 国信优易数据有限公司 A kind of shared bicycle parks determination method and device a little
CN112230253B (en) * 2020-10-13 2021-07-09 电子科技大学 Track characteristic anomaly detection method based on public slice subsequence
CN113254570B (en) * 2021-07-15 2021-10-01 浙江网商银行股份有限公司 Data identification method and device
CN114995628B (en) * 2021-10-13 2023-08-11 荣耀终端有限公司 Space gesture recognition method and related equipment thereof

Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN103218442A (en) * 2013-04-22 2013-07-24 中山大学 Method and system for life mode analysis based on mobile device sensor data
CN103577547A (en) * 2013-10-12 2014-02-12 优视科技有限公司 Webpage type identification method and device
CN104252527A (en) * 2014-09-02 2014-12-31 百度在线网络技术(北京)有限公司 Method and device for determining resident point information of mobile subscriber
CN104636354A (en) * 2013-11-07 2015-05-20 华为技术有限公司 Position point of interest clustering method and related device

Patent Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN103218442A (en) * 2013-04-22 2013-07-24 中山大学 Method and system for life mode analysis based on mobile device sensor data
CN103577547A (en) * 2013-10-12 2014-02-12 优视科技有限公司 Webpage type identification method and device
CN104636354A (en) * 2013-11-07 2015-05-20 华为技术有限公司 Position point of interest clustering method and related device
CN104252527A (en) * 2014-09-02 2014-12-31 百度在线网络技术(北京)有限公司 Method and device for determining resident point information of mobile subscriber

Also Published As

Publication number Publication date
CN106294485A (en) 2017-01-04

Similar Documents

Publication Publication Date Title
KR102205096B1 (en) Transaction risk detection method and apparatus
TWI706273B (en) Uniform resource locator (URL) attack detection method, device and electronic equipment
CN109816397B (en) Fraud discrimination method, device and storage medium
CN106294485B (en) Determine the method and device in significant place
CN106649331A (en) Business district recognition method and equipment
CN105910612A (en) Personalized navigation method and system
CN109299981A (en) A kind of advertisement recommended method and device
CN110688589A (en) Store arrival identification method and device, electronic equipment and readable storage medium
CN108846097A (en) The interest tags representation method of user, article recommended method and device, equipment
CN107148553A (en) Method and system for improving Inertial Measurement Unit sensor signal
CN108834077A (en) Tracking limited region dividing method, device and electronic equipment based on user's mobility
CN113328994B (en) Malicious domain name processing method, device, equipment and machine readable storage medium
CN105990170A (en) Wafer yield analysis method and device
CN103324949B (en) The method and apparatus identifying object from image
JP2022126678A (en) Method, apparatus and system for retrieving image, electronic device, computer-readable storage medium, and computer program
CN109344176B (en) False comment detection method based on double-cycle graph
CN114117261A (en) Track detection method and device, electronic equipment and storage medium
CN111121803B (en) Method and device for acquiring common stop points of road
Galba et al. Public transportation bigdata clustering
CN116011504A (en) Fraud recognition method and device based on sensor data, electronic equipment and medium
CN102779120A (en) Method, system and device for determining field information of station and judging correlation
CN105354597B (en) A kind of classification method and device of game articles
EP3590025A2 (en) Smart context subsampling on-device system
CN108074301A (en) A kind of system of real name method and device of transportation card
CN108460057A (en) A kind of user's stroke method for digging and device based on unsupervised learning

Legal Events

Date Code Title Description
C06 Publication
PB01 Publication
C10 Entry into substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant