CN107194412A - A kind of method of processing data, device, equipment and computer-readable storage medium - Google Patents

A kind of method of processing data, device, equipment and computer-readable storage medium Download PDF

Info

Publication number
CN107194412A
CN107194412A CN201710260035.2A CN201710260035A CN107194412A CN 107194412 A CN107194412 A CN 107194412A CN 201710260035 A CN201710260035 A CN 201710260035A CN 107194412 A CN107194412 A CN 107194412A
Authority
CN
China
Prior art keywords
label
characteristic data
user characteristic
user
tags
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN201710260035.2A
Other languages
Chinese (zh)
Inventor
张晓迪
徐云峰
陈承泽
陈炜于
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Beijing Baidu Netcom Science and Technology Co Ltd
Original Assignee
Beijing Baidu Netcom Science and Technology Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Beijing Baidu Netcom Science and Technology Co Ltd filed Critical Beijing Baidu Netcom Science and Technology Co Ltd
Priority to CN201710260035.2A priority Critical patent/CN107194412A/en
Publication of CN107194412A publication Critical patent/CN107194412A/en
Pending legal-status Critical Current

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F18/00Pattern recognition
    • G06F18/20Analysing
    • G06F18/24Classification techniques
    • G06F18/241Classification techniques relating to the classification model, e.g. parametric or non-parametric approaches
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F18/00Pattern recognition
    • G06F18/20Analysing
    • G06F18/21Design or setup of recognition systems or techniques; Extraction of features in feature space; Blind source separation
    • G06F18/214Generating training patterns; Bootstrap methods, e.g. bagging or boosting

Landscapes

  • Engineering & Computer Science (AREA)
  • Data Mining & Analysis (AREA)
  • Theoretical Computer Science (AREA)
  • Computer Vision & Pattern Recognition (AREA)
  • Bioinformatics & Cheminformatics (AREA)
  • Bioinformatics & Computational Biology (AREA)
  • Artificial Intelligence (AREA)
  • Evolutionary Biology (AREA)
  • Evolutionary Computation (AREA)
  • Physics & Mathematics (AREA)
  • General Engineering & Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • Life Sciences & Earth Sciences (AREA)
  • Management, Administration, Business Operations System, And Electronic Commerce (AREA)

Abstract

The invention provides a kind of method of processing data, device, equipment and computer-readable storage medium, the method for wherein processing data includes:Obtain the user characteristic data of the relevant information containing customer location;According to division of the customer location relevant information on geographical position, the set of tags corresponding to user characteristic data is determined;The user characteristic data is predicted using the set of tags corresponding classification submodel, the label of user is obtained;Wherein, each set of tags corresponds to a classification submodel respectively.The present invention is by obtaining with after the set of tags corresponding to user characteristic data, being predicted using with the classification submodel corresponding to set of tags to user characteristic data, so as to improve predetermined speed and prediction accuracy to user characteristic data.

Description

A kind of method of processing data, device, equipment and computer-readable storage medium
【Technical field】
The present invention relates to Map Services technical field, more particularly to a kind of method of processing data, device, equipment and calculating Machine storage medium.
【Background technology】
Existing disaggregated model, algorithmically there is SVM (support vector machine, SVMs), LR The multiple choices such as (logistic regression, logistic regression), decision tree, but depended on when realizing training and predicting Single model covers all tag along sort and feature.Therefore, prior art works as classification when carrying out the training of disaggregated model When larger, the involved tag along sort of problem and many features, then the scale of required training data also can be in disastrously Increase, so as to cause disaggregated model in finite time it is difficult to complete training, and then influence the ageing of model application and change For development efficiency.In addition, prior art is when carrying out the prediction of disaggregated model, when the tag along sort in single model and feature rule When mould is larger, predetermined speed of model can be influenceed, if in the scene of real-time estimate is carried out using disaggregated model, the prediction of model Speed can influence the response speed of real-time system.
【The content of the invention】
In view of this, the invention provides a kind of method of processing data, device, equipment and computer-readable storage medium, energy Enough lift the predetermined speed and prediction accuracy of disaggregated model to user data.
The present invention there is provided a kind of method of processing data, the side to solve the technical scheme that technical problem is used Method includes:Obtain the user characteristic data of the relevant information containing customer location;According to the customer location relevant information in geographical position The division put, determines the set of tags corresponding to user characteristic data;Using the corresponding classification submodel of the set of tags to institute State user characteristic data to be predicted, obtain the label of user;Wherein, each set of tags corresponds to a classification submodel respectively.
According to one preferred embodiment of the present invention, the classification submodel is previously obtained using following training method:Obtain Take label and the user characteristic data associated with label;According to division of each label on geographical position, to the label It is grouped;Each set of tags is included into label and the user characteristic data associated with label is as training data, point The corresponding classification submodel of each set of tags is not trained.
According to one preferred embodiment of the present invention, the division according to each label on geographical position, to the label When being grouped, the label in boundaries of packets is divided to the multiple set of tags closed on.
According to one preferred embodiment of the present invention, it is described in the label for being included each set of tags and associated with label User characteristic data as training data when, further comprise:Confidence level is less than to the user characteristics of default confidence threshold value Data are filtered out from training data.
According to one preferred embodiment of the present invention, the label includes point-of-interest, or area-of-interest.
According to one preferred embodiment of the present invention, the customer location relevant information includes gps data, Wifi information and IP At least one of address.
According to one preferred embodiment of the present invention, the drawing on geographical position according to the customer location relevant information Point, determine that the set of tags corresponding to user characteristic data includes:The geographical position letter included according to the user characteristic data Breath, spatial index or polymerization are carried out to the user characteristic data;According to spatial index or polymerization result, determine that the user is special Levy the corresponding set of tags of data.
According to one preferred embodiment of the present invention, the classification submodel using corresponding to the set of tags is to the user Characteristic is predicted, and is obtained the label of user and is included:If the classification submodel only one of which, by the user characteristics Data are sent to the classification submodel, according to predicting the outcome for the classification submodel, obtain the label of user;Or, if described Classification submodel has multiple, then sends the characteristic of each classification submodel of correspondence in user characteristic data to corresponding point In class submodel, according to the prediction amalgamation result of multiple classification submodels, the label of user is obtained.
According to one preferred embodiment of the present invention, the prediction amalgamation result according to multiple classification submodels obtains user's Label includes:The confidence level of user characteristic data determines to predict the outcome according to corresponding to each classification submodel, obtains the mark of user Label;Or, determine to predict the outcome according to the confidence level predicted the outcome obtained by each classification submodel, obtain the label of user.
According to one preferred embodiment of the present invention, the confidence level of the user characteristic data is according to geographical position, the frequency of occurrences Or at least one of signal intensity etc. is determined.
The present invention is to provide a kind of device of processing data, described device to solve the technical scheme that technical problem is used Including:Acquiring unit, the user characteristic data for obtaining the relevant information containing customer location;Determining unit, for according to described Division of the customer location relevant information on geographical position, determines the set of tags corresponding to user characteristic data;Predicting unit, is used The user characteristic data is predicted in using the set of tags corresponding classification submodel, the label of user is obtained;Its In, each set of tags corresponds to a classification submodel respectively.
According to one preferred embodiment of the present invention, described device also includes training unit, for using following training method instruction Get classification submodel:Obtain label and the user characteristic data associated with label;According to each label in geographical position On division, the label is grouped;Each set of tags is included into label and the user characteristics associated with label The corresponding classification submodel of each set of tags is respectively trained as training data in data.
According to one preferred embodiment of the present invention, the training unit is for the division according to each label on geographical position When being grouped to the label, the label in boundaries of packets is divided to the multiple set of tags closed on.
According to one preferred embodiment of the present invention, the training unit in the label for each set of tags to be included and It is specific to perform when the user characteristic data associated with label is as training data:Confidence level is less than default confidence threshold value User characteristic data filtered out from training data.
According to one preferred embodiment of the present invention, the label includes point-of-interest, or area-of-interest.
According to one preferred embodiment of the present invention, the customer location relevant information includes gps data, Wifi information and IP At least one of address.
According to one preferred embodiment of the present invention, the determining unit for according to the customer location relevant information on ground The division on position is managed, it is specific to perform when determining the set of tags corresponding to user characteristic data:According to the user characteristic data Comprising geographical location information, spatial index or polymerization are carried out to the user characteristic data;According to spatial index or polymerization As a result, the corresponding set of tags of the user characteristic data is determined.
According to one preferred embodiment of the present invention, the predicting unit is for utilizing classification corresponding to the set of tags Model is predicted to the user characteristic data, when obtaining the label of user, specific to perform:If the classification submodel only has One, then the user characteristic data is sent to the classification submodel, according to predicting the outcome for the classification submodel, used The label at family;Or, if the classification submodel has multiple, by the spy of each classification submodel of correspondence in user characteristic data Levy data to send into corresponding classification submodel, according to the prediction amalgamation result of multiple classification submodels, obtain the mark of user Label.
According to one preferred embodiment of the present invention, the predicting unit according to the prediction of multiple classification submodels for merging It is specific to perform when as a result obtaining the label of user:The confidence level of user characteristic data is determined according to corresponding to each classification submodel Predict the outcome, obtain the label of user;Or, prediction knot is determined according to the confidence level predicted the outcome obtained by each classification submodel Really, the label of user is obtained.
According to one preferred embodiment of the present invention, the confidence level of the user characteristic data is according to geographical position, the frequency of occurrences Or at least one of signal intensity etc. is determined.
As can be seen from the above technical solutions, after the present invention is by the set of tags corresponding to acquisition and user characteristic data, User characteristic data is predicted using with the classification submodel corresponding to set of tags, so as to improve to user characteristic data Predetermined speed and prediction accuracy.
【Brief description of the drawings】
The method flow diagram that Fig. 1 provides for one embodiment of the invention.
The labeled packet schematic diagram that Fig. 2 provides for one embodiment of the invention.
The structure drawing of device that Fig. 3 provides for one embodiment of the invention.
The block diagram for the computer system/server that Fig. 4 provides for one embodiment of the invention.
【Embodiment】
In order that the object, technical solutions and advantages of the present invention are clearer, below in conjunction with the accompanying drawings with specific embodiment pair The present invention is described in detail.
The term used in embodiments of the present invention is the purpose only merely for description specific embodiment, and is not intended to be limiting The present invention." one kind ", " described " and "the" of singulative used in the embodiment of the present invention and appended claims It is also intended to including most forms, unless context clearly shows that other implications.
It should be appreciated that term "and/or" used herein is only a kind of incidence relation for describing affiliated partner, represent There may be three kinds of relations, for example, A and/or B, can be represented:Individualism A, while there is A and B, individualism B these three Situation.In addition, character "/" herein, it is a kind of relation of "or" to typically represent forward-backward correlation object.
Depending on linguistic context, word as used in this " if " can be construed to " ... when " or " when ... When " or " in response to determining " or " in response to detection ".Similarly, depending on linguistic context, phrase " if it is determined that " or " if detection (condition or event of statement) " can be construed to " when it is determined that when " or " in response to determine " or " when the detection (condition of statement Or event) when " or " in response to detection (condition or event of statement) ".
The method flow diagram that Fig. 1 provides for one embodiment of the invention, as shown in fig. 1, methods described includes:
In 101, the user characteristic data of the relevant information containing customer location is obtained.
In this step, acquired user characteristic data is the data containing customer location relevant information, wherein user The relevant information of position includes at least one of gps data, Wifi information and IP address.It is i.e. special by acquired user The geographical position of user can be learnt by levying data.
Alternatively, during one of the present embodiment implements, the customer location included in user characteristic data Relevant information can be one, for example, only include gps data, or only include Wifi information, then or only comprising IP address.With Customer location relevant information included in the characteristic of family can be multiple, such as comprising gps data and Wifi information, or Person includes Wifi information and IP address, or comprising gps data and IP address, then or includes gps data, Wifi information And IP address.Wherein, gps data is the information for directly reflecting customer location;Wifi information or IP address all include ground Positional information is managed, after being positioned to Wifi information or IP address, the information of customer location is obtained.
In 102, according to division of the customer location relevant information on geographical position, user characteristic data institute is determined Corresponding set of tags.
In this step, according to geographical location information contained in user characteristic data, user characteristic data is carried out Spatial index or polymerization, then according to spatial index or polymerization result, determine the set of tags corresponding to user characteristic data.
Wherein, set of tags is divided previously according to each label on geographical position.Specifically, obtaining each After label, according to division of each label on geographical position, the close label in geographical position is divided into one group, i.e., each label All include all labels in a certain geographic range in group.
Alternatively, during one of the present embodiment implements, it can obtain and prestore when obtaining label Label, can also obtain label by way of real time scan or search.Wherein, label can be POI (point of Interest, point-of-interest), or AOI (area of interest, area-of-interest).
And when being grouped according to geographical position to each label, divide the close label in geographical position into according to preset rules One group, so as to complete the packet to label.Longitude and latitude encryption algorithm can be used to be divided, such as using Geohash algorithms, Encoded by the geographical position to acquired label, then divide all labels with identical coding into one group.May be used also To be drawn using the spacial analytical method based on GIS (Geographic InformationSystem, GIS-Geographic Information System) Point, each label in a certain geographic range is polymerize according to different-diameter scope, by each condensed together Label divides one group into.When being grouped to label, in addition it is also necessary to the number of tags under each set of tags or each set of tags Geographic range limited, so that it is guaranteed that having suitable number of tags or the suitable geographical model of covering in each set of tags Enclose, it is to avoid because number of tags is excessive or the excessive training data scale brought of geographic range becomes big unfavorable factor.
The schematic diagram for the labeled packet that Fig. 2 provides for one embodiment of the invention.As shown in Figure 2, label is POI, is used Geohash algorithms are grouped to acquired label.For example, in figure Geohash values for wtw3jnn set of tags in, have " also Occupy pet shop ", it is " clean rich to wash during Geohash values in Hu Guan " and " beautiful China photograph " three labels, figure are wtw3jnj set of tags There are " Building 26, Yongtai road Lane 595 55-57 " and " Building 27, Yongtai road Lane 595 58-59 " two labels.
In this step, using the method similar with dividing set of tags, spatial index or sky are carried out to user characteristic data Between polymerize, so that it is determined which set of tags is user characteristic data particularly belong to., will be with user for example using Geohash algorithms Corresponding geographical position is encoded in characteristic, obtains geohash values, then determines the use according to the geohash values Set of tags belonging to the characteristic of family.
Alternatively, during one of the present embodiment implements, according to the geographical location information of user characteristic data Identified set of tags can clearly indicate that user characteristic data belongs to only one of which, i.e. acquired geographical location information Any group label.The set of tags according to determined by the geographical location information of user characteristic data can also have multiple, and for example user is special The geographical location information for levying data is located at the boundary of packet label, then by the set of tags corresponding to multigroup label in border It is used as the set of tags corresponding to user characteristic data;Or for another example contain multiple geographical location information in user characteristic data When, for example user data includes gps data and Wifi information, and the geographical location information obtained according to gps data is A, and according to The geographical location information that Wifi information is obtained is B, then it is special set of tags corresponding to geographical location information A and B to be all defined as into user Levy the set of tags corresponding to data.
In 103, the user characteristic data is predicted using the set of tags corresponding classification submodel, obtained The label of user.
In this step, predict that used classification submodel is previously according to training data training during user characteristic data Obtain.And train classification submodel used in training data be each submodel included in label and with each label Associated user characteristic data.
After the packet to label is completed so that every group of label is corresponded under a submodel, i.e., each submodel respectively Include whole labels in a certain geographic range.As shown in Figure 2, such as corresponding submodels of set of tags wtw3jnm are son The corresponding submodel of model A, set of tags wtw3jjv is submodel B, by that analogy.
Label according to included in each submodel, obtains the user characteristic data associated with each label.Then Using the label in submodel and the user characteristic data associated with each label as the submodel training data.Its In, it can learn which label corresponding to this feature data be by user characteristic data, and acquired user characteristics Data can be the characteristic containing customer location relevant information, such as containing in gps data, Wifi information and IP address At least one.Can also be the characteristic for not containing customer location relevant information, for example user uses the model of mobile phone, product Board etc..
, can also be to the training data acquired in each submodel before being trained using training data to submodel Processing is optimized, so that training data used in each submodel is more accurate, and is avoided when training submodel Introduce unnecessary training data.Optimization processing to sub- model training data includes:To included in submodel label it is excellent Change and the optimization pair the user characteristic data associated with label.
Wherein, label included in submodel is optimized for:Label at boundaries of packets is respectively divided to facing Near multiple set of tags.When carrying out the division of set of tags in the geographical position according to label, it is possible in the packet of set of tags There is label in boundary, then the label at set of tags boundaries of packets is divided to the multiple set of tags closed on, so that The division of set of tags is more accurate.For example, as shown in Figure 2, label " children's theme park " is located at the set of tags closed on Wtw3jnm and set of tags wtw3jjv boundary, then label " children's theme park " is respectively divided to set of tags Wtw3jnm and set of tags wtw3jjv, so that submodel corresponding with set of tags wtw3jnm and set of tags wtw3jjv A and submodel B include label " children's theme park ".
Pair user characteristic data associated with label is optimized for:Confidence level is less than to the user of default confidence threshold value Characteristic is filtered out from training data, i.e., the user characteristic data corresponding to each label in each set of tags is carried out Filtering, so as to avoid submodel from introducing unnecessary training data.Wherein, the confidence level of user characteristic data can be according to geography At least one of position, the frequency of occurrences or signal intensity etc. are determined.
Alternatively, can be the user that will be far from label physical location during one of the present embodiment implements Characteristic is filtered, and although some characteristic of such as user is associated with a label, but this feature data of user Physical location not near the label, then by user's this feature data filtering.Can also be that will only be put in user characteristic data The higher characteristic of reliability retains, and such as confidence level of Wifi information is the intensity of Wifi signals, then only retains Wifi information The higher data of middle Wifi signal intensities, the weaker Wifi information of signal intensity is filtered.Can also be will be only a small number of The characteristic that user possesses is filtered, i.e., filtered the relatively low user characteristic data of the frequency of occurrences.Such as user's hand The only several users of the characteristic of type number possess, then are filtered the characteristic of the user mobile phone model.
Submodel is trained using the training data after optimization processing, so as to obtain classification submodel so that classification Submodel can be in training data the user characteristic data associated with label determine corresponding label.In training When, each submodel is trained using traditional disaggregated model algorithm, and because the training process of each submodel is completely only It is vertical, therefore multiple submodels are concurrently trained using distributed type assemblies technology, so that needed for considerably reducing training submodel Time.Submodel of classifying is obtained after the training to submodel is completed, it is possible to using classification submodel to user characteristics number According to being predicted.
Using the classification submodel determined in step 102 corresponding to set of tags, user characteristic data is predicted, from And obtain the label of user.I.e. by this step, POI that just can be according to where acquired user characteristic data determines user Or AOI.
In a step 102, if according to determined by the geographical location information of user characteristic data set of tags only one of which, To should set of tags classification submodel also only one of which, according to predicting the outcome for the classification submodel, determine user characteristics number According to corresponding label.If in a step 102, the set of tags according to determined by the geographical location information of user characteristic data has many When individual, the classification submodel of the multiple set of tags of correspondence also has multiple, then needs the progress that predicts the outcome to multiple classification submodels Merge, predicted the outcome merging to predict the outcome as final, so that it is determined that the label corresponding to user characteristic data.
Alternatively, during one of the present embodiment implements, can according to user corresponding to disaggregated model it is special The confidence level for levying data determines to predict the outcome.Wherein, the confidence level of user characteristic data can be according to customer location relevant information In at least one of geographical position, the frequency of occurrences or signal intensity etc. determine.Such as confidence level is user characteristic data During the intensity of middle customer location relevant information, if containing gps data and Wifi information, root in acquired user characteristic data One is obtained according to gps data to predict the outcome, obtaining another according to Wifi information predicts the outcome, but Wifi in user data The signal of information is weaker, then will be given up by predicting the outcome obtained by Wifi information, by predicting the outcome for being obtained according to gps data It is defined as the label corresponding to user characteristic data.In another example confidence level is customer location relevant information in user characteristic data During the frequency of occurrences, if containing IP address and Wifi information in acquired user characteristic data, one is obtained according to IP address Predict the outcome, obtaining another according to Wifi information predicts the outcome, but the frequency of occurrences of IP address is less than Wifi information The frequency of occurrences, then will be given up by predicting the outcome obtained by IP address, by the determination that predicts the outcome obtained according to Wifi information For the label corresponding to user characteristic data.
It can also be training in advance order models, confidence level sequence is carried out to predicting the outcome, so that according to predicting the outcome Confidence level height determines to predict the outcome.For example, being found when using training data training classification submodel, instructed using gps data User tag obtained by practicing is the most accurate, next to that using Wifi information, then gps data will be used pre- obtained by being predicted The confidence level for surveying result is set to highest, and the confidence level of Wifi information is next, then by obtained by confidence level highest gps data The label being defined as corresponding to user characteristic data that predicts the outcome.
From the foregoing it can be that user characteristic data can be predicted under multiple classification submodels simultaneously, then will Predicting the outcome obtained by multiple classification submodels merge obtain it is final predict the outcome, and due to each classification submodel Scale it is all smaller, therefore predetermined speed of Forecasting Methodology provided by the present invention is better than predetermined speed of conventional model.
Structure drawing of device provided in an embodiment of the present invention is described in detail below.As shown in Figure 3, described device includes: Training unit 31, acquiring unit 32, determining unit 33 and predicting unit 34.
Training unit 31, for obtaining submodel of classifying previously according to training data training.
The used training data when training submodel of training unit 31 includes each label and closed with each label The user characteristic data of connection.Therefore training unit 31 using training data train submodel before, it is necessary to obtain each label with And the user characteristic data associated with label.
After training unit 31 obtains each label, according to the division on each label geographical position, set of tags is obtained.Specifically Ground, training unit 31 is after each label is obtained, according to division of each label on geographical position, by the close label in geographical position It is divided into all labels included in one group, i.e., each set of tags in a certain geographic range.
Alternatively, during one of the present embodiment implements, training unit 31 can obtain pre- when obtaining label The label first stored, can also obtain label by way of real time scan or search.Wherein, label can be POI (point Of interest, point-of-interest), or AOI (area ofinterest, area-of-interest).
And training unit 31 is according to geographical position to each label when being grouped, according to preset rules by geographical position phase Near label divides one group into, so as to complete the packet to label.Training unit 31 can use longitude and latitude encryption algorithm to be drawn Point, such as using Geohash algorithms, encoded by the geographical position to acquired label, then there will be identical coding All labels divide one group into.Training unit 31 can also use based on GIS (Geographic Information System, GIS-Geographic Information System) spacial analytical method divided, using different polymerization diameters, by a certain geographic range Each label is polymerize, and divides each label condensed together into one group.
Training unit 31 to each label when being grouped, in addition it is also necessary to the number of tags under each set of tags or each The geographic range of set of tags covering is limited, so that it is guaranteed that having suitable number of tags or covering to close in each set of tags Suitable geographic range, it is to avoid because number of tags is excessive or the excessive training data scale brought of geographic range becomes greatly unfavorable Factor.
Training unit 31 is after the packet to label is completed so that every group of label corresponds to a submodel respectively, i.e., each All include whole labels in a certain geographic range under submodel.Label according to included in each submodel, training is single Member 31 obtains the user characteristic data associated with each label.Then training unit 31 by the label in submodel and with it is every The associated user characteristic data of individual label as the submodel training data.Wherein, it can be obtained by user characteristic data Know which label corresponding to this feature data be, and acquired user characteristic data can be related containing customer location The characteristic of information, such as containing at least one in gps data, Wifi information and IP address.Can also be not contain The characteristic of customer location relevant information, such as user uses the model of mobile phone, brand.
Training unit 31, can also be to acquired in each submodel before being trained using training data to submodel Training data optimize processing so that training data used in each submodel is more accurate, and avoid in instruction Unnecessary training data is introduced when practicing submodel.The optimization processing of the sub- model training data of 31 pairs of training unit includes:Antithetical phrase The optimization of label and the optimization pair the user characteristic data associated with label included in model.
Wherein, training unit 31 is optimized for label included in submodel:By the label at boundaries of packets point The multiple set of tags closed on are not divided to.When carrying out the division of set of tags in the geographical position according to label, it is possible in mark There is label at the boundaries of packets of label group, then the label at set of tags boundaries of packets is divided to the multiple labels closed on Group, so that the division of set of tags is more accurate.For example, as shown in Figure 2, label " children's theme park " is located at what is closed on Set of tags wtw3jnm and set of tags wtw3jjv boundary, then label " children's theme park " is respectively divided to set of tags Wtw3jnm and set of tags wtw3jjv, so that submodel corresponding with set of tags wtw3jnm and set of tags wtw3jjv A and submodel B include label " children's theme park ".
Training unit 31 pairs of user characteristic datas associated with label are optimized for:Confidence level is less than default confidence level The user characteristic data of threshold value is filtered out from training data, i.e., special to the user corresponding to each label in each set of tags Levy data to be filtered, so as to avoid submodel from introducing unnecessary training data.Wherein, the confidence level of user characteristic data can To be determined according at least one of geographical position, the frequency of occurrences or signal intensity etc..
Alternatively, during one of the present embodiment implements, training unit 31 can will be far from label actual bit The user characteristic data put is filtered, and although some characteristic of such as user is associated with a label, but user should The physical location of characteristic is not near the label, then by user's this feature data filtering.Training unit 31 only can also will The characteristic that confidence level is higher in user characteristic data retains, and such as confidence level of Wifi information is the intensity of Wifi signals, Then only retain the higher data of Wifi signal intensities in Wifi information, the weaker Wifi information of signal intensity is filtered.Instruction The characteristic that only a few users possess can also be filtered by practicing unit 31, such as characteristic of user mobile phone model Only several users possess, then are filtered the characteristic of the user mobile phone model.
Training unit 31 is trained using the training data after optimization processing to submodel, so as to obtain classification submodule Type so that the user characteristic data associated with label that classification submodel can be in training data determines corresponding Label.Training unit 31 is in training, and each submodel is trained using traditional disaggregated model algorithm, and due to every height The training process of model is completely independent, therefore concurrently trains multiple submodels using distributed type assemblies technology, so that greatly Reduce the time needed for training submodel.Submodel of classifying is obtained after the training to submodel is completed, it is possible to use and divide Class submodel is predicted to user characteristic data.
Acquiring unit 32, the user characteristic data for obtaining the relevant information containing customer location.
User characteristic data acquired in acquiring unit 32 is the data containing customer location relevant information, wherein user position The relevant information put includes at least one of gps data, Wifi information and IP address.I.e. by acquired in acquiring unit 32 User characteristic data can learn the geographical position of user.
Alternatively, during one of the present embodiment implements, in the user characteristic data that acquiring unit 32 is obtained Comprising customer location relevant information can be one, for example only include gps data, or only include Wifi information, then or Person only includes IP address.The customer location relevant information included in user characteristic data that acquiring unit 32 is obtained can be many It is individual, such as comprising gps data and Wifi information, either comprising Wifi information and IP address or comprising gps data and IP address, then or include gps data, Wifi information and IP address.Wherein, gps data is directly reflection customer location Information;Wifi information or IP address all include geographical location information, after being positioned to Wifi information or IP address, obtain Obtain the information of customer location.
Determining unit 33, for the division according to the customer location relevant information on geographical position, determines that user is special Levy the set of tags corresponding to data.
Determining unit 33 is carried out empty according to geographical location information contained in user characteristic data to user characteristic data Between index or polymerize, then according to spatial index or polymerization result, determine the set of tags corresponding to user characteristic data.
Determining unit 33 carries out space using the method similar with the division of training unit 31 set of tags to user characteristic data Index or spatial clustering, so that it is determined which set of tags is user characteristic data particularly belong to.For example using Geohash algorithms, It will be encoded with geographical position corresponding in user characteristic data, geohash values obtained, then according to the geohash values Determine the set of tags belonging to the user characteristic data.
Alternatively, during one of the present embodiment implements, determining unit 33 is according to the ground of user characteristic data Managing set of tags determined by positional information can clearly indicate that user is special with only one of which, i.e. acquired geographical location information Levy which group label is data belong to.The set of tags according to determined by the geographical location information of user characteristic data of determining unit 33 also may be used Multiple to have, the geographical location information of such as user characteristic data is located at the boundary of packet label, then by many of border Set of tags corresponding to group label is used as the set of tags corresponding to user characteristic data;Or for another example in user characteristic data During containing multiple geographical location information, for example user data includes gps data and Wifi information, the ground obtained according to gps data Reason positional information is A, and the geographical location information obtained according to Wifi information is B, then by corresponding to geographical location information A and B Set of tags is all defined as the set of tags corresponding to user characteristic data.
Predicting unit 34, it is pre- for being carried out using the corresponding classification submodel of the set of tags to the user characteristic data Survey, obtain the label of user.
Using set of tags determined by determining unit 33, it is right with set of tags institute that predicting unit 34 is determined by training unit 31 The classification submodel answered, then classification submodel is predicted to user characteristic data determined by the utilization of predicting unit 34, from And obtain the label of user.I.e. by predicting unit 34, user place just can be determined according to acquired user characteristic data POI or AOI.
If it is determined that the set of tags only one of which according to determined by the geographical location information of user characteristic data of unit 33, then right Should set of tags classification submodel also only one of which, predicting unit 34 predicts the outcome according to the classification submodel, it is determined that with Label corresponding to the characteristic of family.If it is determined that the label according to determined by the geographical location information of user characteristic data of unit 33 When group has multiple, the classification submodel of the multiple set of tags of correspondence also has multiple, then predicting unit 34 is needed to multiple classification submodules Predicting the outcome for type is merged, and is predicted the outcome merging to predict the outcome as final, so that it is determined that user characteristic data institute Corresponding label.
Alternatively, during one of the present embodiment implements, predicting unit 34 can be right according to disaggregated model institute The confidence level of user characteristic data is answered to determine to predict the outcome.Wherein, the confidence level of user characteristic data can be according to customer location At least one of geographical position, the frequency of occurrences or signal intensity in relevant information etc. are determined.Such as confidence level is user In characteristic during the intensity of customer location relevant information, if in acquired user characteristic data containing gps data and Wifi information, obtains one according to gps data and predicts the outcome, obtaining another according to Wifi information predicts the outcome, but uses The signal of Wifi information is weaker in user data, then will be given up by predicting the outcome obtained by Wifi information, will be obtained according to gps data To the label that is defined as corresponding to user characteristic data of predicting the outcome.In another example confidence level is user position in user characteristic data When putting the frequency of occurrences of relevant information, if containing IP address and Wifi information in acquired user characteristic data, according to IP Address obtains one and predicted the outcome, and obtaining another according to Wifi information predicts the outcome, but the frequency of occurrences of IP address will Less than the frequency of occurrences of Wifi information, then it will be given up by predicting the outcome obtained by IP address, by what is obtained according to Wifi information Predict the outcome and be defined as label corresponding to user characteristic data.
Predicting unit 34 can carry out confidence level sequence, so that according to pre- with training in advance order models to predicting the outcome The confidence level height for surveying result determines to predict the outcome.For example, being found when using training data training classification submodel, use User tag obtained by gps data training is the most accurate, next to that using Wifi information, then gps data will be used to carry out pre- The confidence level predicted the outcome obtained by surveying is set to highest, and the confidence level of Wifi information is next, then by confidence level highest GPS Predicting the outcome obtained by data is defined as the label corresponding to user characteristic data.
From the foregoing it can be that predicting unit 34 can be carried out user characteristic data under multiple classification submodels simultaneously Prediction, then predicting the outcome obtained by multiple classification submodels is merged obtain it is final predict the outcome, and due to every The scale of individual classification submodel is all smaller, therefore predetermined speed of Forecasting Methodology provided by the present invention is better than conventional model Predetermined speed.
Fig. 4 shows the frame suitable for being used for the exemplary computer system/server 012 for realizing embodiment of the present invention Figure.The computer system/server 012 that Fig. 4 is shown is only an example, to the function of the embodiment of the present invention and should not be used Range band carrys out any limitation.
As shown in figure 4, computer system/server 012 is showed in the form of universal computing device.Computer system/clothes The component of business device 012 can include but is not limited to:One or more processor or processing unit 016, system storage 028, the bus 018 of connection different system component (including system storage 028 and processing unit 016).
Bus 018 represents the one or more in a few class bus structures, including memory bus or Memory Controller, Peripheral bus, graphics acceleration port, processor or the local bus using any bus structures in a variety of bus structures.Lift For example, these architectures include but is not limited to industry standard architecture (ISA) bus, MCA (MAC) Bus, enhanced isa bus, VESA's (VESA) local bus and periphery component interconnection (PCI) bus.
Computer system/server 012 typically comprises various computing systems computer-readable recording medium.These media can be appointed The usable medium what can be accessed by computer system/server 012, including volatibility and non-volatile media, movably With immovable medium.
System storage 028 can include the computer system readable media of form of volatile memory, for example, deposit at random Access to memory (RAM) 030 and/or cache memory 032.Computer system/server 012 may further include other Removable/nonremovable, volatile/non-volatile computer system storage medium.Only as an example, storage system 034 can For reading and writing immovable, non-volatile magnetic media (Fig. 4 is not shown, is commonly referred to as " hard disk drive ").Although in Fig. 4 It is not shown, it can provide for the disc driver to may move non-volatile magnetic disk (such as " floppy disk ") read-write, and pair can The CD drive of mobile anonvolatile optical disk (such as CD-ROM, DVD-ROM or other optical mediums) read-write.In these situations Under, each driver can be connected by one or more data media interfaces with bus 018.Memory 028 can include At least one program product, the program product has one group of (for example, at least one) program module, and these program modules are configured To perform the function of various embodiments of the present invention.
Program/utility 040 with one group of (at least one) program module 042, can be stored in such as memory In 028, such program module 042 includes --- but being not limited to --- operating system, one or more application program, other The realization of network environment is potentially included in each or certain combination in program module and routine data, these examples.Journey Sequence module 042 generally performs function and/or method in embodiment described in the invention.
Computer system/server 012 can also with one or more external equipments 014 (for example keyboard, sensing equipment, Display 024 etc.) communication, in the present invention, computer system/server 012 is communicated with outside radar equipment, can also be with One or more enables a user to the equipment communication interacted with the computer system/server 012, and/or with causing the meter Any equipment (such as network interface card, modulation that calculation machine systems/servers 012 can be communicated with one or more of the other computing device Demodulator etc.) communication.This communication can be carried out by input/output (I/O) interface 022.Also, computer system/clothes Business device 012 can also pass through network adapter 020 and one or more network (such as LAN (LAN), wide area network (WAN) And/or public network, such as internet) communication.As illustrated, network adapter 020 by bus 018 and computer system/ Other modules communication of server 012.It should be understood that although not shown in the drawings, computer system/server 012 can be combined Using other hardware and/or software module, include but is not limited to:Microcode, device driver, redundant processing unit, outside magnetic Dish driving array, RAID system, tape drive and data backup storage system etc..
Processing unit 016 is stored in program in system storage 028 by operation, thus perform various function application with And data processing, a kind of method of processing data is for example realized, can be included:
Obtain the user characteristic data of the relevant information containing customer location;
According to division of the customer location relevant information on geographical position, the mark corresponding to user characteristic data is determined Label group;
The user characteristic data is predicted using the set of tags corresponding classification submodel, the mark of user is obtained Label;
Wherein, each set of tags corresponds to a classification submodel respectively.
Above-mentioned computer program can be arranged in computer-readable storage medium, i.e., the computer-readable storage medium is encoded with Computer program, the program by one or more computers when being performed so that one or more computers are performed in the present invention State the method flow shown in embodiment and/or device operation.For example, by the method stream of said one or multiple computing devices Journey, can include:
Obtain the user characteristic data of the relevant information containing customer location;
According to division of the customer location relevant information on geographical position, the mark corresponding to user characteristic data is determined Label group;
The user characteristic data is predicted using the set of tags corresponding classification submodel, the mark of user is obtained Label;
Wherein, each set of tags corresponds to a classification submodel respectively.
Over time, the development of technology, medium implication is more and more extensive, and the route of transmission of computer program is no longer limited by Tangible medium, directly can also be downloaded from network etc..Any combination of one or more computer-readable media can be used. Computer-readable medium can be computer-readable signal media or computer-readable recording medium.Computer-readable storage medium Matter for example may be-but not limited to-system, device or the device of electricity, magnetic, optical, electromagnetic, infrared ray or semiconductor, or Combination more than person is any.The more specifically example (non exhaustive list) of computer-readable recording medium includes:With one Or the electrical connections of multiple wires, portable computer diskette, hard disk, random access memory (RAM), read-only storage (ROM), Erasable programmable read only memory (EPROM or flash memory), optical fiber, portable compact disc read-only storage (CD-ROM), light Memory device, magnetic memory device or above-mentioned any appropriate combination.In this document, computer-readable recording medium can Be it is any include or storage program tangible medium, the program can be commanded execution system, device or device use or Person is in connection.
Computer-readable signal media can be included in a base band or as the data-signal of carrier wave part propagation, Wherein carry computer-readable program code.The data-signal of this propagation can take various forms, including --- but It is not limited to --- electromagnetic signal, optical signal or above-mentioned any appropriate combination.Computer-readable signal media can also be Any computer-readable medium beyond computer-readable recording medium, the computer-readable medium can send, propagate or Transmit for being used or program in connection by instruction execution system, device or device.
The program code included on computer-readable medium can be transmitted with any appropriate medium, including --- but do not limit In --- wireless, electric wire, optical cable, RF etc., or above-mentioned any appropriate combination.
It can be write with one or more programming languages or its combination for performing the computer that the present invention is operated Program code, described program design language includes object oriented program language-such as Java, Smalltalk, C++, Also including conventional procedural programming language-such as " C " language or similar programming language.Program code can be with Fully perform, partly perform on the user computer on the user computer, as independent software kit execution, a portion Divide part execution or the execution completely on remote computer or server on the remote computer on the user computer. Be related in the situation of remote computer, remote computer can be by the network of any kind --- including LAN (LAN) or Wide area network (WAN) is connected to subscriber computer, or, it may be connected to outer computer (is for example provided using Internet service Business comes by Internet connection).
Using technical scheme provided by the present invention, by obtaining with after the set of tags corresponding to user characteristic data, making User characteristic data is predicted with the classification submodel corresponding to set of tags, so as to improve to user characteristic data Predetermined speed and the degree of accuracy.
In several embodiments provided by the present invention, it should be understood that disclosed system, apparatus and method can be with Realize by another way.For example, device embodiment described above is only schematical, for example, the unit Divide, only a kind of division of logic function there can be other dividing mode when actually realizing.
The unit illustrated as separating component can be or may not be it is physically separate, it is aobvious as unit The part shown can be or may not be physical location, you can with positioned at a place, or can also be distributed to multiple On NE.Some or all of unit therein can be selected to realize the mesh of this embodiment scheme according to the actual needs 's.
In addition, each functional unit in each embodiment of the invention can be integrated in a processing unit, can also That unit is individually physically present, can also two or more units it is integrated in a unit.Above-mentioned integrated list Member can both be realized in the form of hardware, it would however also be possible to employ hardware adds the form of SFU software functional unit to realize.
The above-mentioned integrated unit realized in the form of SFU software functional unit, can be stored in an embodied on computer readable and deposit In storage media.Above-mentioned SFU software functional unit is stored in a storage medium, including some instructions are to cause a computer Equipment (can be personal computer, server, or network equipment etc.) or processor (processor) perform the present invention each The part steps of embodiment methods described.And foregoing storage medium includes:USB flash disk, mobile hard disk, read-only storage (Read- Only Memory, ROM), random access memory (Random Access Memory, RAM), magnetic disc or CD etc. it is various Can be with the medium of store program codes.
The foregoing is merely illustrative of the preferred embodiments of the present invention, is not intended to limit the invention, all essences in the present invention God is with principle, and any modification, equivalent substitution and improvements done etc. should be included within the scope of protection of the invention.

Claims (22)

1. a kind of method of processing data, it is characterised in that methods described includes:
Obtain the user characteristic data of the relevant information containing customer location;
According to division of the customer location relevant information on geographical position, the label corresponding to user characteristic data is determined Group;
The user characteristic data is predicted using the set of tags corresponding classification submodel, the label of user is obtained;
Wherein, each set of tags corresponds to a classification submodel respectively.
2. according to the method described in claim 1, it is characterised in that the classification submodel is advance using following training method Obtain:
Obtain label and the user characteristic data associated with label;
According to division of each label on geographical position, the label is grouped;
Label that each set of tags is included and the user characteristic data associated with label are instructed respectively as training data Practice the corresponding classification submodel of each set of tags.
3. method according to claim 2, it is characterised in that in the division according to each label on geographical position, to institute When stating label and being grouped, the label in boundaries of packets is divided to the multiple set of tags closed on.
4. method according to claim 2, it is characterised in that the label for being included each set of tags and with When the associated user characteristic data of label is as training data, further comprise:
The user characteristic data that confidence level is less than default confidence threshold value is filtered out from training data.
5. according to the method described in claim 1, it is characterised in that the label includes point-of-interest, or area-of-interest.
6. according to the method described in claim 1, it is characterised in that the customer location relevant information includes gps data, Wifi At least one of information and IP address.
7. according to the method described in claim 1, it is characterised in that it is described according to the customer location relevant information in geographical position The division put, determines that the set of tags corresponding to user characteristic data includes:
The geographical location information included according to the user characteristic data, to the user characteristic data carry out spatial index or Polymerization;
According to spatial index or polymerization result, the corresponding set of tags of the user characteristic data is determined.
8. according to the method described in claim 1, it is characterised in that the classification submodel using corresponding to the set of tags The user characteristic data is predicted, obtaining the label of user includes:
If the classification submodel only one of which, the user characteristic data is sent to the classification submodel, according to this point Predicting the outcome for class submodel, obtains the label of user;Or,
If the classification submodel has multiple, the characteristic of each classification submodel of correspondence in user characteristic data is sent Into corresponding classification submodel, according to the prediction amalgamation result of multiple classification submodels, the label of user is obtained.
9. method according to claim 8, it is characterised in that the prediction amalgamation result according to multiple classification submodels Obtaining the label of user includes:
The confidence level of user characteristic data determines to predict the outcome according to corresponding to each classification submodel, obtains the label of user;Or Person,
Confidence level according to being predicted the outcome obtained by each classification submodel determines to predict the outcome, and obtains the label of user.
10. the method according to claim 4 or 9, it is characterised in that the confidence level of the user characteristic data is according to geographical At least one of position, the frequency of occurrences or signal intensity etc. are determined.
11. a kind of device of processing data, it is characterised in that described device includes:
Acquiring unit, the user characteristic data for obtaining the relevant information containing customer location;
Determining unit, for the division according to the customer location relevant information on geographical position, determines user characteristic data Corresponding set of tags;
Predicting unit, for being predicted using the corresponding classification submodel of the set of tags to the user characteristic data, is obtained To the label of user;
Wherein, each set of tags corresponds to a classification submodel respectively.
12. device according to claim 11, it is characterised in that described device also includes training unit, for using such as Lower training method training obtains submodel of classifying:
Obtain label and the user characteristic data associated with label;
According to division of each label on geographical position, the label is grouped;
Label that each set of tags is included and the user characteristic data associated with label are instructed respectively as training data Practice the corresponding classification submodel of each set of tags.
13. device according to claim 12, it is characterised in that the training unit for according to each label in geography When division on position is grouped to the label, the label in boundaries of packets is divided to the multiple set of tags closed on.
14. device according to claim 12, it is characterised in that the training unit is for each set of tags to be wrapped It is specific to perform when the label that contains and the user characteristic data associated with label are as training data:
The user characteristic data that confidence level is less than default confidence threshold value is filtered out from training data.
15. device according to claim 11, it is characterised in that the label includes point-of-interest, or region of interest Domain.
16. device according to claim 11, it is characterised in that the customer location relevant information include gps data, At least one of Wifi information and IP address.
17. device according to claim 11, it is characterised in that the determining unit is for according to the customer location Division of the relevant information on geographical position, it is specific to perform when determining the set of tags corresponding to user characteristic data:
The geographical location information included according to the user characteristic data, to the user characteristic data carry out spatial index or Polymerization;
According to spatial index or polymerization result, the corresponding set of tags of the user characteristic data is determined.
18. device according to claim 11, it is characterised in that the predicting unit is for utilizing the set of tags institute Corresponding classification submodel is predicted to the user characteristic data, specific to perform when obtaining the label of user:
If the classification submodel only one of which, the user characteristic data is sent to the classification submodel, according to this point Predicting the outcome for class submodel, obtains the label of user;Or,
If the classification submodel has multiple, the characteristic of each classification submodel of correspondence in user characteristic data is sent Into corresponding classification submodel, according to the prediction amalgamation result of multiple classification submodels, the label of user is obtained.
19. device according to claim 18, it is characterised in that the predicting unit is for according to multiple classification submodules It is specific to perform when the prediction amalgamation result of type obtains the label of user:
The confidence level of user characteristic data determines to predict the outcome according to corresponding to each classification submodel, obtains the label of user;Or Person,
Confidence level according to being predicted the outcome obtained by each classification submodel determines to predict the outcome, and obtains the label of user.
20. the device according to claim 14 or 19, it is characterised in that the confidence level of the user characteristic data is according to ground At least one of position, the frequency of occurrences or signal intensity etc. is managed to determine.
21. a kind of equipment, it is characterised in that the equipment includes:
One or more processors;
Storage device, for storing one or more programs,
When one or more of programs are by one or more of computing devices so that one or more of processors are real The existing method as described in any in claim 1-10.
22. a kind of storage medium for including computer executable instructions, the computer executable instructions are by computer disposal For performing the method as described in any in claim 1-10 when device is performed.
CN201710260035.2A 2017-04-20 2017-04-20 A kind of method of processing data, device, equipment and computer-readable storage medium Pending CN107194412A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201710260035.2A CN107194412A (en) 2017-04-20 2017-04-20 A kind of method of processing data, device, equipment and computer-readable storage medium

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201710260035.2A CN107194412A (en) 2017-04-20 2017-04-20 A kind of method of processing data, device, equipment and computer-readable storage medium

Publications (1)

Publication Number Publication Date
CN107194412A true CN107194412A (en) 2017-09-22

Family

ID=59871340

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201710260035.2A Pending CN107194412A (en) 2017-04-20 2017-04-20 A kind of method of processing data, device, equipment and computer-readable storage medium

Country Status (1)

Country Link
CN (1) CN107194412A (en)

Cited By (11)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN108021986A (en) * 2017-10-27 2018-05-11 平安科技(深圳)有限公司 Electronic device, multi-model sample training method and computer-readable recording medium
CN108922079A (en) * 2018-06-05 2018-11-30 阿里巴巴集团控股有限公司 A kind of data processing method, article detection method, device
CN109063736A (en) * 2018-06-29 2018-12-21 考拉征信服务有限公司 Data classification method, device, electronic equipment and computer readable storage medium
CN109102159A (en) * 2018-07-18 2018-12-28 平安科技(深圳)有限公司 Passenger's rating model generation method, device, computer equipment and storage medium
CN109199604A (en) * 2018-08-31 2019-01-15 浙江大学宁波理工学院 A kind of pedicle screw based on multiple features most preferably enters independent positioning method
CN110059771A (en) * 2019-05-10 2019-07-26 合肥工业大学 A kind of interactive vehicle data classification method in the case where sequence is supported
CN110490625A (en) * 2018-05-11 2019-11-22 北京京东尚科信息技术有限公司 User preference determines method and device, electronic equipment, storage medium
CN110598122A (en) * 2018-06-13 2019-12-20 百度在线网络技术(北京)有限公司 Social group mining method, device, equipment and storage medium
CN111629319A (en) * 2019-02-28 2020-09-04 中国移动通信有限公司研究院 Position prediction method and device
CN112699206A (en) * 2021-03-23 2021-04-23 上海迹寻科技有限公司 User position and residence analysis method and device
CN114065947A (en) * 2021-11-15 2022-02-18 深圳大学 Data access speculation method and device, storage medium and electronic equipment

Citations (8)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN102047249A (en) * 2008-05-27 2011-05-04 高通股份有限公司 Method and apparatus for aggregating and presenting data associated with geographic locations
CN103093237A (en) * 2013-01-15 2013-05-08 中国科学院自动化研究所 Face detecting method based on structural model
CN103455545A (en) * 2012-06-04 2013-12-18 国际商业机器公司 Location estimation of social network users
CN103945533A (en) * 2014-05-15 2014-07-23 济南嘉科电子技术有限公司 Big data based wireless real-time position positioning method
CN104463633A (en) * 2014-12-19 2015-03-25 成都品果科技有限公司 User segmentation method based on geographic position and interest point information
CN106131959A (en) * 2016-08-11 2016-11-16 电子科技大学 A kind of dual-positioning method divided based on Wi Fi signal space
CN106462627A (en) * 2014-12-29 2017-02-22 谷歌公司 Analyzing semantic places and related data from a plurality of location data reports
CN106507476A (en) * 2016-11-22 2017-03-15 上海斐讯数据通信技术有限公司 A kind of WiFi localization methods and server and location model construction method

Patent Citations (8)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN102047249A (en) * 2008-05-27 2011-05-04 高通股份有限公司 Method and apparatus for aggregating and presenting data associated with geographic locations
CN103455545A (en) * 2012-06-04 2013-12-18 国际商业机器公司 Location estimation of social network users
CN103093237A (en) * 2013-01-15 2013-05-08 中国科学院自动化研究所 Face detecting method based on structural model
CN103945533A (en) * 2014-05-15 2014-07-23 济南嘉科电子技术有限公司 Big data based wireless real-time position positioning method
CN104463633A (en) * 2014-12-19 2015-03-25 成都品果科技有限公司 User segmentation method based on geographic position and interest point information
CN106462627A (en) * 2014-12-29 2017-02-22 谷歌公司 Analyzing semantic places and related data from a plurality of location data reports
CN106131959A (en) * 2016-08-11 2016-11-16 电子科技大学 A kind of dual-positioning method divided based on Wi Fi signal space
CN106507476A (en) * 2016-11-22 2017-03-15 上海斐讯数据通信技术有限公司 A kind of WiFi localization methods and server and location model construction method

Cited By (17)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN108021986A (en) * 2017-10-27 2018-05-11 平安科技(深圳)有限公司 Electronic device, multi-model sample training method and computer-readable recording medium
CN110490625A (en) * 2018-05-11 2019-11-22 北京京东尚科信息技术有限公司 User preference determines method and device, electronic equipment, storage medium
CN108922079A (en) * 2018-06-05 2018-11-30 阿里巴巴集团控股有限公司 A kind of data processing method, article detection method, device
CN110598122B (en) * 2018-06-13 2022-04-01 百度在线网络技术(北京)有限公司 Social group mining method, device, equipment and storage medium
CN110598122A (en) * 2018-06-13 2019-12-20 百度在线网络技术(北京)有限公司 Social group mining method, device, equipment and storage medium
CN109063736A (en) * 2018-06-29 2018-12-21 考拉征信服务有限公司 Data classification method, device, electronic equipment and computer readable storage medium
CN109102159A (en) * 2018-07-18 2018-12-28 平安科技(深圳)有限公司 Passenger's rating model generation method, device, computer equipment and storage medium
CN109199604B (en) * 2018-08-31 2020-12-01 浙江大学宁波理工学院 Multi-feature-based pedicle screw optimal entry point positioning method
CN109199604A (en) * 2018-08-31 2019-01-15 浙江大学宁波理工学院 A kind of pedicle screw based on multiple features most preferably enters independent positioning method
CN111629319A (en) * 2019-02-28 2020-09-04 中国移动通信有限公司研究院 Position prediction method and device
CN111629319B (en) * 2019-02-28 2022-05-31 中国移动通信有限公司研究院 Position prediction method and device
CN110059771B (en) * 2019-05-10 2021-01-15 合肥工业大学 Interactive vehicle data classification method under ordering support
CN110059771A (en) * 2019-05-10 2019-07-26 合肥工业大学 A kind of interactive vehicle data classification method in the case where sequence is supported
CN112699206A (en) * 2021-03-23 2021-04-23 上海迹寻科技有限公司 User position and residence analysis method and device
CN112699206B (en) * 2021-03-23 2021-07-13 上海迹寻科技有限公司 User position and residence analysis method and device
CN114065947A (en) * 2021-11-15 2022-02-18 深圳大学 Data access speculation method and device, storage medium and electronic equipment
CN114065947B (en) * 2021-11-15 2022-07-22 深圳大学 Data access speculation method and device, storage medium and electronic equipment

Similar Documents

Publication Publication Date Title
CN107194412A (en) A kind of method of processing data, device, equipment and computer-readable storage medium
CN107067003A (en) Extracting method, device, equipment and the computer-readable storage medium of region of interest border
CN105091890B (en) Method and apparatus for recommended candidate path
CN107133645B (en) Method, equipment and storage medium for predicting order cancelling behavior of passenger
CN110378483A (en) The system and method for training machine learning model being deployed on analog platform
CN104737565A (en) Method relating to predicting the future state of a mobile device user
CN106875066A (en) With the Forecasting Methodology of car travel behaviour, device, server and storage medium
CN106971279A (en) Estimate method, device, equipment and the storage medium of driver's order behavior
CN107038157A (en) Identification error detection method, device and storage medium based on artificial intelligence
CN109710709A (en) Interest point data processing method, device, electronic equipment and storage medium
CN107133697A (en) Estimate method, device, equipment and the storage medium of driver's order wish
US11288533B2 (en) Systems and methods for identifying a service qualification of a unit of a community
CN106934026A (en) Trip services mining method, device, equipment and storage medium
CN110097121A (en) A kind of classification method of driving trace, device, electronic equipment and storage medium
CN111044045A (en) Navigation method and device based on neural network and terminal equipment
CN106767835A (en) Localization method and device
CN111522838A (en) Address similarity calculation method and related device
EP3443482A1 (en) Classifying entities in digital maps using discrete non-trace positioning data
CN110471999A (en) Trajectory processing method, apparatus, equipment and medium
US20190171745A1 (en) Open ended question identification for investigations
US20200286243A1 (en) Iterative approach for weakly-supervised action localization
CN109635868A (en) Determination method, apparatus, electronic equipment and the storage medium of barrier classification
US11220269B2 (en) Fording depth estimation
JP2021047184A (en) Method, apparatus, device and computer readable storage medium to be used in navigation
US20180357190A1 (en) Memory sharing for application offload from host processor to integrated sensor hub

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination