WO2018191918A1 - System and method for learning-based group tagging - Google Patents
System and method for learning-based group tagging Download PDFInfo
- Publication number
- WO2018191918A1 WO2018191918A1 PCT/CN2017/081279 CN2017081279W WO2018191918A1 WO 2018191918 A1 WO2018191918 A1 WO 2018191918A1 CN 2017081279 W CN2017081279 W CN 2017081279W WO 2018191918 A1 WO2018191918 A1 WO 2018191918A1
- Authority
- WO
- WIPO (PCT)
- Prior art keywords
- users
- data
- subset
- platform
- data fields
- Prior art date
Links
Images
Classifications
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/20—Information retrieval; Database structures therefor; File system structures therefor of structured data, e.g. relational data
- G06F16/23—Updating
- G06F16/2379—Updates performed during online database operations; commit processing
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F18/00—Pattern recognition
- G06F18/20—Analysing
- G06F18/22—Matching criteria, e.g. proximity measures
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/20—Information retrieval; Database structures therefor; File system structures therefor of structured data, e.g. relational data
- G06F16/28—Databases characterised by their database models, e.g. relational or object models
- G06F16/284—Relational databases
- G06F16/285—Clustering or classification
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F18/00—Pattern recognition
- G06F18/20—Analysing
- G06F18/21—Design or setup of recognition systems or techniques; Extraction of features in feature space; Blind source separation
- G06F18/214—Generating training patterns; Bootstrap methods, e.g. bagging or boosting
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F18/00—Pattern recognition
- G06F18/20—Analysing
- G06F18/24—Classification techniques
- G06F18/241—Classification techniques relating to the classification model, e.g. parametric or non-parametric approaches
- G06F18/2413—Classification techniques relating to the classification model, e.g. parametric or non-parametric approaches based on distances to training or reference patterns
- G06F18/24147—Distances to closest patterns, e.g. nearest neighbour classification
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F7/00—Methods or arrangements for processing data by operating upon the order or content of the data handled
- G06F7/06—Arrangements for sorting, selecting, merging, or comparing data on individual record carriers
- G06F7/20—Comparing separate sets of record carriers arranged in the same sequence to determine whether at least some of the data in one set is identical with that in the other set or sets
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06N—COMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N20/00—Machine learning
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06N—COMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N5/00—Computing arrangements using knowledge-based models
- G06N5/01—Dynamic search techniques; Heuristics; Dynamic trees; Branch-and-bound
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06N—COMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N5/00—Computing arrangements using knowledge-based models
- G06N5/02—Knowledge representation; Symbolic representation
- G06N5/022—Knowledge engineering; Knowledge acquisition
- G06N5/025—Extracting rules from data
Definitions
- This disclosure generally relates to approaches and techniques for user tagging and learning-based tagging.
- a platform may provide various services to users. To facilitate user service and management, it is desirable to organize the users in groups. This process can bring many challenges, especially if the number of users becomes large.
- a computing system for group tagging may comprise one or more processors accessible to platform data and a memory storing instructions that, when executed by the one or more processors, cause the computing system to perform a method.
- the platform data may comprise a plurality of users and a plurality of associated data fields.
- the method may comprise: obtaining a first subset of users and one or more first tags associated with the first subset of users, determining, respectively for one or more of the associated data fields, at least a difference between the first subset of users and at least a part of the plurality of users, in response to determining the difference exceeding a first threshold, determining the corresponding data field as a key data field, determining data of the corresponding one or more key data fields associated with the first subset of users as positive samples, obtaining, based on the one or more key data fields, a second subset of users and associated data from the platform data as negative samples, and training a rule model with the positive and negative samples to obtain a trained group tagging rule model.
- the platform data may comprise tabular data corresponding to each of the plurality of users, and the data fields may comprise at least one of data dimension or data metric.
- the plurality of users may be users of the platform, the platform may be a vehicle information platform, and the data fields may comprise at least one of a location, a number of uses, a transaction amount, or a number of complaints.
- obtaining a first subset of users may comprise receiving identifications of the first subset of users from one or more analysts without full access to the platform data.
- the platform data may not comprise the first tags before the server obtaining the first subset of users.
- the difference may be a Kullback-Leibler divergence.
- the second subset of users may be different from the first subset of users over a third threshold based on a similarity measurement with respect to the one or more key data fields.
- the rule model may be a decision tree model.
- the trained group tagging rule model may determine whether to assign one or more of the plurality of users the first tags.
- the server is further configured to perform applying the trained group tagging rule model to tag the plurality of users and new users added to the plurality of users.
- a group tagging method may comprise obtaining a first subset of a plurality of entities of a platform.
- the first subset of entities may be tagged with first tags, and platform data may comprise data of the plurality of entities with respect to a one or more data fields.
- the group tagging method may further comprise determining at least a difference between data of one or more data fields of the first subset of entities and that of some other entities of the plurality of entities.
- the group tagging method may further comprise, in response to determining the difference exceeding a first threshold, obtaining corresponding data associated with the first subset of entities as positive samples, and corresponding data associated with a second subset of the plurality of entities as negative samples.
- the group tagging method may further comprise training a rule model with the positive and negative samples to obtain a trained group tagging rule model.
- the trained group tagging rule model may determine if an existing or new entity is entitled to the first tag.
- FIGURE 1 illustrates an example environment for group tagging, in accordance with various embodiments.
- FIGURE 2 illustrates an example system for group tagging, in accordance with various embodiments.
- FIGURE 3A illustrates example platform data, in accordance with various embodiments.
- FIGURE 3B illustrates example platform data with first tags, in accordance with various embodiments.
- FIGURE 3C illustrates example platform data with determined positive and negative samples and key data fields, in accordance with various embodiments.
- FIGURE 3D illustrates example platform data with tagged groups, in accordance with various embodiments.
- FIGURE 4A illustrates a flowchart of an example method for group tagging, in accordance with various embodiments.
- FIGURE 4B illustrates a flowchart of another example method for group tagging, in accordance with various embodiments.
- FIGURE 5 illustrates a block diagram of an example computer system in which any of the embodiments described herein may be implemented.
- Group tagging is essential to effective user management. This method can bring a large amount of data into order, and create a basis for further data manipulation, analysis derivation, and value creation. Without group tagging, data processing becomes inefficient, especially when the data volume scales up. Even if a small portion of the data may be tagged manually based on certain “local tagging rules, ” such rules are not verified across the global data and may not be appropriate to use globally as is. Further, for various reasons such as data security, limited job responsibility, and lack of skill background, analysts who have direct user interactions to collect first-hand data and perform manual tagging may not be allowed to access the global data, further limiting the extrapolating of the “local tagging rules” to “global tagging rules. ”
- operation and customer service analysts may directly interact with customers and accumulate the first-hand data.
- the analysts may also create certain “local tagging rules” based on the interactions, for example, categorizing users of certain similar background or characteristics together.
- the analysts have restricted authorization to the entire platform data and may not access all information associated each user.
- engineers who have access to the platform data may lack the customer interaction experiences and bases for creating “global tagging rules. ” Therefore, it is desirable to utilize the first-hand interaction, refine the “local tagging rules, ” and obtain “global tagging rules” which are appropriate and applicable to the platform data in large-scale.
- a computing system may perform a group tagging method.
- the group tagging method may comprise obtaining a first subset of a plurality of entities (e.g., users, objects, virtual representations, etc. ) of a platform.
- the first subset of entities may be each tagged with a first tag following a tagging rule, which may be deemed as a “local tagging rule, ” and platform data may comprise data of the plurality of entities with respect to a one or more data fields.
- the group tagging method may further comprise determining at least a difference between data of one or more data fields of the first subset of entities and that of some other entities of the plurality of entities.
- the group tagging method may further comprise, in response to determining the difference exceeding a first threshold in certain data field (s) of the one or more data fields, obtaining corresponding data associated with the first subset of entities as positive samples, and obtaining corresponding data associated with a second subset of the plurality of entities of which the data is substantially different from that of the first subset of entities in the certain data field (s) as negative samples.
- the substantial difference can be determined based on a similarity measurement method.
- the group tagging method may further comprise training a rule model with the positive and negative samples to obtain a trained group tagging rule model.
- the trained group tagging rule model can be applied to a part or all of the platform data to determine if an existing or new entity is entitled to the first tag. This determination can be deemed as a “global tagging rule. ”
- the entities may comprise users of a platform.
- a computing system for group tagging may comprise a server accessible to platform data.
- the platform data may comprise a plurality of users and a plurality of associated data fields.
- the server may comprise one or more processors accessible to platform data and a memory storing instructions that, when executed by the one or more processors, cause the computing system to obtain a first subset of users and one or more first tags associated with the first subset of users.
- the instruction may further cause the computing system to determine, respectively for one or more of the associated data fields, at least a difference between the first subset of users and at least a part of the plurality of users.
- the instruction may further cause the computing system to, in response to determining the difference exceeding a first threshold, determine the corresponding data field as a key data field.
- the instruction may further cause the computing system to determine data of the corresponding one or more key data fields associated with the first subset of users as positive samples.
- the instruction may further cause the computing system to obtain, based on the one or more key data fields, a second subset of users and associated data from the platform data as negative samples, the associated data of the second subset of users being substantially different from that of the first subset of entities.
- the instruction may further cause the computing system to train a rule model with the positive and negative samples to reach a second accuracy threshold (e.g., a threshold of predetermined 98%accurate) to obtain a trained group tagging rule model.
- a second accuracy threshold e.g., a threshold of predetermined 98%accurate
- the platform may be a vehicle information platform.
- the platform data may comprise tabular data corresponding to each of the plurality of users, and the data fields may comprise at least one of data dimension or data metric.
- the plurality of users may be users of the platform, and the data fields may comprise at least one of a location of the user, a number of uses of the platform service by the user, a transaction amount, or a number of complaints.
- FIG. 1 illustrates an example environment 100 for group tagging, in accordance with various embodiments.
- the example environment 100 can comprise at least one computing system 102 that includes one or more processors 104 and memory 106.
- the memory 106 may be non-transitory and computer-readable.
- the memory 106 may store instructions that, when executed by the one or more processors 104, cause the one or more processors 104 to perform various operations described herein.
- the environment 100 may also include one or more computing devices 110, 111, 112, and 120 (e.g., cellphone, tablet, computer, wearable device (smart watch) , etc. ) coupled to the system 102.
- the computing devices may transmit/receive data to/from the system 102 according to their access and authorization levels.
- the environment 100 may further include one or more data stores (e.g., data stores 108 and 109) that are accessible to the system 102. The data in the data stores may be associated with different access authorization levels.
- the system 102 may be referred to as an information platform (e.g., a vehicle information platform providing information of vehicles, which can be provided by one party to service another party, shared by multiple parties, exchanged among multiple parties, etc. ) .
- Platform data may be stored in the data stores (e.g., data stores 108, 109, etc. ) and/or the memory 106.
- the computing device 120 may be associated with a user of the platform (e.g., a user’s cellphone installed with an Application of the platform) .
- the computing device 120 may have no access to the data stores, except for which processed and fed by the platform.
- the computing devices 110 and 111 may be associated with analysts with limited access and authorization to the platform data.
- the computing device 112 may be associated with engineers with full access and authorization to the platform data.
- the system 102 and one or more of the computing devices may be integrated in a single device or system.
- the system 102 and the computing devices may operate as separate devices.
- the computing devices 110, 111, and 112 may be computers or mobile devices, and the system 102 may be a server.
- the data store (s) may be anywhere accessible to the system 102, for example, in the memory 106, in the computing devices 110, 111, or 112, in another device (e.g., network storage device) coupled to the system 102, or another storage location (e.g., cloud-based storage system, network file system, etc. ) , etc.
- system 102 the computing devices 110, 111, 112, and 120, and/or the data stores 108 and 109 may be able to communicate with one another through one or more wired or wireless networks (e.g., the Internet) through which data can be communicated.
- wired or wireless networks e.g., the Internet
- FIG. 2 illustrates an example system 200 for group tagging, in accordance with various embodiments.
- the computing device 120 may interact with the system 102 (e.g., registering new users, ordering services, transacting payments, etc. ) , and the corresponding information may be stored at least as a part of platform data 202 in the data stores 108, 109 and/or the memory 106, and accessible to the system 102. Further interactions among the system 200 are described below with references to FIGs. 3A-D.
- FIG. 3A illustrates example platform data 300, in accordance with various embodiments.
- the description of FIG. 3A is intended to be illustrative and may be modified in various ways according to the implementation.
- the platform data may be stored in one or more formats such as tables, objects, etc.
- the platform data may comprise tabular data corresponding to each of the plurality of entities (e.g., Users such as User A, B, C, etc. ) of the platform.
- the system 102 may be accessible to platform data comprising a plurality of users and a plurality of associated data fields (e.g., “City, ” “Device, ” “Number of use, ” “Payment, ” “Complaints, ” etc. ) .
- platform data comprising a plurality of users and a plurality of associated data fields (e.g., “City, ” “Device, ” “Number of use, ” “Payment, ” “Complaints, ” etc. ) .
- account information e.g., address, city, phone number, payment method, etc.
- user history e.g., device used to access the platform, number of service uses, payment transaction, complaints made, etc.
- the account information and user history may be stored in the various data fields associated with the user.
- the data fields may be presented as data columns.
- the data fields may include dimensions and metrics.
- the dimensions may comprise attributes of the data. For example, “City” indicates the city location of a user, and “Device” indicates the device used to access the platform.
- the metrics may comprise quantitative measurements. For example, “number of use” indicates a number of times the user has used the platform service, “Payment” indicates a total amount of transaction between the user and the platform, and “Complaints” indicates a number of times the user have complained to the platform.
- analysts and engineers (or other groups of people) of the platform may have different access levels to the platform data.
- the analysts may include operation, customer service, and technical support teams.
- the analysts may only have access to data in “Users, ” “City, ” and “Complaints” columns and only have authorization to edit the “Complaints” column.
- the engineers may include data scientists, back-end engineers, and researcher teams. The engineers may have full access and authorization to edit all columns of the platform data 300.
- computing devices 110 and 111 may be controlled and operated by analysts with limited access and authorization to the platform data. Based on user interaction or other experiences, the analysts may determine “local rules” to tag some users. For example, the analysts may tag a first user subset of the platform users and submit the tag information 204 (e.g., user IDs for the first user subset) to the system 102.
- FIG. 3B illustrates example platform data 310 with first tags, in accordance with various embodiments. The description of FIG. 3B is intended to be illustrative and may be modified in various ways according to the implementation.
- the platform data 310 is similar to the platform data 300 described above, except for the addition of the first tags C1.
- the system 102 may obtain a first subset of users from the plurality of users and the one or more first tags associated with the first subset of users (e.g., by receiving the first user subset and tag information 204) .
- the platform data may not comprise the first tags before the system 102 (e.g., server) obtaining the first subset of users.
- the system 102 may incorporate the obtained information (e.g., the tag information 204) to the platform data (e.g., by adding the “Group tag” column to the platform data 300) .
- the first user subset identified by the analysts may include “User A” corresponding to “14” complaints and “User B” corresponding to “19” complaints.
- the analysts may have tagged both “User A” and “User B” as “C1. ”
- tagging “User A” and “User B as “C1” may be referred to as a “local rule, ” and it is to be determined how this “local rule” can be synthesized and extrapolated to other platform users as a “global rule. ”
- computing device 112 may be controlled and operated by engineers with full access and authorization to the platform data. Based on the “local rules” and the platform data, the engineers may send queries 206 (e.g., instructions, commands, etc. ) to the system 102 to perform the learning-based group tagging.
- queries 206 e.g., instructions, commands, etc.
- FIG. 3C illustrates example platform data 320 with determined positive and negative samples and key data fields, in accordance with various embodiments. The description of FIG. 3C is intended to be illustrative and may be modified in various ways according to the implementation.
- the platform data 320 is similar to the platform data 310 described above.
- the system 102 may determine, respectively for one or more of the associated data fields, at least a difference between the first subset of users and at least a part of the plurality of users. For example, the system 102 may determine, respectively for one or more of the “City, ” “Device, “Number of use, ” “Payment, ” and “Complaints” columns, at least a difference (e.g., a Kullback-Leibler divergence) between the data of the first subset of users (e.g., User A and User B) and the data of at least a part of the platform users (e.g., all platform users, all platform users except for User A and User B, the next 500 users, etc. ) .
- a difference e.g., a Kullback-Leibler divergence
- the system 102 may determine the corresponding data field as a key data field, and determine data of the corresponding one or more key data fields associated with the first subset of users as positive samples.
- This first threshold may be predetermined.
- the predetermined threshold or other property may be preset by the system (e.g., the system 102) or operators (e.g., analysts, engineers, etc. ) associated with the system. For example, by analyzing the “Payment” data of the first user subset against that of other platform users (e.g., all other platform users) , the system 102 may determine that the difference exceeds a first predetermined threshold (e.g., above an average of 500 of all other platform users) .
- a first predetermined threshold e.g., above an average of 500 of all other platform users
- the platform 102 may determine the “Payment” data field as a key data field and obtain “User A-Payment 1500-Group Tag C1” and “User B-payment 823-Group Tag C1” as positive samples.
- the key data fields may include more than one data field, and the data fields can include dimension and/or metric, such as “City” and “Payment. ”
- “User A-City XYZ-Payment 1500-Group Tag C1” and “User B-City XYZ-payment 823-Group Tag C1” may be used as positive samples.
- the first predetermined threshold for data field “City” may be that cities in different provinces or states.
- the system 102 may obtain a second subset of users from the plurality of users and associated data of the second subset of users from the platform data as negative samples.
- the system 102 may assign a tag to the negative samples for training. For example, the system 102 may obtain “User C-City KMN-Payment 25-Group Tag NC1” and “User D-City KMN-payment 118-Group Tag NC1” as negative samples.
- the second subset of users may be different from the first subset of users over a third threshold (e.g., a third predetermined threshold) based on a similarity measurement with respect to the one or more key data fields.
- the similarity measurement can determine how similar a group of users is to another group, by obtaining a “distance” among the one or more key data fields associated with different users or user groups and comparing with distance thresholds.
- the similarity measurement can be implemented by various methods, such as (standardized) Euclidean distance method, Manhattan distance method, Chebyshev distance method, Minkowski distance method, Mahalanobis distance method, Cosine method, Hamming distance method, Jaccard similarity coefficient method, correlation coefficient and distance method, information entropy method, etc.
- the “distance” between two users S and T is if the user S has a property m1 for a data field and the user T has a property m2 for the same data field.
- the distance between two users S and T is if the user S has properties m1 and n1 for two data fields respectively and the other user T has properties m2 and n2 for the corresponding data fields.
- many methods can be used to obtain the “distance” between two groups of users. For example, every pair of users from two groups can be compared, user properties of users in each group can be averaged or otherwise represented by one representing user to compare with that of another representing user, etc.
- the distances among the plurality of uses or user groups can be determined, and a second subset of users sufficiently away (having a “distance” above a preset threshold) from the first subset of users can be determined.
- the data associated with the second subset of users can be used as negative samples.
- various properties (m1, n1, .... ) of a user S and various properties (m2, n2, .... ) of another user T can be treated as vectors.
- the “distance” between the two users is the angle between the two vectors.
- the “distance” between users S (m1, n1) and T (m2, n2) is ⁇ , where cos ⁇ is in the range between -1 and 1. The closer cos ⁇ is to 1, the more similar the two users are to each other. The same principle applies with even more data fields. Further, many methods can be used to obtain the “distance” between two groups of users.
- every pair of users from two groups can be compared, user properties of users in each group can be averaged or otherwise represented by one representing user to compare with that of another representing user, etc.
- the distances among the plurality of uses or user groups can be determined, and a second subset of users sufficiently away (having a “distance” above a preset threshold) from the first subset of users can be determined.
- the data associated with the second subset of users can be used as negative samples.
- the Euclidean distance method, Cosine method, or another similarity measurement method can also be directly used or modified into a k-nearest neighbor method.
- the k-nearest neighbor determination can be used for classification or regression based on the “distance” determination.
- an object e.g., platform user
- the object can be classified by a majority vote of its neighbors, with the object being assigned to the class most common among its k nearest neighbor.
- square root differences between data of the first subset users and data of other users can be calculated, and users corresponding to a difference from the first subset users above a third predetermined threshold can be used as negative samples.
- objects e.g., platform users
- properties e.g., data fields
- Each portion of congregated data points may be determined as a classified group by the k-nearest neighbor method, such that a group corresponding to the negative samples are away from another group corresponding to the positive samples above the third predetermined threshold. For example, if a user corresponds to two data fields, the user can be mapped on a x-y plane with each axis corresponding to a data field.
- An area corresponding to the positive samples on the x-y plane is away from another area corresponding to the negative samples for a distance above the third predetermined threshold.
- data points can classified by the k-nearest neighbor method, and the negative samples can be determined based on a substantial difference from the positive samples.
- the system 102 may train a rule model (e.g., a decision tree rule model) with the positive and negative samples until reaching a second accuracy threshold to obtain a trained group tagging rule model.
- a rule model e.g., a decision tree rule model
- the second accuracy threshold may be preset.
- the depth of the decision tree model may be preset (e.g., three levels of depth to limit the complexity) .
- the number of decision trees may be preset to add “or” conditions for decision making (e.g., parallel decision trees can represent “or” conditions and branches in the same decision tree can represent “and” conditions for determining group tagging decisions) .
- the decision tree model can have more flexibility in decision making, thus improving its accuracy.
- the decision tree rule model can be based on decision tree learning which uses a decision tree as a predictive model.
- the predictive model may map observations about an item (e.g., data field values of a platform user) to conclusions of the item’s target value (e.g., tag C1) .
- target value e.g., tag C1
- the trained rule model can comprise logic algorithms to automatically tag other samples.
- the logic algorithms may be consolidated based at least in part on decisions made at each level or depth of each tree.
- the trained group tagging rule model may determine whether to assign one or more of the plurality of users the first tags, and tag one or more of the platform users and/or new users added to the platform, as shown in FIG. 3D.
- the description of FIG. 3D is intended to be illustrative and may be modified in various ways according to the implementation.
- system 102 may tag “User C” and “User D” as “C2, ” and tag “User E” as “C1. ”
- the train model may also include “City” as a key data field with a more significant weight than that of “Payment.
- the system 102 may tag a new user “User F” as “C1, ” even though the new user has no transaction with the platform yet.
- the group tagging rule can be used to both analyze existing data and predict group tags for new data.
- computing device 111 (or computing device 110) can view the group tags by sending query 208 and receive tagged user 210. Further, the computing device may refine the trained group tagging rule model via the query 208, for example, by correcting the tags for one or more users. If computing device 120 registers a new user with the system 102, the “global tagging rule” can be applied to predictively tag the new user.
- the “local tagging rules” having a high level of reliability and accuracy can be synthesized by comparing with other platform data to obtain “global tagging rules. ”
- the “global tagging rules” incorporate the characteristics defined in the “local tagging rules” and are applicable across the platform data. The process can be automated by the learning process described above, thus achieving the group tagging task unattainable by the analysts with a high efficiency.
- FIG. 4A illustrates a flowchart of an example method 400, according to various embodiments of the present disclosure.
- the method 400 may be implemented in various environments including, for example, the environment 100 of FIG. 1.
- the operations of method 400 presented below are intended to be illustrative. Depending on the implementation, the example method 400 may include additional, fewer, or alternative steps performed in various orders or in parallel.
- the example method 400 may be implemented in various computing systems or devices including one or more processors of one or more servers.
- a first subset of users may be obtained from a plurality of users, and one or more first tags associated with the first subset of users may be obtained.
- the plurality of users and a plurality of associated data fields may be a part of platform data.
- the first subset may be obtained first-hand from analysts or operators.
- at least a difference between the first subset of users and at least a part of the plurality of users may be determined respectively for one or more of the associated data fields.
- the corresponding data field may be determined as a key data field.
- the block 406 may be performed for one or more of the associated data fields to obtain one or more key data fields.
- data of the corresponding the one or more key data fields associated with the first subset of users may be obtained as positive samples.
- a second subset of users may be obtained from the plurality of users, and associated data from the platform data may be obtained as negative samples.
- the negative samples may be substantially different from the positive samples, and can be obtained as discussed above.
- a rule model may be trained with the positive and negative samples to reach a second accuracy threshold to obtain a trained group tagging rule model.
- the trained group tagging rule model can be applied to tag the plurality of users and new users added to the plurality of users, such that the users can be automatically organized in desirable categories.
- FIG. 4B illustrates a flowchart of an example method 420, according to various embodiments of the present disclosure.
- the method 420 may be implemented in various environments including, for example, the environment 100 of FIG. 1.
- the operations of method 420 presented below are intended to be illustrative. Depending on the implementation, the example method 420 may include additional, fewer, or alternative steps performed in various orders or in parallel.
- the example method 420 may be implemented in various computing systems or devices including one or more processors of one or more servers.
- a first subset of a plurality of entities of a platform is obtained.
- the first subset of entities are tagged with first tags, and platform data comprises data of the plurality of entities with respect to a one or more data fields.
- at least a difference is determined between data of one or more data fields of the first subset of entities and that of some other entities of the plurality of entities.
- corresponding data associated with the first subset of entities as positive samples are obtained, and corresponding data associated with a second subset of the plurality of entities as negative samples are obtained.
- the negative samples may be substantially different from the positive samples, and can be obtained as discussed above.
- a rule model is trained with the positive and negative samples to obtain a trained group tagging rule model. The trained group tagging rule model determines if an existing or new entity is entitled to the first tag.
- the techniques described herein are implemented by one or more special-purpose computing devices.
- the special-purpose computing devices may be hard-wired to perform the techniques, or may include circuitry or digital electronic devices such as one or more application-specific integrated circuits (ASICs) or field programmable gate arrays (FPGAs) that are persistently programmed to perform the techniques, or may include one or more hardware processors programmed to perform the techniques pursuant to program instructions in firmware, memory, other storage, or a combination.
- ASICs application-specific integrated circuits
- FPGAs field programmable gate arrays
- Such special-purpose computing devices may also combine custom hard-wired logic, ASICs, or FPGAs with custom programming to accomplish the techniques.
- the special-purpose computing devices may be desktop computer systems, server computer systems, portable computer systems, handheld devices, networking devices or any other device or combination of devices that incorporate hard-wired and/or program logic to implement the techniques.
- Computing device (s) are generally controlled and coordinated by operating system software.
- Conventional operating systems control and schedule computer processes for execution, perform memory management, provide file system, networking, I/O services, and provide a user interface functionality, such as a graphical user interface ( “GUI” ) , among other things.
- GUI graphical user interface
- FIG. 5 is a block diagram that illustrates a computer system 500 upon which any of the embodiments described herein may be implemented.
- the system 500 may correspond to the system 102 described above.
- the computer system 500 includes a bus 502 or other communication mechanism for communicating information, one or more hardware processors 504 coupled with bus 502 for processing information.
- Hardware processor (s) 504 may be, for example, one or more general purpose microprocessors.
- the processor (s) 504 may correspond to the processor 104 described above.
- the computer system 500 also includes a main memory 506, such as a random access memory (RAM) , cache and/or other dynamic storage devices, coupled to bus 502 for storing information and instructions to be executed by processor 504.
- Main memory 506 also may be used for storing temporary variables or other intermediate information during execution of instructions to be executed by processor 504. Such instructions, when stored in storage media accessible to processor 504, render computer system 500 into a special-purpose machine that is customized to perform the operations specified in the instructions.
- the computer system 500 further includes a read only memory (ROM) 508 or other static storage device coupled to bus 502 for storing static information and instructions for processor 504.
- ROM read only memory
- a storage device 510 such as a magnetic disk, optical disk, or USB thumb drive (Flash drive) , etc., is provided and coupled to bus 502 for storing information and instructions.
- the main memory 506, the ROM 508, and/or the storage 510 may correspond to the memory 106 described above.
- the computer system 500 may implement the techniques described herein using customized hard-wired logic, one or more ASICs or FPGAs, firmware and/or program logic which in combination with the computer system causes or programs computer system 500 to be a special-purpose machine. According to one embodiment, the techniques herein are performed by computer system 500 in response to processor (s) 504 executing one or more sequences of one or more instructions contained in main memory 506. Such instructions may be read into main memory 506 from another storage medium, such as storage device 510. Execution of the sequences of instructions contained in main memory 506 causes processor (s) 504 to perform the process steps described herein. In alternative embodiments, hard-wired circuitry may be used in place of or in combination with software instructions.
- the main memory 506, the ROM 508, and/or the storage 510 may include non-transitory storage media.
- non-transitory media, ” and similar terms, as used herein refers to any media that store data and/or instructions that cause a machine to operate in a specific fashion. Such non-transitory media may comprise non-volatile media and/or volatile media.
- Non-volatile media includes, for example, optical or magnetic disks, such as storage device 510.
- Volatile media includes dynamic memory, such as main memory 506.
- non-transitory media include, for example, a floppy disk, a flexible disk, hard disk, solid state drive, magnetic tape, or any other magnetic data storage medium, a CD-ROM, any other optical data storage medium, any physical medium with patterns of holes, a RAM, a PROM, and EPROM, a FLASH-EPROM, NVRAM, any other memory chip or cartridge, and networked versions of the same.
- the computer system 500 also includes a communication interface 518 coupled to bus 502.
- Communication interface 518 provides a two-way data communication coupling to one or more network links that are connected to one or more local networks.
- communication interface 518 may be an integrated services digital network (ISDN) card, cable modem, satellite modem, or a modem to provide a data communication connection to a corresponding type of telephone line.
- ISDN integrated services digital network
- communication interface 518 may be a local area network (LAN) card to provide a data communication connection to a compatible LAN (or WAN component to communicated with a WAN) .
- LAN local area network
- Wireless links may also be implemented.
- communication interface 518 sends and receives electrical, electromagnetic or optical signals that carry digital data streams representing various types of information.
- the computer system 500 can send messages and receive data, including program code, through the network (s) , network link and communication interface 518.
- a server might transmit a requested code for an application program through the Internet, the ISP, the local network and the communication interface 518.
- the received code may be executed by processor 504 as it is received, and/or stored in storage device 510, or other non-volatile storage for later execution.
- processors may be temporarily configured (e.g., by software) or permanently configured to perform the relevant operations.
- processors may constitute processor-implemented engines that operate to perform one or more operations or functions described herein.
- the methods described herein may be at least partially processor-implemented, with a particular processor or processors being an example of hardware.
- a particular processor or processors being an example of hardware.
- the operations of a method may be performed by one or more processors or processor-implemented engines.
- the one or more processors may also operate to support performance of the relevant operations in a “cloud computing” environment or as a “software as a service” (SaaS) .
- SaaS software as a service
- at least some of the operations may be performed by a group of computers (as examples of machines including processors) , with these operations being accessible via a network (e.g., the Internet) and via one or more appropriate interfaces (e.g., an Application Program Interface (API) ) .
- API Application Program Interface
- processors or processor-implemented engines may be located in a single geographic location (e.g., within a home environment, an office environment, or a server farm) . In other example embodiments, the processors or processor-implemented engines may be distributed across a number of geographic locations.
- the term “or” may be construed in either an inclusive or exclusive sense. Moreover, plural instances may be provided for resources, operations, or structures described herein as a single instance. Additionally, boundaries between various resources, operations, engines, and data stores are somewhat arbitrary, and particular operations are illustrated in a context of specific illustrative configurations. Other allocations of functionality are envisioned and may fall within a scope of various embodiments of the present disclosure. In general, structures and functionality presented as separate resources in the example configurations may be implemented as a combined structure or resource. Similarly, structures and functionality presented as a single resource may be implemented as separate resources. These and other variations, modifications, additions, and improvements fall within a scope of embodiments of the present disclosure as represented by the appended claims. The specification and drawings are, accordingly, to be regarded in an illustrative rather than a restrictive sense.
- Conditional language such as, among others, “can, ” “could, ” “might, ” or “may, ” unless specifically stated otherwise, or otherwise understood within the context as used, is generally intended to convey that certain embodiments include, while other embodiments do not include, certain features, elements and/or steps. Thus, such conditional language is not generally intended to imply that features, elements and/or steps are in any way required for one or more embodiments or that one or more embodiments necessarily include logic for deciding, with or without user input or prompting, whether these features, elements and/or steps are included or are to be performed in any particular embodiment.
Landscapes
- Engineering & Computer Science (AREA)
- Theoretical Computer Science (AREA)
- General Engineering & Computer Science (AREA)
- Data Mining & Analysis (AREA)
- Physics & Mathematics (AREA)
- General Physics & Mathematics (AREA)
- Databases & Information Systems (AREA)
- Evolutionary Computation (AREA)
- Artificial Intelligence (AREA)
- Software Systems (AREA)
- Computing Systems (AREA)
- Mathematical Physics (AREA)
- Computer Vision & Pattern Recognition (AREA)
- Computational Linguistics (AREA)
- Life Sciences & Earth Sciences (AREA)
- Bioinformatics & Cheminformatics (AREA)
- Bioinformatics & Computational Biology (AREA)
- Evolutionary Biology (AREA)
- Medical Informatics (AREA)
- Information Retrieval, Db Structures And Fs Structures Therefor (AREA)
- Management, Administration, Business Operations System, And Electronic Commerce (AREA)
- Machine Translation (AREA)
- Image Analysis (AREA)
Abstract
Description
Claims (20)
- A computing system for group tagging, comprising:one or more processors accessible to platform data, wherein the platform data comprises a plurality of users and a plurality of associated data fields; anda memory storing instructions that, when executed by the one or more processors, cause the computing system to perform:obtaining a first subset of users and one or more first tags associated with the first subset of users;determining, respectively for one or more of the associated data fields, at least a difference between the first subset of users and at least a part of the plurality of users;in response to determining the difference exceeding a first threshold, determining the corresponding data field as a key data field;determining data of the corresponding one or more key data fields associated with the first subset of users as positive samples;obtaining, based on the one or more key data fields, a second subset of users and associated data from the platform data as negative samples; andtraining a rule model with the positive and negative samples to obtain a trained group tagging rule model.
- The system of claim 1, wherein:the platform data comprises tabular data corresponding to each of the plurality of users; andthe data fields comprises at least one of data dimension or data metric.
- The system of claim 1, wherein:the plurality of users are users of the platform;the platform is a vehicle information platform; andthe data fields comprise at least one of a location, a number of uses, a transaction amount, or a number of complaints.
- The system of claim 1, wherein obtaining a first subset of users comprises receiving identifications of the first subset of users from one or more analysts without full access to the platform data.
- The system of claim 1, wherein the platform data does not comprise the first tags before obtaining the first subset of users.
- The system of claim 1, wherein the difference is a Kullback-Leibler divergence.
- The system of claim 1, wherein the second subset of users are different from the first subset of users over a third threshold based on a similarity measurement with respect to the one or more key data fields.
- The system of claim 1, wherein the rule model is a decision tree model.
- The system of claim 1, wherein the trained group tagging rule model determines whether to assign one or more of the plurality of users the first tags.
- The system of claim 1, wherein the instruction cause to system to further perform:applying the trained group tagging rule model to tag the plurality of users and new users added to the plurality of users.
- A group tagging method, comprising:obtaining a first subset of users from a plurality of users and one or more first tags associated with the first subset of users, wherein the plurality of users and a plurality of associated data fields are a part of platform data;determining, respectively for one or more of the associated data fields, at least a difference between the first subset of users and at least a part of the plurality of users;in response to determining the difference exceeding a first threshold, determining the corresponding data field as a key data field;determining data of the corresponding one or more key data fields associated with the first subset of users as positive samples;obtaining, based on the one or more key data fields, a second subset of users and associated data from the platform data as negative samples; andtraining a rule model with the positive and negative samples to obtain a trained group tagging rule model.
- The method of claim 11, wherein:the platform data comprises tabular data corresponding to each of the plurality of users; andthe data fields comprises at least one of data dimension or data metric.
- The method of claim 11, wherein:the plurality of users are users of the platform;the platform is a vehicle information platform; andthe data fields comprise at least one of a location, a number of uses, a transaction amount, or a number of complaints.
- The method of claim 11, wherein obtaining a first subset of users comprises receiving identifications of the first subset of users from one or more analysts without full access to the platform data.
- The method of claim 11, wherein the platform data does not comprise the first tags before obtaining the first subset of users.
- The method of claim 11, wherein the difference is a Kullback-Leibler divergence.
- The method of claim 11, wherein the second subset of users are different from the first subset of users over a third threshold based on a similarity measurement with respect to the one or more key data fields.
- The method of claim 11, wherein the rule model is a decision tree model.
- The method of claim 11, wherein further comprising:applying the trained group tagging rule model to tag the plurality of users and new users added to the plurality of users.
- A group tagging method, comprising:obtaining a first subset of a plurality of entities of a platform, wherein the first subset of entities are tagged with first tags, and platform data comprises data of the plurality of entities with respect to a one or more data fields;determining at least a difference between data of one or more data fields of the first subset of entities and that of some other entities of the plurality of entities;in response to determining the difference exceeding a first threshold, obtaining corresponding data associated with the first subset of entities as positive samples, and corresponding data associated with a second subset of the plurality of entities as negative samples; andtraining a rule model with the positive and negative samples to obtain a trained group tagging rule model, wherein the trained group tagging rule model determines if an existing or new entity is entitled to the first tag.
Priority Applications (13)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN202010790992.8A CN111931845B (en) | 2017-04-20 | 2017-04-20 | System and method for determining user group similarity |
SG11201811624QA SG11201811624QA (en) | 2017-04-20 | 2017-04-20 | System and method for learning-based group tagging |
KR1020187038157A KR102227593B1 (en) | 2017-04-20 | 2017-04-20 | System and method for learning-based group tagging |
BR112018077404A BR112018077404A8 (en) | 2017-04-20 | 2017-04-20 | LEARNING-BASED GROUP MARKING SYSTEM AND METHOD |
CA3029428A CA3029428A1 (en) | 2017-04-20 | 2017-04-20 | System and method for learning-based group tagging |
CN201780051176.1A CN109690571B (en) | 2017-04-20 | 2017-04-20 | Learning-based group tagging system and method |
JP2018569002A JP2019528506A (en) | 2017-04-20 | 2017-04-20 | System and method for learning-based group tagging |
PCT/CN2017/081279 WO2018191918A1 (en) | 2017-04-20 | 2017-04-20 | System and method for learning-based group tagging |
AU2017410367A AU2017410367B2 (en) | 2017-04-20 | 2017-04-20 | System and method for learning-based group tagging |
EP17906489.4A EP3461287A4 (en) | 2017-04-20 | 2017-04-20 | System and method for learning-based group tagging |
TW107113535A TW201843609A (en) | 2017-04-20 | 2018-04-20 | System and method for learning-based group tagging |
US15/979,556 US20180307720A1 (en) | 2017-04-20 | 2018-05-15 | System and method for learning-based group tagging |
PH12018550213A PH12018550213A1 (en) | 2017-04-20 | 2018-12-26 | System and method for learning-based group tagging |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
PCT/CN2017/081279 WO2018191918A1 (en) | 2017-04-20 | 2017-04-20 | System and method for learning-based group tagging |
Related Child Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
US15/979,556 Continuation US20180307720A1 (en) | 2017-04-20 | 2018-05-15 | System and method for learning-based group tagging |
Publications (1)
Publication Number | Publication Date |
---|---|
WO2018191918A1 true WO2018191918A1 (en) | 2018-10-25 |
Family
ID=63853929
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
PCT/CN2017/081279 WO2018191918A1 (en) | 2017-04-20 | 2017-04-20 | System and method for learning-based group tagging |
Country Status (12)
Country | Link |
---|---|
US (1) | US20180307720A1 (en) |
EP (1) | EP3461287A4 (en) |
JP (1) | JP2019528506A (en) |
KR (1) | KR102227593B1 (en) |
CN (2) | CN111931845B (en) |
AU (1) | AU2017410367B2 (en) |
BR (1) | BR112018077404A8 (en) |
CA (1) | CA3029428A1 (en) |
PH (1) | PH12018550213A1 (en) |
SG (1) | SG11201811624QA (en) |
TW (1) | TW201843609A (en) |
WO (1) | WO2018191918A1 (en) |
Families Citing this family (6)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
JP6499372B1 (en) * | 2017-07-31 | 2019-04-10 | 楽天株式会社 | PROCESSING SYSTEM, PROCESSING DEVICE, PROCESSING METHOD, PROGRAM, AND INFORMATION RECORDING MEDIUM |
US11354351B2 (en) * | 2019-01-31 | 2022-06-07 | Chooch Intelligence Technologies Co. | Contextually generated perceptions |
CN114430489A (en) * | 2020-10-29 | 2022-05-03 | 武汉斗鱼网络科技有限公司 | Virtual prop compensation method and related equipment |
CN112559900B (en) * | 2021-02-26 | 2021-06-04 | 深圳索信达数据技术有限公司 | Product recommendation method and device, computer equipment and storage medium |
CN115604027B (en) * | 2022-11-28 | 2023-03-14 | 中南大学 | Network fingerprint identification model training method, identification method, equipment and storage medium |
CN115859118B (en) * | 2022-12-23 | 2023-08-11 | 摩尔线程智能科技(北京)有限责任公司 | Data acquisition method and device, electronic equipment and storage medium |
Citations (6)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN104504460A (en) * | 2014-12-09 | 2015-04-08 | 北京嘀嘀无限科技发展有限公司 | Method and device for predicating user loss of car calling platform |
CN105354343A (en) * | 2015-12-24 | 2016-02-24 | 成都陌云科技有限公司 | User characteristic mining method based on remote dialogue |
CN105488697A (en) * | 2015-12-09 | 2016-04-13 | 焦点科技股份有限公司 | Potential customer mining method based on customer behavior characteristics |
CN105608194A (en) * | 2015-12-24 | 2016-05-25 | 成都陌云科技有限公司 | Method for analyzing main characteristics in social media |
CN105631749A (en) * | 2015-12-24 | 2016-06-01 | 成都陌云科技有限公司 | User portrait calculation method based on statistical data |
JP2016181040A (en) * | 2015-03-23 | 2016-10-13 | 日本電信電話株式会社 | Data analyzer, method and program |
Family Cites Families (21)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US6963870B2 (en) * | 2002-05-14 | 2005-11-08 | Microsoft Corporation | System and method for processing a large data set using a prediction model having a feature selection capability |
US20090077081A1 (en) * | 2007-09-19 | 2009-03-19 | Joydeep Sen Sarma | Attribute-Based Item Similarity Using Collaborative Filtering Techniques |
JP2009157606A (en) | 2007-12-26 | 2009-07-16 | Toyota Central R&D Labs Inc | Driver status estimation device and program |
JP5342606B2 (en) * | 2011-06-27 | 2013-11-13 | 株式会社日立ハイテクノロジーズ | Defect classification method and apparatus |
US20140309876A1 (en) * | 2013-04-15 | 2014-10-16 | Flextronics Ap, Llc | Universal vehicle voice command system |
US9053185B1 (en) * | 2012-04-30 | 2015-06-09 | Google Inc. | Generating a representative model for a plurality of models identified by similar feature data |
DE202013100073U1 (en) * | 2012-12-21 | 2014-04-01 | Xerox Corp. | User profiling to estimate the printing performance |
CN104111946B (en) * | 2013-04-19 | 2018-08-07 | 腾讯科技(深圳)有限公司 | Clustering method based on user interest and device |
US9870465B1 (en) * | 2013-12-04 | 2018-01-16 | Plentyoffish Media Ulc | Apparatus, method and article to facilitate automatic detection and removal of fraudulent user information in a network environment |
CN104090888B (en) * | 2013-12-10 | 2016-05-11 | 深圳市腾讯计算机系统有限公司 | A kind of analytical method of user behavior data and device |
JP2015184823A (en) * | 2014-03-20 | 2015-10-22 | 株式会社東芝 | Model parameter calculation device, model parameter calculation method, and computer program |
US10193775B2 (en) * | 2014-10-09 | 2019-01-29 | Splunk Inc. | Automatic event group action interface |
US9558344B2 (en) * | 2015-03-18 | 2017-01-31 | International Business Machines Corporation | Proximity based authentication for striped data |
US10037506B2 (en) * | 2015-04-27 | 2018-07-31 | Xero Limited | Benchmarking through data mining |
US10097973B2 (en) * | 2015-05-27 | 2018-10-09 | Apple Inc. | Systems and methods for proactively identifying and surfacing relevant content on a touch-sensitive device |
CN106250382A (en) * | 2016-01-28 | 2016-12-21 | 新博卓畅技术(北京)有限公司 | A kind of metadata management automotive engine system and implementation method |
CN105959745B (en) * | 2016-05-25 | 2019-10-22 | 北京铭嘉实咨询有限公司 | Advertisement placement method and system |
JP6632476B2 (en) * | 2016-06-16 | 2020-01-22 | 株式会社Zmp | Network system |
CN106296343A (en) * | 2016-08-01 | 2017-01-04 | 王四春 | A kind of e-commerce transaction monitoring method based on the Internet and big data |
CN106296305A (en) * | 2016-08-23 | 2017-01-04 | 上海海事大学 | Electric business website real-time recommendation System and method under big data environment |
US20180157663A1 (en) * | 2016-12-06 | 2018-06-07 | Facebook, Inc. | Systems and methods for user clustering |
-
2017
- 2017-04-20 AU AU2017410367A patent/AU2017410367B2/en active Active
- 2017-04-20 JP JP2018569002A patent/JP2019528506A/en active Pending
- 2017-04-20 CN CN202010790992.8A patent/CN111931845B/en active Active
- 2017-04-20 CA CA3029428A patent/CA3029428A1/en not_active Abandoned
- 2017-04-20 KR KR1020187038157A patent/KR102227593B1/en active IP Right Grant
- 2017-04-20 CN CN201780051176.1A patent/CN109690571B/en active Active
- 2017-04-20 EP EP17906489.4A patent/EP3461287A4/en not_active Withdrawn
- 2017-04-20 SG SG11201811624QA patent/SG11201811624QA/en unknown
- 2017-04-20 WO PCT/CN2017/081279 patent/WO2018191918A1/en unknown
- 2017-04-20 BR BR112018077404A patent/BR112018077404A8/en not_active Application Discontinuation
-
2018
- 2018-04-20 TW TW107113535A patent/TW201843609A/en unknown
- 2018-05-15 US US15/979,556 patent/US20180307720A1/en not_active Abandoned
- 2018-12-26 PH PH12018550213A patent/PH12018550213A1/en unknown
Patent Citations (6)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN104504460A (en) * | 2014-12-09 | 2015-04-08 | 北京嘀嘀无限科技发展有限公司 | Method and device for predicating user loss of car calling platform |
JP2016181040A (en) * | 2015-03-23 | 2016-10-13 | 日本電信電話株式会社 | Data analyzer, method and program |
CN105488697A (en) * | 2015-12-09 | 2016-04-13 | 焦点科技股份有限公司 | Potential customer mining method based on customer behavior characteristics |
CN105354343A (en) * | 2015-12-24 | 2016-02-24 | 成都陌云科技有限公司 | User characteristic mining method based on remote dialogue |
CN105608194A (en) * | 2015-12-24 | 2016-05-25 | 成都陌云科技有限公司 | Method for analyzing main characteristics in social media |
CN105631749A (en) * | 2015-12-24 | 2016-06-01 | 成都陌云科技有限公司 | User portrait calculation method based on statistical data |
Non-Patent Citations (1)
Title |
---|
See also references of EP3461287A4 * |
Also Published As
Publication number | Publication date |
---|---|
AU2017410367A1 (en) | 2019-01-31 |
CA3029428A1 (en) | 2018-10-25 |
CN111931845A (en) | 2020-11-13 |
KR20190015410A (en) | 2019-02-13 |
EP3461287A4 (en) | 2019-05-01 |
CN111931845B (en) | 2024-06-21 |
AU2017410367B2 (en) | 2020-09-10 |
EP3461287A1 (en) | 2019-04-03 |
CN109690571B (en) | 2020-09-18 |
PH12018550213A1 (en) | 2019-10-28 |
JP2019528506A (en) | 2019-10-10 |
KR102227593B1 (en) | 2021-03-15 |
BR112018077404A8 (en) | 2023-01-31 |
BR112018077404A2 (en) | 2019-04-09 |
CN109690571A (en) | 2019-04-26 |
SG11201811624QA (en) | 2019-01-30 |
TW201843609A (en) | 2018-12-16 |
US20180307720A1 (en) | 2018-10-25 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
AU2017410367B2 (en) | System and method for learning-based group tagging | |
WO2022126971A1 (en) | Density-based text clustering method and apparatus, device, and storage medium | |
US11526799B2 (en) | Identification and application of hyperparameters for machine learning | |
US11074434B2 (en) | Detection of near-duplicate images in profiles for detection of fake-profile accounts | |
US11562002B2 (en) | Enabling advanced analytics with large data sets | |
KR102079860B1 (en) | Text address processing method and device | |
US8321358B2 (en) | Interpersonal relationships analysis system and method which computes distances between people in an image | |
WO2017215370A1 (en) | Method and apparatus for constructing decision model, computer device and storage device | |
US11282035B2 (en) | Process orchestration | |
US20160378868A1 (en) | System and method for large scale crowdsourcing of map data cleanup and correction | |
WO2016015444A1 (en) | Target user determination method, device and network server | |
US11570214B2 (en) | Crowdsourced innovation laboratory and process implementation system | |
JP2017515184A (en) | Determining temporary transaction limits | |
CN111046237B (en) | User behavior data processing method and device, electronic equipment and readable medium | |
CN104077723B (en) | A kind of social networks commending system and method | |
US20150278403A1 (en) | Methods and systems for modeling crowdsourcing platform | |
US9754015B2 (en) | Feature rich view of an entity subgraph | |
WO2018120726A1 (en) | Data mining based modeling method, system, electronic device and storage medium | |
CN112470172B (en) | Computational efficiency of symbol sequence analysis using random sequence embedding | |
JP5903376B2 (en) | Information recommendation device, information recommendation method, and information recommendation program | |
US20140244521A1 (en) | Systems and methods for legal data processing | |
Banushkina et al. | Fep1d: a script for the analysis of reaction coordinates | |
US10521415B2 (en) | Method and system for providing weighted evaluation | |
US20200272852A1 (en) | Clustering | |
CN117078411A (en) | Method, device, equipment and storage medium for determining fund flow path |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
121 | Ep: the epo has been informed by wipo that ep was designated in this application |
Ref document number: 17906489 Country of ref document: EP Kind code of ref document: A1 |
|
ENP | Entry into the national phase |
Ref document number: 20187038157 Country of ref document: KR Kind code of ref document: A Ref document number: 2018569002 Country of ref document: JP Kind code of ref document: A Ref document number: 3029428 Country of ref document: CA |
|
REG | Reference to national code |
Ref country code: BR Ref legal event code: B01A Ref document number: 112018077404 Country of ref document: BR |
|
ENP | Entry into the national phase |
Ref document number: 2017906489 Country of ref document: EP Effective date: 20181226 |
|
ENP | Entry into the national phase |
Ref document number: 2017410367 Country of ref document: AU Date of ref document: 20170420 Kind code of ref document: A |
|
ENP | Entry into the national phase |
Ref document number: 112018077404 Country of ref document: BR Kind code of ref document: A2 Effective date: 20181228 |
|
NENP | Non-entry into the national phase |
Ref country code: DE |