WO2020135420A1 - 对用户进行分类的方法和装置 - Google Patents
对用户进行分类的方法和装置 Download PDFInfo
- Publication number
- WO2020135420A1 WO2020135420A1 PCT/CN2019/127930 CN2019127930W WO2020135420A1 WO 2020135420 A1 WO2020135420 A1 WO 2020135420A1 CN 2019127930 W CN2019127930 W CN 2019127930W WO 2020135420 A1 WO2020135420 A1 WO 2020135420A1
- Authority
- WO
- WIPO (PCT)
- Prior art keywords
- user
- interactive
- target user
- record data
- behavior record
- Prior art date
Links
Images
Classifications
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/20—Information retrieval; Database structures therefor; File system structures therefor of structured data, e.g. relational data
- G06F16/28—Databases characterised by their database models, e.g. relational or object models
-
- Y—GENERAL TAGGING OF NEW TECHNOLOGICAL DEVELOPMENTS; GENERAL TAGGING OF CROSS-SECTIONAL TECHNOLOGIES SPANNING OVER SEVERAL SECTIONS OF THE IPC; TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
- Y02—TECHNOLOGIES OR APPLICATIONS FOR MITIGATION OR ADAPTATION AGAINST CLIMATE CHANGE
- Y02D—CLIMATE CHANGE MITIGATION TECHNOLOGIES IN INFORMATION AND COMMUNICATION TECHNOLOGIES [ICT], I.E. INFORMATION AND COMMUNICATION TECHNOLOGIES AIMING AT THE REDUCTION OF THEIR OWN ENERGY USE
- Y02D10/00—Energy efficient computing, e.g. low power processors, power management or thermal management
Definitions
- the present disclosure relates to the technical field of data analysis, and in particular to a method and device for classifying users.
- each user can be surveyed in the form of a questionnaire, and users can answer the questions in the questionnaire one by one. Relevant personnel will collect the completed questionnaires for recycling, and then classify users based on the answers to the questions in the questionnaire. Users belonging to the same classification category have similar attributes. Furthermore, in the future, things that meet the attributes of different users can be recommended for different users in a targeted manner.
- a method for classifying users including:
- the determining user characteristic data corresponding to the target user based on the target user's behavior record data and the other user's behavior record data includes:
- user characteristic data corresponding to the target user is determined.
- the determining user characteristic data corresponding to the target user based on the user relationship network includes:
- the determining user characteristic data corresponding to the target user based on the user relationship network includes:
- the determining user characteristic data corresponding to the target user based on the target user's behavior record data and the other user's behavior record data includes:
- the behavior record data of the target user For each interactive user of the at least one interactive user, in the behavior record data of the target user, obtain behavior record data corresponding to the interaction behavior of the target user and the interactive user, and the target user's Non-interactive behavior record data, and from the interactive user behavior record data, obtain the behavior record data corresponding to the interactive behavior of the interactive user and the target user, and the non-interactive behavior record data of the interactive user, as Related action record data between the target user and the interactive user, based on the related action record data, determining interactive feature data corresponding to the interactive user;
- the average value of the interactive feature data corresponding to all interactive users is determined as the user feature data corresponding to the target user.
- the determining user characteristic data corresponding to the target user based on the target user's behavior record data and the other user's behavior record data includes:
- the intermediate user feature data corresponding to the target user Based on the intermediate user feature data corresponding to the target user, the intermediate user feature data corresponding to each interactive user, and the feature weights corresponding to the target user and each interactive user, determine the weighted average of each intermediate user feature data As user characteristic data corresponding to the target user.
- an apparatus for classifying users includes:
- the acquisition module is used to acquire the behavior record data of the target user and the behavior record data of other users associated with the target user;
- a determining module configured to determine user characteristic data corresponding to the target user based on the target user's behavior record data and the other user's behavior record data;
- the classification module is used to input user feature data corresponding to the target user into the user classification network model to obtain user classification information corresponding to the target user.
- the determination module is configured to:
- user characteristic data corresponding to the target user is determined.
- the determination module is configured to:
- the determination module is configured to:
- the determination module is configured to:
- the behavior record data of the target user For each interactive user of the at least one interactive user, in the behavior record data of the target user, obtain behavior record data corresponding to the interaction behavior of the target user and the interactive user, and the target user's Non-interactive behavior record data, and from the interactive user behavior record data, obtain the behavior record data corresponding to the interactive behavior of the interactive user and the target user, and the non-interactive behavior record data of the interactive user, as Related action record data between the target user and the interactive user, based on the related action record data, determining interactive feature data corresponding to the interactive user;
- the average value of the interactive feature data corresponding to all interactive users is determined as the user feature data corresponding to the target user.
- the determination module is configured to:
- a computer device Based on the intermediate user feature data corresponding to the target user, the intermediate user feature data corresponding to each interactive user, and the feature weights corresponding to the target user and each interactive user, determine the weighted average of each intermediate user feature data As user characteristic data corresponding to the target user.
- the computer device includes a processor, a communication interface, a memory, and a communication bus, where:
- the processor, the communication interface, and the memory complete communication with each other through the communication bus;
- the memory is used to store computer programs
- the processor is used to execute the program stored on the memory to implement the above method for classifying users.
- a computer-readable storage medium in which a computer program is stored, and when the computer program is executed by a processor, the above method for classifying users is implemented.
- the behavior record data of the target user and the behavior record data of other users associated with the target user can be obtained and the behavior record data can be extracted
- the user's characteristic data can be used to determine the user classification information corresponding to the target user based on the user's characteristic data and the user classification network model.
- Fig. 1 is a schematic flowchart of a method for classifying users according to an exemplary embodiment
- Fig. 2 is a schematic flowchart of a method for classifying users according to an exemplary embodiment
- Fig. 3 is a schematic structural diagram of an apparatus for classifying users according to an exemplary embodiment
- Fig. 4 is a schematic structural diagram of a computer device according to an exemplary embodiment.
- a piece of behavior record data may be generated accordingly. For example, when user A makes a call to user B, for the user corresponding to user A, a record of the behavior of making a call to user B at a certain time will be generated. Or, when user A makes a purchase online, for the user corresponding to user A, a piece of behavior record data of a certain shopping site spending a certain amount of money on a certain day will be generated. Or alternatively, user A and user C watch a movie together. For the user corresponding to user A, a record of behavior data of a certain movie theater and someone watching a certain movie at a certain time will be generated.
- certain attributes of the user are often reflected in his personal behavior or interaction with other people.
- the occupation of user A is a teacher, and user A often uses the phone number corresponding to user A to call user B who is also a teacher to communicate, etc.
- user A makes an online purchase, he often uses the user A's corresponding
- the shopping account buys some books, stationery and other supplies; and, user A often purchases two or more literary movie tickets through the ticket purchasing account corresponding to user A.
- the user A can be classified according to the personal behavior of the user A or the behavior generated by interacting with other people, for example, determining that the user A belongs to a teacher, a policeman or a doctor.
- An exemplary embodiment of the present disclosure provides a method for classifying users. As shown in FIG. 1, the processing flow of the method may include the following steps:
- Step S110 Obtain the behavior record data of the target user and the behavior record data of other users associated with the target user.
- a behavior record database may be established, and a large number of users' behavior record data are stored in the behavior record database. These user behavior record data can be obtained from the network, or can be collected through a dedicated device.
- element information such as event information, time when the event occurred, and location where the event occurred can be recorded. If the event is a personal event, such as taking a vehicle, other users may not be recorded in the behavior record data. If the event is an interaction type event, such as user A calling user B, user B can be recorded in the behavior record data.
- Step S120 Determine user characteristic data corresponding to the target user based on the target user's behavior record data and other user's behavior record data.
- the user characteristic data corresponding to the target user may be determined based on the target user's behavior record data and other user's behavior record data.
- a user relationship network can be constructed based on the target user's behavior record data and other user's behavior record data, and based on the user relationship network, user feature data corresponding to the target user can be extracted.
- the user relationship network may be a homogeneous relationship network or a heterogeneous relationship network. Specifically, you can choose to build a homogenous relationship network or a heterogeneous relationship network according to the application scenario of classification processing and the data situation of the behavior record data.
- Step S130 Input user feature data corresponding to the target user into the user classification network model to obtain user classification information corresponding to the target user.
- the user classification network model may be trained in advance, and the user classification network model may be a classifier, which may include a deep classification network model and a machine learning model.
- the user classification network model may be a classifier, which may include a deep classification network model and a machine learning model.
- the user feature data corresponding to the target user may include different types of feature data.
- four different types of user feature data corresponding to the target user are introduced.
- the user feature data corresponding to the four different types of target users may include basic feature data, spatio-temporal feature data, fusion feature data, and community feature data.
- Step S120 may include: determining the interaction relationship between the target user and other users based on the target user's behavior record data and other user's behavior record data; establishing a user relationship network based on the user's interaction relationship; based on the user Relationship network to determine the user characteristic data corresponding to the target user.
- At least one user group having an interactive relationship may be determined based on the target user's behavior record data and other user's behavior record data; the two users included in the at least one user group are determined to be two network nodes, and Connect two network nodes belonging to the same user group to establish a user relationship network; based on the user relationship network, determine the user characteristic data corresponding to the target user.
- At least one user group having an interactive relationship may be determined. For example, user A makes a call to user B, user A makes a call to user C, user B makes a call to user D, etc.
- User A and user B have an interactive relationship, and they can be regarded as a user group 01, and so on, user A
- There is an interactive relationship with user C they can be used as a user group 02
- there is an interactive relationship between user B and user D they can be used as a user group 03.
- two users included in at least one user group may be determined as two network nodes, and two network nodes belonging to the same user group may be connected to obtain a user relationship network.
- user characteristic data corresponding to the target user can be determined based on the user relationship network.
- the user characteristic data includes network structure characteristic data
- the step of determining the user characteristic data corresponding to the target user based on the user relationship network may include: determining the network structure characteristic data of the user relationship network as the user characteristic data corresponding to the target user.
- statistical calculation may be performed on the edges between the network nodes in the user relationship network and the network nodes with connection relationships to obtain the network structure characteristic data of the user relationship network.
- the network structure characteristic data such as the first-order degree, second-order degree, cluster coefficient, neighbor coefficient, degree centrality, intermediary, in-degree centrality, out-degree centrality, embedding (embedding) characteristics of the user relationship network can be determined.
- the network structure characteristic data corresponding to point A in the user relationship network is taken as an example for description.
- the network structure characteristic data corresponding to other network nodes in the user relationship network is determined in a similar way to point A, and the network structure corresponding to other network nodes
- the determination method of the feature data please refer to the determination method of the network structure feature data corresponding to point A.
- Point A may be used as the network node corresponding to the target user described in the method provided in the embodiments of the present disclosure.
- the first-order degree may be the number of first-order neighbor network nodes directly connected to point A. For example, there are 4 first-order neighbor network nodes at point A, and the first-order degree of point A is 4.
- the second-order degree may be the number of neighbor network nodes directly connected to the first-order neighbor network nodes at point A.
- the cluster coefficient can be calculated by 2M/K(K-1). Where K is the first-order degree of point A, and M is the number of edges between each first-order neighbor network node of point A.
- the neighbor coefficient may be the number of network nodes that have a connection relationship between the first-order neighbor network nodes at point A.
- Degree centrality can be determined in the following way. First, determine the difference between the number of edges between point A and each first-order neighbor network node and 1, and then determine the quotient of the first-order degree of point A divided by the difference as the degree of centrality.
- In-degree centrality can be determined in the following manner.
- the edges in the user relationship network may have directions, and the in-degree centrality of point A may be the number of edges entering point A.
- Outgoing degree centrality can be determined in the following way.
- the edges in the user relationship network can have directions, and the out-degree centrality of point A can be the number of edges starting from point A.
- the network structure characteristic data corresponding to other network nodes in the user relationship network is determined in a similar way to point A.
- the determination method of network structure characteristic data corresponding to other network nodes please refer to the determination method of the network structure characteristic data corresponding to point A, here No longer.
- the first-order degree, second-order degree, cluster coefficient, neighbor coefficient, degree centrality, intermediary, in-degree centrality, out-degree centrality, embedding features, etc. in the above 1) to 9) can be used as the network structure of the user relationship network
- Feature data is user feature data corresponding to the target user.
- Step S120 may include: determining at least one interactive user that has an interactive relationship with the target user based on the target user's behavior record data and other user's behavior record data; for each interactive user in the at least one interactive user, the target Obtain the behavior record data corresponding to the interactive behavior of the target user and the interactive user and the non-interactive behavior record data of the target user from the user's behavior record data, and obtain the interaction between the interactive user and the target user in the interactive user's behavior record data The behavior record data corresponding to the behavior and the non-interactive behavior record data of the interactive user are used as the related behavior record data between the target user and the interactive user. Based on the related behavior record data, the interactive feature data corresponding to the interactive user are determined; all interactive users The average value of the corresponding interactive feature data is determined as the user feature data corresponding to the target user.
- At least one interactive user having an interactive relationship with the target user may be determined based on the target user's behavior record data and other user's behavior record data.
- the target user's non-interactive behavior record data and the interactive user's non-interactive behavior record data can be determined from the target user's behavior record data and other user's behavior record data , Behavior record data corresponding to the interactive behavior of the target user and the interactive user, and behavior record data corresponding to the interactive behavior of the interactive user and the target user.
- A place, time, event
- A, B place, time, event
- B place, time, event
- A, B place, time, event
- a and B are two different users, A is the target user, and B is the interactive user.
- a (location, time, event) may be the target user's non-interactive behavior record data.
- a and B (location, time, event) may be behavior record data corresponding to the interaction behavior of the target user and the interactive user, or behavior record data corresponding to the interaction behavior of the interactive user and the target user.
- B (location, time, event) can be the interactive user's non-interactive behavior record data.
- the related behavior record data between the target user and the interactive user can be sorted in chronological order to obtain the behavior sequence between the target user and the interactive user. For example, A (place, time, event)-"A, B (place, time, event)-"B (place, time, event)-"A, B (place, time, event).
- the associated user characteristic data between the target user and the interactive user may be determined.
- the characteristics of the behavior sequence between the target user and the interactive user can be extracted from the three dimensions of time, space (location) and event, as the associated user characteristic data between the target user and the interactive user. Specifically, it can be statistically calculated the time interval between each element in the sequence, the time span between the first element and the last element in the sequence, the number of occurrences of different locations in the sequence, and the number of occurrences of different events in the sequence.
- three-dimensional intersection features can be constructed.
- the above method may be repeatedly performed until the associated user characteristic data between the target user and all interactive users are determined.
- A, B feature 1, feature 2, feature 3
- A, C feature 1, feature 2, feature 3
- A is the target user
- B and C are two interactive users.
- embedding feature extraction algorithms such as LSTM (Long Short-Term Memory, Long-Short-Term Memory Network) and RNN (Recurrent Neural Network) can be used to extract the embedding of the behavior sequence between the target user and the interactive user feature.
- LSTM Long Short-Term Memory, Long-Short-Term Memory Network
- RNN Recurrent Neural Network
- the sum value of the associated user feature data between the target user and all interactive users can be determined, and the quotient of the sum value divided by the number of interactive users can be determined as the user feature data corresponding to the target user.
- the user characteristic data corresponding to the target user is calculated by formula 1.
- Fea mean (A) is the user characteristic data corresponding to the target user A.
- Fea(A, B), Fea(A, C), Fea(A, F), Fea(A, G) are the related user characteristic data between the target user A and the interactive users B, C, F, G, etc. .
- n is the number of interactive users.
- Step S120 may include: determining at least one interactive user having an interactive relationship with the target user based on the target user's behavior record data and other user's behavior record data; based on the target user's behavior record data, and at least one interactive user's Behavior record data to determine the interaction relationship between the target user and at least one interactive user; based on the interaction relationship between users, establish a user relationship network; based on the user relationship network and label propagation algorithm, determine the target user and each interactive user Corresponding feature weights; based on the target user's behavior record data and other user's behavior record data, determine the intermediate user feature data corresponding to the target user and the intermediate user feature data corresponding to each interactive user; based on the target user's corresponding intermediate The user characteristic data, the intermediate user characteristic data corresponding to each interactive user and the characteristic weight corresponding to the target user and each interactive user, and determine the weighted average of each intermediate user characteristic data as the user characteristic data corresponding to the target user.
- the basic feature data and the spatio-temporal feature data can be extracted by the two methods described above, and then the two different types of feature data can be stitched and combined to obtain the intermediate user feature data corresponding to the target user.
- the above method can be repeated, and intermediate user characteristic data corresponding to at least one interactive user can be obtained. Since some behaviors of the interactive user can also affect the behavior of the target user, some attributes of the interactive user are also possessed by the target user. Therefore, at least one interactive user who has influence on the behavior of the target user can correspond to the characteristics of the intermediate user
- the data and the intermediate user feature data corresponding to the target user are fused to determine the user feature data corresponding to the target user.
- At least one interactive user having an interactive relationship with the target user may be determined based on the target user's behavior record data and other user's behavior record data. Then, the target user and all users in at least one interactive user who have an interactive relationship may be connected to establish a user relationship network. Based on the user relationship network and the pageRank (tag propagation) algorithm, the feature weights corresponding to the target user and at least one interactive user are determined. The characteristic weights corresponding to the target user and at least one interactive user respectively indicate the importance of each user in the user relationship network. If any user and other users have more intensive interactions, then any one user is very large To a certain extent, the corresponding feature weight is relatively large.
- a product of intermediate user feature data corresponding to the target user and corresponding feature weights may be determined, and a product of intermediate user feature data corresponding to at least one interactive user and corresponding feature weights may be determined.
- the quotient of the sum of all products divided by the total number of target users and at least one interactive user can be determined as the user characteristic data corresponding to the target user.
- the sum of all products can also be directly determined as the user characteristic data corresponding to the target user.
- the fusion strategy can be adjusted adaptively to improve the accuracy of classifying the target users.
- fusion feature data can also be extracted in other ways. For example, through structure2vec (a data processing algorithm), GCN (Graph Convolutional Neural Network, graph convolutional neural network), GNN (Graph Neural Network, graph neural network), GeniePath (a graph neural network with an adaptive receiving path) Network) algorithm, etc., to fuse a variety of feature data to determine the user feature data corresponding to the target user.
- GCN Graph Convolutional Neural Network, graph convolutional neural network
- GNN Graph Neural Network, graph neural network
- GeniePath a graph neural network with an adaptive receiving path
- the following will introduce the method of fusing various feature data through structure2vec algorithm to determine the user feature data corresponding to the target user.
- the structure2vec algorithm can also correspond to two different fusion strategies.
- the first way can be achieved by formula 2.
- Formula 2 determines the user feature data corresponding to the target user by averaging the user feature data of at least one interactive user.
- ⁇ is a constant
- W (t) is the feature weight corresponding to each interactive user
- N(i) is the number of interactive users
- Formula 3 determines the user feature data corresponding to the target user by summing the user feature data of at least one interactive user.
- ⁇ is a constant
- W (t) is the feature weight corresponding to each interactive user
- N(i) is the number of interactive users
- Formula 4 determines the user characteristic data corresponding to the target user by normalizing the user characteristic data of at least one interactive user.
- ⁇ is a constant
- W (t) is the feature weight corresponding to each interactive user
- N(i) is the number of interactive users
- N(j) is the number of user feature data
- the step of determining the user characteristic data corresponding to the target user may include: dividing the user relationship network based on the community discovery algorithm to obtain multiple sub-user relationship networks; determining the target sub-user relationship network to which the target user belongs; obtaining the target The community attribute information corresponding to the sub-user relationship network is used as the user characteristic data corresponding to the target user.
- the user relationship network is a user relationship network containing a large number of users, and can be based on community discovery algorithms such as Louvain (a community discovery algorithm), MNMF (a community discovery algorithm), and ComE (a community discovery algorithm) Algorithms, etc., divide the user relationship network to obtain multiple sub-user relationship networks.
- the target sub-user relationship network to which the target user belongs can be determined.
- community attribute information corresponding to the target sub-user relationship network can be obtained as user characteristic data corresponding to the target user.
- the community attribute information may be information such as the characteristics of the community, the nature of the community, the label of the community, and the identification of the community.
- User feature data can include basic feature data, spatio-temporal feature data, fusion feature data, and community feature data. Through the above feature data, users can be described from multiple angles. When the angle of description is more, the data described is more sufficient. , The more the user characteristic data can reflect the characteristics of the user, and the classification of the user determined based on the user characteristic data is more accurate.
- the behavior record data of the target user and the behavior record data of other users associated with the target user can be obtained and the behavior record data can be extracted
- the user's characteristic data can be used to determine the user classification information corresponding to the target user based on the user's characteristic data and the user classification network model.
- an exemplary embodiment of the present disclosure provides a method for classifying users. As shown in FIG. 2, the processing flow of the method may include the following steps:
- Step S210 Analyze the behavior record data of the target user and the behavior record data of other users, and determine to construct a homogeneous relationship network or a heterogeneous relationship network.
- step S220 regardless of whether the homogeneous network or the heterogeneous network is constructed, four different types of user feature data such as basic feature data, spatio-temporal feature data, fusion feature data, and community feature data of the target user can be determined.
- Step S230 After determining four different types of user feature data of the target user, such as basic feature data, spatio-temporal feature data, fusion feature data and community feature data, these four different types of user feature data can be fused to obtain User characteristic data corresponding to the fused target user.
- Step S240 processing the user feature data corresponding to the fused target user through a feature engineering processing method, such as performing feature selection and feature extraction on the user feature data corresponding to the fused target user to retain useful user features Data, clearing out valuable user characteristic data.
- a feature engineering processing method such as performing feature selection and feature extraction on the user feature data corresponding to the fused target user to retain useful user features Data, clearing out valuable user characteristic data.
- step S250 it is judged by experience whether the deep classification network model or the machine learning model is used to classify the target user based on user feature data corresponding to the target user after feature engineering processing.
- step S260 user feature data corresponding to the target user after feature engineering processing may be input into the deep classification network model or the machine learning model to obtain user classification information corresponding to the target user.
- the value of the feature data is higher, and thus the accuracy of the determined user classification information can be improved.
- the behavior record data of the target user and the behavior record data of other users associated with the target user can be obtained and the behavior record data can be extracted
- the user's characteristic data can be used to determine the user classification information corresponding to the target user based on the user's characteristic data and the user classification network model.
- the apparatus includes:
- the obtaining module 210 is used to obtain the behavior record data of the target user and the behavior record data of other users associated with the target user;
- the determining module 220 is configured to determine user characteristic data corresponding to the target user based on the target user's behavior record data and the other user's behavior record data;
- the classification module 230 is configured to input user characteristic data corresponding to the target user into the user classification network model to obtain user classification information corresponding to the target user.
- the determination module 220 is configured to:
- user characteristic data corresponding to the target user is determined.
- the determination module 220 is configured to:
- the determination module 220 is configured to:
- the determination module 220 is configured to:
- the behavior record data of the target user For each interactive user of the at least one interactive user, in the behavior record data of the target user, obtain behavior record data corresponding to the interaction behavior of the target user and the interactive user, and the target user's Non-interactive behavior record data, and from the interactive user behavior record data, obtain the behavior record data corresponding to the interactive behavior of the interactive user and the target user, and the non-interactive behavior record data of the interactive user, as Related action record data between the target user and the interactive user, based on the related action record data, determining interactive feature data corresponding to the interactive user;
- the average value of the interactive feature data corresponding to all interactive users is determined as the user feature data corresponding to the target user.
- the determination module 220 is configured to:
- the intermediate user feature data corresponding to the target user Based on the intermediate user feature data corresponding to the target user, the intermediate user feature data corresponding to each interactive user, and the feature weights corresponding to the target user and each interactive user, determine the weighted average of each intermediate user feature data As user characteristic data corresponding to the target user.
- the behavior record data of the target user and the behavior record data of other users associated with the target user can be obtained and the behavior record data can be extracted
- the user's characteristic data can be used to determine the user classification information corresponding to the target user based on the user's characteristic data and the user classification network model.
- the device for classifying users provided in the above embodiments only uses the division of the above functional modules as an example for classifying users.
- the above functions can be allocated by different The functional module is completed, that is, the internal structure of the computer device is divided into different functional modules to complete all or part of the functions described above.
- the device for classifying users provided in the above embodiments belongs to the same concept as the method embodiment for classifying users. For the specific implementation process, see the method embodiments, and details are not described here.
- FIG. 4 shows a schematic structural diagram of a computer device 1900 provided by an exemplary embodiment of the present disclosure.
- the computer device 1900 may have a relatively large difference due to different configuration or performance, and may include one or more central processing units (CPU) 1910 and one or more memories 1920. Wherein, at least one instruction is stored in the memory 1920, and the at least one instruction is loaded and executed by the processor 1910 to implement the method for classifying users described in the foregoing embodiments.
- CPU central processing units
Landscapes
- Engineering & Computer Science (AREA)
- Theoretical Computer Science (AREA)
- Databases & Information Systems (AREA)
- Data Mining & Analysis (AREA)
- Physics & Mathematics (AREA)
- General Engineering & Computer Science (AREA)
- General Physics & Mathematics (AREA)
- Information Retrieval, Db Structures And Fs Structures Therefor (AREA)
- Management, Administration, Business Operations System, And Electronic Commerce (AREA)
Abstract
Description
Claims (14)
- 一种对用户进行分类的方法,其特征在于,所述方法包括:获取目标用户的行为记录数据、以及与目标用户相关联的其他用户的行为记录数据;基于所述目标用户的行为记录数据、以及所述其他用户的行为记录数据,确定所述目标用户对应的用户特征数据;将所述目标用户对应的用户特征数据输入到用户分类网络模型中,得到所述目标用户对应的用户分类信息。
- 根据权利要求1所述的方法,其特征在于,所述基于所述目标用户的行为记录数据、以及所述其他用户的行为记录数据,确定所述目标用户对应的用户特征数据,包括:基于所述目标用户的行为记录数据、以及所述其他用户的行为记录数据,确定所述目标用户和所述其他用户中用户之间的互动关系;基于所述用户之间的互动关系,建立用户关系网络;基于所述用户关系网络,确定所述目标用户对应的用户特征数据。
- 根据权利要求2所述的方法,其特征在于,所述基于所述用户关系网络,确定所述目标用户对应的用户特征数据,包括:确定所述用户关系网络的网络结构特征数据,作为所述目标用户对应的用户特征数据。
- 根据权利要求2所述的方法,其特征在于,所述基于所述用户关系网络,确定所述目标用户对应的用户特征数据,包括:基于社区发现算法,对所述用户关系网络进行划分,得到多个子用户关系网络;确定所述目标用户所属的目标子用户关系网络;获取所述目标子用户关系网络对应的社区属性信息,作为所述目标用户对应的用户特征数据。
- 根据权利要求1所述的方法,其特征在于,所述基于所述目标用户的行为记录数据、以及所述其他用户的行为记录数据,确定所述目标用户对应的用户特征数据,包括:基于所述目标用户的行为记录数据、以及所述其他用户的行为记录数据,确定与所述目标用户之间存在互动关系的至少一个互动用户;对于所述至少一个互动用户中的每个互动用户,在所述目标用户的行为记录数据中,获取所述目标用户与所述互动用户的互动行为对应的行为记录数据,以及所述目标用户的非互动行为记录数据,并在所述互动用户的行为记录数据中,获取所述互动用户与所述目标用户的互动行为对应的行为记录数据,以及所述互动用户的非互动行为记录数据,作为所述目标用户和所述互动用户之间的关联行为记录数据,基于所述关联行为记录数据,确定所述互动用户对应的互动特征数据;将所有互动用户对应的互动特征数据的平均值,确定为所述目标用户对应的用户特征数据。
- 根据权利要求1所述的方法,其特征在于,所述基于所述目标用户的行为记录数据、以及所述其他用户的行为记录数据,确定所述目标用户对应的用户特征数据,包括:基于所述目标用户的行为记录数据、以及所述其他用户的行为记录数据,确定与所述目标用户之间存在互动关系的至少一个互动用户;基于所述目标用户的行为记录数据、以及所述至少一个互动用户的行为记录数据,确定所述目标用户和所述至少一个互动用户中用户之间的互动关系;基于所述用户之间的互动关系,建立用户关系网络;基于所述用户关系网络以及标签传播算法,确定所述目标用户以及每个互动用户分别对应的特征权重;基于所述目标用户的行为记录数据、以及所述其他用户的行为记录数据,确定所述目标用户对应的中间用户特征数据、以及每个互动用户对应的中间用户特征数据;基于所述目标用户对应的中间用户特征数据、所述每个互动用户对应的中间用户特征数据和所述目标用户以及每个互动用户分别对应的特征权重,确定各中间用户特征数据的加权平均值,作为所述目标用户对应的用户特征数据。
- 一种对用户进行分类的装置,其特征在于,所述装置包括:获取模块,用于获取目标用户的行为记录数据、以及与目标用户相关联的 其他用户的行为记录数据;确定模块,用于基于所述目标用户的行为记录数据、以及所述其他用户的行为记录数据,确定所述目标用户对应的用户特征数据;分类模块,用于将所述目标用户对应的用户特征数据输入到用户分类网络模型中,得到所述目标用户对应的用户分类信息。
- 根据权利要求7所述的装置,其特征在于,所述确定模块,用于:基于所述目标用户的行为记录数据、以及所述其他用户的行为记录数据,确定所述目标用户和所述其他用户中用户之间的互动关系;基于所述用户之间的互动关系,建立用户关系网络;基于所述用户关系网络,确定所述目标用户对应的用户特征数据。
- 根据权利要求8所述的装置,其特征在于,所述确定模块,用于:确定所述用户关系网络的网络结构特征数据,作为所述目标用户对应的用户特征数据。
- 根据权利要求8所述的装置,其特征在于,所述确定模块,用于:基于社区发现算法,对所述用户关系网络进行划分,得到多个子用户关系网络;确定所述目标用户所属的目标子用户关系网络;获取所述目标子用户关系网络对应的社区属性信息,作为所述目标用户对应的用户特征数据。
- 根据权利要求7所述的装置,其特征在于,所述确定模块,用于:基于所述目标用户的行为记录数据、以及所述其他用户的行为记录数据,确定与所述目标用户之间存在互动关系的至少一个互动用户;对于所述至少一个互动用户中的每个互动用户,在所述目标用户的行为记录数据中,获取所述目标用户与所述互动用户的互动行为对应的行为记录数据,以及所述目标用户的非互动行为记录数据,并在所述互动用户的行为记录数据中,获取所述互动用户与所述目标用户的互动行为对应的行为记录数据,以及所述互动用户的非互动行为记录数据,作为所述目标用户和所述互动用户之间的关联行为记录数据,基于所述关联行为记录数据,确定所述互动用户对应的互动特征数据;将所有互动用户对应的互动特征数据的平均值,确定为所述目标用户对应 的用户特征数据。
- 根据权利要求7所述的装置,其特征在于,所述确定模块,用于:基于所述目标用户的行为记录数据、以及所述其他用户的行为记录数据,确定与所述目标用户之间存在互动关系的至少一个互动用户;基于所述目标用户的行为记录数据、以及所述至少一个互动用户的行为记录数据,确定所述目标用户和所述至少一个互动用户中用户之间的互动关系;基于所述用户之间的互动关系,建立用户关系网络;基于所述用户关系网络以及标签传播算法,确定所述目标用户以及每个互动用户分别对应的特征权重;基于所述目标用户的行为记录数据、以及所述其他用户的行为记录数据,确定所述目标用户对应的中间用户特征数据、以及每个互动用户对应的中间用户特征数据;基于所述目标用户对应的中间用户特征数据、所述每个互动用户对应的中间用户特征数据和所述目标用户以及每个互动用户分别对应的特征权重,确定各中间用户特征数据的加权平均值,作为所述目标用户对应的用户特征数据。
- 一种计算机设备,其特征在于,所述计算机设备包括处理器、通信接口、存储器和通信总线,其中:所述处理器、所述通信接口和所述存储器通过所述通信总线完成相互间的通信;所述存储器,用于存放计算机程序;所述处理器,用于执行所述存储器上所存放的程序,以实现权利要求1-6任一所述的方法步骤。
- 一种计算机可读存储介质,其特征在于,所述计算机可读存储介质内存储有计算机程序,所述计算机程序被处理器执行时实现权利要求1-6任一所述的方法步骤。
Applications Claiming Priority (2)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201811584209.1A CN111353001B (zh) | 2018-12-24 | 2018-12-24 | 对用户进行分类的方法和装置 |
CN201811584209.1 | 2018-12-24 |
Publications (1)
Publication Number | Publication Date |
---|---|
WO2020135420A1 true WO2020135420A1 (zh) | 2020-07-02 |
Family
ID=71128441
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
PCT/CN2019/127930 WO2020135420A1 (zh) | 2018-12-24 | 2019-12-24 | 对用户进行分类的方法和装置 |
Country Status (2)
Country | Link |
---|---|
CN (1) | CN111353001B (zh) |
WO (1) | WO2020135420A1 (zh) |
Families Citing this family (2)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN111522866B (zh) * | 2020-07-03 | 2020-11-03 | 支付宝(杭州)信息技术有限公司 | 一种可信子图的挖掘方法、装置以及设备 |
CN117574286B (zh) * | 2024-01-11 | 2024-05-24 | 阿里健康科技(杭州)有限公司 | 标签值的确定方法、装置、设备和存储介质 |
Citations (6)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN102769781A (zh) * | 2012-07-17 | 2012-11-07 | 青岛海信传媒网络技术有限公司 | 推荐电视节目的方法及装置 |
WO2015069607A2 (en) * | 2013-11-08 | 2015-05-14 | Microsoft Technology Licensing, Llc | Hierarchical statistical model for behavior prediction and classification |
CN105045916A (zh) * | 2015-08-20 | 2015-11-11 | 广东顺德中山大学卡内基梅隆大学国际联合研究院 | 一种手机游戏推荐系统及其推荐方法 |
CN106934412A (zh) * | 2015-12-31 | 2017-07-07 | 中国科学院深圳先进技术研究院 | 一种用户行为分类方法及系统 |
CN108280757A (zh) * | 2017-02-13 | 2018-07-13 | 腾讯科技(深圳)有限公司 | 用户信用评估方法及装置 |
CN109063130A (zh) * | 2018-08-02 | 2018-12-21 | 广州友米科技有限公司 | 基于用户行为合并展示社交分享信息的方法 |
Family Cites Families (15)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
WO2009043020A2 (en) * | 2007-09-28 | 2009-04-02 | The Trustees Of Dartmouth College | System and method for injecting sensed presence into social networking applications |
EP2384476A4 (en) * | 2009-01-30 | 2013-01-23 | Cbs Interactive Inc | PERSONALIZATION MACHINE FOR CREATING A USER PROFILE |
CN103412930A (zh) * | 2013-08-17 | 2013-11-27 | 北京品友互动信息技术有限公司 | 一种互联网用户属性识别方法 |
CN103593393B (zh) * | 2013-09-13 | 2017-01-04 | 微梦创科网络科技(中国)有限公司 | 基于微博互动关系挖掘社交圈的方法及装置 |
CN103745002B (zh) * | 2014-01-24 | 2017-01-18 | 中国科学院信息工程研究所 | 一种基于行为特征与内容特征融合的水军识别方法及系统 |
CN104809132B (zh) * | 2014-01-27 | 2018-07-31 | 阿里巴巴集团控股有限公司 | 一种获取网络主体社交关系类型的方法及装置 |
CN104298719B (zh) * | 2014-09-23 | 2018-02-27 | 新浪网技术(中国)有限公司 | 基于社交行为进行用户的类别划分、广告投放方法和系统 |
CN104951544A (zh) * | 2015-06-19 | 2015-09-30 | 百度在线网络技术(北京)有限公司 | 用户数据处理方法、用户数据的提供方法和系统 |
US11200257B2 (en) * | 2016-04-25 | 2021-12-14 | Securboration, Inc. | Classifying social media users |
CN107368488A (zh) * | 2016-05-12 | 2017-11-21 | 阿里巴巴集团控股有限公司 | 一种确定用户行为偏好的方法、推荐信息的展示方法和装置 |
CN106548210B (zh) * | 2016-10-31 | 2021-02-05 | 腾讯科技(深圳)有限公司 | 基于机器学习模型训练的信贷用户分类方法及装置 |
CN108073933B (zh) * | 2016-11-08 | 2021-05-25 | 杭州海康威视数字技术股份有限公司 | 一种目标检测方法及装置 |
CN108055646B (zh) * | 2017-05-19 | 2021-02-05 | 上海掌门科技有限公司 | 一种用于推荐用户的方法与设备 |
CN108197224B (zh) * | 2017-12-28 | 2020-11-20 | 广州虎牙信息科技有限公司 | 用户群分类方法、存储介质以及终端 |
CN108804633B (zh) * | 2018-06-01 | 2021-10-08 | 腾讯科技(深圳)有限公司 | 基于行为语义知识网络的内容推荐方法 |
-
2018
- 2018-12-24 CN CN201811584209.1A patent/CN111353001B/zh active Active
-
2019
- 2019-12-24 WO PCT/CN2019/127930 patent/WO2020135420A1/zh active Application Filing
Patent Citations (6)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN102769781A (zh) * | 2012-07-17 | 2012-11-07 | 青岛海信传媒网络技术有限公司 | 推荐电视节目的方法及装置 |
WO2015069607A2 (en) * | 2013-11-08 | 2015-05-14 | Microsoft Technology Licensing, Llc | Hierarchical statistical model for behavior prediction and classification |
CN105045916A (zh) * | 2015-08-20 | 2015-11-11 | 广东顺德中山大学卡内基梅隆大学国际联合研究院 | 一种手机游戏推荐系统及其推荐方法 |
CN106934412A (zh) * | 2015-12-31 | 2017-07-07 | 中国科学院深圳先进技术研究院 | 一种用户行为分类方法及系统 |
CN108280757A (zh) * | 2017-02-13 | 2018-07-13 | 腾讯科技(深圳)有限公司 | 用户信用评估方法及装置 |
CN109063130A (zh) * | 2018-08-02 | 2018-12-21 | 广州友米科技有限公司 | 基于用户行为合并展示社交分享信息的方法 |
Also Published As
Publication number | Publication date |
---|---|
CN111353001A (zh) | 2020-06-30 |
CN111353001B (zh) | 2023-08-18 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN111784455B (zh) | 一种物品推荐方法及推荐设备 | |
US10311106B2 (en) | Social graph visualization and user interface | |
WO2018041168A1 (zh) | 信息推送方法、存储介质和服务器 | |
CN102890696B (zh) | 基于社交网络的上下文排序 | |
WO2020048084A1 (zh) | 资源推荐方法、装置、计算机设备及计算机可读存储介质 | |
US8290818B1 (en) | System for recommending item bundles | |
US9959563B1 (en) | Recommendation generation for infrequently accessed items | |
US8285602B1 (en) | System for recommending item bundles | |
US20170206276A1 (en) | Large Scale Recommendation Engine Based on User Tastes | |
Li et al. | A multi-theoretical kernel-based approach to social network-based recommendation | |
US8832091B1 (en) | Graph-based semantic analysis of items | |
CN109255586B (zh) | 一种面向电子政务办事的在线个性化推荐方法 | |
CN111506820B (zh) | 推荐模型、方法、装置、设备及存储介质 | |
CN109903103B (zh) | 一种推荐物品的方法和装置 | |
US20180307733A1 (en) | User characteristic extraction method and apparatus, and storage medium | |
US20160314145A1 (en) | Using image recognition to locate resources | |
Gelli et al. | Beyond the product: Discovering image posts for brands in social media | |
WO2020135420A1 (zh) | 对用户进行分类的方法和装置 | |
Akhil et al. | A survey of recommender system types and its classification. | |
TW201903705A (zh) | 用於基於種子監督學習提供推薦的系統和方法 | |
CN112560105B (zh) | 保护多方数据隐私的联合建模方法及装置 | |
US10929905B2 (en) | Method, system and machine-readable medium for online task exchange | |
CN113327132A (zh) | 多媒体推荐方法、装置、设备及存储介质 | |
CN109299368B (zh) | 一种用于环境信息资源ai智能个性化推荐的方法及系统 | |
Johari et al. | The Hybrid Recommender System of the Indonesian Online Market Products using IMDb weight rating and TF-IDF |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
121 | Ep: the epo has been informed by wipo that ep was designated in this application |
Ref document number: 19902647 Country of ref document: EP Kind code of ref document: A1 |
|
NENP | Non-entry into the national phase |
Ref country code: DE |
|
122 | Ep: pct application non-entry in european phase |
Ref document number: 19902647 Country of ref document: EP Kind code of ref document: A1 |
|
122 | Ep: pct application non-entry in european phase |
Ref document number: 19902647 Country of ref document: EP Kind code of ref document: A1 |
|
32PN | Ep: public notification in the ep bulletin as address of the adressee cannot be established |
Free format text: NOTING OF LOSS OF RIGHTS PURSUANT TO RULE 112(1) EPC (EPO FORM 1205A DATED 020222) |
|
122 | Ep: pct application non-entry in european phase |
Ref document number: 19902647 Country of ref document: EP Kind code of ref document: A1 |