WO2019051962A1 - 社交平台用户的现实关系匹配方法、装置及可读存储介质 - Google Patents
社交平台用户的现实关系匹配方法、装置及可读存储介质 Download PDFInfo
- Publication number
- WO2019051962A1 WO2019051962A1 PCT/CN2017/109222 CN2017109222W WO2019051962A1 WO 2019051962 A1 WO2019051962 A1 WO 2019051962A1 CN 2017109222 W CN2017109222 W CN 2017109222W WO 2019051962 A1 WO2019051962 A1 WO 2019051962A1
- Authority
- WO
- WIPO (PCT)
- Prior art keywords
- user
- model
- vector
- users
- text
- Prior art date
Links
- 238000000034 method Methods 0.000 title claims abstract description 34
- 239000013598 vector Substances 0.000 claims abstract description 199
- 230000003993 interaction Effects 0.000 claims abstract description 91
- 238000013527 convolutional neural network Methods 0.000 claims abstract description 29
- 238000012545 processing Methods 0.000 claims abstract description 17
- 230000006870 function Effects 0.000 claims description 27
- 238000012549 training Methods 0.000 claims description 7
- 238000004590 computer program Methods 0.000 claims description 6
- 230000009466 transformation Effects 0.000 claims description 5
- 238000013507 mapping Methods 0.000 claims description 4
- 230000002452 interceptive effect Effects 0.000 abstract description 9
- 238000004458 analytical method Methods 0.000 description 9
- 238000003672 processing method Methods 0.000 description 5
- 238000010586 diagram Methods 0.000 description 4
- 230000008569 process Effects 0.000 description 3
- 230000006399 behavior Effects 0.000 description 2
- 239000000463 material Substances 0.000 description 2
- 238000012544 monitoring process Methods 0.000 description 2
- 238000003062 neural network model Methods 0.000 description 2
- 238000011161 development Methods 0.000 description 1
- 238000005516 engineering process Methods 0.000 description 1
- 238000011156 evaluation Methods 0.000 description 1
- 239000000284 extract Substances 0.000 description 1
- 238000000605 extraction Methods 0.000 description 1
- 238000003012 network analysis Methods 0.000 description 1
- 230000006855 networking Effects 0.000 description 1
- 230000003287 optical effect Effects 0.000 description 1
- 238000011176 pooling Methods 0.000 description 1
- 238000012954 risk control Methods 0.000 description 1
Images
Classifications
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/30—Information retrieval; Database structures therefor; File system structures therefor of unstructured textual data
- G06F16/35—Clustering; Classification
- G06F16/353—Clustering; Classification into predefined classes
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/30—Information retrieval; Database structures therefor; File system structures therefor of unstructured textual data
- G06F16/33—Querying
- G06F16/3331—Query processing
- G06F16/334—Query execution
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06Q—INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES, NOT OTHERWISE PROVIDED FOR
- G06Q10/00—Administration; Management
- G06Q10/04—Forecasting or optimisation specially adapted for administrative or management purposes, e.g. linear programming or "cutting stock problem"
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06Q—INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES, NOT OTHERWISE PROVIDED FOR
- G06Q50/00—Information and communication technology [ICT] specially adapted for implementation of business processes of specific business sectors, e.g. utilities or tourism
- G06Q50/01—Social networking
Definitions
- the present application relates to the field of computer network technologies, and in particular, to a real-world relationship matching method for a social platform user, a data processing device, and a computer-readable storage medium.
- the usual method is to measure the distance between nodes according to the network structure, or use the clustering method to find the cluster to divide the nodes, and calculate the similarity between users through different algorithms in the social topology network structure. To determine the relationship between users.
- the clustering method to find the cluster to divide the nodes, and calculate the similarity between users through different algorithms in the social topology network structure. To determine the relationship between users.
- due to the characteristics of social networks although many nodes are close to each other, they may be just online friends, and they have never met in offline or real life.
- the main purpose of the present application is to provide a real-life relationship matching method, a data processing device, and a computer-readable storage medium for an online social user, aiming at solving the technical problem of how to accurately calculate and identify an intimate relationship of an online user in real life.
- a real-life relationship matching method for a social platform user includes the following steps:
- the step of establishing a user vector model according to the account information and the TransE model of each of the users includes:
- the objective function of the user vector model includes:
- the h p is a weighted non-linear transformed output of the vector inputs v 1 and v 2 of the two user users of the user vector model
- the W 1 is the vector input v 1 of the user
- the W 2 is the weight of the vector input v 2 of the user.
- the step of acquiring text interaction information between users on the social platform, and establishing a text relationship prediction model according to text interaction information between the users and a convolutional neural network model includes:
- Obtaining text interaction information m between users on the social platform wherein the text interaction information m includes a plurality of words ⁇ u 1 , u 2 , u 3 , ... u n ⁇ , m ⁇ M ⁇ h , t> ;m represents textual interaction information containing entities h,t, and M represents a collection of textual interaction information containing entities h,t;
- the trained Word2Vec algorithm trains the low-dimensional vector established by each of the words, repeatedly extracts the words and calculates an implicit feature vector, obtains a maximum value of the implicit feature vector for each dimension, and The maximum value of the implicit feature vector is classified.
- the objective function of the text relationship prediction model includes:
- the output result of the user vector model and the output result of the text relationship prediction model are projected into the same space for linear joint output to obtain a realistic relationship prediction classification of the user on the social platform.
- the steps include:
- the preset linear joint algorithm function And outputting the output result h p of the user vector model and the output result of the text relationship prediction model to a preset space, and outputting a realistic relationship prediction classification of the user on the social platform; wherein To project the output result h p of the user vector model to the parameters of the preset space, A parameter for projecting an output result of the text relationship prediction model to the preset space.
- the present application also provides a data processing apparatus comprising a memory, a processor, and a computer program stored on the memory and operable on the processor, the processor implementing the program to implement the steps:
- the step of establishing a user vector model according to the account information and the TransE model of each of the users includes:
- the objective function of the user vector model includes:
- the h p is a weighted non-linear transformed output of the vector inputs v 1 and v 2 of the two user users of the user vector model
- the W 1 is the vector input v 1 of the user
- the W 2 is the weight of the vector input v 2 of the user.
- the step of acquiring text interaction information between users on the social platform, and establishing a text relationship prediction model according to text interaction information between the users and a convolutional neural network model includes:
- Obtaining text interaction information m between users on the social platform wherein the text interaction information m includes a plurality of words ⁇ u 1 , u 2 , u 3 , ... u n ⁇ , m ⁇ M ⁇ h , t> ;m represents textual interaction information containing entities h,t, and M represents a collection of textual interaction information containing entities h,t;
- the objective function of the text relationship prediction model includes:
- the output result of the user vector model and the output result of the text relationship prediction model are projected into the same space for linear joint output to obtain a realistic relationship prediction classification of the user on the social platform.
- the steps include:
- the preset linear joint algorithm function And outputting the output result h p of the user vector model and the output result of the text relationship prediction model to a preset space, and outputting a realistic relationship prediction classification of the user on the social platform; wherein To project the output result h p of the user vector model to the parameters of the preset space, A parameter for projecting an output result of the text relationship prediction model to the preset space.
- the application further provides a computer readable storage medium having stored thereon a computer program, the program being implemented by the processor to implement the steps:
- the step of establishing a user vector model according to the account information and the TransE model of each of the users includes:
- the objective function of the user vector model includes:
- the h p is a weighted non-linear transformed output of the vector inputs v 1 and v 2 of the two user users of the user vector model
- the W 1 is the vector input v 1 of the user
- the W 2 is the weight of the vector input v 2 of the user.
- the step of acquiring text interaction information between users on the social platform, and establishing a text relationship prediction model according to text interaction information between the users and a convolutional neural network model includes:
- Obtaining text interaction information m between users on the social platform wherein the text interaction information m includes a plurality of words ⁇ u 1 , u 2 , u 3 , ... u n ⁇ , m ⁇ M ⁇ h , t> ;m represents textual interaction information containing entities h,t, and M represents a collection of textual interaction information containing entities h,t;
- the objective function of the text relationship prediction model includes:
- the output result of the user vector model and the output result of the text relationship prediction model are projected into the same space for linear joint output to obtain
- the steps of predicting the classification of the real relationship of the users on the social platform include:
- the preset linear joint algorithm function And outputting the output result h p of the user vector model and the output result of the text relationship prediction model to a preset space, and outputting a realistic relationship prediction classification of the user on the social platform; wherein To project the output result h p of the user vector model to the parameters of the preset space, A parameter for projecting an output result of the text relationship prediction model to the preset space.
- the user vector model is established according to the account information of the user on the social platform and the TransE model, and the degree of relationship between the two user entities is predicted; then the text interaction information and convolution between the users on the social platform are performed.
- the neural network model establishes a text relationship prediction model, and obtains a predicted classification of the real relationship between the users on the social platform; by projecting the output result of the user vector model and the output result of the text relationship prediction model to the same
- the space performs a linear joint output to obtain a realistic relationship prediction result of the user on the social platform.
- the intimate relationship prediction is based on the interaction between the user's account information and the interactive text information between the users, and the intimate relationship prediction is performed according to the interaction behavior between the people on the social platform; the user's personal information analysis is effectively solved, and usually only the outlet is analyzed. At the same time, it solves the problem that the analysis of interactive text information between users cannot accurately analyze the actual relationship between users. For example, when using interactive text information analysis, usually only two users communicate with each other intimately. The two users are identified as having the disadvantage of having an intimate relationship.
- FIG. 1 is a flowchart of a method for matching a real relationship of a social platform user in a first embodiment of the present application
- step S10 is a sub-flow diagram of step S10 of the real relationship matching method of the social platform user in FIG. 1;
- FIG. 4 is a schematic structural diagram of a module of a data processing apparatus according to an embodiment of the present application.
- FIG. 1 is a flowchart of a method for a real-world relationship matching method 100 of a social platform user in a first embodiment of the present application, where the data processing method 100 includes the following steps:
- Step S10 Acquire user account information of the user on the social platform, and establish a user vector model according to the account information of each user and the TransE model.
- the user on the social platform may be a registered user on the same platform, or may be a user on a different social platform on a social platform.
- the social platform may be various social softwares or social networking websites in the Internet, and is not limited herein.
- the account information of the user includes, but is not limited to, the user's name, nickname, gender, age, hobbies, work experience, personal signature, tag, place of origin, resident address, email address, telephone number, social account number, and the like.
- the TransE model is a distributed vector representation based on entities and relationships.
- the relationship relation in each triple instance (head, relation, tail) is regarded as the spatial connection relationship from the entity head to the entity tail.
- h, r And t the vector of head, relation, and tail
- the user vector model is established according to the profile information of each user in the Weibo and the TransE model.
- the profile information of the user Zhang San includes: name-Zhang San, gender-female;
- a triad ⁇ Zhangsan, gender, female> is established, wherein h-zhangsan, t-female is the entity in the account information of the user, and r-sex is connected to the entity h - Zhang San, t-female relationship;
- user Li Si's profile information includes: name - Li Si, gender - male; according to the Li Si's profile information to create a triad ⁇ Li Si, Gender, male>, where h- ⁇ , t-male is the entity in the user's account information, r-sex is the relationship between the entity h-Zhang San, t-male; or the user Wang Wu Profile information includes: Name-Wang Wu,
- the dissimilarity d(h+r,t) of the true triple should be smaller than the false triplet d(h'+r,t) or d(h+r,t '), that is, when two entities have many identical relationships, their low-dimensional vectors will be very similar.
- Step S20 Obtain text interaction information between users on the social platform, and establish a text relationship prediction model according to text interaction information between the users and a convolutional neural network model.
- text information can be exchanged between users.
- the microblog platform two users can reply to each other @ or each other, the social information.
- the textual interaction information between users on the platform can be used as a data basis for judging the actual relationship between users.
- text interaction information m between users on the social platform may be defined, wherein the text interaction information m includes a plurality of words ⁇ u 1 , u 2 , u 3 , ... u n ⁇ , m ⁇ M ⁇ h,t> ;m represents textual interaction information containing entities h,t, and M represents a collection of textual interaction information containing entities h,t.
- a text relationship prediction model may be established to obtain a prediction classification of a real relationship between users based on text interaction information between the users.
- Step S30 projecting the output result of the user vector model and the output result of the text relationship prediction model to the same space for linear joint output, to obtain a realistic relationship prediction result of the user on the social platform.
- the output of the user vector model is the addition output h p of the vector inputs v 1 and v 2 of the two users of the user vector model; the text relationship prediction model outputs the user's a prediction classification r of the actual relationship between the two; and outputting the output result h p of the user vector model and the output result of the text relationship prediction model to the same space for linear joint output to obtain the user on the social platform Realistic relationship prediction results.
- the social network user's real relationship matching method 100 predicts the degree of relationship between two user entities by establishing a user vector model according to the user's account information and the TransE model on the social platform;
- the text interaction information between the users on the platform and the convolutional neural network model establishes a text relationship prediction model to obtain a predicted classification of the real relationship between the users on the social platform; by outputting the user vector model and The output of the text relationship prediction model is projected to the same space for linear joint output to obtain a realistic relationship prediction result of the user on the social platform.
- the intimate relationship prediction is based on the interaction between the user's account information and the interactive text information between the users, and the intimate relationship prediction is performed according to the interaction behavior between the people on the social platform; the user's personal information analysis is effectively solved, and usually only the outlet is analyzed. At the same time, it solves the problem that the analysis of interactive text information between users cannot accurately analyze the actual relationship between users. For example, when using interactive text information analysis, usually only two users communicate with each other intimately. The two users are identified as having the disadvantage of having an intimate relationship.
- step S10 establishing a user vector model according to each user account information and a TransE model may include:
- Step S101 Create a triplet ⁇ h, r, t> according to the account information of each user, where h, t is an entity in the account information of the user, and r is connected between the entities h, t Relationship;
- Step S102 Map a relationship between each entity and entity in each of the triples ⁇ h, r, t> to a low-dimensional vector including a relationship between each entity and an entity according to a TransE model.
- a triplet ⁇ h, r, t> may be established according to account information of each of the users, where h, t is an entity in the account information of the user, and r is connected to the entity.
- the relationship between each entity and entity in the group ⁇ h, r, t> is mapped to a low dimensional vector containing the relationship between each entity and entity.
- a user vector model is established according to personal profile information and TransE model of each user in the microblog, for example, when the social platform is Weibo, according to each user of the Weibo.
- the profile information and the TransE model establish a user vector model.
- the user profile information of the user Zhang San includes: name-Zhang San, gender-female; and establish a triad based on the profile information of Zhang San ⁇ Zhang San, gender, female >, wherein h- ⁇ , t-female is the entity in the user's account information, r-sex is the relationship between the entity h-Zhang San, t-female; and for example, the user Li Si's individual
- the brief information includes: name-Li Si, gender-male; according to the profile information of Li Si, create a triad ⁇ Li Si, gender, male>, where h- ⁇ , t-male is the user's account information In the entity, r-sex is the relationship between the entity h-Zhang San, t-male; or the personal profile information of the user Wang Wu includes: name-Wang Wu, occupation-teacher; according to the individual of Wang Wu Profile information Set up a triad ⁇ Wang Wu, occupation, teacher>, among them, h- ⁇ ,
- the first and last entities in the triple are regarded as points in the space, and the relationship between the connected entities can be obtained, and the corresponding network map can be obtained.
- the relationship between each entity and entity in each of the triples ⁇ h, r, t> is mapped to a low dimensional vector containing the relationship between each entity and the entity.
- the objective function of the user vector model may include:
- the dissimilarity d(h+r,t) of the true triple should be smaller than the false triplet d(h'+r,t) or d(h+r,t '), when two entities have many identical relationships, their low-dimensional vectors will be very similar.
- v 1 and v 2 are vector inputs of two users of the user vector model
- an output h p of the user vector model is the two vectors v 1 and v 2 vector obtained by weighted nonlinear transformation
- step S20 the acquiring text interaction information between users on the social platform, according to text interaction information and volume between the users
- the neural network model establishes a text relationship prediction model, which can include:
- Step S201 Obtain text interaction information m between users on the social platform, where the text interaction information m includes a plurality of words ⁇ u 1 , u 2 , u 3 , ... u n ⁇ , m ⁇ M ⁇ h,t> ;m represents textual interaction information containing entities h,t, and M represents a collection of textual interaction information containing entities h,t;
- Step S202 establishing a low-dimensional vector for each of the words according to the convolutional neural network model, training the low-dimensional vector established by each of the words by the trained Word2Vec algorithm, repeatedly extracting the words, and calculating the implicit
- the feature vector obtains a maximum value of the implicit feature vector for each dimension and classifies the maximum value of the implicit feature vector.
- the text interaction information m includes a plurality of words ⁇ u 1 , u 2 , u 3 , ... u n ⁇ , m ⁇ M ⁇ h, t> ; m represents Contains textual interaction information for entities h, t, and M represents a collection of textual interaction information containing entities h, t.
- the user posts a text message "Happy Valentine's Day! Dear @ ⁇ "
- step S202 according to a convolutional neural network model (Convolutional Neural Networks, CNN) establishes a low-dimensional vector for each of the words, trains the low-dimensional vector established by each of the words through the trained Word2Vec algorithm, and repeats the extraction through a convolutional layer.
- the term calculates an implicit feature vector, obtains a maximum value of the implicit feature vector for each dimension by a maximum pooling layer, and classifies a maximum value of the implicit feature vector.
- the objective function of the text relationship prediction model includes:
- the CNN model can be used to capture the relative global features of the textual interaction information. For example, the difference between "Valentine's Day is a holiday" and "Happy Valentine's Day” can be recognized; thereby improving the accuracy of the textual relationship prediction model. .
- the step S30 may include: according to a preset linear joint algorithm function: And outputting the output result h p of the user vector model and the output result of the text relationship prediction model to a preset space, and outputting a realistic relationship prediction classification of the user on the social platform; wherein To project the output result h p of the user vector model to the parameters of the preset space, A parameter for projecting an output result of the text relationship prediction model to the preset space.
- the preset linear joint algorithm function is consistent with the target function type of the text relationship prediction model, and the output result h p of the user vector model and the text relationship prediction are introduced by introducing a deviation algorithm.
- the output of the model is projected into a preset space in a preset linear joint algorithm, and the output obtains a realistic relationship prediction classification of the user on the social platform, which can more accurately calculate and identify the intimacy of the online user in real life. relationship.
- the social relationship user's realistic relationship matching method in the above embodiment may be from social The real-life relationship of online users is accurately mined among hundreds of millions of users in the platform, and the realistic relationship matching method of the social platform users can be applied to the fields of financial products and public security monitoring.
- the internal data of a financial company is generally a single user data, and there is no correlation between the user and the user.
- the degree of relationship between people is very important. For example, in the risk control model, assuming that a user borrows, there is no bad record in itself, but his family or close friend has a bad credit record. Then, when it comes to credit evaluation, it should be more careful.
- the financial company establishes the user's close friend matching network according to the social account information in the external data user data, such as Sina Weibo, WeChat, etc., and can record the bad credit records of the user, his family, acquaintances, and friends on the matching network. To better control risk and prevent potential losses.
- FIG. 4 is a block diagram of a data processing apparatus 200 according to an embodiment of the present application.
- the data processing apparatus 200 includes a memory 201, a processor 202, and a computer program stored on the memory and operable on the processor 202.
- the processor 202 implements the program to implement the following steps:
- Step S10 Acquire user account information of the user on the social platform, and establish a user vector model according to the account information of each user and the TransE model;
- Step S20 Obtain text interaction information between users on the social platform, and establish a text relationship prediction model according to text interaction information between the users and a convolutional neural network model;
- Step S30 projecting the output result of the user vector model and the output result of the text relationship prediction model to the same space for linear joint output, to obtain a realistic relationship prediction result of the user on the social platform.
- the data processing device effectively solves the problem of only analyzing the personal information of the user, and usually can only analyze the shortcomings of the online friends; at the same time, it solves the problem that the analysis of the interactive text information between the users cannot be accurate.
- Analysis of the actual relationship between users for example, when using interactive text information analysis, usually as long as two users communicate with each other intimately, the two users are determined to have the disadvantage of having an intimate relationship.
- the data processing device 200 may be an electronic product having a data processing function such as a server, a computer, a portable computer device, a mobile phone, or a tablet computer.
- the step S10 may include:
- Step S101 Create a triplet ⁇ h, r, t> according to the account information of each user, where h, t is an entity in the account information of the user, and r is connected between the entities h, t Relationship;
- Step S102 Map a relationship between each entity and entity in each of the triples ⁇ h, r, t> to a low-dimensional vector including a relationship between each entity and an entity according to a TransE model.
- the objective function of the user vector model may include:
- v 1 and v 2 are vector inputs of two users of the user vector model
- an output h p of the user vector model is the two vectors v 1 and v 2 vector obtained by weighted nonlinear transformation
- the step S20 may include:
- Step S201 Obtain text interaction information m between users on the social platform, where the text interaction information m includes a plurality of words ⁇ u 1 , u 2 , u 3 , ... u n ⁇ , m ⁇ M ⁇ h,t> ;m represents textual interaction information containing entities h,t, and M represents a collection of textual interaction information containing entities h,t;
- Step S202 establishing a low-dimensional vector for each of the words according to the convolutional neural network model, training the low-dimensional vector established by each of the words by the trained Word2Vec algorithm, repeatedly extracting the words, and calculating the implicit Feature vector, get each Dimensioning the maximum value of the implicit feature vector and classifying the maximum value of the implicit feature vector.
- the objective function of the text relationship prediction model includes:
- the step S30 may include: according to a preset linear joint algorithm function: And outputting the output result h p of the user vector model and the output result of the text relationship prediction model to a preset space, and outputting a realistic relationship prediction classification of the user on the social platform; wherein To project the output result h p of the user vector model to the parameters of the preset space, A parameter for projecting an output result of the text relationship prediction model to the preset space.
- the present application also provides a computer readable storage medium having stored thereon a computer program, which, when executed by a processor, can implement the data processing method 100, the data processing method 102, the data processing method 103, and the data processing method as described above. Step 104.
- the technical solution of the present application which is essential or contributes to the prior art, may be embodied in the form of a software product stored in a storage medium (such as ROM/RAM as described above). , a disk, an optical disk, including a number of instructions for causing a terminal device (which may be a mobile phone, a computer, a server, an air conditioner, or a network device, etc.) to perform the methods described in the various embodiments of the present application.
- a terminal device which may be a mobile phone, a computer, a server, an air conditioner, or a network device, etc.
Landscapes
- Engineering & Computer Science (AREA)
- Theoretical Computer Science (AREA)
- Business, Economics & Management (AREA)
- General Physics & Mathematics (AREA)
- Physics & Mathematics (AREA)
- Strategic Management (AREA)
- Economics (AREA)
- Human Resources & Organizations (AREA)
- General Business, Economics & Management (AREA)
- Data Mining & Analysis (AREA)
- Tourism & Hospitality (AREA)
- Marketing (AREA)
- General Engineering & Computer Science (AREA)
- Databases & Information Systems (AREA)
- Health & Medical Sciences (AREA)
- Computational Linguistics (AREA)
- Computing Systems (AREA)
- Primary Health Care (AREA)
- General Health & Medical Sciences (AREA)
- Development Economics (AREA)
- Game Theory and Decision Science (AREA)
- Entrepreneurship & Innovation (AREA)
- Operations Research (AREA)
- Quality & Reliability (AREA)
- Information Retrieval, Db Structures And Fs Structures Therefor (AREA)
Abstract
Description
Claims (20)
- 一种社交平台用户的现实关系匹配方法,包括步骤:获取社交平台上的用户的账号信息,根据每一所述用户的账号信息和TransE模型建立用户向量模型;获取所述社交平台上的用户之间的文本交互信息,根据所述用户之间的文本交互信息和卷积神经网络模型建立文本关系预测模型;将所述用户向量模型的输出结果以及所述文本关系预测模型的输出结果投射到同一个空间进行线性联合输出,以得到所述社交平台上的用户的现实关系预测结果。
- 如权利要求1所述的社交平台用户的现实关系匹配方法,其中,所述根据每一所述用户的账号信息和TransE模型建立用户向量模型的步骤包括:根据每一所述用户的账号信息建立一个三元组<h,r,t>,其中,h,t是用户的账号信息中的实体,r是连接所述实体h,t之间的关系;根据TransE模型把所述每一个三元组<h,r,t>中每个实体和实体之间的关系映射为包含每个实体和实体之间的关系的低维度向量。
- 如权利要求2所述的社交平台用户的现实关系匹配方法,其中,所述用户向量模型的输出结果hp=tanh(W1v1+W2v2),其中,所述v1和v2为所述用户向量模型的两个用户的向量输入,所述hp为所述用户向量模型的两个用户的向量输入v1和v2的加权非线性转化输出经偏差修正的输出结果,所述W1为所述用户的向量输入v1的权重,所述W2为所 述用户的向量输入v2的权重。
- 如权利要求1所述的社交平台用户的现实关系匹配方法,其中,所述获取所述社交平台上的用户之间的文本交互信息,根据所述用户之间的文本交互信息和卷积神经网络模型建立文本关系预测模型的步骤包括:获取所述社交平台上的用户之间的文本交互信息m,其中,所述文本交互信息m包括多个词语{u1,u2,u3,...un},m∈M<h,t>;m代表包含实体h,t的文本交互信息,M代表包含实体h,t的文本交互信息的集合;根据卷积神经网络模型对所述每一个词语建立低维度向量,通过已训练好的Word2Vec算法对所述每一个词语建立的低维度向量进行训练,重复提取所述词语并计算隐含特征向量,获取每一维所述隐含特征向量的最大值,并将所述隐含特征向量的最大值进行分类。
- 如权利要求5所述的社交平台用户的现实关系匹配方法,其中,所述文本关系预测模型的输出结果r=max{hi},其中,hi=tanh(W-1ui-1+W0ui+W1ui+1),所述u为所述用户之间的文本交互信息中的词语,所述W-1代表ui-1的权重,所述W0代表ui的权重,所述W1代表ui+1的权重。
- 一种数据处理装置,包括存储器、处理器及存储在存储器上并可在处理器上运行的计算机程序,其中,所述处理器执行所述程序时实现步骤:获取社交平台上的用户的账号信息,根据每一所述用户的账号信息和TransE模型建立用户向量模型;获取所述社交平台上的用户之间的文本交互信息,根据所述用户之间的文本交互信息和卷积神经网络模型建立文本关系预测模型;将所述用户向量模型的输出结果以及所述文本关系预测模型的输出结果投射到同一个空间进行线性联合输出,以得到所述社交平台上的用户的现实关系预测结果。
- 如权利要求9所述的数据处理装置,其中,所述根据每一所述用户的账号信息和TransE模型建立用户向量模型的步骤包括:根据每一所述用户的账号信息建立一个三元组<h,r,t>,其中,h,t是用户的账号信息中的实体,r是连接所述实体h,t之间的关系;根据TransE模型把所述每一个三元组<h,r,t>中每个实体和实体之间的关系映射为包含每个实体和实体之间的关系的低维度向量;其中,所述用户向量模型的目标函数包括:
- 如权利要求10所述的数据处理装置,其中,所述用户向量模型的输出结果hp=tanh(W1v1+W2v2),其中,所述v1和v2为所述用户向量模型的两个用户的向量输入,所述hp为所述用户向量模型的两个用户的向量输入v1和v2的加权非线性转化输出经偏差修正的输出结果,所述W1为所述用户的向量输入v1的权重,所述W2为所述用户的向量输 入v2的权重。
- 如权利要求9所述的数据处理装置,其中,所述获取所述社交平台上的用户之间的文本交互信息,根据所述用户之间的文本交互信息和卷积神经网络模型建立文本关系预测模型的步骤包括:获取所述社交平台上的用户之间的文本交互信息m,其中,所述文本交互信息m包括多个词语{u1,u2,u3,...un},m∈M<h,t>;m代表包含实体h,t的文本交互信息,M代表包含实体h,t的文本交互信息的集合;根据卷积神经网络模型对所述每一个词语建立低维度向量,通过已训练好的Word2Vec算法对所述每一个词语建立的低维度向量进行训练,重复提取所述词语并计算隐含特征向量,获取每一维所述隐含特征向量的最大值,并将所述隐含特征向量的最大值进行分类;其中,所述文本关系预测模型的目标函数包括:
- 如权利要求12所述的数据处理装置,其中,所述文本关系预测模型的输出结果r=max{hi},其中,hi=tanh(W-1ui-1+W0ui+W1ui+1),所述u为所述用户之间的文本交互信息中的词语,所述W-1代表ui-1的权重,所述W0代表ui的权重,所述W1代表ui+1的权重。
- 一种计算机可读存储介质,其上存储有计算机程序,其中,该程序被处理器执行时实现步骤:获取社交平台上的用户的账号信息,根据每一所述用户的账号信息和TransE模型建立用户向量模型;获取所述社交平台上的用户之间的文本交互信息,根据所述用户之间的文本交互信息和卷积神经网络模型建立文本关系预测模型;将所述用户向量模型的输出结果以及所述文本关系预测模型的输出结果投射到同一个空间进行线性联合输出,以得到所述社交平台上的用户的现实关系预测结果。
- 如权利要求15所述的计算机可读存储介质,其中,所述根据每一所述用户的账号信息和TransE模型建立用户向量模型的步骤包括:根据每一所述用户的账号信息建立一个三元组<h,r,t>,其中,h,t是用户的账号信息中的实体,r是连接所述实体h,t之间的关系;根据TransE模型把所述每一个三元组<h,r,t>中每个实体和实体之间的关系映射为包含每个实体和实体之间的关系的低维度向量;其中,所述用户向量模型的目标函数包括:
- 如权利要求16所述的计算机可读存储介质,其中,所述用户向量模型的输出结果hp=tanh(W1v1+W2v2),其中,所述v1和v2为所述用户向量模型的两个用户的向量输入,所述hp为所述用户向量模型的两个用户的向量输入v1和v2的加权非线性转化输出经偏差修正的输出结果,所述W1为所述用户的向量输入v1的权重,所述W2为所述用户的向量输入v2的权重。
- 如权利要求15所述的计算机可读存储介质,其中,所述获取所述社交平台上的用户之间的文本交互信息,根据所述用户之间的文 本交互信息和卷积神经网络模型建立文本关系预测模型的步骤包括:获取所述社交平台上的用户之间的文本交互信息m,其中,所述文本交互信息m包括多个词语{u1,u2,u3,...un},m∈M<h,t>;m代表包含实体h,t的文本交互信息,M代表包含实体h,t的文本交互信息的集合;根据卷积神经网络模型对所述每一个词语建立低维度向量,通过已训练好的Word2Vec算法对所述每一个词语建立的低维度向量进行训练,重复提取所述词语并计算隐含特征向量,获取每一维所述隐含特征向量的最大值,并将所述隐含特征向量的最大值进行分类;其中,所述文本关系预测模型的目标函数包括:
- 如权利要求18所述的计算机可读存储介质,其中,所述文本关系预测模型的输出结果r=max{hi},其中,hi=tanh(W-1ui-1+W0ui+W1ui+1),所述u为所述用户之间的文本交互信息中的词语,所述W-1代表ui-1的权重,所述W0代表ui的权重,所述W1代表ui+1的权重。
Applications Claiming Priority (2)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201710827452.0A CN107741953B (zh) | 2017-09-14 | 2017-09-14 | 社交平台用户的现实关系匹配方法、装置及可读存储介质 |
CN201710827452.0 | 2017-09-14 |
Publications (1)
Publication Number | Publication Date |
---|---|
WO2019051962A1 true WO2019051962A1 (zh) | 2019-03-21 |
Family
ID=61235804
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
PCT/CN2017/109222 WO2019051962A1 (zh) | 2017-09-14 | 2017-11-03 | 社交平台用户的现实关系匹配方法、装置及可读存储介质 |
Country Status (2)
Country | Link |
---|---|
CN (1) | CN107741953B (zh) |
WO (1) | WO2019051962A1 (zh) |
Families Citing this family (9)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN108537624B (zh) * | 2018-03-09 | 2021-08-03 | 西北大学 | 一种基于深度学习的旅游服务推荐方法 |
CN108399509A (zh) * | 2018-04-12 | 2018-08-14 | 阿里巴巴集团控股有限公司 | 确定业务请求事件的风险概率的方法及装置 |
CN109947817B (zh) * | 2019-03-01 | 2021-06-18 | 武汉达梦数据库有限公司 | 一种基于关联图谱的六度空间关系分析方法及系统 |
CN110674637B (zh) * | 2019-09-06 | 2023-07-11 | 腾讯科技(深圳)有限公司 | 一种人物关系识别模型训练方法、装置、设备及介质 |
CN111369375A (zh) * | 2020-03-17 | 2020-07-03 | 深圳市随手金服信息科技有限公司 | 一种社交关系确定方法、装置、设备及存储介质 |
CN111797333B (zh) * | 2020-06-04 | 2021-04-20 | 南京擎盾信息科技有限公司 | 舆情传播任务展示的方法及装置 |
CN111797197B (zh) * | 2020-06-04 | 2021-03-26 | 南京擎盾信息科技有限公司 | 舆情分析的方法及装置 |
CN112085114A (zh) * | 2020-09-14 | 2020-12-15 | 杭州中奥科技有限公司 | 线上线下身份匹配方法、装置、设备及存储介质 |
CN112685614B (zh) * | 2021-03-17 | 2021-06-18 | 中国电子科技集团公司第三十研究所 | 一种社交媒体机器人群体快速检测方法 |
Citations (4)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US20140288999A1 (en) * | 2013-03-12 | 2014-09-25 | Correlor Technologies Ltd | Social character recognition (scr) system |
CN104615608A (zh) * | 2014-04-28 | 2015-05-13 | 腾讯科技(深圳)有限公司 | 一种数据挖掘处理系统及方法 |
CN104657434A (zh) * | 2015-01-30 | 2015-05-27 | 中国科学院信息工程研究所 | 一种社交网络结构构建方法 |
CN105741175A (zh) * | 2016-01-27 | 2016-07-06 | 电子科技大学 | 一种对在线社交网络中账户进行关联的方法 |
Family Cites Families (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN104657369A (zh) * | 2013-11-19 | 2015-05-27 | 深圳市腾讯计算机系统有限公司 | 用户属性信息的生成方法及系统 |
US20160042282A1 (en) * | 2014-08-11 | 2016-02-11 | Rashied Baradaran Amini | Relationship evaluator |
CN106570764A (zh) * | 2016-11-09 | 2017-04-19 | 广州杰赛科技股份有限公司 | 一种用户关系预测方法及装置 |
-
2017
- 2017-09-14 CN CN201710827452.0A patent/CN107741953B/zh active Active
- 2017-11-03 WO PCT/CN2017/109222 patent/WO2019051962A1/zh active Application Filing
Patent Citations (4)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US20140288999A1 (en) * | 2013-03-12 | 2014-09-25 | Correlor Technologies Ltd | Social character recognition (scr) system |
CN104615608A (zh) * | 2014-04-28 | 2015-05-13 | 腾讯科技(深圳)有限公司 | 一种数据挖掘处理系统及方法 |
CN104657434A (zh) * | 2015-01-30 | 2015-05-27 | 中国科学院信息工程研究所 | 一种社交网络结构构建方法 |
CN105741175A (zh) * | 2016-01-27 | 2016-07-06 | 电子科技大学 | 一种对在线社交网络中账户进行关联的方法 |
Non-Patent Citations (1)
Title |
---|
WORD2VEC + TRANSE, 7 July 2016 (2016-07-07), XP055583514, Retrieved from the Internet <URL:http://www.cnblogs.com/chenbjin/p/5644457.html> * |
Also Published As
Publication number | Publication date |
---|---|
CN107741953A (zh) | 2018-02-27 |
CN107741953B (zh) | 2020-01-21 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
WO2019051962A1 (zh) | 社交平台用户的现实关系匹配方法、装置及可读存储介质 | |
US11659050B2 (en) | Discovering signature of electronic social networks | |
Alzahrani et al. | Designing a network intrusion detection system based on machine learning for software defined networks | |
CN111431742B (zh) | 网络信息检测方法、装置、存储介质和计算机设备 | |
US9477685B1 (en) | Finding untagged images of a social network member | |
CN109933666B (zh) | 一种好友自动分类方法、装置、计算机设备和存储介质 | |
US11487791B2 (en) | Latent feature extraction from a network graph | |
CN111401700B (zh) | 一种数据分析方法、装置、计算机系统及可读存储介质 | |
CN107085616B (zh) | Lbsn中一种基于多维属性挖掘的虚假评论可疑地点检测方法 | |
US20170099291A1 (en) | Protecting privacy in an online setting | |
Yousefnezhad et al. | Automated iot device identification based on full packet information using real-time network traffic | |
US11755602B2 (en) | Correlating parallelized data from disparate data sources to aggregate graph data portions to predictively identify entity data | |
CN111400452A (zh) | 文本信息分类处理方法、电子设备及计算机可读存储介质 | |
Alterazi et al. | Prevention of cyber security with the internet of things using particle swarm optimization | |
US20180293312A1 (en) | Computerized Method and System for Organizing Video Files | |
Salim et al. | Data analytics of social media 3.0: Privacy protection perspectives for integrating social media and Internet of Things (SM-IoT) systems | |
CN112395979A (zh) | 基于图像的健康状态识别方法、装置、设备及存储介质 | |
de la Torre-Abaitua et al. | A compression-based method for detecting anomalies in textual data | |
CN113420212A (zh) | 基于深度特征学习的推荐方法、装置、设备及存储介质 | |
Alterkavı et al. | Novel authorship verification model for social media accounts compromised by a human | |
Kao et al. | A novel two-stage deep learning structure for network flow anomaly detection | |
CN111480348B (zh) | 用于基于音频的增强现实的系统和方法 | |
WO2019080404A1 (zh) | 跨社交平台用户匹配方法、数据处理装置及可读存储介质 | |
CN113158030A (zh) | 异地兴趣点的推荐方法、装置、电子设备及存储介质 | |
CN112270174A (zh) | 一种谣言检测方法及计算机可读存储介质 |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
NENP | Non-entry into the national phase |
Ref country code: DE |
|
32PN | Ep: public notification in the ep bulletin as address of the adressee cannot be established |
Free format text: NOTING OF LOSS OF RIGHTS PURSUANT TO RULE 112(1) EPC (EPO FORM 1205A DATED 09/10/2020) |
|
32PN | Ep: public notification in the ep bulletin as address of the adressee cannot be established |
Free format text: NOTING OF LOSS OF RIGHTS PURSUANT TO RULE 112(1) EPC (EPO FORM 1205A DATED 09/10/2020) |
|
121 | Ep: the epo has been informed by wipo that ep was designated in this application |
Ref document number: 17925357 Country of ref document: EP Kind code of ref document: A1 |
|
122 | Ep: pct application non-entry in european phase |
Ref document number: 17925357 Country of ref document: EP Kind code of ref document: A1 |