CN113434782B - Cross-social network user identity recognition method based on joint embedded learning model - Google Patents
Cross-social network user identity recognition method based on joint embedded learning model Download PDFInfo
- Publication number
- CN113434782B CN113434782B CN202110718740.9A CN202110718740A CN113434782B CN 113434782 B CN113434782 B CN 113434782B CN 202110718740 A CN202110718740 A CN 202110718740A CN 113434782 B CN113434782 B CN 113434782B
- Authority
- CN
- China
- Prior art keywords
- user
- upg
- representing
- node
- model
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Active
Links
- 238000000034 method Methods 0.000 title claims abstract description 20
- 230000006870 function Effects 0.000 claims abstract description 16
- 238000013528 artificial neural network Methods 0.000 claims abstract description 4
- 239000013598 vector Substances 0.000 claims description 14
- 238000005070 sampling Methods 0.000 claims description 9
- 238000010586 diagram Methods 0.000 claims description 8
- 238000005516 engineering process Methods 0.000 claims description 4
- 238000012549 training Methods 0.000 claims description 4
- 238000006243 chemical reaction Methods 0.000 claims description 3
- 238000011478 gradient descent method Methods 0.000 claims description 3
- 238000013507 mapping Methods 0.000 claims description 3
- 239000011159 matrix material Substances 0.000 claims description 3
- 230000007246 mechanism Effects 0.000 claims description 3
- 238000012544 monitoring process Methods 0.000 claims description 3
- 230000008569 process Effects 0.000 description 4
- 238000012360 testing method Methods 0.000 description 2
- 230000006399 behavior Effects 0.000 description 1
- 230000009286 beneficial effect Effects 0.000 description 1
- 230000007547 defect Effects 0.000 description 1
- 230000000694 effects Effects 0.000 description 1
- 230000003993 interaction Effects 0.000 description 1
- 238000002372 labelling Methods 0.000 description 1
- 238000005065 mining Methods 0.000 description 1
- 230000006855 networking Effects 0.000 description 1
Images
Classifications
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/90—Details of database functions independent of the retrieved data types
- G06F16/95—Retrieval from the web
- G06F16/953—Querying, e.g. by the use of web search engines
- G06F16/9536—Search customisation based on social or collaborative filtering
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F18/00—Pattern recognition
- G06F18/20—Analysing
- G06F18/22—Matching criteria, e.g. proximity measures
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06N—COMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N3/00—Computing arrangements based on biological models
- G06N3/02—Neural networks
- G06N3/04—Architecture, e.g. interconnection topology
- G06N3/045—Combinations of networks
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06N—COMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N3/00—Computing arrangements based on biological models
- G06N3/02—Neural networks
- G06N3/08—Learning methods
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06Q—INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES, NOT OTHERWISE PROVIDED FOR
- G06Q50/00—Information and communication technology [ICT] specially adapted for implementation of business processes of specific business sectors, e.g. utilities or tourism
- G06Q50/01—Social networking
Landscapes
- Engineering & Computer Science (AREA)
- Theoretical Computer Science (AREA)
- Physics & Mathematics (AREA)
- Data Mining & Analysis (AREA)
- General Physics & Mathematics (AREA)
- General Engineering & Computer Science (AREA)
- Artificial Intelligence (AREA)
- General Health & Medical Sciences (AREA)
- Evolutionary Computation (AREA)
- Computing Systems (AREA)
- Databases & Information Systems (AREA)
- Health & Medical Sciences (AREA)
- Life Sciences & Earth Sciences (AREA)
- Computational Linguistics (AREA)
- Mathematical Physics (AREA)
- Biophysics (AREA)
- Business, Economics & Management (AREA)
- Biomedical Technology (AREA)
- Molecular Biology (AREA)
- Software Systems (AREA)
- Bioinformatics & Cheminformatics (AREA)
- Bioinformatics & Computational Biology (AREA)
- Computer Vision & Pattern Recognition (AREA)
- Evolutionary Biology (AREA)
- Economics (AREA)
- Human Resources & Organizations (AREA)
- Marketing (AREA)
- Primary Health Care (AREA)
- Strategic Management (AREA)
- Tourism & Hospitality (AREA)
- General Business, Economics & Management (AREA)
- Machine Translation (AREA)
Abstract
The invention discloses a cross-social-network user identity recognition method based on a joint embedded learning model. Firstly, selecting candidate paired user pairs from two social networks by utilizing the similarity of user names and network structures; then, constructing a user pair network graph by taking all candidate paired user pairs as nodes; secondly, on the basis of the constructed UPG and the labeled user pair data, a labeled matched user information label, structure information and attribute information are fused to build a combined embedded learning model, and the model is designed into a deep neural network structure with 1 input and 2 outputs; and finally, performing minimum learning on the loss function of the combined embedded model by using a random gradient descent algorithm, predicting the user pair to be predicted by using the parameters of the model after learning is finished, and judging whether the output is the same user or not. The method and the system can effectively predict whether two users from different networks are the same user, and play a vital role in related application of cross-social networks in commerce.
Description
Technical Field
The invention relates to the field of user relationship mining for social networks. In particular to a cross-social network user identity recognition method based on a joint embedded learning model.
Background
From early email, BBS, to today's Social Media Networks (SMNs), more and more users are becoming accustomed to daily interaction and information acquisition on Social networks. People often need to register as users of a different website in order to enjoy services provided by the website. It is a common phenomenon that a common user owns virtual accounts of multiple different social networking sites. Because each social network site is independent, the data information is not shared, and a uniform identity identifier is lacked on the network to uniquely identify the netizen, a plurality of social network site accounts belonging to the same netizen are not directly related. In order to obtain a complete image (profile) of a user, data of the user on different social networks needs to be integrated, which breaks through the association of user identities across social platforms, i.e., the identification of accounts of the user on multiple social networks. In recent years, social network identification methods based on representation learning have been prevalent, and researchers have begun to identify users on multiple social networks by using algorithms based on network embedding, however, the following problems still exist in the realization of the cross-social network user identification technology based on representation learning:
1. the existing expression-based learning method belongs to a supervised learning mode and an unsupervised learning mode, wherein the former needs a large amount of Labeled data (Labeled data), the Labeled data is difficult to obtain, and a large amount of manpower is consumed; the latter does not require labeling data, but the obtained effect is often unsatisfactory.
2. The accuracy of user identity recognition can be improved by comprehensively utilizing modal data such as attribute information of a user, structural information of a network, label information of the user and the like, but how to embed the information into a uniform vector space is a difficult problem;
3. the existing user identity correlation method based on representation learning usually splits a task into two steps of embedded learning and identity recognition of nodes, so that label information of a user cannot be effectively integrated.
Disclosure of Invention
The invention aims to overcome the defects of the prior art and provides a cross-social-network user identity association method based on a joint embedding model.
The technical scheme adopted by the invention for solving the technical problem comprises the following steps:
step 2, all the candidate pairing user pairs P ═ { P ═ PiIs node, if user is piTwo users inAre respectively neighbors of the other party, then piAnd pjAn edge exists between the two, and a user-to-network graph UPG is constructed by taking the edge as a principle;
step 3, fusing labeled paired user information labels, structure information and attribute information to build a combined embedded learning model on the basis of the constructed user pair network graph UPG and labeled user pair data, and designing the combined embedded learning model into a deep neural network structure with 1 input and 2 outputs;
and 4, performing minimum learning on the loss function of the combined embedded learning model by using a random gradient descent algorithm, predicting the user pair to be predicted by using the model after learning is finished, and judging whether the output is the same user or not.
Further, the step 1 is specifically realized as follows:
1-1.GA=(UA,EA,XA) Representing social networks A, UASet of users representing social network A, EASet of user relationships, X, representing social network AAA matrix of user attributes representing social network a,representing user i in social network A; gB=(UB,EB,XB) Representing a social network B, and the rest parameters have similar meanings;
1-2, acquiring data of different social network platforms by using a crawler;
1-3. pairs are from social network G respectivelyAAnd GBTo a userUser name n ofkAnd njCalculating the similarity according to a formula (1), and adding a user pair with the similarity larger than 0.8 into the candidate paired user pair set P;
wherein, lev (n)k,nj) Represents the Levenshtein distance, l (n)k) Representing a user name nkThe character length of (d);
1-4, expanding neighbor nodes by taking each pair of users in the user pair set P as a seed user pair, selecting user pairs with r common neighbors (known pairs) from the neighbor nodes of the seed user pair, adding the user pairs into the P, and setting different r values according to different data sets.
Further, the step 2 is specifically realized as follows:
2-1.UPG=(UUPG,EUPG) Representing a user versus network diagram, UUPGRepresenting a set of nodes, EUPGRepresenting a set of relationships between nodes; pairing candidate users piNode as UPG and is recorded as u'i,u'i∈UUPG;
2-2. supposeAndtwo nodes in the UPG have an edge between the two nodes if the following relationship exists between the two nodes;
Further, the step 3 is specifically realized as follows:
3-1, marking accurate mapping account numbers of the users in another network by text analysis and matching technology and combining manual judgment through partial user attribute information crawled by a crawler; the marked user matching pairs are used as the monitoring information of model training;
3-2, every two users in the candidate pairing user set generated in the step 2-1 are pairedAndthe attributes of (1) are subjected to feature conversion through one-hot coding and are respectively recorded asAndthe attributes comprise a user name, a gender, a graduation institution and a geographic location;
3-3, constructing a joint embedded learning model for the network aiming at the constructed user; vector the attributes of two users in a nodePerforming a splicing operation, noteAnd d isiAs input to a joint embedded learning model; the output has a left branch and a right branch, and the left branch utilizes a multilayer perceptron model to output a node label yiThe probability values are 0 and 1, wherein 1 represents that two users in the node are the same user, and 0 represents that two users in the node are different users; outputting the predicted probability value of the Context node by the right branch by using a skipgram model;
the mth layer of the skipgram model is represented as:
wherein δ (·) represents a sigmoid function, WmAnd bmWeights and biases parameters for m layers; formula (4) and formula (5) represent the m +1 th layers of the left and right branches, respectively;the weights parameter representing the left branch of the (m + 1) th layer,the weights parameter of the right branch of the (m + 1) th layer is represented,andand so on;
the last layer of the left branch of the model is designed as softmax layer, and the input of the layer is:
the last layer of the right branch of the model is designed as a softmax layer, and the input of the layer is as follows:
where k represents the number of layers of the left branch implicit layer and k' represents the number of layers of the right branch implicit layer.
Further, the step 4 is specifically realized as follows:
4-1. the left branch of the joint embedding learning model is a multi-layer perceptual model, and the loss function of the branch is defined as:
whereinRepresenting a tagged node in UPG, p (y)i|di) Represents given diUnder the condition of yiIs calculated as follows:
the right branch adopts a negative sampling mechanism to define a loss function as follows:
where δ (·) stands for sigmoid function, n ═ UUPGL, u 'represents node u'iThe context node of (a) is selected,representing randomly selected t negative samples;
4-2, calculating parameters by adopting a mini-batch gradient descent method; setting the value of the left branch's batch b1Set to 200, the value of batch of the right branch b2Is 200; slave UUPGMiddle random sampling b1The labeled nodes, and calculate L(L)According to the gradient value of the parameter WmAnd bm,Andupdating;
4-3 from UUPGMiddle random sampling b2A node and calculateAccording to the gradient value of the parameter WmAnd bm,Andupdating of (1);
4-4, returning to the step 4-2, and iterating for 100 times;
4-5 input node u 'to be predicted in UPG'jCalculating according to the step 3-2 to obtain the attribute vectors of the two users in the node, and splicing the attribute vectors to obtain a vector djInputting the data into a joint embedding learning model, and calculating to obtain a node u 'to be predicted'jThe label of (1).
The invention has the following beneficial effects:
the invention focuses on how to implement network embedding method, effectively integrates key factors of user identity identification, and realizes user identity identification on two social platforms. The cross-social platform identity association plays a crucial role in business cross-social network applications, such as user behavior analysis of multiple social networks, information service push of cross-social networks, cross-platform friend recommendation, network security governance of government offices and enterprises and the like. The method and the system can effectively predict whether two users from different networks are the same user, and play a vital role in the related application of cross-social networks in commerce.
Drawings
FIG. 1 is a flow chart of the method of the present invention;
FIG. 2 is a diagram of an example of a candidate paired user pair generation;
FIG. 3 is an exemplary diagram of a user generating a network graph;
FIG. 4 is an exemplary diagram of a joint embedding model;
Detailed Description
The invention will be further explained with reference to the drawings.
As shown in FIG. 1, the method for identifying the user identity across the social network based on the joint embedded learning model comprises the following steps:
step 2, all candidate pairing user pairs P ═ { P ═ PiIs node, if user is piTwo users in the system are respectively neighbors of the other party, then piAnd pjAn edge exists between the two, and a User Pair network Graph (UPG) is constructed by taking the edge as a principle;
step 3, on the basis of the constructed UPG and labeled user pair data (labeled user pairs), labeled paired user information labels, structure information and attribute information are fused to build a joint embedded learning model, and the model is designed into a deep neural network structure with 1 input and 2 outputs;
and 4, learning the loss function minimization of the combined embedded model by using a random gradient descent algorithm, predicting the user pair to be predicted by using the model after learning is finished, and judging whether the output is the same user or not.
The specific implementation process of the step 1 is as follows:
1-1.GA=(UA,EA,XA) Representing social networks A, UASet of users representing social network A, EASet of user relationships, X, representing social network AAA matrix of user attributes representing social network a,representing a user in social network A; gB=(UB,EB,XB) Representing social network B, the remaining parameters are similar in meaning. The invention utilizes web crawlers to microblog from the green sea (G)A) And known as (G)B) The new wave net comprises about 1.23 x 10 user nodes5The human network contains about 1.95 x 10 user data5. The user information common to the two networks includes user name and natureThe university, the graduation institution and the location.
And 1-2, data of different social network platforms are obtained by using a crawler.
1-3. pairs are from social network G respectivelyAAnd GBTo a userUser name string nkAnd njCalculating the similarity according to the following formula, selecting the user pairs with the similarity more than 0.8 to be added into the candidate paired user pair set P,
wherein, lev (n)k,nj) Represents the Levenshtein distance, l (n)k) Representing a user name nkThe character length of (2). For example, the user name "vio" and "violet" have a similarity of 0.5.
1-3, taking each pair of users in the P as a seed user pair to expand neighbor nodes, selecting the user pairs with r common neighbors (known pairs) from the neighbor nodes of the seed user pair to be added into the P, and setting different r values according to different data sets. In this step, the present invention provides the example shown in FIG. 2. In FIG. 2, assume thatFor user pairs with a username similarity greater than 0.8, let r be 2, according to which step it will beFour user pairs are used as candidate pairing user pairs to be added into P, and finally
The specific implementation process of the step 2 is as follows:
2-1.UPG=(UUPG,EUPG) Representing a user versus network diagram, UUPGRepresenting a set of nodes, EUPGRepresenting a set of relationships between nodes. Pairing candidate users piNode as UPG and is recorded as u'i,u'i∈UUPG;
2-2. supposeAndfor two nodes in a UPG, there is an edge between the two nodes if there is a relationship between them.
The present invention provides step 2 with a user-to-network graph generated by the two social networks shown in FIG. 2, with the results shown in FIG. 3. According to step 2-1 and step 2-2, the generated user-to-network graph contains 6 nodes and 8 edges.
The specific implementation process of the step 3 is as follows:
and 3-1, marking the accurate mapping account of the user in another network by using partial user attribute information (such as account information of other platforms, mobile phones and mailboxes provided by the user in personal introduction) crawled by a crawler, text analysis and matching technology and manual judgment. And the marked user matching pairs are used as the monitoring information of model training.
3-2, every two users in the candidate pairing user set generated in the step 2-1 are pairedAndthe attributes (user name, gender, college and geography) of (1) are subjected to feature conversion by one-hot coding and are respectively recorded asAnd
specifically, aiming at the attribute of the user name, Chinese characters are unified into pinyin, capital letters are unified into lowercase letters, special characters such as underlines and the like are removed, and then a plurality of character substrings are intercepted from the user nameAnd performing one-hot coding on the character substring. For example, for a user name "violet", several character substrings { "vio", "iol", "ole", "let" } with a length of 3 may be truncated. And directly implementing one-hot coding according to the classifiable attributes such as gender, geographic position, graduation colleges and the like. For example, there are only two options in gender, "male" and "female", then the "male" attribute may be encoded as {10}, the "female" attribute may be encoded as {01}, and the remaining attributes are similar.
3-3. as shown in FIG. 4, a joint embedding model is built for the built user to the network. Attribute vectors (denoted as attribute vectors) for two users in a node) Performing a splicing operation, noteAnd as input to the joint embedding model; the output has a left branch and a right branch, and the left branch utilizes a multilayer perceptron model to output a node label y representing predictioniProbability values of 0 and 1 (1 in the nodeThe two users in the node are the same user, 0 represents that the two users in the node are different users), and the probability value of the predicted Context node is output by the right branch by using a skipgram model. The mth layer of the model is represented as:
wherein δ (·) represents a sigmoid function, WmAnd bmAre the weights and biases parameters for the m layers. The latter two formulas represent the (m + 1) th layers of the left branch and the right branch respectively;the weights parameter representing the left branch of the (m + 1) th layer,the weights parameter of the right branch of the (m + 1) th layer is represented,andand so on.
The last layer of the left branch (node label prediction) of the model is designed as the softmax layer, and the inputs of the layer are:
the last layer of the right branch (node label prediction) of the model is designed as the softmax layer, and the inputs of the layer are:
where k represents the number of layers of the left branch implicit layer and k' represents the number of layers of the right branch implicit layer.
The specific implementation process of the step 4 is as follows:
4-1. the left branch of the joint embedding model is a multi-layer perceptual model, and the loss function of the branch is defined as:
whereinRepresenting a tagged node in UPG, p (y)i|di) Represents given diUnder the condition of yiIs calculated as follows:
the right branch adopts a negative sampling mechanism to define a loss function as follows:
where δ (·) stands for sigmoid function, n ═ UUPGL, u 'represent all points u'iThe context node of (a) is selected,representing t negative samples chosen at random. The remaining parameters are referred to in step 3-3.
4-2, calculating parameters by adopting a mini-batch gradient descent method. Setting the value of the left branch's batch b1Set to 200, right-handedbatch value b2Is 200, randomly sampling b1Node with label, and calculate ^ L(L)By a gradient value of the parameter WmAnd bm,Andupdating;
4-3 from UUPGMiddle sampling b2A node and calculateBy a gradient value of the parameter WmAnd bm,Andupdating of (1);
4-4 returns to step 4-2 and iterates 100 times.
4-5 input node u 'to be predicted in UPG'jCalculating according to the step 3-2 to obtain the attribute vectors of the two users in the node, and splicing the attribute vectors to obtain a vector djInputting the data into a joint embedding model, and calculating to obtain a node u 'to be predicted'jThe label of (1).
In step 4, taking the crawl of the user data of the Xinlang microblog and the known net user data as an example, 7325 user data pairs are extracted from the user data, wherein the 7325 user data pairs comprise 2213 labeled data, 30% of the labeled data are extracted to serve as model training data, and the rest are taken as test data. And aiming at the network pair, constructing a user-to-network diagram, constructing a joint embedded model according to the diagram 4, and performing parameter learning on the model. And (4) carrying out user identity correlation and calculating accuracy aiming at the test data pair, wherein the finally obtained accuracy reaches 84.7%.
Claims (3)
1. The cross-social network user identity recognition method based on the joint embedded learning model is characterized by comprising the following steps of:
step 1, aiming at social network GAAnd GBThe user selects candidate paired user pairs from the two social networks by utilizing the user name similarity and the network structure;
step 2, all the candidate pairing user pairs P ═ { P ═ PiIs node, if user is piTwo users in the system are respectively neighbors of the other party, then piAnd pjAn edge exists between the two, and a user-to-network graph UPG is constructed by taking the edge as a principle;
step 3, fusing labeled paired user information labels, structure information and attribute information to build a combined embedded learning model on the basis of the constructed user pair network graph UPG and labeled user pair data, and designing the combined embedded learning model into a deep neural network structure with 1 input and 2 outputs;
step 4, performing minimum learning on the loss function of the combined embedded learning model by using a random gradient descent algorithm, predicting the user pair to be predicted by using the model after learning is finished, and judging whether the output is the same user;
the step 3 is realized as follows:
3-1, marking accurate mapping account numbers of the users in another network by text analysis and matching technology and combining manual judgment through partial user attribute information crawled by a crawler; the marked user matching pairs are used as the monitoring information of model training;
3-2, every two users in the candidate pairing user set generated in the step 2-1 are pairedAndthe attributes of (1) are subjected to feature conversion through one-hot coding and are respectively recorded asAndthe attributes comprise a user name, a gender, a graduation institution and a geographic location;
3-3, constructing a joint embedded learning model for the network aiming at the constructed user; vector the attributes of two users in a nodePerforming a splicing operation, noteAnd d isiAs input to a joint embedded learning model; the output has a left branch and a right branch, and the left branch utilizes a multilayer perceptron model to output a node label yiThe probability values are 0 and 1, wherein 1 represents that two users in the node are the same user, and 0 represents that two users in the node are different users; outputting the predicted probability value of the Context node by the right branch by using a skipgram model;
the mth layer of the skipgram model is represented as:
wherein δ (·) represents a sigmoid function, WmAnd bmWeights and biases parameters for m layers; formula (4) and formula (5) represent the m +1 th layers of the left and right branches, respectively;the weights parameter representing the left branch of the (m + 1) th layer,the weights parameter of the right branch of the (m + 1) th layer is represented,andand so on;
the last layer of the left branch of the model is designed as softmax layer, and the input of the layer is:
the last layer of the right branch of the model is designed as a softmax layer, and the input of the layer is as follows:
where k represents the number of layers of the left branch implicit layer and k' represents the number of layers of the right branch implicit layer.
2. The method for identifying the user identity across the social network based on the joint embedded learning model according to claim 1, wherein the step 1 is implemented as follows:
1-1.GA=(UA,EA,XA) Representing social networks A, UASet of users representing social network A, EASet of user relationships, X, representing social network AAA matrix of user attributes representing social network a,representing user i in social network A; gB=(UB,EB,XB) Representing a social network B, and the rest parameters have similar meanings;
1-2, acquiring data of different social network platforms by using a crawler;
1-3. pairs are from social network G respectivelyAAnd GBTo a userUser name n ofkAnd njCalculating the similarity according to a formula (1), and adding a user pair with the similarity larger than 0.8 into the candidate paired user pair set P;
wherein, lev (n)k,nj) Represents the Levenshtein distance, l (n)k) Representing a user name nkThe character length of (d);
1-4, expanding neighbor nodes by taking each pair of users in the user pair set P as a seed user pair, selecting user pairs with r common neighbors (known pairs) from the neighbor nodes of the seed user pairs, adding the user pairs into the P, and setting different r values according to different data sets;
the step 2 is realized as follows:
2-1.UPG=(UUPG,EUPG) Representing a user versus network diagram, UUPGRepresenting a set of nodes, EUPGRepresenting a set of relationships between nodes; pairing candidate users piNode as UPG and is recorded as u'i,u'i∈UUPG;
2-2. supposeAndtwo nodes in the UPG have an edge between the two nodes if the following relationship exists between the two nodes;
3. The method for identifying the user identity across the social network based on the joint embedded learning model according to claim 2, wherein the step 4 is implemented as follows:
4-1. the left branch of the joint embedding learning model is a multi-layer perceptual model, and the loss function of the branch is defined as:
whereinRepresenting a tagged node in UPG, p (y)i|di) Represents given diUnder the condition of yiIs calculated as follows:
the right branch adopts a negative sampling mechanism to define a loss function as follows:
wherein δ (-) represents sigmoid function, n ═ UUPGL, u 'represents node u'iThe context node of (a) is selected,representing randomly selected t negative samples;
4-2, calculating parameters by adopting a mini-batch gradient descent method; setting the value of the left branch's batch b1Set to 200, the value of batch of the right branch b2Is 200; slave UUPGMiddle random sampling b1The labeled nodes, and calculate L(L)According to the gradient value of the parameter WmAnd bm,Andupdating;
4-3 from UUPGMiddle random sampling b2A node and calculateAccording to the gradient value of the parameter WmAnd bm,Andupdating of (1);
4-4, returning to the step 4-2, and iterating for 100 times;
4-5 input node u 'to be predicted in UPG'jCalculating according to the step 3-2 to obtain the attribute vectors of the two users in the node, and splicing the attribute vectors to obtain a vector djInputting the data into a joint embedding learning model, and calculating to obtain a node u 'to be predicted'jThe label of (1).
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN202110718740.9A CN113434782B (en) | 2021-06-28 | 2021-06-28 | Cross-social network user identity recognition method based on joint embedded learning model |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN202110718740.9A CN113434782B (en) | 2021-06-28 | 2021-06-28 | Cross-social network user identity recognition method based on joint embedded learning model |
Publications (2)
Publication Number | Publication Date |
---|---|
CN113434782A CN113434782A (en) | 2021-09-24 |
CN113434782B true CN113434782B (en) | 2022-03-01 |
Family
ID=77755095
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN202110718740.9A Active CN113434782B (en) | 2021-06-28 | 2021-06-28 | Cross-social network user identity recognition method based on joint embedded learning model |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN113434782B (en) |
Families Citing this family (5)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN114581254A (en) * | 2022-02-28 | 2022-06-03 | 中国人民解放军战略支援部队信息工程大学 | Multi-feature fusion cross-social network user identity association method |
CN114663245A (en) * | 2022-03-16 | 2022-06-24 | 南京信息工程大学 | Cross-social network identity matching method |
CN114817757B (en) * | 2022-04-02 | 2023-07-21 | 广州大学 | Cross-social network virtual identity association method based on graph rolling network |
CN116776193B (en) * | 2023-05-17 | 2024-08-06 | 广州大学 | Method and device for associating virtual identities across social networks based on attention mechanism |
CN118585833A (en) * | 2024-08-06 | 2024-09-03 | 中国人民解放军国防科技大学 | Network asset owner identification method and device based on large language model and contrast learning |
Family Cites Families (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US20140108152A1 (en) * | 2012-10-12 | 2014-04-17 | Google Inc. | Managing Social Network Relationships Between A Commercial Entity and One or More Users |
CN109753602B (en) * | 2018-12-04 | 2020-12-25 | 中国科学院计算技术研究所 | Cross-social network user identity recognition method and system based on machine learning |
CN110347932B (en) * | 2019-06-04 | 2021-11-23 | 中国科学院信息工程研究所 | Cross-network user alignment method based on deep learning |
-
2021
- 2021-06-28 CN CN202110718740.9A patent/CN113434782B/en active Active
Also Published As
Publication number | Publication date |
---|---|
CN113434782A (en) | 2021-09-24 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN113434782B (en) | Cross-social network user identity recognition method based on joint embedded learning model | |
CN110609891B (en) | Visual dialog generation method based on context awareness graph neural network | |
CN110097125B (en) | Cross-network account association method based on embedded representation | |
WO2023000574A1 (en) | Model training method, apparatus and device, and readable storage medium | |
CN111753024B (en) | Multi-source heterogeneous data entity alignment method oriented to public safety field | |
CN108268643A (en) | A kind of Deep Semantics matching entities link method based on more granularity LSTM networks | |
CN109753602B (en) | Cross-social network user identity recognition method and system based on machine learning | |
CN104731962A (en) | Method and system for friend recommendation based on similar associations in social network | |
CN109857871B (en) | User relationship discovery method based on social network mass contextual data | |
CN107330461A (en) | Collaborative filtering recommending method based on emotion with trust | |
WO2018112696A1 (en) | Content pushing method and content pushing system | |
CN112084373B (en) | Graph embedding-based multi-source heterogeneous network user alignment method | |
CN113628059B (en) | Associated user identification method and device based on multi-layer diagram attention network | |
CN112988917B (en) | Entity alignment method based on multiple entity contexts | |
CN107391542A (en) | A kind of open source software community expert recommendation method based on document knowledge collection of illustrative plates | |
CN113095948B (en) | Multi-source heterogeneous network user alignment method based on graph neural network | |
CN110472226A (en) | A kind of network security situation prediction method and device of knowledge based map | |
CN110569355B (en) | Viewpoint target extraction and target emotion classification combined method and system based on word blocks | |
CN113806630A (en) | Attention-based multi-view feature fusion cross-domain recommendation method and device | |
CN112884045B (en) | Classification method of random edge deletion embedded model based on multiple visual angles | |
CN109960755B (en) | User privacy protection method based on dynamic iteration fast gradient | |
CN109492027B (en) | Cross-community potential character relation analysis method based on weak credible data | |
CN110136017A (en) | A kind of group's discovery method based on data enhancing and nonnegative matrix sparse decomposition | |
CN113283243B (en) | Entity and relationship combined extraction method | |
CN112749566B (en) | Semantic matching method and device for English writing assistance |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PB01 | Publication | ||
PB01 | Publication | ||
SE01 | Entry into force of request for substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
GR01 | Patent grant | ||
GR01 | Patent grant | ||
TR01 | Transfer of patent right | ||
TR01 | Transfer of patent right |
Effective date of registration: 20230726 Address after: Room 801, 85 Kefeng Road, Huangpu District, Guangzhou City, Guangdong Province Patentee after: Guangzhou Dayu Chuangfu Technology Co.,Ltd. Address before: Hangzhou City, Zhejiang province 310036 Xiasha Higher Education Park forest Street No. 16 Patentee before: HANGZHOU NORMAL UNIVERSITY QIANJIANG College |