CN113628059A - Associated user identification method and device based on multilayer graph attention network - Google Patents

Associated user identification method and device based on multilayer graph attention network Download PDF

Info

Publication number
CN113628059A
CN113628059A CN202110794741.1A CN202110794741A CN113628059A CN 113628059 A CN113628059 A CN 113628059A CN 202110794741 A CN202110794741 A CN 202110794741A CN 113628059 A CN113628059 A CN 113628059A
Authority
CN
China
Prior art keywords
network
attention
matrix
embedded
node
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Granted
Application number
CN202110794741.1A
Other languages
Chinese (zh)
Other versions
CN113628059B (en
Inventor
胡瑞敏
肖益林
吴俊杭
甄宇
任灵飞
胡文怡
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Wuhan University WHU
Original Assignee
Wuhan University WHU
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Wuhan University WHU filed Critical Wuhan University WHU
Priority to CN202110794741.1A priority Critical patent/CN113628059B/en
Publication of CN113628059A publication Critical patent/CN113628059A/en
Application granted granted Critical
Publication of CN113628059B publication Critical patent/CN113628059B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06QINFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES, NOT OTHERWISE PROVIDED FOR
    • G06Q50/00Systems or methods specially adapted for specific business sectors, e.g. utilities or tourism
    • G06Q50/01Social networking
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F18/00Pattern recognition
    • G06F18/20Analysing
    • G06F18/25Fusion techniques
    • G06F18/253Fusion techniques of extracted features
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N3/00Computing arrangements based on biological models
    • G06N3/02Neural networks
    • G06N3/04Architecture, e.g. interconnection topology
    • G06N3/047Probabilistic or stochastic networks
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N3/00Computing arrangements based on biological models
    • G06N3/02Neural networks
    • G06N3/08Learning methods

Abstract

The invention provides a method and a device for identifying associated users based on a multilayer graph attention network, wherein the method firstly simulates structure noise and attribute noise through a random permutation matrix so as to improve the adaptivity of a model; a correlation user identification model is established, and the importance of each neighbor node to the target node is calculated based on an attention mechanism; calculating to obtain an embedded vector of each node by using the normalized attention coefficient and the weight matrix, and fusing the embedded vectors of all target nodes into an embedded matrix; fusing multilayer embedded matrixes obtained by the multilayer graph attention network according to the weight, and obtaining a final embedded matrix based on a greedy strategy; and performing correlation identification on the final embedded matrix to obtain a final result. Because the structure noise and the attribute noise are simulated in advance, the self-adaptability of the model is greatly improved; the embedding result of the multi-layer graph attention network is effectively extracted, so that the identification accuracy of the associated user is improved, and the error is effectively reduced.

Description

Associated user identification method and device based on multilayer graph attention network
Technical Field
The invention relates to the technical field of associated user identification, in particular to an associated user identification method and device based on a multilayer graph attention network.
Background
The associated user identification is a technology for detecting associated users in a plurality of social networks. Associated users refer to users in different social networks, but they belong to the same natural person in the real world. The associated user identification is often applied to the problems of recommendation systems, criminal behavior prediction, personalized services and 'cold start', so that the associated user identification becomes a hot spot of the current social network research; however, node information in the social network is various, and the network structure is large and complex, so that feature extraction of the social network becomes a difficult point. Most of the related user identification algorithms at present are based on a deep Random Walk-based Social network feature extraction method proposed by Perozzi, Al-Rfou and Skiona in the literature (deep Walk: one Learning of Social responses [ C ]. Proceedings of the 20th ACM SIGKDD interactive conference on Knowledge and data mining-KDD'14,2014:701 and 710.), wherein Perozzi represents a vertex sequence as a multi-dimensional vector by using a natural language processing tool (word2vec) on the basis of the vertex sequence extracted by the Random Walk algorithm (Random Walk), and finally obtains a better feature extraction effect. However, the feature extraction method based on random walk cannot directly process the non-european structure of the social network, but converts the non-european structure into a vertex sequence to perform feature extraction, which results in errors.
Just because the random walk algorithm causes great errors and noises in feature extraction, the method of graph neural network, which can directly extract features of non-european structures (e.g., social networks), is becoming a hot spot. How to combine the graph neural network with the associated user identification, the social network characteristics are efficiently extracted, and the associated user identification is accurately performed, which becomes a difficult problem. There are also related studies aimed at solving this problem, such as Chen et al in the literature (h.chen, h.yin, x.sun, t.chen, b.gabrys, and k.music, "Multi-level graph relational network for cross-platform and link prediction," in Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining,2020, pp.1503-Data 1.) propose to simultaneously use simple graph convolutional neural networks and hyper graph convolutional neural networks for feature extraction of social networks, and un et al in the literature (h.t.trunk, t.van video, n.t.tam, h.yin, m.m.idlich, n.q.v.and, "tung mapping network" propose to stabilize the recognition of neural networks by IEEE network-link recognition method, IEEE-based on the product of interest-Data Mining,2020, pp.1503-Data), and "the correlation of multiple-network recognition" 14-Data Mining "2020, and extension network recognition by IEEE-network-link recognition method, and the stability-node recognition method of IEEE, increase by IEEE-map recognition. In the process of implementing the present invention, the inventors of the present application find that the above algorithms still have some problems, and define the weights of all neighbor nodes as 1, neglecting the difference of the weights of different neighbor nodes to the target node, thereby causing the feature extraction process to be inaccurate.
By combining the above analysis, although various improved associated user identification methods achieve certain detection effects, the problem of low identification accuracy still needs to be improved.
Disclosure of Invention
The invention provides a method and a device for identifying associated users based on a multilayer graph attention network, which are used for solving or at least partially solving the technical problem of low identification precision in the prior art.
In order to solve the above technical problem, a first aspect of the present invention provides a method for identifying an associated user based on a multi-layer graph attention network, including:
s1: acquiring two social networks, taking one of the social networks as a source social network and the other social network as a target social network, and performing data reinforcement learning on the two social networks, wherein the social networks comprise three kinds of information, namely nodes, connecting edges among the nodes and feature vectors of the nodes, the nodes represent users existing in the social networks, the connecting edges among the nodes represent friend relationships among the users, and the feature vectors of the nodes represent vector representations obtained by encoding attribute information of the users through a thermal encoding technology;
s2: constructing a correlation user identification model based on a multilayer graph attention network, wherein the correlation user identification model based on the multilayer graph attention network comprises the multilayer graph attention network, an embedded fusion module and an output module, and the multilayer graph attention network comprises a node relation extraction module and a feature fusion module; the node relation extraction module is used for calculating the importance of each neighbor node to the target node based on an attention mechanism, namely an attention coefficient, and then carrying out normalization and activation processing to obtain an activated attention coefficient; the feature fusion module is used for calculating to obtain an embedded vector of each node by using the activated attention coefficient and the weight matrix, fusing the embedded vectors of all target nodes into an embedded matrix, and expressing the embedded vector by fusing the attention coefficient and the network structure information on the basis of the feature vector; the embedded fusion module is used for fusing embedded matrixes of the source social network and the target social network according to preset weights and obtaining an associated user identification matrix representing the friend relationship between users based on a greedy strategy; the output module is used for obtaining an identification result according to the associated user identification matrix;
s3: training a correlation user identification model based on the multilayer diagram attention network by using the network data subjected to data enhancement as training data, minimizing a loss function to obtain an optimal embedded matrix, and obtaining a model corresponding to the optimal embedded matrix as the trained correlation user identification model based on the multilayer diagram attention network;
s4: and performing relevant user identification on the input social network by using a trained relevant user identification model based on the multilayer graph attention network.
In one embodiment, the data-enhanced learning of the two social networks in step S1 includes: the data enhancement is realized by simulating structural noise and attribute noise through a random permutation matrix.
In one embodiment, the calculation process of the node relation extracting module in step S2 includes:
based on an attention mechanism, calculating the importance of each neighbor node to the target node, and taking the importance as an attention coefficient, wherein the calculation formula is as follows:
euv=a(WFu,WFv)
wherein ,euvDenotes an attention coefficient calculated with a node u as a target node, W denotes a weight matrix, FuFeature vector, F, representing node uvA feature vector representing node v, a () representing the attention mechanism;
the attention coefficient is normalized by a softmax function:
Figure BDA0003162230740000031
wherein the expression represents that attention coefficients are normalized by a softmax function, and alpha'uvDenotes the normalized attention coefficient, MuThe neighborhood representing node u, i.e. all neighbor nodes, represents an exponential function with e as base, exp (e)uv) Is represented by e as base, euvIs a function of the index of the light,
Figure BDA0003162230740000032
means to perform the operation on all the neighbor nodes and sum;
activating the normalized attention coefficient by using a LeakyReLU activation function:
Figure BDA0003162230740000033
wherein the above formula represents activating the normalized attention coefficient by using LeakyReLU function as an activation function, alphauvIndicating the attention coefficient after activation, the LeakyReLU indicates the activation function,
Figure BDA0003162230740000041
weight vector representing attention mechanism parameterization [ | | ·]The concatenation of the vectors is represented and,
Figure BDA0003162230740000042
a formulation representing the attention mechanism a (·).
In one embodiment, the calculation process of the feature fusion module in step S2 includes:
calculating to obtain an embedded vector F 'of each node by using the activated attention coefficient and the weight matrix'u
Figure BDA0003162230740000043
Where σ denotes the activation function, αuvDenotes the attention coefficient after activation, FvA feature vector representing node v;
fusing the embedded vectors of all target nodes into an embedded matrix:
Figure BDA0003162230740000044
wherein the multi-level graph attention network comprises a plurality of graph attention networks, each graph attention network obtaining a corresponding embedded matrix, H(l)Representing the l-th layer graph attention network derived embedded matrix,
Figure BDA0003162230740000045
the embedded vector of node 1 obtained by the l-th layer graph attention network is shown.
In one embodiment, the calculation process of embedding the fusion module in step S2 includes;
obtaining the association identification score of each graph attention layer through a hierarchical alignment matrix:
Figure BDA0003162230740000046
fusing the layered alignment matrix according to a preset weight, and obtaining a final associated user identification matrix by adopting a greedy strategy:
Figure BDA0003162230740000047
Figure BDA0003162230740000048
wherein ,
Figure BDA0003162230740000051
the layer l representing the source social network is aware of the network derived embedded matrix,
Figure BDA0003162230740000052
an embedding matrix, S, representing the Lth layer of the target social network, derived by the attention network(l)A hierarchical alignment matrix representing the graph attention network of the ith layer, an association recognition score, theta, representing the graph attention layer(l)Representing the weight of the l-th level hierarchical alignment matrix.
In one embodiment, the loss function in step S3 is:
Figure BDA0003162230740000053
wherein u and upRepresenting an original network G and a reinforcement learned network GpOf the same corresponding node, H(l)(u) node embedding, H, at level l of node u(l)(up) Representing a node upAnd embedding nodes of the l-th layer.
The optimal embedding matrix obtained by minimizing the objective function is:
Figure BDA0003162230740000054
wherein ,
Figure BDA0003162230740000055
representing a regularized Laplacian matrix, | | · |. non-woven phosphorFRepresents the Frobenius norm,
Figure BDA0003162230740000056
the layer I of the representation target social network is aware of the embedded matrix derived by the force network.
Based on the same inventive concept, a second aspect of the present invention provides an associated user identification device based on a multi-layer graph attention network, comprising:
the data enhancement module is used for acquiring two social networks, taking one of the two social networks as a source social network and the other as a target social network, and performing data enhancement learning on the two social networks, wherein the social networks comprise three information, namely nodes, connecting edges among the nodes and feature vectors of the nodes, the nodes represent users existing in the social networks, the connecting edges among the nodes represent friend relationships among the users, and the feature vectors of the nodes represent vector representations obtained by encoding attribute information of the users through a thermal encoding technology;
the model building module is used for building an associated user identification model based on the multilayer graph attention network, wherein the associated user identification model based on the multilayer graph attention network comprises the following steps: the system comprises a multilayer graph attention network, an embedded fusion module and an output module, wherein the multilayer graph attention network comprises a node relation extraction module and a feature fusion module; the node relation extraction module is used for calculating the importance of each neighbor node to the target node based on an attention mechanism, namely an attention coefficient, and then carrying out normalization and activation processing to obtain an activated attention coefficient; the feature fusion module is used for calculating to obtain an embedded vector of each node by using the activated attention coefficient and the weight matrix, fusing the embedded vectors of all target nodes into an embedded matrix, and expressing the embedded vector by fusing the attention coefficient and the network structure information on the basis of the feature vector; the embedded fusion module is used for fusing embedded matrixes of the source social network and the target social network according to preset weights and obtaining an associated user identification matrix representing the friend relationship between users based on a greedy strategy; the output module is used for obtaining an identification result according to the associated user identification matrix;
the model training module is used for training the associated user recognition model based on the multilayer diagram attention network by taking the network data subjected to data enhancement as training data, minimizing a loss function to obtain an optimal embedding matrix, obtaining a model corresponding to the optimal embedding matrix and taking the model as the trained associated user recognition model based on the multilayer diagram attention network;
and the associated user identification module is used for performing associated user identification on the input social network by utilizing the trained associated user identification model based on the multilayer graph attention network.
One or more technical solutions in the embodiments of the present application have at least one or more of the following technical effects:
the invention provides a multi-layer graph attention network-based associated user identification method, which constructs a multi-layer graph attention network-based associated user identification model, comprises a multi-layer graph attention network and an embedded fusion module, wherein the multi-layer graph attention network in the model is used for calculating the importance of each neighbor node to a target node based on an attention mechanism, calculating an embedded vector of each node by using an activated attention coefficient and a weight matrix, and fusing the embedded vectors of all the target nodes into an embedded matrix, so that the weights of different neighbor nodes to the target node can be considered in the process of extracting social network characteristics, and the accuracy of characteristic extraction is greatly improved; and further fusing the embedded matrixes of the source social network and the target social network according to preset weights through an embedded fusion module to obtain an associated user identification matrix representing the friend relationship between users, namely weighting and fusing the multi-layer graph embedding in the associated identification, and using the fused embedded matrix for associated identification, thereby greatly improving the identification accuracy.
Furthermore, the structural noise and the attribute noise are simulated by adopting data reinforcement learning, so that the adaptability of the associated user identification model based on the multilayer graph attention network is greatly improved.
Drawings
In order to more clearly illustrate the embodiments of the present invention or the technical solutions in the prior art, the drawings used in the description of the embodiments or the prior art will be briefly introduced below, and it is obvious that the drawings in the following description are some embodiments of the present invention, and for those skilled in the art, other drawings can be obtained according to these drawings without creative efforts.
Fig. 1 is an overall flowchart of a method for identifying associated users based on a multi-layer graph attention network in the implementation of the present invention.
Detailed Description
The invention aims to provide a method for identifying associated users based on a multilayer graph attention network, which solves the problems of low identification precision, high error rate and low self-adaptability of the existing similar algorithm
The main concept of the invention is as follows:
firstly, performing data reinforcement learning on two social networks, then constructing a multi-layer graph attention network-based associated user identification model, then training the identification model to obtain a trained model, and finally identifying associated users by using the identification model; calculating to obtain an embedded vector of each node by using the normalized and activated attention coefficient and the weight matrix, and fusing the embedded vectors of all target nodes into an embedded matrix; fusing the embedded matrixes obtained by the multilayer graph attention network according to the weight, and obtaining a final embedded matrix based on a greedy strategy; and performing correlation identification on the final embedded matrix to obtain a final result.
In order to make the objects, technical solutions and advantages of the embodiments of the present invention clearer, the technical solutions in the embodiments of the present invention will be clearly and completely described below with reference to the drawings in the embodiments of the present invention, and it is obvious that the described embodiments are some, but not all, embodiments of the present invention. All other embodiments, which can be derived by a person skilled in the art from the embodiments given herein without making any creative effort, shall fall within the protection scope of the present invention.
Example one
The embodiment of the invention provides a method for identifying associated users based on a multilayer graph attention network, which comprises the following steps:
s1: acquiring two social networks, taking one of the social networks as a source social network and the other social network as a target social network, and performing data reinforcement learning on the two social networks, wherein the social networks comprise three kinds of information, namely nodes, connecting edges among the nodes and feature vectors of the nodes, the nodes represent users existing in the social networks, the connecting edges among the nodes represent friend relationships among the users, and the feature vectors of the nodes represent vector representations obtained by encoding attribute information of the users through a thermal encoding technology;
s2: constructing a correlation user identification model based on a multilayer graph attention network, wherein the correlation user identification model based on the multilayer graph attention network comprises the multilayer graph attention network, an embedded fusion module and an output module, and the multilayer graph attention network comprises a node relation extraction module and a feature fusion module; the node relation extraction module is used for calculating the importance of each neighbor node to the target node based on an attention mechanism, namely an attention coefficient, and then carrying out normalization and activation processing to obtain an activated attention coefficient; the feature fusion module is used for calculating to obtain an embedded vector of each node by using the activated attention coefficient and the weight matrix, fusing the embedded vectors of all target nodes into an embedded matrix, and expressing the embedded vector by fusing the attention coefficient and the network structure information on the basis of the feature vector; the embedded fusion module is used for fusing embedded matrixes of the source social network and the target social network according to preset weights and obtaining an associated user identification matrix representing the friend relationship between users based on a greedy strategy; the output module is used for obtaining an identification result according to the associated user identification matrix;
s3: training a correlation user identification model based on the multilayer diagram attention network by using the network data subjected to data enhancement as training data, minimizing a loss function to obtain an optimal embedded matrix, and obtaining a model corresponding to the optimal embedded matrix as the trained correlation user identification model based on the multilayer diagram attention network;
s4: and performing relevant user identification on the input social network by using a trained relevant user identification model based on the multilayer graph attention network.
In particular, since the associated user identification is to identify an associated user between two social networks, one of the social networks is typically defined as a source social network and the other is defined as a target social network. In the two social networks obtained in step S1, either one of the two social networks may be used as the source social network, and the other social network may be used as the target social network. The attribute information of the user includes the age, sex, place of residence, and the like of the user. The connecting edges between the nodes represent the friend relationship between the users, and if the two users are friends, the connecting edges exist. Structural noise refers to errors in social network structural information, and attribute noise refers to the ghosting that exists in social network attribute information. The purpose of this step is to simulate the structural noise and the attribute noise existing in the real social network, so as to improve the adaptability of the model to different noises.
In the step S2, when the multi-layer graph attention network performs feature extraction, the importance of each neighbor node to the target node is calculated based on the attention mechanism, where the target node refers to the node that is calculating the attention coefficient, and the neighbor nodes are all nodes that have a friend relationship with the target node. For example, for a source social network, if 100 nodes are included, node 1 may be first used as a target node, and the importance of the neighbor node of node 1 to the target node is calculated by using an attention mechanism, and then similar calculation is performed by using other nodes as target nodes in turn. For the target social network, the calculation process is similar, and the description is omitted here.
The weight matrix represents the functional relationship between the input and the output, and is continuously updated in the process of deep learning. The embedded vector represents a vector representation obtained by fusing the attention coefficient, the network structure and other embedded information on the basis of the feature vector, and the embedded vector can more accurately reflect the information of the node. And merging the embedded vectors of each node in the social network, and obtaining a matrix called an embedded matrix.
The multi-layer graph attention network comprises a plurality of attention networks, wherein each layer graph attention network can obtain an embedded matrix, the first layer graph attention network corresponds to the embedded matrix obtained through the first-order neighbor friend relationship, the second layer graph attention network corresponds to the embedded matrix obtained through the second-order neighbor friend relationship, and the like (wherein the output of the previous layer graph attention network is used as the input of the next layer graph attention network). The first-order neighbors are nodes which are directly friends with the target node, the second-order neighbors are nodes which are friends with the target node, and the like.
In one embodiment, the data-enhanced learning of the two social networks in step S1 includes: the data enhancement is realized by simulating structural noise and attribute noise through a random permutation matrix.
In a specific implementation process, data enhancement can be realized through the following steps:
step 1.1, simulating structural noise: randomly deleting connecting edges in the source social network by using the probability Ps, and simulating the structure noise;
step 1.2, simulating attribute noise: for the binary attribute, randomly moving the position of a non-zero item by a probability Pa; for the real-value attribute Fij, randomly sampling in the range of [0, Pa x Fij ] to replace the original attribute value;
wherein, the binary attribute only exists in two forms of 0 or 1, and the non-zero item is 1;
step 1.3, noise addition is carried out through the random permutation matrix, and an adjacent matrix A of the enhanced network is obtainedp
Ap=PAPT
Wherein, the random permutation matrix is P, Pij ═ 1 indicates that the node i of the enhanced network corresponds to the node j in the original network, and a represents the adjacency matrix of the original network.
The adaptability of the model can be improved by simulating the structure noise and the attribute noise through the random permutation matrix.
In one embodiment, the calculation process of the node relation extracting module in step S2 includes:
based on an attention mechanism, calculating the importance of each neighbor node to the target node, and taking the importance as an attention coefficient, wherein the calculation formula is as follows:
euv=a(WFu,WFv)
wherein ,euvDenotes an attention coefficient calculated with a node u as a target node, W denotes a weight matrix, FuFeature vector, F, representing node uvA feature vector representing node v, a () representing the attention mechanism;
the attention coefficient is normalized by a softmax function:
Figure BDA0003162230740000101
wherein the expression represents that attention coefficients are normalized by a softmax function, and alpha'uvDenotes the normalized attention coefficient, MuThe neighborhood representing node u, i.e. all neighbor nodes, represents an exponential function with e as base, exp (e)uv) Is represented by e as base, euvIs a function of the index of the light,
Figure BDA0003162230740000102
means to perform the operation on all the neighbor nodes and sum;
activating the normalized attention coefficient by using a LeakyReLU activation function:
Figure BDA0003162230740000103
wherein the above formula represents activating the normalized attention coefficient by using LeakyReLU function as an activation function, alphauvIndicating the attention coefficient after activation, the LeakyReLU indicates the activation function,
Figure BDA0003162230740000104
weight vector representing attention mechanism parameterization [ | | ·]The concatenation of the vectors is represented and,
Figure BDA0003162230740000105
a formulation representing the attention mechanism a (·).
Specifically, the weight matrix is a parameter of the associated user identification model, and is continuously updated in the model training process. The eakyReLU activation function is used to improve the non-linearity of the model.
In one embodiment, the calculation process of the feature fusion module in step S2 includes:
calculating to obtain an embedded vector F 'of each node by using the activated attention coefficient and the weight matrix'u
Figure BDA0003162230740000106
Where σ denotes the activation function, αuvDenotes the attention coefficient after activation, FvA feature vector representing node v;
fusing the embedded vectors of all target nodes into an embedded matrix:
Figure BDA0003162230740000111
wherein the multi-level graph attention network comprises a plurality of graph attention networks, each graph attention network obtaining a corresponding embedded matrix, H(l)Representing the l-th layer graph attention network derived embedded matrix,
Figure BDA0003162230740000112
the embedded vector of node 1 obtained by the l-th layer graph attention network is shown.
In one embodiment, the calculation process of embedding the fusion module in step S2 includes;
obtaining the association identification score of each graph attention layer through a hierarchical alignment matrix:
Figure BDA0003162230740000113
fusing the layered alignment matrix according to a preset weight, and obtaining a final associated user identification matrix by adopting a greedy strategy:
Figure BDA0003162230740000114
Figure BDA0003162230740000115
wherein ,
Figure BDA0003162230740000116
the layer l representing the source social network is aware of the network derived embedded matrix,
Figure BDA0003162230740000117
an embedding matrix, S, representing the Lth layer of the target social network, derived by the attention network(l)A hierarchical alignment matrix representing the attention network of the ith layer, storing the associated identification score, θ, of the attention layer of the graph(l)Representing the weight of the l-th level hierarchical alignment matrix.
Specifically, the hierarchical alignment matrix is an associated user identification matrix obtained by using only an embedded matrix of a certain layer of graph attention network, and the final associated user identification matrix is obtained by weighting and fusing the hierarchical alignment matrix. For example, if user u is associatedUser prediction, where v is the associated user of u. Through the model of the invention, the association scores between all other users and u can be obtained, and if the association score of u and v is ranked at the top q (which can be set according to practical situations, such as 1, 5, 10), v is considered as the associated user of u. The preset weight is a proportion set according to actual conditions, theta(l)May be 50%, 60%, 70%, etc.
In the method disclosed by the invention, a two-layer graph attention network is taken as an example, a source social network is input into a correlation user identification model of a multi-layer graph attention network, an embedded matrix of the source social network is obtained through a first-layer graph attention network, then the embedded matrix obtained through the first-layer graph attention network is taken as input to continue the calculation of an attention coefficient and an embedded vector, and the embedded matrix of a second-layer graph attention network is obtained by fusing the embedded vector. And then, in an embedded fusion module, setting a preset weight for an embedded matrix of each layer of graph attention network for fusion to obtain a relevant user identification matrix, wherein in the calculation process, the previous layer of graph attention network is used as the input of the next layer of graph attention network, and the influence of each layer of graph attention network is considered in the embedded fusion, so that the difference of the importance of different neighbor nodes on the target node is fully considered, the accuracy of feature extraction is improved, and the identification accuracy of the model is further improved.
In one embodiment, the loss function in step S3 is:
Figure BDA0003162230740000121
wherein u and upRepresenting an original network G and a reinforcement learned network GpOf the same corresponding node, H(l)(u) node embedding, H, at level l of node u(l)(up) Representing a node upAnd embedding nodes of the l-th layer.
The optimal embedding matrix obtained by minimizing the objective function is:
Figure BDA0003162230740000122
wherein ,
Figure BDA0003162230740000123
representing a regularized Laplacian matrix, | | · |. non-woven phosphorFRepresents the Frobenius norm,
Figure BDA0003162230740000124
the layer I of the representation target social network is aware of the embedded matrix derived by the force network.
Specifically, the self-adaptability of the model is improved by minimizing a loss function, the influence of structural noise and attribute noise on the model is reduced, and the optimal embedded matrix is the optimal parameter of the model.
Please refer to fig. 1, which is a flowchart of an associated user identification method based on a multi-layer graph attention network according to an embodiment. And respectively carrying out noise simulation, attention coefficient calculation, embedded matrix calculation and embedded matrix fusion operation on the source social network and the target social network through the associated user identification model. Then obtaining the correlation identification score of each graph attention layer through layering of the matrix, fusing the layered alignment matrixes according to the weight, and obtaining a final correlation user identification matrix by adopting a greedy strategy; and finally, performing associated user identification by using the associated user identification matrix, and outputting a detection result.
And calculating key indexes of the final associated user identification matrix to obtain a final result:
Figure BDA0003162230740000131
where Precision @ q represents the probability of the standard answer appearing in top-q candidates,
Figure BDA0003162230740000132
representing a pair of associated users in a real answer,
Figure BDA0003162230740000133
representing the matching scores of the pair of associated users in the model,
Figure BDA0003162230740000134
representing a user
Figure BDA0003162230740000135
The set of users that match the top q ranks in the model, # { truean formallinks } represents all the associated user pairs.
The applicant runs on a Computer of Intel (R) core (TM) i7-7700K CPU @4.20GHz, 2080Ti GPU, and uses the disclosed Data set Douban of flex-online and documents (D.Koutra, H.Tong, and D.lube, "Big-alignment: Fast binary alignment," in2013IEEE 13th International Conference Data mining.IEEE,2013, pp.389-398.) (F.Zhou, L.Liu, K.Zhang, G.Tracjjjkvski, J.Wu, and D.Zhang, "deep: A deep approximation approach for use alignment," in IEEE FOCOM 8-model correlation, Sa 2018, Sa.2018, and D.8. echo of the published correlation, "I.8. for use alignment," I, J.J.Wu, and D.8. for comparison, "I.8 J.M.8, J.8. for comparison," Big-alignment, "I.8. for use of the same, M.8, M.J.J., Crime behavior prediction, personalized service and the like.
Compared with the prior art, the invention has the beneficial effects that:
according to the method, the attention network of the multilayer graph is utilized, the weights of different neighbor nodes to the target node are effectively considered in the process of extracting the social network characteristics, and the accuracy rate of characteristic extraction is greatly improved; before feature extraction, the method adopts data reinforcement learning to simulate structure noise and attribute noise, thereby greatly improving the adaptivity of the model; in the correlation identification, the invention carries out weighting and fusion on the embedding of the multilayer graph, and uses the fused embedding matrix to carry out correlation identification, thereby greatly improving the identification accuracy.
Example two
Based on the same inventive concept, the present embodiment provides an associated user identification device based on a multi-layer graph attention network, including:
the data enhancement module is used for acquiring two social networks, taking one of the two social networks as a source social network and the other as a target social network, and performing data enhancement learning on the two social networks, wherein the social networks comprise three information, namely nodes, connecting edges among the nodes and feature vectors of the nodes, the nodes represent users existing in the social networks, the connecting edges among the nodes represent friend relationships among the users, and the feature vectors of the nodes represent vector representations obtained by encoding attribute information of the users through a thermal encoding technology;
the model building module is used for building an associated user identification model based on the multilayer graph attention network, wherein the associated user identification model based on the multilayer graph attention network comprises the following steps: the system comprises a multilayer graph attention network, an embedded fusion module and an output module, wherein the multilayer graph attention network comprises a node relation extraction module and a feature fusion module; the node relation extraction module is used for calculating the importance of each neighbor node to the target node based on an attention mechanism, namely an attention coefficient, and then carrying out normalization and activation processing to obtain an activated attention coefficient; the feature fusion module is used for calculating to obtain an embedded vector of each node by using the activated attention coefficient and the weight matrix, fusing the embedded vectors of all target nodes into an embedded matrix, and expressing the embedded vector by fusing the attention coefficient and the network structure information on the basis of the feature vector; the embedded fusion module is used for fusing embedded matrixes of the source social network and the target social network according to preset weights and obtaining an associated user identification matrix representing the friend relationship between users based on a greedy strategy; the output module is used for obtaining an identification result according to the associated user identification matrix;
the model training module is used for training the associated user recognition model based on the multilayer diagram attention network by taking the network data subjected to data enhancement as training data, minimizing a loss function to obtain an optimal embedding matrix, obtaining a model corresponding to the optimal embedding matrix and taking the model as the trained associated user recognition model based on the multilayer diagram attention network;
and the associated user identification module is used for performing associated user identification on the input social network by utilizing the trained associated user identification model based on the multilayer graph attention network.
Since the apparatus described in the second embodiment of the present invention is an apparatus used for implementing the method for identifying associated users based on a multi-layer graph attention network in the first embodiment of the present invention, a person skilled in the art can understand the specific structure and the deformation of the apparatus based on the method described in the first embodiment of the present invention, and thus the details are not described herein again. All the devices adopted in the method of the first embodiment of the present invention belong to the protection scope of the present invention.
The above examples are only intended to illustrate the technical solution of the present invention, but not to limit it; although the present invention has been described in detail with reference to the foregoing embodiments, it will be understood by those of ordinary skill in the art that: the technical solutions described in the foregoing embodiments may still be modified, or some technical features may be equivalently replaced; and such modifications or substitutions do not depart from the spirit and scope of the corresponding technical solutions of the embodiments of the present invention.

Claims (7)

1. A method for identifying associated users based on a multi-layer graph attention network is characterized by comprising the following steps:
s1: acquiring two social networks, taking one of the social networks as a source social network and the other social network as a target social network, and performing data reinforcement learning on the two social networks, wherein the social networks comprise three kinds of information, namely nodes, connecting edges among the nodes and feature vectors of the nodes, the nodes represent users existing in the social networks, the connecting edges among the nodes represent friend relationships among the users, and the feature vectors of the nodes represent vector representations obtained by encoding attribute information of the users through a thermal encoding technology;
s2: constructing a correlation user identification model based on a multilayer graph attention network, wherein the correlation user identification model based on the multilayer graph attention network comprises the multilayer graph attention network, an embedded fusion module and an output module, and the multilayer graph attention network comprises a node relation extraction module and a feature fusion module; the node relation extraction module is used for calculating the importance of each neighbor node to the target node based on an attention mechanism, namely an attention coefficient, and then carrying out normalization and activation processing to obtain an activated attention coefficient; the feature fusion module is used for calculating to obtain an embedded vector of each node by using the activated attention coefficient and the weight matrix, fusing the embedded vectors of all target nodes into an embedded matrix, and expressing the embedded vector by fusing the attention coefficient and the network structure information on the basis of the feature vector; the embedded fusion module is used for fusing embedded matrixes of the source social network and the target social network according to preset weights and obtaining an associated user identification matrix representing the friend relationship between users based on a greedy strategy; the output module is used for obtaining an identification result according to the associated user identification matrix;
s3: training a correlation user identification model based on the multilayer diagram attention network by using the network data subjected to data enhancement as training data, minimizing a loss function to obtain an optimal embedded matrix, and obtaining a model corresponding to the optimal embedded matrix as the trained correlation user identification model based on the multilayer diagram attention network;
s4: and performing relevant user identification on the input social network by using a trained relevant user identification model based on the multilayer graph attention network.
2. The method for identifying associated users according to claim 1, wherein the step of performing data-enhanced learning on two social networks in step S1 comprises: the data enhancement is realized by simulating structural noise and attribute noise through a random permutation matrix.
3. The method for identifying associated users according to claim 1, wherein the calculation process of the node relation extracting module in step S2 includes:
based on an attention mechanism, calculating the importance of each neighbor node to the target node, and taking the importance as an attention coefficient, wherein the calculation formula is as follows:
euv=a(WFu,WFv)
wherein ,euvDenotes an attention coefficient calculated with a node u as a target node, W denotes a weight matrix, FuFeature vector, F, representing node uvA feature vector representing node v, a () representing the attention mechanism;
the attention coefficient is normalized by a softmax function:
Figure FDA0003162230730000021
wherein the expression represents that attention coefficients are normalized by a softmax function, and alpha'uvDenotes the normalized attention coefficient, MuThe neighborhood representing node u, i.e. all neighbor nodes, represents an exponential function with e as base, exp (e)uv) Is represented by e as base, euvIs a function of the index of the light,
Figure FDA0003162230730000022
means to perform the operation on all the neighbor nodes and sum;
activating the normalized attention coefficient by using a LeakyReLU activation function:
Figure FDA0003162230730000023
wherein the above formula represents activating the normalized attention coefficient by using LeakyReLU function as an activation function, alphauvIndicating the attention coefficient after activation, the LeakyReLU indicates the activation function,
Figure FDA0003162230730000024
weight vector representing attention mechanism parameterization [ | | ·]The concatenation of the vectors is represented and,
Figure FDA0003162230730000025
a formulation representing the attention mechanism a (·).
4. The method for identifying correlated users according to claim 1, wherein the calculation process of the feature fusion module in step S2 includes:
calculating to obtain an embedded vector F 'of each node by using the activated attention coefficient and the weight matrix'u
Figure FDA0003162230730000026
Where σ denotes the activation function, αuvDenotes the attention coefficient after activation, FvA feature vector representing node v;
fusing the embedded vectors of all target nodes into an embedded matrix:
Figure FDA0003162230730000031
wherein the multi-level graph attention network comprises a plurality of graph attention networks, each graph attention network obtaining a corresponding embedded matrix, H(l)Representing the embedding matrix obtained by the l-th layer graph attention network, F1 lThe embedded vector of node 1 obtained by the l-th layer graph attention network is shown.
5. The method for identifying associated users according to claim 1, wherein the step S2 of embedding calculation of the fusion module includes;
obtaining the association identification score of each graph attention layer through a hierarchical alignment matrix:
Figure FDA0003162230730000032
fusing the layered alignment matrix according to a preset weight, and obtaining a final associated user identification matrix by adopting a greedy strategy:
Figure FDA0003162230730000033
Figure FDA0003162230730000034
wherein ,
Figure FDA0003162230730000035
the layer l representing the source social network is aware of the network derived embedded matrix,
Figure FDA0003162230730000036
an embedding matrix, S, representing the Lth layer of the target social network, derived by the attention network(l)A hierarchical alignment matrix representing the graph attention network of the ith layer, an association recognition score, theta, representing the graph attention layer(l)And representing the weight of the l-th layer hierarchical alignment matrix, and S represents the final associated user identification matrix.
6. The associated user identification method of claim 1, wherein the loss function in step S3 is:
Figure FDA0003162230730000037
wherein u and upRepresenting an original network G and a reinforcement learned network GpOf the same corresponding node, H(l)(u) node embedding, H, at level l of node u(l)(up) Representing a node upAnd embedding nodes of the l-th layer.
The optimal embedding matrix obtained by minimizing the objective function is:
Figure FDA0003162230730000041
wherein ,
Figure FDA0003162230730000042
representing a regularized Laplacian matrix, | | · |. non-woven phosphorFDenotes the Frobenius norm, Ht (l)The layer I of the representation target social network is aware of the embedded matrix derived by the force network.
7. An associated user identification device based on a multi-layer graph attention network, comprising:
the data enhancement module is used for acquiring two social networks, taking one of the two social networks as a source social network and the other as a target social network, and performing data enhancement learning on the two social networks, wherein the social networks comprise three information, namely nodes, connecting edges among the nodes and feature vectors of the nodes, the nodes represent users existing in the social networks, the connecting edges among the nodes represent friend relationships among the users, and the feature vectors of the nodes represent vector representations obtained by encoding attribute information of the users through a thermal encoding technology;
the model building module is used for building an associated user identification model based on the multilayer graph attention network, wherein the associated user identification model based on the multilayer graph attention network comprises the following steps: the system comprises a multilayer graph attention network, an embedded fusion module and an output module, wherein the multilayer graph attention network comprises a node relation extraction module and a feature fusion module; the node relation extraction module is used for calculating the importance of each neighbor node to the target node based on an attention mechanism, namely an attention coefficient, and then carrying out normalization and activation processing to obtain an activated attention coefficient; the feature fusion module is used for calculating to obtain an embedded vector of each node by using the activated attention coefficient and the weight matrix, fusing the embedded vectors of all target nodes into an embedded matrix, and expressing the embedded vector by fusing the attention coefficient and the network structure information on the basis of the feature vector; the embedded fusion module is used for fusing embedded matrixes of the source social network and the target social network according to preset weights and obtaining an associated user identification matrix representing the friend relationship between users based on a greedy strategy; the output module is used for obtaining an identification result according to the associated user identification matrix;
the model training module is used for training the associated user recognition model based on the multilayer diagram attention network by taking the network data subjected to data enhancement as training data, minimizing a loss function to obtain an optimal embedding matrix, obtaining a model corresponding to the optimal embedding matrix and taking the model as the trained associated user recognition model based on the multilayer diagram attention network;
and the associated user identification module is used for performing associated user identification on the input social network by utilizing the trained associated user identification model based on the multilayer graph attention network.
CN202110794741.1A 2021-07-14 2021-07-14 Associated user identification method and device based on multi-layer diagram attention network Active CN113628059B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN202110794741.1A CN113628059B (en) 2021-07-14 2021-07-14 Associated user identification method and device based on multi-layer diagram attention network

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN202110794741.1A CN113628059B (en) 2021-07-14 2021-07-14 Associated user identification method and device based on multi-layer diagram attention network

Publications (2)

Publication Number Publication Date
CN113628059A true CN113628059A (en) 2021-11-09
CN113628059B CN113628059B (en) 2023-09-15

Family

ID=78379739

Family Applications (1)

Application Number Title Priority Date Filing Date
CN202110794741.1A Active CN113628059B (en) 2021-07-14 2021-07-14 Associated user identification method and device based on multi-layer diagram attention network

Country Status (1)

Country Link
CN (1) CN113628059B (en)

Cited By (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN114067349A (en) * 2022-01-12 2022-02-18 阿里巴巴达摩院(杭州)科技有限公司 Target object processing method and device
CN114386524A (en) * 2022-01-17 2022-04-22 深圳市城图科技有限公司 Power equipment identification method for dynamic self-adaptive graph layering simulation learning
CN116248522A (en) * 2023-05-11 2023-06-09 中国电信股份有限公司四川分公司 Service willingness and deterministic network slice association method
CN116541273A (en) * 2023-04-13 2023-08-04 中国兵器工业信息中心 Binary code similarity detection method and system based on graph attention

Citations (10)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN105095475A (en) * 2015-08-12 2015-11-25 武汉大学 Incomplete attribute tagged pedestrian re-identification method and system based on two-level fusion
US9317567B1 (en) * 2011-02-16 2016-04-19 Hrl Laboratories, Llc System and method of computational social network development environment for human intelligence
CN107194438A (en) * 2017-05-24 2017-09-22 武汉大学 A kind of depth characteristic method for expressing based on multiple stack own coding
CN110837602A (en) * 2019-11-05 2020-02-25 重庆邮电大学 User recommendation method based on representation learning and multi-mode convolutional neural network
US20200356858A1 (en) * 2019-05-10 2020-11-12 Royal Bank Of Canada System and method for machine learning architecture with privacy-preserving node embeddings
CN111931903A (en) * 2020-07-09 2020-11-13 北京邮电大学 Network alignment method based on double-layer graph attention neural network
CN112035745A (en) * 2020-09-01 2020-12-04 重庆大学 Recommendation algorithm based on counterstudy and bidirectional long-short term memory network
CN112084335A (en) * 2020-09-09 2020-12-15 电子科技大学 Social media user account classification method based on information fusion
CN112100514A (en) * 2020-08-31 2020-12-18 浙江工业大学 Social network service platform friend recommendation method based on global attention mechanism representation learning
JP2020205029A (en) * 2019-06-17 2020-12-24 大連海事大学 High-speed network representation learning method based on broad learning system

Patent Citations (10)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US9317567B1 (en) * 2011-02-16 2016-04-19 Hrl Laboratories, Llc System and method of computational social network development environment for human intelligence
CN105095475A (en) * 2015-08-12 2015-11-25 武汉大学 Incomplete attribute tagged pedestrian re-identification method and system based on two-level fusion
CN107194438A (en) * 2017-05-24 2017-09-22 武汉大学 A kind of depth characteristic method for expressing based on multiple stack own coding
US20200356858A1 (en) * 2019-05-10 2020-11-12 Royal Bank Of Canada System and method for machine learning architecture with privacy-preserving node embeddings
JP2020205029A (en) * 2019-06-17 2020-12-24 大連海事大学 High-speed network representation learning method based on broad learning system
CN110837602A (en) * 2019-11-05 2020-02-25 重庆邮电大学 User recommendation method based on representation learning and multi-mode convolutional neural network
CN111931903A (en) * 2020-07-09 2020-11-13 北京邮电大学 Network alignment method based on double-layer graph attention neural network
CN112100514A (en) * 2020-08-31 2020-12-18 浙江工业大学 Social network service platform friend recommendation method based on global attention mechanism representation learning
CN112035745A (en) * 2020-09-01 2020-12-04 重庆大学 Recommendation algorithm based on counterstudy and bidirectional long-short term memory network
CN112084335A (en) * 2020-09-09 2020-12-15 电子科技大学 Social media user account classification method based on information fusion

Non-Patent Citations (2)

* Cited by examiner, † Cited by third party
Title
张志扬;张凤荔;陈学勤;王瑞锦;: "基于分层注意力的信息级联预测模型", 计算机科学, no. 06 *
杨远奇;: "基于注意力机制的神经网络贝叶斯群组推荐算法", 数字技术与应用, no. 08 *

Cited By (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN114067349A (en) * 2022-01-12 2022-02-18 阿里巴巴达摩院(杭州)科技有限公司 Target object processing method and device
CN114386524A (en) * 2022-01-17 2022-04-22 深圳市城图科技有限公司 Power equipment identification method for dynamic self-adaptive graph layering simulation learning
CN116541273A (en) * 2023-04-13 2023-08-04 中国兵器工业信息中心 Binary code similarity detection method and system based on graph attention
CN116541273B (en) * 2023-04-13 2024-01-26 中国兵器工业信息中心 Binary code similarity detection method and system based on graph attention
CN116248522A (en) * 2023-05-11 2023-06-09 中国电信股份有限公司四川分公司 Service willingness and deterministic network slice association method
CN116248522B (en) * 2023-05-11 2023-09-22 中国电信股份有限公司四川分公司 Service willingness and deterministic network slice association method

Also Published As

Publication number Publication date
CN113628059B (en) 2023-09-15

Similar Documents

Publication Publication Date Title
WO2023065545A1 (en) Risk prediction method and apparatus, and device and storage medium
CN111444340B (en) Text classification method, device, equipment and storage medium
CN111061856B (en) Knowledge perception-based news recommendation method
CN111695415B (en) Image recognition method and related equipment
CN113628059A (en) Associated user identification method and device based on multilayer graph attention network
CN110659723B (en) Data processing method and device based on artificial intelligence, medium and electronic equipment
CN111291190B (en) Training method of encoder, information detection method and related device
WO2022105117A1 (en) Method and device for image quality assessment, computer device, and storage medium
CN113627447B (en) Label identification method, label identification device, computer equipment, storage medium and program product
CN110705301A (en) Entity relationship extraction method and device, storage medium and electronic equipment
WO2023065859A1 (en) Item recommendation method and apparatus, and storage medium
WO2022105118A1 (en) Image-based health status identification method and apparatus, device and storage medium
CN114330966A (en) Risk prediction method, device, equipment and readable storage medium
CN111783903A (en) Text processing method, text model processing method and device and computer equipment
CN114282059A (en) Video retrieval method, device, equipment and storage medium
CN111523586A (en) Noise-aware-based full-network supervision target detection method
CN113723238B (en) Face lightweight network model construction method and face recognition method
CN117114063A (en) Method for training a generative large language model and for processing image tasks
Du et al. Structure tuning method on deep convolutional generative adversarial network with nondominated sorting genetic algorithm II
CN111079930A (en) Method and device for determining quality parameters of data set and electronic equipment
CN116975350A (en) Image-text retrieval method, device, equipment and storage medium
CN115620342A (en) Cross-modal pedestrian re-identification method, system and computer
CN112529637B (en) Service demand dynamic prediction method and system based on context awareness
CN114494809A (en) Feature extraction model optimization method and device and electronic equipment
CN114186039A (en) Visual question answering method and device and electronic equipment

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant