CN114626890A

CN114626890A - Abnormal user detection method based on graph structure learning

Info

Publication number: CN114626890A
Application number: CN202210275577.8A
Authority: CN
Inventors: 刘兆伟; 杨栋; 段培永; 王慎强; 马元庆; 王涛
Original assignee: Yantai University
Current assignee: Yantai University
Priority date: 2022-03-21
Filing date: 2022-03-21
Publication date: 2022-06-14

Abstract

The invention discloses an abnormal user detection method based on graph structure learning, which is characterized by comprising a graph neural network training layer, a graph neural network model and a learning node, wherein the graph neural network model is established by a graph neural network model defining method, the learning node is represented by a low-dimensional vector, and a graph neural network model optimizing method realizes the learning of graph neural network model weight and graph structure characteristics by defining a plurality of constraint functions, so that the robustness of the model is enhanced. The method learns various graph structures from node representation to mine potential information of the nodes, improves the quality of low-dimensional vectors of the nodes through an attention mechanism, and greatly improves the accuracy of detecting abnormal users under the condition of unbalanced sample categories.

Description

Abnormal user detection method based on graph structure learning

Technical Field

The invention relates to application of a graph neural network in abnormal user detection, in particular to an abnormal user detection method based on graph structure learning.

Background

In the e-commerce field, examples of abnormal transactions such as fraudulent transactions and false transactions are frequently available, and a great number of people suffer economic losses caused by the abnormal transactions every year. As an e-commerce enterprise, how to utilize settled order data and apply data mining to detect whether the transaction has abnormal behaviors in advance, and the transaction action is blocked in advance or in advance, so that the property safety of users is guaranteed, and the damage caused by abnormal transactions is greatly reduced.

At present, a graph neural network model for processing abnormal user detection exists, and an abnormal user detection task has the remarkable characteristic that training samples are extremely unbalanced, and usually normal users account for most of the abnormal users. The current graph neural network model for the abnormal detection task usually adopts a negative sampling or data enhancement method, the proportion of normal users and abnormal users is basically equivalent by reducing the number of normal users, and the method can not fully utilize precious normal data; the data enhancement method generates some new abnormal users by learning the characteristics of the abnormal users, so that the proportion of the abnormal users is basically equivalent to that of the abnormal users. Therefore, a new method is needed to improve the accuracy of detecting the abnormal user by improving the robustness of the model.

Disclosure of Invention

In order to solve the problem of difficult anomaly detection, the invention provides an anomaly user detection method based on graph structure learning.

A method for detecting abnormal users based on graph structure learning is characterized by comprising the following steps:

s1, capturing user transaction behavior data, and converting the data into a graph structure type;

s2, training the graph neural network, and learning a model weight coefficient until a target function is converged;

s3, generating various graph structures to express various information;

and S4, detecting new user transaction behavior data by using the trained graph neural network model.

S2 includes a graph neural network model definition method for defining a corresponding graph convolution layer based on a plurality of graph structures and generating a low-dimensional vector representation of a plurality of nodes by a graph convolution network. And constructing an attention fusion method according to the characteristics of the abnormal user detection task, and only paying attention to the important low-dimensional vector according to the attention so as to generate a minimum sufficient vector.

S21, the graph convolution layer is defined as follows:

H^(k)＝Relu(SH^(k-1)W^(k)),

representing the processed adjacency matrix S, which is suitable for convolution operations, wherein D represents the degree matrix of the adjacency matrix S,

represents the adjacency matrix S plus the identity matrix I; h^(k)Represents the data characteristics of the k-th layer graph convolution network, where H⁽⁰⁾The original data characteristics; w is a group of^kRepresenting weight coefficients of a k-th layer graph convolution network, wherein a Relu () function represents a nonlinear activation function; for graph convolution of l layers, the l-th layer uses softmax () activation function and the prediction matrix Z ═ H^l。

Learning 3 graph structures in the graph structure learning layer to be applied to the graph convolution layer, wherein the prediction matrix of n nodes in the u graph structure is represented as

Wherein

Is the probability that node i belongs to class c in the u-th graph structure, where u is { A, S }^f,S^d,S^sN is the number of nodes;

notably, the difference between the proposed graph convolution model and the mainstream graph convolution model is the challenge of dealing with the anomaly detection task by learning a variety of graph structures.

And S22, fusing the 4 learned low-dimensional vectors of each node into a low-dimensional vector representation which is most beneficial to anomaly detection based on an attention fusion method through an attention fusion method aiming at the 4 graph convolutions.

The attention fusion method is defined as follows:

first, defining a node low-dimensional vector Z generated by a graph structure g_gImportance coefficient of middle node i:

herein, the

And

respectively expressed in a low-dimensional vector Z_gThe maximum value and the second largest value of the vector of the middle node i are easy to find; and lambda epsilon (0,1) is an artificial parameter. Notably, the attention fusion method has several advantages: 1. by the method, the training of the graph neural network model can be accelerated, because if the predicted value Z of a graph structure has a higher maximum value and the difference between the maximum value and the second maximum value is larger, the attention fusion method can capture the result and guide the optimization of model parameters. 2. The method does not need to add new weight coefficients for calculation, so that the overfitting condition of the model is effectively relieved. Similarly, the importance coefficient of the node i in other graph structures can be obtained.

S23, obtaining a final prediction matrix of the graph neural network model based on the importance coefficient:

where R and K both represent the number of graph structures, ε ∈ (0,1) is an artificial parameter, and Z ∈ R^n*c. The attention fusion framework is not only applicable to the graph convolution form, but also to the fusion between any multiple low-dimensional node vectors generated by different graph structures as a low-dimensional vector fusion framework.

The S2 includes a neural network model optimization method, where the neural network model optimization method obtains the objective function by defining multiple constraint functions, optimizes the neural network model by back propagation under the guidance of a training set, and learns a prediction matrix for anomaly detection, thereby enhancing the robustness of the model. The method specifically comprises the following steps:

s201, defining a consistency constraint function:

by reducing the loss function L_uThe similarity of the three prediction matrixes is improved, and the universality among the prediction matrixes is enhanced.

S202, defining an independence constraint function:

n is the number of the prediction matrix Z; matrix array

Wherein I is an n-order identity matrix, and the personality of the eigenvector Z can be amplified by calculating a matrix G; through L_dTo enhance the difference between the generated graph structure and the original adjacency matrix a, ensuring that useful information with individuality can be captured.

S203, optimizing the weight coefficient W of the neural network model of the graph by defining a cross entropy loss function,assuming that the training set is L, the true label for each node L ∈ L is Y_lThe prediction label is a prediction matrix Z epsilon R^n*cThe nodes in all training sets are expressed as:

s204, combining the abnormal detection task and the constraint condition to obtain the following objective function:

L＝L_t+τL_u+υL_d,

here, τ and υ e (0,1) are artificial parameters of the coherence constraint function and the independence constraint function. Under the guidance of a training set, the graph neural network model is optimized through back propagation, and low-dimensional vector representations of nodes for anomaly detection are learned.

S3 includes a graph structure learning method, which includes feature-based graph structure learning, wandering-based graph structure learning, and sub-graph-based graph structure learning.

S31, the graph structure based on the characteristics S^fThe learning method comprises the following steps:

firstly, fixing the weight coefficient W of the graph neural network, and taking out the vector representation H of the node as H ═ H⁰,H¹,…,H^lS to construct a feature map S^f＝{F⁰,F¹,…,F^lIn which F^kIs the eigenvector H passing through the k-th layer^kThe generated adjacency matrix based on the characteristics characterizes the similarity of k-th order neighbors, and the calculation formula is as follows:

obtaining an adjacency matrix F of each layer through the formula, wherein alpha and beta epsilon (0,1) are artificial parameters;

notably, the calculation

The method updates the feature vector and the graph structure through iterative optimization, a good feature vector can calculate a graph structure which accords with objective reality, meanwhile, the graph structure which accords with standard answers is more beneficial to generating low-dimensional vector representation which accords with an abnormal detection task, and the low-dimensional vector representation is optimized through iterative optimization until convergence.

S32, graph structure S based on wandering^dThe learning method comprises the following steps:

the wandering-based graph structure S^dGenerated by the random walk of the nodes of the original adjacency matrix A and contains the global information of the graph structure, and therefore passes through the graph structure S^dThe learned prediction matrix expresses global information of the data. The calculation formula is as follows:

here, α represents a transition probability in random walk, τ represents a walk cost, and the value becomes larger with the number of walks. In the task of abnormality detection, by learning a graph structure S having global information^dThe accuracy of the prediction matrix is improved.

S33, the graph structure S based on the subgraph^sThe learning method comprises the following steps:

for the original graph adjacency matrix A, a subgraph S is generated by randomly reserving a certain edge^sAnd sub-graph S^sPutting the graph volume layer into the graph volume layer to learn a prediction matrix Z_s＝f(X,S^s) Through Z_sEvaluation subgraph S^sThe probability of connecting edges between each pair of nodes in the set. The probability of connecting edges between the node i and the target node j is as follows:

where W is_s∈R^2c*1Representation application graph structure S^sA mapping vector of b_s∈R^2c*1A vector of the offset is represented, and,

representing node i application graph structure S^sIn order to save space and time, only the limited range K of the node i is considered^sAnd (c) neighbor nodes in the set, wherein k is an artificial parameter.

The node connecting edge probability rho^sAfter the calculation is completed, the graph structure S^sComprises the following steps:

S^s＝S^s+μ^sρ^s,

where mu^sE (0,1) is an artificial parameter.

According to the abnormal user detection method based on graph structure learning, abnormal users are judged through a graph neural network, and aiming at the problems that most of nodes in sample types are non-abnormal types and the sample types are seriously unbalanced, the method re-expresses an adjacent matrix of an original graph structure by providing a graph structure learning model, and further improves the robustness of side attack by learning multi-aspect information. Aiming at the aspect of fusing various information to form minimum sufficient information which is most helpful to the task of identifying the abnormal node, the invention provides a fusion method based on attention, and the minimum sufficient information is formed by calculating the importance and fusing the low-dimensional vector representation learned in all aspects according to the weight, so that the noise of generating a final prediction matrix is reduced, and the discrimination capability of the abnormal node is improved. Aiming at the aspect of model optimization, a plurality of constraint functions are provided, graph structure learning and node vector representation are updated in the direction beneficial to an anomaly detection task, and negative effects of sample class imbalance on a graph convolution network are reduced.

Drawings

Fig. 1 is a flowchart of an abnormal user detection method according to a first embodiment of the present invention.

Detailed Description

In order to make the objects, technical solutions and advantages of the embodiments of the present invention clearer, the technical solutions in the embodiments of the present invention will be clearly and completely described below with reference to the drawings in the embodiments of the present invention, and it is obvious that the described embodiments are some, but not all, embodiments of the present invention. All other embodiments, which can be derived by a person skilled in the art from the embodiments given herein without making any creative effort, shall fall within the protection scope of the present invention.

Example one

The embodiment provides an abnormal user detection system and method based on graph structure learning.

The data reading layer is used for acquiring transaction data of normal users and abnormal users and converting the transaction data into graph structure types and comprises a data cleaning module, a data marking module and a data dividing module. The data cleaning module is used for examining and checking user log data, processing invalid values and missing values, deleting repeated information, correcting existing errors and providing data consistency, extracting user information from the cleaned user log data, forming nodes in a graph, and forming edges in the graph through the relationship among users, so that the graph is constructed; the data marking module is used for marking known data according to categories, including marking normal users and abnormal users; the data dividing module is used for dividing the marking data into a training set, a verification set and a test set according to the proportion of 1:1: 3.

The graph neural network training layer is used for learning the parameters of the graph neural network model, so that the graph neural network model has the capability of identifying abnormal users, and comprises a graph neural network model defining module and a graph neural network model optimizing module. The graph neural network model definition module defines corresponding graph convolution layers to learn low-dimensional vector representation of nodes based on a plurality of graph structures, and fuses the low-dimensional vectors learned by each graph convolution network according to importance by adding an attention fusion mechanism, so that the probability of detection errors of the model is remarkably reduced; the graph neural network model optimization module optimizes the parameters of the graph neural network model by defining a loss function, and improves the identification capability of abnormal users.

The graph structure learning layer is used for relearning the topological structure of graph data, and learning the original adjacent matrix into various adjacent matrix graphs by defining various graph structure learning models, so that various information can be expressed, the topological structure of the graph is more objective and correct, and the performance of abnormal user detection is obviously improved; the graph structure learning layer excavates potential link relations through node characteristics, so that multiple pieces of more objective and correct graph structure information are fused, and negative influences of noise of the graph structures on a graph neural network are reduced.

The abnormal user detection layer is used for man-machine interaction of the system and comprises a data definition and transmission module, a system management module, a file data management module and an abnormal detection result display module. The data definition and transmission module is used for providing an interactive interface for data cleaning, data marking, data dividing and the like; the system management module is used for managing the system and daily operation and maintenance work of the system; and the abnormal result display module is used for providing a visual interface of the abnormal user detection result.

The detection method comprises the following steps:

s1, capturing user transaction behavior data through a data reading layer, and converting the data into a graph structure type;

s2, training the graph neural network through transaction behavior data, and learning a model weight coefficient until a target function is converged;

s3, multiple graph structures are generated through the graph structure learning layer to express information in multiple aspects, so that the topological structure of the graph is more objective and correct, and the robustness of the graph neural network model is improved;

and S4, detecting new user transaction behavior data through the trained neural network model at the abnormal user detection layer.

In step S2, a graph neural network model definition method and a graph neural network model optimization method of the graph neural network training layer are included;

the graph neural network model definition method defines corresponding graph convolution layers based on various graph structures, and generates low-dimensional vector representations of a plurality of nodes through a graph convolution network. And constructing an attention fusion method according to the characteristics of the abnormal user detection task, and only paying attention to the important low-dimensional vector according to the attention so as to generate a minimum sufficient vector.

The map convolutional layer is defined as follows:

H^(k)＝Relu(SH^(k-1)W^(k)),

represents the adjacency matrix S plus the identity matrix I; h^(k)Represents the data characteristics of the k-th layer graph convolution network, where H⁽⁰⁾The original data characteristics; w^kRepresenting weight coefficients of a k-th layer graph convolution network, wherein a Relu () function represents a nonlinear activation function; for graph convolution of l layers, the l-th layer uses softmax () activation function and the prediction matrix Z ═ H^l。

Wherein

The attention fusion method is defined as follows:

first defining what is generated by the graph structure gNode low-dimensional vector Z_gImportance coefficient of middle node i:

herein, the

And

respectively expressed in a low-dimensional vector Z_gThe maximum value and the second largest value of the vector of the middle node i are easy to find; and lambda epsilon (0,1) is an artificial parameter.

Finally, a prediction matrix of the final graph neural network model is obtained based on the importance coefficient:

where R and K both represent the number of graph structures, ε ∈ (0,1) is an artificial parameter, and Z ∈ R^n*c。

According to the method for optimizing the neural network model of the graph, the target function is obtained by defining various constraint functions, the neural network model of the graph is optimized through back propagation under the guidance of a training set, low-dimensional vector representation of nodes for anomaly detection is learned, and the robustness of the model is enhanced.

By defining a consistency constraint function, the commonality between three prediction matrixes is improved:

by reducing the loss function L_uThe similarity of the 3 prediction matrixes is improved, and the universality among all the prediction matrixes is enhanced. By defining an independence constraint function, the difference between each generation graph structure and the original adjacency matrix is improved,to ensure that they can capture different information:

where n is the number of prediction matrices Z; matrix array

Wherein I is an n-order identity matrix, and the personality of the eigenvector Z can be amplified by calculating the matrix G. Thus passing through L_dTo enhance the difference between the generated graph structure and the original adjacency matrix a, ensuring that useful information with individuality can be captured.

Optimizing the weight coefficient W of the neural network model of the graph by defining a cross entropy loss function, and assuming that a training set is L and a real label of each node L belonging to L is Y_lThe prediction label is a prediction matrix Z epsilon R^n*cThe nodes in all training sets are expressed as:

combining the abnormal detection task and the constraint condition to obtain the following objective function:

L＝L_t+τL_u+υL_d,

In step S3, a graph structure learning method in the graph structure learning layer is further included, and the graph structure learning method includes feature-based graph structure learning, wandering-based graph structure learning, and sub-graph-based graph structure learning.

In particular, the feature-based graph structure S^fThe learning method comprises the following steps:

firstly, fixing the weight coefficient W of the graph neural network, and taking out the vector representation H of the node, wherein the vector representation H is equal to { H ═ H⁰,H¹,…,H^lS to construct a feature map S^f＝{F⁰,F¹,…,F^lIn which F^kIs the eigenvector H passing through the k-th layer^kThe generated adjacency matrix based on the characteristics characterizes the similarity of k-th order neighbors, and the calculation formula is as follows:

in particular, the walk-based graph structure S^dThe learning method comprises the following steps:

the walk-based graph structure S^dGenerated by said original adjacency matrix a through node random walks and contains global information of the graph structure, thus through the graph structure S^dThe learned prediction matrix expresses global information of the data. The calculation formula is as follows:

In particular, the subgraph-based graph structure S^sThe learning method comprises the following steps:

for the original graph adjacency matrix A, a subgraph S is generated by randomly reserving a certain edge^sAnd sub-graph S^sPrediction matrix Z of learning nodes put into the graph volume layer_s＝f(X,S^s) Through Z_sEvaluation subgraph S^sThe probability of connecting edges between each node pair in the graph. The probability of connecting edges between the node i and the target node j is as follows:

representing node i application graph structure S^sIn order to save space and time, only the limited range K of the node i is considered^sAnd (c) neighbor nodes in the cluster, wherein k is an artificial parameter.

The node connecting edge probability rho^sAfter the computation is completed, the graph structure S^sComprises the following steps:

S^s＝S^s+μ^sρ^s,

where mu^sE (0,1) is an artificial parameter.

Therefore, while the foregoing is directed to embodiments of the present invention, other and further embodiments of the invention may be devised without departing from the basic scope thereof, and the scope thereof is determined by the claims that follow.

Claims

1. An abnormal user detection method based on graph structure learning is characterized by comprising the following steps:

s3, generating various graph structures to express various information;

2. The abnormal user detection method based on graph structure learning according to claim 1,

in S2, the method for defining the neural network model includes:

s21, defining a map convolutional layer:

H^(k)＝Relu(SH^(k-1)W^(k)),

represents the adjacency matrix S plus the identity matrix I; h^(k)Represents the data characteristics of the k-th layer graph convolution network, where H⁽⁰⁾The original data characteristics; w^kRepresenting weight coefficients of a k-th layer graph convolution network, wherein a Relu () function represents a nonlinear activation function; for graph convolution of l layers, the l-th layer uses softmax () activation function and the prediction matrix Z ═ H^l。；

Wherein

s22, fusing the 4 learned low-dimensional vectors of each node into a low-dimensional vector representation which is most beneficial to anomaly detection based on an attention fusion method through an attention fusion method aiming at the 4 graph convolutions;

the attention fusion method is defined as follows:

and

respectively expressed in a low-dimensional vector Z_gThe maximum value and the second maximum value of the vector of the middle node i; lambda epsilon (0,1) is an artificial parameter;

s23, obtaining a prediction matrix of the final graph neural network model based on the importance coefficient:

r and K both represent the number of graph structures, epsilon belongs to (0,1) as an artificial parameter, and Z belongs to R^n*c。

3. The abnormal user detection method based on graph structure learning according to claim 1 or 2,

the S2 further includes a method for optimizing a neural network model:

s201, defining a consistency constraint function:

by reducing the loss function L_uThe similarity of the three prediction matrixes is improved, and the universality among the prediction matrixes is enhanced;

s202, defining an independence constraint function:

n is the number of the prediction matrix Z; matrix array

Wherein I is an n-order identity matrix, and the personality of the eigenvector Z can be amplified by calculating a matrix G; through L_dThe difference between the generated graph structure and the original adjacency matrix A is enhanced, and the useful information with individuality can be captured;

s203, defining a cross entropy loss function, optimizing a weight coefficient W of the graph neural network model, assuming that a training set is L, and a real label of each node L belonging to L is Y_lThe prediction label is a prediction matrix Z epsilon R^n*cThe nodes in all training sets are expressed as:

L＝L_t+τl_u+υL_d,

τ and υ e (0,1) are artificial parameters of the coherence constraint function and the independence constraint function.

4. The abnormal user detection method based on graph structure learning according to claim 1,

in S3, a graph structure learning method is included, where the graph structure learning method includes feature-based graph structure learning, wandering-based graph structure learning, and sub-graph-based graph structure learning;

s31, the feature-based graph Structure S^fThe learning method comprises the following steps:

first fixing the pattern nerveThe weight coefficient W of the network is taken and the vector representation H of the node is taken to be { H ═ H⁰,H¹,…,H^lS to construct a feature map S^f＝{F⁰,F¹,…,F^lIn which F^kIs the eigenvector H passing through the k-th layer^kThe generated adjacency matrix based on the characteristics characterizes the similarity of k-th order neighbors, and the calculation formula is as follows:

the wandering-based graph structure S^dGenerated by the random walk of the nodes of the original adjacency matrix A and contains the global information of the graph structure, and therefore passes through the graph structure S^dThe learned prediction matrix expresses global information of the data; the calculation formula is as follows:

α represents a transition probability in random walk, τ represents a walk cost, and a value becomes large with the number of walks; in the task of abnormality detection, by learning a graph structure S having global information^dThe accuracy of the prediction matrix is improved;

for the original graph adjacency matrix A, a subgraph S is generated by randomly reserving a certain edge^sAnd sub-graph S^sPrediction matrix Z of learning nodes put into the graph volume layer_s＝f(X,S^s) Through Z_sEvaluation subgraph S^sThe probability of connecting edges between each node pair; the probability of connecting edges between the node i and the target node j is as follows:

W_s∈R^2c*1representation application graph structure S^sA mapping vector of b_s∈R^2c*1A vector of the offset is represented, and,

representing node i application graph structure S^sIn order to save space and time, only the limited range K of the node i is considered^sThe neighbor nodes in the node, wherein k is an artificial parameter;

S^s＝S^s+μ^sρ^s,

μ^se (0,1) is an artificial parameter.