CN111476261A

CN111476261A - Community-enhanced graph convolution neural network method

Info

Publication number: CN111476261A
Application number: CN201911288719.9A
Authority: CN
Inventors: 刘彦北; 王祺; 肖志涛; 张芳; 耿磊; 吴骏; 王雯
Original assignee: Tianjin Polytechnic University
Current assignee: Tianjin Polytechnic University
Priority date: 2019-12-16
Filing date: 2019-12-16
Publication date: 2020-07-31

Abstract

The invention provides a community-enhanced graph convolution neural network method, which comprises the following steps: (1) inputting a feature matrix formed by node feature vectors of graph data and an adjacency matrix of a graph; (2) calculating a modularity matrix of the graph; (3) extracting features of the graph data by adopting a graph convolutional neural network, and obtaining a final graph representation by the last layer of hidden representation through a softmax function; (4) defining a target function for detecting a community structure and fusing the target function and a cross entropy function into a unified loss function; (5) parameters are optimized, and a loss function is minimized, so that the graph is represented by an iteration updating graph. The method can acquire the local and global structure information of the graph to learn the graph representation at the same time, and can be widely applied to the technical fields of social networks, traffic networks, citation networks and the like which need to analyze irregular graph data.

Description

Community-enhanced graph convolution neural network method

Technical Field

The invention belongs to the field of data mining, and provides a method for a community-enhanced graph convolution neural network, which can be used for semi-supervised classification problems represented by a citation network, a social network and the like.

Background

The graph data has an irregular and changeable data structure and is used for representing various objects and the mutual relations among the objects, and the graph data can be applied to a plurality of fields such as social networks, traffic networks, biochemical molecular networks and the like. Graph data representation has become an increasingly popular area of research. The graph neural network is the most popular graph representation method based on the deep learning technology, and particularly, the graph convolution network method is used for popularizing convolution operation from traditional image data to graph data, and has shown remarkable learning capability.

Graph convolutional neural network methods can be divided into spectral-based and spatial-based methods. The spectral-based approach defines the graph convolution operation by the spectral representation of the graph, which can be interpreted as de-noising, rather than introducing a filter to define the graph convolution from the perspective of graph signal processing. The space-based method defines the convolution operation of the graph through the spatial relation of the nodes, and an operator directly defining the neighborhood nodes represents the graph convolution as aggregating the characteristic information from the neighbor nodes.

However, most graph convolutional neural network approaches focus primarily on the local structure of the network, i.e., the relationship or similarity of nodes to neighbor nodes, while global structures are ignored as an important feature of another aspect of the network. Specifically, the graph convolution neural network performs convolution on the graph by aggregating information of neighboring nodes, and fuses the relationships between the nodes to extract local features of the graph. However, the graph convolution neural network model does not take into account the global structure, i.e., the relationship between communities formed by nodes. Nodes have dense connections within their communities and sparse connections between communities, e.g., users belonging to the same organization have close relationships and users of different organizations have distant relationships in a social network. The representation of nodes within a community should be more similar than the representation of nodes between different communities.

In order to solve the problems, the invention provides a community enhanced graph convolution neural network method.

Disclosure of Invention

The invention provides a community-enhanced graph convolution neural network method, which can simultaneously acquire local and global information of a graph and learn an effective graph representation. Local feature information of the graph is extracted through a main framework of a graph convolution neural network model, and global feature information is extracted through introducing constraints of community structures into the network. The result shows that the method is superior to the existing graph convolution neural network method.

The technical scheme for realizing the invention comprises the following steps:

step 1, inputting a feature matrix X ∈ R consisting of node feature vectors of graph data^n×mThe abutting matrix of the figure A ∈ Rⁿ ^×nN represents the number of nodes, m represents the number of features;

step 2, calculating a modularity matrix B ∈ R of the graph^n×n；

And step 3: extracting features of graph data by adopting a graph convolutional neural network, and representing the last hidden layer by a softmax function to obtain a final graph representation;

step 4, defining an objective function L for detecting the community structure by using the modularity matrix_comTo obtain global information and combine it with a cross entropy loss function L for node classification_croMerge into a unified loss function L_c；

Step 5. optimization of parameters using back-propagation algorithm, minimizing loss function L_cUntil it converges, represented by the iterated update map.

Compared with the prior art, the invention has the beneficial effects that:

the invention can simultaneously obtain the local and global structure information of the graph to learn graph representation, provides a community-enhanced graph convolution neural network method, unifies the extraction of the local and global information into a network model by defining a new loss function, solves the problem that the global structure information is neglected, and improves the learning and representation capability of graph data.

Drawings

FIG. 1 is a general frame diagram, namely an abstract figure;

FIG. 2 is a schematic diagram of a community enhanced graph convolution neural network;

FIG. 3 is a visualization result graph;

FIG. 4 data set parameters;

FIG. 5 semi-supervised classification results;

FIG. 6 shows the node clustering result.

Detailed Description

The present invention will be described in further detail with reference to specific embodiments. FIG. 1 shows a flow chart of the community enhanced convolutional neural network learning algorithm of the present invention. As shown in fig. 1, a community-enhanced convolutional neural network learning algorithm of the present invention includes:

1. inputting a feature matrix formed by node feature vectors of graph data and an adjacency matrix of a graph;

2. calculating a modularity matrix of the graph;

3. extracting features of the graph data by adopting a graph convolutional neural network, and obtaining a final graph representation by the last layer of hidden representation through a softmax function;

4. defining a target function for detecting a community structure and fusing the target function and a cross entropy function into a unified loss function;

5. parameters are optimized, and a loss function is minimized, so that the graph is represented by an iteration updating graph.

The following describes a specific implementation process of the technical solution of the present invention with reference to the accompanying drawings.

1. Data set

The invention utilizes three standard citation network data sets, including Cora, Citeseer and Pubmed data sets, as shown in fig. 4, and counts the parameters of each data set. In these datasets, nodes correspond to documents, edges correspond to inter-referenced relationships between documents, and edges and nodes are connected undirectly. The node is characterized by an element represented by a word bag model of the document corresponding to the node, and the label of the node is the research field related to the document. In the experiment, only 20 nodes from each type of data are selected for training the model, and the performance of label prediction on 1000 test nodes is evaluated.

2. Community-enhanced graph convolution neural network method

2.1 graph data definition

The invention is represented by G ═ V, E, where V represents the set of vertices and E represents the set of edges^n×mThe adjacency matrix is denoted A ∈ R^n×nN represents the number of nodes and m represents the number of features.

2.2 calculation of the modularity matrix

Calculating a modularity matrix B ∈ R of the input diagram^n×nThe calculation formula of the elements in the matrix is as follows:

wherein A is_ijIs an element of the adjacency matrix, k_iIs the degree of the ith node and e is the number of edges.

2.3 local feature acquisition

Local feature extraction is carried out by adopting a graph convolutional neural network method to obtain hidden layer representation of a graph:

wherein the content of the first and second substances,

i is the identity matrix. σ (-) denotes an activation function, W^(l)Is the weight matrix that needs training learning, and l is the number of layers. The hidden layer of the last layer is represented as H^(L)And (3) transmitting to a softmax function to obtain a final graph representation, namely the prediction of the label:

Z＝soft max(H^(L)) (3)

using the cross entropy loss function as an objective function for node classification:

where i ∈L is the node index set with labels, K is the number of node categories, and Y_i·A corresponding label indication representing the ith marker node.

2.4 acquisition of Global information

Introducing a community structure into a network to obtain global information of the graph, wherein modularity is an index for measuring the strength of the community structure, and an objective function for detecting the community structure is defined by using a modularity matrix:

combining the two objective functions to obtain a final loss function as follows:

L_c＝L_ce-αL_com(6)

wherein α is a hyper-parameter that controls community structure effects.

2.5 optimization of parameters

Iteratively updating neural network weights based on training data to optimize parameters using a back-propagated Adam algorithm, minimizing a loss function L_cUntil it converges, the graph is represented with an iteratively updated graph.

3. Model parameter setting

The nonlinear activation function adopts a Re L U function, the number of hidden layers of the model is 2, dropout is 0.5, L2 regularization parameter is 5 & 10^-4The number of hidden layer units is 32, the learning rate is 0.01, the number of training iterations is 200, and the settings of α under the Cora, cieseer, and Pubmed data sets are 1.2, 1.5, and 1, respectively.

4. Comparison algorithm

The method of the invention was compared in experiments with the following method:

ManiReg: the model is a semi-supervised learning model based on manifold regularization, and can fully utilize the geometric characteristics of edge distribution;

semi Emb: is a semi-supervised embedded learning model;

l P, a Label propagation method based on a Gaussian random field model;

deepwalk: deep walking, which is a graph network embedding method based on random walking;

Planetoid-T: an inductive, embedding-based semi-supervised learning model;

ChebNet: the method is a spectrogram convolution neural network, and reduces calculation of Laplace eigenvectors and generation of a spatial localization filter by performing Chebyshev expansion on the graph Laplace;

GCN: a classical atlas neural network adopts a first-order approximate spectral filter, and provides a simplified model for semi-supervised learning.

And (3) GAT: an improved method of graph convolution neural networks employs an attention mechanism to assign different weights to the neighbors of a node when aggregating feature information.

5. Results of the experiment

Fig. 5 shows the performance of the semi-supervised classification task performed in three data sets by the present invention, with evaluation indexes of accuracy and F1 score. The algorithm with the best classification effect can be known as CE-GCN by observing the table, namely the method. Fig. 6 shows the performance of clustering tasks performed in three data sets, and the experiment performed clustering using KMeans algorithm, and the input dimension was consistently set to 64 to compare the performance of different methods. From the data, it can be seen that the CE-GCN still maintains significant advantages over other methods. In addition, fig. 3 shows a comparison result when performing a data visualization task. Experimental results show that the method has better representation capability than the existing algorithm on tasks related to the graph nodes.

The foregoing examples are merely illustrative of the principles and efficacy of the present invention and are not intended to limit the scope of the invention, it being understood that the invention is not limited to the implementations described herein, which are described for the purpose of assisting those skilled in the art in practicing the invention. Modifications and improvements to the above-described embodiments may occur to those skilled in the art without departing from the spirit and scope of the invention, and it is intended that the invention be limited only by the appended claims.

Claims

1. A community-enhanced graph convolution neural network method, comprising the steps of:

step 1: inputting a feature matrix formed by node feature vectors of graph data and an adjacency matrix of a graph;

step 2: calculating a modularity matrix of the graph;

and step 3: extracting features of the graph data by adopting a graph convolutional neural network, and obtaining a final graph representation by the last layer of hidden representation through a softmax function;

and 4, step 4: defining a target function for detecting the community structure by using the modularity matrix and fusing the target function and the cross entropy function into a unified loss function;

and 5: parameters are optimized, and a loss function is minimized, so that the graph is represented by an iteration updating graph.

2. The method of claim 1, wherein in step 1, a node feature matrix X ∈ R is obtained^n×mAnd adjacency matrix A ∈ R^n×nAs inputs, n represents the number of nodes and m represents the number of features.

3. The method of claim 1, wherein in step 2, a modularity matrix B ∈ R of the graph is calculated^n×nAnd providing data preparation for information acquisition of community structures.

4. The method of claim 1, wherein in step 3, a graph convolution neural network method is used to perform feature extraction layer by layer, new node representations are obtained by fusing neighbor information of nodes, and representations of graph data obtained by convolution layers are called hidden layer representations.

5. The method of claim 1, wherein in step 3, the last hidden layer representation is transferred to a softmax perception layer to obtain a final graph representation Z.

6. The method of claim 1, wherein in step 4, modularity is introduced into the network to measure the strength of the community structure, and the objective function L for detecting the community structure is defined by a matrix of the modularity_comTo obtain global information of the graph.

7. The method of claim 1, wherein in step 4, a cross-entropy loss function L is used_croAs an objective function for node classification, the objective function L for global information extraction is used_comAre fused into a unified loss function L_c。

8. The method of claim 1, wherein in step 5, the loss function L is minimized by using back-propagation algorithm to optimize parameters_cUntil it converges, and then iteratively updates the graph representation.