CN111353534B

CN111353534B - Graph data category prediction method based on adaptive fractional order gradient

Info

Publication number: CN111353534B
Application number: CN202010122396.2A
Authority: CN
Inventors: 骆春波; 刘子健; 罗杨; 孙文健; 王亚宁; 吴佳; 刘翔
Original assignee: University of Electronic Science and Technology of China
Current assignee: University of Electronic Science and Technology of China
Priority date: 2020-02-27
Filing date: 2020-02-27
Publication date: 2021-01-26
Anticipated expiration: 2040-02-27
Also published as: CN111353534A

Abstract

The invention discloses a graph data category prediction method based on adaptive fractional order gradient, which comprises the steps of carrying out normalization processing on an adjacency matrix A of graph data to obtain a normalized adjacency matrix; initializing the network weight according to the preset number of layers of the network; feeding the feature matrixes H of all the nodes in the graph and the adjacency matrix A of the graph into a graph network, and calculating a mean square error loss function; updating the weight parameters and the fractional order according to the mean square error loss function to obtain a graph neural network of the self-adaptive fractional order gradient; the method solves the technical problem that the existing graph neural network method falls into local optimum, thereby obtaining an ideal optimization result.

Description

Graph data category prediction method based on adaptive fractional order gradient

Technical Field

The invention belongs to the field of graph signal processing, and particularly relates to a graph data category prediction method based on adaptive fractional order gradient.

Background

Deep learning has received a great deal of attention in dealing with non-euclidean structural data represented by graphs. In a broad sense, images, video, manifolds, etc. are different representations of a graph. Therefore, graph signal processing provides an important tool for the research fields of social science, bioinformatics, physical systems, knowledge graphs and the like.

As a non-european data processing technique, Graph Neural Networks (GNNs) take data features and adjacency relations as inputs, and focus on tasks such as node classification, edge prediction, graph clustering, and the like. The graph convolutional neural networks (GCNs) apply convolution operation to the graph networks, high-dimensional information of the graph space is extracted and aggregated, and therefore the semi-supervised node classification task is completed. These methods are usually based on the iterative update of parameters by gradient descent methods, and all have the following disadvantages: due to the non-convex loss function, the network is easy to fall into local optimum, and an ideal result cannot be obtained; the first order search method converges more slowly than the first order search method.

Disclosure of Invention

Aiming at the defects in the prior art, the graph data category prediction method based on the adaptive fractional order gradient solves the problems that the network is easy to fall into local optimum and the convergence rate of the first-order search method is slower than that of the high-order search method in the existing graph neural network method.

In order to achieve the purpose of the invention, the invention adopts the technical scheme that: a map data category prediction method based on adaptive fractional order gradient comprises the following steps:

s1, normalizing the adjacency matrix A of the graph data to obtain a normalized adjacency matrix

S2, establishing a five-layer neural network structure, initializing weight of each layer of the network, and initializing iteration times t, fractional order parameters and fractional orders, wherein t is 0;

s3, adjacent matrix to be normalized

Inputting the characteristic H of each node of the graph data into a five-layer neural network structure to obtain the prediction category of each node of the graph data;

s4, calculating a mean square error loss function value according to the prediction type of each node and the target type of each node of the graph data;

s5, judging whether the mean square error loss function value is smaller than a threshold value, if so, obtaining an optimized five-layer neural network, if not, updating the weight parameters by adopting a fractional order gradient descent method, updating the fractional order according to the fractional order parameters, increasing the value of t by 1, and jumping to the step S3;

and S6, processing the graph data by adopting the five-layer neural network after optimization to obtain the final prediction category of each node of the graph data.

Further: the step S1 includes the following steps:

s11, carrying out row summation or column summation on the adjacent matrix A to obtain a degree matrix D;

s12, adding the adjacent matrix A and the unit matrix I, and then carrying out left multiplication and right multiplication with the 0.5 power of the degree matrix D to obtain a normalized adjacent matrix

Further: adjacency matrix normalized in step S12

Comprises the following steps:

further: the five-layer neural network in the step S2 sequentially includes: an input layer, a 3-layer hidden layer and an output layer.

Further: in step S3, the input/output relationship of each layer of neural network in the five-layer neural network structure is:

wherein the content of the first and second substances,

is the input of the l-th layer neural network in the t-th iteration process,

is the output of the l-th layer neural network in the t-th iteration process,

is the input to the first layer neural network during iteration 0,

is the weight of the l-1 layer neural network in the t iteration process, 1<l is less than or equal to 5, and e is a natural logarithm.

Further: in step S5, the formula for updating the weight parameter by the fractional gradient descent method is:

wherein η is the step size factor, γ is the mean square error loss function, Γ represents the Hadamard product, Γ (·) represents the gamma function, ν_t-1Is the fractional order of t-1 iterations.

Further: in step S5, the calculation formula for updating the fractional order according to the fractional order parameter is:

wherein α is a fractional order parameter.

The invention has the beneficial effects that: compared with the traditional first-order search method, the method has higher convergence rate; because the order itself is time-varying, the order of the iteration itself is continuously updated in the process of the iteration, and the order does not fall into the current fractional order extreme point. Moreover, as the iteration process proceeds, the order gradually approaches 1, thereby ensuring the finally reached solution of the first-order problem, i.e. the ideal solution of the problem.

The method combines the advantages of fractional calculus non-local characteristics and good memorability and the design of time-varying adaptive order, so that the method can obtain more ideal optimization results.

Drawings

FIG. 1 is a flow chart of a method for predicting image data classes based on adaptive fractional order gradients.

FIG. 2 is a variation curve of a fractional order iteration process;

FIG. 3 is a comparison graph of the processing result of the method and the neural network GNN for the semi-supervised node classification dataset Cora;

FIG. 4 is a comparison graph of the processing results of the method and the neural network GNN of the graph for the semi-supervised node classification dataset Citeseer;

fig. 5 is a comparison graph of the processing result of the semi-supervised node classification data set Pubmed by the method and the neural network GNN.

Detailed Description

The following description of the embodiments of the present invention is provided to facilitate the understanding of the present invention by those skilled in the art, but it should be understood that the present invention is not limited to the scope of the embodiments, and it will be apparent to those skilled in the art that various changes may be made without departing from the spirit and scope of the invention as defined and defined in the appended claims, and all matters produced by the invention using the inventive concept are protected.

As shown in fig. 1, a method for predicting a class of map data based on an adaptive fractional order gradient includes the following steps:

The step S1 includes the following steps:

Adjacency matrix normalized in step S12

Comprises the following steps:

the five-layer neural network in the step S2 sequentially includes: an input layer, a 3-layer hidden layer and an output layer.

S3, adjacent matrix to be normalized

in step S3, the input/output relationship of each layer of neural network in the five-layer neural network structure is:

wherein the content of the first and second substances,

is the input of the l-th layer neural network in the t-th iteration process,

is the output of the l-th layer neural network in the t-th iteration process,

is the input to the first layer neural network during iteration 0,

is the t-th iterationWeight of layer 1-1 neural network in generation process, 1<l is less than or equal to 5, and e is a natural logarithm.

in step S5, the formula for updating the weight parameter by the fractional gradient descent method is:

wherein η is the step size factor, γ is the mean square error loss function, Γ represents the Hadamard product, Γ (-) represents the gamma function, v_t-1Is the fractional order of t-1 iterations.

In step S5, the calculation formula for updating the fractional order according to the fractional order parameter is:

wherein α is a fractional order parameter.

Fig. 2 shows the variation curve of the adaptive fractional order of the present invention at different initial preset values with the iterative process.

The method combines the advantages of fractional calculus non-local characteristics and good memorability and the design of time-varying adaptive order, so that the method can obtain more ideal optimization results. As shown in FIGS. 3-5, the box line graphs in the graphs are the accuracy of the prediction categories obtained by the method, and it can be seen from the graphs that the accuracy of the prediction categories of the method is generally superior to the accuracy obtained by using the graph neural network GNN.

Claims

1. A graph data category prediction method based on adaptive fractional order gradient is characterized by comprising the following steps:

s3, adjacent matrix to be normalized

Inputting the characteristic H of each node of the graph data into a five-layer neural network structure to obtain the prediction category of each node of the graph data; s4, calculating a mean square error loss function value according to the prediction type of each node and the target type of each node of the graph data;

wherein η is the step size factor, γ is the mean square error loss function, Γ represents the Hadamard product, Γ (-) represents the gamma function, v_t-1Is the fractional order of the t-1 iteration process;

wherein alpha is a fractional order parameter; and S6, processing the graph data by adopting the five-layer neural network after optimization to obtain the final prediction category of each node of the graph data.

2. The adaptive fractional order gradient-based map data category prediction method of claim 1, wherein the step S1 comprises the following steps:

3. The adaptive fractional order gradient-based graph data class prediction method according to claim 2, wherein the adjacency matrix normalized in step S12

Comprises the following steps:

4. the adaptive fractional order gradient-based map data category prediction method of claim 1, wherein the five-layer neural network in step S2 sequentially comprises: an input layer, a 3-layer hidden layer and an output layer.

5. The adaptive fractional order gradient-based map data category prediction method of claim 1, wherein the input-output relationship of each layer of neural network in the five-layer neural network structure in step S3 is as follows:

wherein the content of the first and second substances,

is the input of the l-th layer neural network in the t-th iteration process,

is the output of the l-th layer neural network in the t-th iteration process,

is the input to the first layer neural network during iteration 0,