CN114756713A

CN114756713A - Graph representation learning method based on multi-source interaction fusion

Info

Publication number: CN114756713A
Application number: CN202210267016.3A
Authority: CN
Inventors: 朱东杰; 孙云栋; 张星东; 丁卓
Original assignee: Nanjing Longyuan Information Technology Co ltd; Harbin Institute of Technology Weihai
Current assignee: Nanjing Longyuan Information Technology Co ltd; Harbin Institute of Technology Weihai
Priority date: 2022-03-17
Filing date: 2022-03-17
Publication date: 2022-07-15

Abstract

The invention discloses a graph representation learning method based on multi-source interaction fusion, which comprises the following steps: extracting node attributes, node categories and adjacency relations among nodes in a network in a graph structure form; respectively obtaining a BFS high-order neighborhood node set and a DFS high-order neighborhood node set by adopting a BFS and DFS-based element path high-order neighborhood node sampling algorithm; acquiring first-order neighborhood information of the node through a first-order neighborhood information aggregation algorithm; acquiring high-order domain information of the node through a heterogeneous high-order neighborhood information aggregation algorithm; fusing self information of the nodes, high-order neighborhood information and first-order neighborhood information of the nodes by using a multi-source information fusion model based on a gated neural network to obtain multi-source interaction fusion information of the nodes as final vector representation; and optimizing the parameters of the algorithm model under a multitask optimization function. The method and the device improve the extraction capability of the node information in the meta-path, and simultaneously greatly enhance the capturing capability of the neighborhood information of different levels.

Description

Graph representation learning method based on multi-source interaction fusion

Technical Field

The invention relates to the technical field of graph data representation learning and graph data mining, in particular to a graph representation learning method based on multi-source interaction fusion.

Background

Many complex systems are data processing in the form of graph structures, such as social networks, biological networks, and information networks. As is well known, network data is often complex and therefore difficult to process, mainly in terms of high computational complexity, low parallelism, and difficulty in utilizing existing machine learning, deep learning methods, etc. In order to process network data more efficiently, it is a primary challenge to find an efficient network data representation method, which enables downstream data analysis tasks, such as: data mining, analysis, prediction, etc. can be done efficiently in limited space and time. Graph representation learning is a promising graph structure data representation method capable of supporting a series of graph data processing and analysis tasks, such as: the method comprises the steps of graph node classification, graph node clustering, graph visualization, node connection relation prediction and the like. Compared with the traditional graph data representation, firstly, the graph data nodes and the relations thereof can be vectorized and represented in a lower dimension by graph representation learning, so that the purpose of reducing the dimension is achieved, the storage cost is reduced, and the calculation efficiency is improved; then, the noise and redundant information can be removed while the graph data structure and the topological information are kept, and the extraction and mining capacity of potential data features is improved; most importantly, the distance between the nodes can be used for measuring the mutual relation and can be subjected to parallelization calculation and applied to the machine learning and deep learning algorithm of the front edge, and the application scene of the method can be greatly widened. Therefore, how to realize efficient representation of graph structure data becomes a hot issue of recent research, and thus the graph representation learning field is also derived. In addition, in reality, the attribute information of the graph nodes and the association relation between the nodes are complex and diverse, and how to efficiently and comprehensively mine and learn the complex information has important research value for downstream tasks.

The existing graph representation learning method has great success in the fields of node classification, link prediction, recommendation systems, group discovery and the like, wherein the network representation learning capability is further improved due to the appearance of a Graph Neural Network (GNN). The network representation learning method based on the GNN can well aggregate information of nodes directly adjacent to a central node, but the existing method based on the GNN cannot directly aggregate high-order neighbor information in a single layer, and can aggregate remote neighbor information through a multi-layer iteration method, but the method is high in complexity, and has the problems of indirect information loss, introduction of a large amount of noise information and the like in the iteration process. In addition, the existing GNN-based network representation learning method does not fully consider and distinguish the relationship between nodes in a hierarchical manner, and the relationship between different levels and different semantics among the nodes in a real scene has important influence on the network representation result. Finally, most of the existing methods for processing heterogeneous graph data are based on meta-path policies, and extract different relationships by defining node interaction modes, but the existing methods do not incorporate nodes in meta-paths into information aggregation, for example: the APA (A stands for scholars, P stands for thesis) in the academic citation network is used for mining two scholars who jointly publish the same article, the meta-path only concerns the two scholars, but not concerns the article information jointly published by the two scholars, a large amount of information is lost, and accurate mining of the relationship between the two authors is affected.

Disclosure of Invention

The invention provides a graph representation learning method based on multi-source interaction fusion, which aims to solve the problems that the existing graph representation learning technology cannot directly aggregate high-order neighbor information in a single layer, does not fully consider and distinguish the relationship between nodes in a hierarchical mode, does not bring nodes in a meta path into information aggregation when processing meta path information of heterogeneous graph data and the like.

In order to achieve the purpose, the technical scheme of the invention is as follows:

a graph representation learning method based on multi-source interaction fusion comprises the following steps:

extracting node attributes, node categories and adjacency relations among nodes in a network in a graph structure form;

dividing the neighbor nodes of the nodes into directly adjacent first-order neighborhood nodes and indirectly adjacent high-order neighborhood nodes based on the adjacency relation among the nodes;

a BFS-based element path high-order neighborhood node sampling algorithm is adopted to obtain a BFS high-order neighborhood node set; obtaining a DFS high-order neighborhood node set by adopting a DFS-based meta-path high-order neighborhood node sampling algorithm;

acquiring first-order neighborhood information of the node through a first-order neighborhood information aggregation algorithm;

acquiring high-order domain information of the node through a heterogeneous high-order neighborhood information aggregation algorithm;

fusing self information of the nodes, high-order neighborhood information and first-order neighborhood information of the nodes by using a multi-source information fusion model based on a gated neural network to obtain multi-source interaction fusion information of the nodes as final vector representation;

and adjusting parameters of the algorithm model under the multitask optimization function until the iteration times or the precision requirement is met.

Preferably, each layer of sampling in the generation process of the BFS-based meta-path high-order neighborhood node sampling algorithm follows a meta-path mode, and each node passed by the intermediate step is retained.

Preferably, in the generating process of the DFS-based meta-path high-order neighborhood node sampling algorithm, each step of walking sampling follows a meta-path mode, and each node passed by an intermediate step is retained, and a generating policy formula is as follows:

where the random function represents the walk-with-memory strategy, vⁱRepresents the currently visited node, vⁱ⁺¹As the next node possible to access; e represents the set of all edges in the graph; r_i∈R,(0≤i＜L_R) Represents the ith node type, L, in meta-path mode_RRepresentative YuanluThe length of the diameter.

Preferably, the first-order neighborhood information aggregation algorithm adds the relationship between nodes while preserving the structure information aggregation capability of GNNs and the information transfer characteristics between network nodes, and defines a new node update strategy, as follows:

wherein the content of the first and second substances,

represents an out-of-order neighbor of node i, i.e., there is an edge pointing from node i to node t,

representing an in-first-order neighbor of node i, i.e., there is an edge pointing from node t' to node i,

representing the vector representation of nodes in the l-th layer of the neural network, d (l) representing the vector dimension of nodes in the l-th layer of the neural network, W^lIs a weight matrix which can be learnt by the neural network of the l-th layer, and different attention parameters are set according to different edge directions

And

preferably, the obtaining of the high-order domain information of the node through the heterogeneous high-order neighborhood information aggregation algorithm specifically includes the following steps:

firstly, when node information of each path is aggregated, focusing Attention to the node information of two end points of the path, simultaneously incorporating nodes passing through the nodes into the calculation of path information aggregation, and aggregating all the node information in each meta-path based on the proposed Inner-Attention GNN network;

the information fusion is carried out on each meta-path sequence according to different Attention by using the proposed Inter-Attention GNN neural network, and the Inner-Attention GNN aggregation function is as follows:

wherein N is_bl(i) For the set of dl path neighbors that node i acquires through the DFS policy,

information representing the intra-aggregation of dl path neighbors of node i in the layer l neural network,

representing a learnable network parameter matrix, alpha_ijThe learnable attention weight representing the node i and the node j is calculated by the following method:

wherein the content of the first and second substances,

for the dl-th path attention network parameter that can be learned, [ g ]]Is a vector join operation;

the attention weight is then normalized using the SoftMax function:

finally, the Inter-Attention GNN network aggregation function is:

wherein DL is a manually set hyper-parameter representing DThe maximum number of paths under the FS policy,

in order for the neural network parameters to be learnable,

attention weights of the dl path neighborhood of node i obtained by training for the Inter-Attention GNN.

Preferably, the gated neural network-based multi-source information fusion model has a fusion function of:

wherein the content of the first and second substances,

m and b are learnable parameters that,

and

respectively high-order neighborhood information under a high-order neighborhood information BFS strategy under a DFS strategy under the model of the l < th > layer.

Preferably, the multi-task optimization function is a combination of an adjacency optimization task and a node label prediction task:

L＝ω₁L₁+(1-ω₁)L₂

wherein, ω is₁The method is characterized in that the method is a hyper-parameter and represents the proportion of a main task, an adjacency optimization task serves as the main task, and the optimization function is as follows:

the node label prediction task is used as an auxiliary task, and the optimization function is as follows:

where Y represents the set of node labels in all training sets, t_iThe real label of the representative node is i, y_iRepresenting whether the label of the prediction node is i or not, if so, y_iIs 1, otherwise y_iIs a non-volatile organic compound (I) with a value of 0,

and optimizing the parameters of the model by continuously minimizing the multitask optimization function and utilizing an inverse gradient algorithm.

Based on the technical scheme, the invention has the beneficial effects that: the method comprises the steps that neighbor nodes of nodes in a graph are divided into directly adjacent first-order neighborhood nodes and indirectly adjacent high-order neighborhood nodes, and a BFS high-order neighborhood node set and a DFS high-order neighborhood node set are obtained by respectively adopting a provided BFS-based meta-path high-order neighborhood node sampling algorithm and a DFS-based meta-path high-order neighborhood node sampling algorithm aiming at the high-order neighborhood nodes; respectively obtaining high-order neighborhood information and first-order neighborhood information of the nodes by utilizing a proposed heterogeneous high-order neighborhood information aggregation algorithm and first-order neighborhood information aggregation algorithm which are brought into the node information in the path; and finally, fusing self information of the nodes, high-order neighborhood information and first-order neighborhood information of the nodes by using a gated neural network to obtain multi-source interaction fusion information of the nodes as final vector representation, and optimizing the whole process under multiple tasks. The method solves the problem that the existing graph neural network is insufficient in capturing the remote neighborhood nodes, improves the extraction capability of the node information in the meta-path, and greatly enhances the capturing capability of the neighborhood information of different levels.

Drawings

FIG. 1 is a flow diagram of a graph representation learning method based on multi-source interaction fusion, under an embodiment;

FIG. 2 is a schematic diagram of a first-order neighborhood information aggregation method in one embodiment;

FIG. 3 is a diagram of a hierarchical attention neighborhood information aggregation algorithm in one embodiment;

FIG. 4 is a diagram illustrating a heterogeneous path information aggregation method based on meta-paths in an embodiment.

Detailed Description

The technical solution in the embodiments of the present invention will be clearly and completely described below with reference to the accompanying drawings in the embodiments of the present invention.

The embodiment is described with reference to fig. 1, and the graph representation learning method based on multi-source interaction fusion provided by the embodiment specifically includes the following steps:

step S1, reading node attribute matrix H in graph data, namely graph structure form network (social network, information network, technical network and biological network)_n×dNode adjacency relation matrix under different interaction modes

And a class label information matrix L of nodes_n×cWherein n represents the number of nodes in the graph; d represents the dimension of the initial attribute of the node; r represents an interaction mode, namely a meta path; c represents the number of classes of the node. And dividing the neighbor nodes of the nodes in the graph into directly adjacent first-order neighborhood nodes and indirectly adjacent high-order neighborhood nodes.

S2, aiming at the high-order neighborhood nodes, obtaining a BFS high-order neighborhood node set V by adopting the proposed BFS-based element path high-order neighborhood node sampling algorithm_BThe specific implementation process comprises the following steps: and acquiring neighborhood nodes and information thereof of different layers under all relation modes according to a BFS strategy by setting different path lengths.

Obtaining a DFS high-order neighborhood node set V by adopting the proposed DFS-based meta-path high-order neighborhood node sampling algorithm_DThe specific implementation process is as follows: by setting different path lengths, heterogeneous meta-path nodes in all relation modes are obtained according to a DFS strategy, and intermediate node information is recorded, so that a strategy formula is generated as follows:

where the random function represents the walk-with-memory strategy, vⁱRepresenting current accessNode, vⁱ⁺¹As the next node possible to access; e represents the set of all edges in the graph; r_i∈R,(0≤i＜L_R) Represents the ith node type, L, in meta-path mode_RRepresenting the length of the meta path.

Step S3, acquiring first-order neighborhood information h of nodes by using GNN-based first-order neighborhood information aggregation algorithm_i，1. The specific implementation principle is schematically shown in fig. 2. h is_iVector representation, r, representing node i_ijRepresenting the relationship vector representation, α, of nodes i and j_ijRepresenting the attention weights calculated by nodes i and j, different solid line adjacent edge colors representing different relationships, and different dotted line colors representing different attention head weights. The structure information gathering capability of the GNNs and the information transfer characteristic between the network nodes are reserved, the relationship between the nodes is added, and a new node updating strategy is defined:

wherein the content of the first and second substances,

an in-order neighbor representing node i, i.e., there is an edge pointing from node t' to node i.

Representing the vector representation of nodes in the l-th layer of the neural network, d (l) representing the vector dimension of nodes in the l-th layer of the neural network, W^lIs a weight matrix which can be learnt and is set with different attention parameters aiming at different edge directions

And

to be provided with

For example, the calculation method is as follows:

the SoftMax function normalizes the attention weights:

and step S4, acquiring the high-order domain information of the node through a heterogeneous high-order neighborhood information aggregation algorithm. The neighbors sampled by the BFS strategy are processed in a layered mode, meanwhile, a layered attention mechanism is provided for selectively aggregating different neighbor information in different layers, and a model schematic diagram of the layered attention mechanism is shown in FIG. 3.

Firstly, an Inner-Attention GNN network is provided for aggregating neighborhood information in each layer of neighborhood, and a new aggregation function is as follows:

wherein N is_bl(i) For a bl (bl is more than or equal to 2) order neighbor set acquired by a node i through a BFS strategy,

information representing the intra-bl aggregated neighbors of node i in the l-th neural network,

representing a learnable network parameter matrix, alpha_ijRepresenting the learnable attention weights of node i and node j. Firstly, inputting vectors of two nodes into an attention network to calculate attention weight between the two nodes, wherein the calculation method comprises the following steps:

wherein

For the learnable blth order attention network parameter, [ g]Is a vector join operation.

The attention weight is then normalized using the SoftMax function:

after the aggregation information of the nodes in each layer is obtained, information of different layers needs to be fused, an Inter-Attention GNN network is further provided, the aggregation information of different layers is fused, and an aggregation function is as follows:

wherein BL is a manually set hyper-parameter representing the maximum order under the BFS strategy,

in order for the neural network parameters to be learnable,

attention weights for the bl layer neighborhood of node i obtained from the Inter-Attention GNN training.

Similarly, for DFS sampling neighbors, processing paths, extracting different meta-path information, so-called meta-paths, that is, specifying a relationship pattern with certain practical significance, for example, an APA meta-path in a citation diagram can dig out authors who published the same paper, although the two authors are not directly connected in the original heterogeneous network; similarly, the APCPA meta-path may mine authors who published articles in the same meeting or journal, and although the two authors may not have direct contact, the research directions may be similar.

According to the proposed DFS branch path high-order neighborhood information aggregation algorithm, an algorithm schematic diagram is shown in FIG. 4, and different meta-path examples are obtained by utilizing a meta-path information sampling strategy; secondly, fusing all node information in each meta path based on the proposed Inner-Attention GNN network; and finally, performing information aggregation on each meta path according to different Attention by using the proposed Inter-Attention GNN neural network.

And step S5, fusing the self information of the node, the high-order neighborhood information of the node and the first-order neighborhood information by using a gated neural network to obtain multi-source interaction fusion information of the node. The fusion algorithm strategy specifically comprises the following steps:

wherein the content of the first and second substances,

m and b are learnable parameters.

And step S6, continuously optimizing the algorithm model parameters under the multi-task optimization function until the iteration times or the precision requirement is met. Taking the adjacency optimization task as a main task and determining a loss function L by minimizing₁The low-dimensional vector representations of the adjacent nodes are more similar, and the low-order vector representations of the nodes which are not adjacent are more distant. L is a radical of an alcohol₁The calculation method is as follows:

taking a node label prediction task as an auxiliary task, and minimizing a cross entropy loss function L₂And enabling the obtained node low-dimensional vector to represent label information capable of covering the node. L is₂The calculation method is as follows:

where Y represents the set of node labels in all training sets, t_iThe real label of the representative node is i, y_iRepresenting whether the label of the prediction node is i or not, if so, y_iIs 1, otherwise y_iIs 0.

Fusing the two tasks to obtain a final loss function L, wherein omega₁The super-parameter represents the proportion of the main task. The calculation mode of L is as follows:

L＝ω₁L₁+(1-ω₁)L₂。

the above description is only a preferred embodiment of the graph representation learning method based on multi-source interaction fusion disclosed by the present invention, and is not intended to limit the scope of protection of the embodiments of the present specification. Any modification, equivalent replacement, improvement and the like made within the spirit and principle of the embodiments of the present disclosure should be included in the protection scope of the embodiments of the present disclosure.

Claims

1. A graph representation learning method based on multi-source interaction fusion is characterized by comprising the following steps:

2. The graph representation learning method based on the multi-source interaction fusion of claim 1, wherein each layer of sampling in the generation process of the BFS-based meta-path high-order neighborhood node sampling algorithm follows a meta-path mode, and each node passed by an intermediate step is reserved.

3. The graph representation learning method based on multi-source interaction fusion of claim 1, wherein in the generation process of the DFS-based meta-path high-order neighborhood node sampling algorithm, each step of wandering sampling follows a meta-path mode, and each node passed by an intermediate step is retained, and a generation strategy formula is as follows:

where the random function represents the walk-with-memory strategy, vⁱRepresents the currently visited node, vⁱ⁺¹As the next node possible to access; e represents the set of all edges in the graph; r_i∈R,(0≤i＜L_R) Represents the ith node type, L, in meta-path mode_RRepresenting the length of the meta-path.

4. The graph representation learning method based on multi-source interactive fusion of claim 1, wherein the first-order neighborhood information aggregation algorithm adds the relationship between nodes while preserving the structure information aggregation capability of GNNs and the information transfer characteristics between network nodes, and defines a new node update strategy as follows:

wherein the content of the first and second substances,

And

5. the graph representation learning method based on multi-source interaction fusion of claim 1, wherein the obtaining of the high-order domain information of the nodes by the heterogeneous high-order neighborhood information aggregation algorithm specifically comprises the following steps:

wherein, the first and the second end of the pipe are connected with each other,

the attention weight is then normalized using the SoftMax function:

finally, the Inter-Attention GNN network aggregation function is:

wherein DL is a manually set hyper-parameter representing DFS strategyThe maximum number of paths is set to be,

in order for the neural network parameters to be learnable,

6. The graph representation learning method based on multi-source interactive fusion of claim 1, wherein the multi-source information fusion model based on the gated neural network has a fusion function of:

wherein the content of the first and second substances,

m and b are learnable parameters that,

and

respectively are the high-order neighborhood information under the high-order neighborhood information BFS strategy under the DFS strategy under the model of the l < th > layer.

7. The graph representation learning method based on multi-source interaction fusion of claim 1, wherein the multi-task optimization function is a combination of an adjacency optimization task and a node label prediction task:

L＝ω₁L₁+(1-ω₁)L₂

wherein, ω is₁The adjacent relation optimization task is used as a main task, and the optimization function is as follows: