CN114610950B

CN114610950B - Graph network node representation method

Info

Publication number: CN114610950B
Application number: CN202011416368.8A
Authority: CN
Inventors: 沈颖
Original assignee: Sun Yat Sen University
Current assignee: Sun Yat Sen University
Priority date: 2020-12-04
Filing date: 2020-12-04
Publication date: 2023-11-07
Anticipated expiration: 2040-12-04
Also published as: CN114610950A

Abstract

The application relates to the technical field of graph networks, in particular to a graph network node representation method, which comprises the following steps: respectively obtaining attribute interaction expression vectors and neighborhood aggregation expression vectors of target nodes in a graph network; and splicing the attribute interaction expression vector and the neighborhood aggregation expression vector to obtain a final expression vector of the target node in the graph network. In the embodiment, the neighborhood aggregation mechanism of the graph network is considered to acquire the information of the additional information supplement node from the neighbor nodes, so that the learned target node representation has richer information and stronger differentiation.

Description

Graph network node representation method

Technical Field

The application relates to the technical field of graph networks, in particular to a graph network node representation method.

Background

The efficient representation of network nodes is a key issue in performing network analysis tasks. In recent years, researchers have proposed network representation learning aimed at learning low-dimensional dense vector representations for all nodes in a high-dimensional sparse network, preserving rich network information in order to perform subsequent network analysis tasks.

In the medical field, the traditional adjacent matrix graph network analysis method only considers the adjacent relation between nodes, and the high-dimensional characteristics of the adjacent matrix graph network analysis method bring huge calculation and storage cost, so that the clinical large-scale network containing mass facts is difficult to deal with. The existing network representation learning method is used for exploring neighborhood aggregation information, and most of the network representation learning method belongs to direct push learning rather than inductive learning, and node signals which do not exist cannot be captured. Clinically, however, negative symptoms often help to rule out possible diseases. The limited representation of clinical path characteristics results in bottlenecks in practical use.

At present, a plurality of network representation learning methods not only utilize the topology structure of a network, but also explore attribute information reflecting certain characteristics of the network. In a real-world network, node attributes often have the characteristic of high-dimensional sparsity, face the problem of dimension disasters, and limit the capacity of node representation.

Disclosure of Invention

The application mainly solves the technical problem that the node representation capability in the prior art is limited.

A graph network node representation method, comprising:

respectively obtaining attribute interaction expression vectors and neighborhood aggregation expression vectors of target nodes in a graph network;

and splicing the attribute interaction expression vector and the neighborhood aggregation expression vector to obtain a final expression vector of the target node in the graph network.

The obtaining the attribute interaction expression vector of the target node in the graph network comprises the following steps:

in the embedded layer, searching and calculating an embedded vector set corresponding to the non-zero attribute of the target node through an embedded table;

calculating second-order attribute interaction by adopting second-order interaction pooling operation, and compressing an embedded vector set of the target node into a single vector;

and inputting the single vector into a full-connection layer, stacking a plurality of full-connection layers, and taking the output of the last layer as the attribute interaction representation vector of the target node.

The searching and calculating the embedded vector set corresponding to the non-zero attribute of the target node through the embedded table in the embedded layer comprises the following steps:

for graph network g= (V, E, X), where V represents the set of all nodes of the network, E represents the set of edges in the network, the number of nodes and edges in the network are |v| and |e|, respectively; the attribute matrix of all nodes is: x= [ X ] ₁ ，x ₂ ，...，x _|V| ]E|V|xF, and searching through an embedded table to obtain a target node V _i Corresponding embedded vectors of all non-zero attributes of the set S are obtained to form an embedded vector set S _i ＝[x _i1 v ₁ ，x _i2 v ₂ ，...，x _ij v _j ]Wherein x is _ij Representing a target node v _i Is the j-th dimensional attribute value of (c).

The calculating the second-order attribute interaction by adopting the second-order interaction pooling operation, and compressing the embedded vector set of the target node into a single vector comprises:

calculating second-order attribute interaction in a dot product manner by adopting second-order interaction pooling operation, and compressing the embedded vector set into a single vector t _i ：

Wherein,representing attribute->And->Second order interactions between.

1. The graph network node representation method of claim 4, wherein inputting the single vector into the fully-connected layer, stacking a plurality of fully-connected layers, and taking the output of the last layer as the attribute interaction representation vector of the target node comprises:

the full-connection layer is as follows:

wherein the method comprises the steps ofRespectively a first layer weight matrix and an offset vector, wherein sigma (·) is a nonlinear activation function, and L' is represented;

after stacking a plurality of fully connected layers, the output of the last layer is taken as a target node v _i Attribute interaction representation vector f _i 。

The obtaining the neighborhood aggregation expression vector of the target node in the graph network comprises the following steps:

calculating a target node v _i With neighbor node v _i Is related to (a)

Wherein,

all nodes v with target by softmax function _i Adjacent node v _j Is related to (a)Normalization is performed to obtain the attention weighting coefficient +.>

By taking the attentionWeighting all neighbor nodes of the target node by the weight coefficient to obtain a target node v _i Is a neighborhood aggregate representation vector h _i ；

Wherein N is _i Representation and v _i A set of neighboring neighbor nodes, whereinRepresents the layer I output of the neural network, +.>Representing a weight matrix.

The graph network node representation method further comprises the following steps: calculating the attribute smoothness lambda of the current graph network according to the final representation vector _a Smoothness lambda according to the attribute _a Judging attribute similarity between the target node and the neighbor nodes;

wherein,

further, the method further comprises: calculating label smoothness lambda of graph network node _l According to the label smoothness lambda _l Judging the validity of neighbor information;

wherein,

further, the method further comprises the following steps: smoothness lambda according to the attribute _a And label smoothness lambda _l Calculating the noise S of all nodes in the graph network;

wherein sigma ² Representation squareAnd (3) difference.

Further, the method further comprises: for node classification tasks, calculating a loss function L by the following formula, and updating the weight by minimizing the loss function L;

wherein, C is the number of labels,to mark the number of the node, Y _lc ＝softmax(z _l ) The predicted probability vector after the input softmax classifier of the vector is represented for the node.

The graph network node representation method according to the above embodiment includes: respectively obtaining attribute interaction expression vectors and neighborhood aggregation expression vectors of target nodes in a graph network; and splicing the attribute interaction expression vector and the neighborhood aggregation expression vector to obtain a final expression vector of the target node in the graph network. In the embodiment, the neighborhood aggregation mechanism of the graph network is considered to acquire the information of the additional information supplement node from the neighbor nodes, so that the learned target node representation has richer information and stronger differentiation.

Drawings

FIG. 1 is a flow chart of a method for representing a graph network node in an embodiment of the application;

fig. 2 is a schematic diagram of a processing procedure of a graph network node representation method according to an embodiment of the present application.

Detailed Description

The application will be described in further detail below with reference to the drawings by means of specific embodiments. Wherein like elements in different embodiments are numbered alike in association. In the following embodiments, numerous specific details are set forth in order to provide a better understanding of the present application. However, one skilled in the art will readily recognize that some of the features may be omitted, or replaced by other elements, materials, or methods in different situations. In some instances, related operations of the present application have not been shown or described in the specification in order to avoid obscuring the core portions of the present application, and may be unnecessary to persons skilled in the art from a detailed description of the related operations, which may be presented in the description and general knowledge of one skilled in the art.

Furthermore, the described features, operations, or characteristics of the description may be combined in any suitable manner in various embodiments. Also, various steps or acts in the method descriptions may be interchanged or modified in a manner apparent to those of ordinary skill in the art. Thus, the various orders in the description and drawings are for clarity of description of only certain embodiments, and are not meant to be required orders unless otherwise indicated.

According to the method, the neighborhood aggregation mechanism of the graph neural network is considered to acquire additional information from the neighbor nodes to supplement node self information, and the neighborhood aggregation network is designed to fuse network structure and node attribute information by utilizing attribute smoothness and label smoothness, so that the learned node representation has richer information and stronger differentiation.

According to the application, node classification task experiments are carried out on a plurality of different types of network data sets, and the node representation method is verified to be capable of improving the utilization degree of network structure and attribute information, adapting to the characteristics of different networks, keeping good performance and improving the stability of a model.

In the embodiment of the application, the representation of the nodes in the graph network is thinned to two layers of attribute interaction state feature dimension and neighborhood aggregation state feature dimension for modeling so as to acquire more node information and improve the accuracy of the node information representation.

Referring to fig. 1 and 2, the present application provides a graph network node representation method, which includes:

step 101: respectively obtaining attribute interaction expression vectors and neighborhood aggregation expression vectors of target nodes in a graph network;

and 102, splicing the attribute interaction representation vector and the neighborhood aggregation representation vector to obtain a final representation vector of the target node in the graph network.

Given graph network g= (V, E, X) in this embodiment, where V represents the set of all nodes of the network, E represents the set of edges in the network, and the number of nodes and edges in the network are |v| and |e|, respectively.

The application uses X= [ X ] ₁ ，x ₂ ，...，x _|V| ]The E|V|x F is expressed as an attribute matrix of all nodes, and the dimension of the node attribute is F. Study node v _i Attribute x of (2) _i Mapping to an embedding vector set S by an embedding layer _i ＝[x _i1 v ₁ ，x _i2 v ₂ ，...，x _ij v _j ]Wherein x is _ij Representing node v _i Is the j-th dimensional attribute value of (c).

The topology information of the graph is then represented by the adjacency matrix a= {0,1} ^|V|×|V| And (3) representing. If node v _i And node v _ij There is a connecting edge e between _ij E, then a _ij =1; conversely, a _ij =0. Node v _i N for neighbor set of (2) _i Representation, N _i ＝{v _j |e _ij ∈E}。

Wherein in step 101, obtaining the attribute interaction expression vector of the target node in the graph network includes:

(1) In the embedded layer, the application maps each dimension feature of the high-dimension sparse attribute of the graph network to a low-dimension dense vector through embedded table lookup. Because of the sparseness of the graph network attributes, most elements in the graph network are 0. In order to improve the calculation efficiency, the application only calculates the embedded vector corresponding to the non-zero attribute, namely S _i ＝[x _ij v _j |x _ij ≠0]Wherein S is _i < F. The target node may be any node in the graph network.

(2) The goal of the interaction layer is to capture all attribute interaction relations of the nodes, the application uses a factor decomposition machine, adopts a second-order interaction pooling operation to calculate second-order attribute interaction in a dot product mode, and compresses an embedded vector set into a single vector through the following formula (1);

wherein +.,representing attribute->And->Second order interactions between. By summation pooling, a k-dimensional vector can be output that encodes the second order interactions between all attribute embeddings. The k-dimensional vector encodes the second order interactions between all attribute embeddings. The second-order interactive pooling operation does not introduce any additional parameters, and at the same time, the computational complexity can be reduced to linear complexity through mathematical transformation, the k-dimensional vector is weighted by the following formula (2), and the weight formula is obtained by the following formula (3).

(3) In order to better clarify interaction among different features, the method introduces a attention-aware pooling layer in an attribute interaction network, and performs weighted summation on each attribute interaction through a formula (2):

wherein the method comprises the steps ofIs node v _i Attribute interaction item->Weight coefficient of>Representing node attribute vector dot product. For learning the weight parameter->The application introduces a multi-layer perceptron to build an attention network. The input of the multi-layer perceptron is two attribute embedded vectors v _i And v _j ：

Wherein W is _f ∈d _f ×k,b _j ∈d _f ,r _f ∈d _f Weight parameters for model training, d _f Indicating the size of the hidden layer of the attention network. To learn weight parametersOne conventional idea is to solve it directly as a parameter to be learned by gradient descent, but this approach has a significant drawback: for attribute interaction items which never appear in the training set, the weight coefficient of the attribute interaction items cannot be learned, and the generalization capability is lacked. Thus, to solve the generalization problem, a Multi-Layer Perceptron (MLP) is introduced herein to construct an attention network (Attention Network), input as two attribute-embedded vectors v _i And v _j 。

(4) In order to learn the higher-order interaction relation between the attributes, the application pools the second-order interaction vector t _i Inputting a full connection layer, stacking a plurality of full connection layers, and taking the output of the last layer as an attribute interaction expression vector f of the node _i . The definition of the full connection layer in this embodiment is as follows:

wherein the method comprises the steps ofThe first layer weight matrix and the offset vector are respectively, sigma (·) is a nonlinear activation function, and L' represents the output of the last layer after stacking a plurality of fully connected layers. After stacking multiple fully connected layers, the output of the last layer can be regarded as the target node v _i Attribute interaction representation vector f _i 。

Further, the following focuses on the acquisition of the neighborhood aggregation expression vector in the research layer of the neighborhood aggregation module.

(1) The present embodiment uses a graph convolutional neural network to learn a mapping function f (X, a) that takes as input the adjacency matrix and the attribute matrix. The propagation formula between layers of the graph roll-up neural network is as follows:

wherein the method comprises the steps ofI is an identity matrix; a is an adjacent matrix, which itself represents the link relation between each node and the adjacent node, and the matrix is obtained after I is added>The graph rolling operation object comprises information of own nodes and adjacent nodes; />Is the degree matrix of the network,/>The element value on the diagonal is perThe degree of each vertex, the remaining elements are 0. To prevent gradient extinction and gradient explosion, a symmetric normalization operation was introduced: />So that the sum of each row of the matrix is 1; w (W) ^(l) Is a trainable layer-1 weight matrix; h ^(l) Is an implicit state of the first layer, for input layer H ⁽⁽⁾⁾ =x; σ (·) represents a nonlinear activation function, e.g., RELU (·) =max (0, ·). By stacking multiple graph convolutional neural network layers, the relationship between a node and neighboring nodes that are multi-hop away can be captured.

(2) Based on a two-layer graph convolution neural network, the embodiment performs symmetrical normalization on the adjacency matrix according to the following formula (6) in a data preprocessing stage.

(3) Then the processed adjacency matrixAnd the attribute matrix X is input into a two-layer graph roll-up neural network. After two layers of graph convolution, obtaining an hidden layer matrix H= [ H ] coded by a graph convolution network ₁ ，h ₂ ，...h _|V| ]. Wherein the forward propagation model is as shown in the following formula (7):

wherein W is ⁽⁰⁾ E F x d is a first layer weight matrix for mapping the node attribute to hidden state, W ⁽¹⁾ ∈d ₀ ×d ₁ Is the weight matrix of the second hidden layer.

(4) The application provides a smoothness-aware graph annotation meaning network, which adopts a weighted sum as an aggregation function, adopts a splicing mode as a combination function, and learns weights among nodes by using a self-attention mechanism, so that the performance of different graph networks and node representations is improved.

The core of the network is a graph attention layer, and the input node vector passes through the graph attention layer to generate a new node vector. Specifically, at layer l, the target node v _i The input of (2) isThe new target node vector of its output is +.>Its neighbor node v _j Input of +.>The new neighbor node vector output by it is +.>Specifically, the vector is obtained by linearly transforming the input vector by the following formula (8).

Wherein the method comprises the steps ofAnd->For the weight matrix to be trained, d _k Is vector quantityAnd->Is a dimension of (c). />Is composed of->And->The nodes with more different attributes between adjacent nodes are obtained by subtraction calculation after linear transformation, namely, the nodes with more different attributes contain more information and higher weight is obtained.

Wherein, in step 102, obtaining a neighborhood aggregate representation vector of a target node in the graph network includes:

the target node v is calculated by the following formula (9) _i With neighbor node v _j Is related to (a)

Wherein,

wherein,the target node vector passes through the attention layer of the graph and then outputs a new target node vector, wherein +_>Is a new neighbor node vector output after the neighbor node vector passes through the attention layer of the graph.

The application introduces the masked attention into the graph structure and distributes the attention to the target node v _i Adjacent neighbor node set N _i And pair all and v by softmax function _i Adjacent node v _j Normalized to obtain the attention coefficient

The present embodiment utilizes a trainable weight matrixAnd splicing operation, namely weighting neighbor nodes by adopting attention weight coefficients, and providing an innovative graph neural network neighborhood aggregation mechanism to learn and obtain a target node v _i Is a neighborhood aggregate representation vector h _i ；

Wherein,represents the layer I output of the neural network, +.>Also representing the weight matrix, ">And->And the weight matrixes are all represented, and are randomly initialized, and the most suitable weight matrix is automatically learned by continuous iteration of training, so that the model is optimal, and an optimal result is obtained. This is an important part of neural network training. Wherein (1)>And->The outputs of the (l-1) th layers of the i-th and j-th neural network nodes, respectively.

Equation (11) is a graph neural network neighborhood aggregation mechanism provided by the application. In contrast to prior art models, in the aggregation operation, the graph nervesNetwork (GNN)Graph convolutional neural network (GCN)>Andno attention mechanism is used. Whereas the existing graphical attention network (GAT)/(GAT)>Unlike the attention mechanism used by the proposed method of the present application: the attention mechanism used by the application utilizes the network smoothness information, thereby optimizing the introduction of neighbor information; secondly, GAT uses an additive attention model, while the dot product model used by the present application is more beneficial to learning interactions between attributes. The relationship between the current node and its neighborhood nodes is learned through an aggregation mechanism.

Finally, the present application represents the attribute interaction vector f _i And neighborhood aggregate representation vector h _i Splicing to obtain final representation vector of graph network node

Wherein the attribute interactions represent vectors: taking the medical field as an example, one of complications of cerebral apoplexy is pulmonary infection, and the attribute coincidence degree between cerebral apoplexy (central node) and pulmonary infection (neighbor node) is very low, so that the introduction of the node information of pulmonary infection is beneficial to better learning the clinical characterization of cerebral apoplexy. f (f) _i Vector information representing stroke and pulmonary infection is presented herein. Neighborhood aggregation represents a vector: still taking the medical field as an example, the label of cerebral stroke is "disease", and a patient in the neighborhood with a label of "politics" suffers from this disease. The information of "politics" is not helpful in understanding the disease and may introduce noise-like information about major events in the relevant country. In this example, h _i Only stroke vectors, and not the political figure vectors.The attribute interaction representation vector and the overall vector representation of the neighborhood aggregate representation vector are considered together. It should be noted that the illustration is given here only by way of example in the medical field, and does not represent that the method of the application can only be applied in the medical field, and that the method of the application is applied in the medical field only by way of representation of network nodes, so that it has a richer information for providing reference information to a doctor, and is not directly used for diagnosis and treatment of diseases.

In the combination operation, the GCN and the GAT combine the aggregation operation and the combination operation into a whole, and aggregate the neighbor node set and the self node in a weighted aggregation mode to finally obtain the node representation. The graph SAGE and the method of the application combine the representation of the upper layer of the node with the representation of the neighbor node in a splicing mode, so that the updated final representation vector of the node is obtained, and the operation can better keep the information of the node and the relation between the input of the current neural network and the output of the upper layer of the neural network. The obtained graph network node of the embodiment is used for tasks such as downstream task link prediction, node classification and the like. Compared with the existing method, the method has the advantages that:

1) Node information of the graph network data is better characterized;

2) A new domain aggregation mechanism is proposed;

3) A joint training method for mashup tag smoothness and attribute smoothness is provided.

Furthermore, the application provides a training method of the model provided by the embodiment, and the method introduces the attribute smoothness and the label smoothness of the graph network, so that the isomorphic effect of the neighborhood network is utilized to more fully explore the interaction relationship between the heterogeneous topological structure and the node attribute information in the clinical attribute graph network.

Further, the method of the application further comprises: calculating the attribute smoothness lambda of the current graph network according to the final representation vector _a According to the attributeSmoothness lambda _a And judging attribute similarity between the target node and the neighbor nodes thereof. The method specifically comprises the following steps:

definition 1. Aggregation operation of the graph network should capture neighbor node information dissimilar to the attribute of the central node as much as possible so as to promote the introduced information quantity and enhance the representation capability of the nodes.

Given a clinical attribute map network G and its attribute matrix X, the present application defines the attribute smoothness as follows:

Z _i representing attribute interaction representation vectors, z _j Representing a neighborhood aggregate representation vector, in general, the neighbor node attributes and the attributes of the center node follow the same distribution. Lambda (lambda) _a The larger the attribute signal, the higher the frequency of the attribute signal, the more dissimilar the attribute between the nodes and the neighbor nodes in the graph network, and the larger the amount of information introduced.

Further, the method of the present application further comprises calculating the label smoothness lambda of the graph network node _l According to the label smoothness lambda _l And judging the validity of the neighbor information. The method specifically comprises the following steps:

and 2, retaining the information of the nodes in a neighborhood aggregation mechanism, and preventing excessive interference information from being introduced due to overlarge smoothness of the labels.

The label smoothness designed by the application is used for analyzing the validity of neighbor information. In the node classification task, each node v _i The label corresponding to E V is y _i . In the neighborhood aggregation mechanism, attribute input of a neighbor node can be decomposed into the following two parts according to labels:

wherein I (·) represents an indication function,neighbor node representing the same label as the current node, +.>Representing neighbor nodes of different labels than the current node. The application is summarized as follows: the neighbor nodes of the same label provide useful information, while the neighbor nodes of different labels provide interference information. Thus, given a network of attributes, the present application obtains its label smoothness by the following equation (14):

wherein lambda is _l The smaller the label the more similar the neighbor node is to the central node, the more valid information is obtained from the neighbor node.

Further, the method of the application further comprises: according to the following formula (15), the smoothness lambda is obtained according to the attribute _a And label smoothness lambda _l Calculating the noise S of all nodes in the graph network;

wherein sigma ² Representing the variance.

Specifically, the exploration thought of the noise S in the application is as follows: it is assumed that the noise of all nodes in the graph network obeys the same distribution, and that the noise distribution and the signal distribution are independent of each other. If the noise is random, the energy is defined as the variance σ ² Node v _i The first layer neighborhood attribute input isThe noise energy of the neighborhood aggregation attribute input is +.>

Given a given(true signal)>(noise) and c _i (node v) _i Information of itself), since noise is random noise, the average value is 0, then:

expected formula from varianceWeighted aggregated inputThe noise of (2) is:

further, considering the tag smoothness and the node smoothness, the noise can be expressed as:

further, in terms of the loss function, for the node classification task, the cross entropy of all labeled nodes is used as the loss function, see equation (16).

Wherein C is the number of tags to be attached,to mark the number of the node, Y _lc ＝softmax(z _l ) Representing the input of a vector for a nodePredictive probability vectors after entering the softmax classifier. The weight update is performed by minimizing the above loss function.

The foregoing description of the application has been presented for purposes of illustration and description, and is not intended to be limiting. Several simple deductions, modifications or substitutions may also be made by a person skilled in the art to which the application pertains, based on the idea of the application.

Claims

1. A graph network node representation method, comprising:

acquiring complications of cerebral apoplexy, wherein the complications are pulmonary infection, the cerebral apoplexy is taken as a central node, and the pulmonary infection is taken as a neighbor node to determine attribute interaction expression vectors of the cerebral apoplexy and the pulmonary infection;

respectively obtaining attribute interaction expression vectors of cerebral stroke and pulmonary infection of target nodes in a graph network and neighborhood aggregation expression vectors of cerebral stroke;

for graph networksThe method comprises the steps of carrying out a first treatment on the surface of the V represents the set of all nodes of the network, E represents the set of edges in the network, the number of nodes and edges in the network are +.>And->；

The attribute matrix of all nodes is:the method comprises the steps of carrying out a first treatment on the surface of the F represents the dimension of node attributes, +.>Representing the attributes of the nodes;

target node is obtained through embedded table lookupObtaining all embedded vectors corresponding to all non-zero attributes to form an embedded vector set +.>；/>Representing the target node +.>Is>A dimension attribute value;

computing second order attribute interactions in a dot product manner using a second order interaction pooling operation, compressing the set of embedded vectors into a single vector：

；

Wherein,representing attribute->And->A second order interaction relationship between the two;

inputting the single vector into the full connection layer, stacking a plurality of full connection layers, and taking the output of the last layer as a target nodeAttribute interaction representation vector +.>；

The full-connection layer is as follows:

；

wherein,is->Layer weight matrix,/->Is->Layer offset vector->As a function of the non-linear activation,representing the output of the last layer after stacking a plurality of fully connected layers, < >>Indicate->The value of a single vector of layers,indicate->The value of a single vector of layers,/>Representing a single vector of layer 0Value of->Values representing individual vectors of the last layer after a plurality of fully connected layers; computing target node->And neighbor node->Correlation of->；

；

Wherein, ELU represents the activation function,a new target node vector which is output after the target node passes through the attention layer of the graph is represented, T represents transposition,/->Representing a new neighbor node vector output by a neighbor node through a graph attention layer;

all nodes with target node through softmax functionAdjacent node->Correlation of->Normalization is performed to obtain the attention weighting coefficient +.>：

；

Weighting all neighbor nodes of the target node by adopting the attention weight coefficient to obtain the target nodeNeighborhood aggregate representation vector +.>，/>Representation and->A set of neighboring neighbor nodes; r represents->Any node in the network;

；

wherein,represents the->Layer output->Representing a weight matrix, +.>Representation and->Neighboring neighbor nodesGather (S)>And->The (th) of the (th) and (th) neural network nodes respectively>Layer output;

and splicing the attribute interaction expression vector and the neighborhood aggregation expression vector to obtain a final expression vector of the target node in the graph network, so as to display the final expression vector to a doctor.

2. The graph network node representation method of claim 1, wherein the graph network node representation method further comprises: calculating the attribute smoothness of the current graph network according to the final representation vectorSmoothness according to the attribute->Judging attribute similarity between the target node and the neighbor nodes;

wherein,。

3. the graph network node representation method of claim 2, further comprising: calculating label smoothness for graph network nodesAccording to the tag smoothness +.>Judging the validity of neighbor information;

wherein,representing the target node +.>Corresponding tag->Representing the target node +.>Adjacent node->A corresponding tag.

4. A graph network node representation method in accordance with claim 3, further comprising: smoothness according to the attributeAnd tag smoothness +.>Calculating noise for all nodes in a graph networkS；

；

Wherein,representing the variance.

5. The graph network node representation method of claim 4, further comprising: for node classification tasks, calculating a loss function L by the following formula, and updating the weight by minimizing the loss function L;

；

wherein, C is the number of labels,for marking the number of the node>The predicted probability vector after the input softmax classifier of the vector is represented for the node.