CN114625881A

CN114625881A - Economic field knowledge graph completion algorithm based on graph attention machine mechanism

Info

Publication number: CN114625881A
Application number: CN202111471322.0A
Authority: CN
Inventors: 贾海涛; 邢增桓; 高源�; 李家伟; 林思远; 王树臣; 梁晓程; 许文波
Original assignee: Yangtze River Delta Research Institute of UESTC Huzhou
Current assignee: Yangtze River Delta Research Institute of UESTC Huzhou
Priority date: 2021-12-04
Filing date: 2021-12-04
Publication date: 2022-06-14

Abstract

The invention provides an ERP-GAT-based economic field knowledge graph completion algorithm. The ERP-GAT algorithm adopts an encoder-decoder structure, an image attention machine mechanism is introduced into an encoder, an entity embedding matrix and a relation embedding matrix are input, the attention score of adjacent triples of each target entity is calculated, the embedding matrix is updated, the multi-hop relation around a given entity or node can be obtained, rich semantic information and roles played in the relation near the given entity can be obtained, the relation group with similar semantically existing knowledge is consolidated, a ConvKB model is used by a decoder, a score function is obtained by a convolutional layer to analyze the global embedding characteristic in each dimension, and the transition characteristic in the ERP-GAT model is summarized. And finally, compared with other existing algorithms, the five indexes of the standard data set FB15K237 and the four indexes of NELL-995 are remarkably improved, and the best effect of a knowledge graph completion task is obtained.

Description

Economic field knowledge graph completion algorithm based on graph attention mechanism

Technical Field

The invention belongs to the field of natural language processing.

Background

The mainstream method of the knowledge graph completion algorithm is to deduce new entities, relationships, rules and knowledge from the existing entities, relationships, rules and knowledge and predict whether a given triplet is effective, a traditional knowledge embedding-based model such as a Convolutional Neural Network (CNN) can learn more high-quality embedding due to the parameter efficiency and the characteristic that the complex relationship can be considered, however, the CNN can only consider each triplet independently, ignore rich semantic information and potential relationships near the given entity in the knowledge graph, and does not consider the relationship between triplets. The R-GCN is able to collect information from the neighbors of a given entity, convolving the neighbors of each entity, however the weights assigned to the neighbors by the R-GCN are equal, there is a bottleneck in processing directed graphs, and the problem of dynamic graphs cannot be handled.

Disclosure of Invention

The invention provides an economic field knowledge graph completion algorithm based on a graph attention machine mechanism. The contents are as follows:

(1) an ERP-GAT benchmark algorithm and an improved algorithm are given firstly, and a corresponding overall frame diagram is given.

(2) The baseline model and the modified ERP-GAT model were then tested on two sets of public data (FB15K-237 and NELL-995).

(3) Finally, the effectiveness of the ERP-GAT algorithm is verified through experimental analysis, and experimental results show that the ERP-GAT algorithm effectively improves MR, MRR and Hits @ N indexes in the relation prediction task result.

Drawings

FIG. 1 is an overall block diagram of the algorithm of the present invention.

Fig. 2 is a diagram attention mechanism layer network structure of the present invention.

FIG. 3 is a process for calculating attention values for a triplet of interest using the model of the present invention.

FIG. 4 is a data set presentation used in the algorithmic experiments of the present invention.

FIG. 5 is a graph of the relationship prediction results on the NELL995 data set in accordance with the present invention.

FIG. 6 is a graph of the results of the relational prediction on the FB15 dataset of the present invention.

Detailed Description

The mainstream method of the knowledge graph completion algorithm is to deduce new entities, relationships, rules and knowledge from the existing entities, relationships, rules and knowledge and predict whether a given triple is effective, a traditional knowledge embedding-based model such as a Convolutional Neural Network (CNN) can learn more high-quality embedding due to the parameter efficiency and the characteristic that the complex relationship can be considered, however, the CNN can only independently process each triple, and the abundant semantic information and potential relationship near the given entity in the knowledge graph are ignored. The R-GCN is able to collect information from the neighbors of a given entity, convolving the neighbors of each entity, however the weights assigned to the neighbors by the R-GCN are equal, there is a bottleneck in processing directed graphs, and the problem of dynamic graphs cannot be handled. In the existing method, knowledge graph embedding is learned by using entity characteristics, or characteristics of entities and relations are processed in a non-connected mode, and the ERP-GAT algorithm provided by the method can comprehensively capture semantic similarity relations of single-hop and multi-hop neighbors of any given entity in the knowledge graph.

The idea of the algorithm will be described below, and specific steps of the algorithm will be given.

Firstly, the problems which are not completely solved in the relation prediction algorithm based on CNN and GCN are briefly analyzed, and accordingly, a solution is proposed and a design framework of an ERP-GAT algorithm is introduced (shown in a figure 1); then, the detailed description of ERP-GAT includes obtaining the multi-hop relation around the given entity or node, obtaining rich semantic information near the given entity and the role played in the relation, consolidating the relation group with similar semanteme of the prior knowledge, etc.; finally, experiments and results analyses were performed on the reference models TransE, ConvKB, R-GCN, etc. and the modified ERP-GAT model on two sets of public data sets (FB15K-237 and NELL-995), specifically comparing the three aspects of MR, MRR, Hits @ N, etc. The effectiveness of the ERP-GAT algorithm is verified through experimental analysis, and experimental results show that the ERP-GAT algorithm effectively improves MR, MRR and Hits @ N indexes in a relation prediction task result.

In fig. 1, a knowledge graph completion method (ERP-GAT) based on a graph attention mechanism firstly inputs two embedded matrices, then enters a graph attention mechanism layer, calculates attention scores of all triples adjacent to a given target entity, then updates the embedded matrices, trains a loss function after passing through the graph attention mechanism layer, then enters a decoder part, trains the loss function after passing through a CNN layer, and finally obtains a knowledge graph completion result.

The method comprises the following specific steps:

the method comprises the following steps: calculating the attention score of the adjacent triplets of each target entity

In the knowledge graph, the entity plays different roles according to the relationship in the current triple, for example, in the triple { liuqiang, chief executive officer, kyoto } and the triple { liuqiang, husband, and octopus }, the entity "liuqiangdong" appears in two different triples, and due to the different relationship, plays two roles of "chief executive officer" and "husband", respectively. To cope with this phenomenon, the ERP-GAT algorithm uses a graphical attention mechanism layer. The figure notes that the mechanism layer takes as input two embedded matrices, where,

representing an entity embedding matrix, wherein the ith behavior entity e_iEmbedded vector of, N_eRepresenting the total number of entities, and T representing the feature dimension of each entity embedding vector;

represents a relational embedding matrix, where N_rRepresenting the total number of relationships and P representing the feature dimension of each relationship embedding vector. The drawing attention mechanism layer takes two corresponding embedded matrixes as output, and the two embedded matrixes are respectively

And

to obtain an entity e_iThe ERP-GAT model performs linear transformation on entity and relation characteristic vector of the triple connected with the target entity

To learn with entity e_iEach triplet of an association. As shown in equation 1:

wherein

Is a triplet

A vector representation of (a);

respectively representing entity embedding vectors e_i、e_jAnd relation embedding vector r_k；W₁A linear transformation matrix is represented.

To measure the importance of triples associated with a target entity, the ERP-GAT model uses α_ijkTo define the importance of the triples, using a linear transformation matrix W₂To pair

After linear transformation, the attention score is obtained through a LeakyReLU function, and normalization is performed through a softmax function, as shown in formula 2:

step two: updating an embedded matrix

In order to solve the problems that CNN can only independently consider each triple, ignore rich semantic information and potential relation near a given entity in a knowledge graph and do not consider the relation between the triples, an ERP-GAT model uses a multi-head attention mechanism to stabilize a learning process, encapsulates more information in the triples near the given entity, respectively calculates updated embedded vectors for M triples, then connects the embedded vectors in series to finally obtain the updated entity embedded vector

As shown in equation 3:

specifically, in the last layer of the graph attention machine layer, the ERP-GAT model performs weighted averaging on a plurality of embedded vectors to obtain a final output entity embedded vector, as shown in equation 4:

using a linear transformation matrix W for a relational embedding matrix^R∈R^T×T′Linear transformation is performed, where T' is the dimension of the output relationship embedding matrix, as shown in equation 5:

G′＝GW^R (5)

the model was trained using hinge loss as a loss function, as shown in equation 6:

wherein γ >0 is an edge hyper-parameter; s represents the correct triplet set and S' represents the incorrect triplet set.

Step three: decoder

The ERP-GAT model uses ConvKB as a decoder, uses convolutional layer scoring functions to analyze global embedding characteristics in each dimension and generalizes transition characteristics in the ERP-GAT model, and makes triplets

After passing through the convolution filter, the output of each convolution filter is connected in series through a ReLU function, and finally linear transformation is carried out

As shown in equation 7:

where Ω denotes the number of convolution filters, ω^mRepresenting the mth convolution filter.

The decoder is trained using the soft boundary loss as a loss function, as shown in equation 8:

wherein,

step four: results and analysis of the experiments

(1) Experimental data set

To verify the effectiveness of the algorithm presented herein, the common standard data set, FB15K237 and NELL-995, widely used by researchers in the field of knowledge-graph complementation, were used herein. The specific information of the data set is shown in fig. 4.

(2) Evaluation index

Knowledge profiles are used herein to represent the commonly used indicators for learning studies MR, MRR, Hit @1, Hit @3, Hit @ 10. Wherein higher values of MRR and hit @ N indicate better prediction, and lower values of MR indicate better prediction. Where MRR represents the average of the reciprocals of the correct entity score ranks in the multiple triplet sets Q. As shown in equation 9:

wherein, rank_iThe relational prediction ranking of the ith triplet is represented.

The MR calculation formula is shown in equation 10:

hits @ N refers to the average proportion of triples with a ranking no higher than N in the relational prediction, and the calculation formula is shown in formula 11:

where II (-) represents the indicator function, 1 if the condition is true, and 0 otherwise, N is set to 1, 3, 10 for evaluation.

(3) Experimental setup

The experimental goal herein is to complement the triplet { e 'of the head entity replaced by all valid entities'_i，r_k，e_jTriple of { e } or tail entities_i，r_k，e′_jAnd then packing the replaced invalid triples and the unique valid triples before replacement into a set, and evaluating all the triples in the set.

(4) Analysis of results

The ERP-GAT model algorithm training and result testing are completed by performing experiments on the standard data sets FB15K-237 and NELL-995, and the obtained experimental results are shown in FIG. 5 and FIG. 6. The result shows that five indexes of the FB15K-237 data set and four indexes of the NELL-995 data set of the ERP-GAT algorithm are obviously improved compared with other existing algorithms, and the best effect of the knowledge graph completion task is achieved.

Claims

1. An economic field knowledge graph completion algorithm based on a graph attention mechanism comprises the following steps:

In the knowledge graph, the entity plays different roles according to the relationship in the current triplet, for example, in the triplet { liuqiang, chief executive officer, kyoto } and the triplet { liuqiang, husband, and toshiba }, the entity "liuqiangdong" appears in two different triplets, and due to the different relationship, plays two roles of chief executive officer and husband, respectively, in order to cope with this phenomenon, the ERP-GAT algorithm uses an image attention machine system layer which takes two embedded matrices as input, wherein,

represents a relational embedding matrix, where N_rRepresenting the total number of relationships, P representing the feature dimension of each relationship embedded vector, and the graph attention mechanism layer takes two corresponding embedded matrixes as output, namely

And

to obtain an entity e_iThe ERP-GAT model is based on the entity and the closing of the triples connected with the target entityLinear transformation of the characteristic vector

To learn with entity e_iEach triplet associated, as shown in equation 1:

wherein

Is a triplet

A vector representation of (a);

To measure the importance of triples associated with a target entity, the ERP-GAT model uses α_ijkTo define the importance of the triples, using a linear transformation matrix W₂For is to

step two: updating an embedded matrix

In order to solve the problem that the CNN can only independently consider each triple, ignore rich semantic information and potential relations near a given entity in the knowledge graph, and do not consider the relations among the triples,the ERP-GAT model uses a multi-head attention mechanism to stabilize the learning process, encapsulates more information in triplets near a given entity, respectively calculates updated embedding vectors for M triplets, and then connects the embedding vectors in series to finally obtain updated entity embedding vectors

As shown in equation 3:

G'＝GW^R#(5)

Step three: decoder

As shown in equation 7:

wherein,

2. the method of claim 1, wherein an attention score mechanism is used in step 1 to calculate and measure the importance of triples associated with a given entity.

3. The method of claim 1, wherein step 2 uses a multi-attention mechanism to stabilize the learning process and to encapsulate more information in triples around a given entity.

4. The method of claim 1, wherein ConvKB is used as the decoder in step 3.