CN110955780B - Entity alignment method for knowledge graph - Google Patents

Entity alignment method for knowledge graph Download PDF

Info

Publication number
CN110955780B
CN110955780B CN201910968049.9A CN201910968049A CN110955780B CN 110955780 B CN110955780 B CN 110955780B CN 201910968049 A CN201910968049 A CN 201910968049A CN 110955780 B CN110955780 B CN 110955780B
Authority
CN
China
Prior art keywords
entity
distance
matrix
word
graph
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Active
Application number
CN201910968049.9A
Other languages
Chinese (zh)
Other versions
CN110955780A (en
Inventor
赵翔
曾维新
唐九阳
徐浩
谭真
殷风景
葛斌
肖卫东
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
National University of Defense Technology
Original Assignee
National University of Defense Technology
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by National University of Defense Technology filed Critical National University of Defense Technology
Priority to CN201910968049.9A priority Critical patent/CN110955780B/en
Publication of CN110955780A publication Critical patent/CN110955780A/en
Application granted granted Critical
Publication of CN110955780B publication Critical patent/CN110955780B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/30Information retrieval; Database structures therefor; File system structures therefor of unstructured textual data
    • G06F16/36Creation of semantic tools, e.g. ontology or thesauri
    • G06F16/367Ontology
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N3/00Computing arrangements based on biological models
    • G06N3/02Neural networks
    • G06N3/04Architecture, e.g. interconnection topology
    • G06N3/045Combinations of networks
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N3/00Computing arrangements based on biological models
    • G06N3/02Neural networks
    • G06N3/08Learning methods

Landscapes

  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • Data Mining & Analysis (AREA)
  • Life Sciences & Earth Sciences (AREA)
  • General Physics & Mathematics (AREA)
  • General Engineering & Computer Science (AREA)
  • Computational Linguistics (AREA)
  • General Health & Medical Sciences (AREA)
  • Mathematical Physics (AREA)
  • Biophysics (AREA)
  • Molecular Biology (AREA)
  • Computing Systems (AREA)
  • Biomedical Technology (AREA)
  • Artificial Intelligence (AREA)
  • Evolutionary Computation (AREA)
  • Software Systems (AREA)
  • Health & Medical Sciences (AREA)
  • Animal Behavior & Ethology (AREA)
  • Databases & Information Systems (AREA)
  • Information Retrieval, Db Structures And Fs Structures Therefor (AREA)
  • Machine Translation (AREA)

Abstract

The invention discloses an entity alignment method for a knowledge graph, which comprises the following steps: acquiring data of two knowledge maps; learning the structure vector of the entity by using a graph convolution network, and expressing the name of the entity as a word vector; calculating the structure distance and the word characteristic distance of the entity; fusing the two distances into a comprehensive distance to represent the similarity degree of the entities; and carrying out entity identification alignment according to the calculation result of the similarity degree. The method designs an entity alignment basic framework which integrates the structural feature and the entity name feature; and reordering the preorder alignment result by adopting a word shift distance model so as to fully mine entity name information and improve the accuracy and timeliness of entity alignment.

Description

Entity alignment method for knowledge graph
Technical Field
The invention belongs to the field of knowledge graph data processing, and particularly relates to an entity alignment method for a knowledge graph.
Background
In recent years, a large number of Knowledge Graphs (KGs) such as YAGO, DBpedia, NELL, and CN-DBpedia, zhishi. The large-scale knowledge maps play an important role in intelligent services such as question-answering systems, personalized recommendation and the like. In addition, to meet specific domain-related needs, more and more domain knowledge maps, such as medical knowledge maps, are being derived. In the process of knowledge graph construction, the trade-off between the coverage rate and the accuracy rate is inevitably needed. And any knowledge graph cannot be complete or completely correct.
In order to improve the coverage rate and accuracy of the knowledge graph, one possible method is to introduce relevant knowledge from other knowledge graphs, because the knowledge redundancies and complementation exist among the knowledge graphs constructed in different ways. For example, a constructed generic knowledge graph extracted from a web page may contain only the name of a drug, while more information may be found in a medical knowledge graph constructed based on medical data. To integrate knowledge in the external knowledge-graph into the target knowledge-graph, the most important step is to align the different knowledge-graphs. For this reason, an Entity Alignment (EA) task is proposed and receives a wide attention. The task is to find pairs of entities in different knowledge graphs that express the same meaning. And the entity pairs serve as hubs for linking different knowledge graphs to serve subsequent tasks.
At present, the mainstream entity alignment method mainly judges whether two entities point to the same thing by means of the structural features of a knowledge graph. Such methods assume that entities expressing the same meaning in different knowledge graphs have similar adjacent information. On artificially constructed data sets, this type of method achieves the best experimental results. But a recent work has indicated that these manually constructed data sets have a more dense knowledge-graph than the real-world knowledge-graph, and the structural feature-based entity alignment approach has a far less effective knowledge-graph with normal distribution.
In fact, by analyzing the distribution of entities in the real-world knowledge graph, more than half of the entities are connected to only one or two other entities. These entities are called long-tail entities (long-tail entities) and occupy most of knowledge graph entities, so that the graph as a whole presents high sparsity. This also corresponds to the knowledge of the real world knowledge map: only a few entities are frequently used and have rich adjacency information; most entities are mentioned only rarely, containing little structural information. Therefore, current entity alignment methods based on structural information do not perform well on real-world datasets.
Disclosure of Invention
In view of the above, the present invention is directed to an entity alignment method for a knowledge graph, which overcomes the disadvantage of performing entity alignment only by using entity structure information in the prior art, and makes full use of the entity structure information and the entity name information to be comprehensively used for entity alignment, thereby improving the alignment efficiency.
Based on the above purpose, the present invention provides an entity alignment method for knowledge graph, which includes the following steps:
step 1, acquiring data of two knowledge maps;
step 2, learning the structure vector of the entity by using a graph convolution network; representing the names of the entities as word vectors;
step 3, calculating the structure distance and the word characteristic distance of the entity;
step 4, fusing the two distances into a comprehensive distance to represent the similarity of the entities;
and 5, performing entity identification alignment according to the calculation result of the similarity degree to obtain a similar entity pair.
The two knowledge maps are represented as G 1 =(E 1 ,R 1 ,T 1 ) And G 2 =(E 2 ,R 2 ,T 2 ) Wherein E represents an entity, R represents a relationship,
Figure GDA0002378054520000031
representing triplets in a graph, pairs of known entities represented as
Figure GDA0002378054520000032
The entity alignment task aims to find a new entity pair by utilizing the known entity pair information and generate a final alignment result
Figure GDA0002378054520000033
Wherein the equal sign represents that the two entities point to the same real world entity;
in the step 2, two-layer graph convolution networks are used for processing two knowledge graph data and generating corresponding entity structure vectors respectively;
entity e of two knowledge graphs in step 3 1 ∈G 1 And e 2 ∈G 2 The structural distance is D in structural space s (e 1 ,e 2 )=||e 1 -e 2 || l1 /d s ,d s Is the structural matrix dimension; the word characteristic distance is D t (e 1 ,e 2 )=||ne(e 1 )-ne(e 2 )|| l1 /d t Suppose that entity e includes the word w in its name 1 ,w 2 ,...,w p Then the entity name vector may be represented as the average of these word vectors, i.e.
Figure GDA0002378054520000034
Wherein w i Is w i Word vector of d t Is the name vector matrix dimension;
the fusion formula of the comprehensive distance in the step 4 is as follows:
D(e 1 ,e 2 )=αD s (e 1 ,e 2 )+(1α)D t (e 1 ,e 2 )
where α is the hyperparameter used to adjust the weights of the two features.
Preferably, the characteristic distance is calculated by a word-shift distance model, which is intended to measure the difference between different sentences, and the word-shift distance is expressed as the minimum distance value of embedded vectors of all words in an entity that need to be shifted to reach embedded vectors of all words in another entity.
Specifically, the input of the graph convolution network is the characteristic matrix of the entity
Figure GDA0002378054520000035
And an adjacency matrix A of the graph, and the output is a feature matrix with structure information
Figure GDA0002378054520000036
N represents the number of nodes in the graph, and P and F represent the dimensions of the input and output matrix features, respectively, assuming the input of the l-th layer as the feature matrix of the nodes
Figure GDA0002378054520000041
Wherein d is l Dimension representing the characteristic matrix of the l-th layer, for the first layer, H 1 =X,d 1 = P; the first layer output is
Figure GDA0002378054520000042
Wherein
Figure GDA0002378054520000043
I is a unit matrix, and the unit matrix is,
Figure GDA0002378054520000044
is composed of
Figure GDA0002378054520000045
The diagonal matrix of (a) is,
Figure GDA0002378054520000046
is a parameter matrix of the l-th layer, d l+1 Is the dimension of the feature matrix of the next layer, the activation function σ is often set to ReLU, H for the last layer l+1 =Z,d l+1 =F。
Specifically, an initial feature matrix X is obtained by sampling from L2 regularized truncated normal distribution, and is updated through training of each layer of GCN, so that structural information in a knowledge graph is fully captured, and an output feature matrix Z is generated; the dimension of the feature matrix is always set to d s ,P=F=d l =d s And two GCNs share the feature matrix W in two layers 1 And W 2
Specifically, the training objective is to minimize the following loss values:
Figure GDA0002378054520000047
wherein [ x ]] + =max{0,x},
Figure GDA0002378054520000048
The representation is based on a known entity pair (e) 1 ,e 2 ) E is to be 1 Or e 2 And replacing the negative sample set generated by a random entity, wherein e represents a structure vector of the entity e, and gamma represents an end distance separating the positive sample from the negative sample, and performing model optimization by adopting random gradient descent.
Compared with the prior art, the invention has the following advantages and beneficial effects:
(1) An entity alignment basic framework which fuses the structural feature and the entity name feature is designed. The entity name and the structural information are mutually complemented, and the basic framework can greatly improve the alignment result of the long-tail entity, so that the overall alignment effect is optimized.
(2) And a word shift distance model is adopted to reorder the preorder alignment results so as to fully mine entity name information and improve alignment accuracy.
Drawings
Fig. 1 is a schematic overall flow chart of an embodiment of the present invention.
Detailed Description
The invention is further described with reference to the accompanying drawings, but the invention is not limited in any way, and any alterations or substitutions based on the teaching of the invention are within the scope of the invention.
As shown in fig. 1, an entity alignment method for a knowledge-graph includes the following steps:
step 1, acquiring data of two knowledge maps;
step 2, learning the structure vector of the entity by using a graph convolution network; representing the names of the entities as word vectors;
step 3, calculating the structure distance and the word characteristic distance of the entity;
step 4, fusing the two distances into a comprehensive distance to represent the similarity of the entities;
and 5, performing entity identification alignment according to the calculation result of the similarity degree to obtain a similar entity pair.
For a better understanding of the present disclosure, all possible meanings of the symbols are given. H l : layer i structural feature matrix, N: number of nodes, X: initial structural feature matrix, d l : layer i feature matrix dimension, Z: final structural feature matrix, S: known entity pair, a: adjacency matrix, d s : dimension of structural matrix, W l : layer I parameter matrix, D s : entity spacing under structural space, e: structural vector of entity e, d t : name vector matrix dimension, P: initial structural feature matrix dimension, F: final structural feature matrix dimension, N: name vector matrix of entity, D: distance between entities, G 1 : to-be-aligned knowledge map 1,G 2 : to-be-aligned knowledge map 2,e 1 :G 1 Middle entity, e 2 :G 2 Middle entity, Δ 1 :e 1 Difference between the nearest two entities, Δ 2 :e 2 Difference between the nearest two entities, θ 1 : distance difference threshold, theta 2 : a threshold number of newly added entity pairs.
A formalized description of the entity alignment problem is given by two knowledge graphs, G 1 =(E 1 ,R 1 ,T 1 ) And G 2 =(E 2 ,R 2 ,T 2 ) Wherein E represents an entity, R represents a relationship,
Figure GDA0002378054520000051
representing triplets in the map. The known entity pair is represented as
Figure GDA0002378054520000052
The entity alignment task aims to find a new entity pair by utilizing the known entity pair information and generate a final alignment result
Figure GDA0002378054520000053
Wherein equal signs indicate that two entities point to the same entityA world entity. Given an entity, the process of finding its corresponding entity in another knowledge-graph can be considered a ranking problem. That is, under a certain feature space, the degree of similarity (distance) of a given entity to all entities in another knowledge-graph is calculated and given an ordering, and the entity with the highest degree of similarity (distance is the smallest) can be regarded as an alignment result.
Taking the medical knowledge map as an example, in order to obtain more medical knowledge, a plurality of independent medical knowledge maps can be fused, and in order to better fuse the medical knowledge map, entities in the medical knowledge map need to be identified, wherein the entities include names of medicines, names of diseases and names of symptoms. The three types of entities are the most basic entities of the medical knowledge graph, the alignment of the three types of entities is made, the most basic requirements of the medical knowledge graph are met, and the extraction of other entities can be determined according to actual needs.
The embodiment captures entity adjacency structure information and generates an entity structure representation vector by using a Graph Convolution Network (GCN). The GCN is a convolutional network that acts directly on the graph structure data to generate corresponding node structure vectors by capturing the structure information around the nodes. The input of the GCN is a feature matrix of the entity
Figure GDA0002378054520000061
And the adjacency matrix a of the figure. The output is a feature matrix with structure information
Figure GDA0002378054520000062
N represents the number of nodes in the map, while P and F represent the dimensions of the input and output matrix features, respectively.
GCN models typically contain multiple GCN layers. In particular, assume that the input at layer I is a feature matrix of nodes
Figure GDA0002378054520000063
Wherein d is l Dimension representing the characteristic matrix of the l-th layer (for the first layer, H 1 =X,d 1 = P). The first layer output is
Figure GDA0002378054520000064
Wherein
Figure GDA0002378054520000065
I is an identity matrix and is a matrix of the identity,
Figure GDA0002378054520000066
is composed of
Figure GDA0002378054520000067
The diagonal matrix of (a).
Figure GDA0002378054520000068
Is a parameter matrix of the l-th layer, d l+1 Is the dimension of the next level feature matrix. The activation function σ is often set to ReLU. For the last layer, H l+1 =Z,d l+1 =F。
In this embodiment, two-layer GCNs are constructed, each GCN being configured to process a knowledge graph and generate a corresponding entity vector, where an initial feature matrix X is obtained by sampling from L2 regularized truncated normal distribution and is updated through training of each layer of the GCN, so as to fully capture structural information in the knowledge graph and generate an output feature matrix Z s (P=F=d l =d s ) And two GCNs share the feature matrix W in two layers 1 And W 2
The entity structure vectors of different knowledge-graphs are not in the same space, so it is necessary to align them into the same space using a known entity pair S. A specific training objective is to minimize the following loss values:
Figure GDA0002378054520000071
wherein [ x ]] + =max{0,x},
Figure GDA0002378054520000072
The representation is based on a known entity pair (e) 1 ,e 2 ) A 1, e 1 Or e 2 Instead, a set of negative examples generated by the random entity. e represents the structure vector of entity e. Gamma represents the end distance separating the positive and negative samples. Model optimization was performed using a random gradient descent.
Given the final structural feature matrix Z, e 1 ∈G 1 And e 2 ∈G 2 A distance D below the structural space s (e 1 ,e 2 )=||e 1 -e 2 || l1 /d s
If only the structural features are considered, the distance D between the target entity e and the structural features s The closest entity will be considered the corresponding entity of e.
Unlike the prior art, the present embodiment proposes to align with text features simultaneously. Specifically, in the text form of entity names, consider that 1) entity names are often used to identify entities and are widely available; 2) By comparing the entity names, whether the two entities are the same or not can be visually judged; 3) The method is not influenced by the scale of the training set and has stronger stability.
Although the conventional string comparison method can be used to measure the similarity between two entity names, the semantic similarity of the entity names is used in this embodiment because it is also applicable when the knowledge base is very different, such as the alignment of multi-language knowledge base. Specifically, the average word vector representation is used as an entity name vector because it is simple and general, and semantic information can be expressed without a special corpus in consideration of the simplicity and the universality. Suppose that entity e includes the word w in its name 1 ,w 2 ,...,w p Then the entity name vector may be represented as the average of these word vectors, i.e.
Figure GDA0002378054520000073
Wherein w i Is w i The word vector of (2). The name vector for all entities can be represented as N.
Similar to word vectors, similar entity names will be very close in vector space. e.g. of the type 1 ∈G 1 And e 2 ∈G 2 Distance under text feature space is D t (e 1 ,e 2 )=||ne(e 1 )-ne(e 2 )|| l1 /d t . If only the entity is consideredName characteristics, and e distance D to target entity t The closest entity will be considered as the corresponding entity to e. For cross-language entity alignment, pre-training cross-language word vectors can be utilized, thereby ensuring that cross-language entity name vectors are in the same space.
Considering that structural and name features depict entities from two different aspects, structural and semantic, respectively, they can be further combined to provide a more comprehensive alignment cue. In particular, two entities e 1 ∈G 1 And e 2 ∈G 2 The distance between them is:
D(e 1 ,e 2 )=αD s (e 1 ,e 2 )+(1-α)D t (e 1 ,e 2 )
where α is the hyperparameter used to adjust the weights of the two features. In the space after the feature fusion, the entity closest to the target entity e by the distance D is regarded as the corresponding entity of e.
The word-shift distance model aims to measure the difference between different sentences, and represents that the embedded vectors of all words in one sentence need to be shifted to reach the minimum distance value of the embedded vectors of all words in another sentence. Compared with the distance between average word vectors, the word shift distance can better depict the influence of each word in the sentence on the whole sentence, and the semantic loss caused by average operation is avoided. However, this model is time consuming due to the need to compute word-level distances, and is not suitable for large-scale data. For this reason, the method is not used to calculate the distance between entity names from the beginning, but rather it is used to reorder the preamble results.
The main contributions of the present invention are as follows:
(1) An entity alignment basic framework which fuses the structural feature and the entity name feature is designed. The entity name and the structural information are mutually complemented, and the basic framework can greatly improve the alignment result of the long-tail entity, so that the overall alignment effect is optimized.
(2) And reordering the preorder alignment result by adopting a word shift distance model so as to fully mine entity name information and improve alignment accuracy.
Aiming at the problem that the structure information of the knowledge graph is deficient in a real world data set, the invention combines the entity name information which is not influenced by the degree of the entity node with the structure information to construct an entity alignment basic framework. And on the basis of the previous step, further mining entity name information by using a word shifting distance model, reordering the preorder results and further generating a final alignment result. The model works best on widely used entity aligned data sets.
The above examples are an implementation of the method for knowledge-graph fusion, but the implementation of the method is not limited by the examples, and any other changes, modifications, substitutions, combinations, and simplifications that do not depart from the spirit and principle of the present invention should be regarded as equivalent substitutions, and are included in the scope of the present invention.

Claims (4)

1. A method for entity alignment of a knowledge graph, comprising the steps of:
step 1, acquiring data of two knowledge maps;
step 2, learning the structure vector of the entity by using a graph convolution network; representing the names of the entities as word vectors;
step 3, calculating the structure distance and the word characteristic distance of the entity;
step 4, fusing the two distances into a comprehensive distance to represent the similarity of the entities;
step 5, performing entity identification alignment according to the calculation result of the similarity degree to obtain a similar entity pair;
the two knowledge maps are represented as G 1 =(E 1 ,R 1 ,T 1 ) And G 2 =(E 2 ,R 2 ,T 2 ) Wherein E represents an entity, R represents a relationship,
Figure FDA0003828200280000013
representing triplets in a graph, known entity pairs are represented as
Figure FDA0003828200280000011
In the step 2, two-layer graph convolution networks are used for processing two knowledge graph data and generating corresponding entity structure vectors respectively;
entity e of two knowledge graphs in step 3 1 ∈G 1 And e 2 ∈G 2 The structural distance is D in structural space s (e 1 ,e 2 )=||e 1 -e 2 || l1 /d s ,d s Is the structural matrix dimension;
the word feature distance is the semantic similarity of the entity name, the average word vector is used as the entity name vector, and the distance of the entity name vector in the text feature space is calculated, specifically, the word feature distance is D t (e 1 ,e 2 )=||ne(e 1 )-ne(e 2 )|| l1 /d t Suppose that entity e includes the word w in its name 1 ,w 2 ,...,w p Then the entity name vector may be represented as the average of these word vectors, i.e.
Figure FDA0003828200280000012
Whereinw i Is w i Word vector of d t Is the name vector matrix dimension;
further, the word characteristic distance is calculated through a word moving distance model, the word moving distance model is used for measuring the difference among different sentences, and the word moving distance is represented as the minimum distance value of embedded vectors of all words in an entity which need to move to embedded vectors of all words in another entity;
the fusion formula of the comprehensive distance in the step 4 is as follows:
D(e 1 ,e 2 )=αD s (e 1 ,e 2 )+(1-α)D t (e 1 ,e 2 )
where α is the hyperparameter used to adjust the weights of the two features.
2. The entity alignment method as claimed in claim 1, wherein the graph convolution networkThe input being a feature matrix of the entity
Figure FDA0003828200280000021
And an adjacency matrix A of the graph, the output being a feature matrix into which the structure information is fused
Figure FDA0003828200280000022
N represents the number of nodes in the graph, and P and F represent the dimensions of the input and output matrix features, respectively, assuming the input of the l-th layer as the feature matrix of the nodes
Figure FDA0003828200280000023
Wherein d is l Dimension representing the characteristic matrix of the l-th layer, for the first layer, H 1 =X,d 1 =P;
The first layer output is
Figure FDA0003828200280000024
Wherein
Figure FDA0003828200280000025
I is an identity matrix and is a matrix of the identity,
Figure FDA0003828200280000026
is composed of
Figure FDA0003828200280000027
The diagonal matrix of (a) is,
Figure FDA0003828200280000028
is a parameter matrix of the l-th layer, d l+1 Is the dimension of the feature matrix of the next layer, the activation function σ is often set to ReLU, H for the last layer l+1 =Z,d l+1 =F。
3. The entity alignment method according to claim 2, wherein the initial feature matrix X is sampled from the L2 regularized truncated normal distribution and updated by GCN layer trainingFurther, fully capturing structural information in the knowledge graph and generating an output characteristic matrix Z; the dimension of the feature matrix is always set to d s ,P=F=d l =d s While two GCNs share the feature matrix W in two layers 1 And W 2
4. The entity alignment method of claim 3, wherein the training objective is to minimize the following penalty values:
Figure FDA0003828200280000029
wherein [ x ]] + =max{0,x},
Figure FDA00038282002800000210
The representation is based on a known entity pair (e) 1 ,e 2 ) A 1, e 1 Or e 2 Instead of a set of negative examples generated by a random entity,eand representing a structural vector of an entity e, and gamma represents an end distance separating positive and negative samples, and performing model optimization by adopting random gradient descent.
CN201910968049.9A 2019-10-12 2019-10-12 Entity alignment method for knowledge graph Active CN110955780B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201910968049.9A CN110955780B (en) 2019-10-12 2019-10-12 Entity alignment method for knowledge graph

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201910968049.9A CN110955780B (en) 2019-10-12 2019-10-12 Entity alignment method for knowledge graph

Publications (2)

Publication Number Publication Date
CN110955780A CN110955780A (en) 2020-04-03
CN110955780B true CN110955780B (en) 2022-10-14

Family

ID=69975624

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201910968049.9A Active CN110955780B (en) 2019-10-12 2019-10-12 Entity alignment method for knowledge graph

Country Status (1)

Country Link
CN (1) CN110955780B (en)

Families Citing this family (13)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN111563192B (en) * 2020-04-28 2023-05-30 腾讯科技(深圳)有限公司 Entity alignment method, device, electronic equipment and storage medium
CN111680488B (en) * 2020-06-08 2023-07-21 浙江大学 Cross-language entity alignment method based on knowledge graph multi-view information
CN112699246B (en) * 2020-12-21 2022-09-27 南京理工大学 Domain knowledge pushing method based on knowledge graph
CN112287126B (en) * 2020-12-24 2021-03-19 中国人民解放军国防科技大学 Entity alignment method and device suitable for multi-mode knowledge graph
CN112765362B (en) * 2021-01-12 2023-06-13 中国人民解放军国防科技大学 Knowledge-graph entity alignment method based on improved self-encoder and related equipment
CN113111657B (en) * 2021-03-04 2024-05-03 浙江工业大学 Cross-language knowledge graph alignment and fusion method, device and storage medium
CN113409157B (en) * 2021-05-19 2022-06-28 桂林电子科技大学 Cross-social network user alignment method and device
CN113761221B (en) * 2021-06-30 2022-02-15 中国人民解放军32801部队 Knowledge graph entity alignment method based on graph neural network
CN113656596B (en) * 2021-08-18 2022-09-20 中国人民解放军国防科技大学 Multi-modal entity alignment method based on triple screening fusion
CN113407759B (en) * 2021-08-18 2021-11-30 中国人民解放军国防科技大学 Multi-modal entity alignment method based on adaptive feature fusion
CN114386694B (en) * 2022-01-11 2024-02-23 平安科技(深圳)有限公司 Drug molecular property prediction method, device and equipment based on contrast learning
CN117172321A (en) * 2023-11-02 2023-12-05 中国科学院空天信息创新研究院 Geographic entity alignment method and device for introducing graphic neural network and electronic equipment
CN117172255B (en) * 2023-11-02 2024-02-02 中国科学院空天信息创新研究院 Geographic entity alignment method and device considering spatial semantic relation and electronic equipment

Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN108595708A (en) * 2018-05-10 2018-09-28 北京航空航天大学 A kind of exception information file classification method of knowledge based collection of illustrative plates
CN108875051A (en) * 2018-06-28 2018-11-23 中译语通科技股份有限公司 Knowledge mapping method for auto constructing and system towards magnanimity non-structured text
CN109255002A (en) * 2018-09-11 2019-01-22 浙江大学 A method of it is excavated using relation path and solves knowledge mapping alignment task

Family Cites Families (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US9418103B2 (en) * 2013-12-06 2016-08-16 Quixey, Inc. Techniques for reformulating search queries

Patent Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN108595708A (en) * 2018-05-10 2018-09-28 北京航空航天大学 A kind of exception information file classification method of knowledge based collection of illustrative plates
CN108875051A (en) * 2018-06-28 2018-11-23 中译语通科技股份有限公司 Knowledge mapping method for auto constructing and system towards magnanimity non-structured text
CN109255002A (en) * 2018-09-11 2019-01-22 浙江大学 A method of it is excavated using relation path and solves knowledge mapping alignment task

Non-Patent Citations (2)

* Cited by examiner, † Cited by third party
Title
"Table Detection in Invoice Documents by Graph Neural Networks";Pau Riba 等;《2019 International Conference on Document Analysis and Recognition (ICDAR)》;20190925;全文 *
"基于神经网络的实体识别和关系抽取的联合模型研究";丁琛;《中国优秀硕士学位论文全文数据库 信息科技辑》;20190815;全文 *

Also Published As

Publication number Publication date
CN110955780A (en) 2020-04-03

Similar Documents

Publication Publication Date Title
CN110955780B (en) Entity alignment method for knowledge graph
CN110941722B (en) Knowledge graph fusion method based on entity alignment
CN110472065B (en) Cross-language knowledge graph entity alignment method based on GCN twin network
CN108389614B (en) Method for constructing medical image map based on image segmentation and convolutional neural network
US10096121B2 (en) Human-shape image segmentation method
WO2018077039A1 (en) Community discovery method, apparatus, server, and computer storage medium
US20230206127A1 (en) Knowledge graph fusion method based on iterative completion
CN113140254B (en) Meta-learning drug-target interaction prediction system and prediction method
JP7334205B2 (en) Destination prediction method, device, electronic device, computer-readable storage medium and computer program
CN112084373B (en) Graph embedding-based multi-source heterogeneous network user alignment method
CN104778256A (en) Rapid incremental clustering method for domain question-answering system consultations
CN114625882B (en) Network construction method for improving unique diversity of image text description
Padó et al. Who sides with whom? Towards computational construction of discourse networks for political debates
CN115438197B (en) Method and system for complementing relationship of affair knowledge graph based on double-layer heterogeneous graph
CN114048340A (en) Hierarchical fusion combined query image retrieval method
CN104572632B (en) A kind of method in the translation direction for determining the vocabulary with proper name translation
CN110569355B (en) Viewpoint target extraction and target emotion classification combined method and system based on word blocks
CN109086463A (en) A kind of Ask-Answer Community label recommendation method based on region convolutional neural networks
CN115391553A (en) Method for automatically searching time sequence knowledge graph complement model
US20230117325A1 (en) System for generating compound structure representation
Sagcan et al. Toponym recognition in social media for estimating the location of events
CN112529057A (en) Graph similarity calculation method and device based on graph convolution network
CN115330142B (en) Training method of joint capacity model, capacity demand matching method and device
CN115601745A (en) Multi-view three-dimensional object identification method facing application end
CN115860119A (en) Low-sample knowledge graph completion method and system based on dynamic meta-learning

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant