CN113946665A

CN113946665A - Knowledge base question-answering method for providing background information based on text

Info

Publication number: CN113946665A
Application number: CN202111066945.XA
Authority: CN
Inventors: 冯程程; 姜琳颖
Original assignee: Northeastern University China
Current assignee: Northeastern University China
Priority date: 2021-09-13
Filing date: 2021-09-13
Publication date: 2022-01-18
Anticipated expiration: 2041-09-13
Also published as: CN113946665B

Abstract

The invention belongs to the technical field of deep learning and intelligent question answering, and relates to a knowledge base question answering method for providing background information based on texts, wherein the representation of an entity is learned from a retrieved knowledge base subgraph through an entity representation module, and the problem representation is updated through fusing seed entities; learning, by a document characterization module, a correct representation of a document associated with a given problem, and updating the problem representation by fusing document information; in the answer prediction module, obtaining a final entity representation, a document representation and a question representation through an entity representation module and a document representation module; and splicing the entity representations and the corresponding document representations, and calculating a dot product with the problem to obtain a matching score. The invention creatively integrates the knowledge base information and the text information into the question, realizes the purpose of providing background knowledge by utilizing the knowledge base and the text information, improves the performance of the question-answering system and has strong expansibility.

Description

Knowledge base question-answering method for providing background information based on text

Technical Field

The invention belongs to the technical field of deep learning and intelligent question answering, relates to a knowledge base question answering method for providing background information based on texts, and particularly relates to an intelligent question answering method using two data sources of texts and a knowledge base.

Background

The Knowledge Base (KB) is considered as an essential resource for answering factual questions. However, a KB that is accurately built using complex patterns requires a significant amount of manpower, which inevitably limits the coverage of the KB. In fact, KB is often incomplete, not enough to cover all the evidence needed for the problem. On the other hand, the vast amount of unstructured text on the internet can easily cover a wide range of evolving knowledge that can be commonly used in question-and-answer systems. Therefore, in order to improve the coverage of the KB, researchers believe that the KB can be extended directly using text data. Currently, there are many possible ways to use text to enhance the performance of a knowledge base question-and-answer system. One method is to split the knowledge base tuple into entities and relations, split the text into entities and remove the remaining part of the entities (called patten), encode the entities by using a Universal Schema, and use the encoded text as the input of a memory network input module, and then obtain the predicted answer through a generalization module, an output module and a response module. The document is regarded as a heterogeneous node, the heterogeneous node and an entity in a knowledge base are combined into a unified graph, and the representation of each node is obtained in a graph convolution mode and is used for performing classification and predicting answers. Recently, researchers have pioneered the use of hypergraph convolutional networks to exploit the higher-order relationships contained in text.

These methods only consider the problem of incomplete answer coverage when the knowledge base of answer relief contained in the text is used as the only information source. Another information source may serve to provide clues to answers (background knowledge) when not considered in retrieving answers in the knowledge base or text. Xiong et al believe that when searching for answers in text, the information in the knowledge base may provide background knowledge to distinguish between relevant and irrelevant information (see FIG. 1(a)) (Wenhan Xiong, Mo Yu, Shiyu Chang, Xiaooxino Guo, and William Yang Wang Wang.Impropering query in computer kbs with knowledge-aware reader. in Annual Meeting of the Association for computerized linkage, 2019.). Their approach is better at answering a first type of question, but it may encounter difficulties in answering a second type of question (see fig. 1(b)) because it is difficult to return an accurate answer when there are some constraints in the question that cannot be found directly in KB. The relationships and entities corresponding to the constraint "debug with Carolina Panthers" cannot be found directly in the subgraph of problem 2. When the answer is retrieved in the sub-graph, the algorithm does not know what the constraint "debug with Carolina Panthers" mentioned in the question refers to in the sub-graph, nor what the time mentioned in the question refers to. Thus, the system is likely to return to newtons when to do other things. From the foregoing analysis, we consider that in addition to the situation in fig. 1(a), when the answer is in a sub-graph, it is also necessary to use textual information to provide background knowledge (see fig. 1 (b)).

The invention utilizes the end-to-end neural network to improve the traditional information retrieval-based method, and combines a knowledge base and an entity link text as an information source. In addition to text data that may increase answer coverage of the knowledge base, the present invention recognizes that the knowledge base and text may complement each other by providing background knowledge to each other (see two examples in FIG. 1). The method fuses the knowledge base and the text information into the problem representation, and achieves the purposes of providing background knowledge and enhancing the problem representation. And enhances the interaction between the knowledge base, text and questions.

Disclosure of Invention

The Knowledge Base (KB) is widely used in the question-answering (QA) task to provide appropriate answers to a given question, i.e., KBQA questions. However, the knowledge base itself may be incomplete (e.g., incomplete answer coverage and incomplete context constraints in the knowledge base), which limits the overall performance of existing KBQA models. To solve this problem, the present invention proposes a new model TB-model, which enhances knowledge base coverage by text supplementation with additional answers and uses text to provide background information to enhance the representation of the problem with complex constraints, thereby improving the accuracy of answer retrieval.

The invention provides an intelligent question-answering method using two information sources of a text and a knowledge base, which comprises the following steps: and calculating the similarity of the relationship between the problem and each entity in the knowledge base subgraph to obtain a problem relationship matching score. And judging whether the neighbor entity is the subject entity, and generating the matching score of the problem neighbor according to the obtained problem relation matching score and the binary index of judging whether the neighbor entity is the subject entity. And fusing the neighbor information of each entity according to the matching score of the problem neighbor to obtain the entity representation containing the neighbor information. And integrating the seed entity characteristics into the problem characteristics, and updating the problem characteristics for the first time. The resulting entities are used to characterize the representation of each word-segmentation in the augmented document. And performing attention calculation between the question and the document to obtain a document representation containing the attention of the question. And fusing the document representation corresponding to the subject entity into the problem representation, and updating the problem representation for the second time. And splicing the entity representation with the document representation containing the entity, and matching the entity representation with the problem. And outputting the entity with the highest matching score as the answer.

The technical scheme of the invention is as follows:

a knowledge base question-answering method for providing background information based on texts is characterized by comprising the following steps:

s1, learning the representation of the entity from the retrieved knowledge base subgraph through an entity representation module, and updating the problem representation through fusing seed entities;

s2, learning the correct representation of the document related to the given question through a document representation module, and updating the question representation through fusing document information;

s3, in the answer prediction module, obtaining the final entity representation, the document representation and the question representation through the entity representation module in the step S1 and the document representation module in the step S2; and splicing the entity representations and the corresponding document representations, and calculating a dot product with the problem to obtain a matching score.

Further, step S1 specifically includes the following steps:

s101, after bi-LSTM coding, the hidden state of each word of the question and the relation is respectively

And

wherein d is_hIs the hidden layer dimension; l_qAnd l_rLength of question and length of relation, respectively; r is a real number set; h is_qHidden state for each word of the question; h is_rHidden states for each word of the relationship; a representation of the self-attention mechanism relationship is then obtained:

wherein the content of the first and second substances,

is h_rThe number of the ith row of (a),

for the trained vector, α_iIs a correlation coefficient;

a relational representation generated for a self-attention mechanism;

use of

Calculating the degree of correlation with each participle of the question, taking the obtained degree of correlation of each participle as weight, calculating the sum of the weights, and matching the question and the relation by dot product_rModeling:

wherein, beta_iIs a correlation coefficient;

j line, s, of hq_rMatching scores for questions and relationships; the ith neighbor to entity e (r)_iEi) final match score s (r)_iEi) is defined as:

wherein, I is a binary indicator and represents an entity e_iWhether or not it is a seed entity, epsilon₀Is a set of seed entities that is,

a matching score for the question and the ith relationship;

s102, obtaining a final matching score S (r) between the question and the neighbor according to the step S101_i,e_i) And controlling information propagation of the neighbors by using a linear gate function to obtain the representation of each entity containing the neighbor information:

g(x,y)＝sigmoid(W[x；y])

wherein

A representation of the entity containing the neighbor information,

and

for pre-trained tuples (e, r)_i,e_i) Head entities e and e in_iIs embedded into the Glove knowledge graph of (1),

for a trainable transformation matrix, σ (-) is the activation function, Ne is the set of all neighbors of entity e, g (x, y) is the defined function, γ^eIs a compromise parameter for the calculation of the linear gate function;

s103, updating problem representation by fusing knowledge of a knowledge base of the seed entity after the representation that each entity contains neighbor information is obtained in the step S102:

a characterization of the problem obtained from the attention mechanism,

is the average value of the seed entity characterization, γ q is the compromise parameter of the gate function calculation,

updating the problem representation by fusing knowledge of the knowledge base of the seed entity; w is a^q,

Are parameters for learning.

Further, step S2 specifically includes the following steps:

s201, fusing the text word segmentation features and the entity features corresponding to the text word segmentation features by using a conditional gate function to obtain knowledge-enhanced document word segmentation features:

wherein the content of the first and second substances,

for the ith word-segmentation of the document,

is a feature of the ith word-segmentation,

is a representation of the entity to which the ith participle is linked, and γ d is a compromise parameter for the gate function calculation,

segmenting word features for the knowledge-enhanced document;

s202, the knowledge-enhanced document word segmentation characteristics obtained in the step S201

Using Bi-LSTM encoding as input, the hidden state at the participle level is output

Wherein d is_hIs the hidden layer dimension, l_dFor document length:

h_d＝Bi-LSTM(f_d)

wherein f is_dIs composed of

The matrix of the composition is composed of a plurality of matrixes,

is f_dRow i of (1);

s203, using an altering co-annotation mechanism to solve the problem and the hidden state h of each participle of the document_q，h_dAs input, a final document representation is obtained

H＝tanh(W_xX+(W_gg)1^T)

Wherein the content of the first and second substances,

is a vector with all elements 1, W_x,

And

is a parameter of learning, X is the input document feature, g is the lead of attention derived from the question, H is the intermediate state obtained, a^xIs the attention weight of feature X;

s204, fusing the question and the document: obtaining a representation of a document

Then, the information of the document containing the entity e is gathered by taking an average as follows:

the problem of fusing document information is characterized as follows:

wherein the content of the first and second substances,

is the average of all document tokens that contain the subject entity,

is a compromise parameter for the calculation of the linear gate function.

Further, step S203 specifically includes the following steps:

first step input X ═ h_d，g＝0；

The second step X ═ h_qG is the document characteristics output in the first step;

the third step X ═ h_dG is the problem characteristic output in the second step; the output of the third step is used as the final document characterization

Further, step S3 specifically includes the following steps: the entity tokens and the corresponding document tokens are spliced and the dot product is calculated with the problem to obtain the matching scores of the entity tokens and the corresponding document tokens, as follows:

wherein s is^eTo be the matching score of the question and candidate entity e,

is a parameter matrix of the transform dimension.

The invention has the advantages that: the invention creatively integrates knowledge base information and text information into the problem, and realizes the purpose of mutually providing background knowledge by utilizing the knowledge base and the text information. The performance of the question answering system is improved. The expansibility is strong.

Drawings

FIGS. 1(a) and 1(b) are explanatory views of how the present invention works; wherein FIG. 1(a) is the answer in the document; FIG. 1(b) shows the answer in the knowledge base.

FIG. 2 is a general model diagram of the present invention.

Detailed Description

The following further describes a specific embodiment of the present invention with reference to the drawings and technical solutions.

Examples

Task definition:

specifically, the QA task considered in embodiments of the present invention requires answering the question by reading the knowledge base tuple K { (es, r, eo) } and retrieving the wikipedia document set D. To build a scalable system, the present invention considers only one sub-graph per problem. The sub-graph is retrieved by the topic (seed) entity (the entity mentioned in the problem: E0 ═ E ∈ Q }). The document set D is retrieved by an existing document retriever and further sorted by Lucene index. Entities in the document are also annotated and linked to the knowledge base entities. For each question, the TB-model attempts to retrieve the answer entity from a candidate set that includes all knowledge base entities and document set entities.

The knowledge base question-answering method based on the text provides background information and is based on a TB-model, wherein the TB-model comprises an entity representation module, a document representation module and an answer prediction module; the entity representation module is used for learning the representation of the entity from the retrieved knowledge base subgraph and updating the problem representation by fusing the seed entity; the document representation module is used for learning the correct representation of the document related to the given problem and updating the problem representation again by fusing the document information; and the answer prediction module performs answer prediction according to the information of the knowledge base, the updated questions and the documents. The overall model diagram is shown in fig. 2. The method for providing background information based on the text comprises the following specific steps:

s1, in the entity characterization module, learning the representation of the entity from the retrieved knowledge base sub-graph, and updating the problem representation by fusing the seed entity.

Specifically, step S1 includes the steps of:

s101, before representing the entity, first obtain a matching score between a neighbor and the question by using a graph attention mechanism (in this embodiment, a 1-hop entity and a relationship connected to the candidate entity are regarded as neighbors), and the process includes matching the relationship between the question and the neighbor and determining whether the question includes the entity in the neighbor.

To match the problem and neighbor relations in the isomorphic potential space, a shared bi-LSTM coding problem and participle relation is employed. After bi-LSTM encoding, the hidden state of each word of the question and the relation is respectively

And

wherein d is_hIs the hidden layer dimension; l_qAnd l_rLength of question and length of relation, respectively; r is a real number set; h is_qHidden state for each word of the question; h is_rAs a hidden state of each word of the relationship. According to

A representation of the self-attention mechanism relationship is obtained:

wherein the content of the first and second substances,

is h_rThe number of the ith row of (a),

for the trained vector, α_iIs a correlation coefficient;

is a self-attention mechanism.

Then, use

Calculating the degree of correlation with each word segmentation of the question, taking the obtained degree of correlation of each word segmentation as a weight, calculating the sum of the weights, and then performing dot product on the questionMatching score s of sum relation_rModeling, as follows:

wherein, beta_iIs a correlation coefficient;

is h_qJ (th) row s_rMatching scores for questions and relationships;

in addition to the question-relationship similarity, if one neighbor entity is the subject entity in the question, then the corresponding tuple (e, r)_i,e_i) May be more likely to be relevant to the answer to the question than other non-subject neighbors, so a determination needs to be made as to whether an entity in a neighbor is a subject entity. The ith neighbor to entity e (r)_i,e_i) S (r) of_i,e_i) Is defined as:

is the matching score of the question with the ith relationship.

S102, obtaining a final matching score S (r) between the question and the neighbor according to the step S101_i,e_i) And controlling the information propagation of the neighbors by using a linear gate function, and obtaining the representation of each entity containing the neighbor information as follows:

g(x,y)＝sigmoid(W[x；y])

wherein

And

for pre-trained tuples (e, r)_i,e_i) Head entity e and tail entity e in_iIs embedded into the Glove knowledge graph of (1),

σ (-) is the activation function for the trainable transformation matrix; ne is the set of all neighbors of entity e and g (x, y) is a defined function. In addition, γ^eIs a compromise parameter for the calculation of the linear gate function that controls how much of the original entity information should be retained.

And S103, updating the problem representation by fusing knowledge of the knowledge base of the seed entity after the representation that each entity contains the neighbor information is obtained in the step S102.

A characterization of the problem obtained from the attention mechanism,

is the mean value, gamma, of the seed entity characterization^qIs a compromise parameter of the gate function calculation,

to update the problem representation by fusing knowledge of the knowledge base of seed entities, w^q,

Are parameters for learning.

S2, in the document characterization module, the correct representation of the document associated with the given question is learned and the question representation is further updated by fusing the document information.

Specifically, step S2 includes the steps of:

s201, using the character of document word segmentation designed by predecessor

(Danqi Chen, Adam Fisch, Jason Weston, and Autoine Bordes.2017.reading with knowledge to answer optional information questions. in Proceedings of the 55th environmental Meeting of the Association for the computerized Linear constraints, ACL 2017, Vancouver, Canada, July 30-August 4, Volume 1: Long Papers, pages 1870-:

wherein the content of the first and second substances,

for the ith word-segmentation of the document,

is a feature of the ith word segmentation.

Is the ith minuteCharacterization of the entity to which the word is linked, learned by the entity characterization Module, gamma^dIs a compromise parameter of the gate function calculation,

word segmentation features for knowledge-enhanced documents.

Wherein d is_hIs the hidden layer dimension, l_dFor document length:

h_d＝Bi-LSTM(f_d)

wherein f is_dIs composed of

The matrix of the composition is composed of a plurality of matrixes,

is f_dRow i of (2).

H＝tanh(W_xX+(W_gg)1^T)

Wherein 1 ∈ R^dh is a vector with all elements 1. W_x,

And

are parameters of learning. X is the input document (question) feature, g is the lead of attention derived from the question (document), H is the intermediate state obtained, a^xIs the attention weight of feature X.

First step input X ═ h_dG is 0; the second step X ═ h_qG is the document characteristics output in the first step; the third step X ═ h_dAnd g is the problem characteristic output by the second step. The output of the third step is used as the final document characterization

Then, we aggregate the information of the document containing entity e by taking the average as follows:

the problem of fusing document information is characterized as follows:

wherein the content of the first and second substances,

is the average of all document tokens that contain the subject entity,

is a compromise parameter for the calculation of the linear gate function.

S3, in the answer prediction module, obtaining the final entity representation through the entity representation module and the document representation module

Document characterization

Problem characterization

Then; the entity tokens and the corresponding document tokens are spliced together and the dot product is calculated with the problem to obtain the matching scores of the entity tokens and the corresponding document tokens, as follows:

wherein s is^eTo be the matching score of the question and candidate entity e,

is a parameter matrix of the transform dimension.

The present invention trains the model using a binary cross entropy loss function. Excellent performance results were obtained on the webquestinsp dataset.

Therefore, the method and the device fuse the document information related to the question into the question representation through the gating function, so as to explain the constraint in the question, enhance the representation of the query and achieve the purpose of providing background information for answer retrieval.

The above description of exemplary embodiments has been presented only to illustrate the technical solution of the invention and is not intended to be exhaustive or to limit the invention to the precise form described. Obviously, many modifications and variations are possible in light of the above teaching to those skilled in the art. The exemplary embodiments were chosen and described in order to explain certain principles of the invention and its practical application to thereby enable others skilled in the art to understand, implement and utilize the invention in various exemplary embodiments and with various alternatives and modifications. It is intended that the scope of the invention be defined by the following claims and their equivalents.

Claims

1. A knowledge base question-answering method for providing background information based on texts is characterized by comprising the following steps:

s1, learning the representation of the entity from the retrieved knowledge base subgraph through the entity representation module, and updating the problem representation through fusing seed entities;

s3, obtaining a final entity representation, a document representation and a question representation in the answer prediction module through the entity representation module in the step S1 and the document representation module in the step S2; and splicing the entity representations and the corresponding document representations, and calculating a dot product with the problem to obtain a matching score.

2. The knowledge base question-answering method for providing background information based on texts as claimed in claim 1, wherein the step S1 specifically comprises the following steps:

And

wherein the content of the first and second substances,

is h_rThe number of the ith row of (a),

for the trained vector, α_iIs a correlation coefficient;

a relational representation generated for a self-attention mechanism;

use of

wherein, beta_iIs a correlation coefficient;

is h_qJ (th) row s_rMatching scores for questions and relationships;

for instance, a pair of fruitsIth neighbor of body e (r)_i,e_i) S (r) of_i,e_i) Is defined as:

a matching score for the question and the ith relationship;

g(x,y)＝sigmoid(W[x；y])

wherein

A representation of the entity containing the neighbor information,

and

a characterization of the problem obtained from the attention mechanism,

updating the problem representation by fusing knowledge of the knowledge base of the seed entity;

are parameters for learning.

3. The knowledge base question-answering method for providing background information based on texts according to claim 1 or 2, wherein the step S2 specifically comprises the following steps:

wherein the content of the first and second substances,

for the ith word-segmentation of the document,

is a feature of the ith word-segmentation,

is a representation of the entity to which the ith participle is linked, γ^dIs a compromise parameter of the gate function calculation,

segmenting word features for the knowledge-enhanced document;

Wherein d is_hIs the hidden layer dimension, l_dFor document length:

h_d＝Bi-LSTM(f_d)

wherein f is_dIs composed of

The matrix of the composition is composed of a plurality of matrixes,

is f_dRow i of (1);

H＝tanh(W_xX+(W_gg)1^T)

Wherein the content of the first and second substances,

is a vector in which all elements are 1,

and

Then go toThe information for aggregating documents containing entity e by means of averaging is as follows:

the problem of fusing document information is characterized as follows:

wherein the content of the first and second substances,

is the average of all document tokens that contain the subject entity,

is a compromise parameter for the calculation of the linear gate function.

4. The knowledge base question-answering method for providing background information based on texts according to claim 3, wherein the step S203 specifically comprises the following steps:

first step input X ═ h_d，g＝0；

5. The knowledge base question-answering method for providing background information based on texts according to claim 1, 2 or 4, wherein the step S3 specifically comprises the following steps: the entity tokens and the corresponding document tokens are spliced and the dot product is calculated with the problem to obtain the matching scores of the entity tokens and the corresponding document tokens, as follows:

wherein s is^eTo be the matching score of the question and candidate entity e,

is a parameter matrix of the transform dimension.

6. The knowledge base question-answering method for providing background information based on texts according to claim 3, wherein the step S3 specifically comprises the following steps: the entity tokens and the corresponding document tokens are spliced and the dot product is calculated with the problem to obtain the matching scores of the entity tokens and the corresponding document tokens, as follows:

wherein s is^eTo be the matching score of the question and candidate entity e,

is a parameter matrix of the transform dimension.