CN113420111A

CN113420111A - Intelligent question-answering method and device for multi-hop inference problem

Info

Publication number: CN113420111A
Application number: CN202110674586.XA
Authority: CN
Inventors: 张学君; 万辛; 付瑞柳; 黄远; 张鹏远; 刘睿霖; 颜永红
Original assignee: Institute of Acoustics CAS; National Computer Network and Information Security Management Center
Current assignee: Institute of Acoustics CAS; National Computer Network and Information Security Management Center
Priority date: 2021-06-17
Filing date: 2021-06-17
Publication date: 2021-09-21
Anticipated expiration: 2041-06-17
Also published as: CN113420111B

Abstract

The embodiment of the application discloses an intelligent question-answering method and device for multi-hop inference problems, wherein the method comprises the following steps: acquiring a question text; performing semantic coding on the problem text to obtain semantic coding representation of the problem text; determining a first prediction result according to semantic coding representation of the problem text, wherein the first prediction result is a prediction result of the position of at least one problem main body of the problem text; determining a second prediction result according to the semantic coding representation of the question text, wherein the second prediction result is a prediction result of at least one question relation of the question text; generating a sub-question text according to the first prediction result and the second prediction result, wherein the sub-question text comprises at least one sub-question; sequentially answering at least one sub-question according to a screening document to obtain an answer corresponding to at least one sub-question, wherein the screening document comprises the answer corresponding to at least one sub-question; and determining a final answer of the question text according to the answer corresponding to the at least one sub-question.

Description

Intelligent question-answering method and device for multi-hop inference problem

Technical Field

The invention relates to the technical field of natural language processing. In particular to an intelligent question-answering method and device for multi-hop reasoning problems.

Background

The multi-hop inference system is an important technology for intelligently analyzing complex question descriptions and searching multiple clues for answers in an intelligent question-answering service scene, and is an important component of the intelligent question-answering service scene.

Deep learning techniques have made great progress in traditional single-hop reading understanding tasks, achieving a level comparable to humans, but many single-hop reading understanding tasks can be answered by simple question-sentence matching of paragraphs, and do not involve complex reasoning.

For the machine to learn true reasoning ability, many high-quality multi-hop question-answering datasets have been proposed recently, such as HotpotQA, ComplexWebQuestions, Qangaroo WikiHop, R4C, 2WikiMultiHopQA, etc. These multi-hop question-and-answer tasks are more challenging than traditional reading understanding tasks, and the difficulty is mainly reflected in that: firstly, the inference can be completed only by finding a clue at a plurality of positions in a large number of paragraphs which are far away by a machine; secondly, the machine is required to have certain interpretability while completing reasoning; and thirdly, the machine is required to be capable of adapting to complicated and changeable question types and answer types, and the answers can be obtained by better integrating clues.

Most of the traditional multi-hop question-answering models are end-to-end reading understanding models based on graph structures or graph neural networks. While these tasks have performed well on many tasks, they also have significant limitations.

Firstly, the internal reasoning mechanism of the end-to-end reading understanding model is unclear, and an additional discriminator is usually adopted to judge whether a certain sentence is a clue sentence, so that the additional discriminator has no strong correlation with the reasoning result of the end-to-end reading understanding model, and the interpretability is not faithful enough; secondly, although the graph structure is theoretically effective for solving the multi-hop question-answering problem, recent experiments prove that the existing graph neural network is only a special attention mechanism and is not necessary for the multi-hop question-answering problem, and the same or even better effect can be achieved by only using the self-attention mechanism of the transform network instead of the graph neural network as long as the same additional adjacency matrix information is provided.

Disclosure of Invention

Because the existing method has the problems, the embodiment of the application provides an intelligent question-answering method and device for multi-hop reasoning problems.

In a first aspect, an embodiment of the present application provides an intelligent question-answering method for a multi-hop inference problem, including:

acquiring a question text;

carrying out semantic coding on the problem text to obtain semantic coded representation of the problem text;

determining a first prediction result according to semantic coding representation of the question text, wherein the first prediction result is a prediction result of the position of at least one question main body of the question text;

determining a second prediction result according to the semantic coding representation of the question text, wherein the second prediction result is a prediction result of at least one question relation of the question text;

generating a sub-question text according to the first prediction result and the second prediction result, wherein the sub-question text comprises at least one sub-question;

sequentially answering the at least one sub-question according to a screening document to obtain an answer corresponding to the at least one sub-question, wherein the screening document comprises the answer corresponding to the at least one sub-question;

and determining a final answer of the question text according to the answer corresponding to the at least one sub-question.

In one possible implementation, the determining a first prediction result from the semantically encoded representation of the question text comprises:

performing first coding on the semantic coding representation of the problem text to obtain a first coding representation, wherein the first coding representation is used for searching and predicting the position of at least one problem main body of the problem text;

performing an attention mechanism operation on the first encoded representation to obtain a first attention perception representation of the semantically encoded representation for the at least one question body;

applying at least two pointers to the first attention perception representation through a pointer network architecture, and outputting probability distributions of a starting point and an end point corresponding to the at least one problem body;

and determining a prediction result of the position of the at least one problem main body according to the probability distribution of the starting point and the end point corresponding to the at least one problem main body.

In one possible implementation, the method further comprises:

performing maximum average pooling operation on the semantic code representation to obtain a maximum pooling result and an average pooling result;

splicing the maximum pooling result and the average pooling result to obtain a pooling semantic code representation;

determining probability distribution of different problem types according to the pooling semantic code representation;

and determining the problem type corresponding to the maximum probability distribution as a third prediction result, wherein the third prediction result is a prediction result of the problem type to which the problem text belongs.

In one possible implementation, the method further comprises:

determining probability distribution of different problem relation predictions according to the pooling semantic code representation aiming at the classification relation extractor;

multiplying the probability distribution of the different problem relation predictions by the probability distribution of the corresponding problem types to determine the final problem relation prediction result of the problem text; and/or

Second coding the semantic coding representation of the question text to obtain a second coding representation for a pull-out relationship extractor, the second coding representation being used for search prediction of at least one question relationship of the question text;

performing attention mechanism operation on the second coded representation to obtain a second attention perception representation of the semantic coded representation for the at least one question relationship;

applying at least two pointers to the second attention perception representation through a pointer network architecture, and outputting probability distributions of a starting point and an end point corresponding to the at least one problem relation;

determining a second prediction result corresponding to the prediction result of the question type to which the question text belongs according to the probability distribution of the starting point and the end point corresponding to the at least one question relation;

and multiplying a second prediction result corresponding to the prediction result of the problem type to which the problem text belongs by the probability distribution of the corresponding problem type to determine the prediction result of the final problem relation of the problem text.

In one possible implementation, the generating a subproblem text according to the first prediction result and the second prediction result includes:

determining a sub-question text according to the question body, the at least one question relationship and the first separation identifier; or

Determining a sub-problem text by adopting different templates according to the type of the problem main body and the type of the at least one problem relation; or

And determining a sub-problem text according to a preset training model and through the problem main body and the at least one problem relation.

In one possible implementation, the sequentially answering the at least one sub-question according to the screening document to obtain an answer corresponding to the at least one sub-question includes:

splicing the sub-question text, the screened document and the second separation identifier to obtain a first splicing result;

performing word segmentation on the first splicing result to obtain a word segmentation result;

carrying out double-byte encoding on the word segmentation result to obtain double-byte encoding representation;

determining a vector representation of the first concatenation result from the double-byte encoded representation;

performing third coding on the vector representation of the first splicing result to obtain a third coded representation;

determining a probability distribution of a starting point and an ending point of an answer to at least one sub-question included in the sub-question text according to the third coded representation;

determining an answer corresponding to at least one sub-question according to probability distribution of a starting point and an ending point of the answer of the at least one sub-question included in the sub-question text;

splicing a question main body with a first question relation to obtain a first subproblem, and determining an answer corresponding to the first subproblem as a first answer according to the steps; and splicing the first answer and the second question in a relationship manner to obtain a second subproblem, determining that the answer corresponding to the second subproblem is a second answer according to the steps, and by analogy, splicing the (N-1) th answer and the final question in a relationship manner to obtain a final subproblem, determining that the answer corresponding to the final subproblem is an Nth answer according to the steps, wherein N is the number of the subproblems and is a positive integer.

In one possible implementation, the question text belongs to a question type including: combinatorial, inferential, comparative, and/or cascaded comparative;

the method further comprises the following steps:

determining an answer corresponding to the last subproblem as a final answer of the question text aiming at the condition that the question type of the question text is a combined type or an inference type; or

Aiming at the condition that the question type of the question text is a comparative type or a cascade comparative type, taking an answer corresponding to the last subproblem respectively corresponding to at least two question main bodies as a comparative object, and splicing the question text and the comparative object to obtain a second splicing result;

and determining the final answer of the question text according to the second splicing result.

In a second aspect, an embodiment of the present application provides an intelligent question-answering device for multi-hop inference questions, including:

the relation extractor module is used for acquiring a question text; performing semantic coding on the problem text to obtain semantic coding representation of the problem text; determining a first prediction result according to semantic coding representation of the question text, wherein the first prediction result is a prediction result of the position of at least one question main body of the question text; determining a second prediction result according to the semantic coding representation of the question text, wherein the second prediction result is a prediction result of at least one question relation of the question text;

the reader module is used for generating a subproblem text according to the first prediction result and the second prediction result, and the subproblem text comprises at least one subproblem; according to the screened documents, sequentially answering the at least one sub-question to obtain answers corresponding to the at least one sub-question;

and the comparator module is used for determining the final answer of the question text according to the answer corresponding to the at least one sub-question.

the determining a second prediction result based on the semantically coded representation of the question text comprises:

and determining second prediction results respectively corresponding to the question types of the question texts according to the semantic coding representation of the question texts.

In one possible implementation, the comparator module is specifically configured to:

aiming at the condition that the question type of the question text is a comparative type or a cascade comparative type, taking an answer corresponding to the last subproblem respectively corresponding to at least two question main bodies as a comparative object, and splicing the question text and the comparative object to obtain a second splicing result; and determining the final answer of the question text according to the second splicing result.

In a third aspect, an embodiment of the present application further provides a computer device, which includes a memory, a processor, and a computer program stored on the memory and executable on the processor, and the processor implements the steps in the first aspect and various possible implementations when executing the computer program.

In a fourth aspect, embodiments of the present application further propose a non-transitory computer-readable storage medium having stored thereon a computer program, which when executed by a processor implements the steps as in the first aspect and various possible implementations.

According to the technical scheme, the complex problem is decomposed, and three-stage problem reasoning is carried out. This way of reasoning is more closely related to human behavior patterns. The complex multi-hop reasoning problem is converted into a single-hop problem which can be solved skillfully through a problem decomposition mode. The problem relation is flexibly extracted by adopting the relation extractor to generate the sub-problem, and the problem that the traditional end-to-end reading understanding model has the understanding capability limitation on the complex problem expression is solved. Compared with an end-to-end reading understanding model, for comparison objects in a comparative problem, the comparison objects are compared in a targeted manner by adopting a comparator method, and the method has a better universal type for multi-hop problem texts with different types of the problems.

Drawings

In order to more clearly illustrate the embodiments of the present application or the technical solutions in the prior art, the drawings needed to be used in the description of the embodiments or the prior art will be briefly introduced below, it is obvious that the drawings in the following description are only some embodiments of the present application, and for those skilled in the art, other drawings can be obtained according to these drawings without creative efforts.

Fig. 1 is a schematic flowchart of an intelligent question-answering method for a multi-hop inference problem according to an embodiment of the present application;

fig. 2 is a schematic structural diagram of an intelligent question-answering device for a multi-hop inference problem according to an embodiment of the present application.

Detailed Description

The technical solutions in the embodiments of the present invention will be described below with reference to the drawings in the embodiments of the present invention. The following examples are only for illustrating the technical solutions of the present invention more clearly, and the protection scope of the present invention is not limited thereby.

It should be noted that the term "and/or" in this application is only one kind of association relationship describing the associated object, and means that three relationships may exist, for example, a and/or B may mean: a exists alone, A and B exist simultaneously, and B exists alone. The terms "first," "second," and "third," etc. in the description and claims of the embodiments of the present application are used for distinguishing between different objects and not for describing a particular order of the objects. For example, the first coded representation, the second coded representation, the third coded representation, and so on are used to distinguish different coded representations, rather than to describe a particular order of the target objects. In the embodiments of the present application, words such as "exemplary," "for example," or "such as" are used to mean serving as examples, illustrations, or illustrations. Any embodiment or design described herein as "exemplary," "for example," or "such as" is not necessarily to be construed as preferred or advantageous over other embodiments or designs. Rather, use of the words "exemplary," "for example," or "such as" are intended to present relevant concepts in a concrete fashion.

In one possible implementation, the conventional multi-hop question-and-answer model mostly adopts some end-to-end reading understanding models based on a graph structure or a graph neural network. But they have the following disadvantages in the using process:

To this end, an embodiment of the present application provides a flowchart of an intelligent question-answering method for multi-hop inference questions as shown in fig. 1, where the flowchart includes: S102-S114, the complex multi-hop reasoning problem is converted into a single-hop problem which can be solved skillfully through a problem decomposition mode, and the comprehension capability limitation of the traditional end-to-end reading comprehension model to the complex multi-hop reasoning problem expression is solved.

The following describes an intelligent question-answering method for multi-hop inference questions as shown in fig. 1 provided in the embodiments of the present application in detail.

In one possible implementation, the intelligent question-answering method for multi-hop inference questions provided by the embodiment of the present application is implemented by the following steps:

s102, obtaining a question text.

In the embodiment of the present application, a question text is first acquired. Illustratively, the text of the question obtained is "the box office of the dragon crouching in tiger and the spacious wall, who is more? ".

S104, carrying out semantic coding on the question text to obtain semantic coded representation of the question text.

In the embodiment of the present application, the problem text acquired in S102 is preprocessed, and error coding and/or meaningless representation in the problem text is removed by a regular expression method, so as to obtain a clean text. For example, the clean text obtained is "who is more in the den dragon october box room". And matching the obtained clean text in the dictionary from the positive sequence direction and the negative sequence direction respectively by utilizing a forward and backward maximum word segmentation algorithm to obtain word segmentation results in the two directions. And carrying out double-Byte Encoding (BPE) on the word segmentation result to obtain a double-Byte Encoding representation. Clean text vector Representations are obtained by querying the Embedding layer of a lightweight bert (bidirectional Encoder Representations from transforms) model. And self-interactive calculation is carried out by utilizing a Transformer coding layer to obtain semantic coding representation of the problem text, and the semantic coding representation is suitable for the problem relation extraction task.

S106, determining a first prediction result according to the semantic coding representation of the question text, wherein the first prediction result is a prediction result of the position of at least one question main body of the question text.

In the embodiment of the present application, first, a Recurrent Neural Network (RNN) encoder module is constructed for semantic encoded representation of an obtained problem text, and first encoding is performed to obtain a first encoded representation, where the first encoding represents search prediction for a position where at least one problem main body of the problem text is located. Next, an attention mechanism operation is performed on the first encoded representation to obtain a first attention aware representation of the semantically encoded representation for the at least one problem subject. Then, at least two pointers are applied to the first attention perception representation through a pointer network architecture, and probability distribution of a starting point and an end point corresponding to at least one problem main body is output. And finally, determining the position of the at least one problem main body according to the probability distribution of the starting point and the ending point corresponding to the at least one problem main body.

The following description will take as an example the semantic code representation of the problem text "who is more in the box office of the juxtaposaur dragon and the october city," obtained as described above.

In the embodiment of the application, it can be known from the foregoing content that the obtained question text "who is more in the box-office of the long-closed october and the long-closed october" is preprocessed, and the obtained clean text is "who is more in the box-office of the long-closed october and the long-closed october". The clean text is further converted into a semantically encoded representation of the question text. And carrying out first coding on the converted semantic coding representation by using an RNN (radio network node) coder module to obtain a first coding representation. And performing attention mechanism operation on the first coded representation to obtain a first attention perception representation of the semantic coded representation for at least one question subject. Since the aforementioned clean text has two problem bodies, the probability distribution of the starting point and the ending point corresponding to the two problem bodies is output by applying four pointers to the first attention perception representation through the pointer network architecture. In the embodiment of the application, the first problem subject is determined to be 'crouching tiger dragon', and the second problem subject is determined to be 'October Bingcheng'.

S108, according to the semantic coding representation of the question text, determining a second prediction result, wherein the second prediction result is a prediction result of at least one question relation of the question text.

In the embodiment of the present application, a prediction result of a question type to which the question text belongs is determined according to the semantic coding representation of the question text, where the prediction result of the question type to which the question text belongs is a third prediction result. The question text belongs to the question types including: combinatorial, inferential, comparative, and/or cascaded comparative. The combination type is characterized in that two single-hop problems are connected in series, the inference type is characterized in that inference of common knowledge is needed, the comparison type is characterized in that comparison of comparison objects is needed, and the cascade comparison type is characterized in that a combination type and comparison type complex is formed. And performing maximum average pooling operation on the semantic coding expression to obtain a maximum pooling result and an average pooling result. And splicing the maximum pooling result and the average pooling result to obtain a pooling semantic code expression. The previously obtained pooled semantic code representations are converted into probability distributions for different question types by a Multi-Layer Perceptron (MLP) and logistic regression Layer. And determining the problem type corresponding to the maximum probability distribution as a prediction result of the problem type of the problem text. Taking the question text as the example of more than one in the box rooms of the clinopodium and the october city, the probability distributions of the question type combination type, the inference type, the comparison type and the cascade comparison type to which the question text belongs are respectively 10%, 70% and 10%. Therefore, it can be determined that the prediction result of the question type to which the question text belongs is a comparative type.

In the embodiment of the application, aiming at semantic coding expression of the obtained problem text, a linear classifier or a pointer network architecture is adopted, and independent relation extractors are respectively constructed for different problem types so as to obtain a second prediction result corresponding to the prediction result of the problem type to which the problem text belongs. In particular, the number of question relationships in different question texts is not necessarily required. In order to extract the problem relations by using a uniform model, the number of the problem relations is normalized in a preprocessing stage, and the problem relations are uniformly normalized into two problem relations. For a question text where the number of question relationships is one, the question relationships are repeated as two. And for the problem texts with the number of the problem relations exceeding two, combining the problem relations by utilizing human subjective judgment, and finally integrating the problem relations into two problem relations. Therefore, the relationship extractor needs to extract the predicted results of two problem relationships and pay attention to the precedence order of the problem relationships.

In the embodiment of the present application, for different application scenarios, the embodiment of the present application provides two different relationship extractors: a categorical relationship extractor or an extraction relationship extractor. For the classification formula relation extractor, a linear classifier structure is adopted, the previously obtained pooling semantic coding expression is converted into probability distribution predicted for different problem relations through an MLP and a logistic regression layer, and the probability distribution is multiplied by the probability distribution of the corresponding problem type to obtain a multiplication result. Further, the maximum probability distribution in the multiplication results is used as a prediction result of the final question relation of the question text. And for the extraction type relation extractor, constructing an RNN (radio network node) encoder module aiming at the semantic encoding expression of the obtained problem text by adopting a pointer network structure, carrying out second encoding to obtain a second encoding expression, wherein the second encoding expression is used for searching and predicting at least one problem relation of the problem text. And performing attention mechanism operation on the second coded representation to obtain a second attention perception representation of the semantic coded representation for the at least one question relation. And applying at least two pointers to the second attention perception representation through a pointer network architecture, and outputting probability distribution of a starting point and an end point corresponding to at least one problem relation. And determining second prediction results respectively corresponding to the problem types of the problem texts according to the probability distribution of the starting point and the end point corresponding to the at least one problem relation. Further, second prediction results respectively corresponding to the question types to which the question texts belong are multiplied by the probability distribution of the corresponding question types, and the prediction result of the final question relation of the question texts is determined.

In the embodiment of the present application, for example, the first problem relationship of the question text "who is more than the box rooms of the clockwork and the october city" is "the box room of the clockwork, the second problem relationship is" the box room of the october city, "the third problem relationship is" who is more than the two box rooms, "and then it can be determined that the prediction result of the final problem relationship is" who is more than the two box rooms.

S110, generating a subproblem text according to the first prediction result and the second prediction result, wherein the subproblem text comprises at least one subproblem.

In the embodiment of the present application, the simplest way is to separate one of the question body and the at least one question relationship by the first separation identifier as a sub-question, and generate a sub-question text. Or in order to increase readability, a fixed template is adopted, and the text of the subproblem is determined by adopting different templates according to the type (time, place and person) of the problem main body and the type of at least one problem relation. Or in order to increase the diversity of sub-problem expressions, the single-hop problems in the single-hop inference data set SQuAD are utilized, the problem subjects and the problem relations of the single-hop problems are extracted, the corresponding generation models are trained, the preset training models are obtained, and the sub-problem texts are determined through the problem subjects and at least one problem relation.

By means of the problem decomposition, the complex multi-hop inference problem is converted into a single-picking problem which can be solved skillfully.

And S112, sequentially answering the at least one sub-question according to a screening document to obtain an answer corresponding to the at least one sub-question, wherein the screening document comprises the answer corresponding to the at least one sub-question.

In the embodiment of the application, a lightweight BERT model (ALBERT) is utilized, and a single-hop model of a span predictor is added to answer the subproblems one by one. Specifically, the sub-problem text, the second separation identifier and the screening document are spliced in sequence to obtain a first splicing result. And matching the first splicing result in a dictionary from the positive sequence direction and the negative sequence direction respectively by utilizing a forward and backward maximum word segmentation algorithm for the first splicing result, synthesizing word segmentation results in the two directions, and performing double-byte coding to obtain a final BPE result. And obtaining the vector representation of the first splicing result by querying an Embedding layer of the ALBERT. And carrying out third coding on the vector representation of the first splicing result, namely obtaining a third coded representation, namely the coded representation of the first splicing result containing the context semantic information through an ALBERT coding layer based on Self-orientation. Determining, using the fully connected layer, a probability distribution of a starting point and an ending point of an answer to at least one sub-question included in the sub-question text according to the third coded representation. And determining an answer corresponding to the at least one sub-question according to the probability distribution of the starting point and the ending point of the answer of the at least one sub-question included in the sub-question text. Restoring the two normalized question relations in the step S108, splicing the question main body with the first question relation to obtain a first subproblem, and determining an answer corresponding to the first subproblem as a first answer according to the steps; and splicing the first answer and the second question relationship to obtain a second subproblem, determining that the answer corresponding to the second subproblem is a second answer according to the steps, and by analogy, splicing the (N-1) th answer and the final question relationship to obtain a last subproblem, determining that the answer corresponding to the last subproblem is an Nth answer according to the steps, wherein N is the number of the subproblems and is a positive integer.

The problem relation is flexibly extracted by the relation extractor to generate the sub-problem, and the problem that the traditional end-to-end reading understanding model has the understanding capability limitation on complex problem expression is solved.

S114, determining a final answer of the question text according to the answer corresponding to the at least one sub-question.

In the embodiment of the application, aiming at the condition that the question type of the question text belongs to a combined type or an inference type, the answer corresponding to the last subproblem is determined as the final answer of the question text; or, in the case that the question type to which the question text belongs is a comparative type or a cascade comparative type, taking the answer corresponding to the last subproblem corresponding to at least two question main bodies respectively as a comparative object (for example, time, number), and splicing the question text and the comparative object to obtain a second splicing result. And inputting the second splicing result into a BERT classifier model-based quantitative relation Comparator, and comparing the second splicing result with the BERT classifier model-based quantitative relation Comparator to determine a final answer of the question text.

Compared with an end-to-end reading understanding model, the embodiment of the application has the advantages that compared objects in a comparative problem are compared in a targeted mode by adopting a comparator method, and the method is better in universality for multi-hop problem texts with different types of the problems.

Fig. 2 is a schematic structural diagram of an intelligent question-answering device for a multi-hop inference problem according to an embodiment of the present application, where the schematic structural diagram includes: the device is a three-stage multi-hop inference device based on complex question decomposition, executes multi-hop inference by adopting a complex question decomposition mode according to the behavior pattern of human answering to the complex question, and divides the multi-hop inference process into the relation extractor module 202, the reader module 204 and the comparator module 206. The device adopts the deep learning technique to accurately extract the question main body and the question relation, automatically answers each subproblem, and utilizes the comparison object contained in the question text to construct a comparator, and automatically compares and answers the comparison object of the question text.

A relationship extractor module 202, configured to obtain a question text; performing semantic coding on the problem text to obtain semantic coding representation of the problem text; determining a first prediction result according to semantic coding representation of the question text, wherein the first prediction result is a prediction result of the position of at least one question main body of the question text; determining a second prediction result according to the semantic coding representation of the question text, wherein the second prediction result is a prediction result of at least one question relation of the question text;

the reader module 204 is configured to generate a subproblem text according to the first prediction result and the second prediction result, where the subproblem text includes at least one subproblem; according to the screened documents, sequentially answering the at least one sub-question to obtain answers corresponding to the at least one sub-question;

a comparator module 206, configured to determine a final answer to the question text according to the answer corresponding to the at least one sub-question.

In one possible implementation, the question text belongs to the question types including: combinatorial, inferential, comparative, and/or cascaded comparative;

the determining a second prediction result according to the semantic code representation of the question text includes:

and determining second prediction results respectively corresponding to the problem types of the problem texts according to the semantic coding representation of the problem texts.

In one possible implementation, the comparator module 206 is specifically configured to: aiming at the condition that the question type of the question text is a comparative type or a cascade comparative type, taking the answer corresponding to the last subproblem respectively corresponding to at least two question main bodies as a comparative object, and splicing the question text and the comparative object to obtain a second splicing result; and determining the final answer of the question text according to the second splicing result.

The embodiment of the application provides computer equipment, which comprises a memory, a processor and a computer program stored on the memory and capable of running on the processor, wherein when the processor executes the computer program, the processor realizes the intelligent question-answering method for the multi-hop inference problem corresponding to the method in fig. 1.

The present application provides a non-transitory computer readable storage medium, on which a computer program is stored, and when the computer program is executed by a processor, the computer program implements an intelligent question-answering method for a multi-hop inference problem as corresponding to fig. 1.

The above-described embodiments of the apparatus are merely illustrative, and the units described as separate parts may or may not be physically separate, and parts displayed as units may or may not be physical units, may be located in one place, or may be distributed on a plurality of network units. Some or all of the modules may be selected according to actual needs to achieve the purpose of the solution of the present embodiment. One of ordinary skill in the art can understand and implement it without inventive effort.

Through the above description of the embodiments, those skilled in the art will clearly understand that each embodiment can be implemented by software plus a necessary general hardware platform, and certainly can also be implemented by hardware. With this understanding in mind, the above-described technical solutions may be embodied in the form of a software product, which can be stored in a computer-readable storage medium such as ROM/RAM, magnetic disk, optical disk, etc., and includes instructions for causing a computer device (which may be a personal computer, a server, or a network device, etc.) to execute the methods described in the embodiments or some parts of the embodiments.

It should be noted that: the above embodiments are only used to illustrate the technical solutions of the present application, and not to limit the same; although the present application has been described in detail with reference to the foregoing embodiments, it should be understood by those of ordinary skill in the art that: the technical solutions described in the foregoing embodiments may still be modified, or some technical features may be equivalently replaced; and such modifications or substitutions do not depart from the spirit and scope of the corresponding technical solutions in the embodiments of the present application.

Claims

1. An intelligent question-answering method for multi-hop inference questions, comprising:

acquiring a question text;

2. The method of claim 1, wherein determining a first prediction result from the semantically encoded representation of the question text comprises:

3. The method of claim 1, further comprising:

4. The method of claim 3, further comprising:

5. The method of claim 1, wherein generating a sub-problem text based on the first and second predictors comprises:

6. The method according to claim 4, wherein said sequentially answering said at least one sub-question according to a filter document to obtain an answer corresponding to said at least one sub-question comprises:

7. The method of claim 3, wherein the question text belongs to a question type comprising: combinatorial, inferential, comparative, and/or cascaded comparative;

the method further comprises the following steps:

8. An intelligent question-answering device for multi-hop inference questions, comprising:

9. A computer device comprising a memory, a processor and a computer program stored on the memory and executable on the processor, characterized in that the processor implements the method according to any of claims 1-7 when executing the computer program.

10. A non-transitory computer-readable storage medium, on which a computer program is stored, which, when being executed by a processor, carries out the method according to any one of claims 1 to 7.