WO2019235103A1

WO2019235103A1 - Question generation device, question generation method, and program

Info

Publication number: WO2019235103A1
Application number: PCT/JP2019/017805
Authority: WO
Inventors: 淳史大塚; 京介西田; いつみ斉藤; 光甫西田; 久子浅野; 準二富田
Original assignee: 日本電信電話株式会社
Priority date: 2018-06-07
Filing date: 2019-04-25
Publication date: 2019-12-12
Also published as: JP2022111261A; JP7315065B2

Abstract

This invention is characterized by comprising a generation means for receiving input of question text and a related document including an answer to the question text and using a pre-learned machine learning model to generate revised question text wherein a potentially missing portion of the question text is supplemented with words which are included in a prescribed vocabulary set.

Description

Question generating apparatus, question generating method and program

The present invention relates to a question generation device, a question generation method, and a program.

In recent years, a question answering technique in which a computer automatically answers a question input by a user in a natural language on a device such as a smartphone or a smart speaker has attracted attention. As such a question answering technique, a machine reading type question answering technique for extracting a part which becomes an answer from a document written in a natural language for a question inputted in a natural language is known (for example, Non-Patent Document 1).

Machine reading comprehension type question answering technology uses a neural network to collate the question with the answer part described in a manual or other document, and is known to be able to achieve answer accuracy equivalent to or better than that of humans. ing.

Here, in order to achieve high answer accuracy in the machine-reading-type question answering technology, the question content must be clear and the information necessary for answering must be included in the question without any shortage. However, in an actual service using a machine-reading-type question answering technique, the question content may be ambiguous or the question sentence may be too short. In such a case, there is a possibility that the answer to the question cannot be uniquely determined or the answer content may be wrong, and high answer accuracy may not be achieved.

An embodiment of the present invention has been made in view of the above points, and an object thereof is to achieve high accuracy in answering a question.

In order to achieve the above object, a question generation device according to an embodiment of the present invention receives a question sentence and a related document including an answer to the question sentence as input, and uses a machine learning model that has been learned in advance, And generating means for generating a revised question sentence in which a potentially missing part of the question sentence is supplemented with a word included in a predetermined vocabulary set.

高い High accuracy of answers to questions can be realized.

It is a figure which shows an example of a function structure of the question production | generation apparatus at the time of the revision question production | generation in 1st embodiment of this invention. It is a figure which shows an example of a function structure of the question generation apparatus at the time of learning in 1st embodiment of this invention. It is a figure which shows an example of the hardware constitutions of the question generation apparatus in 1st embodiment of this invention. It is a flowchart which shows an example of the production | generation process of the revision question in 1st embodiment of this invention. It is a figure which shows an example at the time of implement | achieving the revision question generation model in 1st embodiment of this invention with the neural network. It is a flowchart which shows an example of the learning process of the revision question production | generation model in 1st embodiment of this invention. It is a figure which shows the modification (the 1) at the time of implement | achieving the revision question production | generation model in 1st embodiment of this invention with a neural network. It is a figure which shows the modification (the 2) at the time of implement | achieving the revision question production | generation model in 1st embodiment of this invention with a neural network. It is a figure which shows the modification of the function structure of the question generation apparatus at the time of the revision question generation in 1st embodiment of this invention. It is a figure which shows the application example (the 1) to a chat bot. It is a figure which shows the application example (the 2) to a chat bot. It is a figure for demonstrating an example of the revision question in 2nd embodiment of this invention. It is a figure which shows an example of a function structure of the question production | generation apparatus at the time of the revision question production | generation in 2nd embodiment of this invention. It is a figure which shows an example of a function structure of the question generation apparatus at the time of learning in 2nd embodiment of this invention. It is a flowchart which shows an example of the production | generation process of the revision question in 2nd embodiment of this invention. It is a figure which shows an example at the time of implement | achieving the revision question production | generation model in 2nd embodiment of this invention with a neural network. It is a flowchart which shows an example of the learning process of the revision question production | generation model in 2nd embodiment of this invention.

Hereinafter, embodiments of the present invention will be described in detail with reference to the drawings. In the following, the revised question (RQ: Revised Question) of the input question (hereinafter, also simply referred to as “input question”) for the purpose of improving the accuracy of answering the question response using the machine-reading-type question answering technology. The question generation device 100 that generates The revised question is a question sentence with more specific contents that reinforces the question contents of the input question. That is, the revised question is a question in which the content of the question is clear and the information necessary for the answer is included without a shortage.

Before the task for generating and responding to an answer to a question (question answering task), the revised question of the question is generated, and then the question answering task using the revised question is performed to improve the answer accuracy of the question answer. Will be able to.

In addition, each embodiment described below is only an example, and forms to which the present invention can be applied are not limited to the following embodiments. The technology according to each embodiment of the present invention can be used for, for example, a service that provides an answer to a question input by a user in a natural language, but the usage target is not limited to this and can be used for various targets. is there.

[First embodiment]
First, a first embodiment of the present invention will be described.

(Overview)
In the first embodiment of the present invention, when an input question and a document related to the input question (hereinafter also referred to as “related document”) are given, the question generation device 100 generates a revised question. The revised question of the input question is generated using a machine learning model (hereinafter also referred to as a “revised question generation model”).

More specifically, in the first embodiment of the present invention, a revised question generation model is used to match an input question with a related document, and a potentially missing part of the input question (characters such as words and phrases). A revised question is generated by supplementing the column. Thereby, for example, when an input question with ambiguous question content or an input question with a too short question text is given, a revised question that is more detailed and specific than the input question is generated. At this time, the revised question is generated using the related document, so that, for example, a revised question that can be answered by the system that performs the question answering task can be generated (in other words, the system that performs the question answering task is Unanswered revised questions can be prevented from being generated.)

In the first embodiment of the present invention, an input question used as correct answer data, a question in which a part of the input question is missing (also referred to as a “missing question”), and a related document are used. Learn the revised question generation model. In this learning, the parameters of the revised question generation model are updated so that the natural sentence obtained using the missing question and the related document approaches the input question that is correct answer data. A missing question is a question sentence in which necessary information (a character string such as a word or a phrase) is partially missing as a question sentence related to an input related document. A natural sentence is a sentence written in a natural language.

Here, in the first embodiment of the present invention, the input question is a sentence written in a natural language (that is, a natural sentence). For example, by performing morphological analysis or the like, a set of J word tokens Q = It is assumed that {q ₀ , q ₁ ,..., Q _J−1 }. In addition, the sentence used as an input question may be, for example, a sentence in which keywords are simply listed in addition to a natural sentence. Moreover, the sentence etc. which were obtained as a speech recognition result may be sufficient.

The related document is a sentence composed of, for example, about several hundred words, and is expressed as a set of T word tokens X = {x ₀ , x ₁ ,..., X _T−1 }. Shall. Here, it is assumed that the related document includes information serving as an answer to the input question. Examples of the related document include a manual in which an answer to an input question is described. In the first embodiment of the present invention, the related document is also referred to as a passage.

The revised question is a sentence in which the input question is refined and materialized, and is expressed as a set of S word tokens RQ = {y ₀ , y ₁ ,..., Y _S−1 }. And

(Functional configuration of the question generation device 100)
First, the functional configuration of the question generation device 100 when generating a revised question in the first embodiment of the present invention will be described with reference to FIG. FIG. 1 is a diagram illustrating an example of a functional configuration of a question generating device 100 at the time of generating a revised question in the first embodiment of the present invention.

As shown in FIG. 1, the question generation device 100 for generating a revised question in the first embodiment of the present invention includes a revised question generation unit 200. The revised question generation unit 200 is realized by a learned revised question generation model (that is, a revised question generation model using parameters updated by a revised question generation model learning unit 400 described later).

The revised question generation unit 200 inputs a question (input question) and a related document, and generates and outputs a revised question. More specifically, the revised question generation unit 200 generates a revised question by regarding the input question as a missing question and restoring the previous question text using the related document.

Here, the revised question generation unit 200 includes a collation unit 210 and a question restoration unit 220. The collation unit 210 generates matching information between the input question and the related document. The matching information is information representing a matching relationship between each word included in the input question and each word included in the related document. The question restoration unit 220 uses the matching information generated by the matching unit 210, the input question, and the related document to generate (restore) a natural sentence so that the input question becomes a question sentence before being lost. The natural sentence generated by the question restoration unit 220 becomes a revised question.

Next, the functional configuration of the question generation device 100 during learning in the first embodiment of the present invention will be described with reference to FIG. FIG. 2 is a diagram illustrating an example of a functional configuration of the question generation device 100 during learning according to the first embodiment of the present invention.

As shown in FIG. 2, the question generation device 100 at the time of learning in the first embodiment of the present invention includes a missing question creation unit 300 and a revised question generation model learning unit 400.

The missing question creation unit 300 creates a missing question by inputting a question (input question) and missing a part of the input question.

The revised question generation model learning unit 400 learns a revised question generation model using the missing question created by the missing question creation unit 300, the input question, and the related document. Then, the revised question generation model learning unit 400 outputs the learned parameters of the revised question generation model.

Here, the revised question generation model learning unit 400 includes a verification unit 210, a question restoration unit 220, and a parameter update unit 410. The collation unit 210 and the question restoration unit 220 are as described above. The parameter update unit 410 calculates an error between the natural sentence (revised question) generated by the question restoration unit 220 and the input question, and uses the error to change the parameter of the revised question generation model by an arbitrary optimization method. (Revised question generation model parameter not learned) is updated. The revised parameter generation model is learned by updating the parameters by the parameter updating unit 410.

In the first embodiment of the present invention, the revised question generation model is a machine learning model realized by a neural network. However, all or part of the revised question generation model may be realized by a machine learning model other than the neural network. For example, at least one functional unit of the matching unit 210 and the question restoration unit 220 may be realized by a machine learning model other than the neural network.

(Hardware configuration of question generation device 100)
Next, the hardware configuration of the question generation device 100 according to the first embodiment of the present invention will be described with reference to FIG. FIG. 3 is a diagram illustrating an example of a hardware configuration of the question generation device 100 according to the first embodiment of the present invention.

As shown in FIG. 3, the question generation device 100 according to the first embodiment of the present invention includes an input device 501, a display device 502, an external I / F 503, a RAM (Random Access Memory) 504, and a ROM (Read Only Memory) 505, an arithmetic device 506, a communication I / F 507, and an auxiliary storage device 508. Each of these hardware is connected via a bus B so as to be able to communicate.

The input device 501 is, for example, a keyboard, a mouse, a touch panel, or the like, and is used by a user to input various operations. The display device 502 is a display or the like, for example, and displays a processing result (for example, a revised question or the like) of the question generation device 100. The question generation device 100 may not include at least one of the input device 501 and the display device 502.

External I / F 503 is an interface with an external device. The external device includes a recording medium 503a and the like. The question generation device 100 can read and write the recording medium 503a and the like via the external I / F 503. The recording medium 503a may store one or more programs that realize each functional unit included in the question generation device 100.

Examples of the recording medium 503a include a flexible disk, a CD (Compact Disc), a DVD (Digital Versatile Disk), an SD memory card (Secure Digital memory card), a USB (Universal Serial Bus) memory card, and the like.

The RAM 504 is a volatile semiconductor memory that temporarily stores programs and data. The ROM 505 is a nonvolatile semiconductor memory that can retain programs and data even when the power is turned off. The ROM 505 stores, for example, settings related to an OS (Operating System), settings related to a communication network, and the like.

The computing device 506 is, for example, a CPU (Central Processing Unit) or a GPU (Graphics Processing Unit), and reads a program or data from the ROM 505, the auxiliary storage device 508, or the like onto the RAM 504 and executes processing. Each functional unit included in the question generation device 100 is realized by, for example, processing that the arithmetic device 506 causes one or more programs stored in the auxiliary storage device 508 to execute. The question generation device 100 may include both the CPU and the GPU as the arithmetic device 506, or may include only one of the CPU and the GPU.

The communication I / F 507 is an interface for connecting the question generating device 100 to a communication network. One or more programs that realize each functional unit included in the question generation device 100 may be acquired (downloaded) from a predetermined server device or the like via the communication I / F 507.

The auxiliary storage device 508 is, for example, an HDD or SSD (Solid State Drive), and is a non-volatile storage device that stores programs and data. The programs and data stored in the auxiliary storage device 508 include, for example, an OS and one or more programs that realize each functional unit included in the question generation device 100.

The question generation device 100 according to the first embodiment of the present invention has the hardware configuration shown in FIG. In the example illustrated in FIG. 3, the case where the question generation device 100 according to the first embodiment of the present invention is realized by one device (computer) is described, but the present invention is not limited thereto. The question generation device 100 according to the first embodiment of the present invention may be realized by a plurality of devices (computers). Further, the question generation device 100 according to the first embodiment of the present invention may be realized by a device (computer) including a plurality of arithmetic devices 506 and a plurality of memories (RAM 504, ROM 505, auxiliary storage device 508, etc.). .

(Revision question generation process)
Next, the revision question generation process in the first embodiment of the present invention will be described with reference to FIG. FIG. 4 is a flowchart showing an example of a revision question generation process in the first embodiment of the present invention. In the revised question generation process, it is assumed that the revised question generation model for realizing the revised question generation unit 200 has been learned.

Here, FIG. 5 shows an example of a revised question generation model for realizing the revised question generation unit 200 in the first embodiment of the present invention. As shown in FIG. 5, in the first embodiment of the present invention, the revised question generation model is a neural network composed of three layers of Encode 、 Layer, Matching 及び Layer, and Decode Layer. Among these layers, the matching unit 210 is realized by Encode layer and Matching layer. Also, the question restoration unit 220 is realized by Decode layer. In the subsequent revision question generation processing, detailed processing of each layer will be described with reference to the revision question generation model shown in FIG.

The Encode Layer and Decode Layer are layers based on the language generation model Seq2Seq. On the other hand, Matching Layer is a layer based on Attention Flow Layer and Modeling Layer used in machine reading tasks. For details of Seq2Seq, refer to Reference Document 1 and Reference Document 2 below, for example. For details of the reading comprehension task, refer to Reference Document 3 below, for example.

[Reference 1]
I. Sutskever, O. Vinyals, and Q. V. Le. Sequence to sequence learning with neural networks.Proc of the 27th International Conference on Neural Information Processing Systems (NIPS2014), pp. 3104-3112, 2014.
[Reference 2]
O. Vinyals and Q. V. Le. A neural conversational model.Proc of the ICML Deep Learning Workshop 2015, 2015.
[Reference 3]
M. J. Seo, A. Kembhavi, A. Farhadi, and H. Hajishirzi. Bidirectional attention flow for machine comprehension.Proc of 5th International Conference on Learning Representations (ICLR2017), 2017.
Step S101: The revised question generation unit 200 inputs a question (input question) Q and a related document X.

Step S102: Revised Question matching unit 210 of the generator 200, the following steps S102-1 ~ step S102-4, as the matching information, a hidden state vector _{h d0} to the initial state of the Decoder, used in the machine reading task A matching matrix M that is a matching model is generated.

Step S102-1: First, the collation unit 210 converts the related document X and the input question Q into d-dimensional word vector sequences as Word Embedding processing in Encode 質問 Layer of the revised question generation model shown in FIG. That is, the matching unit 210 vectorizes each word token constituting the related document X and the input question Q to create a word vector series.

Suppose that the word vector sequence of the related document X is also represented by X, the word vector sequence X of the related document X is

It expresses.

Suppose that the word vector sequence of the input question Q is also represented by Q.

It expresses.

In the first embodiment of the present invention, the word vector series X and Q are generated from the input question Q and the related document X that are input. However, the present invention is not limited to this. For example, in the above step S101, the word vector series X And Q may be input.

Step S102-2: Next, as a process of Passage Context in the Encode Layer of the revised question generation model shown in FIG. 5, the collation unit 210 encodes the word vector sequence X by RNN (Recurrent Neural Network), and the related document X To obtain a context matrix HεR ^{2d × T.} Note that a column vector composed of elements of the t-th column of the context matrix H is represented as a context vector H _t .

Similarly, the collation unit 210 encodes the word vector sequence Q by RNN as the Question Context processing in the Encode Layer of the revised question generation model shown in FIG. 5, and sets the context matrix U∈R ^{2d × J} of the input question Q. obtain. Note that a column vector composed of elements in the j-th column of the context matrix U is represented as a context vector U _j .

Here, the RNN used for the processing of Passage Context and QuestionQuestContext may be, for example, bi-RNN, LSTM (Long Short Term Memory), bi-LSTM, or the like. However, a common parameter is used for the RNN used for the PassagePassContext processing and the RNN used for the Question Context processing.

Step S102-3: Next, matching unit 210, as processing of Matching Layer revised question generator model shown in FIG. 5, below, to generate a hidden state vector h _d0 to the initial state of the Decoder.

First, the collation unit 210 uses the attention mechanism (attention) to the attention vector U _J−1 and the context matrix H for the attention vector with the related document X according to the following equations (1) and (2). to calculate the H _{^ U} ∈R ^2d. For convenience of description, “X with“ ^ ”attached to the top (that is, X with“ ^ ”added as an accent) is expressed as“ X ^ ”.

Here, τ represents transposition. Softmax _t represents the t-th output of the softmax function. Note that “U” subscripted by H ^ _U in the above formula (2) is not a subscript.

Similarly, the collation unit 210 uses the attention mechanism (attention) to perform the attention with the input question Q by the following expressions (3) and (4) with respect to the context vector U _J−1 and the context matrix U. to calculate the vector _{U ^} U ∈R ^2d.

Here, softmax _j represents the jth output of the softmax function. Note that “U” subscripted by U ^ _U in the above formula (4) is not a subscript.

This is because the context itself of the input question Q takes attention, and important words in the input question Q are taken into consideration.

Then, the matching unit 210 sets the initial state of the Decoder according to the following equation (5) using the two attention vectors H ^ _U and U ^ _U calculated in the above equations (2) and (4), respectively. The hidden state vector _hd0 is calculated.

Here, W _m εR ^{4d × 2d} and b _m εR ^2d are parameters. F is an activation function, and for example, Leaky ReLU or the like is used. In addition, [;] represents a connection.

Step S102-4: Next, the collation unit 210 generates a matching matrix M as follows as the Matching layer processing of the revised question generation model shown in FIG.

First, the collation unit 210 inputs a context matrix H having a sequence length of T and a context matrix U having a sequence length of J to the attention layer. And the collation part 210 calculates the similarity matrix S of the word of the related document X and the input question Q as a process of Attention layer.

The similarity between the t-th word of the related document X and the j-th word of the input question Q is

It is defined as ^Here, _{w s} τ ∈R ^6d it is a parameter. Also,

Represents an element product.

Thereby, a similarity matrix S = (S _tj ) εR ^{T × J} is created.

Next, using the similarity matrix S, the collation unit 210 calculates attentions in two directions, an attention from the related document X to the input question Q and an attention from the input question Q to the related document X.

In the attention from the related document X to the input question Q, the collation unit 210 calculates an attention vector weighted by the word of the input question Q for each word of the related document X. That is, the collation unit 210 uses the following equations (7) and (8) to indicate the attention vector corresponding to the t-th word of the related document X.

Calculate

Further, in the attention from the input question Q to the related document X, the collation unit 210 calculates an attention vector weighted by a word strongly related to any word of the input question Q, and then uses this attention vector as the related document X. A matrix arranged for the sequence length T is created. That is, first, the matching unit 210 uses the following equations (9) and (10) to obtain an attention vector.

Calculate

Here, _max j (S) is, t = 1, ···, against T-1, the max _{(S t),} the T dimension of the j-th element _{S tj} the elements of the vector _{S t} It is a vector ( _note that a vector γ having each γ _t as an element is a T-dimensional vector).

Subsequently, the matching unit 210 is a matrix in which T attention vectors calculated by the above equation (10) are arranged.

Create

After that, the matching unit 210 uses the attention vector H ^ _H ∈ R ^{2d × T} obtained by taking the self-attention between the context vector H _T-1 and the context matrix H, by the following equation (11), and the attention matrix G Calculate

For details of self-attention, refer to Reference Document 4 below, for example.

[Reference 4]
W. Wang, N. Yang, F. Wei, B. Chang, and M. Zhou.Gated self-matching networks for reading comprehension and question answering.Proc of the 55th Annual Meeting of the Association for Computational Linguistics (ACL2017), pp .189-198, 2017.
However, the matching unit 210 may calculate the attention matrix G without using the attention vector H ^ _H ∈ R ^2d (that is, without concatenating the attention vector H ^ _H in the above equation (11)). . In this case, the attention matrix G is GεR ^{8d × T.}

Then, the matching unit 210, as processing of Matching Model in Encode Layer revised question generator model shown in FIG. 5, the matching by entering the attention matrix G calculated by the above equation (11) to the RNN matrix M∈R ^{2d XT} is obtained.

In step S102 described above, as the matching information, a hidden state vector h _d0 to the initial state of the Decoder, is generated and match matrix M is a matching model used in the machine reading task.

In addition, you may use arbitrary methods other than the above as a method of producing | generating matching information. In addition, any format such as a vector, a matrix, or a tensor may be used as an expression format of the matching information. For example, it is possible to use a bag-of-words vector in which the word element matched between the input question Q and the related document X is 1, and the other word elements are 0. Information that takes into account the appearance position of the word in the related document X may be used. However, when the matching information is expressed only by a scalar value such as a similarity, information on which part the input question Q and the related document X match is lost. Is preferably not a scalar value.

Step S103: The question restoration unit 220 of the revised question generation unit 200 uses the matching information (the hidden state vector _hd0 and the matching matrix M) generated by the matching unit 210, the input question Q, and the related document X to be described below. Steps S103-1 to S103-7 generate a natural sentence that becomes the revised question RQ.

Here, the natural sentence that becomes the revised question RQ is assumed to be composed of the word y _s (s = 0, 1,...). However, the word _{y 0} is assumed to be a token that indicates the beginning of a sentence <BOS>. Question restoring unit 220, for example, until the token <EOS> indicating the end of a sentence is generated, from s = 1 in the order, by repeatedly generating word y _s, and generates a revised question RQ. In the following steps S103-1 ~ step S103-7, will be described for generating a word _{y s} at a certain s. Further, assuming that the RNN that is the Decoder is LSTM, the hidden state of this LSTM is represented as h _ds, and the initial value of this hidden state (that is, the hidden state h _ds when s = 0) is referred to as the collating unit 210 The hidden state vector _hd0 calculated in (1) is used.

Step S103-1: First, as the Word Embedding process in the Decode Layer of the revised question generation model shown in FIG. 5, the question restoration unit 220 uses the word y _s-1 generated in the previous iteration as the word vector e _ys. Convert to _-1 . As described above, when s = 1 (that is, for the first time), the token <BOS> indicating the beginning of the sentence is converted into the word vector e _y0 with the word y _s−1 = y _0. .

Step S103-2: Next, the question restoration unit 220 uses the attention mechanism (attention) as the processing of the decode layer of the revised question generation model shown in FIG. 5 according to the following equations (12) to (15). Calculate the input z ^ _s R ^3d to the LSTM which is the Decoder.

Here, W _d εR ^{2d × 3d} and b _d εR ^2d are parameters, and f is an activation function. M _t εR ^2d is a column vector composed of elements of the t-th column of the matching matrix M.

Step S103-3: Next, the question restoration unit 220 updates the hidden state h _{ds of the} Decoder according to the following equation (16).

Step S103-4: Next, the question restoration unit 220 inputs z ^ _s obtained by the above equation (15) to the LSTM as a Decoder process in the Decode Layer, and calculates a softmax function. As a result, a generation probability distribution P _G (y _s | y _<s , X, Q) is obtained as an output of the softmax function. The generation probability distribution P _G (y _s | y _<s , X, Q) is a specific identification that is set in advance as the s-th word y _s when the s−1th words y _s are generated. This is a distribution of conditional probabilities that a word included in the vocabulary set is generated. The specific vocabulary set includes, for example, a set composed of words that appear frequently in a general document.

Step S103-5: Next, the question restoration unit 220 uses the weight ε _st obtained in the above equation (13) and the softmax function as processing in the Decode Layer, according to the following equation (17): The generation probability P _C (y _s | y _<s , X, Q) is calculated.

Here, I (y _s = x _t ) is a function that returns 1 if the generated word y _s matches the t-th word x _{t of the} related document X, and returns 0 otherwise.

The generation probability P _C (y _s | y _<s , X, Q) is an application of the concept of CopyNet. CopyNet is a neural network model that facilitates generating (copying) an encoded word as it is by giving the word generation probability from outside the LSTM output. In the first embodiment of the present invention, a word included in the related document X is generated as the sth word y _s by introducing the generation probability P _C (y _s | y _<s , X, Q). (Copying) can be facilitated. Therefore, by introducing P _C (y _s | y _<s , X, Q), the input question Q that is regarded as a missing question can be supplemented with the words included in the related document X. For details of CopyNet, refer to Reference Document 5 and Reference Document 6 below, for example.

[Reference 5]
Z. Cao, C. Luo, W. Li, and S. Li.Joint copying and restricted generation for paraphrase.Proceedings of the Thirty-First AAAI Conference on Artificial Intelligence (AAAI2017), pp. 3152-3158, 2017.
[Reference 6]
J. Gu, Z. Lu, H. Li, and VO Li.Incorporating copying mechanism in sequence-to-sequence learning.Proceedings of the 54th Annual Meeting of the Association for Computational Linguistics (ACL2016), pp. 1631-1640, 2016 .
Step S103-6: Next question restoration unit 220 uses the weighting lambda _s, the final generation probability _P of a word _{_{y s (y s | y <}} s, X, Q) following equation (18) Calculate with

Here, the weight λ _s is calculated by the following equation (19).

Here, W _λ ∈ R ^{1 × 2d} and b _λ ∈ R ¹ are parameters, and σ is a sigmoid function.

The generation probabilities P (y _s | y _<s , X, Q) are expressed as P _G (y _s | y _<s , X, Q) and P _C (y _s | y _<s , X, Q) with the weight λ _s . Q) is a weighted average. For this reason, whether or not the word included in the related document X is copied as y _s is determined by the weight λ _s .

Step S103-7: Next, the question restoration unit 220 generates a word y _{s based} on the final generation probability P (y _s | y _<s , X, Q) calculated by the above equation (18). That is, for example, the question restoration unit 220 generates, as y _s , the word having the maximum P (y _s | y _<s , X, Q) among the words included in the related document X and the input question Q.

Step S103-1 ~ Step S103-7 above, by repeated until the word _{y s} is <EOS> is generated, revised question constituted by each word _{y s (s = 0,1, ···} ) RQ is generated. The revised question RQ is output by the revised question generation unit 200 to a predetermined output destination. Here, examples of the predetermined output destination include the display device 502, the auxiliary storage device 508, and other programs (for example, a program for executing a question answering task).

Here, the revised question RQ is created by adding information in the related information X based on the input question Q. At this time, when the revised question RQ is generated by a generation model such as an Encoder-Decoder model using only matching information, a revised question RQ that is not very related to the related document X or the input question Q is generated. There is. Therefore, in the first embodiment of the present invention, by using not only the matching information but also the information of the related document X itself by a method applying the concept of CopyNet, the input question Q that is regarded as a missing question is used. Thus, the revised question RQ related to the related document X can be generated.

In step S103-7, one word y _s is generated for each s. However, the present invention is not limited to this, and a plurality of words y _s are generated for a certain s (or all s). May be. By generating a plurality of words y _s, for example, by using a beam search or the like, it is possible to generate a plurality of revised question RQ. A beam search is a kind of search algorithm such as a breadth-first search of a graph. When using a beam search, the question restoration unit 220 generates, for example, a word y _s for B beam widths for each s. Thus, if the word length of the finally produced, revised question RQ is L, candidate B ^L number of revised question RQ is generated. Next, the question restoration unit 220 can generate revised questions RQ of a plurality of variations by outputting the top q items from these candidates in the order of generation score using a beam search.

Further, in the above step S103-1 ~ step S103-7, the words _{y 0} as <BOS>, a case has been described in which the words in the beginning of a sentence to produce a revised question RQ in the order, not limited to this, for example, a word y _The revision question RQ may be generated in order from the word at the end of the sentence, with ₀ as <EOS>.

(Partial generation and total generation)
Here, in the revised question generation process according to the first embodiment of the present invention, a revised question RQ that compensates for a partial deficiency of the input question Q regarded as a missing question may be generated. A revised question RQ that compensates for all deficiencies in Q may be generated. In the following, generating a revised question RQ that compensates for some deficiencies in the input question Q is “partial generation”, and generating a revised question RQ that compensates for all deficiencies in the input question Q is “general generation”. Represent.

Specifically, for example, a question with clear contents of a question and lack of information necessary for answering (hereinafter, such a question is referred to as “whole question”) is canceled as “plan A is canceled halfway. "What is the charge when doing?" And the input question Q is "What is the charge?"

In this case, in the partial generation, for example, “What is the charge for canceling midway?” Is generated as the revised question RQ. On the other hand, in the whole generation, for example, the whole question “What is the fee for canceling plan A halfway?” Is generated as the revised question RQ.

Therefore, in this case, in order to obtain the entire question by partial generation, it is necessary to perform the revised question generation process again using the input question Q as “What is the fee for cancellation on the way?” Obtained as the revised question RQ. is there. As a result, the overall question “What is the fee for canceling plan A halfway?” Is obtained as the final revised question RQ.

As described above, when partial generation is used, it is necessary to repeatedly execute the revision question generation process in order to obtain the entire question, but in general, partial generation is more accurately performed than partial generation. Can be restored.

It should be noted that whether the revision question generation process is partial generation or total generation is determined by a learning data set used for the revision question generation model learning process. Whether the revision question generation process is partial generation or total generation is determined according to a question answering task in which the revision question is used.

Here, the learning data set is a set of learning data represented by a set of the input question Q used as correct answer data and the related document X. In addition, for each word constituting the input question Q used as correct answer data, a label that is 1 if the word is a word included in the related document X, and 0 otherwise is given. To do. Hereinafter, for the sake of convenience, the input question Q used as correct answer data is represented as “correct answer question Q _true ”.

(Learning process of revised question generation model)
Next, the learning process of the revised question generation model in the first embodiment of the present invention will be described with reference to FIG. FIG. 6 is a flowchart showing an example of the learning process of the revised question generation model in the first embodiment of the present invention. In the learning process of the revised question generation model, for example, the learning data set is divided into a predetermined number of mini-batches, and the parameters of the revised question generation model are updated for each mini-batch.

The following steps S201 to S204 are repeatedly executed using each learning data included in the mini-batch. On the other hand, the following steps S205 to S206 are executed after steps 201 to S204 are executed for all learning data included in the mini-batch.

Step S201: The missing question creation unit 300 inputs a correct answer question Q _true included in the learning data. Further, the revised question generation model learning unit 400 inputs the correct answer question Q _true and the related document X included in the learning data.

Step S202: Next, the missing question creation unit 300 creates a question Q (missing question Q) in which a part of the correct question Q _true is missing. Here, there are generally a plurality of variations of the missing question Q with respect to the correct answer question Q _true . However, the missing question creation unit 300 may create all of these missing questions Q, or some (including one) of them. A missing question Q may be created.

For example, it is assumed that the correct answer question Q _true is “tell me the fee for plan A”. In this case, “Tell me a fee” and “Tell me” exist as variations of the missing question Q. Therefore, the missing question creation unit 300 may create the missing question Q for both “tell me the fee” and “tell me”, and either “tell me the fee” or “tell me” A missing question Q may be created.

When learning a revised question generation model that realizes partial generation, the entire question sentence that is the same as the correct question Q _true is set as a missing question Q, and a token <BOS> that indicates the beginning of the sentence is set as the correct question Q _true. May be. From this, for example, when performing processing for generating revised question by partial generation, when <BOS> is generated as a word y _1, it can be seen that the entire question as revised question RQ is generated.

Suppose, for example, that the overall question is "What is the fee for canceling Plan A halfway?" In this case, in the first partial generation, the revised question RQ “What is the fee for canceling midway?” Is generated from the input question Q “What is the fee?”. Next, in the second partial generation, the revised question RQ “What is the fee for canceling plan A halfway?” Is generated from the input question Q “What is the fee for canceling midway?”. Then, in the third partial generation, the revised question RQ “<BOS>” is generated from the input question Q “What is the fee for canceling plan A halfway?”. Generation of <BOS> indicates that there is no more clause that can be added (generated). For this reason, it can be known that the second revised question RQ “What is the fee for canceling Plan A halfway?” Is the entire question.

Here, any method can be used as a method for creating the missing question Q. As a method for creating the missing question Q, for example, syntactic analysis such as dependency analysis of the correct answer Q _true or phrase structure analysis was performed. Can be created using the results. In addition, the granularity of the portion to be deleted from the correct answer question Q _true can be set arbitrarily.

As an example of a method for creating the missing question Q, for example, there is a method of sequentially deleting phrases from the top. For example, it is assumed that the correct answer question Q _true is “What is the charge for canceling Plan A halfway?”. This correct answer question Q _true is composed of three clauses of “plan A”, “when canceling halfway”, and “how much do you charge?”. For this reason, in this case, the missing question creating section 300 is, for example, "the fee at the time of closeout?" That was missing one clause of the beginning of the correct question Q _true and, two clauses of the beginning of the correct question Q _true Create the missing question Q as “What is the charge?”

As another example of a method for creating a missing question Q, for example, an arbitrary two clauses having a dependency relationship are extracted from a correct question Q _true , and a sentence obtained by combining the extracted two clauses according to the dependency relationship is lost. There is a method of using the question Q. At this time, if there is a clause having a dependency relationship with the obtained missing question Q in the correct question Q _true , a sentence obtained by combining the missing question Q and the clause may be used as a new missing question Q. .

Also, if the correct question Q _true is in a language such as English, the missing question Q is created by performing phrase structure analysis, dependency tree analysis, etc., and performing loss in sections or words from this analysis result. Just do it. For example, when the correct question Q _true is English, a missing question Q in which a phrase structure below a noun phrase (NP) is missing may be created.

Note that it is preferable that the missing question creation unit 300 does not create the missing question Q in which the syntax information of the correct answer question Q _true is destroyed. For example, if the correct answer Q _true is “Tell me the price for plan A” and use the analysis result of dependency analysis, do not create the missing question Q “Tell me about plan A”, which is not related to dependency. Is preferred.

The missing question creation unit 300 may create the missing question Q by pattern matching, for example. For example, the missing position in the correct answer question Q _true is determined using a predetermined expression as a marker. Specifically, for example, it is conceivable to use “in the case of” as a marker as a predetermined expression. In this case, if the correct answer Q _true is “What is the penalty for contracts of less than 2 years?”, The missing question Q “The penalty for the penalty” Can be created.

Step S203: The verification unit 210 of the revised question generation model learning unit 400 generates matching information. Since this step S203 is the same as step S102 by replacing the input question Q in step S102 of FIG. 4 with the missing question Q, description thereof is omitted.

Step S204: The question restoration unit 220 of the revised question generation model learning unit 400 generates a revised question RQ. Since this step S204 is the same as step S103 by replacing the input question Q in step S103 of FIG. 4 with the missing question Q, description thereof is omitted.

Step S205: The parameter update unit 410 of the revised question generation model learning unit 400 calculates an error between the revised question RQ generated using each learning data included in the mini-batch and the correct question Q _true included in the learning data. calculate. For example, cross-entropy may be used as an error function used for error calculation. The error function is appropriately determined according to the revised question generation model.

Step S206: The parameter update unit 410 of the revised question generation model learning unit 400 updates the parameters of the revised question generation model using the error calculated in step S205. That is, for example, the parameter update unit 410 calculates the partial differential value of the error function by the error back-propagation method (back propagation) using the error calculated in step S205 described above. Update. Thereby, the revised question generation model is learned.

Here, the error function used when updating the parameters of the revised question generation model shown in FIG. 5 will be described.

In the revised question generation model shown in FIG. 5, the parameters (hereinafter, the learning target parameter is expressed as “θ”) so that each word y _s generated with the generation probability P matches the correct question Q _true . There is a need to. Here, the generation probability P of the word y _s needs to be set to an appropriate λ _s as shown in the above equation (18). Therefore, in the first embodiment of the present invention, shall learn the revision question generator model multitask learning for learning the generation probability P and lambda _s of words y _s simultaneously, the error function is the generation of a word y _s The sum L (θ) = L _g + L _λ of the error L _g related to the probability P and the error L _λ related to λ _{s is} assumed. The parameter θ is updated so that the error function L is minimized.

Here, lambda _s is enough to take a value close to 1 indicates that the probability of the words contained in related document X is copied as y _s is increased. As described above, at the time of learning, for each word constituting the input question Q used as correct answer data, a label that is 1 if the word is a word included in the related document X, and a label that is 0 otherwise. Shall be granted. By learning a neural network that generates λ _s with this label as a correct answer, λ _s is a probability of predicting whether the word y _s generated by c ^ _s is a word included in the related document X, and Become. With this learning, when the revised question RQ is generated, it is determined that the closer λ _s is to 1, the higher the probability that the word desired to be generated is in the related document X, and the generation probability P _c is strongly considered. It becomes like this.

The errors L _λ and L _g in the above error function L (θ) = L _g + L _λ may be calculated by a general method in learning of a neural network. For example, the error L _λ can be calculated using binary cross-entropy, and the error L _g can be calculated using negative log likelihood.

(Modified example of revised question generation model)
Here, in the first embodiment of the present invention, the case where the revised question generation unit 200 is realized by the revised question generation model shown in FIG. 5 has been described. For example, the revised question generation model shown in FIG. The revised question generation unit 200 may be realized by the revised question generation model shown in FIG.

The revised question generation model shown in FIG. 7 is a model that does not have a mechanism for calculating the generation probability P _C (y _s | y _<s , X, Q) in the decode layer. In this case, the word _{y s} final generation probability _{_{P (y s | y <s}} , X, Q) = P G (y s | y <s, X, Q) become.

The revised question generation model shown in FIG. 8 is a model that does not have a matching layer in addition to the revised question generation model shown in FIG. In this case, as processing of the Decode Layer, the attention mechanism (attention) calculates the input z ^ _s to the Decoder using the context matrix H instead of the matching matrix M.

(Modification of the functional configuration of the question generation device 100)
Here, when the revised question RQ is generated, the related document X related to the input question Q is not clear, and only a document set that is assumed to include the related document X may be obtained. In such a case, if the revision question generation process is performed using each document included in the document set, the processing time increases. In view of this, it is conceivable to perform a process of retrieving the related document X from the document set as a pre-process of the revision question process.

FIG. 9 shows a functional configuration of the question generation device 100 that performs the above preprocessing. FIG. 9 is a diagram illustrating a modification of the functional configuration of the question generation device 100 when generating a revised question in the first embodiment of the present invention.

As shown in FIG. 9, the question generation device 100 when generating a revised question may further include a related document search unit 600. The related document search unit 600 inputs the input question Q and the document set Y, and searches the document set Y for a document (related document) X related to the input question Q. Then, the related document search unit 600 outputs the searched related document X to the revised question generation unit 200. Thereby, even when only a document set that is assumed to include the related document X can be obtained, the revised question RQ can be easily obtained.

Note that any search method can be used as the search method by the related document search unit 600. For example, after calculating the scores of each document included in the document set Y and the input question Q, N ′ cases with the highest scores are used as the related documents X. Although the value of N ′ is arbitrarily set, for example, about 1 to 10 can be considered.

Here, the related document X searched by the related document search unit 600 and the revised question RQ generated from the related document X and the input question Q are presented to the questioner (user) who has made the input question Q. Is also possible. Therefore, as illustrated in FIG. 9, the question generation device 100 when generating a revised question may further include a display control unit 700. The display control unit 700 displays the related document X searched by the related document search unit 600 and the revised question RQ generated by the revised question generating unit 200 from the related document X and the input question Q.

(Application examples)
Here, as described above, for example, when two or more values are set as N ′, a plurality of related documents X may be obtained from the document set Y. In this case, the revised question RQ can be generated using each of the plurality of related documents X.

For example, if two related documents X ₁ and related documents X ₂ are obtained from the document set Y, the revised question generator 200, the revised question RQ ₁ using the input question Q and related documents X _1, input question Q And a revised question RQ ₂ using the related document X ₂ is obtained.

Therefore, as an application example of such a question generation device 100, when any question (input question Q) is made by the user, a plurality of revised questions RQ and related documents used for generating this revised question RQ A chat bot that presents X to the user can be considered.

For example, as shown in FIG. 10, when an input question Q “I want to know the fee” is input from the user (S 11), the related document search unit 600 of the question generation device 100 reads a plurality of related documents X from the document set Y. Search for (related documents _{X 1} and related documents _{X 2).} Then, the display control unit 700 of the question generator 100, and related documents X ₁ and input question generated by the revised question generator 200 from Q revised question RQ ₁ "want to know Price Plan A", related document X ₁ and links to, and related documents X ₂ and input question Q generated by the revised question generation unit 200 from the revised question RQ ₂ "I want to know the price at the time when the special discount is applied", links to related documents X ₂ Are displayed to the user (S12). As a result, even when the user makes an ambiguous question (input question Q), the question generating device 100 moves to a plurality of revised questions RQ and related documents X respectively related to the plurality of revised questions RQ. Can be presented to the user.

Further, as another application example to the chatbot, a plurality of revised questions RQ and related documents X may be presented in order. For example, as illustrated in FIG. 11, when the input question Q “I want to know the fee” is input from the user (S 21), the related document search unit 600 of the question generation device 100 reads a plurality of related documents X from the document set Y. Search for (related documents _{X 1} and related documents _{X 2).} Then, the display control unit 700 of the question generation device 100 displays, for example, a sentence for confirming to the user whether or not the revised question RQ ₁ “I want to know the charges for Plan A” is intended (S22). ).

When a negative response such as “No” is input from the user in response to the confirmation text (S23), the display control unit 700 of the question generation device 100 may display, for example, the revised question RQ ₂ “Special discount applied” A sentence for confirming to the user whether or not the user wants to know the charge when the service is made is displayed (S24).

For this confirmation text, if the response indicating affirmation such as "That's right" is input from the user (S25), the display control unit 700 of the question generator 100, for example, user links to related documents X ₂ (S26).

Thus, even when the user makes an ambiguous question (input question Q), the question generation device 100 interactively links the revised question RQ and the related document X related to the revised question RQ. Can be presented to the user.

(Summary)
As described above, the question generation device 100 according to the first embodiment of the present invention uses, for example, a revised question generation model realized by a neural network, and an input question that may contain a potential defect. From Q, a revised question RQ that does not include a deficit can be generated. Thereby, for example, when a question answering task using the revised question RQ is performed, the answer accuracy of the question answering task can be improved.

Further, in the question generation device 100 according to the first embodiment of the present invention, when the revised question RQ is generated using the revised question generation model, the revision in which a word included in the related document X related to the input question Q is copied. A question RQ is generated. As a result, the answer accuracy of the question answering task can be further improved, and the user can know from which part of the related document X the revised question RQ is generated.

Also, the question generation device 100 according to the first embodiment of the present invention can generate a plurality of variations of revised questions RQ for one input question Q. For example, in the question generation device 100 according to the first embodiment of the present invention, for one input question Q “I want to know the fee”, as the revised question Q, “I want to know the fee for Plan A”, “Special discount” Variations such as “I want to know the fee when is applied” can be generated. Thereby, for example, it becomes possible to allow the user to select a revised question Q close to the intention of the question from among a plurality of revised questions Q.

Further, by generating a plurality of variations of revised questions RQ for one input question Q, the question generation device 100 according to the first embodiment of the present invention can, for example, generate a “common question collection (FAQ)”. It can also be applied to automatic creation and expansion.

[Second Embodiment]
Next, a second embodiment of the present invention will be described.

(Overview)
In the first embodiment described above, the case where the question generation device 100 generates the revised question of the input question using the revised question generation model when the input question and the related document are given has been described. However, for example, when the input question is short or ambiguous, the answer to the input question may not be uniquely identified, and there may be multiple possible answers in the related document. is there. Therefore, in such a case, if the question is refined and embodied without considering the answer, a revised question that cannot be answered may be generated. Moreover, even when a plurality of patterns are refined and embodied, there is a possibility that answers to all revised questions will be the same. Furthermore, question answering techniques such as machine reading can often answer only one question (that is, one answer per question), and it is not possible to completely respond to a question that has multiple possible answers. .

Therefore, in the second embodiment of the present invention, when an input question and a related document are given, a question response is made before the question generation device 100 generates a revised question, and N is input to the input question. Number of answers (N is an integer of 1 or more) is generated. Then, the question generation device 100 generates a revised question for each of these N answers. As a result, even if there are multiple answers to the input question, it is possible to generate a revised question for uniquely obtaining these answers by machine reading etc., even for short questions and ambiguous questions. High response accuracy can be achieved. Note that the N answers generated by the question answer are candidates for the final answer to the input question (that is, the answer that the questioner really needs), and are also referred to as “answer candidates”.

The generation of the revision question in the second embodiment of the present invention will be described more specifically with reference to FIG. For example, it is assumed that the related document shown in FIG. 12 and the input question “What happened to the yen exchange rate at 5:00 pm?” Are given. In this case, there are a plurality of answer candidates for the input question in the related document (that is, in the related document, as the answer candidate for the input question, the yen exchange rate information for the dollar and the yen exchange rate information for the euro are included. Has been described.). Therefore, at this time, it cannot be determined which answer candidate among the plurality of answer candidates is the answer that the questioner really needs.

Therefore, in the second embodiment of the present invention, firstly, the answer 1 “1 dollar = 26.75 yen to 75 yen from 26 dollars higher than last weekend” and answer 2 “64 compared to last weekend”. Two answer candidates are generated: 1 Euro = 129.57 yen to 61 yen with a weak yen against the euro. Then, using these answers, the input question is refined and specified so that the answer can be uniquely determined, and a revised question is generated for each answer. In the example shown in FIG. 12, “To dollar” and “To euro” are assigned to the input question, respectively, and revised question 1 “What happens to the yen at 5 pm against the dollar? And revised question 2 “What happened to the euro at 5pm against the euro?”.

Thus, in the second embodiment of the present invention, the revised question is generated by the following (1) and (2).

(1) A question response is made to the input question, and N answers (answer candidates) for the input question are generated.

(2) For each N answers, a revised question for generating the answer is generated (that is, N revised questions corresponding to each of the N answers are generated).

Here, the above (1) and (2) can be executed simultaneously end-to-end by the revised question generation model realized by the neural network. However, the revised question generation model is not necessarily realized by a neural network, and all or a part of the revised question generation model may be realized by a machine learning model other than the neural network. Also, the model that performs the question response of (1) above and the model that generates the revised question of (2) above may be prepared separately and used individually or in combination.

In the above question response (1), information that is highly likely to be an answer (answer candidate) is found from related documents, and an answer is made based on the found information. Here, as a method of obtaining an answer (answer candidate), for example, a method in which a description in a related document is extracted as it is, a method of generating a sentence that becomes an answer with reference to the description in the related document, etc. There are various methods. In the second embodiment of the present invention, as an example, as a method for obtaining an answer (answer candidate) in the above (1), a method in which an answer obtained by extracting a description in a related document as it is is used. Will be described.

Here, in the learning of the revised question generation model, as in the first embodiment, an input question used as correct answer data, a question in which a part of the input question is missing (that is, a missing question), a related document, And the parameters of the revised question generation model are updated so that the natural sentence obtained using the missing question and the related document approaches the input question which is correct answer data. At this time, in the revised question generation model, similar to the first embodiment, matching between the missing question and the related document is performed, and the missing portion is found from the related document and compensated. By learning such a revised question generation model, as in the first embodiment, for example, when an input question with a short natural sentence and a related document are input, the input question is potentially missing. The revised part is found and supplemented from the related document, and a revised question sentence that is more detailed and embodied than the input question is generated.

In the second embodiment, in the revised question generation model learning, the correct answer to the input question is used as the correct answer data, and the parameters of the revised question generation model are updated so that the answer to the input question approaches the correct answer data. .

(Functional configuration of the question generation device 100)
First, the functional configuration of the question generation device when generating a revised question in the second embodiment of the present invention will be described with reference to FIG. FIG. 13 is a diagram illustrating an example of a functional configuration of the question generation device 100 when generating a revised question in the second embodiment of the present invention.

As illustrated in FIG. 13, the question generation device 100 according to the second embodiment of the present invention includes a text processing unit 800, a revised question generation unit 900, and an output unit 1000.

The text processing unit 800 inputs an input question and a related document described in a natural sentence, and performs preprocessing for inputting the input question and the related document to the revised question generating unit 900. Specifically, the text processing unit 800 converts an input question and a related document described in a natural sentence into a set of word tokens (word series) by performing, for example, morphological analysis. Note that at least one of the input question and the related document may be a sentence obtained as a speech recognition result. The related document input to the text processing unit 800 may be one or more documents (that is, a set of related documents). In the second embodiment of the present invention, when “related documents” is represented, a set of related documents is also included.

In the following, as in the first embodiment, the input question is converted into a set of J word tokens (word sequence) Q = {q ₀ , q ₁ ,..., Q _J }. This word sequence Q is also expressed as an input question Q. Similarly, it is assumed that the related document is converted into a set of T word tokens (word sequence) X = {x ₀ , x ₁ ,..., X _T }. Shall.

Note that when the input question Q and the related document X expressed in word series are input to the question generation device 100, the question generation device 100 may not include the text processing unit 800.

The revised question generation unit 900 generates a question response to the input question and a revised question corresponding to the answer (answer candidate) obtained by the question response. The revised question generation unit 900 is realized by a learned revised question generation model (that is, a revised question generation model using parameters updated by a revised question generation model learning unit 1100 described later).

Here, the revised question generation unit 900 includes a question response execution unit 910 and a question generation unit 920.

The question response execution unit 910 inputs the input question Q and the related document X, performs a question response, and generates answer candidates for the input question Q from the related document X. As described above, the number of answer candidates generated here is not necessarily one, and N answer candidates are generated with N being an integer of 1 or more. In the second embodiment of the present invention, a method in which the description in the related document is extracted as it is is used as the answer candidate. However, the present invention is not limited thereto, and a natural sentence question and an arbitrary document (related document) are input Any method can be used as long as a natural sentence answer can be obtained.

The question generation unit 920 inputs the input question Q, the related document X, and N answer candidates, and generates a revised question RQ that is more detailed and specific than the input question Q. At this time, the question generation unit 920 generates a revised question RQ for each of the N answer candidates (that is, generates N revised questions RQ corresponding to each of the N answer candidates).

Here, in the second embodiment of the present invention, the question generation unit 920 generates the revised question RQ by adding information that can uniquely identify each answer candidate to the input question Q. . For example, information related to conditions such as “in the case of” and “in the case of” may be described in the vicinity of the information that is the answer candidate in the related document X. Therefore, by adding information regarding such conditions to the input question Q, it is possible to generate a revised question RQ that can uniquely determine an answer (answer candidate) when this condition is met. In addition to this, for example, a proper expression such as a person name or a place name can be useful information for narrowing down answer candidates, and a revised question RQ in which these are added to the input question Q may be generated.

The generation method of the revised question RQ, the discovery method of information to be added to the input question Q, the method of adding information to the input question Q, and the like are described above. Any method can be employed as long as it can generate a revised question RQ by adding to the input question Q. For example, after finding and extracting the information “in the case of” described above by pattern matching, the information closest to the answer (answer candidate) is added to the head of the input question Q from the extracted information. A method of generating a revised question RQ may be used. Alternatively, for example, the revised question RQ may be generated using a sentence generation method using a neural network.

The output unit 1000 outputs N answers (answer candidates) and N revised questions RQ corresponding to each of these N answers. At this time, for example, the output unit 1000 outputs one or more pairs of a certain answer candidate and a revised question RQ corresponding to this answer candidate. Here, as a method for outputting a pair of the answer candidate and the revised question RQ, an arbitrary method can be adopted according to the user interface of the question generation device 100.

For example, when the question generation device 100 includes a user interface that outputs an answer to the screen as in a search system or the like, a search result suggestion function for an input question Q input from a user (questioner) In such a case, the candidate of the revised question RQ is displayed as "Maybe ...", and when the revised question RQ is selected by the user, an answer (answer candidate) corresponding to the revised question RQ is displayed. Also good.

Further, for example, when the question generating device 100 includes a user interface by voice dialogue, when the input question Q is input from the user, the revised question RQ corresponding to the most likely answer (answer candidate) is “probably. Say “Yes, is it?” (XX is the question content of the revised question RQ), and utters the answer (answer candidate) corresponding to the revised question RQ when the user agrees You may adopt the method of doing. At this time, for example, when the user disagrees with the confirmation utterance, the user confirms the confirmation question about the revised question RQ corresponding to the next most likely answer (answer candidate). You may adopt the method of repeating this until it agrees. Here, regarding the likelihood of the answer (answer candidate), for example, the question generation device 100 may have a function of calculating the likelihood, or the answer candidate is generated together with the generation of the answer candidate by the question answer execution unit 910. Likelihood may be calculated.

Note that the output destination of the output unit 1000 is not limited to that described above, and may be, for example, the auxiliary storage device 508, the recording medium 503a, or other devices connected via a network.

Next, the functional configuration of the question generation device 100 during learning in the second embodiment of the present invention will be described with reference to FIG. FIG. 14 is a diagram illustrating an example of a functional configuration of the question generation device 100 during learning according to the second embodiment of the present invention.

As shown in FIG. 14, the question generation device 100 at the time of learning in the second embodiment of the present invention includes a missing question creation unit 300 and a revised question generation model learning unit 1100.

The missing question creating unit 300 creates the missing question by inputting the input question Q and missing a part of the input question Q, as in the first embodiment.

The revised question generation model learning unit 1100 generates a revised question generation model using the missing question created by the missing question creation unit 300, the input question Q, the correct answer A _true for the input question Q, and the related document X. learn. Then, the revised question generation model learning unit 1100 outputs the learned revised question generation model parameters.

Here, the revised question generation model learning unit 1100 includes a question response execution unit 910, a question generation unit 920, and a parameter update unit 1110. The question response execution unit 910 and the question generation unit 920 are as described above. The parameter update unit 1110 calculates an error between the natural sentence (revised question RQ) generated by the question generation unit 920 and the input question Q, and answers to the input question Q by the question response execution unit 910 and the input question Q Calculate the error from the correct answer to. Then, using these errors, the parameters of the revised question generation model (revised question generation model parameters that have not been learned) are updated by an arbitrary optimization method. The revised parameter generation model is learned by updating the parameters by the parameter updating unit 1110.

(Hardware configuration of question generation device 100)
The hardware configuration of the question generation device 100 according to the second embodiment of the present invention may be the same as that of the first embodiment, and a description thereof will be omitted.

(Revision question generation process)
Next, the revision question generation process in the second embodiment of the present invention will be described with reference to FIG. FIG. 15 is a flowchart illustrating an example of a revision question generation process according to the second embodiment of the present invention. In the revised question generation process, it is assumed that the revised question generation model for realizing the revised question generation unit 900 is realized by a neural network and has been learned.

Here, an example of a revised question generation model for realizing the revised question generation unit 900 in the second embodiment of the present invention is shown in FIG. As shown in FIG. 16, in the second embodiment of the present invention, the revised question generation model includes a document encoding layer, a question encoding layer, a document / question matching layer, a machine reading modeling layer, a machine reading output layer, and an answer vector generation. It is a neural network composed of a layer, a decode layer, and a revised question word generation layer. Of these layers, the question response execution unit 910 is realized by the document encoding layer, the question encoding layer, the document / question matching layer, the machine reading modeling layer, and the machine reading output layer. The question generation unit 920 is realized by the answer vector generation layer, the decode layer, and the revised question word generation layer.

The document encoding layer, the question encoding layer, the document / question matching layer, and the machine reading modeling layer correspond to the matching unit 210 in the first embodiment. The decode layer and the revised question word generation layer correspond to the question restoration unit 220 in the first embodiment.

The neural network that realizes the corrected question generation model in the second embodiment of the present invention includes an encoder-decoder model that is a method for generating a natural sentence in the neural network, and a machine that generates an answer to the question response in the neural network. It is based on a reading model. In the machine reading model, the description of the answer candidate is directly extracted from the related document X (that is, the position of the start point and the end point when the description is extracted is estimated), thereby generating the answer candidate. This machine reading model is composed of a document / question matching layer, a machine reading modeling layer, and a machine reading output layer. For details of the Encoder-Decoder model, see, for example, Reference Document 1 above. For details of the machine reading model, see Non-Patent Document 1 above, for example.

In the subsequent revision question generation processing, detailed processing of each layer will be described with reference to the revision question generation model shown in FIG.

Step S301: The text processing unit 800 inputs an input question described in a natural sentence and a related document.

Step S302: The text processing unit 800 converts each input question and related document into a word series. As described above, hereinafter, it is assumed that the input question is converted to the word sequence Q of J word tokens and the related document is converted to the word sequence X of T word tokens. X ".

In addition, when the input question Q and the related document X represented by the word series are input to the question generation device 100, the above step S302 may not be performed.

Step S303: revised question generator 900, the following steps S303-1 ~ step S303-3, as the matching information to generate a state vector _{h q0} and _{h M0} to the initial state of the decoding layer.

Step S303-1: First, the question response execution unit 910 of the revised question generation unit 900 inputs the related document X and the input question Q, and performs processing of the document encoding layer and the question encoding layer of the revised question generation model shown in FIG. The related document X and the input question Q are each converted (encoded) into a d-dimensional word vector sequence. That is, the question response execution unit 910 creates a word vector sequence by converting each word token constituting the related document X and the input question Q into a d-dimensional real vector.

Further, the question response execution unit 910 outputs a state vector h _q0 when the input question Q is encoded into a d-dimensional word vector sequence.

In the second embodiment of the present invention, the word vector series of the related document X is expressed as “document vector series H” as H. In addition, the word vector series of the input question Q is expressed as “question vector series U” as represented by U. At this time, the document vector sequence is HεR ^{d × T} , and the query vector sequence is UεR ^{d × J.}

Here, as a method for encoding the related document X and the input question Q into d-dimensional word vector sequences, any method can be adopted as long as the document vector sequence and the question vector sequence can be generated. For example, a method of inputting a related document X and an input question Q to a word embedding layer (Word Embedding Layer) and converting each word token into a d-dimensional real vector, and then converting the word token into a word vector sequence by RNN is used. Can do. In addition to this, for example, encoding using an attention mechanism (attention) may be performed. However, since the decoding layer (Decode Layer) uses the state vector h _q0 output from the question encoding layer as an initial state, it is necessary to generate the state vector h _q0 by an arbitrary method.

In the second embodiment of the present invention, the case where the state vector h _q0 is generated only in the question encoding layer will be described. However, the state vector h _x0 may be generated only in the document encoding layer or in the document encoding layer. . When the state vector h _x0 is generated only in the document encoding layer, the decoding layer may use the state vector h _x0 as an initial state. On the other hand, when the state vector h _q0 and the state vector h _x0 are respectively generated in the document encoding layer and the question encoding layer, the decoding layer uses one or both of these state vectors as the initial state. Can do.

Step S303-2: Next, the question response execution unit 910 of the revised question generation unit 900 uses the document vector series H and the question vector series U as processing of the document / question matching layer of the revised question generation model shown in FIG. Thus, information related to the input question Q is found and extracted in the related document X for machine reading. This discovery and extraction is performed by collating the related document X with the input question Q.

Here, as a method of collating the related document X with the input question Q, any method can be adopted. For example, BiDAF using an attention mechanism can be employed. Also, for example, QANet using CNN (Convolutional Neural Network) can be adopted. For details of BiDAF using an attention mechanism, see Non-Patent Document 1 above, for example. For details of QANet using CNN, refer to Reference Document 7 below, for example.

[Reference 7]
Adams Wei Yu, David Dohan, Minh-Thang Luong, Rui Zhao, Kai Chen, Mohammad Norouzi, Quoc V. Le.QANet: Combining Local Convolution with Global Self-Attention for Reading Comprehension.ICLR2018
As a result, a collation vector sequence GεR ^{r × T,} which is an r-dimensional real vector sequence, is output as a collation result between the related document X and the input question Q. Here, r differs depending on the method used for collation between the related document X and the input question Q. This matching vector series G corresponds to the attention matrix G in the first embodiment.

Step S303-3: The question answer execution unit 910 of the revised question generation unit 900 uses the matching vector sequence G as a machine reading modeling layer process of the revised question generation model shown in FIG. R ^{d × T} is created. Here, the machine-reading modeling vector sequence M is created by performing a technique using RNN on the collation vector sequence G, for example, as in the document encoding layer and the question encoding layer. . At this time, the question response execution unit 910 generates a hidden state vector h _M0 in the same manner as the question encoding layer. This hidden state vector h _M0 is used as the initial state of the decode layer. The machine reading modeling vector series M corresponds to the matching matrix M in the first embodiment.

Step S304: Next, the question answer execution unit 910 of the revised question generation unit 900 generates answer candidates by using the machine reading modeling vector sequence M as the process of the machine reading output layer of the revised question generation model shown in FIG. To do. The generation of the answer candidates is performed by extracting the start point and the end point of the description as the answer candidate from the related document X.

Here, with respect to the start point, as the processing of the answer start point output layer included in the machine reading output layer of the revised question generation model shown in FIG. 16, the machine reading modeling vector sequence M is linearly converted with weights W ₀ ∈R ^{1 × d.} after having created the starting point vector _{O start} ∈R ^T by, it converted into a probability distribution _{P start} by applying the softmax function sequence length T with respect to the starting point vector _{O start.} Then, using this probability distribution P _start , the t _start (0 ≦ t _start ≦ T) -th element with the highest probability is extracted from the related document X and used as the starting word.

On the other hand, regarding the end point, as the processing of the answer end point output layer included in the machine reading output layer of the revised question generation model shown in FIG. 16, first, the start point vector O _start and the machine reading modeling vector sequence M are input to the RNN and new. A machine reading modeling vector series M ′ is created. After that, a probability distribution P _end is obtained from the new machine-reading modeling vector sequence M ′ by the same method as the starting point, and t _end (t _start ≦ t _end ≦ T) th with the highest probability is obtained using this probability distribution P _end. Are extracted from the related document X and set as the end point word.

Thereby, a section from the t _start- th (start point) word to the t _end- th (end point) word in the related document X is extracted as an answer (answer candidate).

In order to extract N answers (answer candidates), P (i, k) = P _start (i) × P _end (k) is first calculated using P _start and P _end . However, 0 ≦ i ≦ T and i ≦ k ≦ T. Then, a combination of i, k with the top N P (i, k) may be used as the start point and the end point. As a result, the sections corresponding to the top N i, k combinations are respectively extracted as N answers (answer candidates).

The question answer execution unit 910 may output the start point and the end point of each of N answers (answer candidates), may output the N answers (answer candidates) themselves, or N The start point word and end point word of each answer (answer candidate) may be output. In the second embodiment of the present invention, it is assumed that the start point and the end point of each of N answers (answer candidates) are output. Further, the subsequent step S305 is executed for each of the N start point and end point sets. Hereinafter, a certain set of start point t _start and end point t _end is set as “answer candidate A”. Step S305 will be described for the answer candidate A.

Step S305: The revised question generation unit 900 generates a revised question corresponding to the answer candidate A through the following steps S305-1 to S305-3.

Step S305-1: The question generation unit 920 of the revised question generation unit 900 inputs the answer candidate A (that is, the start point t _start and the end point t _end ), and processes the response vector generation layer of the revised question generation model shown in FIG. As an answer vector corresponding to answer candidate A

Create Here, d _a represents the number of dimensions of the answer vector.

As a method for creating the answer vector a, any method can be adopted as long as the answer vector a can be created using the answer candidate A (that is, the start point t _start and the end point t _end ) as an input. . For example, the description of the section from the _start point t _start to the end point t _end is once converted into a word sequence, and this word sequence is converted into a vector by the document encoding layer, and the answer vector a may be used. The start point t _start and the end point section _{H _(t} _start, _t _end) which is determined by t _{end the} RNN respect ∈R ^{d ×} l (l is the sequence length of the answer candidate a) vector sequences was extracted from the document vector sequence, corresponding to the extracted sections Alternatively, the answer vector a may be created by applying or calculating the center of gravity vector.

Note that, for example, a method for generating a sentence to be an answer (answer candidate A) with reference to the description in the related document X, instead of using the answer (answer candidate A) extracted as it is in the related document X Is used, the generated sentence (the sentence that becomes the answer) is used as an input, and the answer vector a may be created as the process of the answer vector generation layer.

Step S305-2: The question generation unit 920 of the revised question generation unit 900 outputs the words constituting the revised question using the answer vector a by the RNN as processing of the decoding layer of the revised question generation model shown in FIG. Create a vector to do. Here, in this RNN, the state vectors h _q0 and h _M0 output from the question response execution unit 910 are used as initial values (initial states) of the state vectors.

An arbitrary method can be adopted as a method of using the state vectors h _q0 and h _M0 . For example, the RNN may be two layers, and the initial state of the first layer RNN may be h _q0 , and the initial state of the second layer RNN may be h _M0 . Or, for example, when used in a one-layer RNN, after performing linear transformation to match the number of dimensions, an average vector of two state vectors h _q0 and h _M0 may be set as an initial state, Only one of the two state vectors h _q0 and h _M0 may be set as the initial state.

Further, instead of the state vector h _M0 , the state vector h _x0 of the document encoding layer may be used to determine the initial state of the decoding layer for the state vectors h _q0 and h _x0 . Thereby, for example, when there are a plurality of different answer candidates having the same P (i, k) (that is, when the question content is ambiguous, etc.), it is expected to improve the answer accuracy.

Here, in the Encoder-Decoder model, the decoding layer contains the embedded vector of the previously generated word.

Enter. Here, d _e represents the number of dimensions of the word embedding vectors. In contrast, in the second embodiment of the present invention, a vector in which a response vector is combined with a word embedding vector.

Are input to the decode layer. Note that, except for the initial value of the state vector and the input vector, it is the same as the decoding layer of the Encoder-Decoder model. Therefore, for example, any technique used in the decoding layer of the Encoder-Decoder model, such as attention mechanism or copying, may be applied to the decoding layer of the revised question generation model shown in FIG.

Step S305-3: The question generation unit 920 of the revision question generation unit 900 generates the s-th word y _s constituting the revision question from the output of the decode layer, similarly to the Encoder-Decoder model. That is, for example, after linearly converting the output result of the decoding layer, the word generation probability in the related document X is generated by the softmax function. Then, for example, words, word generation probability becomes the maximum, is generated as s-th word y _s. By repeating this until the word y _s is <EOS> is generated, the words constituting the revised candidate corresponding to the answer candidate A is generated. It should be noted, _{y 0} is assumed to be <BOS>.

Step S306: Finally, the output unit 1000 outputs N answers (answer candidates) and N revised questions RQ corresponding to each of these N answers.

(Learning process of revised question generation model)
Next, the revised question generation model learning process in the second embodiment of the present invention will be described with reference to FIG. FIG. 17 is a flowchart showing an example of the learning process of the revised question generation model in the second embodiment of the present invention. Here, in the second embodiment of the present invention, a machine-reading corpus is used to learn the revised question generation model. The machine-reading corpus includes a plurality of sets of “question”, “document to be questioned”, and “answer range (or character string of the answer range) in the question target document”. At this time, the “document to be questioned” included in the corpus is the related document X, the “question” included in the corpus is the input question Q, and the correct answer A _true for the input question Q is the “question” in the corpus. The response range (or character string of the response range) in the target document is used as it is. Then, the input question Q and the correct answer A _true of the answer to the input question Q are used as learning data for machine reading processing in the question response execution unit 910. In the second embodiment of the present invention, it is assumed that the correct answer A _true of the answer is represented by a set of a start point and an end point.

Step S401: The text processing unit 800 inputs a plurality of learning data (that is, a learning data set) and related documents.

Step S402: The text processing unit 800 converts a plurality of input questions and related documents respectively included in the plurality of input learning data into a plurality of input questions Q and a related document X that are word sequences. However, in the case of using a machine-reading corpus, a plurality of input questions and related documents that have been input are often already expressed in a word sequence, and therefore this step S402 need not be performed.

In the revised question generation model learning process, for example, the learning data set is divided into a predetermined number of mini-batches, and the parameters of the revised question generation model are updated for each mini-batch.

The following steps S403 to S406 are repeatedly executed using each learning data included in the mini-batch. On the other hand, the following steps S407 to S409 are executed after steps 401 to S206 are executed for all the learning data included in the mini-batch.

Step S403: The missing question creation unit 300 creates a question Q (missing question Q) in which a part of the input question Q that is learning data is missing. Since the input question Q is correct answer data for the missing question Q, the input question Q is hereinafter referred to as a correct question Q _true .

Here, as a method for creating the missing question Q, any method can be created. For example, the missing question Q may be statistically created using the trained Encoder-Decoder model, or the missing question Q is created by cutting off clauses and phrases using syntax information such as sentence dependency. May be. Alternatively, the missing question Q may be created using a sentence compression technique that is one of the tasks of natural language processing.

Step S404: The question response execution unit 910 of the revised question generation model learning unit 1100 generates matching information. Since this step S404 is the same as step S303 by replacing the input question Q in step S303 of FIG. 15 with the missing question Q, the description thereof is omitted.

Step S405: The question response execution unit 910 of the revised question generation model learning unit 1100 generates answer candidates for the missing question Q. This step S405 is the same as step S304 by replacing the input question Q in step S304 of FIG.

Step S406: The question generation unit 920 of the revised question generation model learning unit 1100 generates a revised question RQ corresponding to each answer candidate of the missing question Q. This step S406 is the same as step S305 by replacing the input question Q in step S305 of FIG.

Step S407: The parameter update unit 1110 of the revised question generation model learning unit 1100 generates the revised question RQ generated using each learning data included in the mini-batch and the input question Q (that is, correct answer question) included in the learning data. Calculate the first error of Q _true ). The parameter updating unit 1110 calculates a second error between the answer A to the input question Q included in each learning data included in the mini-batch and the correct answer A _true included in the learning data. Here, the answer A is obtained as an answer in the question answer by inputting the input question Q (and the related document X) to the question answer execution unit 910.

For example, cross-entropy may be used as the error function used for calculating the first error and the second error. The error function is appropriately determined according to the revised question generation model.

Step S408: The parameter update unit 1110 of the revised question generation model learning unit 1100 updates the parameters of the revised question generation model using the first error and the second error calculated in Step S407. That is, for example, the parameter update unit 410 calculates the partial differential value of the error function by the error back propagation method (back propagation) using the first error and the second error calculated in step S407 above. The parameter of the revised question generation model is updated. Thereby, the revised question generation model is learned.

Here, as shown in FIG. 16, when the revised question generation model is a neural network, correct answer data is generated by each of the machine reading (that is, the question answer execution unit 910) and the revised question generation (that is, the question generation unit 920). That is, an error function is defined for the correct answer question Q _true for the revised question RQ and the correct answer A _true for the correct question Q _true, and the sum of these error function values (ie, the first error and the second error). ) Is treated as an error of the entire neural network, and the parameter is updated so that this error is reduced (that is, the parameter is updated by multitask learning).

(Summary)
As described above, the question generation device 100 according to the second embodiment of the present invention uses the revised question generation model realized by, for example, a neural network, and generates the revised question RQ with respect to the input question Q. A question answer is performed, and a revised question RQ corresponding to the answer candidate obtained by this question answer is generated. Thereby, for example, even if the answer to the input question Q cannot be uniquely identified, a revised question RQ is generated for each answer candidate, so by using these revised questions RQ in the question answering task, , It will be possible to achieve high response accuracy.

The present invention is not limited to the specifically disclosed embodiments, and various modifications and changes can be made without departing from the scope of the claims.

DESCRIPTION OF SYMBOLS 100 Question generation apparatus 200 Revised question generation part 210 Collation part 220 Question restoration part 300 Missing question creation part 400 Revised question generation model learning part

Claims

Using a question sentence and a related document containing an answer to the question sentence as input, using a previously learned machine learning model, a potentially missing part of the question sentence is included in a predetermined vocabulary set Generating means for generating a revised question sentence supplemented with
A question generating device characterized by comprising:
The generating means includes
Collating means for generating matching information representing a matching relationship between each word included in the question sentence and each word included in the related document;
Using the matching information generated by the matching means, from the vocabulary set, by generating each word constituting the revised question sentence, question restoring means for generating the revised question sentence;
The question generation device according to claim 1, wherein
The question restoration means includes
A weighted average of a first probability of generating each word constituting the revised question sentence from words included in the vocabulary set and a second probability generated from words included in the related document The question generation device according to claim 2, wherein the question generation device is generated with a third probability represented by:
The revised question sentence is a sentence in which a potentially missing part of the question sentence is supplemented with a word included in the vocabulary set and a word included in the related document. The question generation device according to any one of 1 to 3.
The generating means includes
When the question sentence is input, the revised question sentence corresponding to each of the related documents included in the set based on the question sentence and a set of related documents including an answer to the question sentence, and the related document The question generation apparatus according to claim 1, wherein correspondence information between the revised question sentence and the revised question sentence is generated.
The generating means includes
5. The method according to claim 1, wherein the generated revised question text is used as an input, and the generation of a revised question text that supplements a potentially missing part of the revised question text is repeatedly executed. The question generating device according to one item.
The generating means includes
The question generation apparatus according to claim 1, wherein an answer candidate for the question sentence and the revised question sentence corresponding to the answer are generated.
The generating means includes
Collating means for generating matching information representing a matching relationship between each word included in the question sentence and each word included in the related document;
Machine reading means for generating the answer candidates using the matching information;
Revision question generating means for generating the revised question sentence by generating each word constituting the revised question sentence from the vocabulary set using the answer candidate and the matching information;
The question generation device according to claim 7, comprising:
First generation means for receiving a question sentence and a related document including an answer to the question sentence and generating a missing question sentence in which a part of the question sentence is missing;
Using a neural network model, second generation means for generating a restored question sentence in which the missing question sentence is restored with a word included in a predetermined vocabulary set;
Learning means for updating parameters of the neural network model by using an error between the restoration question sentence generated by the second generation means and the question sentence;
A question generating device characterized by comprising:
The learning means includes
The question generating apparatus according to claim 9, wherein the parameter of the neural network model is updated by further using an error between a correct answer to the question sentence and an answer to the question sentence.
Using a question sentence and a related document containing an answer to the question sentence as input, using a previously learned machine learning model, a potentially missing part of the question sentence is included in a predetermined vocabulary set Generation procedure to generate a revised question sentence supplemented with
A question generation method characterized in that a computer executes the above.
A program for causing a computer to function as each means in the question generation device according to any one of claims 1 to 10.