JP6769405B2

JP6769405B2 - Dialogue system and dialogue method

Info

Publication number: JP6769405B2
Application number: JP2017135625A
Authority: JP
Inventors: 拓磨峰村; 整加藤
Original assignee: Toyota Motor Corp
Current assignee: Toyota Motor Corp
Priority date: 2017-07-11
Filing date: 2017-07-11
Publication date: 2020-10-14
Anticipated expiration: 2037-07-11
Also published as: JP2019020774A

Description

本発明は、対話システムに関し、特に、ユーザからの質問に対して回答を生成する質問応答システムに関連する。 The present invention relates to a dialogue system, and more particularly to a question answering system that generates an answer to a question from a user.

質問応答システムでは、ユーザからの自然文による質問に対して、適切な回答を生成することが望まれている。 In a question answering system, it is desired to generate an appropriate answer to a question written by a user in a natural sentence.

特許文献１では、ユーザ入力された自然文を解析した結果と、それぞれの状況において要求される機能を判別するように機械学習された識別機を用いる。そして、これらを比較することで、ユーザがシステムに要求している機能を抽出している。 In Patent Document 1, a discriminator that has been machine-learned to discriminate between a result of analyzing a natural sentence input by a user and a function required in each situation is used. Then, by comparing these, the functions required by the user from the system are extracted.

特許文献２では、ユーザ入力された質問文中の単語や文節の曖昧度を求めて、曖昧度の高い単語や文節の意味を特定するための問いかけ文を生成する。これを繰り返し行い、適切な回答を生成している。 In Patent Document 2, the degree of ambiguity of a word or phrase in a user-input question sentence is obtained, and a question sentence for specifying the meaning of the word or phrase with a high degree of ambiguity is generated. This is repeated to generate the appropriate answer.

特開２０１６−１５１９２８号公報Japanese Unexamined Patent Publication No. 2016-151928 特開２００４−１１０５２４号公報Japanese Unexamined Patent Publication No. 2004-110524

しかしながら、特許文献１ではユーザの入力が曖昧である場合に、適切な回答を生成することができない。また、特許文献２は、自明の事柄が省略された場合も絞込み対話が発生してしまうため、ユーザとシステムとの間の冗長な対話が発生してしまう。 However, in Patent Document 1, when the user's input is ambiguous, an appropriate answer cannot be generated. Further, in Patent Document 2, since a narrowing-down dialogue occurs even when a self-evident matter is omitted, a redundant dialogue between the user and the system occurs.

本発明は、対話システムにおいて、ユーザからの自然文による質問が曖昧である場合でも、質問を特定し、適切な回答を生成することを目的とする。 An object of the present invention is to identify a question and generate an appropriate answer even when the question in natural sentences from the user is ambiguous in the dialogue system.

本発明の第一の態様は、
ユーザの質問文のテキストを取得する取得手段と、
あらかじめ定められた複数の質問と、前記複数の質問の属性を表す１つ以上の属性項目と、前記複数の質問ごとの前記属性項目の内容を表す属性値を対応付けて保持する記憶手段と、
前記テキストから、前記属性項目もしくは前記属性値またはこれらの類義語に関連するキーワードを抽出する抽出手段と、
前記キーワードと前記記憶手段を用いて、前記複数の質問の内、前記テキストと合致する質問の候補を求める第一意図理解手段と、
前記複数の質問のそれぞれと前記テキストの一致度を表すスコアを前記複数の質問ごとに求める第二意図理解手段と、
前記第一意図理解手段または前記第二意図理解手段の処理結果から、質問を特定する質問特定手段と、
を備え、
前記質問特定手段は、
前記第二意図理解手段の処理結果から、質問が一意に定まる場合は、前記第二意図理
解手段の処理結果からユーザの質問を特定し、
前記第二意図理解手段の処理結果から、質問が一意に定まらない場合は、前記第一意図理解手段および前記第二意図理解手段の処理結果からユーザの質問を特定する、
対話システムである。 The first aspect of the present invention is
How to get the text of the user's question,
A storage means for associating and holding a plurality of predetermined questions, one or more attribute items representing the attributes of the plurality of questions, and an attribute value representing the contents of the attribute items for each of the plurality of questions.
An extraction means for extracting the attribute item or the attribute value or a keyword related to these synonyms from the text.
Using the keyword and the storage means, a first intent understanding means for finding a candidate for a question that matches the text among the plurality of questions.
A second means of understanding the intention to obtain a score indicating the degree of agreement between each of the plurality of questions and the text for each of the plurality of questions.
A question specifying means for identifying a question from the processing result of the first intention understanding means or the second intention understanding means,
With
The question identification means
If the question is uniquely determined from the processing result of the second intention understanding means, the user's question is specified from the processing result of the second intention understanding means.
If the question is not uniquely determined from the processing result of the second intention understanding means, the user's question is specified from the processing results of the first intention understanding means and the second intention understanding means.
It is a dialogue system.

このような構成によれば、前記質問特定手段は、ユーザの質問文が曖昧である場合でも、第一意図理解手段によって得られる質問の候補または第二意図理解手段によって得られるスコアから、質問を特定することができる。よって、適切な回答を返すことができる。 According to such a configuration, the question identifying means asks a question from a question candidate obtained by the first intention understanding means or a score obtained by the second intention understanding means even if the question sentence of the user is ambiguous. Can be identified. Therefore, an appropriate answer can be returned.

本発明において、記憶手段は、あらかじめ定められた複数の質問を保持する。また、記憶手段は、前記複数の質問を特定するための項目として、前記複数の質問の属性（特徴）を表す「属性項目」を保持する。さらに、記憶手段は、各属性が実際にとる値（内容）である「属性値」を保持する。 In the present invention, the storage means holds a plurality of predetermined questions. Further, the storage means holds "attribute items" representing the attributes (features) of the plurality of questions as items for identifying the plurality of questions. Further, the storage means holds an "attribute value" which is a value (content) actually taken by each attribute.

例えば、「車のトラブルに関する質問応答システム」の場合、「属性項目」は、車のトラブルに関する任意の質問を特定可能とする項目であることが好ましい。例えば、「属性項目」は、［＜パーツ＞，＜症状＞，＜質問形式＞］等とすることができる。また、上記の例において、あらかじめ定められた質問「エンジンから異音がする場合どうすればよいか？」が設けられているとすると、この質問に対応する「属性値」は、［「エンジン」，「異音」，「ｈｏｗ」］等とすることができる。 For example, in the case of a "question answering system for car troubles", it is preferable that the "attribute item" is an item that enables any question regarding car troubles to be specified. For example, the "attribute item" can be [<part>, <symptom>, <question format>] or the like. Further, in the above example, if a predetermined question "What should I do if the engine makes an abnormal noise?" Is provided, the "attribute value" corresponding to this question is ["engine", " It can be "abnormal noise", "how"], etc.

本発明において、前記質問特定手段は、前記スコアを正規化した値が所定の閾値以上となる質問が１つとなる場合に、質問が一意に定まると判断する、ことが好ましい。 In the present invention, it is preferable that the question identifying means determines that a question is uniquely determined when there is one question in which the normalized value of the score is equal to or greater than a predetermined threshold value.

また、本発明において、前記質問特定手段における、前記第二意図理解手段を用いる質問の特定は、前記複数の質問すべてを対象に前記スコアを正規化した値が所定の閾値以上となる質問を、ユーザの質問であると特定することにより行う、ことが好ましい。さらに、前記質問特定手段における、前記第一意図理解手段および前記第二意図理解手段を用いる質問の特定は、前記第一意図理解手段によって得られる質問の候補を対象に前記スコアを正規化した値が所定の閾値以上となる質問を、ユーザの質問であると特定することにより行う、ことが好ましい。 Further, in the present invention, in the question specifying means, the question using the second intention understanding means is specified by asking a question for which the normalized value of the score is equal to or more than a predetermined threshold value for all the plurality of questions. It is preferably done by identifying it as a user question. Further, in the question identifying means, the identification of the question using the first intention understanding means and the second intention understanding means is a value obtained by normalizing the score for the question candidates obtained by the first intention understanding means. It is preferable to ask a question in which is equal to or greater than a predetermined threshold value by identifying it as a user's question.

このように、第二意図理解手段の処理結果のみで質問を一意に特定することができない場合であっても、第一意図理解手段の処理結果を用いることで、質問特定手段は質問を特定することができる。 In this way, even if the question cannot be uniquely specified only by the processing result of the second intention understanding means, the question identifying means specifies the question by using the processing result of the first intention understanding means. be able to.

本発明において、前記質問特定手段によって質問が一意に定まらない場合に、前記抽出手段によりキーワードが抽出されていない前記属性項目についてユーザへの問いかけ文を生成する問いかけ文生成手段をさらに備える、ことが好ましい。 In the present invention, it is possible to further provide a question sentence generation means for generating a question sentence to the user for the attribute item for which the keyword is not extracted by the extraction means when the question is not uniquely determined by the question identification means. preferable.

これにより、最初にユーザ入力された質問からキーワードが抽出されていない属性項目等について問いかけることができるため、ユーザの質問が曖昧である場合でも対処することができる。具体的な例として、上述の「車のトラブルに関する質問応答システム」において、ユーザから「エンジンがなんかおかしいな」といった質問が入力されたとする。この場合、抽出手段によってキーワード「エンジン」が抽出されるため、属性項目＜パーツ＞については「エンジン」に関する質問であると判断できる。ここで、属性項目＜症状＞については不明であるため、キーワードが抽出されていない属性項目＜症状＞を用いて「どのような症状でしょうか？」と問いかけ文を生成することができる。 As a result, it is possible to ask about attribute items and the like for which keywords are not extracted from the question first input by the user, so that even if the user's question is ambiguous, it can be dealt with. As a specific example, it is assumed that a question such as "something is wrong with the engine" is input by the user in the above-mentioned "question answering system for car trouble". In this case, since the keyword "engine" is extracted by the extraction means, it can be determined that the attribute item <part> is a question related to "engine". Here, since the attribute item <symptom> is unknown, it is possible to generate a question sentence asking "what kind of symptom is it?" Using the attribute item <symptom> for which the keyword has not been extracted.

本発明において、前記問いかけ文生成手段は、前記抽出手段によりキーワードが抽出さ
れていない前記属性項目の内、前記属性値が複数種類存在する前記属性項目の前記属性値を用いて問いかけ文を生成する、ことができる。 In the present invention, the question sentence generating means generates a question sentence by using the attribute value of the attribute item in which a plurality of types of the attribute values exist among the attribute items whose keywords have not been extracted by the extraction means. ,be able to.

これにより、問いかけ文生成手段は、具体的な属性値の内容について問いかけ文を生成することができるため、ユーザとの冗長な対話を回避することができる。具体的には、上述の質問において、属性項目＜パーツ＞が「エンジン」に絞り込まれた結果、例えば、問いかけ文生成手段は、キーワードが抽出されていない属性項目＜症状＞の属性値について「Ａですか？またはＢですか？」といった問いかけ文を生成することができる。また、問いかけ文生成手段は、属性値が複数種類存在する属性項目の属性値を用いることで、例えば、属性項目＜症状＞の属性値が「ｘｘｘ」の１種類しか存在しない場合に、「ｘｘｘですか？」といった冗長な問いかけ文の生成を回避することができる。 As a result, the question sentence generation means can generate a question sentence about the content of a specific attribute value, so that redundant dialogue with the user can be avoided. Specifically, as a result of narrowing down the attribute item <parts> to the "engine" in the above question, for example, the question sentence generation means has "A" for the attribute value of the attribute item <symptom> for which the keyword has not been extracted. Is it? Or is it B? ”Can be generated. Further, the question sentence generation means uses the attribute value of the attribute item having a plurality of types of attribute values. For example, when the attribute value of the attribute item <symptom> is only one type of "xxx", "xxx" is used. It is possible to avoid the generation of redundant question sentences such as "Is it?".

本発明において、前記問いかけ文生成手段は、前記複数の質問の内、前記スコアを正規化した値が所定値以上の質問のみを対象とし、かつ、平均情報量（エントロピー）が最大の前記属性項目の前記属性値を用いて問いかけ文を生成する、ことができる。 In the present invention, the question sentence generating means targets only the questions whose normalized value of the score is equal to or more than a predetermined value among the plurality of questions, and the attribute item having the maximum average amount of information (entropy). It is possible to generate a question sentence using the attribute value of.

このように、問いかけ文生成手段は、ユーザの質問により近いと想定される質問に絞り込み、かつ、平均情報量を基にして、どの属性項目について問いかけをするかを決定することができるので、質問の特定に要する対話回数を最小限に留めることができる。 In this way, the question sentence generation means can narrow down the questions that are expected to be closer to the user's question, and can determine which attribute item to ask based on the average amount of information. It is possible to minimize the number of dialogues required to identify.

また、本発明において、前記問いかけ文生成手段は、前記属性項目の内、複数の属性項目について、前記属性値を問いかけ文として用いる、こともできる。 Further, in the present invention, the question sentence generating means can also use the attribute value as a question sentence for a plurality of attribute items among the attribute items.

さらに、本発明において、前記質問特定手段において質問が一意に定まるまで、前記問いかけ文生成手段、前記第一意図理解手段、前記第二意図理解手段および前記質問特定手段の処理を繰り返す、こともできる。 Further, in the present invention, the processing of the question sentence generating means, the first intention understanding means, the second intention understanding means, and the question specifying means can be repeated until the question is uniquely determined by the question specifying means. ..

また本発明において、前記第二意図理解手段は、機械学習により生成した分類器を用いて前記スコアを求める、ことが好ましい。 Further, in the present invention, it is preferable that the second intention understanding means obtains the score by using a classifier generated by machine learning.

なお、本発明は、上記手段の少なくとも一部を備える対話システムとして捉えることもできる。本発明は、また、上記処理の少なくとも一部を実行する対話方法として捉えることができる。また、本発明は、この方法をコンピュータに実行させるためのコンピュータプログラム、あるいはこのコンピュータプログラムを非一時的に記憶したコンピュータ可読記憶媒体として捉えることもできる。上記手段および処理の各々は可能な限り互いに組み合わせて本発明を構成することができる。 The present invention can also be regarded as a dialogue system including at least a part of the above means. The present invention can also be regarded as an interactive method for executing at least a part of the above processing. The present invention can also be regarded as a computer program for causing a computer to execute this method, or as a computer-readable storage medium in which the computer program is stored non-temporarily. Each of the above means and treatments can be combined with each other as much as possible to constitute the present invention.

例えば、本発明の一態様は、
あらかじめ定められた複数の質問と、前記複数の質問の属性を表す１つ以上の属性項目と、前記複数の質問ごとの前記属性項目の内容を表す属性値を対応付ける記憶手段を備えるコンピュータが実行する対話方法であって、
ユーザの質問文のテキストを取得する取得ステップと、
前記テキストから、前記属性項目もしくは前記属性値またはこれらの類義語に関連するキーワードを抽出する抽出ステップと、
前記キーワードと前記記憶手段を用いて、前記複数の質問の内、前記テキストと合致する質問の候補を求める第一意図理解ステップと、
前記複数の質問のそれぞれと前記テキストの一致度を表すスコアを前記複数の質問ごとに求める第二意図理解ステップと、
前記第一意図理解ステップまたは前記第二意図理解ステップの処理結果から、質問を特定する質問特定ステップと、
を含み、
前記質問特定ステップでは、
前記第二意図理解ステップの処理結果から、質問が一意に定まる場合は、前記第二意図理解ステップの処理結果からユーザの質問を特定し、
前記第二意図理解ステップの処理結果から、質問が一意に定まらない場合は、前記第一意図理解ステップおよび前記第二意図理解ステップの処理結果からユーザの質問を特定する、
対話方法である。 For example, one aspect of the present invention is
Executed by a computer having a storage means for associating a plurality of predetermined questions, one or more attribute items representing the attributes of the plurality of questions, and an attribute value representing the contents of the attribute items for each of the plurality of questions. It ’s a dialogue method,
Get step to get the text of the user's question, and
An extraction step of extracting a keyword related to the attribute item or the attribute value or a synonym thereof from the text.
Using the keyword and the storage means, a first intent understanding step of finding a candidate for a question that matches the text among the plurality of questions.
A second intent understanding step of obtaining a score representing the degree of agreement between each of the plurality of questions and the text for each of the plurality of questions.
A question identification step that identifies a question from the processing result of the first intention understanding step or the second intention understanding step,
Including
In the question identification step
If the question is uniquely determined from the processing result of the second intention understanding step, the user's question is specified from the processing result of the second intention understanding step.
If the question is not uniquely determined from the processing result of the second intention understanding step, the user's question is specified from the processing results of the first intention understanding step and the second intention understanding step.
It is a dialogue method.

本発明によれば、対話システムにおいて、ユーザからの自然文による質問が曖昧である場合でも、質問を特定し、適切な回答を生成することが可能となる。 According to the present invention, in the dialogue system, even when a question in a natural sentence from a user is ambiguous, it is possible to identify the question and generate an appropriate answer.

図１は、実施形態に係る音声対話システムの構成を示す図である。FIG. 1 is a diagram showing a configuration of a voice dialogue system according to an embodiment. 図２は、実施形態に係る記憶部の構成の例を示す図である。FIG. 2 is a diagram showing an example of the configuration of the storage unit according to the embodiment. 図３は、実施形態に係る機械学習型分類部の処理の例を示す図である。FIG. 3 is a diagram showing an example of processing of the machine learning type classification unit according to the embodiment. 図４は、実施形態に係る質問特定部の処理の例を示す図である。FIG. 4 is a diagram showing an example of processing of the question specifying unit according to the embodiment. 図５は、実施形態に係る音声対話処理の流れの例を示すフローチャートである。FIG. 5 is a flowchart showing an example of the flow of voice dialogue processing according to the embodiment. 図６は、実施形態に係る問いかけ文生成処理の流れの例を示すフローチャートである。FIG. 6 is a flowchart showing an example of the flow of the question sentence generation process according to the embodiment. 図７は、実施形態に係る問いかけ文生成処理の質問の絞り込み方法の例を示す図である。FIG. 7 is a diagram showing an example of a method of narrowing down the questions in the question sentence generation process according to the embodiment. 図８は、実施形態に係る問いかけ文生成処理の属性項目の選択方法の例を示す図である。FIG. 8 is a diagram showing an example of a method of selecting an attribute item in the question sentence generation process according to the embodiment.

以下、図面を参照して、本発明の実施の形態を例示的に詳しく説明する。なお、以下の説明は本発明を例示的に説明するものであり、本発明は以下の実施形態に限定されるものではない。 Hereinafter, embodiments of the present invention will be described in detail exemplarily with reference to the drawings. The following description is an exemplary description of the present invention, and the present invention is not limited to the following embodiments.

＜システム構成＞
図１は、本実施形態に係る音声対話システムの構成を示す図である。本実施形態に係る音声対話システムは、音声対話装置１００およびスマートフォン２００を備える。 <System configuration>
FIG. 1 is a diagram showing a configuration of a voice dialogue system according to the present embodiment. The voice dialogue system according to the present embodiment includes a voice dialogue device 100 and a smartphone 200.

本実施形態に係る音声対話装置１００は、演算装置、記憶装置、入出力装置などを含む情報処理装置（コンピュータ）である。記憶装置に格納されたプログラムを演算装置が実行することで、音声対話装置１００の、取得部１１０、記憶部１２０、抽出部１３０、ルールベース型分類部１４０、機械学習型分類部１５０、質問特定部１６０、問いかけ文生成部１７０、回答生成部１８０等の機能が提供される。 The voice dialogue device 100 according to the present embodiment is an information processing device (computer) including an arithmetic device, a storage device, an input / output device, and the like. When the arithmetic unit executes the program stored in the storage device, the acquisition unit 110, the storage unit 120, the extraction unit 130, the rule-based classification unit 140, the machine learning type classification unit 150, and the question identification unit of the voice dialogue device 100 are executed. Functions such as unit 160, question sentence generation unit 170, and answer generation unit 180 are provided.

また、本実施形態に係るスマートフォン２００は、マイクやスピーカー等を含む情報処理装置（コンピュータ）を備える。スマートフォン２００は、ユーザの発話内容（質問）を取得し、発話内容をテキストに変換した後、音声対話装置１００に出力する。また、スマートフォン２００は、音声対話装置１００より問いかけ文や回答文を取得し、音声等でユーザに出力する。 Further, the smartphone 200 according to the present embodiment includes an information processing device (computer) including a microphone, a speaker, and the like. The smartphone 200 acquires the user's utterance content (question), converts the utterance content into text, and then outputs the utterance content to the voice dialogue device 100. Further, the smartphone 200 acquires a question sentence and an answer sentence from the voice dialogue device 100 and outputs the question sentence and the answer sentence to the user by voice or the like.

取得部１１０は、本実施形態では、スマートフォン２００を介してユーザの質問文のテキストを取得する。 In the present embodiment, the acquisition unit 110 acquires the text of the user's question text via the smartphone 200.

記憶部１２０は、あらかじめ定められた複数の質問と、複数の質問の属性を表す１つ以上の属性項目と、複数の質問それぞれに対して属性項目が実際とる値（内容）である属性値を保持する。 The storage unit 120 stores a plurality of predetermined questions, one or more attribute items representing the attributes of the plurality of questions, and an attribute value which is a value (content) actually taken by the attribute item for each of the plurality of questions. Hold.

図２は、本実施形態に係る「車のトラブルに関する質問応答システム」における記憶部１２０の例を示す。図２の例では、記憶部１２０に複数の質問（ＩＤ）１２１ならびに属性項目１２２（パーツ）、属性項目１２３（症状）および属性項目１２４（質問形式）を記憶している。また、図２の例では、記憶部１２０に＜パーツ＞に該当する属性値「エンジン」および「タイヤ」、＜症状＞に該当する属性値「ｘｘｘ」および「ｙｙｙ」、＜質問形式＞に該当する属性値「ｈｏｗ」および「ｗｈａｔ」を記憶している。 FIG. 2 shows an example of the storage unit 120 in the “question answering system for car trouble” according to the present embodiment. In the example of FIG. 2, a plurality of questions (ID) 121, attribute item 122 (parts), attribute item 123 (symptom), and attribute item 124 (question format) are stored in the storage unit 120. Further, in the example of FIG. 2, the storage unit 120 corresponds to the attribute values "engine" and "tire" corresponding to <parts>, the attribute values "xxx" and "yyy" corresponding to <symptom>, and <question format>. The attribute values "how" and "what" to be used are stored.

なお、図２の例では属性項目が３つの場合を示しているが、属性項目の数はいくつであっても構わない。また、記憶部１２０の属性項目および属性値の内容は特に限定されない。また、図２では、記憶部１２０の構成として表（テーブル）を用いているが、データ構造は特に限定されない。記憶部１２０は、あらかじめ定められた質問を特定できるように構成されていればよい。 Although the example of FIG. 2 shows the case where there are three attribute items, the number of attribute items may be any number. Further, the content of the attribute item and the attribute value of the storage unit 120 is not particularly limited. Further, in FIG. 2, a table is used as the configuration of the storage unit 120, but the data structure is not particularly limited. The storage unit 120 may be configured so that a predetermined question can be specified.

なお、記憶部１２０のすべての属性値の内容が一致する質問が複数あってもよい。例えば、図２の属性項目［＜パーツ＞，＜症状＞，＜質問形式＞］の属性値が、［「エンジン」，「異音」，「ｈｏｗ」］である質問が２つ以上あってもよい。このような構成とした場合でも、後述するように、記憶部１２０を用いて質問の候補を求めることができる。また、記憶部１２０は、手動で作成してもよく、機械学習等を用いて作成してもよい。さらに、記憶部１２０の内容は固定でもよいし、追加、更新および削除等を行ってもよい。 In addition, there may be a plurality of questions in which the contents of all the attribute values of the storage unit 120 match. For example, even if there are two or more questions whose attribute values of the attribute items [<parts>, <symptoms>, <question format>] in FIG. 2 are ["engine", "abnormal noise", "how"]. Good. Even in such a configuration, as will be described later, the storage unit 120 can be used to obtain question candidates. Further, the storage unit 120 may be created manually, or may be created by using machine learning or the like. Further, the contents of the storage unit 120 may be fixed, or may be added, updated, deleted, or the like.

抽出部１３０は、入力されたテキストから、記憶部１２０の属性項目もしくは属性値またはこれらの類義語に関連するキーワードを抽出する。ここで、ユーザが以下のような曖昧な質問をした場合、抽出部１３０は、図２に示す記憶部１２０を参照して、質問文中の属性値に関連するキーワード「エンジン」を抽出する。
「エンジンがなんかおかしいな」 The extraction unit 130 extracts an attribute item or attribute value of the storage unit 120 or a keyword related to these synonyms from the input text. Here, when the user asks the following ambiguous question, the extraction unit 130 extracts the keyword "engine" related to the attribute value in the question text by referring to the storage unit 120 shown in FIG.
"Samething strange about the engine"

なお、抽出方法は特に限定されない。たとえば、抽出部１３０は、形態素解析等を用いてテキストに含まれるすべての単語等を抽出してもよい。 The extraction method is not particularly limited. For example, the extraction unit 130 may extract all words and the like included in the text by using morphological analysis and the like.

ルールベース型分類部１４０（第一意図理解手段）は、記憶部１２０および抽出部１３０の抽出結果から、ユーザの質問と合致する質問の候補を求める。まず、ルールベース型分類部１４０は、抽出部１３０の抽出したキーワードと、記憶部１２０の属性値を参照する。そして、ルールベース型分類部１４０は、キーワードに関連する属性値を有する質問を、ユーザの質問の候補として出力する。ここで、キーワードが抽出された属性項目を属性項目群Ａ、キーワードが抽出されていない属性項目を属性項目群Ｂと称する。 The rule-based classification unit 140 (first intention understanding means) seeks a question candidate that matches the user's question from the extraction results of the storage unit 120 and the extraction unit 130. First, the rule-based classification unit 140 refers to the keywords extracted by the extraction unit 130 and the attribute values of the storage unit 120. Then, the rule-based classification unit 140 outputs a question having an attribute value related to the keyword as a candidate for the user's question. Here, the attribute item from which the keyword is extracted is referred to as the attribute item group A, and the attribute item from which the keyword is not extracted is referred to as the attribute item group B.

例えば、ユーザが上述のような曖昧な質問をした場合、まず、ルールベース型分類部１４０は抽出部１３０が抽出した質問文のキーワード「エンジン」および記憶部１２０の属性値を参照する。そして、ルールベース型分類部１４０は、キーワード「エンジン」に関連する属性値を有する質問を候補として求める。記憶部１２０が図２に示す構成の場合、＜パーツ＞の属性値に「エンジン」が設定されている質問１から５が質問の候補として出力される。この場合、属性項目＜パーツ＞は属性項目群Ａ、属性項目＜症状＞および＜質問形式＞は属性項目群Ｂに含まれる。 For example, when the user asks an ambiguous question as described above, first, the rule-based classification unit 140 refers to the keyword "engine" of the question text extracted by the extraction unit 130 and the attribute value of the storage unit 120. Then, the rule-based classification unit 140 asks for a question having an attribute value related to the keyword "engine" as a candidate. When the storage unit 120 has the configuration shown in FIG. 2, questions 1 to 5 in which "engine" is set in the attribute value of <part> are output as question candidates. In this case, the attribute item <part> is included in the attribute item group A, and the attribute items <symptom> and <question format> are included in the attribute item group B.

機械学習型分類部１５０（第二意図理解手段）は、あらかじめ定められた質問のそれぞ
れと、取得部１１０より出力されたテキストとの一致度（スコア）を求める。本実施形態では、機械学習型分類部１５０は、機械学習を用いて生成された分類器を用いてスコアの算出を行う。機械学習を用いた分類器の作成方法に関しては、既存の手法を用いる。分類器によって求められたスコアの例を図３（Ａ）に示す。 The machine learning type classification unit 150 (second intention understanding means) obtains a degree of agreement (score) between each of the predetermined questions and the text output from the acquisition unit 110. In the present embodiment, the machine learning type classification unit 150 calculates the score using a classifier generated by using machine learning. As for the method of creating a classifier using machine learning, an existing method is used. An example of the score obtained by the classifier is shown in FIG. 3 (A).

質問特定部１６０は、ルールベース型分類部１４０または機械学習型分類部１５０の処理結果を用いてユーザの質問を特定する。本実施形態では、機械学習型分類部１５０の処理結果から質問が一意に定まる場合は、質問特定部１６０は、当該質問をユーザの質問であると特定する。また、機械学習型分類部１５０の処理結果からユーザの質問が一意に定まらない場合は、質問特定部１６０は、ルールベース型分類部１４０および機械学習型分類部１５０を用いて判断を行う。質問特定部１６０の詳細については、図５のフローチャートを用いた処理説明において行う。 The question identification unit 160 identifies a user's question by using the processing result of the rule-based classification unit 140 or the machine learning type classification unit 150. In the present embodiment, when a question is uniquely determined from the processing result of the machine learning type classification unit 150, the question identification unit 160 identifies the question as a user's question. When the user's question is not uniquely determined from the processing result of the machine learning type classification unit 150, the question identification unit 160 makes a determination using the rule-based classification unit 140 and the machine learning type classification unit 150. The details of the question specifying unit 160 will be described in the processing explanation using the flowchart of FIG.

問いかけ文生成部１７０は、質問特定部１６０において質問が一意に定まらない場合に、ユーザへの問いかけ文を生成する。そして、生成された問いかけ文はスマートフォン２００を介してユーザに出力される。本実施形態では、問いかけ文生成部１７０は、ルールベース型分類部１４０の算出結果から得られる質問の候補を対象に、属性項目群Ｂの属性値を用いて問いかけ文を生成する。問いかけ文生成部１７０の詳細については、図５および６のフローチャートを用いた処理説明において行う。 The question sentence generation unit 170 generates a question sentence to the user when the question is not uniquely determined in the question identification unit 160. Then, the generated question sentence is output to the user via the smartphone 200. In the present embodiment, the question sentence generation unit 170 generates a question sentence by using the attribute value of the attribute item group B for the question candidates obtained from the calculation result of the rule-based classification unit 140. The details of the question sentence generation unit 170 will be described in the processing description using the flowcharts of FIGS. 5 and 6.

回答生成部１８０は、質問特定部１６０によって特定された質問に対する回答を生成する。そして、生成された回答はスマートフォン２００を介してユーザに出力される。本実施形態では、回答生成部１８０は、質問と回答が１対１に対応付けられる第２記憶手段（不図示）を参照して回答文を生成する。 The answer generation unit 180 generates an answer to the question specified by the question identification unit 160. Then, the generated answer is output to the user via the smartphone 200. In the present embodiment, the answer generation unit 180 generates an answer sentence with reference to a second storage means (not shown) in which the question and the answer are associated with each other on a one-to-one basis.

＜処理内容＞
本実施形態に係る音声対話システムにおける処理について、上述と同様、ユーザから「エンジンがなんかおかしいな」といった質問が入力された例について図５を参照して説明する。 <Processing content>
Similar to the above, the processing in the voice dialogue system according to the present embodiment will be described with reference to FIG. 5 with reference to an example in which a question such as "something is wrong with the engine" is input from the user.

ステップＳ１０２において、取得部１１０は、スマートフォン２００を介してユーザの質問のテキストを取得する。 In step S102, the acquisition unit 110 acquires the text of the user's question via the smartphone 200.

ステップＳ１０４において、機械学習型分類部１５０は、あらかじめ定められた複数の質問ぞれぞれと、取得されたテキストとの一致度（スコア）を算出する。 In step S104, the machine learning type classification unit 150 calculates the degree of coincidence (score) between each of the plurality of predetermined questions and the acquired text.

ステップＳ１０６において、質問特定部１６０は、ステップＳ１０４の処理結果を用いて質問が一意に定まるか否かを判断する。具体的には、質問特定部１６０は、ステップＳ１０４において複数の質問すべてに対して算出されたスコアを合計値が１となるように正規化し、正規化後の値（確信度）が所定の閾値以上となる質問が一意に定まるか否かを判断する。ここで、所定の閾値とは、それ以上であればユーザの意図した質問であるとみなせるような確信度の値である。図３（Ｂ）は、図３（Ａ）のすべての質問を対象に、スコアを正規化して得られる確信度の例である。質問特定部１６０は、質問が一意に定まる場合には、当該質問をユーザの質問であると特定し、ステップＳ１１８に進む。質問が一意に定まらない場合は、ステップＳ１１０に進む。 In step S106, the question identification unit 160 determines whether or not the question is uniquely determined by using the processing result of step S104. Specifically, the question identification unit 160 normalizes the scores calculated for all of the plurality of questions in step S104 so that the total value is 1, and the normalized value (confidence) is a predetermined threshold value. Determine whether the above questions are uniquely determined. Here, the predetermined threshold value is a value of certainty that can be regarded as a question intended by the user if it is higher than that. FIG. 3 (B) is an example of the certainty obtained by normalizing the scores for all the questions of FIG. 3 (A). When the question is uniquely determined, the question specifying unit 160 identifies the question as a user's question and proceeds to step S118. If the question is not uniquely determined, the process proceeds to step S110.

ステップＳ１０８において、ステップＳ１０６の処理の結果、質問が一意に定まらないと判断された場合（Ｓ１０６−ＮＯ）、抽出部１３０は、ステップＳ１０２において取得したテキストから、記憶部１２０の属性項目もしくは属性値またはこれらの類義語に関連するキーワードを抽出する。 In step S108, when it is determined that the question is not uniquely determined as a result of the processing in step S106 (S106-NO), the extraction unit 130 extracts the attribute item or attribute value of the storage unit 120 from the text acquired in step S102. Or extract keywords related to these synonyms.

ステップＳ１１０において、ルールベース型分類部１４０は、抽出されたキーワードと記憶部１２０を参照して、ユーザの質問の候補を算出する。 In step S110, the rule-based classification unit 140 calculates the candidate of the user's question by referring to the extracted keyword and the storage unit 120.

ステップＳ１１２において、質問特定部１６０は、ステップＳ１１０の処理結果から質問が一意に定まるか否かを判断する。具体的には、候補として算出された質問のみを対象に、再度、スコアの正規化を行う。そして、質問特定部１６０は、得られた値（確信度）が所定の閾値以上となる質問が一意に定まるか否かを判断する。本実施形態では、ステップＳ１０６とステップＳ１１２において同じ閾値を用いるが、異なる値を用いてもよい。質問が一意に定まる場合には、質問特定部１６０は当該質問をユーザの質問であると特定して、ステップＳ１１８に進む。質問が一意に定まらない場合は、ステップＳ１１４に進む。図４（Ａ）は、あらかじめ定められた質問すべてを対象にスコアを正規化した例を示す。図４（Ｂ）は、ルールベース型分類部１４０が算出した質問の候補のみを対象にスコアを正規化した例を示す。 In step S112, the question identification unit 160 determines whether or not the question is uniquely determined from the processing result of step S110. Specifically, the score is normalized again only for the questions calculated as candidates. Then, the question specifying unit 160 determines whether or not a question whose obtained value (certainty) is equal to or higher than a predetermined threshold value is uniquely determined. In this embodiment, the same threshold value is used in step S106 and step S112, but different values may be used. When the question is uniquely determined, the question identification unit 160 identifies the question as a user's question and proceeds to step S118. If the question is not uniquely determined, the process proceeds to step S114. FIG. 4 (A) shows an example in which the scores are normalized for all the predetermined questions. FIG. 4B shows an example in which the score is normalized only for the question candidates calculated by the rule-based classification unit 140.

ステップＳ１１４では、ステップＳ１１２において質問が一意に定まらないと判断された場合（Ｓ１１２−ＮＯ）、問いかけ文生成部１７０がユーザへの問いかけ文を生成する。本実施形態では、問いかけ文生成部１７０は、属性項目群Ｂに含まれる属性項目の属性値を用いて問いかけ文を生成する。そして、生成された問いかけ文は、スマートフォン２００を介してユーザに出力される。ステップＳ１１４の詳細については、後述の図６のフローチャートを用いた処理説明において行う。 In step S114, when it is determined in step S112 that the question is not uniquely determined (S112-NO), the question sentence generation unit 170 generates a question sentence to the user. In the present embodiment, the question sentence generation unit 170 generates a question sentence using the attribute values of the attribute items included in the attribute item group B. Then, the generated question sentence is output to the user via the smartphone 200. The details of step S114 will be described in the processing description using the flowchart of FIG. 6 described later.

ステップＳ１１６において、システムからユーザに対して問いかけがされた後に、ユーザからの応答が検出された場合は、ルールベース型分類部１４０は、再度、質問の候補の算出を行い、ステップＳ１１２に進む。そして、質問が一意に定まるまでステップＳ１１２から１１６の処理を繰り返す。 If a response from the user is detected after the system asks the user in step S116, the rule-based classification unit 140 calculates the question candidates again and proceeds to step S112. Then, the processes of steps S112 to 116 are repeated until the question is uniquely determined.

ステップＳ１１８では、上記ステップにおいて質問が一意に定まる場合（Ｓ１０６−ＹＥＳ、Ｓ１１２−ＹＥＳ）、回答生成部１８０は、質問に適した回答を行う。回答は、スマートフォン２００を介してユーザに出力される。 In step S118, when the question is uniquely determined in the above step (S106-YES, S112-YES), the answer generation unit 180 gives an answer suitable for the question. The answer is output to the user via the smartphone 200.

（問いかけ文生成処理）
ステップＳ１１４の処理の詳細を、図６および７を参照して説明する。 (Question sentence generation process)
Details of the process in step S114 will be described with reference to FIGS. 6 and 7.

≪ステップＳ１１４１≫
ステップＳ１１４１において、問いかけ文生成部１７０は質問の絞り込みを行う。具体的には、問いかけ文生成部１７０は確信度が所定値以上の質問を選択する。ここで、所定値とは、それ未満であればユーザの意図した質問ではないとみなせるような確信度の値である。本実施形態では、所定値は上述の所定の閾値未満の値を用いるが、所定の閾値以上でもよい。図７は、問いかけ文生成部１７０が、所定値０．１０で質問を絞り込む例である。図７の例では、問いかけ文生成部１７０は、ステップＳ１１０において算出された５つの質問を、質問１から４の４つに絞り込んでいる。 << Step S1141 >>
In step S1141, the question sentence generation unit 170 narrows down the questions. Specifically, the question sentence generation unit 170 selects a question whose certainty is equal to or higher than a predetermined value. Here, the predetermined value is a value of certainty that if it is less than that, it can be regarded as not a question intended by the user. In the present embodiment, the predetermined value uses a value less than the above-mentioned predetermined threshold value, but may be equal to or more than the predetermined threshold value. FIG. 7 is an example in which the question sentence generation unit 170 narrows down the questions with a predetermined value of 0.10. In the example of FIG. 7, the question sentence generation unit 170 narrows down the five questions calculated in step S110 to four questions 1 to 4.

≪ステップＳ１１４２≫
ステップＳ１１４２において、問いかけ文生成部１７０は、問いかけ文に用いる属性項目を選択する。まず、問いかけ文生成部１７０は、キーワードが抽出されていない属性項目群Ｂに着目する。そして、問いかけ文生成部１７０は、属性項目群Ｂに含まれる属性項目の内、ステップＳ１１４１の処理で絞り込まれた質問に対応する属性値が複数種類存在する属性項目を選択する。図７の例では、問いかけ文生成部１７０は、まず、属性項目群Ｂに含まれる属性項目＜症状＞、＜質問形式＞に着目する。そして、属性項目ごとに絞り
込まれた質問１から４に対応する属性値が複数種類存在する属性項目を選択する。具体的には、属性項目＜症状＞に該当する属性値は「ｘｘｘ」、「ｙｙｙ」、「ｚｚｚ」および「ｗｗｗ」の４種類、属性項目＜質問形式＞に該当する属性値は「ｈｏｗ」の１種類のみである。よって、図７の例では、問いかけ文生成部１７０は、複数の属性値が存在する属性項目＜症状＞を選択する。 << Step S1142 >>
In step S1142, the question sentence generation unit 170 selects an attribute item to be used for the question sentence. First, the question sentence generation unit 170 pays attention to the attribute item group B in which the keyword is not extracted. Then, the question sentence generation unit 170 selects, among the attribute items included in the attribute item group B, an attribute item having a plurality of types of attribute values corresponding to the questions narrowed down by the process of step S1141. In the example of FIG. 7, the question sentence generation unit 170 first pays attention to the attribute items <symptom> and <question format> included in the attribute item group B. Then, an attribute item having a plurality of types of attribute values corresponding to questions 1 to 4 narrowed down for each attribute item is selected. Specifically, there are four types of attribute values corresponding to the attribute item <symptom>: "xxx", "yyy", "zzz" and "www", and the attribute value corresponding to the attribute item <question format> is "how". There is only one type of. Therefore, in the example of FIG. 7, the question sentence generation unit 170 selects the attribute item <symptom> in which a plurality of attribute values exist.

≪ステップＳ１１４３≫
ステップＳ１１４３において、ステップＳ１１４２の処理結果から選択された属性項目が１つか否かを判断する。選択された属性項目が１つの場合には、問いかけ文生成部１７０は、当該属性項目を問いかけ文に用いる属性項目とし、ステップＳ１１４５に進む。選択された属性項目が２つ以上の場合は、ステップＳ１１４４に進む。 << Step S1143 >>
In step S1143, it is determined whether or not there is one attribute item selected from the processing result of step S1142. When there is only one selected attribute item, the question sentence generation unit 170 sets the attribute item as the attribute item used in the question sentence, and proceeds to step S1145. If there are two or more selected attribute items, the process proceeds to step S1144.

≪ステップＳ１１４４≫
ステップＳ１１４４では、ステップＳ１１４３において属性項目が１つに決定されない場合（Ｓ１１４３−ＮＯ）に、問いかけ文生成部１７０は、ステップＳ１１４２において選択された複数の属性項目から問いかけ文に用いる属性項目を決定する。 << Step S1144 >>
In step S1144, when the attribute item is not determined to be one in step S1143 (S1143-NO), the question sentence generation unit 170 determines the attribute item to be used for the question sentence from the plurality of attribute items selected in step S1142. ..

ここで、後述するステップＳ１１４５では、問いかけ文生成部１７０は、ステップＳ１１４４において決定する属性項目について、２種類の属性値を用いて問いかけ文を生成する。例えば、属性項目＜症状＞について「Ａですか？またはＢですか？」（属性値が３種類以上存在する場合は「Ａですか？またはＢですか？それとも他の症状ですか？」）といった問いかけ文を生成する。 Here, in step S1145, which will be described later, the question sentence generation unit 170 generates a question sentence using two types of attribute values for the attribute item determined in step S1144. For example, for the attribute item <symptom>, "A or B?" (If there are three or more attribute values, "A or B? Or another symptom?") Generate a question sentence.

よって、ステップＳ１１４４における属性項目の決定方法として、問いかけ文生成部１７０は、例えば、上記の問いかけを行う場合の平均情報量（エントロピー）を考慮して属性項目を決定することができる。具体的には、問いかけ文生成部１７０は、属性項目ごとに各属性値の確信度を用いて平均情報量の算出を行い、求まる値が最大の属性項目を用いて問いかけ文を生成する。以下、図８を用いて、平均情報量を考慮した属性項目の決定方法について説明する。 Therefore, as a method of determining the attribute item in step S1144, the question sentence generation unit 170 can determine the attribute item in consideration of, for example, the average amount of information (entropy) when asking the above question. Specifically, the question sentence generation unit 170 calculates the average amount of information using the certainty of each attribute value for each attribute item, and generates a question sentence using the attribute item having the maximum obtained value. Hereinafter, a method of determining an attribute item in consideration of the average amount of information will be described with reference to FIG.

図８は、ステップＳ１１４２において複数の属性項目が選択された例である。このような場合、問いかけ文生成部１７０は、ステップＳ１１４１において絞り込まれた質問を対象に再度正規化を行った確信度から、以下の式１を用いて平均情報量を求める。

Ｅは属性項目ごとの平均情報量、Ｐ_１からＰ_３は所定の属性値で集約した確信度の合計値である。 FIG. 8 is an example in which a plurality of attribute items are selected in step S1142. In such a case, the question sentence generation unit 170 obtains the average amount of information using the following equation 1 from the certainty that the questions narrowed down in step S1141 are normalized again.

E is the average amount of information for each attribute item, P ₃ from P ₁ is the total value of the confidence that aggregates a predetermined attribute value.

ここで、問いかけ文生成部１７０は、２種類の属性値を選ぶ。例えば、図８に示す記憶部３２０の属性項目＜症状＞の場合、問いかけ文生成部１７０は、１つ目の属性値は「ｘｘｘ」、２つ目の属性値は「ｙｙｙ」を選ぶ。なお、本実施形態では、属性値が３種類以上存在する場合は、後述するステップＳ１１４５の処理と同様に、スコアの降順で上位２種類の属性値を用いるが、属性値の選択方法については特に限定されない。例えば、問いかけ文生成部１７０は、属性値をランダムに２種類選択してもよい。 Here, the question sentence generation unit 170 selects two types of attribute values. For example, in the case of the attribute item <symptom> of the storage unit 320 shown in FIG. 8, the question sentence generation unit 170 selects “xxx” for the first attribute value and “yyy” for the second attribute value. In the present embodiment, when there are three or more types of attribute values, the top two types of attribute values are used in descending order of the score, as in the process of step S1145 described later, but the method of selecting the attribute values is particularly high. Not limited. For example, the question sentence generation unit 170 may randomly select two types of attribute values.

そして、１つ目の属性値で集約した確信度の合計値をＰ_１、２つ目の属性値で集約した確信度の合計値をＰ_２とする。また、属性値が３種類以上ある場合は、３つ目以降のすべての属性値の確信度の合計値をＰ_３（Ｐ_３＝１−Ｐ_１−Ｐ_２）とする。 Then, let P _{1 be} the total value of the certainty levels aggregated by the _first attribute value, and P _{2 be} the total value of the certainty levels aggregated by the second attribute value. If there are three or more types of attribute values, the total value of the certainty of all the attribute values after the _third is P ₃ (P ₃ = 1-P _1- P ₂ ).

図８に示す記憶部３２０の例では、属性項目＜症状＞の平均情報量は１．５７（Ｐ_１＝０．３１，Ｐ_２＝０．２８，Ｐ_３＝０．４１）、属性項目＜質問形式＞の平均情報量は０．９９（Ｐ_１＝０．４５，Ｐ_２＝０．５５，Ｐ_３＝０．００）である。よって、問いかけ文生成部１７０は、問いかけ文生成に用いる属性項目を、平均情報量が最大の属性項目＜症状＞に決定する。 In the example of the storage unit 320 shown in FIG. 8, the average amount of information of the attribute item <symptom> is 1.57 (P ₁ = 0.31, P ₂ = 0.28, P ₃ = 0.41), and the attribute item < The average amount of information in the question format> is 0.99 (P ₁ = 0.45, P ₂ = 0.55, P ₃ = 0.00). Therefore, the question sentence generation unit 170 determines the attribute item used for question sentence generation as the attribute item <symptom> having the maximum average amount of information.

なお、平均情報量の算出に用いる属性値の選択方法は特に限定されず、例えば、問いかけ文生成部１７０は属性値をランダムに選んでもよい。また、平均情報量の算出に用いる属性値の種類は２種類に限定されず、例えば、問いかけ文生成部１７０は、１種類または３種類以上の属性値を用いて平均情報量を算出してもよい。 The method of selecting the attribute value used for calculating the average amount of information is not particularly limited. For example, the question sentence generation unit 170 may randomly select the attribute value. Further, the types of attribute values used for calculating the average information amount are not limited to two types. For example, the question sentence generation unit 170 may calculate the average information amount using one type or three or more types of attribute values. Good.

なお、ステップＳ１１４４における属性項目の決定方法として、上述の平均情報量を考慮する方法以外に、例えば、問いかけ文生成部１７０は、ランダムに属性項目を選んでもよい。また、あらかじめ属性項目に優先順位を設定して、問いかけ文生成部１７０は優先順位に従って優先度の高い属性項目を選んでもよい。さらに、問いかけ文生成部１７０は、属性値の種類に応じて、例えば、種類が最も多い属性項目を選んでもよい。 In addition to the above-mentioned method of considering the average amount of information, as the method of determining the attribute item in step S1144, for example, the question sentence generation unit 170 may randomly select the attribute item. Further, the priority may be set in advance for the attribute items, and the question sentence generation unit 170 may select the attribute items having high priority according to the priority. Further, the question sentence generation unit 170 may select, for example, the attribute item having the most types according to the type of the attribute value.

≪ステップＳ１１４５≫
ステップＳ１１４５では、上述の通り、ステップＳ１１４３（Ｓ１１４３−ＹＥＳ）またはステップＳ１１４４において決定された属性項目の属性値を２種類用いて問いかけ文を生成する。例えば、属性項目＜症状＞の属性値「ｘｘｘ」および「ｙｙｙ」を用いる場合、問いかけ文生成部１７０は、以下の問いかけ文を生成する。
「ｘｘｘですか？またはｙｙｙですか？それとも他の症状ですか？」
ここで、本実施形態では、問いかけ文生成部１７０は、属性値が３種類以上存在する場合は、スコアの降順で上位２種類の属性値を用いるが、属性値の選択方法については特に限定されない。例えば、問いかけ文生成部１７０は、属性値をランダムに２種類選択してもよい。 << Step S1145 >>
In step S1145, as described above, a question sentence is generated using two types of attribute values of the attribute items determined in step S1143 (S1143-YES) or step S1144. For example, when the attribute values "xxx" and "yyy" of the attribute item <symptom> are used, the question sentence generation unit 170 generates the following question sentence.
"Is it xxx or yyy? Or is it some other symptom?"
Here, in the present embodiment, when the question sentence generation unit 170 has three or more types of attribute values, the top two types of attribute values are used in descending order of the score, but the method of selecting the attribute values is not particularly limited. .. For example, the question sentence generation unit 170 may randomly select two types of attribute values.

なお、本実施形態では、問いかけ文に２種類の属性値を用いる例について説明したが、数は特に限定されない。例えば、問いかけ文生成部１７０は、該当する属性項目の全ての属性値を用いて問いかけ文を生成してもよい。 In this embodiment, an example in which two types of attribute values are used in the question sentence has been described, but the number is not particularly limited. For example, the question sentence generation unit 170 may generate a question sentence using all the attribute values of the corresponding attribute items.

＜本実施形態の有利な効果＞
本実施形態によれば、ユーザの質問が曖昧なため問い返しが必要な場合でも、得られる情報量を考慮して問いかけを行うことで、質問の特定に要する対話回数を最小限に留めることができる。 <Advantageous effect of this embodiment>
According to the present embodiment, even when the user's question is ambiguous and it is necessary to answer the question, the number of dialogues required to identify the question can be minimized by asking the question in consideration of the amount of information obtained. ..

＜変形例＞
取得部１１０が取得するデータはテキストに限らない。例えば、取得部１１０はユーザの発話の音声等を取得してもよい。また、取得部１１０は、通信等によりデータ取得してもよい。 <Modification example>
The data acquired by the acquisition unit 110 is not limited to text. For example, the acquisition unit 110 may acquire the voice of the user's utterance or the like. Further, the acquisition unit 110 may acquire data by communication or the like.

また、本実施形態では、音声対話装置システムが、２つの分類部（意図理解部）を用いる例について説明したが、それぞれの構成は特に限定されない。第一の分類部については、複数の質問とそれらの特徴等を保持するテーブル等を用いる構成であればよく、第二の分類部については、複数の質問のそれぞれとユーザの質問とのスコア（一致度）が求まる構成であればよい。例えば、音声対話装置１００は、スコアの算出に、ルールベース型分類部１４０で用いるものとは異なるルールベース型の分類器を用いてスコアを求めてもよい。 Further, in the present embodiment, an example in which the voice dialogue device system uses two classification units (intention understanding units) has been described, but the configuration of each is not particularly limited. The first classification section may be configured to use a plurality of questions and a table or the like that holds their characteristics, etc., and the second classification section may have scores for each of the plurality of questions and the user's question ( Any configuration can be used as long as the degree of coincidence) can be obtained. For example, the voice dialogue device 100 may obtain the score by using a rule-based classifier different from that used by the rule-based classification unit 140 in calculating the score.

また、音声対話装置１００は、回答生成部１８０で用いる第２記憶手段（不図示）を有していなくてもよい。例えば、回答は記憶部１２０に設けられていてもよい。さらに、音声対話装置１００は、回答生成部１８０を有していなくてもよく、この場合、質問特定部１６０が質問の特定および回答の生成を行ってもよい。 Further, the voice dialogue device 100 does not have to have a second storage means (not shown) used in the answer generation unit 180. For example, the answer may be provided in the storage unit 120. Further, the voice dialogue device 100 does not have to have the answer generation unit 180, and in this case, the question identification unit 160 may specify the question and generate the answer.

また、本実施形態では、ステップＳ１１４において、問いかけ文生成部１７０は、いずれか１つの属性項目（の属性値）を用いてユーザへの問いかけ文を生成しているが、問いかけ文に用いる属性項目は１つに限らない。例えば、問いかけ文生成部は＜症状＞について問いかけるとともに、＜質問形式＞について同時に問いかけてもよい。このように複数の属性項目について問いかけ文を生成することで、問いかけに要する対話回数をさらに抑えることができる。 Further, in the present embodiment, in step S114, the question sentence generation unit 170 generates a question sentence to the user by using (the attribute value of) any one of the attribute items, but the attribute item used for the question sentence. Is not limited to one. For example, the question sentence generation unit may ask about <symptom> and at the same time about <question format>. By generating question sentences for a plurality of attribute items in this way, the number of dialogues required for the question can be further reduced.

さらに、本実施形態では、ステップＳ１１４において、問いかけ文生成部１７０は属性項目群Ｂに含まれるいずれかの属性値を用いて問いかけ文を生成する例について説明したが、問いかけ文の生成方法はこれに限らない。ユーザの質問を特定できるような問いかけ文であればよい。例えば、上述の例において、問いかけ文生成部１７０は、ユーザの質問から自明な内容（属性項目群Ａ）である「エンジン」を用いて「エンジンのどのようなトラブルでしょうか？」といった問いかけ文を生成してもよい。さらに、記憶部１２０に設けられた単語等を含まない問いかけ文を生成してもよく、例えば、上述の例において質問中のキーワード「エンジン」以外を用いて、「なんかおかしい、とは具体的にどのようなトラブルでしょうか」といった問いかけ文を生成してもよい。 Further, in the present embodiment, in step S114, the question sentence generation unit 170 has described an example of generating a question sentence using any of the attribute values included in the attribute item group B, but the method of generating the question sentence is this. Not limited to. Any question text that can identify the user's question will do. For example, in the above example, the question sentence generation unit 170 uses the "engine" which is a trivial content (attribute item group A) from the user's question to ask a question sentence such as "what kind of trouble is the engine?" It may be generated. Further, a question sentence that does not include a word or the like provided in the storage unit 120 may be generated. For example, in the above example, using a keyword other than the keyword "engine" in the question, "something is wrong" specifically. You may generate a question sentence such as "What kind of trouble is it?"

なお、スマートフォン２００への入力は音声に限らず、例えば、キーボードで入力されたテキスト等でもよい。また、スマートフォン２００からユーザへの出力はテキスト等であってもよい。さらに、本実施形態に係るスマートフォン２００は、音声をテキストに変換する役割を果たしていればよく、例えば、ＰＣ等に置き換えることもできる。また、質問応答システムは、スマートフォン２００を用いなくてもよい。例えば、音声対話装置１００に、マイク、スピーカーおよびディスプレイ等を設けて、ユーザの質問を直接取得してもよく、ユーザに直接問いかけおよび回答を行ってもよい。 The input to the smartphone 200 is not limited to voice, and may be, for example, text input by the keyboard. Further, the output from the smartphone 200 to the user may be text or the like. Further, the smartphone 200 according to the present embodiment may play a role of converting voice into text, and may be replaced with, for example, a PC or the like. Further, the question answering system does not have to use the smartphone 200. For example, the voice dialogue device 100 may be provided with a microphone, a speaker, a display, or the like to directly acquire the user's question, or may directly ask and answer the user's question.

また、本実施形態では、正規化後の確信度を用いた例について説明したが、これに限らない。例えば、機械学習型分類部１５０によって求められたスコアそのものを用いて質問の特定等を行ってもよい。この場合、上記の所定の閾値等は、スコアに対して設ければよい。 Further, in the present embodiment, an example using the conviction after normalization has been described, but the present invention is not limited to this. For example, the question may be specified using the score itself obtained by the machine learning type classification unit 150. In this case, the above-mentioned predetermined threshold value and the like may be provided for the score.

また、本実施形態では、「車のトラブルに関する質問応答システム」を例に説明をしたが、質問の対象はこれに限定されない。 Further, in the present embodiment, the explanation has been given by taking the "question answering system for car trouble" as an example, but the subject of the question is not limited to this.

＜その他＞
上記の実施形態および変形例の構成は、本発明の技術的思想を逸脱しない範囲内で、適宜組み合わせて利用することができる。また、本発明は、その技術的思想を逸脱しない範囲で適宜変更を加えて実現しても構わない。 <Others>
The configurations of the above-described embodiments and modifications can be appropriately combined and used within a range that does not deviate from the technical idea of the present invention. Further, the present invention may be realized by appropriately modifying it without departing from the technical idea.

１００：音声対話装置
１１０：取得部
１２０，２２０，３２０：記憶部
１３０：抽出部
１４０：ルールベース型分類部
１５０：機械学習型分類部
１６０：質問特定部
１７０：問いかけ文生成部
１８０：回答生成部
２００：スマートフォン 100: Voice dialogue device 110: Acquisition unit 120, 220, 320: Storage unit 130: Extraction unit 140: Rule-based classification unit 150: Machine learning type classification unit 160: Question identification unit 170: Question generation unit 180: Answer generation Department 200: Smartphone

Claims

How to get the text of the user's question,
A storage means for associating and holding a plurality of predetermined questions, one or more attribute items representing the attributes of the plurality of questions, and an attribute value representing the contents of the attribute items for each of the plurality of questions.
An extraction means for extracting the attribute item or the attribute value or a keyword related to these synonyms from the text.
Using the keyword and the storage means, a first intent understanding means for finding a candidate for a question that matches the text among the plurality of questions.
A second means of understanding the intention to obtain a score indicating the degree of agreement between each of the plurality of questions and the text for each of the plurality of questions.
A question identifying means for identifying a question from the processing result of the first intention understanding means or the second intention understanding means,
With
The question identification means
If the question is uniquely determined from the processing result of the second intention understanding means, the user's question is specified from the processing result of the second intention understanding means.
If the question is not uniquely determined from the processing result of the second intention understanding means, the user's question is specified from the processing results of the first intention understanding means and the second intention understanding means.
Dialogue system.

The question identifying means determines that a question is uniquely determined when there is one question in which the normalized value of the score is equal to or greater than a predetermined threshold value.
The dialogue system according to claim 1.

In the question specifying means, the question using the second intention understanding means is specified.
A question in which the normalized value of the score is equal to or higher than a predetermined threshold for all of the plurality of questions is asked by identifying it as a user's question.
In the question identifying means, the identification of the question using the first intention understanding means and the second intention understanding means is performed.
A question for which the value obtained by normalizing the score is equal to or greater than a predetermined threshold value is specified as a user's question for the question candidates obtained by the first intention understanding means.
The dialogue system according to claim 1 or 2.

A question sentence generation means for generating a question sentence to a user for the attribute item for which a keyword has not been extracted by the extraction means when the question is not uniquely determined by the question identification means.
The dialogue system according to any one of claims 1 to 3, further comprising.

The question sentence generation means generates a question sentence by using the attribute value of the attribute item in which a plurality of types of the attribute values exist among the attribute items for which keywords have not been extracted by the extraction means.
The dialogue system according to claim 4.

The question sentence generating means targets only the questions whose normalized value of the score is equal to or greater than a predetermined value among the plurality of questions, and the attribute value of the attribute item having the maximum average amount of information (entropy). Generate a question sentence using
The dialogue system according to claim 5.

The question sentence generation means uses the attribute value as a question sentence for a plurality of attribute items among the attribute items.
The dialogue system according to claim 6.

The processing of the question sentence generating means, the first intention understanding means, the second intention understanding means, and the question specifying means is repeated until the question is uniquely determined by the question specifying means.
The dialogue system according to any one of claims 4 to 7.

The second intention understanding means obtains a score using a classifier generated by machine learning.
The dialogue system according to any one of claims 1 to 8.

Executed by a computer having a storage means for associating a plurality of predetermined questions, one or more attribute items representing the attributes of the plurality of questions, and an attribute value representing the contents of the attribute items for each of the plurality of questions. It ’s a dialogue method,
Get step to get the text of the user's question, and
An extraction step of extracting a keyword related to the attribute item or the attribute value or a synonym thereof from the text.
Using the keyword and the storage means, a first intent understanding step of finding a candidate for a question that matches the text among the plurality of questions.
A second intent understanding step of obtaining a score representing the degree of agreement between each of the plurality of questions and the text for each of the plurality of questions.
A question identification step that identifies a question from the processing result of the first intention understanding step or the second intention understanding step,
Including
In the question identification step
If the question is uniquely determined from the processing result of the second intention understanding step, the user's question is specified from the processing result of the second intention understanding step.
If the question is not uniquely determined from the processing result of the second intention understanding step, the user's question is specified from the processing results of the first intention understanding step and the second intention understanding step.
How to interact.

A program for causing a computer to perform each step of the method according to claim 10.