JP6730090B2

JP6730090B2 - Dialog processing device

Info

Publication number: JP6730090B2
Application number: JP2016100653A
Authority: JP
Inventors: 健太郎乾; 直也井之上; 祥多佐藤; 生聖渡部; 佐和樋口
Original assignee: Tohoku University NUC; Toyota Motor Corp
Current assignee: Tohoku University NUC; Toyota Motor Corp
Priority date: 2016-05-19
Filing date: 2016-05-19
Publication date: 2020-07-29
Anticipated expiration: 2036-05-19
Also published as: JP2017207663A

Description

本発明は対話処理装置に関し、特に連想知識を用いる対話処理装置に関する。 The present invention relates to a dialog processing device, and more particularly to a dialog processing device using associative knowledge.

近年、ユーザとの会話を実現する対話処理装置の開発が求められている。例えば、特許文献１では、ユーザの発話を認識し、認識された発話と応答知識とに基づいて、ユーザの発話に対する応答用の発話を生成し、生成した応答用の発話を音声合成して出力する対話処理装置について開示している。 In recent years, there has been a demand for the development of a dialogue processing device that realizes a conversation with a user. For example, in Patent Document 1, a user's utterance is recognized, a response utterance for the user's utterance is generated based on the recognized utterance and response knowledge, and the generated response utterance is speech-synthesized and output. There is disclosed a dialogue processing device.

特開２００８−１８６０２７号公報JP, 2008-186027, A

しかしながら、一般的に、対話処理装置の応答文は、発話文に対して短絡的となりがちである。このため、話題が広がり難く、ユーザとの間で会話を継続させるという点において改善の余地がある。 However, in general, the response sentence of the dialogue processing device tends to be short-circuited with respect to the utterance sentence. Therefore, there is room for improvement in that it is difficult for the topic to spread and the conversation with the user is continued.

上記目的を達成するための本発明の一態様は、事象と、該事象に対する連想知識と、該事象と該連想知識との連想関係の種別とを対応付けて記憶する第１のデータベースと、前記連想関係の種別毎に、少なくとも１つの応答の種別及び該応答の種別に対応したテンプレートを前記連想関係の種別に対応付けて記憶する第２のデータベースと、入力された発話文から事象を抽出する事象抽出手段と、前記第１のデータベースを参照して、前記事象抽出手段により抽出された事象に対応する前記連想知識及び前記連想関係の種別を特定する知識照合手段と、前記知識照合手段により特定された前記連想関係の種別に対応する前記応答の種別のうち、いずれか１つを選択する応答分類決定手段と、前記知識照合手段により特定された前記連想知識を、前記応答分類決定手段により選択された前記応答の種別に対応した前記テンプレートに適用することにより、前記発話文に対する応答文を生成する応答文生成手段とを有する対話処理装置である。
この対話処理装置では、抽出された事象に対応する連想知識を用いて応答文が生成される。また、その際、連想関係の種別に応じた応答種別のうち選択されたものに対応するテンプレートが用いられて応答文が生成される。このため、この対話処理装置によれば、多様な応答を行うことが可能となり、話題が広がりやすくなる。したがって、会話を継続しやすくなる。 One aspect of the present invention for achieving the above object is a first database that stores an event, associative knowledge for the event, and a type of an associative relationship between the event and the associative knowledge in association with each other, For each associative relation type, a second database that stores at least one response type and a template corresponding to the response type in association with the associative relation type and an event from the input utterance sentence An event extraction unit, a knowledge collation unit that refers to the first database and specifies the type of the associative knowledge and the association relation corresponding to the event extracted by the event extraction unit, and the knowledge collation unit. The response classification determining unit that selects any one of the types of the responses corresponding to the identified type of the associative relationship, and the association classification determining unit that determines the associative knowledge specified by the knowledge collating unit It is an interaction processing device comprising: a response sentence generation unit that generates a response sentence for the utterance sentence by applying the template corresponding to the selected type of the response.
In this dialogue processing device, a response sentence is generated using the associative knowledge corresponding to the extracted event. At that time, the response sentence is generated by using the template corresponding to the selected one of the response types corresponding to the type of associative relationship. Therefore, according to this dialogue processing device, it becomes possible to make various responses, and the topic easily spreads. Therefore, it becomes easy to continue the conversation.

本発明によれば、本発明の構成を用いない場合に比較して、話題が広がりやすく、ユーザとの間で会話を継続しやすい対話処理装置を提供することができる。 According to the present invention, it is possible to provide a dialogue processing device in which a topic is more likely to spread and a conversation with a user is easier to continue, as compared with a case where the configuration of the present invention is not used.

実施の形態１にかかる対話処理装置の構成の一例を示すブロック図である。FIG. 1 is a block diagram showing an example of a configuration of a dialogue processing device according to a first exemplary embodiment. 実施の形態１にかかる連想知識データベースに記憶される対応関係の一例を示す表である。5 is a table showing an example of correspondence relationships stored in the associative knowledge database according to the first exemplary embodiment. 実施の形態１にかかる応答知識データベースに記憶される対応関係の一例を示す表である。5 is a table showing an example of correspondence relationships stored in the response knowledge database according to the first exemplary embodiment. 実施の形態１にかかる対話処理装置の動作の一例を示すフローチャートである。5 is a flowchart showing an example of an operation of the dialogue processing device according to the first exemplary embodiment. 実施の形態２にかかる応答知識データベースに記憶される対応関係の一例を示す表である。9 is a table showing an example of correspondences stored in a response knowledge database according to the second exemplary embodiment.

＜実施の形態１＞
以下、図面を参照して本発明の実施の形態について説明する。図１は、実施の形態１にかかる対話処理装置１の構成の一例を示すブロック図である。図１に示されるように、対話処理装置１は、連想知識データベース１０と、応答知識データベース２０と、事象抽出部３０と、知識照合部４０と、応答分類決定部５０と、応答テンプレート結合部６０とを有する。連想知識データベース１０及び応答知識データベース２０は、それぞれ、例えば図示しない記憶装置により実現される。また、事象抽出部３０、知識照合部４０、応答分類決定部５０、及び、応答テンプレート結合部６０は、それぞれ、例えば、対話処理装置１が有する図示しないＣＰＵが、プログラムを実行することにより実現される。 <Embodiment 1>
Hereinafter, embodiments of the present invention will be described with reference to the drawings. FIG. 1 is a block diagram showing an example of the configuration of the dialogue processing apparatus 1 according to the first exemplary embodiment. As shown in FIG. 1, the dialogue processing device 1 includes an associative knowledge database 10, a response knowledge database 20, an event extraction unit 30, a knowledge collation unit 40, a response classification determination unit 50, and a response template combination unit 60. Have and. The associative knowledge database 10 and the response knowledge database 20 are each realized by, for example, a storage device (not shown). Further, the event extraction unit 30, the knowledge collation unit 40, the response classification determination unit 50, and the response template combination unit 60 are respectively realized by the CPU (not shown) included in the dialogue processing device 1 executing a program. It

連想知識データベース１０は、事象と、この事象に対する連想知識と、この事象とこの連想知識との連想関係（因果関係）の種別とを対応付けて記憶するデータベースである。なお、「事象」には、例えば、事物の状態若しくは意味、又は人若しくは動物の行為などが含まれる。図２は、連想知識データベース１０に記憶される対応関係の一例を示す表である。図２に示すように、連想知識データベース１０は、予め定められた複数の事象毎に、当該事象から連想される知識である連想知識と、連想関係の種別である知識分類とを対応付けて記憶している。例えば、事象「ゴルフを練習する」に対し、連想知識「ゴルフが上達する」が対応付けられている。また、ゴルフの練習の目的がゴルフの上達であることから、知識分類として「目的」が対応付けられている。 The associative knowledge database 10 is a database that stores an event, an associative knowledge about the event, and a type of an associative relationship (causal relationship) between the event and the associative knowledge in association with each other. Note that the “event” includes, for example, the state or meaning of a thing or the act of a person or animal. FIG. 2 is a table showing an example of correspondence relationships stored in the associative knowledge database 10. As shown in FIG. 2, the associative knowledge database 10 stores, for each of a plurality of predetermined events, the associative knowledge that is the knowledge associated with the event and the knowledge classification that is the type of the associative relationship in association with each other. doing. For example, associative knowledge “golf improves” is associated with the phenomenon “practice golf”. Further, since the purpose of golf practice is to improve golf, “purpose” is associated with the knowledge classification.

図２に示した例では、連想関係として「目的」以外に、「利用」、「共起」、「効果」、「条件」、「習慣」、「用途」が挙げられているが、連想関係はこれらに限られず、他の連想関係が定義されてもよい。なお、本実施の形態では、例えば、事象の内容が、連想知識の内容を発生させる目的で行うものである場合、知識分類として「目的」が用いられる。また、例えば、事象の内容が行われている際に、連想知識の内容の物が利用される場合、知識分類として「利用」が用いられる。また、例えば、事象の内容が行われている際に、連想知識の内容が行われやすい場合、知識分類として「共起」が用いられる。また、例えば、事象の内容の結果、連想知識の内容が効果として現れる場合、知識分類として「効果」が用いられる。また、例えば、連想知識の内容が、事象の内容を発生させるために満たすべき条件である場合、知識分類として「条件」が用いられる。また、例えば、事象の内容が発生する場合、連想知識の内容が習慣的に発生するならば、知識分類として「習慣」が用いられる。また、例えば、事象の内容の用途として、連想知識の内容が発生する場合、知識分類として「用途」が用いられる。 In the example shown in FIG. 2, in addition to "purpose", "use", "co-occurrence", "effect", "condition", "habit", and "use" are listed as an associative relationship. Are not limited to these, and other associative relations may be defined. In the present embodiment, for example, when the content of the event is to generate the content of the associative knowledge, “purpose” is used as the knowledge classification. Further, for example, when the content of the associative knowledge is used while the content of the event is being performed, “use” is used as the knowledge classification. Further, for example, when the content of the associative knowledge is likely to be performed while the content of the event is being performed, “co-occurrence” is used as the knowledge classification. Further, for example, when the content of the associative knowledge appears as an effect as a result of the content of the event, “effect” is used as the knowledge classification. In addition, for example, when the content of the associative knowledge is a condition to be satisfied in order to generate the content of the event, “condition” is used as the knowledge classification. Further, for example, if the content of the event occurs, or if the content of the associative knowledge occurs habitually, “habit” is used as the knowledge classification. Further, for example, when the content of the associative knowledge is generated as the usage of the content of the event, the “usage” is used as the knowledge classification.

応答知識データベース２０は、連想関係の種別毎に、少なくとも１つの応答の種別及びこの応答の種別に対応したテンプレートを、連想関係の種別に対応付けて記憶するデータベースである。図３は、実施の形態１にかかる応答知識データベース２０に記憶される対応関係の一例を示す表である。図３に示すように、応答知識データベース２０は、知識分類毎に、応答の種別である応答分類と、応答に用いられるテンプレートとを対応付けて記憶している。各知識分類において、応答分類のそれぞれに１つのテンプレートが対応付けられている。ここで、応答知識データベース２０では、各知識分類に対し、１以上の応答分類が対応付けられている。つまり、応答知識データベース２０では、各知識分類に対し、応答分類に応じた１以上のテンプレートが対応付けられている。例えば、図３に示した例では、知識分類「習慣」に対し、４つの応答分類が対応付けられている。また、この４つの応答分類のそれぞれに、応答分類に応じたテンプレートが対応付けられている。 The response knowledge database 20 is a database that stores at least one response type and a template corresponding to this response type in association with each association type for each association type. FIG. 3 is a table showing an example of correspondences stored in the response knowledge database 20 according to the first embodiment. As shown in FIG. 3, the response knowledge database 20 stores, for each knowledge category, the response category, which is the type of response, and the template used for the response in association with each other. In each knowledge classification, one template is associated with each response classification. Here, in the response knowledge database 20, one or more response classifications are associated with each knowledge classification. That is, in the response knowledge database 20, each knowledge classification is associated with one or more templates corresponding to the response classification. For example, in the example shown in FIG. 3, four response classifications are associated with the knowledge classification “habit”. A template corresponding to the response classification is associated with each of the four response classifications.

図３に示した例では、応答の種別として「確認」、「感想」、「提案」、「習慣」が挙げられているが、応答分類はこれらに限られず、他の応答分類が定義されてもよい。なお、本実施の形態では、例えば、応答の種別「確認」に対して、連想知識の内容について対話相手に確認するための応答文のテンプレートが対応付けられている。また、例えば、応答の種別「感想」に対して、連想知識の内容で示される感想を対話相手に伝えるための応答文のテンプレートが対応付けられている。また、例えば、応答の種別「提案」に対して、連想知識の内容を対話相手に提案するための応答文のテンプレートが対応付けられている。また、例えば、応答の種別「習慣」に対して、連想知識の内容で示される習慣が対話相手にあるのかを聞くための応答文のテンプレートが対応付けられている。 In the example shown in FIG. 3, “confirmation”, “impression”, “suggestion”, and “habit” are listed as the types of responses, but the response classification is not limited to these, and other response classifications are defined. Good. In the present embodiment, for example, the response type “confirm” is associated with a response sentence template for confirming the content of the associative knowledge with the conversation partner. Further, for example, the response type “impression” is associated with a response sentence template for transmitting the impression indicated by the content of the associative knowledge to the conversation partner. Further, for example, a response sentence template for proposing the content of the associative knowledge to the conversation partner is associated with the response type “proposal”. Further, for example, the response type “habit” is associated with the template of the response sentence for asking whether the conversation partner has the habit indicated by the content of the associative knowledge.

テンプレートは、言葉を埋め込むためのスロットを有している。なお、図３のテンプレートにおいて、＜…＞で示される部分がスロットを示している。テンプレートのこのスロットに、連想知識に基づく言葉が埋め込まれることで応答文が完成することとなる。なお、スロットに挿入される言葉は、図３に示されるように、例えば、名詞、サ変名詞、述語、全文、述語項などのパターンのうちのいずれかであり、各スロットには、挿入すべきパターンが定義づけられている。このように、テンプレートのスロットには、例えば、連想知識データベース１０に示される連想知識の語句から抽出される名詞、連想知識データベース１０に示される連想知識の語句から抽出されるサ変名詞、連想知識データベース１０に示される連想知識の語句から抽出される述語、連想知識データベース１０に示される連想知識の語句の全文、連想知識データベース１０に示される連想知識の語句から抽出される述語項などが挿入される。 The template has slots for embedding words. In addition, in the template of FIG. 3, the portion indicated by <...> indicates a slot. The response sentence is completed by embedding words based on associative knowledge in this slot of the template. As shown in FIG. 3, the word inserted in the slot is, for example, one of the patterns of noun, sahen noun, predicate, full sentence, predicate term, etc., and should be inserted in each slot. The pattern is defined. Thus, for example, in the slot of the template, the noun extracted from the word of the associative knowledge shown in the associative knowledge database 10, the sahen noun extracted from the word of the associative knowledge shown in the associative knowledge database 10, the associative knowledge database The predicate extracted from the word of the associative knowledge shown in 10, the whole sentence of the word of the associative knowledge shown in the associative knowledge database 10, the predicate term extracted from the word of the associative knowledge shown in the associative knowledge database 10, etc. are inserted. ..

事象抽出部３０（事象抽出手段）は、入力された発話文から事象を抽出する。事象抽出部３０には、例えば、対話相手であるユーザの発話音声に音声認識等の予め定められた処理を施すことにより言語化された発話文が入力される。事象抽出部３０は、入力された発話文に対し、予め定められた構文解析処理を施し、単語又は述語項などにより特定される事象を抽出する。構文解析処理は、例えば、公知の形態素解析エンジン及び公知の係り受け解析エンジンを用いることで実行される。なお、事象抽出部３０は、入力された発話文からの事象の抽出のみならず、さらに、例えば、図示しないカメラなどから得られた画像情報から画像情報に示される事象を抽出してもよい。事象抽出部３０は、抽出した事象を、知識照合部４０に通知する。 The event extracting unit 30 (event extracting means) extracts an event from the input utterance sentence. The event extraction unit 30 receives a verbalized utterance sentence, for example, by performing a predetermined process such as voice recognition on a voice uttered by a user who is a dialogue partner. The event extraction unit 30 performs a predetermined syntax analysis process on the input utterance sentence and extracts an event specified by a word or a predicate term. The syntactic analysis process is executed by using, for example, a known morphological analysis engine and a known dependency analysis engine. The event extraction unit 30 may not only extract the event from the input utterance sentence, but also extract the event indicated by the image information from the image information obtained from a camera (not shown). The event extracting unit 30 notifies the knowledge collating unit 40 of the extracted event.

知識照合部４０（知識照合手段）は、連想知識データベース１０を参照して、事象抽出部３０により抽出された事象に対応する連想知識及び知識分類（連想関係の種別）を特定する。なお、事象抽出部３０により抽出された事象が複数存在する場合には、知識照合部４０は、予め定められた選択規則（例えばランダムな選択）に従って複数の事象の中から１つを選択した上で、連想知識及び知識分類を特定する。知識照合部４０は、特定した知識分類を応答分類決定部５０に通知し、特定した連想知識を応答テンプレート結合部６０に通知する。 The knowledge collating unit 40 (knowledge collating means) refers to the associative knowledge database 10 to identify the associative knowledge and the knowledge classification (type of associative relationship) corresponding to the event extracted by the event extracting unit 30. When there are a plurality of events extracted by the event extraction unit 30, the knowledge matching unit 40 selects one from the plurality of events according to a predetermined selection rule (for example, random selection). Then, the associative knowledge and the knowledge classification are specified. The knowledge collating unit 40 notifies the response classification determining unit 50 of the specified knowledge classification and notifies the response template combining unit 60 of the specified associative knowledge.

応答分類決定部５０（応答分類決定手段）は、知識照合部４０により特定された知識分類（連想関係の種別）に対応する応答分類（応答の種別）のうち、いずれか１つを選択する。応答分類決定部５０は、応答知識データベース２０を参照し、知識照合部４０により特定された知識分類に対応する応答分類を抽出する。抽出された応答分類が１つである場合、応答分類決定部５０は、この抽出された応答分類を選択する。抽出された応答分類が複数である場合、応答分類決定部５０は、予め定められた選択規則に従って、この抽出された複数の応答分類の中から１つの応答分類を選択する。例えば、応答分類決定部５０は、抽出された複数の応答分類の中からランダムに１つの応答分類を選択する。応答分類決定部５０は、選択した応答分類に応答知識データベース２０において対応付けられているテンプレートを、応答テンプレート結合部６０に通知する。 The response classification determination unit 50 (response classification determination means) selects any one of the response classifications (response types) corresponding to the knowledge classification (association type) identified by the knowledge matching unit 40. The response classification determining unit 50 refers to the response knowledge database 20 and extracts a response classification corresponding to the knowledge classification specified by the knowledge matching unit 40. When the number of extracted response classifications is one, the response classification determination part 50 selects this extracted response classification. When there are a plurality of extracted response classifications, the response classification determination unit 50 selects one response classification from the plurality of extracted response classifications according to a predetermined selection rule. For example, the response classification determination unit 50 randomly selects one response classification from the plurality of extracted response classifications. The response classification determination unit 50 notifies the response template combination unit 60 of the template associated with the selected response classification in the response knowledge database 20.

応答テンプレート結合部６０（応答文生成手段）は、知識照合部４０により特定された連想知識を、応答分類決定部５０により選択された応答の種別に対応したテンプレートに適用することにより、発話文に対する応答文を生成する。より具体的には、応答テンプレート結合部６０は、知識照合部４０により特定された知識分類に対応付けられたテンプレートのうち、応答分類決定部５０により選択された応答の種別に対応したテンプレートに、知識照合部４０により特定された連想知識を適用することにより、発話文に対する応答文を生成する。応答テンプレート結合部６０は、応答分類決定部５０から通知されたテンプレートにおけるスロットに、知識照合部４０により特定された連想知識を当てはめることにより、応答文を生成する。応答テンプレート結合部６０は、テンプレートにおけるスロットに定義づけられたパターンに沿うよう、知識照合部４０により特定された連想知識から言葉を抽出する。なお、この抽出は、例えば、連想知識の構文構造を解析することで可能である。また、応答テンプレート結合部６０は、パターンが、述語、全文、述語項などの場合には、活用語変換（例えば、「行く」から「行き」への変換）を行い、テンプレートのスロットに言葉を挿入する。応答テンプレート結合部６０は、このようにして、事象抽出部３０に入力された発話文に対する応答文を生成する。 The response template combination unit 60 (response sentence generation means) applies the associative knowledge specified by the knowledge collation unit 40 to the template corresponding to the type of the response selected by the response classification determination unit 50, thereby responding to the utterance sentence. Generate a response sentence. More specifically, the response template combination unit 60, among the templates associated with the knowledge classification specified by the knowledge matching unit 40, the template corresponding to the type of the response selected by the response classification determination unit 50, By applying the associative knowledge specified by the knowledge matching unit 40, a response sentence to the utterance sentence is generated. The response template combination unit 60 generates a response sentence by applying the associative knowledge specified by the knowledge collation unit 40 to the slot in the template notified from the response classification determination unit 50. The response template combination unit 60 extracts words from the associative knowledge specified by the knowledge matching unit 40 so as to follow the pattern defined in the slots of the template. Note that this extraction can be performed by, for example, analyzing the syntactic structure of the associative knowledge. When the pattern is a predicate, a full sentence, a predicate term, or the like, the response template combination unit 60 performs inflectional word conversion (for example, conversion from “go” to “go”) and puts words in the template slot. insert. The response template combination unit 60 thus generates a response sentence to the utterance sentence input to the event extraction unit 30.

このように、対話処理装置１は、ユーザの発話文に関連する連想知識を、応答分類に応じた加工方法により加工して、応答文を生成する。
次に、対話処理装置１の動作の一例について説明する。図４は、対話処理装置１の動作の一例を示すフローチャートである。以下、図４に示すフローチャートに沿って、対話処理装置１の動作について説明する。 In this way, the dialogue processing apparatus 1 processes the associative knowledge related to the user's utterance sentence by the processing method according to the response classification to generate the response sentence.
Next, an example of the operation of the dialogue processing device 1 will be described. FIG. 4 is a flowchart showing an example of the operation of the dialogue processing device 1. The operation of the dialogue processing device 1 will be described below with reference to the flowchart shown in FIG.

ステップ１０（Ｓ１０）において、ユーザの発話（例えば、「歌を歌うことが好きなんだ」）が検出される。この検出は、例えば、図示しないマイクによりユーザの発話音声を取得することにより行われる。ステップ１１（Ｓ１１）において、ステップ１０において検出された発話音声に対する音声認識が、図示しない音声認識部により行われる。音声認識部は、例えば、公知の音声認識ソフトウェアにより実現される。ステップ１１による音声認識により、音声がテキストに変換され、ユーザの発話文が取得される。 In step 10 (S10), the user's utterance (for example, "I like to sing a song") is detected. This detection is performed, for example, by acquiring the voice uttered by the user with a microphone (not shown). In step 11 (S11), the voice recognition of the uttered voice detected in step 10 is performed by a voice recognition unit (not shown). The voice recognition unit is realized by known voice recognition software, for example. By the voice recognition in step 11, the voice is converted into text and the utterance sentence of the user is acquired.

事象抽出部３０は、ステップ１２（Ｓ１２）において、ステップ１１で取得された発話文に対し構文解析処理を行い、ステップ１３（Ｓ１３）において、発話文から事象を抽出する。例えば、事象抽出部３０は、構文解析処理を行うことで、「歌うことが好き」などの述語項、及び、「歌」などの品詞別単語を事象として抽出する。そして、ステップ１４（Ｓ１４）において、知識照合部４０は、連想知識データベース１０を参照して、事象抽出部３０により抽出された事象に対応する連想知識及び知識分類を特定する。ステップ１４では、例えば、事象「歌うことが好き」に対して、連想知識「カラオケに行く」及び知識分類「習慣」が特定される。 In step 12 (S12), the event extraction unit 30 performs a syntax analysis process on the utterance sentence acquired in step 11, and in step 13 (S13), extracts an event from the utterance sentence. For example, the event extraction unit 30 performs a syntactic analysis process to extract a predicate term such as “I like to sing” and a part-of-speech word such as “song” as an event. Then, in step 14 (S14), the knowledge collating unit 40 refers to the associative knowledge database 10 and specifies the associative knowledge and the knowledge classification corresponding to the event extracted by the event extracting unit 30. In step 14, for example, the associated knowledge “go to karaoke” and the knowledge classification “habit” are specified for the event “I like singing”.

ステップ１５（Ｓ１５）において、応答分類決定部５０は、応答知識データベース２０を参照して、ステップ１４で特定された知識分類に対応する応答分類を１つ選択する。ステップ１５では、例えば、知識分類「習慣」に対して、４つの応答分類のうち、「感想」が選択される。ステップ１６（Ｓ１６）において、応答テンプレート結合部６０は、ステップ１４で特定された連想知識を、ステップ１５で選択された応答分類に対応したテンプレートに適用することにより、発話文に対する応答文を生成する。例えば、ステップ１５において応答分類「感想」が選択された場合、応答テンプレート結合部６０は、知識分類「習慣」に対応付けられているテンプレートのうち、応答分類「感想」に対応したテンプレート「＜述語項＞たいね」を用いて、応答文を生成する。このとき、応答テンプレート結合部６０は、連想知識「カラオケに行く」の述語項である「カラオケに行く」を、テンプレートのスロットに挿入することで応答文を生成する。なお、応答テンプレート結合部６０は、挿入に際し、述語項の接続部を活用（変形）した上でスロットへの挿入を行う。これにより、応答文「カラオケに行きたいね」が生成されることとなる。 In step 15 (S15), the response classification determination unit 50 refers to the response knowledge database 20 and selects one response classification corresponding to the knowledge classification specified in step 14. In step 15, for example, “impression” is selected from the four response classifications for the knowledge classification “habit”. In step 16 (S16), the response template combining unit 60 applies the associative knowledge specified in step 14 to the template corresponding to the response classification selected in step 15 to generate a response sentence to the utterance sentence. .. For example, when the response classification “impression” is selected in step 15, the response template combination unit 60 selects the template “<predicate” corresponding to the response classification “impression” among the templates associated with the knowledge classification “habit”. A response sentence is generated by using "term>tai". At this time, the response template combination unit 60 generates a response sentence by inserting the predicate term “go to karaoke” of the associative knowledge “go to karaoke” into the slot of the template. It should be noted that the response template combination unit 60 uses (transforms) the connection part of the predicate term and then inserts it into the slot. As a result, the response sentence "I want to go to karaoke" is generated.

ステップ１７（Ｓ１７）において、ステップ１６で生成された応答文を音声として出力するための音声合成処理が行われ、図示しないスピーカ等により応答文が音声出力される。 In step 17 (S17), a voice synthesis process for outputting the response sentence generated in step 16 as voice is performed, and the response sentence is voice output from a speaker or the like not shown.

以上、対話処理装置１について説明した。対話処理装置１によれば、抽出された事象に対応する連想知識を用いて応答文が生成される。そして、その際、連想関係の種別に応じた応答種別のうち選択されたものに対応するテンプレートが用いられて応答文が生成される。このため、対話処理装置１によれば、ユーザの発話の語彙から飛躍した応答が可能となる。つまり、ユーザの発話に対し、多様な応答を行うことが可能となり、話題が広がりやすくなる。したがって、対話処理装置１によれば、会話を継続しやすくなる。また、上述の通り、連想知識データベース１０及び応答知識データベース２０によって、応答文生成に必要とする知識を２段構成で汎化している。このため、多様な応答を容易に生成することが可能となっている。また、データベースから選ばれた連想知識の知識分類に複数の応答分類が対応している場合には、選択する応答分類を変更することで種々の応答パターンを実現できるので、データベースに予め格納しておく文の数を抑制することができる。 The dialog processing device 1 has been described above. According to the dialogue processing device 1, the response sentence is generated using the associative knowledge corresponding to the extracted event. Then, at that time, the response sentence is generated by using the template corresponding to the selected one of the response types according to the type of the associative relationship. Therefore, according to the dialogue processing device 1, it is possible to make a leap response from the vocabulary of the user's utterance. That is, it becomes possible to make various responses to the user's utterance, and the topic easily spreads. Therefore, according to the dialogue processing device 1, it becomes easy to continue the conversation. Further, as described above, the knowledge required for response sentence generation is generalized by the two-stage configuration by the associative knowledge database 10 and the response knowledge database 20. Therefore, it is possible to easily generate various responses. Further, when a plurality of response classifications correspond to the knowledge classification of the associative knowledge selected from the database, various response patterns can be realized by changing the response classification to be selected. The number of sentences to be set can be suppressed.

＜実施の形態２＞
実施の形態２にかかる対話処理装置１では、応答分類決定部５０が知識照合部４０により特定された知識分類に対応する応答分類のうちいずれか１つを選択する際、応答分類決定部５０が発生頻度及び履歴頻度に基づいて選択を行う点で実施の形態１にかかる対話処理装置１と異なる。以下、実施の形態２にかかる対話処理装置１について、実施の形態１にかかる対話処理装置１と異なる構成及び動作について説明し、重複する説明については省略する。 <Second Embodiment>
In the dialogue processing device 1 according to the second exemplary embodiment, when the response classification determining unit 50 selects any one of the response classifications corresponding to the knowledge classification specified by the knowledge matching unit 40, the response classification determining unit 50 It differs from the interactive processing apparatus 1 according to the first exemplary embodiment in that selection is performed based on the occurrence frequency and the history frequency. Hereinafter, the configuration and operation of the interactive processing device 1 according to the second embodiment different from those of the interactive processing device 1 according to the first embodiment will be described, and redundant description will be omitted.

図５は、実施の形態２にかかる応答知識データベース２０に記憶される対応関係の一例を示す表である。図５に示されるように、実施の形態２にかかる応答知識データベース２０は、各知識分類における応答分類それぞれに対し、さらに、発生頻度及び履歴頻度を対応付けて記憶している。ここで、発生頻度とは、対話処理装置１以外の所定の環境下における、所定の応答分類に属する表現の発生頻度をいう。この発生頻度は、応答分類ごとの応答の一般的な利用頻度に相当する。発生頻度は、例えば、ある応答分類に分類される表現が、ウェブ上にどれだけ存在するかを検索することにより得られる。また、履歴頻度とは、所定の応答分類に属する表現についての、対話処理装置１がユーザと対話した際の実際の使用頻度である。 FIG. 5 is a table showing an example of correspondences stored in the response knowledge database 20 according to the second embodiment. As shown in FIG. 5, the response knowledge database 20 according to the second embodiment further stores the occurrence frequency and the history frequency in association with each response classification in each knowledge classification. Here, the frequency of occurrence means the frequency of occurrence of expressions belonging to a predetermined response classification under a predetermined environment other than the dialogue processing device 1. This frequency of occurrence corresponds to the general frequency of use of responses for each response classification. The frequency of occurrence can be obtained, for example, by searching how many expressions classified into a certain response classification exist on the web. Further, the history frequency is an actual usage frequency when the dialogue processing device 1 interacts with a user for an expression belonging to a predetermined response classification.

本実施の形態にかかる応答分類決定部５０は、応答知識データベース２０に記憶された発生頻度及び履歴頻度に基づいて、知識照合部４０により特定された知識分類に対応する応答分類のうち、いずれか１つを選択する。ここで、本実施の形態にかかる応答分類決定部５０は、発生頻度が大きいほどより選択されやすく、かつ、履歴頻度が小さいほどより選択されやすくなるよう予め定められた選択ルールに従って、応答分類を選択する。本実施の形態にかかる応答分類決定部５０は、知識照合部４０により特定された知識分類に対応する応答分類のうち、例えば以下の算出式（式（１））による値ｙ_ｉが最大となるｉを選択する。なお、ｉは、知識照合部４０により特定された知識分類に対応する各応答分類を識別するためのインデックスである。 The response classification determination unit 50 according to the present embodiment is any one of the response classifications corresponding to the knowledge classification specified by the knowledge matching unit 40 based on the occurrence frequency and the history frequency stored in the response knowledge database 20. Select one. Here, the response classification determining unit 50 according to the present embodiment performs the response classification according to a predetermined selection rule such that the higher the occurrence frequency, the easier the selection, and the smaller the history frequency, the easier the selection. select. The response classification determination unit 50 according to the present embodiment has a maximum value y _{i of} the response classification corresponding to the knowledge classification specified by the knowledge matching unit 40, for example, according to the following calculation formula (Formula (1)). Select i. In addition, i is an index for identifying each response classification corresponding to the knowledge classification specified by the knowledge matching unit 40.

ここで、ａ_１及びａ_２は、重み係数であり、ａ_１＋ａ_２＝１である。また、ｘ_１ｉは、発生頻度についての評価値であり、以下の式（２）により定義される。なお、式（２）において、ｒ_ｉは、インデックスがｉである応答分類の発生頻度である。ただし、式（２）において、分母の値が０となる場合には、ｘ_１ｉの算出が不能とならぬよう、分母を例えば０．１とするものとする。 Here, a ₁ and a ₂ are weighting factors, and a ₁ +a ₂ =1. Further, x _1i is an evaluation value for the occurrence frequency and is defined by the following formula (2). In the formula (2), r _i is the frequency of occurrence of the response classification having the index i. However, in Expression (2), when the value of the denominator is 0, the denominator is set to 0.1, for example, so that the calculation of x _1i is not impossible.

さらに、ｘ_２ｉは、履歴頻度についての評価値であり、以下の式（３）により定義される。なお、式（３）において、ｓ_ｉは、インデックスがｉである応答分類の履歴頻度である。ただし、式（３）において、分母の値が０となる場合には、ｘ_２ｉの算出が不能とならぬよう、分母を例えば０．１とするものとする。 Further, x _2i is an evaluation value for the history frequency, and is defined by the following expression (3). In Expression (3), s _i is the history frequency of the response classification whose index is i. However, in Expression (3), when the value of the denominator is 0, the denominator is set to 0.1, for example, so that the calculation of x _2i is not impossible.

例えば、上述のステップ１４において、知識照合部４０によって知識分類「習慣」が特定された場合、本実施の形態にかかる応答分類決定部５０は、次のように応答分類を決定する。なお、ここでは、ａ_１＝０．１であり、ａ_２＝０．９であるものとする。図５に示されるように、知識分類「習慣」には、応答分類「確認」、「習慣」、「感想」、「提案」が対応していることがわかる。ここで、応答分類「確認」の発生頻度は３であり、履歴頻度は０である。また、応答分類「習慣」の発生頻度は１０であり、履歴頻度は１である。また、応答分類「感想」の発生頻度は５であり、履歴頻度は０である。また、応答分類「提案」の発生頻度は１であり、履歴頻度は０である。 For example, when the knowledge collation unit 40 specifies the knowledge classification “habit” in step 14 described above, the response classification determination unit 50 according to the present embodiment determines the response classification as follows. Here, it is assumed that a ₁ =0.1 and a ₂ =0.9. As shown in FIG. 5, the knowledge classification “habit” corresponds to the response classifications “confirmation”, “habit”, “impression”, and “suggestion”. Here, the occurrence frequency of the response classification “confirmation” is 3 and the history frequency is 0. Further, the frequency of occurrence of the response classification “habit” is 10 and the history frequency is 1. Further, the frequency of occurrence of the response classification “impression” is 5 and the history frequency is 0. Further, the frequency of occurrence of the response classification “proposal” is 1 and the frequency of history is 0.

上記式（１）〜（３）によれば、各応答分類に対する評価値（ｙ_ｉの値）は、次のようになる。すなわち、応答分類「確認」についての評価値は０．９１５であり、応答分類「習慣」についての評価値は０．０５２であり、応答分類「感想」についての評価値は０．９２６であり、応答分類「提案」についての評価値は０．９０５である。本実施の形態にかかる応答分類決定部５０は、このように、各応答分類について評価値を算出した上で、評価値が最大である応答分類「感想」を選択する。このように、所定の環境下において応答分類「習慣」に分類される応答が比較的多く使用されている場合であっても、そのような応答が既にユーザとの対話で利用されている場合には他の応答種別が選択されうる。 According to the above equations (1) to (3), the evaluation value (value of y _i ) for each response classification is as follows. That is, the evaluation value for the response classification “confirmation” is 0.915, the evaluation value for the response classification “habit” is 0.052, the evaluation value for the response classification “impression” is 0.926, The evaluation value for the response classification “proposal” is 0.905. The response classification determination unit 50 according to the present embodiment thus calculates the evaluation value for each response classification, and then selects the response classification “impression” having the maximum evaluation value. In this way, even when the responses classified into the response classification “habit” are used relatively frequently in the predetermined environment, when such responses are already used in the dialog with the user, Other response types may be selected.

本実施の形態にかかる対話処理装置１によれば、発生頻度に基づいて応答分類が選択されるため、一般的な利用頻度に応じた応答を行うことが可能となる。また、履歴頻度に基づいて応答分類が選択されるため、変化に富んだ多様な応答を行うことが可能となる。なお、本実施の形態では、応答分類決定部５０が、発生頻度及び履歴頻度に基づいて応答分類を選択する構成について説明したが、応答分類決定部５０は、発生頻度又は履歴頻度のいずれか一方に基づいて応答分類を選択してもよい。 According to the interaction processing device 1 according to the present embodiment, the response classification is selected based on the occurrence frequency, and therefore it is possible to make the response according to the general usage frequency. Further, since the response classification is selected based on the history frequency, it is possible to make various responses rich in changes. In addition, in this Embodiment, although the response classification determination part 50 demonstrated the structure which selects a response classification based on the occurrence frequency and the history frequency, the response classification determination part 50 is either occurrence frequency or history frequency. The response classification may be selected based on

なお、本発明は上記実施の形態に限られたものではなく、趣旨を逸脱しない範囲で適宜変更することが可能である。例えば、ユーザの発話に対する意図解析処理結果を利用して応答文が生成されてもよい。 The present invention is not limited to the above-mentioned embodiment, and can be modified as appropriate without departing from the spirit of the present invention. For example, the response sentence may be generated by using the result of the intention analysis processing for the user's utterance.

１対話処理装置
１０連想知識データベース
２０応答知識データベース
３０事象抽出部
４０知識照合部
５０応答分類決定部
６０応答テンプレート結合部 1 Dialog Processing Device 10 Associative Knowledge Database 20 Response Knowledge Database 30 Event Extraction Section 40 Knowledge Matching Section 50 Response Classification Determining Section 60 Response Template Combining Section

Claims

A first database that stores an event, associative knowledge about the event, and a type of an associative relationship between the event and the associative knowledge in association with each other;
A second database that stores at least one response type and a template corresponding to the response type in association with each association type for each association type;
Event extraction means for extracting an event from the input utterance sentence,
Knowledge collating means for referring to the first database and specifying the type of the associative knowledge and the associative relationship corresponding to the event extracted by the event extracting means;
Response classification determining means for selecting any one of the types of responses corresponding to the type of associative relationship specified by the knowledge collating means,
Response sentence generating means for generating a response sentence to the utterance sentence by applying the associative knowledge specified by the knowledge collating means to the template corresponding to the type of the response selected by the response classification determining means. It has a door,
The response classification determining means makes a selection based on an occurrence frequency of an expression belonging to the type of the response under a predetermined environment.
Dialog processing device.

The response classification determining means is configured to generate an expression frequency that belongs to the type of the response under a predetermined environment, and an actual usage frequency of the expression that belongs to the type of the response when the dialogue processing device interacts with a user. And select based on
The dialogue processing device according to claim 1.

The response classification determining means makes a selection according to a predetermined selection rule such that the higher the occurrence frequency is, the easier it is to select, and the lower the usage frequency is, the easier it is to select.
The dialogue processing device according to claim 2.

The type of the associative relation is a type of the relation between the event and the associative knowledge, which includes at least one of purpose, use, co-occurrence, effect, condition, habit, or use.
The dialog processing device according to claim 1.