JP7224278B2

JP7224278B2 - Apparatus, program and method for estimating response sentence to user's utterance sentence

Info

Publication number: JP7224278B2
Application number: JP2019221898A
Authority: JP
Inventors: 博楊; 亜楠王; 剣明呉; 啓一郎帆足
Original assignee: KDDI Corp
Current assignee: KDDI Corp
Priority date: 2019-12-09
Filing date: 2019-12-09
Publication date: 2023-02-17
Anticipated expiration: 2039-12-09
Also published as: JP2021092617A

Description

本発明は、ユーザと自然な対話を実現する対話装置の技術に関する。 TECHNICAL FIELD The present invention relates to technology of a dialogue device that realizes natural dialogue with a user.

近年、ユーザとの対話装置として、スマートフォン又はタブレットのみならず、「Google Home（登録商標）」「Amazon Echo（登録商標）」のようなスマートスピーカや、「SOTA（登録商標）」「ユニボー（登録商標）」のようなロボット（以下「端末」と称す）が用いられてきている。端末は、ユーザインタフェース機能し、ユーザの発話音声を、サーバとしての対話装置へ送信する。
対話装置は、その発話文に対して自然な対話となる応答文を推定し、その応答文を端末へ返信する。
そして、端末は、その応答文を音声又はテキストによって、ユーザへ返答する。
このような対話システムとしては、例えば「Siri（登録商標）」や「しゃべってコンシェル（登録商標）」がある。 In recent years, not only smartphones and tablets but also smart speakers such as "Google Home (registered trademark)" and "Amazon Echo (registered trademark)" have been used as interactive devices with users, as well as "SOTA (registered trademark)" and "Unibo (registered trademark)". (trademark)” (hereinafter referred to as “terminal”) have been used. The terminal functions as a user interface and transmits the user's uttered voice to the interactive device as a server.
The dialogue device estimates a response sentence that will be a natural dialogue for the utterance sentence, and returns the response sentence to the terminal.
Then, the terminal replies the response sentence to the user by voice or text.
Examples of such dialogue systems include "Siri (registered trademark)" and "Talking Concier (registered trademark)."

従来、発話文に対して因果関係を持つ応答文を生成し、自然な対話を実現する技術がある（例えば非特許文献１参照）。この技術によれば、因果関係を持つ単語ペア辞書を予め構築し、発話文と因果関係を持つ応答文を優先的に選択する（リランキング応答生成）ことができる。具体的には、ユーザの発話文の単語と応答文の単語とをペアとして、単語ペア辞書を照合する。照合一致（単語ペア辞書に当該単語ペアが存在）した際に、因果関係があると判定し、この応答文を優先的に選択する。
非特許文献１によれば、例えば以下の文章に対して、因果関係ペアを抽出して学習することができる。
「円安になったため、貿易の視点から見ると日本の景気が上昇することが期待できる」
：因果関係ペア｛（円安になる）->（景気が上昇）｝ Conventionally, there is a technique of generating a response sentence having a causal relationship with an uttered sentence and realizing a natural dialogue (for example, see Non-Patent Document 1). According to this technique, a word pair dictionary having a causal relationship can be constructed in advance, and a response sentence having a causal relationship with an utterance sentence can be preferentially selected (reranking response generation). Specifically, a word in the user's utterance sentence and a word in the response sentence are paired with a word pair dictionary. When matching is achieved (the word pair exists in the word pair dictionary), it is determined that there is a causal relationship, and this response sentence is preferentially selected.
According to Non-Patent Document 1, for example, causality pairs can be extracted and learned for the following sentences.
“Since the yen has weakened, we can expect the Japanese economy to rise from a trade perspective.”
: Causality pair {(yen depreciation) -> (economy rise)}

また、大規模な対話コーパスから因果関係を持つ対話データ（発話文及び応答文）のみを用いて、学習モデルを作成する技術もある（例えば非特許文献２参照）。 There is also a technique for creating a learning model using only dialogue data (utterance sentences and response sentences) having a causal relationship from a large-scale dialogue corpus (see, for example, Non-Patent Document 2).

Shohei Tanaka, Koichiro Yoshino, Katsuhito Sudoh, and Satoshi Nakamura. "Conversational Response Re-ranking Based on Event Causality and Role Factored Tensor Event Embedding" 1st Workshop NLP for Conversational AI, ACL 2019 Workshop (ConvAI).Shohei Tanaka, Koichiro Yoshino, Katsuhito Sudoh, and Satoshi Nakamura. "Conversational Response Re-ranking Based on Event Causality and Role Factored Tensor Event Embedding" 1st Workshop NLP for Conversational AI, ACL 2019 Workshop (ConvAI). 佐藤翔多, 乾健太郎. “因果関係に基づいてデータサンプリングを利用した雑談応答学習”言語処理大会第２４回年次大会発表論文集（２０１８年３月）Shota Sato, Kentaro Inui. “Chat Response Learning Using Data Sampling Based on Causal Relationships” Proceedings of the 24th Annual Conference of the Language Processing Conference (March 2018)

非特許文献１に記載の技術によれば、学習時には、単語ペアのみを照合するために、その単語ペア以外の文脈の特徴量を全く考慮してない。前述の例によれば、因果関係ペア｛（円安になる）->（景気が上昇）｝には、「貿易の視点」や「日本」のような制限となる特徴量が、全く含まれないこととなる。
また、運用時には、実際のユーザの発話文に対して、予め学習された因果関係ペアが完全一致で照合しないと、リランキング（最適な応答結果の再選定）を実現できないという問題もある。 According to the technique described in Non-Patent Literature 1, since only word pairs are matched during learning, context feature amounts other than the word pairs are not considered at all. According to the above example, the causality pair {(the yen depreciates) -> (the economy rises)} does not include any restrictive features such as "perspective of trade" or "Japan." There will be no
In addition, during operation, there is also the problem that reranking (reselection of the optimum response result) cannot be realized unless causality pairs that have been learned in advance are completely matched against actual user utterances.

非特許文献２に記載の技術によれば、因果関係を持つ対話データ（発話文及び応答文）のみを用いて学習モデルを作成するために、教師データとなる対話データに依存しすぎてしまう。これは、教師データの量のみならず、ユーザの発話文に対する応答文としての多様性や汎用性が乏しいという問題もある。 According to the technique described in Non-Patent Document 2, in order to create a learning model using only dialogue data (utterance sentences and response sentences) that have a causal relationship, it relies too much on dialogue data serving as teacher data. This is not only because of the amount of training data, but also because it lacks diversity and versatility as a response sentence to a user's utterance sentence.

そこで、本発明は、教師データとしての対話コーパスに依存しすぎることなく、ユーザの発話文に対して、できる限り因果関係を考慮した応答文を推定する装置、プログラム及び方法を提供することを目的とする。 SUMMARY OF THE INVENTION Accordingly, it is an object of the present invention to provide an apparatus, a program, and a method for estimating a response sentence that considers causal relationships as much as possible with respect to a user's utterance sentence without relying too much on a dialogue corpus as training data. and

本発明によれば、ユーザの発話文に対する応答文を生成する対話装置に搭載されたコンピュータを機能させるプログラムにおいて、
学習時に、
文章中の前後を因果関係で接続する接続助詞を登録した接続助詞テーブルと、
第１の教師データとなる学習文章の群から、接続助詞テーブルに登録された接続助詞を含む学習文章を選別する学習文章選別手段と、
接続助詞を含む学習文章を因果関係有りとして学習する分類型の第１の機械学習エンジンと、
第２の教師データとなる対話コーパスの対となる発話文及び応答文を、第１の機械学習エンジンへ入力し、因果関係有りと推定された発話文及び応答文を選別する発話応答文章選別手段と、
因果関係有りと推定された発話文をエンコーダ側に入力し、応答文をデコーダ側に入力して学習する第２の機械学習エンジンと
して機能させ、
運用時に、
ユーザの発話文を第２の機械学習エンジンへ入力し、応答文を取得する対話手段と
してコンピュータを機能させることを特徴とする。 According to the present invention, in a program that causes a computer installed in a dialogue device to generate a response sentence to a user's utterance sentence,
when learning,
a conjunctive particle table in which conjunctive particles that connect the preceding and following sentences in a causal relationship are registered;
learning sentence selection means for selecting learning sentences containing the conjunction particles registered in the conjunction particle table from the group of learning sentences serving as the first training data;
a classification-type first machine learning engine that learns learning sentences containing conjunctive particles as having a causal relationship;
Utterance response sentence selection means for inputting an utterance sentence and a response sentence paired in a dialogue corpus as second training data to a first machine learning engine and selecting an utterance sentence and a response sentence presumed to have a causal relationship. and,
Functioning as a second machine learning engine that inputs an utterance sentence estimated to have a causal relationship to the encoder side and inputs a response sentence to the decoder side for learning,
During operation,
It is characterized by inputting a user's uttered sentence to the second machine learning engine and causing the computer to function as interactive means for obtaining a response sentence.

本発明のプログラムにおける他の実施形態によれば、
運用時に、
対話手段から入力されたユーザの発話文と、当該発話文を第２の機械学習エンジンへ入力して取得された応答文と、を接続した発話応答文章を生成する発話応答文章生成手段と、
生成された発話応答文章を、第１の機械学習エンジンへ入力することによって因果関係の有無を推定し、因果関係有りと推定された当該発話応答文章の応答文を対話手段へ出力する因果関係推定手段と
して更にコンピュータを機能させることも好ましい。 According to another embodiment of the program of the present invention,
During operation,
an utterance response sentence generation means for generating an utterance response sentence by connecting a user utterance sentence input from the dialogue means and a response sentence obtained by inputting the utterance sentence to a second machine learning engine;
Presence or absence of a causal relationship is estimated by inputting the generated utterance response text into a first machine learning engine, and a causal relationship in which a response sentence of the utterance response text presumed to have a causal relationship is output to the dialogue means. It is also preferable to have a computer further function as the estimation means.

本発明のプログラムにおける他の実施形態によれば、
第１の機械学習エンジンは、因果関係有りとなる文章全体の表現の特徴を抽出可能な分類型のニューラルネットワークであり、
第２の機械学習エンジンは、因果関係有りとなる発話文及び応答文の間の特徴を抽出可能なSeq2Seqである
ようにコンピュータを機能させることも好ましい。 According to another embodiment of the program of the present invention,
The first machine learning engine is a categorical neural network capable of extracting the characteristics of the entire sentence that has a causal relationship,
The second machine learning engine also preferably functions as a Seq2Seq capable of extracting features between the causal utterance sentence and the response sentence.

本発明のプログラムにおける他の実施形態によれば、
学習時に、
第２の教師データとなる対話コーパスの対となる発話文及び応答文について、発話文をエンコーダ側に入力し、応答文をデコーダ側に入力して学習する第３の機械学習エンジンと
して機能させ、
運用時に、
因果関係推定手段は、第１の機械学習エンジンによって因果関係無しと推定された際に、発話応答文章の発話文を第３の機械学習エンジンへ入力して取得した応答文を、対話手段へ出力する
ようにコンピュータを機能させることも好ましい。 According to another embodiment of the program of the present invention,
when learning,
Functioning as a third machine learning engine for learning by inputting the utterance sentence to the encoder side and inputting the response sentence to the decoder side for the utterance sentence and the response sentence that are paired in the dialogue corpus as the second training data,
During operation,
When the first machine learning engine estimates that there is no causal relationship, the causality estimating means inputs the utterance sentence of the utterance response sentence to the third machine learning engine and outputs the obtained response sentence to the dialogue means. It is also preferred to have the computer function to do so.

本発明のプログラムにおける他の実施形態によれば、
第３の機械学習エンジンは、汎用的な発話文及び応答文の間の特徴を抽出可能なSeq2Seqである
ようにコンピュータを機能させることも好ましい。 According to another embodiment of the program of the present invention,
The third machine learning engine also preferably functions as a Seq2Seq capable of extracting features between generic utterance sentences and response sentences.

本発明によれば、ユーザの発話文に対する応答文を生成する対話装置において、
学習時に、
文章中の前後を因果関係で接続する接続助詞を登録した接続助詞テーブルと、
第１の教師データとなる学習文章の群から、接続助詞テーブルに登録された接続助詞を含む学習文章を選別する学習文章選別手段と、
接続助詞を含む学習文章を因果関係有りとして学習する分類型の第１の機械学習エンジンと、
第２の教師データとなる対話コーパスの対となる発話文及び応答文を、第１の機械学習エンジンへ入力し、因果関係有りと推定された発話文及び応答文を選別する発話応答文章選別手段と、
因果関係有りと推定された発話文をエンコーダ側に入力し、応答文をデコーダ側に入力して学習する第２の機械学習エンジンと
して機能させ、
運用時に、
ユーザの発話文を第２の機械学習エンジンへ入力し、応答文を取得する対話手段と
を有することを特徴とする。 According to the present invention, in a dialogue device that generates a response sentence to a user's utterance sentence,
when learning,
a conjunctive particle table in which conjunctive particles that connect the preceding and following sentences in a causal relationship are registered;
learning sentence selection means for selecting learning sentences containing the conjunction particles registered in the conjunction particle table from the group of learning sentences serving as the first training data;
a classification-type first machine learning engine that learns learning sentences containing conjunctive particles as having a causal relationship;
Utterance response sentence selection means for inputting an utterance sentence and a response sentence paired in a dialogue corpus as second training data to a first machine learning engine and selecting an utterance sentence and a response sentence presumed to have a causal relationship. and,
Functioning as a second machine learning engine that inputs an utterance sentence estimated to have a causal relationship to the encoder side and inputs a response sentence to the decoder side for learning,
During operation,
and dialogue means for inputting a user's utterance sentence to a second machine learning engine and acquiring a response sentence.

本発明によれば、ユーザの発話文に対する応答文を生成する装置の対話方法において、
装置は、
学習時に、
文章中の前後を因果関係で接続する接続助詞を登録した接続助詞テーブルと、
第１の教師データとなる学習文章の群から、接続助詞テーブルに登録された接続助詞を含む学習文章を選別し、接続助詞を含む学習文章を因果関係有りとして学習する分類型の第１の機械学習エンジンと、
第２の教師データとなる対話コーパスの対となる発話文及び応答文を、第１の機械学習エンジンへ入力し、因果関係有りと推定された発話文及び応答文を選別し、因果関係有りと推定された発話文をエンコーダ側に入力し、応答文をデコーダ側に入力して学習する第２の機械学習エンジンと
を有し、
運用時に、
ユーザの発話文を第２の機械学習エンジンへ入力し、応答文を取得する
ように実行することを特徴とする。 According to the present invention, in the dialogue method for a device that generates a response sentence to a user's utterance sentence,
The device
when learning,
a conjunctive particle table in which conjunctive particles that connect the preceding and following sentences in a causal relationship are registered;
A classification-type first machine that selects learning sentences containing a conjunction particle registered in a conjunction particle table from a group of learning sentences serving as first teacher data, and learns the learning sentences containing the conjunction particles as having a causal relationship. a learning engine;
A pair of utterance sentences and response sentences in the dialogue corpus, which serves as second training data, is input to the first machine learning engine, and utterance sentences and response sentences that are presumed to have a causal relationship are selected, and judged to have a causal relationship. a second machine learning engine that inputs the estimated utterance sentence to the encoder side and inputs the response sentence to the decoder side for learning;
During operation,
It is characterized by inputting a user's uttered sentence to a second machine learning engine and executing so as to obtain a response sentence.

本発明の対話装置、プログラム及び方法によれば、教師データとしての対話コーパスに依存しすぎることなく、ユーザの発話文に対して、できる限り因果関係を考慮した応答文を推定することができる。 According to the dialogue apparatus, program, and method of the present invention, it is possible to estimate a response sentence that considers causal relationships as much as possible with respect to a user's utterance sentence without relying too much on a dialogue corpus as teacher data.

本発明における対話装置の機能構成図である。1 is a functional configuration diagram of an interactive device according to the present invention; FIG. 第１の機械学習エンジン及び第２の機械学習エンジンの学習を表す説明図である。FIG. 4 is an explanatory diagram showing learning of a first machine learning engine and a second machine learning engine; 第１の機械学習エンジンの概略的な機能構成図である。1 is a schematic functional configuration diagram of a first machine learning engine; FIG. 第２の機械学習エンジン及び第３の機械学習エンジンの概略的な機能構成図である。FIG. 4 is a schematic functional configuration diagram of a second machine learning engine and a third machine learning engine; 第２の実施形態における運用時の対話装置の第２の機能構成図である。FIG. 11 is a second functional configuration diagram of the interactive device during operation in the second embodiment; 第２の実施形態における運用時の対話を表す説明図である。FIG. 11 is an explanatory diagram showing dialogue during operation in the second embodiment; 第３の実施形態における運用時の対話装置の第２の機能構成図である。FIG. 11 is a second functional configuration diagram of the interactive device during operation in the third embodiment; 第３の実施形態における運用時の対話を表す説明図である。FIG. 11 is an explanatory diagram showing dialogue during operation in the third embodiment;

以下、本発明の実施の形態について、図面を用いて詳細に説明する。 BEST MODE FOR CARRYING OUT THE INVENTION Hereinafter, embodiments of the present invention will be described in detail with reference to the drawings.

図１は、本発明における対話装置の機能構成図である。 FIG. 1 is a functional configuration diagram of a dialogue device according to the present invention.

対話装置１は、ユーザとの間で、自然な対話を実現するものであり、ユーザの発話文に対する応答文を生成する。
図１によれば、サーバ機能を有する対話装置１は、ユーザインタフェース機能を有する端末２と通信する。端末２は、ユーザに対する入出力デバイスとして、マイクによってユーザの音声を取得し、スピーカによってユーザへ発声するものであってもよいし、ユーザからテキストベースの発話文を入力し、応答文を表示するものであってもよい。
尚、音声認識機能は、対話装置１に搭載されたものであってもよいし、端末２に搭載されていてもよい。 The dialogue device 1 realizes natural dialogue with a user, and generates a response sentence to a user's utterance sentence.
According to FIG. 1, a dialogue device 1 with server functionality communicates with a terminal 2 with user interface functionality. The terminal 2, as an input/output device for the user, may acquire the user's voice with a microphone and utter it to the user with a speaker, or input a text-based utterance from the user and display a response. can be anything.
The speech recognition function may be installed in the interactive device 1 or may be installed in the terminal 2. FIG.

本発明によれば、対話装置１は、複数の機械学習エンジンを搭載しており、＜学習時＞及び＜運用時＞に分けられる。また、対話装置１は、機械学習エンジンに対して、学習時に、複数の＜教師データ＞によって学習モデルを構築させる。 According to the present invention, the interactive device 1 is equipped with a plurality of machine learning engines, and is divided into <during learning> and <during operation>. Further, the interactive device 1 causes the machine learning engine to construct a learning model using a plurality of <teacher data> during learning.

＜教師データ＞
図１によれば、対話装置１は、教師データとして、接続助詞テーブル１００と、第１の教師データ１０１と、第２の教師データ１０２とを有する。これら教師データは、対話装置１自らが記憶しておく必要はないが、学習時に外部から入力する必要がある。 <Teacher data>
According to FIG. 1, the dialogue device 1 has a conjunctive particle table 100, first teacher data 101, and second teacher data 102 as teacher data. These teacher data need not be stored by the interactive device 1 themselves, but must be input from the outside during learning.

［接続助詞テーブル１００］
接続助詞テーブル１００は、文章中の前後を因果関係で接続する接続助詞を登録したものである。「接続助詞」とは、前文と後文との間に因果関係を構築する助詞であり、因果関係の手がかりとなるものである。
例えば、以下のような助詞がある。
「～ため、～」「～から、～」「～により、～」「～によって、～」
「～を背景に、～」「～を受け、～」「～の結果、～」「～をきっかけに、～」
「～の影響、～」「～の原因、～」「～を行うと、～」「～すれば、～」
「～しないと、～」「～に伴い、～」「～を反映し、～」 [Conjunctive particle table 100]
The conjunctive particle table 100 registers conjunctive particles that connect sentences before and after in a causal relationship. A "conjunctive particle" is a particle that builds a causal relationship between a preamble and a postscript, and serves as a clue to the causal relationship.
For example, there are adjectives such as:
"Because of,""From,""Becauseof,""Becauseof,"
``In the background of ``, ``In response to ``, ``As a result of ``, ``In the wake of ``,
"Influence of""Causeof""If you do""If you do"
"If you don't,""In line with,""Reflecting,"

［第１の教師データ１０１］
第１の教師データ１０１は、インターネット上で、自然言語の文章を構造化して大規模に集積した大量のコーパスである。これは、例えばウィキペディア(Wikipedia)（登録商標）のような百科事典であって、自然言語として正当な文章群である。
例えば、以下のような文章があるとする。
「子供が熱を出したため、学校を休んだ」 [First teacher data 101]
The first training data 101 is a large corpus of structured natural language sentences accumulated on the Internet on a large scale. This is, for example, an encyclopedia such as Wikipedia (registered trademark), which is a valid set of sentences as a natural language.
For example, suppose we have the following sentences:
"I was absent from school because my child had a fever."

［第２の教師データ１０２］
第２の教師データ１０２は、発話文と応答文とからなる対話ログの群である。これは、人の対話認識として正当な文章群である。
例えば以下のような文章があるとする。
発話文「子供が熱を出した」
応答文「学校を休んでください」 [Second teacher data 102]
The second training data 102 is a group of dialogue logs consisting of utterance sentences and response sentences. This is a valid sentence group for human dialogue recognition.
For example, suppose we have the following sentences:
Utterance sentence "My child has a fever."
Response sentence "Please take a break from school"

＜学習時＞
対話装置１は、学習時の機能部として、学習文章選別部１１と、第１の機械学習エンジン１２と、発話応答文章選別部１３と、第２の機械学習エンジン１４と、第３の機械学習エンジン１５とを有する。これら機能構成部は、装置に搭載されたコンピュータを機能させるプログラムを実行することによって実現される。また、これら機能構成部の処理の流れは、対話装置の学習方法としても理解できる。 <When learning>
The dialogue device 1 includes, as functional units during learning, a learning sentence selection unit 11, a first machine learning engine 12, an utterance response sentence selection unit 13, a second machine learning engine 14, and a third machine learning engine. and an engine 15 . These functional components are implemented by executing a program that causes a computer installed in the device to function. In addition, the flow of processing of these functional components can also be understood as a learning method of the interactive device.

図２は、第１の機械学習エンジン及び第２の機械学習エンジンの学習を表す説明図である。 FIG. 2 is an explanatory diagram showing learning of the first machine learning engine and the second machine learning engine.

［学習文章選別部１１］
学習文章選別部１１は、第１の教師データ１０１となる学習文章の群から、接続助詞テーブル１００に登録された「接続助詞」を含む学習文章を選別する。これによって、接続助詞を含む学習文章から、接続助詞を削除して連結した学習文章のみが、第１の機械学習エンジン１２へ入力される。
図２によれば、以下の学習文章が、第１の機械学習エンジン１２へ入力される。
第１の教師データ「子供が熱を出した｛ため｝、学校を休んだ」
入力される学習文章「子供が熱を出した、学校を休んだ」 [Learning text selection unit 11]
The learning sentence selection unit 11 selects learning sentences containing the “conjunctive particles” registered in the conjunctional particle table 100 from the group of learning sentences to be the first teacher data 101 . As a result, only the learning sentences obtained by deleting the connecting particles from the learning sentences containing the connecting particles and connecting them are input to the first machine learning engine 12 .
According to FIG. 2, the following training text is input to the first machine learning engine 12:
1st teacher data: "I was absent from school because my child had a fever."
Learning sentence to be input "My child had a fever and was absent from school"

［第１の機械学習エンジン１２］
第１の機械学習エンジン１２は、学習文章選別部１１から入力された学習文章に対して、形態素分析によって形態素に区分した上で、「因果関係有り」として学習する。 [First Machine Learning Engine 12]
The first machine learning engine 12 classifies the learning sentences input from the learning sentence selection unit 11 into morphemes by morphological analysis, and then learns them as “having causal relationships”.

図３は、第１の機械学習エンジンの概略的な機能構成図である。
図３によれば、第１の機械学習エンジン１２は、分類型のニューラルネットワークである。
第１の機械学習エンジン１２は、学習時には、因果関係有りとなる文章全体の表現の特徴を網羅的に抽出した深層学習モデルを構築する。
そして、第１の機械学習エンジン１２は、運用時には、因果関係が不明な発話応答文章を入力すると、形態素分析によって形態素に区分した上で、因果関係有り又は無しに分類した結果を出力する。 FIG. 3 is a schematic functional configuration diagram of the first machine learning engine.
According to FIG. 3, the first machine learning engine 12 is a categorical neural network.
At the time of learning, the first machine learning engine 12 constructs a deep learning model that comprehensively extracts the features of the expressions of the entire sentence that have a causal relationship.
During operation, the first machine learning engine 12, when inputting an utterance response sentence with unknown causality, classifies it into morphemes by morphological analysis, and then outputs the result of classification into whether there is causality or not.

［発話応答文章選別部１３］
発話応答文章選別部１３は、第２の教師データ１０２となる対話コーパスの対となる発話文及び応答文を、第１の機械学習エンジン１２へ入力する。対となる発話文及び応答文は、そのまま接続された発話応答文章として、第１の機械学習エンジン１２へ入力される。 [Utterance Response Sentence Selection Unit 13]
The utterance response sentence selection unit 13 inputs the utterance sentence and the response sentence that form a pair of the dialogue corpus as the second teacher data 102 to the first machine learning engine 12 . The paired utterance sentence and response sentence are input to the first machine learning engine 12 as an utterance response sentence connected as they are.

図２によれば、発話応答文章選別部１３は、以下の発話応答文章を、第１の機械学習エンジン１２へ出力している。
発話応答文章「子供が熱を出した、仕事を休んでください」
これに対し、第１の機械学習エンジン１２は、その発話応答文章に対して、因果関係有り又は無しの結果を、発話応答文章選別部１３へ返答する。
そして、発話応答文章選別部１３は、「因果関係有り」と推定された発話文及び応答文のみを、第２の機械学習エンジン１４へ出力する。
発話文「子供が熱を出した」
応答文「学校を休んでください」 According to FIG. 2, the speech response text selection unit 13 outputs the following speech response texts to the first machine learning engine 12 .
Spoken Response Sentence "My child has a fever, please take a day off from work."
In response to this, the first machine learning engine 12 replies to the speech response text selection unit 13 with a result indicating that there is or is not a causal relationship with respect to the speech response text.
Then, the utterance response text selection unit 13 outputs only the utterance texts and response texts estimated to have a causal relationship to the second machine learning engine 14 .
Utterance sentence "My child has a fever."
Response sentence "Please take a break from school"

［第２の機械学習エンジン１４］
第２の機械学習エンジン１４は、因果関係有りと推定された発話文を、形態素分析によって形態素に区分した上でエンコーダ側に入力し、応答文を、形態素分析によって形態素に区分した上でデコーダ側に入力して学習する。 [Second machine learning engine 14]
The second machine learning engine 14 divides the speech sentence estimated to have a causal relationship into morphemes by morphological analysis and inputs it to the encoder side. Learn by typing.

図４は、第２の機械学習エンジン及び第３の機械学習エンジンの概略的な機能構成図である。
図４によれば、第２の機械学習エンジン１４は、具体的には、因果関係有りとなる発話文及び応答文の間の特徴を抽出可能なSeq2Seqであってもよいし、seq2seq+attentionやtransformのような改良モデルであってもよい。
seq2seqは、形態素文字列を入力して、別の形態素文字列を出力する置き換えルールを学習するニューラルネットワークである。これによって、発話文に対して１セットの応答文を学習していく。勿論、文字列の依存関係を学習可能なＲＮＮ(Recurrent Neural Network)の一種である例えばＬＳＴＭ(Long Short-Term Memory)であってもよい。
これによって、第２の機械学習エンジン１４は、エンコーダ側に発話文が入力されると、デコーダ側から応答文が出力されるようになる。 FIG. 4 is a schematic functional configuration diagram of the second machine learning engine and the third machine learning engine.
According to FIG. 4, the second machine learning engine 14 may be specifically Seq2Seq capable of extracting features between an utterance sentence and a response sentence that have a causal relationship, seq2seq+attention, or It may be an improved model such as transform.
seq2seq is a neural network that learns replacement rules for inputting a morpheme string and outputting another morpheme string. In this way, one set of response sentences is learned for each utterance sentence. Of course, for example, LSTM (Long Short-Term Memory), which is a type of RNN (Recurrent Neural Network) capable of learning the dependency relationship of character strings, may be used.
As a result, the second machine learning engine 14 outputs a response sentence from the decoder side when the utterance sentence is input to the encoder side.

［第３の機械学習エンジン１５］
第３の機械学習エンジン１５は、第２の教師データ１０２となる対話コーパスの対となる発話文及び応答文について、発話文を、形態素分析によって形態素に区分した上でエンコーダ側に入力し、応答文を、形態素分析によって形態素に区分した上でデコーダ側に入力して学習する。
即ち、第３の機械学習エンジン１５は、第２の機械学習エンジン１４と異なって、因果関係に限らず、全ての第２の教師データ１０２について、汎用的に発話文及び応答文の関係の特徴を学習する。
一方で、第３の機械学習エンジン１５は、第２の機械学習エンジン１４と同様に、発話文及び応答文の間の特徴を抽出可能なSeq2Seqであってもよいし、seq2seq+attentionやtransformのような改良モデルであってもよい。
これによって、第３の機械学習エンジン１５は、エンコーダ側に発話文が入力されると、デコーダ側から応答文が出力されるようになる。 [Third Machine Learning Engine 15]
The third machine learning engine 15 divides the spoken sentences into morphemes by morphological analysis and inputs them to the encoder side for the spoken sentences and the response sentences that are pairs of the dialogue corpus, which is the second teacher data 102, and inputs them to the encoder side. A sentence is segmented into morphemes by morphological analysis and input to the decoder for learning.
In other words, unlike the second machine learning engine 14, the third machine learning engine 15 is not limited to the causal relationship, but for all of the second teacher data 102, general purpose characteristics of the relationship between the utterance sentence and the response sentence. to learn.
On the other hand, the third machine learning engine 15, like the second machine learning engine 14, may be Seq2Seq capable of extracting features between an utterance sentence and a response sentence, or may be a seq2seq+attention or transform engine. It may be an improved model such as
As a result, when the speech sentence is input to the encoder side, the third machine learning engine 15 outputs a response sentence from the decoder side.

＜運用時＞
［対話部１６］
対話装置１は、運用時の機能部として、対話部１６を有する。 <During operation>
[Dialogue part 16]
The dialog device 1 has a dialog section 16 as a functional section during operation.

対話部１６は、ユーザの発話文を端末２から受信し、その発話文を第２の機械学習エンジン１４へ出力する。そして、第２の機械学習エンジン１４から応答文を取得し、その応答文を端末２へ送信する。端末２のユーザインタフェース機能が、キー入力可能であってディスプレイ表示可能であれば、対話部１６は、発話文及び応答文はテキストベースで、端末２との間で送受信することができる。 The dialogue unit 16 receives the user's utterance sentence from the terminal 2 and outputs the utterance sentence to the second machine learning engine 14 . Then, it acquires a response sentence from the second machine learning engine 14 and transmits the response sentence to the terminal 2 . If the user interface function of the terminal 2 allows key input and display, the dialogue unit 16 can transmit and receive text-based utterance sentences and response sentences to and from the terminal 2 .

対話部１６は、端末２のユーザインタフェース機能に応じて、ユーザの発話音声の音声認識機能、及び、ユーザへの応答文の音声変換機能を有するものであってもよい。音声認識機能は、端末２のマイクによって取得されたユーザの発話音声を、テキストベースの発話文に変換する。音声変換機能は、第２の機械学習エンジン１４から取得した応答文を、音声信号に変換する。
この場合、対話部１６は、ユーザの発話音声データを端末２から受信し、その発話音声データをテキストの発話文に変換し、その発話文を第２の機械学習エンジン１４へ入力する。また、第２の機械学習エンジン１４から応答文を入力し、その応答文を音声データに変換し、その音声データを端末２へ送信する。 The dialogue unit 16 may have a function of recognizing a user's uttered voice and a function of converting a response sentence to a user's voice according to the user interface function of the terminal 2 . The voice recognition function converts the user's uttered voice acquired by the microphone of the terminal 2 into a text-based uttered sentence. The speech conversion function converts the response sentence obtained from the second machine learning engine 14 into a speech signal.
In this case, the dialogue unit 16 receives the user's speech data from the terminal 2 , converts the speech data into a text utterance, and inputs the utterance to the second machine learning engine 14 . It also receives a response sentence from the second machine learning engine 14 , converts the response sentence into voice data, and transmits the voice data to the terminal 2 .

図５は、第２の実施形態における運用時の対話装置の第２の機能構成図である。 FIG. 5 is a second functional configuration diagram of the interactive device during operation according to the second embodiment.

前述した図２によれば、対話部１６は、第２の機械学習エンジン１４のみを用いて、発話文に対する応答文を取得する。
これに対し、図５によれば、対話装置１は、第１の機械学習エンジン１２及び第２の機械学習エンジン１４を用いて、運用時の機能部として、対話部１６に加えて、発話応答文章生成部１７と、因果関係推定部１８とを有する。これら機能構成部も、装置に搭載されたコンピュータを機能させるプログラムを実行することによって実現される。また、これら機能構成部の処理の流れは、対話装置の運用方法としても理解できる。 According to FIG. 2 described above, the dialogue unit 16 uses only the second machine learning engine 14 to acquire a response sentence to the utterance sentence.
On the other hand, according to FIG. 5, the dialogue device 1 uses the first machine learning engine 12 and the second machine learning engine 14 to function as functional units during operation in addition to the dialogue unit 16. It has a sentence generator 17 and a causal relationship estimator 18 . These functional components are also implemented by executing a program that causes a computer installed in the device to function. In addition, the processing flow of these functional components can also be understood as an operation method of the interactive device.

図６は、第２の実施形態における運用時の対話を表す説明図である。 FIG. 6 is an explanatory diagram showing dialogue during operation in the second embodiment.

［発話応答文章生成部１７］
発話応答文章生成部１７は、対話部１６から入力されたユーザの発話文と、当該発話文を第２の機械学習エンジン１４へ入力して取得された応答文と、を接続した発話応答文章を生成する。 [Utterance Response Sentence Generation Unit 17]
The utterance response sentence generation unit 17 generates an utterance response sentence by connecting the user's utterance sentence input from the dialogue unit 16 and the response sentence obtained by inputting the utterance sentence to the second machine learning engine 14. Generate.

図６によれば、対話部１６は、端末２から以下の発話文を受信したとする。
発話文「子供が熱を出した」
対話部１６は、この発話文を、発話応答文章生成部１７へ入力する。
発話応答文章生成部１７は、入力された以下の発話文を、第２の機械学習エンジン１４へ入力する。
発話文「子供が熱を出した」
これに対し、第２の機械学習エンジン１４は、発話文に対する応答文を推定し、以下の応答文を返答する。
応答文「仕事を休んでください」
これに対し、発話応答文章生成部１７は、発話文と応答文とを接続して、以下の発話応答文章を生成する。
発話応答文章「子供が熱を出した、仕事を休んでください」
この発話応答文章は、因果関係推定部１８へ入力される。 According to FIG. 6, it is assumed that the dialogue unit 16 receives the following utterance sentence from the terminal 2 .
Utterance sentence "My child has a fever."
The dialogue unit 16 inputs this utterance sentence to the utterance response sentence generation unit 17 .
The utterance response text generation unit 17 inputs the following input utterance text to the second machine learning engine 14 .
Utterance sentence "My child has a fever."
In response to this, the second machine learning engine 14 estimates a response sentence to the utterance sentence and returns the following response sentence.
Response sentence "Please take a break from work"
On the other hand, the utterance response text generation unit 17 connects the utterance text and the response text to generate the following utterance response text.
Spoken Response Sentence "My child has a fever, please take a day off from work."
This speech response text is input to the causality estimation unit 18 .

［因果関係推定部１８］
因果関係推定部１８は、生成された発話応答文章を、第１の機械学習エンジン１２へ入力することによって因果関係の有無を推定し、因果関係有りと推定された当該発話応答文章の応答文を、対話部１６へ出力する。
[Causal relationship estimation unit 18]
The causal relationship estimation unit 18 estimates whether or not there is a causal relationship by inputting the generated utterance response text into the first machine learning engine 12, and extracts the response sentence of the utterance response text that is estimated to have a causal relationship. is output to the dialogue unit 16 .

図６によれば、因果関係推定部１８は、発話応答文章生成部１７から入力された発話応答文章を、第１の機械学習エンジン１２へ入力する。
発話応答文章「子供が熱を出した、仕事を休んでください」
これに対し、第１の機械学習エンジン１２は、因果関係の有無を推定し、ここでは、因果関係有りが、因果関係推定部１８へ出力される。
このとき、因果関係推定部１８は、因果関係有りであるので、発話応答文章の応答文を、対話部１６へ出力する。
応答文「仕事を休んでください」
これによって、対話部１６は、その応答文を端末２へ送信する。 According to FIG. 6 , the causal relationship estimating unit 18 inputs the speech response text input from the speech response text generating unit 17 to the first machine learning engine 12 .
Spoken Response Sentence "My child has a fever, please take a day off from work."
On the other hand, the first machine learning engine 12 estimates whether or not there is a causal relationship, and here, the existence of a causal relationship is output to the causal relationship estimating section 18 .
At this time, since there is a causal relationship, the causal relationship estimating unit 18 outputs the response sentence of the speech response sentence to the dialogue unit 16 .
Response sentence "Please take a break from work"
By this, the dialog part 16 transmits the response sentence to the terminal 2. FIG.

図７は、第３の実施形態における運用時の対話装置の第２の機能構成図である。 FIG. 7 is a second functional block diagram of the interactive device during operation according to the third embodiment.

図７は、図５と比較して、第３の機械学習エンジン１５を更に用いる。
因果関係推定部１８は、第１の機械学習エンジン１２によって「因果関係無し」と推定された発話応答文章における発話文を、第３の機械学習エンジン１５へ入力する。
これに対し、第３の機械学習エンジン１５は、その発話文に対して応答文を推定し、その応答文を、因果関係推定部１８へ出力する。
そして、因果関係推定部１８は、第３の機械学習エンジン１５から出力された応答文を、対話部１６へ出力する。
このように、因果関係推定部１８は、発話文に対する応答文として、因果関係無しの場合であっても、汎用モデルとしての第３の機械学習エンジン１５を用いて、多様性を持つ応答文を出力することができる。 FIG. 7 further uses a third machine learning engine 15 compared to FIG.
The causality estimation unit 18 inputs the utterance sentences in the utterance response sentences estimated as “no causality” by the first machine learning engine 12 to the third machine learning engine 15 .
On the other hand, the third machine learning engine 15 estimates a response sentence for the utterance sentence and outputs the response sentence to the causality estimation unit 18 .
The causality estimation unit 18 then outputs the response sentence output from the third machine learning engine 15 to the dialogue unit 16 .
In this way, the causal relationship estimation unit 18 uses the third machine learning engine 15 as a general-purpose model to generate diverse response sentences even when there is no causal relationship as a response sentence to an utterance sentence. can be output.

図８は、第３の実施形態における運用時の対話を表す説明図である。 FIG. 8 is an explanatory diagram showing dialogue during operation in the third embodiment.

図８によれば、対話部１６から発話応答文章生成部１７へ、以下の発話文が入力されたとする。
発話文「子供が熱を出した」
これに対して、発話応答文章生成部１７は、第２の機械学習エンジン１４へ、その発話文を入力し、以下の応答文を得る。
応答文「仕事場で昼食をとった」
そして、発話応答文章生成部１７は、発話文と応答文とを接続した、以下の発話応答文章を、因果関係推部１８へ出力する。
発話応答文章「子供が熱を出した、仕事場で昼食をとった」
これに対して、因果関係推定部１８は、第１の機械学習エンジン１２へ、その発話応答文章を入力し、「因果関係無し」と推定されている。
これに対して、因果関係推定部１８は、因果関係無しと推定された発話応答文章における以下の発話文を、第３の機械学習エンジン１５へ入力する。
発話文「子供が熱を出した」
これに対し、因果関係推定部１８は、第３の機械学習エンジン１５から、以下の応答文を得る。
応答文「残業はしますか」
そして、因果関係推定部１８は、その応答文を、対話部１６へ出力する。 According to FIG. 8, it is assumed that the following utterance sentence is input from the dialogue unit 16 to the utterance response sentence generation unit 17.
Utterance sentence "My child has a fever."
In response to this, the utterance response sentence generation unit 17 inputs the utterance sentence to the second machine learning engine 14 and obtains the following response sentence.
Response sentence "I had lunch at work"
Then, the utterance response text generation unit 17 outputs the following utterance response text connecting the utterance text and the response text to the causal relationship inferring unit 18 .
Spoken Response Sentence "My child has a fever, I had lunch at work."
On the other hand, the causality estimator 18 inputs the speech response text to the first machine learning engine 12, and it is estimated that there is no causality.
On the other hand, the causal relationship estimation unit 18 inputs the following utterance sentence in the utterance response sentence estimated to have no causal relationship to the third machine learning engine 15 .
Utterance sentence "My child has a fever."
In response to this, the causality estimation unit 18 obtains the following response sentence from the third machine learning engine 15 .
Response sentence "Do you work overtime?"
The causality estimation unit 18 then outputs the response sentence to the dialogue unit 16 .

以上、詳細に説明したように、本発明の対話装置、プログラム及び方法によれば、ユーザの発話文に対して、できる限り因果関係を考慮した応答文を推定することができる。
特に、本発明によれば、因果関係を持つ単語ペアのみならず、その周辺の単語群も含む文脈の全体表現の特徴を網羅的に学習しているために、発話文に対する応答文が、ユーザにとってできる限り自然に感じられるようになる。また、本発明によれば、因果関係の辞書に無い発話文であっても、因果関係が無い文についても網羅的に学習しているので、汎用的に応答文を返答することができる。
即ち、本発明によれば、文の全体表現の特徴を網羅的に学習することによって、発話文に対して因果関係有りであっても無しであっても、多様的且つ汎用的な応答文を返答することができる。 As described in detail above, according to the interactive device, program, and method of the present invention, it is possible to estimate a response sentence that considers causal relationships as much as possible with respect to a user's uttered sentence.
In particular, according to the present invention, since the characteristics of the overall expression of the context including not only word pairs with causal relationships but also word groups around them are comprehensively learned, the response sentence to the utterance sentence is the user's sentence. feels as natural as possible for In addition, according to the present invention, even sentences that do not have a causal relationship, even if they are not in the causal dictionary, are comprehensively learned, so that a response sentence can be returned in a general purpose manner.
That is, according to the present invention, by comprehensively learning the characteristics of the overall expression of a sentence, various and versatile response sentences can be generated regardless of whether or not there is a causal relationship with the utterance sentence. can reply.

最後に、従来技術の非特許文献１及び非特許文献２と、本発明との相違を表す、発話文及び応答文のサンプル例を説明する。
＜発話文に対して因果関係が有る応答文＞
（非特許文献１の場合）
発話文「円安になったね」
｛（円安になる）->（景気が上昇）｝という因果関係ペアの辞書を参照
応答文「景気が上昇するよ」
（本発明の場合）
｛因果関係文脈全体の特徴を網羅的に学習済み｝
発話文「円安になったね」
応答文「貿易の視点から見ると日本の景気が上昇することが期待できるよ」 Finally, sample examples of utterance sentences and response sentences showing the difference between the non-patent document 1 and non-patent document 2 of the prior art and the present invention will be described.
<Response sentence with causal relationship to utterance sentence>
(In the case of Non-Patent Document 1)
Spoken text: "The yen has depreciated."
Refer to a dictionary of causality pairs {(the yen will depreciate) -> (the economy will rise)} Response sentence "The economy will rise."
(in the case of the present invention)
{Comprehensive learning of the features of the entire causal context}
Spoken text: "The yen has depreciated."
Response: "From the perspective of trade, we can expect the Japanese economy to rise."

＜発話文に対して因果関係が無い応答文＞
（非特許文献２の場合）
発話文「レストランへ行きたい」
｛因果関係の辞書が無いために、相槌のみ｝
応答文「そうですか」
（本発明の場合）
｛因果関係文脈全体の特徴を網羅的に学習済み｝
発話文「円安になったね」
応答文「いつも外食ですか？」 <Response sentence with no causal relationship to the utterance sentence>
(In the case of Non-Patent Document 2)
Utterance sentence "I want to go to a restaurant"
{only backtracking due to lack of causality dictionary}
Response sentence "Is that so?"
(in the case of the present invention)
{Comprehensive learning of the features of the entire causal context}
Spoken text: "The yen has depreciated."
Response sentence "Do you always eat out?"

前述した本発明の種々の実施形態について、本発明の技術思想及び見地の範囲の種々の変更、修正及び省略は、当業者によれば容易に行うことができる。前述の説明はあくまで例であって、何ら制約しようとするものではない。本発明は、特許請求の範囲及びその均等物として限定するものにのみ制約される。 For the various embodiments of the present invention described above, various changes, modifications and omissions within the spirit and scope of the present invention can be easily made by those skilled in the art. The foregoing description is exemplary only and is not intended to be limiting. The invention is to be limited only as limited by the claims and the equivalents thereof.

１対話装置
１００接続助詞テーブル
１０１第１の教師データ
１０２第２の教師データ
１１学習文章選別部
１２第１の機械学習エンジン
１３発話応答文章選別部
１４第２の機械学習エンジン
１５第３の機械学習エンジン
１６対話部
１７発話応答文章生成部
１８因果関係推定部
２端末
1 Dialogue Device 100 Conjunctive Particle Table 101 First Teacher Data 102 Second Teacher Data 11 Learning Sentence Selector 12 First Machine Learning Engine 13 Utterance Response Sentence Selector 14 Second Machine Learning Engine 15 Third Machine Learning Engine 16 Dialog Unit 17 Utterance Response Sentence Generation Unit 18 Causal Relationship Estimation Unit 2 Terminal

Claims

In a program that operates a computer installed in a dialogue device that generates a response sentence to a user's utterance sentence,
when learning,
a conjunctive particle table in which conjunctive particles that connect the preceding and following sentences in a causal relationship are registered;
learning sentence selection means for selecting learning sentences containing the conjunction particles registered in the conjunction particle table from the group of learning sentences serving as the first training data;
a classification-type first machine learning engine that learns learning sentences containing conjunctive particles as having a causal relationship;
Utterance response sentence selection means for inputting an utterance sentence and a response sentence paired in a dialogue corpus as second training data to a first machine learning engine and selecting an utterance sentence and a response sentence presumed to have a causal relationship. and,
Functioning as a second machine learning engine that inputs an utterance sentence estimated to have a causal relationship to the encoder side and inputs a response sentence to the decoder side for learning,
During operation,
A program characterized by inputting a user's utterance sentence to a second machine learning engine and causing a computer to function as dialogue means for obtaining a response sentence.

During operation,
an utterance response sentence generation means for generating an utterance response sentence by connecting a user utterance sentence input from the dialogue means and a response sentence obtained by inputting the utterance sentence to a second machine learning engine;
Presence or absence of a causal relationship is estimated by inputting the generated utterance response text into a first machine learning engine, and a causal relationship in which a response sentence of the utterance response text presumed to have a causal relationship is output to the dialogue means. 2. The program according to claim 1, further causing a computer to function as estimation means.

The first machine learning engine is a categorical neural network capable of extracting the characteristics of the entire sentence that has a causal relationship,
3. The program according to claim 1, wherein the second machine learning engine causes the computer to function as a Seq2Seq capable of extracting features between an utterance sentence and a response sentence that have a causal relationship.

when learning,
Functioning as a third machine learning engine for learning by inputting the utterance sentence to the encoder side and inputting the response sentence to the decoder side for the utterance sentence and the response sentence that are paired in the dialogue corpus as the second training data,
During operation,
When the first machine learning engine estimates that there is no causal relationship, the causality estimating means inputs the utterance sentence of the utterance response sentence to the third machine learning engine and outputs the obtained response sentence to the dialogue means. 3. A program according to claim 2 , which causes a computer to function to do.

The first machine learning engine is a categorical neural network capable of extracting the characteristics of the entire sentence that has a causal relationship,
The second machine learning engine is Seq2Seq that can extract features between utterance sentences and response sentences that have a causal relationship,
5. The program according to claim 4, wherein the third machine learning engine causes the computer to function as a Seq2Seq capable of extracting features between generic utterance sentences and response sentences.

In a dialogue device that generates a response sentence to a user's utterance sentence,
when learning,
a conjunctive particle table in which conjunctive particles that connect the preceding and following sentences in a causal relationship are registered;
learning sentence selection means for selecting learning sentences containing the conjunction particles registered in the conjunction particle table from the group of learning sentences serving as the first training data;
a classification-type first machine learning engine that learns learning sentences containing conjunctive particles as having a causal relationship;
Utterance response sentence selection means for inputting an utterance sentence and a response sentence paired in a dialogue corpus as second training data to a first machine learning engine and selecting an utterance sentence and a response sentence presumed to have a causal relationship. and,
Functioning as a second machine learning engine that inputs an utterance sentence estimated to have a causal relationship to the encoder side and inputs a response sentence to the decoder side for learning,
During operation,
1. A dialogue device, comprising dialogue means for inputting a user's uttered sentence to a second machine learning engine and obtaining a response sentence.

In the interaction method for a device that generates a response sentence to a user's utterance sentence,
The device
when learning,
a conjunctive particle table in which conjunctive particles that connect the preceding and following sentences in a causal relationship are registered;
A classification-type first machine that selects learning sentences containing a conjunction particle registered in a conjunction particle table from a group of learning sentences serving as first teacher data, and learns the learning sentences containing the conjunction particles as having a causal relationship. a learning engine;
A pair of utterance sentences and response sentences in the dialogue corpus, which serves as second training data, is input to the first machine learning engine, and utterance sentences and response sentences that are presumed to have a causal relationship are selected, and judged to have a causal relationship. a second machine learning engine that inputs the estimated utterance sentence to the encoder side and inputs the response sentence to the decoder side for learning;
During operation,
A device interaction method, characterized by inputting a user's utterance sentence into a second machine learning engine and executing it so as to obtain a response sentence.