JP6849964B2

JP6849964B2 - Dialogue control device, dialogue engine, management terminal, dialogue device, dialogue control method, dialogue method, and program

Info

Publication number: JP6849964B2
Application number: JP2016172468A
Authority: JP
Inventors: 敬典興梠
Original assignee: Nextremer Co Ltd
Current assignee: Nextremer Co Ltd
Priority date: 2016-09-05
Filing date: 2016-09-05
Publication date: 2021-03-31
Anticipated expiration: 2036-09-05
Also published as: JP2018041124A

Description

本発明は、対話を制御する対話制御装置等に関するものである。 The present invention relates to a dialogue control device or the like that controls dialogue.

従来、ユーザとのコミュニケーションの柔軟性を向上させることを目的とした音声対話制御装置が存在した。かかる音声対話装置の制御は、音声を取得してから応答を生成するまでの待機時間に、応答の修正要否を判定するための修正条件が満たされたと判定したとき、生成した応答を修正する修正実行部を備える（特許文献１参照）。 Conventionally, there has been a voice dialogue control device for the purpose of improving the flexibility of communication with a user. The control of the voice dialogue device corrects the generated response when it is determined that the correction condition for determining whether or not the response needs to be corrected is satisfied in the waiting time from the acquisition of the voice to the generation of the response. It is provided with a correction execution unit (see Patent Document 1).

また、従来、実際のユーザの発話に適した分類カテゴリの設定が比較的容易に行えるようなメンテナンス性の高い対話システムがあった（特許文献２参照）。かかる対話システムは、対話の状態遷移情報を用いて対話を進行していく。 Further, conventionally, there has been a highly maintainable dialogue system in which a classification category suitable for an actual user's utterance can be set relatively easily (see Patent Document 2). Such a dialogue system advances the dialogue by using the state transition information of the dialogue.

特開２０１５−２５６９号公報Japanese Unexamined Patent Publication No. 2015-2569 特開２０１１−２１５７４２号公報Japanese Unexamined Patent Publication No. 2011-215742

しかしながら、従来の装置においては、一の対話エンジンのみを使用していたので、適切な対話が継続できない場合があった。 However, in the conventional device, since only one dialogue engine is used, there are cases where an appropriate dialogue cannot be continued.

また、例えば、ユーザと対話エンジンとの対話が破綻しているにも関わらず、対話を継続するか、対話を終了するかしかなかった。 Also, for example, even though the dialogue between the user and the dialogue engine is broken, there is no choice but to continue the dialogue or end the dialogue.

本第一の発明の対話制御装置は、２以上の対話エンジンから、ユーザの発話に対応する応答である自動応答情報を受け付け得る対話制御装置であって、ユーザの発話に対応するユーザ発話情報をユーザ端末から受信する受信部と、ユーザ発話情報に対応する情報である発話情報を、１以上の対話エンジンに対して出力し得る発話情報出力部と、発話情報の出力に応じて、１以上の各対話エンジンから自動応答情報を受け付ける自動応答受付部と、自動応答受付部が受け付けた１以上の自動応答情報のうちの一の自動応答情報に対応する一の自動情報をユーザ端末に送信する応答送信部とを具備する対話制御装置である。 The dialogue control device of the first invention is a dialogue control device that can receive automatic response information that is a response corresponding to a user's utterance from two or more dialogue engines, and obtains user utterance information corresponding to the user's utterance. A receiving unit received from the user terminal, an utterance information output unit capable of outputting utterance information corresponding to the user utterance information to one or more dialogue engines, and one or more utterance information depending on the output of the utterance information. A response that transmits to the user terminal an automatic response reception unit that receives automatic response information from each dialogue engine and one automatic information corresponding to one of the one or more automatic response information received by the automatic response reception unit. It is a dialogue control device including a transmission unit.

かかる構成により、複数の対話エンジンを効果的に用いて、適切な対話を継続させることができる。 With such a configuration, a plurality of dialogue engines can be effectively used to continue an appropriate dialogue.

また、本第二の発明の対話制御装置は、第一の発明に対して、発話情報出力部は、発話情報を、２以上の対話エンジンに出力し、自動応答受付部は、発話情報の出力に応じて、２以上の各対話エンジンから自動応答情報を受け付け、２以上の自動応答情報から一の自動応答情報を選択する選択部をさらに具備し、応答送信部は、選択部が選択した自動応答情報に対応する一の自動情報をユーザ端末に送信する対話制御装置である。 Further, in the dialogue control device of the second invention, the utterance information output unit outputs the utterance information to two or more dialogue engines, and the automatic response reception unit outputs the utterance information with respect to the first invention. The response transmission unit further includes a selection unit that receives automatic response information from two or more dialogue engines and selects one automatic response information from two or more automatic response information, and the response transmission unit is an automatic unit selected by the selection unit. It is an interactive control device that transmits one automatic information corresponding to the response information to the user terminal.

また、本第三の発明の対話制御装置は、第二の発明に対して、選択部は、２以上の自動応答情報から、予め決められた条件に合致する一の自動応答情報を選択する対話制御装置である。 Further, in the dialogue control device of the third invention, with respect to the second invention, the selection unit selects one automatic response information that matches a predetermined condition from two or more automatic response information. It is a control device.

また、本第四の発明の対話制御装置は、第二または第三の発明に対して、選択部は、２以上の自動応答情報のうちの予め決められた条件を満たす自動応答情報の中で、優先度情報に基づく優先度が最も高い対話エンジンから出力された自動応答情報を選択する対話制御装置である。 Further, in the dialogue control device of the fourth invention, with respect to the second or third invention, the selection unit is among the automatic response information satisfying the predetermined conditions among the two or more automatic response information. , A dialogue control device that selects the automatic response information output from the dialogue engine with the highest priority based on the priority information.

かかる構成により、複数の対話エンジンから一の応答を適切に選択し、適切な対話を継続させることができる。 With such a configuration, one response can be appropriately selected from a plurality of dialogue engines, and an appropriate dialogue can be continued.

また、本第五の発明の対話制御装置は、第一の発明に対して、２以上の対話エンジンから一の対話エンジンを選択する選択部をさらに具備し、発話情報出力部は、発話情報を、選択部が選択した一の対話エンジンに出力し、自動応答受付部は、発話情報の出力に応じて、一の対話エンジンから自動応答情報を受け付け、応答送信部は、自動応答受付部が受け付けた自動応答情報に対応する一の自動情報をユーザ端末に送信する対話制御装置である。 Further, the dialogue control device of the fifth invention further includes a selection unit for selecting one dialogue engine from two or more dialogue engines with respect to the first invention, and the utterance information output unit outputs utterance information. , The selection unit outputs to one dialogue engine selected, the automatic response reception unit receives the automatic response information from one dialogue engine according to the output of the utterance information, and the response transmission unit receives the automatic response reception unit. It is a dialogue control device that transmits one automatic information corresponding to the automatic response information to the user terminal.

かかる構成により、複数の対話エンジンから一の対話エンジンを適切に選択し、適切な対話を継続させることができる。 With such a configuration, one dialogue engine can be appropriately selected from a plurality of dialogue engines, and an appropriate dialogue can be continued.

また、本第六の発明の対話制御装置は、第五の発明に対して、選択部は、ユーザ発話情報または発話情報を用いて、一の対話エンジンを選択する対話制御装置である。 Further, the dialogue control device of the sixth invention is a dialogue control device for selecting one dialogue engine by using the user utterance information or the utterance information with respect to the fifth invention.

かかる構成により、ユーザの発話を用いて、複数の対話エンジンから一の対話エンジンを適切に選択し、適切な対話を継続させることができる。 With such a configuration, one dialogue engine can be appropriately selected from a plurality of dialogue engines by using the user's utterance, and an appropriate dialogue can be continued.

また、本第七の発明の対話制御装置は、第一から第六いずれか１つの発明に対して、発話情報出力部は、発話情報を管理端末に対しても出力可能であり、発話情報の管理端末への出力に応じて、管理端末からの応答である手動応答情報を受け付ける手動応答受付部をさらに具備し、応答送信部は、一の自動応答情報に対応する情報である一の自動情報または手動応答情報に対応する情報である手動情報をユーザ端末に送信する対話制御装置である。 Further, the dialogue control device of the seventh invention can output the utterance information to the management terminal for any one of the first to sixth inventions, and the utterance information output unit can output the utterance information to the management terminal. It further includes a manual response receiving unit that receives manual response information that is a response from the management terminal in response to output to the management terminal, and the response transmitting unit is one automatic information that is information corresponding to one automatic response information. Alternatively, it is an interactive control device that transmits manual information, which is information corresponding to manual response information, to a user terminal.

かかる構成により、対話エンジンからの応答と管理者からの応答とを使い分けて、ユーザ端末に出力することにより、適切な対話を継続できる。 With such a configuration, an appropriate dialogue can be continued by properly using the response from the dialogue engine and the response from the administrator and outputting them to the user terminal.

また、本第八の発明の対話制御装置は、対話エンジンを採用するか、管理端末を採用するかを決定する決定部をさらに具備し、応答送信部は、決定が対話エンジンを採用するとの決定の場合は、一の自動情報をユーザ端末に送信し、決定が管理端末を採用するとの決定の場合は、手動情報をユーザ端末に送信する請求項７載の対話制御装置である。 Further, the dialogue control device of the eighth invention further includes a decision unit for deciding whether to adopt a dialogue engine or a management terminal, and the response transmission unit determines that the decision adopts the dialogue engine. In the case of, the interactive control device according to claim 7, wherein one automatic information is transmitted to the user terminal, and in the case of the decision to adopt the management terminal, the manual information is transmitted to the user terminal.

また、本第九の発明の対話制御装置は、第八の発明に対して、対話エンジンを採用する自動モード、または管理端末を採用する手動モードのいずれかを、少なくとも採り得るモード情報が格納されるモード格納部と、モード情報を変更するための予め決められた条件を満たすか否かを判断する判断部と、判断部が予め決められた条件を満たすと判断した場合に、モード情報を変更するモード変更部とをさらに具備し、決定部は、モード格納部のモード情報を取得する対話制御装置である。 Further, the dialogue control device of the ninth invention stores mode information that can at least adopt either an automatic mode that employs a dialogue engine or a manual mode that employs a management terminal for the eighth invention. Mode storage unit, a judgment unit that determines whether or not a predetermined condition for changing the mode information is satisfied, and a judgment unit that determines whether or not a predetermined condition is satisfied, the mode information is changed. The mode changing unit is further provided, and the determining unit is an interactive control device that acquires the mode information of the mode storage unit.

また、本第十の発明の対話制御装置は、第八の発明に対して、対話エンジンを採用する自動モード、または管理端末を採用する手動モードのいずれかを、少なくとも採り得るモード情報が格納されるモード格納部と、管理端末からモードの変更の指示である変更指示を受け付ける変更指示受付部と、変更指示に応じて、モード格納部のモード情報を変更するモード変更部とをさらに具備し、決定部は、モード格納部のモード情報に従って、対話エンジンを採用するか、管理端末を採用するかを決定する対話制御装置である。 Further, the dialogue control device of the tenth invention stores mode information that can at least take either an automatic mode adopting a dialogue engine or a manual mode adopting a management terminal for the eighth invention. It further includes a mode storage unit, a change instruction reception unit that receives a change instruction that is a mode change instruction from a management terminal, and a mode change unit that changes the mode information of the mode storage unit in response to the change instruction. The determination unit is a dialogue control device that determines whether to adopt the dialogue engine or the management terminal according to the mode information of the mode storage unit.

また、本第十一の発明の対話制御装置は、第七から第十いずれか１つの発明に対して、一の自動情報を管理端末に出力する自動情報出力部をさらに具備する対話制御装置である。 Further, the dialogue control device of the eleventh invention is a dialogue control device further comprising an automatic information output unit for outputting one automatic information to a management terminal for any one of the seventh to tenth inventions. is there.

また、本第十二の発明の対話制御装置は、第七から第十一いずれか１つの発明に対して、手動情報を対話エンジンに出力する手動情報出力部をさらに具備する対話制御装置である。 Further, the dialogue control device of the twelfth invention is a dialogue control device further provided with a manual information output unit for outputting manual information to the dialogue engine for any one of the seventh to eleventh inventions. ..

かかる構成により、対話エンジンに管理者からの応答を渡すことにより、対話エンジンの能力の向上に寄与でき得る。 With such a configuration, it is possible to contribute to the improvement of the capacity of the dialogue engine by passing the response from the administrator to the dialogue engine.

また、本第十三の発明の対話エンジンは、第一から第十二いずれか１つの発明に対して、発話情報に対応する自動応答情報を取得するための情報である学習情報が格納される学習情報格納部と、対話制御装置から発話情報を受け付ける発話受付部と、発話受付部が受け付けた発話情報に対応する応答である自動応答情報を、学習情報を用いて取得する応答取得部と、応答取得部が取得した自動応答情報を対話制御装置に出力する応答出力部と、他の対話エンジンからの応答である他自動応答情報を受け付ける学習受付部と、学習受付部が受け付けた他自動応答情報を用いて、学習情報に情報を付加する学習部とを具備する対話エンジンである。 Further, the dialogue engine of the thirteenth invention stores learning information which is information for acquiring automatic response information corresponding to speech information for any one of the first to twelfth inventions. A learning information storage unit, a utterance reception unit that receives utterance information from the dialogue control device, a response acquisition unit that acquires automatic response information that is a response corresponding to the utterance information received by the utterance reception unit, and a response acquisition unit that uses the learning information. A response output unit that outputs the automatic response information acquired by the response acquisition unit to the dialogue control device, a learning reception unit that receives other automatic response information that is a response from another dialogue engine, and another automatic response received by the learning reception unit. It is a dialogue engine including a learning unit that adds information to learning information using information.

かかる構成により、他の対話エンジンからの応答を学習できる対話エンジンを提供できる。 With such a configuration, it is possible to provide a dialogue engine capable of learning the response from another dialogue engine.

また、本第十四の発明の対話装置は、２以上の対話エンジンから、ユーザの発話に対応する応答である自動応答情報を受け付け得る対話装置であって、ユーザ発話情報に対応する情報である発話情報を、１以上の対話エンジンに対して出力し得る発話情報出力部と、発話情報の出力に応じて、１以上の各対話エンジンから自動応答情報を受け付ける自動応答受付部と、自動応答受付部が受け付けた１以上の自動応答情報のうちの一の自動応答情報に対応する一の自動情報をユーザ端末に送信する応答出力部とを具備する対話装置である。 Further, the dialogue device of the fourteenth invention is a dialogue device capable of receiving automatic response information which is a response corresponding to a user's utterance from two or more dialogue engines, and is information corresponding to the user's utterance information. An utterance information output unit that can output utterance information to one or more dialogue engines, an automatic response reception unit that receives automatic response information from one or more dialogue engines according to the output of utterance information, and an automatic response reception unit. It is an interactive device including a response output unit that transmits one automatic information corresponding to one of the one or more automatic response information received by the unit to the user terminal.

また、本第十五の発明の対話装置は、第十四の発明に対して、発話情報出力部は、ユーザ発話情報に関する発話情報を、１以上の対話エンジンまたは画面のうちの少なくともどちらか一方に出力し、発話情報の前記画面への出力に応じて、管理者からの応答である手動応答情報を受け付ける手動応答受付部をさらに具備し、応答出力部は、自動応答情報に関する自動情報または手動応答情報に関する手動情報を出力する対話装置である。 Further, in the dialogue device of the fifteenth invention, with respect to the fourteenth invention, the utterance information output unit outputs the utterance information regarding the user utterance information to at least one of one or more dialogue engines or screens. The response output unit is further provided with a manual response reception unit that outputs the utterance information to the screen and receives the manual response information that is the response from the administrator, and the response output unit is the automatic information or the manual about the automatic response information. It is an interactive device that outputs manual information about response information.

本発明による対話制御装置によれば、複数の対話エンジンを効果的に用いて、適切な対話を継続させることができる。 According to the dialogue control device according to the present invention, a plurality of dialogue engines can be effectively used to continue an appropriate dialogue.

実施の形態１における対話システムＡの概念図Conceptual diagram of the dialogue system A in the first embodiment 同対話システムＡのブロック図Block diagram of the dialogue system A 同対話制御装置２のブロック図Block diagram of the dialogue control device 2 同対話制御装置２の動作について説明するフローチャートA flowchart for explaining the operation of the dialogue control device 2. 同対話エンジン３の動作について説明するフローチャートFlow chart explaining the operation of the dialogue engine 3 同優先度情報管理表を示す図Diagram showing the same priority information management table 同学習情報管理表を示す図Diagram showing the learning information management table 同Ｆｏｏｄｔａｂｌｅを示す図Diagram showing the same Foodtable 同ユーザ端末１の出力例を示す図The figure which shows the output example of the user terminal 1. 実施の形態２における対話制御装置４のブロック図Block diagram of the dialogue control device 4 according to the second embodiment 同対話制御装置４の動作について説明するフローチャートA flowchart for explaining the operation of the dialogue control device 4. 同選択条件管理表を示す図The figure which shows the selection condition management table 実施の形態３における対話システムＣの概念図Conceptual diagram of the dialogue system C in the third embodiment 同対話システムＣのブロック図Block diagram of the dialogue system C 同対話制御装置５のブロック図Block diagram of the dialogue control device 5 同対話制御装置５の動作について説明するフローチャートA flowchart for explaining the operation of the dialogue control device 5. 同対話エンジン６の動作について説明するフローチャートA flowchart explaining the operation of the dialogue engine 6. 同管理端末７の動作について説明するフローチャートFlow chart explaining the operation of the management terminal 7. 同破綻判断条件管理表を示す図Diagram showing the bankruptcy judgment condition management table 同出力例を示す図Diagram showing the same output example 同手動応答情報管理表を示す図The figure which shows the manual response information management table 実施の形態４における対話装置８のブロック図Block diagram of the dialogue device 8 according to the fourth embodiment 同対話装置８の動作について説明するフローチャートA flowchart illustrating the operation of the dialogue device 8. 上記実施の形態におけるコンピュータシステムの概観図Overview of the computer system according to the above embodiment 同コンピュータシステムのブロック図Block diagram of the computer system

以下、対話制御装置等の実施形態について図面を参照して説明する。なお、実施の形態において同じ符号を付した構成要素は同様の動作を行うので、再度の説明を省略する場合がある。 Hereinafter, embodiments of the dialogue control device and the like will be described with reference to the drawings. In addition, since the components with the same reference numerals perform the same operation in the embodiment, the description may be omitted again.

（実施の形態１）
本実施の形態において、複数の対話エンジンにリクエストを送信し、複数の応答から一の応答を採用する対話システムについて説明する。 (Embodiment 1)
In the present embodiment, a dialogue system that sends a request to a plurality of dialogue engines and adopts one response from the plurality of responses will be described.

本実施の形態において、予め決められた条件に合致する対話エンジンの応答を採用する対話システムについて説明する。 In the present embodiment, a dialogue system that employs the response of the dialogue engine that meets the predetermined conditions will be described.

本実施の形態において、予め決められた条件を満たす応答の中で、対話エンジンに対する優先度が最も高い対話エンジンの応答を採用する対話システムについて説明する。 In the present embodiment, a dialogue system that adopts the response of the dialogue engine having the highest priority to the dialogue engine among the responses satisfying the predetermined conditions will be described.

本実施の形態における対話システムを構成する対話制御装置は、２以上の対話エンジンから、ユーザの発話に対応する応答である自動応答情報を受け付け得る装置である。 The dialogue control device constituting the dialogue system in the present embodiment is a device that can receive automatic response information that is a response corresponding to a user's utterance from two or more dialogue engines.

図１は、本実施の形態における対話システムＡの概念図である。対話システムＡは、１または２以上のユーザ端末１、対話制御装置２、および２以上の対話エンジン３を備える。ユーザ端末１は、例えば、いわゆるパーソナルコンピュータ、タブレット端末、スマートフォン等であり、その種類は問わない。なお、ユーザ端末１は、ユーザが使用する端末である。ユーザは、対話エンジンとの対話を行う人である。対話制御装置２は、対話を制御する装置である。対話制御装置２は、例えば、ＡＳＰのサーバ、クラウドサーバ等である。但し、対話制御装置２の種類は問わない。２以上の各対話エンジン３は、ユーザからの発話の情報を受け付け、応答を返す装置である。対話エンジン３は、例えば、ＡＳＰのサーバ、クラウドサーバ等である。但し、対話エンジン３の種類は問わない。また、対話エンジン３は、発話情報を受け付け、自動応答情報を出力する。対話エンジン３は、公知の対話エンジンでも良い。 FIG. 1 is a conceptual diagram of the dialogue system A according to the present embodiment. The dialogue system A includes one or more user terminals 1, a dialogue control device 2, and two or more dialogue engines 3. The user terminal 1 is, for example, a so-called personal computer, a tablet terminal, a smartphone, or the like, and the type thereof does not matter. The user terminal 1 is a terminal used by the user. A user is a person who interacts with a dialogue engine. The dialogue control device 2 is a device that controls dialogue. The dialogue control device 2 is, for example, an ASP server, a cloud server, or the like. However, the type of the dialogue control device 2 does not matter. Each of the two or more dialogue engines 3 is a device that receives information on utterances from the user and returns a response. The dialogue engine 3 is, for example, an ASP server, a cloud server, or the like. However, the type of the dialogue engine 3 does not matter. Further, the dialogue engine 3 receives the utterance information and outputs the automatic response information. The dialogue engine 3 may be a known dialogue engine.

また、通常、１以上の各ユーザ端末１と対話制御装置２とはインターネット、ＬＡＮ等のネットワークにより通信可能である。また、通常、対話制御装置２と１以上の各対話エンジン３とはインターネット、ＬＡＮ等のネットワークにより通信可能である。ただし、対話制御装置２は、１以上のユーザ端末１、１以上の対話エンジン３のうちの１以上の装置を内部に含んでいても良い。つまり、ユーザ端末１と対話制御装置２と対話エンジン３のうちの２以上の装置は、物理的に一の装置で実現されても良い。 Further, usually, one or more user terminals 1 and the dialogue control device 2 can communicate with each other via a network such as the Internet or LAN. Further, usually, the dialogue control device 2 and each of the dialogue engines 3 of 1 or more can communicate with each other via a network such as the Internet or LAN. However, the dialogue control device 2 may include one or more devices of one or more user terminals 1 and one or more dialogue engines 3 inside. That is, two or more of the user terminal 1, the dialogue control device 2, and the dialogue engine 3 may be physically realized by one device.

図２は、本実施の形態における対話システムＡのブロック図である。図３は、対話制御装置２のブロック図である。 FIG. 2 is a block diagram of the dialogue system A according to the present embodiment. FIG. 3 is a block diagram of the dialogue control device 2.

ユーザ端末１は、ユーザ受付部１１、ユーザ処理部１２、ユーザ送信部１３、ユーザ受信部１４、およびユーザ出力部１５を備える。 The user terminal 1 includes a user reception unit 11, a user processing unit 12, a user transmission unit 13, a user reception unit 14, and a user output unit 15.

対話制御装置２は、格納部２１、受信部２２、処理部２３、出力部２４、受付部２５、および送信部２６を備える。処理部２３は、選択部２３１を備える。出力部２４は、発話情報出力部２４１を備える。受付部２５は、自動応答受付部２５１を備える。送信部２６は、応答送信部２６１を備える。 The dialogue control device 2 includes a storage unit 21, a reception unit 22, a processing unit 23, an output unit 24, a reception unit 25, and a transmission unit 26. The processing unit 23 includes a selection unit 231. The output unit 24 includes an utterance information output unit 241. The reception unit 25 includes an automatic response reception unit 251. The transmission unit 26 includes a response transmission unit 261.

対話エンジン３は、学習情報格納部３１、発話受付部３２、応答取得部３３、応答出力部３４、学習受付部３５、および学習部３６を備える。 The dialogue engine 3 includes a learning information storage unit 31, an utterance reception unit 32, a response acquisition unit 33, a response output unit 34, a learning reception unit 35, and a learning unit 36.

ユーザ端末１を構成するユーザ受付部１１は、ユーザからの発話を受け付ける。発話の受け付けは、音声での受け付けでも良いし、テキストでの受け付けでも良い。発話は、ユーザが発した情報、ユーザが入力した情報であれば良い。発話の入力手段は、マイク、キーボード、マウス、メニュー画面によるもの等、何でも良い。ユーザ受付部１１は、マイクやキーボード等の入力手段のデバイスドライバーや、メニュー画面の制御ソフトウェア等で実現され得る。 The user reception unit 11 constituting the user terminal 1 receives an utterance from the user. The utterance may be accepted by voice or by text. The utterance may be information uttered by the user or information input by the user. The utterance input means may be any, such as a microphone, a keyboard, a mouse, and a menu screen. The user reception unit 11 can be realized by a device driver for input means such as a microphone or a keyboard, control software for a menu screen, or the like.

ユーザ処理部１２は、ユーザ受付部１１が受け付けた発話から、当該発話に対応するユーザ発話情報を取得する。ユーザ処理部１２は、例えば、音声である発話を音声認識し、テキストであるユーザ発話情報を取得する。なお、ユーザ発話情報は、ユーザが入力した発話と同じでも良い。 The user processing unit 12 acquires the user utterance information corresponding to the utterance from the utterance received by the user reception unit 11. For example, the user processing unit 12 voice-recognizes utterances that are voices and acquires user utterance information that is text. The user utterance information may be the same as the utterance input by the user.

また、ユーザ処理部１２は、ユーザ受信部１４が受信した自動情報がテキストの場合、当該を自動情報から音声データを生成しても良い。 Further, when the automatic information received by the user receiving unit 14 is text, the user processing unit 12 may generate voice data from the automatic information.

ユーザ送信部１３は、ユーザ処理部１２が取得したユーザ発話情報を対話制御装置２に送信する。 The user transmission unit 13 transmits the user utterance information acquired by the user processing unit 12 to the dialogue control device 2.

ユーザ受信部１４は、後述する自動情報を受信する。自動情報は、通常、対話制御装置２から送信される。 The user receiving unit 14 receives the automatic information described later. The automatic information is usually transmitted from the dialogue control device 2.

ユーザ出力部１５は、ユーザ受信部１４が受信した自動情報を出力する。ユーザ出力部１５は、ユーザ処理部１２が自動情報から取得した音声データを音声出力しても良い。なお、かかる場合も、自動情報を出力したものとする。また、ここでの出力とは、通常、ディスプレイへの表示、またはスピーカーへの音声出力であるが、プロジェクターを用いた投影、プリンタでの印字、外部の装置への送信、記録媒体への蓄積、他の処理装置や他のプログラムなどへの処理結果の引渡しなどを含む概念である、と考えても良い。 The user output unit 15 outputs the automatic information received by the user reception unit 14. The user output unit 15 may output the voice data acquired from the automatic information by the user processing unit 12. Even in such a case, it is assumed that the automatic information is output. Further, the output here is usually a display on a display or an audio output to a speaker, but projection using a projector, printing by a printer, transmission to an external device, storage on a recording medium, It may be considered that the concept includes delivery of the processing result to another processing device or another program.

対話制御装置２を構成する格納部２１は、各種の情報が格納される。各種の情報とは、例えば、後述する自動応答情報の選択条件、後述する優先度情報、対話の履歴、２以上の各対話エンジン３と通信を行うための情報（例えば、対話エンジン３のＩＰアドレス、ＵＲＬ、ＭＡＣアドレス等）である。対話の履歴は、例えば、ユーザ発話情報、発話情報、自動応答情報、自動情報等である。 Various types of information are stored in the storage unit 21 that constitutes the dialogue control device 2. The various types of information include, for example, selection conditions for automatic response information described later, priority information described later, dialogue history, and information for communicating with each of two or more dialogue engines 3 (for example, the IP address of the dialogue engine 3). , URL, MAC address, etc.). The dialogue history is, for example, user utterance information, utterance information, automatic response information, automatic information, and the like.

受信部２２は、ユーザ発話情報をユーザ端末１から受信する。また、ユーザ発話情報は、通常、テキストであるが、音声データでも良い。ユーザ発話情報のデータ構造やータタイプは問わない。 The receiving unit 22 receives the user utterance information from the user terminal 1. Further, the user utterance information is usually text, but voice data may also be used. The data structure and data type of user utterance information do not matter.

処理部２３は、各種の処理を行う。各種の処理とは、選択部２３１が行う処理等である。 The processing unit 23 performs various processes. The various processes are processes performed by the selection unit 231 and the like.

選択部２３１は、２以上の自動応答情報から一の自動応答情報を選択する。かかる選択方法は、ランダムでも良い。ランダムとは、２以上の自動応答情報から任意に一の自動応答情報を選択することである。 The selection unit 231 selects one automatic response information from two or more automatic response information. Such a selection method may be random. Random is to arbitrarily select one automatic response information from two or more automatic response information.

また、選択部２３１は、２以上の自動応答情報から、予め決められた条件に合致する一の自動応答情報を選択する。予め決められた条件は、例えば、スコアに基づく条件、自動応答情報の内容に基づく条件、時間に基づく条件、優先度に基づく条件等、またはこれらの組み合わせである。スコアに基づく条件は、例えば、自動応答情報に対応付いているスコアが最も高い、という条件である。また、自動応答情報の内容に基づく条件とは、例えば、最も長い文字列の自動応答情報、または文の数が最も多い自動応答情報、という条件である。また、時間に基づく条件とは、例えば、最も早く受け付けられた自動応答情報である、という条件である。また、優先度に基づく条件とは、例えば、閾値の時間内に受け付けられた自動応答情報の中で、優先度情報に基づく優先度が最も高い対話エンジン３から出力された自動応答情報である、という条件である。なお、かかる条件は問わない。優先度に基づく場合、通常、格納部２１は優先度情報を格納している。優先度情報とは、２以上の対話エンジン３の優先度を示す情報である。優先度情報は、例えば、優先度の高い順に並べられた対話エンジン３の識別子である。優先度情報は、例えば、優先度（例えば、数値）と対話エンジン３の識別子との組を、２組以上有する情報である。 Further, the selection unit 231 selects one automatic response information that matches a predetermined condition from two or more automatic response information. The predetermined conditions are, for example, a score-based condition, a content-based condition of the automatic response information, a time-based condition, a priority-based condition, and the like, or a combination thereof. The condition based on the score is, for example, the condition that the score corresponding to the automatic response information is the highest. Further, the condition based on the content of the automatic response information is, for example, the automatic response information of the longest character string or the automatic response information having the largest number of sentences. The time-based condition is, for example, the condition that the automatic response information is received earliest. Further, the priority-based condition is, for example, the automatic response information output from the dialogue engine 3 having the highest priority based on the priority information among the automatic response information received within the threshold time. It is a condition. The conditions are not limited. When based on the priority, the storage unit 21 usually stores the priority information. The priority information is information indicating the priority of two or more dialogue engines 3. The priority information is, for example, an identifier of the dialogue engine 3 arranged in descending order of priority. The priority information is, for example, information having two or more sets of a priority (for example, a numerical value) and an identifier of the dialogue engine 3.

選択部２３１は、２以上の自動応答情報のうちの予め決められた条件を満たす自動応答情報の中で、優先度情報に基づく優先度が最も高い対話エンジン３から出力された自動応答情報を選択する。ここで、予め決められた条件を満たす自動応答情報とは、例えば、対話の破綻を示す情報（例えば、「理解できない」「理解不能」など）を有しない自動応答情報、スコアが閾値以上の自動応答情報等である。 The selection unit 231 selects the automatic response information output from the dialogue engine 3 having the highest priority based on the priority information among the automatic response information satisfying the predetermined conditions among the two or more automatic response information. To do. Here, the automatic response information satisfying the predetermined conditions is, for example, automatic response information having no information indicating the failure of the dialogue (for example, "incomprehensible" or "incomprehensible"), and automatic response with a score equal to or higher than the threshold value. Response information, etc.

なお、選択部２３１が２以上の自動応答情報の中から一の自動応答情報を選択するアルゴリズム（上記の予め決められた条件）は問わない。 The algorithm (predetermined condition described above) in which the selection unit 231 selects one automatic response information from two or more automatic response information does not matter.

出力部２４は、各種の情報を出力する。ここでの出力とは、通常、送信である。ただし、出力とは、情報が出力先に渡れば良い。 The output unit 24 outputs various information. The output here is usually a transmission. However, the output means that the information should be passed to the output destination.

発話情報出力部２４１は、ユーザ発話情報に対応する情報である発話情報を、１以上の各対話エンジン３に出力する。ここでは、主に、発話情報出力部２４１が、発話情報を、２以上の各対話エンジン３に出力する場合について説明する。ここで、発話情報は、ユーザ発話情報と同じ情報でも良い。また、発話情報は、ユーザ発話情報を音声認識し、得られたテキストでも良い。発話情報は、発話識別子が付加された情報であることは好適である。なお、発話識別子は、発話を識別する情報である。 The utterance information output unit 241 outputs utterance information, which is information corresponding to the user utterance information, to each of one or more dialogue engines 3. Here, a case where the utterance information output unit 241 mainly outputs the utterance information to each of two or more dialogue engines 3 will be described. Here, the utterance information may be the same information as the user utterance information. Further, the utterance information may be a text obtained by voice-recognizing the user utterance information. It is preferable that the utterance information is information to which the utterance identifier is added. The utterance identifier is information that identifies the utterance.

自動応答情報出力部２４２は、選択部２３１が選択した一の自動応答情報を、当該一の自動応答情報を出力した対話エンジン３以外の１以上の対話エンジン３に出力する。なお、自動応答情報出力部２４２は、自動応答受付部２５１が受け付けた１以上の自動応答情報を、自身が出力した自動応答情報を除いて、他の全ての対話エンジン３に出力しても良いし、他の対話エンジン３の中の一部の対話エンジン３に出力しても良い。また、自動応答情報出力部２４２が自動応答情報を対話エンジン３に出力するのは、対話エンジン３の学習のためである。 The automatic response information output unit 242 outputs one automatic response information selected by the selection unit 231 to one or more dialogue engines 3 other than the dialogue engine 3 that outputs the one automatic response information. The automatic response information output unit 242 may output one or more automatic response information received by the automatic response reception unit 251 to all other dialogue engines 3 except for the automatic response information output by itself. However, it may be output to some of the dialogue engines 3 among the other dialogue engines 3. Further, the automatic response information output unit 242 outputs the automatic response information to the dialogue engine 3 for learning of the dialogue engine 3.

受付部２５は、各種の情報や指示等を受け付ける。ここでの受け付けとは、通常、受信である。但し、受け付けは、情報等を受け取れれば良い。 The reception unit 25 receives various information, instructions, and the like. Acceptance here is usually reception. However, the reception may be as long as the information etc. is received.

自動応答受付部２５１は、発話情報の対話エンジン３への出力に応じて、対話エンジン３からの応答である１または２以上の自動応答情報を受け付ける。通常、自動応答受付部２５１は、発話情報の出力に応じて、２以上の各対話エンジンから自動応答情報を受け付ける。なお、ここでの受け付けとは、例えば、受信である。 The automatic response reception unit 251 receives one or two or more automatic response information which is a response from the dialogue engine 3 according to the output of the utterance information to the dialogue engine 3. Normally, the automatic response reception unit 251 receives automatic response information from two or more dialogue engines according to the output of the utterance information. The reception here is, for example, reception.

送信部２６は、各種の情報を送信する。各種の情報とは、例えば、自動情報等である。 The transmission unit 26 transmits various types of information. The various types of information are, for example, automatic information and the like.

応答送信部２６１は、自動応答受付部２５１が受け付けた１以上の自動応答情報のうちの一の自動応答情報に対応する一の自動情報をユーザ端末１に送信する。 The response transmission unit 261 transmits one automatic information corresponding to one of the one or more automatic response information received by the automatic response reception unit 251 to the user terminal 1.

応答送信部２６１は、通常、選択部２３１が選択した自動応答情報に対応する一の自動情報をユーザ端末１に送信する。 The response transmission unit 261 normally transmits one automatic information corresponding to the automatic response information selected by the selection unit 231 to the user terminal 1.

対話エンジン３を構成する学習情報格納部３１は、１または２以上の学習情報を格納している。なお、単に「学習情報」と言うときは、２以上の学習情報でも良い。また、学習情報とは、発話情報に対応する自動応答情報を取得するための情報である。学習情報は、例えば、発話情報と自動応答情報との組である。また、学習情報は、例えば、発話情報の文字列パターンと自動応答情報の文字列パターンとの組である。また、学習情報は、例えば、会話の状態遷移を示す情報（例えば、特許文献２参照）等でも良い。また、学習情報は、単語の辞書等を含んでも良い。なお、学習情報のデータ構造や内容は問わないことは言うまでもない。 The learning information storage unit 31 constituting the dialogue engine 3 stores one or more learning information. When the term "learning information" is simply used, two or more learning information may be used. Further, the learning information is information for acquiring the automatic response information corresponding to the utterance information. The learning information is, for example, a set of utterance information and automatic response information. Further, the learning information is, for example, a set of a character string pattern of utterance information and a character string pattern of automatic response information. Further, the learning information may be, for example, information indicating a state transition of conversation (see, for example, Patent Document 2). Further, the learning information may include a dictionary of words and the like. Needless to say, the data structure and contents of the learning information do not matter.

発話受付部３２は、対話制御装置２から発話情報を受け付ける。発話受付部３２は、通常、対話制御装置２から発話情報を受信する。 The utterance reception unit 32 receives utterance information from the dialogue control device 2. The utterance reception unit 32 usually receives utterance information from the dialogue control device 2.

応答取得部３３は、発話受付部３２が受け付けた発話情報に対応する応答である自動応答情報を、学習情報格納部３１の学習情報を用いて取得する。なお、かかる応答取得部３３の処理は公知技術である。例えば、応答取得部３３は、発話受付部３２が受け付けた発話情報と対になる自動応答情報を学習情報格納部３１から検索する。また、例えば、応答取得部３３は、発話受付部３２が受け付けた発話情報に合致する発話情報の文字列パターンを検知し、当該発話情報の文字列パターンと対になる自動応答情報の文字列パターンを取得し、当該自動応答情報の文字列パターンの変数部分に発話情報から抽出した単語または句を代入して、自動応答情報を構成する。 The response acquisition unit 33 acquires automatic response information, which is a response corresponding to the utterance information received by the utterance reception unit 32, by using the learning information of the learning information storage unit 31. The processing of the response acquisition unit 33 is a known technique. For example, the response acquisition unit 33 searches the learning information storage unit 31 for automatic response information paired with the utterance information received by the utterance reception unit 32. Further, for example, the response acquisition unit 33 detects a character string pattern of the utterance information that matches the utterance information received by the utterance reception unit 32, and the character string pattern of the automatic response information paired with the character string pattern of the utterance information. Is acquired, and the word or phrase extracted from the utterance information is substituted into the variable part of the character string pattern of the automatic response information to construct the automatic response information.

応答出力部３４は、応答取得部３３が取得した自動応答情報を対話制御装置２に出力する。ここでの出力は、通常、送信である。ただし、出力とは、対話制御装置２に自動応答情報が渡されれば良い。 The response output unit 34 outputs the automatic response information acquired by the response acquisition unit 33 to the dialogue control device 2. The output here is usually a transmission. However, as for the output, the automatic response information may be passed to the dialogue control device 2.

学習受付部３５は、他の対話エンジン３からの応答である自動応答情報を受け付ける。ここでの自動応答情報の受け付けは、自動情報の受け付けでも良い趣旨である。 The learning reception unit 35 receives automatic response information which is a response from another dialogue engine 3. The acceptance of the automatic response information here is intended to accept the automatic information.

学習部３６は、学習受付部３５が受け付けた自動応答情報を用いて、学習処理を行う。学習処理は、例えば、自動応答情報を用いて、学習情報格納部３１の学習情報に情報を付加することである。学習処理は、例えば、自動応答情報を用いて、学習情報格納部３１の学習情報を変更することである。学習情報に情報を付加する処理は、学習情報格納部３１に新しい学習情報を追記する処理を含む。 The learning unit 36 performs learning processing using the automatic response information received by the learning reception unit 35. The learning process is, for example, adding information to the learning information of the learning information storage unit 31 by using the automatic response information. The learning process is, for example, changing the learning information of the learning information storage unit 31 by using the automatic response information. The process of adding information to the learning information includes a process of adding new learning information to the learning information storage unit 31.

学習部３６は、例えば、自動応答情報と当該自動応答情報に対応する発話情報の組である学習情報を学習情報格納部３１に追記する。 The learning unit 36 adds, for example, learning information, which is a set of automatic response information and utterance information corresponding to the automatic response information, to the learning information storage unit 31.

学習部３６は、例えば、他の対話エンジン３からの自動応答情報と当該自動応答情報に対応する発話情報の組の情報から、状態遷移情報である学習情報に、当該発話情報の前の自動応答情報のノード（状態）の先の状態として、発話情報のノードを付加し、当該発話情報のノードに、自動応答情報のノードを付加する。 For example, the learning unit 36 converts the information of the set of the automatic response information from the other dialogue engine 3 and the utterance information corresponding to the automatic response information into the learning information which is the state transition information, and automatically responds before the utterance information. A utterance information node is added as a state ahead of the information node (state), and an automatic response information node is added to the utterance information node.

学習部３６は、例えば、他の対話エンジン３からの自動応答情報から自動応答情報のパターンを構成し、かつ当該自動応答情報に対応する発話情報から発話情報のパターンを構成し、発話情報のパターンと応答のパターンとの組である学習情報を学習情報格納部３１に追記する。 For example, the learning unit 36 constitutes an automatic response information pattern from the automatic response information from the other dialogue engine 3, and constitutes an utterance information pattern from the utterance information corresponding to the automatic response information, and forms an utterance information pattern. The learning information, which is a set of the response pattern and the response pattern, is added to the learning information storage unit 31.

学習部３６は、例えば、他の対話エンジン３からの自動応答情報から自動応答情報のパターンを構成し、かつ当該自動応答情報に対応する発話情報から発話情報のパターンを構成し、当該パターンの組に合致する、発話情報のパターンと自動応答情報のパターンの組の出現頻度または出現確率を増加させるように、学習情報格納部３１の学習情報を更新する。かかる場合、学習情報は、発話情報のパターンと自動応答情報のパターンの組、および出現頻度または出現確率を有する。 For example, the learning unit 36 configures a pattern of automatic response information from automatic response information from another dialogue engine 3, and configures a pattern of utterance information from utterance information corresponding to the automatic response information, and sets the pattern. The learning information of the learning information storage unit 31 is updated so as to increase the appearance frequency or appearance probability of the pair of the utterance information pattern and the automatic response information pattern that matches. In such a case, the learning information has a set of a pattern of utterance information and a pattern of automatic response information, and an appearance frequency or appearance probability.

なお、学習部３６は、例えば、発話情報に対応する発話識別子と自動応答情報に対応する発話識別子とが同一の、発話情報と自動応答情報とを用いて、上記の学習処理を行う。 The learning unit 36 performs the above learning process using, for example, the utterance information and the automatic response information in which the utterance identifier corresponding to the utterance information and the utterance identifier corresponding to the automatic response information are the same.

また、他の対話エンジン３からの自動応答情報を用いた学習処理は種々あり得る。他の対話エンジン３からの自動応答情報を用いる点を除いて、学習部３６が行う学習処理は他の公知の学習処理のアルゴリズムを用いても良い。 In addition, there may be various learning processes using the automatic response information from the other dialogue engine 3. The learning process performed by the learning unit 36 may use another known learning processing algorithm, except that the automatic response information from the other dialogue engine 3 is used.

ユーザ処理部１２、処理部２３、選択部２３１、応答取得部３３、および学習部３６は、通常、ＭＰＵやメモリ等から実現され得る。ユーザ処理部１２の処理手順は、通常、ソフトウェアで実現され、当該ソフトウェアはＲＯＭ等の記録媒体に記録されている。但し、ハードウェア（専用回路）で実現しても良い。 The user processing unit 12, the processing unit 23, the selection unit 231 and the response acquisition unit 33, and the learning unit 36 can usually be realized from an MPU, a memory, or the like. The processing procedure of the user processing unit 12 is usually realized by software, and the software is recorded in a recording medium such as a ROM. However, it may be realized by hardware (dedicated circuit).

ユーザ送信部１３、出力部２４、発話情報出力部２４１、送信部２６、応答送信部２６１、および応答出力部３４は、通常、無線または有線の通信手段で実現されるが、放送手段で実現されても良い。 The user transmission unit 13, the output unit 24, the utterance information output unit 241, the transmission unit 26, the response transmission unit 261 and the response output unit 34 are usually realized by wireless or wired communication means, but are realized by broadcasting means. You may.

なお、出力部２４、発話情報出力部２４１、自動情報出力部２４２、および応答出力部３４は、情報を出力する出力手段で実現されても良い。ここでの情報の出力とは、例えば、プログラムによる情報の引き渡しである。 The output unit 24, the utterance information output unit 241, the automatic information output unit 242, and the response output unit 34 may be realized by output means for outputting information. The output of information here is, for example, the delivery of information by a program.

ユーザ受信部１４、受信部２２、受付部２５、自動応答受付部２５１、発話受付部３２、および学習受付部３５は、通常、無線または有線の通信手段で実現されるが、放送を受信する手段で実現されても良い。 The user receiving unit 14, the receiving unit 22, the receiving unit 25, the automatic response receiving unit 251, the utterance receiving unit 32, and the learning receiving unit 35 are usually realized by wireless or wired communication means, but are means for receiving broadcasts. It may be realized by.

なお、受付部２５、自動応答受付部２５１、発話受付部３２、および学習受付部３５は、情報を受け付ける受付手段で実現されても良い。ここでの情報の受け付けとは、例えば、プログラムによる情報の引き渡しである。 The reception unit 25, the automatic response reception unit 251 and the utterance reception unit 32, and the learning reception unit 35 may be realized by the reception means for receiving information. The acceptance of information here is, for example, the delivery of information by a program.

ユーザ出力部１５は、ディスプレイやスピーカー等の出力デバイスを含むと考えても含まないと考えても良い。ユーザ出力部１５は、出力デバイスのドライバーソフトまたは、出力デバイスのドライバーソフトと出力デバイス等で実現され得る。 The user output unit 15 may or may not include an output device such as a display or a speaker. The user output unit 15 can be realized by the driver software of the output device, the driver software of the output device, the output device, or the like.

格納部２１、および学習情報格納部３１は、不揮発性の記録媒体が好適であるが、揮発性の記録媒体でも実現可能である。 The storage unit 21 and the learning information storage unit 31 are preferably non-volatile recording media, but can also be realized by volatile recording media.

格納部２１等に情報が記憶される過程は問わない。例えば、記録媒体を介して情報が格納部２１等で記憶されるようになってもよく、通信回線等を介して送信された情報が格納部２１等で記憶されるようになってもよく、あるいは、入力デバイスを介して入力された情報が格納部２１等で記憶されるようになってもよい。 The process of storing information in the storage unit 21 or the like does not matter. For example, the information may be stored in the storage unit 21 or the like via the recording medium, or the information transmitted via the communication line or the like may be stored in the storage unit 21 or the like. Alternatively, the information input via the input device may be stored in the storage unit 21 or the like.

次に、対話システムＡの動作について説明する。まず、ユーザ端末１の動作例について説明する。 Next, the operation of the dialogue system A will be described. First, an operation example of the user terminal 1 will be described.

ユーザ端末１のユーザ受付部１１は、ユーザからの発話を受け付ける。次に、ユーザ処理部１２は、ユーザ受付部１１が受け付けた発話から、当該発話に対応するユーザ発話情報を取得する。ここで、ユーザ処理部１２は、例えば、音声である発話を音声認識し、テキストであるユーザ発話情報を取得する。次に、ユーザ送信部１３は、ユーザ処理部１２が取得したユーザ発話情報を対話制御装置２に送信する。次に、ユーザ発話情報の送信に応じて、ユーザ受信部１４は、自動情報を受信する。なお、自動情報は、ユーザの発話に対する応答の情報である。次に、ユーザ出力部１５は、ユーザ受信部１４が受信した自動情報を出力する。以上の処理を繰り返し、ユーザと対話エンジン３との対話が継続される。なお、自動情報がテキストの場合、ユーザ出力部１５は、当該テキストを音声合成し、音声出力しても良い。また、ユーザの指示等により、対話は終了する。 The user reception unit 11 of the user terminal 1 receives an utterance from the user. Next, the user processing unit 12 acquires the user utterance information corresponding to the utterance from the utterance received by the user reception unit 11. Here, the user processing unit 12 recognizes a voice utterance, for example, and acquires the text user utterance information. Next, the user transmission unit 13 transmits the user utterance information acquired by the user processing unit 12 to the dialogue control device 2. Next, in response to the transmission of the user utterance information, the user receiving unit 14 receives the automatic information. The automatic information is information on the response to the user's utterance. Next, the user output unit 15 outputs the automatic information received by the user reception unit 14. By repeating the above process, the dialogue between the user and the dialogue engine 3 is continued. When the automatic information is text, the user output unit 15 may voice-synthesize the text and output it as voice. In addition, the dialogue ends according to the user's instruction or the like.

次に、対話制御装置２の動作について、図４のフローチャートを用いて説明する。 Next, the operation of the dialogue control device 2 will be described with reference to the flowchart of FIG.

（ステップＳ４０１）受信部２２は、ユーザ発話情報をユーザ端末１から受信したか否かを判断する。ユーザ発話情報を受信した場合はステップＳ４０２に行き、受信しない場合はステップＳ４０１に戻る。 (Step S401) The receiving unit 22 determines whether or not the user utterance information has been received from the user terminal 1. If the user utterance information is received, the process proceeds to step S402, and if not received, the process returns to step S401.

（ステップＳ４０２）処理部２３は、ユーザ発話情報から発話情報を取得する。 (Step S402) The processing unit 23 acquires the utterance information from the user utterance information.

（ステップＳ４０３）出力部２４は、発話情報の２以上の出力先を取得する。かかる出力先は、２以上の対話エンジン３であり、例えば、格納部２１に格納されている。出力先は、例えば、後述する通信先識別子である。 (Step S403) The output unit 24 acquires two or more output destinations of the utterance information. Such an output destination is two or more dialogue engines 3, and is stored in, for example, a storage unit 21. The output destination is, for example, a communication destination identifier described later.

（ステップＳ４０４）発話情報出力部２４１は、ステップＳ４０３で取得された２以上の出力先に、ステップＳ４０２で取得された発話情報を出力する。 (Step S404) The utterance information output unit 241 outputs the utterance information acquired in step S402 to two or more output destinations acquired in step S403.

（ステップＳ４０５）受付部２５は、２以上の対話エンジン３への発話情報の出力に応じて、１以上の自動応答情報を受け付ける。なお、受付部２５は、発話情報の出力から閾値以内の時間のみ、自動応答情報を受け付けることは好適である。 (Step S405) The reception unit 25 receives one or more automatic response information in response to the output of the utterance information to the two or more dialogue engines 3. It is preferable that the reception unit 25 receives the automatic response information only for a time within the threshold value from the output of the utterance information.

（ステップＳ４０６）選択部２３１は、ステップＳ４０５で受け付けられた１以上の自動応答情報から一の自動応答情報を選択する。なお、ステップＳ４０５で受け付けられた自動応答情報が一つの場合、本ステップは省略し得る。 (Step S406) The selection unit 231 selects one automatic response information from one or more automatic response information received in step S405. If there is only one automatic response information received in step S405, this step may be omitted.

（ステップＳ４０７）処理部２３は、ステップＳ４０６で選択された自動応答情報から自動情報を取得する。 (Step S407) The processing unit 23 acquires automatic information from the automatic response information selected in step S406.

（ステップＳ４０８）応答送信部２６１は、ステップＳ４０７で取得された自動情報をユーザ端末１に送信する。 (Step S408) The response transmission unit 261 transmits the automatic information acquired in step S407 to the user terminal 1.

（ステップＳ４０９）自動応答情報出力部２４２は、ステップＳ４０６で選択された一の自動応答情報を、当該一の自動応答情報を出力した対話エンジン３以外の対話エンジン３に出力する。ステップＳ４０１に戻る。 (Step S409) The automatic response information output unit 242 outputs the one automatic response information selected in step S406 to the dialogue engine 3 other than the dialogue engine 3 that outputs the one automatic response information. Return to step S401.

なお、図４のフローチャートにおいて、電源オフや処理終了の割り込みにより処理は終了する。 In the flowchart of FIG. 4, the process ends when the power is turned off or an interrupt for the end of the process occurs.

次に、対話エンジン３の動作について、図５のフローチャートを用いて説明する。 Next, the operation of the dialogue engine 3 will be described with reference to the flowchart of FIG.

（ステップＳ５０１）発話受付部３２は、対話制御装置２から発話情報を受け付けたか否かを判断する。発話情報を受け付けた場合はステップＳ５０２に行き、発話情報を受け付けない場合はステップＳ５０４に行く。 (Step S501) The utterance reception unit 32 determines whether or not the utterance information has been received from the dialogue control device 2. If the utterance information is accepted, the process goes to step S502, and if the utterance information is not accepted, the process goes to step S504.

（ステップＳ５０２）応答取得部３３は、ステップＳ５０１で受け付けられた発話情報に対応する自動応答情報を、学習情報格納部３１の学習情報を用いて取得する。 (Step S502) The response acquisition unit 33 acquires the automatic response information corresponding to the utterance information received in step S501 by using the learning information of the learning information storage unit 31.

（ステップＳ５０３）応答出力部３４は、応答取得部３３が取得した自動応答情報を対話制御装置２に出力する。 (Step S503) The response output unit 34 outputs the automatic response information acquired by the response acquisition unit 33 to the dialogue control device 2.

（ステップＳ５０４）学習受付部３５は、学習対象の自動応答情報を受け付けたか否かを判断する。自動応答情報を受け付けた場合はステップＳ５０５に行き、自動応答情報を受け付けない場合はステップＳ５０１に戻る。 (Step S504) The learning reception unit 35 determines whether or not the automatic response information of the learning target has been received. If the automatic response information is accepted, the process proceeds to step S505, and if the automatic response information is not accepted, the process returns to step S501.

（ステップＳ５０５）学習部３６は、学習受付部３５が受け付けた自動応答情報を用いて、学習情報に、追記する情報を構成する。 (Step S505) The learning unit 36 configures information to be added to the learning information by using the automatic response information received by the learning reception unit 35.

（ステップＳ５０６）学習部３６は、ステップＳ５０５で構成した情報を学習情報格納部３１に追記する。ステップＳ５０１に戻る。 (Step S506) The learning unit 36 adds the information configured in step S505 to the learning information storage unit 31. Return to step S501.

なお、図５のフローチャートにおいて、電源オフや処理終了の割り込みにより処理は終了する。 In the flowchart of FIG. 5, the process ends when the power is turned off or an interrupt for the end of the process occurs.

以下、本実施の形態における対話システムＡの具体的な動作について説明する。本具体例において、対話システムＡは、３つの対話エンジンを具備する、とする。なお、対話システムＡの概念図は図１である。 Hereinafter, the specific operation of the dialogue system A in the present embodiment will be described. In this specific example, it is assumed that the dialogue system A includes three dialogue engines. The conceptual diagram of the dialogue system A is FIG.

今、対話制御装置２の格納部２１は、図６に示す優先度情報管理表を格納している。優先度情報管理表は、各対話エンジン３の優先度等を管理する表である。優先度情報管理表は、「優先度」「対話エンジン識別子」「通知先識別子」を有する３つのレコードを有する。「優先度」は小さい数値ほど、優先度が高いことを示す。「通知先識別子」は、ここではＩＰアドレスであるが、対話制御装置２が対話エンジン３と通信するための情報であれば何でも良い。 Now, the storage unit 21 of the dialogue control device 2 stores the priority information management table shown in FIG. The priority information management table is a table that manages the priority and the like of each dialogue engine 3. The priority information management table has three records having "priority", "interaction engine identifier", and "notification destination identifier". The smaller the "priority" is, the higher the priority is. The "notification destination identifier" is an IP address here, but may be any information as long as it is information for the dialogue control device 2 to communicate with the dialogue engine 3.

また、３つの対話エンジン３の中の一の対話エンジン３（例えば、「ｅ１」で識別される対話エンジン３。なお、以後、かかる対話エンジンを対話エンジン「ｅ１」とする。）を構成する学習情報格納部３１は、図７に示す学習情報管理表を格納している、とする。学習情報管理表は、「ＩＤ」「学習情報」を有する１以上のレコードを管理している。「ＩＤ」はレコードを識別する情報である。「学習情報」は「発話情報」と「自動応答情報」との対である。「発話情報」「自動応答情報」は、定数（固定的な文字列）でも良いし、変数を有する情報でも良い。また、「自動応答情報」は、プログラムや検索式（ここでは、例えば、ＳＱＬ文であるが、他の形式でも構わない）などの実行可能な情報でも良い。学習情報の中の｛｝は変数を示し、学習情報の中の＊は０文字以上の文字を示す。また、自動応答情報の中の＄は変数であり、学習情報格納部３１に値が保持されている。なお、「＄営業日」の値は、ここでは「日曜日、火曜日、水曜日、木曜日、金曜日、土曜日」である、とする。また、自動応答情報の中の「ｉｎｃｌｕｄｅ」は、含まれるか否かの論理演算子である、とする。つまり、「｛ｔｉｍｅＥｘｐｒｅｓｓｉｏｎ｝ｉｎｃｌｕｄｅ＄営業日」は、変数「ｔｉｍｅＥｘｐｒｅｓｓｉｏｎ」の情報が変数「営業日」の中に含まれるか否かを判断することを示す。そして、含まれる場合は真「１」を、含まれない場合は偽「０」を返す、とする。 Further, learning that constitutes one of the three dialogue engines 3 (for example, the dialogue engine 3 identified by "e1"; hereinafter, such a dialogue engine is referred to as the dialogue engine "e1"). It is assumed that the information storage unit 31 stores the learning information management table shown in FIG. 7. The learning information management table manages one or more records having "ID" and "learning information". The "ID" is information that identifies a record. "Learning information" is a pair of "utterance information" and "automatic response information". The "utterance information" and "automatic response information" may be constants (fixed character strings) or information having variables. Further, the "automatic response information" may be executable information such as a program or a search expression (here, for example, it is an SQL statement, but other formats may be used). {} In the learning information indicates a variable, and * in the learning information indicates 0 or more characters. Further, $ in the automatic response information is a variable, and the value is held in the learning information storage unit 31. The value of "$ business day" is assumed to be "Sunday, Tuesday, Wednesday, Thursday, Friday, Saturday" here. Further, it is assumed that "include" in the automatic response information is a logical operator of whether or not it is included. That is, "{timeExpress} include $ business day" indicates whether or not the information of the variable "timeExpression" is included in the variable "business day". Then, if it is included, a true "1" is returned, and if it is not included, a false "0" is returned.

また、対話エンジン「ｅ１」の学習情報格納部３１は、図８に示すＦｏｏｄｔａｂｌｅを格納している、とする。Ｆｏｏｄｔａｂｌｅは、「ＩＤ」「料理」「作り方」を有するレコードを１以上格納している。 Further, it is assumed that the learning information storage unit 31 of the dialogue engine "e1" stores the Foodtable shown in FIG. Foodtable stores one or more records having "ID", "cooking", and "how to make".

さらに、対話エンジン「ｅ１」の学習情報格納部３１は、料理名を格納しているｆｏｏｄ辞書、作物名を格納しているｃｒｏｐｓ辞書を有する、とする。ｆｏｏｄ辞書には、「カツ丼、牛丼、親子丼、・・・」等の多数の料理名が格納されている。また、ｃｒｏｐｓ辞書には、「お米、ナス、にんじん、・・・」等の多数の作物名が格納されている。 Further, it is assumed that the learning information storage unit 31 of the dialogue engine "e1" has a food dictionary for storing the dish name and a crops dictionary for storing the crop name. The food dictionary stores a large number of dish names such as "Katsudon, Gyudon, Oyakodon, ...". In addition, a large number of crop names such as "rice, eggplant, carrot, ..." are stored in the crops dictionary.

かかる状況において、以下の３つの具体例について説明する。具体例１は、優先度に基づいて自動応答情報を選択する場合である。さらに具体的には、具体例１は、対話制御装置２が、予め決められた条件を満たす自動応答情報の中で、優先度情報に基づく優先度が最も高い対話エンジン３から出力された自動応答情報を選択する場合である。また、ここでの予め決められた条件は、例えば、対話エンジン３からの応答である自動応答情報に予め決められた用語（例えば、「理解できなかった」「理解不能」）を含まないことである。 In such a situation, the following three specific examples will be described. Specific example 1 is a case where the automatic response information is selected based on the priority. More specifically, in the specific example 1, the dialogue control device 2 outputs an automatic response output from the dialogue engine 3 having the highest priority based on the priority information among the automatic response information satisfying the predetermined conditions. When selecting information. Further, the predetermined condition here is that, for example, the automatic response information, which is the response from the dialogue engine 3, does not include predetermined terms (for example, "did not understand" or "incomprehensible"). is there.

具体例２は、時間条件に基づく場合である。時間条件は、自動応答情報を受け付ける早さに基づく場合と言っても良い。さらに具体的には、具体例２は、対話制御装置２が、予め決められた条件を満たす、最初に受け付けられた自動応答情報を選択する場合である。ここでの予め決められた条件も、例えば、対話エンジン３からの応答である自動応答情報に予め決められた用語（例えば、「理解できなかった」「理解不能」）を含まないこととする。 Specific example 2 is a case based on a time condition. It may be said that the time condition is based on the speed at which the automatic response information is received. More specifically, the second embodiment is a case where the dialogue control device 2 selects the first received automatic response information that satisfies a predetermined condition. The predetermined conditions here also assume that, for example, the automatic response information that is the response from the dialogue engine 3 does not include predetermined terms (for example, "did not understand" or "incomprehensible").

具体例３は、内容条件に基づく場合である。内容条件は、自動応答情報（応答）の内容に基づく条件である。さらに具体的には、具体例３は、対話制御装置２が、予め決められた条件を満たす自動応答情報の中で、最も文長が長い自動応答情報または自立語の数が多い自動応答情報または文の数が最も多い自動応答情報を選択する場合である。ここでの予め決められた条件も、例えば、対話エンジン３からの応答である自動応答情報に予め決められた用語（例えば、「理解できなかった」「理解不能」）を含まないこととする。 Specific example 3 is a case based on the content condition. The content condition is a condition based on the content of the automatic response information (response). More specifically, in the specific example 3, the dialogue control device 2 has the longest sentence length or the large number of independent words among the automatic response information satisfying the predetermined conditions. This is the case when the automatic response information with the largest number of sentences is selected. The predetermined conditions here also assume that, for example, the automatic response information that is the response from the dialogue engine 3 does not include predetermined terms (for example, "did not understand" or "incomprehensible").

（具体例１）
今、ユーザは、ユーザ端末１に対して、「牛丼の作り方を教えて」と音声入力した、とする。すると、ユーザ端末１のユーザ受付部１１は、ユーザからの発話「牛丼の作り方を教えて」を受け付ける。次に、ユーザ処理部１２は、ユーザの音声入力に対して音声認識し、ユーザ発話情報「牛丼の作り方を教えて」を取得する。次に、ユーザ送信部１３は当該ユーザ発話情報を対話制御装置２に送信する。 (Specific example 1)
Now, it is assumed that the user voice-inputs "Tell me how to make beef bowl" to the user terminal 1. Then, the user reception unit 11 of the user terminal 1 accepts the utterance "Tell me how to make beef bowl" from the user. Next, the user processing unit 12 voice-recognizes the user's voice input and acquires the user's utterance information "Tell me how to make beef bowl". Next, the user transmission unit 13 transmits the user utterance information to the dialogue control device 2.

次に、対話制御装置２の受信部２２は、ユーザ発話情報「牛丼の作り方を教えて」をユーザ端末１から受信する。 Next, the receiving unit 22 of the dialogue control device 2 receives the user utterance information "Tell me how to make beef bowl" from the user terminal 1.

そして、処理部２３は、当該ユーザ発話情報から発話情報「１：牛丼の作り方を教えて」を取得する。発話情報の「１」は発話識別子である。 Then, the processing unit 23 acquires the utterance information "1: Tell me how to make beef bowl" from the user's utterance information. The "1" in the utterance information is the utterance identifier.

次に、発話情報出力部２４１は、図６の優先度情報管理表から、発話情報の出力先（３つの通信先識別子）を取得する。そして、発話情報出力部２４１は、３つの各通信先識別子で識別される３つの対話エンジン３に、発話情報「１：牛丼の作り方を教えて」を送信する。 Next, the utterance information output unit 241 acquires the output destinations (three communication destination identifiers) of the utterance information from the priority information management table of FIG. Then, the utterance information output unit 241 transmits the utterance information "1: Tell me how to make beef bowl" to the three dialogue engines 3 identified by each of the three communication destination identifiers.

次に、３つの各対対話エンジン３の発話受付部３２は、対話制御装置２から発話情報を受け付ける。そして、３つの各対話エンジン３の応答取得部３３は、発話受付部３２が受け付けた発話情報に対応する応答である自動応答情報を、学習情報格納部３１の学習情報を用いて取得する。例えば、対話エンジン識別子「ｅ１」で識別される対話エンジン３の応答取得部３３は、発話情報が有する「牛丼の作り方を教えて」を形態素解析する。そして、応答取得部３３は、取得した形態素「牛丼」をキーとして、辞書を検索し、「牛丼」は｛ｆｏｏｄ｝に含まれることを検出する。次に、応答取得部３３は、「牛丼の作り方を教えて」が図７の学習情報管理表の「ＩＤ＝４」の発話情報に合致することを検出する。そして、応答取得部３３は、「ＩＤ＝４」の自動応答情報を取得し、当該自動応答情報を実行し、図８のＦｏｏｄｔａｂｌｅから料理「牛丼」に対応する作り方「４人前の材料をご紹介します。・・・」を取得する。そして、応答出力部３４は、応答取得部３３が取得した自動応答情報「１：４人前の材料をご紹介します。・・・」を対話制御装置２に送信する。 Next, the utterance receiving unit 32 of each of the three dialogue engines 3 receives the utterance information from the dialogue control device 2. Then, the response acquisition unit 33 of each of the three dialogue engines 3 acquires automatic response information, which is a response corresponding to the utterance information received by the utterance reception unit 32, using the learning information of the learning information storage unit 31. For example, the response acquisition unit 33 of the dialogue engine 3 identified by the dialogue engine identifier "e1" morphologically analyzes "tell me how to make beef bowl" contained in the utterance information. Then, the response acquisition unit 33 searches the dictionary using the acquired morpheme "gyudon" as a key, and detects that "gyudon" is included in {food}. Next, the response acquisition unit 33 detects that "Tell me how to make beef bowl" matches the utterance information of "ID = 4" in the learning information management table of FIG. 7. Then, the response acquisition unit 33 acquires the automatic response information of "ID = 4", executes the automatic response information, and prepares "Gyudon" corresponding to the dish "Gyudon" from the Foodtable of FIG. Introducing ... "is acquired. Then, the response output unit 34 transmits the automatic response information "1: 4 servings of material ...." acquired by the response acquisition unit 33 to the dialogue control device 2.

次に、対話制御装置２の受付部２５は、閾値の時間以内に受信された対話エンジン３から自動応答情報を受信する。ここでは、受付部２５は、閾値の時間以内に３つの対話エンジン３から自動応答情報を受信した、とする。 Next, the reception unit 25 of the dialogue control device 2 receives the automatic response information from the dialogue engine 3 received within the threshold time. Here, it is assumed that the reception unit 25 has received the automatic response information from the three dialogue engines 3 within the threshold time.

次に、選択部２３１は、３つの自動応答情報から一の自動応答情報を選択する。つまり、選択部２３１は、自動応答情報に予め決められた用語（「理解できなかった」または「理解不能」）を含む「ｅ２」の対話エンジン３から送信された自動応答情報を選択しない、と決定する。次に、「ｅ１」「ｅ３」の対話エンジン３の優先度情報「１」「３」を取得し、優先度が高い（優先度情報「１」の）「ｅ１」の対話エンジン３の自動応答情報「１：４人前の材料をご紹介します。・・・」を選択する。 Next, the selection unit 231 selects one automatic response information from the three automatic response information. That is, the selection unit 231 does not select the automatic response information transmitted from the dialogue engine 3 of "e2" that includes a predetermined term ("unintelligible" or "incomprehensible") in the automatic response information. decide. Next, the priority information "1" and "3" of the dialogue engine 3 of "e1" and "e3" are acquired, and the automatic response of the dialogue engine 3 of "e1" having a high priority (priority information "1") is obtained. Select the information "1: 4 servings of ingredients ...".

次に、処理部２３は、選択された自動応答情報から自動情報「４人前の材料をご紹介します。・・・」を取得する。 Next, the processing unit 23 acquires the automatic information "Introducing the material for 4 servings ..." from the selected automatic response information.

次に、応答送信部２６１は、当該自動情報をユーザ端末１に送信する。 Next, the response transmission unit 261 transmits the automatic information to the user terminal 1.

次に、ユーザ端末１のユーザ受信部１４は、当該自動情報を受信する。そして、ユーザ出力部１５は、当該自動情報を出力する。 Next, the user receiving unit 14 of the user terminal 1 receives the automatic information. Then, the user output unit 15 outputs the automatic information.

また、自動応答情報出力部２４２は、選択された一の自動応答情報「１：４人前の材料をご紹介します。・・・」を、当該一の自動応答情報を出力した対話エンジン３以外の対話エンジン３（「ｅ２」「ｅ３」の対話エンジン３）に送信する。 In addition, the automatic response information output unit 242 displays the selected automatic response information "1: 4 servings of material ...." other than the dialogue engine 3 that outputs the one automatic response information. Is transmitted to the dialogue engine 3 (dialogue engine 3 of "e2" and "e3").

そして、「ｅ２」「ｅ３」の対話エンジン３の学習受付部３５は、当該自動応答情報を受信する。 Then, the learning reception unit 35 of the dialogue engine 3 of "e2" and "e3" receives the automatic response information.

次に、「ｅ２」「ｅ３」の対話エンジン３の学習部３６は、学習受付部３５が受け付けた自動応答情報を用いて、学習情報に、追記する情報を構成する。つまり、学習部３６は、自動応答情報「１：４人前の材料をご紹介します。・・・」が有する発話識別子「１」に対応する発話情報を取得する。そして、発話識別子「１」に対応する発話情報の内容「牛丼の作り方を教えて」と自応答情報の内容「４人前の材料をご紹介します。・・・」との組を、図８の学習情報管理表に追記する。 Next, the learning unit 36 of the dialogue engine 3 of the “e2” and “e3” configures information to be added to the learning information by using the automatic response information received by the learning reception unit 35. That is, the learning unit 36 acquires the utterance information corresponding to the utterance identifier "1" possessed by the automatic response information "I will introduce the material for 1: 4 servings ....". Then, a set of the content of the utterance information corresponding to the utterance identifier "1" "Tell me how to make beef bowl" and the content of the self-response information "Introducing the ingredients for 4 servings ..." is shown in the figure. Add to the learning information management table of 8.

なお、学習部３６は、発話情報の内容「牛丼の作り方を教えて」のうち、名詞または名詞句「牛丼」「作り方」を変数に置き換え、「｛変数１｝の{変数２}を教えて」というパターンに変更し、「４人前の材料をご紹介します。・・・」のうち、名詞または名詞句「４人前」「材料」を変数に置き換え、「｛変数３｝の{変数４}をご紹介します。・・・」というパターンに変更し、２つのパターンの組を、図８の学習情報管理表に追記するなどしても良い。 The learning unit 36 replaces the nouns or noun phrases "beef bowl" and "how to make" with variables in the content of the utterance information "tell me how to make beef bowl", and replaces "{variable 2} of {variable 1}" with variables. Change to the pattern "Tell me" and replace the noun or noun phrase "4 servings" and "material" with variables in "Introducing materials for 4 servings ...." You may change the pattern to "Introducing variable 4} ...." and add a set of two patterns to the learning information management table in Fig. 8.

つまり、本具体例において、他の対話エンジン３からの入力である自動応答情報を用いて、学習情報をよくすれば良く、学習情報の構造は問わない。 That is, in this specific example, the learning information may be improved by using the automatic response information input from the other dialogue engine 3, and the structure of the learning information does not matter.

次に、ユーザは、ユーザ端末１に対して、「粉末うどんスープはどちらのスープがお勧めですか？」と音声入力した、とする。すると、ユーザ端末１のユーザ受付部１１は、当該ユーザの音声入力を受け付ける。次に、ユーザ処理部１２は、ユーザの音声入力に対して音声認識し、ユーザ発話情報「粉末うどんスープはどちらのスープがお勧めですか？」を取得する。次に、ユーザ送信部１３は、当該ユーザ発話情報を対話制御装置２に送信する。 Next, it is assumed that the user voice-inputs "Which soup is recommended for powdered udon soup?" To the user terminal 1. Then, the user reception unit 11 of the user terminal 1 accepts the voice input of the user. Next, the user processing unit 12 voice-recognizes the user's voice input and acquires the user utterance information "Which soup is recommended for powdered udon soup?". Next, the user transmission unit 13 transmits the user utterance information to the dialogue control device 2.

次に、対話制御装置２の受信部２２は、当該ユーザ発話情報をユーザ端末１から受信する。 Next, the receiving unit 22 of the dialogue control device 2 receives the user utterance information from the user terminal 1.

そして、処理部２３は、当該ユーザ発話情報から発話情報「２：粉末うどんスープはどちらのスープがお勧めですか？」を取得する。 Then, the processing unit 23 acquires the utterance information "2: Which soup is recommended for the powdered udon soup?" From the user's utterance information.

次に、発話情報出力部２４１は、図６の優先度情報管理表から３つの通信先識別子を取得する。そして、発話情報出力部２４１は、３つの各通信先識別子で識別される３つの対話エンジン３に、当該話情報を送信する。 Next, the utterance information output unit 241 acquires three communication destination identifiers from the priority information management table of FIG. Then, the utterance information output unit 241 transmits the utterance information to the three dialogue engines 3 identified by the three communication destination identifiers.

次に、３つの各対話エンジン３の発話受付部３２は、対話制御装置２から発話情報を受け付ける。そして、３つの各対話エンジン３の応答取得部３３は、発話受付部３２が受け付けた発話情報に対応する応答である自動応答情報を、学習情報格納部３１の学習情報を用いて取得する。そして、３つの各対話エンジン３の応答出力部３４は、応答取得部３３が取得した自動応答情報を対話制御装置２に送信する。 Next, the utterance receiving unit 32 of each of the three dialogue engines 3 receives the utterance information from the dialogue control device 2. Then, the response acquisition unit 33 of each of the three dialogue engines 3 acquires automatic response information, which is a response corresponding to the utterance information received by the utterance reception unit 32, using the learning information of the learning information storage unit 31. Then, the response output unit 34 of each of the three dialogue engines 3 transmits the automatic response information acquired by the response acquisition unit 33 to the dialogue control device 2.

次に、対話制御装置２の受付部２５は、閾値の時間以内に受信された対話エンジン３から自動応答情報を受信する。 Next, the reception unit 25 of the dialogue control device 2 receives the automatic response information from the dialogue engine 3 received within the threshold time.

次に、選択部２３１は、上述した優先度情報を用いるアルゴリズムにより、受信された自動応答情報から一の自動応答情報を選択する。そして、処理部２３は、当該一の自動応答情報から自動情報を取得する。 Next, the selection unit 231 selects one automatic response information from the received automatic response information by the algorithm using the priority information described above. Then, the processing unit 23 acquires the automatic information from the one automatic response information.

次に、ユーザ端末１のユーザ受信部１４は、自動情報「４人前の材料をご紹介します。・・・」を受信する。そして、ユーザ出力部１５は、当該自動情報を出力する。 Next, the user receiving unit 14 of the user terminal 1 receives the automatic information "Introducing the material for four servings ....". Then, the user output unit 15 outputs the automatic information.

また、自動応答情報出力部２４２は、選択された一の自動応答情報を、当該一の自動応答情報を出力した対話エンジン３以外の対話エンジン３に送信する。 Further, the automatic response information output unit 242 transmits the selected automatic response information to the dialogue engine 3 other than the dialogue engine 3 that outputs the one automatic response information.

そして、当該一の自動応答情報を送信していない対話エンジン３の学習受付部３５は、当該一の自動応答情報を受信する。学習部３６は、学習受付部３５が受け付けた自動応答情報を用いて、上述した学習処理を行う。 Then, the learning reception unit 35 of the dialogue engine 3 that has not transmitted the one automatic response information receives the one automatic response information. The learning unit 36 performs the above-mentioned learning process using the automatic response information received by the learning reception unit 35.

以上の処理を繰り返すことにより、適切な対話を継続させることができる。また、ユーザは、適切な応答を得ることができる。さらに、対話エンジン３は、他の対話エンジン３の能力（知識といっても良い）を学習できる。 By repeating the above processing, an appropriate dialogue can be continued. In addition, the user can obtain an appropriate response. Further, the dialogue engine 3 can learn the abilities (which may be called knowledge) of other dialogue engines 3.

（具体例２）
今、ユーザは、ユーザ端末１に対して、「牛丼の作り方を教えて」と音声入力した、とする。すると、ユーザ端末１のユーザ受付部１１は、ユーザからの発話「牛丼の作り方を教えて」を受け付ける。次に、ユーザ処理部１２は、ユーザの音声入力に対して音声認識し、ユーザ発話情報「牛丼の作り方を教えて」を取得する。次に、ユーザ送信部１３は当該ユーザ発話情報を対話制御装置２に送信する。 (Specific example 2)
Now, it is assumed that the user voice-inputs "Tell me how to make beef bowl" to the user terminal 1. Then, the user reception unit 11 of the user terminal 1 accepts the utterance "Tell me how to make beef bowl" from the user. Next, the user processing unit 12 voice-recognizes the user's voice input and acquires the user's utterance information "Tell me how to make beef bowl". Next, the user transmission unit 13 transmits the user utterance information to the dialogue control device 2.

そして、処理部２３は、当該ユーザ発話情報から発話情報「１：牛丼の作り方を教えて」を取得する。 Then, the processing unit 23 acquires the utterance information "1: Tell me how to make beef bowl" from the user's utterance information.

次に、発話情報出力部２４１は、３つの通信先識別子を取得する。そして、発話情報出力部２４１は、３つの各通信先識別子で識別される３つの対話エンジン３に、当該発話情報を送信する。なお、３つの通信先識別子は、格納部２１に格納されている、とする。 Next, the utterance information output unit 241 acquires three communication destination identifiers. Then, the utterance information output unit 241 transmits the utterance information to the three dialogue engines 3 identified by the three communication destination identifiers. It is assumed that the three communication destination identifiers are stored in the storage unit 21.

次に、３つの各対話エンジン３の発話受付部３２は、対話制御装置２から発話情報を受け付ける。そして、３つの各対話エンジン３の応答取得部３３は、発話受付部３２が受け付けた発話情報に対応する応答である自動応答情報を、学習情報格納部３１の学習情報を用いて取得する。 Next, the utterance receiving unit 32 of each of the three dialogue engines 3 receives the utterance information from the dialogue control device 2. Then, the response acquisition unit 33 of each of the three dialogue engines 3 acquires automatic response information, which is a response corresponding to the utterance information received by the utterance reception unit 32, using the learning information of the learning information storage unit 31.

次に、対話制御装置２の受付部２５は、３つの各対話エンジン３から自動応答情報を受信する。受付部２５は、最初に対話エンジン「ｅ２」からの自動応答情報「１：理解できなかった。・・・」を受信した、とする。次に、受付部２５は、対話エンジン「ｅ１」からの自動応答情報「１：４人前の材料をご紹介します。・・・」を受信した、とする。さらに次に、受付部２５は、対話エンジン「ｅ３」からの自動応答情報「１：まず、水、粉末うどんスープ、・・・を用意します。次に、・・・」を受信した、とする。 Next, the reception unit 25 of the dialogue control device 2 receives the automatic response information from each of the three dialogue engines 3. It is assumed that the reception unit 25 first receives the automatic response information "1: I could not understand ...." from the dialogue engine "e2". Next, it is assumed that the reception unit 25 receives the automatic response information "I will introduce the material for 1: 4 servings ..." from the dialogue engine "e1". Next, the reception unit 25 received the automatic response information "1: First, water, powdered udon soup, ..." from the dialogue engine "e3". Then ... " To do.

次に、選択部２３１は、３つの自動応答情報から一の自動応答情報を選択する。つまり、選択部２３１は、予め決められた用語（「理解できなかった」または「理解不能」）を含まない自動応答情報の中で、最初に受信された「ｅ１」の対話エンジン３の自動応答情報「１：４人前の材料をご紹介します。・・・」を選択する。 Next, the selection unit 231 selects one automatic response information from the three automatic response information. That is, the selection unit 231 automatically responds to the dialogue engine 3 of "e1" that is first received in the automatic response information that does not include a predetermined term ("unintelligible" or "incomprehensible"). Select the information "1: 4 servings of ingredients ...".

次に、「ｅ２」「ｅ３」の対話エンジン３の学習部３６は、当該自動応答情報を用いて、上述した学習処理を行う。 Next, the learning unit 36 of the dialogue engine 3 of "e2" and "e3" performs the above-mentioned learning process using the automatic response information.

以上の処理を繰り返すことにより、適切な対話を継続させることができる。また、ユーザは、適切な応答を得ることができる。さらに、対話エンジン３は、他の対話エンジン３の能力（知識といっても良い）を得ることができる。 By repeating the above processing, an appropriate dialogue can be continued. In addition, the user can obtain an appropriate response. Further, the dialogue engine 3 can acquire the ability (which may be called knowledge) of another dialogue engine 3.

（具体例３）
今、ユーザは、ユーザ端末１に対して、「牛丼の作り方を教えて」と音声入力した、とする。すると、ユーザ端末１のユーザ受付部１１は、ユーザからの発話「牛丼の作り方を教えて」を受け付ける。次に、ユーザ処理部１２は、ユーザの音声入力に対して音声認識し、ユーザ発話情報「牛丼の作り方を教えて」を取得する。次に、ユーザ送信部１３は当該ユーザ発話情報を対話制御装置２に送信する。 (Specific example 3)
Now, it is assumed that the user voice-inputs "Tell me how to make beef bowl" to the user terminal 1. Then, the user reception unit 11 of the user terminal 1 accepts the utterance "Tell me how to make beef bowl" from the user. Next, the user processing unit 12 voice-recognizes the user's voice input and acquires the user's utterance information "Tell me how to make beef bowl". Next, the user transmission unit 13 transmits the user utterance information to the dialogue control device 2.

次に、発話情報出力部２４１は、３つの通信先識別子を取得する。そして、発話情報出力部２４１は、３つの各通信先識別子で識別される３つの対話エンジン３に、当該発話情報を送信する。 Next, the utterance information output unit 241 acquires three communication destination identifiers. Then, the utterance information output unit 241 transmits the utterance information to the three dialogue engines 3 identified by the three communication destination identifiers.

次に、対話制御装置２の受付部２５は、３つの各対話エンジン３から自動応答情報を受信する。 Next, the reception unit 25 of the dialogue control device 2 receives the automatic response information from each of the three dialogue engines 3.

次に、選択部２３１は、３つの自動応答情報から一の自動応答情報を選択する。つまり、選択部２３１は、予め決められた用語（「理解できなかった」または「理解不能」）を含まない自動応答情報の中で、文の長さが最も長い「ｅ１」の対話エンジン３の自動応答情報「１：４人前の材料をご紹介します。・・・」を選択する。 Next, the selection unit 231 selects one automatic response information from the three automatic response information. That is, the selection unit 231 of the dialogue engine 3 of "e1" having the longest sentence length among the automatic response information that does not include a predetermined term ("unintelligible" or "incomprehensible"). Select the automatic response information "1: Introducing materials for 4 servings ....".

なお、上記具体例で、ユーザ端末１において、図９に示すようにキャラクタがお話しているように、自動情報を音声出力することは好適である。 In the above specific example, it is preferable to output the automatic information by voice on the user terminal 1 as the character speaks as shown in FIG.

以上、本実施の形態によれば、複数の対話エンジンを効果的に用いて、適切な対話を継続させることができる。 As described above, according to the present embodiment, it is possible to effectively use a plurality of dialogue engines to continue an appropriate dialogue.

特に、本実施の形態によれば、対話ごとに、複数の対話エンジンから送信された複数の自動応答情報から適切な自動応答情報を選択し、ユーザに応答できる。かかることにより、ユーザに極めて適切な応答を行える。 In particular, according to the present embodiment, it is possible to select appropriate automatic response information from a plurality of automatic response information transmitted from a plurality of dialogue engines and respond to the user for each dialogue. As a result, a very appropriate response can be made to the user.

なお、本実施の形態における処理は、ソフトウェアで実現しても良い。そして、このソフトウェアをソフトウェアダウンロード等により配布しても良い。また、このソフトウェアをＣＤ−ＲＯＭなどの記録媒体に記録して流布しても良い。なお、このことは、本明細書における他の実施の形態においても該当する。なお、本実施の形態における対話制御装置２を実現するソフトウェアは、以下のようなプログラムである。つまり、このプログラムは、２以上の対話エンジンから、ユーザの発話に対応する応答である自動応答情報を受け付け得る対話制御装置におけるプログラムであって、コンピュータを、ユーザの発話に対応するユーザ発話情報をユーザ端末から受信する受信部と、前記ユーザ発話情報に対応する情報である発話情報を、１以上の対話エンジンに対して出力し得る発話情報出力部と、前記発話情報の出力に応じて、前記１以上の各対話エンジンから自動応答情報を受け付ける自動応答受付部と、前記自動応答受付部が受け付けた１以上の自動応答情報のうちの一の自動応答情報に対応する一の自動情報を前記ユーザ端末に送信する応答送信部として機能させるためのプログラムである。 The processing in this embodiment may be realized by software. Then, this software may be distributed by software download or the like. Further, this software may be recorded on a recording medium such as a CD-ROM and disseminated. This also applies to other embodiments herein. The software that realizes the dialogue control device 2 in the present embodiment is the following program. That is, this program is a program in a dialogue control device that can receive automatic response information that is a response corresponding to a user's utterance from two or more dialogue engines, and uses a computer to display user utterance information corresponding to the user's utterance. The receiving unit received from the user terminal, the utterance information output unit capable of outputting the utterance information which is the information corresponding to the user utterance information to one or more dialogue engines, and the utterance information output unit according to the output of the utterance information. The user receives an automatic response reception unit that receives automatic response information from one or more dialogue engines, and one automatic information corresponding to one of the one or more automatic response information received by the automatic response reception unit. It is a program for functioning as a response transmitter to be transmitted to a terminal.

また、上記プログラムにおいて、前記発話情報出力部は、前記発話情報を、２以上の対話エンジンに出力し、前記自動応答受付部は、前記発話情報の出力に応じて、前記２以上の各対話エンジンから自動応答情報を受け付け、コンピュータを、前記２以上の自動応答情報から一の自動応答情報を選択する選択部としてさらに機能させ、前記応答送信部は、前記選択部が選択した自動応答情報に対応する一の自動情報を前記ユーザ端末に送信するものとして、コンピュータを機能させるプログラムであることは好適である。 Further, in the above program, the utterance information output unit outputs the utterance information to two or more dialogue engines, and the automatic response reception unit outputs the utterance information to each of the two or more dialogue engines according to the output of the utterance information. The computer further functions as a selection unit for selecting one automatic response information from the two or more automatic response information, and the response transmission unit corresponds to the automatic response information selected by the selection unit. It is preferable that the program functions the computer so as to transmit one automatic information to the user terminal.

また、上記プログラムにおいて、前記選択部は、前記２以上の自動応答情報から、予め決められた条件に合致する一の自動応答情報を選択するものとして、コンピュータを機能させるプログラムであることは好適である。 Further, in the above program, it is preferable that the selection unit is a program that causes the computer to function by selecting one automatic response information that matches a predetermined condition from the two or more automatic response information. is there.

また、上記プログラムにおいて、前記選択部は、前記２以上の自動応答情報のうちの予め決められた条件を満たす自動応答情報の中で、前記優先度情報に基づく優先度が最も高い対話エンジンから出力された自動応答情報を選択するものとして、コンピュータを機能させるプログラムであることは好適である。 Further, in the above program, the selection unit outputs from the dialogue engine having the highest priority based on the priority information among the automatic response information satisfying the predetermined conditions among the two or more automatic response information. It is preferable that the program functions the computer as the one that selects the automatic response information.

また、本実施の形態における対話エンジン３を実現するソフトウェアは、以下のようなプログラムである。つまり、このプログラムは、コンピュータがアクセス可能な記録媒体は、発話情報に対応する自動応答情報を取得するための情報である学習情報が格納される学習情報格納部を具備し、コンピュータを、対話制御装置２から発話情報を受け付ける発話受付部と、前記発話受付部が受け付けた発話情報に対応する応答である自動応答情報を、前記学習情報を用いて取得する応答取得部と、前記応答取得部が取得した自動応答情報を前記対話制御装置に出力する応答出力部と、他の対話エンジンからの応答である他自動応答情報を受け付ける学習受付部と、前記学習受付部が受け付けた他自動応答情報を用いて、前記学習情報に情報を付加する学習部として機能させるプログラムである。 Further, the software that realizes the dialogue engine 3 in the present embodiment is the following program. That is, in this program, the recording medium accessible to the computer is provided with a learning information storage unit for storing learning information which is information for acquiring automatic response information corresponding to the spoken information, and the computer is interactively controlled. The utterance reception unit that receives utterance information from the device 2, the response acquisition unit that acquires automatic response information that is a response corresponding to the utterance information received by the utterance reception unit, and the response acquisition unit that use the learning information. A response output unit that outputs the acquired automatic response information to the dialogue control device, a learning reception unit that receives other automatic response information that is a response from another dialogue engine, and other automatic response information received by the learning reception unit. It is a program that is used to function as a learning unit that adds information to the learning information.

（実施の形態２）
本実施の形態において、複数の対話エンジンから一部の対話エンジンを選択し、当該対話エンジンと対話を行える対話システムについて説明する。なお、一部の対話エンジンは、一つであることは好適であるが、２以上でも良い。 (Embodiment 2)
In the present embodiment, a dialogue system capable of selecting a part of dialogue engines from a plurality of dialogue engines and having a dialogue with the dialogue engines will be described. It is preferable that some dialogue engines are one, but two or more may be used.

本実施の形態における対話システムＢの概念図は、符号以外、図１と同様である。対話システムＢは、１または２以上のユーザ端末１、対話制御装置４、および２以上の対話エンジン３を備える。対話制御装置４は、対話を制御する装置である。対話制御装置４は、例えば、ＡＳＰのサーバ、クラウドサーバ等である。但し、対話制御装置４の種類は問わない。 The conceptual diagram of the dialogue system B in the present embodiment is the same as that of FIG. 1 except for the reference numerals. The dialogue system B includes one or more user terminals 1, a dialogue control device 4, and two or more dialogue engines 3. The dialogue control device 4 is a device that controls dialogue. The dialogue control device 4 is, for example, an ASP server, a cloud server, or the like. However, the type of the dialogue control device 4 does not matter.

また、通常、１以上の各ユーザ端末１と対話制御装置４とはインターネット、ＬＡＮ等のネットワークにより通信可能である。また、通常、対話制御装置４と１以上の各対話エンジン３とはインターネット、ＬＡＮ等のネットワークにより通信可能である。ただし、対話制御装置４は、１以上のユーザ端末１、１以上の対話エンジン３のうちの１以上の装置を内部に含んでいても良い。つまり、ユーザ端末１と対話制御装置４と対話エンジン３のうちの２以上の装置は、物理的に一の装置で実現されても良い。 Further, usually, one or more user terminals 1 and the dialogue control device 4 can communicate with each other via a network such as the Internet or LAN. Further, usually, the dialogue control device 4 and each of the dialogue engines 3 of 1 or more can communicate with each other via a network such as the Internet or LAN. However, the dialogue control device 4 may include one or more of one or more user terminals 1 and one or more dialogue engines 3 inside. That is, two or more of the user terminal 1, the dialogue control device 4, and the dialogue engine 3 may be physically realized by one device.

本実施の形態における対話システムＢのブロック図は、対話制御装置の符号以外は図２と同様である。図１０は、対話制御装置４のブロック図である。 The block diagram of the dialogue system B in the present embodiment is the same as that of FIG. 2 except for the reference numerals of the dialogue control device. FIG. 10 is a block diagram of the dialogue control device 4.

対話制御装置４は、格納部４１、受信部２２、処理部４３、出力部２４、受付部２５、および送信部２６を備える。処理部４３は、選択部４３１を備える。出力部２４は、発話情報出力部２４１、および自動情報出力部２４２を備える。受付部２５は、自動応答受付部２５１を備える。 The dialogue control device 4 includes a storage unit 41, a reception unit 22, a processing unit 43, an output unit 24, a reception unit 25, and a transmission unit 26. The processing unit 43 includes a selection unit 431. The output unit 24 includes an utterance information output unit 241 and an automatic information output unit 242. The reception unit 25 includes an automatic response reception unit 251.

対話制御装置４を構成する格納部４１は、各種の情報が格納される。各種の情報とは、例えば、後述する対話エンジン３の選択条件、優先度情報、対話の履歴、２以上の各対話エンジン３と通信を行うための情報（例えば、対話エンジン３のＩＰアドレス、ＵＲＬ、ＭＡＣアドレス等）、である。 Various types of information are stored in the storage unit 41 that constitutes the dialogue control device 4. The various information includes, for example, selection conditions of the dialogue engine 3, priority information, dialogue history, and information for communicating with each of two or more dialogue engines 3 (for example, the IP address and URL of the dialogue engine 3). , MAC address, etc.).

処理部４３は、各種の処理を行う。各種の処理とは、選択部４３１が行う処理等である。 The processing unit 43 performs various processes. The various processes are processes performed by the selection unit 431 and the like.

選択部４３１は、２以上の対話エンジン３から一部の対話エンジン３を選択する。選択部４３１は、Ｎ（Ｎは２以上の自然数）の対話エンジン３からＭ（Ｎは１以上の自然数，Ｍ＜Ｎ）の対話エンジン３を選択する。一部の対話エンジン３とは、２以上の対話エンジン３のうちの一部である。なお、一部の対話エンジン３とは、一の対話エンジン３であることは好適である。 The selection unit 431 selects a part of the dialogue engines 3 from the two or more dialogue engines 3. The selection unit 431 selects the dialogue engine 3 of N (N is a natural number of 2 or more) from the dialogue engine 3 of M (N is a natural number of 1 or more, M <N). The part dialogue engine 3 is a part of two or more dialogue engines 3. It is preferable that the part of the dialogue engine 3 is one dialogue engine 3.

選択部４３１は、ユーザ発話情報または発話情報を用いて、一部の対話エンジン３を選択することは好適である。 It is preferable that the selection unit 431 selects a part of the dialogue engine 3 by using the user utterance information or the utterance information.

選択部４３１は、例えば、選択条件に基づいて、一部の対話エンジン３を選択する。選択条件とは、例えば、対話（発話）の主題が合致することである。主題が合致するという選択条件は、例えば、発話から取得されるジャンルと、対話エンジン３に対応付けて格納されているジャンルが一致する対話エンジン３を選択することである。選択条件は、例えば、予め決められた条件を満たすほど負荷が小さい（例えば、最も負荷が小さい、ログインユーザ数が閾値以下）ことである。選択条件は、例えば、予め決められた条件を満たすほどレスポンスが早い（例えば、最もレスポンスが早い）ことである。 The selection unit 431 selects a part of the dialogue engine 3 based on the selection condition, for example. The selection condition is, for example, that the subject of dialogue (utterance) matches. The selection condition that the subject matches is, for example, to select the dialogue engine 3 in which the genre acquired from the utterance and the genre stored in association with the dialogue engine 3 match. The selection condition is, for example, that the load is small enough to satisfy a predetermined condition (for example, the load is the smallest, the number of logged-in users is equal to or less than the threshold value). The selection condition is, for example, that the response is faster (for example, the fastest response) as the predetermined condition is satisfied.

選択部４３１が対話エンジン３を選択するタイミングは問わない。選択部４３１は、ユーザ発話情報が受け付けられる毎（対話毎）に、対話エンジン３を選択しても良い。また、選択部４３１は、一連の対話の始まりの際に、対話エンジン３を選択しても良い。選択部４３１は、例えば、一連の対話の始まりの際に、主題が合致する対話エンジン３を選択したり、負荷が最小の対話エンジン３を選択したり、レスポンスが早い対話エンジン３を選択したりしても良い。なお、レスポンスを検知するために、事前に命令を対話エンジン３に送信し、応答を受信し、レスポンスの早さを検知する等しても良い。 The timing at which the selection unit 431 selects the dialogue engine 3 does not matter. The selection unit 431 may select the dialogue engine 3 each time the user utterance information is received (for each dialogue). Further, the selection unit 431 may select the dialogue engine 3 at the beginning of a series of dialogues. For example, at the beginning of a series of dialogues, the selection unit 431 selects a dialogue engine 3 that matches the subject, selects a dialogue engine 3 that has the least load, or selects a dialogue engine 3 that has a fast response. You may. In order to detect the response, a command may be transmitted to the dialogue engine 3 in advance, the response may be received, and the speed of the response may be detected.

複数の対話エンジン３から自動応答情報が受け付けられた場合、選択部４３１は、当該自動応答情報から一の自動応答情報を選択する。なお、かかる選択は、選択部２３１の処理と同様である。 When the automatic response information is received from the plurality of dialogue engines 3, the selection unit 431 selects one automatic response information from the automatic response information. The selection is the same as the processing of the selection unit 231.

ここでの自動応答受付部２５１は、通常、一の対話エンジン３から自動応答情報を受け付ける。ただし、自動応答受付部２５１は、複数の各対話エンジンから自動応答情報を受け付けても良い。 The automatic response reception unit 251 here normally receives automatic response information from one dialogue engine 3. However, the automatic response reception unit 251 may receive automatic response information from each of a plurality of dialogue engines.

格納部４１は、不揮発性の記録媒体が好適であるが、揮発性の記録媒体でも実現可能である。 The storage unit 41 is preferably a non-volatile recording medium, but can also be realized by a volatile recording medium.

格納部４１に情報が記憶される過程は問わない。例えば、記録媒体を介して情報が格納部４１で記憶されるようになってもよく、通信回線等を介して送信された情報が格納部４１で記憶されるようになってもよく、あるいは、入力デバイスを介して入力された情報が格納部４１で記憶されるようになってもよい。 The process in which the information is stored in the storage unit 41 does not matter. For example, the information may be stored in the storage unit 41 via the recording medium, the information transmitted via the communication line or the like may be stored in the storage unit 41, or The information input via the input device may be stored in the storage unit 41.

処理部４３、および選択部４３１は、通常、ＭＰＵやメモリ等から実現され得る。処理部４３等の処理手順は、通常、ソフトウェアで実現され、当該ソフトウェアはＲＯＭ等の記録媒体に記録されている。但し、ハードウェア（専用回路）で実現しても良い。 The processing unit 43 and the selection unit 431 can usually be realized from an MPU, a memory, or the like. The processing procedure of the processing unit 43 and the like is usually realized by software, and the software is recorded on a recording medium such as a ROM. However, it may be realized by hardware (dedicated circuit).

次に、対話制御装置４の動作について、図１１のフローチャートを用いて説明する。図１１のフローチャートにおいて、図４のフローチャートと同一のステップについて説明を省略する。 Next, the operation of the dialogue control device 4 will be described with reference to the flowchart of FIG. In the flowchart of FIG. 11, the same steps as the flowchart of FIG. 4 will be omitted.

（ステップＳ１１０１）選択部４３１は、２以上の対話エンジン３から一部の対話エンジン３（出力先）を選択する。 (Step S1101) The selection unit 431 selects a part of the dialogue engines 3 (output destinations) from two or more dialogue engines 3.

（ステップＳ１１０２）選択部４３１は、複数の自動応答情報から一の自動応答情報を選択するか否かを判断する。一の自動応答情報を選択する場合はステップＳ４０６に行き、一の自動応答情報を選択しない場合はステップＳ４０７に行く。ここで、選択部４３１は、例えば、ステップＳ１１０１で２以上の対話エンジン３を選択したか否かを判断しても良い。選択部４３１は、例えば、２以上の自動応答情報が受け付けられたか否かを判断しても良い。 (Step S1102) The selection unit 431 determines whether or not to select one automatic response information from the plurality of automatic response information. If one automatic response information is selected, the process goes to step S406. If one automatic response information is not selected, the process goes to step S407. Here, the selection unit 431 may determine, for example, whether or not two or more dialogue engines 3 have been selected in step S1101. The selection unit 431 may determine, for example, whether or not two or more automatic response information has been accepted.

なお、図１１のフローチャートにおいて、電源オフや処理終了の割り込みにより処理は終了する。 In the flowchart of FIG. 11, the process ends when the power is turned off or an interrupt for the end of the process occurs.

以下、本実施の形態における対話システムＢの具体的な動作について説明する。本具体例において、対話システムＢは、３つの対話エンジンを具備する、とする。なお、対話システムＡの概念図は図１である。 Hereinafter, the specific operation of the dialogue system B in the present embodiment will be described. In this specific example, it is assumed that the dialogue system B includes three dialogue engines. The conceptual diagram of the dialogue system A is FIG.

今、３つの対話エンジン３の中の一の対話エンジン３（例えば、対話エンジン「ｅ１」）を構成する学習情報格納部３１は、図７に示す学習情報管理表を格納している、とする。また、対話エンジン「ｅ１」の学習情報格納部３１は、図８に示すＦｏｏｄｔａｂｌｅを格納している、とする。さらに、対話エンジン「ｅ１」の学習情報格納部３１は、料理名を格納しているｆｏｏｄ辞書、作物名を格納しているｃｒｏｐｓ辞書を有する、とする。ｆｏｏｄ辞書には、「カツ丼、牛丼、親子丼、・・・」等の多数の料理名が格納されている。また、ｃｒｏｐｓ辞書には、「お米、ナス、にんじん、・・・」等の多数の作物名が格納されている。 It is assumed that the learning information storage unit 31 constituting one of the three dialogue engines 3 (for example, the dialogue engine "e1") stores the learning information management table shown in FIG. 7. .. Further, it is assumed that the learning information storage unit 31 of the dialogue engine "e1" stores the Foodtable shown in FIG. Further, it is assumed that the learning information storage unit 31 of the dialogue engine "e1" has a food dictionary for storing the dish name and a crops dictionary for storing the crop name. The food dictionary stores a large number of dish names such as "Katsudon, Gyudon, Oyakodon, ...". In addition, a large number of crop names such as "rice, eggplant, carrot, ..." are stored in the crops dictionary.

かかる状況において、以下の２つの具体例について説明する。具体例１は、一の対話エンジンを選択する場合である。具体例２は、２以上の対話エンジンを選択し、かつ２以上の自動応答情報を選択する場合である。２以上の自動応答情報を選択する方法は、実施の形態１で説明したように種々のアルゴリズムがあり得る。 In such a situation, the following two specific examples will be described. Specific example 1 is a case where one dialogue engine is selected. Specific example 2 is a case where two or more dialogue engines are selected and two or more automatic response information is selected. As a method for selecting two or more automatic response information, various algorithms may be used as described in the first embodiment.

（具体例１）
今、対話制御装置２の格納部２１は、図１２に示す選択条件管理表を格納している、とする。選択条件管理表は、対話エンジンを選択するための条件を管理する表である。選択条件管理表は、「対話エンジン識別子」「ジャンル」「通知先識別子」を有する３つのレコードを有する。「ジャンル」は、各対話エンジンが応答可能なジャンルである。 (Specific example 1)
Now, it is assumed that the storage unit 21 of the dialogue control device 2 stores the selection condition management table shown in FIG. The selection condition management table is a table that manages the conditions for selecting the dialogue engine. The selection condition management table has three records having "interaction engine identifier", "genre", and "notification destination identifier". A "genre" is a genre that each dialogue engine can respond to.

かかる状況において、ユーザは、ユーザ端末１に対して、「牛丼の作り方を教えて」と音声入力した、とする。すると、ユーザ端末１のユーザ受付部１１は、ユーザからの発話「牛丼の作り方を教えて」を受け付ける。次に、ユーザ処理部１２は、ユーザの音声入力に対して音声認識し、ユーザ発話情報「牛丼の作り方を教えて」を取得する。次に、ユーザ送信部１３は当該ユーザ発話情報を対話制御装置２に送信する。 In such a situation, it is assumed that the user voice-inputs "Tell me how to make beef bowl" to the user terminal 1. Then, the user reception unit 11 of the user terminal 1 accepts the utterance "Tell me how to make beef bowl" from the user. Next, the user processing unit 12 voice-recognizes the user's voice input and acquires the user's utterance information "Tell me how to make beef bowl". Next, the user transmission unit 13 transmits the user utterance information to the dialogue control device 2.

次に、選択部４３１は、２以上の対話エンジン３から一の対話エンジン３を選択する。ここでは、選択部４３１は、「牛丼の作り方を教えて」に対して自然言語処理を行い、ジャンル「料理」を取得する。そして、選択部４３１は、ジャンル「料理」と対になる対話エンジン「ｅ２」を選択する。なお、選択部４３１は、例えば、「牛丼の作り方を教えて」を形態素解析し、自立語「牛丼」「作り方」「教えて」を取得し、当該自立語から機械学習（例えば、深層学習、決定木、ＳＶＭ）により、ジャンル「料理」を取得する。なお、機械学習により文を分類し、当該文のジャンル（クラスと言っても良い）を決定する技術は公知技術であるので、詳細な説明を省略する。また、選択部４３１は、例えば、「牛丼の作り方を教えて」を形態素解析し、自立語「牛丼」「作り方」「教えて」を取得し、当該自立語をキーとしてＤＢ検索を行い、ＤＢからジャンル「料理」を取得する。かかる場合、自立語とジャンルとを対応付けるレコードがＤＢに格納されている、とする。なお、ユーザが入力したジャンルがユーザ端末１から受信され、受信部２２が当該ジャンルを受信し、選択部４３１が当該ジャンルを用いて、ジャンル「料理」を取得しても良い。 Next, the selection unit 431 selects one dialogue engine 3 from two or more dialogue engines 3. Here, the selection unit 431 performs natural language processing on "Tell me how to make beef bowl" and acquires the genre "cooking". Then, the selection unit 431 selects the dialogue engine "e2" that is paired with the genre "cooking". In addition, the selection unit 431 performs morphological analysis of, for example, "teach me how to make beef bowl", acquires the independent words "gyudon", "how to make", and "teach", and machine learning (for example, deep layer) from the independent words. Acquire the genre "cooking" by learning, decision tree, SVM). Since the technique of classifying sentences by machine learning and determining the genre (which may be called a class) of the sentence is a known technique, detailed description thereof will be omitted. In addition, the selection unit 431 performs morphological analysis of, for example, "tell me how to make beef bowl", acquires the independent words "gyudon", "how to make", and "teach", and performs a DB search using the independent word as a key. , Get the genre "cooking" from the DB. In such a case, it is assumed that the record corresponding to the independent word and the genre is stored in the DB. The genre input by the user may be received from the user terminal 1, the receiving unit 22 may receive the genre, and the selection unit 431 may acquire the genre "cooking" using the genre.

そして、発話情報出力部２４１は、対話エンジン「ｅ２」の通信先識別子を図１２の表から取得し、当該通信先識別子で識別される対話エンジン３に、発話情報「１：牛丼の作り方を教えて」を送信する。 Then, the utterance information output unit 241 acquires the communication destination identifier of the dialogue engine "e2" from the table of FIG. 12, and informs the dialogue engine 3 identified by the communication destination identifier of the utterance information "1: How to make beef bowl". Tell me "is sent.

次に、対話エンジン「ｅ２」の発話受付部３２は、対話制御装置２から当該発話情報を受け付ける。そして、応答取得部３３は、発話受付部３２が受け付けた発話情報に対応する応答である自動応答情報「１：４人前の材料をご紹介します。・・・」を、学習情報格納部３１の学習情報を用いて取得する。そして、応答出力部３４は、応答取得部３３が取得した当該自動応答情報を対話制御装置２に送信する。 Next, the utterance receiving unit 32 of the dialogue engine "e2" receives the utterance information from the dialogue control device 2. Then, the response acquisition unit 33 provides the automatic response information "1: 4 servings of material ...", which is a response corresponding to the utterance information received by the utterance reception unit 32, to the learning information storage unit 31. Acquired using the learning information of. Then, the response output unit 34 transmits the automatic response information acquired by the response acquisition unit 33 to the dialogue control device 2.

次に、対話制御装置２の受付部２５は、対話エンジン「ｅ２」から当該自動応答情報を受信する。 Next, the reception unit 25 of the dialogue control device 2 receives the automatic response information from the dialogue engine "e2".

また、自動応答情報出力部２４２は、選択された一の自動応答情報「１：４人前の材料をご紹介します。・・・」および発話情報「１：牛丼の作り方を教えて」を、当該一の自動応答情報を出力した対話エンジン３以外の対話エンジン３（「ｅ１」「ｅ３」の対話エンジン３）に送信する。 In addition, the automatic response information output unit 242 provides the selected automatic response information "1: Introduce the ingredients for 4 servings ..." and the utterance information "1: Tell me how to make beef bowl". , The automatic response information of the one is transmitted to the dialogue engine 3 (the dialogue engine 3 of "e1" and "e3") other than the dialogue engine 3 that outputs the information.

そして、「ｅ１」「ｅ３」の対話エンジン３の学習受付部３５は、当該自動応答情報等受信する。 Then, the learning reception unit 35 of the dialogue engine 3 of "e1" and "e3" receives the automatic response information and the like.

次に、「ｅ１」「ｅ３」の対話エンジン３の学習部３６は、学習受付部３５が受け付けた自動応答情報を用いて、学習情報に、追記する情報を構成する。つまり、学習部３６は、発話情報の内容「牛丼の作り方を教えて」と自応答情報の内容「４人前の材料をご紹介します。・・・」との組を、図８の学習情報管理表に追記する。 Next, the learning unit 36 of the dialogue engine 3 of "e1" and "e3" configures information to be added to the learning information by using the automatic response information received by the learning reception unit 35. In other words, the learning department 36 learns the combination of the content of the utterance information "Tell me how to make beef bowl" and the content of the self-response information "Introducing the ingredients for 4 servings ...." Add to the information management table.

つまり、本具体例において、他の対話エンジン３からの入力である自動応答情報を用いて、学習情報が充実すれば良く、学習情報の構造は問わない。 That is, in this specific example, it is sufficient that the learning information is enriched by using the automatic response information input from the other dialogue engine 3, and the structure of the learning information does not matter.

（具体例２）
今、対話制御装置２の格納部２１は、図１２に示す選択条件管理表を格納している、とする。また、格納部２１は、図６に示す優先度情報管理表を格納している。 (Specific example 2)
Now, it is assumed that the storage unit 21 of the dialogue control device 2 stores the selection condition management table shown in FIG. Further, the storage unit 21 stores the priority information management table shown in FIG.

かかる状況において、ユーザは、ユーザ端末１に対して、「京都でフットサルができるホテルを教えて」と音声入力した、とする。すると、ユーザ端末１のユーザ受付部１１は、当該話を受け付ける。次に、ユーザ処理部１２は、ユーザの音声入力に対して音声認識し、ユーザ発話情報「京都でフットサルができるホテルを教えて」を取得する。次に、ユーザ送信部１３は当該ユーザ発話情報を対話制御装置２に送信する。 In such a situation, it is assumed that the user voice-inputs "Tell me a hotel where futsal can be played in Kyoto" to the user terminal 1. Then, the user reception unit 11 of the user terminal 1 accepts the story. Next, the user processing unit 12 voice-recognizes the user's voice input and acquires the user utterance information "Tell me a hotel where futsal can be played in Kyoto". Next, the user transmission unit 13 transmits the user utterance information to the dialogue control device 2.

そして、処理部２３は、当該ユーザ発話情報から発話情報「１：京都でフットサルができるホテルを教えて」を取得する。 Then, the processing unit 23 acquires the utterance information "1: Tell me the hotel where futsal can be played in Kyoto" from the user's utterance information.

次に、選択部４３１は、２以上の対話エンジン３から１または２以上の対話エンジン３を選択する。ここでは、選択部４３１は、「京都でフットサルができるホテルを教えて」に対して自然言語処理を行い、ジャンル「旅行、スポーツ」を取得する。そして、選択部４３１は、ジャンル「旅行」または「スポーツ」と対になる対話エンジン「ｅ１」「ｅ３」を選択する。 Next, the selection unit 431 selects one or two or more dialogue engines 3 from two or more dialogue engines 3. Here, the selection unit 431 performs natural language processing on "Tell me a hotel where you can play futsal in Kyoto" and acquires the genre "Travel, sports". Then, the selection unit 431 selects the dialogue engines "e1" and "e3" that are paired with the genre "travel" or "sports".

そして、発話情報出力部２４１は、対話エンジン「ｅ１」「ｅ３」の通信先識別子を図１２の表から取得し、当該通信先識別子で識別される２つの対話エンジン３に、発話情報「１：京都でフットサルができるホテルを教えて」を送信する。 Then, the utterance information output unit 241 acquires the communication destination identifiers of the dialogue engines "e1" and "e3" from the table of FIG. 12, and supplies the utterance information "1:" to the two dialogue engines 3 identified by the communication destination identifiers. Tell me which hotels you can play futsal in Kyoto. "

次に、「ｅ１」「ｅ３」の各対話エンジンの発話受付部３２は、対話制御装置２から発話情報を受け付ける。そして、２つの各対話エンジン３の応答取得部３３は、発話受付部３２が受け付けた発話情報に対応する応答である自動応答情報を、学習情報格納部３１の学習情報を用いて取得する。次に、２つの各対話エンジン３の応答出力部３４は、応答取得部３３が取得した自動応答情報を対話制御装置２に送信する。 Next, the utterance receiving unit 32 of each of the dialogue engines of "e1" and "e3" receives the utterance information from the dialogue control device 2. Then, the response acquisition unit 33 of each of the two dialogue engines 3 acquires automatic response information, which is a response corresponding to the utterance information received by the utterance reception unit 32, using the learning information of the learning information storage unit 31. Next, the response output unit 34 of each of the two dialogue engines 3 transmits the automatic response information acquired by the response acquisition unit 33 to the dialogue control device 2.

次に、対話制御装置２の受付部２５は、２つの各対話エンジン３から自動応答情報を受信する。 Next, the reception unit 25 of the dialogue control device 2 receives the automatic response information from each of the two dialogue engines 3.

次に、選択部４３１は、２つの自動応答情報から一の自動応答情報を選択する。かかる選択方法は、実施の形態１で説明した。また、ここで、選択部４３１は、対話エンジン「ｅ１」からの自動応答情報を選択した、とする。 Next, the selection unit 431 selects one automatic response information from the two automatic response information. Such a selection method has been described in the first embodiment. Further, it is assumed that the selection unit 431 has selected the automatic response information from the dialogue engine "e1".

次に、処理部２３は、選択された自動応答情報から自動情報を取得する。そして、応答送信部２６１は、当該自動情報をユーザ端末１に送信する。 Next, the processing unit 23 acquires automatic information from the selected automatic response information. Then, the response transmission unit 261 transmits the automatic information to the user terminal 1.

次に、ユーザ端末１のユーザ受信部１４は、当該動情報を受信する。そして、ユーザ出力部１５は、当該自動情報を出力する。 Next, the user receiving unit 14 of the user terminal 1 receives the dynamic information. Then, the user output unit 15 outputs the automatic information.

また、自動応答情報出力部２４２は、選択された一の自動応答情報を、当該一の自動応答情報を出力した対話エンジン３以外の対話エンジン３（「ｅ２」「ｅ３」の対話エンジン３）に送信する。 Further, the automatic response information output unit 242 transfers the selected automatic response information to the dialogue engine 3 (dialogue engine 3 of "e2" and "e3") other than the dialogue engine 3 that outputs the one automatic response information. Send.

次に、「ｅ２」「ｅ３」の対話エンジン３の学習部３６は、学習受付部３５が受け付けた自動応答情報を用いて、上述した学習処理を行う。 Next, the learning unit 36 of the dialogue engine 3 of the “e2” and “e3” performs the above-mentioned learning process using the automatic response information received by the learning reception unit 35.

以上、本実施の形態によれば、複数の対話エンジンを効果的に用いて、適切な対話を継続させることができる。特に、本実施の形態によれば、複数の対話エンジンの中から、対話エンジンを選択することにより、少ない負荷で、適切な応答をユーザに提供できる。 As described above, according to the present embodiment, it is possible to effectively use a plurality of dialogue engines to continue an appropriate dialogue. In particular, according to the present embodiment, by selecting a dialogue engine from a plurality of dialogue engines, it is possible to provide an appropriate response to the user with a small load.

なお、本実施の形態における対話制御装置４を実現するソフトウェアは、以下のようなプログラムである。つまり、このプログラムは、２以上の対話エンジンから、ユーザの発話に対応する応答である自動応答情報を受け付け得る対話制御装置におけるプログラムであって、コンピュータを、ユーザの発話に対応するユーザ発話情報をユーザ端末から受信する受信部と、前記ユーザ発話情報に対応する情報である発話情報を、１以上の対話エンジンに対して出力し得る発話情報出力部と、前記発話情報の出力に応じて、前記１以上の各対話エンジンから自動応答情報を受け付ける自動応答受付部と、前記自動応答受付部が受け付けた１以上の自動応答情報のうちの一の自動応答情報に対応する一の自動情報を前記ユーザ端末に送信する応答送信部として機能させるためのプログラムである。 The software that realizes the dialogue control device 4 in this embodiment is the following program. That is, this program is a program in a dialogue control device that can receive automatic response information that is a response corresponding to a user's utterance from two or more dialogue engines, and uses a computer to display user utterance information corresponding to the user's utterance. The receiving unit received from the user terminal, the utterance information output unit capable of outputting the utterance information which is the information corresponding to the user utterance information to one or more dialogue engines, and the utterance information output unit according to the output of the utterance information. The user receives an automatic response reception unit that receives automatic response information from one or more dialogue engines, and one automatic information corresponding to one of the one or more automatic response information received by the automatic response reception unit. It is a program for functioning as a response transmitter to be transmitted to a terminal.

また、上記プログラムにおいて、コンピュータを、前記２以上の対話エンジンから一部の対話エンジンを選択する選択部としてさらに機能させ、前記発話情報出力部は、前記発話情報を、前記選択部が選択した一部の対話エンジンに出力し、前記自動応答受付部は、前記発話情報の出力に応じて、前記一部の対話エンジンから自動応答情報を受け付け、前記応答送信部は、前記自動応答受付部が受け付けた自動応答情報に対応する一の自動情報を前記ユーザ端末に送信するものとして、コンピュータを機能させるプログラムであることは好適である。 Further, in the above program, the computer is further functioned as a selection unit for selecting a part of the dialogue engines from the two or more dialogue engines, and the speech information output unit selects the speech information. Output to the dialogue engine of the unit, the automatic response receiving unit receives automatic response information from some of the dialogue engines in response to the output of the spoken information, and the response transmitting unit receives the automatic response receiving unit. It is preferable that the program functions the computer so as to transmit one automatic information corresponding to the automatic response information to the user terminal.

また、上記プログラムにおいて、前記選択部は、前記ユーザ発話情報または前記発話情報を用いて、一の対話エンジンを選択するものとして、コンピュータを機能させるプログラムであることは好適である。 Further, in the above program, it is preferable that the selection unit is a program for operating a computer by using the user utterance information or the utterance information to select one dialogue engine.

（実施の形態３）
本実施の形態において、複数の対話エンジンの中のいずれかの応答を採用する自動モード、管理者の応答を採用する手動モードが存在する対話制御装置を具備する対話システムについて説明する。 (Embodiment 3)
In the present embodiment, a dialogue system including a dialogue control device having an automatic mode that adopts a response from a plurality of dialogue engines and a manual mode that adopts a response of an administrator will be described.

また、本実施の形態において、モードを自動決定、または自動変更する対話システムについて説明する。なお、本実施の形態において、対話の破綻等を自動検知し、モードを自動変更する。 Further, in the present embodiment, a dialogue system that automatically determines or automatically changes the mode will be described. In the present embodiment, the failure of the dialogue or the like is automatically detected, and the mode is automatically changed.

また、本実施の形態において、管理者からの指示によりモードを変更する対話システムについて説明する。 Further, in the present embodiment, a dialogue system that changes the mode according to an instruction from the administrator will be described.

また、本実施の形態において、自動モードの場合に、対話エンジンからの応答を管理端末に出力する対話システムについて説明する。 Further, in the present embodiment, a dialogue system that outputs a response from the dialogue engine to the management terminal in the case of the automatic mode will be described.

さらに、本実施の形態において、手動モードの場合に、管理者からの応答を対話エンジンに提供する対話システムについて説明する。 Further, in the present embodiment, a dialogue system that provides a response from the administrator to the dialogue engine in the case of the manual mode will be described.

図１３は、本実施の形態における対話システムＣの概念図である。対話システムＡは、１または２以上のユーザ端末１、対話制御装置５、２以上の対話エンジン６、および１または２以上の管理端末７を備える。対話制御装置５は、対話を制御する装置である。対話制御装置５は、例えば、ＡＳＰのサーバ、クラウドサーバ等である。但し、対話制御装置５の種類は問わない。対話エンジン６は、ユーザからの発話の情報を受け付け、応答を返す装置である。対話エンジン６は、例えば、ＡＳＰのサーバ、クラウドサーバ等である。但し、対話エンジン６の種類は問わない。また、対話エンジン６は、発話情報を受け付け、自動応答情報を出力する。対話エンジン６は、公知の対話エンジンでも良い。管理端末７は、例えば、いわゆるパーソナルコンピュータ、タブレット端末、スマートフォン等であり、その種類は問わない。管理端末７は、管理者が使用する端末である。管理者は、対話の管理者であり、手動モードの場合に、ユーザと対話を行う人である。管理者は、対話を支援する人である、と言っても良い。 FIG. 13 is a conceptual diagram of the dialogue system C in the present embodiment. The dialogue system A includes one or more user terminals 1, a dialogue control device 5, two or more dialogue engines 6, and one or more management terminals 7. The dialogue control device 5 is a device that controls dialogue. The dialogue control device 5 is, for example, an ASP server, a cloud server, or the like. However, the type of the dialogue control device 5 does not matter. The dialogue engine 6 is a device that receives information on utterances from the user and returns a response. The dialogue engine 6 is, for example, an ASP server, a cloud server, or the like. However, the type of the dialogue engine 6 does not matter. Further, the dialogue engine 6 receives the utterance information and outputs the automatic response information. The dialogue engine 6 may be a known dialogue engine. The management terminal 7 is, for example, a so-called personal computer, a tablet terminal, a smartphone, or the like, and the type thereof does not matter. The management terminal 7 is a terminal used by the administrator. An administrator is the administrator of a conversation, who interacts with the user in manual mode. It can be said that the manager is a person who supports the dialogue.

また、通常、１以上のユーザ端末１と対話制御装置５とは、インターネット、ＬＡＮ等のネットワークにより通信可能である。また、通常、対話制御装置５と対話エンジン６とは、インターネット、ＬＡＮ等のネットワークにより通信可能である。また、通常、対話制御装置５と１以上の管理端末７とは、インターネット、ＬＡＮ等のネットワークにより通信可能である。ただし、対話制御装置５は、対話エンジン６、管理端末７のうちの１以上の装置を内部に含んでいても良い。つまり、対話制御装置５と対話エンジン６と管理端末７のうちの２以上の装置は、物理的に一の装置で実現されても良い。また、対話制御装置５は、ユーザ端末１、対話エンジン６のうちの１以上の装置を内部に含んでいても良い。つまり、対話制御装置５とユーザ端末１と対話エンジン６のうちの２以上の装置は、物理的に一の装置で実現されても良い。 Further, usually, one or more user terminals 1 and the dialogue control device 5 can communicate with each other via a network such as the Internet or a LAN. Further, normally, the dialogue control device 5 and the dialogue engine 6 can communicate with each other via a network such as the Internet or a LAN. Further, normally, the dialogue control device 5 and one or more management terminals 7 can communicate with each other via a network such as the Internet or a LAN. However, the dialogue control device 5 may include one or more of the dialogue engine 6 and the management terminal 7 inside. That is, two or more of the dialogue control device 5, the dialogue engine 6, and the management terminal 7 may be physically realized by one device. Further, the dialogue control device 5 may include one or more of the user terminal 1 and the dialogue engine 6 inside. That is, two or more of the dialogue control device 5, the user terminal 1, and the dialogue engine 6 may be physically realized by one device.

図１４は、本実施の形態における対話システムＣのブロック図である。また、図１５は、対話制御装置５のブロック図である。 FIG. 14 is a block diagram of the dialogue system C according to the present embodiment. Further, FIG. 15 is a block diagram of the dialogue control device 5.

対話制御装置５は、格納部５１、受信部２２、処理部５３、出力部５４、受付部５５、および送信部５６を備える。格納部５１は、モード格納部５１１を備える。処理部５３は、決定部５３１、判断部５３２、モード変更部５３３、および選択部５３４を備える。出力部５４は、発話情報出力部５４１、自動情報出力部５４２、および手動情報出力部５４３を備える。受付部５５は、自動応答受付部５５１、手動応答受付部５５２、および変更指示受付部５５３を備える。送信部５６は、応答送信部５６１を備える。 The dialogue control device 5 includes a storage unit 51, a reception unit 22, a processing unit 53, an output unit 54, a reception unit 55, and a transmission unit 56. The storage unit 51 includes a mode storage unit 511. The processing unit 53 includes a determination unit 531, a determination unit 532, a mode change unit 533, and a selection unit 534. The output unit 54 includes an utterance information output unit 541, an automatic information output unit 542, and a manual information output unit 543. The reception unit 55 includes an automatic response reception unit 551, a manual response reception unit 552, and a change instruction reception unit 553. The transmission unit 56 includes a response transmission unit 561.

対話エンジン６は、学習情報格納部３１、発話受付部３２、応答取得部３３、応答出力部３４、学習受付部６５、および学習部６６を備える。 The dialogue engine 6 includes a learning information storage unit 31, an utterance reception unit 32, a response acquisition unit 33, a response output unit 34, a learning reception unit 65, and a learning unit 66.

管理端末７は、管理情報受付部７１、管理処理部７２、管理者出力部７３、管理指示受付部７４、および管理出力部７５を備える。 The management terminal 7 includes a management information reception unit 71, a management processing unit 72, an administrator output unit 73, a management instruction reception unit 74, and a management output unit 75.

対話制御装置５を構成する格納部５１は、各種の情報が格納される。各種の情報とは、例えば、後述するモード情報、自動応答情報の選択条件、対話エンジン３の選択条件、優先度情報、対話の履歴である。 Various types of information are stored in the storage unit 51 that constitutes the dialogue control device 5. The various types of information are, for example, mode information described later, selection conditions for automatic response information, selection conditions for the dialogue engine 3, priority information, and dialogue history.

モード格納部５１１は、モード情報が格納される。モード情報は、ユーザとの対話が自動であるか手動であるかを示す情報である。モード情報には、通常、いずれかの対話エンジン６を採用する自動モード、または管理端末７を採用する手動モードがある。つまり、モード情報は、自動モード、手動モードのいずれかを、少なくとも採り得る。ただし、モード情報は、自動モード、手動モード以外のモードを採り得ても良い。自動モード、手動モード以外のモードとは、例えば、動作しない停止モード等である。また、対話エンジン６を採用することは、対話エンジン６からの応答がユーザ端末１に送信されることである。また、管理端末７を採用することは、管理端末７からの応答がユーザ端末１に送信されることである。 Mode information is stored in the mode storage unit 511. The mode information is information indicating whether the interaction with the user is automatic or manual. The mode information usually includes an automatic mode in which any of the dialogue engines 6 is adopted, or a manual mode in which the management terminal 7 is adopted. That is, the mode information can be at least either an automatic mode or a manual mode. However, the mode information may be a mode other than the automatic mode and the manual mode. The modes other than the automatic mode and the manual mode are, for example, a stop mode that does not operate. Further, adopting the dialogue engine 6 means that the response from the dialogue engine 6 is transmitted to the user terminal 1. Further, adopting the management terminal 7 means that the response from the management terminal 7 is transmitted to the user terminal 1.

処理部５３は、各種の処理を行う。各種の処理とは、決定部５３１、判断部５３２、モード変更部５３３、選択部５３４等が行う処理である。 The processing unit 53 performs various processes. The various processes are processes performed by the determination unit 531, the determination unit 532, the mode change unit 533, the selection unit 534, and the like.

各種の処理とは、例えば、ユーザ発話情報から発話情報を取得する処理である。処理部５３は、例えば、ユーザ発話情報の受信に応じて、発話識別子を生成する。そして、処理部５３は、例えば、ユーザ発話情報に発話識別子を付加し、発話情報を構成する。なお、発話識別子は、ユーザ発話情報の識別子であり、ユニークな情報である。また、処理部５３は、例えば、最近に生成した発話識別子を１インクリメントし、ユニークな発話識別子を生成する。 The various processes are, for example, processes for acquiring utterance information from user utterance information. The processing unit 53 generates an utterance identifier in response to receiving user utterance information, for example. Then, the processing unit 53 adds a utterance identifier to the user utterance information, for example, to form the utterance information. The utterance identifier is an identifier of user utterance information and is unique information. Further, the processing unit 53 increments, for example, the recently generated utterance identifier by 1, and generates a unique utterance identifier.

また、処理部５３は、例えば、自動応答情報または手動応答情報から自動情報または手動情報を取得しても良い。処理部５３は、例えば、テキストの自動応答情報からその音声データである自動情報を生成しても良い。処理部５３は、例えば、テキストの手動応答情報からその音声データである手動情報を生成しても良い。また、処理部５３は、例えば、自動応答情報または手動応答情報から、送信するデータ構造の自動情報または送信するデータ構造の手動情報を取得しても良い。かかる場合、処理部５３は、例えば、自動応答情報または手動応答情報のデータ構造を変更し、自動情報または手動情報を取得する。なお、処理部５３は、例えば、自動応答情報と自動情報が同じ情報である場合、単に、受付部５５が受け付けた自動応答情報を取得する。また、処理部５３は、例えば、手動応答情報と手動情報が同じ情報である場合、単に、受付部５５が受け付けた手動応答情報を取得する。 Further, the processing unit 53 may acquire the automatic information or the manual information from the automatic response information or the manual response information, for example. For example, the processing unit 53 may generate automatic information which is the voice data from the automatic response information of the text. The processing unit 53 may generate the manual information which is the voice data from the manual response information of the text, for example. Further, the processing unit 53 may acquire the automatic information of the data structure to be transmitted or the manual information of the data structure to be transmitted from, for example, the automatic response information or the manual response information. In such a case, the processing unit 53 changes the data structure of the automatic response information or the manual response information, for example, and acquires the automatic information or the manual information. In addition, for example, when the automatic response information and the automatic information are the same information, the processing unit 53 simply acquires the automatic response information received by the reception unit 55. Further, for example, when the manual response information and the manual information are the same information, the processing unit 53 simply acquires the manual response information received by the reception unit 55.

決定部５３１は、対話エンジン６を採用するか、管理端末７を採用するかを決定する。対話エンジン６の採用とは、対話エンジン６の応答の採用であり、対話エンジン６の応答である自動応答情報に対する情報である自動情報をユーザ端末１に送信することである。また、管理端末７の採用とは、管理端末７からの応答の採用であり、管理端末７の応答である手動応答情報に対する情報である手動情報をユーザ端末１に送信することである。 The determination unit 531 determines whether to adopt the dialogue engine 6 or the management terminal 7. The adoption of the dialogue engine 6 is the adoption of the response of the dialogue engine 6 and is to transmit the automatic information which is the information for the automatic response information which is the response of the dialogue engine 6 to the user terminal 1. Further, the adoption of the management terminal 7 is the adoption of the response from the management terminal 7, and the manual information which is the information for the manual response information which is the response of the management terminal 7 is transmitted to the user terminal 1.

決定部５３１は、モード格納部５１１のモード情報を取得する。そして、決定部５３１は、モード格納部５１１のモード情報に従って、対話エンジン６を採用するか、管理端末７を採用するかを決定する。このモード情報は、例えば、対話エンジン６を採用することを示す自動モード、または管理端末７を採用することを示す手動モードである。 The determination unit 531 acquires the mode information of the mode storage unit 511. Then, the determination unit 531 determines whether to adopt the dialogue engine 6 or the management terminal 7 according to the mode information of the mode storage unit 511. This mode information is, for example, an automatic mode indicating that the dialogue engine 6 is adopted, or a manual mode indicating that the management terminal 7 is adopted.

なお、決定部５３１の上記決定の処理は、モード格納部５１１のモード情報の読み出しでも良い。 The determination process of the determination unit 531 may be performed by reading the mode information of the mode storage unit 511.

また、決定部５３１は、発話情報の出力先を決定しても良い。決定部５３１は、例えば、モード格納部５１１のモード情報に従って、発話情報の出力先を対話エンジン６とするか、管理端末７とするかを決定する。また、決定部５３１は、発話情報の出力先を、常に、対話エンジン６と管理端末７との両方にする、としても良い。 Further, the determination unit 531 may determine the output destination of the utterance information. The determination unit 531 determines, for example, whether the output destination of the utterance information is the dialogue engine 6 or the management terminal 7 according to the mode information of the mode storage unit 511. Further, the determination unit 531 may always set the output destination of the utterance information to both the dialogue engine 6 and the management terminal 7.

判断部５３２は、予め決められた条件を満たすか否かを判断する。ここで、予め決められた条件とは、モード情報を変更するための条件である。モード情報を変更するための条件は、自動モードから手動モードに変更するための第一の条件、手動モードから自動モードに変更するための第二の条件がある。 The determination unit 532 determines whether or not a predetermined condition is satisfied. Here, the predetermined condition is a condition for changing the mode information. The conditions for changing the mode information include the first condition for changing from the automatic mode to the manual mode and the second condition for changing from the manual mode to the automatic mode.

第一の条件は、例えば、対話が破綻していることを特定するための条件、管理端末７からの指示を受け付けたこと等である。対話が破綻していることを特定するための条件は、例えば、対話エンジン６からの応答である１または２以上の自動応答情報を用いて対話の破綻を検出するための条件、ユーザ端末１からの１または２以上のユーザ発話情報を用いて対話の破綻を検出するための条件、管理端末７からの入力に基づいて対話の破綻を検出するための条件である。対話が破綻していることを特定するための条件は、例えば、対話エンジン６からの応答である自動応答情報に予め決められた用語（例えば、「理解できなかった」「理解不能」）を含むこと、予め決められた用語を含む自動応答情報が予め決められた期間内に閾値以上または閾値より多い回数受け付けられたこと、受信された２以上の連続するユーザ発話情報が同一または閾値以上の類似度であること、ユーザ発話情報に予め決められた用語（例えば、「何言っているのか」「駄目だな」）を含むこと等である。また、管理端末７からの指示とは、例えば、モード情報を変更する指示、モード情報を指定する指示である。なお、対話の破綻とは、ユーザと対話エンジン６との対話の破綻である。また、予め決められた条件は、例えば、格納部５１に格納されている。 The first condition is, for example, a condition for identifying that the dialogue is broken, an instruction from the management terminal 7 is accepted, and the like. The condition for identifying that the dialogue is broken is, for example, the condition for detecting the breakdown of the dialogue by using one or more automatic response information which is the response from the dialogue engine 6, from the user terminal 1. It is a condition for detecting the failure of the dialogue using one or more user speech information of the above, and a condition for detecting the failure of the dialogue based on the input from the management terminal 7. The conditions for identifying that the dialogue is broken include, for example, predetermined terms (for example, "incomprehensible" and "incomprehensible") in the automatic response information which is the response from the dialogue engine 6. That, the automatic response information including the predetermined term is received more than the threshold or more times than the threshold within the predetermined period, and the received two or more consecutive user utterance information are the same or similar to the threshold or more. It means that the user's utterance information includes predetermined terms (for example, "what are you saying" and "no good"). Further, the instructions from the management terminal 7 are, for example, an instruction to change the mode information and an instruction to specify the mode information. The failure of the dialogue is the failure of the dialogue between the user and the dialogue engine 6. Further, the predetermined conditions are stored in, for example, the storage unit 51.

第二の条件は、例えば、管理端末７からの応答を閾値以上の回数受け付けたこと、管理端末７からの指示を受け付けたこと等である。 The second condition is, for example, that the response from the management terminal 7 has been received a number of times equal to or greater than the threshold value, that the instruction from the management terminal 7 has been received, and the like.

モード変更部５３３は、判断部５３２が予め決められた条件を満たすと判断した場合に、モード情報を変更する。モード情報の変更とは、通常、モード情報をモード格納部２１１に書き込むことである。モード変更部５３３は、変更指示に応じて、モード格納部５１１のモード情報を変更する。 The mode change unit 533 changes the mode information when the determination unit 532 determines that the predetermined condition is satisfied. Changing the mode information is usually writing the mode information to the mode storage unit 211. The mode change unit 533 changes the mode information of the mode storage unit 511 in response to the change instruction.

選択部５３４は、例えば、２以上の自動応答情報から一の自動応答情報を選択する。選択部５３４は、２以上の自動応答情報から、予め決められた条件に合致する一の自動応答情報を選択する。選択部５３４は、２以上の自動応答情報のうちの予め決められた条件を満たす自動応答情報の中で、優先度情報に基づく優先度が最も高い対話エンジン６から出力された自動応答情報を選択する。選択部５３４は、例えば、２以上の対話エンジン６から一の対話エンジン６を選択する。選択部５３４は、ユーザ発話情報または発話情報を用いて、一の対話エンジン６を選択する。選択部５３４は、選択部２３１または選択部４３１と同じ処理でも良い。 The selection unit 534 selects one automatic response information from, for example, two or more automatic response information. The selection unit 534 selects one automatic response information that matches a predetermined condition from two or more automatic response information. The selection unit 534 selects the automatic response information output from the dialogue engine 6 having the highest priority based on the priority information among the automatic response information satisfying the predetermined conditions among the two or more automatic response information. To do. The selection unit 534 selects, for example, one dialogue engine 6 from two or more dialogue engines 6. The selection unit 534 selects one dialogue engine 6 by using the user utterance information or the utterance information. The selection unit 534 may perform the same processing as the selection unit 231 or the selection unit 431.

出力部５４は、各種の情報を出力する。ここでの出力とは、通常、送信である。ただし、出力とは、情報が出力先に渡れば良い。 The output unit 54 outputs various information. The output here is usually a transmission. However, the output means that the information should be passed to the output destination.

発話情報出力部５４１は、ユーザ発話情報に対応する情報である発話情報を、対話エンジン６または管理端末７のうちの少なくともどちらか一方に出力する。発話情報出力部５４１は、発話情報を、モード情報に関係なく、対話エンジン６および管理端末７に出力しても良い。ここで、発話情報は、ユーザ発話情報と同じ情報でも良い。また、発話情報は、ユーザ発話情報を音声認識し、得られたテキストでも良い。発話情報は、発話識別子が付加された情報であることは好適である。 The utterance information output unit 541 outputs utterance information, which is information corresponding to the user utterance information, to at least one of the dialogue engine 6 and the management terminal 7. The utterance information output unit 541 may output the utterance information to the dialogue engine 6 and the management terminal 7 regardless of the mode information. Here, the utterance information may be the same information as the user utterance information. Further, the utterance information may be a text obtained by voice-recognizing the user utterance information. It is preferable that the utterance information is information to which the utterance identifier is added.

発話情報出力部５４１は、上記のように、発話情報を管理端末７に対して出力し得る。決定部５３１が管理端末７を採用すると決定した場合、発話情報出力部５４１は、発話情報を管理端末７に対して出力する。なお、決定部５３１が管理端末７を採用すると決定した場合でも、１以上の対話エンジン６に発話情報を出力しても良い。 The utterance information output unit 541 can output the utterance information to the management terminal 7 as described above. When the determination unit 531 decides to adopt the management terminal 7, the utterance information output unit 541 outputs the utterance information to the management terminal 7. Even when the determination unit 531 decides to adopt the management terminal 7, the utterance information may be output to one or more dialogue engines 6.

発話情報出力部５４１は、ユーザ発話情報に対応する情報である発話情報を、１以上の対話エンジン６に対して出力し得る。発話情報出力部５４１は、発話情報を、２以上の対話エンジン６に出力しても良い。発話情報出力部５４１は、発話情報を、選択部５３４が選択した一の対話エンジン６に出力しても良い。 The utterance information output unit 541 can output utterance information, which is information corresponding to the user utterance information, to one or more dialogue engines 6. The utterance information output unit 541 may output the utterance information to two or more dialogue engines 6. The utterance information output unit 541 may output the utterance information to one dialogue engine 6 selected by the selection unit 534.

自動情報出力部５４２は、一の自動情報を管理端末７に出力する。自動情報出力部５４２は、自動応答受付部５５１が受け付けた自動応答情報に対応する自動情報を管理端末７に出力する。 The automatic information output unit 542 outputs one automatic information to the management terminal 7. The automatic information output unit 542 outputs the automatic information corresponding to the automatic response information received by the automatic response reception unit 551 to the management terminal 7.

手動情報出力部５４３は、手動情報を１または２以上の対話エンジン６に出力する。手動情報出力部５４３は、手動応答受付部５５２が受け付けた手動応答情報に対応する手動情報を１以上の対話エンジン６に出力する。 The manual information output unit 543 outputs the manual information to one or more dialogue engines 6. The manual information output unit 543 outputs the manual information corresponding to the manual response information received by the manual response reception unit 552 to one or more dialogue engines 6.

受付部５５は、各種の情報や指示等を受け付ける。ここでの受け付けとは、通常、受信である。但し、受け付けは、情報等を受け取れれば良い。 The reception unit 55 receives various information, instructions, and the like. Acceptance here is usually reception. However, the reception may be as long as the information etc. is received.

自動応答受付部５５１は、発話情報の１以上の対話エンジン６への出力に応じて、１以上の各対話エンジン６からの応答である自動応答情報を受け付ける。かかる受け付けは、例えば、受信である。 The automatic response reception unit 551 receives automatic response information which is a response from each of the one or more dialogue engines 6 in response to the output of the utterance information to one or more dialogue engines 6. Such reception is, for example, reception.

手動応答受付部５５２は、発話情報の管理端末７への出力に応じて、管理端末７からの応答である手動応答情報を受け付ける。かかる受け付けは、例えば、受信である。 The manual response reception unit 552 receives the manual response information, which is a response from the management terminal 7, in response to the output of the utterance information to the management terminal 7. Such reception is, for example, reception.

変更指示受付部５５３は、管理端末７からモードの変更の指示である変更指示を受け付ける。かかる受け付けは、例えば、受信である。また、この変更指示は、自動モードから手動モードへの変更でも良いし、手動モードから自動モードへの変更でも良い。 The change instruction receiving unit 553 receives a change instruction, which is an instruction for changing the mode, from the management terminal 7. Such reception is, for example, reception. Further, this change instruction may be a change from the automatic mode to the manual mode, or may be a change from the manual mode to the automatic mode.

送信部５６は、各種の情報を送信する。各種の情報とは、例えば、手動情報、自動情報等である。 The transmission unit 56 transmits various types of information. The various types of information include, for example, manual information, automatic information, and the like.

応答送信部５６１は、自動応答情報に対応する情報である自動情報または手動応答情報に対応する情報である手動情報をユーザ端末１に送信する。 The response transmission unit 561 transmits the automatic information which is the information corresponding to the automatic response information or the manual information which is the information corresponding to the manual response information to the user terminal 1.

応答送信部５６１は、決定部５３１の決定が対話エンジン６を採用するとの決定の場合は、自動情報をユーザ端末１に送信しても良い。また、応答送信部５６１は、決定部５３１の決定が管理端末７を採用するとの決定の場合は、手動情報をユーザ端末１に送信しても良い。ここで、自動情報は、自動応答情報と同じ情報でも良い。自動情報と自動応答情報の内容は、通常、同じである。自動情報は、例えば、自動応答情報を音声に変換した音声データである。手動情報は、手動応答情報と同じ情報でも良い。手動情報と手動応答情報の内容は、通常、同じである。手動情報は、例えば、手動応答情報を音声に変換した音声データである。 The response transmission unit 561 may transmit automatic information to the user terminal 1 when the decision of the determination unit 531 determines that the dialogue engine 6 is adopted. Further, the response transmission unit 561 may transmit manual information to the user terminal 1 when the decision of the determination unit 531 determines that the management terminal 7 is adopted. Here, the automatic information may be the same information as the automatic response information. The contents of the automatic information and the automatic response information are usually the same. The automatic information is, for example, voice data obtained by converting automatic response information into voice. The manual information may be the same information as the manual response information. The contents of the manual information and the manual response information are usually the same. The manual information is, for example, voice data obtained by converting manual response information into voice.

応答送信部５６１は、自動応答受付部５５１が受け付けた１以上の自動応答情報のうちの一の自動応答情報に対応する一の自動情報をユーザ端末１に送信しても良い。応答送信部５６１は、選択部５３４が選択した自動応答情報に対応する一の自動情報をユーザ端末１に送信しても良い。応答送信部５６１は、自動応答受付部５５１が受け付けた自動応答情報に対応する一の自動情報をユーザ端末１に送信しても良い。 The response transmission unit 561 may transmit one automatic information corresponding to one of the one or more automatic response information received by the automatic response reception unit 551 to the user terminal 1. The response transmission unit 561 may transmit one automatic information corresponding to the automatic response information selected by the selection unit 534 to the user terminal 1. The response transmission unit 561 may transmit one automatic information corresponding to the automatic response information received by the automatic response reception unit 551 to the user terminal 1.

対話エンジン６を構成する学習受付部６５は、管理端末７からの応答である手動応答情報を受け付ける。また、学習受付部６５は、他の対話エンジン６からの応答である自動応答情報を受け付ける。ここでの手動応答情報または自動応答情報の受け付けは、手動情報または自動情報の受け付けでも良い趣旨である。 The learning reception unit 65 constituting the dialogue engine 6 receives the manual response information which is the response from the management terminal 7. Further, the learning reception unit 65 receives automatic response information which is a response from another dialogue engine 6. The acceptance of the manual response information or the automatic response information here is intended to accept the manual information or the automatic information.

学習部６６は、学習受付部３５が受け付けた手動応答情報を用いて、学習処理を行う。また、学習部６６は、学習受付部３５が受け付けた自動応答情報を用いて、学習処理を行う。自動応答情報を用いた学習処理については、上述した。 The learning unit 66 performs learning processing using the manual response information received by the learning reception unit 35. In addition, the learning unit 66 performs learning processing using the automatic response information received by the learning reception unit 35. The learning process using the automatic response information has been described above.

学習処理は、例えば、手動応答情報を用いて、学習情報格納部３１の学習情報に情報を付加することである。学習処理は、例えば、手動応答情報を用いて、学習情報格納部３１の学習情報を変更することである。学習情報に情報を付加する処理は、学習情報格納部３１に新しい学習情報を追記する処理を含む。 The learning process is, for example, adding information to the learning information of the learning information storage unit 31 by using the manual response information. The learning process is, for example, changing the learning information of the learning information storage unit 31 by using the manual response information. The process of adding information to the learning information includes a process of adding new learning information to the learning information storage unit 31.

学習部６６は、例えば、手動応答情報と当該手動応答情報に対応する発話情報の組である学習情報を学習情報格納部３１に追記する。 The learning unit 66 adds, for example, learning information, which is a set of manual response information and utterance information corresponding to the manual response information, to the learning information storage unit 31.

学習部６６は、例えば、手動応答情報と当該手動応答情報に対応する発話情報の組の情報から、状態遷移情報である学習情報に、当該発話情報の前の自動応答情報のノード（状態）の先の状態として、発話情報のノードを付加し、当該発話情報のノードに、手動応答情報のノードを付加する。 For example, the learning unit 66 converts the information of the set of the manual response information and the utterance information corresponding to the manual response information into the learning information which is the state transition information, and the node (state) of the automatic response information before the utterance information. As the previous state, the utterance information node is added, and the manual response information node is added to the utterance information node.

学習部６６は、例えば、手動応答情報から手動応答情報のパターンを構成し、かつ当該手動応答情報に対応する発話情報から発話情報のパターンを構成し、発話情報のパターンと応答のパターンとの組である学習情報を学習情報格納部３１に追記する。なお、応答のパターンは、自動応答情報のパターンとなる。 For example, the learning unit 66 constitutes a pattern of manual response information from manual response information, and constitutes a pattern of utterance information from utterance information corresponding to the manual response information, and sets a pattern of utterance information and a pattern of response. The learning information is added to the learning information storage unit 31. The response pattern is the pattern of automatic response information.

学習部６６は、例えば、手動応答情報から手動応答情報のパターンを構成し、かつ当該手動応答情報に対応する発話情報から発話情報のパターンを構成し、当該パターンの組に合致する、発話情報のパターンと自動応答情報のパターンの組の出現頻度または出現確率を増加させるように、学習情報格納部３１の学習情報を更新する。かかる場合、学習情報は、発話情報のパターンと自動応答情報のパターンの組、および出現頻度または出現確率を有する。 The learning unit 66, for example, constructs a pattern of manual response information from manual response information, and configures a pattern of utterance information from utterance information corresponding to the manual response information, and constitutes a pattern of utterance information that matches the set of the patterns. The learning information of the learning information storage unit 31 is updated so as to increase the appearance frequency or appearance probability of the pattern and the pattern set of the automatic response information. In such a case, the learning information has a set of a pattern of utterance information and a pattern of automatic response information, and an appearance frequency or appearance probability.

なお、学習部６６は、例えば、発話情報に対応する発話識別子と手動応答情報に対応する発話識別子とが同一の、発話情報と手動応答情報とを用いて、上記の学習処理を行う。 The learning unit 66 performs the above learning process using, for example, the utterance information and the manual response information in which the utterance identifier corresponding to the utterance information and the utterance identifier corresponding to the manual response information are the same.

また、学習部６６が行う手動応答情報を用いた学習処理は種々あり得る。手動応答情報または他の対話エンジン６からの自動応答情報を用いる点を除いて、学習部６６が行う学習処理は他の公知の学習処理のアルゴリズムを用いても良い。 In addition, there may be various learning processes using the manual response information performed by the learning unit 66. The learning process performed by the learning unit 66 may use another known learning processing algorithm, except that the manual response information or the automatic response information from another dialogue engine 6 is used.

管理端末７を構成する管理格納部７０は、各種の情報が格納される。各種の情報は、例えば、モード情報、発話情報、自動情報、手動応答情報等である。 Various information is stored in the management storage unit 70 that constitutes the management terminal 7. Various types of information include, for example, mode information, utterance information, automatic information, manual response information, and the like.

管理情報受付部７１は、管理者からの手動応答情報を受け付ける。ここでの受け付けは、音声による受け付けでも良いし、キーボードやマウス等の入力手段からの受け付けでも良い。 The management information reception unit 71 receives manual response information from the administrator. The reception here may be reception by voice or reception from an input means such as a keyboard or a mouse.

管理情報受付部７１は、管理者からの手動応答情報を音声で受け付けることは好適である。また、管理情報受付部７１は、管理者からの手動応答情報をキーボード、マウス、タッチパネル、メニュー画面によるもの等から受け付けても良い。管理情報受付部７１は、一の手動応答情報を、音声で受け付けたり、キーボード等の入力手段により受け付けたりすることは好適である。つまり、管理情報受付部７１は、音声による受け付け手段と、キーボード、マウス、タッチパネル、またはメニュー画面等の入力手段との両方を具備し、一の手動応答情報をいずれかの手段により受け付けることは好適である。 It is preferable that the management information receiving unit 71 receives the manual response information from the administrator by voice. Further, the management information reception unit 71 may receive manual response information from the administrator from a keyboard, a mouse, a touch panel, a menu screen, or the like. It is preferable that the management information reception unit 71 receives one manual response information by voice or by an input means such as a keyboard. That is, it is preferable that the management information receiving unit 71 includes both a voice receiving means and an input means such as a keyboard, a mouse, a touch panel, or a menu screen, and receives one manual response information by any means. Is.

管理情報受付部７１は、自動情報を対話制御装置５から受け付ける。また、管理情報受付部７１は、発話情報を対話制御装置５から受け付ける。なお、ここでの受け付けは、通常、受信であるが、情報が管理端末７に渡る処理であれば良い。 The management information reception unit 71 receives automatic information from the dialogue control device 5. Further, the management information reception unit 71 receives the utterance information from the dialogue control device 5. The reception here is usually reception, but any process may be used as long as the information is passed to the management terminal 7.

管理処理部７２は、各種の処理を行う。各種の処理とは、例えば、管理情報受付部７１が受け付けた自動情報または発話情報を出力するデータ構造にしたり、かかる情報の出力画面を構成したりする処理である。 The management processing unit 72 performs various processes. The various processes are, for example, a process of forming a data structure for outputting automatic information or utterance information received by the management information receiving unit 71, or a process of configuring an output screen of such information.

管理者出力部７３は、各種の処理を行う。各種の処理とは、例えば、管理情報受付部７１が受け付けた自動情報または発話情報を出力するデータ構造にしたり、かかる情報の出力画面を構成したりする処理である。 The administrator output unit 73 performs various processes. The various processes are, for example, a process of forming a data structure for outputting automatic information or utterance information received by the management information receiving unit 71, or a process of configuring an output screen of such information.

管理指示受付部７４は、管理者から各種の指示を受け付ける。各種の指示とは、例えば、モードの変更の指示である変更指示等である。各種の指示の入力手段は、キーボードやタッチパネルやマウスやメニュー画面によるもの等、何でも良い。管理指示受付部７４は、キーボード等の入力手段のデバイスドライバーや、メニュー画面の制御ソフトウェア等で実現され得る。 The management instruction receiving unit 74 receives various instructions from the administrator. The various instructions are, for example, change instructions that are instructions for changing the mode. The input means for various instructions may be any means such as a keyboard, a touch panel, a mouse, and a menu screen. The management instruction receiving unit 74 can be realized by a device driver of an input means such as a keyboard, control software of a menu screen, or the like.

管理出力部７５は、各種の情報や指示を対話制御装置５に出力する。各種の情報や指示は、例えば、変更指示、モード情報等である。なお、ここでの出力は、通常、送信であるが、情報等が対話制御装置２に渡れば良い。 The management output unit 75 outputs various information and instructions to the dialogue control device 5. Various types of information and instructions are, for example, change instructions, mode information, and the like. The output here is usually transmission, but information or the like may be passed to the dialogue control device 2.

格納部５１、モード格納部５１１、および管理格納部７０は、不揮発性の記録媒体が好適であるが、揮発性の記録媒体でも実現可能である。 The storage unit 51, the mode storage unit 511, and the management storage unit 70 are preferably non-volatile recording media, but can also be realized by volatile recording media.

格納部５１等に情報が記憶される過程は問わない。例えば、記録媒体を介して情報が格納部５１等で記憶されるようになってもよく、通信回線等を介して送信された情報が格納部５１等で記憶されるようになってもよく、あるいは、入力デバイスを介して入力された情報が格納部５１で記憶されるようになってもよい。 The process of storing information in the storage unit 51 or the like does not matter. For example, the information may be stored in the storage unit 51 or the like via the recording medium, or the information transmitted via the communication line or the like may be stored in the storage unit 51 or the like. Alternatively, the information input via the input device may be stored in the storage unit 51.

処理部５３、決定部５３１、判断部５３２、モード変更部５３３、選択部５３４、応答取得部６３、学習部６６、および管理処理部７２は、通常、ＭＰＵやメモリ等から実現され得る。処理部５３の処理手順は、通常、ソフトウェアで実現され、当該ソフトウェアはＲＯＭ等の記録媒体に記録されている。但し、ハードウェア（専用回路）で実現しても良い。 The processing unit 53, the determination unit 531, the determination unit 532, the mode change unit 533, the selection unit 534, the response acquisition unit 63, the learning unit 66, and the management processing unit 72 can usually be realized from an MPU, a memory, or the like. The processing procedure of the processing unit 53 is usually realized by software, and the software is recorded in a recording medium such as a ROM. However, it may be realized by hardware (dedicated circuit).

出力部５４、発話情報出力部５４１、自動情報出力部５４２、手動情報出力部５４３、送信部５６、応答送信部５６１、応答出力部６４、および管理出力部７５は、通常、無線または有線の通信手段で実現されるが、放送手段等で実現されても良い。 The output unit 54, the utterance information output unit 541, the automatic information output unit 542, the manual information output unit 543, the transmission unit 56, the response transmission unit 561, the response output unit 64, and the management output unit 75 are usually wireless or wired communication. It is realized by means, but it may be realized by broadcasting means or the like.

なお、出力部５４、発話情報出力部５４１、自動情報出力部５４２、手動情報出力部５４３、および応答出力部６４は、情報を出力する出力手段で実現されても良い。ここでの情報の出力とは、例えば、プログラムによる情報の引き渡しである。 The output unit 54, the utterance information output unit 541, the automatic information output unit 542, the manual information output unit 543, and the response output unit 64 may be realized by output means for outputting information. The output of information here is, for example, the delivery of information by a program.

受付部５５、自動応答受付部５５１、手動応答受付部５５２、変更指示受付部５５３、発話受付部６２、学習受付部６５、および管理情報受付部７１は、通常、無線または有線の通信手段で実現されるが、放送受信手段等で実現されても良い。 The reception unit 55, the automatic response reception unit 551, the manual response reception unit 552, the change instruction reception unit 553, the utterance reception unit 62, the learning reception unit 65, and the management information reception unit 71 are usually realized by wireless or wired communication means. However, it may be realized by a broadcast receiving means or the like.

なお、受付部５５、自動応答受付部５５１、手動応答受付部５５２、変更指示受付部５５３、発話受付部６２、学習受付部６５、および管理情報受付部７１は、情報を受け付ける受付手段で実現されても良い。ここでの情報の受け付けとは、例えば、プログラムによる情報の引き渡しである。 The reception unit 55, the automatic response reception unit 551, the manual response reception unit 552, the change instruction reception unit 553, the utterance reception unit 62, the learning reception unit 65, and the management information reception unit 71 are realized by the reception means for receiving information. You may. The acceptance of information here is, for example, the delivery of information by a program.

管理者出力部７３は、ディスプレイやスピーカー等の出力デバイスを含むと考えても含まないと考えても良い。管理者出力部７３は、出力デバイスのドライバーソフトまたは、出力デバイスのドライバーソフトと出力デバイス等で実現され得る。 The administrator output unit 73 may or may not include an output device such as a display or a speaker. The administrator output unit 73 can be realized by the driver software of the output device, the driver software of the output device, the output device, or the like.

次に、対話システムＣの動作について説明する。まず、対話制御装置５の動作について、図１６のフローチャートを用いて説明する。図１６のフローチャートにおいて、図４のフローチャートと同一のステップについて、その説明を省略する。 Next, the operation of the dialogue system C will be described. First, the operation of the dialogue control device 5 will be described with reference to the flowchart of FIG. In the flowchart of FIG. 16, the description of the same steps as the flowchart of FIG. 4 will be omitted.

（ステップＳ１６０１）決定部５３１または／および選択部５３４は、発話情報の出力先を決定する。決定部５３１は、例えば、モード格納部２１１のモード情報に従って、出力先を対話エンジン３とするか、管理端末７とするかを決定する。なお、決定部５３１は、発話情報の出力先を、常に、１以上の対話エンジン６と管理端末７との両方にする、としても良い。また、モード情報が自動モードの場合、選択部５３４は、２以上の対話エンジン６のうちの１以上の対話エンジン６を選択する。かかる選択方法については、実施の形態１、２で記載したいずれかの方法が使用できる。なお、選択部５３４は、２以上のすべての対話エンジン６を発話情報の出力先として決定しても良い。 (Step S1601) The determination unit 531 and / and the selection unit 534 determine the output destination of the utterance information. The determination unit 531 determines, for example, whether the output destination is the dialogue engine 3 or the management terminal 7 according to the mode information of the mode storage unit 211. The determination unit 531 may always set the output destination of the utterance information to both one or more dialogue engines 6 and the management terminal 7. When the mode information is the automatic mode, the selection unit 534 selects one or more dialogue engines 6 out of two or more dialogue engines 6. As such a selection method, any of the methods described in the first and second embodiments can be used. The selection unit 534 may determine all two or more dialogue engines 6 as output destinations of utterance information.

（ステップＳ１６０２）発話情報出力部２４１は、ステップＳ１６０１で決定された出力先に、ステップＳ４０２で取得された発話情報を出力する。 (Step S1602) The utterance information output unit 241 outputs the utterance information acquired in step S402 to the output destination determined in step S1601.

（ステップＳ１６０３）受付部５５は、発話情報の出力に応じて、１以上の自動応答情報または手動応答情報を受け付ける。なお、受付部５５は自動応答情報または手動応答情報を受け付けるまでウエイトする。 (Step S1603) The reception unit 55 receives one or more automatic response information or manual response information according to the output of the utterance information. The reception unit 55 waits until it receives the automatic response information or the manual response information.

（ステップＳ１６０４）選択部５３４は、複数の自動応答情報から一の自動応答情報を選択するか否かを判断する。一の自動応答情報を選択する場合はステップＳ１６０５に行き、一の自動応答情報を選択しない場合はステップＳ１６０６に行く。ここで、選択部５３４は、例えば、ステップＳ１６０１で２以上の対話エンジン３を選択したか否かを判断しても良い。選択部５３４は、例えば、２以上の自動応答情報が受け付けられたか否かを判断しても良い。 (Step S1604) The selection unit 534 determines whether or not to select one automatic response information from the plurality of automatic response information. If one automatic response information is selected, the process goes to step S1605. If one automatic response information is not selected, the process goes to step S1606. Here, the selection unit 534 may determine, for example, whether or not two or more dialogue engines 3 have been selected in step S1601. The selection unit 534 may determine, for example, whether or not two or more automatic response information has been accepted.

（ステップＳ１６０５）選択部５３４は、２以上の自動応答情報から一の自動応答情報を選択する。 (Step S1605) The selection unit 534 selects one automatic response information from two or more automatic response information.

（ステップＳ１６０６）処理部５３は、受け付けられた自動応答情報または手動応答情報から、自動情報または手動情報を取得する。なお、自動応答情報および手動応答情報の両方が受け付けられた場合、処理部２３は、自動応答情報および動応答情報から、自動情報および手動情報を取得する。 (Step S1606) The processing unit 53 acquires the automatic information or the manual information from the received automatic response information or the manual response information. When both the automatic response information and the manual response information are received, the processing unit 23 acquires the automatic information and the manual information from the automatic response information and the dynamic response information.

（ステップＳ１６０７）処理部５３は、自動情報を管理端末７に出力するか否かを判断する。管理端末７に出力する場合はステップＳ１６０８に行き、出力しない場合はステップＳ１６０９に行く。なお、例えば、自動情報を管理端末７に出力するか否かを示すフラグが格納部５１で管理されており、処理部５３は、かかるフラグを格納部５１から読み出し、判断する。また、処理部５３は、常に、自動情報を管理端末７に出力しても良いし、常に、自動情報を管理端末７に出力しなくても良い。 (Step S1607) The processing unit 53 determines whether or not to output the automatic information to the management terminal 7. If it is output to the management terminal 7, it goes to step S1608, and if it is not output, it goes to step S1609. For example, a flag indicating whether or not to output automatic information to the management terminal 7 is managed by the storage unit 51, and the processing unit 53 reads the flag from the storage unit 51 and determines. Further, the processing unit 53 may always output the automatic information to the management terminal 7, or may not always output the automatic information to the management terminal 7.

（ステップＳ１６０８）自動情報出力部２４２は、ステップＳ１６０６で取得された自動情報を管理端末７に出力する。 (Step S1608) The automatic information output unit 242 outputs the automatic information acquired in step S1606 to the management terminal 7.

（ステップＳ１６０９）処理部５３は、手動情報を対話エンジン６に出力するか否かを判断する。対話エンジン６に出力する場合はステップＳ１６１０に行き、出力しない場合はステップＳ１６１１に行く。なお、例えば、手動情報を対話エンジン６に出力するか否かを示すフラグが格納部５１で管理されており、処理部５３は、かかるフラグを格納部５１から読み出し、判断する。また、処理部５３は、常に、手動情報を対話エンジン６出力しても良いし、常に、手動情報を対話エンジン６に出力しなくても良い。 (Step S1609) The processing unit 53 determines whether or not to output the manual information to the dialogue engine 6. If it is output to the dialogue engine 6, it goes to step S1610, and if it is not output, it goes to step S1611. For example, a flag indicating whether or not to output manual information to the dialogue engine 6 is managed by the storage unit 51, and the processing unit 53 reads such a flag from the storage unit 51 and determines. Further, the processing unit 53 may always output the manual information to the dialogue engine 6, or may not always output the manual information to the dialogue engine 6.

（ステップＳ１６１０）手動情報出力部５４３は、ステップＳ１６０６で取得された手動情報を１以上の対話エンジン６に出力する。なお、手動情報出力部５４３は、対話制御装置５が利用可能な複数のすべての対話エンジン６に手動情報を出力しても良いし、一部の対話エンジン６にのみ手動情報を出力しても良い。なお、手動情報の出力先となる対話エンジン６は学習機能を有する対話エンジンであることは好適である。 (Step S1610) The manual information output unit 543 outputs the manual information acquired in step S1606 to one or more dialogue engines 6. The manual information output unit 543 may output the manual information to all the plurality of dialogue engines 6 in which the dialogue control device 5 can be used, or may output the manual information to only some of the dialogue engines 6. good. It is preferable that the dialogue engine 6 to which the manual information is output is a dialogue engine having a learning function.

（ステップＳ１６１１）決定部５３１は、対話エンジン６を採用するか、管理端末７を採用するかを決定する。そして、決定部５３１は、対話エンジン６を採用するとの判断である場合は自動情報を取得し、管理端末７を採用するとの判断である場合は手動情報を取得する。 (Step S1611) The determination unit 531 determines whether to adopt the dialogue engine 6 or the management terminal 7. Then, the determination unit 531 acquires the automatic information when it is determined that the dialogue engine 6 is adopted, and acquires the manual information when it is determined that the management terminal 7 is adopted.

（ステップＳ１６１２）応答送信部５６１は、ステップＳ１６１１で取得された自動情報または手動情報をユーザ端末１に送信する。 (Step S1612) The response transmission unit 561 transmits the automatic information or manual information acquired in step S1611 to the user terminal 1.

（ステップＳ１６１３）判断部５３２は、モード情報を変更するための条件を満たすか否かを判断する。 (Step S1613) The determination unit 532 determines whether or not the condition for changing the mode information is satisfied.

（ステップＳ１６１４）ステップＳ１６１３における判断結果が、条件を満たすとの判断結果である場合はステップＳ１６１５に行き、条件を満たさないとの判断結果である場合はステップＳ１６０１に戻る。 (Step S1614) If the determination result in step S1613 is the determination result that the condition is satisfied, the process goes to step S1615, and if the determination result is that the condition is not satisfied, the process returns to step S1601.

（ステップＳ１６１５）モード変更部５３３は、モード格納部５１１のモード情報を変更する。ステップＳ１６０１に戻る。 (Step S1615) The mode change unit 533 changes the mode information of the mode storage unit 511. Return to step S1601.

（ステップＳ１６１６）処理部５３は、ステップＳ１６１５における変更が、手動モードへの変更であるか否かを判断する。手動モードへの変更である場合はステップＳ１６１７に行き、手動モードへの変更でない場合はステップＳ１６１８に行く。 (Step S1616) The processing unit 53 determines whether or not the change in step S1615 is a change to the manual mode. If it is a change to the manual mode, it goes to step S1617, and if it is not a change to the manual mode, it goes to step S1618.

（ステップＳ１６１７）出力部５４は、最新の発話情報を管理端末７に出力する。ステップＳ１６０５に戻る。 (Step S1617) The output unit 54 outputs the latest utterance information to the management terminal 7. Return to step S1605.

（ステップＳ１６１８）出力部５４は、最新の発話情報を対話エンジン６に出力する。ステップＳ１６０５に戻る。 (Step S1618) The output unit 54 outputs the latest utterance information to the dialogue engine 6. Return to step S1605.

（ステップＳ１６１９）変更指示受付部５５３は、管理端末７から変更指示を受け付けたか否かを判断する。変更指示を受け付けた場合はステップＳ１６２０に行き、変更指示を受け付けない場合はステップＳ１６０１に戻る。 (Step S1619) The change instruction receiving unit 553 determines whether or not the change instruction has been received from the management terminal 7. If the change instruction is accepted, the process proceeds to step S1620, and if the change instruction is not accepted, the process returns to step S1601.

（ステップＳ１６２０）モード変更部５３３は、モード格納部５１１のモード情報を変更する。ステップＳ１６０１に戻る。 (Step S1620) The mode change unit 533 changes the mode information of the mode storage unit 511. Return to step S1601.

なお、図１６のフローチャートにおいて、ステップＳ１６１３の判断部５３２の判断処理のタイミングは問わない。 In the flowchart of FIG. 16, the timing of the determination process of the determination unit 532 in step S1613 does not matter.

また、図１６のフローチャートにおいて、自動情報や手動情報の送信の順序や、送信するか否かの判断の順序等の各種の処理の順序は問わないことは言うまでもない。 Further, it goes without saying that in the flowchart of FIG. 16, the order of various processes such as the order of transmission of automatic information and manual information and the order of determination of whether or not to transmit information does not matter.

また、図１６のフローチャートにおいて、ステップＳ１６０５で自動応答情報を受け付けない場合、自動情報は出力されない。また、ステップＳ１６０５で手動応答情報を受け付けない場合、手動情報は出力されない。 Further, in the flowchart of FIG. 16, if the automatic response information is not received in step S1605, the automatic information is not output. If the manual response information is not accepted in step S1605, the manual information is not output.

さらに、図１６のフローチャートにおいて、電源オフや処理終了の割り込みにより処理は終了する。 Further, in the flowchart of FIG. 16, the process ends when the power is turned off or an interrupt for the end of the process occurs.

次に、対話エンジン６の動作について、図１７のフローチャートを用いて説明する。 Next, the operation of the dialogue engine 6 will be described with reference to the flowchart of FIG.

（ステップＳ１７０１）発話受付部３２は、対話制御装置２から発話情報を受け付けたか否かを判断する。発話情報を受け付けた場合はステップＳ１７０２に行き、発話情報を受け付けない場合はステップＳ１７０４に行く。 (Step S1701) The utterance reception unit 32 determines whether or not the utterance information has been received from the dialogue control device 2. If the utterance information is accepted, the process goes to step S1702, and if the utterance information is not accepted, the process goes to step S1704.

（ステップＳ１７０２）応答取得部３３は、ステップＳ１７０１で受け付けられた発話情報に対応する自動応答情報を、学習情報格納部３１の学習情報を用いて取得する。 (Step S1702) The response acquisition unit 33 acquires the automatic response information corresponding to the utterance information received in step S1701 by using the learning information of the learning information storage unit 31.

（ステップＳ１７０３）応答出力部３４は、応答取得部３３が取得した自動応答情報を対話制御装置２に出力する。 (Step S1703) The response output unit 34 outputs the automatic response information acquired by the response acquisition unit 33 to the dialogue control device 2.

（ステップＳ１７０４）学習受付部６５は、学習対象を受け付けたか否かを判断する。学習対象を受け付けた場合はステップＳ１７０５に行き、学習対象を受け付けない場合はステップＳ１７０１に戻る。学習対象とは、手動応答情報または、他の対話エンジン６の出力である自動応答情報である。 (Step S1704) The learning reception unit 65 determines whether or not the learning target has been accepted. If the learning target is accepted, the process proceeds to step S1705, and if the learning target is not accepted, the process returns to step S1701. The learning target is manual response information or automatic response information that is the output of another dialogue engine 6.

（ステップＳ１７０５）学習部６６は、学習受付部６５が受け付けた学習対象を用いて、学習情報に、追記する情報を構成する。 (Step S1705) The learning unit 66 configures information to be added to the learning information by using the learning target received by the learning reception unit 65.

（ステップＳ１７０６）学習部６６は、ステップＳ１７０５で構成した情報を学習情報格納部３１に追記する。ステップＳ１７０１に戻る。 (Step S1706) The learning unit 66 adds the information configured in step S1705 to the learning information storage unit 31. Return to step S1701.

（ステップＳ１７０１）発話受付部３２は、対話制御装置５から発話情報を受け付けたか否かを判断する。発話情報を受け付けた場合はステップＳ１７０２に行き、発話情報を受け付けない場合はステップＳ１７０４に行く。 (Step S1701) The utterance reception unit 32 determines whether or not the utterance information has been received from the dialogue control device 5. If the utterance information is accepted, the process goes to step S1702, and if the utterance information is not accepted, the process goes to step S1704.

（ステップＳ１７０３）応答出力部３４は、応答取得部３３が取得した自動応答情報を対話制御装置５に出力する。 (Step S1703) The response output unit 34 outputs the automatic response information acquired by the response acquisition unit 33 to the dialogue control device 5.

（ステップＳ１７０４）学習受付部６５は、学習対象の自動応答情報または手動応答情報を受け付けたか否かを判断する。自動応答情報または手動応答情報を受け付けた場合はステップＳ１７０５に行き、自動応答情報を受け付けない場合はステップＳ１７０１に戻る。 (Step S1704) The learning reception unit 65 determines whether or not the automatic response information or the manual response information to be learned has been received. If the automatic response information or the manual response information is accepted, the process proceeds to step S1705, and if the automatic response information is not accepted, the process returns to step S1701.

（ステップＳ１７０５）学習部６６は、学習受付部６５が受け付けた自動応答情報または手動応答情報を用いて、学習情報に、追記する情報を構成する。 (Step S1705) The learning unit 66 configures information to be added to the learning information by using the automatic response information or the manual response information received by the learning reception unit 65.

なお、図１７のフローチャートにおいて、電源オフや処理終了の割り込みにより処理は終了する。 In the flowchart of FIG. 17, the process ends when the power is turned off or an interrupt for the end of the process occurs.

また、図１７のフローチャートにおいて、学習処理自体は上記した学習のアルゴリズム、公知のアルゴリズムが採用可能である。 Further, in the flowchart of FIG. 17, the above-mentioned learning algorithm and known algorithm can be adopted for the learning process itself.

次に、管理端末７の動作について、図１８のフローチャートを用いて説明する。 Next, the operation of the management terminal 7 will be described with reference to the flowchart of FIG.

（ステップＳ１８０１）管理情報受付部７１は、発話情報を対話制御装置５から受け付けたか否かを判断する。発話情報を受け付けた場合はステップＳ１８０２に行き、受け付けない場合はステップＳ１８０６に行く。 (Step S1801) The management information reception unit 71 determines whether or not the utterance information has been received from the dialogue control device 5. If the utterance information is accepted, the process goes to step S1802, and if the utterance information is not accepted, the process goes to step S1806.

（ステップＳ１８０２）管理者出力部７３は、ステップＳ１８０１で受け付けられた発話情報を、または管理処理部７２により出力用に構成された発話情報を出力する。 (Step S1802) The administrator output unit 73 outputs the utterance information received in step S1801 or the utterance information configured for output by the management processing unit 72.

（ステップＳ１８０３）管理処理部７２は、発話情報の出力に対応した手動応答情報を出力するか否かを判断する。手動応答情報を出力する場合はステップＳ１８０４に行き、手動応答情報を出力しない場合はステップＳ１８０１に戻る。なお、手動応答情報を出力する場合は、モード情報が手動モードの場合である。なお、例えば、管理格納部７０のモード情報が「手動モード」である場合、管理処理部７２は、手動応答情報を出力すると判断する。 (Step S1803) The management processing unit 72 determines whether or not to output the manual response information corresponding to the output of the utterance information. If the manual response information is output, the process proceeds to step S1804, and if the manual response information is not output, the process returns to step S1801. When the manual response information is output, the mode information is the manual mode. For example, when the mode information of the management storage unit 70 is "manual mode", the management processing unit 72 determines that the manual response information is output.

（ステップＳ１８０４）管理情報受付部７１は、手動応答情報を受け付けたか否かを判断する。手動応答情報を受け付けた場合はステップＳ１８０５に行き、手動応答情報を受け付けない場合はステップＳ１８０４に戻る。 (Step S1804) The management information reception unit 71 determines whether or not the manual response information has been received. If the manual response information is accepted, the process proceeds to step S1805, and if the manual response information is not accepted, the process returns to step S1804.

（ステップＳ１８０５）管理者出力部７３は、ステップＳ１８０４で受け付けられた手動応答情報を対話制御装置５に出力する。ステップＳ１８０１に戻る。 (Step S1805) The administrator output unit 73 outputs the manual response information received in step S1804 to the dialogue control device 5. Return to step S1801.

（ステップＳ１８０６）管理情報受付部７１は、自動情報を対話制御装置６から受け付けたか否かを判断する。自動情報を受け付けた場合はステップＳ１８０７に行き、受け付けない場合はステップＳ１８０８に行く。 (Step S1806) The management information receiving unit 71 determines whether or not the automatic information has been received from the dialogue control device 6. If the automatic information is accepted, the process goes to step S1807, and if not, the process goes to step S1808.

（ステップＳ１８０７）管理者出力部７３は、ステップＳ１８０６で受け付けられた自動情報を、または管理処理部７２により出力用に構成された自動情報を出力する。ステップＳ１８０１に戻る。 (Step S1807) The administrator output unit 73 outputs the automatic information received in step S1806 or the automatic information configured for output by the management processing unit 72. Return to step S1801.

（ステップＳ１８０８）管理指示受付部７４は、管理者からモードの変更指示を受け付けたか否かを判断する。変更指示を受け付けた場合はステップＳ１８０９に行き、変更指示を受け付けない場合はステップＳ１８０１に戻る。 (Step S1808) The management instruction receiving unit 74 determines whether or not the mode change instruction has been received from the administrator. If the change instruction is accepted, the process proceeds to step S1809, and if the change instruction is not accepted, the process returns to step S1801.

（ステップＳ１８０９）管理者出力部７５は、ステップＳ１８０８で受け付けられた変更指示を対話制御装置５に出力する。ステップＳ１８０１に戻る。 (Step S1809) The administrator output unit 75 outputs the change instruction received in step S1808 to the dialogue control device 5. Return to step S1801.

なお、図６のフローチャートにおいて、電源オフや処理終了の割り込みにより処理は終了する。 In the flowchart of FIG. 6, the process ends when the power is turned off or an interrupt for the end of the process occurs.

以下、本実施の形態における対話システムＣの具体的な動作について説明する。対話システムＣの概念図は図１３である。 Hereinafter, the specific operation of the dialogue system C in the present embodiment will be described. The conceptual diagram of the dialogue system C is FIG.

今、対話制御装置５の格納部５１は、対話が破綻していることを特定するための条件を示す情報である破綻判断条件管理表が格納されている、とする（図１９参照）。破綻判断条件管理表は、「ＩＤ」「破綻条件」を有するレコードを１以上管理している。「破綻条件」は、「情報」「条件」を有する。「ＩＤ」は、レコードを識別する情報である。「情報」は、対話の破綻を判断するために使用する情報である。「情報」は、ユーザ発話情報または自動応答情報である。なお、ユーザ発話情報の代わりに発話情報であっても良いし、自動応答情報の代わりに自動情報でも良い。また、「条件」は、対話の破綻を判断するための条件である。「ＩＤ＝１」の条件は、類似度が９０％以上のユーザ発話情報が連続して受け付けられることである。なお、ユーザ発話情報の類似度の算出の処理は、例えば、文の類似度の算出と同一の処理であり、公知技術であるので詳細な説明を省略する。「ＩＤ＝２」の条件は、類似度が９０％以上のユーザ発話情報が、連続して受け付けられる４つのユーザ発話情報の中で、３以上存在することである。「ＩＤ＝３」の条件は、連続して受け付けられる４つの自動応答情報の中で、破綻用語を含む自動応答情報が２以上存在することである。「ＩＤ＝４」の条件は、会話不成立を示す情報を含む自動応答情報が受け付けられたことである。なお、破綻用語は、ここでは、例えば、「理解できなかった」「理解不能」である、とする。また、会話不成立を示す情報は、例えば、自動応答情報に含まれ得るスコア「０」である、とする。さらに、例えば、モード格納部２１１のデフォルトのモード情報は「自動モード」である、とする。 Now, it is assumed that the storage unit 51 of the dialogue control device 5 stores a failure determination condition management table, which is information indicating a condition for identifying that the dialogue has failed (see FIG. 19). The bankruptcy judgment condition management table manages one or more records having "ID" and "bankruptcy condition". The "bankruptcy condition" has "information" and "condition". The "ID" is information that identifies a record. "Information" is information used to determine the failure of a dialogue. "Information" is user utterance information or automatic response information. The utterance information may be used instead of the user utterance information, or the automatic information may be used instead of the automatic response information. In addition, the "condition" is a condition for determining the failure of the dialogue. The condition of "ID = 1" is that user utterance information having a similarity of 90% or more is continuously accepted. The process of calculating the similarity of user utterance information is, for example, the same as the process of calculating the similarity of sentences, and is a known technique, so detailed description thereof will be omitted. The condition of "ID = 2" is that there are 3 or more user utterance information having a similarity of 90% or more among the four user utterance information that are continuously accepted. The condition of "ID = 3" is that there are two or more automatic response information including the broken term among the four automatic response information that are continuously accepted. The condition of "ID = 4" is that the automatic response information including the information indicating that the conversation has not been established has been accepted. In addition, the broken term is assumed here, for example, to be "incomprehensible" or "incomprehensible". Further, it is assumed that the information indicating that the conversation is not established is, for example, a score "0" that can be included in the automatic response information. Further, for example, it is assumed that the default mode information of the mode storage unit 211 is "automatic mode".

また、格納部５１は、例えば、図６に示す優先度情報管理表を格納している、とする。さらに、格納部５１は、例えば、図１２に示す選択条件管理表を格納している、とする。 Further, it is assumed that the storage unit 51 stores, for example, the priority information management table shown in FIG. Further, it is assumed that the storage unit 51 stores, for example, the selection condition management table shown in FIG.

また、一の対話エンジン６（例えば、「ｅ１」）を構成する学習情報格納部６１は、図７に示す学習情報管理表を格納している、とする。また、その学習情報格納部６１は、図８に示すＦｏｏｄｔａｂｌｅを格納している、とする。Ｆｏｏｄｔａｂｌｅは、「ＩＤ」「料理」「作り方」を有するレコードを１以上格納している。さらに、その学習情報格納部６１は、料理名を格納しているｆｏｏｄ辞書、作物名を格納しているｃｒｏｐｓ辞書を有する、とする。ｆｏｏｄ辞書には、「カツ丼、牛丼、親子丼、・・・」等の多数の料理名が格納されている。また、ｃｒｏｐｓ辞書には、「お米、ナス、にんじん、・・・」等の多数の作物名が格納されている。 Further, it is assumed that the learning information storage unit 61 constituting one dialogue engine 6 (for example, “e1”) stores the learning information management table shown in FIG. 7. Further, it is assumed that the learning information storage unit 61 stores the Foodtable shown in FIG. Foodtable stores one or more records having "ID", "cooking", and "how to make". Further, it is assumed that the learning information storage unit 61 has a food dictionary for storing the dish name and a crops dictionary for storing the crop name. The food dictionary stores a large number of dish names such as "Katsudon, Gyudon, Oyakodon, ...". In addition, a large number of crop names such as "rice, eggplant, carrot, ..." are stored in the crops dictionary.

かかる状況において、以下の３つの具体例について説明する。具体例１は、対話制御装置２が対話の破綻を自動検知して、モード情報が自動モードから手動モードに自動的に切り替わる場合である。また、具体例１において、自動で手動モードから自動モードに切り替える処理についても説明する。また、具体例２は、管理者が手動で、モード情報を自動モードから手動モードに切り替える場合である。また、具体例２において、管理者が手動で、モード情報を手動モードから自動モードに切り替える場合についても説明する。さらに、具体例３は、管理者が入力した手動応答情報を用いて、対話エンジン３が学習を行う場合である。なお、３つの具体例において、例えば、対話システムＣが具備する対話エンジン６は３つである。 In such a situation, the following three specific examples will be described. A specific example 1 is a case where the dialogue control device 2 automatically detects the failure of the dialogue and the mode information is automatically switched from the automatic mode to the manual mode. Further, in Specific Example 1, a process of automatically switching from the manual mode to the automatic mode will also be described. Further, a specific example 2 is a case where the administrator manually switches the mode information from the automatic mode to the manual mode. Further, in the specific example 2, a case where the administrator manually switches the mode information from the manual mode to the automatic mode will also be described. Further, the specific example 3 is a case where the dialogue engine 3 performs learning using the manual response information input by the administrator. In the three specific examples, for example, the dialogue engine 6 included in the dialogue system C is three.

（具体例１）
今、ユーザは、ユーザ端末１に対して、「おはよう」と音声入力した、とする。すると、ユーザ端末１のユーザ受付部１１は、ユーザからの発話「おはよう」を受け付ける。次に、ユーザ処理部１２は、ユーザの音声入力に対して音声認識し、ユーザ発話情報「おはよう」を取得する。次に、ユーザ送信部１３は、ユーザ発話情報「おはよう」を対話制御装置２に送信する。 (Specific example 1)
Now, it is assumed that the user has input the voice "Good morning" to the user terminal 1. Then, the user reception unit 11 of the user terminal 1 receives the utterance "good morning" from the user. Next, the user processing unit 12 voice-recognizes the user's voice input and acquires the user utterance information "good morning". Next, the user transmission unit 13 transmits the user utterance information "good morning" to the dialogue control device 2.

次に、対話制御装置５の受信部２２は、ユーザ発話情報「おはよう」をユーザ端末１から受信する。 Next, the receiving unit 22 of the dialogue control device 5 receives the user utterance information "good morning" from the user terminal 1.

そして、処理部５３は、ユーザ発話情報「おはよう」から発話情報「１：おはよう」を取得する。つまり、処理部５３は、格納されている発話識別子「０」（発話識別子の初期値である、とする）をインクリメントし、ユニークな発話識別子「１」を生成する。そして、処理部２３は、ユーザ発話情報に発話識別子「１」を付加し、発話情報「１：おはよう」を構成する。 Then, the processing unit 53 acquires the utterance information "1: Good morning" from the user utterance information "Good morning". That is, the processing unit 53 increments the stored utterance identifier "0" (assuming that it is the initial value of the utterance identifier) to generate a unique utterance identifier "1". Then, the processing unit 23 adds the utterance identifier "1" to the user utterance information to form the utterance information "1: Good morning".

次に、決定部５３１は、モード情報「自動モード」をモード格納部５１１から取得し、発話情報の出力先を対話エンジン６、と決定する。 Next, the determination unit 531 acquires the mode information "automatic mode" from the mode storage unit 511, and determines that the output destination of the utterance information is the dialogue engine 6.

次に、選択部３４は、３つの対話エンジン６の通知先情報を図６の表から取得する。 Next, the selection unit 34 acquires the notification destination information of the three dialogue engines 6 from the table of FIG.

次に、発話情報出力部２４１は、３つの対話エンジン６に発話情報「１：おはよう」を送信する。 Next, the utterance information output unit 241 transmits the utterance information "1: Good morning" to the three dialogue engines 6.

次に、３つの各対話エンジン６の発話受付部３２は、対話制御装置５から発話情報「１：おはよう」を受信する。 Next, the utterance reception unit 32 of each of the three dialogue engines 6 receives the utterance information "1: Good morning" from the dialogue control device 5.

次に、例えば、対話エンジン「ｅ１」の応答取得部３３は、発話情報が有する「おはよう」と対になる自動応答情報「おはようございます」を図７の学習情報管理表から取得する。なお、ここで、応答取得部３３は、発話識別子「１」を付加した自動応答情報「１：おはようございます」を取得することは好適である。 Next, for example, the response acquisition unit 33 of the dialogue engine "e1" acquires the automatic response information "good morning" paired with the "good morning" of the utterance information from the learning information management table of FIG. Here, it is preferable that the response acquisition unit 33 acquires the automatic response information "1: Good morning" to which the utterance identifier "1" is added.

次に、応答出力部３４は、応答取得部３３が取得した自動応答情報「１：おはようございます」を対話制御装置５に送信する。なお、送信される自動応答情報は、「おはようございます」でも良い。 Next, the response output unit 34 transmits the automatic response information “1: Good morning” acquired by the response acquisition unit 33 to the dialogue control device 5. The automatic response information to be sent may be "Good morning".

次に、対話制御装置５の自動応答受付部２５１は、３つの対話エンジン６から自動応答情報を受信する。なお、対話エンジン「ｅ１」からは、自動応答受付部２５１は、自動応答情報「１：おはようございます」を受信する。 Next, the automatic response reception unit 251 of the dialogue control device 5 receives the automatic response information from the three dialogue engines 6. From the dialogue engine "e1", the automatic response reception unit 251 receives the automatic response information "1: Good morning".

次に、選択部５３４は、実施の形態１で説明した選択のアルゴリズムにより、優先度が高い対話エンジン「ｅ１」の自動応答情報を選択する。 Next, the selection unit 534 selects the automatic response information of the dialogue engine "e1" having a high priority by the selection algorithm described in the first embodiment.

次に、処理部５３は、「ｅ１」の自動応答情報から、自動情報「おはようございます」を取得する。次に、応答送信部２６１は、自動情報「おはようございます」をユーザ端末１に送信する。なお、処理部５３は、自動情報を管理端末７に出力しない、と判断する。 Next, the processing unit 53 acquires the automatic information "Good morning" from the automatic response information of "e1". Next, the response transmission unit 261 transmits the automatic information "Good morning" to the user terminal 1. The processing unit 53 determines that the automatic information is not output to the management terminal 7.

次に、ユーザ端末１のユーザ受信部１４は、自動情報「おはようございます」を受信する。そして、ユーザ出力部１５は、自動情報「おはようございます」を音声出力する。 Next, the user receiving unit 14 of the user terminal 1 receives the automatic information "Good morning". Then, the user output unit 15 outputs the automatic information "Good morning" by voice.

次に、ユーザは、ユーザ端末１に対して、「来週の木曜日は営業していますか？」と音声入力した、とする。すると、ユーザ端末１のユーザ受付部１１は、ユーザからの発話「来週の木曜日は営業していますか？」を受け付ける。次に、ユーザ処理部１２は、ユーザの音声入力に対して音声認識し、ユーザ発話情報「来週の木曜日は営業していますか？」を取得する。次に、ユーザ送信部１３は、当該ユーザ発話情報を対話制御装置２に送信する。 Next, it is assumed that the user voice-inputs "Is it open next Thursday?" To the user terminal 1. Then, the user reception unit 11 of the user terminal 1 accepts the utterance "Is it open next Thursday?" From the user. Next, the user processing unit 12 voice-recognizes the user's voice input and acquires the user utterance information "Is it open next Thursday?". Next, the user transmission unit 13 transmits the user utterance information to the dialogue control device 2.

次に、対話制御装置５の受信部２２は、ユーザ発話情報「来週の木曜日は営業していますか？」をユーザ端末１から受信する。 Next, the receiving unit 22 of the dialogue control device 5 receives the user utterance information "Is it open next Thursday?" From the user terminal 1.

そして、処理部５３は、当該ユーザ発話情報から発話情報「２：来週の木曜日は営業していますか？」を取得する。つまり、処理部２３は、格納されている発話識別子「１」をインクリメントし、ユニークな発話識別子「２」を生成する。そして、処理部２３は、ユーザ発話情報に発話識別子「２」を付加し、発話情報「２：来週の木曜日は営業していますか？」を構成する。 Then, the processing unit 53 acquires the utterance information "2: Is it open next Thursday?" From the user utterance information. That is, the processing unit 23 increments the stored utterance identifier "1" to generate a unique utterance identifier "2". Then, the processing unit 23 adds the utterance identifier "2" to the user utterance information, and constitutes the utterance information "2: Is it open next Thursday?".

次に、決定部５３１は、モード情報から発話情報の出力先を「対話エンジン６」と決定する。 Next, the determination unit 531 determines the output destination of the utterance information from the mode information as the "dialogue engine 6".

次に、発話情報出力部２４１は、３つの各対話エンジン３に発話情報「２：来週の木曜日は営業していますか？」を送信する。 Next, the utterance information output unit 241 transmits the utterance information "2: Is it open next Thursday?" To each of the three dialogue engines 3.

次に、３つの各対話エンジン３の発話受付部３２は、対話制御装置２から発話情報「２：来週の木曜日は営業していますか？」を受信する。 Next, the utterance reception unit 32 of each of the three dialogue engines 3 receives the utterance information "2: Is it open next Thursday?" From the dialogue control device 2.

次に、３つの各対話エンジン３の応答取得部３３は、発話情報が有する「来週の木曜日は営業していますか？」に対応する自動応答情報を取得する。 Next, the response acquisition unit 33 of each of the three dialogue engines 3 acquires the automatic response information corresponding to the utterance information "Is it open next Thursday?".

なお、対話エンジン「ｅ１」の応答取得部３３は、「来週の木曜日は営業していますか？」が図８の学習情報管理表の中の発話情報のパターン「｛ｔｉｍｅＥｘｐｒｅｓｓｉｏｎ｝＊営業していますか？＊」に合致することを検出する。なお、かかる発話情報のパターンは、図８の学習情報管理表の「ＩＤ＝３」の情報である。そして、応答取得部３３は、「ＩＤ＝３」の自動応答情報を取得する。そして、応答取得部３３は、当該自動応答情報の｛ｔｉｍｅＥｘｐｒｅｓｓｉｏｎ｝に対して「来週の木曜日」を代入し、「＄営業日」の値「日曜日、火曜日、水曜日、木曜日、金曜日、土曜日」から、来週の木曜日が営業日に含まれる（つまり、自動応答情報の実行結果「真」）、と判断する。なお、応答取得部３３は、「来週の木曜日」から曜日を示す「木曜日」を抽出し、「＄営業日」の値に含まれる、と判断する。次に、対話エンジン「ｅ１」の応答取得部３３は、自動応答情報のパターン"はい。｛ｔｉｍｅＥｘｐｒｅｓｓｉｏｎ｝は営業しております"を取得する。次に、応答取得部３３は、自動応答情報のパターンの｛ｔｉｍｅＥｘｐｒｅｓｓｉｏｎ｝に対して「来週の木曜日」を代入し、自動応答情報「来週の木曜日は営業しております」を取得する。そして、応答送信部２６１は、自動応答情報「２：来週の木曜日は営業しております」を対話制御装置５に送信する。 In addition, in the response acquisition unit 33 of the dialogue engine "e1", "Is it open next Thursday?" Is open as the pattern of utterance information in the learning information management table in Fig. 8 "{timeExpress} *. Detects that it matches "? *". The pattern of the utterance information is the information of "ID = 3" in the learning information management table of FIG. Then, the response acquisition unit 33 acquires the automatic response information of "ID = 3". Then, the response acquisition unit 33 substitutes "next Thursday" for the {timeExpression} of the automatic response information, and starts from the value of "$ business day" "Sunday, Tuesday, Wednesday, Thursday, Friday, Saturday". It is determined that next Thursday is included in the business day (that is, the execution result of the automatic response information is "true"). The response acquisition unit 33 extracts "Thursday" indicating the day of the week from "Next Thursday" and determines that it is included in the value of "$ business day". Next, the response acquisition unit 33 of the dialogue engine "e1" acquires the pattern of the automatic response information "Yes. {TimeExpression} is open". Next, the response acquisition unit 33 substitutes "next Thursday" for the {timeExpression} of the automatic response information pattern, and acquires the automatic response information "next Thursday is open". Then, the response transmission unit 261 transmits the automatic response information "2: Open next Thursday" to the dialogue control device 5.

次に、対話制御装置５の自動応答受付部５５１は、自動応答情報を３つの各エンジン３から受信する。 Next, the automatic response reception unit 551 of the dialogue control device 5 receives the automatic response information from each of the three engines 3.

次に、選択部５３４は、上述した処理により、３つの自動応答情報の中から、対話エンジン「ｅ１」の自動応答情報「２：来週の木曜日は営業しております」を選択する。 Next, the selection unit 534 selects the automatic response information "2: next Thursday is open" from the three automatic response information by the above-mentioned processing.

次に、処理部５３は、自動応答情報から、自動情報「来週の木曜日は営業しております」を取得する。次に、決定部５３１は、モード情報「自動モード」から対話エンジン６を採用すると決定する。そして、応答送信部５６１は、自動情報「来週の木曜日は営業しております」をユーザ端末１に送信する。 Next, the processing unit 53 acquires the automatic information "Open next Thursday" from the automatic response information. Next, the determination unit 531 determines to adopt the dialogue engine 6 from the mode information "automatic mode". Then, the response transmission unit 561 transmits the automatic information "Open next Thursday" to the user terminal 1.

次に、ユーザ端末１のユーザ受信部１４は、自動情報「来週の木曜日は営業しております」を受信する。そして、ユーザ出力部１５は、自動情報「来週の木曜日は営業しております」を音声出力する。 Next, the user receiving unit 14 of the user terminal 1 receives the automatic information "Open next Thursday". Then, the user output unit 15 outputs the automatic information "Open next Thursday" by voice.

以上のように、ユーザと３つのうちのいずれかの対話エンジン６との対話の動作が繰り返し行われた、とする。そして、３つの対話エンジン６から送信されたすべての自動応答情報の中に「理解不能」を含む情報が、２回連続して、対話制御装置２に送信された、とする。なお、３つの各対話エンジン６は、受け付けた発話情報に対応する自動応答情報を取得することができなかった場合、「理解不能」を含む自動応答情報を構成し、送信する、とする。また、２回目の「理解不能」を含む自動応答情報に対応するユーザ発話情報は「来週の木曜日のお席の予約は可能ですか？」であった、とする。 As described above, it is assumed that the operation of the dialogue between the user and one of the three dialogue engines 6 is repeatedly performed. Then, it is assumed that the information including "incomprehensible" in all the automatic response information transmitted from the three dialogue engines 6 is transmitted to the dialogue control device 2 twice in succession. It is assumed that each of the three dialogue engines 6 configures and transmits automatic response information including "incomprehensible" when the automatic response information corresponding to the received utterance information cannot be acquired. Further, it is assumed that the user utterance information corresponding to the second automatic response information including "incomprehensible" was "Is it possible to reserve a seat next Thursday?".

そして、対話制御装置５の自動応答受付部５５１が、最終的に選択された自動応答情報が、２度目の「理解不能」を含む自動応答情報であった、とする。なお、処理部５３は、選択された自動応答情報やユーザ発話情報や手動応答情報等のログを格納部５１に蓄積することは好適である。または、処理部５３は、１度目の「理解不能」を含む自動応答情報が選択された場合に、当該自動応答情報を格納部５１に蓄積しても良い。 Then, it is assumed that the automatic response reception unit 551 of the dialogue control device 5 has automatically selected the automatic response information including the second "incomprehensible". It is preferable that the processing unit 53 stores logs such as selected automatic response information, user utterance information, and manual response information in the storage unit 51. Alternatively, the processing unit 53 may store the automatic response information in the storage unit 51 when the automatic response information including the first "incomprehensible" is selected.

次に、判断部５３２は、２回の連続した「理解不能」を含む自動応答情報が、図１９の「ＩＤ＝３」の破綻条件に合致する、と判断５。 Next, the determination unit 532 determines that the automatic response information including two consecutive "incomprehensible" matches the failure condition of "ID = 3" in FIG.

次に、モード変更部５３３は、モード格納部２１１のモード情報「自動モード」から「手動モード」に変更する。 Next, the mode changing unit 533 changes the mode information "automatic mode" of the mode storage unit 211 to "manual mode".

次に、処理部５３は、手動モードへの変更である、と判断する。そして、出力部５４は、最新の発話情報「１２：来週の木曜日のお席の予約は可能ですか？」を管理端末７に出力する。なお、「１２」は発話識別子である。 Next, the processing unit 53 determines that the change is to the manual mode. Then, the output unit 54 outputs the latest utterance information "12: Is it possible to reserve a seat next Thursday?" To the management terminal 7. In addition, "12" is an utterance identifier.

次に、管理端末７の管理情報受付部７１は、発話情報「１２：来週の木曜日のお席の予約は可能ですか？」を対話制御装置２から受信する。 Next, the management information reception unit 71 of the management terminal 7 receives the utterance information "12: Is it possible to reserve a seat next Thursday?" From the dialogue control device 2.

次に、管理者出力部７３は、発話情報のうちの発話内容「来週の木曜日のお席の予約は可能ですか？」を出力する。かかる出力例は、図２０である。図２０において、２００１に発話情報のうちの発話内容が表示されている。 Next, the administrator output unit 73 outputs the utterance content "Is it possible to reserve a seat next Thursday?" In the utterance information. An example of such an output is shown in FIG. In FIG. 20, the utterance content of the utterance information is displayed in 2001.

次に、管理者は、図２０の２００２の「Ｙｅｓ」ボタンを押下した、とする。すると、管理情報受付部７１は、「Ｙｅｓ」ボタンの押下を検知し、「Ｙｅｓ」ボタンに対応づけて格納されている手動応答情報「はい。」を、図２１の手動応答情報管理表から取得する。なお、管理格納部７０には、図２０の画面上のボタンに対応する手動応答情報が管理されている。かかる手動応答情報管理表は、図２１である。図２１は、「ＩＤ」「ボタン」「手動応答情報」を有するレコードが管理されている。「ボタン」の各属性値は、図１０のボタンに対応する。次に、管理者出力部７３は、手動応答情報「はい。」を対話制御装置５に送信する。 Next, it is assumed that the administrator presses the "Yes" button of 2002 in FIG. 20. Then, the management information reception unit 71 detects that the "Yes" button is pressed, and acquires the manual response information "Yes." Stored in association with the "Yes" button from the manual response information management table of FIG. To do. The management storage unit 70 manages manual response information corresponding to the buttons on the screen of FIG. 20. The manual response information management table is shown in FIG. In FIG. 21, records having "ID", "button", and "manual response information" are managed. Each attribute value of the "button" corresponds to the button of FIG. Next, the administrator output unit 73 transmits the manual response information “Yes.” To the dialogue control device 5.

なお、ここで、管理者は、図２０の画面内のボタンを押下せずに、または図２０の２００３の「音声入力」ボタンを押下した後、「はい。来週の木曜日は、４名様までご予約可能です。」などと、音声入力しても良い。かかる場合、対話制御装置２の管理情報受付部７１は、音声データ「はい。来週の木曜日は、４名様までご予約可能です。」を受け付け、当該音声データを音声認識し、テキストの手動応答情報「はい。来週の木曜日は、４名様までご予約可能です。」を得る。そして、管理者出力部７３は、当該手動応答情報を対話制御装置５に送信する。なお、管理情報受付部７１は、デフォルトで、音声を受け付けても良い。 Here, the administrator does not press the button in the screen of FIG. 20, or after pressing the "voice input" button of 2003 of FIG. 20, "Yes. Next Thursday, up to 4 people. You can make a reservation by voice input. In such a case, the management information reception unit 71 of the dialogue control device 2 receives the voice data "Yes. Up to 4 people can be reserved next Thursday.", Recognizes the voice data, and manually responds to the text. Get the information "Yes. Next Thursday, up to 4 people can be booked." Then, the administrator output unit 73 transmits the manual response information to the dialogue control device 5. The management information reception unit 71 may receive voice by default.

次に、対話制御装置５の手動情報受付部５５２は、手動応答情報「はい。来週の木曜日は、４名様までご予約可能です。」を受け付ける。そして、処理部５３は、当該手動応答情報を手動情報として取得する。 Next, the manual information reception unit 552 of the dialogue control device 5 receives the manual response information "Yes. Up to 4 people can be reserved next Thursday." Then, the processing unit 53 acquires the manual response information as manual information.

そして、モード情報が「手動モード」なので、応答送信部５６１は手動情報「はい。来週の木曜日は、４名様までご予約可能です。」をユーザ端末１に送信する。 Then, since the mode information is "manual mode", the response transmission unit 561 transmits the manual information "Yes. Up to 4 people can be reserved next Thursday." To the user terminal 1.

次に、ユーザ端末１のユーザ受信部１４は、当該手動情報を受信する。次に、ユーザ出力部１５は、手動情報「はい。来週の木曜日は、４名様までご予約可能です。」を音声出力する。 Next, the user receiving unit 14 of the user terminal 1 receives the manual information. Next, the user output unit 15 outputs the manual information "Yes. Next Thursday, up to 4 people can be reserved" by voice.

以上の処理により、対話の破綻を自動検知し、自動モードから手動モードの自動切り替えを行うことにより、対話の継続が可能となる。 By the above processing, the failure of the dialogue is automatically detected, and the dialogue can be continued by automatically switching from the automatic mode to the manual mode.

また、手動モードの状態で、閾値以上の回数の対話（手動応答情報の受信）が行われた場合、判断部５３２は、「手動モード」から「自動モード」にモード情報を変更する、と判断するとする。そして、モード変更部５３３は、モード格納部５１１のモード情報を「手動モード」から「自動モード」に変更する。 Further, in the state of the manual mode, when the dialogue (reception of the manual response information) is performed more than the threshold value, the determination unit 532 determines that the mode information is changed from the "manual mode" to the "automatic mode". Then. Then, the mode change unit 533 changes the mode information of the mode storage unit 511 from the "manual mode" to the "automatic mode".

以後、上記で説明したような「自動モード」での対話処理が継続される。なお、ユーザ端末１において、図９に示すようにキャラクタがお話しているように、自動情報または手動情報を音声出力することは好適である。 After that, the dialogue processing in the "automatic mode" as described above is continued. It is preferable that the user terminal 1 outputs automatic information or manual information by voice as the character speaks as shown in FIG.

（具体例２）
今、ユーザは、ユーザ端末１に対して、「牛丼の作り方を教えて」と音声入力した、とする。すると、ユーザ端末１のユーザ受付部１１は、ユーザからの発話「牛丼の作り方を教えて」を受け付ける。次に、ユーザ処理部１２は、ユーザの音声入力に対して音声認識し、ユーザ発話情報「牛丼の作り方を教えて」を取得する。次に、ユーザ送信部１３は当該ユーザ発話情報を対話制御装置５に送信する。 (Specific example 2)
Now, it is assumed that the user voice-inputs "Tell me how to make beef bowl" to the user terminal 1. Then, the user reception unit 11 of the user terminal 1 accepts the utterance "Tell me how to make beef bowl" from the user. Next, the user processing unit 12 voice-recognizes the user's voice input and acquires the user's utterance information "Tell me how to make beef bowl". Next, the user transmission unit 13 transmits the user utterance information to the dialogue control device 5.

次に、対話制御装置５の受信部２２は、ユーザ発話情報「牛丼の作り方を教えて」をユーザ端末１から受信する。 Next, the receiving unit 22 of the dialogue control device 5 receives the user utterance information "Tell me how to make beef bowl" from the user terminal 1.

そして、処理部５３は、当該ユーザ発話情報から発話情報「１：牛丼の作り方を教えて」を取得する。発話情報の「１」は発話識別子である。 Then, the processing unit 53 acquires the utterance information "1: Tell me how to make beef bowl" from the user's utterance information. The "1" in the utterance information is the utterance identifier.

次に、決定部５３１は、発話情報の出力先を対話エンジン６および管理端末７と決定する。なお、本具体例において、発話情報の出力先は、常に、対話エンジン６および管理端末７である、とする。 Next, the determination unit 531 determines the output destination of the utterance information to the dialogue engine 6 and the management terminal 7. In this specific example, it is assumed that the output destination of the utterance information is always the dialogue engine 6 and the management terminal 7.

次に、選択部５３４は、３つの対話エンジン６から一の対話エンジン６を選択する。ここでは、選択部５３４は、「牛丼の作り方を教えて」に対して自然言語処理を行い、ジャンル「料理」を取得する。そして、選択部５３４は、図１２を参照し、ジャンル「料理」と対になる対話エンジン「ｅ２」を選択する。なお、選択部５３４は、例えば、「牛丼の作り方を教えて」を形態素解析し、自立語「牛丼」「作り方」「教えて」を取得し、当該自立語から機械学習（例えば、深層学習、決定木、ＳＶＭ）により、ジャンル「料理」を取得する。 Next, the selection unit 534 selects one dialogue engine 6 from the three dialogue engines 6. Here, the selection unit 534 performs natural language processing on "Tell me how to make beef bowl" and acquires the genre "cooking". Then, the selection unit 534 selects the dialogue engine "e2" that is paired with the genre "cooking" with reference to FIG. In addition, the selection unit 534 performs morphological analysis of, for example, "teach me how to make beef bowl", acquires the independent words "gyudon", "how to make", and "teach", and machine learning (for example, deep layer) from the independent words. Acquire the genre "cooking" by learning, decision tree, SVM).

次に、発話情報出力部５４１は、発話情報「１：牛丼の作り方を教えて」を、対話エンジン「ｅ２」および管理端末７に送信する。 Next, the utterance information output unit 541 transmits the utterance information "1: Tell me how to make beef bowl" to the dialogue engine "e2" and the management terminal 7.

そして、管理端末７は、発話情報「１：牛丼の作り方を教えて」を受信し、発話情報に含まれる「牛丼の作り方を教えて」を出力する。 Then, the management terminal 7 receives the utterance information "1: Tell me how to make beef bowl" and outputs "Tell me how to make beef bowl" included in the utterance information.

次に、対話エンジン「ｅ２」の発話受付部３２は、対話制御装置５から発話情報「１：牛丼の作り方を教えて」を受信する。 Next, the utterance reception unit 32 of the dialogue engine "e2" receives the utterance information "1: Tell me how to make beef bowl" from the dialogue control device 5.

次に、対話エンジン「ｅ２」の応答取得部３３は、発話情報が有する「牛丼の作り方を教えて」を形態素解析する。そして、応答取得部３３は、取得した形態素「牛丼」をキーとして、辞書を検索し、「牛丼」は｛ｆｏｏｄ｝に含まれることを検出する。次に、応答取得部３３は、「牛丼の作り方を教えて」が図７の学習情報管理表の「ＩＤ＝４」の発話情報に合致することを検出する。そして、応答取得部３３は、「ＩＤ＝４」の自動応答情報を取得し、当該自動応答情報を実行し、図８のＦｏｏｄｔａｂｌｅから料理「牛丼」に対応する作り方「４人前の材料をご紹介します。・・・」を取得する。そして、応答出力部３４は、応答取得部３３が取得した自動応答情報「１：４人前の材料をご紹介します。・・・」を対話制御装置５に送信する。 Next, the response acquisition unit 33 of the dialogue engine "e2" morphologically analyzes "tell me how to make beef bowl" that the utterance information has. Then, the response acquisition unit 33 searches the dictionary using the acquired morpheme "gyudon" as a key, and detects that "gyudon" is included in {food}. Next, the response acquisition unit 33 detects that "Tell me how to make beef bowl" matches the utterance information of "ID = 4" in the learning information management table of FIG. 7. Then, the response acquisition unit 33 acquires the automatic response information of "ID = 4", executes the automatic response information, and prepares "Gyudon" corresponding to the dish "Gyudon" from the Foodtable of FIG. Introducing ... "is acquired. Then, the response output unit 34 transmits the automatic response information "1: 4 servings of material ...." acquired by the response acquisition unit 33 to the dialogue control device 5.

次に、対話制御装置５の受付部５５は、当該自動応答情報を受け付ける。そして、処理部５３は、当該答情報から、自動情報「４人前の材料をご紹介します。・・・」を取得する。なお、ここでは、自動応答情報と自動情報が同じでも良い。 Next, the reception unit 55 of the dialogue control device 5 receives the automatic response information. Then, the processing unit 53 acquires the automatic information "Introducing the material for 4 servings ..." from the answer information. Here, the automatic response information and the automatic information may be the same.

次に、処理部５３は、自動情報を管理端末７に出力する、と判断する。そして、自動情報出力部５４２は、自動情報「４人前の材料をご紹介します。・・・」を管理端末７に出力する。 Next, the processing unit 53 determines that the automatic information is output to the management terminal 7. Then, the automatic information output unit 542 outputs the automatic information "Introducing the material for 4 servings ..." to the management terminal 7.

次に、決定部５３１は、モード情報「自動モード」から対話エンジン６を採用する、と判断する。そして、応答送信部５６１は、対話エンジン「ｅ２」からの自動情報「４人前の材料をご紹介します。・・・」をユーザ端末１に送信する。 Next, the determination unit 531 determines that the dialogue engine 6 is adopted from the mode information "automatic mode". Then, the response transmission unit 561 transmits the automatic information "Introducing the material for four servings ..." from the dialogue engine "e2" to the user terminal 1.

次に、ユーザは、ユーザ端末１に対して、「粉末うどんスープはどちらのスープがお勧めですか？」と音声入力した、とする。すると、ユーザ端末１のユーザ受付部１１は、当該ユーザの音声入力を受け付ける。次に、ユーザ処理部１２は、ユーザの音声入力に対して音声認識し、ユーザ発話情報「粉末うどんスープはどちらのスープがお勧めですか？」を取得する。次に、ユーザ送信部１３は、当該ユーザ発話情報を対話制御装置５に送信する。 Next, it is assumed that the user voice-inputs "Which soup is recommended for powdered udon soup?" To the user terminal 1. Then, the user reception unit 11 of the user terminal 1 accepts the voice input of the user. Next, the user processing unit 12 voice-recognizes the user's voice input and acquires the user utterance information "Which soup is recommended for powdered udon soup?". Next, the user transmission unit 13 transmits the user utterance information to the dialogue control device 5.

次に、対話制御装置５の受信部２２は、当該ユーザ発話情報をユーザ端末１から受信する。 Next, the receiving unit 22 of the dialogue control device 5 receives the user utterance information from the user terminal 1.

そして、処理部５３は、当該ユーザ発話情報から発話情報「２：粉末うどんスープはどちらのスープがお勧めですか？」を取得する。 Then, the processing unit 53 acquires the utterance information "2: Which soup is recommended for the powdered udon soup?" From the user's utterance information.

次に、決定部５３１は、発話情報の出力先を対話エンジン６および管理端末７と決定する。そして、発話情報出力部５４１は、当該発話情報を、選択されている対話エンジン「ｅ２」および管理端末７に送信する。 Next, the determination unit 531 determines the output destination of the utterance information to the dialogue engine 6 and the management terminal 7. Then, the utterance information output unit 541 transmits the utterance information to the selected dialogue engine "e2" and the management terminal 7.

次に、対話エンジン「ｅ２」は、当該発話情報を受信し、当該発話情報に対応する自動応答情報を取得し、対話制御装置５に送信する。なお、ここでの自動応答情報は、例えば、「理解できない」旨の情報を含む、とする。 Next, the dialogue engine "e2" receives the utterance information, acquires the automatic response information corresponding to the utterance information, and transmits it to the dialogue control device 5. It should be noted that the automatic response information here includes, for example, information to the effect that "it cannot be understood".

また、管理端末７は、当該発話情報を受信し、発話情報に含まれる「粉末うどんスープはどちらのスープがお勧めですか？」を出力する。 Further, the management terminal 7 receives the utterance information and outputs "Which soup is recommended for the powdered udon soup?" Included in the utterance information.

そして、この出力を見た管理者は、どの対話エンジン６にも答えられないと判断し、管理端末７にモード情報の変更指示を入力した、とする。そして、管理者は、「粉末うどんスープはＡ社のスープがお勧めです。」と音声入力した、とする。 Then, the administrator who sees this output determines that he / she cannot answer any of the dialogue engines 6, and inputs the mode information change instruction to the management terminal 7. Then, the administrator supposes that "A company's soup is recommended for powdered udon soup."

次に、管理指示受付部７４は、管理者からモードの変更指示（自動モードから手動モードへの変更指示）を受け付ける。そして、管理出力部７５は、かかる変更指示を対話制御装置５に出力する。 Next, the management instruction receiving unit 74 receives a mode change instruction (change instruction from the automatic mode to the manual mode) from the administrator. Then, the management output unit 75 outputs such a change instruction to the dialogue control device 5.

また、管理情報受付部７１は、音声データ「粉末うどんスープはＡ社のスープがお勧めです。」を受け付け、当該音声データを音声認識し、テキストに変更する。次に、管理者出力部７３は、テキストの手動応答情報「２：粉末うどんスープはＡ社のスープがお勧めです。」を対話制御装置５に送信する。 In addition, the management information reception unit 71 receives the voice data "Powdered udon soup is recommended from company A.", Recognizes the voice data, and changes it into a text. Next, the administrator output unit 73 transmits the text manual response information "2: The soup of company A is recommended for the powdered udon soup" to the dialogue control device 5.

次に、対話制御装置５の変更指示受付部５５３は、管理端末７から変更指示を受信する。モード変更部５３３は、モード格納部５１のモード情報を、手動モードに変更する。 Next, the change instruction receiving unit 553 of the dialogue control device 5 receives the change instruction from the management terminal 7. The mode change unit 533 changes the mode information of the mode storage unit 51 to the manual mode.

また、手動応答受付部５５２は、管理端末７から手動応答情報を受信する。処理部５３は、受信された自動応答情報および手動応答情報から、自動情報および手動情報を取得する。 Further, the manual response reception unit 552 receives the manual response information from the management terminal 7. The processing unit 53 acquires the automatic information and the manual information from the received automatic response information and the manual response information.

次に、決定部５３１は、モード情報「手動モード」から管理端末７を採用する、と決定する。 Next, the determination unit 531 determines that the management terminal 7 is adopted from the mode information "manual mode".

そして、応答送信部５６１は、取得された手動情報「粉末うどんスープはＡ社のスープがお勧めです。」をユーザ端末１に送信する。 Then, the response transmission unit 561 transmits the acquired manual information "Powdered udon soup is recommended from company A's soup" to the user terminal 1.

次に、ユーザ端末１は、手動情報「粉末うどんスープはＡ社のスープがお勧めです。」を受信し、出力する。 Next, the user terminal 1 receives and outputs the manual information "Powdered udon soup is recommended from company A."

以上の処理により、管理者が適切に対話に介在し、複数の対話エンジンも用いて、効果的で継続的な対話を行える。なお、本具体例において、管理者が「自動モード」にモード情報を変更する指示を入力した後、「自動モード」で対話を行うことができる。 Through the above processing, the administrator can appropriately intervene in the dialogue, and an effective and continuous dialogue can be performed by using a plurality of dialogue engines. In this specific example, after the administrator inputs an instruction to change the mode information in the "automatic mode", the dialogue can be performed in the "automatic mode".

（具体例３）
具体例２において、対話制御装置５の手動応答受付部５５２が管理端末７から手動応答情報「２：粉末うどんスープはＡ社のスープがお勧めです。」を受信した後、処理部５３は、手動応答情報を３つの各対話エンジン６に送信する、と判断する。 (Specific example 3)
In Specific Example 2, after the manual response reception unit 552 of the dialogue control device 5 receives the manual response information "2: Powdered udon soup is recommended from company A" from the management terminal 7, the processing unit 53 receives the manual response information. It is determined that the manual response information is transmitted to each of the three dialogue engines 6.

そして、手動情報出力部５４３は、かかる手動応答情報を３つの各対話エンジン６に送信する。なお、ここで、手動情報出力部５４３は、手動情報を３つの各対話エンジン６に送信しても良い。また、手動情報出力部５４３は、通常、発話情報またはユーザ発話情報を、手動応答情報または手動情報と対にして、３つの各対話エンジン６に送信する。なお、手動情報出力部５４３は既に発話情報が送信されている対話エンジン６には、発話情報またはユーザ発話情報を送信しないことは好適である。 Then, the manual information output unit 543 transmits such manual response information to each of the three dialogue engines 6. Here, the manual information output unit 543 may transmit the manual information to each of the three dialogue engines 6. Further, the manual information output unit 543 normally transmits the utterance information or the user utterance information to each of the three dialogue engines 6 in pairs with the manual response information or the manual information. It is preferable that the manual information output unit 543 does not transmit the utterance information or the user utterance information to the dialogue engine 6 to which the utterance information has already been transmitted.

次に、対話エンジン６の学習受付部６５は、手動応答情報「２：粉末うどんスープはＡ社のスープがお勧めです。」を受信する。 Next, the learning reception unit 65 of the dialogue engine 6 receives the manual response information "2: The soup of company A is recommended for the powdered udon soup."

次に、学習部６６は、学習受付部６５が受け付けた手動応答情報を用いて、学習情報に、追記する情報を構成する。つまり、学習部６６は、手動応答情報「２：粉末うどんスープはＡ社のスープがお勧めです。」が有する発話識別子「２」に対応する発話情報を取得する。そして、発話識別子「２」に対応する発話情報の内容「粉末うどんスープはどちらのスープがお勧めですか？」と手動応答情報の内容「粉末うどんスープはＡ社のスープがお勧めです。」との組を、図７の学習情報管理表に追記する。 Next, the learning unit 66 configures information to be added to the learning information by using the manual response information received by the learning reception unit 65. That is, the learning unit 66 acquires the utterance information corresponding to the utterance identifier "2" of the manual response information "2: The soup of company A is recommended for the powdered udon soup." Then, the content of the utterance information corresponding to the utterance identifier "2" "Which soup is recommended for the powdered udon soup?" And the content of the manual response information "The soup of company A is recommended for the powdered udon soup." The set with is added to the learning information management table of FIG.

なお、学習部６６は、発話情報の内容「粉末うどんスープはどちらのスープがお勧めですか？」のうち、名詞または名詞句「粉末うどんスープ」「スープ」を変数に置き換え、「｛変数１｝はどちらの{変数２}がお勧めですか？」というパターンに変更し、「粉末うどんスープはＡ社のスープがお勧めです。」のうち、名詞または名詞句「粉末うどんスープ」「Ａ社」「スープ」を変数に置き換え、「｛変数３｝は{変数４}の｛変数５｝がお勧めです。」というパターンに変更し、２つのパターンの組を、図７の学習情報管理表に追記するなどしても良い。 In addition, the learning unit 66 replaces the nomenclature or the nomenclature "powdered udon soup" and "soup" with variables in the content of the utterance information "Which soup is recommended for powdered udon soup?" } Is changed to the pattern of "Which {variable 2} is recommended?", And among "Powdered udon soup is recommended from company A.", Nomenclature or utterances "Powdered udon soup" "A" Replace "company" and "soup" with variables, change to the pattern "{variable 3} is recommended for {variable 4} {variable 5}", and change the set of the two patterns to the learning information management shown in Fig. 7. You may add it to the table.

つまり、本具体例において、管理者からの入力である手動応答情報を用いて、学習情報をよくすれば良く、学習情報の構造は問わない。 That is, in this specific example, the learning information may be improved by using the manual response information input from the administrator, and the structure of the learning information does not matter.

また、本具体例において、各対話エンジン６は、他の対話エンジン６が出力した自動応答情報を用いて、学習処理を行うことが好適である。 Further, in this specific example, it is preferable that each dialogue engine 6 performs learning processing using the automatic response information output by the other dialogue engine 6.

以上、本実施の形態によれば、対話エンジン６からの応答と管理者からの応答とを使い分けて、ユーザ端末に出力することにより、適切な対話を継続できる。 As described above, according to the present embodiment, an appropriate dialogue can be continued by properly using the response from the dialogue engine 6 and the response from the administrator and outputting them to the user terminal.

また、本実施の形態によれば、管理者からの応答を学習できる対話エンジン６を提供できる。 Further, according to the present embodiment, it is possible to provide a dialogue engine 6 capable of learning a response from an administrator.

また、本実施の形態によれば、例えば、対話が破綻した等の場合に、対話を支援できる管理端末７を提供できる。 Further, according to the present embodiment, it is possible to provide a management terminal 7 capable of supporting the dialogue, for example, when the dialogue breaks down.

なお、本実施の形態における対話制御装置５を実現するソフトウェアは、以下のようなプログラムである。つまり、このプログラムは、２以上の対話エンジンから、ユーザの発話に対応する応答である自動応答情報を受け付け得る対話制御装置におけるプログラムであって、コンピュータを、ユーザの発話に対応するユーザ発話情報をユーザ端末から受信する受信部と、前記ユーザ発話情報に対応する情報である発話情報を、１以上の対話エンジンに対して出力し得る発話情報出力部と、前記発話情報の出力に応じて、前記１以上の各対話エンジンから自動応答情報を受け付ける自動応答受付部と、前記自動応答受付部が受け付けた１以上の自動応答情報のうちの一の自動応答情報に対応する一の自動情報を前記ユーザ端末に送信する応答送信部として機能させるためのプログラムである。 The software that realizes the dialogue control device 5 in this embodiment is the following program. That is, this program is a program in a dialogue control device that can receive automatic response information that is a response corresponding to a user's utterance from two or more dialogue engines, and uses a computer to display user utterance information corresponding to the user's utterance. The receiving unit received from the user terminal, the utterance information output unit capable of outputting the utterance information which is the information corresponding to the user utterance information to one or more dialogue engines, and the utterance information output unit according to the output of the utterance information. The user receives an automatic response reception unit that receives automatic response information from one or more dialogue engines, and one automatic information corresponding to one of the one or more automatic response information received by the automatic response reception unit. It is a program for functioning as a response transmitter to be transmitted to a terminal.

また、上記プログラムにおいて、前記発話情報出力部は、前記発話情報を管理端末に対しても出力可能であり、コンピュータを、前記発話情報の前記管理端末への出力に応じて、前記管理端末からの応答である手動応答情報を受け付ける手動応答受付部としてさらに機能させ、前記応答送信部は、前記一の自動応答情報に対応する情報である一の自動情報または前記手動応答情報に対応する情報である手動情報を前記ユーザ端末に送信するものとして、コンピュータを機能させるプログラムであることは好適である。 Further, in the above program, the utterance information output unit can also output the utterance information to the management terminal, and the computer can output the utterance information from the management terminal in response to the output of the utterance information to the management terminal. Further functioning as a manual response receiving unit that receives manual response information that is a response, the response transmitting unit is information corresponding to the one automatic response information, one automatic information, or information corresponding to the manual response information. It is preferable that the program operates the computer to transmit the manual information to the user terminal.

また、上記プログラムにおいて、コンピュータを、対話エンジンを採用するか、管理端末を採用するかを決定する決定部としてさらに機能させ、前記応答送信部は、前記決定が対話エンジンを採用するとの決定の場合は、前記一の自動情報を前記ユーザ端末に送信し、前記決定が管理端末を採用するとの決定の場合は、前記手動情報を前記ユーザ端末に送信するものとして、コンピュータを機能させるプログラムであることは好適である。 Further, in the above program, when the computer further functions as a decision unit for deciding whether to adopt the dialogue engine or the management terminal, and the response transmission unit determines that the decision adopts the dialogue engine. Is a program that causes the computer to function by transmitting the above-mentioned automatic information to the user terminal and, in the case of the decision to adopt the management terminal, transmitting the manual information to the user terminal. Is suitable.

また、上記プログラムにおいて、コンピュータを、対話エンジンを採用する自動モード、または管理端末を採用する手動モードのいずれかを、少なくとも採り得るモード情報が格納されるモード格納部と、モード情報を変更するための予め決められた条件を満たすか否かを判断する判断部と、前記判断部が予め決められた条件を満たすと判断した場合に、前記モード情報を変更するモード変更部としてさらに機能させ、前記決定部は、前記モード格納部のモード情報を取得するものとして、コンピュータを機能させるプログラムであることは好適である。 Further, in the above program, in order to change the mode information and the mode storage unit in which at least the mode information that can be taken is changed between the automatic mode that employs the dialogue engine and the manual mode that employs the management terminal. Further functions as a determination unit for determining whether or not the predetermined condition is satisfied, and a mode change unit for changing the mode information when the determination unit determines that the predetermined condition is satisfied. It is preferable that the determination unit is a program that causes the computer to function as a device for acquiring the mode information of the mode storage unit.

また、上記プログラムにおいて、コンピュータを、対話エンジンを採用する自動モード、または管理端末を採用する手動モードのいずれかを、少なくとも採り得るモード情報が格納されるモード格納部と、管理端末からモードの変更の指示である変更指示を受け付ける変更指示受付部と、前記変更指示に応じて、前記モード格納部のモード情報を変更するモード変更部としてさらに機能させ、前記決定部は、前記モード格納部のモード情報に従って、対話エンジンを採用するか、管理端末を採用するかを決定するものとして、コンピュータを機能させるプログラムであることは好適である。 Further, in the above program, the computer has either an automatic mode that employs a dialogue engine or a manual mode that employs a management terminal, at least a mode storage unit that stores mode information that can be taken, and a mode change from the management terminal. The change instruction receiving unit that receives the change instruction, which is the instruction of the above, and the mode changing unit that changes the mode information of the mode storage unit in response to the change instruction are further functioned, and the determining unit is the mode of the mode storage unit. It is preferable that the program functions the computer as a decision to adopt the dialogue engine or the management terminal according to the information.

また、上記プログラムにおいて、コンピュータを、前記一の自動情報を前記管理端末に出力する自動情報出力部としてさらに機能させ、ものとして、コンピュータを機能させるプログラムであることは好適である。 Further, in the above program, it is preferable that the computer further functions as an automatic information output unit that outputs the above-mentioned one automatic information to the management terminal, and as a program, the computer functions.

また、上記プログラムにおいて、コンピュータを、前記手動情報を前記対話エンジンに出力する手動情報出力部としてさらに機能させ、ものとして、コンピュータを機能させるプログラムであることは好適である。 Further, in the above program, it is preferable that the computer further functions as a manual information output unit that outputs the manual information to the dialogue engine, and as a program, the computer functions.

（実施の形態４）
本実施の形態において、スタンドアロンの対話装置８について説明する。ここでは、対話装置８は、通常、２以上の対話エンジン３を有する。また、対話装置８は、例えば、対話システムＡのユーザ端末１、対話制御装置２、対話エンジン３を有する。ただし、ここでの対話装置８は、管理端末７は含まず、管理端末７と通信を行う。 (Embodiment 4)
In the present embodiment, the stand-alone dialogue device 8 will be described. Here, the dialogue device 8 usually has two or more dialogue engines 3. Further, the dialogue device 8 includes, for example, a user terminal 1 of the dialogue system A, a dialogue control device 2, and a dialogue engine 3. However, the dialogue device 8 here does not include the management terminal 7 and communicates with the management terminal 7.

図２２は、本実施の形態における対話装置８のブロック図である。 FIG. 22 is a block diagram of the dialogue device 8 according to the present embodiment.

対話装置８は、受付部８１、発話情報出力部８２、自動応答受付部８３、手動応答受付部８４、応答出力部８５、および１以上の対話エンジン３を備える。 The dialogue device 8 includes a reception unit 81, an utterance information output unit 82, an automatic response reception unit 83, a manual response reception unit 84, a response output unit 85, and one or more dialogue engines 3.

受付部８１は、ユーザが入力した発話に対応するユーザ発話情報を受け付ける。なお、受付部８１は、発話を受け付け、当該発話からユーザ発話情報を構成しても良い。また、ここでの受け付けは、音声データであるユーザ発話情報の受け付けでも良いし、テキストのユーザ発話情報の受け付けでも良い。ここで、受け付けとは、キーボードやマウス、タッチパネルなどの入力デバイスから入力された情報の受け付け、有線もしくは無線の通信回線を介して送信された情報の受信、光ディスクや磁気ディスク、半導体メモリなどの記録媒体から読み出された情報の受け付けなどを含む概念である。 The reception unit 81 receives the user utterance information corresponding to the utterance input by the user. The reception unit 81 may accept utterances and configure user utterance information from the utterances. Further, the reception here may be reception of user utterance information which is voice data, or reception of text user utterance information. Here, acceptance means accepting information input from an input device such as a keyboard, mouse, or touch panel, receiving information transmitted via a wired or wireless communication line, and recording an optical disk, magnetic disk, semiconductor memory, or the like. It is a concept including acceptance of information read from a medium.

ユーザ発話情報の入力手段は、キーボードやマウスやメニュー画面によるもの等、何でも良い。受付部８１は、テンキーやキーボード等の入力手段のデバイスドライバーや、メニュー画面の制御ソフトウェア等で実現され得る。 The input means of the user utterance information may be any means such as a keyboard, a mouse, and a menu screen. The reception unit 81 can be realized by a device driver for input means such as a numeric keypad or a keyboard, control software for a menu screen, or the like.

発話情報出力部８２は、ユーザ発話情報に関する発話情報を、２以上の対話エンジン３または画面のうちの少なくともどちらか一方に出力する。ここでの出力とは、発話情報の引き渡しを行われれば良い。なお、画面への出力とは、管理端末７の画面への出力でも良い。 The utterance information output unit 82 outputs utterance information related to user utterance information to at least one of two or more dialogue engines 3 or a screen. The output here may be the delivery of utterance information. The output to the screen may be the output to the screen of the management terminal 7.

発話情報出力部８２は、通常、ＭＰＵやメモリ等から実現され得る。発話情報出力部８２の処理手順は、通常、ソフトウェアで実現され、当該ソフトウェアはＲＯＭ等の記録媒体に記録されている。但し、ハードウェア（専用回路）で実現しても良い。発話情報出力部８２は、無線または有線の通信手段で実現されても良い。 The utterance information output unit 82 can usually be realized from an MPU, a memory, or the like. The processing procedure of the utterance information output unit 82 is usually realized by software, and the software is recorded in a recording medium such as ROM. However, it may be realized by hardware (dedicated circuit). The utterance information output unit 82 may be realized by a wireless or wired communication means.

自動応答受付部８３は、発話情報の１以上の対話エンジン３への出力に応じて、１以上の対話エンジン３からの応答である自動応答情報を受け付ける。自動応答受付部８３は、複数の自動応答情報から一の自動応答情報を選択する選択部２３１、または複数の対話エンジンから一の対話エンジンを選択する選択部４３１の機能を有する。 The automatic response reception unit 83 receives automatic response information which is a response from one or more dialogue engines 3 in response to the output of the utterance information to one or more dialogue engines 3. The automatic response reception unit 83 has a function of a selection unit 231 that selects one automatic response information from a plurality of automatic response information, or a selection unit 431 that selects one dialogue engine from a plurality of dialogue engines.

手動応答受付部８４は、発話情報の画面への出力に応じて、管理者からの応答である手動応答情報を受け付ける。ここでの受け付けは、管理端末７からの受信でも良い。 The manual response reception unit 84 receives the manual response information, which is a response from the administrator, in response to the output of the utterance information to the screen. The reception here may be reception from the management terminal 7.

手動応答受付部８４は、例えば、無線または有線のネットワークの受信手段等で実現され得る。 The manual response receiving unit 84 can be realized by, for example, a receiving means of a wireless or wired network.

応答出力部８５は、自動応答情報に関する自動情報または手動応答情報に関する手動情報を出力する。ここでの出力は、ユーザに対する出力であり、例えば、音声出力、またはディスプレイへの表示である。 The response output unit 85 outputs automatic information regarding the automatic response information or manual information regarding the manual response information. The output here is an output to the user, for example, an audio output or a display on a display.

応答出力部８５は、ディスプレイやスピーカー等の出力デバイスを含むと考えても含まないと考えても良い。応答出力部８５は、出力デバイスのドライバーソフトまたは、出力デバイスのドライバーソフトと出力デバイス等で実現され得る。 The response output unit 85 may or may not include an output device such as a display or a speaker. The response output unit 85 can be realized by the driver software of the output device, the driver software of the output device, the output device, or the like.

次に、対話装置８の動作について、図２３のフローチャートを用いて説明する。図２３のフローチャートにおいて、図４、図１１のフローチャートと同一のステップについて、説明を省略する。 Next, the operation of the dialogue device 8 will be described with reference to the flowchart of FIG. In the flowchart of FIG. 23, the same steps as those of the flowcharts of FIGS. 4 and 11 will be omitted.

（ステップＳ２３０１）受付部８１は、ユーザ発話情報を受け付けたか否かを判断する。ユーザ発話情報を受け付けた場合はステップＳ４０２に行き、受け付けない場合はステップＳ４１９に行く。なお、本ステップにおいて、受付部８１は、ユーザからの発話を受け付け、当該発話からユーザ発話情報を構成しても良い。かかる処理も、ユーザ発話情報の受け付けである。 (Step S2301) The reception unit 81 determines whether or not the user utterance information has been received. If the user utterance information is accepted, the process goes to step S402, and if the user utterance information is not accepted, the process goes to step S419. In this step, the reception unit 81 may accept utterances from the user and configure user utterance information from the utterances. Such processing is also the reception of user utterance information.

（ステップＳ２３０２）自動応答受付部８３は、対話エンジン３を動作させるか否かを判断する。対話エンジン３を動作させる場合はステップＳ２３０３に行き、対話エンジン３を動作させない場合はステップＳ２３０４に行く。なお、自動応答受付部５３は、例えば、格納部５１のモード情報が自動モードである場合、対話エンジン３を動作させると判断する。また、対話エンジン３を動作させる場合、自動応答受付部８３は、複数の対話エンジン３を動作させても良いし、選択した一の対話エンジン３を動作させても良い。対話エンジン３を選択する処理は上述した。 (Step S2302) The automatic response reception unit 83 determines whether or not to operate the dialogue engine 3. If the dialogue engine 3 is operated, the process goes to step S2303. If the dialogue engine 3 is not operated, the process goes to step S2304. The automatic response reception unit 53 determines that the dialogue engine 3 is operated, for example, when the mode information of the storage unit 51 is the automatic mode. Further, when operating the dialogue engine 3, the automatic response reception unit 83 may operate a plurality of dialogue engines 3 or may operate one selected dialogue engine 3. The process of selecting the dialogue engine 3 has been described above.

（ステップＳ２３０３）自動応答受付部５３は、一の対話エンジン３の出力である自動応答情報を取得する。なお、自動応答受付部８３が複数の対話エンジン３を動作させた場合、通常、２以上の自動応答情報から一の自動応答情報を選択する。一の自動応答情報を選択する処理は上述した。 (Step S2303) The automatic response receiving unit 53 acquires the automatic response information which is the output of one dialogue engine 3. When the automatic response receiving unit 83 operates a plurality of dialogue engines 3, usually, one automatic response information is selected from two or more automatic response information. The process of selecting one automatic response information has been described above.

（ステップＳ２３０４）手動応答受付部８４は、管理者の入力に対応する手動応答情報を取得する。なお、例えば、モード情報が手動モードである場合、手動応答受付部８４は、手動応答情報を取得できるまでウエイトする。 (Step S2304) The manual response reception unit 84 acquires the manual response information corresponding to the input of the administrator. For example, when the mode information is the manual mode, the manual response receiving unit 84 waits until the manual response information can be acquired.

（ステップＳ２３０５）応答出力部８５は、自動情報を画面に出力するか否かを判断する。画面に出力する場合はステップＳ２３０６に行き、出力しない場合はステップＳ２３０７に行く。なお、例えば、自動情報を画面に出力するか否かを示すフラグが格納部５１で管理されており、応答出力部８５は、かかるフラグを格納部８０から読み出し、判断する。また、応答出力部８５は、常に、自動情報を画面に出力しても良いし、常に、自動情報を画面に出力しなくても良い。また、本ステップの処理は、応答出力部８５が行ったが、図示しない判断手段が行っても良い。 (Step S2305) The response output unit 85 determines whether or not to output the automatic information to the screen. If it is output to the screen, it goes to step S2306, and if it is not output, it goes to step S2307. For example, a flag indicating whether or not to output automatic information to the screen is managed by the storage unit 51, and the response output unit 85 reads such a flag from the storage unit 80 and determines. Further, the response output unit 85 may always output the automatic information to the screen, or may not always output the automatic information to the screen. Further, although the processing of this step is performed by the response output unit 85, a determination means (not shown) may be used.

（ステップＳ２３０６）応答出力部８５は、自動情報を画面に出力する。 (Step S2306) The response output unit 85 outputs automatic information to the screen.

（ステップＳ２３０７）図示しない処理部は、手動情報を対話エンジン３に出力するか否かを判断する。対話エンジン３に出力する場合はステップＳ２３０８に行き、出力しない場合はステップＳ２３０９に行く。なお、例えば、手動情報を対話エンジン３に出力するか否かを示すフラグが格納部８０で管理されており、処理部２３は、かかるフラグを格納部２１から読み出し、判断する。また、処理部２３は、常に、手動情報を対話エンジン３に出力しても良いし、常に、手動情報を対話エンジン３に出力しなくても良い。 (Step S2307) A processing unit (not shown) determines whether or not to output manual information to the dialogue engine 3. If it is output to the dialogue engine 3, it goes to step S2308, and if it is not output, it goes to step S2309. For example, a flag indicating whether or not to output manual information to the dialogue engine 3 is managed by the storage unit 80, and the processing unit 23 reads such a flag from the storage unit 21 and determines. Further, the processing unit 23 may always output the manual information to the dialogue engine 3, or may not always output the manual information to the dialogue engine 3.

（ステップＳ２３０８）図示しない処理部は、取得された手動情報を対話エンジン３に出力する。 (Step S2308) A processing unit (not shown) outputs the acquired manual information to the dialogue engine 3.

（ステップＳ２３０９）応答出力部８は、ユーザに出力する情報を取得する。かかる情報は、自動情報または手動情報である。また、応答出力部８５は、例えば、モード情報が自動モードの場合は自動情報を取得し、手動モードの場合は手動情報を取得する。 (Step S2309) The response output unit 8 acquires information to be output to the user. Such information may be automatic or manual information. Further, the response output unit 85 acquires, for example, automatic information when the mode information is in the automatic mode, and acquires manual information when the mode information is in the manual mode.

（ステップＳ２３１０）応答出力部８５は、ステップＳ２３０９で取得した情報を出力する。 (Step S2310) The response output unit 85 outputs the information acquired in step S2309.

（ステップＳ２３１１）図示しない判断部は、モード情報を変更するための条件を満たすか否かを判断する。 (Step S2311) A determination unit (not shown) determines whether or not the condition for changing the mode information is satisfied.

（ステップＳ２３１２）ステップＳ２３１１における判断結果が、条件を満たすとの判断結果である場合はステップＳ２３１２に行き、条件を満たさないとの判断結果である場合はステップＳ２３１４に行く。 (Step S2312) If the determination result in step S2311 is a determination result that the condition is satisfied, the process goes to step S2312, and if the determination result is that the condition is not satisfied, the process goes to step S2314.

（ステップＳ２３１３）図示しないモード変更部は、格納部８０のモード情報を変更する。 (Step S2313) The mode changing unit (not shown) changes the mode information of the storage unit 80.

（ステップＳ２３１４）図示しない処理部は、ステップＳ２３１３における変更が、手動モードへの変更であるか否かを判断する。手動モードへの変更である場合はステップＳ２３１５に行き、手動モードへの変更でない場合はステップＳ２３１６に行く。 (Step S2314) A processing unit (not shown) determines whether or not the change in step S2313 is a change to the manual mode. If it is a change to the manual mode, it goes to step S2315, and if it is not a change to the manual mode, it goes to step S2316.

（ステップＳ２３１５）発話情報出力部８２は、最新の発話情報を画面に出力する。ステップＳ２３０２に戻る。 (Step S2315) The utterance information output unit 82 outputs the latest utterance information to the screen. Return to step S2302.

（ステップＳ２３１６）発話情報出力部８２は、最新の発話情報を各対話エンジン３に出力する。ステップＳ２３０２に戻る。 (Step S2316) The utterance information output unit 82 outputs the latest utterance information to each dialogue engine 3. Return to step S2302.

なお、図２３のフローチャートにおいて、電源オフや処理終了の割り込みにより処理は終了する。 In the flowchart of FIG. 23, the process ends when the power is turned off or an interrupt for the end of the process occurs.

以上、本実施の形態によれば、１または２以上の対話エンジン３からの応答と管理者からの応答とを使い分けて、ユーザに対して出力することにより、適切な対話を継続できる。 As described above, according to the present embodiment, an appropriate dialogue can be continued by properly using the response from one or two or more dialogue engines 3 and the response from the administrator and outputting them to the user.

なお、本実施の形態において、対話装置８は、管理端末７と通信した。しかし、対話装置８は、ユーザ端末１と通信しても良い。かかる場合、対話装置８は、管理端末７を具備しても良い。 In the present embodiment, the dialogue device 8 communicates with the management terminal 7. However, the dialogue device 8 may communicate with the user terminal 1. In such a case, the dialogue device 8 may include a management terminal 7.

さらに、本実施の形態における対話装置８を実現するソフトウェアは、以下のようなプログラムである。つまり、このプログラムは、２以上の対話エンジンから、ユーザの発話に対応する応答である自動応答情報を受け付け得る対話装置を実現するプログラムであって、コンピュータを、前記ユーザ発話情報に対応する情報である発話情報を、１以上の対話エンジンに対して出力し得る発話情報出力部と、前記発話情報の出力に応じて、前記１以上の各対話エンジンから自動応答情報を受け付ける自動応答受付部と、前記自動応答受付部が受け付けた１以上の自動応答情報のうちの一の自動応答情報に対応する一の自動情報を前記ユーザ端末に出力する応答出力部として機能させるためのプログラムである。 Further, the software that realizes the dialogue device 8 in the present embodiment is the following program. That is, this program is a program that realizes a dialogue device that can receive automatic response information that is a response corresponding to a user's utterance from two or more dialogue engines, and uses a computer as information corresponding to the user's utterance information. An utterance information output unit that can output certain utterance information to one or more dialogue engines, an automatic response reception unit that receives automatic response information from each of the one or more dialogue engines in response to the output of the utterance information, and an automatic response reception unit. This is a program for functioning as a response output unit that outputs one automatic information corresponding to one of the one or more automatic response information received by the automatic response reception unit to the user terminal.

上記プログラムにおいて、前記発話情報出力部は、前記ユーザ発話情報に関する発話情報を、１以上の対話エンジンまたは画面のうちの少なくともどちらか一方に出力し、前記発話情報の前記画面への出力に応じて、管理者からの応答である手動応答情報を受け付ける手動応答受付部をさらに具備し、前記応答出力部は、前記自動応答情報に関する自動情報または前記手動応答情報に関する手動情報を出力するものとして、コンピュータを機能させるプログラムであることは好適である。 In the above program, the utterance information output unit outputs utterance information related to the user utterance information to at least one of one or more dialogue engines or screens, and responds to the output of the utterance information to the screen. The computer further includes a manual response receiving unit that receives manual response information that is a response from the administrator, and the response output unit outputs automatic information about the automatic response information or manual information about the manual response information. It is preferable that the program is to function.

また、図２４は、本明細書で述べたプログラムを実行して、上述した種々の実施の形態の対話制御装置等を実現するコンピュータの外観を示す。上述の実施の形態は、コンピュータハードウェア及びその上で実行されるコンピュータプログラムで実現され得る。図２４は、このコンピュータシステム３００の概観図であり、図２５は、システム３００のブロック図である。 Further, FIG. 24 shows the appearance of a computer that executes the program described in the present specification to realize the dialogue control device and the like of the various embodiments described above. The embodiments described above may be implemented in computer hardware and computer programs running on it. FIG. 24 is an overview view of the computer system 300, and FIG. 25 is a block diagram of the system 300.

図２４において、コンピュータシステム３００は、ＣＤ−ＲＯＭドライブを含むコンピュータ３０１と、キーボード３０２と、マウス３０３と、モニタ３０４とを含む。 In FIG. 24, the computer system 300 includes a computer 301 including a CD-ROM drive, a keyboard 302, a mouse 303, and a monitor 304.

図２５において、コンピュータ３０１は、ＣＤ−ＲＯＭドライブ３０１２に加えて、ＭＰＵ３０１３と、ＣＤ−ＲＯＭドライブ３０１２等に接続されたバス３０１４と、ブートアッププログラム等のプログラムを記憶するためのＲＯＭ３０１５と、ＭＰＵ３０１３に接続され、アプリケーションプログラムの命令を一時的に記憶するとともに一時記憶空間を提供するためのＲＡＭ３０１６と、アプリケーションプログラム、システムプログラム、及びデータを記憶するためのハードディスク３０１７とを含む。ここでは、図示しないが、コンピュータ３０１は、さらに、ＬＡＮへの接続を提供するネットワークカードを含んでも良い。 In FIG. 25, in addition to the CD-ROM drive 3012, the computer 301 is connected to the MPU 3013, the bus 3014 connected to the CD-ROM drive 3012, the ROM 3015 for storing a program such as a bootup program, and the MPU 3013. It includes a RAM 3016 that is connected and for temporarily storing instructions of an application program and providing a temporary storage space, and a hard disk 3017 for storing an application program, a system program, and data. Although not shown here, the computer 301 may further include a network card that provides a connection to the LAN.

コンピュータシステム３００に、上述した実施の形態の対話制御装置等の機能を実行させるプログラムは、ＣＤ−ＲＯＭ３１０１に記憶されて、ＣＤ−ＲＯＭドライブ３０１２に挿入され、さらにハードディスク３０１７に転送されても良い。これに代えて、プログラムは、図示しないネットワークを介してコンピュータ３０１に送信され、ハードディスク３０１７に記憶されても良い。プログラムは実行の際にＲＡＭ３０１６にロードされる。プログラムは、ＣＤ−ＲＯＭ３１０１またはネットワークから直接、ロードされても良い。 A program for causing the computer system 300 to execute a function such as the dialogue control device of the above-described embodiment may be stored in the CD-ROM 3101, inserted into the CD-ROM drive 3012, and further transferred to the hard disk 3017. Alternatively, the program may be transmitted to the computer 301 via a network (not shown) and stored on the hard disk 3017. The program is loaded into RAM 3016 at run time. The program may be loaded directly from the CD-ROM3101 or network.

プログラムは、コンピュータ３０１に、上述した実施の形態の対話制御装置等の機能を実行させるオペレーティングシステム（ＯＳ）、またはサードパーティープログラム等は、必ずしも含まなくても良い。プログラムは、制御された態様で適切な機能（モジュール）を呼び出し、所望の結果が得られるようにする命令の部分のみを含んでいれば良い。コンピュータシステム３００がどのように動作するかは周知であり、詳細な説明は省略する。 The program does not necessarily include an operating system (OS) that causes the computer 301 to execute functions such as the dialogue control device of the above-described embodiment, or a third-party program or the like. The program need only include a portion of the instruction that calls the appropriate function (module) in a controlled manner to obtain the desired result. It is well known how the computer system 300 works, and detailed description thereof will be omitted.

なお、上記プログラムにおいて、情報を送信するステップや、情報を受信するステップなどでは、ハードウェアによって行われる処理、例えば、送信ステップにおけるモデムやインターフェースカードなどで行われる処理（ハードウェアでしか行われない処理）は含まれない。 In the above program, in the step of transmitting information and the step of receiving information, processing performed by hardware, for example, processing performed by a modem or interface card in the transmission step (performed only by hardware). Processing) is not included.

また、上記プログラムを実行するコンピュータは、単数であってもよく、複数であってもよい。すなわち、集中処理を行ってもよく、あるいは分散処理を行ってもよい。 Further, the number of computers that execute the above program may be singular or plural. That is, centralized processing may be performed, or distributed processing may be performed.

また、上記各実施の形態において、一の装置に存在する２以上の通信手段は、物理的に一の媒体で実現されても良いことは言うまでもない。 Further, it goes without saying that in each of the above embodiments, the two or more communication means existing in one device may be physically realized by one medium.

また、上記各実施の形態において、各処理は、単一の装置によって集中処理されることによって実現されてもよく、あるいは、複数の装置によって分散処理されることによって実現されてもよい。 Further, in each of the above-described embodiments, each process may be realized by centralized processing by a single device, or may be realized by distributed processing by a plurality of devices.

本発明は、以上の実施の形態に限定されることなく、種々の変更が可能であり、それらも本発明の範囲内に包含されるものであることは言うまでもない。 It goes without saying that the present invention is not limited to the above embodiments, and various modifications can be made, and these are also included in the scope of the present invention.

以上のように、本発明にかかる対話制御装置は、複数の対話エンジンを効果的に用いて、適切な対話を継続させることができるという効果を有し、対話制御装置等として有用である。 As described above, the dialogue control device according to the present invention has an effect that an appropriate dialogue can be continued by effectively using a plurality of dialogue engines, and is useful as a dialogue control device or the like.

１ユーザ端末
２、４、５、６対話制御装置
３、６対話エンジン
７管理端末
８対話装置
１１ユーザ受付部
１２ユーザ処理部
１３ユーザ送信部
１４ユーザ受信部
１５ユーザ出力部
２１、４１、５１、８０格納部
２２受信部
２３、４３、５３処理部
２４、５４出力部
２５、５５、８１受付部
２６、５６送信部
３１、６１学習情報格納部
３２、６２発話受付部
３３、６３応答取得部
３４、６４、８５応答出力部
３５、６５学習受付部
３６、６６学習部
５１、２１１、５１１モード格納部
５３、８３、２５１、５５１自動応答受付部
７０管理格納部
７１管理情報受付部
７２管理処理部
７３管理者出力部
７４管理指示受付部
７５管理出力部
８２、２４１、５４１発話情報出力部
８４、５５２手動応答受付部
２３１、４３１、５３４選択部
２４２、５４２自動情報出力部
２４２自動応答情報出力部
２６１、５６１応答送信部
５３１決定部
５３２判断部
５３３モード変更部
５４３手動情報出力部
５５２手動情報受付部
５５３変更指示受付部 1 User terminal 2, 4, 5, 6 Dialogue control device 3, 6 Dialogue engine 7 Management terminal 8 Dialogue device 11 User reception unit 12 User processing unit 13 User transmission unit 14 User reception unit 15 User output unit 21, 41, 51, 80 Storage unit 22 Reception unit 23, 43, 53 Processing unit 24, 54 Output unit 25, 55, 81 Reception unit 26, 56 Transmission unit 31, 61 Learning information storage unit 32, 62 Speech reception unit 33, 63 Response acquisition unit 34 , 64, 85 Response output unit 35, 65 Learning reception unit 36, 66 Learning unit 51, 211,511 Mode storage unit 53, 83, 251, 551 Automatic response reception unit 70 Management storage unit 71 Management information reception unit 72 Management processing unit 73 Administrator output unit 74 Management instruction reception unit 75 Management output unit 82, 241, 541 Speech information output unit 84, 552 Manual response reception unit 231, 431, 534 Selection unit 242, 542 Automatic information output unit 242 Automatic response information output unit 261 and 561 Response transmission unit 531 Decision unit 532 Judgment unit 533 Mode change unit 543 Manual information output unit 552 Manual information reception unit 553 Change instruction reception unit

Claims

A receiver that receives user utterance information corresponding to the user's utterance from the user terminal,
An utterance information output unit capable of outputting utterance information, which is information corresponding to the user utterance information, to two or more dialogue engines, and an utterance information output unit.
An automatic response reception unit that receives automatic response information from each of the two or more dialogue engines according to the output of the utterance information, and an automatic response reception unit.
A selection unit that selects one of the two or more automatic response information received by the automatic response reception unit, and a selection unit.
It includes a response transmission unit that transmits one automatic information corresponding to the one automatic response information selected by the selection unit to the user terminal.
By repeating the processes of the receiving unit, the utterance information output unit, the automatic response receiving unit, the selecting unit, and the response transmitting unit two or more times, dialogue control for continuing the dialogue with the user is performed. Dialogue control device.

The selection unit
The dialogue control device according to claim 1, wherein one automatic response information that matches a predetermined condition is selected from the two or more automatic response information.

Further, a storage unit for storing priority information indicating the priority of the two or more dialogue engines is provided.
The selection unit
Claim 1 for selecting the automatic response information output from the dialogue engine having the highest priority based on the priority information, among the automatic response information satisfying a predetermined condition among the two or more automatic response information. Alternatively, the dialogue control device according to claim 2.

A receiver that receives user utterance information corresponding to the user's utterance from the user terminal,
A selection unit that selects one dialogue engine from two or more dialogue engines using the user utterance information, and
An utterance information output unit that outputs utterance information, which is information corresponding to the user utterance information, to one dialogue engine selected by the selection unit.
An automatic response reception unit that receives automatic response information from the one dialogue engine in response to the output of the utterance information, and an automatic response reception unit.
It is provided with a response transmission unit that transmits one automatic information corresponding to the automatic response information received by the automatic response reception unit to the user terminal.
By repeating the processes of the receiving unit, the selection unit, the utterance information output unit, the utterance information output unit, and the response transmission unit two or more times, dialogue control for continuing the dialogue with the user is performed. Dialogue control device.

The utterance information output unit
The utterance information can also be output to the management terminal,
A manual response receiving unit that receives manual response information, which is a response from the management terminal, is further provided in response to the output of the utterance information to the management terminal.
The response transmitter is
The item according to any one of claims 1 to 4, wherein the one automatic information which is the information corresponding to the one automatic response information or the manual information which is the information corresponding to the manual response information is transmitted to the user terminal. Dialogue control device.

It also has a decision-making unit that decides whether to adopt a dialogue engine or a management terminal.
The response transmitter is
If the decision decides to adopt the dialogue engine, the one automatic information is transmitted to the user terminal, and if the decision decides to adopt the management terminal, the manual information is transmitted to the user terminal. 5. The dialogue control device according to claim 5.

A mode storage unit that stores at least possible mode information, either an automatic mode that employs a dialogue engine or a manual mode that employs a management terminal.
A judgment unit that determines whether or not a predetermined condition for changing the mode information is satisfied, and
When the determination unit determines that a predetermined condition is satisfied, the mode change unit for changing the mode information is further provided.
The decision unit
The dialogue control device according to claim 6, wherein the mode information of the mode storage unit is acquired.

A mode storage unit that stores at least possible mode information, either an automatic mode that employs a dialogue engine or a manual mode that employs a management terminal.
A change instruction reception unit that accepts change instructions that are mode change instructions from the management terminal,
Further provided with a mode change unit that changes the mode information of the mode storage unit in response to the change instruction.
The decision unit
The dialogue control device according to claim 6, wherein the dialogue control device determines whether to adopt a dialogue engine or a management terminal according to the mode information of the mode storage unit.

The dialogue control device according to any one of claims 5 to 8, further comprising an automatic information output unit that outputs the above-mentioned one automatic information to the management terminal.

The dialogue control device according to any one of claims 5 to 9, further comprising a manual information output unit that outputs the manual information to the dialogue engine.

A learning information storage unit that stores learning information, which is information for acquiring automatic response information corresponding to utterance information, and
An utterance receiving unit that receives utterance information from the dialogue control device according to any one of claims 1 to 10.
A response acquisition unit that acquires automatic response information, which is a response corresponding to the utterance information received by the utterance reception unit, using the learning information.
A response output unit that outputs the automatic response information acquired by the response acquisition unit to the dialogue control device, and a response output unit.
A learning reception unit that accepts other automatic response information that is a response from another dialogue engine,
A dialogue engine including a learning unit that adds information to the learning information by using other automatic response information received by the learning reception unit.

A reception unit that accepts user utterance information corresponding to the user's utterance,
An utterance information output unit capable of outputting utterance information, which is information corresponding to the user utterance information, to two or more dialogue engines, and an utterance information output unit.
An automatic response reception unit that receives automatic response information from each of the two or more dialogue engines according to the output of the utterance information, and an automatic response reception unit.
A selection unit that selects one of the two or more automatic response information received by the automatic response reception unit, and a selection unit.
It is provided with a response output unit that outputs one automatic information corresponding to the one automatic information corresponding to the one automatic response information selected by the selection unit.
A dialogue device that interacts with the user by repeating the processes of the reception unit, the utterance information output unit, the automatic response reception unit, the selection unit, and the response output unit two or more times.

A reception unit that accepts user utterance information corresponding to the user's utterance,
Using the user utterance information, a selection unit for selecting one of the dialog engine from two or more dialog engine,
The information der Ru speech information corresponding to the user utterance information, the utterance information output unit that outputs one of the dialog engine, wherein the selecting unit selects,
An automatic response reception unit that receives automatic response information from the one dialogue engine in response to the output of the utterance information, and an automatic response reception unit.
It is provided with a response output unit that outputs one automatic information corresponding to the automatic response information received by the automatic response reception unit.
A dialogue device that interacts with the user by repeating the processes of the reception unit, the selection unit, the utterance information output unit, the automatic response reception unit, and the response output unit two or more times.

It is a dialogue control method realized by a receiving unit, an utterance information output unit, an automatic response receiving unit, a selection unit, and a response transmitting unit.
A reception step in which the receiving unit receives user utterance information corresponding to the user's utterance from the user terminal, and
An utterance information output step in which the utterance information output unit can output utterance information, which is information corresponding to the user utterance information, to two or more dialogue engines.
An automatic response reception step in which the automatic response reception unit receives automatic response information from each of the two or more dialogue engines in response to the output of the utterance information.
A selection step in which the selection unit selects one of the two or more automatic response information received in the automatic response reception step.
The response transmission unit includes a response transmission step of transmitting one automatic information corresponding to the one automatic response information selected in the selection step to the user terminal.
Dialogue control for continuing dialogue with the user by repeating the reception step, the utterance information output step, the automatic response reception step, the selection step, and the response transmission step two or more times. Method.

It is a dialogue control method realized by a receiving unit, a selection unit, an utterance information output unit, an automatic response receiving unit, and a response transmitting unit.
A reception step in which the receiving unit receives user utterance information corresponding to the user's utterance from the user terminal, and
The selection unit includes a selection step of using said user's utterance information, selects one interactive engine from two or more dialog engine,
The utterance information output unit, the information der Ru speech information corresponding to the user utterance information, the utterance information output step of outputting to an interactive engine in which the selecting unit selects,
An automatic response reception step in which the automatic response reception unit receives automatic response information from one dialogue engine selected in the selection step in response to the output of the utterance information.
The response transmission unit includes a response transmission step of transmitting one automatic information corresponding to the automatic response information received in the automatic response reception step to the user terminal.
Dialogue control for continuing dialogue with the user by repeating the reception step, the selection step, the utterance information output step, the automatic response reception step, and the response transmission step two or more times. Method.

Computer,
A receiver that receives user utterance information corresponding to the user's utterance from the user terminal,
An utterance information output unit capable of outputting utterance information, which is information corresponding to the user utterance information, to two or more dialogue engines, and an utterance information output unit.
An automatic response reception unit that receives automatic response information from each of the two or more dialogue engines according to the output of the utterance information, and an automatic response reception unit.
A selection unit that selects one of the two or more automatic response information received by the automatic response reception unit, and a selection unit.
A program for functioning as a response transmission unit that transmits one automatic information corresponding to the one automatic response information selected by the selection unit to the user terminal.
By repeating the processes of the receiving unit, the utterance information output unit, the automatic response receiving unit, the selecting unit, and the response transmitting unit two or more times, dialogue control for continuing the dialogue with the user is performed. program.

Computer,
A receiver that receives user utterance information corresponding to the user's utterance from the user terminal,
Using the user utterance information, a selection unit for selecting one of the dialog engine from two or more dialog engine,
The information der Ru speech information corresponding to the user utterance information, the utterance information output unit that outputs one of the dialog engine, wherein the selecting unit selects,
An automatic response reception unit that receives automatic response information from the one dialogue engine in response to the output of the utterance information, and an automatic response reception unit.
It functions as a response transmission unit that transmits one automatic information corresponding to the automatic response information received by the automatic response reception unit to the user terminal.
By repeating the processes of the receiving unit, the selecting unit, the utterance information output unit, the automatic response receiving unit, and the response transmitting unit two or more times, dialogue control for continuing the dialogue with the user is performed. program.

Computer,
A reception unit that accepts user utterance information corresponding to the user's utterance,
An utterance information output unit capable of outputting utterance information, which is information corresponding to the user utterance information, to two or more dialogue engines, and an utterance information output unit.
An automatic response reception unit that receives automatic response information from each of the two or more dialogue engines according to the output of the utterance information, and an automatic response reception unit.
A selection unit that selects one of the two or more automatic response information received by the automatic response reception unit, and a selection unit.
It is made to function as a response output unit that outputs one automatic information corresponding to one automatic information corresponding to the one automatic response information selected by the selection unit.

A program that interacts with the user by repeating the processes of the reception unit, the utterance information output unit, the automatic response reception unit, the selection unit, and the response output unit two or more times.

Computer,
A reception unit that accepts user utterance information corresponding to the user's utterance,
Using the user utterance information, a selection unit for selecting one of the dialog engine from two or more dialog engine,
The information der Ru speech information corresponding to the user utterance information, the utterance information output unit that outputs one of the dialog engine, wherein the selecting unit selects,
An automatic response reception unit that receives automatic response information from the one dialogue engine in response to the output of the utterance information, and an automatic response reception unit.
It functions as a response output unit that outputs one automatic information corresponding to the automatic response information received by the automatic response reception unit.
A program that interacts with the user by repeating the processes of the reception unit, the selection unit, the utterance information output unit, the automatic response reception unit, and the response output unit two or more times.