JP2001034289A

JP2001034289A - Interactive system using natural language

Info

Publication number: JP2001034289A
Application number: JP11202538A
Authority: JP
Inventors: Seiya Osada; 誠也長田; Shinichi Doi; 伸一土井
Original assignee: NEC Corp
Current assignee: NEC Corp
Priority date: 1999-07-16
Filing date: 1999-07-16
Publication date: 2001-02-09
Anticipated expiration: 2019-07-16
Also published as: JP3662780B2

Abstract

PROBLEM TO BE SOLVED: To provide an interactive system capable of reducing the load of a user by allowing the system to interpret the vague speaking of the user and changing the response of the system while using the degree of assurance of the interpreted result. SOLUTION: This system is provided with a natural language analyzing means 21 which receives a character string from an input device 1 and converts it into plural task commands by referring to a dictionary storage part 31, an ambiguity dissolving means 22 which narrows down the plural task commands to one by referring to an ambiguity dissolving information storage part 32 and also obtains the degree of assurance from the method of the narrowing down of the commands and a response sentence generating means 23 which generates a response sentence in accordance with the degree of assurance by referring to the narrowed down task command and the obtained degree of assurance and the dictionary storage part 31 and outputs it to an output device 4.

Description

DETAILED DESCRIPTION OF THE INVENTION

【０００１】[0001]

【発明の属する技術分野】本発明は、自然言語を用いた
対話システムに関し、特に、ユーザの曖昧な発話を解釈
しユーザへの応答を行なうシステム及び装置に関する。BACKGROUND OF THE INVENTION 1. Field of the Invention The present invention relates to a dialogue system using a natural language, and more particularly to a system and apparatus for interpreting a vague utterance of a user and responding to the user.

【０００２】[0002]

【従来の技術】タスクに対するユーザの発話は一般的に
は曖昧なものであり、曖昧な発話を解釈する従来のシス
テムとして以下のようなものがあげられる。2. Description of the Related Art Generally speaking, a user's utterance for a task is ambiguous, and the following systems are known as conventional systems for interpreting ambiguous utterance.

【０００３】第１のシステムは、曖昧な発話に対してシ
ステムが確認の応答を行い、それに対してユーザに肯定
か否定かを答えさせることでユーザの発話を解釈するも
のである。例えば特開平9-171394号公報には、システム
がユーザの曖昧な発話から生成できるタスクを列挙し、
列挙したタスクの中から１つずつユーザの意図に合うか
どうか質問し、ユーザが肯定を意味する語を入力する
と、システムはそのタスクが正しい解釈であると決定す
る。[0003] In the first system, the system responds to a vague utterance to confirm and interprets the utterance of the user by allowing the user to answer affirmative or negative. For example, in Japanese Patent Application Laid-Open No. 9-171394, tasks that the system can generate from ambiguous utterances of the user are listed,
The system asks each of the listed tasks one by one to see if it meets the user's intention, and if the user enters a word that means affirmation, the system determines that the task is the correct interpretation.

【０００４】第２のシステムは、ユーザの曖昧な発話を
システムはさまざまな手がかりを用いて一意に解釈し
て、確認なしにタスクを実行するものである。[0004] In the second system, a system interprets a user's ambiguous utterance uniquely using various clues and executes a task without confirmation.

【０００５】[0005]

【発明が解決しようとする課題】しかしながら、上記し
た従来のシステムは下記記載の問題点を有している。However, the above-mentioned conventional system has the following problems.

【０００６】上記第１のシステムの問題点は、ユーザが
曖昧な発話するたびにシステムが確認の応答文を出力す
ることから、ユーザはそのたびに応答文に答えなくては
ならず、煩雑であり、負担がかかる、ということであ
る。A problem with the first system is that the system outputs a response sentence for confirmation every time the user makes an ambiguous utterance, so that the user must answer the response sentence each time, which is complicated. Yes, it is burdensome.

【０００７】一方、上記第２のシステムの問題点は、ユ
ーザの曖昧性のある発話を一意に解釈して必ず実行して
しまうと、システムの解釈が間違っていた時にユーザは
システムが行ってしまったタスクに対して修正を行わな
くてはならず、結果としてユーザに負担がかかる、とい
うことである。[0007] On the other hand, the problem of the second system is that if a user's ambiguous utterance is uniquely interpreted and executed without fail, the user will execute the system when the system interpretation is incorrect. This means that the user has to make corrections for the tasks that have been performed, which results in a burden on the user.

【０００８】したがって、本発明は、上記問題点に鑑み
てなされたものであって、その目的は、ユーザが発話し
た後、ユーザにできるだけ負荷のかけることなく、タス
クの実行と応答文の出力を行なうシステム及び装置を提
供することにある。これ以外の本発明の目的、特徴、利
点等は以下の説明から、当業者には直ちに明らかとされ
るであろう。Therefore, the present invention has been made in view of the above problems, and has as its object to execute task execution and output of a response sentence after a user speaks without imposing as much load on the user as possible. It is an object of the present invention to provide a system and apparatus for performing the above. Other objects, features, advantages and the like of the present invention will be immediately apparent to those skilled in the art from the following description.

【０００９】[0009]

【課題を解決するための手段】前記目的を達成する本発
明は、入力手段から入力された自然言語を受け取り、語
彙辞書を参照して１又は複数のタスクコマンドに変換す
る自然言語解析手段と、曖昧性解消情報を参照して、複
数のタスクコマンドを１つに絞り込むとともに、絞り込
みの仕方から、確信度を求める曖昧性解消手段と、絞り
込んだタスクコマンドと求められた確信度と語彙辞書を
参照して確信度に応じた応答文を生成し、出力手段に出
力する応答文生成手段とを備える。より詳細には、本発
明は、入力された自然言語を解釈するために必要な語彙
と、該自然言語の解釈結果から応答文を生成するために
必要な語彙を予め記憶する辞書記憶部と、前記辞書記憶
部を参照して、入力された自然言語の解釈を行う自然言
語解析手段と、前記自然言語解析手段で求められた解釈
結果が複数存在した時に、１つの解釈結果に絞り込むた
めの手がかりとして用いられる情報を記憶する曖昧性解
消情報記憶部と、前記自然言語解析手段で求められた解
釈結果が複数存在した時に、前記曖昧性解消情報記憶部
を参照して、前記複数の解釈結果を１つに絞り込むとと
もに、前記１つに絞り込んだ解釈結果に対する確信度を
求める曖昧性解消手段と、前記曖昧性解消手段で絞り込
んだ解釈結果を応答文として出力し、その際、前記曖昧
性解消手段で求められた前記確信度に応じて応答文を変
化させる応答文生成手段と、を備える。According to the present invention, there is provided a natural language analyzing means for receiving a natural language input from an input means and converting the natural language into one or a plurality of task commands with reference to a vocabulary dictionary. By referring to the disambiguation information, narrow down a plurality of task commands to one, and refer to the disambiguation removing means for obtaining the certainty from the narrowing down method, and the narrowed task command, the certainty determined and the vocabulary dictionary. And a response sentence generating means for generating a response sentence according to the certainty factor and outputting the response sentence to the output means. More specifically, the present invention provides a vocabulary necessary for interpreting an input natural language, a dictionary storage unit for storing in advance a vocabulary necessary for generating a response sentence from the interpretation result of the natural language, A natural language analyzing unit that refers to the dictionary storage unit to interpret an input natural language; and a clue for narrowing down to one interpretation result when there are a plurality of interpretation results obtained by the natural language analyzing unit. An ambiguity resolution information storage unit that stores information used as the information, and when there are a plurality of interpretation results obtained by the natural language analysis unit, refer to the ambiguity resolution information storage unit to convert the plurality of interpretation results. An ambiguity resolving means for narrowing down to one, and obtaining a certainty factor for the interpretation result narrowed to one, and outputting the interpretation result narrowed down by the ambiguity resolving means as a response sentence; And a response sentence generation means for changing the response sentence according to the confidence obtained by the sexual eliminating means.

【００１０】本発明は、前記曖昧性解消手段で求められ
た解釈を受けて、該解釈をタスクコマンドに変換し、前
記タスクコマンドからタスクを実行するタスク実行手段
をさらに備える。[0010] The present invention further comprises a task execution means for receiving the interpretation determined by the ambiguity resolution means, converting the interpretation into a task command, and executing a task from the task command.

【００１１】本発明においては、前記応答文生成手段
が、前記曖昧性解消手段で求められた確信度に応じて、
前記タスク実行手段でタスクを実行した後にタスクの実
行を報告する応答文を生成するか、もしくは、前記タス
ク実行手段でタスクを実行する前にタスクを実行してと
よいか否かを確認するための応答文を生成するか、を決
定する。[0011] In the present invention, the response sentence generation means may determine the response sentence according to the certainty factor obtained by the ambiguity resolution means.
To generate a response statement reporting the execution of the task after executing the task by the task execution unit, or to confirm whether the task execution unit should execute the task before executing the task Is generated or not.

【００１２】本発明においては、実行を行ったタスクコ
マンドに対する逆操作のタスクコマンドを記憶するキャ
ンセルコマンド記憶部と、キャンセルの意味を表す入力
があったときに、前記タスク実行手段でキャンセルコマ
ンドを実行するとともに、前記応答文生成手段におい
て、キャンセルコマンドを実行したことを報告する応答
文を生成する。In the present invention, a cancel command storage unit for storing a task command of an inverse operation to the executed task command, and executing the cancel command by the task execution means when there is an input indicating the meaning of cancellation. At the same time, the response sentence generating means generates a response sentence reporting that the cancel command has been executed.

【００１３】本発明においては、システムがタスクコマ
ンドを実行してようかどうかの確認をとる応答文を出力
した後、時間の経過を測定する時間管理手段を備え、前
記時間管理手段で計時を開始後、あらかじめ定められた
所定の時間が経過しても、ユーザからの応答がない時に
は、肯定を表す入力が前記入力装置に入力されたものと
解釈する。According to the present invention, there is provided time management means for measuring the passage of time after outputting a response sentence for confirming whether or not the system executes the task command, and the time management means starts time measurement. Thereafter, if there is no response from the user even after a predetermined time has elapsed, it is interpreted that an input indicating affirmation has been input to the input device.

【００１４】[0014]

【発明の実施の形態】本発明の好ましい実施の形態につ
いて以下に説明する。本発明の自然言語を用いた対話シ
ステムは、入力された自然言語を解釈する自然言語解析
手段（図１の２１）と、解釈結果が複数存在する時に解
釈を１つに絞り込むとともに、その絞り込み方を用い
て、確信度を求める曖昧性解消手段（図１の２２）と、
１つに絞り込まれたタスクコマンドから確信度に応じた
応答文を生成する応答文生成手段（図１の２３）と、を
有する。DETAILED DESCRIPTION OF THE PREFERRED EMBODIMENTS Preferred embodiments of the present invention will be described below. The dialogue system using a natural language according to the present invention includes a natural language analyzing means (21 in FIG. 1) for interpreting an input natural language, and a method for narrowing down the interpretation to one when there are a plurality of interpretation results. Ambiguity resolution means (22 in FIG. 1) for obtaining a certainty factor using
Response sentence generating means (23 in FIG. 1) for generating a response sentence corresponding to the degree of certainty from the task commands narrowed down to one.

【００１５】本発明においては、ユーザの曖昧な発話を
確信度というパラメータに変換し、確信度に応じた応答
文を生成することにより、自然な対話を行うことができ
る。In the present invention, a natural conversation can be performed by converting a vague utterance of a user into a parameter called certainty factor and generating a response sentence according to the certainty factor.

【００１６】本発明はその好ましい第１の実施の形態に
おいて、図１を参照すると、入力手段（１）から入力さ
れた自然言語に対して辞書記憶部（３１）を参照して解
釈を行う自然言語解析手段（２１）と、自然言語解析手
段（２１）で求められた解釈結果が複数存在した時に、
自然言語解析手段（２１）で求められた解釈結果が複数
存在した時に、解釈結果に絞り込むための手がかりとし
て用いられる情報を記憶する曖昧性解消情報記憶部（３
２）の情報を参照して、前記複数の解釈結果を１つに絞
り込むとともに、前記１つに絞り込んだ解釈結果に対す
る確信度を求める曖昧性解消手段（２２）と、曖昧性解
消手段（２２）で絞り込んだ解釈結果を応答文として出
力手段（４）から出力し、その際、前記曖昧性解消手段
で求められた前記確信度に応じて応答文を変化させる応
答文生成手段（２３）と、を備える。In the first preferred embodiment of the present invention, referring to FIG. 1, a natural language for interpreting a natural language input from an input means (1) by referring to a dictionary storage section (31). When there are a plurality of interpretation results obtained by the language analysis means (21) and the natural language analysis means (21),
When there are a plurality of interpretation results obtained by the natural language analysis means (21), the disambiguation information storage unit (3) stores information used as a clue for narrowing down the interpretation results.
With reference to the information of 2), the plurality of interpretation results are narrowed down to one, and the ambiguity resolving means (22) for obtaining a certainty factor for the narrowed down interpretation result, and an ambiguity resolving means (22) A response sentence generating means (23) for outputting the interpretation result narrowed down by the output means (4) as a response sentence, and at this time, changing the response sentence according to the certainty factor obtained by the ambiguity eliminating means; Is provided.

【００１７】自然言語解析手段（２１）で求められた複
数の解釈として複数の処理要求（タスクコマンド）があ
る場合、曖昧性解消手段（２２）は、曖昧性解消情報記
憶部（３２）に記憶された情報（例えば処理対象の物理
的な条件、入力された自然言語の文脈、ユーザの操作履
歴等）に基づき、複数の処理要求のうち妥当な処理要求
を一つ選択するとともに確信度を設定する。その際、自
然言語解析手段（２１）で求められた解釈が一つの場
合、曖昧性がないため、確信度最大とする。When there are a plurality of processing requests (task commands) as a plurality of interpretations obtained by the natural language analysis means (21), the ambiguity resolution means (22) stores the ambiguity resolution information storage section (32). Based on the received information (eg, the physical conditions of the processing target, the context of the input natural language, the operation history of the user, etc.), select one appropriate processing request from a plurality of processing requests and set the certainty factor I do. At this time, if the interpretation obtained by the natural language analysis means (21) is one, there is no ambiguity.

【００１８】本発明は、その好ましい第２の実施の形態
において、図３を参照すると、曖昧性解消手段（２２）
で求められた解釈を受けて、該解釈をタスクコマンドに
変換し、前記タスクコマンドからタスクを実行するタス
ク実行手段（２４）をさらに備える。In the second preferred embodiment of the present invention, referring to FIG. 3, the disambiguation means (22)
Receiving the interpretation determined in (1), converts the interpretation into a task command, and further executes a task executing means (24) for executing a task from the task command.

【００１９】応答文生成手段（２３）は、前記曖昧性解
消手段（２２）で求められた確信度に応じて、前記タス
ク実行手段（２４）でタスクを実行した後にタスクの実
行を報告する応答文を生成するか、もしくは、タスク実
行手段（２４）でタスクを実行する前にタスクを実行し
てよいか否かを確認するための応答文を生成するかを決
定する。The response sentence generation means (23) responds to the task execution means (24) according to the certainty factor obtained by the ambiguity resolution means (22), after the task has been executed by the task execution means (24). It is determined whether a statement is to be generated or a response statement for confirming whether the task can be executed before the task is executed by the task execution means (24).

【００２０】すなわち、自然言語解析手段（２１）で
は、入力された自然言語を解釈し、該入力された自然言
語から解釈可能な処理要求（タスクコマンド）が複数あ
る場合、前記曖昧性解消手段（２２）で複数のタスクコ
マンドを一つに絞り込み、該一つの絞り込まれた確信度
が小の場合、タスクコマンドを直ちに実行せずに、未実
行タスクコマンド記憶部（３３）に一旦記憶しておき、
タスクを実行する前にタスクを実行してよいか否かを確
認するための応答文を出力する。該応答文に対してユー
ザが肯定を意味する入力を行なった場合、未実行タスク
コマンド記憶部（３３）からタスクコマンドを取り出し
て、タスク実行手段でタスクコマンドを実行し、タスク
実行完了の応答文を出力し、一方否定入力の場合、未実
行タスク取り消しの応答文を作成して出力する。That is, the natural language analysis means (21) interprets the input natural language, and when there are a plurality of processing requests (task commands) interpretable from the input natural language, the ambiguity resolution means (21) At 22), a plurality of task commands are narrowed down to one, and when the one certainty factor is small, the task command is not immediately executed, but is temporarily stored in the non-executed task command storage unit (33). ,
Outputs a response statement to confirm whether the task can be executed before executing the task. When the user inputs an affirmative answer to the response statement, the task command is extracted from the unexecuted task command storage unit (33), the task command is executed by the task execution means, and the response statement of task execution completion is obtained. Is output. On the other hand, in the case of a negative input, a response statement for canceling an unexecuted task is created and output.

【００２１】応答文生成手段（２３）が、曖昧性解消手
段（２２）で求められた確信度に応じて、応答文を生成
するかしないかを決定するようにしてもよい。The response sentence generating means (23) may determine whether or not to generate a response sentence according to the degree of certainty obtained by the ambiguity eliminating means (22).

【００２２】応答文生成手段（２３）は、曖昧性解消手
段（２２）で求められた確信度に応じて、曖昧性解消手
段（２２）が解釈した結果を、報告、確認、質問のうち
のいずれかを表す応答文に変更する。The response sentence generating means (23) outputs the result interpreted by the disambiguation means (22) in accordance with the degree of certainty obtained by the disambiguation means (22). Change to a response statement that indicates one of them.

【００２３】本発明は、その好ましい第３の実施の形態
において、図５を参照すると、タスク実行手段で実行を
行ったタスクコマンドに対する逆操作のタスクコマンド
を記憶するキャンセルコマンド記憶部（３４）を備え、
ユーザからキャンセルの意味を表す入力があったとき
に、タスク実行手段でキャンセルコマンドを実行すると
ともに、応答文生成手段（２２）では、キャンセルコマ
ンドを実行したことを報告する応答文を生成する。In the third preferred embodiment of the present invention, referring to FIG. 5, a cancel command storage unit (34) for storing a task command of an inverse operation to the task command executed by the task execution means is provided. Prepared,
When there is an input indicating the meaning of cancellation from the user, the task execution means executes the cancel command, and the response sentence generation means (22) generates a response sentence reporting that the cancel command has been executed.

【００２４】本発明は、その好ましい第４の実施の形態
において、図７を参照すると、絞り込まれたタスクコマ
ンドの確信度が低いときに、応答文生成手段（２２）で
は、タスクコマンドを実行していいかどうかの確認をと
る応答文を出力し、該応答文を出力した後、時間の経過
を測定するタイマ等の時間管理手段（３５）を備え、計
時を開始後、あらかじめ定められた所定の時間が経過し
ても、ユーザからの応答がない時には、肯定を表す入力
が入力手段（１）に入力されたものと解釈する。According to the fourth embodiment of the present invention, referring to FIG. 7, when the certainty degree of the narrowed task command is low, the response sentence generation means (22) executes the task command. A response sentence for confirming whether or not to proceed is output, and after outputting the response sentence, time management means (35) such as a timer for measuring the passage of time is provided. If there is no response from the user even after a lapse of time, it is interpreted that an input indicating affirmation has been input to the input means (1).

【００２５】時間管理手段（３５）によって前記所定の
時間が経過した後に、肯定を表す入力が入力手段（１）
に入力されたか否かを、曖昧性解消手段（２２）で求め
た確信度に応じて決定するようにしてもよい。After the predetermined time has passed by the time management means (35), an input indicating affirmative is input to the input means (1).
May be determined according to the certainty factor obtained by the disambiguation means (22).

【００２６】本発明の実施の形態において、自然言語解
析手段、曖昧性解消手段、応答文生成手段、及びタスク
実行手段はコンピュータ等データ処理装置上で実行され
るプログラムにより各々の処理・機能が実現される。こ
の場合、該プログラムを記録した記録媒体又は通信媒体
より該プログラムをデータ処理装置に読み出して実行す
ることにより本発明を実施することができる。In the embodiment of the present invention, the natural language analyzing means, the ambiguity eliminating means, the response sentence generating means, and the task executing means realize their respective processing and functions by a program executed on a data processing device such as a computer. Is done. In this case, the present invention can be implemented by reading the program from a recording medium or a communication medium on which the program is recorded to a data processing device and executing the program.

【００２７】[0027]

【実施例】本発明の実施例について図面を参照して以下
に説明する。DESCRIPTION OF THE PREFERRED EMBODIMENTS Embodiments of the present invention will be described below with reference to the drawings.

【００２８】本発明の第１の実施例について説明する。
図１は、本発明の第１の実施例の構成を示す図である。
図１を参照すると、本発明の第１の実施例は、キーボー
ドや音声認識装置等の入力装置１と、プログラム制御に
より動作するデータ処理装置２と、情報を記憶する記憶
装置３と、ディスプレイ装置や音声合成装置等の出力装
置４とを含む。A first embodiment of the present invention will be described.
FIG. 1 is a diagram showing the configuration of the first exemplary embodiment of the present invention.
Referring to FIG. 1, a first embodiment of the present invention includes an input device 1 such as a keyboard and a voice recognition device, a data processing device 2 operated by program control, a storage device 3 for storing information, and a display device. And an output device 4 such as a voice synthesizer.

【００２９】記憶装置３は、辞書記憶部３１と、曖昧性
解消情報記憶部３２とを備えている。辞書記憶部３１に
は、タスクを実行するためにユーザが使う語彙、及び、
ユーザへの応答文を生成するために使う語彙があらかじ
め記憶されている。The storage device 3 includes a dictionary storage unit 31 and an ambiguity resolution information storage unit 32. The vocabulary used by the user to execute the task is stored in the dictionary storage unit 31, and
The vocabulary used to generate the response sentence to the user is stored in advance.

【００３０】曖昧性解消情報記憶部３２には、タスクが
どのような構成になっているのか、タスク先の現在の状
態、ユーザが発話した文脈等、ユーザの発話の曖昧性を
解消するために必要な情報が記憶されている。The ambiguity resolving information storage unit 32 is used for resolving the ambiguity of the user's utterance, such as the configuration of the task, the current state of the task destination, the context in which the user uttered, and the like. Necessary information is stored.

【００３１】データ処理装置２は、自然言語解析手段２
１と、曖昧性解消手段２２と、応答文生成手段２３と、
を備えている。The data processing device 2 includes a natural language analysis unit 2
1, a disambiguation unit 22, a response sentence generation unit 23,
It has.

【００３２】自然言語解析手段２１は、入力装置１で得
た入力文字列を辞書記憶部３１に記憶された語彙を用い
て解釈する。The natural language analysis means 21 interprets the input character string obtained by the input device 1 using the vocabulary stored in the dictionary storage 31.

【００３３】曖昧性解消手段２２は、自然言語解析手段
２１で複数の解釈結果が得られた時に、曖昧性解消情報
記憶部３２の情報を参照して、解釈を１つに絞る。When a plurality of interpretation results are obtained by the natural language analysis means 21, the ambiguity resolution means 22 refers to the information in the ambiguity resolution information storage unit 32 and narrows down the interpretation to one.

【００３４】また、解釈の絞り方に応じてシステムが曖
昧性を解消した結果の確信度も求める。In addition, the degree of certainty as a result of the system eliminating ambiguity is also determined according to the method of narrowing down the interpretation.

【００３５】確信度は、どの手段を用いて曖昧性が解消
されたか、また曖昧性解消情報記憶部に記憶されている
どの情報を用いて曖昧性が解消されたかを用いて、求め
られる。The certainty factor is obtained by using which means has been used to resolve the ambiguity and which information stored in the ambiguity resolution information storage unit has been used to resolve the ambiguity.

【００３６】以下、タスクとして、家庭内の機器操作を
例に挙げて、確信度の定義の一例について説明する。Hereinafter, as an example of the task, an example of the definition of the certainty factor will be described with reference to the operation of home appliances.

【００３７】（１）．自然言語解析手段２１で解釈が１
つに絞られていた時は確信度最大とする。(1). Interpretation 1 by natural language analysis means 21
When it is narrowed down to one, the maximum confidence is set.

【００３８】（２）．タスクを実行の可否によって解釈
を１つに絞った時、例えば「つけて」という入力に対し
て、・「テレビをつける」というタスクと、・「ビデオをつける」というタスクと、・「明かりをつける」というタスクと、の３つ解釈が自然言語解析手段２１で行われたときに、
例えば、曖昧性解消情報記憶部３２の情報から、テレビ
と明かりの電源はすでについており（電源オンとされて
おり）、ビデオの電源はついていない場合、「ビデオを
つける」という解釈に、曖昧性解消手段２２で解釈を１
つに絞った時には、確信度大とする。(2). When the interpretation is narrowed down to one depending on whether the task can be executed or not, for example, for the input of “turn on”, the task of “turn on the TV”, the task of turning on the video, and the “turn on the light” When the three interpretations of “attach” and “
For example, from the information in the disambiguation information storage unit 32, if the power supply of the television and the light is already on (power is turned on) and the power supply of the video is not turned on, the interpretation that "video is turned on" is ambiguous. Interpretation 1
When it is narrowed down to one, the degree of certainty is high.

【００３９】（３）．文脈によって解釈を１つに絞った
時、例えば「１チャンネルにして」という入力に対し
て、自然言語解析手段２１で、・「テレビのチャンネルを１にする」というタスクと、・「ビデオのチャンネルを１にする」というタスクと、の二つの解釈ができたときに、前回の入力が「テレビを
つけて」であったため、「テレビ」が話題になっている
ものと判断して、「テレビのチャンネルを１にする」と
曖昧性解消手段２２で１に絞った時には「確信度中」と
する。(3). When the interpretation is narrowed down to one according to the context, for example, in response to an input of “to one channel”, the natural language analysis means 21 performs a task of “to set the television channel to 1”; When the two tasks, "turn the TV to 1" and "interpretation of the TV," were completed, the previous input was "Turn on the TV," and it was determined that "TV was a topic." Channel is set to 1 ", and when the ambiguity eliminating means 22 narrows down to 1, the degree of confidence is determined to be" medium ".

【００４０】（４）．ユーザの操作の頻度によって解釈
を１つに絞った時、例えば「つけて」という入力に対し
て、自然言語解析手段２１で、・「テレビをつける」というタスク、・「ビデオをつける」というタスク、・「明かりをつける」というタスクの三つに解釈できた時に、今までの過去の履歴を見る
と、「明かりをつける」ことが多いので、「明かりをつ
ける」という解釈に、曖昧性解消手段２２で１つに絞っ
た時には、「確信度小」とする。(4). When the number of interpretations is reduced to one according to the frequency of the user's operation, for example, in response to an input of “turn on”, the natural language analysis unit 21 performs a task of “turning on a television”;・ When the task of "turn on the light" can be interpreted into three, when looking at the past history up to now, there are many cases where "turn on the light", so the ambiguity in the interpretation of "turn on the light" is resolved. When the number is reduced to one by the means 22, “small confidence” is set.

【００４１】（５）．曖昧性を解消する手がかりが何も
ないため、自然言語解析手段２１で見つけた順番を用い
て曖昧性解消手段２２で解釈を１つに絞った時には、
「確信度最小」とする。(5). Since there is no clue to disambiguate, when the interpretation is narrowed down to one by the disambiguation unit 22 using the order found by the natural language analysis unit 21,
"Minimum confidence".

【００４２】応答文生成手段２３は、曖昧性解消手段２
２で１つに絞った解釈と、その確信度、さらに辞書記憶
部３を使って、ユーザにできるだけ負担にならないよう
な応答文を生成する。The response sentence generation means 23 is provided with the ambiguity resolution means 2
By using the interpretation limited to one in step 2, its certainty factor, and the dictionary storage unit 3, a response sentence that does not burden the user as much as possible is generated.

【００４３】例えば、確信度が高い時には、タスクを実
行してしまい、実行した結果を伝える応答文を生成す
る。For example, when the degree of certainty is high, the task is executed, and a response sentence conveying the execution result is generated.

【００４４】一方、確信度が低い時には、タスクを実行
せずに、実行していいかどうかの応答文を生成する。On the other hand, when the degree of certainty is low, a response sentence as to whether or not to execute the task is generated without executing the task.

【００４５】図２は、本発明の一実施例の処理手順を示
す流れ図である。図１及び図２を参照して、本発明の一
実施例の動作について詳細に説明する。FIG. 2 is a flowchart showing a processing procedure according to one embodiment of the present invention. The operation of the embodiment of the present invention will be described in detail with reference to FIGS.

【００４６】入力装置１で得た入力文字列は、自然言語
解析手段２１に供給される（図２のステップ１０１）。The input character string obtained by the input device 1 is supplied to the natural language analyzing means 21 (step 101 in FIG. 2).

【００４７】自然言語解析手段２１は、辞書記憶部３１
に記憶されている語彙を参照して、この入力文字列を解
釈する（図２のステップ１０２）。The natural language analyzing means 21 includes a dictionary storage 31
The input character string is interpreted with reference to the vocabulary stored in (2) (step 102 in FIG. 2).

【００４８】曖昧性解消手段２２は、曖昧性解消情報記
憶部３２に記憶されている情報を参照して、解釈が複数
存在している時に解釈を１つに絞り込む（図２のステッ
プ１０３）。The disambiguation means 22 refers to the information stored in the disambiguation information storage unit 32 and narrows down the interpretation to one when there are a plurality of interpretations (step 103 in FIG. 2).

【００４９】また、どのように解釈を１つに絞ったかを
用いてシステムが曖昧性を解消した結果の確信度を求め
る（図２のステップ１０４）。Further, using the method of narrowing down the interpretation to one, a certainty factor of the result of the system eliminating the ambiguity is obtained (step 104 in FIG. 2).

【００５０】応答文生成手段２３は、辞書記憶部３１に
記憶されている語彙を参照して、確信度の大きさに応じ
て異なった応答文を生成し、出力装置４に出力する（図
２のステップ１０５）。The response sentence generating means 23 refers to the vocabulary stored in the dictionary storage unit 31, generates a different response sentence according to the degree of certainty, and outputs it to the output device 4 (FIG. 2). Step 105).

【００５１】本発明の第１の実施例について具体例に即
して以下に説明する。The first embodiment of the present invention will be described below with reference to a specific example.

【００５２】入力装置１として、マイクを備え、該マイ
クから入力された信号をディジタル信号に変換して音声
認識を行ない認識結果をテキストコードとして出力する
音声認識システムからなるものとする。データ処理装置
２はパーソナル・コンピュータよりなり、磁気ディスク
記憶装置等よりなる記憶装置３を備える。また出力装置
４は、スピーカを備えた音声合成システムよりなり、応
答文生成手段２３から出力されたテキスト情報を音声合
成して音声出力する。It is assumed that the input device 1 comprises a voice recognition system that includes a microphone, converts a signal input from the microphone into a digital signal, performs voice recognition, and outputs a recognition result as a text code. The data processing device 2 is composed of a personal computer and has a storage device 3 such as a magnetic disk storage device. The output device 4 is composed of a voice synthesizing system having a speaker.

【００５３】データ処理装置２（パーソナル・コンピュ
ータ）は、自然言語解析手段２１、曖昧性解消手段２
２、応答文生成手段２３を備え、これらの各手段は、Ｃ
ＰＵ（中央演算装置）で実行されるプログラムによりそ
の処理・機能が実現される。すなわち、該プログラムを
記録した記録媒体、もしくは通信媒体から該プログラム
を読み出してデータ処理装置２で実行することにより、
本発明を実施することができる。記憶装置３には、辞書
３１及び曖昧性解消情報記憶部３２を備える。The data processing device 2 (personal computer) includes a natural language analyzing means 21 and an ambiguity eliminating means 2
2. A response sentence generating means 23 is provided.
The processing and functions are realized by a program executed by a PU (central processing unit). That is, by reading the program from a recording medium or a communication medium on which the program is recorded and executing the program in the data processing device 2,
The present invention can be implemented. The storage device 3 includes a dictionary 31 and an ambiguity resolution information storage unit 32.

【００５４】以下は本発明を適用したシステムとして、
テレビとビデオの電源等の制御を行なう構成を例に説明
する。なお、記憶装置１の曖昧性解消情報記憶部３２に
は、テレビ及びビデオの各機器の電源のオン・オフ状態
が記憶管理されるものとする。The following is a system to which the present invention is applied.
A configuration for controlling a television and a video power supply will be described as an example. In the ambiguity resolution information storage unit 32 of the storage device 1, the on / off state of the power of each device such as a television and a video is stored and managed.

【００５５】ユーザが、テレビとビデオがある家の中
で、テレビを操作するために、「電源つけて」と、入力
装置１のマイクに向かって話したとする。Assume that the user speaks to the microphone of the input device 1 "turn on the power" to operate the television in a house where the television and the video are present.

【００５６】入力装置１をなす音声認識システムは、
「電源つけて」と音声認識して、テキスト文字に変換す
る。The speech recognition system constituting the input device 1 is as follows.
Recognize "turn on power" and convert to text characters.

【００５７】このテキスト文字と、辞書記憶部３１を参
照して自然言語解析手段３１は、「テレビの電源をつけ
る」と「ビデオの電源をつける」という複数の解釈結果
を出力する。Referring to the text characters and the dictionary storage unit 31, the natural language analysis means 31 outputs a plurality of interpretation results of "turn on the television" and "turn on the video".

【００５８】曖昧性解消手段３２は、曖昧性解消情報と
して、テレビ及びビデオの各機器の現在状態を見ると、
「テレビの電源はオフ」、「ビデオの電源はオン」であ
ることから、「テレビの電源をつける」という解釈（タ
スク）に絞り込む。The disambiguation means 32 looks at the current state of each TV and video device as disambiguation information.
Since "TV power is off" and "video power is on", the interpretation (task) of "turn on the TV" is narrowed down.

【００５９】また機器の現在状態から、曖昧性を解消す
ることができたことから、確信度での最大〜最小の５段
階評価のうち「確信度大」であるとする。Since the ambiguity can be resolved from the current state of the device, the degree of certainty is determined to be “high certainty” among the five levels of maximum to minimum levels of certainty.

【００６０】応答文生成手段２３では、確信度大と「テ
レビの電源をつける」という解釈結果と辞書記憶部３１
を参照して、「テレビの電源をつけます」という応答文
を生成し、出力装置４をなす音声合成システムでスピー
カに出力する。The response sentence generating means 23 outputs the high certainty factor, the interpretation result of "turn on the television", and the dictionary storage unit 31.
, A response sentence “Turn on the television” is generated, and is output to the speaker by the voice synthesis system constituting the output device 4.

【００６１】次に本発明の第２の実施例について図面を
参照して詳細に説明する。Next, a second embodiment of the present invention will be described in detail with reference to the drawings.

【００６２】図３は、本発明の第２の実施例の構成を示
す図である。図３において、図１と同等の要素には同一
の参照符号が付されている。図３を参照すると、本発明
の第２の実施例は、図１を参照して説明した前記第１の
実施例に対して、タスク実行手段２４がデータ処理装置
２に追加されており、未実行タスクコマンド記憶部３３
が記憶装置３にさらに付加されて構成されている。FIG. 3 is a diagram showing the configuration of the second embodiment of the present invention. In FIG. 3, the same elements as those in FIG. 1 are denoted by the same reference numerals. Referring to FIG. 3, the second embodiment of the present invention differs from the first embodiment described with reference to FIG. Execution task command storage unit 33
Is further added to the storage device 3.

【００６３】タスク実行手段２４は、曖昧性解消手段２
２で生成された解釈を、応答文生成手段２３で実行すべ
きタスクであるか否かを判断し、実行すべきタスクであ
ると判断された時に、タスクを実行する。The task executing means 24 includes the ambiguity eliminating means 2
The response sentence generating means 23 determines whether the interpretation generated in step 2 is a task to be executed, and executes the task when it is determined that the task is to be executed.

【００６４】未実行タスクコマンド記憶部３３は、応答
文生成手段２３で応答文を生成した中で、確信度が低
く、応答文が質問型「〜しますか」などになったとき
に、そのタスクコマンドを記憶している。When the response sentence is generated by the response sentence generating means 23, the unexecuted task command storage unit 33, when the confidence is low and the response sentence becomes a question type " Remembers task commands.

【００６５】図４は、本発明の第２の実施例の動作を説
明するための流れ図である。図３及び図４を参照して、
本発明の第２の実施例の動作について説明する。入力装
置１で得た入力文字列は自然言語解析手段２１に供給さ
れ（図４のステップ２０１）、自然言語解析手段２１
は、辞書記憶部３１に記憶されている語彙を参照して、
この入力文字列を解釈し、解釈可能な複数のタスクコマ
ンドに変換し（図４のステップ２０２）、曖昧性解消手
段２２は、曖昧性解消情報記憶部３２に記憶されている
情報を参照して、タスクコマンドが複数存在している時
に１つに絞り込み（図４のステップ２０３）、また、ど
のようにタスクコマンドを１つに絞ったかを用いてシス
テムが曖昧性を解消した結果の確信度を求める（図４の
ステップ２０４）。FIG. 4 is a flowchart for explaining the operation of the second embodiment of the present invention. Referring to FIG. 3 and FIG.
The operation of the second embodiment of the present invention will be described. The input character string obtained by the input device 1 is supplied to the natural language analyzing means 21 (step 201 in FIG. 4), and the natural language analyzing means 21
Refers to the vocabulary stored in the dictionary storage unit 31,
This input character string is interpreted and converted into a plurality of interpretable task commands (step 202 in FIG. 4), and the disambiguation means 22 refers to the information stored in the disambiguation information storage unit 32. When there are a plurality of task commands, the command is narrowed down to one (step 203 in FIG. 4), and the degree of certainty as a result of the system resolving the ambiguity by using how the task command is narrowed down to one is used. (Step 204 in FIG. 4).

【００６６】前記第１の実施例では、確信度の大きさに
かかわらず、応答文生成手段２３で応答文を生成し、出
力装置４で出力していた。In the first embodiment, the response sentence is generated by the response sentence generating means 23 and output by the output device 4 irrespective of the degree of certainty.

【００６７】本発明の第２の実施例では、曖昧性解析手
段２２で求められた確信度に応じて、システムがタスク
コマンドを実行してよいかどうか判断する（図４のステ
ップ２０５）。In the second embodiment of the present invention, it is determined whether the system can execute the task command according to the certainty factor obtained by the ambiguity analysis means 22 (step 205 in FIG. 4).

【００６８】ステップ２０５で、確信度が低く、タスク
コマンドを実行してはいけないと判断したときには、未
実行タスクコマンド記憶部３３に、タスクコマンドが記
憶される（図４のステップ２０８）。When it is determined in step 205 that the confidence is low and the task command should not be executed, the task command is stored in the unexecuted task command storage unit 33 (step 208 in FIG. 4).

【００６９】そして、タスクコマンドを実行してよいか
どうかを表す「〜していいですか？」というような応答
文を応答文生成手段２３で生成する（図４のステップ２
０９）。Then, a response sentence such as "Is it OK?" Indicating whether the task command may be executed is generated by the response sentence generation means 23 (step 2 in FIG. 4).
09).

【００７０】該応答文に対する、ユーザからの返答入力
を入力装置１を介してを得ると（ステップ２０１）、自
然言語解析手段２１は、入力の自然言語と未実行タスク
コマンド記憶部３３で記憶されている未実行タスクコマ
ンドを参照して、解釈可能な複数のタスクコマンドに変
換する（図４のステップ２０２）。When a response input from the user to the response sentence is obtained via the input device 1 (step 201), the natural language analysis unit 21 stores the input natural language and the unexecuted task command storage unit 33. The unexecuted task command is referred to and converted into a plurality of interpretable task commands (step 202 in FIG. 4).

【００７１】図４のステップ２０３では、複数のタスク
コマンドの絞り込みを行ない、ステップ２０４では、絞
り込まれたタスクコマンドの確信度を求め、確信度によ
りタスクコマンドを実行してよいかどうかを判断する。In step 203 of FIG. 4, a plurality of task commands are narrowed down. In step 204, the certainty of the narrowed task command is obtained, and it is determined whether the task command can be executed based on the certainty.

【００７２】「〜していいですか？」のような応答に対
して、図４のステップ２０４で求めた確信度が高いとき
には、入力が「はい」のような肯定を意味する入力か、
「いいえ」のような否定を意味する入力かどうかを判断
する（図４のステップ２０６）。In response to a response such as “Do you want to do?”, If the certainty factor obtained in step 204 in FIG. 4 is high, is the input an affirmative input such as “Yes”,
It is determined whether the input means a negative such as "No" (step 206 in FIG. 4).

【００７３】ユーザから入力が、肯定を意味するもので
ある時には、応答文生成手段２３はタスク実行手段２４
にタスクコマンドを送り、タスク実行手段２４でタスク
を実行する。When the input from the user means affirmative, the response sentence generating means 23
And the task execution means 24 executes the task.

【００７４】また応答文生成手段２３は、タスクの実行
完了を表す「〜しました。」のような応答文を生成する
（図４のステップ２０７）。The response sentence generating means 23 generates a response sentence such as "I've done it" indicating the completion of execution of the task (step 207 in FIG. 4).

【００７５】また、否定を意味する入力の時には、タス
クコマンドを破棄し、未実行タスクの取り消しを表す
「〜を取り消します」のような応答文を生成する（図４
のステップ２１０）。When the input is negative, the task command is discarded, and a response statement such as "cancel ~" indicating cancellation of the unexecuted task is generated (FIG. 4).
Step 210).

【００７６】本発明の第２の実施例について具体例に即
して説明する。以下の例でも、前記第１の実施例と同様
にテレビとビデオがある家の中での状況を想定して説明
する。A second embodiment of the present invention will be described with reference to a specific example. Also in the following example, description will be made assuming a situation in a house where a television and a video are present, as in the first embodiment.

【００７７】今、ユーザがテレビの電源をつけるため
に、「テレビ」とマイクに向かって話したとする。Now, suppose that the user has spoken to the microphone "TV" in order to turn on the television.

【００７８】入力装置１をなす音声認識システムは、
「テレビ」と音声認識し、テキスト文字に変換する。The voice recognition system constituting the input device 1
Recognizes "TV" and converts it to text characters.

【００７９】このテキスト文字と辞書記憶部３１とを参
照して、自然言語解析手段２１は、・「テレビの電源をつける」、・「テレビのチャンネルを１にする」、・「テレビのチャンネルを２にする」、… といった複数のタスクコマンドに変換する。With reference to the text characters and the dictionary storage unit 31, the natural language analysis means 21 can be used to: "turn on the television";"set the television channel to 1"; Is converted into a plurality of task commands.

【００８０】曖昧性解消手段２２は、曖昧性解消情報記
憶部３２からテレビの現在状況を見て、入力された「テ
レビ」に対するタスクとしては、「テレビの電源はオ
フ」から、「テレビの電源をつける」くらいの操作で大
丈夫だろうと判断し、絞り込む。また確信度は前記確信
度での最大〜最小の５段階評価で「確信度最小」と求め
たとする。The disambiguation means 22 looks at the current status of the television from the disambiguation information storage unit 32, and as a task for the input “TV”, from “TV power off” to “TV power supply”. It is determined that the operation of “apply” is okay and narrow down. It is also assumed that the certainty is determined to be “minimum certainty” by five-level evaluation from the maximum to the minimum of the certainty.

【００８１】応答文生成手段２３は、確信度最小と「テ
レビの電源をつける」というタスクコマンドと辞書記憶
部３１を参照して、「テレビをつけますか？」という応
答文を生成し、出力装置４をなす音声合成システムのス
ピーカーに出力する。The response sentence generating means 23 generates a response sentence "Do you turn on the TV?" With reference to the minimum certainty factor and the task command "Turn on the TV" and the dictionary storage unit 31, and outputs the response sentence. Output to the speaker of the speech synthesis system constituting the device 4.

【００８２】また「テレビの電源をつける」というタス
クコマンドを未実行タスクコマンド記憶部３３に記憶す
る。The task command “turn on the television” is stored in the unexecuted task command storage unit 33.

【００８３】このシステムからの応答後に、ユーザが
「はい」と入力すると、自然言語解析手段３１は、辞書
記憶部３１を参照して、この発話が肯定を意味する語だ
と認識し、また未実行タスクコマンド記憶部３３に、
「テレビの電源をつける」というコマンドが記憶されて
いるため、「テレビの電源をつける」というタスクコマ
ンドを生成する。When the user inputs "Yes" after the response from the system, the natural language analysis means 31 refers to the dictionary storage section 31 and recognizes that the utterance is a word meaning affirmative. In the execution task command storage unit 33,
Since the command of "turn on the television" is stored, a task command of "turn on the television" is generated.

【００８４】曖昧性解消手段２２では、すでにタスクコ
マンドが一意に決まっているため、確信度を最大〜最小
の５段階評価で、「確信度最大」とする。In the ambiguity eliminating means 22, since the task command has already been uniquely determined, the certainty is determined to be “maximum certainty” by a maximum to minimum five-level evaluation.

【００８５】応答文生成手段２３では、確信度最大と
「テレビの電源をつける」というタスクコマンドと辞書
記憶部３１を参照して、「テレビの電源をつけました」
という応答文を生成し、音声合成システムでスピーカに
出力する。The response sentence generating means 23 refers to the maximum certainty factor, the task command “Turn on the TV”, and the dictionary storage unit 31 and “turns on the TV”.
Is generated and output to the speaker by the speech synthesis system.

【００８６】次に本発明の第３の実施例について図面を
参照して詳細に説明する。Next, a third embodiment of the present invention will be described in detail with reference to the drawings.

【００８７】図５は、本発明の第３の実施例の構成を示
す図である。図５において、図３に示した要素と同等の
要素には同一の参照符号が付されている。FIG. 5 is a diagram showing the configuration of the third embodiment of the present invention. In FIG. 5, the same elements as those shown in FIG. 3 are denoted by the same reference numerals.

【００８８】図５を参照すると、本発明の第３の実施例
は、データ処理装置２は、図３に示した前記第２の実施
例と同様に、自然言語解析手段２１、曖昧製解消手段２
２、応答文生成手段２３、タスク実行手段２４を備えて
いる。記憶装置３は、辞書記憶部３１、曖昧性解消情報
記憶部３２、キャンセルコマンド記憶部３４を備えてい
る。Referring to FIG. 5, according to a third embodiment of the present invention, the data processing apparatus 2 includes a natural language analyzing unit 21 and an ambiguity elimination unit, similarly to the second embodiment shown in FIG. 2
2, a response sentence generating means 23 and a task executing means 24 are provided. The storage device 3 includes a dictionary storage unit 31, an ambiguity resolution information storage unit 32, and a cancel command storage unit 34.

【００８９】キャンセルコマンド記憶部３４には、ユー
ザの意図に反したタスクを実行してしまったときに、元
に復旧するためのタスクコマンドが予め記憶されてい
る。The cancel command storage unit 34 previously stores a task command for restoring the original task when a task contrary to the user's intention has been executed.

【００９０】図６は、本発明の第３の実施例の動作を説
明するための流れ図である。図５及び図６を参照して、
本発明の第３の実施例の動作について詳細に説明する。
なお、図６のステップ３０１〜３０４は、図２に示した
ステップ２０１〜２０４と同様とされる。FIG. 6 is a flow chart for explaining the operation of the third embodiment of the present invention. Referring to FIGS. 5 and 6,
The operation of the third embodiment of the present invention will be described in detail.
Steps 301 to 304 in FIG. 6 are the same as steps 201 to 204 shown in FIG.

【００９１】前記第１の実施例では、曖昧性解消手段２
２で確信度が高く求められてしまうと、ユーザの意図に
反したものでも、応答文生成部２３で実行完了の応答文
を生成して出力装置４に出力してしまう。In the first embodiment, the disambiguation means 2
If a high degree of certainty is required in 2, the response sentence generation unit 23 generates a response sentence of execution completion and outputs it to the output device 4 even if the response sentence is contrary to the user's intention.

【００９２】本発明の第３の実施例においては、システ
ムがタスク実行完了の応答文を応答文生成手段２３で生
成し、出力装置４に出力した時に、キャンセルや取り消
しを意味する入力を受け付けると、図６のステップ３０
１−３０４の一連の処理において、図４のステップ２０
１−２０４の前記第２の実施例の処理と同様に、タスク
コマンドを１つに絞り込む。その際、キャンセルや取り
消しを意味する入力を受け付けた時には、キャンセルコ
マンドが絞り込まれる。In the third embodiment of the present invention, when the system generates a response sentence of task execution completion by the response sentence generating means 23 and outputs the response sentence to the output device 4, the system accepts an input meaning cancellation or cancellation. Step 30 in FIG.
In a series of processing of 1-304, step 20 in FIG.
Similarly to the processing of the second embodiment 1-204, the task commands are narrowed down to one. At this time, when an input indicating cancellation or cancellation is received, the cancellation command is narrowed down.

【００９３】絞り込まれたタスクコマンドがキャンセル
コマンドであるか否か判定する（図６のステップ３０
５）。It is determined whether the narrowed task command is a cancel command (step 30 in FIG. 6).
5).

【００９４】キャンセルコマンドでなければ、前記第１
の実施例と同様に、確信度に応じて応答文生成部２３で
応答文を生成し、出力装置４に出力する（図６のステッ
プ３０６）。If it is not a cancel command, the first
Similarly to the embodiment, the response sentence generation unit 23 generates a response sentence according to the certainty factor and outputs the response sentence to the output device 4 (step 306 in FIG. 6).

【００９５】キャンセルコマンドである場合、キャンセ
ルコマンド記憶部３４を参照して応答文生成部２３で応
答文を生成し、出力装置４に出力する（図６のステップ
３０７）。If the command is a cancel command, the response sentence generation unit 23 generates a response sentence with reference to the cancel command storage unit 34 and outputs it to the output device 4 (step 307 in FIG. 6).

【００９６】例えば、システムが「テレビをつけまし
た」という応答文を返した時に、ユーザが、キャンセル
や取り消しを意味する入力として、「違う」と答えた時
に、システムは、「テレビを消しました」という応答文
を生成する。For example, when the system returns a response sentence "Turn on TV" and the user answers "No" as an input to indicate cancellation or cancellation, the system returns "Turn off TV". Is generated.

【００９７】本発明の第３の実施例について具体例に即
して説明する。以下でも、前記第１の実施例と同様にテ
レビとビデオがある家の中での状況を想定する。The third embodiment of the present invention will be described with reference to a specific example. In the following, as in the first embodiment, a situation in a house where a television and a video are present is assumed.

【００９８】今、ユーザがテレビの電源をつけるため
に、「テレビをつけて」とマイクに向かって話したとす
る。Now, it is assumed that the user speaks to the microphone "turn on the television" in order to turn on the television.

【００９９】入力装置１をなす音声認識システムは、あ
やまって「ビデオをつけて」と誤認識し、テキスト文字
に変換する。このテキスト文字と辞書記憶部３１を参照
して自然言語解析手段２１は、「ビデオの電源をつけ
る」というタスクコマンドに変換する。The voice recognition system constituting the input device 1 erroneously recognizes "attach video" and converts it to text characters. With reference to the text characters and the dictionary storage unit 31, the natural language analysis unit 21 converts it into a task command of "turn on video power".

【０１００】曖昧性解消手段２２では、すでにタスクコ
マンドが一意に決まっているので、「確信度最大」とす
る。In the ambiguity eliminating means 22, since the task command has already been uniquely determined, it is set to "maximum confidence".

【０１０１】応答文生成手段２３は、「確信度最大」
と、「ビデオの電源をつける」というタスクコマンドと
辞書記憶部３１を参照して、「ビデオの電源をつけまし
た」という応答文を生成し、出力装置４をなす音声合成
システムでスピーカに出力する。The response sentence generating means 23 outputs "maximum confidence".
With reference to the task command “Turn on video power” and the dictionary storage unit 31, a response sentence “Turn on video power” is generated and output to the speaker by the voice synthesis system that forms the output device 4. I do.

【０１０２】また、キャンセルコマンド記憶部３４に、
「ビデオの電源をつける」という操作の逆操作である
「ビデオの電源を消す」というコマンドが記憶される。The cancel command storage unit 34 stores
A command “turn off video power” which is the reverse operation of “turn on video power” is stored.

【０１０３】システムはユーザの意図とは異なる応答を
したために、マイクからユーザは「違う」と入力した場
合、自然言語解析手段３１は辞書記憶部３１を参照して
この発話がキャンセルを意味する語だと認識し、またキ
ャンセルコマンド記憶部３４に「ビデオの電源を消す」
というタスクコマンドが記憶されていることを認識し
て、「ビデオの電源を消す」というタスクコマンドを生
成する。Since the system responded differently from the user's intention, if the user inputs “no” from the microphone, the natural language analysis means 31 refers to the dictionary storage unit 31 and the utterance indicates a word indicating cancellation. "Cancel the video power" in the cancel command storage unit 34.
It recognizes that the task command is stored, and generates a task command of “turn off video power”.

【０１０４】曖昧性解消手段２２では、すでにタスクコ
マンドが一意に決まっているので、確信度最大とする。In the ambiguity eliminating means 22, since the task command has already been uniquely determined, the certainty factor is set to the maximum.

【０１０５】応答文生成手段２３では、「確信度最大」
と「ビデオの電源を消す」というタスクコマンドと辞書
記憶部３１を参照して、「ビデオの電源を消しました」
という応答文を生成し、該応答文を出力装置４をなす音
声合成システムよりスピーカに出力する。In the response sentence generating means 23, "maximum confidence"
And the task command “Turn off video” and the dictionary storage unit 31 to refer to “Turn off video”
Is generated, and the response sentence is output to the speaker from the speech synthesis system constituting the output device 4.

【０１０６】次に本発明の第４の実施例について説明す
る。図７は、本発明の第４の実施例の構成を示す図であ
る。図７を参照すると、本発明の第４の実施例は、図３
に示した前記第２の実施例に、時間管理部３５を記憶装
置３に備えたものである。Next, a fourth embodiment of the present invention will be described. FIG. 7 is a diagram showing the configuration of the fourth embodiment of the present invention. Referring to FIG. 7, a fourth embodiment of the present invention is shown in FIG.
The time management unit 35 is provided in the storage device 3 in the second embodiment shown in FIG.

【０１０７】図８は、本発明の第４の実施例の動作を説
明するための流れ図である。図７及び図８を参照して、
本発明の第４の実施例の動作について説明する。FIG. 8 is a flowchart for explaining the operation of the fourth embodiment of the present invention. Referring to FIGS. 7 and 8,
The operation of the fourth embodiment of the present invention will be described.

【０１０８】図８のステップ４０２−４０５、ステップ
４０８−４０９は、その流れ図を図４に示した前記第２
の実施例における２０２−２０５、ステップ２０８−２
０９と同一とされ、また、図８のステップ４０６、４０
７、４１０は、前記第２の実施例におけるステップ２０
６、２０７、２１０と同一の処理とされている。前記第
２の実施例と同様な手順で、図８のステップ４０９まで
処理が終わったとする。前記第２の実施例では、図４に
示すように、ステップ２０９で応答文を生成して、入力
装置１でユーザからの返答入力を待ち、返答が入力され
たときにステップ２０１からの処理を行っている。Steps 402-405 and 408-409 of FIG. 8 correspond to the second step shown in FIG.
202-205, step 208-2 in the embodiment of
09, and steps 406 and 40 in FIG.
Steps 7 and 410 correspond to step 20 in the second embodiment.
6, 207, and 210. It is assumed that the processing has been completed up to step 409 in FIG. 8 in the same procedure as in the second embodiment. In the second embodiment, as shown in FIG. 4, a response sentence is generated in step 209, and the input device 1 waits for a response input from the user. When a response is input, the process from step 201 is performed. Is going.

【０１０９】本発明の第４の実施例では、システムがユ
ーザの入力を待つような応答をしたときに、時間管理部
３５に現在時刻を設定し、入力装置１では、不図示のタ
イマ（計時手段）により予め定められた一定時間だけ待
つことにする（図８のステップ４０１）。In the fourth embodiment of the present invention, when the system responds to wait for user input, the current time is set in the time management unit 35, and the input device 1 uses a timer (not shown). Means) and waits for a predetermined time (step 401 in FIG. 8).

【０１１０】一定時間内に、すなわち、タイマでのタイ
ムアウト発生前に、応答文に対するユーザからの入力が
あれば、以下の処理を前記第２の実施例と同様に行う
（図８のステップ４０２以降）。If there is an input from the user to the response sentence within a predetermined time, that is, before the time-out of the timer occurs, the following processing is performed in the same manner as in the second embodiment (from step 402 in FIG. 8). ).

【０１１１】一定時間の間にユーザからの入力がなけれ
ば、すんわち、タイマでのタイムアウトが発生した際に
は、ユーザは、応答の内を了解したものとして、入力装
置１にユーザが肯定を意味する入力を行なったものとみ
なす。If there is no input from the user for a certain period of time, that is, when a timeout occurs in the timer, the user acknowledges that the response has been acknowledged by the user on the input device 1. Is assumed to have been entered.

【０１１２】ステップ４０９で生成された「〜していい
ですか？」のような応答に対して、図８のステップ４０
４で求めた確信度が高いときには、入力が「はい」のよ
うな肯定を意味する入力か、「いいえ」のような否定を
意味する入力かどうかを判断する（図８のステップ４０
６）。In response to the response such as “Do you want to do?” Generated in step 409, step 40 in FIG.
When the certainty factor obtained in step 4 is high, it is determined whether the input is an input meaning affirmation such as "yes" or an input meaning negation such as "no" (step 40 in FIG. 8).
6).

【０１１３】ユーザから入力が、肯定を意味するもので
ある時には、応答文生成手段２３はタスク実行手段２４
にタスクコマンドを送り、タスク実行手段２４でタスク
を実行する。When the input from the user means affirmative, the response sentence generating means 23 is
And the task execution means 24 executes the task.

【０１１４】また応答文生成手段２３は、タスクの実行
完了を表す「〜しました。」のような応答文を生成する
（図８のステップ４０７）。Further, the response sentence generating means 23 generates a response sentence such as "I've done it" indicating the completion of the execution of the task (step 407 in FIG. 8).

【０１１５】また、否定を意味する入力の時には、タス
クコマンドを破棄し、未実行タスクの取り消しを表す
「〜を取り消します」のような応答文を生成する（図８
のステップ４１０）。If the input is negative, the task command is discarded, and a response statement such as "cancel ~" indicating cancellation of the unexecuted task is generated (FIG. 8).
Step 410).

【０１１６】本発明の第４の実施例について具体例に即
して以下に説明する。以下でも、前記第２の実施例と同
様にテレビとビデオがある家の中での状況を想定して説
明する。The fourth embodiment of the present invention will be described below with reference to a specific example. In the following, description will be made assuming a situation in a house where a television and a video are provided, as in the second embodiment.

【０１１７】今、ユーザがテレビの電源をつけた後に、
テレビのチャンネルを１にするために「１チャンネルに
して」とマイクに向かって話したものとする。音声認識
システムは「１チャンネルにして」と認識し、テキスト
文字に変換する。Now, after the user turns on the television,
In order to set the channel of the television to 1, it is assumed that the user has spoken to the microphone, "change to one channel". The voice recognition system recognizes "on one channel" and converts it to text characters.

【０１１８】このテキスト文字と辞書を参照して自然言
語解析手段３１は、「テレビのチャンネルを１にす
る」、「ビデオのチャンネルを１にする」という複数の
タスクコマンドに変換する。With reference to the text characters and the dictionary, the natural language analysis means 31 converts the command into a plurality of task commands such as "set the television channel to 1" and "set the video channel to 1".

【０１１９】曖昧性解消手段２２では、文脈を用いて
「テレビの電源をつける」を実行した後であることか
ら、「テレビのチャンネルを１にする」であるものと判
断する。また、文脈を用いて曖昧性を解消したので、確
信度を中とする。The disambiguation means 22 determines that the "channel of the television is set to 1" since "turn on the television" has been executed using the context. In addition, since the ambiguity has been eliminated using the context, the certainty factor is set to medium.

【０１２０】応答文生成手段２３では確信度中と「テレ
ビのチャンネルを１にする」というタスクコマンドと辞
書記憶部３１を参照して、「テレビのチャンネルを１に
しますね」という応答文を生成し、出力装置４をなす音
声合成システムでスピーカに出力する。The response sentence generation means 23 generates a response sentence "Set the television channel to 1" by referring to the certainty factor, the task command "Set the television channel to 1", and the dictionary storage unit 31. Then, the voice is output to the speaker by the voice synthesis system constituting the output device 4.

【０１２１】また、未実行タスクコマンド記憶部３３に
「テレビのチャンネルを１にする」というタスクコマン
ドを記憶する。Further, a task command “set the channel of the television to 1” is stored in the unexecuted task command storage unit 33.

【０１２２】またスピーカで応答文を出力するのと同時
に、時間管理部３５に、現在時刻を記憶させる。At the same time that the response sentence is output from the speaker, the current time is stored in the time management unit 35.

【０１２３】出力装置４をなすスピーカからの応答文を
出力した後、ユーザから一定時間入力がないときには、
入力装置１から肯定を表す語が入力されたものとして、
前記第２の実施例と同様な処理を行い、応答文生成手段
２３において、「テレビのチャンネル１を１にしまし
た」という応答文を生成し、該応答文を、出力装置４を
なす音声合成システムよりスピーカに出力する。After outputting the response sentence from the speaker constituting the output device 4, if there is no input from the user for a certain period of time,
Assuming that a word indicating affirmation has been input from the input device 1,
By performing the same processing as in the second embodiment, the response sentence generation means 23 generates a response sentence "TV channel 1 has been set to 1", and converts the response sentence into a speech synthesis which forms the output device 4. Output from the system to the speaker.

【０１２４】[0124]

【発明の効果】以上説明したように、本発明によれば下
記記載の効果を奏する。As described above, according to the present invention, the following effects can be obtained.

【０１２５】本発明の第１の効果は、にユーザはその応
答に必ずしも返答する必要がなくなるので、ユーザにか
かる負荷を削減する、ということである。The first effect of the present invention is that the user does not always need to reply to the response, so that the load on the user is reduced.

【０１２６】その理由は次の通りである。本発明におい
ては、システムが求めた確信度を用いて応答文を変化さ
せることで、システムは必ずしも確認を求める応答文を
出力するわけではない。このため、ユーザはシステムか
らの応答に必ずしも返答する必要がないためである。The reason is as follows. In the present invention, by changing the response sentence using the certainty factor obtained by the system, the system does not necessarily output the response sentence requesting confirmation. For this reason, the user need not always reply to the response from the system.

【０１２７】本発明の第２の効果は、システムが操作を
行う前に、ユーザとシステムとの対話により曖昧性を解
消することを可能とし、システムがユーザの本来の意図
とは異なった解釈をする可能性を低減し、その結果、ユ
ーザは修正取り消し作業を行う可能性を特段に低減して
いる、ということである。A second effect of the present invention is that it is possible to eliminate ambiguity by interaction between the user and the system before the system performs an operation, and the system interprets the interpretation different from the original intention of the user. That the user has a particularly reduced chance of performing the undoing operation.

【０１２８】その理由は、本発明においては、システム
が求めた確信度が低いときには、システムは、システム
で解釈したタスクを実行してよいかどうかをユーザに質
問する構成とされているためである。The reason is that, in the present invention, when the certainty degree obtained by the system is low, the system is configured to ask the user whether the task interpreted by the system may be executed. .

【０１２９】本発明の第３の効果は、システムがユーザ
の意図と異なった解釈をしてしまう場合でも、修正取り
消し可能とし、ユーザにかかる負荷を削減する、という
ことである。A third effect of the present invention is that, even when the system interprets the data differently from the user's intention, the correction can be canceled and the load on the user can be reduced.

【０１３０】本発明の第４の効果は、システムがユーザ
に質問をしたときに、ユーザから一定時間応答がないと
システムに肯定を表す語が入力されたときと同様な処理
がおこなわれ、このため、ユーザはシステムの確認を表
す応答に必ずしも答える必要がないので、ユーザの負荷
を削減する、ということである。A fourth effect of the present invention is that, when the system asks a question to the user, the same processing as when a positive word is input to the system if the user does not respond for a certain period of time is performed. Therefore, the user does not need to answer the response indicating the confirmation of the system, so that the load on the user is reduced.

[Brief description of the drawings]

【図１】本発明の第１の実施例の構成を示す図である。FIG. 1 is a diagram showing a configuration of a first exemplary embodiment of the present invention.

【図２】本発明の第１の実施例の動作を説明するための
流れ図である。FIG. 2 is a flowchart for explaining the operation of the first exemplary embodiment of the present invention.

【図３】本発明の第２の実施例の構成を示すブロック図
である。FIG. 3 is a block diagram showing a configuration of a second exemplary embodiment of the present invention.

【図４】本発明の第２の実施例の動作を説明するための
流れ図である。FIG. 4 is a flowchart for explaining the operation of the second exemplary embodiment of the present invention.

【図５】本発明の第３の実施例の構成を示す図である。FIG. 5 is a diagram showing a configuration of a third exemplary embodiment of the present invention.

【図６】本発明の第３の実施例の動作を説明するための
流れ図である。FIG. 6 is a flowchart for explaining the operation of the third embodiment of the present invention.

【図７】本発明の第４の実施例の構成を示す図である。FIG. 7 is a diagram showing a configuration of a fourth embodiment of the present invention.

【図８】本発明の第４の実施例の動作を説明するための
流れ図である。FIG. 8 is a flowchart for explaining the operation of the fourth embodiment of the present invention.

[Explanation of symbols]

１入力装置２データ処理装置３記憶装置４出力装置２１自然言語解析手段２２曖昧性解消手段２３応答文生成手段２４タスク実行手段３１辞書記憶部３２曖昧性解消情報記憶部３３未実行タスクコマンド記憶部３４キャンセルコマンド記憶部３５時間管理部 REFERENCE SIGNS LIST 1 input device 2 data processing device 3 storage device 4 output device 21 natural language analysis means 22 ambiguity resolution means 23 response sentence generation means 24 task execution means 31 dictionary storage unit 32 ambiguity resolution information storage unit 33 unexecuted task command storage unit 34 Cancel command storage unit 35 Time management unit

───────────────────────────────────────────────────── フロントページの続き (51)Int.Cl.⁷ 識別記号ＦＩテーマコート゛(参考）Ｇ１０Ｌ 3/00 ５３７Ｚ５６１ＣＦターム(参考） 5B091 AA15 CA12 CA14 CB12 CD03 DA09 5D015 AA05 HH23 KK02 KK04 LL04 LL05 5D045 AB01 ──────────────────────────────────────────────────続き Continued on the front page (51) Int.Cl. ⁷ Identification symbol FI Theme coat ゛ (Reference) G10L 3/00 537Z 561C F-term (Reference) 5B091 AA15 CA12 CA14 CB12 CD03 DA09 5D015 AA05 HH23 KK02 KK04 LL04 LL05 5D045 AB01

Claims

[Claims]

1. A dictionary storage unit for storing in advance a vocabulary necessary for interpreting an input natural language and a vocabulary necessary for generating a response sentence from an interpretation result of the natural language; A natural language analysis unit for interpreting a natural language input from an input unit with reference to a unit; and a clue for narrowing down to one interpretation result when there are a plurality of interpretation results obtained by the natural language analysis unit. An ambiguity resolution information storage unit that stores information used as information, and when there are a plurality of interpretation results obtained by the natural language analysis unit, refer to the ambiguity resolution information storage unit to obtain the plurality of interpretation results. And an ambiguity eliminating means for obtaining a certainty factor for the interpretation result narrowed down to one, and outputting the interpretation result narrowed down by the ambiguity eliminating means as a response sentence. A response sentence generating means for changing a response sentence according to the certainty factor obtained by the ambiguity eliminating means; and a dialogue system using a natural language.

2. The natural language according to claim 1, further comprising: a task execution unit that receives an interpretation obtained by the disambiguation unit and executes a task command corresponding to the interpretation. Dialogue system used.

3. The method according to claim 2, wherein the response sentence generating means generates a response sentence for reporting execution of the task after executing the task by the task executing means, according to the certainty factor obtained by the disambiguation means. Or determining whether to generate a response sentence for confirming whether or not to execute the task before executing the task by the task execution unit. Dialogue system using natural language.

4. The method according to claim 2, wherein said response sentence generation means determines whether or not to generate a response sentence according to the degree of certainty obtained by said disambiguation means. A dialogue system using the described natural language.

5. The method according to claim 1, wherein said response sentence generating means determines a result interpreted by said disambiguation means in accordance with a degree of certainty obtained by said disambiguation means, by reporting, confirming, or asking a question. The interactive system using a natural language according to any one of claims 2, 3, and 4, wherein the response system changes the response sentence.

6. A cancel command storage unit for storing a task command of an inverse operation to the executed task command, wherein the task execution means executes the cancel command when an input indicating the meaning of cancel is made, The interactive system according to any one of claims 2, 3, 4, and 5, wherein the response sentence generating means generates a response sentence reporting that the cancel command has been executed. .

7. A time management means for measuring a lapse of time after outputting a response sentence for confirming whether or not the system executes a task command. 7. The method according to claim 2, wherein if there is no response from the user even after a predetermined period of time has elapsed, an input indicating affirmation is interpreted as being input from the input unit. A dialogue system using the natural language according to any one of the above.

8. A certainty factor obtained by said ambiguity eliminating means as to whether or not an input indicating affirmativeness has been inputted from said input means after said predetermined time has elapsed after starting time measurement by said time management means. 8. The method according to claim 7, wherein the determination is made in accordance with:
Dialogue system using the natural language described in.

9. A natural language analysis means for receiving a natural language input from an input means and converting it into one or more task commands with reference to a vocabulary dictionary, and a plurality of task commands with reference to disambiguation information. 1
A response sentence corresponding to the certainty factor is generated by referring to the ambiguity eliminating means for obtaining the certainty factor from the narrowing method and the narrowed task command, the determined certainty factor and the vocabulary dictionary. ,
A natural language dialogue processing device, comprising: a response sentence generating unit that outputs the response sentence to an output unit.

10. A natural language analyzing means for interpreting a natural language inputted from an input means with reference to a vocabulary dictionary and outputting one or a plurality of task commands corresponding to the natural language as candidates as an interpretation result. When there are a plurality of task commands obtained by the natural language analysis means, the vagueness includes any one of the interpreted natural language context information, task history information, and status information of the processing object of the task candidate. An ambiguity eliminating means for narrowing down to one task command based on predetermined information for eliminating ambiguity and obtaining a certainty factor for the task command narrowed down to one, and a task command obtained by the ambiguity eliminating means A response sentence generating means for automatically creating a response sentence according to the certainty factor and outputting the response sentence from an output means; and a task execution for executing the task command Natural language dialog processing apparatus characterized by comprising a stage.

11. When the certainty degree of the one narrowed task command is low, the task command is not immediately executed, but is temporarily stored in an unexecuted command storage unit, and the response sentence generation unit is When a response sentence for confirming whether or not the task command may be executed is generated and output to the output unit, and when the user inputs an affirmative answer to the response sentence from the input unit, The task command is taken out from the unexecuted command storage means, the task command is executed by the task execution means, and the response sentence generation means outputs a response sentence of task execution completion. 11. The natural language dialogue processing according to claim 10, wherein when executed by a means, the response sentence generating means creates and outputs a response sentence for canceling an unexecuted task. apparatus.

12. A system according to claim 11, further comprising a cancel command storing means for storing a cancel command for canceling the operation of the task command executed by said task executing means, wherein the input representing the meaning of the cancellation of the task after the execution of the task command is inputted to said input means. When executed, the cancel command is read from the cancel command storage means, the task execution means executes the cancel command, and the response statement generation means reports a response that the cancel command has been executed. 12. The system according to claim 10, wherein a sentence is generated and output.
Natural language dialogue processing device as described.

13. A time management means for measuring a lapse of time after the response sentence generating means generates and outputs a response sentence for confirming whether or not the task command may be executed, After starting the timing by the time management means, even if a predetermined time has elapsed, if there is no response from the user, it is interpreted that an input indicating affirmation has been input to the input means, The natural language dialogue processing device according to claim 11, wherein:

14. The input means comprises voice recognition means for recognizing the input voice and converting and outputting the text to text, wherein the output means inputs text information of the response sentence created by the response sentence generation means. 14. The natural language dialogue processing device according to claim 10, further comprising a voice synthesis unit that performs voice synthesis and outputs the voice from a voice output unit.

15. The natural language dialogue processing device according to claim 14, wherein a predetermined operation of one or a plurality of devices is performed by a voice input from a user who instructs the predetermined operation. Processing equipment.

16. A natural language analysis process for interpreting a natural language input from an input means by referring to a vocabulary dictionary storage unit; and b. A plurality of interpretation results obtained in the natural language analysis process. When present, the plurality of interpretation results are narrowed down to one by referring to the disambiguation information storage unit that stores information used as a clue for narrowing down the plurality of interpretation results to one interpretation result. (C) automatically generating, as a response sentence, the interpretation result narrowed down by the ambiguity resolution processing as a response sentence by referring to the vocabulary dictionary storage unit and outputting the result from the output means; At this time, a response sentence generation process of changing a response sentence according to the certainty factor obtained by the ambiguity eliminating means, and the processes (a) to (c) of the following are executed by a computer. Recording medium recording the eye of the program.

17. The recording medium according to claim 16, wherein: (d) receiving the interpretation determined in the disambiguation processing (b), and executing a task command corresponding to the interpretation by the computer. A recording medium on which a program to be executed is recorded.

18. The recording medium according to claim 17, wherein the response sentence generation processing (c) executes the task after executing the task according to the degree of certainty obtained in the ambiguity resolution processing (b). Generating a response sentence that reports, or includes a process of determining whether to generate a response sentence for confirming whether or not to execute the task before executing the task, A recording medium that stores a program to be executed by a computer.

19. The recording medium according to claim 17, further comprising: a cancel command storage unit for storing a task command of an inverse operation to the executed task command, wherein: (f) an input representing the meaning of cancel is transmitted from said input means. When executed, a program for executing a cancel command and causing the computer to execute a process of generating a response sentence reporting that the cancel command has been executed in the response sentence generation process (c) is recorded. Recording medium.