JP3963698B2

JP3963698B2 - Spoken dialogue system

Info

Publication number: JP3963698B2
Application number: JP2001325122A
Authority: JP
Inventors: 英樹北尾; 英樹中村; 一義荒井; 佳子武田
Original assignee: Denso Ten Ltd
Current assignee: Denso Ten Ltd
Priority date: 2001-10-23
Filing date: 2001-10-23
Publication date: 2007-08-22
Anticipated expiration: 2021-10-23
Also published as: JP2003131691A

Description

【０００１】
【発明の属する技術分野】
本発明は、車両電装品、例えばエアコン、パワーウインド、さらにはカーナビゲーションシステム、オーディオ機器、ＰＤＡなどの車載機器の制御を、ユーザとの対話に基づいて自動的に遂行する音声対話システムに関する。
【０００２】
【従来の技術】
カーナビゲーションシステム、オーディオ機器など操作が複雑な機器では、音声対話システムを利用することによって機器の操作を簡単化している。この音声対話システムでは、予め決められたステップに基づいて作成される質問をシステムが発話し、それに対するユーザの回答を音声認識し、認識結果に基づいて新たな発話を作成すると言う手順によって、機器の操作に必要な情報をユーザより得るものである。
【０００３】
【発明が解決しようとする課題】
しかしながら、このような従来の音声対話システムでは、音声認識装置を起動させるためにユーザ側で音声認識のタイミングを取る必要があった。実際、従来の装置では、ユーザが「認識開始」あるいは「発声中」を示すスイッチあるいはボタンをあらかじめ押す、もしくは、押しつづけて、音声認識装置を起動する構成である。このような構成は、操作が煩雑であるばかりではなく、運転動作以外にもユーザに負担を与えることとなり、運転ミスを誘発する危険がある。
【０００４】
本発明は、従来装置の上記ような欠点を解決する目的でなされたもので、ユーザ側から対話システムに認識開始のための働きかけをするのではなく、機器が対話システムの開始を自動的に認識することが可能な音声対話システムを提供することを課題とする。
【０００５】
【課題を解決するための手段】
本発明は、上記課題を解決するために、音声対話によりナビゲーションシステムの制御を行うための、車両における音声対話システムであって、パーキングブレーキがオンとなったことを検出するセンサと、前記センサからの出力信号を受信しかつ前記ナビゲーションシステムから車両が駐車場に入場したことを示す信号を受信すると、駐車場の課金情報の登録を促す言葉を出力する音声出力装置と、を備える車両における音声対話システムを提供する。
【０００６】
前記センサは、パーキングブレーキがオンとなったことを検出するセンサであっても良い。
【０００７】
これらのセンサからの出力信号が本システムに入力されると、音声認識装置が自動的に起動され、さらに入力された信号に対応して予め決められた言葉が音声出力装置より出力される。この言葉は、ユーザに対してシステム側から音声対話のきっかけを与える言葉であり、例えば、センサがエンジンのオン状態を検出するものである場合、「何をしますか」と言うような言葉である。したがってユーザがこの言葉に対して、例えば「エアコンをつけたい。」と答えると、音声認識装置がこの発声を認識し、音声出力装置はこの認識に基づいて所定の対話ルーチンを開始することができる。
【０００８】
また、センサがエンジンのオン状態を検出し、さらに運転者が乗車したことを検出すると、音声認識装置が起動され、さらに音声出力装置は「何処へ行きますか」と言うような対話のきっかけとなる言葉を出力する。この問いかけにユーザが例えば、「＊＊＊に行きたい」と答えると、音声認識装置がこれを認識し、音声出力装置はこの認識に基づいて所定の対話ルーチンを開始することができる。
【０００９】
このように、本発明では、車両に設けた種々のセンサからの出力信号によって音声認識装置を起動すると共に、対話システム側から対話のきっかけとなる言葉を自動的に出力するようにしている。したがって、ユーザはスタートボタン等を操作して自ら音声認識装置を起動する必要はない。そのため、音声対話をスムースに開始する事ができる。
【００１０】
また、音声出力装置から出力する言葉の先頭に、「○×さん」、「もしもし」等の特に意味をなさない言葉を付加する事によって、ユーザはシステムからの質問をより聞き取りやすくなる。
【００１１】
【発明の実施の形態】
以下に、図面を参照して本発明の実施形態を説明する。
図１は、本発明の一実施形態にかかる音声対話システムの構造を示すブロック図である。図において１は音声認識装置、２は音声出力装置、３は車両の各部に設けたセンサ、４は本対話システムによって制御すべき車両電装品を示す。音声認識装置１は、ユーザの発声を言葉として認識するための装置であって、一般に音声認識エンジンと認識辞書で構成されている。音声出力装置２は音声認識エンジンと、対話用データベースおよび音声合成エンジンを備えており、音声認識装置１で言葉として認識されたユーザの発声に基づいて予め決められたプログラムにしたがってユーザへの質問を作成し、これを合成音声として出力することで、所定の対話ルーチンを実行する。
【００１２】
音声認識装置１によって以上のようにして獲得された情報は、車両電装品４に送られ、該機器をユーザの意向に沿うように操作する。車両電装品４は、例えば、エアコン、パワーウインドウ、ナビゲーションシステム、オーディオ・ビデオ機器、情報端末機器などである。
【００１３】
本装置では、車両の各部に設けたセンサ３から、車両の種々の状態を示す信号が音声認識装置１および音声出力装置２に対して出力される。このセンサ３は、例えば、エンジンがオンとなったことを検出するセンサ、エンジンがオンとなりかつ運転者が乗車したことを検出するセンサ、燃料が一定量以下となったことを検出するセンサ、ＶＩＣＳの渋滞情報を検出するセンサ、メールの着信を検出するセンサ、パーキングブレーキがオンとなったことを検出するセンサなどである。
【００１４】
センサ３からの出力信号は、同時に音声出力装置２に入力される。この入力によって音声出力装置２は、入力信号に対応して予め決められた言葉を合成音声として出力する。
【００１５】
図２は、センサ３から出力される情報の種類と、それに対応して予め決められている音声出力との関係を示している。図２の（ａ）では、センサ３からエンジンがオン状態となったことを示す信号が出力されると、音声出力装置２から「何をしますか？」と言う質問が出力されることを示している。これに対してユーザが、例えば「エアコンをつけたい」と答えると、この発声を音声認識装置１が認識して、音声出力装置２においてエアコンの制御のための対話ルーチンを起動する。
【００１６】
もし、ユーザが「必要ない」等の言葉を発声すると音声認識装置１がこれを認識して対話を自動的に終了する。あるいは、音声認識装置１で一定時間音声の入力が無いことを検出すると、対話を自動終了するようにしておいても良い。また、音声出力装置２で「必要ありませんね」、「案内を終了します」等の言葉を発声して対話を自動終了するようにしておいても良い。
【００１７】
図２の（ｂ）に示す例では、車載機器としてカーナビゲーションシステムが組み込まれており、センサ３からエンジンがオン状態となったことおよび運転者が乗車したことを示す情報が出力される例を示している。この場合は、運転者がどこかへ移動する確率が非常に高いので、音声出力装置２から「どちらまで案内しましょうか？」等の問いかけを行う。ユーザがこれに対して、「＊＊＊まで行きたい」などと答えると、音声認識装置１がこれを認識して、自動的にナビゲーションシステムの目的地設定の対話を開始する。
【００１８】
なお、音声案内の必要が無い場合は、（ａ）の事例と同様に、ユーザの音声が一定時間入力されないこと、あるいはユーザの「必要ない」と言うような発声によって、対話を自動的に終了する。
【００１９】
これによって、ユーザが乗車時にルーチンワーク的に目的地を設定できる様になり、ユーザが目的地設定のコマンドを覚える必要は無くなる。
【００２０】
図２の（ｃ）に示す例では、図（ｂ）と同様に車載機器としてカーナビゲーションシステムが組み込まれており、センサ３でガソリンの残量を検出している。この場合、ガソリン残量が一定値以下となると、音声出力装置２から「ガソリンが少なくなっています。近くのガソリンスタンドまで案内しましょうか」と言うような問いかけを行うことで、ナビゲーションシステムのための対話を開始する。これにより、ユーザは、燃料の残量を気にしながら運転する必要が無くなる。
【００２１】
図２の（ｄ）に示す例は、（ｃ）の変形であって、センサ３でガソリン残量を検出すると共に、目的地までの距離と燃費を考慮して目的地までの距離＋数十ｋｍ以内になった時に、「現在の燃料では目的地にたどり着けません。次のスタンドで必ず給油して下さい。」等の警告を発し、さらに「スタンドまで案内しましょうか？」と言うような問いかけを行うことで、ナビゲーションのための対話を開始する。
【００２２】
これにより、ユーザは長距離ドライブであっても燃料および燃料の補給先を気にしながら運転する必要がなくなる。
【００２３】
図２の（ｅ）に示す例では、ＶＩＣＳ（道路交通情報通信システム）の情報を入手可能なナビゲーションシステムを搭載している車両において、ＶＩＣＳからの渋滞情報を対話のきっかけとする。即ち、ＶＩＣＳから渋滞情報が入力されると、渋滞が発生している道に差し掛かる前にシステム側で「この先渋滞が発生しています。迂回しますか？」と言うような問いかけを行って、ナビゲーションのための対話を開始する。
【００２４】
これにより、ユーザは一々渋滞の状況を確認しながら運転する必要がなくなるので、運転に専念することができる。
【００２５】
図２の（ｆ）に示す例では、例えばメールの受信を検出する機能を有する情報端末を搭載している車両において、メールを受信するとシステム側から「メールが到着しています。読み上げますか？」と言うような問いかけを行うことで、メールの読み上げのための対話システムを開始する。
【００２６】
図３は、電子メールの着信を対話のきっかけとし、対話を進める場合のフローを示す図面である。まず、センサがメールの着信情報を受信すると、対話システム側から「メールが△件届いています。読み上げますか？」と言うような問いかけを行い、メール読み上げの対話の開始タイミングを形成する（ステップＳ１０）。
【００２７】
この場合、例えばユーザの反応「はい」、「あとで」、「題名は」、「誰から？」に応じて図示するようなメールの読み上げ機能が実行される。例えば、ユーザが「はい」と返事すれば、メール処理のための対話ルーチンに入り、合成された音声によってメールの内容を読み上げる（ステップＳ１１）。ユーザが「あとで」と返事をすれば、システム側では音声対話を一旦終了する（ステップＳ１２）。ユーザが「題名は？」と返事をすれば、合成された音声によってメールの題名を読み上げる（ステップＳ１３）。ユーザが「誰から？」と返事をすれば、合成された音声によって差出人の名前を読み上げる（ステップＳ１４）。なお、メールの差出人に対し、予めニックネーム等が登録できるようにされていることが好ましい。
【００２８】
ステップＳ１３において、システムがメールの題名を読み上げた場合、ユーザがさらに「読んで」と答えれば、システムは合成された音声によりメールの内容を読み上げる（ステップＳ１５）。ステップＳ１３に対するユーザの応答が、「次のは？」である場合、ステップＳ１６で次の題名を合成された音声によって読み上げる。ステップＳ１３に対するユーザの応答が、「あとでいいや」であると、ステップＳ１７において一旦、対話システムを終了する。
【００２９】
ステップＳ１４においてメールの差出人が読み上げられ、ユーザがそれに対して「読んで」と応答すると、ステップＳ１８でシステムはそのメールの内容を合成された音声にて読み上げる。ユーザの応答が、「次は？」である場合は、ステップＳ１９において、システムは次の差出人の名前を合成された音声にて読み上げる。ユーザの応答が、「あとでいいや」である場合には、ステップＳ２０においてシステムは音声対話を一旦終了する。
【００３０】
以上によって、ユーザはメールの着信を自動的に知ることが出来るだけではなく、題名や送信者などで読み上げるべきメールを選択できるので、非常に便利である。
【００３１】
図４は、本発明のさらに他の実施形態を示す図であって、駐車場の課金情報の登録が可能な機能とナビゲーション機能を備えた車両のための音声対話のフローを示す。課金情報を登録する機能としては、予め課金情報を登録しておくことも出来るが、図示する実施形態では、車両が入庫しパーキングブレーキが設定されたことを検出すると、課金情報の登録を促す問いかけをシステム側から発する構成を有している。
【００３２】
まず、ステップＳ３０で車両が駐車場に入庫したことをナビゲーションシステムが認識すると、そのときの時間、即ち入庫時間がシステムにおいて保存される。次にステップＳ３１でパーキングブレーキがオンとされると、システム側から「駐車場の料金システムを登録しますか？」と言うような問いかけを行って、課金情報登録案内のための対話をユーザに促す。
【００３３】
音声対話システムの案内に従って駐車場の料金システムを登録し、エンジンをオフし（ステップＳ３２）、その後エンジンを再びオンとする（ステップＳ３３）と、システムはこの状態を出庫であると判断し、その時間を基に計算された駐車時間と料金案内を出力する。この出力は、例えば、「駐車時間は＊＊＊分ですので、○○○円になります。」等の合成音声による案内である。
【００３４】
これによってユーザは、駐車料金を出庫前に予め知ることが出来るので、非常に便利である。
【００３５】
図５は、本発明のさらに他の実施形態を示す図であって、有料道路の料金情報を備えるナビゲーションシステム、あるいは有料道路の料金情報を登録可能なナビゲーションシステムに本発明の音声対話システムを適用した場合の実施形態を示す。
【００３６】
本システムでは、走行中の車両が有料道路の入り口を通過した場合、ステップＳ４０に示す様にその入り口情報を保存しておき、ステップＳ４１に示す様に料金所が近づいてくるとシステム側から「＊＊＊円になります。」等の案内の言葉を出力し、これを音声認識開始のタイミングとする。これによってユーザは、音声認識開始のボタンを押すことなく「次の料金所は？」、「次はいくら？」と言うような質問をシステムに対して行う事ができる。
【００３７】
車両の走行スピードが０となった時点でシステムは料金が支払われたことを認識し（ステップＳ４２）、音声認識を停止する。車両が走行を再開すると、システムは料金支払いが終了したことを認識し（ステップＳ４３）、音声認識を開始する。
【００３８】
これによって、ユーザは、有料道路の料金所手前で予め通行料金を知ることが出来るので、支払いの準備がスムースに行え、料金所を通過する時間が短縮できる。
【００３９】
以上、図１から図５を参照して本発明の種々の実施形態を説明したが、上記各実施形態は種々の変更が可能である。例えば、上記全ての機能を備えている音声対話システムにおいて、複数の条件が重なった場合の優先度や、要、不要を任意に設定できるようにしても良い。例えば、メールの着信と料金案内が同時に発生する場合には、料金案内を優先し、一連の対話が終了した時点で、メールの案内を行うように設定したり、料金案内は行わずにメールだけを読ませるように設定したりすることも可能である。
【００４０】
さらに、上記全ての機能を備えている音声対話システムおいて、案内や問いかけを行う機能を選択できるようにすることもできる。例えば料金案内の要否や、燃料警告の要否などを選択できるようにする。
【００４１】
また、音声対話システムの欠点として、ユーザがシステム側からの案内のはじめの部分を聞き逃す確率が高いことがある。そこで、対話のきっかけとして、「○○さん」、「もしもし」、「ちょっと失礼します」等の、対話の中身とは無意味な言葉を文頭に発するようにしても良い。発する無意味語は、毎回同じ言葉でも良いし、複数のバリエーションを設けておいても良い。
【００４２】
さらに、案内する情報に応じて、対話のきっかけとなる言葉を変化させることも可能である。例えば、ＶＩＣＳ渋滞情報などで渋滞状況が変化したときに、「速報です」等の言葉を文頭につけることも可能である。
【００４３】
さらに、情報の緊急度に応じて、対話のきっかけとなる文章を変化させても良い。例えば、有料道路の料金案内において、ゆっくり走っている時には、走行区間や料金の案内を詳細に行い、高速で走っている場合には、料金の案内のみで済ますこともできる。一例として、渋滞や低速走行時には、システム側から「まもなく料金所です。○○インターチェンジから××インターチェンジまでの料金は△△円です」と発話し、高速走行時には「料金所です。料金は△△円です」と発話する。
【００４４】
さらに、上記無意味な呼びかけの直後に、ユーザが「ちょっと待って」、「待って」などと応答すると、これをシステム側で認識し、認識状態を維持したままシステムを待機状態へ移行させ、対話を保留することも可能である。この場合、ユーザの次の発声、「はい」、「何？」を認識して対話を継続する。あるいは、対話を保留した状態で一定時間経過した後、例えば「案内してよろしいでしょうか」、「まだでしょうか」などの言葉を再度問いかけることも有効である。また、一定時間経過後に、例えば「案内を終了します。○○情報でした」などと、何の情報を提供しようとしたかを示すことも有効である。またさらに、対話の保留中に情報が不要となったと判断した場合に、対話の保留状態を解除することも可能である。例えば、駐車料金案内を保留したまま駐車場を出てしまった場合などでは、料金案内は不要であるため、「保留された料金案内を解除します」などの通知を行ってその場の案内を中止するようにしても良い。
【００４５】
【発明の効果】
以上、種々の実施形態を示して説明したように、本発明の音声対話システムでは、車両の各部に設けたセンサからの出力信号に応じて、音声出力装置によって対話のきっかけの言葉を出力することが出来る。したがって、ユーザは音声対話システムに対して対話のきっかけを自身から作る必要がないため、運転に専念することができる。また、必要な音声案内のタイミングを逃すことがない。
【図面の簡単な説明】
【図１】本発明の一実施形態にかかる音声対話システムを示すブロック図。
【図２】図１に示すシステムの動作説明に供する図。
【図３】本発明の一実施形態のシステムによってメール読み上げの機能を実行する場合のフローを示す図。
【図４】本発明の一実施形態のシステムによって駐車場の料金案内の機能を実行する場合のフローを示す図。
【図５】本発明の一実施形態のシステムによって有料道路の料金案内を実行する場合のフローを示す図。
【符号の説明】
１…音声対話システム
２…車載機器
１１…音声認識装置
１２…音声出力装置
１３…車両各部のセンサ[0001]
BACKGROUND OF THE INVENTION
The present invention relates to a voice dialogue system that automatically performs control of vehicle electrical components such as an air conditioner, a power window, and a car navigation system, an audio device, and a PDA, based on a dialogue with a user.
[0002]
[Prior art]
In devices with complicated operations such as car navigation systems and audio devices, the operation of the devices is simplified by using a voice interaction system. In this spoken dialogue system, the system utters a question created based on a predetermined step, recognizes a user's answer to the question, and creates a new utterance based on the recognition result. Information necessary for the operation is obtained from the user.
[0003]
[Problems to be solved by the invention]
However, in such a conventional voice interaction system, it is necessary to take a voice recognition timing on the user side in order to activate the voice recognition device. Actually, the conventional apparatus is configured to activate the voice recognition apparatus by pressing or continuing to press a switch or button indicating “recognition start” or “speaking” in advance. Such a configuration not only makes the operation complicated, but also imposes a burden on the user other than the driving operation, and there is a risk of inducing a driving error.
[0004]
The present invention has been made for the purpose of solving the above-described drawbacks of the conventional apparatus, and the device automatically recognizes the start of the dialog system, rather than the user side acting on the dialog system to start the recognition. It is an object of the present invention to provide a spoken dialogue system that can be used.
[0005]
[Means for Solving the Problems]
This onset Ming, in order to solve the above problems, for controlling the navigation system by voice dialogue, a voice dialogue system in the vehicle, a sensor for detecting that the parking brake is turned on, the sensor An audio output device that outputs a word prompting registration of billing information for the parking lot when receiving a signal indicating that the vehicle has entered the parking lot from the navigation system. Provide an interactive system.
[0006]
The sensor may be a sensor which detects that the path parking brake is turned on.
[0007]
When output signals from these sensors are input to the system, the speech recognition device is automatically activated, and words determined in advance corresponding to the input signals are output from the speech output device. This is a word that gives the user a voice dialogue from the system side. For example, if the sensor detects the engine on state, it will say "What do you do?" is there. Therefore, when the user answers, for example, “I want to turn on the air conditioner” to this word, the voice recognition device recognizes this utterance, and the voice output device can start a predetermined dialogue routine based on this recognition. .
[0008]
In addition, when the sensor detects that the engine is on and further detects that the driver has boarded, the voice recognition device is activated, and the voice output device triggers a dialogue such as "Where are you going?" Output the word. When the user answers this question, for example, “I want to go to ***”, the voice recognition device recognizes this, and the voice output device can start a predetermined dialogue routine based on this recognition.
[0009]
As described above, in the present invention, the speech recognition apparatus is activated by the output signals from various sensors provided in the vehicle, and words that trigger the dialogue are automatically output from the dialogue system side. Therefore, the user does not have to start the speech recognition apparatus by operating the start button or the like. Therefore, the voice conversation can be started smoothly.
[0010]
In addition, by adding words that do not make any special meaning such as “Mr. XX” or “Moshimoshi” to the head of the words output from the voice output device, the user can more easily hear questions from the system.
[0011]
DETAILED DESCRIPTION OF THE INVENTION
Embodiments of the present invention will be described below with reference to the drawings.
FIG. 1 is a block diagram showing the structure of a spoken dialogue system according to an embodiment of the present invention. In the figure, 1 is a voice recognition device, 2 is a voice output device, 3 is a sensor provided in each part of the vehicle, and 4 is a vehicle electrical component to be controlled by this interactive system. The speech recognition device 1 is a device for recognizing a user's utterance as a word, and generally includes a speech recognition engine and a recognition dictionary. The voice output device 2 includes a voice recognition engine, a dialogue database, and a voice synthesis engine, and asks questions to the user according to a program determined in advance based on the user's utterance recognized as words by the voice recognition device 1. A predetermined interactive routine is executed by creating and outputting this as synthesized speech.
[0012]
The information acquired by the voice recognition device 1 as described above is sent to the vehicle electrical component 4 to operate the device in accordance with the user's intention. The vehicle electrical component 4 is, for example, an air conditioner, a power window, a navigation system, an audio / video device, an information terminal device, or the like.
[0013]
In the present apparatus, signals indicating various states of the vehicle are output to the voice recognition device 1 and the voice output device 2 from the sensors 3 provided in each part of the vehicle. This sensor 3 is, for example, a sensor that detects that the engine is turned on, a sensor that detects that the engine is turned on and the driver gets on, a sensor that detects that the fuel has become a certain amount or less, VICS, A sensor for detecting traffic jam information, a sensor for detecting incoming mail, a sensor for detecting that the parking brake is turned on, and the like.
[0014]
Output signals from the sensor 3 are simultaneously input to the audio output device 2. By this input, the voice output device 2 outputs a predetermined word corresponding to the input signal as a synthesized voice.
[0015]
FIG. 2 shows the relationship between the type of information output from the sensor 3 and a predetermined audio output. In FIG. 2A, when a signal indicating that the engine is turned on is output from the sensor 3, a question “What are you doing?” Is output from the audio output device 2. Show. On the other hand, when the user answers, for example, “I want to turn on the air conditioner”, the voice recognition device 1 recognizes this utterance and starts a dialogue routine for controlling the air conditioner in the voice output device 2.
[0016]
If the user utters words such as “not necessary”, the speech recognition apparatus 1 recognizes this and automatically ends the dialogue. Alternatively, when the voice recognition device 1 detects that there is no voice input for a certain period of time, the dialogue may be automatically terminated. Further, the dialogue may be automatically ended by uttering words such as “No need” or “End guidance” in the voice output device 2.
[0017]
In the example shown in FIG. 2B, an example in which a car navigation system is incorporated as an in-vehicle device, and information indicating that the engine is turned on and the driver gets on is output from the sensor 3. Show. In this case, since the probability that the driver moves to somewhere is very high, the voice output device 2 asks "Where should I guide you?" When the user answers “I want to go to ***” or the like, the speech recognition apparatus 1 recognizes this and automatically starts a dialogue for setting the destination of the navigation system.
[0018]
If there is no need for voice guidance, the dialogue is automatically terminated when the user's voice is not input for a certain period of time or when the user says “not necessary”, as in the case of (a). To do.
[0019]
As a result, the user can set the destination in a routine manner when getting on, and the user does not need to memorize the destination setting command.
[0020]
In the example shown in (c) of FIG. 2, a car navigation system is incorporated as an in-vehicle device as in the case of (b), and the remaining amount of gasoline is detected by the sensor 3. In this case, when the remaining amount of gasoline is below a certain value, the voice output device 2 will ask the question “While gasoline is low. Start the conversation. This eliminates the need for the user to drive while caring for the remaining amount of fuel.
[0021]
The example shown in (d) of FIG. 2 is a modification of (c), in which the remaining amount of gasoline is detected by the sensor 3 and the distance to the destination plus several tens of times in consideration of the distance to the destination and fuel consumption. When it is within km, a warning such as "Can't reach the destination with the current fuel. Be sure to refuel at the next stand." Start a dialogue for navigation by asking questions.
[0022]
This eliminates the need for the user to drive while paying attention to the fuel and the fuel replenishment destination even in long-distance driving.
[0023]
In the example shown in FIG. 2 (e), the traffic information from the VICS is used as a trigger for the dialogue in a vehicle equipped with a navigation system capable of obtaining VICS (road traffic information communication system) information. In other words, when traffic information is input from VICS, the system asks the question “This traffic jam has already occurred. Do you want to detour?” Before reaching the road where the traffic has occurred. Start a dialogue for navigation.
[0024]
This eliminates the need for the user to drive while confirming the traffic jam, so that he can concentrate on driving.
[0025]
In the example shown in FIG. 2 (f), for example, in a vehicle equipped with an information terminal having a function of detecting the reception of an email, when the email is received, the system side says, “Mail has arrived. ”To start a dialogue system for reading a mail.
[0026]
FIG. 3 is a diagram showing a flow in a case where an incoming e-mail is triggered by a dialogue and the dialogue is advanced. First, when the sensor receives the incoming mail information, the dialogue system asks the question “A mail has arrived. Do you want to read it out?” To form the start timing of the mail reading conversation (steps) S10).
[0027]
In this case, for example, an e-mail reading function as shown in the figure is executed according to the user's reaction “yes”, “after”, “title”, “from whom?”. For example, if the user replies “yes”, the user enters an interactive routine for mail processing, and reads the contents of the mail by the synthesized voice (step S11). If the user replies “after”, the voice conversation is once ended on the system side (step S12). If the user answers “What is the title?”, The title of the mail is read out by the synthesized voice (step S13). If the user replies “From whom?”, The name of the sender is read out by the synthesized voice (step S14). It is preferable that a nickname or the like can be registered in advance for the sender of the mail.
[0028]
In step S13, when the system reads out the title of the mail, if the user further answers “read”, the system reads out the content of the mail by the synthesized voice (step S15). When the user's response to step S13 is “next?”, The next title is read out by the synthesized voice in step S16. If the user's response to step S13 is "I'll do it later", the dialog system is once terminated in step S17.
[0029]
In step S14, the sender of the mail is read out, and when the user responds “read”, the system reads out the contents of the mail in synthesized voice in step S18. If the user response is "What is next?", In step S19, the system reads the name of the next sender in a synthesized voice. If the user's response is “I'll do it later”, the system once ends the voice conversation in step S20.
[0030]
As described above, the user can not only automatically know the incoming mail, but can select the mail to be read out by the title or sender, which is very convenient.
[0031]
FIG. 4 is a diagram showing still another embodiment of the present invention, and shows a flow of voice interaction for a vehicle having a function capable of registering parking lot billing information and a navigation function. As a function for registering the billing information, the billing information can be registered in advance, but in the illustrated embodiment, when the vehicle is received and the parking brake is set, an inquiry is made to prompt the user to register the billing information. From the system side.
[0032]
First, when the navigation system recognizes that the vehicle has entered the parking lot in step S30, the time at that time, that is, the entry time is stored in the system. Next, when the parking brake is turned on in step S31, the system asks "Do you want to register the parking fee system?" And asks the user for a dialog for charging information registration guidance. Prompt.
[0033]
When the parking fee system is registered according to the guidance of the voice dialogue system, the engine is turned off (step S32), and then the engine is turned on again (step S33), the system determines that this state is a delivery, and Output parking time and toll information calculated based on time. This output is, for example, a guidance by synthetic voice such as “Parking time is *** minutes, so it will be XX circle”.
[0034]
As a result, the user can know the parking fee in advance before leaving, which is very convenient.
[0035]
FIG. 5 is a diagram showing still another embodiment of the present invention, in which the voice dialogue system of the present invention is applied to a navigation system having toll road toll information or a navigation system capable of registering toll road toll information. The embodiment in the case where it did is shown.
[0036]
In this system, when the traveling vehicle passes the entrance of the toll road, the entrance information is stored as shown in step S40, and when the toll gate approaches as shown in step S41, the system side “ “*** yen” will be output, and this will be used as the voice recognition start timing. As a result, the user can ask the system questions such as “What is the next toll booth” and “How much is next?” Without pressing the voice recognition start button.
[0037]
When the traveling speed of the vehicle becomes 0, the system recognizes that the fee has been paid (step S42), and stops voice recognition. When the vehicle resumes running, the system recognizes that the fee payment has ended (step S43), and starts voice recognition.
[0038]
As a result, the user can know the toll in advance before the toll gate on the toll road, so that preparation for payment can be made smoothly and the time for passing through the toll gate can be shortened.
[0039]
While various embodiments of the present invention have been described above with reference to FIGS. 1 to 5, various modifications can be made to the above embodiments. For example, in a spoken dialogue system having all the above functions, the priority when multiple conditions overlap, the necessity, and the necessity may be set arbitrarily. For example, when incoming mail and fee guidance occur at the same time, priority is given to fee guidance, and when a series of conversations are completed, email guidance is set, or only email without fee guidance is performed. It is also possible to set to read.
[0040]
Furthermore, in the voice dialogue system having all the above functions, it is possible to select a function for performing guidance and asking questions. For example, it is possible to select whether or not fee guidance is necessary and whether or not fuel warning is necessary.
[0041]
Further, as a drawback of the voice dialogue system, there is a high probability that the user misses the first part of the guidance from the system side. Therefore, as a trigger for the dialogue, words that are meaningless to the content of the dialogue, such as “Mr. XXX”, “Moshimoshi”, “I am a little rude”, may be issued at the beginning of the dialogue. The meaningless words that are emitted may be the same word each time, or a plurality of variations may be provided.
[0042]
Furthermore, it is possible to change the language that triggers the dialogue according to the information to be guided. For example, when the traffic situation changes due to VICS traffic information or the like, it is possible to add a word such as “Breaking news” to the beginning of the sentence.
[0043]
Furthermore, the text that triggers the conversation may be changed according to the urgency of the information. For example, in toll road fee guidance, when traveling slowly, the travel section and fee guidance may be provided in detail, and when traveling at high speed, only the fee guidance may be required. As an example, when driving in a traffic jam or at low speed, the system says, “Soon it will be a toll booth. It ’s a circle. ”
[0044]
Furthermore, immediately after the above meaningless call, if the user responds with a `` wait a moment '', `` wait '', etc., this is recognized on the system side, the system is shifted to the standby state while maintaining the recognition state, It is also possible to suspend the dialogue. In this case, the user's next utterance, “yes”, “what?” Is recognized and the dialogue is continued. Alternatively, after a certain period of time has passed with the dialogue pending, it is also effective to ask again the words such as “Are you sure you want to guide me” or “Is it still?” It is also effective to indicate what information is to be provided after a certain period of time, for example, “Guidance ends. Furthermore, when it is determined that the information is no longer necessary while the dialogue is on hold, it is possible to cancel the on-hold state of the dialogue. For example, if you leave the parking lot with the parking fee guide on hold, the fee guide is not required, so you will be notified of the location by giving a notice such as "Release the reserved fee guide". You may make it cancel.
[0045]
【The invention's effect】
As described above, as shown in various embodiments, in the voice dialogue system according to the present invention, a voice trigger is used to output a dialogue trigger word according to an output signal from a sensor provided in each part of the vehicle. I can do it. Therefore, since the user does not need to create a dialog for the voice dialogue system from himself, he can concentrate on driving. Also, the necessary voice guidance timing is not missed.
[Brief description of the drawings]
FIG. 1 is a block diagram showing a voice interaction system according to an embodiment of the present invention.
FIG. 2 is a diagram for explaining the operation of the system shown in FIG. 1;
FIG. 3 is a diagram showing a flow when a mail reading function is executed by the system according to the embodiment of the present invention.
FIG. 4 is a diagram showing a flow in the case of executing a parking fee guide function by the system according to the embodiment of the present invention.
FIG. 5 is a diagram showing a flow in a case where toll road fee guidance is executed by the system of one embodiment of the present invention.
[Explanation of symbols]
DESCRIPTION OF SYMBOLS 1 ... Voice dialogue system 2 ... In-vehicle apparatus 11 ... Voice recognition apparatus 12 ... Voice output device 13 ... Sensor of each part of vehicle

Claims

A voice dialogue system in a vehicle for controlling a navigation system by voice dialogue,
A sensor that detects that the parking brake is on;
A voice output device that receives an output signal from the sensor and receives a signal indicating that a vehicle has entered the parking lot from the navigation system;
A spoken dialogue system in a vehicle comprising: