JP4984708B2

JP4984708B2 - Information processing apparatus having voice dialogue function

Info

Publication number: JP4984708B2
Application number: JP2006199376A
Authority: JP
Inventors: 英志北川; 俊之福岡; 鏡子奥山; 拓郎池田; 智則池谷
Original assignee: Fujitsu Ltd
Current assignee: Fujitsu Ltd
Priority date: 2006-07-21
Filing date: 2006-07-21
Publication date: 2012-07-25
Anticipated expiration: 2026-07-21
Also published as: JP2008026621A

Description

本発明は、音声対話機能を有する情報処理装置または電子機器に関し、特に、カーナビゲーション・システムおよび音声サービス機能を有する情報処理装置において複数のサービスの音声処理の優先度に応じて低い優先度の音声処理を中断させることなく適正なタイミングで異なる優先度の音声処理を実行する技術に関する。 The present invention relates to an information processing apparatus or electronic device having a voice interaction function, and in particular, in a car navigation system and an information processing apparatus having a voice service function, a low priority voice according to a priority of voice processing of a plurality of services. The present invention relates to a technique for executing audio processing with different priorities at an appropriate timing without interrupting the processing.

カーナビゲーション・システムおよび車両用のテレマティクス端末においては、ラジオのようにユーザに一方的な情報提供をするだけでなく、ユーザからの音声入力やボタン入力などの入力操作を処理して、対話形式で情報を提供することができる。それによって、ユーザは、経路案内、天気予報およびニュース等の様々な情報提供サービスを利用できる。また、複数のサービスが提供される場合でも、ユーザはサービスメニューの階層をたどってサービスを指定したり、直接サービスを呼び出したりすることによって、応答するサービスを切り替えながら利用することができる。 Car navigation systems and telematics terminals for vehicles not only provide unilateral information to the user like a radio, but also process input operations such as voice input and button input from the user in an interactive format. Information can be provided. Thereby, the user can use various information providing services such as route guidance, weather forecast, and news. Further, even when a plurality of services are provided, the user can use the services to be switched while specifying the services by following the hierarchy of the service menu or by directly calling the services.

特に車内でのサービス利用においては、運転を阻害しないという観点から音声インタフェースの利用が重視されている。このために、ユーザへの情報提供およびサービスの操作などを、音声を利用して対話的に行うことができるカーナビゲーション・システムやテレマティクス端末が既に提供されている。 In particular, in the use of services in the vehicle, the use of voice interfaces is emphasized from the viewpoint of not hindering driving. For this reason, a car navigation system and a telematics terminal capable of interactively using voice to provide information to users and service operations have been already provided.

通常の対話型の情報処理システムでは、ユーザの所望のサービスを提供するためのそのシステムの動作が記述されたサービス・シナリオを処理することによって、ユーザに様々なサービスを提供する。一般に、サービス・シナリオは状態遷移モデルで表現され、その各状態にはサービスの出力情報が割り当てられ、各遷移にはユーザの入力情報の候補が割り当てられている。対話型の情報処理システムは、状態遷移モデルにおける現在の状態に割り当てられたサービスの出力情報の内容を出力し、それに応答して入力されたユーザの入力情報に応じて、次の状態へと遷移し、これを繰り返すことによって対話を進行させる。 In an ordinary interactive information processing system, various services are provided to a user by processing a service scenario in which an operation of the system for providing a user's desired service is described. In general, a service scenario is expressed by a state transition model, service output information is assigned to each state, and user input information candidates are assigned to each transition. The interactive information processing system outputs the contents of the output information of the service assigned to the current state in the state transition model, and transitions to the next state according to the user input information input in response. Then, the dialogue is advanced by repeating this.

サービス・シナリオは、例えば、ＶｏｉｃｅＸＭＬのような音声対話記述言語によって記述される。ユーザの入力情報の項目の候補はＳＲＧＳ（Speech Recognition Grammar Specification）のような文法記述言語によって記述される。ＶｏｉｃｅＸＭＬインタプリタはサービス・シナリオを解釈して、ユーザが入力した情報に従って、情報提供、予約および機器操作などを対話的に実行する。 The service scenario is described by a voice interaction description language such as VoiceXML. Candidates for user input information items are described in a grammar description language such as SRGS (Speech Recognition Grammar Specification). The VoiceXML interpreter interprets a service scenario and interactively executes information provision, reservation, device operation, and the like according to information input by the user.

対話型の情報処理システムでは、様々な種類のユーザ入力を扱うことができる。例えば、ユーザの音声入力を受け付けるために音声認識技術が用いられる。音声認識技術では、音声認識モジュール（ＡＳＲ、Automatic Speech Recognition）によって、所定の入力期間において入力されたユーザの入力音声をテキスト情報に変換する。音声認識モジュールは、指定された入力項目の候補を参照して、ユーザ入力が入力候補のいずれかと一致していると推定された場合に、それを認識結果とする。また、システムからユーザに対する音声を出力するために音声合成技術が用いられる。音声合成技術では、音声合成モジュール（ＴＴＳ、Text To Speech）によって、所定の出力期間においてテキスト情報を自動的に出力音声に変換して出力する。ユーザは、対話型システムに直接的に結合された入力装置および出力装置だけでなく、固定電話機、携帯電話機、ＰＤＡ等の移動体通信機器、およびカーナビゲーション装置のような情報処理機器を用いて、通信ネットワークを介して対話型システムに接続して、情報処理サービスを利用することができる。 An interactive information processing system can handle various types of user input. For example, voice recognition technology is used to accept a user's voice input. In the speech recognition technology, a speech recognition module (ASR, Automatic Speech Recognition) converts user input speech input during a predetermined input period into text information. The speech recognition module refers to the designated input item candidate, and when it is estimated that the user input matches any of the input candidates, the speech recognition module regards it as a recognition result. In addition, a voice synthesis technique is used to output a voice to the user from the system. In the speech synthesis technology, text information is automatically converted into output speech and output in a predetermined output period by a speech synthesis module (TTS, Text To Speech). The user uses not only input devices and output devices directly coupled to the interactive system, but also mobile communication devices such as fixed phones, mobile phones, PDAs, and information processing devices such as car navigation devices, An information processing service can be used by connecting to an interactive system via a communication network.

特開平１０−４７９８４号公報（特許文献１）には、カーナビゲーション装置が記載されている。そのカーナビゲーション装置の経路案内タイミング制御装置は、基準時間Ｔｓを設定するレジスタと、自車が曲がるべき交差点への到達時間Ｔｒを算出する時間間隔Ｔｗを設定するレジスタと、この時間間隔Ｔｗを計算する場合の最小時間単位Ｔｉを設定するレジスタとから構成される。カーナビゲーション装置の経路案内は、自車の走行状態情報と自車の位置情報と案内すべき交差点の位置情報を判断要素として演算を行い、自車が案内すべき交差点に到達する時間を計算し経路案内を出力するタイミングを変化させることにより運転者に経路案内を伝える。それによって、曲がるべき交差点における道路の種類、道路の混雑状態に対応した経路誘導の確実性を向上させる。この方法では、単一のサービスが単一の音声通知だけを行う場合にはうまくいくが、複数のサービスを利用して複数の音声通知が発生する場合は考慮されていない。
特開平１０−４７９８４号公報 Japanese Unexamined Patent Publication No. 10-47984 (Patent Document 1) describes a car navigation device. The route guidance timing control device of the car navigation device calculates the time interval Tw, a register for setting the reference time Ts, a register for setting the time interval Tw for calculating the arrival time Tr to the intersection where the vehicle should turn, and the time interval Tw. And a register for setting the minimum time unit Ti. The route guidance of the car navigation device is calculated using the running state information of the own vehicle, the position information of the own vehicle, and the position information of the intersection to be guided as judgment elements, and calculates the time for the own vehicle to reach the intersection to be guided. The route guidance is transmitted to the driver by changing the timing of outputting the route guidance. Thereby, the reliability of route guidance corresponding to the type of road at the intersection to bend and the congestion state of the road is improved. This method works well when a single service performs only a single voice notification, but does not take into account the case where a plurality of voice notifications are generated using a plurality of services.
Japanese Patent Laid-Open No. 10-47984

特開２０００−３０４５５３号公報（特許文献２）には、路側用通信装置が記載されている。車載機としてのナビゲーション装置からサービス要求の指示入力がなされると、サービス要求が路側へ向けて出力され、路上機としての情報センタから遅延指示の応答があると、ナビゲーション装置は遅延時間内に応答時間が収まる他のサービスを検索し、出力し、その結果を受信する。遅延時間経過後に先に行ったサービス要求に対する応答を受信する。従って、路上機側で処理に時間を必要とするサービスが存在する場合であっても、その遅延時間内に他のサービスや情報の授受が可能となる。それによって、路側と移動体側との間において、より迅速に効率よく情報授受が行われる。この方法では複数のサービスを並列に利用することが可能になるが、システムで時間がかかる処理が発生するたびに別なサービスが動作してしまうので、サービスの中断が頻発してしまう。
特開２０００−３０４５５３号公報 Japanese Patent Laying-Open No. 2000-304553 (Patent Document 2) describes a roadside communication device. When a service request instruction is input from a navigation device as an in-vehicle device, the service request is output toward the roadside. When there is a delay instruction response from the information center as a road device, the navigation device responds within the delay time. Search for and output other services that will save time, and receive the results. A response to a service request made earlier after the delay time has elapsed is received. Therefore, even when there is a service that requires time for processing on the roadside device side, other services and information can be exchanged within the delay time. As a result, information is exchanged more quickly and efficiently between the road side and the moving body side. With this method, it is possible to use a plurality of services in parallel, but each time a process that takes time occurs in the system, another service operates, so service interruptions frequently occur.
JP 2000-304553 A

特開２００３−２０２２３３号公報（特許文献３）には、車両に搭載され、テキスト文の内容に基づいて音声合成を行う情報再生方法が記載されている。その情報再生方法は、車両速度および車両位置の状態情報を取得し、この状態情報に基づいて、制限時間を算出し、発声すべき内容のテキスト文のうち、当該制限時間以内で発声可能なテキスト文を、複数の発声内容のテキスト文を格納した所定の格納手段から選択し、その選択されたテキスト文から音声合成パラメータを生成し音声出力する。この方法では、制限時間内におさまる出力が選択されるので、一つの音声出力が中断されることはない。しかし、対話処理は必ずしも一つの通知だけで終了せず、ユーザとの複数回の応答によって構成される場合がある。この方法は、そのような対話のまとまりが中断されずに実行されることを保証するものではない。
特開２００３−２０２２３３号公報 Japanese Patent Laying-Open No. 2003-202233 (Patent Document 3) describes an information reproduction method that is mounted on a vehicle and performs speech synthesis based on the content of a text sentence. The information reproduction method acquires state information of the vehicle speed and the vehicle position, calculates a time limit based on the state information, and text that can be uttered within the time limit of the text sentence to be uttered. A sentence is selected from a predetermined storage means that stores a plurality of text sentences with utterance contents, and a speech synthesis parameter is generated from the selected text sentence and output as a voice. In this method, since an output that falls within the time limit is selected, one audio output is not interrupted. However, the dialogue process does not necessarily end with only one notification, and may be configured by multiple responses with the user. This method does not guarantee that such a batch of dialogs is executed without interruption.
JP 2003-202233 A

カーナビゲーション・システムまたは車両用のテレマティクス端末において、複数のサービスを切り替えながら並行して利用し、または必要に応じて複数のサービスが自動的に音声通知を行う場合、複数のサービスの音声出力の間に競合が生じることがある。即ち、ユーザが或る音声サービスを利用している間に、他のサービスが音声通知を行おうとして、双方のサービスが音声出力の権利を奪いあうことがある。例えば、ユーザが運転しながらレストラン検索を行っている間に、経路案内サービスから右折すべき交差点の音声案内が割り込んで通知されることがある。 In a car navigation system or a telematics terminal for vehicles, when multiple services are used in parallel while switching, or when multiple services automatically give voice notifications as necessary, during the audio output of multiple services May cause contention. That is, while a user is using a certain voice service, both services may take away the right of voice output when another service tries to make a voice notification. For example, while the user is searching for a restaurant while driving, the route guidance service may interrupt and notify the voice guidance of an intersection to turn right.

このような場合、ユーザに対してレストラン検索サービスが、検索条件作成のために検索対象を絞り込むための質問をした直後に、右左折案内を通知し、再び同じ質問を出して再度ユーザに音声入力を依頼する、といった手順で対話が進められる。従って、ユーザは、質問に対する回答を音声入力しようとしているときに、右左折案内の音声通知によって思考や回答入力を妨げられるので、レストラン検索サービスとの間で円滑な対話ができなくなる。割り込む音声通知を遅延させることができれば、問題を解決できることもある。しかし、高い優先度の右折や左折の案内のタイミングを遅らせると、経路案内として役に立たない。従って、高い優先度の音声案内を優先的に通知する必要がある。 In such a case, immediately after the restaurant search service asks the user a question for narrowing down the search target for creating the search condition, the right / left turn guidance is notified, and the same question is given again and voice input is made to the user again. Dialogue proceeds in the order of requesting. Therefore, when the user tries to input the answer to the question by voice, the voice notification of the left / right turn guidance prevents the user from thinking and inputting the answer, so that the user cannot have a smooth dialogue with the restaurant search service. If the interrupted voice notification can be delayed, the problem may be solved. However, if the timing of high-priority right or left turn guidance is delayed, it will not be useful as route guidance. Therefore, it is necessary to give priority to voice guidance with high priority.

発明者たちは、複数のサービスの音声処理（音声出力および／または音声入力）の優先度と、複数の音声処理に要する各時間長さとに従って、それぞれの音声処理のタイミングを決定すれば、適正なタイミングで音声処理が行われ、ユーザは円滑に複数のサービスを利用できる、と認識した。 If the inventors determine the timing of each voice processing according to the priority of voice processing (voice output and / or voice input) of a plurality of services and each time length required for the plurality of voice processing, Voice processing was performed at the timing, and the user recognized that multiple services could be used smoothly.

本発明の目的は、複数の音声処理の間の競合を防止することである。 An object of the present invention is to prevent conflicts between the plurality of audio processing.

本発明の別の目的は、ユーザに対して複数の音声処理をそれぞれの適正なタイミングで円滑に行うことである。 Another object of the present invention is smoothly that multiple voice processing at each proper timing for the user.

本発明の特徴によれば、音声対話機能を有する情報処理装置は、分類および優先度を有する複数のサービス・シナリオを格納するサービス・シナリオ格納手段と、そのサービス・シナリオ格納手段における高い優先度のサービス・シナリオに従って次の第１の音声処理のタイミングを推定する通知時間推定手段と、現時点からその第１の音声処理のタイミングまでの空き時間を推定する空き時間推定手段と、そのサービス・シナリオ格納手段における低い優先度のサービス・シナリオに従って音声出力とユーザによる入力を受け付ける処理とを含む第２の音声処理に要する時間長さを推定する対話時間推定手段と、その第２の音声処理のその推定された時間長さがその推定された空き時間未満である場合に、その空き時間にその低い優先度のサービス・シナリオに従ってその第２の音声処理を行う対話制御手段と、を具えている。 According to the features of the present invention, an information processing apparatus having a voice interaction function includes service scenario storage means for storing a plurality of service scenarios having classifications and priorities, and high priority in the service scenario storage means. Notification time estimating means for estimating the timing of the next first voice processing according to the service scenario, free time estimating means for estimating the free time from the present time to the timing of the first voice processing, and storing the service scenario Dialog time estimation means for estimating the time length required for the second voice processing including voice output and processing for accepting user input according to a low priority service scenario in the means, and the estimation of the second voice processing If the estimated duration is less than the estimated free time, the free time Dialogue control means for performing the second speech processing according-bis scenario, which comprises a.

本発明によれば、複数の音声処理の間の競合を防止することができ、ユーザに対して複数の音声処理をそれぞれの適正なタイミングで円滑に行うことができる。 According to the present invention, it is possible to prevent contention between a plurality of audio processing, it is smoothly that multiple voice processing at each proper timing for the user.

本発明の実施形態を、図面を参照して説明する。図面において、同様の構成要素には同じ参照番号が付されている。 Embodiments of the present invention will be described with reference to the drawings. In the drawings, similar components are given the same reference numerals.

図１は、本発明による、複数の音声サービス４００の音声処理即ち音声出力および音声入力を行う音声対話システム１２を有する情報処理装置または電子機器１０の構成を示している。図１のブロック図は、プロセッサ１００によって実行される音声対話処理のためのフロー図と見ることもできる。情報処理装置または電子機器１０は、例えばカーナビゲーション装置、カーナビゲーション機能を有する情報処理装置またはテレマティクス端末であってもよい。情報処理装置または電子機器１０は、プロセッサ１００、メモリ１０２、ディスプレイ１０４、スピーカおよびマイクロホンを含む音響装置１０６、キーおよびボタンを含む入力装置１０８を有する。典型的には、プロセッサ１００およびメモリ１０２上に音声対話システム１２が実装される。情報処理装置または電子機器１０は、例えば、カーナビゲーション、車両メンテナンス、レストラン検索、施設検索、電子メール、ニュース、オーディオ再生、等のサービス機能を実装している。これらのサービス機能はそれぞれの音声サービス４００を含んでいる。 FIG. 1 shows a configuration of an information processing apparatus or electronic device 10 having a voice interaction system 12 that performs voice processing, that is, voice output and voice input, of a plurality of voice services 400 according to the present invention. The block diagram of FIG. 1 can also be viewed as a flow diagram for voice interaction processing performed by the processor 100. The information processing apparatus or the electronic device 10 may be, for example, a car navigation apparatus, an information processing apparatus having a car navigation function, or a telematics terminal. The information processing apparatus or electronic device 10 includes a processor 100, a memory 102, a display 104, an acoustic device 106 including a speaker and a microphone, and an input device 108 including keys and buttons. Typically, a voice interaction system 12 is implemented on the processor 100 and the memory 102. The information processing apparatus or the electronic device 10 implements service functions such as car navigation, vehicle maintenance, restaurant search, facility search, e-mail, news, audio playback, and the like. These service functions include a respective voice service 400.

音声対話システム１２は、複数の音声サービス４００別に設けられていて音声通知を行うべき時刻および時間長さを推定する通知時刻推定部２０、或る時刻に高い優先度の音声通知を行う場合における現在からその音声通知までの残りの空き時間ｔ１を推定しさらに２つの高い優先度の音声通知の間の空き時間ｔ１を推定する空き時間推定部３０、複数の音声サービス４００における複数のサービス・シナリオのそれぞれの対話分類（例えば、通知、回答を要求する質問、回答を要求する質問および回答に対する確認通知）および優先度（例えば、０、１、２、．．．）を含むサービス・シナリオを格納するサービス・シナリオ格納部４０、音声を出力しようとしているサービス・シナリオの対話に要する時間ｔ２を推定する対話時間推定部５０、対話履歴情報、ユーザ応答時間履歴情報および受信テキストを格納する情報格納部６０、実行すべき音声処理を決定する対話制御部７０、入力処理部７２および出力処理部７４を含んでいる。対話時間推定部５０は、入力時間推定部５２および文字数カウンタ５４を含んでいる。構成要素２０、３０、５０、７０、７２および７４の機能は、メモリ１０２に格納されたプログラムに従ってプロセッサ１００上に実装することができる。 The voice dialogue system 12 is provided for each of a plurality of voice services 400, and a notification time estimation unit 20 that estimates a time and a length of time for voice notification. The voice dialogue system 12 presents a voice notification with a high priority at a certain time. A free time estimation unit 30 that estimates a remaining free time t1 from the voice notification to the voice notification and further estimates a free time t1 between two high priority voice notifications, and a plurality of service scenarios in the plurality of voice services 400. Stores service scenarios including each interaction classification (eg, notification, question requesting answer, question requesting answer and confirmation notice for answer) and priority (eg, 0, 1, 2,...) Service scenario storage unit 40, dialogue time estimation unit for estimating time t2 required for dialogue of a service scenario about to output voice 0, dialogue history information, information storage unit 60 that stores the user response time history data and incoming text, the dialogue control unit 70 which determines the sound processing to be executed, and includes an input processing unit 72 and the output processing section 74. The dialogue time estimation unit 50 includes an input time estimation unit 52 and a character number counter 54. The functions of the components 20, 30, 50, 70, 72, and 74 can be implemented on the processor 100 according to a program stored in the memory 102.

通知時刻推定部２０は、空き時間推定部３０とサービス・シナリオ格納部４０とに接続されている。対話時間推定部５０は、サービス・シナリオ格納部４０に接続されている。対話制御部７０は、サービス・シナリオ格納部４０、空き時間推定部３０および対話時間推定部５０に接続され、さらに入力処理部７２および出力処理部７４に接続されている。対話制御部７０は、空き時間推定部３０からの空き時間ｔ１および対話時間推定部５０からの対話時間ｔ２に基づいて、現在実行中の音声サービス４００のいずれのサービス・シナリオを実行すべきかを決定し、その決定されたサービス・シナリオに従って入力処理部７２および出力処理部７４を制御する。 The notification time estimation unit 20 is connected to the free time estimation unit 30 and the service scenario storage unit 40. The dialogue time estimation unit 50 is connected to the service scenario storage unit 40. The dialogue control unit 70 is connected to the service scenario storage unit 40, the free time estimation unit 30 and the dialogue time estimation unit 50, and is further connected to an input processing unit 72 and an output processing unit 74. The dialogue control unit 70 determines which service scenario of the voice service 400 currently being executed is to be executed based on the idle time t1 from the idle time estimation unit 30 and the dialogue time t2 from the dialogue time estimation unit 50. Then, the input processing unit 72 and the output processing unit 74 are controlled according to the determined service scenario.

一例として、並行して２つの音声サービスＡおよびＢが動作中であり、その２つの音声サービスの音声処理ａ、ｂは時間的に競合し得るものとする。サービス別の通知時刻推定部２０は、サービス・シナリオ格納部４０の音声サービスＡおよびＢのサービス・シナリオに基づいて、音声サービスＡの時間的に次に発声すべき高い優先度の音声通知ａを行うべき予測時刻ｔｐを推定する。空き時間推定部３０は、予測時刻ｔｐから現在の時刻ｔｃを減算して現在の空き時間ｔ１＝ｔｐ−ｔｃを算出する。対話時間推定部５０は、音声サービスＢにおける次に発声すべき低い優先度の音声処理ｂ（音声通知および入力）に要する時間ｔ２を算出する。その音声処理ｂは、優先度が低く、予測時刻ｔｐより早くまたは遅く実行してもよいものとする。その音声処理ｂは、サービス・シナリオに設定された対話分類（例えば、通知、回答を要求する質問、回答を要求する質問およびその回答に対する確認通知）に基づいて、対話処理単位で行われる。 As an example, it is assumed that two voice services A and B are operating in parallel, and the voice processes a and b of the two voice services can compete in time. The notification time estimation unit 20 for each service generates a high priority voice notification a to be uttered next in time for the voice service A based on the service scenarios of the voice services A and B in the service scenario storage unit 40. A predicted time tp to be performed is estimated. The available time estimation unit 30 calculates the current available time t1 = tp−tc by subtracting the current time tc from the predicted time tp. The dialogue time estimation unit 50 calculates a time t2 required for the voice processing b (voice notification and input) with the lower priority to be uttered next in the voice service B. The voice processing b has a low priority and may be executed earlier or later than the predicted time tp. The voice processing b is performed in units of dialogue processing based on dialogue classification (for example, notification, question for requesting an answer, question for requesting an answer, and confirmation notification for the answer) set in the service scenario.

例えば、音声サービスＢのサービス・シナリオの対話分類が「通知」の場合は、対話処理単位（まとまり）は、通知のテキストの音声合成出力処理である。例えば、音声サービスＢのサービス・シナリオの対話分類が「回答を要求する質問」の場合は、対話処理単位は、質問のテキスト・メッセージの音声合成出力処理と、それに対するユーザの回答メッセージの音声認識処理である。例えば、音声サービスＢのサービス・シナリオの対話分類が「回答を要求する質問および回答に対する確認通知」の場合は、対話処理単位は、質問のテキスト・メッセージの音声合成出力処理と、それに対するユーザの回答メッセージの音声認識処理と、その回答に対する確認テキスト・メッセージの音声合成出力処理とである。 For example, when the dialogue classification of the service scenario of the voice service B is “notification”, the dialogue processing unit (group) is voice synthesis output processing of the text of the notification. For example, when the dialogue classification of the service scenario of the voice service B is “question that requires an answer”, the dialogue processing unit is a voice synthesis output process of a question text message and a voice recognition of the answer message of the user corresponding thereto. It is processing. For example, when the dialogue classification of the service scenario of the voice service B is “question for requesting an answer and a confirmation notification for the answer”, the dialogue processing unit includes the voice synthesis output processing of the text message of the question and the user's A voice recognition process for the answer message and a voice synthesis output process for the confirmation text message for the answer.

テキスト音声の発声に要する時間は、その文字数またはモーラ数に基づいて計算することができる。回答メッセージに要する時間は、ユーザの応答時間履歴情報、特定の質問に対する一般的なユーザの応答時間データ、サービス・シナリオに予め設定した予測のユーザ応答時間、または可能性ある最長の応答テキスト・メッセージの文字数またはモーラ数に基づいて計算することができる。 The time required for uttering the text voice can be calculated based on the number of characters or the number of mora. The time required for the response message can be the user response time history information, general user response time data for a particular question, the predicted user response time preset in the service scenario, or the longest possible response text message Can be calculated based on the number of characters or the number of mora.

対話制御部７０は、サービス・シナリオ格納部４０からのサービス・シナリオに基づいて音声サービスＡにおける高い優先度の音声通知ａまでの空き時間ｔ１と、音声サービスＢにおける低い優先度の音声処理ｂに要する音声処理時間ｔ２とを比較する。音声処理時間ｔ２＜空き時間ｔ１の関係であれば、対話制御部７０は、入力部１２および／または出力部１４を用いて、直ちに音声サービスＢの低い優先度の音声処理ｂを行う。一方、音声処理時間ｔ２≧空き時間ｔ１であれば、対話制御部７０は、音声サービスＢの低い優先度の音声処理ｂを抑止して、直ちにまたは予定された時刻に先に音声サービスＡにおける音声通知ａを行わせる。その後、音声対話システム１２は、音声処理時間ｔ２＜空き時間ｔ１となって音声サービスＢの低い優先度の音声処理ｂが実行されるまで、同様の動作を繰り返す。 The dialogue control unit 70 sets the idle time t1 until the voice notification a with high priority in the voice service A based on the service scenario from the service scenario storage unit 40 and the voice processing b with low priority in the voice service B. The required audio processing time t2 is compared. If the relation of voice processing time t2 <free time t1 is satisfied, the dialogue control unit 70 immediately performs voice processing b with a low priority of the voice service B using the input unit 12 and / or the output unit 14. On the other hand, if the voice processing time t2 ≧ the idle time t1, the dialogue control unit 70 suppresses the voice processing b with a low priority of the voice service B, and immediately or at the scheduled time, the voice in the voice service A first. Notification a is performed. Thereafter, the voice interaction system 12 repeats the same operation until the voice processing time t2 <the idle time t1 and the voice processing b with a lower priority of the voice service B is executed.

並行して３つ以上の音声サービスＡ、Ｂ、Ｃ、．．．を実行する場合にも、同様にそれぞれの音声処理の優先度に従って、音声処理ａ、ｂ、ｃ、．．．を実行することができる。音声サービスＡの音声処理ａが最も高い優先度を有し、音声サービスＢの音声処理ｂが次の高い優先度を有し、音声サービスＣの音声処理ｃが最も低い優先度を有するものとする。この場合、音声サービスＡの音声処理ａは優先的に実行される。 In parallel, three or more voice services A, B, C,. . . In the same manner, the voice processing a, b, c,. . . Can be executed. Assume that voice processing a of voice service A has the highest priority, voice processing b of voice service B has the next highest priority, and voice processing c of voice service C has the lowest priority. . In this case, the voice processing a of the voice service A is executed with priority.

ユーザが音声サービスＢを使用している場合、対話制御部７０は、音声サービスＢの音声処理ｂに要する時間ｔ２を、音声サービスＡの最初の音声処理ａの予測時刻ｔｐの前の空き時間ｔ１と比較し、音声処理時間ｔ２＜空き時間ｔ１の場合に、その空き時間ｔ１に音声サービスＢの音声処理ｂを実行させる。対話制御部７０は、音声処理時間ｔ２≧空き時間ｔ１の場合には、音声サービスＢの音声処理ｂを抑止（禁止）し、先に音声サービスＡの音声処理ａを実行させる。その後、音声処理時間ｔ２＜空き時間ｔ１となって音声サービスＢの音声処理ｂが実行されるまで、その手順が繰り返される。 When the user uses the voice service B, the dialogue control unit 70 sets the time t2 required for the voice processing b of the voice service B to the free time t1 before the predicted time tp of the first voice processing a of the voice service A. If the voice processing time t2 <the idle time t1, the voice processing b of the voice service B is executed during the idle time t1. When the voice processing time t2 ≧ the idle time t1, the dialogue control unit 70 inhibits (prohibits) the voice processing b of the voice service B and first executes the voice processing a of the voice service A. Thereafter, the procedure is repeated until the voice processing b of the voice service B is executed because the voice processing time t2 <the idle time t1.

ユーザが音声サービスＣを使用している場合、対話制御部７０は、音声サービスＣの音声処理ｃに要する時間ｔ２を、音声サービスＡおよびＢの最初の音声処理ａまたはｂの予測時刻ｔｐの前の空き時間ｔ１と比較し、音声処理時間ｔ２＜空き時間ｔ１の場合に、その空き時間ｔ１に音声サービスＣの音声処理ｃを実行させる。対話制御部７０は、音声処理時間ｔ２≧空き時間ｔ１の場合には、音声サービスＣの音声処理ｃを抑止し、先に音声サービスＡまたはＢの最初の音声処理ａまたはｂを実行させる。その後、音声処理時間ｔ２＜空き時間ｔ１となって音声サービスＣの音声処理ｃが実行されるまで、その手順が繰り返される。 When the user uses the voice service C, the dialogue control unit 70 sets the time t2 required for the voice processing c of the voice service C before the predicted time tp of the first voice processing a or b of the voice services A and B. When the voice processing time t2 <the idle time t1, the voice processing c of the voice service C is executed during the idle time t1. When the voice processing time t2 ≧ the idle time t1, the dialogue control unit 70 suppresses the voice processing c of the voice service C and first executes the first voice processing a or b of the voice service A or B. Thereafter, the procedure is repeated until the voice processing c of the voice service C is executed because the voice processing time t2 <the idle time t1.

図２は、本発明の実施形態による、カーナビゲーション・システムの経路案内サービス４０２および電子メール・サービス４０４の音声処理すなわち音声出力および入力を行う音声対話システム１２を有する情報処理装置または電子機器１０の構成を示している。この場合、車両の位置と、地図データベース、ウェブ・ページまたはテキスト放送の交通情報とによって得られた周囲の環境とに基づいて、経路案内サービス４０２による高い優先度の音声通知が予測される。 FIG. 2 illustrates an information processing apparatus or electronic device 10 having a voice interaction system 12 that performs voice processing, that is, voice output and input, of a route guidance service 402 and an e-mail service 404 of a car navigation system according to an embodiment of the present invention. The configuration is shown. In this case, high-priority voice notification by the route guidance service 402 is predicted based on the position of the vehicle and the surrounding environment obtained from the map database, web page, or text broadcast traffic information.

情報処理装置または電子機器１０は、カーナビゲーション・システム情報格納部８０を有する。カーナビゲーション・システム情報格納部８０は、経路情報８２、現在位置８４および地図データベース８６を格納している。通知時刻推定部２０は次案内時刻推定部２２を含んでいる。次案内時刻推定部２２は、カーナビゲーション・システム情報格納部８０および車両速度センサ１１２に接続されている。情報処理装置または電子機器１０のその他の構成は図１と同様である。 The information processing apparatus or electronic device 10 includes a car navigation system information storage unit 80. The car navigation system information storage unit 80 stores route information 82, a current position 84, and a map database 86. The notification time estimation unit 20 includes a next guidance time estimation unit 22. The next guidance time estimation unit 22 is connected to the car navigation system information storage unit 80 and the vehicle speed sensor 112. Other configurations of the information processing apparatus or the electronic device 10 are the same as those in FIG.

経路案内サービス４０２は、優先的に音声通知をすべき事項として、渋滞警告、カーブ進入時の速度警告、交通事故多発区間の警告、速度取締区間の警告、路面凍結警告、落下物警告、等を含んでいてもよい。 The route guidance service 402 gives priority to voice notification, such as a traffic jam warning, a speed warning when entering a curve, a warning of a traffic accident frequent section, a warning of a speed control section, a road surface freezing warning, a falling object warning, etc. May be included.

次案内時刻推定部２２は、経路案内サービス４０２について、例えば、車両速度センサ１１２によって検出された車両速度と、カーナビゲーション・システム情報格納部８０内の経路情報８２とから経路中の通過点（例えば交差点）への到達時刻を推定する。次案内時刻推定部２２は、現在予測される運転状況に基づいて、高い優先度の経路案内サービス４０２の音声通知のイベントを発生すべき予測時刻ｔｐを推定する。空き時間推定部３０は、予測時刻ｔｐから現在の時刻ｔｃを減算して現在の空き時間ｔ１＝ｔｐ−ｔｃを算出する。対話時間推定部５０は、低い優先度の電子メール・サービス４０４の音声処理のイベントを発生するのに要する時間ｔ２を推定する。 For the route guidance service 402, the next guidance time estimation unit 22 uses, for example, a passing point in the route (for example, from the vehicle speed detected by the vehicle speed sensor 112 and the route information 82 in the car navigation system information storage unit 80). Estimate the arrival time to the intersection). The next guidance time estimation unit 22 estimates a predicted time tp at which a voice notification event of the route guidance service 402 with a high priority should be generated based on the currently predicted driving situation. The available time estimation unit 30 calculates the current available time t1 = tp−tc by subtracting the current time tc from the predicted time tp. The dialogue time estimation unit 50 estimates a time t2 required to generate a voice processing event of the low priority electronic mail service 404.

通知時刻推定は、計算負荷を減少させるために、所定の条件を満たす場合にだけ実行されるようにしてもよい。例えば、経路案内サービスについての通知時刻推定は、カーナビゲーション・システムにおいて経路が設定されている場合または走行中にだけ実行されるようにすることができる。 The notification time estimation may be executed only when a predetermined condition is satisfied in order to reduce the calculation load. For example, the notification time estimation for the route guidance service may be executed only when a route is set in the car navigation system or during traveling.

図３の（Ａ）、（Ｂ）および（Ｃ）は、図２の情報処理装置または電子機器１０において同時にカーナビゲーション・システムの経路案内サービス４０２および電子メール・サービス４０４を利用する場合における運転状況の予測および経路案内の予想時刻を示している。音声対話システム１２は、その運転状況の予測に基づいて、高い優先度の経路案内サービス４０２による音声通知と、低い優先度の電子メール・サービス４０４による音声案内および入力指示とを行う。 FIGS. 3A, 3B, and 3C show driving situations when the route guidance service 402 and the e-mail service 404 of the car navigation system are simultaneously used in the information processing apparatus or the electronic apparatus 10 of FIG. The forecast time of route guidance and route guidance is shown. The voice interaction system 12 performs voice notification by the route guidance service 402 with high priority and voice guidance and input instruction by the low priority electronic mail service 404 based on the prediction of the driving situation.

図３（Ａ）上側の運転状況において、経路案内サービス４０２または通知時刻推定部２０は、カーナビゲーション・システム情報格納部８０から取得した目的地までの経路情報（８２）、地図上の周囲の状況（８６）および現在の位置（８４）、および車両速度センサ１１２から取得した車両速度に従って、時間軸に対して、運転状況として、長い時間の直進走行の後、停止して右折し、直進走行することを予測する。通知時刻推定部２０は、経路案内サービス４０２が、走行中、停車の直前にサービス・シナリオに従って「右方向です」という高い優先度の音声通知を発声させ、それを予告するためにその数秒前にサービス・シナリオに従って「この先○○を右です」という高い優先度の音声通知を発声させることを、予測する。経路案内サービス４０２の各音声通知の音声通知のサービス・シナリオには、対話分類と優先度とが付加されている。図３（Ａ）下側の経路案内予想時刻において、空き時間推定部３０は、現時点ｔｃにおける、斜線の陰影で示した「この先○○を右です」という最初の音声通知の予測時刻の前に存在する空き時間ｔ１を算出する。 In the driving situation on the upper side of FIG. 3A, the route guidance service 402 or the notification time estimation unit 20 obtains the route information (82) to the destination obtained from the car navigation system information storage unit 80, and the surrounding conditions on the map. According to (86), the current position (84), and the vehicle speed acquired from the vehicle speed sensor 112, as a driving situation with respect to the time axis, after traveling straight ahead for a long time, stop, turn right, and travel straight ahead Predict that. The notification time estimation unit 20 causes the route guidance service 402 to utter a high-priority voice notification “right” according to the service scenario immediately before stopping while driving, and a few seconds before that According to the service scenario, it is predicted that a high-priority voice notification “This is the right of XX is right” will be uttered. Dialog classification and priority are added to the voice notification service scenario of each voice notification of the route guidance service 402. In the estimated route guidance time on the lower side of FIG. 3 (A), the free time estimation unit 30 before the predicted time of the first voice notification of “This is the right of XX” indicated by the shaded area at the current time tc. An existing free time t1 is calculated.

図３（Ｂ）上側の運転状況において、予測とは違って、車両の走行中に横断歩道の信号機が赤信号になって、ドライバは停止線で車両を停止させ、それと同時に電子メール・サービス４０４を起動する。経路案内サービス４０２または通知時刻推定部２０は、その車両の停止の発生に従って運転状況の予測を修正する。電子メール・サービス４０４の電子メール送信のサービス・シナリオには、対話分類と優先度とが付加されている。この時、空き時間推定部３０は、現時点ｔｃにおける、経路案内サービス４０２による高い優先度の「この先○○を右です」という最初の音声通知の予測時刻ｔｐまでの空き時間ｔ１を算出する。対話時間推定部５０は、その文字数カウンタ５４と入力時間推定部５２を用いて、電子メール・サービス４０４の低い優先度の「送信先はどこですか？」という音声案内のモーラ数に対応する予測所要時間と、その回答としてのユーザによるキー入力または音声入力（「．．．」）およびその音声認識を行う予測所要時間との合計の予測音声処理時間ｔ２を算出する。入力時間推定部５２は、情報格納部６０内の対応するシナリオの対話履歴情報から想定される音声入力の最大文字数に対応する時間に付加時間を加えてその時間を推定してもよい。 In the driving situation on the upper side of FIG. 3B, unlike the prediction, the traffic light at the pedestrian crossing turns red while the vehicle is running, and the driver stops the vehicle at the stop line and at the same time, the e-mail service 404. Start up. The route guidance service 402 or the notification time estimation unit 20 corrects the prediction of the driving situation according to the occurrence of the stop of the vehicle. Dialogue classification and priority are added to the email transmission service scenario of the email service 404. At this time, the vacant time estimation unit 30 calculates a vacant time t1 until the predicted time tp of the first voice notification “This is the right of XX” on the route guidance service 402 at the current time tc. The dialogue time estimation unit 50 uses the character number counter 54 and the input time estimation unit 52 to predict the number of voice guidance mora of “Where is the destination?” With a low priority of the e-mail service 404. The total predicted speech processing time t2 of the time, the key input or voice input (“...”) By the user as the answer and the estimated required time for performing the speech recognition is calculated. The input time estimation unit 52 may estimate the time by adding an additional time to the time corresponding to the maximum number of characters of voice input assumed from the dialogue history information of the corresponding scenario in the information storage unit 60.

対話制御部７０は、低い優先度の音声処理の予測時間ｔ２を、高い優先度の音声処理の空き時間ｔ１と比較する。予測時間ｔ２が空き時間ｔ１未満（ｔ２＜ｔ１）の場合には、対話制御部７０は、電子メール・サービス４０４による「送信先はどこですか？」という低い優先度の音声案内を発声させ、その後で確保された所定時間だけ入力処理部１４を介したユーザによる音響装置１０６を介した音声入力または入力装置１０８を介したキー操作を待つ。図３（Ｂ）の場合、予測時間ｔ２が空き時間ｔ１未満なので、対話制御部７０は、入力処理部１２および出力処理部１４を用いてその音声案内と、ユーザ音声入力およびその音声認識を行わせる。ユーザによる音声入力またはキー操作があれば、入力処理部７２はそれに従って音声認識を含む入力処理を行い、対話制御部７０は次の対話制御を行う。一方、電子メール・サービスの予測時間ｔ２が空き時間ｔ１以上である（ｔ２≧ｔ１）場合には、経路案内サービス４０２によって、先に「この先○○を右です」という高い優先度の音声通知を発声させる。その後、予測時間ｔ２が次の空き時間ｔ１未満となるまで、上述の処理を繰り返す。 The dialogue control unit 70 compares the predicted time t2 of the low priority voice processing with the idle time t1 of the high priority voice processing. When the predicted time t2 is less than the free time t1 (t2 <t1), the dialogue control unit 70 utters a low-priority voice guidance “Where is the destination?” By the e-mail service 404, and then The user waits for a voice input via the acoustic device 106 or a key operation via the input device 108 by the user via the input processing unit 14 for a predetermined time secured in step (b). In the case of FIG. 3B, since the predicted time t2 is less than the idle time t1, the dialogue control unit 70 performs voice guidance, user voice input, and voice recognition using the input processing unit 12 and the output processing unit 14. Make it. If there is a voice input or key operation by the user, the input processing unit 72 performs input processing including voice recognition in accordance with it, and the dialog control unit 70 performs the next dialog control. On the other hand, when the estimated time t2 of the e-mail service is equal to or greater than the free time t1 (t2 ≧ t1), the route guidance service 402 first sends a voice notification with a high priority of “This is the right to the right”. Speak. Thereafter, the above-described processing is repeated until the predicted time t2 becomes less than the next free time t1.

図３（Ｃ）上側の運転状況において、ドライバは、上述の停止の後、走行を再開する。経路案内サービス４０２または通知時刻推定部２０は、その走行の再開の発生に従って運転状況の予測を修正する。車両の走行中に、ドライバは、電子メール・サービス４０４において、低い優先度の受信電子メールの音声による読み上げを行うよう入力装置１０８を操作する。電子メール・サービス４０４の電子メール読み上げのサービス・シナリオには、対話分類と優先度とが付加されている。この時、空き時間推定部３０は、「この先○○を右です」という高い優先度の音声通知までの空き時間ｔ１を予測する。対話時間推定部５０は、文字カウンタ５４を用いて、電子メールの文字数（モーラ数）に基づいて電子メールの音声による読み上げに要する予測時間ｔ２を算出する。 In the driving situation on the upper side of FIG. 3C, the driver resumes running after the above-mentioned stop. The route guidance service 402 or the notification time estimation unit 20 corrects the prediction of the driving situation according to the occurrence of the restart of the travel. While the vehicle is traveling, the driver operates the input device 108 in the e-mail service 404 so as to read out a low-priority received e-mail by voice. Dialogue classification and priority are added to the e-mail reading service scenario of the e-mail service 404. At this time, the vacant time estimation unit 30 predicts the vacant time t1 until the voice notification with the high priority “This is the right of XX is right”. The dialogue time estimation unit 50 uses the character counter 54 to calculate an estimated time t2 required for reading out the voice of the email based on the number of characters (number of mora) of the email.

対話制御部７０は、低い優先度の音声処理の予測時間ｔ２を、高い優先度の音声処理の空き時間ｔ１と比較する。電子メール・サービスの予測時間ｔ２が空き時間ｔ１未満（ｔ２＜ｔ１）の場合には、対話制御部７０は、出力処理部１４を用いて電子メールの読み上げを行わせる。電子メールの読み上げの予測時間ｔ２が空き時間ｔ１以上である（ｔ２≧ｔ１）の場合には、対話制御部７０は、電子メール・サービス４０４による電子メールの読み上げ動作を抑止し、直ちにまたは予定された所定の時刻に経路案内サービス４０２によって次の「この先○○を右です」という高い優先度の音声通知を発声させる。 The dialogue control unit 70 compares the predicted time t2 of the low priority voice processing with the idle time t1 of the high priority voice processing. When the predicted time t2 of the electronic mail service is less than the free time t1 (t2 <t1), the dialogue control unit 70 causes the output processing unit 14 to read out the electronic mail. When the predicted reading time t2 of the e-mail is equal to or longer than the free time t1 (t2 ≧ t1), the dialogue control unit 70 suppresses the reading operation of the e-mail by the e-mail service 404 and is immediately or scheduled. At the predetermined time, the route guidance service 402 causes the next high-priority voice notification “This is the right of XX” to be uttered.

その発声の時点において、経路案内サービス４０２の次の「右方向です」という通知までの新たな空き時間ｔ１は図３のｔ１’の値となる。対話制御部７０は、電子メールの音声による読み上げに要する予測時間ｔ２を空き時間ｔ１と比較する。電子メール・サービスの予測時間ｔ２が空き時間ｔ１未満（ｔ２＜ｔ１）の場合には、対話制御部７０は、電子メール・サービス４０４に電子メールの読み上げを行わせる。電子メールの読み上げの予測時間ｔ２が空き時間ｔ１以上である（ｔ２≧ｔ１）の場合には、対話制御部７０は、電子メール・サービス４０４による電子メールの読み上げ動作を抑止し、直ちにまたは予定された所定の時刻に経路案内サービス４０２によって次の「右方向です」という高い優先度の音声通知を発声させる。その後、予測時間ｔ２が次の空き時間ｔ１未満となるまで、上述と同様の処理を繰り返す。 At the time of the utterance, the new idle time t1 until the next “right direction” notification of the route guidance service 402 becomes the value of t1 ′ in FIG. The dialogue control unit 70 compares the estimated time t2 required for reading out the voice of the e-mail with the idle time t1. When the estimated time t2 of the electronic mail service is less than the free time t1 (t2 <t1), the dialogue control unit 70 causes the electronic mail service 404 to read out the electronic mail. When the predicted reading time t2 of the e-mail is equal to or longer than the free time t1 (t2 ≧ t1), the dialogue control unit 70 suppresses the reading operation of the e-mail by the e-mail service 404 and is immediately or scheduled. At a predetermined time, the route guidance service 402 causes the next high-priority voice notification “to the right” to be uttered. Thereafter, the same processing as described above is repeated until the predicted time t2 becomes less than the next free time t1.

図４は、本発明の実施形態による、燃料警告サービス４０６および近所の施設検索サービス４０８の音声出力および音声入力を行う音声対話システム１２を有する情報処理装置または電子機器１０の構成を示している。この場合、車両の各センサの検知情報に基づいて、燃料警告サービス４０６等による高い優先度の警告の発生が予測される。 FIG. 4 shows a configuration of the information processing apparatus or electronic device 10 having the voice interaction system 12 that performs voice output and voice input of the fuel warning service 406 and the nearby facility search service 408 according to the embodiment of the present invention. In this case, the occurrence of a high priority warning by the fuel warning service 406 or the like is predicted based on the detection information of each sensor of the vehicle.

通知時刻推定部２０は、燃料センサ１１４に接続された燃料減少率推定部２３と、燃料減少率推定部２３に接続された燃料警告時刻推定部２４とを含んでいる。燃料警告サービス４０６は、燃料センサ１１４を周期的にモニタして、燃料の残量が所定の閾値より少なくなった場合に、サービス・シナリオに従って「燃料を補給して下さい」という高い優先度の音声通知を行う。燃料警告サービス４０６の音声警告のサービス・シナリオには、対話分類と優先度とが付加されている。燃料センサ１１４は、燃料の残量を燃料減少率推定部２３に供給する。燃料減少率推定部２３は、周期的に燃料の残量とその時刻とをメモリに格納し、周期的な各時間における燃料残量の変化に基づいて、燃料の残量が或る所定の閾値より低くなった場合に、燃料減少率を算出して燃料警告時刻推定部２４に供給する。燃料警告時刻推定部２４は、現在の燃料の残量とその燃料減少率とに基づいて、燃料が別の所定の閾値より少なくなって燃料の減少または燃料補給の必要性を警告すべき時刻ｔｐを推定する。近所の施設検索サービス４０８は、入力装置１０８を用いて入力されたユーザの要求に従って、地図データベース８６またはインターネット上で近くの数件のガソリン・スタンドを検索する。 The notification time estimation unit 20 includes a fuel reduction rate estimation unit 23 connected to the fuel sensor 114 and a fuel warning time estimation unit 24 connected to the fuel reduction rate estimation unit 23. The fuel warning service 406 periodically monitors the fuel sensor 114, and when the remaining amount of fuel falls below a predetermined threshold, a high priority voice “please refuel” according to the service scenario. Make a notification. Dialogue classification and priority are added to the voice warning service scenario of the fuel warning service 406. The fuel sensor 114 supplies the remaining amount of fuel to the fuel decrease rate estimation unit 23. The fuel reduction rate estimation unit 23 periodically stores the remaining amount of fuel and its time in a memory, and based on the change in the remaining amount of fuel at each periodic time, the remaining amount of fuel is a predetermined threshold value. When it becomes lower, the fuel decrease rate is calculated and supplied to the fuel warning time estimation unit 24. Based on the current remaining fuel amount and the fuel reduction rate, the fuel warning time estimation unit 24 warns the time when the fuel becomes less than another predetermined threshold and warns the necessity of fuel reduction or fuel replenishment. Is estimated. The neighborhood facility search service 408 searches the map database 86 or several nearby gas stations according to the user's request entered using the input device 108.

燃料警告サービス４０６の代わりにまたはそれに加えて、各種のセンサに基づく、オイル交換時期の警告、タイヤ交換時期の警告、前方車間距離の警告、タイヤ空気圧の警告、緊急車両接近の警告、歩行者警告、二輪車接近の警告、等のための音声サービスを設けてもよい。 Instead of or in addition to the fuel warning service 406, based on various sensors, oil change time warning, tire change time warning, front inter-vehicle distance warning, tire air pressure warning, emergency vehicle approach warning, pedestrian warning Voice services for motorcycle approach warnings, etc. may be provided.

車両の走行中に、ドライバは、近所の施設検索サービス４０８において、低い優先度の近所の施設の検索を行うよう入力装置１０８を操作する。近所の施設検索サービス４０８のガソリン・スタンド選択のサービス・シナリオには、対話分類と優先度とが付加されている。空き時間推定部３０は、現時点ｔｃにおける、燃料警告サービス４０６による「燃料を補給して下さい」という音声通知の予測時刻ｔｐまでの空き時間ｔ１を算出する。近所の施設検索サービス４０８は、検索結果としての近所のガソリン・スタンドのリストを表示し、ユーザにその１つを選択させる。そのために、近所の施設検索サービス４０８は、「ガソリン・スタンドを選択して下さい」という低い優先度の音声案内のモーラ数に対応する予測所要時間と、ユーザによる音声入力およびその音声認識を行う予測所要時間との合計の予測音声処理時間ｔ２を算出する。 While the vehicle is traveling, the driver operates the input device 108 to search for a nearby facility with a low priority in the nearby facility search service 408. Dialog classification and priority are added to the service scenario for selecting a gas station in the neighborhood facility search service 408. The free time estimation unit 30 calculates the free time t1 until the predicted time tp of the voice notification “please replenish fuel” by the fuel warning service 406 at the current time tc. The neighborhood facility search service 408 displays a list of neighborhood gas stations as a search result and allows the user to select one. To that end, the neighborhood facility search service 408 predicts the estimated time required for the number of mora of low-priority voice guidance “Please select a gas station”, voice input by the user, and voice recognition thereof. The total predicted speech processing time t2 with the required time is calculated.

対話制御部７０は、予測時間ｔ２を空き時間ｔ１と比較する。予測時間ｔ２が空き時間ｔ１未満（ｔ２＜ｔ１）の場合には、対話制御部７０は、近所の施設検索サービス４０８による「ガソリン・スタンドを選択して下さい」という音声案内を発声させ、確保された所定時間だけ入力処理部１４を介したユーザによる音声入力またはキー入力を待つ。ユーザによる入力があれば、対話制御部７０は、それに従って処理を行う。近所の施設検索サービス４０８の予測時間ｔ２が空き時間ｔ１以上である（ｔ２≧ｔ１）場合には、対話制御部７０は、「ガソリン・スタンドを選択して下さい」という音声案内の発声およびユーザの入力の双方を抑止し、直ちにまたは警告すべきタイミングで燃料警告サービス４０６によって、先に「燃料を補給して下さい」という高い優先度の音声通知を発声させる。その後、予測時間ｔ２が次の空き時間ｔ１未満となるまで、上述の処理を繰り返す。 The dialogue control unit 70 compares the predicted time t2 with the idle time t1. When the predicted time t2 is less than the free time t1 (t2 <t1), the dialogue control unit 70 utters a voice guidance “Please select a gas station” by the facility search service 408 in the neighborhood and is secured. The user waits for voice input or key input through the input processing unit 14 for a predetermined time. If there is an input by the user, the dialogue control unit 70 performs processing according to the input. When the estimated time t2 of the nearby facility search service 408 is equal to or greater than the free time t1 (t2 ≧ t1), the dialogue control unit 70 utters a voice guidance “Please select a gas station” and the user's Both of the inputs are suppressed, and a high-priority voice notification “please refuel” is uttered by the fuel warning service 406 immediately or at a timing to be warned. Thereafter, the above-described processing is repeated until the predicted time t2 becomes less than the next free time t1.

通知時刻推定部２０は、さらに、例えば車間距離警告サービスについて、ミリ波レーダなどによって検出された自己の車両と前方または後方の他の車両との間の距離とその減少率に基づいて危険車間距離に到達する時刻を推定してもよい。通知時刻推定部２０は、さらに、例えばオイル交換警告サービスおよびタイヤ摩擦警告サービスについて、前回のオイル交換およびタイヤ交換の日付から現在までの走行距離と、車両速度とに基づいてオイル交換警告およびタイヤ摩擦警告の時刻を推定してもよい。 The notification time estimation unit 20 further uses, for example, an inter-vehicle distance warning service based on the distance between the own vehicle detected by a millimeter wave radar or the like and another vehicle ahead or behind and the rate of decrease thereof. You may estimate the time to arrive at. The notification time estimation unit 20 further includes, for example, an oil change warning service and a tire friction warning service based on the travel distance from the date of the previous oil change and tire change to the current time and the vehicle speed, and the oil change warning and tire friction. The warning time may be estimated.

通知時刻推定は、計算負荷を減少させるために、所定の条件を満たす場合にだけ実行されるようにしてもよい。例えば、車間距離警告サービスについての通知時刻推定は、車間距離が１００ｍ以内の場合にだけ実行されるようにすることができる。 The notification time estimation may be executed only when a predetermined condition is satisfied in order to reduce the calculation load. For example, the notification time estimation for the inter-vehicle distance warning service can be executed only when the inter-vehicle distance is within 100 m.

図５は、本発明のさらに別の実施形態による、運転時間警告サービス４１０およびレストラン検索サービス４１２の音声出力および音声入力を行う音声対話システム１２を有する情報処理装置または電子機器１０の構成を示している。この場合、推定または検出されたドライバ情報に基づいて、運転時間警告サービス４１０等による高い優先度の警告の発生が予測される。 FIG. 5 shows a configuration of an information processing apparatus or electronic device 10 having a voice dialogue system 12 that performs voice output and voice input of a driving time warning service 410 and a restaurant search service 412 according to still another embodiment of the present invention. Yes. In this case, based on the estimated or detected driver information, the occurrence of a high priority warning by the driving time warning service 410 or the like is predicted.

通知時刻推定部２０は警告時刻推定部２６を含んでいる。警告時刻推定部２６は、運転時間を測定する運転時間タイマ１１６に接続されている。運転時間警告サービス４１０は、例えば１０分未満の連続的エンジン停止時間を除外して連続的に行われている運転時間ｔｄを検出して閾値ｔｈｄを超えたときにサービス・シナリオに従って「ｔｄ時間、連続運転をしています。休憩をとって下さい。」という高い優先度の警告を発声する。運転時間警告サービス４１０の音声警告のサービス・シナリオには、対話分類と優先度とが付加されている。警告時刻推定部２６は、運転時間タイマ１１から受信した連続運転時間ｔｄを閾値ｔｈｄ秒と比較して、それが閾値ｔｈｄを超えた場合に、連続運転を警告すべき時刻ｔｐを推定する。レストラン検索サービス４１２は、入力装置１０８を用いて入力されたユーザの要求に従って、地図データベース８６またはインターネット上で近くの数件のレストランを検索する。 The notification time estimation unit 20 includes a warning time estimation unit 26. The warning time estimation unit 26 is connected to an operation time timer 116 that measures the operation time. The operation time warning service 410 detects the operation time td that is continuously performed excluding the continuous engine stop time of, for example, less than 10 minutes, and when the threshold value thd is exceeded, “td time, “I am driving continuously. Please take a break.” Dialog classification and priority are added to the voice warning service scenario of the driving time warning service 410. The warning time estimation unit 26 compares the continuous operation time td received from the operation time timer 11 with the threshold thd seconds, and estimates the time tp at which continuous operation should be warned when it exceeds the threshold thd. The restaurant search service 412 searches several nearby restaurants on the map database 86 or the Internet in accordance with a user request input using the input device 108.

運転時間警告サービス４１０の代わりにまたはそれに加えて、ドライバ居眠り運転警告サービス、等のための音声サービスを設けてもよい。通知時刻推定部２０は、例えば居眠り警告サービスについて、ドライバ用の生体センサで生理的状態を検出し、居眠りの予兆から入眠の時刻を推定する。 Instead of or in addition to the driving time warning service 410, a voice service for a driver doze driving warning service, etc. may be provided. For example, for the dozing warning service, the notification time estimating unit 20 detects a physiological state with a driver biosensor, and estimates the time of falling asleep from a sign of dozing.

車両の走行中に、ドライバは、レストラン検索サービス４１２において、低い優先度のレストラン検索を行うよう入力装置１０８を操作する。レストラン検索サービス４１２のレストラン選択のサービス・シナリオには、対話分類と優先度とが付加されている。空き時間推定部３０は、運転時間警告サービス４１０による「ｔｄ時間、連続運転をしています。休憩をとって下さい。」という音声通知を行う予測時刻ｔｐまでの空き時間ｔ１を算出する。近所の施設検索サービス４０８は、検索結果として近所のレストランのリストを表示し、ユーザにその１つを選択させる。そのために、レストラン検索サービス４１２は、文字数カウンタ５４によって文字数（モーラ数）に基づいて決定された「レストランの種類を選択して下さい」または「レストランを選択して下さい」という音声案内の予測所要時間と、入力時間推定部５２によって情報格納部６０におけるユーザ応答履歴情報に基づいて決定されたユーザによる音声入力およびその音声認識を行う予測所要時間との合計の予測音声処理時間ｔ２を算出する。 While the vehicle is traveling, the driver operates the input device 108 to perform a restaurant search with a low priority in the restaurant search service 412. Dialogue classification and priority are added to the restaurant selection service scenario of the restaurant search service 412. The vacant time estimation unit 30 calculates the vacant time t1 up to the predicted time tp at which the voice notification “td time, continuous driving. Take a break.” By the driving time warning service 410 is given. The neighborhood facility search service 408 displays a list of nearby restaurants as a search result, and allows the user to select one. For this purpose, the restaurant search service 412 uses the character counter 54 based on the number of characters (number of mora) to determine the estimated time required for voice guidance “Please select a restaurant type” or “Please select a restaurant”. And a predicted speech processing time t2 that is the sum of the required time for speech input by the user and speech recognition determined by the input time estimation unit 52 based on the user response history information in the information storage unit 60 is calculated.

対話制御部７０は、予測時間ｔ２を空き時間ｔ１と比較する。予測時間ｔ２が空き時間ｔ１未満（ｔ２＜ｔ１）の場合には、対話制御部７０は、レストラン検索サービス４１２による「レストランの種類を選択して下さい」という音声案内を発声させ、確保された所定時間だけ入力処理部１４を介したユーザによる音声入力またはキー入力を待つ。ユーザによる入力があれば、対話制御部７０は、それに従って処理を行う。レストラン検索サービス４１２の予測時間ｔ２が空き時間ｔ１以上である（ｔ２≧ｔ１）場合には、対話制御部７０は、レストラン検索サービス４１２の音声処理を抑止し、直ちにまたは警告すべきタイミングで運転時間警告サービス４１０によって先に「ｔｄ時間、連続運転をしています。休憩をとって下さい。」という高い優先度の音声通知を発声させる。その後、予測時間ｔ２が次の空き時間ｔ１未満となるまで、上述の処理を繰り返す。 The dialogue control unit 70 compares the predicted time t2 with the idle time t1. When the predicted time t2 is less than the vacant time t1 (t2 <t1), the dialogue control unit 70 utters a voice guidance “Please select the type of restaurant” by the restaurant search service 412 and the predetermined predetermined time is secured. Waiting for voice input or key input by the user via the input processing unit 14 for the time. If there is an input by the user, the dialogue control unit 70 performs processing according to the input. When the predicted time t2 of the restaurant search service 412 is equal to or greater than the vacant time t1 (t2 ≧ t1), the dialogue control unit 70 suppresses the voice processing of the restaurant search service 412 and immediately or at the timing to warn The warning service 410 first utters a high-priority voice notification “Td time, continuous operation. Take a break.” Thereafter, the above-described processing is repeated until the predicted time t2 becomes less than the next free time t1.

上述の実施形態によれば、例えば、図２の経路案内サービス４０２と図５のレストラン検索サービス４１２を組み合わせた場合、図２の経路案内サービス４０２の音声通知「この先、○○交差点です。」の前に短い空き時間しかない（ｔ２≧ｔ１）場合に、図５のレストラン検索サービス４１２の音声質問「希望するレストランの種類はなんですか？」を発声させようとするとき、それが抑止されて、次のように通知および音声処理が進行する。
−（抑止された質問：希望するレストランの種類はなんですか？）
−（経路案内）この先、○○交差点です。
−（遅延された質問）希望するレストランの種類はなんですか？
−（ユーザ回答）中華
−（確認通知）中華レストランですね。検索します。
従って、音声対話システム１２によって、レストラン検索に関する１つの音声処理単位（質問−回答−確認通知）の音声処理がが、中断されることがなく実行される。 According to the above-described embodiment, for example, when the route guidance service 402 in FIG. 2 and the restaurant search service 412 in FIG. 5 are combined, the voice notification of the route guidance service 402 in FIG. When there is only a short free time before (t2 ≧ t1), when the voice query “What type of restaurant is desired?” Of the restaurant search service 412 in FIG. Notification and voice processing proceed as follows.
-(Deterred question: What type of restaurant would you like?)
-(Route guidance) This is the XX intersection.
-(Delayed Question) What kind of restaurant would you like?
-(User response) Chinese-(Confirmation) Chinese restaurant. Search
Accordingly, the voice processing of one voice processing unit (question-answer-confirmation notification) related to restaurant search is executed by the voice dialogue system 12 without interruption.

図６は、本発明のさらに別の実施形態による、経路案内サービス４０２、オイル交換警告サービス４１４およびニュース・サービス４１６の音声出力および音声入力を行う音声対話システム１２を有する情報処理装置または電子機器１０の構成を示している。この場合、車両の位置と、地図データベース、ウェブ・ページまたはテキスト放送の交通情報とによって得られた周囲の環境とに基づいて、経路案内サービス４０２による最も高い優先度の音声通知の予測が行われ、さらに、車両の各センサの検知情報に基づいて、オイル交換警告サービス４１４等による次に高い優先度の警告の発生が予測される。 FIG. 6 shows an information processing apparatus or electronic device 10 having a voice interaction system 12 that performs voice output and voice input of a route guidance service 402, an oil change warning service 414, and a news service 416 according to still another embodiment of the present invention. The structure of is shown. In this case, the route guidance service 402 predicts the highest priority voice notification based on the position of the vehicle and the surrounding environment obtained from the map database, web page or text traffic information. In addition, based on the detection information of each sensor of the vehicle, the occurrence of the next highest priority warning by the oil change warning service 414 or the like is predicted.

オイル交換警告サービス４１４は、オイル・センサ１１８を周期的にモニタして、オイル劣化率が所定の閾値より高くなった場合に、サービス・シナリオに従って「オイルを交換して下さい」という音声通知を行う。オイル交換警告サービス４１４の音声警告のサービス・シナリオには、対話分類と優先度とが付加されている。通知時刻推定部２０は、図１の次案内時刻推定部２２と、オイル・センサ１１８に接続されたオイル劣化率推定部２７と、オイル劣化率推定部２７に接続されたオイル警告時刻推定部２８とを含んでいる。オイル・センサ１１８は、オイルの劣化率をオイル劣化率推定部２７に供給する。 The oil change warning service 414 periodically monitors the oil sensor 118 and, when the oil deterioration rate becomes higher than a predetermined threshold value, makes a voice notification “please change oil” according to the service scenario. . Dialogue classification and priority are added to the voice warning service scenario of the oil change warning service 414. The notification time estimation unit 20 includes a next guidance time estimation unit 22 shown in FIG. 1, an oil deterioration rate estimation unit 27 connected to the oil sensor 118, and an oil warning time estimation unit 28 connected to the oil deterioration rate estimation unit 27. Including. The oil sensor 118 supplies the oil deterioration rate to the oil deterioration rate estimation unit 27.

オイル劣化率推定部２７は、周期的にオイルの劣化率とその時刻とをメモリに格納し、周期的な各時間におけるオイル劣化率の変化に基づいて、オイルの劣化率が或る所定の閾値より高くなった場合に、オイル劣化率をオイル警告時刻推定部２８に供給する。 The oil deterioration rate estimation unit 27 periodically stores the oil deterioration rate and its time in a memory, and the oil deterioration rate is a predetermined threshold value based on the change of the oil deterioration rate at each periodic time. When it becomes higher, the oil deterioration rate is supplied to the oil warning time estimation unit 28.

車両の走行中に、ドライバは、ニュース・サービス４１６において、低い優先度のニュースの音声による読み上げを行うよう入力装置１０８を操作する。ニュース・サービス４１６は、入力されたユーザの要求に従って、受信したテキスト形式のニュースを読み上げる。ニュース・サービス４１６の音声による読み上げのサービス・シナリオには、対話分類と優先度とが付加されている。オイル警告時刻推定部２８は、現在のオイルの劣化率に基づいて、オイルの交換の必要性を警告すべき時刻ｔｐ１を推定する。経路案内サービス４０２については、図２に関して説明したのと同様に、次案内時刻推定部２２は、予測される運転状況に基づいて、音声通知すべき予測時刻ｔｐ２を推定する。空き時間推定部３０は、予測時刻ｔｐ１およびｔｐ２のうちのいずれか早い時刻ｔｐ＝ｔｐ１またはｔｐ２から現在の時刻ｔｃを減算して現在の空き時間ｔ１＝ｔｐ−ｔｃを算出する。対話時間推定部５０は、文字カウンタ５４を用いて、ニュースの文字数（モーラ数）に基づいてニュースの音声による読み上げに要する予測時間ｔ２を算出する。 While the vehicle is traveling, the driver operates the input device 108 in the news service 416 so as to read out the low priority news by voice. The news service 416 reads out the received text-format news in accordance with the input user request. Dialogue classification and priority are added to the voice service scenario of the news service 416. The oil warning time estimation unit 28 estimates a time tp1 at which warning of the necessity for oil replacement is to be made based on the current oil deterioration rate. For the route guidance service 402, as described with reference to FIG. 2, the next guidance time estimation unit 22 estimates the predicted time tp2 to be notified by voice based on the predicted driving situation. The available time estimation unit 30 calculates the current available time t1 = tp−tp by subtracting the current time tc from the earlier time tp = tp1 or tp2 of the predicted times tp1 and tp2. The dialogue time estimation unit 50 uses the character counter 54 to calculate a predicted time t2 required for reading the news by voice based on the number of news characters (number of mora).

対話制御部７０は、ニュースの読み上げの予測時間ｔ２を空き時間ｔ１と比較する。予測時間ｔ２が空き時間ｔ１未満（ｔ２＜ｔ１）の場合には、対話制御部７０は、出力処理部１４を用いてニュースの読み上げを行わせる。 The dialogue control unit 70 compares the predicted reading time t2 of the news with the free time t1. When the predicted time t2 is less than the free time t1 (t2 <t1), the dialogue control unit 70 uses the output processing unit 14 to read the news.

予測時間ｔ２が空き時間ｔ１以上である（ｔ２≧ｔ１）場合には、対話制御部７０は、ニュース・サービス４１６によるニュースの読み上げを抑止し、直ちにまたは警告すべきタイミングで先に経路案内サービス４２の最も高い優先度の音声通知またはオイル交換警告サービス４１４の次に高い優先度の音声警告を発生させる。その後、予測時間ｔ２が次の空き時間ｔ１未満となるまで、上述の処理を繰り返す。 When the predicted time t2 is equal to or longer than the free time t1 (t2 ≧ t1), the dialogue control unit 70 suppresses reading of news by the news service 416 and immediately precedes the route guidance service 42 at a timing to be warned. The highest priority voice notification or the oil change warning service 414 next to the highest priority voice alert. Thereafter, the above-described processing is repeated until the predicted time t2 becomes less than the next free time t1.

さらに、経路案内サービス４２の最も高い優先度の音声通知の予測時刻における所要の時間期間と、オイル交換警告サービス４１４の次に高い優先度の音声警告の予測時刻における時間期間とに重なりが生じた場合に、対話制御部７０は、オイル交換警告サービス４１４の次に高い優先度の音声警告の予測時間ｔ２を、経路案内サービス４２の最も高い優先度の音声通知の空き時間ｔ１と比較する。予測時間ｔ２が空き時間ｔ１未満（ｔ２＜ｔ１）の場合には、対話制御部７０は、オイル交換警告サービス４１４による警告を行わせる。予測時間ｔ２が空き時間ｔ１以上である（ｔ２≧ｔ１）場合には、対話制御部７０は、オイル交換警告サービス４１４による警告を抑止し、直ちにまたは予定された所定の時刻に先に経路案内サービス４２の最も高い優先度の音声通知を行わせる。 Further, there is an overlap between the required time period at the predicted time of the highest priority voice notification of the route guidance service 42 and the time period at the predicted time of the next highest priority voice warning after the oil change warning service 414. In this case, the dialogue control unit 70 compares the predicted time t2 of the second highest priority voice warning after the oil change warning service 414 with the highest priority voice notification idle time t1 of the route guidance service 42. When the predicted time t2 is less than the free time t1 (t2 <t1), the dialogue control unit 70 makes a warning by the oil change warning service 414. When the predicted time t2 is equal to or longer than the free time t1 (t2 ≧ t1), the dialogue control unit 70 suppresses the warning by the oil change warning service 414 and immediately or at the predetermined time before the route guidance service. The voice notification with the highest priority of 42 is performed.

このように、本発明の実施形態によれば、実行中の複数の音声サービスに関して、それぞれの音声サービスの複数の音声入出力処理の間の競合を防止することができ、複数の音声入出力処理を、不適正なタイミングで中断されることなく、ユーザに対してそれぞれの適正なタイミングで行うことができ、ユーザの思考や音声入力または入力操作を妨げることがない。 As described above, according to the embodiment of the present invention, it is possible to prevent contention between a plurality of voice input / output processes of each voice service with respect to a plurality of voice services being executed. Can be performed at an appropriate timing for the user without being interrupted at an inappropriate timing, and the user's thoughts and voice input or input operation are not hindered.

以上説明した実施形態では音声サービスについて説明したが、本発明は、ユーザに対する異なる優先度の表示出力および操作入出処理の間の競合を防止するのに適用することもできる。 Although the voice service has been described in the above-described embodiment, the present invention can also be applied to prevent contention between display output and operation input / output processing of different priorities for the user.

以上説明した実施形態は典型例として挙げたに過ぎず、その各実施形態の構成要素を組み合わせること、その変形およびバリエーションは当業者にとって明らかであり、当業者であれば本発明の原理および請求の範囲に記載した発明の範囲を逸脱することなく上述の実施形態の種々の変形を行えることは明らかである。 The embodiments described above are merely given as typical examples, and it is obvious to those skilled in the art to combine the components of each embodiment, and variations and variations thereof will be apparent to those skilled in the art. Obviously, various modifications may be made to the above-described embodiments without departing from the scope of the invention as set forth in the scope.

以上の実施例を含む実施形態に関して、さらに以下の付記を開示する。
（付記１）音声対話機能を有する情報処理装置であって、
優先度を有する複数のサービス・シナリオを格納するサービス・シナリオ格納手段と、
前記サービス・シナリオ格納手段における高い優先度のサービス・シナリオに従って次の第１の音声処理のタイミングを推定する通知時間推定手段と、
現時点から前記第１の音声処理のタイミングまでの空き時間を推定する空き時間推定手段と、
前記サービス・シナリオ格納手段における低い優先度のサービス・シナリオに従って音声出力を含む第２の音声処理に要する時間長さを推定する対話時間推定手段と、
前記第２の音声処理の前記推定された時間長さが前記推定された空き時間未満である場合に、前記空き時間に前記低い優先度のサービス・シナリオに従って前記第２の音声処理を行う対話制御手段と、
を具えることを特徴とする、情報処理装置。
（付記２）前記第２の音声処理の前記推定された時間長さが前記推定された空き時間以上である場合に、前記対話制御手段は、前記低い優先度のサービス・シナリオに従った前記第２の音声処理を抑止して、前記高い優先度のサービス・シナリオに従って前記第１の音声処理を行うものであることを特徴とする、付記１に記載の情報処理装置。
（付記３）さらに、前記対話時間推定手段は、ユーザによる入力に要する時間長さを推定する入力時間推定手段と、テキストの文字数に基づいて音声出力する時間を推定する手段とを含むものであることを特徴とする、付記１に記載の情報処理装置。
（付記４）さらに情報格納部を具え、
前記対話時間推定手段は、前記情報格納部に格納された対話履歴情報、ユーザ応答時間履歴情報またはテキスト・データに基づいて、前記低い優先度のサービス・シナリオによる前記第２の音声処理に要する時間を推定するものであることを特徴とする、付記１に記載の情報処理装置。
（付記５）前記高い優先度のサービス・シナリオがカーナビゲーション・システムの経路案内サービスであり、
前記通知時間推定手段は、前記カーナビゲーション・システムの経路情報、現在位置情報および地図データベースに基づいて前記第１の音声処理のタイミングを推定するものであることを特徴とする、付記１に記載の情報処理装置。
（付記６）前記通知時間推定手段は、センサに接続されていて前記センサからの検出情報に基づいて前記第１の音声処理のタイミングを推定するものであることを特徴とする、付記１に記載の情報処理装置。
（付記７）前記第１の音声処理は音声通知を含み、前記第２の音声処理は、音声質問、ユーザの回答入力音声の音声認識および回答の音声確認を含むことを特徴とする、付記１に記載の情報処理装置。
（付記８）分類および優先度を有する複数のサービス・シナリオを格納するサービス・シナリオ格納手段を有し、音声対話機能を有する情報処理装置用の音声対話を行うプログラムであって、
前記サービス・シナリオ格納手段における高い優先度のサービス・シナリオに従って次の第１の音声処理のタイミングを推定するステップと、
現時点から前記第１の音声処理のタイミングまでの空き時間を推定するステップと、
前記サービス・シナリオ格納手段における低い優先度のサービス・シナリオに従って音声出力を含む第２の音声処理に要する時間長さを推定するステップと、
前記第２の音声処理の前記推定された時間長さが前記推定された空き時間未満である場合に、前記空き時間に前記低い優先度のサービス・シナリオに従って前記第２の音声処理を行うステップと、
を実行させるよう動作可能なプログラム。 Regarding the embodiment including the above examples, the following additional notes are further disclosed.
(Appendix 1) An information processing apparatus having a voice interaction function,
Service scenario storage means for storing a plurality of service scenarios having priorities;
Notification time estimating means for estimating the timing of the next first voice processing according to a service scenario with high priority in the service scenario storage means;
Free time estimation means for estimating free time from the current time to the timing of the first audio processing;
Dialogue time estimation means for estimating a time length required for the second voice processing including voice output in accordance with a low priority service scenario in the service scenario storage means;
Dialog control for performing the second voice processing in the idle time according to the low priority service scenario when the estimated duration of the second voice processing is less than the estimated idle time Means,
An information processing apparatus comprising:
(Supplementary note 2) When the estimated time length of the second voice processing is equal to or greater than the estimated idle time, the dialogue control means is configured to perform the first operation according to the low priority service scenario. 2. The information processing apparatus according to appendix 1, wherein the first voice processing is performed in accordance with the service scenario having a high priority while suppressing the second voice processing.
(Additional remark 3) Furthermore, the said dialog time estimation means includes the input time estimation means which estimates the time length required for a user's input, and the means which estimates the time to output voice based on the number of characters of a text. The information processing apparatus according to attachment 1, wherein the information processing apparatus is characterized.
(Supplementary Note 4) Further, an information storage unit is provided,
The dialogue time estimation means is a time required for the second voice processing by the low priority service scenario based on the dialogue history information, user response time history information or text data stored in the information storage unit. The information processing apparatus according to appendix 1, characterized in that
(Supplementary Note 5) The high priority service scenario is a route guidance service for a car navigation system,
The notification time estimation means estimates the timing of the first voice processing based on route information, current position information and a map database of the car navigation system. Information processing device.
(Supplementary Note 6) The supplementary note 1 is characterized in that the notification time estimation means is connected to a sensor and estimates the timing of the first voice processing based on detection information from the sensor. Information processing device.
(Supplementary note 7) The first voice processing includes voice notification, and the second voice processing includes voice question, voice recognition of a user's answer input voice, and voice confirmation of a reply. The information processing apparatus described in 1.
(Supplementary Note 8) A program for performing voice dialogue for an information processing apparatus having a service dialogue storage means for storing a plurality of service scenarios having classification and priority, and having a voice dialogue function,
Estimating the timing of the next first voice processing in accordance with a high priority service scenario in the service scenario storage means;
Estimating a free time from the current time to the timing of the first audio processing;
Estimating a length of time required for second audio processing including audio output according to a low priority service scenario in the service scenario storage means;
Performing the second audio processing in the idle time according to the low priority service scenario when the estimated duration of the second audio processing is less than the estimated idle time; ,
A program that can run to run.

図１は、本発明による、複数の音声サービスの音声出力および音声入力を行う音声対話システムを有する情報処理装置または電子機器の構成を示している。FIG. 1 shows a configuration of an information processing apparatus or an electronic apparatus having a voice dialogue system that performs voice output and voice input of a plurality of voice services according to the present invention. 図２は、本発明の実施形態による、カーナビゲーション・システムの経路案内サービスおよび電子メール・サービスの音声処理すなわち音声出力および音声入力を行う音声対話システムを有する情報処理装置または電子機器の構成を示している。FIG. 2 shows a configuration of an information processing apparatus or electronic device having a voice dialogue system for performing voice processing, that is, voice output and voice input of a route guidance service and an e-mail service of a car navigation system according to an embodiment of the present invention. ing. 図３の（Ａ）、（Ｂ）および（Ｃ）は、図２の情報処理装置または電子機器において同時にカーナビゲーション・システムの経路案内サービスおよび電子メール・サービスを利用する場合における運転状況の予測および経路案内の予想時刻を示している。(A), (B), and (C) of FIG. 3 are predictions of driving situations when the route guidance service and the e-mail service of the car navigation system are simultaneously used in the information processing apparatus or electronic device of FIG. The estimated time of route guidance is shown. 図４は、本発明の実施形態による、燃料警告サービスおよび近所の施設検索サービスの音声出力および音声入力を行う音声対話システムを有する情報処理装置または電子機器の構成を示している。FIG. 4 shows a configuration of an information processing apparatus or an electronic apparatus having a voice dialogue system that performs voice output and voice input of a fuel warning service and a nearby facility search service according to an embodiment of the present invention. 図５は、本発明のさらに別の実施形態による、運転時間警告サービスおよびレストラン検索サービスの音声出力および音声入力を行う音声対話システムを有する情報処理装置または電子機器の構成を示している。FIG. 5 shows a configuration of an information processing apparatus or electronic device having a voice interaction system for performing voice output and voice input of a driving time warning service and a restaurant search service according to still another embodiment of the present invention. 図６は、本発明のさらに別の実施形態による、経路案内サービス、オイル交換警告サービスおよびニュース・サービスの音声出力および音声入力を行う音声対話システムを有する情報処理装置または電子機器の構成を示している。FIG. 6 shows a configuration of an information processing apparatus or an electronic apparatus having a voice dialogue system that performs voice output and voice input of a route guidance service, an oil change warning service, and a news service according to still another embodiment of the present invention. Yes.

Explanation of symbols

１０情報処理装置または電子機器
１２音声対話システム
２０通知時刻推定部
３０空き時間推定部
４０サービス・シナリオ格納部
５０対話時間推定部
６０情報格納部
７０対話制御部
７２入力処理部
７４出力処理部 DESCRIPTION OF SYMBOLS 10 Information processing apparatus or electronic device 12 Voice dialog system 20 Notification time estimation part 30 Free time estimation part 40 Service scenario storage part 50 Dialog time estimation part 60 Information storage part 70 Dialog control part 72 Input processing part 74 Output processing part

Claims

An information processing apparatus having a voice interaction function,
Service scenario storage means for storing a plurality of service scenarios having priorities;
Notification time estimating means for estimating the timing of the next first voice processing according to a service scenario with high priority in the service scenario storage means;
Free time estimation means for estimating free time from the current time to the timing of the first audio processing;
Conversation time estimation means for estimating a time length required for second voice processing including voice output processing and processing for accepting input by a user in accordance with a low priority service scenario in the service scenario storage means;
Dialog control for performing the second voice processing in the idle time according to the low priority service scenario when the estimated duration of the second voice processing is less than the estimated idle time Means,
An information processing apparatus comprising:

When the estimated time length of the second voice processing is equal to or greater than the estimated idle time, the dialogue control means is configured to enable the second voice processing according to the low priority service scenario. The information processing apparatus according to claim 1, wherein the first voice processing is performed according to the high priority service scenario.

Further, the dialogue time estimation means includes an input time estimation means for estimating a time length required for input by a user, and a means for estimating a time for voice output based on the number of characters of text. The information processing apparatus according to claim 1 or 2 .

It also has an information storage unit,
The dialogue time estimation means is a time required for the second voice processing by the low priority service scenario based on the dialogue history information, user response time history information or text data stored in the information storage unit. and characterized in that to estimate the information processing apparatus according to any one of claims 1 to 3.

The high priority service scenario is a route guidance service for a car navigation system,
The notification time estimating means, the route information of the car navigation system, and characterized in that to estimate the timing of the first speech processing based on the current position information and the map database, according to claim 1 to 4 The information processing apparatus according to any one of the above.

6. The first voice processing includes voice notification, and the second voice processing includes voice question, voice recognition of a user's answer input voice, and voice confirmation of a reply. The information processing apparatus according to any one of the above.