JP3704925B2

JP3704925B2 - Mobile terminal device and medium recording voice output program thereof

Info

Publication number: JP3704925B2
Application number: JP33283897A
Authority: JP
Inventors: 崇柳澤; 雅信山下
Original assignee: Toyota Motor Corp
Current assignee: Toyota Motor Corp
Priority date: 1997-04-22
Filing date: 1997-12-03
Publication date: 2005-10-12
Anticipated expiration: 2017-12-03
Also published as: JPH116743A

Description

【０００１】
【発明の属する技術分野】
本発明は、ナビゲーション装置や電子メールの読み上げなどの音声出力を行う移動端末装置、その動作プログラムを記録した媒体に関する。
【０００２】
【従来の技術】
従来より、経路案内用ナビゲーション装置が知られており、この装置は車両の現在地を検出するＧＰＳ装置等の現在地検出装置と、検出した車両の現在地を地図上の位置として認識する地図データベースと、を有している。そして、これらを利用して、ディスプレイに地図及び現在地を表示して運転者の走行を案内する。また、ナビゲーション装置では、目的地を入力することで、目的地までの最適経路を探索する機能を有している。そして、目的地までの経路を設定した走行においては、右左折する交差点等の案内が必要な交差点（案内交差点）に接近した場合には、右左折についての案内を行う。ここで、この案内は、ディスプレイ上に交差点の拡大図を表示し、この拡大図において進行方向を示すことで行われると共に、音声による案内も行われる。運転者は、車両走行中は前方を注視しているため、音声による案内の方がわかりやすい。このため、音声による案内が普及してきている。このように、ナビゲーション装置などの移動端末において、音声出力が行われている。
【０００３】
なお、音声により進行方向についての案内を行うナビゲーション装置としては、例えば特開平２−２４９１００号公報に記載されたものがある。
【０００４】
このようなナビゲーション装置においては、音声案内をわかりやすく行うために交差点名称等を読み上げることが好ましい。例えば、案内交差点であるＡ交差点の５００ｍ手前において、「５００ｍ先で、Ａ交差点を左方向です。」等の音声案内が行われる。
【０００５】
この音声読み上げを行うためには、各交差点に関する音声データが必要となる。通常の地図データベースでは、全国の地図を記憶すると共に、案内に利用されるような交差点名称は全てテキストデータとして記憶されている。このため、このテキストデータから音声合成を行うことにより、交差点名称の音声データを作成し、音声案内を出力することが行われている。
【０００６】
しかし、このような音声合成は予め装置内に持っている５０音等の単位音声の組み合わせで読み上げ用の音声データを作成する。このため、交差点名称を普通に読み上げた場合と、そのイントネーション、アクセント、発音等が異なり、聞き取りにくいものとなってしまう。特に、運転中の運転者に対する案内では、上記の音声合成による案内では聞き取りやすさという点で十分でなく、わかりやすい案内音声が望まれる。
【０００７】
そこで、各交差点名称についての音声データを全て記憶しておくことが行われている。即ち、各交差点名称等について、実際に読み上げたデータを所定の符号化処理等を行い音声データとして地図データベースに記憶しておく。これにより、自然でわかりやすい音声案内を行うことができる。なお、音声データは、例えばＰＣＭ（パルス・コード・モジュレーション）などの符号化処理されたデータである。
【０００８】
【発明が解決しようとする課題】
しかしながら、従来の音声案内ナビゲーション装置には、以下に示す問題点がある。全ての音声データを記憶すると、そのデータ量が膨大なものになり、記憶媒体の容量が大きくなる。通常、地図データベースには、ＣＤ−ＲＯＭが利用されるが、通常の地図データの数倍の容量が音声データの記憶のために必要となる。
【０００９】
従って、音声の読み上げを自然でわかりやすく行えると共に、記憶媒体等の記憶容量を小さく抑えることができるナビゲーション装置及びそのシステムが要望されていた。
【００１０】
また、車載の端末装置により、電子メール等のやりとりを行ったり、センターから各種のデータの提供を受けたりするサービスも開始されている。このようなサービスにより得た情報も、音声で読み上げた方がよい場合も多い。このような場合の読み上げ音声についても自然でわかりやすい音声出力が望まれる。
【００１１】
本発明は上記問題点を解決することを課題としてなされたものであり、自然でわかりやすい音声出力が行えると共に、そのための記憶容量を小さく抑えることができる移動端末装置などを提供することを目的とする。
【００１２】
【課題を解決するための手段】
本発明は、音声出力のための音声データを外部データベースからネットワークを介して取得するデータ取得手段と、前記音声データに対応する言葉を含めて音声出力を行う音声出力手段と、テキストデータにより表現された単語について、テキストデータから音声合成して音声出力する音声合成手段と、を有し、前記音声合成手段により音声合成して出力した単語の使用頻度が所定以上になったときに、前記データ取得手段によりその単語について音声データを外部から取得することを特徴とする。このように、本発明では、外部から取得した音声データにより、各種音声を出力することができる。従って、移動端末装置において、大容量の音声データ記憶用のメモリを用意する必要がなくなる。また、音声合成により音声出力を行うのに比べ、自然な発声による音声出力が行える。そして、よく使用する単語についての音声データを移動端末内に補充することができる。
【００１３】
また、音声出力のための処理対象となる言葉のうち、よく使用される言葉についての音声データを蓄積した蓄積手段と、テキストデータで表現された単語について、テキスト合成で音声出力するテキスト合成手段と、を有し、蓄積手段からの音声データと、テキスト合成手段からの音声データとを合成して音声出力を行うことが好適である。これによれば、よく使用される単語についての音声データを取得して、この単語についてはテキスト合成でなく自然な発音で出力でき、全体としてわかりやすい音声出力ができる。また、音声データの記憶容量を十分少ないものにできる。
【００１５】
また、取得した音声データのうち、使用頻度が低いものについて、データ消去処理を行うことが好適である。このようにして、必要な音声データのみの記憶とすることができ、メモリの記憶容量を有効に使用することができる。
【００１６】
また、表示用のテキストデータの読み上げ音声を出力することが好適である。表示用のテキストデータを音声出力することで、観光案内や、電子メールなどの読み上げ出力を聞き易いものにできる。
【００１７】
また、ナビゲーション装置の案内音声を出力することが好適である。案内音声には、地名などたくさんの単語があり、これについて外部から提供を受けることによって、移動端末装置において必要なメモリ容量を減少することができる。そして、自然な発音を維持することができる。
【００１８】
また、地理的名称に関する音声データを外部から取得し、前記音声出力手段は、取得した音声データに基づき、前記地理的名称の読み上げ音声を含む音声案内を行うことが好適である。従って、経路案内において必要な地理的名称についてのを音声データを予め記憶しておくことが不要であり、データを記憶する記憶媒体等の容量を必要最小限に低減することができる。また、車両外部から受信した音声データに基づいて音声出力を得るので、読み上げ音声は自然なものとなる。このため、運転者にとって、自然で且つわかりやすい音声経路案内を行うことができる。
【００１９】
また、取得した地理的名称を、音声案内の進行方向案内文に加えて読み上げることが好適である。運転者は、進行方向案内について、地理的名称を含むメッセージで聞くことができ、進行方向を容易に認識することができる。
【００２０】
また、目的地に関するデータを車両外部に送信する送信手段を有し、前記データ取得手段は、現在地と前記目的地との間の経路に関する情報を音声データを車両外部から取得することが好適である。この場合においては、目的地に関するデータを車両外部に送信するので、現在地と目的地との間の地理的名称についての音声データを受信することができる。従って、受信する音声データについて、無駄な部分をなくすことができる。
【００２１】
また、取得する音声データは現在地及び進行方向に基づいて決定される所定範囲内にある情報であり、前記所定範囲は変更可能であることが好適である。例えば、送受信される音声データの容量に対応して前記所定範囲を決定することができる。
【００２２】
また、音声データを取得する範囲である所定範囲は、自車の走行履歴または過去の交通流を考慮して決められることを特徴とする。これらの場合には、目的地までの距離だけでなく、過去の交通状況や過去の自車の走行履歴を考慮して、走行経路を予測して必要な地理的名称を決定できる。そこで、必要な音声データを確実に得ることができる。
【００２３】
また、外部情報を受信する外部情報受信手段と、前記受信した外部情報中に含まれているコード化された音声データをデコードするデコード手段と、を有し、前記音声出力手段は、デコード手段で得られた音声データを用いて音声読み上げすることが好適である。このようにして、受信したデータの中の音声データをデコードしてそのまますぐに音声出力することができる。
【００２４】
また、前記音声データ受信手段と前記音声案内手段とは、携帯端末装置に搭載されていることが好適である。このように、携帯端末装置に搭載されていると、この携帯端末を使用して任意の場所で経路の設定等が行える。そして、音声データの記憶が不要であるので、携帯端末装置を小型、軽量化できる。さらに例えば、センターから地図データをもらう構成にすれば、地図データベース自体が不要になり、装置の小型化を一層図ることができる。
【００２５】
また、外部から取得される音声データは、サンプリング音声データであることが好適である。人の発声から得たサンプリング音声データを取得することで、人間の生の発声と同様の自然な音声を再生できる。
【００２６】
本発明は、移動体端末の動作を達成するためのプログラムを記憶した媒体に関する。なお、動作プログラムは、移動端末装置内のＲＯＭ、ＣＤ−ＲＯＭやハードディスクなどに記憶しておくことが好適である。また、ユーザはＣＤ−ＲＯＭを購入することで、新たなプログラムをナビＥＣＵにロードすることもできる。さらに、媒体は、ＣＤ−ＲＯＭに限らず、ＤＶＤやＦＤなど音声出力プログラムを記憶できるものであればどのような形式のものでもよく、通信で提供することも好適である。特に、センターが動作プログラムを移動端末装置に通信で提供することが好ましい。
【００２７】
【発明の実施の形態】
以下、本発明に好適な実施形態について、図面に基づいて説明する。
【００２８】
「基本形態１」
図１は、本発明に係るナビゲーション装置（移動端末）の基本的な形態を示すブロック図である。ナビゲーション装置２においては、ナビゲーションＥＣＵ（以下、ナビＥＣＵという）４に、モデム６、音声信号出力装置８、地図データベース１０、ＧＰＳ装置１２、音声データ記憶部１４、操作部１６及び表示部１８が接続されている。
【００２９】
モデム６は、自動車電話等の無線通信機２０に接続されており、車両外部から送られてくる音声データの受信や車両外部への情報の送信等に必要な変復調処理を行う。音声信号出力装置８には、スピーカ２２が接続されており、ナビＥＣＵ４からの信号に従い音声をスピーカ２２から出力して、運転者に対して経路案内を行う。なお、ナビＥＣＵ４は、その内部のＲＯＭに所定のプログラムを記憶しており、このプログラムを実行することによって、各種動作を達成する。このＲＯＭは、マスクＲＯＭでもよいが、ＥＥＰＲＯＭ等書き替え可能なものとすることが好ましい。この場合、地図データベース１０として利用されるＣＤ−ＲＯＭに動作プログラム（音声出力プログラム）を記憶しておき、このプログラムをナビＥＣＵ４内のＥＥＰＲＯＭ等にロードすることも好適である。これによって、ユーザはＣＤ−ＲＯＭを購入することで、新たなプログラムをナビＥＣＵ４にロードすることができ、ナビＥＣＵ４を新しいプログラムに基づいて動作させることができる。なお、媒体は、ＣＤ−ＲＯＭに限らず、ＤＶＤやＦＤなど音声出力プログラムを記憶できるものであればどのような形式のものでもよい。
【００３０】
ＧＰＳ装置１２は、複数の人工衛星からの電波を受信することで、現在地（緯度及び経度等）を検出する。そして、地図データベース１０を利用して、ＧＰＳ装置１２により検出された現在地が地図上の位置として認識される。音声データ記憶部１４では、車両外部から送られてくる音声データがナビＥＣＵ４を介して記憶される。なお、この音声データ記憶部１４としては、通常ＲＡＭが利用されるが、ＥＥＰＲＯＭ等でもよい。操作部１６は、ナビＥＣＵ４に対して各種データの入力処理等を行うために使用される。表示部１８は、操作部１６による各種データ入力のための表示、地図データベース１０及びＧＰＳ装置１２とによる地図上の現在位置情報の表示、メッセージの表示等を行う。
【００３１】
図２は、このナビゲーション装置２を利用したナビゲーションシステムを示す模式的概念図である。ナビゲーション装置２より経路データをセンター３０に送信すると、センター３０がこの経路を走行する際の経路案内に必要な音声データを作成し、ナビゲーション装置２に提供する。従って、ナビゲーション装置２において、音声データに基づく経路案内が行える。
【００３２】
上記のナビゲーション装置２及びそのシステムを使用して、音声データによる経路案内方法を以下に説明する。図４は、ナビゲーションシステムにおける動作を示すフローチャートであり、図３は経路案内の一例を説明するための図である。なお、このような処理は、ナビＥＣＵ４がその内部に記憶されているプログラムを実行することによって達成される。
【００３３】
図３において、最初に車両は位置Ｘにいる。この場合、図４のフローチャートで示すように、先ずＧＰＳ装置１２により位置Ｘを絶対位置（緯度及び経度）として検出すると共に、地域毎に設置されているセンター３０のうち、自車位置に最も近いセンター３０を地図データベース１０により探索する（Ｓ１０２）。また、地図データベース１０により地図上には、自車位置Ｘが地図上の位置として認識される。なお、現在地をセンター３０に送信し、現在地周辺の地図データを受け取り、地図上の自車位置Ｘを認識してもよい。
【００３４】
次に、操作部１６により目的地のデータを入力する。すると、ナビＥＣＵ４が地図データベース１０を利用して、現在地から目的地までの経路を探索し、経路データが作成される（Ｓ１０４）。そして、現在地から目的地までの経路上のこれから走行する予定の距離ａ（ｋｍ）分のリンク及びノードからなる経路データを無線通信機２０等を介してセンター３０に送信する（Ｓ１０６）。
【００３５】
なお、リンクとは、交差点毎に区切られた道路の１単位をいい、これらのリンク間の区切りである交差点をノードという。また、距離ａ（ｋｍ）は、センター３０側の処理速度、音声データ記憶部１４の容量、受信する音声データの容量等を考慮して決定される。例えば、１０ｋｍ等の固定距離でもよいが条件に応じて変更することが好適である。
【００３６】
続いて、センター３０において、経路上の走行予定距離ａ（ｋｍ）分のリンク及びノードからなる経路データを受信すると、この経路データに対応する交差点名称群を探索し、車両へ送信する（Ｓ１０８）。
【００３７】
この交差点名称群の探索について以下に詳細に説明する。交差点名称群Ｄ（Ｕ）はこの先、走行予定距離ａ（ｋｍ）にあるリンク及びノードについての経路データＵにより定められる抽出関数ｆ（Ｕ）により決定される。図５（ａ），（ｂ）は、それぞれ、各ノードに対して抽出関数ｆ（Ｕ）により決められた交差点名称を抽出する範囲を示す図である。例えば、図５（ａ）に示すように経路５２上の各ノード５０から半径ｂ（ｋｍ）以内の範囲にある交差点５４を全て抽出して、これらの交差点名称の集合を交差点名称群Ｄ（Ｕ）とする。また、他の例として、図５（ｂ）に示すように、経路５６上の各ノード５０から距離ｂ（ｋｍ）以内の交差点５４を全て抽出して、これらの交差点名称を交差点名称群Ｄ（Ｕ）とすることもできる。なお、上記距離ｂ（ｋｍ）は、センター３０側の処理速度、音声データ記憶部１４の容量、受信する音声データの容量等により変更可能である。
【００３８】
このように経路上の交差点名称だけでなく、ある範囲の交差点を取得することで、経路はずれの際の案内や方面案内等が可能になる。
【００３９】
そして、センター３０は、交差点名称の探索後、これらの交差点名称群Ｄ（Ｕ）をセンター３０から音声データとしてナビゲーション装置２へ送信する。
【００４０】
このように、音声データを受信した場合には、車両の音声データ記憶部１４において、受信した交差点名称群Ｄ（Ｕ）の音声データを記憶する（Ｓ１１０）。その後、経路案内が開始される（Ｓ１１２）。ここで、図３に示すように、車両は現在地Ｘから経路案内に基づき、走行を行う。そして、Ａ交差点に接近したとき、例えば、「３００ｍ先、Ａ交差点を左方向です。」等の音声が読み上げられて、経路案内が行われる。そして、Ｂ交差点に接近したときは、「３００ｍ先、Ｂ交差点を右方向です。」等の音声が読み上げられる。
【００４１】
ここで、車両が距離ａ（ｋｍ）だけ走行する間の案内に必要な交差点名称などの音声データは、車両において、センター３０から受信している。そこで、上述のような案内における交差点名Ａ、Ｂの音声データが、音声データ記憶部１４に記憶されている。そこで、案内における交差点名称を自然な発音で出力することができる。なお、その他の定型の案内音声は、地図データベース１０に、音声データが記憶されており、これを読み出して使用する。
【００４２】
そして、所定設定距離の走行毎に、自車が距離ａ（ｋｍ）から１ｋｍ手前の地点に達したか否かをステップＳ１１４にて判断する。ステップＳ１１４にて自車位置が所定距離ａ−１（ｋｍ）の地点に達しない場合には、ステップＳ１１２に戻り、音声案内を継続する。なお、この１ｋｍについては、この距離に限定せず、センター３０側の処理速度等により変更可能である。そして、距離ａ（ｋｍ）から１ｋｍ手前の地点に達した場合は、ステップＳ１１６にて、自車位置から目的地まで、１ｋｍ以内であるか否かを判定する。このステップＳ１１６にて、自車位置が目的地まで１ｋｍ以内に達していない場合には、ステップＳ１０２に戻り、上述の工程を繰り返し、現在地から目的地までの経路データを送信し、必要な音声データを受信して経路案内を行う。一方、目的地まで１ｋｍ以内である場合には、音声経路案内は終了し、車両はそのまま目的地まで走行する。
【００４３】
なお、センター３０から取得する音声データは、交差点名称の代わりに例えば、Ｆ市１２番地等の地理的名称や施設名称でもよく、この地理的名称と交差点名称とを合わせて音声データとしてもよい。更に、音声データは各自車状況を考慮して、経路案内に必要でない交差点名称群を含んでもよい。
【００４４】
このように、本実施の形態においては、交差点名称等に関する音声データをセンター３０から取得することができる。このため、音声データを予め記憶しておくことが不要である。そして、走行中の経路案内に必要な分に関する音声データだけを音声データ記憶部１４に記憶する。このため、音声データ記憶部１４の容量は小さなものでよい。また、センター３０から受信音声データを読み上げるので、音声データは自然なものとなる。このため、運転者にとって、自然で且つわかりやすい音声経路案内を行うことができる。
【００４５】
なお、本実施の形態においては、目的地の設定とこの目的地に関する経路の探索を車両側のナビゲーション装置２で行っていたが、本実施の形態においては、これに限定されない。即ち、車両側で、目的地を設定した後、この目的地に関するデータをセンター３０に送信し、センター３０側で現在地から目的地までの経路を探索して経路データを作成し、この経路データに基づいて交差点名称等の音声データを用意しても良い。
【００４６】
また、センター３０側が有する過去の交通状況データベースを使用して、経路案内に必要な交差点名称等を決定することもできる。通常、走行曜日、走行時間、天気等に応じて、道路上を走行する車両数は変化する。このため、これらの走行時間等による車両数の情報を経路毎に交通状況データベースに保存しておく。
そして、走行時間等の過去の交通状況データベースに基づいて、例えば経路案内時での走行時間において経路として選択する車両が多いと判定される経路についても、交差点名称等の音声データを用意する。過去の交通状況を考慮した経路の例を図６に示す。この図に示すように、経路ＸＸは最初に決定された経路であるが、この経路上のノードＧ，Ｈ等だけでなく、上記の過去の交通量等を考慮して考えられる経路ＹＹ上の交差点Ｍ，Ｎ等に関する交差点名称群等の音声データもセンター３０が送信対象とする。このため、走行状況に応じて、経路ＹＹを選択して走行した場合、この経路に関する音声案内を行うことも可能である。
【００４７】
さらに、自車の走行履歴、例えば、所定区間における異なる経路毎の自車の走行回数等を走行履歴データベースに記憶しておき、この走行履歴データベースに基づいて、対象とする交差点名称を決定することも可能である。この場合においては、車両側で、目的地までの経路を設定した後、過去の走行履歴を参考にして、走行頻度の高い経路も経路データに加える。従って、これらの複数の経路に基づく交差点名称についての音声データを得ることができる。
【００４８】
このようにして、過去の交通状況や過去の自車の走行履歴を考慮して音声データを入手することで、走行経路の変更に対応して、音声案内を行うことができる。
【００４９】
また、ナビゲーション装置２を、携帯端末装置に搭載することも好適である。
この場合、携帯端末装置を利用して経路の設定を行い、上述の場合と同様に、携帯電話やＰＨＳなどの電話を利用して経路及び最小限の音声データを得ておく。そして、走行中は、適宜詳細なデータをもらい、これを利用して、経路案内を行うことができる。図７は携帯端末装置を利用した音声案内を示す模式図である。車両の走行前に、例えば自宅において、目的地を入力する。そして、現在地をＧＰＳ装置１２により検出し、目的地及び現在地をセンター３０へ送信する。すると、センター３０では、送信された現在地及び目的地に基づき、最適経路の探索を行い、最適経路データ並びにこの経路上のノード及びリンクに対応する交差点名称群等の音声データ及び地図データを携帯情報端末に送信する。この際の音声データは最小限のものにしておくとよい。なお、現在地（走行開始位置）を入力するようにすれば、ＧＰＳ装置はなくても良い。また、上述の操作は、車両走行開始時に行ってもよい。特に、経路中に都市部等のＤＳＲＣ、ＰＨＳ等のアンテナが整備されている地域がある場合、この地域のデータは最小限に抑えるとよい。なお、ＤＳＲＣ（Dedicated Short Range Communication）では、光ビーコン等を利用する場合が多い。
【００５０】
そして、経路設定の終わった携帯端末装置を車両へ持ち込み、この携帯端末装置からの経路案内を受けながら、目的地に向けての走行が行われる。図７に示すように、都市部等のＤＳＲＣ、ＰＨＳ等のアンテナが整備されている地域を走行する場合には、ノード５０に接近する毎に、適時ＰＨＳ等によりセンター３０に電話をして、交差点から所定範囲内（例えば、円７０の範囲内）の交差点名称群Ｄ（Ｕ）を受信して、音声による経路案内を行う。なお、センター３０に電話をして、同時に詳細な地図データを随時もらうことで、予め記憶しておくデータ量を非常に少なくして、所望の経路案内を行うことができる。また、地図データを随時もらえるので、携帯端末装置に予め必要な地図データ量はさらに少なくてもよい。
【００５１】
このように、音声データを記憶する必要がなく、センター３０から地図データをもらう構成にすれば、地図データベース自体が不要になる。従って、装置の小型化を図ることができ、携帯端末装置をまとめることが容易である。さらに、携帯端末装置を利用すれば、任意の場所において、経路設定が行えるため、友人などとドライブの計画を話しながら、経路の設定などを行うこともできる。
【００５２】
さらに、携帯端末装置と車載のナビゲーション装置を組み合わせることも好適である。特に、車載のナビゲーション装置において、道路側のビーコンとの通信機器を設けておけば、走行中において必要なデータをこの通信機器により入手することもできる。
【００５３】
「実施形態」
図８に実施形態の構成を示す。この実施形態では、音声合成装置４０を有している。この音声合成装置４０は、ナビＥＣＵ４から供給されるテキストデータから音声合成し、スピーカ２２から合成音声を出力させる。従って、ナビＥＣＵ４は、音声データを出力する場合には、これに基づき音声信号出力装置８を介し、スピーカ２２から音声出力し、テキストデータを出力する場合には、音声合成装置４０を介し、スピーカ２２から音声出力する。また、固定メモリ４２は、音声データ記憶部１４に代えて設けられたものであり、ＥＥＰＲＯＭ等で構成されよく使用する単語（主要キーワード）についての音声データの供給を受け、これを固定的に記憶する。すなわち、通常の単語は音声合成によって出力するが、よく使う単語については、その音声データの提供を受けこれを固定メモリ４２に記憶しておく。従って、音声出力の際によく使われる単語については、供給を受けた音声データを利用して音声出力が行われるため、全体として理解しやすい音声出力が行える。なお、従来の地図データベース１０と同様に、経路案内において、通常使用する単語について、その音声データが記憶しておくことも好適である。これによって、これら単語については取得の必要がなくなる。
【００５４】
このように、本実施形態においては、よく使用する単語やフレーズについて音声データを取得し、これを固定メモリ４２に記憶する。この動作について、図９に基づいて説明する。
【００５５】
まず、テキストデータを受信したら、そのテキストデータの読み上げ処理を実行する（Ｓ２０２）。すなわち、受信したテキストデータについて、固定メモリ４２に記憶されている単語については、ここから読み出した音声データにより音声を出力し、固定メモリ４２に記憶されていない単語については、音声合成装置４０により音声合成を行う。次に、固定メモリ４２に記憶されていなかった単語があるかを判定する（Ｓ２０４）。この判定において、ＹＥＳ、すなわち記憶されていない単語があった場合には、該当する単語をテキストデータで記憶する（Ｓ２０６）。ここで、当該単語が、すでに記憶されていた場合には、その単語についてのカウント値を１インクリメントする。また、初めての単語については、カウント値１とともに、その単語を記憶する。なお、このデータは、ナビＥＣＵ４内のＲＡＭに記憶すればよい。
【００５６】
次に、記憶された単語について、そのカウント値が所定値（例えば、５回）を超えたものがあるかを判定する（Ｓ２０８）。そして、該当する単語については、センター３０に音声データの提供を要求し（Ｓ２１０）、センター３０から音声データの提供を受け、これを固定メモリ４２に記憶する（Ｓ２１２）。次に、固定メモリ４２の中で、過去所定期間（例えば、１年間）使用していない単語があるかを判定する（Ｓ２１４）。これは、単語毎に適当なタイムスタンプ（例えば、年、月を示すデータ）を記憶しておき、これをチェックすることで達成できる。そして、この判定でＹＥＳの場合には、当該単語を固定メモリ４２から削除することをアドバイスする（Ｓ２１６）。Ｓ２１６の処理を終了した場合、及びＳ２０４、Ｓ２０８、Ｓ２１４でＮＯであった場合には、処理を終了する。
【００５７】
なお、Ｓ２１４の単語削除のアドバイスの際には、「単語○○は、１年間使用されていません。削除をしますか」という表示を表示部１８に行い、「はい」または「いいえ」の入力を待ち、削除を行うか否かを決定するなどの方法が採用される。また、Ｓ２１０の音声データの要求の際にも、「単語○○について音声データを要求しますか」等という問い合わせをすることも好適である。
【００５８】
このようにして、テキストデータを記憶することで、使用頻度を検出し、使用頻度の高い単語について、自動的に固定メモリ４２に音声データを記憶し、使用頻度の低い単語については音声データを削除することができる。そこで、不要な音声データにより、固定メモリ４２が占められてしまうことを防止することができる。
【００５９】
なお、基本形態と同様にして、センター３０から所定の音声データの提供を常に受けておき、使用頻度の高いものについて、その音声データを固定メモリ４２に記憶することも好適である。この場合、出力する内容によっては、地図データベース１０、固定メモリ４２、及び音声データ記憶部１４からの音声データと、音声合成装置４０からの出力に基づいて音声出力が行われることになる。
【００６０】
さらに、上述のようなシステムにおいて、センター３０において、ユーザからの各単語についての音声データ要求回数をカウントしておき、所定回数に達した場合に、各ユーザに自動配信することもできる。すなわち、図１０に示すように、センター３０において情報を配信する際に、各単語Ｔｉについてユーザからの音声データ要求が５０回に達したかを判定する（Ｓ３０２）。そして、この判定において、ＹＥＳであれば、その単語について、ユーザ端末（移動端末）に自動配信する（Ｓ３０４）。このような処理は、所定地域に限定して行うことも好適である。すなわち、ある地域に存在する移動端末からの要求をカウントして、その地域に存在する移動端末に当該単語の音声データを自動配信することができる。
【００６１】
「音声読み上げデータの例」
次に、センター３０が交通情報を提供する際に、音声データを添付して、車両（移動端末）に付与する例について説明する。この場合、音声データは、符号化されて添付される。そこで、移動端末装置においては、受信した音声データをデコードすることで、音声出力を得ることができる。
【００６２】
所定の地域の道路や設定された経路について渋滞情報を提供する際に、センター３０は表１に示すようなデータを提供する。すなわち、各リンクに対応した渋滞レベルデータに追加して、渋滞情報読み上げデータ、道路名称の読み上げデータを移動端末に提供する。例えば、リンク１〜７について、渋滞情報読み上げデータとして、「渋滞はありません」「少し渋滞」「２キロ渋滞」「かなり渋滞」「車線減少」「工事箇所」「通行止め」などを提供し、また道路名称の読み上げデータとして、「国道１号線」「丸山公園通り北行き」「西大津バイパス堅田方面」「高雄パークウェイ嵐山方面」などを提供する。
【００６３】
【表１】

そこで、移動端末は経路に応じて、提供された音声データを利用して案内を行う。例えば、自宅出発時において経路が定まっており、その経路についての渋滞情報を取得していた場合、「国道１号線から西大津バイパスを通るルートです。国道１号線は＊少し渋滞＊しています。西大津バイパス堅田方面は＊車線減少＊箇所があるので注意して走行して下さい。」、また走行中の交差点手前では、「５００ｍ先、丸山公園前を左方向です。その先丸山公園通り北行きは＊２キロの渋滞＊です。」等という音声案内を提供された音声データ（読み上げデータ）を利用して行うことができる。
【００６４】
なお、経路を車両側で計算する場合には、所定範囲の交通データを移動端末に提供するが、センター３０側で経路を計算する場合には、センター３０において、経路がわかっている。従って、移動端末に提供するデータは、案内に必要な最小限のデータにすることができる。
【００６５】
また、駐車場の利用状況についての情報（満室情報）をセンター３０が提供する場合には、表２に示すような音声データを提供する。
【００６６】
【表２】

このように、駐車場を特定するＮｏ．に対応して、駐車場名称のテキストデータ、駐車場名称の読み上げデータ、満室レベルデータ、満室状況の読み上げデータが送信される。従って、移動端末装置において、案内を行うときに、駐車場名称や、満室レベルを受け取った読み上げデータを利用して行うことができる。
【００６７】
さらに、経路案内においては、特徴的な建物など目印となるもの（ＰＯＩ：Point of Intent）を知らせることが好適である。そこで、これらＰＯＩについての音声データを移動端末装置に提供することが好適である。表３は、このようなＰＯＩ及びその属性データの音声読み上げデータの提供例を示すものである。
【００６８】
【表３】

この例では、例えば、ノードＮｏ．１７１について、ＰＯＩとして「ＴＶ塔」、その音声読み上げデータとして「テレビトウ」というデータが提供され、またＰＯＩの属性データとして「赤く、一番高い」というデータと共に、その音声読み上げデータとして「アカク、イチバンタカイ」というデータが提供される。
【００６９】
従って、音声案内において、「＊赤く一番高いテレビ塔＊前を右折すると、県庁前通りです。」等という音声案内をすることができる。また、「＊いちょう並木＊に沿って、＊茶色い３４階建て＊の＊県庁ビル＊を通り過ぎたら、５００ｍで左方向です。」「左折後、右前方に＊富士山が見え＊てきます。」「３００ｍ先、＊市営地下駐車場＊です。左折で進入できます。」等という音声案内も行うことができる。
【００７０】
また、図１１に、移動端末装置を車両に実際に搭載したイメージを示す。このように、ＧＰＳ装置１２を構成するＧＰＳアンテナ１２ａは、車室内のインパネの上方に設けられ、ナビゲーションのためのＥＣＵ４ａ（ナビＥＣＵ４の一部）及び地図データベース１０を構成するＣＤ−ＲＯＭ１０ａは、後部トランク内に設けられている。また、表示部１８及び情報制御のためのＥＣＵ（ナビＥＣＵ４の一部）は、一体的に形成され、ワイドマルチステーション６０として、ドライバ席と助手席に間のスペースに配置されている。そして、このワイドマルチステーション６０には、ケーブル６２を介し、無線通信機２０を構成する移動体電話をハンズフリー電話機として動作させるクレードル８０が接続されている。
【００７１】
すなわち、この例では、図１２に示すように、移動体電話３２を構成する携帯電話機８２は、クレードル８０に載置される。そして、携帯電話機８２のコネクタ接続用ターミナル８２ａに、クレードル８０のコネクタ８０ａを接続することで、携帯電話機８２とクレードル８０が接続される。このクレードル８０には、ハンズフリーで通話をするためのマイクロフォン、スピーカ、ワンタッチダイヤルボタンなどの各種の機器が接続されており、携帯電話機８２をこのクレードル８０にセットすることによって、携帯電話機８２を利用してハンズフリー電話機として使用することになる。
【００７２】
また、各種操作は、ワイドマルチステーション６０の入力操作部を利用して行われる。なお、無線通信機２０は、この構成に限らず、専用の車載電話システムを設けることも好適である。
【００７３】
「その他の構成」
無線通信機２０において、センター３０との間で、電子メールなどのやりとりも行うことが好適である。この場合取得された電子メールは、通常テキストデータであり、これがナビＥＣＵ４内のＲＡＭに記憶される。そして、表示部１８に表示されるが、運転中などは音声出力される。すなわち、ナビＥＣＵ４が、受信した電子メールについてのテキストデータを音声合成装置４０に供給し、電子メールの読み上げ音声がスピーカ２２から出力される。この場合においても、必要な言葉やフレーズについて、適宜音声データを取得することが好適である。また、流行語なども所定回数以上の使用に対し、その音声データを取得しておくことができる。
【００７４】
さらに、各種の音声データについて、ＩＤ番号などのコードを予め決定しておき、音声データをこのＩＤ番号に対応させて移動端末装置に記憶させておけば、通信するデータは、このＩＤ番号のみでよくなる。従って、通信データ量を大幅に削減することができる。
【００７５】
さらに、移動端末装置と、センターの間の通信は、通常の携帯電話回線や、ＰＨＳ、ＦＭ多重放送、ＴＶ多重放送、地上波デジタル通信、光ビーコン、電波ビーコン等を利用したものが利用可能である。
【００７６】
【発明の効果】
以上説明したように、本発明によれば、音声データを記憶するのに必要な記憶媒体等の容量を必要最小限に低減することができる。また、車両外部から受信した音声データを読み上げるので、読み上げ音声は明瞭なものとなる。このため、運転者にとって、自然で且つわかりやすい音声経路案内などの音声出力を行うことができる。
【図面の簡単な説明】
【図１】本発明の基本形態の移動端末装置の構成を示すブロック図である。
【図２】基本形態を示す模式的概念図である。
【図３】基本形態における音声案内を示す経路図である。
【図４】基本形態における音声案内を行うことを示すフローチャートである。
【図５】基本形態における経路及びその交差点名称群の範囲を示す図である。
【図６】交通状況データベースも考慮して、決定された経路及びその交差点名称群の範囲を示す図である。
【図７】携帯端末装置を使用した時の音声案内における経路及びその交差点名称群の範囲を示す図である。
【図８】実施形態における移動端末装置の構成を示すブロック図である。
【図９】実施形態における音声データ取得の動作を示すフローチャートである。
【図１０】実施形態における単語削除の動作を示すフローチャートである。
【図１１】移動端末装置を車両に実際に搭載したイメージを示す図である。
【図１２】無線通信機の構成を示す図である。
【符号の説明】
２ナビゲーション装置、４ナビＥＣＵ、６モデム、８音声信号出力装置、１０地図データベース、１２ＧＰＳ装置、１４音声データ記憶部、１６操作部、１８表示部、２０無線通信機、２２スピーカー、３０センター、５０ノード、５２経路、５４交差点、７０交差点名称抽出範囲。[0001]
BACKGROUND OF THE INVENTION
  The present invention relates to a mobile terminal device that performs voice output such as reading a navigation device or e-mail, and a medium in which an operation program is recorded.To the bodyRelated.
[0002]
[Prior art]
Conventionally, a navigation device for route guidance is known, and this device includes a current location detection device such as a GPS device that detects the current location of a vehicle, and a map database that recognizes the detected current location of the vehicle as a position on a map. Have. And using these, a map and a present location are displayed on a display, and a driver's run is guided. The navigation device has a function of searching for an optimum route to the destination by inputting the destination. In traveling with a route to the destination, when approaching an intersection (guide intersection) that requires guidance such as an intersection that makes a right or left turn, guidance is provided for a right or left turn. Here, the guidance is performed by displaying an enlarged view of the intersection on the display and showing the traveling direction in the enlarged view, and also by voice guidance. Since the driver is gazing at the front while the vehicle is traveling, voice guidance is easier to understand. For this reason, voice guidance has become widespread. Thus, voice output is performed in a mobile terminal such as a navigation device.
[0003]
In addition, as a navigation apparatus which guides the traveling direction by voice, there is one described in, for example, Japanese Patent Laid-Open No. 2-249100.
[0004]
In such a navigation apparatus, it is preferable to read out an intersection name or the like in order to make voice guidance easy to understand. For example, voice guidance such as “500 meters ahead and the A intersection is on the left” is performed 500 meters before the A intersection, which is a guidance intersection.
[0005]
In order to perform this voice reading, voice data regarding each intersection is required. In a normal map database, a map of the whole country is stored, and all intersection names used for guidance are stored as text data. For this reason, by performing speech synthesis from this text data, speech data of intersection names is created and voice guidance is output.
[0006]
However, in such speech synthesis, speech data for reading is created by a combination of unit speech such as 50 sounds previously stored in the apparatus. For this reason, the intonation, accent, pronunciation, etc. are different from the case where the intersection name is normally read out, and it becomes difficult to hear. In particular, guidance for a driver who is driving is not sufficient in terms of ease of listening with the above-described guidance by voice synthesis, and an easy-to-understand guidance voice is desired.
[0007]
Therefore, all audio data for each intersection name is stored. That is, for each intersection name, the data actually read out is subjected to a predetermined encoding process and stored in the map database as voice data. Thereby, natural and easy-to-understand voice guidance can be performed. Note that the audio data is data subjected to encoding processing such as PCM (pulse code modulation).
[0008]
[Problems to be solved by the invention]
However, the conventional voice guidance navigation apparatus has the following problems. When all audio data is stored, the amount of data becomes enormous and the capacity of the storage medium increases. Normally, a CD-ROM is used for the map database, but a capacity several times that of normal map data is required for storing audio data.
[0009]
Therefore, there has been a demand for a navigation apparatus and system that can read out speech naturally and in an easy-to-understand manner and can reduce the storage capacity of a storage medium or the like.
[0010]
In addition, services for exchanging e-mails and receiving various data from the center using an in-vehicle terminal device have been started. In many cases, it is better to read out information obtained by such a service by voice. A natural and easy-to-understand voice output is desired for the reading voice in such a case.
[0011]
The present invention has been made to solve the above problems, and an object of the present invention is to provide a mobile terminal device and the like that can perform natural and easy-to-understand audio output and reduce the storage capacity therefor. .
[0012]
[Means for Solving the Problems]
  The present invention is expressed by data acquisition means for acquiring voice data for voice output from an external database via a network, voice output means for outputting voice including words corresponding to the voice data, and text data. About the wordSpeech synthesis from text dataOutput audiovoiceCombining means;And voice synthesis by the voice synthesis meansWhen the usage frequency of the output word exceeds a predetermined value, the data acquisition meansRisoVoice data for wordsFrom outsideIt is characterized by acquiring. As described above, in the present invention, various sounds can be output from the sound data acquired from the outside. Therefore, it is not necessary to prepare a large capacity memory for storing voice data in the mobile terminal device. In addition, voice output by natural utterance can be performed as compared with voice output by voice synthesis.Then, voice data about frequently used words can be supplemented in the mobile terminal.
[0013]
  Also,soundAmong the words to be processed for voice output, storage means for storing speech data on frequently used words, and text synthesis means for outputting speech by text synthesis for words expressed in text data, It is preferable to synthesize the voice data from the storage means and the voice data from the text synthesis means to perform voice output. According to this, it is possible to acquire voice data for a frequently used word, and to output the word with natural pronunciation instead of text synthesis, and it is possible to output a voice that is easy to understand as a whole. In addition, the storage capacity of the audio data can be made sufficiently small.
[0015]
  Also, TakePerform data erasure processing on the obtained audio data that is infrequently usedIs preferred. In this way, only necessary audio data can be stored, and the storage capacity of the memory can be used effectively.
[0016]
  Also, tableIt is possible to output a text-to-speech reading soundIs preferred. By outputting the text data for display as a voice, it is possible to make it easy to hear a reading output such as a tourist guide or an e-mail.
[0017]
  Also, NaOutput the guidance voice of the navigation deviceIs preferred. There are many words such as place names in the guidance voice, and the memory capacity required in the mobile terminal device can be reduced by receiving provisions from outside. And natural pronunciation can be maintained.
[0018]
  AlsoThe groundVoice data relating to a physical name is acquired from the outside, and the voice output means performs voice guidance including a reading voice of the geographical name based on the acquired voice data.Is preferred. Therefore, it is not necessary to previously store voice data for a geographical name necessary for route guidance, and the capacity of a storage medium for storing the data can be reduced to the minimum necessary. Moreover, since the voice output is obtained based on the voice data received from the outside of the vehicle, the read-out voice becomes natural. For this reason, it is possible for the driver to perform natural and easy-to-understand voice route guidance.
[0019]
  Also, TakeRead the obtained geographical name in addition to the voice guidance in the direction of travelIs preferred. The driver can listen to the direction guidance with a message including a geographical name, and can easily recognize the direction of travel.
[0020]
  Also,EyeTransmitting means for transmitting data relating to a target location to the outside of the vehicle, wherein the data acquisition means acquires information relating to a route between the current location and the destination from outside the vehicle.Is preferred. In this case, since the data regarding the destination is transmitted to the outside of the vehicle, it is possible to receive the voice data regarding the geographical name between the current location and the destination. Therefore, it is possible to eliminate a useless portion of the received audio data.
[0021]
  Also, TakeThe audio data to be obtained is information within a predetermined range determined based on the current location and the traveling direction, and the predetermined range can be changed.Is preferred. For example, the predetermined range can be determined according to the volume of audio data to be transmitted / received.
[0022]
  Also,soundThe predetermined range, which is a range for acquiring voice data, is determined in consideration of a travel history of the own vehicle or a past traffic flow. In these cases, a necessary geographical name can be determined by predicting a travel route in consideration of not only the distance to the destination but also the past traffic situation and the past travel history of the own vehicle. Therefore, necessary audio data can be obtained with certainty.
[0023]
  Also, OutsideExternal information receiving means for receiving the part information, and decoding means for decoding the encoded audio data included in the received external information, wherein the audio output means is obtained by the decoding means Aloud using voice dataIs preferred. In this way, the audio data in the received data can be decoded and immediately output as audio.
[0024]
  Also,PreviousThe voice data receiving means and the voice guidance means are mounted on a portable terminal device.Is preferred. As described above, when the mobile terminal device is mounted, a route can be set at an arbitrary place using the mobile terminal. And since the audio | voice data storage is unnecessary, a portable terminal device can be reduced in size and weight. Further, for example, if the map data is received from the center, the map database itself becomes unnecessary, and the apparatus can be further downsized.
[0025]
  Also, OutsideThe audio data obtained from theIs preferred. By acquiring sampled voice data obtained from a human voice, natural voice similar to a human voice can be reproduced.
[0026]
  The present invention, TransferThe present invention relates to a medium storing a program for achieving the operation of a moving terminal. The operation program is preferably stored in a ROM, CD-ROM, hard disk, or the like in the mobile terminal device. The user can also load a new program into the navigation ECU by purchasing a CD-ROM. Furthermore, the medium is not limited to a CD-ROM, and may be of any format as long as it can store an audio output program such as a DVD or FD, and is preferably provided by communication. In particular, the center preferably provides the operation program to the mobile terminal device by communication.
[0027]
DETAILED DESCRIPTION OF THE INVENTION
DESCRIPTION OF EXEMPLARY EMBODIMENTS Hereinafter, preferred embodiments of the invention will be described with reference to the drawings.
[0028]
  "

Basic

Form 1 "
  FIG. 1 shows a navigation device (mobile terminal) according to the present invention.BasicIt is a block diagram which shows a form. In the navigation device 2, a modem 6, a voice signal output device 8, a map database 10, a GPS device 12, a voice data storage unit 14, an operation unit 16, and a display unit 18 are connected to a navigation ECU (hereinafter referred to as a navigation ECU) 4. Has been.
[0029]
The modem 6 is connected to a radio communication device 20 such as an automobile phone, and performs modulation / demodulation processing necessary for receiving voice data transmitted from the outside of the vehicle and transmitting information to the outside of the vehicle. A speaker 22 is connected to the voice signal output device 8, and a voice is output from the speaker 22 in accordance with a signal from the navigation ECU 4 to provide route guidance to the driver. The navigation ECU 4 stores a predetermined program in its internal ROM, and achieves various operations by executing this program. The ROM may be a mask ROM, but is preferably rewritable such as an EEPROM. In this case, it is also preferable to store an operation program (audio output program) in a CD-ROM used as the map database 10 and load this program into an EEPROM or the like in the navigation ECU 4. Thereby, the user can load a new program into the navigation ECU 4 by purchasing the CD-ROM, and can operate the navigation ECU 4 based on the new program. The medium is not limited to a CD-ROM, and may be of any format as long as it can store an audio output program such as a DVD or FD.
[0030]
The GPS device 12 detects the current location (latitude, longitude, etc.) by receiving radio waves from a plurality of artificial satellites. Then, the current location detected by the GPS device 12 is recognized as a position on the map using the map database 10. In the voice data storage unit 14, voice data sent from the outside of the vehicle is stored via the navigation ECU 4. As the audio data storage unit 14, a normal RAM is used, but an EEPROM or the like may be used. The operation unit 16 is used to perform various data input processing on the navigation ECU 4. The display unit 18 performs display for inputting various data by the operation unit 16, display of current position information on the map by the map database 10 and the GPS device 12, display of a message, and the like.
[0031]
FIG. 2 is a schematic conceptual diagram showing a navigation system using the navigation device 2. When the route data is transmitted from the navigation device 2 to the center 30, voice data necessary for route guidance when the center 30 travels along this route is created and provided to the navigation device 2. Therefore, the navigation device 2 can perform route guidance based on the voice data.
[0032]
A route guidance method using voice data using the navigation device 2 and its system will be described below. FIG. 4 is a flowchart showing an operation in the navigation system, and FIG. 3 is a diagram for explaining an example of route guidance. Note that such processing is achieved by the navigation ECU 4 executing a program stored therein.
[0033]
In FIG. 3, the vehicle is initially at position X. In this case, as shown in the flowchart of FIG. 4, the position X is first detected as an absolute position (latitude and longitude) by the GPS device 12 and the closest to the vehicle position among the centers 30 installed in each region. The center 30 is searched by the map database 10 (S102). The map database 10 recognizes the vehicle position X on the map as a position on the map. Note that the current location may be transmitted to the center 30, the map data around the current location may be received, and the vehicle position X on the map may be recognized.
[0034]
Next, the destination data is input by the operation unit 16. Then, the navigation ECU 4 searches for a route from the current location to the destination using the map database 10, and route data is created (S104). Then, route data consisting of links and nodes for the distance a (km) scheduled to travel on the route from the current location to the destination is transmitted to the center 30 via the wireless communication device 20 or the like (S106).
[0035]
A link means one unit of a road divided at each intersection, and an intersection that is a division between these links is called a node. The distance a (km) is determined in consideration of the processing speed on the center 30 side, the capacity of the audio data storage unit 14, the capacity of received audio data, and the like. For example, a fixed distance such as 10 km may be used, but it is preferable to change the distance according to conditions.
[0036]
Subsequently, when the center 30 receives route data composed of links and nodes for the planned travel distance a (km) on the route, it searches for an intersection name group corresponding to the route data and transmits it to the vehicle (S108). .
[0037]
The search for the intersection name group will be described in detail below. The intersection name group D (U) is determined by the extraction function f (U) determined by the route data U for the link and node at the planned travel distance a (km). FIGS. 5A and 5B are diagrams showing ranges in which intersection names determined by the extraction function f (U) for each node are extracted. For example, as shown in FIG. 5A, all intersections 54 within a radius b (km) from each node 50 on the route 52 are extracted, and a set of these intersection names is set as an intersection name group D (U ). As another example, as shown in FIG. 5B, all the intersections 54 within a distance b (km) from each node 50 on the route 56 are extracted, and these intersection names are designated as intersection name group D ( U). The distance b (km) can be changed according to the processing speed on the center 30 side, the capacity of the audio data storage unit 14, the capacity of received audio data, and the like.
[0038]
Thus, by acquiring not only the intersection name on the route but also a certain range of intersections, it is possible to provide guidance or direction guidance when the route is off.
[0039]
Then, after searching for the intersection name, the center 30 transmits these intersection name groups D (U) from the center 30 to the navigation device 2 as voice data.
[0040]
As described above, when the voice data is received, the received voice data of the intersection name group D (U) is stored in the voice data storage unit 14 of the vehicle (S110). Thereafter, route guidance is started (S112). Here, as shown in FIG. 3, the vehicle travels from the current location X based on route guidance. Then, when approaching the A intersection, for example, “300 meters ahead, the A intersection is on the left” is read out and route guidance is performed. When approaching the B intersection, a voice such as “300m ahead, B intersection is in the right direction” is read out.
[0041]
Here, voice data such as an intersection name necessary for guidance while the vehicle travels a distance a (km) is received from the center 30 in the vehicle. Therefore, the voice data of the intersection names A and B in the guidance as described above is stored in the voice data storage unit 14. Therefore, the intersection name in the guidance can be output with natural pronunciation. Note that other standard guidance voices are stored as voice data in the map database 10 and are read out and used.
[0042]
Then, at every step of the predetermined set distance, it is determined in step S114 whether or not the host vehicle has reached a point 1 km before the distance a (km). If the vehicle position does not reach the point of the predetermined distance a-1 (km) in step S114, the process returns to step S112 and the voice guidance is continued. In addition, about 1 km, it is not limited to this distance, but can be changed depending on the processing speed on the center 30 side. If the vehicle reaches a point 1 km before the distance a (km), it is determined in step S116 whether the vehicle position is within 1 km from the destination. If it is determined in step S116 that the vehicle position has not reached the destination within 1 km, the process returns to step S102, the above process is repeated, route data from the current location to the destination is transmitted, and necessary voice data is transmitted. To receive route guidance. On the other hand, when the distance to the destination is within 1 km, the voice route guidance ends and the vehicle travels to the destination as it is.
[0043]
The voice data acquired from the center 30 may be, for example, a geographical name such as 12 city F or a facility name instead of the intersection name, and the geographical name and the intersection name may be combined as voice data. Furthermore, the voice data may include intersection name groups that are not necessary for route guidance in consideration of each vehicle situation.
[0044]
Thus, in the present embodiment, voice data relating to intersection names and the like can be acquired from the center 30. For this reason, it is not necessary to store audio data in advance. Then, only the voice data related to the route guidance required for traveling is stored in the voice data storage unit 14. For this reason, the capacity | capacitance of the audio | voice data storage part 14 may be small. In addition, since the received voice data is read from the center 30, the voice data becomes natural. For this reason, it is possible for the driver to perform natural and easy-to-understand voice route guidance.
[0045]
In the present embodiment, the destination is set and the route relating to the destination is searched by the navigation device 2 on the vehicle side. However, the present embodiment is not limited to this. That is, after the destination is set on the vehicle side, data related to the destination is transmitted to the center 30, the route from the current location to the destination is searched for on the center 30 side, and route data is created. On the basis of this, audio data such as intersection names may be prepared.
[0046]
In addition, it is possible to determine an intersection name or the like necessary for route guidance using the past traffic situation database on the center 30 side. Usually, the number of vehicles traveling on the road varies depending on the day of the week, the traveling time, the weather, and the like. For this reason, information on the number of vehicles based on the travel time and the like is stored in the traffic situation database for each route.
Then, based on the past traffic situation database such as travel time, voice data such as intersection names is prepared for a route that is determined to have many vehicles to be selected as a route in the travel time at the time of route guidance, for example. An example of a route in consideration of past traffic conditions is shown in FIG. As shown in this figure, the route XX is the route determined first, but not only on the nodes G and H on this route, but also on the route YY considered in consideration of the above-mentioned past traffic volume, etc. The center 30 also transmits audio data such as intersection names related to the intersections M and N. For this reason, when the vehicle travels by selecting the route YY according to the traveling situation, it is also possible to perform voice guidance regarding this route.
[0047]
Furthermore, the travel history of the host vehicle, for example, the number of travels of the host vehicle for different routes in a predetermined section is stored in the travel history database, and the target intersection name is determined based on the travel history database. Is also possible. In this case, after setting the route to the destination on the vehicle side, a route with a high travel frequency is also added to the route data with reference to the past travel history. Therefore, it is possible to obtain voice data for intersection names based on the plurality of routes.
[0048]
In this way, by obtaining voice data in consideration of past traffic conditions and past travel histories of the vehicle, voice guidance can be performed in response to changes in the travel route.
[0049]
It is also preferable to mount the navigation device 2 on a portable terminal device.
In this case, the route is set by using the mobile terminal device, and the route and the minimum voice data are obtained by using a phone such as a mobile phone or a PHS as in the case described above. And while driving | running | working, it can obtain detailed data suitably and can perform route guidance using this. FIG. 7 is a schematic diagram showing voice guidance using a portable terminal device. Before the vehicle travels, the destination is input, for example, at home. Then, the current location is detected by the GPS device 12, and the destination and the current location are transmitted to the center 30. Then, the center 30 searches for the optimum route based on the transmitted current location and destination, and stores the optimum route data and voice data and map data such as intersection name groups corresponding to nodes and links on the route. Send to the terminal. At this time, the audio data should be kept to a minimum. If the current location (travel start position) is input, the GPS device may not be provided. Further, the above-described operation may be performed at the start of vehicle travel. In particular, when there is an area where an antenna such as DSRC or PHS is provided in a route in an urban area, the data in this area should be minimized. In DSRC (Dedicated Short Range Communication), an optical beacon or the like is often used.
[0050]
Then, the mobile terminal device for which the route setting has been completed is brought into the vehicle, and traveling toward the destination is performed while receiving route guidance from the mobile terminal device. As shown in FIG. 7, when traveling in an area where an antenna such as DSRC or PHS is installed in an urban area, whenever the node 50 is approached, the center 30 is called by the PHS etc. An intersection name group D (U) within a predetermined range (for example, within the range of the circle 70) from the intersection is received and route guidance is performed by voice. In addition, by calling the center 30 and receiving detailed map data as needed, the amount of data stored in advance can be greatly reduced and desired route guidance can be performed. Moreover, since map data can be obtained at any time, the amount of map data necessary for the mobile terminal device in advance may be further reduced.
[0051]
Thus, there is no need to store voice data, and if the map data is obtained from the center 30, the map database itself becomes unnecessary. Therefore, it is possible to reduce the size of the device, and to easily assemble the portable terminal devices. Furthermore, since a route can be set at an arbitrary place by using a mobile terminal device, it is possible to set a route while talking about a drive plan with a friend or the like.
[0052]
Furthermore, it is also preferable to combine a mobile terminal device and an in-vehicle navigation device. In particular, in a vehicle-mounted navigation device, if a communication device with a beacon on the road side is provided, necessary data can be obtained from the communication device while traveling.
[0053]
  "Implementation formstate"
  Fig. 8StateThe configuration is shown. In this embodiment, a speech synthesizer 40 is provided. The speech synthesizer 40 synthesizes speech from text data supplied from the navigation ECU 4 and outputs synthesized speech from the speaker 22. Therefore, the navigation ECU 4 outputs voice data from the speaker 22 based on the voice signal output device 8 when outputting voice data, and outputs voice data from the speaker 22 via the voice synthesizer 40 when outputting text data. The sound is output from 22. The fixed memory 42 is provided in place of the voice data storage unit 14, and is configured by an EEPROM or the like to receive voice data for frequently used words (main keywords) and store them in a fixed manner. To do. That is, normal words are output by speech synthesis, but frequently used words are provided with speech data and stored in the fixed memory 42. Therefore, for words that are often used for voice output, voice output is performed using the supplied voice data, so that voice output that is easy to understand as a whole can be performed. As with the conventional map database 10, it is also preferable to store the voice data of words that are normally used in route guidance. This eliminates the need to obtain these words.
[0054]
Thus, in the present embodiment, voice data is acquired for frequently used words and phrases and stored in the fixed memory 42. This operation will be described with reference to FIG.
[0055]
  First, when text data is received, the text data is read out (S202). That is, for received text data,Fixed memory 42For the words stored in, output the voice by the voice data read from here,Fixed memory 42For words that are not stored in, speech synthesis is performed by the speech synthesizer 40. Next, it is determined whether there is a word that has not been stored in the fixed memory 42 (S204). In this determination, if YES, that is, if there is an unstored word, the corresponding word is stored as text data (S206). If the word has already been stored, the count value for that word is incremented by one. For the first word, the word is stored together with a count value of 1. This data may be stored in the RAM in the navigation ECU 4.
[0056]
Next, it is determined whether there is a stored word whose count value exceeds a predetermined value (for example, 5 times) (S208). For the corresponding word, the center 30 is requested to provide voice data (S210), the voice data is received from the center 30, and stored in the fixed memory 42 (S212). Next, it is determined whether there is a word that has not been used in the past predetermined period (for example, one year) in the fixed memory 42 (S214). This can be achieved by storing an appropriate time stamp (for example, data indicating the year and month) for each word and checking it. If the determination is YES, it is advised to delete the word from the fixed memory 42 (S216). If the process of S216 is completed, or if NO in S204, S208, and S214, the process is terminated.
[0057]
In the word deletion advice in S214, the display unit 18 displays “Word XX has not been used for one year. Do you want to delete it” and “Yes” or “No” is displayed. A method of waiting for input and determining whether or not to delete is adopted. It is also preferable to make an inquiry such as “Do you want to request audio data for the word XX” when requesting the audio data in S210?
[0058]
In this way, by storing text data, the frequency of use is detected, voice data is automatically stored in the fixed memory 42 for frequently used words, and voice data is deleted for words that are not frequently used. can do. Therefore, it is possible to prevent the fixed memory 42 from being occupied by unnecessary audio data.
[0059]
  In addition,Basic formSimilarly, it is also preferable to always receive predetermined audio data from the center 30 and store the audio data in the fixed memory 42 for those frequently used. In this case, depending on the content to be output, voice output is performed based on the voice data from the map database 10, the fixed memory 42, and the voice data storage unit 14 and the output from the voice synthesizer 40.
[0060]
Furthermore, in the system as described above, the center 30 counts the number of times voice data is requested for each word from the user, and when it reaches the predetermined number, it can be automatically distributed to each user. That is, as shown in FIG. 10, when distributing information in the center 30, it is determined whether or not the voice data request from the user has reached 50 times for each word Ti (S302). If YES in this determination, the word is automatically distributed to the user terminal (mobile terminal) (S304). It is also preferable to perform such processing only in a predetermined area. That is, it is possible to count requests from mobile terminals that exist in a certain area and automatically distribute the voice data of the word to mobile terminals that exist in that area.
[0061]
"Example of speech-to-speech data"
Next, an example will be described in which voice data is attached and given to a vehicle (mobile terminal) when the center 30 provides traffic information. In this case, the audio data is encoded and attached. Therefore, the mobile terminal device can obtain audio output by decoding the received audio data.
[0062]
The center 30 provides data as shown in Table 1 when providing traffic information on a road in a predetermined area or a set route. That is, in addition to traffic congestion level data corresponding to each link, traffic congestion information reading data and road name reading data are provided to the mobile terminal. For example, for links 1 to 7, “No traffic jam”, “Slight traffic jam”, “2km traffic jam”, “Significant traffic jam”, “Decrease lanes”, “Construction points”, “Closed” etc. As the reading data of the name, “National Route 1” “To Maruyama Koen-dori Tou” “To Nishiotsu Bypass Katata” “To Kaohsiung Parkway Arashiyama” etc. are provided.
[0063]
[Table 1]

Therefore, the mobile terminal provides guidance using the provided voice data according to the route. For example, if the route is fixed at the time of departure from the home and traffic jam information about the route has been acquired, “It is a route that passes through the Nishiotsu bypass from National Route 1. National Route 1 is * a little congested *. Nishiotsu Please drive carefully because there are * lane reduction * locations in the area of Bypass Katata. "In front of the intersection you are driving," To the left in front of Maruyama Park, 500 meters away. * Two kilometers of traffic * "can be performed using voice data (read-out data) provided with voice guidance such as"
[0064]
When the route is calculated on the vehicle side, a predetermined range of traffic data is provided to the mobile terminal. However, when the route is calculated on the center 30 side, the center 30 knows the route. Therefore, the data provided to the mobile terminal can be the minimum data necessary for guidance.
[0065]
In addition, when the center 30 provides information about the use situation of the parking lot (full room information), voice data as shown in Table 2 is provided.
[0066]
[Table 2]

In this way, the parking lot No. In response to this, text data of parking lot name, reading data of parking lot name, full room level data, and reading data of full room status are transmitted. Therefore, when performing guidance in the mobile terminal device, it can be performed using the reading data that received the parking lot name and the full room level.
[0067]
Further, in route guidance, it is preferable to notify a landmark (POI: Point of Intent) such as a characteristic building. Therefore, it is preferable to provide voice data regarding these POIs to the mobile terminal device. Table 3 shows an example of provision of voice reading data of such POI and its attribute data.
[0068]
[Table 3]

In this example, for example, node no. For 171, “TV tower” is provided as the POI, “TV toe” as the voice reading data, and “Red, highest” as the POI attribute data, and “Aku, “Ichibantakai” data is provided.
[0069]
Therefore, in the voice guidance, it is possible to give a voice guidance such as “* Turn right in front of the highest TV tower * in red. Also, “After passing the * Ginkgo row of trees * and passing the * Brown 34 stories * * Prefectural office building *, turn left 500m." "After turning left, you will see * Mount Fuji * on the right." You can also give voice guidance such as “It is a * municipal parking lot * 300m away.
[0070]
FIG. 11 shows an image in which the mobile terminal device is actually mounted on the vehicle. Thus, the GPS antenna 12a that constitutes the GPS device 12 is provided above the instrument panel in the vehicle interior, and the ECU 4a (a part of the navigation ECU 4) for navigation and the CD-ROM 10a that constitutes the map database 10 It is provided in the trunk. In addition, the display unit 18 and an ECU for information control (a part of the navigation ECU 4) are integrally formed and arranged as a wide multi-station 60 in a space between the driver seat and the passenger seat. The wide multi-station 60 is connected via a cable 62 to a cradle 80 that operates a mobile phone constituting the wireless communication device 20 as a hands-free phone.
[0071]
That is, in this example, as shown in FIG. 12, the mobile phone 82 constituting the mobile phone 32 is placed on the cradle 80. Then, by connecting the connector 80 a of the cradle 80 to the connector connection terminal 82 a of the mobile phone 82, the mobile phone 82 and the cradle 80 are connected. Various devices such as a microphone, a speaker, and a one-touch dial button for making a hands-free call are connected to the cradle 80, and the mobile phone 82 is used by setting the mobile phone 82 in the cradle 80. Then it will be used as a hands-free phone.
[0072]
Various operations are performed using the input operation unit of the wide multi-station 60. The radio communication device 20 is not limited to this configuration, and it is also preferable to provide a dedicated in-vehicle phone system.
[0073]
"Other configurations"
In the wireless communication device 20, it is preferable to exchange electronic mails with the center 30. In this case, the acquired e-mail is normal text data, which is stored in the RAM in the navigation ECU 4. Although displayed on the display unit 18, a sound is output during operation. That is, the navigation ECU 4 supplies text data about the received e-mail to the speech synthesizer 40, and a read-out sound of the e-mail is output from the speaker 22. Even in this case, it is preferable to appropriately acquire voice data for necessary words and phrases. Also, buzzwords and the like can be acquired for use over a predetermined number of times.
[0074]
Furthermore, if a code such as an ID number is determined in advance for various types of audio data, and the audio data is stored in the mobile terminal device in association with this ID number, the data to be communicated is only this ID number. Get better. Therefore, the communication data amount can be greatly reduced.
[0075]
Furthermore, communication between the mobile terminal device and the center can be performed using a normal mobile phone line, PHS, FM multiplex broadcast, TV multiplex broadcast, terrestrial digital communication, optical beacon, radio beacon, etc. is there.
[0076]
【The invention's effect】
As described above, according to the present invention, the capacity of a storage medium or the like necessary for storing audio data can be reduced to the minimum necessary. In addition, since the voice data received from the outside of the vehicle is read out, the read-out voice becomes clear. For this reason, it is possible for the driver to perform voice output such as voice route guidance that is natural and easy to understand.
[Brief description of the drawings]
FIG. 1 of the present inventionBasic formIt is a block diagram which shows the structure of this mobile terminal device.
[Figure 2]Basic formIt is a typical conceptual diagram which shows.
[Fig. 3]Basic formIt is a route diagram which shows the voice guidance in.
[Fig. 4]Basic formIt is a flowchart which shows performing voice guidance in.
[Figure 5]Basic formIt is a figure which shows the path | route in and the range of the intersection name group.
FIG. 6 is a diagram showing a determined route and a range of intersection name groups in consideration of a traffic situation database.
FIG. 7 is a diagram showing a route and a range of intersection name groups in voice guidance when a mobile terminal device is used.
[Fig. 8] ImplementationStateIt is a block diagram which shows the structure of the mobile terminal device in it.
FIG. 9 Implementation formStateIt is a flowchart which shows the operation | movement of audio | voice data acquisition in O.
FIG. 10 EmbodimentStateIt is a flowchart which shows the operation | movement of word deletion in it.
FIG. 11 is a diagram illustrating an image in which a mobile terminal device is actually mounted on a vehicle.
FIG. 12 is a diagram illustrating a configuration of a wireless communication device.
[Explanation of symbols]
2 navigation device, 4 navigation ECU, 6 modem, 8 audio signal output device, 10 map database, 12 GPS device, 14 audio data storage unit, 16 operation unit, 18 display unit, 20 wireless communication device, 22 speaker, 30 center, 50 nodes, 52 routes, 54 intersections, 70 intersection name extraction ranges.

Claims

Data acquisition means for acquiring audio data for audio output from an external database via a network;
Voice output means for outputting voice including words corresponding to the voice data;
Speech synthesis means for synthesizing speech from text data and outputting speech for words expressed by text data ;
Have
When frequency of use of words and outputs the speech synthesized by said speech synthesis means exceeds a predetermined, mobile terminal device and acquires the audio data from the outside for the word resources by the said data acquisition means .

The apparatus of claim 1.
Of the words to be processed for voice output, the mobile terminal apparatus characterized by comprising a storage means for storing audio data for words that are commonly used.

The apparatus of claim 2.
The mobile terminal apparatus according to claim 1, wherein the storage unit stores voice data for the word acquired by the data acquisition unit when the usage frequency becomes a predetermined frequency or more .

The apparatus of claim 3.
Among the words to be processed for voice output, when the corresponding voice data is stored in the storage means, the voice output means uses the voice data stored for the words and outputs the voice. A mobile terminal device characterized in that when voice data is not stored, text data is voice-synthesized from the voice synthesizer for this word and voice output is performed .

A medium in which a voice output program for voice output by a mobile terminal device is recorded,
The voice output program is stored in the mobile terminal device.
Obtain audio data for audio output from an external database over the network,
Let the voice output including words corresponding to the voice data,
For words expressed by text data, speech synthesis is performed from text data,
A medium on which a voice output program is recorded, wherein when a use frequency of a word output by voice synthesis from the text data exceeds a predetermined value, voice data is acquired from the outside for the word .

The medium of claim 5 , wherein
The voice output program is stored in a mobile terminal device.
A medium on which a voice output program is recorded, wherein voice data of frequently used words among words to be processed for voice output is stored .

The medium of claim 6, wherein
The voice output program is stored in a mobile terminal device.
A medium on which an audio output program is recorded, in which audio data about an acquired word is accumulated when the use frequency becomes a predetermined frequency or more .

The medium of claim 7,
The voice output program is stored in a mobile terminal device.
Among the words to be processed for voice output, if the corresponding voice data is accumulated, the voice data stored for this word is output as voice,
A medium on which a voice output program is recorded, wherein when the corresponding voice data is not accumulated, the words are voice-synthesized from the text data for voice output .