JPH07210194A

JPH07210194A - Device for outputting sound

Info

Publication number: JPH07210194A
Application number: JP6003398A
Authority: JP
Inventors: Yuki Inoue; 由紀井上; Shunichi Yajima; 俊一矢島; Takashi Endo; 隆遠藤; Nobuo Hataoka; 信夫畑岡; Shigeru Kakumoto; 繁角本; Tomoko Hatakeyama; 朋子畠山
Original assignee: Hitachi Ltd
Current assignee: Hitachi Ltd
Priority date: 1994-01-18
Filing date: 1994-01-18
Publication date: 1995-08-11

Abstract

PURPOSE:To accelerate a process time until an sound output of recorded and reproduced sound and to perform smoothly the timing control of the sound output between the sound recorded and reproduced and a rule synthesis sound. CONSTITUTION:A sound output device is provided with an output response part generating a required output response code from output information from a position senser and a path search part, and is connected to a device 101 receiving required information and an output device 102 such as a speaker, etc., and the main body is constituted of a control part 103 controlling the whole device, a large scale sound recording file 104, a high speed access memory 105 beforehand loading the sound data of important words from the large scale sound recording file 104, a high speed memory control part 106, a sound recording/reproducing part 107, a rule synthesis part 108, a phoneme element memory 109 and a connection output part 110 outputting the sound.

Description

Detailed Description of the Invention

【０００１】[0001]

【産業上の利用分野】本発明は、各種の音声ガイダンス
等に用いられる、音声合成による音声出力装置に関す
る。BACKGROUND OF THE INVENTION 1. Field of the Invention The present invention relates to a voice output device by voice synthesis, which is used for various voice guidances and the like.

【０００２】[0002]

【従来の技術】入力装置から文字列を入力すると、その
文字列が主要な文字列であれば、音声データが格納され
ているメモリより文字列に対応する音声データを読み出
し、その音声データをもとにＰＣＭ方式によって音声合
成する。音声データがあらかじめメモリに格納されてい
ない文字列が入力された場合は、言語解析をしてその文
字列の読み等の音韻情報と単語間のポーズ及びアクセン
ト等の韻律情報を得て、それらの韻律情報に基づいて規
則合成を行い、入力された文字列に対応する音声を出力
するという音声出力装置がある。2. Description of the Related Art When a character string is input from an input device, if the character string is a main character string, the voice data corresponding to the character string is read from a memory storing the voice data, and the voice data is also stored. And PCM are used for voice synthesis. When a character string whose voice data is not stored in the memory in advance is input, linguistic analysis is performed to obtain phonological information such as reading of the character string and prosodic information such as pauses and accents between words, and There is a voice output device that performs rule synthesis based on prosody information and outputs a voice corresponding to an input character string.

【０００３】[0003]

【発明が解決しようとする課題】しかし、音声出力すべ
き情報に対して、各々音声データの検索が行われるが、
規則合成音の音声出力までにかかる時間と録音再生音の
検索・音声出力までにかかる時間を比較すると差がで
き、音声出力のタイミング制御に不具合が生じるという
問題点があった。However, although the voice data is searched for the information to be voice output,
There is a problem in that there is a difference between the time required to output the voice of the regular synthesized voice and the time required to search and output the voice of the recording / playback sound, which causes a problem in the timing control of the voice output.

【０００４】さらに、規則合成によって生成した音声と
録音再生音とでは声の高低に関する情報、いわゆる、ピ
ッチ周波数に差があり、両方式によって生成した音声を
接続し文章音声として出力する場合には、音声の品質が
劣化するという問題点があった。Further, there is a difference in pitch information between the voice generated by rule synthesis and the recorded and reproduced sound, that is, the pitch frequency, and when the voices generated by both formulas are connected and output as a sentence voice, There is a problem that the quality of voice deteriorates.

【０００５】[0005]

【課題を解決するための手段】本発明は、音声出力指令
に対し音声出力を速やかに行うことを第一の目的とし、
その具体的手段として大規模音声データの他に高速アク
セスメモリを用意し、音声出力に使用する可能性の高い
重要音声データを高速アクセスメモリ中に転送し保持す
る手段を設け、高速アクセスメモリ中にある音声は録音
再生で出力し、その他の音声は規則合成で出力する。SUMMARY OF THE INVENTION The first object of the present invention is to quickly output a voice in response to a voice output command.
As a concrete means, a high-speed access memory is prepared in addition to large-scale audio data, and a means for transferring and holding important audio data that is highly likely to be used for audio output in the high-speed access memory is provided. Some voices are output by recording and playback, and other voices are output by rule synthesis.

【０００６】さらに、高品質な音声を出力することを第
二の目的とし、ピッチ，パワー，音韻継続時間長等の調
整手段を設け、自然な文章音声を出力する。Further, the second purpose is to output a high quality voice, and a means for adjusting pitch, power, phoneme duration etc. is provided to output a natural sentence voice.

【０００７】[0007]

【作用】上記の音声出力装置において、地図上の位置等
の情報が入力されると高速アクセスメモリ中で、入力情
報に対応する音声データの有無を検索し、音声データが
ある場合には、録音再生部において音声データの再生を
行い、高速アクセスメモリ中に音声データがない場合に
は、規則合成部において、音素となる素片データを音素
片メモリより読み出し規則合成を行う。さらに、接続出
力部において、以上のようにして生成された各区分音声
を読み出しかつ接続して、出力装置より音声出力する。In the above voice output device, when the information such as the position on the map is input, the high speed access memory is searched for the voice data corresponding to the input information, and if the voice data is present, the recording is performed. The reproduction unit reproduces the voice data, and when there is no voice data in the high-speed access memory, the rule synthesis unit reads out the phoneme unit data from the phoneme unit memory and performs rule composition. Further, the connection output unit reads out and connects each of the divided sounds generated as described above, and outputs the sound from the output device.

【０００８】[0008]

【実施例】図１は本発明の一実施例を示す車載ナビゲー
ションシステム用録音再生・規則合成併用型音声出力装
置のシステムのブロック図である。この音声出力装置
は、車載ナビゲーション装置中の位置センサと経路探索
部との出力情報から、必要な出力応答コードを発生する
出力応答部があり、出力応答部より出力した文字コード
からなる音声ナビゲーションに必要な情報を受信する装
置１０１と、本体と、音声を出力するためのスピーカ等
の出力装置１０２が接続されている。DESCRIPTION OF THE PREFERRED EMBODIMENTS FIG. 1 is a block diagram of a system of an audio output apparatus for recording / playback / rule synthesis combined use for an in-vehicle navigation system showing an embodiment of the present invention. This voice output device has an output response unit that generates a necessary output response code from the output information of the position sensor and the route search unit in the vehicle-mounted navigation device, and the voice navigation is composed of the character code output from the output response unit. A device 101 for receiving necessary information, a main body, and an output device 102 such as a speaker for outputting voice are connected.

【０００９】本体は、この装置全体の制御を行う制御部
１０３と、地名・交差点名を含む音声ナビゲーションに
必要な文章音声を複数区分に分割して記憶しておく大規
模録音ファイル１０４と、大規模録音ファイル１０４の
中から重要語の音声データをあらかじめロードする高速
アクセスメモリ１０５と、大規模録音ファイルから高速
アクセスメモリへ音声データの転送を随時行う高速メモ
リ制御部１０６と、高速アクセスメモリ中の音声を再生
する録音再生部１０７と、高速アクセスメモリ中に存在
しない音声を生成出力する規則合成部１０８と、規則合
成音生成時に必要な音声の素片データを格納する音素片
メモリ１０９と、各区分音声を読み出し、接続して音声
を出力する接続出力部１１０からなる。The main body includes a control unit 103 for controlling the entire apparatus, a large-scale recording file 104 for storing sentence voices including a place name and an intersection name necessary for voice navigation in a plurality of divisions, and a large-scale recording file 104. A high-speed access memory 105 that pre-loads voice data of important words from the large-scale recording file 104, a high-speed memory control unit 106 that transfers voice data from the large-scale recording file to the high-speed access memory at any time, and a high-speed access memory A recording / reproducing unit 107 for reproducing a voice, a rule synthesizing unit 108 for generating and outputting a voice that does not exist in the high-speed access memory, a phoneme unit memory 109 for storing voice unit data necessary for generating a ruled synthesized sound, It is composed of a connection output unit 110 that reads out segmented voices, connects them, and outputs voices.

【００１０】図２に示すフローチャートによって、図１
に示した本実施例の車載ナビゲーションシステム用録音
再生・規則合成併用型音声出力装置の動作のあらましを
説明する。According to the flow chart shown in FIG.
The outline of the operation of the recording / playback / rule synthesis combined type voice output device for the vehicle-mounted navigation system shown in FIG.

【００１１】車載ナビゲーション装置において音声出力
指令が出されると、車載ナビゲーション装置中の位置セ
ンサ等から、音声によるナビゲーションを行うために必
要な位置情報等の文字コードを１０１の文字コード受信
装置に送信する。文字コード受信装置１０１から制御部
１０３に文字コードが出力され、文字コードに対応する
音声が高速アクセスメモリ１０５に格納されているかど
うかを検索する。When a voice output command is issued in the vehicle-mounted navigation device, a position sensor or the like in the vehicle-mounted navigation device transmits a character code such as position information necessary for performing navigation by voice to the character code receiving device 101. . A character code is output from the character code receiving device 101 to the control unit 103, and it is searched whether a voice corresponding to the character code is stored in the high speed access memory 105.

【００１２】文字コードに対応する音声が高速アクセス
メモリ１０５に格納されていれば、その音声を録音再生
部１０７で再生する。一方、出力された文字コードに対
応する音声が高速アクセスメモリ１０５に格納されてい
なければ、まず、制御部103で文字コードが音韻・アク
セント情報に変換され、規則合成部１０８においてそれ
らの情報に基づき音素となる素片データを音素片メモリ
１０９より読み出し規則合成を行う。なお、すべての文
字コードに対応する文字情報・アクセント情報は高速ア
クセスメモリ１０５中に記憶されている。さらに、接続
出力部１１０において、以上のようにして生成された各
区分音声を読み出しかつ接続して、出力装置より音声出
力する。If the voice corresponding to the character code is stored in the high speed access memory 105, the voice is reproduced by the recording / reproducing unit 107. On the other hand, if the voice corresponding to the output character code is not stored in the high-speed access memory 105, first, the control unit 103 converts the character code into phoneme / accent information, and the rule synthesizing unit 108 based on the information. The phoneme unit data that is a phoneme is read from the phoneme unit memory 109 and rule synthesis is performed. Character information / accent information corresponding to all character codes is stored in the high-speed access memory 105. Further, the connection output unit 110 reads out and connects the respective divided sounds generated as described above, and outputs the sounds from the output device.

【００１３】次に、大規模録音ファイル１０４から高速
アクセスメモリ１０５へ音声データを転送する制御につ
いて説明する。図３に大規模録音ファイルにおけるデー
タ構造の一実施例を示す。車載ナビゲーションにおける
音声ガイダンスの文章には、交差点名・地名等の地理情
報を多く含むため、このような音声データは、地図上の
位置等を示す文字コードとともに格納する。さらに、こ
の大規模録音ファイル中の音声データについて、地図上
の位置等を示す文字コードをもとに音声データが数個ず
つ含まれるようにエリアを分割し、各ブロックごとに音
声データを格納しておく。大規模録音ファイルから高速
アクセスメモリへの音声データの転送は、このブロック
ごとに行う。Next, control for transferring audio data from the large-scale recording file 104 to the high-speed access memory 105 will be described. FIG. 3 shows an example of the data structure of a large-scale recording file. Since the text of the voice guidance in the vehicle-mounted navigation includes a lot of geographical information such as intersection names and place names, such voice data is stored together with the character code indicating the position on the map. Furthermore, regarding the audio data in this large-scale recording file, the area is divided so that several audio data are included based on the character code indicating the position on the map, etc., and the audio data is stored for each block. Keep it. Transfer of audio data from a large-scale recording file to the high-speed access memory is performed for each block.

【００１４】次に、大規模録音ファイル１０４から高速
アクセスメモリ１０５へ転送する重要な音声データを選
択する制御について説明する。図４は、車載ナビゲーシ
ョンで用いる地図と大規模録音ファイル中の音声データ
のブロックとの対応を示している。地図上の点線で区切
られた各々のエリアは大規模録音ファイルで音声データ
を分割・格納したブロックに相当する。転送するブロッ
クの選択は、現在地，目的地，走行中の道路の方向等の
条件から決定する。原則として、現在地を含むブロック
を囲む八つのブロックの音声データは高速アクセスメモ
リヘ転送する。図４の場合を例として、転送するブロッ
クの選択例を図５のフローチャートに従って説明する。
現在地のあるブロックはＣ４である。つまり、Ｃ４を囲
む八つのブロックの音声データは高速アクセスメモリ中
に転送が行われている。次に、道路の方向，目的地等の
条件から、現在地点から移動可能なブロックは、Ｃ３，
Ｄ３と限定でき、例えば、ナビゲーションシステムの推
奨経路にしたがい車が走行すると、転送するブロックは
Ｃ２，Ｄ２，Ｅ２，Ｅ３，Ｅ４の五つが選択される。Next, control for selecting important voice data to be transferred from the large-scale recording file 104 to the high speed access memory 105 will be described. FIG. 4 shows the correspondence between a map used for in-vehicle navigation and blocks of audio data in a large-scale recording file. Each area separated by a dotted line on the map corresponds to a block in which audio data is divided and stored in a large-scale recording file. The selection of blocks to be transferred is determined based on conditions such as the current location, the destination, and the direction of the road on which the vehicle is running. In principle, the audio data of eight blocks surrounding the block including the current location are transferred to the high speed access memory. Taking the case of FIG. 4 as an example, an example of selecting blocks to be transferred will be described with reference to the flowchart of FIG.
The block where the current position is is C4. That is, the audio data of the eight blocks surrounding C4 are transferred to the high-speed access memory. Next, based on conditions such as the direction of the road and the destination, the block that can be moved from the current position is C3.
It can be limited to D3. For example, when the vehicle travels according to the recommended route of the navigation system, five blocks C2, D2, E2, E3 and E4 are selected as blocks to be transferred.

【００１５】また、車が走行中の道路をそのまま直進し
た場合は、Ｂ２，Ｃ２，Ｄ２の三つのブロックを転送す
る。When the vehicle goes straight on the road on which the vehicle is running, three blocks B2, C2 and D2 are transferred.

【００１６】この処理と並行して、高速アクセスメモリ
中の不要な音声データの選択もブロックごとに行われ
る。不要な音声データのブロックは現在地，目的地，走
行中の道路の方向等の条件から判断される。In parallel with this processing, unnecessary voice data in the high speed access memory is also selected for each block. Blocks of unnecessary audio data are judged based on conditions such as the current location, the destination, and the direction of the road on which the vehicle is running.

【００１７】次に高速アクセスメモリ中のデータ構造の
一実施例を図６に示す。高速アクセスメモリ中には、地
図上の位置を示す文字コードに対応する交差点名，地名
が音韻アクセント情報等とともに格納されている。大規
模録音ファイルから音声データが転送されると、該当す
る位置コードの音声データエリアにそれぞれ格納され
る。また、車載ナビゲーションシステムにおいて、頻度
高く用いられる定型文、例えば、「次の・・・・交差点
を右折して下さい。」などは、録音音声を複数区分に分
割して、あらかじめ高速アクセスメモリ中に格納する。
このような定型文の音声データに関しては、各区分音声
ごとにピッチ，パワー，音韻継続時間長の値が格納され
ている。図７に高速アクセスメモリ中に格納された頻度
の高い音声データに関するデータ構造の一実施例を示
す。Next, an embodiment of the data structure in the high speed access memory is shown in FIG. In the high-speed access memory, intersection names and place names corresponding to character codes indicating positions on the map are stored together with phonological accent information and the like. When the voice data is transferred from the large-scale recording file, it is stored in the voice data area of the corresponding position code. In addition, for fixed phrases that are frequently used in in-vehicle navigation systems, such as "Next ... Turn right at the intersection.", The recorded voice is divided into multiple sections and stored in the high-speed access memory beforehand. Store.
Regarding the voice data of such a fixed sentence, the values of pitch, power, and phoneme duration are stored for each segmented voice. FIG. 7 shows an embodiment of a data structure relating to audio data that is frequently stored in the high speed access memory.

【００１８】次に、各区分音声を滑らかに接続する制御
について説明する。出力する文章に定型文を用いる場
合、上記の例で破線で示した区分音声（以下、可変部音
声と呼ぶ）と可変部音声に隣接する定型文の区分音声と
の接続を滑らかにすると、自然な文章音声が出力され
る。そこで、接続出力部では隣接する定型文の区分音声
のピッチ，パワー，音韻継続時間長等の情報をもとに可
変部音声のピッチ，パワー，音韻継続時間長等を調節
し、接続処理を行い音声出力する。図８は、上記接続処
理の処理概念を表す図である。Next, control for smoothly connecting the divided voices will be described. When using a standard sentence as the output sentence, smoothing the connection between the segmented voice (hereinafter referred to as the variable part voice) indicated by the broken line in the above example and the segmental voice of the fixed phrase adjacent to the variable part voice is natural. The sentence sound is output. Therefore, the connection output unit adjusts the pitch, power, phoneme duration, etc. of the variable part voice based on the information such as the pitch, power, phoneme duration, etc. of the segmented speech of the adjacent fixed sentence, and performs connection processing. Output audio. FIG. 8 is a diagram showing a processing concept of the connection processing.

【００１９】また、可変部音声を規則合成部で生成する
場合には、音韻アクセント情報と隣接する定型文の区分
音声のピッチ，パワー，音韻継続時間長等の情報をもと
にあらかじめ規則合成してもよい。When the variable part speech is generated by the rule synthesizing part, the rule synthesizing is carried out beforehand based on the information such as the phoneme accent information and the pitch, power, and phoneme duration of the segmented speech of the adjacent fixed sentence. May be.

【００２０】[0020]

【発明の効果】本発明によれば、出力すべき録音データ
は高速アクセスメモリ中に格納するため、録音再生音の
音声出力までの処理時間は高速化され、録音再生音と規
則合成音との音声出力のタイミング制御が円滑に行える
ようになる。録音再生用の音声データは大量の記憶容量
を必要とするため、本発明による処理の高速化は著し
い。さらに、音声データに対してピッチ，パワー，音韻
継続時間長の値の調整を行い、音声データを接続するた
め、高品質な文章音声が提供される。According to the present invention, since the recording data to be output is stored in the high speed access memory, the processing time until the voice output of the recording / reproducing sound is accelerated, and the recording / reproducing sound and the regular synthesized sound are combined. The audio output timing control can be smoothly performed. Since the voice data for recording and reproduction requires a large storage capacity, the speed of the processing according to the present invention is remarkable. Furthermore, since the pitch, power, and phoneme duration length values are adjusted for the voice data and the voice data is connected, a high-quality text voice is provided.

[Brief description of drawings]

【図１】本発明の一実施例を示す車載ナビゲーションシ
ステム用録音再生・規則合成併用型音声出力装置のシス
テムのブロック図。FIG. 1 is a block diagram of a system of an audio output device for recording / playback / rule synthesis combined use for an in-vehicle navigation system showing an embodiment of the present invention.

【図２】本発明の一実施例を示す車載ナビゲーションシ
ステム用録音再生・規則合成併用型音声出力装置の動作
のフローチャート。FIG. 2 is a flow chart of the operation of the recording / playback / rule synthesis combined type voice output device for an in-vehicle navigation system showing an embodiment of the present invention.

【図３】本発明の一実施例を示す車載ナビゲーションシ
ステム用大規模録音ファイルのデータ構造の説明図。FIG. 3 is an explanatory diagram of a data structure of a large-scale recording file for an in-vehicle navigation system showing an embodiment of the present invention.

【図４】本発明の一実施例を示す車載ナビゲーション用
地図と大規模録音ファイルの音声データのブロックとの
対応を示す説明図。FIG. 4 is an explanatory diagram showing the correspondence between a vehicle-mounted navigation map and a block of audio data of a large-scale recording file showing an embodiment of the present invention.

【図５】本発明の大規模録音ファイルから高速アクセス
メモリへ転送する音声データのブロックの選択の一実施
例のフローチャート。FIG. 5 is a flowchart of an embodiment of selecting a block of audio data to be transferred from a large-scale recording file to a high-speed access memory according to the present invention.

【図６】本発明の車載ナビゲーション用高速アクセスメ
モリ中のデータ構造の一実施例を示す説明図。FIG. 6 is an explanatory diagram showing an example of a data structure in a vehicle-mounted navigation high-speed access memory according to the present invention.

【図７】本発明の車載ナビゲーション用高速アクセスメ
モリ中に格納された頻度の高い音声データに関するデー
タ構造の一実施例を示す説明図。FIG. 7 is an explanatory diagram showing an embodiment of a data structure relating to frequently-used voice data stored in a vehicle-mounted navigation high-speed access memory according to the present invention.

【図８】本発明の一実施例を示す車載ナビゲーション装
置用音声出力装置の接続処理の処理概念を表す説明図。FIG. 8 is an explanatory diagram showing a processing concept of a connection processing of an audio output device for a vehicle-mounted navigation device showing an embodiment of the present invention.

[Explanation of symbols]

１０１…文字コード受信装置、１０２…音声出力装置、
１０３…制御部、104…大規模録音ファイル、１０５…
高速アクセスメモリ、１０６…高速アクセスメモリ制御
部、１０７…録音再生部、１０８…規則合成部、１０９
…音素片メモリ、１１０…接続出力部。101 ... Character code receiving device, 102 ... Voice output device,
103 ... Control unit, 104 ... Large-scale recording file, 105 ...
High-speed access memory, 106 ... High-speed access memory control unit, 107 ... Recording / playback unit, 108 ... Rule synthesis unit, 109
... phoneme unit memory, 110 ... connection output unit.

フロントページの続き (72)発明者畑岡信夫東京都国分寺市東恋ケ窪１丁目280番地株式会社日立製作所中央研究所内 (72)発明者角本繁東京都国分寺市東恋ケ窪１丁目280番地株式会社日立製作所中央研究所内 (72)発明者畠山朋子東京都国分寺市東恋ケ窪１丁目280番地株式会社日立製作所中央研究所内Front page continuation (72) Inventor Nobuo Hataoka 1-280 Higashi Koigokubo, Kokubunji, Tokyo Inside Hitachi Central Research Laboratory (72) Inventor Shigeru Kakumoto 1-280 Higashi Koikeku, Kokubunji, Tokyo Hitachi Central Research Co., Ltd. In-house (72) Inventor Tomoko Hatakeyama 1-280 Higashi-Kengikubo, Kokubunji-shi, Tokyo Inside Central Research Laboratory, Hitachi, Ltd.

Claims

[Claims]

1. A large amount of text voice is divided into a plurality of sections in units of phrases, and waveform data storage means for storing waveform data corresponding to the text voice, and section voice waveform data is selected from the waveform data storage means. And a high-speed access memory for storing the waveform data of the segmented voice selected from the waveform data storage unit, a unit for reading out each segmented voice from the high-speed access memory and reproducing and outputting the recorded voice, and a character code string. In the device which outputs the sentence voice by connecting the respective segmented voices generated by the means for performing the regular voice synthesis for generating the voice waveform signal from the sentence voice, the segmented waveform data of the sentence voice to be output exists in the high-speed access memory. Means for determining whether or not the section waveform data does not exist in the high-speed access memory, and a sentence corresponding to the section A voice output device comprising means for performing rule synthesis from a character code string to obtain segmented voice waveform data.

2. The means for transferring the segmented speech waveform data selected from the waveform data storage means to the high speed access memory, and the segmented speech waveform data transferred from the waveform data storage means at the high speed according to claim 1. Means for accumulating in the access memory, and means for storing the number of times of reading of the divided voice waveform data and the storage time in the high speed access memory as the usage frequency information of each divided voice waveform data transferred to the high speed access memory. The means for determining the necessity / non-necessity of the segmented voice waveform data in the high-speed access memory based on the usage frequency information of the respective segmented voice waveform data transferred to the high-speed access memory, and the voice waveform data determined to be unnecessary are And a means for deleting from the high speed access memory.

3. A large amount of text speech is divided into a plurality of sections in units such as clauses, and characteristic parameter storage means for storing characteristic parameters of speech of LPC parameters corresponding to the text speech, and classification by the characteristic parameter storage means. And a high-speed access memory for storing the feature parameters of the segmented voice selected from the feature parameter storage unit, each segmented voice is read from the high-speed access memory, and the recorded voice is reproduced and output. In the device which outputs the sentence voice by connecting the respective segmented voices generated by the means and the means for performing the regular voice synthesis for generating the voice waveform signal from the character code string, the characteristic parameter of the segmented voice of the sentence voice to be output is The means for determining whether or not it exists in the high speed access memory, and A voice output device comprising means for obtaining a feature parameter of a segmented voice by performing rule synthesis from a character code string corresponding to the segment when it does not exist in the high speed access memory.

4. The means for transferring the feature parameter of the segmented voice selected from the voice feature parameter storage means to the high speed access memory, and the feature parameter of the segmented voice transferred from the waveform data storage means according to claim 3. Means for accumulating in the high-speed access memory, and the frequency of reading the characteristic parameters of the segmental voice and the storage time in the high-speed access memory as usage frequency information of the characteristic parameters of each segmental voice transferred to the high-speed access memory. Means for storing, means for judging the necessity / non-necessity of the characteristic parameter of the divided voice in the high-speed access memory based on the use frequency information of the characteristic parameter of each divided voice transferred to the high-speed access memory And a means for deleting from the high speed access memory. An audio output device, wherein the door.

5. The voice segment according to claim 1, 2, 3 or 4, which is frequently used, is a pitch value together with voice data.
Means for storing prosody information such as power value and phoneme duration in the high-speed access memory;
A voice output device comprising means for adjusting the values of pitch, power, and phoneme time length, and connecting the voice generated by rule synthesis or recording / playback to output voice.

6. The vehicle-mounted navigation device according to claim 5, which has a means for detecting a vehicle position, a means for searching for a route, and a means for guiding a route, and uses voice as information presenting means such as route information.

7. An area on a map is divided into several parts and defined as a block, and a position on the map where voice data is output is defined as a position code, and a position code in the same block is defined in claim 6. An in-vehicle voice navigation device that stores voice data corresponding to the position code and the block code as one set.

8. The vehicle-mounted voice navigation device according to claim 7, wherein the means for storing in the high-speed access memory is a block unit based on the information of the current position.