JPH09134195A

JPH09134195A - Speech synthesizer

Info

Publication number: JPH09134195A
Application number: JP7290098A
Authority: JP
Inventors: Hiroyuki Fujimoto; 博之藤本; Toshitaka Yamato; 俊孝大和; Osamu Ishikawa; 修石川
Original assignee: Denso Ten Ltd
Current assignee: Denso Ten Ltd
Priority date: 1995-11-08
Filing date: 1995-11-08
Publication date: 1997-05-20
Anticipated expiration: 2015-11-08
Also published as: JP3171775B2

Abstract

PROBLEM TO BE SOLVED: To obtain a speech synthesizer capable of decreasing a processing amt. and reducing a cost by forming the character strings of reading with accent and phoneme symbols from the sentence examples selected from a sentence example section and the character code strings of the proper nouns outputted from a KANA (Japanese syllabary) reading conversion section. SOLUTION: The sentences of guide information are outputted dividedly to the fixed plural guide commands and the character code strings of the variable proper nouns and figures to the speech synthesizer from a navigation ECU 11. The character table section 2 in the speech synthesizer stores the plural fixed sentence examples in response to the guide commands from the navigation ECU 11 and outputs the same. The KANA (Japanese syllabary) reading conversion section 3 forms variable reading sentence examples in accordance with the proper nouns, figures and code strings. A section 4 for forming the reading characteristic strings with the accent and phonemes imparts the accents and phonemes to the variable reading sentence examples from the reading conversion section 3 and synthesizes these sentence examples with the fixed sentence examples from the sentence example table section 2. As a result, the speech guide including the arbitrary place-names and figures is embodied with a simple constitution.

Description

Detailed Description of the Invention

【０００１】[0001]

【発明の属する技術分野】本発明は、テキスト音声合成
装置に関し、特に限られた文例を用い、固有名詞、数字
等のみを可変とすることで音声情報を提供することがで
きる音声合成装置に関する。BACKGROUND OF THE INVENTION 1. Field of the Invention The present invention relates to a text-to-speech synthesizer, and more particularly to a voice-synthesizer capable of providing voice information by making only proper nouns and numbers variable by using limited sentence examples.

【０００２】[0002]

【従来の技術】従来のテキスト音声合成装置では、言語
処理部を設け、入力テキストの文法的解釈、意味理解等
を行い、アクセント、イントネーションを付与してい
る。2. Description of the Related Art In a conventional text-to-speech synthesizer, a language processing unit is provided to perform grammatical interpretation of input text, understand meaning, etc., and add accents and intonations.

【０００３】[0003]

【発明が解決しようとする課題】しかしながら、従来の
テキスト音声合成装置では、言語処理部のアルゴリズム
が複雑で処理量が多いため、パーソナルコンピュータ上
で動作するシステム等で採用されているが、コストの低
廉なシステムへの搭載は、現実的には困難であるとの問
題点がある。However, in the conventional text-to-speech synthesizer, since the algorithm of the language processing unit is complicated and the processing amount is large, it is adopted in a system or the like operating on a personal computer. There is a problem that it is practically difficult to mount the system on an inexpensive system.

【０００４】したがって、本発明は、上記問題点に鑑
み、処理量を少なくして低廉にできる音声合成装置を提
供することを目的とする。Therefore, in view of the above problems, it is an object of the present invention to provide a speech synthesizer which can reduce the amount of processing and can be manufactured at low cost.

【０００５】[0005]

【課題を解決するための手段】本発明は、前記問題点を
解決するために、次の構成を有する音声合成装置を提供
する。すなわち、車両に搭載されるナビゲーションから
の文字案内情報を音声合成する音声合成装置に、前記文
字案内情報を、固有名詞、数字コード列とする可変情報
と、案内指令として固定文例（番号）情報とに分けて出
力するナビゲーションＥＣＵと、複数の前記固定情報を
文例としてテーブル化して記憶して、前記案内指令に対
応する文例を出力する文例テーブル部と、前記可変情報
の固有名詞の文字コード列をかな読み文字列に変換する
かな読み変換部と、前記文例テーブル部から選択された
文例と、かな読み変化部から出力される固有名詞の文字
コード列からアクセント、韻律記号付の読み文字列を生
成するアクセント韻律付読み文字列生成部とが設けられ
る。この装置により、任意の地名を含む音声案内が簡単
な構成で実現できる。In order to solve the above problems, the present invention provides a speech synthesizer having the following configuration. That is, in a voice synthesizing device for voice synthesizing character guidance information from a navigation installed in a vehicle, variable information having the character guidance information as a proper noun and a numerical code string and fixed sentence example (number) information as a guidance command are provided. A navigation ECU which outputs the fixed information as a sentence example, and a sentence example table section which outputs a sentence example corresponding to the guidance command and a character code string of a proper noun of the variable information. A kana-yomi conversion unit for converting into a kana-yomi character string, a sentence example selected from the sentence example table part, and a character string of a proper noun output from the kana-yomi changing unit to generate a reading character string with accents and prosodic symbols. And a reading character string generating section with accent prosody. With this device, voice guidance including an arbitrary place name can be realized with a simple configuration.

【０００６】前記かな読み変換部は、さらに数字の文字
コード列を読み文字列に変換し、前記アクセント韻律付
読み文字列生成部は、さらに前記文例テーブル部から選
択された文例と、かな読み変換部から出力される数字の
文字コード列から、アクセント、韻律記号付読み文字列
を生成してもよい。この装置により、任意の数字を含む
音声案内が簡単な構成で実現できる。The Kana-yomi conversion unit further converts a numerical character code string into a reading character string, and the accent-prosodic reading character string generation unit further converts the sentence example selected from the sentence example table unit and the Kana-Yomi conversion unit. A reading character string with accent and prosody symbols may be generated from a character code string of numbers output from the department. With this device, voice guidance including an arbitrary number can be realized with a simple configuration.

【０００７】前記アクセント韻律付読み文字列生成部
は、前記かな読み変換部から出力される固有名詞のモー
ラ長に依存して固有名詞と前記文例テーブル部から選択
された文例との間のポーズ長を変化させて合成を行うよ
うにしてもよい。この手段により、聞きやすい合成音の
提供が可能になる。前記アクセント韻律付読み文字列生
成部は、前記かな読み変換部から出力される固有名詞に
対して後続する前記文例テーブル部から選択された文例
が、「付近」のように固有名詞のアクセント側をＯ型に
する作用のある接尾語で始まる場合に、固有名詞のアク
セント型をＯ型にするようにしてもよい。この手段によ
り自然な合成音を生成できる。The accented prosodic reading character string generation unit depends on the mora length of the proper noun output from the Kana-yomi conversion unit, and the pose length between the proper noun and the sentence example selected from the sentence example table unit. May be changed to perform composition. By this means, it is possible to provide a synthetic sound that is easy to hear. The accented prosodic reading character string generation unit selects a sentence example selected from the sentence example table unit that follows the proper noun output from the Kana-yomi conversion unit, with the accent side of the proper noun such as “near”. The accent type of a proper noun may be set to O type when it starts with a suffix that has the effect of making it O type. By this means, a natural synthetic sound can be generated.

【０００８】前記アクセント韻律付読み文字列生成部
は、前記かな読み変換部から出力される固有名詞に対し
て後続する前記文例テーブル部から選択された文例が、
「付近」のように固有名詞のアクセント側をＯ型にする
作用のある接尾語で始まらない場合に、固有名詞に対し
て語尾から３モーラ目にアクセント核を置くようにして
もよい。この手段により自然な合成音を生成できる。In the accented prosodic reading character string generation unit, the sentence example selected from the sentence example table unit following the proper noun output from the Kana-Yomi conversion unit is
When it does not start with a suffix such as “near” that has the effect of making the accent side of a proper noun into an O type, the accent nucleus may be placed at the third mora from the end of the proper noun. By this means, a natural synthetic sound can be generated.

【０００９】前記固有名詞の語尾から３モーラ目が「長
音」及び「促音」のいずれかの場合、アクセント核の位
置をずらして語尾から４モーラ目にアクセント核を置く
ようにしてもよい。この手段により自然な合成音を生成
できる。前記アクセント韻律付読み文字列生成部は、前
記かな読み変換部から出力される固有名詞のモーラ長が
大きい場合、固有名詞中の所定の切れ目にポーズ長を挿
入するようにしてもよい。When the third mora from the ending of the proper noun is either "long sound" or "prompt", the accent nucleus may be shifted and the accent nucleus may be placed at the fourth mora from the ending. By this means, a natural synthetic sound can be generated. The accented prosodic reading character string generation unit may insert a pause length at a predetermined break in the proper noun when the mora length of the proper noun output from the Kana-yomi conversion unit is large.

【００１０】前記アクセント韻律付読み文字列生成部
は、前記かな読み変換部から出力される数字のコード列
のモーラ長に依存して数字と前記文例テーブル部から選
択された文例と間のポーズ長を変化させて合成を行うよ
うにしてもよい。この手段により自然な合成音を生成で
きる。前記アクセント韻律付読み文字列生成部は、前記
かな読み変換部から出力される数字のコード列に対して
後続する前記文例テーブル部から選択された文例が、
「程度」、「先」のように数字のアクセント側をＯ型に
する作用のある接尾語で始まる場合に、数字列のアクセ
ント型をＯ型にするようにしてもよい。この手段により
自然な合成音を生成できる。The accented prosodic reading character string generation unit depends on the mora length of the code string of the number output from the Kana reading conversion unit, and the pause length between the number and the sentence example selected from the sentence example table unit. May be changed to perform composition. By this means, a natural synthetic sound can be generated. The accented prosodic reading character string generation unit is a sentence example selected from the sentence example table unit following the numerical code string output from the Kana reading conversion unit,
When starting with a suffix that acts to make the accent side of the numbers O-type, such as “degree” and “first”, the accent type of the number string may be O-type. By this means, a natural synthetic sound can be generated.

【００１１】前記アクセント韻律付読み文字列生成部
は、前記かな読み変換部から出力される数字のコード列
に対して後続する前記文例テーブル部から選択された文
例が、「程度」、「先」のように数字のアクセント側を
Ｏ型にする作用のある接尾語で始まらない場合に、数字
のコード列に対して語尾から３モーラ目にアクセント核
を置くようにしてもよい。この手段により自然な合成音
を生成できる。In the accented prosodic reading character string generation unit, the sentence examples selected from the sentence example table unit subsequent to the numerical code string output from the Kana reading conversion unit are "degree" and "first". When the suffix does not start with a suffix that acts to make the accent side of the number O-shaped, the accent nucleus may be placed at the third mora from the end of the code sequence of the number. By this means, a natural synthetic sound can be generated.

【００１２】前記数字のコード列の語尾から３モーラ目
が「長音」及び「促音」のいずれかの場合、アクセント
核の位置をずらして語尾から４モーラ目にアクセント核
を置くようにしてもよい。この手段により自然な合成音
を生成できる。前記数字のコード列が時刻の場合、「０
０時△△分」で示される時刻の「００時」の語尾から３
モーラ目にアクセント核を置き、語尾から３モーラ目が
「長音」及び「促音」のいずれかの場合アクセント核位
置を語尾から４モーラ目にずらすようにしてもよい。こ
の手段により自然な合成音を生成できる。When the third mora from the end of the numerical code sequence is either "long sound" or "consonant", the position of the accent nucleus may be shifted to place the accent nucleus at the fourth mora from the end. . By this means, a natural synthetic sound can be generated. When the code string of the numerical value is time, "0
3 from the ending of "00" at the time indicated by "0:00"
The accent nucleus may be placed in the mora, and the accent nucleus position may be shifted to the fourth mora from the end when the third mora from the end is either "long sound" or "prompt". By this means, a natural synthetic sound can be generated.

【００１３】前記数字のコード列が時刻の場合、「００
時間△△分」で示される時刻の「００時」の語尾から３
モーラ目の「ジ」にアクセント核を置くようにしてもよ
い。この手段により自然な合成音を生成できる。前記数
字のコード列が小数点を含む場合、「００．△△」で示
される「００．」の語尾から３モーラ目にアクセント核
を置き、語尾から３モーラ目が「長音」及び「促音」の
いずれかの場合アクセント核位置を語尾から４モーラ目
にずらすようにしてもよい。この手段により自然な合成
音を生成できる。When the numerical code string is the time, "00"
3 from the ending of "00 hour" at the time indicated by "hour △ minute"
The accent nucleus may be placed on the "ji" of the mora eyes. By this means, a natural synthetic sound can be generated. When the code sequence of the numeral includes a decimal point, an accent nucleus is placed in the third moor from the ending of “00.” indicated by “00.ΔΔ”, and the third mora from the ending is “long sound” and “consonant”. In either case, the accent nucleus position may be shifted to the fourth mora from the end of the word. By this means, a natural synthetic sound can be generated.

【００１４】前記かな読み変換部から出力される固有名
詞、数字のコード列に「が」行音が含まれる場合、これ
を鼻濁化「が」行音にするようにしてもよい。この手段
により自然な合成音を生成できる。前記かな読み変換部
から出力される固有名詞、数字のコード列に同一母音の
連続が含まれる場合、この母音連続を長音化するように
してもよい。この手段により自然な合成音を生成でき
る。When the proper noun / numeral code string output from the kana-yomi conversion unit includes a "ga" line sound, it may be converted to a nasalized "ga" line sound. By this means, a natural synthetic sound can be generated. When the code string of proper nouns and numbers output from the kana-yomi conversion unit includes a continuation of the same vowel, the continuation of the vowel may be extended. By this means, a natural synthetic sound can be generated.

【００１５】前記かな読み変換部から出力される固有名
詞、数字のコード列に「ＥＩ」の母音連続が含まれる場
合、この母音連続を「ＥＥ」の同一母音連続と同様に長
音化するようにしてもよい。前記かな読み変換部から出
力される固有名詞、数字のコード列に「ＯＵ」の母音連
続が含まれる場合、この母音連続を「ＯＯ」の母音連続
と同様に長音化するようにしてもよい。この手段により
自然な合成音を生成できる。このようにして、簡易なア
クセント処理を実施することにより、外部からアクセン
ト、韻律情報をもらわなくともイントネーション、アク
セントを付与した合成音を提供することが可能になる。When the proper noun and numeric code string output from the Kana-yomi conversion unit includes a vowel sequence of "EI", the vowel sequence is made to be long like the same vowel sequence of "EE". May be. When the proper noun / numeric code string output from the kana-yomi conversion unit includes a vowel continuation of "OU", the vowel continuation may be extended as in the case of the vowel continuation of "OO". By this means, a natural synthetic sound can be generated. By performing the simple accent processing in this manner, it is possible to provide a synthetic sound with intonation and accent added without receiving accent or prosody information from the outside.

【００１６】[0016]

【発明の実施の形態】以下本発明の実施の形態について
図面を参照して説明する。図１は本発明に係る音声合成
装置を示す図である。本発明に係る音声合成装置は、車
両に搭載されるナビゲーションシステムに使用されるも
のであり、ナビゲーションシステムは、限られた文例を
用い固有名詞、数字等のみを可変とすることで各種情報
を提供する。本図に示すように、ナビゲーションＥＣＵ
（Electronic Control Unit ）１１からは音声合成装置
に案内情報の文章が、固定の複数の案内指令と、可変の
固有名詞、数字の文字コード列（ＪＩＳ、Ｓ−ＪＩＳ
等）とに分けて出力される。音声合成装置は、ナビゲー
ションＥＣＵ（Electronic Control Unit ）１１からの
案内指令に対応して複数の固定文例を記憶して出力する
文例テーブル２と、ナビゲーションＥＣＵ（Electronic
Control Unit ）１１からの固有名詞、数字、コード列
（ＪＩＳ、Ｓ−ＪＩＳ等）を基に可変読み文例を形成す
るかな読み変換部３と、読み変換部３の可変読み文例に
アクセント、韻律を付与して文例テーブル２からの固定
文例と合成するアクセント韻律付読み文字列生成部４
と、アクセント韻律付読み文字列生成部４からの合成文
例を音響処理する音響処理部５と、音響処理部５へ音素
を提供するための音素辞書部６と、音響処理部５へアク
セントルールを提供するアクセントルール部７と、音響
処理部５へ韻律ルールを提供する韻律ルール部８と、音
響処理部５から処理データをＰＣＭデータに変換するＰ
ＣＭデータ変換部９を具備する。このようにして、任意
の地名、数字を含む音声案内が簡単な構成で実現でき
る。Embodiments of the present invention will be described below with reference to the drawings. FIG. 1 is a diagram showing a speech synthesizer according to the present invention. The speech synthesizer according to the present invention is used for a navigation system mounted on a vehicle, and the navigation system provides various information by changing only proper nouns, numbers, etc. using limited sentence examples. To do. As shown in this figure, the navigation ECU
From (Electronic Control Unit) 11 to the voice synthesizer, a sentence of guidance information is provided with a plurality of fixed guidance commands, a variable proper noun, and a numeric character code string (JIS, S-JIS).
Etc.) and are output separately. The voice synthesizer stores a sentence example table 2 for storing and outputting a plurality of fixed sentence examples in response to a guidance command from a navigation ECU (Electronic Control Unit) 11, and a navigation ECU (Electronic Electronic).
Kana Yomi conversion unit 3 that forms a variable Yomi sentence example based on proper nouns, numbers, and code strings (JIS, S-JIS, etc.) from Control Unit 11 and accents and prosody on the variable Yomi sentence example of Yomi conversion unit 3. An accented prosodic reading character string generation unit 4 that adds and synthesizes a fixed sentence example from the sentence example table 2
, An acoustic processing unit 5 that acoustically processes the synthesized sentence example from the accented prosodic reading character string generation unit 4, a phoneme dictionary unit 6 for providing phonemes to the acoustic processing unit 5, and an accent rule to the acoustic processing unit 5. An accent rule unit 7 to be provided, a prosody rule unit 8 to provide a prosody rule to the sound processing unit 5, and a P to convert processing data from the sound processing unit 5 into PCM data.
A CM data converter 9 is provided. In this way, voice guidance including arbitrary place names and numbers can be realized with a simple configuration.

【００１７】ここに、音声合成装置の文例テーブル部２
は、ナビゲーションＥＣＵ１から入力した案内指令に対
して文例番号をパラメータとして、下表に示すように、
読み文字列にアクセント・韻律記号を付した複数の固定
された読み文字例を有する。Here, the sentence example table section 2 of the speech synthesizer is used.
Uses the sentence number as a parameter for the guidance command input from the navigation ECU 1, as shown in the table below.
It has a plurality of fixed reading character examples in which a reading character string is provided with accent and prosody symbols.

【００１８】[0018]

【表１】 [Table 1]

【００１９】なお、「／」はモーラ境界位置を示し、
「ａ（丸）」はアクセント核位置を示し、「，」はフレ
ーズ終端位置を「．」は文末を示す。これらのそれぞれ
の位置にはポーズ長が設けられ、「／」、「ａ
（丸）」、「，」、「．」の順に大きくなる。さらに、
かな読み変換部３は、ナビゲーションＥＣＵ１から送ら
れた、可変のかなの固有名詞、数字コード列に対して、
下表に示すように、読み文字列に変換する。"/" Indicates the mora boundary position,
"A (circle)" indicates the accent nucleus position, "," indicates the phrase end position, and "." Indicates the end of the sentence. A pose length is provided at each of these positions, and "/", "a"
(Circle) ”,“, ”and“. ”Increase in that order. further,
The kana-reading conversion section 3 responds to the variable kana proper nouns and numeric code strings sent from the navigation ECU 1.
Convert it to a reading string as shown in the table below.

【００２０】[0020]

【表２】 [Table 2]

【００２１】また、アクセント韻律付読み文字列生成部
４は、文字列テーブル部２から選択される固定の文例
と、かな変換部３から出力される可変読み文字列とを、
下表に示すように、合成する。Further, the accented prosodic reading character string generation unit 4 stores the fixed sentence example selected from the character string table unit 2 and the variable reading character string output from the kana conversion unit 3.
Synthesize as shown in the table below.

【００２２】[0022]

【表３】 [Table 3]

【００２３】この場合、かな変換部３からの可変読み文
字列は１文字〜２０数モーラの範囲にある。「案内地点
は○○付近です」という付近を合成する場合、アクセン
ト韻律付読み文字列生成部４は、「○○」に該当する固
有名詞の小さなモーラ長に対して「○○」と「付近」と
の間に大きなポーズ長を入れてもよい。具体的には、
「津（１モーラ）」のように短い単語ほどポーズ長を大
きくして聞きやすくする。In this case, the variable reading character string from the kana conversion unit 3 is in the range of 1 character to 20 or more mora. When synthesizing the neighborhood "The guidance point is near XX", the accented prosodic reading character string generation unit 4 uses "XX" and "near" for the small mora length of the proper noun corresponding to "XX". A large pause length may be inserted between "". In particular,
The shorter the word, such as "tsu (1 mora)", the larger the pose length to make it easier to hear.

【００２４】なお、時間に関して、「あと、○○分で
す」に対して、「３」と「分」の間に大きなポーズ長を
入れてもよい。さらに、距離に関して、「あと、○○ｋ
ｍです」に対して、「１」と「ｋｍ」との間に大きなポ
ーズ長を入れてもよい。次に、かな読み変換部３から送
られる固有名詞の可変読み文字列にはアクセント情報が
なく、この文字列のモーラ長が大きい場合にはアクセン
トが無いと聞きにくい。そのため、一般的には単語の語
尾から３モーラ目にアクセント核がくることが多いため
ここにアクセントを置くようにしてもよい。例えば、
「イイダバシ（飯田橋）の場合、語尾から３モーラ目の
「ダ」にアクセント核を置くことにより自然な合成音を
生成することが可能になる。さらに、数字コード列に対
して、後続する文例（予め合成装置内の保持したアクセ
ント・韻律記号付文字列）が「程度」、「先」等の数字
列のアクセント型を「Ｏ型」にする作用のある接尾語で
始まらない場合、固有名詞に対して語尾から３モーラ目
にアクセント核を置くようにしてもよい。同様に自然な
合成音を得ることが可能になるためである。Regarding the time, a large pause length may be inserted between "3" and "minutes" for "there is XX minutes". Furthermore, regarding the distance,
A large pause length may be inserted between "1" and "km" for "m." Next, the variable reading character string of the proper noun sent from the kana-yomi conversion unit 3 has no accent information, and when the mora length of this character string is large, it is difficult to hear without accent. Therefore, in general, the accent nucleus is often located at the 3rd mora from the end of the word, so the accent may be placed here. For example,
In the case of "Iidabashi (Iidabashi)", it is possible to generate a natural synthesized sound by placing an accent nucleus at "da" which is the third mora from the ending. Further, the accent type of the numerical string such as “degree” or “preceding” is set to “O type” for the sentence example (character string with accent / prosodic symbol held in advance in the synthesizing device) subsequent to the numerical code string. If it does not start with a suffix having an action, an accent nucleus may be placed at the third mora from the end of the proper noun. This is because it becomes possible to obtain a natural synthesized sound as well.

【００２５】しかし、固定文例の「付近」のような接尾
語が付いた場合、アクセント核の位置が「付近」に置か
れ、「飯田橋」は平坦形（Ｏ型）のアクセント型とな
る。なお、数字コード列に対して、後続する文例（予め
合成装置内の保持したアクセント・韻律記号付文字列）
が「程度」、「先」等の数字列のアクセント型を「Ｏ
型」にする作用のある接尾語で始まる場合、数字列のア
クセント型を「Ｏ型に平坦型」にすることにより自然な
合成音を生成することができる。However, when a suffix such as "near" in the fixed sentence example is added, the position of the accent nucleus is placed in "near", and "Iidabashi" becomes a flat (O type) accent type. An example of a sentence following the numeric code string (character string with accent / prosodic symbol stored in the synthesizer in advance)
Indicates that the accent type of a number string such as "degree" or "destination" is "O"
When starting with a suffix that has the effect of making "type", a natural synthetic sound can be generated by changing the accent type of the number string to "O type and flat type".

【００２６】また、「ハンシンコーソク（阪神高速）」
のように、語尾から３モーラ目が「長音」、「促音」の
場合はアクセント核の位置を語頭方向に１つずらし語尾
から４モーラ目に置く。数字コード列の語尾から３モー
ラ目が「長音」又は「促音」の場合、アクセント核の位
置をずらして語尾から４モーラ目にアクセント核を置く
ようにしてもよい。同様に自然な合成音を得ることが可
能になるからである。以上をまとめると、下表のように
なる。[Hanshin Kosoku (Hanshin Expressway)]
As described above, when the third mora from the end is “long sound” and the “consonant”, the position of the accent nucleus is shifted by one in the beginning direction and placed at the fourth mora from the end. When the third mora from the end of the numerical code string is “long sound” or “consonant”, the position of the accent nucleus may be shifted to place the accent nucleus at the fourth mora from the end. This is because it is possible to obtain a natural synthesized sound as well. The above table is summarized below.

【００２７】[0027]

【表４】 [Table 4]

【００２８】次に、かな変換部３からの可変読み文字列
の固有名詞が「国道２号線」、「銀座４丁目」等のよう
に、モーラ長が長く、固有名詞内の意味に切れ目がある
単語に対しては、この切れ目にポーズ長を、下表のよう
に、挿入し、自然な合成音を生成する。Next, the proper noun of the variable reading character string from the kana conversion unit 3 has a long mora length such as "National Highway No. 2" and "Ginza 4-chome", and the meaning in the proper noun has a break. For words, the pause length is inserted at this break as shown in the table below, and a natural synthesized voice is generated.

【００２９】[0029]

【表５】 [Table 5]

【００３０】次に、アクセント韻律付読み文字列生成部
４では、以下に示すような数字コード列をかな読み変換
３から入力した場合には、語尾から３モーラ目ではな
く、所定の場所にアクセント核を置く。時刻について、
「１０：１０」の場合、『ジュ’ージジュップン』のよ
うに、時間を表す『ジュージ』の語尾から３モーラ目に
アクセント核を置くようにしてもよい。Next, in the accented prosodic reading character string generation unit 4, when the following numerical code string is input from the Kana-Yomi conversion 3, the accent is not given at the third mora from the end but at a predetermined place. Put a nucleus. About the time
In the case of “10:10”, an accent nucleus may be placed at the third mora from the end of “Juji”, which represents time, as in “Ju'uji Jupun”.

【００３１】時間について、「１時間１５分」の場合、
『イチジ’カンジューゴフン』のように、時間を表す
『イチジ’カン』の語尾から３モーラ目にアクセント核
を置くようにしてもよい。少数について、「４．５」の
場合、『ヨ’ンテンゴ』のように、語頭にアクセント核
を置くようにしてもよい。Regarding time, if "1 hour 15 minutes",
An accent nucleus may be placed at the third mora from the end of "Ijijikan", which represents time, such as "Ijijikanjugofun". In the case of "4.5" for a small number, an accent nucleus may be placed at the beginning of the word, as in "Yontengo".

【００３２】次に、アクセント韻律付読み文字列４で
は、かな読み変換３から「が」行音を入力した場合に
は、「が」行音は鼻濁化する確率が非常に高いので、送
られてきた文字列のすべて「が」行音を鼻濁音として処
理する。次に、アクセント韻律付読み文字列４では、か
な読み変換３から送られてきた文字列に同一母音の連続
が含まれる場合には、これを長音化する。例えば、「お
おさか」→『おーさか』のように長音化してもよい。Next, in the accented prosodic reading character string 4, when a "ga" line sound is input from the kana-yomi conversion 3, the "ga" line sound has a very high probability of becoming nasal. All the "ga" line sounds of the received character string are processed as nasal noise. Next, in the accented prosodic reading character string 4, if the character string sent from the Kana-yomi conversion 3 contains a sequence of the same vowels, it is lengthened. For example, it may be possible to lengthen the sound such as “Osaka” → “Osaka”.

【００３３】次に、アクセント韻律付読み文字列４で
は、かな読み変換３から送られてきた文字列に「Ｅ
Ｉ」、「ＯＵ」の連続母音が含まれる場合には、これら
を長音化する。例えば、「めいじ（ＭＥＩＪＩ）」→『めーじ』「どうろ（ＤＯＵＲＯ）」→『どーろ』このようにして、簡易なアクセント処理を実施すること
により、外部からアクセント、韻律情報をもらわなくと
も、イントネーション、アクセントを付与した合成音を
提供することができる。Next, in the accented prosodic reading character string 4, "E" is added to the character string sent from the Kana reading conversion 3.
When the continuous vowels of "I" and "OU" are included, they are lengthened. For example, “Meiji” → “Meiji” “DOURO” → “Doro” In this way, by performing simple accent processing, accent and prosody information is not received from outside. Together, it is possible to provide a synthetic sound with intonation and accent.

[Brief description of the drawings]

【図１】図１は本発明に係る音声合成装置を示す図であ
る。FIG. 1 is a diagram showing a speech synthesizer according to the present invention.

[Explanation of symbols]

１…ナビゲーションＥＣＵ２…文例テーブル部３…かな読み変換部４…アクセント韻律付読み文字列 1 ... Navigation ECU 2 ... Sentence table section 3 ... Kana-reading conversion section 4 ... Accent prosodic reading character string

Claims

[Claims]

1. A voice synthesizing device for synthesizing voice guidance information from a navigation mounted on a vehicle, wherein the text guidance information is variable character string information including proper nouns and numeric code strings, and fixed sentence examples as guidance commands. A navigation ECU (1) that outputs (number) information separately, and a sentence example table unit (2) that outputs a sentence example corresponding to the guidance command by storing a plurality of the fixed information as a sentence example into a table and storing the table. A kana-yomi conversion unit (3) for converting a character code string of the proper noun of the variable information into a kana-yomi character string, a sentence example selected from the sentence example table unit (2), and an output from the kana-reading changing unit (3). And a accented prosodic reading character string generation unit (4) for generating a reading character string with accent and prosody symbols from a character code string of a proper noun. Apparatus.

2. The kana-yomi conversion unit (3) further converts a numeric character code string into a reading character string, and the accented prosodic reading character string generation unit (4) further includes the sentence example table unit (2). 2. The speech according to claim 1, wherein a reading character string with accents and prosodic symbols is generated from a sentence example selected from) and a character code string of numbers output from the kana reading changing unit (3). Synthesizer.

3. The accented prosodic reading character string generation unit (4) depends on the proper noun and the sentence example table unit (2) depending on the mora length of the proper noun output from the Kana reading conversion unit (3). 2. The speech synthesis apparatus according to claim 1, wherein the speech synthesis is performed by changing the pause length between the sentence example selected from and the sentence example.

4. The accented prosodic reading character string generation unit (4) is a sentence example selected from the sentence example table unit (2) following the proper noun output from the Kana reading conversion unit (3). , Begins with a suffix that acts to make the accented side of the proper noun into an O type, such as "nearby",
The speech synthesizer according to claim 1, wherein the accent type of the proper noun is O type.

5. The accented prosodic reading character string generation unit (4) selects a sentence example selected from the sentence example table unit (2) following the proper noun output from the Kana reading conversion unit (3). Is characterized by placing an accent nucleus at the third mora from the end of the proper noun when it does not start with a suffix that acts to make the accent side of the proper noun into an O type, such as "nearby". The speech synthesizer according to claim 1.

6. When the third mora from the ending of the proper noun is either “long sound” or “grandphon”, the accent nucleus is shifted and the accent nucleus is placed at the fourth mora from the ending. The speech synthesizer according to claim 5.

7. The accented prosodic reading character string generation section (4), when the mora length of the proper noun output from the kana-reading conversion section (3) is large, the pause length at a predetermined break in the proper noun. 1 is inserted.
The speech synthesizer according to.

8. The reading character string with accent prosody (4)
Is composed by changing the pause length between the number and the sentence example selected from the sentence example table section (2) depending on the mora length of the numeric code string output from the Kana-yomi conversion unit (3). The speech synthesizing apparatus according to claim 2, wherein

9. The accented prosodic reading character string (4)
Means that the sentence examples selected from the sentence example table unit (2) following the numerical code string output from the Kana-yomi conversion unit (3) are number accents such as “degree” and “preceding”. 3. The speech synthesizer according to claim 2, wherein the accent type of the numeral string is set to O type when it starts with a suffix having an action of making side to O type.

10. The sentence example selected from the sentence example table section (2) of the accented prosodic reading character string (4) following the numeric code string output from the Kana-yomi conversion section (3). Is not beginning with a suffix that has the effect of making the accent side of a number an O-type, such as "degree" or "destination", the 3
The speech synthesis apparatus according to claim 1, wherein an accent nucleus is placed on the mora eyes.

11. When the third mora from the end of the code sequence of the numeral is either “long sound” or “consonant”, the accent nucleus is shifted and the accent nucleus is placed at the fourth mora from the end. The speech synthesizer according to claim 10.

12. When the numerical code string is time,
The accent nucleus is placed at the third mora from the ending of “00” at the time indicated by “00 hour ΔΔ minute”, and the accent nucleus position is set when the third mora from the ending is either “long sound” or “consonant”. 11. The speech synthesis apparatus according to claim 10, wherein the speech is shifted from the end of the word to the fourth mora.

13. When the numerical code string is time,
11. The speech synthesis apparatus according to claim 10, wherein an accent nucleus is placed at "ji", which is the third mora from the ending of "00 hour" at the time indicated by "00 hour ΔΔ minute".

14. When the code string of the numeral includes a decimal point, 3 from the ending of “00.” indicated by “00.ΔΔ”.
11. The speech according to claim 10, wherein an accent nucleus is placed in the mora, and the accent nucleus position is shifted to the fourth mora from the end when the third mora from the end is either “long sound” or “consonant”. Synthesizer.

15. When the code string of proper nouns and numbers output from the kana-yomi conversion unit (3) includes a “ga” line sound, it is converted to a nasalized “ga” line sound. The speech synthesizer according to claim 1 or 2.

16. The vowel sequence is lengthened when the code sequence of proper nouns and numbers output from the kana-yomi conversion unit (3) includes the same vowel sequence. Or the speech synthesizer according to 2.

17. When a vowel continuation of “EI” is included in a code sequence of proper nouns and numbers output from the kana-yomi conversion unit (3), this vowel continuation is the same as the same vowel continuation of “EE”. The speech synthesizer according to claim 1 or 2, characterized in that the sound is made longer.

18. When the vowel continuation of “OU” is included in the code sequence of proper nouns and numbers output from the kana-yomi conversion unit (3), this vowel continuation is treated as the same vowel continuation of “OO”. The speech synthesizer according to claim 1 or 2, wherein the speech synthesizer produces a long sound.