JPH0679235B2

JPH0679235B2 - Speech synthesizer

Info

Publication number: JPH0679235B2
Application number: JP62248381A
Authority: JP
Inventors: 延佳海木
Original assignee: Sharp Corp
Current assignee: Sharp Corp
Priority date: 1987-09-30
Filing date: 1987-09-30
Publication date: 1994-10-05
Anticipated expiration: 2009-10-05
Also published as: JPS6490500A

Description

【発明の詳細な説明】（産業上の利用分野）本発明は音声を合成する音声合成装置に関し，特に,2個
の音素を接続する仕方を定めた音素接続規則の集合中の
規則に基づいて，音素系列中の隣り合う２個の音素を接
続し音声合成パラメータを生成する規則合成部を有する
音声合成装置に関する。Description: TECHNICAL FIELD The present invention relates to a speech synthesizer for synthesizing speech, and in particular, it is based on a rule in a set of phoneme connection rules that defines how to connect two phonemes. The present invention relates to a speech synthesizing device having a rule synthesizing unit that connects two adjacent phonemes in a phoneme sequence to generate a speech synthesizing parameter.

（従来の技術）第４図は上記種類の従来の音声合成装置の１例の構成を
示すものである。この音声合成装置は，任意の文字列ま
たは音譜記号列（以下では「文字列等」と略称する）が
入力される入力部１と，入力された文字列等に対する音
声合成パラメータを生成する規則合成部102と該音声合
成パラメータに基づく規則合成音声を出力する合成部３
とから構成されている。(Prior Art) FIG. 4 shows a configuration of an example of a conventional speech synthesizer of the type described above. This speech synthesizer includes an input unit 1 into which an arbitrary character string or a musical notation string (hereinafter abbreviated as "character string or the like") is input, and rule synthesis that generates a speech synthesis parameter for the input character string or the like. Unit 102 and synthesizer 3 for outputting a rule-synthesized voice based on the voice synthesis parameter
It consists of and.

規則合成部102について説明する。The rule synthesizer 102 will be described.

入力部１より入力された文字列等は文字列解析部21によ
って構文解析が施される。この構文解析では，まず単語
辞書22を参照しながら文字列等中の単語の同定が行われ
る。これにより文字列等に対応する音素記号列が決定さ
れ，文字列等中の単語のアクセント位置に対応する音素
記号列中の音素にアクセント情報が付与される。さらに
構文解析を行うことにより，文または文章全体のイント
ネーションが決定される。音素記号列は音声合成パラメ
ータ生成部123に入力されて音声合成のための処理がな
される。The character string and the like input from the input unit 1 is syntactically analyzed by the character string analysis unit 21. In this syntactic analysis, the words in the character string or the like are identified with reference to the word dictionary 22. Thereby, the phoneme symbol string corresponding to the character string or the like is determined, and the accent information is given to the phoneme in the phoneme symbol string corresponding to the accent position of the word in the character string or the like. Further parsing determines the intonation of the sentence or the entire sentence. The phoneme symbol string is input to the speech synthesis parameter generation unit 123 and processed for speech synthesis.

音声合成パラメータ生成部123における処理を第５図の
フローチャートを参照しながら説明する。音声合成パラ
メータ生成部123では，まず文字列解析部21で生成され
た音素記号列を入力する（ステップ）。入力された音
素記号列の各音素に対応する音素を特徴づける音素特徴
パラメータを特徴パラメータファイル24から取り出す
（ステップ）。音素記号列中の各々隣接する２個の音
素を接続するための音素接続規則を規則ファイル125か
ら取り出す（ステップ）。該規則は２個の音素の音素
特徴パラメータを接続するための制御構造と制御情報と
を含んでいる。該規則をそれぞれ適用することにより各
隣接する２個の音素を接続する音素接続特徴パラメータ
が生成される（ステップ）。このようにして得られた
音素特徴パラメータおよび音素接続特徴パラメータなら
びに先に文字列解析部21で得られたアクセント，イント
ネーションに関する情報から，音声合成パラメータが生
成される（ステップ）。The processing in the voice synthesis parameter generation unit 123 will be described with reference to the flowchart of FIG. The speech synthesis parameter generation unit 123 first inputs the phoneme symbol string generated by the character string analysis unit 21 (step). A phoneme feature parameter that characterizes a phoneme corresponding to each phoneme of the input phoneme symbol string is extracted from the feature parameter file 24 (step). A phoneme connection rule for connecting two adjacent phonemes in the phoneme symbol string is extracted from the rule file 125 (step). The rule includes control structure and control information for connecting phoneme feature parameters of two phonemes. By applying each of the rules, a phoneme connection feature parameter that connects two adjacent phonemes is generated (step). A voice synthesis parameter is generated from the phoneme feature parameter and the phoneme connection feature parameter thus obtained, and the information on the accent and intonation previously obtained by the character string analysis unit 21 (step).

（発明が解決しようとする問題点）上記従来の音声合成装置については，生起しうる２個の
隣接する音素の組のすべてに対しそれぞれ音素接続規則
を作成し、規則ファイル125に保持しておく必要があっ
た。すなわち音素接続規則には，大別すると，（ｉ）母音に母音を接続するための音素接続規則，（ii）子音に母音を接続するための音素接続規則，およ
び（iii）母音に子音を接続する（つまり音節間の接続を
行う）ための音素接続規則の３種類が存在する。(Problems to be Solved by the Invention) In the conventional speech synthesizer described above, a phoneme connection rule is created for each of two pairs of adjacent phonemes that can occur and stored in the rule file 125. There was a need. That is, the phoneme connection rules are roughly classified into (i) phoneme connection rules for connecting vowels to vowels, (ii) phoneme connection rules for connecting vowels to consonants, and (iii) consonant connections to vowels. There are three types of phoneme connection rules for doing (that is, connecting syllables).

このような多くの規則を保持するためには規則ファイル
に非常に大きな記憶容量をもたせる必要があった。それ
ゆえ規則ファイルの占めるスペースおよびコストが大き
くなるという欠点があった。In order to hold many such rules, it was necessary to give the rule file a very large storage capacity. Therefore, there is a drawback that the space and cost occupied by the rule file become large.

本発明は，上記問題点に鑑みてなされたものであり，そ
の目的は，音素接続規則を小さいスペースおよび小さい
コストで記憶保持できる音声合成装置を提供することに
ある。The present invention has been made in view of the above problems, and an object thereof is to provide a speech synthesizer capable of storing and retaining a phoneme connection rule in a small space and at a small cost.

（問題点を解決するための手段）本発明の音声合成装置は,2個の音素を接続する仕方を定
めた音素接続規則の集合中の規則に基づいて，音素系列
中の隣り合う２個の音素を接続し音声合成パラメータを
生成する規則合成部を有する音声合成装置であって，該
規則合成部が，音素系列中のある音素（第１の音素）に
該ある音素の直後の他の音素（第２の音素）を接続する
ための音素接続規則が該集合中に存在するか否かを判断
する手段と，該判断手段により該第１の音素に第２の音
素を接続するための音素接続規則が存在しないと判断さ
れた場合に該集合中に存在する該第２の音素に該第１の
音素を接続するための音素接続規則を時間的に逆向きに
適用して該第１の音素に該第２の音素を接続する手段と
を備えており，そのことにより上記目的が達成される。(Means for Solving Problems) A speech synthesis apparatus according to the present invention uses two adjacent phonemes in a phoneme sequence based on a rule in a set of phoneme connection rules that defines how to connect two phonemes. 1. A speech synthesis apparatus having a rule synthesizing unit for connecting phonemes to generate a speech synthesis parameter, wherein the rule synthesizing unit is a phoneme (first phoneme) in a phoneme sequence and another phoneme immediately after the certain phoneme. Means for determining whether or not a phoneme connection rule for connecting (second phoneme) exists in the set, and a phoneme for connecting the second phoneme to the first phoneme by the determining means When it is determined that the connection rule does not exist, the phoneme connection rule for connecting the first phoneme to the second phoneme existing in the set is applied in a temporally reverse direction to apply the phoneme connection rule to the first phoneme. And a means for connecting the second phoneme to the phoneme, whereby the above object It is achieved.

第３図は，男性の自然音声/ama/の音声波形とその波形
を分析して得られたホルマントの軌跡を示している。分
析方法は以下の通りである。音声信号のサンプリング周
波数は10KHzで,1フレームを20msとしてハミング窓をか
け，係数が0.98のプリエンファシスを施した。12次の自
己相関法で線形予測分析し，それにより求められた線形
予測係数を係数とする高次方程式の解を求め，その解を
用いてマッキャンドレス（McCandless）のホルマント軌
跡推定のアルゴリズム（“An Algorithmfor Automatic
Formant Extraction Using Linear Prdiction Spectr
a",IEEE ASSP,Vol.22,No.2, April, 1974）によってホ
ルマント軌跡を表示したものである。Figure 3 shows the voice waveform of male natural voice / ama / and the formant locus obtained by analyzing the waveform. The analysis method is as follows. The sampling frequency of the audio signal was 10 KHz, a Hamming window was applied for one frame of 20 ms, and pre-emphasis with a coefficient of 0.98 was applied. A linear prediction analysis is performed by the 12th-order autocorrelation method, a solution of a higher-order equation whose coefficient is the linear prediction coefficient obtained by the analysis is obtained, and the solution is used to estimate the formant trajectory of the McCandless formant trajectory (“ An Algorithmfor Automatic
Formant Extraction Using Linear Prdiction Spectr
a ", IEEE ASSP, Vol.22, No.2, April, 1974).

母音/a/から子音/m/の鼻音化した部分へのホルマントの
遷移と，子音/m/の鼻音化した部分から母音/a/へのホル
マントの遷移とは，細かい違いはあるが，大体において
対称と見做し得る。子音/m/と他の母音との組み合わせ
および子音/s,n,h,r,w/などと母音との組み合わせにお
いても同じ現象が見られる。従つて，これらの組み合わ
せについては，子音に母音を接続する音素接続規則のみ
を用意しておき，母音への子音の接続は子音に母音を接
続する音素接続規則を時間的に逆向きに適用することに
よって行っても合成音声の品質はあまり低下しない。特
徴パラメータとしてホルマントのかわりに偏自己相関
（PARCOR）係数，線形予測係数，線スペクトル対（LS
P）パラメータ，声道断面積比などを用いた場合にも同
様のことが言える。実際にホルマントを特徴パラメータ
として前述の音声/ama/を合成する際に,/a/から/m/の鼻
音化した部分への遷移には/m/の鼻音化した部分から/a/
への遷移を時間方向に反転して用いてみたところ，かな
り明瞭で自然な合成音声が得られた。The formant transition from the vowel / a / to the consonant / m / nasalized part and the transition from the consonant / m / nasalized formant to the vowel / a / formant are slightly different, but generally Can be regarded as symmetric in. The same phenomenon is observed in the combination of consonant / m / with other vowels and the combination of consonant / s, n, h, r, w / with vowels. Therefore, for these combinations, only the phoneme connection rule that connects vowels to consonants is prepared, and the consonant connection to vowels applies the phoneme connection rule that connects vowels to consonants in the opposite time direction. However, the quality of the synthesized voice does not deteriorate so much. Partial autocorrelation (PARCOR) coefficient, linear prediction coefficient, line spectrum pair (LS
The same can be said when the P) parameter and vocal tract cross-sectional area ratio are used. When synthesizing the above-mentioned speech / ama / using formant as a feature parameter, the transition from / a / to / m / to the nasalized portion is performed from the / m / nasalized portion to / a /
When the transition to was inverted and used in the time direction, a fairly clear and natural synthesized speech was obtained.

（実施例）以下に本発明を実施例について説明する。(Example) Hereinafter, the present invention will be described with reference to Examples.

第１図に本発明の音声合成装置の一実施例のブロック図
を示す。第４図の従来の音声合成装置と共通の部分には
同一の参照番号が付されている。本実施例の音声合成装
置は，基本的には第４図の従来の音声合成装置と似た構
成を有しており，文字列等が入力される入力部1,該文字
列等に対する音声合成パラメータを生成する規則合成部
2,および該音声合成パラメータに基づく規則合成音声を
出力する合成部３を備えている。FIG. 1 shows a block diagram of an embodiment of the speech synthesizer of the present invention. The same parts as those of the conventional speech synthesizer shown in FIG. 4 are designated by the same reference numerals. The speech synthesizer of this embodiment basically has a configuration similar to that of the conventional speech synthesizer shown in FIG. 4, and has an input unit 1 for inputting a character string or the like, and speech synthesis for the character string or the like. Rule synthesizer that generates parameters
2, and a synthesizing section 3 for outputting a rule-synthesized speech based on the speech synthesis parameter.

規則合成部２の文字列解析部21は，入力部１から送られ
てくる文字列等を構文解析して音素記号列へ変換し，さ
らにアクセントおよびイントネーション情報を抽出す
る。構文解析に際しては単語辞書22が参照される。音素
記号列およびアクセント，イントネーション情報は音声
合成パラメータ生成部23へ送られる。The character string analyzing unit 21 of the rule synthesizing unit 2 parses a character string or the like sent from the input unit 1 to convert it into a phoneme symbol string, and further extracts accent and intonation information. The word dictionary 22 is referred to when parsing. The phoneme symbol string, accent, and intonation information are sent to the voice synthesis parameter generation unit 23.

音声合成パラメータ生成部23は，文字列解析部21からの
音素記号列の各音素を特徴づける音素特徴パラメータを
特徴パラメータファイル24から取り出す特徴パラメータ
取り出し手段230,該音素記号列中の隣接する２個の音素
に規則ファイル25中の対応する音素接続規則を適用する
ことにより該２個の音素を接続する音素接続特徴パラメ
ータを生成する音素接続手段231,およびそれらの特徴パ
ラメータから音声合成パラメータを生成する合成パラメ
ータ生成手段235を備えている。音素接続手段231は，接
続すべき２個の音素の組（これを「音素Ａ−音素Ｂ」と
する）に対応する音素接続規則が規則ファイル25に含ま
れているか否かを判断する判断手段232と，判断手段232
で「含まれている」と判断された場合にその音素接続規
則を規則ファイル25から取り出し，該規則を適用して音
素Ａに音素Ｂを接続する音素接続特徴パラメータを生成
する順方向音素接続手段233と，判断手段232で「含まれ
ていない」と判断された場合に音素Ｂに音素Ａを接続す
るための音素接続規則を規則ファイル25から取り出し，
該規則を時間的に逆向きに適用して音素Ａに音素Ｂを接
続する逆方向音素接続手段234とを備えている。The voice synthesis parameter generation unit 23 extracts a phoneme feature parameter that characterizes each phoneme of the phoneme symbol string from the character string analysis unit 21 from the feature parameter file 24, a feature parameter extraction unit 230, and two adjacent phoneme symbol strings in the phoneme symbol string. Phoneme connecting means 231 for generating a phoneme connection characteristic parameter connecting the two phonemes by applying the corresponding phoneme connection rule in the rule file 25 to the phonemes of the phoneme, and a speech synthesis parameter from those characteristic parameters. A synthetic parameter generation means 235 is provided. The phoneme connection means 231 determines whether or not the phoneme connection rule corresponding to a pair of two phonemes to be connected (referred to as "phoneme A-phoneme B") is included in the rule file 25. 232 and determination means 232
If the phoneme connection rule is determined to be “included”, the phoneme connection rule is extracted from the rule file 25, and the phoneme connection feature parameter for applying the rule to connect the phoneme B to the phoneme A is generated. 233 and the phoneme connection rule for connecting the phoneme A to the phoneme B when the judgment means 232 judges that the phoneme A is not included, from the rule file 25,
Reverse direction phoneme connecting means 234 for connecting the phoneme B to the phoneme A by applying the rule in the reverse direction in time.

特徴パラメータファイル24に保持されている音素特徴パ
ラメータとしてはホルマント周波数およびホルマントバ
ンド幅が用いられているが，かわりに偏自己相関（PARC
OR）係数，線形予測係数，線スペクトル対（LSP）パラ
メータ、声道断面積比などを用いることもできる。Formant frequencies and formant bandwidths are used as the phoneme feature parameters stored in the feature parameter file 24, but instead of partial autocorrelation (PARC
OR) coefficient, linear prediction coefficient, line spectrum pair (LSP) parameter, vocal tract cross-sectional area ratio, etc. can also be used.

規則ファイル25には，音素記号列中の隣接する２個の音
素を接続するための音素接続規則が保持されているが，
音素の組すべてについての規則が保持されているのでは
ない。母音−母音および子音−母音のすべての音素の組
について音素接続規則が用意されているのに対し，母音
−子音の音素の組については，子音の語頭の音素が/s,
n,h,m,r,w/の場合の音素接続規則は用意されていない。
また，母音−子音の組で子音が/p,t,k/の無声破裂音の
場合については、子音の開始直前は無音なので，子音の
種類にかかわらず，母音−無音の規則を用いることにし
ている。これらのことにより，規則ファイル25の記憶容
量は従来の音声合成装置の規則ファイル125の記憶容量
よりもかなり小さくなっている。The rule file 25 holds a phoneme connection rule for connecting two adjacent phonemes in a phoneme symbol string.
The rules for all phoneme pairs are not retained. Phoneme connection rules are prepared for all vowel-vowel and consonant-vowel phoneme pairs, while for vowel-consonant phoneme pairs the consonant prefix phoneme is / s,
The phoneme connection rules for n, h, m, r, w / are not prepared.
In the case of unvoiced plosives with consonants of / p, t, k / in a vowel-consonant pair, there is no sound immediately before the start of a consonant, so the rule of vowel-silence is used regardless of the type of consonant. ing. As a result, the storage capacity of the rule file 25 is considerably smaller than the storage capacity of the rule file 125 of the conventional speech synthesizer.

音声合成パラメータ生成部23おける処理を第２図のフロ
ーチャートに基づいて説明する。第１図との対応を示す
ならば，ステップ〜は音素接続手段231による処理
を表しており，そのうちステップ〜は判断手段232,
ステップ〜は逆方向音素接続手段234,そしてステッ
プは順方向の音素接続手段233による処理をそれぞれ
表している。The processing in the speech synthesis parameter generation unit 23 will be described based on the flowchart of FIG. If the correspondence with FIG. 1 is shown, the steps ~ represent the processing by the phoneme connecting means 231, of which the steps ~ are the judgment means 232,
Steps 1 to 3 represent processing by the reverse phoneme connecting means 234, and steps represent processing by the forward phoneme connecting means 233.

（１）文字列解析部21で生成された音素記号列を読み込
み（ステップ），音素記号列中の各音素に対応する音
素特徴パラメータを特徴パラメータファイル24から取り
出す（ステップ）。(1) The phoneme symbol string generated by the character string analysis unit 21 is read (step), and the phoneme characteristic parameter corresponding to each phoneme in the phoneme symbol string is taken out from the characteristic parameter file 24 (step).

（２）音素記号列の一端から隣接する２音素の組を切り
出す（ステップ）。この音素の組を説明のために音素
Ａ−音素Ｂと表すことにする。(2) A pair of adjacent two phonemes is cut out from one end of the phoneme symbol string (step). For the sake of explanation, this set of phonemes will be referred to as phoneme A-phoneme B.

（３）ステップ〜で，音素Ａ−音素Ｂに対応する音
素接続規則が規則ファイル25にあるか否かを判定する。
音素Ａ−音素Ｂが母音−子音であり（ステップで「Ye
s」），その音素の組に対応する音素接続規則が規則フ
ァイルにない（ステップで「No」）のであれば，ステ
ップへ進む。音素Ａ−音素Ｂが母音−子音でない（ス
テップで「No」），または母音−子音であってもそれ
に対応する音素接続規則がある（ステップで「Ye
s」）のならば，ステップへ進む。In steps (3) to, it is determined whether the phoneme connection rule corresponding to the phoneme A and the phoneme B exists in the rule file 25.
Phoneme A-phoneme B is a vowel-consonant (in the step "Ye
s ”), if there is no phoneme connection rule corresponding to the phoneme set in the rule file (“ No ”in step), go to step. If phoneme A-phoneme B is not a vowel-consonant ("No" in step), or if it is a vowel-consonant, there is a corresponding phoneme connection rule ("Ye in step"
s ”), go to the step.

（４）音素Ａ−音素Ｂに対応する音素接続規則がない場
合は，音素Ａ−音素Ｂを反転して音素Ｂ−音素Ａとし
（ステップ），音素Ｂ−音素Ａに対応する音素接続規
則を規則ファイル25から取り出し，その規則に基づいて
音素Ｂに音素Ａを接続する音素接続特徴パラメータを生
成し（ステップ），生成された音素接続パラメータを
時間方向で逆向きに並べかえ，音素Ａに音素Ｂを接続す
る音素接続特徴パラメータとする（ステップ）。(4) If there is no phoneme connection rule corresponding to phoneme A-phoneme B, phoneme A-phoneme B is inverted to phoneme B-phoneme A (step), and phoneme connection rule corresponding to phoneme B-phoneme A is set. A phoneme connection feature parameter for connecting the phoneme A to the phoneme B is generated based on the rule file 25 (step), the generated phoneme connection parameters are rearranged in the reverse direction in the time direction, and the phoneme A is phoneme B. Is used as a phoneme connection feature parameter (step).

（５）音素Ａ−音素Ｂに対応する音素接続規則が規則フ
ァイルにある場合にはその規則を規則ファイル25から取
り出し，この規則に基づいて音素Ａに音素Ｂを接続する
音素接続特徴パラメータを生成する（ステップ）。(5) If a phoneme connection rule corresponding to phoneme A-phoneme B is present in the rule file, that rule is taken out from the rule file 25, and a phoneme connection feature parameter for connecting phoneme B to phoneme A is generated based on this rule. Do (step).

（６）音素記号列中の隣接する２音素の組をすべて接続
しおわっていなければ（ステップで「No」），次の隣
接する２音素の組を切り出し（ステップ），ステップ
へもどる。すべての隣接２音素が接続されたならば
（ステップで「Yes」），上で得られた音素特徴パラ
メータおよび音素接続特徴パラメータ並びに文字解析部
21で得られたアクセント，イントネーション情報を用い
て音声合成パラメータを生成する（ステップ）。(6) If all adjacent pairs of two phonemes in the phoneme symbol string are not completely connected (“No” in step), the next pair of adjacent two phonemes is cut out (step) and the process returns to the step. If all adjacent two phonemes are connected (“Yes” in step), the phoneme feature parameters and phoneme connection feature parameters obtained above and the character analysis unit
A voice synthesis parameter is generated using the accent and intonation information obtained in step 21 (step).

上述の実施例では，規則ファイルに子音を母音を接続す
る音素接続規則を用意して母音に子音を接続する音素接
続規則を省略するような構成をとったが，その逆，すな
わち母音に子音を接続する音素接続規則を用意し子音に
母音を接続する規則を省略する構成等の他の構成とする
ことも可能である。In the above-described embodiment, the phoneme connection rule for connecting consonants to vowels is prepared in the rule file, and the phoneme connection rule for connecting consonants to vowels is omitted. It is also possible to adopt another configuration such as a configuration in which a phoneme connection rule for connection is prepared and the rule for connecting a vowel to a consonant is omitted.

（発明の効果）本発明によれば，合成音声の品質を低下させることな
く，音素接続規則を保持するための記憶の容量を小さく
おさえることができ，従って記憶のためのスペースおよ
びコストが削減された音声合成装置が提供される。(Effects of the Invention) According to the present invention, the capacity of the memory for holding the phoneme connection rule can be kept small without deteriorating the quality of the synthesized speech, thus reducing the space and cost for the memory. A speech synthesizer is provided.

[Brief description of drawings]

第１図は本発明の音声合成装置の一実施例の構成を示す
ブロック図，第２図は本発明の音声合成装置の音声合成
パラメータ生成部での処理手順例を示すフローチャー
ト，第３図は自然音声/ama/の音声波形の例およびその
分析結果の例を示すグラフ，第４図は従来の音声合成装
置の一例の構成を示すブロック図，第５図は従来の音声
合成装置の音声合成パラメータ生成部での処理手順例を
示すフローチャートである。１……入力部,2……規則合成部,3……合成部,21……文
字列解析部,22……単語辞書,23……音声合成パラメータ
生成部,24……特徴パラメータファイル,25……規則ファ
イル,230……特徴パラメータ取り出し手段,231……音素
接続手段,232……判断手段,233……順方向音素接続手
段,234……逆方向音素接続手段,235……合成パラメータ
生成手段。FIG. 1 is a block diagram showing the configuration of an embodiment of the speech synthesis apparatus of the present invention, FIG. 2 is a flow chart showing an example of the processing procedure in the speech synthesis parameter generation unit of the speech synthesis apparatus of the present invention, and FIG. FIG. 4 is a block diagram showing the configuration of an example of a conventional speech synthesizer, and FIG. 5 is a speech synthesis of the conventional speech synthesizer. It is a flow chart which shows the example of a processing procedure in a parameter generation part. 1 ... Input unit, 2 ... Rule synthesis unit, 3 ... Synthesis unit, 21 ... Character string analysis unit, 22 ... Word dictionary, 23 ... Speech synthesis parameter generation unit, 24 ... Feature parameter file, 25 ...... Rule file, 230 ...... Characteristic parameter extraction means, 231 ...... Phoneme connection means, 232 ...... Judgment means, 233 ...... Forward phoneme connection means, 234 ...... Reverse phoneme connection means, 235 …… Synthesis parameter generation means.

Claims

[Claims]

1. A rule synthesizing unit for connecting two adjacent phonemes in a phoneme sequence to generate a speech synthesis parameter based on a rule in a set of phoneme connection rules defining how to connect two phonemes. A phoneme connection for connecting a certain phoneme (first phoneme) in a phoneme sequence to another phoneme (second phoneme) immediately after the certain phoneme. Means for determining whether a rule exists in the set, and if the determining means determines that there is no phoneme connection rule for connecting the second phoneme to the first phoneme Means for connecting the second phoneme to the first phoneme by applying a phoneme connection rule for connecting the first phoneme to the second phoneme existing in Speech synthesizer.