JPH01237700A

JPH01237700A - Basic frequency pattern forming device

Info

Publication number: JPH01237700A
Application number: JP6607988A
Authority: JP
Inventors: Masanori Miyatake; 正典宮武; Yoshinori Kosaka; 勾坂　芳典
Original assignee: A T R JIDO HONYAKU DENWA KENKYUSHO KK
Current assignee: A T R JIDO HONYAKU DENWA KENKYUSHO KK
Priority date: 1988-03-18
Filing date: 1988-03-18
Publication date: 1989-09-22

Abstract

PURPOSE:To eliminate the influence by the fluctuation in pronounced speeches by providing a parameter forming means which can control the rise and fall of a basic frequency pattern and the amplitude thereof and a basic frequency forming means which forms the basic frequency signals in the vowel central part of respective moras in accordance with a prescribed mathematical expression. CONSTITUTION:A parameter control section 11 forms the parameter signal which can control the rise and fall of the basic frequency pattern and the amplitude thereof. A basic frequency calculating section 12 receives the parameter signal and forms the basic frequency signals of the vowel central parts for each of the moras in accordance with the prescribed mathematical expression. Since the basic frequency signals are time-normalized with the respective moras and are, therefore, not affected by the fluctuation in the pronounced speeches. A basic frequency interpolation section 13 receives the discrete basic frequency signals and makes interpolation between the respective basic frequency signals. The frequency pattern forming device which can control the rise and fall of the basic frequency pattern and the amplitude thereof and is not affected by the fluctuation in the pronounced speeches is thereby obtd.

Description

【発明の詳細な説明】［産業上の利用分野］この発明は、音声合成に使用される基本周波数パターン
生成装置に関し、特に、合成音声の自然性を改善するた
めの基本周波数パターン生成装置に関する。DETAILED DESCRIPTION OF THE INVENTION [Field of Industrial Application] The present invention relates to a fundamental frequency pattern generation device used for speech synthesis, and particularly to a fundamental frequency pattern generation device for improving the naturalness of synthesized speech.

［従来の技術および発明が解決しようとする課題］従来
、日本語の普通発声における基本周波数パターンを記述
する代表的なモデルとして、点ピッチ・パターン・モデ
ルが挙げられる。これは各母音中心の基本周波数値だけ
に限定された離散的モデルである。[Prior Art and Problems to be Solved by the Invention] Conventionally, a point pitch pattern model has been cited as a typical model for describing the fundamental frequency pattern in normal Japanese speech. This is a discrete model limited only to the fundamental frequency value of each vowel center.

第２図は、−例として、５モーラ３型アクセントの単語
を示す点ピッチ・パターンを示すグラフである。このグ
ラフは、たとえば、１９７３年１１月の電子通信学会論
文誌別冊（Ｔｒａｎｓ、ＩＥＣＥ　’７Ｂ／１１　　Ｖ
ｏｌ、５６−Ｄ　　Ｎｏ。FIG. 2 is a graph showing, by way of example, a dot pitch pattern showing a word with a 5-mora type 3 accent. This graph, for example, was published in the November 1973 IEICE Transactions Special Issue (Trans, IECE '7B/11 V
ol, 56-D No.

１１）に記載されているようなものである。11).

第２図を参照して、横軸の１ないし５はこの単語の各モ
ーラの位置を示し、縦軸は周波数（Ｈｚ）を示す。各点
（１０１）ないしく１０５）は各ノードの各母音中心に
おける基本周波数の値を示す。Referring to FIG. 2, 1 to 5 on the horizontal axis indicate the position of each mora of this word, and the vertical axis indicates frequency (Hz). Each point (101) to 105) indicates the value of the fundamental frequency at each vowel center of each node.

このような点ピッチ・パターンは、単語の内容によらず
、モーラ長およびアクセント型に依存してほぼ一定の形
状を示す。すなわち、５モーラ３型アクセントの単語は
、一般に、第２図に示されるような点ピッチ争パターン
を有する。このモデルでは、各母音中心における基本周
波数の値は（１０１）ないしく１０５）の５点しか得ら
れないが、これらを線形補間することによって、このモ
デルを使用して良好な合成音声を得ることができる。Such a dot pitch pattern exhibits a substantially constant shape depending on the mora length and accent type, regardless of the content of the word. That is, words with a 5-mora type 3 accent generally have a dot-pitch conflict pattern as shown in FIG. In this model, only 5 fundamental frequency values (101) to 105) are obtained at the center of each vowel, but by linearly interpolating these values, it is possible to obtain good synthesized speech using this model. Can be done.

基本周波数パターンは、モーラ長やアクセント型によっ
て異なるため、それぞれに対応した種々の基本周波数パ
ターンが必要とされ、そのためのモデル化も提案されて
いる。Since the fundamental frequency pattern differs depending on the mora length and accent type, various fundamental frequency patterns corresponding to each are required, and modeling for this purpose has also been proposed.

しかし、このような従来のモデルは、普通に発声された
音声に対するものであり、基本周波数パターンの振幅な
どを制御することができなかった。However, such conventional models are for normally uttered voices, and cannot control the amplitude of the fundamental frequency pattern.

実際のところ、点ピッチ・パターンは、発声速度の変動
の影響をほとんど受けないが、発声の高さだけではなく
、発声の大きさによっても大きな影響を受けることが知
られている。したがって、より自然な音声を合成するた
めに、基本周波数パターンの上昇、下降および振幅など
を統一的に、かつ、数理的に制御することのできるモデ
ルが望まれている。In fact, it is known that the dot pitch pattern is hardly affected by variations in vocalization rate, but is significantly affected not only by the pitch of the vocalization but also by the loudness of the vocalization. Therefore, in order to synthesize more natural speech, a model is desired that can uniformly and mathematically control the rise, fall, amplitude, etc. of the fundamental frequency pattern.

このような基本周波数パターンを数理的に制御するもの
として、臨界制動２次系のモデルが挙げられる。これは
、アクセント指令時点および振幅の大きさなどの指定に
よりパターン全体を生成する連続的モデルである。これ
により、基本周波数パターンを精度好く近似することが
でき、パラメータを用いた定量的な扱いがより容易とな
る。An example of a model for mathematically controlling such a fundamental frequency pattern is a critical damping second-order system model. This is a continuous model that generates the entire pattern based on specifications such as the accent command time point and amplitude size. Thereby, the fundamental frequency pattern can be approximated with high accuracy, and quantitative handling using parameters becomes easier.

近似式は、たとえば、「日本語単語アクセントの基本周
波数パターンとその生成機構のモデル」と題された、藤
崎ほかによる、日本音響学会誌（２７巻９号４４５ない
し４５３頁、１９７１年）に、以下のように示されてい
る。The approximation formula can be found, for example, in the Journal of the Acoustical Society of Japan (Vol. 27, No. 9, pp. 445-453, 1971) by Fujisaki et al. entitled "Model of fundamental frequency patterns of Japanese word accents and their generation mechanism". It is shown below.

ＰＯ（ｔ）　−ＦＩＩｉｎ−ｅｘｐ［Ｇｖ（ｔ−ｔｏ）
−Ｇｖ（ｔ−ｔ３）＋Ｇａ（ｔ−ｔｌ）−Ｇａ（ｔ−ｔ
２）−ｇ２］　　　　　　　　・”式（１）Ｇｙ（ｔ）
−Ａｖαｔ　−ｅｘｐ　（−αｔ）　　　　　−式（２
）Ｇａ（ｉ）　−Ａ　ａ［１−（１＋βｔ）　・ｅｘｐ
（−β１）］−・・式（３）ここで、ＦＯ（ｔ）は対数
基本周波数パターンであり、Ｆｍ１ｎは発振可能な最低
周波数、時間ｔｏおよびｔ３はそれぞれ話調指令の始点
および終点、時間ｔ１およびｔ２はそれぞれアクセント
指令の始点および終点、ｇ２は発振停止に相当する入力
信号の値である。また、Ａｖは話調成分の応答の大きさ
を示す振幅パラメータ、Ａａはアクセント成分の応答の
大きさを示す振幅パラメータ、αおよびβはそれぞれの
成分の応答速度を示す時間成分である。PO(t) −FIIin-exp[Gv(t-to)
-Gv(t-t3)+Ga(t-tl)-Ga(t-t
2)-g2] ・”Formula (1) Gy(t)
−Avαt −exp (−αt) −Formula (2
)Ga(i) -A a[1-(1+βt) ・exp
(-β1)] - Equation (3) where FO(t) is the logarithmic fundamental frequency pattern, Fm1n is the lowest frequency that can oscillate, time to and t3 are the start and end points of the tone command, respectively, and the time t1 and t2 are the start and end points of the accent command, respectively, and g2 is the value of the input signal corresponding to stopping oscillation. Further, Av is an amplitude parameter indicating the magnitude of the response of the tone component, Aa is an amplitude parameter indicating the magnitude of the response of the accent component, and α and β are time components indicating the response speed of each component.

第３図は、式（１）ないしく３）によって基本周波数パ
ターンを得るための、基本周波数のコントロール機構を
示すブロック図である。このブロック図は前述の文献に
記載されている。FIG. 3 is a block diagram showing a fundamental frequency control mechanism for obtaining a fundamental frequency pattern using equations (1) to 3). This block diagram is described in the aforementioned document.

第３図を参照して、話調成分発生部６は話調指令信号Ｓ
９を受けて話調成分Ｇｖ　（ｔ）を出力する。話調指令
信号Ｓ９は、時間ｔｏで立上がり、時間ｔ３で立下がる
、単位パルスである。一方、アクセント成分発生部７は
、アクセント指令信号ＳＩＯを受けて、アクセント成分
Ｇａ　（ｔ）を出力する。アクセント指令信号ＳＩＯは
時間ｔ１で立上がり、時間ｔ２で立下がる、単位パルス
である。（ここで、ｔｏ＜ｔｌ＜ｔ２＜ｔ３の関係があ
る）これら話調成分発生部６およびアクセント成分発生
部７はともに前述の式（１）ないしく３）に従って動作
する。Referring to FIG. 3, the tone component generator 6 generates a tone command signal S.
9 and outputs tone component Gv (t). The speech tone command signal S9 is a unit pulse that rises at time to and falls at time t3. On the other hand, the accent component generating section 7 receives the accent command signal SIO and outputs the accent component Ga (t). The accent command signal SIO is a unit pulse that rises at time t1 and falls at time t2. (Here, there is a relationship of to<tl<t2<t3.) Both tone component generating section 6 and accent component generating section 7 operate according to the above-mentioned equations (1) to 3).

基本周波数パターン合成部８は、基本周波数の話調成分
Ｇｖ　（ｔ）およびアクセント成分Ｇａ（１）を加え、
連続したモデルである基本周波数パターンＦＯ（ｔ）を
合成する。The fundamental frequency pattern synthesis unit 8 adds the tone component Gv(t) and the accent component Ga(1) of the fundamental frequency,
A fundamental frequency pattern FO(t), which is a continuous model, is synthesized.

前述の式（１）ないしく３）の各パラメータを決定する
ために、Ａｎａｌｙｓｉｓ　　ｂｙ　　５ｙｎｔｈｅｓ
Ｌｓ法（以下、ＡｂＳ法と略称する）と呼ばれる方法が
使用される。これは、前述の各パラメータに適当な初期
値を与え、これらのパラメータを徐々に変化させること
によって得られる基本周波数パターンを、実際に観測さ
れた基本周波数パターンと繰返し比較することにより、
最適なパラメータを推定するものである。In order to determine each parameter of the above equations (1) to 3), Analysis by 5ynthes
A method called the Ls method (hereinafter abbreviated as AbS method) is used. This is done by giving appropriate initial values to each of the parameters mentioned above and repeatedly comparing the fundamental frequency pattern obtained by gradually changing these parameters with the actually observed fundamental frequency pattern.
This is to estimate the optimal parameters.

このようにして、第３図に示されたような方法により、
精度の高い基本周波数パターンを得ることができる。In this way, by the method shown in FIG.
A highly accurate fundamental frequency pattern can be obtained.

しかし、連続的なパターンを対象としているため、Ａｂ
Ｓ法の繰返し演算に非常に多くの時間を要するという課
題、および、実際に観測される基本周波数の局所的な変
動（たとえば、単語の中に無声音が含まれる場合など）
がパラメータ推定に悪影響を及ぼすという課題がある。However, since the target is a continuous pattern, Ab
The problem is that iterative calculations of the S-method take a very long time, and the local fluctuations in the fundamental frequency that are actually observed (for example, when a word contains unvoiced sounds)
There is a problem in that this has a negative effect on parameter estimation.

また、従来ではアクセント指令などのタイミングは母音
の開始点から一定時間後とされているので、発声速度の
全体的または部分的な変動がパラメータ推定に悪影響を
及ぼすという課題がある。Furthermore, conventionally, the timing of accent commands and the like is set to be a certain period of time after the start point of a vowel, so there is a problem in that overall or partial variations in speech rate have a negative effect on parameter estimation.

この発明は、上記のような課題を解決するためになされ
たもので、基本周波数パターンの上昇および下降ならび
に振幅を統一的に、かつ、高精度に制御することが可能
で、発声音声の変動による影響を受けない周波数パター
ン生成装置を得ることを目的とする。This invention was made to solve the above-mentioned problems, and it is possible to uniformly and precisely control the rise and fall of the fundamental frequency pattern as well as the amplitude, and it is possible to control the rise and fall of the fundamental frequency pattern as well as the amplitude with high precision. The purpose of this invention is to obtain a frequency pattern generation device that is not affected.

［課題を解決するための手段］この発明に係る基本周波数パターン生成装置は、基本周
波数パターンの上昇および下降ならびに振幅を制御する
ためのパラメータ信号を生成するパラメータ生成手段と
、パラメータ信号を受け、所定の数理式に基づき、各モ
ーラについて時間正規化された、母音中心部の基本周波
数信号を生成する基本周波数生成手段と、基本周波数信
号を受け、各基本周波数信号間を補間する補間手段とを
含む。[Means for Solving the Problems] A fundamental frequency pattern generation device according to the present invention includes a parameter generation device that generates a parameter signal for controlling the rise and fall and amplitude of a fundamental frequency pattern, and a parameter generation device that receives the parameter signal and generates a predetermined a fundamental frequency generating means for generating a fundamental frequency signal at the center of a vowel that is time-normalized for each mora based on a mathematical formula; and an interpolating means for receiving the fundamental frequency signal and interpolating between the fundamental frequency signals. .

［作用］この発明における基本周波数パターン生成装置は、パラ
メータ生成手段が、基本周波数パターンの上昇および下
降ならびに振幅を制御することのできるパラメータ信号
を生成する。基本周波数生成手段は、このパラメータ信
号を受け、所定の数理式に基づいて各モーラごとに母音
中心部の基本周波数信号を生成する。この基本周波数信
号は各モーラについて時間正規化されているので、発声
音声の変動による影響を受けない。補間手段はこの離散
的な基本周波数信号を受けて、各基本周波数信号間を補
間する。[Operation] In the basic frequency pattern generation device according to the present invention, the parameter generation means generates a parameter signal that can control the rise and fall and amplitude of the basic frequency pattern. The fundamental frequency generating means receives this parameter signal and generates a fundamental frequency signal of the vowel center for each mora based on a predetermined mathematical formula. Since this fundamental frequency signal is time-normalized for each mora, it is not affected by fluctuations in the uttered voice. The interpolation means receives the discrete fundamental frequency signals and interpolates between the respective fundamental frequency signals.

［発明の実施例］第１図は、この発明の一実施例を示す、基本周波数パタ
ーン生成装置およびそれを利用した音声合成装置の概略
を示すブロック図である。[Embodiment of the Invention] FIG. 1 is a block diagram schematically showing a fundamental frequency pattern generation device and a speech synthesis device using the same, showing an embodiment of the present invention.

第１図を参照して、基本周波数パターン生成装置１は、
基本周波数パターン制御情報信号Ｓ１を受けてパラメー
タセット信号Ｓ２を出力するパラメータ制御部１１と、
信号Ｓ２を受けて各モーラごとに母音中心部の基本周波
数値の点ピッチ・パターン信号Ｓ３を出力する基本周波
数算定部１２と、信号Ｓ３を受けてこれを補間し基本周
波数パターン信号Ｓ４を出力する基本周波数パターン補
間部１３とを含む。Referring to FIG. 1, the basic frequency pattern generation device 1 includes:
a parameter control unit 11 that receives a basic frequency pattern control information signal S1 and outputs a parameter set signal S2;
A fundamental frequency calculation unit 12 receives the signal S2 and outputs a point pitch pattern signal S3 of the fundamental frequency value of the vowel center for each mora, and receives the signal S3 and interpolates it to output a fundamental frequency pattern signal S4. A basic frequency pattern interpolation unit 13 is included.

この音声合成装置は、基本周波数パターン生成装置１か
ら基本周波数パターン信号Ｓ４を、また、パワーパター
ン発声部２、音韻継続長決定部３およびスペクトルパタ
ーン生成部４から、それぞれ信号Ｓ５、Ｓ６およびＳ７
を受けるように接続された合成音声生成部５を含む。合
成音声生成部５から合成された音声信号Ｓ８が出力され
る。This speech synthesis device receives the fundamental frequency pattern signal S4 from the fundamental frequency pattern generation device 1, and the signals S5, S6, and S7 from the power pattern generation section 2, phoneme duration determination section 3, and spectral pattern generation section 4, respectively.
It includes a synthesized speech generating section 5 connected to receive the synthesized speech. A synthesized speech signal S8 is output from the synthesized speech generation section 5.

次に、基本周波数パターン生成装置１の動作について説
明する。Next, the operation of the basic frequency pattern generation device 1 will be explained.

パラメータ制御部１１は、外部から制御情報信号Ｓ１を
受ける。制御情報信号Ｓ１は、たとえば、基本周波数パ
ターンが生成される単語の、モーラおよび音節の数、平
均周波数などを含む。パラメータ制御部１１は、これら
の制御情報を用いて基本周波数パターン生成のためのパ
ラメータセット信号Ｓ２を出力する。このパラメータセ
ット信号Ｓ２には、基本周波数パターンの上昇および下
降ならびに振幅を制御することができるパラメータが含
まれている。The parameter control unit 11 receives a control information signal S1 from the outside. The control information signal S1 includes, for example, the mora, the number of syllables, the average frequency, etc. of the word for which the fundamental frequency pattern is generated. The parameter control unit 11 uses these control information to output a parameter set signal S2 for generating a fundamental frequency pattern. This parameter set signal S2 includes parameters that can control the rise and fall as well as the amplitude of the fundamental frequency pattern.

ここで基本周波数値を求めるのに用いられる数理式とし
て、たとえば、前述の臨界制動２次系の応答を用いたも
のや、多項式展開を用いたものなど、種々考えられるが
、ここでは−例として、以下に臨界制動２次系の応答を
利用したものについて説明する。Various mathematical formulas can be used to find the fundamental frequency value, such as one using the response of the critical damping quadratic system mentioned above, or one using polynomial expansion. Hereinafter, a method using the response of the critical braking secondary system will be explained.

これは、前述の式（１）ないしく３）によるもので、た
とえば、５モーラ３型アクセントを有する単語の場合、
アクセント成分の終了指令をｔ２−３（モーラ）とする
。一般に、ｎモーラｉ型アクセントの単語の場合、ｔ　
２＝　ｉ　（モーラ）（但し、ｉ−〇のときはｔ２−ｎ
）とすることにより、種々のアクセント型に対応するこ
とができる。その他のパラメータについても同様に決定
される。This is based on the above-mentioned formulas (1) to 3). For example, in the case of a word with a 5-mora 3-type accent,
The end command of the accent component is assumed to be t2-3 (mora). Generally, for words with n-mora i-type accent, t
2= i (mora) (however, when i-〇, t2-n
), it is possible to correspond to various accent types. Other parameters are similarly determined.

基本周波数算定部１２はパラメータセット信号Ｓ２を受
けて、各モーラごとに母音中心部における基本周波数値
を求め、点ピッチ・パターンを生成する。The fundamental frequency calculating section 12 receives the parameter set signal S2, calculates the fundamental frequency value at the center of the vowel for each mora, and generates a point pitch pattern.

基本周波数パターン補間部１３は、点ピッチ・パターン
信号Ｓ３を受け、各点ピッチ・パターン間を線形補間し
て基本周波数パターン全体の形状を決定し、基本周波数
パターン信号Ｓ４を出力する。The basic frequency pattern interpolation unit 13 receives the point pitch pattern signal S3, performs linear interpolation between the point pitch patterns, determines the shape of the entire basic frequency pattern, and outputs the basic frequency pattern signal S4.

合成音声生成部５は、基本周波数パターン信号Ｓ４の他
に、パワーパターン発生部２からのパワー・パターン信
号Ｓ５、音韻継続長決定部３からの音韻継続長信号Ｓ６
、およびスペクトルパターン生成部４からのスペクトル
・パターン信号Ｓ７を受け、合成音を生成し合成音声信
号Ｓ８を出力する。In addition to the fundamental frequency pattern signal S4, the synthesized speech generation section 5 receives a power pattern signal S5 from the power pattern generation section 2 and a phoneme duration signal S6 from the phoneme duration determination section 3.
, and the spectral pattern signal S7 from the spectral pattern generation section 4, generates a synthesized sound, and outputs a synthesized speech signal S8.

これらパワー・パターン信号Ｓ５、音韻継続長信号Ｓ６
、およびスペクトル・パターン信号Ｓ７の生成方法また
は合成方法については、従来より多くの手法があるが、
いずれをも用いることができる。These power pattern signal S5, phoneme duration signal S6
There are many conventional methods for generating or synthesizing the spectral pattern signal S7.
Either can be used.

なお、基本周波数算定部１２において使用される数理式
や、基本周波数パターン補間部１３において行なわれる
線形補間などについて、前述の説明はあくまでも一例で
あって、これによってこの発明の及ぶ範囲が何ら制限さ
れるものではない。Note that the above description of the mathematical formulas used in the fundamental frequency calculation section 12, the linear interpolation performed in the fundamental frequency pattern interpolation section 13, etc. is merely an example, and the scope of the present invention is not limited thereby. It's not something you can do.

したがって、この発明によると、各モーラにおける母音
中心部分の基本周波数によって得られる基本周波数パタ
ーンを用いているので、局所的な変動を除去した安定な
制御パラメータを得ることができ、かつ、モーラ長によ
って時間軸を正規化しているので発声速度の変動の影響
を除去することができる。加えて、基本周波数パターン
の上昇および下降ならびに振幅を制御することのできる
パラメータを含む数理式により制御しているので、モー
ラ長およびアクセント型だけではなく様々な口調に対応
できる精度の高い定量的な制御を行なうことが可能とな
り、より自然な合成音声を得ることができる。Therefore, according to the present invention, since the fundamental frequency pattern obtained by the fundamental frequency of the vowel center part in each mora is used, it is possible to obtain stable control parameters with local fluctuations removed. Since the time axis is normalized, the influence of variations in speaking speed can be removed. In addition, it is controlled by a mathematical formula that includes parameters that can control the rise and fall of the fundamental frequency pattern as well as the amplitude, so it is possible to use highly accurate quantitative expressions that can accommodate not only mora length and accent type, but also a variety of tones. It becomes possible to perform control and obtain more natural synthesized speech.

さらに、制御パラメータを決定するときに扱うデータ量
は、従来の連続的モデルと比較して非常に少なくて済む
ため、〜これらのパラメータを短時間で推定することが
できる。また、複数の基本周波数パターンをＡｂＳ法を
用いて同時に分析することも可能で、これにより、基本
周波数パターンの音声ごとまたは単語ごとのばらつきを
除去することも容易となる。Furthermore, since the amount of data handled when determining control parameters is very small compared to conventional continuous models, these parameters can be estimated in a short time. It is also possible to simultaneously analyze a plurality of fundamental frequency patterns using the AbS method, thereby making it easy to remove variations in the fundamental frequency patterns from voice to voice or from word to word.

［発明の効果］以上のように、この発明によれば、基本周波数パターン
の上昇および下降ならびに振幅を制御できるパラメータ
生成手段と、所定の数理式に基づいて各モーラの母音中
心部の基本周波数信号を生成する基本周波数生成手段と
を含むので、基本周波数パターンの上昇および下降なら
びに振幅を統一的に、かつ、高精度に制御することが可
能で、分析時の発声音声の変動による影響を受けない周
波数パターン生成装置がもたらされる。[Effects of the Invention] As described above, according to the present invention, there is provided a parameter generation means that can control the rise and fall as well as the amplitude of a fundamental frequency pattern, and a fundamental frequency signal of the vowel center of each mora based on a predetermined mathematical formula. Since the basic frequency pattern includes a fundamental frequency generating means that generates a fundamental frequency pattern, it is possible to uniformly and highly accurately control the rise and fall of the fundamental frequency pattern as well as the amplitude, and it is not affected by fluctuations in vocalizations during analysis. A frequency pattern generation device is provided.

[Brief explanation of the drawing]

第１図は、この発明の一実施例を示す、基本周波数パタ
ーン生成装置およびそれを利用した音声合成装置の概略
を示すブロック図である。第２図は、従来の点ピッチ・
パターン・モデルを示すグラフである。第３図は、従来
の基本周波数のコントロール機構を示すブロック図であ
ある。図において、１は基本周波数パターン生成装置、２はパ
ワーパターン発生部、３は音韻継続長決定部、４はスペ
クトルパターン生成部、５は合成音声生成部、６は話調
成分発生部、７はアクセント成分発生部、８は合成部、
１゛１はパラメータ制御部、１２は基本周波数算定部、
１３は基本周波数パターン補間部を示す。特許出願人　株式会社エイ・ティー・アール第２図＋　　　２　　３　　４　　５（モー２イｆｒ、ｙ、）FIG. 1 is a block diagram schematically showing a fundamental frequency pattern generation device and a speech synthesis device using the same, showing an embodiment of the present invention. Figure 2 shows the conventional point pitch
It is a graph showing a pattern model. FIG. 3 is a block diagram showing a conventional fundamental frequency control mechanism. In the figure, 1 is a fundamental frequency pattern generation device, 2 is a power pattern generation section, 3 is a phoneme duration determination section, 4 is a spectral pattern generation section, 5 is a synthesized speech generation section, 6 is a tone component generation section, and 7 is a speech tone component generation section. Accent component generation part, 8 is synthesis part,
1.1 is a parameter control section, 12 is a fundamental frequency calculation section,
Reference numeral 13 indicates a basic frequency pattern interpolation section. Patent applicant A.T.R. Co., Ltd. Figure 2 + 2 3 4 5 (Mo2i fr, y,)

Claims

[Scope of Claims] A fundamental frequency pattern generation device for generating a fundamental frequency pattern for speech synthesis, comprising parameter generation means for generating a parameter signal for controlling the rise and fall and amplitude of the fundamental frequency pattern; Discrete fundamental frequency generating means receives a parameter signal and generates a discrete fundamental frequency signal at the center of a vowel that is time-normalized for each mora based on a predetermined mathematical formula; , and interpolation means for interpolating between each fundamental frequency signal.