JPS61269200A

JPS61269200A - Voice sythesization filter

Info

Publication number: JPS61269200A
Application number: JP60110263A
Authority: JP
Inventors: 安永　智
Original assignee: NEC Corp
Current assignee: NEC Corp
Priority date: 1985-05-24
Filing date: 1985-05-24
Publication date: 1986-11-28

Abstract

(57)【要約】本公報は電子出願前の出願データであるた
め要約のデータは記録されません。(57) [Summary] This bulletin contains application data before electronic filing, so abstract data is not recorded.

Description

【発明の詳細な説明】〔産業上の利用分野〕本発明は音声合成フィルタに関し、特に、線形予測法を
用いて音声信号から発声者の声道の状態を表わす予測係
数またはパーフール係数等のスペクトルパラメータ、声
道の駆動源の大きさを表わす振幅情報、駆動源の種類を
示す有声無声情報および有声音の声帯振動数を表わすピ
ッチ情報等を分析部で分析抽出し量子化した後に合成部
に送シ。[Detailed Description of the Invention] [Field of Industrial Application] The present invention relates to a speech synthesis filter, and in particular, the present invention relates to a speech synthesis filter that uses a linear prediction method to extract spectra such as prediction coefficients or perfour coefficients representing the state of the vocal tract of a speaker from a speech signal. Parameters, amplitude information representing the size of the driving source of the vocal tract, voiced/unvoiced information representing the type of driving source, pitch information representing the vocal cord vibration frequency of voiced sounds, etc. are analyzed and extracted by the analysis unit, quantized, and then sent to the synthesis unit. Send it.

合成部で前記情報に基づいて原音声を合成によって再現
させる場合に使用する音声合成フィルタに関する。The present invention relates to a speech synthesis filter used when a synthesis unit reproduces original speech by synthesis based on the information.

[Conventional technology]

従来、この種の音声合成フィルタでは、有声時には、ピ
ッチ周期で駆動され大きさが振幅情報に比例したインパ
ルスが駆動信号として入力され。Conventionally, in this type of speech synthesis filter, when voicing occurs, an impulse driven at a pitch period and whose magnitude is proportional to amplitude information is input as a drive signal.

無声時には白色雑音が駆動入力信号とされる。そして２
例えば分析部から送られた予測係数等（スペクトルパラ
メータ）をα１．α２．・・・・・・α、とし、前記駆
動入力信号をＳｎとして、１タイムクロツク前の出力信
号をＤｎ、、２タイムクロツク前の出力信号をＤ　　同
様にｐタイムクロック前の出力信号をｎ−２゜Ｄ　　とすれば、出力Ｄｎは線形予測法に従って。When there is no voice, white noise is used as the drive input signal. And 2
For example, the prediction coefficients etc. (spectral parameters) sent from the analysis section are α1. α2. ......α, the drive input signal is Sn, the output signal one time clock ago is Dn, the output signal two time clocks ago is D, and similarly the output signal p time clocks ago is n-2°. D, the output Dn follows the linear prediction method.

−ｐＤｎ＝Ｓｎ＋αＩ　Ｄｉ−１＋α２Ｄｎ−２＋・・・・
・・＋αｐＤｎ−ｐによって求められる。ここに、１タ
イムクロツクは分析部において音声分析したときのサン
プリング周期（例えば０．１２５ｍ５　）である。換言
すれば音声合成フィルタの出力信号を１タイムクａツク
の遅延時間を持つ遅延回路が縦続接続された回路に印加
し。-p Dn=Sn+αI Di-1+α2Dn-2+...
...+αpDn-p. Here, one time clock is a sampling period (for example, 0.125 m5) when audio is analyzed in the analysis section. In other words, the output signal of the speech synthesis filter is applied to a circuit in which delay circuits each having a delay time of one time clock are connected in cascade.

各段階の遅延回路の出力に前記スペクトルパラメータを
それぞれ乗じた信号を駆動信号に加算して出力させるよ
うに構成されている。スペクトルパラメータには予測係
数でなくパーコール係数が使用されることもある。It is configured to add a signal obtained by multiplying the output of the delay circuit of each stage by the spectrum parameter to the drive signal and output the resultant signal. Percoll coefficients are sometimes used instead of prediction coefficients for spectral parameters.

[Problem that the invention seeks to solve]

しかし、音声合成部に送られてくるスペクトルパラメー
タは通常量子化されているため２分析時にスペクトルパ
ラメータの系が安定でありても。However, since the spectral parameters sent to the speech synthesis section are usually quantized, even if the system of spectral parameters is stable during the second analysis.

合成部における逆量子化後のスペクトルパラメータの系
の安定性は保障されず、不安定になる事が１、　　ある
。このような場合に従来の音声合成フィルタ回路で前述
の合成を行うと、フィルタの共振特性が異常に鋭くなっ
て原音声を正確に再現することが困難になる場合がある
。また、著しいときけ発振現象をおこすという欠点があ
る。The stability of the spectral parameter system after dequantization in the synthesis section is not guaranteed, and it may become unstable. In such a case, if the above-described synthesis is performed using a conventional speech synthesis filter circuit, the resonance characteristics of the filter may become abnormally sharp, making it difficult to accurately reproduce the original speech. Furthermore, it has the disadvantage of causing a significant oscillation phenomenon.

本発明の目的は、上述のような、スペクトル情報に誤差
があるような場合においても、異常共振等を起こさず安
定した音声合成が行われ、従来のものに比べ、よシ原音
声波形に近い振幅特性の音声を出力することができる音
声合成フィルタを提供することにある。The purpose of the present invention is to perform stable speech synthesis without causing abnormal resonance etc. even in the case where there is an error in the spectrum information as described above, and to achieve a sound waveform that is closer to the original speech waveform than conventional methods. An object of the present invention is to provide a speech synthesis filter capable of outputting speech having amplitude characteristics.

[Means for solving problems]

本発明は、有声時にはピッチ周期でインパルスが、無声
時には白色雑音がそれぞれ駆動入力信号として入力され
、出力信号を遅延させた信号に予測係数、パーコール係
数等のスペクトルパラメータを乗じた信号を前記駆動入
力信号に加えて出力するととによって音声を合成出力す
る音声合成フィルタにおいて前記スペクトルパラメータ
の予測利得によって損失が制御される損失付加回路を備
え、予測利得が大きくなった時には該損失付加回路を介
して合成するようにしたことを特徴とする。In the present invention, an impulse with a pitch period is inputted as a driving input signal when voiced, and white noise is inputted when unvoiced as a driving input signal, and a signal obtained by multiplying a signal obtained by delaying an output signal by a spectral parameter such as a prediction coefficient or a Percoll coefficient is inputted to the driving input signal. A speech synthesis filter that synthesizes and outputs speech by adding and outputting a signal includes a loss adding circuit whose loss is controlled by the predicted gain of the spectral parameter, and when the predicted gain becomes large, synthesis is performed via the loss adding circuit. It is characterized by being made to do.

〔Example〕

次に２本発明の実施例について２図面を参照して詳細に
説明する。Next, two embodiments of the present invention will be described in detail with reference to two drawings.

先ず２本発明の原理について説明する。本発明は２合成
フィルタの共振特性が、フィルタ係数であるところのス
ペクトルパラメータの量子化によシネ安定になることを
考慮し、予め合成フィルタの予測利得を求める事によっ
て合成フィルタの共振特性を制御するようにしたもので
ある。そして。First, two principles of the present invention will be explained. The present invention takes into account that the resonance characteristics of two synthesis filters become stable due to quantization of the spectral parameters, which are filter coefficients, and controls the resonance characteristics of the synthesis filter by determining the predicted gain of the synthesis filter in advance. It was designed to do so. and.

損失パラメータａｌを用いて次式によって音声合成を行
う。すなわち、出力信号Ｄｎを次式によって求める。Speech synthesis is performed using the following equation using the loss parameter al. That is, the output signal Dn is determined by the following equation.

Ｄｎ＝Ｓｎ＋ａα、Ｄｎ−１＋ａ２α２Ｄｎ−２＋・・
・、、＋Ｂ　ＰαｐＤｎ−ｐ上記損失パラメータａは０
≦ａ≦１の変数であシ。Dn=Sn+aα, Dn-1+a2α2Dn-2+...
・,,+B PαpDn-p The above loss parameter a is 0
Must be a variable with ≦a≦1.

合成フィルタの予測利得によって制御される。そして、
予測利得が低い場合はａ’＝１として従来と同様の合成
を行うが、予測利得が高くなると、予測利得に応じてａ
を１以下として合成フィルタの損失を大きくすることに
よって、フィルタの共振特性の先鋭化を防ぐようにする
。予測利得に応じて損失パラメータａの値をいくらに設
定するかは。Controlled by the prediction gain of the synthesis filter. and,
When the prediction gain is low, the same synthesis as before is performed with a'=1, but when the prediction gain is high, a' is set according to the prediction gain.
By setting 1 or less to increase the loss of the synthesis filter, the resonance characteristics of the filter are prevented from becoming sharper. What value should be set for the loss parameter a depending on the predicted gain?

例えば高い予測利得に対してａ　＝　０．９とし、予測
利得が低くなるに従って１に近ずけるようにする等適宜
な函数とすることができるが、細部については実験的に
原音声に近い合成出力が得られるように定めればよい。For example, an appropriate function can be used, such as setting a = 0.9 for a high prediction gain and approaching 1 as the prediction gain decreases, but for details, it is experimentally possible to synthesize a signal close to the original speech. Just set it so that the output can be obtained.

第１図は２本発明の一実施例を示すブロック図である。FIG. 1 is a block diagram showing an embodiment of the present invention.

入力端子１に駆動信号Ｓｎを与え、加算回路２の出力信
号を遅延回路４．係数付加回路５および損失付加回路６
を介して再び加算回路２に与える。すなわち、出力信号
を一定時間遅延させて係数付加回路および損失付加回路
を介して駆動信号に加算して合成出力を得る。係数付加
回路５は遅延回路４の出力信号に前記スペクトルパラメ
ータ（分析部から例えばデジタル値で与えられる）を乗
する回路である。また、損失付加回路６は損失制御回路
７によって制御される前述の損失パラメータａｉを乗す
る回路である。予測利得算出回路８は入力端子９に入力
されたスペクトルパラメータよシ予測利得を算出する。A drive signal Sn is applied to the input terminal 1, and the output signal of the adder circuit 2 is sent to the delay circuit 4. Coefficient addition circuit 5 and loss addition circuit 6
It is again applied to the adder circuit 2 via the . That is, the output signal is delayed for a certain period of time and added to the drive signal via a coefficient addition circuit and a loss addition circuit to obtain a composite output. The coefficient addition circuit 5 is a circuit that multiplies the output signal of the delay circuit 4 by the spectrum parameter (given as a digital value, for example, from the analysis section). Further, the loss addition circuit 6 is a circuit that multiplies the aforementioned loss parameter ai controlled by the loss control circuit 7. The prediction gain calculation circuit 8 calculates a prediction gain based on the spectrum parameters input to the input terminal 9.

損失制御回路７は前記予測利得に応じて損失パラメータ
ａｉを定め損失付加回路６を制御する。以上の構成によ
シ、予測利得が高いときは損失付加回路６の損失が大と
なるため駆動信号に加算される信号が小となシ、フィル
タ全体の損失が大きくなるから異常共振を防ぐことが可
能である。The loss control circuit 7 determines the loss parameter ai according to the predicted gain and controls the loss adding circuit 6. With the above configuration, when the predicted gain is high, the loss of the loss addition circuit 6 becomes large, so the signal added to the drive signal becomes small, and the loss of the entire filter increases, so abnormal resonance can be prevented. is possible.

第２図は、第１図の実施例の具体的な回路構成の一例を
示した図であり、ｐ個の遅延回路を縦続接続した場合を
示す。すなわち、加算回路２の出力信号を遅延回路４３
２乗算器６３．遅延回路４２゜乗算器６２．・・・・・
・、遅延回路４１．および乗算器６１の縦続接続回路（
ｐ段）に与える。そして２乗算器６３の出力信号は乗算
器５３によって入力端子９３から入力したスペクトルパ
ラメータα１と乗算されて加算回路２に与えられる。乗
算器６２の出力信号は乗算器５２によって同様に端子９
２から入力するスペクトルパラメータα２と乗算されて
加算回路２に与えられる。同様に、最終段（ｐ番目）Ｉ
　　　の乗算器６１の出力信号は乗算器５１によってス
ペクトルパラメータα、と乗算されて加算回路２に与え
られる。FIG. 2 is a diagram showing an example of a specific circuit configuration of the embodiment shown in FIG. 1, and shows a case where p delay circuits are connected in cascade. That is, the output signal of the adder circuit 2 is sent to the delay circuit 43.
Square multiplier 63. Delay circuit 42° multiplier 62.・・・・・・
, delay circuit 41. and a cascade connection circuit of multiplier 61 (
p stage). Then, the output signal of the square multiplier 63 is multiplied by the spectrum parameter α1 input from the input terminal 93 by the multiplier 53, and the resultant signal is applied to the addition circuit 2. The output signal of multiplier 62 is also output to terminal 9 by multiplier 52.
2 is multiplied by the spectral parameter α2 inputted from 2 and is applied to the adder circuit 2. Similarly, the final stage (pth) I
The output signal of the multiplier 61 is multiplied by the spectrum parameter α by the multiplier 51, and the resultant signal is applied to the addition circuit 2.

加算回路２はこれらの信号を入力端子１から入力した駆
動信号に加算して出力し、出力端子３に合成された音声
合成出力を出す。乗算器６３　、６２　。The adder circuit 2 adds these signals to the drive signal input from the input terminal 1 and outputs the resultant signal, and outputs a synthesized speech output from the output terminal 3. Multipliers 63 , 62 .

・・・、６１は損失制御回路７によって制御され損失パ
ラメータａを乗算する回路である。すなわち、損失付加
回路６（第１図）を構成している。損失制御回路７は予
測利得算出回路８の出力に応じて損失パラメータａを定
め、各乗算器６３，６２．・・・、６１の乗算値を制御
する。ここで、予測利得算出回路８の入力としては、入
力端子９３，９２．・・・９１から入力されたスペクト
ルパラメータが用いられる。損失パラメータａは。. . , 61 is a circuit that is controlled by the loss control circuit 7 and multiplies the loss parameter a. That is, it constitutes a loss adding circuit 6 (FIG. 1). The loss control circuit 7 determines a loss parameter a according to the output of the predicted gain calculation circuit 8, and controls each multiplier 63, 62 . . . . controls the multiplication value of 61. Here, input terminals 93, 92 . ...91 is used. The loss parameter a is.

予測利得が低いときは１であシ、予測利得が高くなるに
従って１よシ小さい値に設定される。細部については実
験的に適宜定めてもよい。遅延回路の段数ｐはスペクト
ルパラメータαの次数だけ設けられる。When the prediction gain is low, it is set to 1, and as the prediction gain increases, it is set to a value smaller than 1. Details may be determined experimentally as appropriate. The number of stages p of the delay circuit is equal to the order of the spectrum parameter α.

以上の構成により、予測利得が高い場合には大きい損失
を与えた信号が駆動信号に加えられるから。With the above configuration, when the predicted gain is high, a signal with a large loss is added to the drive signal.

異常共振を防ぐことができる。また、損失パラメータａ
の値を適宜設定するととによシ、原音声信号に近い合成
音声を出力することが可能である。Abnormal resonance can be prevented. Also, the loss parameter a
By appropriately setting the value of , it is possible to output synthesized speech that is close to the original speech signal.

第３図は、従来のパーコール格子型の音声合成フィルタ
の各遅延回路４４．・・・、４５の出力側にそれぞれ乗
算器６４．・・・、６５を挿入した場合を示す。この場
合も乗算器６４．６５の乗算値ａは損失制御回路７によ
って制御される。入力端子９４、・・・、９５へはパー
コール係数に１．・・・、ｋｐが入力されている。この
場合も入力端子９４．・・・。FIG. 3 shows each delay circuit 44 of a conventional Percoll lattice type speech synthesis filter. . . , 45 are provided with multipliers 64 . ..., 65 is inserted. In this case as well, the multiplication value a of the multipliers 64 and 65 is controlled by the loss control circuit 7. The input terminals 94, . . . , 95 are supplied with a Percoll coefficient of 1. ..., kp is input. In this case as well, the input terminal 94. ....

９５から入力されるパーコール係数に１．・・・ｋ、よ
シ予測利得算出回路８において算出された予測利得に応
じて損失制御回路７で求められる損失パラメータａによ
って乗算値を制御し、損失を与えることは前述の実施例
と同様であシ、異常共振の防止等については前述と同様
の効果を奏する。1 to the Percoll coefficient input from 95. . . . The multiplication value is controlled by the loss parameter a determined by the loss control circuit 7 in accordance with the predicted gain calculated by the predicted gain calculation circuit 8, and the loss is given in the same manner as in the previous embodiment. The same effects as described above are achieved in terms of prevention of reeds and abnormal resonance.

〔Effect of the invention〕

以上のように９本発明においては、出力信号を遅延回路
を介して駆動入力信号に加算する場合に。As described above, in the present invention, when an output signal is added to a drive input signal via a delay circuit.

量子化後の予測利得に応じて損失を付加するように構成
したから、異常共振の発生を防ぎ安定な動作を行うこと
ができる。また、損失パラメータを適当に設定すること
によシ、原音声波形に近い品質のよい合成音声を出力す
ることが可能である。Since the loss is added in accordance with the predicted gain after quantization, it is possible to prevent abnormal resonance from occurring and perform stable operation. Furthermore, by appropriately setting the loss parameters, it is possible to output synthesized speech with good quality that is close to the original speech waveform.

[Brief explanation of the drawing]

第１図は本発明の一実施例を示すブロック図。第２図はｐ個の遅延回路が縦続接続された場合に適用し
た実施例を示すブロック図、第３図は本発明をパーコー
ル格子型フィルタに適用した実施例を示すブロック図で
ある。図において、２．２１〜２４・・・加算回路、４゜４１
〜４５・・・遅延回路、５・・・係数付加回路、６・・
・損失付加回路、７・・・損失制御回路、５１〜５７゜
６１〜６５・・・乗算器、８・・・予測利得算出回路。FIG. 1 is a block diagram showing one embodiment of the present invention. FIG. 2 is a block diagram showing an embodiment in which p delay circuits are connected in cascade, and FIG. 3 is a block diagram showing an embodiment in which the present invention is applied to a Percoll grid filter. In the figure, 2.21 to 24...addition circuit, 4°41
~45...Delay circuit, 5...Coefficient addition circuit, 6...
- Loss addition circuit, 7... Loss control circuit, 51-57° 61-65... Multiplier, 8... Predicted gain calculation circuit.

Claims

[Claims]

1. When voiced, an impulse with a pitch period is input, and when unvoiced, white noise is input as a driving input signal, and a signal obtained by multiplying a signal obtained by delaying the output signal by a spectral parameter such as a prediction coefficient or a Percoll coefficient is used as the driving input signal. 1. A speech synthesis filter that synthesizes and outputs speech by additionally outputting it, comprising a loss adding circuit whose loss is controlled by a predicted gain of the spectral parameter.