JPH03174597A - Voice synthesizer - Google Patents

Voice synthesizer

Info

Publication number
JPH03174597A
JPH03174597A JP1314750A JP31475089A JPH03174597A JP H03174597 A JPH03174597 A JP H03174597A JP 1314750 A JP1314750 A JP 1314750A JP 31475089 A JP31475089 A JP 31475089A JP H03174597 A JPH03174597 A JP H03174597A
Authority
JP
Japan
Prior art keywords
amplitude
fluctuation
voice
natural
fluctuations
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
JP1314750A
Other languages
Japanese (ja)
Inventor
Nobuhide Yamazaki
山崎 信英
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Ricoh Co Ltd
Original Assignee
Ricoh Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Ricoh Co Ltd filed Critical Ricoh Co Ltd
Priority to JP1314750A priority Critical patent/JPH03174597A/en
Publication of JPH03174597A publication Critical patent/JPH03174597A/en
Pending legal-status Critical Current

Links

Abstract

PURPOSE:To generate a more natural synthesized voice by multiplying a synthesized voice signal by the output of a fluctuation time-series memory through an integrator and adding amplitude fluctuations to the synthesized voice. CONSTITUTION:The fluctuation time series memory 2a is stored with amplitude fluctuations obtained from a natural voice by an amplitude fluctuation extracting means 1 in advance. For example, a vowel which is generated steadily is used as the natural voice and the amplitude fluctuation extracting means 1 finds a maximum amplitude value in each pitch period of the natural voice and divide maximum values by a mean value for normalization to obtain amplitude fluctuations. The voice signal outputted by a synthesizing filter 4 is superposed upon the amplitude fluctuations from a fluctuation generating circuit 2 to obtain a voice signal given the amplitude fluctuations. Consequently, the amplitude fluctuations in the natural voice can be reflected faithfully and the more natural synthesized voice can be obtained.

Description

【発明の詳細な説明】 挟揉盆互 本発明は、音声合成装置、より詳細には、規則音声合成
装置において1合成音声に自然性を付与するための装置
の構成に関する。
DETAILED DESCRIPTION OF THE INVENTION The present invention relates to a speech synthesis device, and more particularly, to the configuration of a device for imparting naturalness to one synthesized speech in a regular speech synthesis device.

k来挟権 従来の規則音声合成装置では、合成音声の振幅を定めら
れた規則によって制御している。例えば。
2. Description of the Related Art Conventional regular speech synthesis devices control the amplitude of synthesized speech according to predetermined rules. for example.

韻律情報をもとにピッチパタンを得るのと同様にして、
振幅パタンを得るようにした方法がある。
In the same way as obtaining pitch patterns based on prosodic information,
There is a method for obtaining an amplitude pattern.

ところで、人間の発声する音声は生体から発声されるた
めに、色々なゆらぎが含まれており、その一つとして振
幅のゆらぎが存在する。そこで、従来より、この振幅ゆ
らぎを近似合成し、振幅パタンに重畳することが行われ
ている。この振幅ゆらぎの近似合成として、さまざまな
方式が提案されてきた。例えば、(a)特公昭62−4
9639号公報では、正規乱数発生手段の出力をゆらぎ
系列として使用している(これは、音声信号にゆらぎデ
ータを加える方式である)。(b)特開昭63−229
499号公報では、乱数発生手段の出力に積分フィルタ
を用いて、ゆらぎ系列を得ている(これは、ゆらぎ系列
に定数を加えたものと、音声信号とを積算する方式であ
る)。(c)特開昭58−186800号公報では、振
幅パタンに1/fゆらぎをあたえている(ゆらぎのあた
え方については、いっさい書かれていない)aまた、(
d)特開昭55−133099号公報では、D/A変換
器の基準電圧に、増幅器から得られたゆらぎを加えてい
る。而して、上記従来技術(a)では、振幅ゆらぎとし
てランダム系列発振器の出力を、正規乱数系列に変換し
て用いている。また、(b)では、ランダム系列を積分
器に通すことで。
By the way, since the human voice is uttered by a living body, it contains various fluctuations, one of which is amplitude fluctuation. Therefore, conventionally, this amplitude fluctuation is approximately synthesized and superimposed on the amplitude pattern. Various methods have been proposed for approximate synthesis of this amplitude fluctuation. For example, (a) Tokuko Sho 62-4
In Publication No. 9639, the output of a normal random number generation means is used as a fluctuation sequence (this is a method of adding fluctuation data to an audio signal). (b) Japanese Patent Publication No. 63-229
No. 499 uses an integral filter for the output of the random number generating means to obtain a fluctuation sequence (this is a method of integrating the fluctuation sequence plus a constant and the audio signal). (c) In JP-A-58-186800, a 1/f fluctuation is given to the amplitude pattern (no information is given on how to give the fluctuation)a Also, (
d) In Japanese Patent Laid-Open No. 55-133099, fluctuations obtained from an amplifier are added to the reference voltage of a D/A converter. In the prior art (a), the output of the random sequence oscillator is converted into a normal random number sequence and used as the amplitude fluctuation. Also, in (b), by passing the random sequence through an integrator.

振幅ゆらぎを得ている。また、(c)では、ランダム系
列を1/f特性のテーブルを持つ量子化器によって1/
fゆらぎの近似を行っている。また、(d)では、増幅
率が非常に大きい増幅器の出力ゆらぎを用いている。し
かしながら、上記従来の方式はあくまでも振幅ゆらぎの
近似であったので、不自然さが残っていた。
Amplitude fluctuations are obtained. In addition, in (c), the random sequence is 1/
Approximation of f fluctuation is performed. Furthermore, in (d), the output fluctuation of an amplifier with a very large amplification factor is used. However, since the above-mentioned conventional method was only an approximation of amplitude fluctuation, unnaturalness remained.

1−一攻 本発明は、上述のごとき実情に鑑みてなされたもので、
特に、生体の振幅ゆらぎを反映して、より自然な合成音
を得ることを目的としてなされたものである。
1-One Attack The present invention was made in view of the above-mentioned circumstances,
In particular, this was done with the aim of reflecting the amplitude fluctuations of the living body to obtain more natural synthesized sounds.

講−一」又 本発明は、上記目的を達成するために、一定の規則ある
いは保存されたデータをもとに合成音声の振幅を決定す
る規則音声合成装置において、ゆらぎ時系列メモリと、
積算器を有し、合成音声信号とゆらぎ時系列メモリの出
力を、積算器によって掛け合わせることで1合成音声に
振幅ゆらぎを付与することを特徴としたものである。以
下、本発明の実施例に基づいて説明する。
In order to achieve the above object, the present invention provides a regular speech synthesizer that determines the amplitude of synthesized speech based on certain rules or stored data, which includes a fluctuation time series memory;
The apparatus is characterized in that it has an integrator and adds amplitude fluctuation to one synthesized speech by multiplying the synthesized speech signal and the output of the fluctuation time series memory by the integrator. Hereinafter, the present invention will be explained based on examples.

第3図は、本発明に用いる振幅ゆらぎ生成回路の一例を
説明するための回路構成図で、図中、1は振幅ゆらぎ抽
出回路、2は振幅ゆらぎ生成回路で、該振幅ゆらぎ生成
回路2はゆらぎ時系列メモリ2a及びアドレスカウンタ
2bを有しており、ゆらぎ時系列メモリ2aは、ゆらぎ
をデジタル的に記憶している。このゆらぎ時系列メモリ
2aは、例えば、Dビットアドレスカウンタ2bにより
アドレス指定される。上記アドレスカウンタ2bはピッ
チ周期信号によってエピッチ周期ごとに歩進される。別
の実施例として、上記アドレスカウンタはクロック発振
器をピッチ周期信号によって制御しても良い。
FIG. 3 is a circuit configuration diagram for explaining an example of an amplitude fluctuation generation circuit used in the present invention. In the figure, 1 is an amplitude fluctuation extraction circuit, 2 is an amplitude fluctuation generation circuit, and the amplitude fluctuation generation circuit 2 is It has a fluctuation time series memory 2a and an address counter 2b, and the fluctuation time series memory 2a stores fluctuations digitally. This fluctuation time series memory 2a is addressed, for example, by a D-bit address counter 2b. The address counter 2b is incremented every pitch period by the pitch period signal. In another embodiment, the address counter may control a clock oscillator with a pitch period signal.

上記ゆらぎ時系列メモリ2aには、あらかじめ、振幅ゆ
らぎ抽出手段1によって肉声から得た振幅ゆらぎを保存
しておく。例えば、肉声として、定常的に発声した母音
音声を用い、振幅ゆらぎ抽出手段1によって、肉声の1
ピッチ周期ごとに振幅の最大値を求め、これらを平均値
で除算することで正規化したものを振幅ゆらぎとする。
The amplitude fluctuation obtained from the real voice by the amplitude fluctuation extraction means 1 is stored in advance in the fluctuation time series memory 2a. For example, using a regularly uttered vowel voice as the real voice, the amplitude fluctuation extraction means 1 extracts one of the real voices.
The maximum value of the amplitude is determined for each pitch period, and the value is normalized by dividing it by the average value, and the result is defined as the amplitude fluctuation.

第1図及び第2図は、それぞれ本発明による音声合成装
置の実施例を説明するための構成図で。
FIG. 1 and FIG. 2 are block diagrams for explaining embodiments of a speech synthesis device according to the present invention, respectively.

図中、2は振幅ゆらぎ生成回路、3は音源生成部、4は
合成フィルタ、5は積算器で、第1図に示した実施例に
おいて、合成フィルタ4から出力された音声信号は、積
算器5によってゆらぎ生成回路2からの振幅ゆらぎと重
畳され、振幅ゆらぎが与えられた音声信号となる。別の
実施例として、第2図に示すように、音源信号に振幅ゆ
らぎを与えても、同様の効果が得られる。また、波形編
集型の音声合成装置においても1合成音声波形と上記振
幅ゆらぎを積算器によって重畳することで同様の効果が
得られる。
In the figure, 2 is an amplitude fluctuation generation circuit, 3 is a sound source generation section, 4 is a synthesis filter, and 5 is an integrator.In the embodiment shown in FIG. 5, the signal is superimposed with the amplitude fluctuation from the fluctuation generation circuit 2, resulting in an audio signal given amplitude fluctuation. As another example, as shown in FIG. 2, the same effect can be obtained even if amplitude fluctuation is applied to the sound source signal. Further, in a waveform editing type speech synthesis device, a similar effect can be obtained by superimposing one synthesized speech waveform and the above amplitude fluctuation using an integrator.

また、ゆらぎ時系列メモリに複数の異なったゆらぎを保
存し9例えば1合成する音韻ごとに切替えて、より自然
なゆらぎを与えることも可能である。
It is also possible to store a plurality of different fluctuations in the fluctuation time-series memory and switch them for each phoneme to be synthesized, for example, to provide more natural fluctuations.

勿−一≦4 以上の説明から明らかなように、本発明によると、本発
明の音声合成装置のゆらぎ生成回路によって、肉声に含
まれる振幅のゆらぎを忠実に反映することができ、より
自然性の高い合成音声を得ることができる。
As is clear from the above description, according to the present invention, the fluctuation generation circuit of the speech synthesizer of the present invention can faithfully reflect the amplitude fluctuations included in the real voice, making it more natural. It is possible to obtain high-quality synthesized speech.

【図面の簡単な説明】[Brief explanation of the drawing]

第1図及び第2図は、それぞれ本発明の詳細な説明する
ための構成図、第3図は1本発明の実施に用いる振幅ゆ
らぎ生成回路の一例を説明するための構成図である。 1・・・振幅ゆらぎ抽出回路、2・・・振幅ゆらぎ生成
回路、2a・・・ゆらぎ時系列メモリ、2b・・・アド
レスカウンタ、3・・・音源生成部、4・・・合成フィ
ルタ、5・・・積算器。
FIGS. 1 and 2 are block diagrams for explaining the present invention in detail, and FIG. 3 is a block diagram for explaining an example of an amplitude fluctuation generation circuit used to implement the present invention. DESCRIPTION OF SYMBOLS 1... Amplitude fluctuation extraction circuit, 2... Amplitude fluctuation generation circuit, 2a... Fluctuation time series memory, 2b... Address counter, 3... Sound source generation unit, 4... Synthesis filter, 5 ...Integrator.

Claims (1)

【特許請求の範囲】[Claims] 1、一定の規則あるいは保存されたデータをもとに合成
音声の振幅を決定する規則音声合成装置において、ゆら
ぎ時系列メモリと、積算器を有し、合成音声信号とゆら
ぎ時系列メモリの出力を、前記積算器によって掛け合わ
せて合成音声に振幅ゆらぎを付与することを特徴とした
音声合成装置。
1. A regular speech synthesizer that determines the amplitude of synthesized speech based on certain rules or stored data, which has a fluctuation time series memory and an integrator, and which outputs the synthesized speech signal and the fluctuation time series memory. , a speech synthesis device characterized in that the synthesized speech is multiplied by the integrator to impart amplitude fluctuation to the synthesized speech.
JP1314750A 1989-12-04 1989-12-04 Voice synthesizer Pending JPH03174597A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
JP1314750A JPH03174597A (en) 1989-12-04 1989-12-04 Voice synthesizer

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
JP1314750A JPH03174597A (en) 1989-12-04 1989-12-04 Voice synthesizer

Publications (1)

Publication Number Publication Date
JPH03174597A true JPH03174597A (en) 1991-07-29

Family

ID=18057135

Family Applications (1)

Application Number Title Priority Date Filing Date
JP1314750A Pending JPH03174597A (en) 1989-12-04 1989-12-04 Voice synthesizer

Country Status (1)

Country Link
JP (1) JPH03174597A (en)

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US8898062B2 (en) 2007-02-19 2014-11-25 Panasonic Intellectual Property Corporation Of America Strained-rough-voice conversion device, voice conversion device, voice synthesis device, voice conversion method, voice synthesis method, and program

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US8898062B2 (en) 2007-02-19 2014-11-25 Panasonic Intellectual Property Corporation Of America Strained-rough-voice conversion device, voice conversion device, voice synthesis device, voice conversion method, voice synthesis method, and program

Similar Documents

Publication Publication Date Title
EP0805433A3 (en) Method and system of runtime acoustic unit selection for speech synthesis
JP3287230B2 (en) Chorus effect imparting device
US5381514A (en) Speech synthesizer and method for synthesizing speech for superposing and adding a waveform onto a waveform obtained by delaying a previously obtained waveform
JP2564641B2 (en) Speech synthesizer
US20010029454A1 (en) Speech synthesizing method and apparatus
JPH03174597A (en) Voice synthesizer
JP3197975B2 (en) Pitch control method and device
US4905562A (en) Method for deriving and replicating complex musical tones
JP2005539261A (en) Method for controlling time width in speech synthesis
US4984496A (en) Apparatus for deriving and replicating complex musical tones
JP3130305B2 (en) Speech synthesizer
JP3394281B2 (en) Speech synthesis method and rule synthesizer
JP2573586B2 (en) Rule-based speech synthesizer
JPS587197A (en) Singing voice generator
JPS5880699A (en) Voice synthesizing system
JPS58168097A (en) Voice synthesizer
JP2573585B2 (en) Speech spectrum pattern generator
JP3284634B2 (en) Rule speech synthesizer
JP2573587B2 (en) Pitch pattern generator
JP2586040B2 (en) Voice editing and synthesis device
JP3967571B2 (en) Sound source waveform generation device, speech synthesizer, sound source waveform generation method and program
Woodward The synthesis of music and speech
JPH0695696A (en) Speech synthesis system
JPS6032720Y2 (en) speech synthesizer
JPH0553595A (en) Speech synthesizing device