JP2560277B2 - Speech synthesis method - Google Patents

Speech synthesis method

Info

Publication number
JP2560277B2
JP2560277B2 JP60235724A JP23572485A JP2560277B2 JP 2560277 B2 JP2560277 B2 JP 2560277B2 JP 60235724 A JP60235724 A JP 60235724A JP 23572485 A JP23572485 A JP 23572485A JP 2560277 B2 JP2560277 B2 JP 2560277B2
Authority
JP
Japan
Prior art keywords
waveform
representative
segment
envelope
data
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Expired - Lifetime
Application number
JP60235724A
Other languages
Japanese (ja)
Other versions
JPS6294900A (en
Inventor
順子 栗林
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
NEC Corp
Original Assignee
Nippon Electric Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Nippon Electric Co Ltd filed Critical Nippon Electric Co Ltd
Priority to JP60235724A priority Critical patent/JP2560277B2/en
Publication of JPS6294900A publication Critical patent/JPS6294900A/en
Application granted granted Critical
Publication of JP2560277B2 publication Critical patent/JP2560277B2/en
Anticipated expiration legal-status Critical
Expired - Lifetime legal-status Critical Current

Links

Description

【発明の詳細な説明】 〔産業上の利用分野〕 本発明は音声合成方式に関する。The present invention relates to a speech synthesis system.

〔従来の技術〕 音声合成方式は大別するとパラメータ方式と波形符号
化方式に分けられる。
[Prior Art] Speech synthesis methods are roughly classified into a parameter method and a waveform coding method.

前者は、ホルマント,RARCOR,LPC方式などが代表的な
ものである。このような声道パラメータを使用する方式
は、データの圧縮率が高いという利点を持っているが、
その分析の過程の条件により音質に非常なばらつきが生
じており、音質は自然性に欠けるという欠点がある。
Typical examples of the former are formant, RARCOR, and LPC methods. The method using such a vocal tract parameter has an advantage of high data compression rate,
Due to the condition of the analysis process, the sound quality varies greatly, and the sound quality lacks naturalness.

後者は、DM,ADM,ADPCM方式などがある。これらの方式
は、良質な合成音質が得られるがパラメータ方式に比較
し、1秒の音声を合成するのに必要なデータ量が多いと
いう欠点がある。この為、波形素片合成方式と組合せ、
データ量を圧縮する方法が用いられている。
The latter includes DM, ADM and ADPCM methods. Although these systems can obtain high quality synthetic sound quality, they have a drawback that a large amount of data is required to synthesize one second of voice as compared with the parameter system. Therefore, in combination with the waveform element synthesis method,
A method of compressing the amount of data is used.

音声信号は各種周波数成分を含んだ複雑な波形である
が、その母音部の波形はある基本同期毎に、類似の波形
が繰返し表われる。波形素片合成方式はこの点に着目
し、繰返し表われる類似波形(以下素片という)を1つ
代表として取出し、(以下この素片を代表素片という)
この素片を繰返すことによって元の類似波形部分を再生
するものである。これにより、全波形のデータを記憶す
る必要がなくデータ量を少なくすることができる。
The voice signal has a complicated waveform including various frequency components, but the waveform of the vowel part of the voice signal has a similar waveform repeatedly for each basic synchronization. Focusing on this point, the waveform segment synthesis method takes out one similar waveform that appears repeatedly (hereinafter referred to as a segment) as a representative (hereinafter, this segment is referred to as a representative segment).
By repeating this segment, the original similar waveform portion is reproduced. Thereby, it is not necessary to store the data of all the waveforms, and the data amount can be reduced.

実際の音声信号では、基本周期の振幅値の包絡線は連
続的に変化している。ところが、この波形素片合成方式
は代表素片を繰返し使用する為、繰返し部分の包絡線が
一定となってしまう。この様子を第2図に示す。
In an actual voice signal, the envelope of the amplitude value of the fundamental cycle changes continuously. However, in this waveform segment synthesis method, since the representative segment is repeatedly used, the envelope of the repeated portion becomes constant. This is shown in FIG.

第2図はaの範囲で示される代表素片(以下代表素片
aという)を3回繰返し、bの範囲で示される代表素片
(以下代表素片bという)を2回繰返した波形図であ
る。これは、代表素片a,代表素片bそれぞれの繰返し中
は、包絡線は変化していない。
FIG. 2 is a waveform diagram in which a representative element shown in the range of a (hereinafter referred to as representative element a) is repeated three times and a representative element shown in the range of b (hereinafter referred to as representative element b) is repeated twice. Is. This is because the envelope does not change during the repetition of each of the representative pieces a and b.

この為、代表素片を繰返す際に前の繰返し波形に対
し、相対的にどの程度振幅が増減するかという情報であ
る包絡線変化値ΔEを、代表素片データとは別に保持
し、代表素片繰返し時に、この包絡線変化値ΔEを正,
負側の波形データに付与する方法がとられてきた。
Therefore, when the representative segment is repeated, the envelope change value ΔE, which is information about how much the amplitude increases and decreases relative to the previous repetitive waveform, is stored separately from the representative segment data, and the representative segment data is retained. At the time of one-sided repetition, this envelope change value ΔE is positive,
The method of giving to the waveform data on the negative side has been taken.

包絡線変化値ΔEの付与は、代表素片の振幅値に包絡
線変化値ΔEをパラメータとした包絡線データEを乗算
する形で行なわれる。
The envelope change value ΔE is given by multiplying the amplitude value of the representative segment by the envelope data E with the envelope change value ΔE as a parameter.

つまり、1回目の繰返し波形に対しては、常に包絡線
データとしてE=1と代表素片の振幅値との乗算を行
う。従って、代表素片の振幅値はそのままである。2回
目の繰返し波形に対しては、E=1+ΔEの値を包絡線
データとして、代表素片の振幅値との乗算を行う。3回
目以降の繰返し波形に対しても同様にE=1+2ΔE
(3波形目),E=1+3ΔE(4波形目)……の値を包
絡線データとして代表素片の振幅値との乗算を行う。
That is, for the first repetitive waveform, the envelope data is always multiplied by E = 1 and the amplitude value of the representative segment. Therefore, the amplitude value of the representative segment remains unchanged. With respect to the second repetitive waveform, the value of E = 1 + ΔE is used as the envelope data and multiplication with the amplitude value of the representative segment is performed. Similarly for the third and subsequent repetitive waveforms, E = 1 + 2ΔE
(3rd waveform), E = 1 + 3ΔE (4th waveform) ... Envelope data is used for multiplication with the amplitude value of the representative segment.

第2図に示される波形に包絡線変化値を付与した場合
を第3図に示す。すなわち第3図は、代表素片aを3回
繰返す際にΔEaという包絡線変化値を付与し、代表素片
bを2回繰返す際にΔEbという包絡線変化値を付与した
場合を示したものである。
FIG. 3 shows a case where an envelope change value is added to the waveform shown in FIG. That is, FIG. 3 shows a case where an envelope change value of ΔEa is given when the representative segment a is repeated three times, and an envelope change value of ΔEb is given when the representative segment b is repeated twice. Is.

包絡線変化値の算出方法は、代表素片と次の代表素片
との最大振幅値差から求める方法などがある。式(1)
は、包絡線変化値ΔEを代表素片と次の代表素片との最
大振幅値差から求める式である。
As a method of calculating the envelope change value, there is a method of obtaining it from the maximum amplitude value difference between the representative segment and the next representative segment. Equation (1)
Is an expression for obtaining the envelope change value ΔE from the maximum amplitude value difference between the representative segment and the next representative segment.

RPn :代表素片の繰返し回数 Mn :代表素片の最大振幅値(絶対値) Mn+1:次の代表素片の最大振幅値(絶対値) 従来、波形素片合成方式は、主に擬音,メロディ等そ
の信号が正弦波,余弦波,くけい波等からなり、しか
も、正負振幅がほぼ対称である信号に使われており、包
絡線変化値ΔEは各代表素片に対し1つ与えられてい
た。
RPn: Number of repetitions of the representative segment Mn: Maximum amplitude value (absolute value) of the representative segment Mn + 1: Maximum amplitude value (absolute value) of the next representative segment Conventionally, the waveform segment synthesis method mainly uses onomatopoeia and melody. This signal is used for signals whose sine wave, cosine wave, claw wave, etc., and whose positive and negative amplitudes are almost symmetrical, and one envelope variation value ΔE is given to each representative segment. It was

〔発明が解決しようとする問題点〕[Problems to be solved by the invention]

上述した従来の波形素片合成方式は各代表素片に対し
て1つの包絡線変化値ΔEが与えられる為、音声信号の
ように、代表素片の正負振幅が非対称の場合は1つの包
絡線変化値ΔEを正側の波形データと負側の波形データ
に付与すると最後に繰返された素片と、次の1回目繰返
しの素片との間に急激な振幅の変化が生じ、この歪の為
ざらついた耳ざわりな音が発生するという問題があっ
た。
In the above-described conventional waveform segment synthesis method, one envelope change value ΔE is given to each representative segment, so that one envelope segment when the positive and negative amplitudes of the representative segment are asymmetrical like a voice signal. When the change value ΔE is added to the positive-side waveform data and the negative-side waveform data, a sudden change in amplitude occurs between the last repeated element and the next first-repeated element, and this distortion There was a problem that a gritty, gritty sound was generated.

すなわち第3図に示したように、代表素片aの包絡線
変化値ΔEを、正側の波形データ及び負側の波形データ
に付与した場合、代表素片aの3回目の振幅と、次の代
表素片bの1回目の振幅との間には急激な振幅の変化が
生じてしまい耳ざわりな音が発生する。
That is, as shown in FIG. 3, when the envelope change value ΔE of the representative segment a is added to the positive-side waveform data and the negative-side waveform data, the third amplitude of the representative segment a and A sudden change in amplitude occurs between the representative element b and the first amplitude, and a harsh sound is generated.

本発明の目的は歪の少ない音声が得られる音声合成方
式を提供することにある。
An object of the present invention is to provide a speech synthesis system that can obtain speech with less distortion.

〔問題点を解決するための手段〕[Means for solving problems]

本発明の音声合成方式は、波形データの振幅値の増減
に関する包絡線情報をそれぞれ繰返される類似波形から
なる複数の代表素片に付与する波形素片合成方式を用い
る音声合成方式において、入力音声の第1の代表素片の
正側(及び負側)の最大値と次の第2の代表素片の正側
(及び負側)の最大値との増減率を求め、この増減率を
前記第1の代表素片の繰返し数で除した値を包絡線情報
とし、音声合成時に求めた前記包絡線情報を正側の波形
データ及び負側の波形データに別々に与えるものであ
る。
The speech synthesis method of the present invention is a speech synthesis method using a waveform segment synthesis method that assigns envelope information relating to an increase / decrease of an amplitude value of waveform data to a plurality of representative segments each having a repeated similar waveform. The increase / decrease rate between the maximum value on the positive side (and the negative side) of the first representative element and the maximum value on the positive side (and the negative side) of the second representative element is calculated, and this increase / decrease rate A value obtained by dividing the number of repetitions of one representative segment is set as envelope information, and the envelope information obtained at the time of speech synthesis is separately given to the positive side waveform data and the negative side waveform data.

〔実施例〕〔Example〕

次に、本発明の実施例について図面を用いて説明す
る。
Next, embodiments of the present invention will be described with reference to the drawings.

次式(2),(3)はそれぞれ正側の波形データに付
与する包絡線変化値ΔEPn負側の波形データに付与する
包絡線変化値ΔEMnとをを求める式の一例である。
The following expressions (2) and (3) are examples of expressions for obtaining the envelope change value ΔEPn given to the positive side waveform data and the envelope change value ΔEMn given to the negative side waveform data, respectively.

MAXn :代表素片データの正側の最大値 MAXn+1:次の代表素片データの正側の最大値 MINn :代表素片データの負側の最小値 MINn+1:次の代表素片データの負側の最小値 RPn :繰返し回数 この場合、合成時のパラメータとしては、代表素片デ
ータ、繰返し回数そして包絡線データとしてΔEPn,ΔEM
nを与えればよい。
MAXn: Maximum value on the positive side of representative segment data MAXn +1 : Maximum value on the positive side of the next representative segment data MINn: Minimum value on the negative side of the representative segment data MINn +1 : Next representative segment data Negative minimum value of RPn: Number of iterations In this case, the parameters for synthesis are representative segment data, number of iterations, and envelope data ΔEPn, ΔEM
You can give n.

第1図は本発明の一実施例を説明する為の合成波形図
であり、縦軸に振幅値を又横軸に時間を示している。
FIG. 1 is a composite waveform diagram for explaining one embodiment of the present invention, in which the vertical axis represents the amplitude value and the horizontal axis represents the time.

第1図の波形は第2図の波形に対し、包絡線変化値を
正側の波形データ,負側の波形データに対しそれぞれ独
立に付与したものである。
The waveform of FIG. 1 is obtained by independently adding the envelope change value to the waveform data of the positive side and the waveform data of the negative side with respect to the waveform of FIG.

すなわち、代表素片a,代表素片bに対して、それぞれ
正側の波形データに付与する包絡線変化値ΔEPa,ΔEP
b、負側の波形データに付与する包絡線変化値ΔEMa,ΔE
Mbを与え、代表素片を繰返す際に、これらを付与したも
のである。
That is, the envelope change values ΔEPa and ΔEP to be added to the waveform data on the positive side for the representative segment a and the representative segment b, respectively.
b, Envelope change value ΔEMa, ΔE given to the waveform data on the negative side
Mb is given and these are added when the representative element is repeated.

つまり、代表素片aの1回目の繰返しの際にはaと1
との乗算を行い、2回目の繰返しの際には代表素片aの
振幅値が正であれば、1+ΔEPnとの乗算を行い、振幅
値が負であれば、1+ΔEMnとの乗算を行う。3回目に
ついても同様に、振幅値が正であれば1+2ΔEPn、振
幅値が負であれば1+2ΔEMnとの乗算を行う。
That is, when the representative segment a is repeated for the first time, a and 1
When the amplitude value of the representative segment a is positive, it is multiplied by 1 + ΔEPn, and when the amplitude value is negative, it is multiplied by 1 + ΔEMn. Similarly, for the third time, if the amplitude value is positive, 1 + 2ΔEPn is multiplied, and if the amplitude value is negative, 1 + 2ΔEMn is multiplied.

このように正側の波形データ及び負側の波形データに
付与する包絡線変化値を別々に付与することにより、異
なる代表素片間の振幅の変化を少くし、歪の少ない音声
を合成することができる。
In this way, by adding the envelope change values to the positive-side waveform data and the negative-side waveform data separately, it is possible to reduce the change in amplitude between different representative units and synthesize speech with less distortion. You can

〔発明の効果〕〔The invention's effect〕

以上説明したように本発明によれば、波形素片合成方
式において、正側の波形データ及び負側の波形データに
付与する包絡線変化値を包絡線データとして保持し、代
表素片繰返し時に、包絡線データの付与を、正側の波形
データ,負側の波形データに対し、それぞれ独立に行う
ことにより、代表素片繰返し時の包絡線を連続的に変化
させることができ、歪の少ない音声が得られる。
As described above, according to the present invention, in the waveform segment synthesizing method, the envelope change value given to the positive-side waveform data and the negative-side waveform data is held as envelope data, and when the representative segment is repeated, Envelope data can be added to the positive-side waveform data and the negative-side waveform data independently of each other, so that the envelope can be continuously changed when the representative unit is repeated. Is obtained.

【図面の簡単な説明】[Brief description of drawings]

第1図は本発明の一実施例を説明する為の合成波形図、
第2図は波形素片合成方式を説明する為の合成波形図、
第3図は従来の音声合成方式を説明する為の合成波形図
である。
FIG. 1 is a synthetic waveform diagram for explaining one embodiment of the present invention,
FIG. 2 is a synthesized waveform diagram for explaining the waveform segment synthesis method.
FIG. 3 is a synthesized waveform diagram for explaining the conventional speech synthesis method.

Claims (1)

(57)【特許請求の範囲】(57) [Claims] 【請求項1】波形データの振幅値の増減に関する包絡線
情報をそれぞれ繰返される類似波形からなる複数の代表
素片に付与する波形素片合成方式を用いる音声合成方式
において、入力音声の第1の代表素片の正側(及び負
側)の最大値と次の第2の代表素片の正側(及び負側)
の最大値との増減率を求め、この増減率を前記第1の代
表素片の繰返し数で除した値を包絡線情報とし、音声合
成時に求めた前記包絡線情報を正側の波形データ及び負
側の波形データに別々に与えることを特徴とする音声合
成方式。
1. A voice synthesizing method using a waveform element synthesizing method for assigning envelope information relating to an increase / decrease of an amplitude value of waveform data to a plurality of representative elements each having a repeated similar waveform. Maximum value on the positive side (and negative side) of the representative segment and the positive side (and negative side) on the second representative segment
Of the maximum value of, and the value obtained by dividing the increase / decrease rate by the number of repetitions of the first representative segment is the envelope information, and the envelope information obtained at the time of speech synthesis is the waveform data on the positive side and A voice synthesis method characterized in that it is given separately to the waveform data on the negative side.
JP60235724A 1985-10-21 1985-10-21 Speech synthesis method Expired - Lifetime JP2560277B2 (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
JP60235724A JP2560277B2 (en) 1985-10-21 1985-10-21 Speech synthesis method

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
JP60235724A JP2560277B2 (en) 1985-10-21 1985-10-21 Speech synthesis method

Publications (2)

Publication Number Publication Date
JPS6294900A JPS6294900A (en) 1987-05-01
JP2560277B2 true JP2560277B2 (en) 1996-12-04

Family

ID=16990285

Family Applications (1)

Application Number Title Priority Date Filing Date
JP60235724A Expired - Lifetime JP2560277B2 (en) 1985-10-21 1985-10-21 Speech synthesis method

Country Status (1)

Country Link
JP (1) JP2560277B2 (en)

Families Citing this family (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP2003108178A (en) 2001-09-27 2003-04-11 Nec Corp Voice synthesizing device and element piece generating device for voice synthesis

Family Cites Families (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JPS5182502A (en) * 1975-01-08 1976-07-20 Hitachi Ltd

Also Published As

Publication number Publication date
JPS6294900A (en) 1987-05-01

Similar Documents

Publication Publication Date Title
JP3294604B2 (en) Processor for speech synthesis by adding and superimposing waveforms
JP3078205B2 (en) Speech synthesis method by connecting and partially overlapping waveforms
JP6024191B2 (en) Speech synthesis apparatus and speech synthesis method
JPH086592A (en) Method and device for voice synthesis
RU2296377C2 (en) Method for analysis and synthesis of speech
US5381514A (en) Speech synthesizer and method for synthesizing speech for superposing and adding a waveform onto a waveform obtained by delaying a previously obtained waveform
US5452398A (en) Speech analysis method and device for suppyling data to synthesize speech with diminished spectral distortion at the time of pitch change
EP0191531B1 (en) A method and an arrangement for the segmentation of speech
US5321794A (en) Voice synthesizing apparatus and method and apparatus and method used as part of a voice synthesizing apparatus and method
US5369730A (en) Speech synthesizer
JP2560277B2 (en) Speech synthesis method
US5163110A (en) Pitch control in artificial speech
JP2600384B2 (en) Voice synthesis method
JPH09244693A (en) Method and device for speech synthesis
US4075424A (en) Speech synthesizing apparatus
JP5175422B2 (en) Method for controlling time width in speech synthesis
JP2612867B2 (en) Voice pitch conversion method
JPH04116700A (en) Voice analyzing and synthesizing device
JPS62102294A (en) Voice coding system
JPH0772897A (en) Method and device for synthesizing speech
JPH0514280B2 (en)
JPH0693200B2 (en) Speech synthesis method
JPS61128299A (en) Voice analysis/analytic synthesization system
JP3263136B2 (en) Signal pitch synchronous position extraction method and signal synthesis method
JPS628199A (en) Voice synthesization

Legal Events

Date Code Title Description
EXPY Cancellation because of completion of term