JPS5848920B2

JPS5848920B2 - Speech synthesizer sound source creation device

Info

Publication number: JPS5848920B2
Application number: JP4669278A
Authority: JP
Inventors: 啓山本; 一成入江
Original assignee: Nippon Telegraph and Telephone Corp
Current assignee: Nippon Telegraph and Telephone Corp
Priority date: 1978-04-21
Filing date: 1978-04-21
Publication date: 1983-10-31
Also published as: JPS54139407A

Description

【発明の詳細な説明】本発明は、音声合成器の音源に対し、短時間振幅の緩や
かな変化と連続的な有声無声切換え動作を行わせる音声
合成器の音源作成装置に関するものである。DETAILED DESCRIPTION OF THE INVENTION The present invention relates to a sound source creation device for a speech synthesizer that performs gradual changes in short-term amplitude and continuous voiced/unvoiced switching operations on the sound source of the speech synthesizer.

従来の音声合或装置は第１図に示したように構成され、
又音源作成回路は第１図の点線で示した部分のように構
戒されている。A conventional voice synthesis device is constructed as shown in FIG.
Also, the sound source creation circuit is structured as shown by the dotted line in FIG.

第１図において、１は音声情報人力端子、２は音声出力
端子、３は音声情報分離回路、４はスペクトル情報の出
力端子、５は有声無声切替情報の出力端子、６はピッチ
情報出力端子、７は振幅情報の出力端子であり、入力端
子１から直列信号として入力された信号は音声情報分離
回路３の端子４，５，６，７からの出力情報に分離され
、これらの出力はフレーム周期（約１０ｍｓｅｃ〜
２５ｍｓｅｃ）毎に更新される。In FIG. 1, 1 is an audio information manual terminal, 2 is an audio output terminal, 3 is an audio information separation circuit, 4 is an output terminal for spectrum information, 5 is an output terminal for voiced/unvoiced switching information, 6 is a pitch information output terminal, 7 is an output terminal for amplitude information, and the signal inputted as a serial signal from input terminal 1 is separated into output information from terminals 4, 5, 6, and 7 of audio information separation circuit 3, and these outputs have a frame period. (about 10 msec~
It is updated every 25 msec).

８は計数器であり、この計数値はピッチ周期の値から１
２５μ秒毎に１だけ減じて行き、０となった時にまたピ
ッチ周期の値とするもので、計数値がＯの時のみこのカ
ウンタ出力は１となり、それ以外の場合はＯの出力とす
る。8 is a counter, and this count value is 1 from the pitch period value.
It is decremented by 1 every 25 microseconds, and when it reaches 0, it is taken as the value of the pitch period again. Only when the count value is O, the counter output becomes 1, and in other cases, the output is O.

９は乗算器であり、前記計数器８の出力と端子Ｔの振幅
値を乗算器９で乗じることにより、ピッチ周期毎に端子
７で指定された値のパルスを得る。9 is a multiplier, and by multiplying the output of the counter 8 and the amplitude value of the terminal T by the multiplier 9, a pulse of the value specified at the terminal 7 is obtained for each pitch period.

即ち、乗算器９の出力で有声音時の音源が得られる。That is, the output of the multiplier 9 provides a sound source for voiced sound.

１０は１と−１を１２５μ秒毎に不規則に得るランダム
パルス発生回路、１１は乗算器で、この乗算器１１の出
力は端子７で指定された振幅の値によって極性が不規則
なパルスを得る。10 is a random pulse generation circuit that randomly generates 1 and -1 every 125 μs, 11 is a multiplier, and the output of this multiplier 11 generates pulses with irregular polarity depending on the amplitude value specified at terminal 7. obtain.

即ち、乗算器１１の出力で無声音時の音源が得られる。That is, the output of the multiplier 11 provides a sound source for unvoiced sound.

１２は切替スイッチ回路で、乗算器９と１１で得た情報
を端子５で指定される有声無声切替情報により選択する
回路である。Reference numeral 12 denotes a changeover switch circuit which selects the information obtained by the multipliers 9 and 11 based on the voiced/unvoiced switching information designated by the terminal 5.

この切替スイッチ回路１２で得られた音源情報は端子４
のスペクトル情報と共に用いられ、合成デイジタルフィ
ルタ回路１３によりデイジタル音声出力が得られ、デイ
ジタル・アナログ変換回路１４を介してスピーカ１５か
ら合成音声が得られる。The sound source information obtained by this changeover switch circuit 12 is transmitted to the terminal 4.
A digital audio output is obtained by the synthesis digital filter circuit 13, and a synthesized speech is obtained from the speaker 15 via the digital-to-analog conversion circuit 14.

この従来例の動作は、第２図に示したように、フレーム
とフレームの境界において振幅が不連続になり、無声音
から有声音への滑らかな移行が行われず、実際の音声と
は異なる合成音が得られるという欠点があった。As shown in Figure 2, the operation of this conventional example results in amplitude discontinuity at the boundaries between frames, a smooth transition from unvoiced sound to voiced sound, and a synthesized sound that differs from actual speech. It had the disadvantage that it could be obtained.

又、第３図に示したように、スペクトル情報については
、補間して用いればよい合戒音が得られることが既に提
案されているが、この場合、良品質の合成音を得るには
、音源情報として１／２フレームだけずらして使用する
必要があった。Furthermore, as shown in Figure 3, it has already been proposed that spectral information can be used by interpolation to obtain a combined sound, but in this case, in order to obtain a high-quality synthesized sound, it is necessary to It was necessary to use it as information by shifting it by 1/2 frame.

本発明は、上記従来例の欠点を解決するために、パルス
列、ランダム雑音の振幅に対して補間を行い、振幅の急
激な変化を防ぐと同時に、有声無声の切替わり時点にお
いて、パルス列とランダム雑音の混在区間を置くことに
より、滑らかな有声無声の切換え動作を可能にし、合成
音声の品質向上を計る音声合成器の音源作成装置を提供
するものである。In order to solve the above-mentioned drawbacks of the conventional example, the present invention performs interpolation on the amplitude of the pulse train and random noise to prevent sudden changes in the amplitude, and at the same time, at the time of switching between voiced and unvoiced, the pulse train and random noise are interpolated. The present invention provides a sound source creation device for a speech synthesizer that enables smooth voiced/unvoiced switching operations and improves the quality of synthesized speech by providing mixed sections.

以下、図面により実施例を詳細に説明する。Hereinafter, embodiments will be described in detail with reference to the drawings.

第４図は、本発明の１実施例を示したもので、第１図と
同一符号のものは同一のものを示しており、又１６は分
岐回路であり、有声時には端子１７に端子７からの振幅
情報を出力し、端子１８にＯレベルの振幅情報を出力し
、無声時には、端子１７にＯレベルの振幅情報を出力し
、端子１８に端子７からの振幅情報を出力する。FIG. 4 shows one embodiment of the present invention, in which the same reference numerals as in FIG. , and outputs O-level amplitude information to terminal 18 . During silence, O-level amplitude information is output to terminal 17 , and amplitude information from terminal 7 is output to terminal 18 .

ここで、端子１７，１８に得られる情報はそれぞれ音声
成分の振幅情報、無声成分の振幅情報である。Here, the information obtained at the terminals 17 and 18 is the amplitude information of the voice component and the amplitude information of the unvoiced component, respectively.

又端子４のスペクトル情報、端子５のピッチ周期、端子
１７．１８からの有声成分の振幅情報、無声成分の振幅
情報は補間回路１９，２０，２１，２２でそれぞれ補間
され、滑らかな変化をする情報として使用される。In addition, the spectrum information of terminal 4, the pitch period of terminal 5, the amplitude information of voiced components, and the amplitude information of unvoiced components from terminals 17 and 18 are interpolated by interpolation circuits 19, 20, 21, and 22, respectively, and change smoothly. Used for information.

又これらの補間回路１９，２０，２１，２２は同様の回
路であり、兼用した回路構成とすることが可能であり、
回路規模を大きく増加させることはない。Furthermore, these interpolation circuits 19, 20, 21, and 22 are similar circuits, and can have a common circuit configuration.
It does not significantly increase the circuit scale.

又計数器８と補間回路２７の出力が乗算器９で乗算され
た有声成分の振幅情報と、ランダム雑音発生器１０の出
力と補間回路２２の出力が乗算器１１で乗算された無声
成分は加算回路２３で合成され、音源情報が得られる。Also, the amplitude information of the voiced component obtained by multiplying the outputs of the counter 8 and the interpolation circuit 27 by the multiplier 9 and the unvoiced component obtained by multiplying the output of the random noise generator 10 and the output of the interpolation circuit 22 by the multiplier 11 are added together. The signals are synthesized in a circuit 23 and sound source information is obtained.

この音源情報は合成デイジタルフィルタ回路１３で端子
４からのスペクトル情報と共に用いられてデイジタル音
声出力が得られ、デイジタル・アナログ変換回路１４を
介してスピーカ１５から合成音声が得られる。This sound source information is used in the synthesis digital filter circuit 13 together with the spectrum information from the terminal 4 to obtain a digital audio output, and a synthesized audio is obtained from the speaker 15 via the digital-to-analog conversion circuit 14.

なお、点線で示した部分２４は音源作或回路である。Note that a portion 24 indicated by a dotted line is a sound source production circuit.

第５図は、上記実施例の動作を説明するために、第４図
の各部の情報変化を示したもので、このように滑らかに
変化する有声成分と無声成分の振幅情報を得ることがで
きる。In order to explain the operation of the above embodiment, FIG. 5 shows information changes in each part of FIG. 4. In this way, smoothly changing amplitude information of voiced and unvoiced components can be obtained. .

この振幅情報は、第５図の４で示した振幅情報よりも１
／２フレ一本だけ統計的にずれているが、本実施例のよ
うにスペクトル情報およびピッチ情報も補間することに
より、振幅情報と同じ位相となり、各情報の位相が一致
したシステムが構成される。This amplitude information is 1
Although there is a statistical deviation by one /2 frame, by interpolating the spectrum information and pitch information as in this example, the phase becomes the same as that of the amplitude information, and a system in which the phases of each information match is constructed. .

以上説明したように、本発明によれば、回路規模の大き
な増大を招かずに連続的な有声無声切換え動作と振幅の
滑らかな変化を実現することが可能であり、音声合成器
の音源として使用した場合、合成音声の品質向上を計る
ことができ、又、スペクトル情報およびピッチ情報も補
間することにより、各情報の位相が一致したシステムを
構成できるという利点がある。As explained above, according to the present invention, it is possible to realize continuous voiced/unvoiced switching operation and smooth change in amplitude without causing a large increase in circuit scale, and it can be used as a sound source for a speech synthesizer. In this case, it is possible to improve the quality of the synthesized speech, and by also interpolating spectrum information and pitch information, there is an advantage that a system in which the phases of each information are matched can be constructed.

[Brief explanation of drawings]

第１図は、従来方式による音源作成装置の構成図、第２
図は、従来方式における各情報の時間変化の一例を示し
た図、第３図は、スペクトル情報に補間を行った例を示
した図、第４図は、本発明の１実施例のブロック図、第
５図は、第４図の各情報の時間変化を示した図である。１・・・・・・音声情報入力端子、２・・・・・・音声
出力端子、３・・・・・・音声情報分離回路、４・・・
・・・スペクトル情報の出力端子、５・・・・・・有声
無声切替情報の出力端子、６・・・・・・ピッチ情報出
力端子，７・・・・・・振幅情報の出力端子、８・・・
・・・計数器、９，１１・・・・・・乗算器、１０・・
・・・・ランダムパルス発生器、１３・・・・・・合成
デイジタルフィルタ回路、１４・・・・・・デイジタル
・アナログ変換回路、１５・・・・・・スピーカ、１６
・・・・・・分岐回路、１７，１８・・・・・・端子、
１９，２０，２１．２２・・・・・・補間回路、２３・
・・・・・加算回路。Figure 1 is a configuration diagram of a conventional sound source creation device;
The figure shows an example of how each information changes over time in the conventional method, Figure 3 shows an example of interpolation of spectrum information, and Figure 4 is a block diagram of one embodiment of the present invention. , FIG. 5 is a diagram showing temporal changes in each piece of information in FIG. 4. 1...Audio information input terminal, 2...Audio output terminal, 3...Audio information separation circuit, 4...
... Spectral information output terminal, 5 ... Voiced/unvoiced switching information output terminal, 6 ... Pitch information output terminal, 7 ... Amplitude information output terminal, 8 ...
... Counter, 9, 11 ... Multiplier, 10...
... Random pulse generator, 13 ... Synthesis digital filter circuit, 14 ... Digital-to-analog conversion circuit, 15 ... Speaker, 16
...branch circuit, 17,18...terminal,
19, 20, 21. 22... Interpolation circuit, 23.
...Addition circuit.

Claims

[Claims]

1 Voiced/unvoiced judgment is performed for each analysis frame, and a pulse train is generated for voiced sounds, and random noise is generated for unvoiced sounds.The amplitudes of voiced sounds and unvoiced sounds are interpolated for each sound source signal. 1. A sound source creation device for a speech synthesizer, characterized in that a mixed section of a pulse train and random noise is set up.