JPH02239296A

JPH02239296A - Method and device for synthesizing voice

Info

Publication number: JPH02239296A
Application number: JP6015589A
Authority: JP
Inventors: Shoji Ogata; 緒方　正二
Original assignee: Hitachi Ltd
Current assignee: Hitachi Ltd
Priority date: 1989-03-13
Filing date: 1989-03-13
Publication date: 1990-09-21

Abstract

PURPOSE:To prevent the voice synthesis output from being unstable regardless of rough quantization of voice data by constituting a circuit, which generates plural sine waves different in frequency and amplitude as element signals of voice synthesis, of a digital circuit which reads out data from a digital storage device. CONSTITUTION:The read speed is determined by the output of a frequency value code read out from a voice data storage device 14 to control the read, and numerical data of sine waveforms is read out from a waveform storage device 16. Read-out numerical data is converted to an analog signal by a D/A converter 17. The converted analog output is applied to a variable gain amplifier 18. The output of an amplitude value code read out from a voice data storage device 14 is applied to the variable gain amplifier 18 whose gain is controlled, and respective sine waves outputted from the variable gain amplifier 18 are synthesized by a synthesizer 19 to obtain the voice synthesis output. Thus, the voice synthesis output is not unstable regardless of rough quantization of voice data.

Description

【発明の詳細な説明】〔産業上の利用分野〕本発明は、音声合成方法および装置に関し，特に、簡単
なデジタル回路で装置構成が可能な複合正弦波方式で音
声合成を行う音声合成方法および装置に関するものであ
る。[Detailed Description of the Invention] [Industrial Application Field] The present invention relates to a speech synthesis method and device, and in particular to a speech synthesis method and device that performs speech synthesis using a composite sine wave method that can be configured with a simple digital circuit. It is related to the device.

[Conventional technology]

従来から開発されている音声合成方式は、音声波形その
ものを符号化する波形符号化方式と，音声波形の特徴を
利用してパラメータ化するパラメータ合成方式に大別で
きる．波形符号化方式にはＰＣＭ　（パルス符号変調），ＤＰ
ＣＭ　（差分ＰＣＭ），ＤＭ　（デルタ変調），ＡＰＣ
Ｍ　（適応ＰＣＭ），ＡＤＰＣＭ　（適応ＤＰＣＭ），
ＡＤＭ　（適応ＤＭ）等の方式があり、また、パラメー
タ合成方式には．ＬＰＧ　（線形予測符号化），ＰＡＲ
ＣＯＲ　（偏自己相関）等の方式がある。Speech synthesis methods that have been developed can be broadly divided into waveform encoding methods that encode the speech waveform itself, and parameter synthesis methods that use the characteristics of the speech waveform to create parameters. Waveform encoding methods include PCM (pulse code modulation) and DP.
CM (differential PCM), DM (delta modulation), APC
M (adaptive PCM), ADPCM (adaptive DPCM),
There are methods such as ADM (adaptive DM), and parameter synthesis methods. LPG (linear predictive coding), PAR
There are methods such as COR (partial autocorrelation).

ところで、波形符号化方式は，音声合成を行う合成回路
が簡単になるが、データの圧縮度が小さいため，データ
メモリのメモリ容量を大きくする必要がある．また，パ
ラメータ合成方式は，データ圧縮度が大きいため、デー
タメモリのメモリ容量を小さくできるが、逆に、音声合
成を行う合成回路が複雑になる６近年の大規模集積回路
（ＬＳＩ）技術の進歩により複雑な合成回路も容易に製
作できるようになっているので、パラメータ合成方式が
主流になりつつあるが、パラメータ合成方式では、合成
される音声の基本周波数はサンプリングパルスの周波数
で決定される離散値しか取りえず、デジタルフィルタの
演算を行って波形合成するため、デジタルフィルタの特
性がパラメータ伝送時のエラー等により、安定性が損な
われる，このため、音程ずれが生じ、また、めりはりの
ない合成音となってしまうという問題がある。更に構成
を簡易なものとするため、量子化を粗くすると，フィル
タの特性が不安定となるという問題がある。By the way, the waveform encoding method simplifies the synthesis circuit that performs speech synthesis, but because the degree of data compression is low, it is necessary to increase the memory capacity of the data memory. In addition, since the parameter synthesis method has a high degree of data compression, it is possible to reduce the memory capacity of the data memory, but on the other hand, the synthesis circuit that performs speech synthesis becomes complicated. 6 Recent advances in large-scale integrated circuit (LSI) technology As it has become easier to create complex synthesis circuits, the parameter synthesis method is becoming mainstream. Since only values can be taken and the waveform is synthesized by performing digital filter calculations, the characteristics of the digital filter may become unstable due to errors during parameter transmission, etc., resulting in pitch deviations and sharpness. There is a problem in that the result is a synthesized sound with no sound. Furthermore, if the quantization is coarsened in order to simplify the configuration, there is a problem that the characteristics of the filter become unstable.

これに対して、デジタルフィルタを使わないで、簡単な
アナログ回路で構成できる複合正弦波モデルによる音声
合成法がある。これは、音声を周波数ｆエと振幅ｍＬの
異なるｎ個の正弦波の和でモデル化する．そして、標本
音声の自己相関とモデルの自己相関が等しくなるように
周波数ｆエと振幅ｍＬ（１≦ｉ≦ｎ）を定めて，音声合
成の要素信号となる各々の正弦波を規定する。On the other hand, there is a speech synthesis method using a complex sine wave model that can be configured with a simple analog circuit without using a digital filter. This models the voice as a sum of n sine waves with different frequencies f and amplitudes mL. Then, the frequency f and the amplitude mL (1≦i≦n) are determined so that the autocorrelation of the sample voice is equal to the autocorrelation of the model, and each sine wave that becomes an element signal for speech synthesis is defined.

このような複合正弦波モデルによる音声合成法によって
、音声合成を行う音声合成装置は、例えば，周波数ｆｌ
＃Ｌｔｆ３ｔ・・・・ｌ　ｆｆｌの正弦波を発生するｎ
個の可変周波数発振器と，電圧で利得可変のｎ個の増幅
器とを組み合せて構成する。この音声合成装置において
、大事なことは音声ピッチを付与することで、このため
，例えば、ピッチ周期ごとに正弦波の位相をＯに初期化
する位相リセット回路等が備えられる，この種の複合正弦波モデルによる音声合成法によって、
音声合成を行う音声合成装置に関する公知文献としては
，例えば、特開昭５８−１４７７９８号公報，特開昭６
１−２１０００号公報等が挙げられる．〔発明が解決しようとする課題〕ところで、上述のように、音声を周波数と振幅の異なる
複数個の正弦波の和でモデル化する複合正弦波音声合成
方式による音声合成装置は、要求される合成音声の品質
に応じて，音声の量子化を粗らくしても、デジタルフィ
ルタを用いないので、回路が安定に動作するという特長
を有するが、音声合成装置は複数個の可変周波数発振器
と利得可変な複数個の増幅器を組合せたアナログ回路で
構成することになるため、回路要素としてはアナログ回
路が多くなり、集積回路化が困難であるという問題があ
る。A speech synthesis device that performs speech synthesis using such a speech synthesis method using a composite sine wave model, for example, uses a frequency fl
#Ltf3t...l n that generates a sine wave of ffl
It is constructed by combining n variable frequency oscillators and n amplifiers whose gain is variable with voltage. In this speech synthesis device, the important thing is to give the speech pitch, and for this purpose, for example, this type of composite sine synthesizer is equipped with a phase reset circuit that initializes the phase of the sine wave to O for each pitch period. By using the wave model-based speech synthesis method,
Publicly known documents related to speech synthesis devices that perform speech synthesis include, for example, Japanese Patent Application Laid-open No. 147798/1983,
1-21000, etc. [Problems to be Solved by the Invention] As described above, a speech synthesizer using a composite sine wave speech synthesis method that models speech as a sum of multiple sine waves with different frequencies and amplitudes cannot achieve the required synthesis. It has the advantage that the circuit operates stably even if the audio quantization is coarsened depending on the quality of the audio because no digital filter is used.However, the audio synthesizer uses multiple variable frequency oscillators and variable gain. Since it is constructed from an analog circuit that is a combination of a plurality of amplifiers, there is a problem that there are many analog circuits as circuit elements, and it is difficult to integrate the circuit.

本発明は、上記問題点を解決するためになされたもので
ある。The present invention has been made to solve the above problems.

本発明の目的は、集積回路化が容易な簡単なデジタル回
路を用いて音声合成ができる複合正弦波方式の音声合成
方法を提供することにある。SUMMARY OF THE INVENTION An object of the present invention is to provide a composite sine wave speech synthesis method that can synthesize speech using a simple digital circuit that can be easily integrated into an integrated circuit.

本発明の他の目的は、簡単なデジタル回路を用いて装置
構成が可能な音声合成装置を提供することにある．本発明の前記ならびにその他の目的と新規な特徴は、本
明細書の記述及び添付図面によって明らかになるであろ
う。Another object of the present invention is to provide a speech synthesis device that can be configured using a simple digital circuit. The above and other objects and novel features of the present invention will become apparent from the description of this specification and the accompanying drawings.

ｃｉ＊題を解決するための手段〕上記目的を達成するため，本発明においては、音声を複
数の異なる正弦波の組合せでモデル化し、モデル化した
各々の正弦波を周波数値コードと振幅値コードとの組合
せでコード化して記憶する音声データ記憶装置と、正弦
波の波形を数値化して記憶する波形記憶装置とを備え、
音声データ記憶装置から読み出した周波数値コードの出
力により続出速度を定めて，波形記憶装置から正弦波波
形の数値データを読出し、読出した数値データをアナロ
グ信号に変換し、音声データ記憶装置から読み出した振
幅値コードの出力により利得を制御した増幅器を介して
前記アナログ信号を出力して、周波数と振幅の異なる正
弦波を生成し．生成した正弦波の複数個の和により音声
を合成することを特徴とする．また、音声成分の各々の正弦波を周波数値コードと振幅
値コードとの組合せでコード化して記憶する音声データ
記憶装置と、正弦波の波形を数値化して記憶する波形記
憶装置と、音声データ記憶装置から読み出した周波数値
コードの出力によリ読出速度を定めて読出し制御を行う
読出し制御手段と、波形記憶装置からの読出し出カをア
ナログ信号に変換するデジタル／アナログ変換器と、音
声データ記憶装置から読み出した振幅値コードの出力に
より利得を制御する可変利得増幅器とを備えることを特
徴とする。Means for Solving the ci* Problem] To achieve the above object, in the present invention, speech is modeled by a combination of a plurality of different sine waves, and each modeled sine wave is divided into a frequency value code and an amplitude value code. an audio data storage device that encodes and stores the sine wave waveform in combination with the sine wave, and a waveform storage device that digitizes and stores the waveform of the sine wave;
The successive output speed is determined by the output of the frequency value code read from the audio data storage device, the numerical data of the sine wave waveform is read from the waveform storage device, the read numerical data is converted into an analog signal, and the converted signal is read from the audio data storage device. The analog signal is outputted through an amplifier whose gain is controlled by the output of an amplitude value code to generate a sine wave having a different frequency and amplitude. The feature is that speech is synthesized by the sum of multiple generated sine waves. The present invention also includes an audio data storage device that encodes and stores each sine wave of the audio component using a combination of a frequency value code and an amplitude value code, a waveform storage device that digitizes and stores the waveform of the sine wave, and an audio data storage device. A readout control means that controls the readout by determining the readout speed based on the output of the frequency value code read from the device, a digital/analog converter that converts the readout output from the waveform storage device into an analog signal, and an audio data storage. The apparatus is characterized by comprising a variable gain amplifier whose gain is controlled by the output of an amplitude value code read from the apparatus.

正弦波の波形を数値化して記憶する波形記憶装置は、正
弦波の位相角の値に対応するアドレスコードのアドレス
に、当該位相角の正弦波の波高値の数値データを記憶し
ている。A waveform storage device that digitizes and stores the waveform of a sine wave stores numerical data of the peak value of the sine wave at the phase angle at an address of an address code corresponding to the value of the phase angle of the sine wave.

[Effect]

前記手段によれば、音声合成装置には、音声データ記憶
装置と．波形記憶装置と，読出し制御手段と、デジタル
／アナログ変換器と、可変利得増幅器とが備えられる。According to the above means, the speech synthesis device includes a speech data storage device. A waveform storage device, a readout control means, a digital/analog converter, and a variable gain amplifier are provided.

音声データ記憶装置は、音声を複数の異なる正弦波の組
合せでモデル化し、モデル化した各々の正弦波を周波数
値コードと振幅値コードとの組合せでコード化して記憶
する。また、波形記憶装置は、正弦波の波形を数値化し
て記憶する。読出し制御手段が，音声データ記憶装置か
ら読み出した周波数値コードの出力により読出速度を定
めて読出し制御を行い，波形記憶装置から正弦波波形の
数値化データを読出す。読出された数値化データは、デ
ジタル／アナログ変換器によりアナログ信号に変換され
る。変換されたアナログ出力は可変利得増幅器に加えら
れる．また，音声データ記憶装置から読み出された振幅
値コードの出力が，利得を制御する可変利得増幅器に加
えられており、この可変利得増幅器からの出力の各々の
正弦波のを合成して、音声合成出力を得る。The audio data storage device models audio using a combination of a plurality of different sine waves, codes each modeled sine wave using a combination of a frequency value code and an amplitude value code, and stores the code. Further, the waveform storage device converts the waveform of the sine wave into numerical values and stores it. The read control means determines the read speed based on the output of the frequency value code read from the audio data storage device, performs read control, and reads digitized data of the sine wave waveform from the waveform storage device. The read numerical data is converted into an analog signal by a digital/analog converter. The converted analog output is applied to a variable gain amplifier. In addition, the output of the amplitude value code read from the audio data storage device is applied to a variable gain amplifier that controls the gain, and each sine wave output from the variable gain amplifier is synthesized to generate an audio signal. Get the composite output.

このように、本発明の音成合成装置においては、音声合
成の要素信号である異なる周波数と振幅の複数の正弦波
を発生する回路は、デジタル記憶装置からデータ読み出
しを行う簡単なデジタル回路で構成されるので、合成す
る音声信号に対応してデータ読み呂しの制御データを与
えることにより、任意の周波数の任意の振幅の複数の正
弦波を発生させることができる．これにより、音成合成装置は主要部が簡単なデジタル回
路で構成可能となり、集積回路化が容易となる。また、
音声合成は複合正弦波法で行うので、音声合成のための
音声データの量子化を粗くしても、音声合成出力が不安
定にならない音声合成装置とすることができる。In this way, in the sound synthesis device of the present invention, the circuit that generates a plurality of sine waves of different frequencies and amplitudes, which are the element signals of speech synthesis, is constructed of a simple digital circuit that reads data from a digital storage device. Therefore, multiple sine waves of any frequency and any amplitude can be generated by providing control data for data readout corresponding to the audio signal to be synthesized. As a result, the main part of the sound synthesis device can be configured with a simple digital circuit, and it can be easily integrated into an integrated circuit. Also,
Since speech synthesis is performed using the composite sine wave method, it is possible to provide a speech synthesis apparatus in which the speech synthesis output does not become unstable even if the quantization of speech data for speech synthesis is made coarse.

〔Example〕

以下、本発明の一実施例を図面を用いて具体的に説明す
る。Hereinafter, one embodiment of the present invention will be specifically described using the drawings.

なお、実施例を説明するための全図において、同一要素
のものは同一符号を付け、その繰り返しの説明は省略す
る。In all the figures for explaining the embodiments, the same elements are given the same reference numerals, and repeated explanations thereof will be omitted.

第１図は、本発明の一実施例にかかるデジタル回路で構
成した音声合成装置の要部のブロック図である。第１図
において、１０は音声合成を行う主要部の音声合成回路
である．１１は発振器、ｌ２は力ウンタ、１３アドレス
カウンタ、１９は合成した音声を呂力する合成器である
。主要部の音声合成回路１０には、音声合成の要素成分
信号となる各々の正弦波の周波数と振幅の値をそれぞれ
周波数値コードと振幅値コードとの組合せでコード化さ
れた音声データが記憶されている音声データ記憶装置を
含む記憶装置のＲＯＭ群１４，分周器群１５，正弦波波
形の数値データを記憶している波形記憶装置を含む正弦
波発生器群１６，デジタル／アナログ変換器群１７，お
よび減衰器群１８が設けられている。音声合成回路１０
において、ＲＯＭ群１４の各々のＲＯＭに対応して，分
局器群１５，正弦波発生器群１６，デジタル／アナログ
変換器群１７，および減衰器群１８における各々の回路
要素が対応して設けられている。正弦波発生器群１６の
回路要素の各々の正弦波発生器は、正弦波波形の数値デ
ータから正弦波を発生する。すなわち、音声データの周
波数値コードのコードデータにより分周器群１５の各々
の回路要素を制御し、各々の正弦波波形の数値データの
読み出し速度が制御される．これにより、正弦波発生器
群ｌ６の回路要素の各々の正弦波発生器は個別に読み出
し速度が制御されて，各々の正弦波発生器から周波数が
異なる複数の正弦波の数値データのデジタルデータが読
み出される。読み出された各々の正弦波のデジタルデー
タは、デジタル／アナログ変換器群１７の各々の回路要
素の個別のデジタル／アナログ変換器に加えられて、ア
ナログ信号の正弦波に変換される。個々のアナログ信号
の正弦波は、更に減衰器群１８の各々の回路要素の減衰
器に加えられる。減衰器群１８の各々の回路要素の減衰
器は、ＲＯＭ群１４の各々のＲＯＭがら読み出された音
声データの振幅値コードのコードデータにより減衰度が
制御されて、増幅器で増幅する利得が可変され、所定の
振幅値の正弦波として出力される。減衰器群１８の各々
の回路要素の減衰器から出力された周波数と振幅の異な
る複数の正弦波は、合成器１９により合成されて、合成
音声出力として出力される。FIG. 1 is a block diagram of the main parts of a speech synthesis device constructed from digital circuits according to an embodiment of the present invention. In FIG. 1, 10 is a speech synthesis circuit which is the main part that performs speech synthesis. 11 is an oscillator, 12 is a force counter, 13 is an address counter, and 19 is a synthesizer for outputting the synthesized voice. The main part of the speech synthesis circuit 10 stores speech data in which the frequency and amplitude values of each sine wave, which are elemental component signals for speech synthesis, are coded by a combination of a frequency value code and an amplitude value code. A ROM group 14 of storage devices including an audio data storage device, a frequency divider group 15, a sine wave generator group 16 including a waveform storage device storing numerical data of a sine wave waveform, and a digital/analog converter group. 17, and an attenuator group 18 are provided. Speech synthesis circuit 10
In the ROM group 14, each circuit element in the divider group 15, sine wave generator group 16, digital/analog converter group 17, and attenuator group 18 is provided correspondingly to each ROM in the ROM group 14. ing. The sine wave generator of each circuit element of the sine wave generator group 16 generates a sine wave from numerical data of a sine wave waveform. That is, each circuit element of the frequency divider group 15 is controlled by the code data of the frequency value code of the audio data, and the reading speed of numerical data of each sine wave waveform is controlled. As a result, the reading speed of each sine wave generator of the circuit element of the sine wave generator group l6 is individually controlled, and digital data of numerical data of a plurality of sine waves having different frequencies is output from each sine wave generator. Read out. Each read sine wave digital data is applied to an individual digital/analog converter of each circuit element of the digital/analog converter group 17 and converted into a sine wave analog signal. The individual analog signal sine waves are further applied to the attenuator of each circuit element of attenuator group 18. The attenuation degree of the attenuator of each circuit element of the attenuator group 18 is controlled by the code data of the amplitude value code of the audio data read from each ROM of the ROM group 14, and the gain amplified by the amplifier is variable. and output as a sine wave with a predetermined amplitude value. A plurality of sine waves having different frequencies and amplitudes output from the attenuators of each circuit element of the attenuator group 18 are synthesized by a synthesizer 19 and output as a synthesized audio output.

また、ここでの発振器１１は、音声合成を行う音声合成
回路１０の基本タイミングＣＬＫＩを発生して、回路全
体を制御する。発振器１１がらの基本タイミングＣＬＫ
Ｉはカウンタ１２に供給され，音声のピッチ周期のピッ
チ周波数タイミングＣＬＫ２を発生させる。カウンタ１
２からのピッチ周波数タイミングＣＬＫ２は、分局器群
１５と正弦波発生器群１６に供給されて、合成音声のピ
ッチ周期の位相を制御する。また、ピッチ周波数タイミ
ングＣＬＫ２は、アドレスカウンタ１３に供給される。Further, the oscillator 11 here generates a basic timing CLKI for the speech synthesis circuit 10 that performs speech synthesis, and controls the entire circuit. Basic timing CLK from oscillator 11
I is supplied to a counter 12, which generates a pitch frequency timing CLK2 of the pitch period of the voice. counter 1
The pitch frequency timing CLK2 from 2 is supplied to the divider group 15 and the sine wave generator group 16 to control the phase of the pitch period of the synthesized speech. Further, the pitch frequency timing CLK2 is supplied to the address counter 13.

アドレスカウンタ１３のカウント値は、ＲＯＭ群１４に
対する読み出しアドレスを与える．次に、上述した音声合成装置の要素の個々の回路構成を
説明する。The count value of the address counter 13 provides a read address for the ROM group 14. Next, the individual circuit configurations of the elements of the above-mentioned speech synthesis device will be explained.

複合正弦波法で音声を合成する要素成分信号の個々の正
弦波を発生させるため、音声データとして、ｎ個の周波
数値を与える周波数値コードとｎ個の振幅値を与える振
幅値コードとを組合せたコードデータが予め設定される
。この組合せコードのコードデータ（音声データ）は、
ＲＯＭ群１４のＲＯＭデータとして予め記憶される。In order to generate individual sine waves of elemental component signals for synthesizing audio using the composite sine wave method, a frequency value code giving n frequency values and an amplitude value code giving n amplitude values are combined as audio data. The code data is set in advance. The code data (audio data) of this combination code is
The data is stored in advance as ROM data in the ROM group 14.

第２図は、音声データの周波数値コードと振幅値コード
化して組合せたコードデータの一例を示す図である．第
２図のコードデータは、音声成分の正弦波の周波数値と
振幅値とをコード化して組合せたデータの例であり，合
成する音声に応じてピッチ周期Ｔの１　／　ｎ時間単位
で、各時間単位における音声データがコードデータとし
て時系列的に記憶されている例である。第２図において
，周波数値コードＦ０。〜Ｆ．Ｉｌは、前述したＲＯＭ
群１４から読み出され、正弦波波形の数値データの読出
し速度を与える。また，振幅値コードＭ０。〜Ｍ０は、
前述したＲＯＭ群１４から読み出され、減衰器群１８の
利得を制御する減衰度の値を与える。このように、周波
数値コードＦ１Ｊと振幅値コードＭ．Ｊとを組合せたコ
ードデータＦＬ，・Ｍエ，が１つの正弦波を規定してい
る。FIG. 2 is a diagram showing an example of code data obtained by converting audio data into frequency value code and amplitude value code and combining them. The code data in Fig. 2 is an example of data in which the frequency value and amplitude value of the sine wave of the audio component are coded and combined. This is an example in which audio data in units of time is stored chronologically as code data. In FIG. 2, frequency value code F0. ~F. Il is the aforementioned ROM
It is read out from the group 14 and provides the readout speed of numerical data in a sinusoidal waveform. Also, amplitude value code M0. ~M0 is
It is read from the aforementioned ROM group 14 and provides an attenuation value for controlling the gain of the attenuator group 18. In this way, the frequency value code F1J and the amplitude value code M. The code data FL, ·M, which is a combination of J and J, defines one sine wave.

第３図は、正弦波波形を数値データとして記憶している
波形記憶装置の読出し速度制御を音声データの周波数値
コードＦエ，により行う速度制御回路の一例を示すブロ
ック回路図である。第３図において、ＣＬＫＩは基本タ
イミング、ＣＬＫ２はピッチ周波数タイミング、Ｄ０〜
Ｄ０は周波数値コードＦ　ＬＪのビットデータである。FIG. 3 is a block circuit diagram showing an example of a speed control circuit that controls the read speed of a waveform storage device that stores a sine wave waveform as numerical data using a frequency value code F of audio data. In Figure 3, CLKI is the basic timing, CLK2 is the pitch frequency timing, D0~
D0 is bit data of the frequency value code FLJ.

この速度制御回路は、カウンタを構成するフリップフロ
ップ群と制御のための論理ゲート群から構成されており
、例えば，第１図のブロック図においては、分局器群１
５の要素回路の分周器に含まれる回路である。This speed control circuit is composed of a group of flip-flops constituting a counter and a group of logic gates for control. For example, in the block diagram of FIG.
This circuit is included in the frequency divider of the 5th element circuit.

この速度制御回路は、例えば、周波数値コードＦＬＪの
コード値のビットデータＤ．〜Ｄｎをフリツプフロップ
ＣＮＴ２°〜ＣＮＴ２”にピッチ周波数タイミングＣＬ
Ｋ２または出力信号ＣＮＴＯＵＴによってセットし、基
本タイミングＣＬＫＩによりカウントアップさせる回路
である。いわゆるブリセットカウンタである。セットす
る周波数値コードのビットコードＤ０〜Ｄ０の値により
、出力信号ＣＮＴＯＵＴの出力周期が変化する。This speed control circuit uses, for example, bit data D. of the code value of the frequency value code FLJ. ~Dn to flip-flop CNT2°~CNT2'' with pitch frequency timing CL
This circuit is set by K2 or output signal CNTOUT and counts up by basic timing CLKI. This is a so-called brisset counter. The output cycle of the output signal CNTOUT changes depending on the values of the bit codes D0 to D0 of the frequency value code to be set.

第４図は，第３図の速度制御回路の動作例を説明するタ
イミングチャートである。この例では、ｎ＝３の場合に
おけるタイミングチャートを示している。図示するよう
に，周波数値コードのビットデータＤ。−Ｄ，の入力が
、ピッチ周波数タイミングＣ　Ｌ　Ｋ　２の発生前に確
定され，ピッチ周波数タイミングＣＬＫ２の発生タイミ
ングでフリツプフロップＣＮＴ　２°−ＣＮＴ２３にセ
ットされ、基本タイミングＣＬＫＩによりカウントアッ
プが行われる。図示するように、周波数値コードのビツ
トデータＤ０〜Ｄ３として（０００１），が与えられた
場合、出力信号ＣＮＴＯＵＴの周期は基本タイミングＣ
ＬＫＩの８周期単位毎に出力される。セットされる周波
数値コードのビットデータ値により出力信号ＣＮＴＯＵ
Ｔの周期が変化する。FIG. 4 is a timing chart illustrating an example of the operation of the speed control circuit shown in FIG. This example shows a timing chart for the case where n=3. As shown in the figure, bit data D of the frequency value code. The input of -D is determined before the pitch frequency timing CLK2 occurs, is set in the flip-flop CNT2°-CNT23 at the pitch frequency timing CLK2, and is counted up at the basic timing CLKI. As shown in the figure, when (0001) is given as bit data D0 to D3 of the frequency value code, the period of the output signal CNTOUT is the basic timing C
It is output every 8 cycles of LKI. The output signal CNTOU is output according to the bit data value of the frequency value code that is set.
The period of T changes.

第５図は，波形記憶装置から数値化データを読出す読出
しアドレスを与えるアドレス制御回路の要部の構成を示
すブロック図である。第５図において、５１はアドレス
カウンタ，５２は正弦波波形の数値化データを記憶して
いるＲＯＭである。ＲＯＭ５２に与えるアドレスを発生
するアドレスカウンタ５１はｎ個のフリップフロップ５
１ａ〜５１ｎから構成されている。アドレスカウンタ５
１から発生するアドレスデータＡ０〜Ａ，は，ＲＯＭ５
２に順次に与えることにより、正弦波波形の数値化デー
タが順次に読み出される。アドレスカウンタ５１のｎ個
のフリップフロップ５１ａ〜５１ｎは、ピッチ周波数タ
イミングＣＬＫ２によってリセットされ、ＣＮＴＯＵＴ
信号をトリガーにして，順次にカウントアップ動作する
。正弦波波形の数値化データを記憶しているＲＯＭ５２
は，正弦波の位相角の値に対応するアドレスコードのア
ドレスに、当該位相角の正弦波の波高値の数値データを
記憶しており、アドレスカウンタ５１が．ＲＯＭの読出
しアドレスを順次アップさせ、読み出し制御を行うこと
により，位相角の進行に応じて、ＲＯＭの内容の正弦波
波形の数値化データが順次に読み出される。FIG. 5 is a block diagram showing the configuration of a main part of an address control circuit that provides a read address for reading digitized data from the waveform storage device. In FIG. 5, 51 is an address counter, and 52 is a ROM that stores numerical data of a sine wave waveform. An address counter 51 that generates an address to be given to the ROM 52 has n flip-flops 5.
It is composed of 1a to 51n. address counter 5
Address data A0 to A, generated from ROM 5
2, the numerical data of the sine wave waveform is sequentially read out. The n flip-flops 51a to 51n of the address counter 51 are reset by the pitch frequency timing CLK2, and the CNTOUT
The signal is used as a trigger to count up sequentially. ROM52 that stores numerical data of the sine wave waveform
stores the numerical data of the peak value of the sine wave at the phase angle in the address of the address code corresponding to the value of the phase angle of the sine wave, and the address counter 51 stores the value of the peak value of the sine wave at the phase angle. By sequentially increasing the read address of the ROM and performing read control, numerical data of the sinusoidal waveform of the contents of the ROM is sequentially read out as the phase angle progresses.

第６図は、入力アドレスと読出し出力データ値の対応関
係により数値データ化した正弦波波形のデータを記憶し
ているＲＯＭデータの一例を示す図である。第６図に示
すように、この正弦波波形のデータを記憶しているＲＯ
Ｍは，入力アドレス八〇〜Ａ４の２進コードの値を，正
弦波の位相角に対応させ、正弦波の位相角に対応したア
ドレスコードの記憶域に，当該位相角の正弦波の波高．
値のレベル値を数値データとして記憶する。これにより
、入力アドレスＡ０〜Ａ４のアドレスデータを順次カウ
ントアップしてＲＯＭから数値データを読み出すと、順
次に正弦波の位相角の進行に対応した波高値の数値デー
タが読み出され、読み出しデータとして、正弦波形の波
形データが出力されることになる．このように、本実施例の音声合成装置の主要部を構成す
る音声合成回路１０（第１図）の各々の要素の回路は，
簡単なデジタル回路のＲＯＭの読み出し回路で構成され
るため、音成合成装置は主要部を簡単なデジタル回路と
して、集積回路化が容易となる。また，音声合成は複合
正弦波法で行うので、音声合成のための音声データの量
子化を粗くしても、音声合成出力が不安定にならない音
声合成装置とすることができる。FIG. 6 is a diagram showing an example of ROM data storing sine wave waveform data converted into numerical data based on the correspondence between input addresses and read output data values. As shown in Figure 6, the RO which stores this sine wave waveform data
M associates the binary code values of input addresses 80 to A4 with the phase angle of the sine wave, and stores the wave height of the sine wave at the phase angle in the storage area of the address code corresponding to the phase angle of the sine wave.
Store the level value of the value as numerical data. As a result, when the address data of input addresses A0 to A4 are sequentially counted up and the numerical data is read from the ROM, the numerical data of the peak value corresponding to the progression of the phase angle of the sine wave is sequentially read out, and the numerical data is read out as the read data. , the waveform data of a sine waveform will be output. As described above, the circuits of each element of the speech synthesis circuit 10 (FIG. 1) that constitutes the main part of the speech synthesis device of this embodiment are as follows.
Since it is composed of a ROM readout circuit which is a simple digital circuit, the main part of the tone synthesis device can be made into a simple digital circuit, and it can be easily integrated into an integrated circuit. Furthermore, since speech synthesis is performed using the composite sine wave method, it is possible to provide a speech synthesis apparatus in which the speech synthesis output does not become unstable even if the quantization of speech data for speech synthesis is made coarse.

次に，本発明の他の実施例を説明する。以上に説明した
実施例においては，正弦波波形の数値化して記憶してい
る波形記憶装置は、正弦波発生器群１６の要素回路の各
々の正弦波発生器に含ませているが，これはＲＯＭ群１
５における共用データとして有するような構成でもよい
。Next, another embodiment of the present invention will be described. In the embodiment described above, the waveform storage device that stores the numeric value of the sine wave waveform is included in each sine wave generator of the element circuit of the sine wave generator group 16. ROM group 1
The configuration may be such that it is held as shared data in 5.

第７図は、本発明の他の一実施例にかかるデジタル回路
で構成した音声合成装置の要部のブロック図である．第
７図において、１０は音声合成回路、１．１は発振器、
１２はカウンタ、１３はアドレスカウンタ、１４はＲＯ
Ｍ群、１５は分局器群、１６は正弦波発生器群，１７は
デジタル／アナログ変換器群，１８は減衰器群，１９は
合成器である．これらは、第１図で説明したものと同様
なものある．また，７１はＲＡＭ部、７２はアドレスセ
レクタである。この実施例では．ＲＯＭ群１４に対する
制御データを与えるためにＲＡＭ部７１と、このＲＡＭ
部７１の与えるアドレスを選択するアドレスセレクタと
を、更に備える構成となっている。ＲＡＭ部７１には、
ＲＯＭ群１４と記憶されると音声データと同様な形式の
周波数値コードと振幅値コードとを組合せたコードデー
タが音声データが記憶され、ＲＯＭデータを変更する場
合の一部の代替データとして用いられる。また，これは
．ＲＯＭ群１４の要素の各々のＲＯＭを選択する選択デ
ータとして用いられる。アドレスセレクタ７２は、Ｒ．
　Ａ　Ｍ部７１に与えるアドレスの供給先を外部からの
アドレスとするか，また、アドレスカウンタ１３からの
アドレスとするかを選択する．このように、周波数値と
振幅値をコード化して記憶する記憶装置としても利用可
能なＲＡＭ部７１と、ＲＡＭ部７１に与えるアドレスを
制御するアドレスセレクタ７２を追加した構成とするこ
とにより、より広い応用が可能な音声合成装置となる。FIG. 7 is a block diagram of the main parts of a speech synthesizer constructed from digital circuits according to another embodiment of the present invention. In FIG. 7, 10 is a speech synthesis circuit, 1.1 is an oscillator,
12 is a counter, 13 is an address counter, 14 is RO
M group, 15 is a splitter group, 16 is a sine wave generator group, 17 is a digital/analog converter group, 18 is an attenuator group, and 19 is a combiner. These are similar to those explained in Figure 1. Further, 71 is a RAM section, and 72 is an address selector. In this example. In order to provide control data to the ROM group 14, a RAM section 71 and this RAM are provided.
The configuration further includes an address selector for selecting an address provided by the section 71. In the RAM section 71,
When stored in the ROM group 14, the audio data is stored as code data that is a combination of a frequency value code and an amplitude value code in the same format as the audio data, and is used as some alternative data when changing the ROM data. . Also, this. It is used as selection data for selecting each ROM of the elements of the ROM group 14. Address selector 72 is R.
Select whether the address to be supplied to the AM section 71 is an external address or an address from the address counter 13. In this way, by adding the RAM section 71, which can also be used as a storage device for encoding and storing frequency values and amplitude values, and the address selector 72, which controls the address given to the RAM section 71, it is possible to use a wider area. It becomes a speech synthesis device that can be applied.

以上、本発明を実施例にもとづき具体的に説明したが、
本発明は、前記実施例に限定されるものではなく、その
要旨を逸脱しない範囲において種々変更可能であること
は言うまでもない。The present invention has been specifically explained above based on examples, but
It goes without saying that the present invention is not limited to the embodiments described above, and can be modified in various ways without departing from the spirit thereof.

〔Effect of the invention〕

以上、説明したように、本発明によれば、音成合成装置
は主要部が簡単なデジタル回路で構成可能となり、集積
回路化が容易となる。また、音声合成は複合正弦波法で
行うので，音声合成のための音声データの量子化を粗く
しても、音声合成呂力が不安定にならない音声合成装置
とすることができる。As described above, according to the present invention, the main part of the sound synthesis device can be configured with a simple digital circuit, and it can be easily integrated into an integrated circuit. Furthermore, since speech synthesis is performed using the composite sine wave method, it is possible to provide a speech synthesis apparatus that does not become unstable in its speech synthesis performance even if the quantization of speech data for speech synthesis is made coarse.

[Brief explanation of drawings]

第１図は、本発明の一実施例にががるデジタル回路で構
成した音声合成装置の要部のブロック図、第２図は，音
声データの周波数値コードと振幅値コード化して組合せ
たコードデータの一例を示す図、第３図は，正弦波波形を数値データとして記憶している
波形記憶装置の読出し速度制御を音声データの周波数値
コードＦ。により行う速度制御回路の一例を示すブロッ
ク回路図、第４図は、第３図の速度制御回路の動作例を説明するタ
イミングチャート，第５図は、波形記憶装置から数値化データを読出す読出
しアドレスを与えるアドレス制御回路の要部の構成を示
すブロック図，第６図は、入力アドレスと読出し出力データ値の対応関
係により数値データ化した正弦波波形のデータを記憶し
ているＲＯＭデータの一例を示す図、第７図は、本発明の他の一実施例にかかるデジタル回路
で構成した音声合成装置の要部のブロック図である．図中．　１０・・・音声合成回路、１１・・・発振器，
１２・・・カウンタ、１３・・・アドレスカウンタ、１
４・・・Ｒ．　Ｏ　Ｍ群、１５・・・分周器群、１６・
・・正弦波発生器群、１７・・・デジタル／アナログ変
換器群、１８・・・減衰器群，１９・・・合成器、７１
・・・Ｒ　Ａ．　Ｍ部、７２・・・アドレスセレクタ。Fig. 1 is a block diagram of the main parts of a speech synthesis device configured with a digital circuit according to an embodiment of the present invention, and Fig. 2 is a code obtained by converting audio data into frequency value code and amplitude value code and combining them. A diagram showing an example of data. FIG. 3 shows a frequency value code F of audio data for controlling the read speed of a waveform storage device that stores a sine wave waveform as numerical data. FIG. 4 is a timing chart illustrating an example of the operation of the speed control circuit shown in FIG. 3. FIG. A block diagram showing the configuration of the main part of an address control circuit that provides an address. Figure 6 is an example of ROM data that stores sine wave waveform data converted into numerical data based on the correspondence between input addresses and read output data values. FIG. 7 is a block diagram of the main parts of a speech synthesis device configured with digital circuits according to another embodiment of the present invention. In the figure. 10... Speech synthesis circuit, 11... Oscillator,
12...Counter, 13...Address counter, 1
4...R. OM group, 15... Frequency divider group, 16...
...Sine wave generator group, 17...Digital/analog converter group, 18...Attenuator group, 19...Synthesizer, 71
...RA. M section, 72...address selector.

Claims

[Claims] 1. Modeling the voice using a combination of a plurality of different sine waves,
The audio data storage device includes an audio data storage device that encodes and stores each modeled sine wave using a combination of a frequency value code and an amplitude value code, and a waveform storage device that digitizes and stores the waveform of the sine wave. The reading speed is determined by the output of the frequency value code read from the device, the numerical data of the sine wave waveform is read out from the waveform storage device, the read numerical data is converted into an analog signal, and the amplitude value code is read out from the audio data storage device. The analog signal is outputted through an amplifier whose gain is controlled by the output of the sine wave to generate sine waves having different frequencies and amplitudes, and the sound is synthesized by the sum of the plurality of generated sine waves. Synthesis method. 2. An audio data storage device that encodes and stores each sine wave of the audio component using a combination of a frequency value code and an amplitude value code, a waveform storage device that digitizes and stores the waveform of the sine wave, and an audio data storage device. a readout control means that determines the readout speed and controls the readout based on the output of the frequency value code read out from the device; a digital/analog converter that converts the readout output from the waveform storage device into an analog signal; and a digital/analog converter that converts the readout output from the waveform storage device into an analog signal; and a variable gain amplifier that controls gain by outputting an amplitude value code. 3. A waveform storage device that digitizes and stores the waveform of a sine wave stores numerical data of the peak value of the sine wave at the phase angle at the address of the address code corresponding to the value of the phase angle of the sine wave. The speech synthesis device according to claim 2, characterized in that: