JPS5949600B2

JPS5949600B2 - Speech synthesis device for melody sound synthesis

Info

Publication number: JPS5949600B2
Application number: JP55177437A
Authority: JP
Inventors: 稔黒田; 博糸山
Original assignee: Matsushita Electric Works Ltd
Current assignee: Panasonic Electric Works Co Ltd
Priority date: 1980-12-15
Filing date: 1980-12-15
Publication date: 1984-12-04
Also published as: JPS57100499A

Description

【発明の詳細な説明】本発明は、時計用、警報用、インタブオン用等の各種用
途に応じた音声メツセージを記憶している交換可能なコ
ントロールＩＣに接続して使用され、各種の音声メツセ
ージのほかにインタブオン用のチヤイム音や時計用のオ
ルゴール音のような各種のメロデイ音を人間の音声メツ
セージと同時に再生したり、あるいは音階の異なる複数
個の単音階を同時に再生して和音の合成を行なつたりす
ることのできるメロデイ音合成兼用の音声合成装置に関
するものである。DETAILED DESCRIPTION OF THE INVENTION The present invention is used by being connected to a replaceable control IC that stores voice messages for various purposes such as for clocks, alarms, and in-tab-on applications. In addition, it is possible to play various melodic sounds such as chime sounds for intab-ons and music box sounds for clocks at the same time as human voice messages, or to synthesize chords by playing multiple monotone scales with different scales at the same time. The present invention relates to a speech synthesizer that can also be used to synthesize melody sounds.

一般に音声の特徴を表わすパラメータには、音の大小を
表わす振幅パラメータと、音の高低すなわち基本周期を
表わすピツチパラメータと、音の音色、すなわちスペク
トル分布を表わすスベクトルパラメータとがある。In general, parameters representing the characteristics of a sound include an amplitude parameter representing the magnitude of the sound, a pitch parameter representing the pitch of the sound, that is, the fundamental period, and a vector parameter representing the timbre of the sound, that is, the spectral distribution.

このような各種パラメータは音声の特徴を表わすもので
あるために特徴パラメータと総称されるが、通常音声信
号は１０ｍｓｅｃ乃至３０ｍｓｅｃの短期間においてほ
ぼ定常信号とみなすことができるので、従来、この期間
を１フレーム（データ更新間隔）として１フレームから
１組の特徴パラメータを抽出し、１フレーム毎にデータ
を更新するようにした音声合成装置が開発されている。
ところでかかる音声合成装置においては、元の音声信号
から１組の特徴パラメータを抽出する際には、まずスペ
クトルに関する特徴パラメータを抽出した後、残つた波
形を規格化して振幅に関する特徴パラメータを抽出し、
最後に残つた残差波形を自己相関器に入力して基本周期
に関する特徴パラメータを抽出するという過程を採つて
いる。この原理から明らかなように、音声分析系におい
ては１つのフレームから同時に複数個の基本周期を抽出
することはできず、したがつて１フレーム内において抽
出できるのは１つの基本周期と、その基本周期に対応す
る１組の振幅およびスベクトルパラメータのみであり、
何種類もの音が重なり合つた複合音を分析することはで
きない。このため音声分析系において分析抽出した特徴
パラメータを再合成する音声合成系においても１フレー
ム内においては１種類の音しか再生できないという欠点
があつた。勿論このような音声合成装置を人間の音声メ
ツセージの合成にのみ用いるのであれば、上述のように
１フレーム内において再生し得る音が１種類だけであつ
たとしても何ら差し支えないものであるが、音声以外の
擬音、例えばインタブオン用のチヤイム音や時計用のオ
ルゴール音のような各種のメロデイ音をも合成するよう
な場合には、音階の異なる何種類もの単音を組み合わせ
て和音を構成したい場合があり、また音声認識機能をも
有する高度な家電用の機器においては人間の音声メツセ
ージとメロデイ音とを同時に再生したいような場合もあ
り、１フレーム内において１種類の音しか合成できない
従来の音声合成装置ではこのようなメロデイ音の合成を
も得なう用途には不光分であるという欠点があつた。These various parameters are collectively referred to as feature parameters because they represent the characteristics of the voice, but since a voice signal can normally be regarded as a nearly stationary signal over a short period of 10 msec to 30 msec, conventionally this period has been A speech synthesis device has been developed that extracts one set of feature parameters from one frame (data update interval) and updates the data for each frame.
By the way, in such a speech synthesis device, when extracting a set of feature parameters from an original speech signal, first, the feature parameters related to the spectrum are extracted, and then the remaining waveform is normalized to extract the feature parameters related to the amplitude.
A process is adopted in which the residual waveform that remains at the end is input to an autocorrelator to extract characteristic parameters related to the fundamental period. As is clear from this principle, in a speech analysis system, it is not possible to extract multiple fundamental periods from one frame at the same time, and therefore only one fundamental period and its fundamental period can be extracted within one frame. only one set of amplitude and svector parameters corresponding to the period,
It is not possible to analyze complex sounds that are made up of many different sounds. Therefore, even in a speech synthesis system that resynthesizes the feature parameters analyzed and extracted in a speech analysis system, there is a drawback that only one type of sound can be reproduced within one frame. Of course, if such a speech synthesis device is used only for synthesizing human speech messages, there is no problem even if only one type of sound can be reproduced within one frame as described above. When synthesizing onomatopoeic sounds other than vocal sounds, such as chime sounds for intab-on and various melody sounds such as music box sounds for clocks, you may want to compose a chord by combining several types of single notes with different scales. In addition, in advanced home appliances that also have voice recognition functions, there are cases where it is desired to simultaneously play a human voice message and a melody sound, so conventional voice synthesis, which can only synthesize one type of sound in one frame, The device had the disadvantage of being non-luminous, which is not suitable for applications that require the synthesis of such melodic sounds.

そこで本発明者等は従来、擬似的に和音の合成を行なう
ために、音声分析系において別々に抽出した複数個の単
音の特徴パラメータを音声合成系の１フレーム内におい
て交互に切り換えながら再生することを試みたものであ
るが、このような方法を用いた場合、再生される音階は
和音というよりはむしろ各単音の中間的な音階を有する
１つの単音のように聞こえてしまい、本来の和音の持つ
美しい響きを得ることはできなかつた。Therefore, in order to perform pseudo-chord synthesis, the present inventors have conventionally developed a method in which feature parameters of a plurality of single notes extracted separately in a speech analysis system are played back while being alternately switched within one frame of the speech synthesis system. However, when such a method is used, the reproduced scale sounds less like a chord and more like a single note with intermediate scales between each note, making it sound less like the original chord. I couldn't get the beautiful sound that it had.

また同じような方法を用いて人間の音声メツセージとメ
ロデイ音とを交互に切り換えながら再生した場合には一
応はメロデイ音と音声メツセージとが同時に聞きとれは
するものの再生音の音質は非常に聞き苦しく、到底実用
には供し得ないという欠点があつた。本発明は従来例の
このような欠点を解消するために為されたものであり、
音声分析系において別別に分析抽出した何種類もの音を
音声合成系において全く同時に再生することにより、純
粋な和音の持つ美しい響きを再現したり、あるいは人間
の音声メツセージをメロデイ音と同時に報知することの
できるメロデイ音合成兼用の音声合成装置を提供するこ
とを目的とするものである。If a similar method is used to alternately switch between human voice messages and melody sounds, the melody sounds and voice messages can be heard at the same time, but the quality of the reproduced sound is extremely difficult to hear. However, it had the drawback that it could not be put to practical use. The present invention has been made in order to eliminate such drawbacks of the conventional example,
By reproducing multiple types of sounds that have been separately analyzed and extracted in the speech analysis system at the same time in the speech synthesis system, it is possible to reproduce the beautiful sound of pure chords, or to broadcast human voice messages at the same time as melody sounds. It is an object of the present invention to provide a speech synthesis device which can also be used for melody speech synthesis.

以下本発明の構成を図示実施例について説明すると、第
３図及び第４図に示すように音声またはメロデイ音のよ
うな可聴音信号を構成する振幅、基本周期、およびスペ
クトルに関する各特徴パラメータのうち、基本周期に関
する特徴パラメータを一時記憶しておくラツチメモリ１
３ａ，１３ｂを複数個設け、各ラツチメモリ１３ａ，１
３ｂの出力を順次音源形成手段３０に切換接続するマル
チプレクサ３４を設け、上記音源形成手段３０の出力を
振幅およびスペクトルに関する各特徴パラメータにて夫
々制御されるデジタルフイルタ２３に通過せしめること
により上記可聴音信号を再合成するようにして成るメロ
デイ音合成兼用の音声合成装置において、デジタルフイ
ルタ２３の出力に上記マルチプレクサ３４と同期して動
作するデマルチプレクサ３５を接続し、デマルチプレク
サ３５の各出力にデータラツチメモリ３６ａ，３６ｂを
介して複数個のＤＡコンバータ３７ａ，３７ｂを接続し
、各ＤＡコンバータ３７ａ，３７ｂのうちデマルチプレ
クサ３５によつて最後にデジタルフイルタ２３に接続さ
れるＤＡコンバータ３７ｂの出力をミキシング回路３８
に直接入力し、他のＤＡコンバータ３７ａの出力を夫々
遅延回路３９を介してミキシング回路３８に入力し、各
遅延回路３９の遅延時間を全ＤＡコンバータ３６ａ，３
７ｂの出力が同時にミキシング回路３８に入力されるよ
うに設定したものである。The configuration of the present invention will be described below with reference to the illustrated embodiment. As shown in FIGS. , a latch memory 1 that temporarily stores characteristic parameters related to the fundamental period.
A plurality of latch memories 13a, 13b are provided, and each latch memory 13a, 1
A multiplexer 34 is provided to sequentially switch and connect the output of the sound source forming means 30 to the sound source forming means 30, and the output of the sound source forming means 30 is passed through the digital filter 23 which is controlled by each characteristic parameter related to amplitude and spectrum, thereby producing the audible sound. In a speech synthesis device that also serves as melody sound synthesis and is configured to resynthesize signals, a demultiplexer 35 that operates in synchronization with the multiplexer 34 is connected to the output of the digital filter 23, and a data latch is connected to each output of the demultiplexer 35. A mixing circuit connects a plurality of DA converters 37a, 37b via memories 36a, 36b, and outputs the output of the DA converter 37b, which is connected to the digital filter 23 at the end by a demultiplexer 35 among the DA converters 37a, 37b. 38
The outputs of the other DA converters 37a are input to the mixing circuit 38 via the respective delay circuits 39, and the delay time of each delay circuit 39 is adjusted to all the DA converters 36a, 3.
7b is set to be input to the mixing circuit 38 at the same time.

第３図の実施例は和音合成用のものであり、再生音の基
本周期はラツチメモリ１３ａとラツチメモリ１３ｂとに
記憶されたピツチパラメータに従つて変えることができ
るが、振幅やスペクトル包絡を決定するためのパラメー
タはパラメータスタツク２２ａに１組記憶されているだ
けであるので、音色や音量は全く等しく音階のみが異な
る２つの単音を同時に再生して和音を合成することがで
きるようになつている。一方第４図に示す実施例は人間
の音声メツセージとメロデイ音とを同時に再生できるよ
うにした実施例を示しており、同実施例においては基本
周期に関するピツチパラメータが切り換わるとそれに対
応して振幅およびスペクトルに関するパラメータもパラ
メータスタツク２２ａ，２２ｂの切り換えにより変更し
得るようになつており、したがつて全く異種類の音を同
時に再生できるようになつているものである。第４図の
実施例は和音の合成用としても用いることができるもの
であり、この場合には基本周期を記憶するラツチメモリ
１３ａ，１３ｂの数が２個であるから、第３図の実施例
と同様に相異なる２個の単音よりなる和音を合成するこ
とになる。もちろん基本周期記憶用のラツチメモリ１３
をさらに多数個設ければ、ラツチメモリ１３の数だけ音
程の異なるメロデイ音を同時に再生して多数の単音の集
合よりなる変化に富んだ和音を得ることができるもので
ある。第３図および第４図の実施例においてはデジタル
フイルタ２３の出力側に基本周期切換用のマルチプレク
サ３４と同期して動作するデマルチプレクサ３５が接続
されており、デマルチプレクサ３５の各出力にはデータ
ラツチメモリ３６ａ，３６ｂが夫々接続されており、第
５図ａに示すようにデマルチプレクサ３５の一方の出力
３５ａからデータ１が得られるとデータラツチメモリ３
６ａにより次のデータ１が得られるまでそのデータが保
持される。The embodiment shown in FIG. 3 is for chord synthesis, and the fundamental period of the reproduced sound can be changed according to pitch parameters stored in latch memory 13a and latch memory 13b. Since only one set of parameters is stored in the parameter stack 22a, it is possible to synthesize a chord by simultaneously playing back two single notes that have exactly the same timbre and volume but differ only in scale. On the other hand, the embodiment shown in FIG. 4 shows an embodiment in which a human voice message and a melody sound can be played simultaneously. In this embodiment, when the pitch parameter related to the fundamental period is switched, the amplitude is changed accordingly. Parameters related to spectra can also be changed by switching the parameter stacks 22a and 22b, making it possible to simultaneously reproduce completely different types of sounds. The embodiment shown in FIG. 4 can also be used for chord synthesis; in this case, the number of latch memories 13a and 13b for storing the fundamental period is two, so it is different from the embodiment shown in FIG. Similarly, a chord consisting of two different single notes is synthesized. Of course, the latch memory 13 for basic period storage.
If a larger number of melody tones are provided, it is possible to simultaneously reproduce melody tones with different pitches equal to the number of latch memories 13, thereby obtaining a richly varied chord consisting of a set of a large number of single tones. In the embodiments shown in FIGS. 3 and 4, a demultiplexer 35 that operates in synchronization with a multiplexer 34 for switching the fundamental period is connected to the output side of the digital filter 23, and each output of the demultiplexer 35 has data. Latch memories 36a and 36b are connected to each other, and when data 1 is obtained from one output 35a of the demultiplexer 35 as shown in FIG.
6a, the data is held until the next data 1 is obtained.

したがつてデマルチプレクサ３５がデータ１を出力する
と直ちに次の音源信号が音源形成手段３０から出力され
、デジタルフイルタ２３における演算によつてデータ２
が計算され、デマルチプレクサ３５の他方の出力３５ｂ
からデータラツチメモリ３６ｂにデータ２が入力される
。この場合にもデータ２はデータラツチメモリ３６ｂに
より次のデータ２が得られるまで保持されるから、デー
タ２が保持されると直ちに次の音源信号が音源形成手段
３０から出力され、デジタルフイルタ２３による演算を
経て再びデータ１が得られる。以上の過程によつて第５
図ａに示すようにデータ１とデータ２とが得られるもの
であるが、これらのタイミングはデマルチプレクサ３５
の半サイクル分だけずれているので、第５図ｂに示すよ
うにデータ１を半サイクルだけ遅延せしめる。このよう
にすればミキシング回路３８において合成される２種類
の音は従来のように擬似的に高速度で切り換わつて再生
されるのではなくて完全に連続的な２種類の音が全く同
時に再生されることになり、純粋な和音を合成したり、
あるいは人間の音声メツセージとメロデイ音とを全く同
時に再生したりすることができるようになつている。第
３図および第４図に示す実施例はまた、人間の音声メツ
セージの合成のみを行なつたり、あるいは単音のみから
なるメロデイ音の合成のみを行なつたりすることももち
ろんできるようになつており、この場合には本来のラツ
チメモリ１３ａと本来のパラメータスタツク２２ａのみ
を動作させ、他の補助的なラツチメモリ１３ｂやパラメ
ータスタツク２２ｂは使用しないようにする。Therefore, as soon as the demultiplexer 35 outputs data 1, the next sound source signal is output from the sound source forming means 30, and the digital filter 23 calculates data 2.
is calculated, and the other output 35b of the demultiplexer 35
Data 2 is input to the data latch memory 36b from the data latch memory 36b. In this case as well, data 2 is held by the data latch memory 36b until the next data 2 is obtained, so as soon as data 2 is held, the next sound source signal is output from the sound source forming means 30, and the digital filter 23 outputs the next sound source signal. After the calculation, data 1 is obtained again. Through the above process, the fifth
As shown in Figure a, data 1 and data 2 are obtained, but these timings are determined by the demultiplexer 35.
Since there is a shift of half a cycle, data 1 is delayed by a half cycle as shown in FIG. 5b. In this way, the two types of sounds synthesized in the mixing circuit 38 are not reproduced by pseudo-switching at high speed as in the past, but are completely continuous two types of sounds at the same time. It will be played, synthesize pure chords,
Or, it has become possible to reproduce a human voice message and a melody sound at exactly the same time. The embodiments shown in FIGS. 3 and 4 are of course also capable of synthesizing only human voice messages, or only synthesizing melody sounds consisting of only single tones. In this case, only the original latch memory 13a and the original parameter stack 22a are operated, and the other auxiliary latch memory 13b and parameter stack 22b are not used.

このような制御は音声合成用ＬＳＩの入力端子１に接続
されたパラメータコード検出回路２８にて行なわれるも
のである。第３図の実施例において３ａはデータ読込用
のシフトレジスタであり、音声合成用ＬＳＩと組合せて
使用される交換可能な時計用、インタブオン用、警報器
用などの各種の制御用ＬＳＩ３ｌと接続され、制御用Ｌ
ＳＩ３ｌのデータ記憶部３２から音声またはメロデイ音
の振幅、基本周期、ならびにスペクトルに関する１組の
特徴パラメータを１フレーム毎に直列に入力し、その後
データをラツチメモリ１３ａ，１３ｂ等に転送するとい
う構成を採ることにより、音声合成用ＬＳＩとその制御
用ＬＳＩ３ｌとの接続を容易にしているものである。Such control is performed by the parameter code detection circuit 28 connected to the input terminal 1 of the speech synthesis LSI. In the embodiment shown in FIG. 3, 3a is a shift register for reading data, and is connected to various control LSIs 3l for use in combination with the voice synthesis LSI, such as for clocks, interrupt-ons, alarms, etc. Control L
A configuration is adopted in which a set of characteristic parameters relating to the amplitude, fundamental period, and spectrum of a voice or melody sound are serially input for each frame from the data storage unit 32 of the SI 3l, and then the data is transferred to the latch memories 13a, 13b, etc. This facilitates the connection between the speech synthesis LSI and its control LSI 3l.

本実施例においてはかかるデータ読込用のシフトレジス
タ３ａにレジスタ切換回路２を介してサブシフトレジス
タ３ｂを並列に接続しており、複合音の合成時には入力
端子１に接続されたパラメータコード検出回路２８がレ
ジスタ切換回路２を動作させ、本来のピツチパラメータ
の他に複合音の合成時に必要とされるサブピツチパラメ
ータを、本来のシフトレジスタ３ａと並列に接続された
サブシフトレジスタ３ｂに直列に入力するようにしてい
るものである。したがつて人間の音声メツセージの合成
や単音階のみよりなるメロデイ音の合成のみを行なつて
いる場合には、サブシフトレジスタ３ｂも補助的なラツ
チメモリ１３ｂも共に作動しておらず、本来のシフトレ
ジスタ３ａのみが作動して１組の特徴パラメータを読み
込み、これをラツチメモリ１３ａに転送しているもので
あるが、和音のような複合音を合成する際には基本周期
に関するピツチパラメータだけは複数個読み込むように
しているものである。以下本発明の音声合成装置の全体
構成について更に詳述する。In this embodiment, a sub-shift register 3b is connected in parallel to the data reading shift register 3a via a register switching circuit 2, and a parameter code detection circuit 28 connected to the input terminal 1 when synthesizing a complex sound. operates the register switching circuit 2, and in addition to the original pitch parameter, sub-pitch parameters required for compound sound synthesis are input in series to the sub-shift register 3b connected in parallel to the original shift register 3a. That's what I do. Therefore, when synthesizing human voice messages or synthesizing melody sounds consisting only of monophonic scales, neither the sub-shift register 3b nor the auxiliary latch memory 13b is in operation, and the original shift is not performed. Only the register 3a operates to read a set of feature parameters and transfer them to the latch memory 13a, but when synthesizing a complex sound such as a chord, only the pitch parameters related to the fundamental period need to be set in multiple numbers. This is what I am trying to load. The overall configuration of the speech synthesis device of the present invention will be described in further detail below.

第３図において１２は補間計算回路であり、１フレーム
毎にデータの更新を行なう際に各フレーム間の接続点に
おいて特徴パラメータが不連続的に変化すると音声信号
に歪みを生じて明瞭度が低下しやすいのでデータ更新の
際に特徴パラメータがスムーズに変化するように１フレ
ーム内の８点において近似的に直線的補間を行なつてい
るものである。もつともメロデイ音を合成する際には、
合成音にアクセントをつけて歯切れの良い音を出すため
に、パラメータコード検出回路２８と補間制御信号発生
回路３３の動作によりかかる補間計算を停止するように
している。音声およびメロデイ音の特徴パラメータは入
力端子１に接続された制御用ＬＳＩ３ｌのデータ記憶部
３２からレジスタ切換回路２を介してシフトレジスタ３
ａに直列に記憶されるものである。ところで、このよう
にしてシフトレジスタ３ａに読み込まれたデータは特徴
パラメータを表わすものではあるが、特徴パラメータそ
のものではなく、特徴パラメータを記憶している再生用
ＲＯＭｌＯのアドレス信号である。In Fig. 3, 12 is an interpolation calculation circuit, and when the data is updated every frame, if the characteristic parameters change discontinuously at the connection point between each frame, the audio signal will be distorted and the clarity will deteriorate. Since it is easy to do so, approximately linear interpolation is performed at eight points within one frame so that the feature parameters change smoothly when updating data. Of course, when synthesizing melody sounds,
In order to accentuate the synthesized sound and produce a crisp sound, the interpolation calculation is stopped by the operation of the parameter code detection circuit 28 and the interpolation control signal generation circuit 33. The characteristic parameters of the voice and melody sound are transferred from the data storage section 32 of the control LSI 3l connected to the input terminal 1 to the shift register 3 via the register switching circuit 2.
A is stored in series. By the way, although the data read into the shift register 3a in this manner represents the characteristic parameters, they are not the characteristic parameters themselves, but are address signals of the reproduction ROM10 that stores the characteristic parameters.

しかもそのアドレス信号は再生用ＲＯＭＩＯの中の相対
アドレスを示すにすぎない。したがつて、読み込んだデ
ータから実際の特徴パラメータを再生するためにはイン
デツクスＲＯＭ５に記憶された先頭アドレスをアドレス
カウンタ４の働きによつて引き出して、この先頭アドレ
スを再生制御回路６から送出されるシフトクロツク４１
に従つてシフトレジスタ３ａから取り出される相対アド
レスに加算して絶対アドレスを作成し、この絶対アドレ
スによつて再生用ＲＯＭｌＯをアクセスし、再生用ＲＯ
ＭｌＯ内に記憶されている特徴パラメータを取り出す必
要がある。図中８は上記絶対アドレス計算用の加算回路
であり、７，９，１１はシリアルパラレル交換装置であ
る。ところで人間の音声あるいはメロデイ音のような可
聴音信号から、振幅およびスペクトルに関する特徴パラ
メータを抽出した後に残る残差波形は、白色雑音または
所定の基本周期を有するインパルス列となるものである
が、上記インパルス列を構成する個々のインパルス波形
は人間の音声を合成する場合と、メロデイ音を合成する
場合とでは若干異なつている。Moreover, the address signal merely indicates a relative address within the reproduction ROMIO. Therefore, in order to reproduce the actual feature parameters from the read data, the first address stored in the index ROM 5 is extracted by the function of the address counter 4, and this first address is sent out from the reproduction control circuit 6. shift clock 41
Accordingly, an absolute address is created by adding it to the relative address taken out from the shift register 3a, and the reproducing ROM1O is accessed using this absolute address, and the reproducing ROM1O is accessed.
It is necessary to retrieve the feature parameters stored in MIO. In the figure, 8 is an adder circuit for calculating the above-mentioned absolute address, and 7, 9, and 11 are serial-parallel switching devices. By the way, the residual waveform that remains after extracting characteristic parameters related to amplitude and spectrum from an audible sound signal such as a human voice or a melody sound is white noise or an impulse train having a predetermined fundamental period. The individual impulse waveforms constituting an impulse train are slightly different when synthesizing human speech and when synthesizing melody sounds.

そこで本発明の音声合成装置においては人間の音声を合
成する際に用いるインパルス波形を記憶せる音源ＲＯＭ
ｌ６ａと、メロデイ音を合成する際に用いるインパルス
波形を記憶せる音源ＲＯＭｌ６ｂとを別々に設けている
。かかる音源ＲＯＭｌ６ａ，ｌ６ｂは同一の音源ＲＯＭ
ｌ６の異なるエリアを用いて形成してもかまわない。音
源ＲＯＭｌ６にはアドレス順にインパルスの波形変化が
記憶されており、したがつて音源ＲＯＭｌ６のアドレス
カウンタが順次インクリメントされると、上記インパル
ス波形が再生されるようになつている。アドレスカウン
タ１５のデータがラツチメモリ１３に記憶されたピツチ
パラメータと一致したときには、一致回路１４が動作し
てアドレスカウンタ１５にりセツトパルスを送出する。
したがつてアドレスカウンタ１５のデータはＯから順次
ピツチパラメータの値までインクリメントされて行き、
ピツチパラメータの値に達すると再びＯに戻つて同じ動
作を繰り返す。このためかかるアドレスカウンタ１５に
て音源ＲＯＭｌ６をアクセントすると、所望の基本周期
を有するインパルス列が再生されるものである。本発明
においては原音をサンプリングするときに用いたサンプ
リング周期毎に１個ずつのデータを音源ＲＯＭｌ６のア
ドレス順に記憶せしめており、したがつて一方のアドレ
スカウンタ１５ａのインクリメントはサンプリング周期
毎に行ない、他方のアドレスカウンタ１５ｂのインクリ
メントはこれより１／２サンプリング周期だけ遅れたタ
イミングで同じくサンプリング周期毎に行なわれるもの
である。すなわち本実施例においては２つのアドレスカ
ウンタ１５ａ，１５ｂが全く独立に作動しており、この
ためにデジタルフイルタ２３を全く独立に時分割的に使
用できるようになつているものである。デジタルフイル
タ２３の入力側に接続された切換回路２０は音源制御回
路１８の制御の下に、有声音源１７と無声音源１９とを
切換えるものである。Therefore, in the speech synthesis device of the present invention, a sound source ROM that stores impulse waveforms used when synthesizing human speech is used.
16a and a sound source ROM 16b that can store impulse waveforms used when synthesizing melody sounds are provided separately. These sound source ROMs l6a and l6b are the same sound source ROM
It may be formed using different areas of l6. The sound source ROM 16 stores impulse waveform changes in address order, so that when the address counter of the sound source ROM 16 is sequentially incremented, the impulse waveform is reproduced. When the data in the address counter 15 matches the pitch parameter stored in the latch memory 13, the match circuit 14 operates and sends a set pulse to the address counter 15.
Therefore, the data in the address counter 15 is sequentially incremented from O to the value of the pitch parameter.
When the value of the pitch parameter is reached, it returns to O again and repeats the same operation. Therefore, when the address counter 15 accents the sound source ROM 16, an impulse train having a desired fundamental period is reproduced. In the present invention, one piece of data is stored in the address order of the sound source ROM 16 for each sampling period used when sampling the original sound, so one address counter 15a is incremented every sampling period, and the other The address counter 15b is incremented every sampling period at a timing delayed by 1/2 sampling period. That is, in this embodiment, the two address counters 15a and 15b operate completely independently, and therefore the digital filter 23 can be used completely independently in a time-sharing manner. A switching circuit 20 connected to the input side of the digital filter 23 switches between the voiced sound source 17 and the unvoiced sound source 19 under the control of the sound source control circuit 18.

有声音源１７は人間の声帯振動を模擬するものであり、
基本周期毎に繰り返すインパルス列を発生せしめるもの
である。また無声音源１９は声道中の乱気流によつて生
じる摩擦音を模擬するものであり、略一様なスペクトル
分布を有する白色雑音を発生せしめるものである。しか
して母音のように声帯の振動を伴う有声音を合成する際
には有声音源１７を、また子音のように声帯の振動を伴
わない無声音を合成する際には無声音源１９を夫々切換
回路２０を介してデジタルフイルタ２３に切換接続し、
該デジタルフイルタ２３にて振幅およびスペクトルに関
する情報を付加し、合成音をアンプ２４にて増幅し、ス
ピーカ２５より再生するものである。ところで本発明の
音声合成装置においては振幅を表わす振幅パラメータ（
Ａパラメータ）として５ビツト、基本周期を表わすピツ
チパラメータ（Ｐパラメータ）として６ビツトをそれぞ
れ割り当てており、またスペクトルを表わすスペクトル
パラメータとしては第１図に示すように音声信号の標本
値Ｘｔと、これよりＰ個離れた標本値Ｘｔ一ｐとの部分
自己相関係数（ＰＡＲＣＯＲ係数）Ｋｐを用いており、
Ｋ，，Ｋ２，・・・・・・，Ｋ９，ＫｌＯの各Ｋパラメ
ータにそれぞれ７，５，４，４，４，３，３，３，３の
ように量子化ピツトを割り当てている。The voiced sound source 17 simulates human vocal cord vibration,
This generates an impulse train that repeats every fundamental period. The unvoiced sound source 19 simulates fricative sounds caused by turbulence in the vocal tract, and generates white noise having a substantially uniform spectral distribution. The switching circuit 20 switches between the voiced sound source 17 when synthesizing a voiced sound that involves vocal cord vibration, such as a vowel, and the unvoiced sound source 19 when synthesizing an unvoiced sound that does not involve vocal cord vibration, such as a consonant. Switching connection to the digital filter 23 via
The digital filter 23 adds amplitude and spectrum information, the amplifier 24 amplifies the synthesized sound, and the speaker 25 reproduces the sound. By the way, in the speech synthesis device of the present invention, the amplitude parameter (
5 bits are assigned to the pitch parameter (P parameter) representing the fundamental period, and 6 bits are assigned to the pitch parameter (P parameter) representing the fundamental period.As shown in Figure 1, the spectral parameter representing the spectrum is the sample value Xt of the audio signal. The partial autocorrelation coefficient (PARCOR coefficient) Kp with the sample value Xt1p which is more than P points apart is used,
Quantization pits such as 7, 5, 4, 4, 4, 3, 3, 3, 3 are assigned to each K parameter of K, K2, .

このような各パラメータについてのビツト配分数はイン
デツクスＲＯＭ５に記憶されており、再生制御回路６は
該ビツト配分数に対応する数のシフトクロツク４１をシ
フトレジスタ３ａに送出し、シフトレジスタ３ａ内にＡ
，Ｐ，ＫｌＯ，Ｋ，，Ｋ８，・・・・・・，Ｋ２，Ｋ，
の順に直列に入力された各特徴パラメータをラツチメモ
リ１３ａやパラメータスタツク２２ａに送出するように
しているものである。ところで人間の音声を合成する際
には基本周期を表わすピツチパラメータは少数でよく、
むしろ振幅やスペクトルを表わすパラメータが多数必要
となるものであるが、メロデイ音を合成する際には基本
周期を表わすピツチパラメータの方が多数必要となる。
そこでこのような各種の用途に応じて各特徴パラメータ
のビツト配分数を変更せしめるためには第３図において
シフトレジスタ１３ａと並列に接続せるピツチパラメー
タ入力専用のサブシフトレジスタ３ｂをシフトレジスタ
３ａと同じ構成のものとして、すべての特徴パラメータ
を読み込めるような容量を予め持たせておく。このよう
にすれば、インデツクスＲＯＭ５内に記憶せるビツト配
分数を書き換えるだけでピツチパラメータのビツト配分
数を自由に変更することができるものである。なお本発
明においてはＡパラメータ、Ｐパラメータ、ならびにＫ
，Ｏ〜Ｋ，の各パラメータについて各種の演算をスムー
ズに行なうために、発振回路２Ｔとタイミング制御回路
２６を設けて第２図に示すような時間割り当て図に基づ
いてデータの読み込みや演算動作を行なつている。The number of bit allocations for each parameter is stored in the index ROM 5, and the reproduction control circuit 6 sends the number of shift clocks 41 corresponding to the number of bit allocations to the shift register 3a.
,P,KlO,K,,K8,...,K2,K,
Each characteristic parameter input in series in the order of 1 is sent to the latch memory 13a and the parameter stack 22a. By the way, when synthesizing human speech, only a small number of pitch parameters representing the fundamental period are required.
Rather, a large number of parameters representing amplitude and spectrum are required, but when synthesizing melody sounds, a larger number of pitch parameters representing the fundamental period are required.
Therefore, in order to change the number of bits allocated to each characteristic parameter according to such various uses, in FIG. The configuration has a capacity in advance that allows reading all feature parameters. In this way, the bit allocation number of the pitch parameter can be freely changed by simply rewriting the bit allocation number stored in the index ROM 5. In addition, in the present invention, A parameter, P parameter, and K
, OK to K, and perform various calculations smoothly, an oscillation circuit 2T and a timing control circuit 26 are provided to read data and perform calculation operations based on a time allocation diagram as shown in FIG. I am doing it.

データの読み込みは第２図に示すように各フレームを８
等分した補間区間Ｄｌ，Ｄ２・・・・・・Ｄ８のうち最
初の補間区間Ｄ１において行なわれるものである。各区
間Ｄ，〜Ｄ８は２５等分されてそれぞれＰ１〜Ｐ２５に
分割されている。Ａ，Ｐ，Ｋ，Ｏ，Ｋ９・・・・・・，
Ｋ，の各パラメータはすべて奇数番目のＰｌ，Ｐ３，Ｐ
５・・・・・・Ｐ２３において直列に配列されており、
Ｐ２，は予備のブランクである。また偶数番目のＰ２，
Ｐ４，Ｐ６・・・・・・，Ｐ２，は補間計算を行なうた
めのタイミングである。さらにＰ１〜Ｐ２５の各領域２
２等分されてＴ，，Ｔ２・・・・・・，Ｔ２２となる。
このうちＴ１〜Ｔ，はパラメータコード検出回路２８を
作動せしめるための制御信号区間であり、実際のデータ
はＴ６以串こ読み込まれる。次に第６図は併合発明の一
実施例を示すものである。Data is read by 8 times each frame as shown in Figure 2.
This is performed in the first interpolation section D1 of the equally divided interpolation sections Dl, D2, . . . D8. Each section D, to D8 is divided into 25 equal parts, and divided into P1 to P25, respectively. A, P, K, O, K9...
The parameters of K, are all odd-numbered Pl, P3, P
5...They are arranged in series at P23,
P2 is a spare blank. Also, the even number P2,
P4, P6..., P2 are timings for performing interpolation calculations. Furthermore, each area 2 of P1 to P25
It is divided into two equal parts and becomes T,, T2..., T22.
Of these, T1 to T are control signal sections for activating the parameter code detection circuit 28, and actual data is read from T6 onwards. Next, FIG. 6 shows an embodiment of the combined invention.

第６図の実施例にあつてはデジタルフイルタ２３の出力
がデマルチプレクサ３５を介さずにそのままＤＡコンバ
ータ３Ｔに入力されており、ＤＡコンバータ３？の出力
側にデマルチプレクサ３５が接続されている。すなわち
第３図および第４図に示す実施例にあつてはデマルチプ
レクサ３５をＤＡコンバータ３Ｔａ，３１ｂの前段に入
れているためにＤＡコンバータ３１ａ，３７ｂが２個必
要であり、またさらに多くの単音からなる和音を合成す
る際にはその数だけＤＡコンバータ３Ｔを増設する必要
があるが、デジタルフイルタ２３から出力されるデジタ
ルデータは元々時分割的に交互に出力されてくるもので
あるから、本併合発明においてはデジタルフイルタ２３
の出力に直接ＤＡコンバータ３Ｔを接続することにより
、複数種の音について１つのＤＡコンバータ３７を用い
て一括してＤＡ変換を行なつている。ＤＡコンバータ３
Ｔにてデジタル信号からアナログ信号に変換された時分
割的なデータはデマルチプレクサ３５にて別々の音に分
離され、遅延回路３９にて互いの音のタイミングを整合
されたのち、ミキシング回路３８によりミキシングされ
るものである。この場合第３図および第４図においてデ
マルチプレクサ３５の出力に接続されていたデータラツ
チメモリ３６ａ，３６ｂの代わりに、アナログ的なデー
タをホールドするデータ保持回路４０ａ，４０ｂが必要
となる。もつとも上述のようにデジタルフイルタ２３の
切換を行なうデマルチプレクサ３５は、原音のサンプリ
ング周期に等しい極めて短い時間を一周期として動作し
ているので、アナログデータをホールドするデータ保持
回路４０ａ，４０ｂは原音のサンプリング周期に等しい
時間だけデータを保持できるものであればよい。本併合
発明においても第３図および第４図に示す本発明の実施
例と同様にデマルチプレクサ３５から時間的に先に出て
くるデータについては遅延回路３９によつてミキシング
回路３８に入るタイミングを遅らせ、デマルチプレクサ
３５から後から出てくるデータはそのままミキシング回
路３８に入力することにより、２つ以上の音がスピーカ
２５から全く同時に再生されるようにしているものであ
る。なお第４図および第６図において主に人間の音声メ
ツセージの合成に用いられるラツチメモリ１３ａとパラ
メータスタツク２２ａとには補間計算回路１２が接続さ
れているが、その他のラツチメモリ１３ｂとパラメータ
スタツク２２ｂについては補間計算を行なう必要がない
ので補間計算回路１２は接続されていない。以上のよう
に本発明においては、基本周期に関する特徴パラメータ
を一時記憶しておくラツチメモリを複数個設け、各ラツ
チメモリの出力を順次音源形成手段に切換接続するマル
チプレクサを設け、上記音源形成手段の出力を振幅およ
びスペクトルに関する各特徴パラメータにて夫々制御さ
れるデジタルフイルタに通過せしめるようにした音声合
成装置において、デジタルフイルタの出力に上記マルチ
プレクサと同期して動作するデマルチプレクサを接続し
、デマルチプレクサの各出力にデータラツチメモリを介
して複数個のＤＡコンバータを接続し、各ＤＡコンバー
タのうちデマルチプレクサによつて最後にデジタルフイ
ルタに接続されるＤＡコンバータの出力をミキシング回
路に直接入力し、他のＤＡコンバータの出力を夫々遅延
回路を介してミキシング回路に入力し、各遅延回路の遅
延時間を全ＤＡコンバータの出力が同時にミキシング回
路に入力されるように設定したものであるので、従来例
のように単純に基本周期を交互に切り換えて擬似的に和
音を合成していた場合と比較すると、基本周期の異なる
複数個の箸が全く同時に、しかも連続的に再生されると
いう利点があり、純粋な和音の持つ美しい響きを再現す
ることができるという利点がある。In the embodiment shown in FIG. 6, the output of the digital filter 23 is directly input to the DA converter 3T without going through the demultiplexer 35, and the DA converter 3? A demultiplexer 35 is connected to the output side of the . That is, in the embodiment shown in FIGS. 3 and 4, since the demultiplexer 35 is placed before the DA converters 3Ta and 31b, two DA converters 31a and 37b are required, and more single tones are required. When synthesizing a chord consisting of In the merged invention, digital filter 23
By directly connecting the DA converter 3T to the output of the DA converter 37, one DA converter 37 is used to perform DA conversion on multiple types of sounds at once. DA converter 3
The time-division data converted from a digital signal to an analog signal at T is separated into separate sounds by a demultiplexer 35, and the timings of the sounds are matched by a delay circuit 39, and then by a mixing circuit 38. It is something that is mixed. In this case, data holding circuits 40a and 40b for holding analog data are required in place of the data latch memories 36a and 36b connected to the output of the demultiplexer 35 in FIGS. 3 and 4. Of course, as mentioned above, the demultiplexer 35 that switches the digital filter 23 operates with one cycle having an extremely short time equal to the sampling cycle of the original sound, so the data holding circuits 40a and 40b that hold analog data are Any device that can hold data for a period of time equal to the sampling period may be used. In the present combined invention, as in the embodiments of the present invention shown in FIGS. 3 and 4, the timing at which data that comes out earlier from the demultiplexer 35 enters the mixing circuit 38 is controlled by the delay circuit 39. By delaying the delay and inputting the data that comes out later from the demultiplexer 35 as it is to the mixing circuit 38, two or more sounds can be reproduced from the speakers 25 at exactly the same time. In FIGS. 4 and 6, the interpolation calculation circuit 12 is connected to the latch memory 13a and parameter stack 22a, which are mainly used for synthesizing human voice messages, but the other latch memories 13b and parameter stack 22b are The interpolation calculation circuit 12 is not connected because there is no need to perform interpolation calculations for these. As described above, in the present invention, a plurality of latch memories for temporarily storing characteristic parameters related to the fundamental period are provided, a multiplexer is provided to sequentially switch and connect the output of each latch memory to the sound source forming means, and the output of the sound source forming means is connected to the sound source forming means. In a speech synthesis device in which the signal passes through a digital filter that is controlled by characteristic parameters related to amplitude and spectrum, a demultiplexer that operates in synchronization with the multiplexer is connected to the output of the digital filter, and each output of the demultiplexer is connected to the output of the digital filter. A plurality of DA converters are connected to the DA converter via a data latch memory, and the output of the DA converter that is connected to the digital filter last by a demultiplexer is directly input to the mixing circuit, and the output of the DA converter is directly input to the mixing circuit, and the output of the DA converter that is connected to the digital filter at the end is inputted directly to the mixing circuit. The output of each DA converter is input to the mixing circuit via a delay circuit, and the delay time of each delay circuit is set so that the outputs of all DA converters are simultaneously input to the mixing circuit, so it is not as simple as the conventional example. Compared to pseudo-synthesizing chords by alternating the fundamental period, this has the advantage that multiple chopsticks with different fundamental periods can be played back at exactly the same time and continuously, making it possible to create pure chords. It has the advantage of being able to reproduce the beautiful sound that it has.

すなわち本発明においては、デマルチプレクサの出力に
データラツチメモリを接続することにより、順次時分割
的に切り換えながら出力される断続的な音を連続的な音
に変換することができ、しかも上記連続的な音に変換さ
れた複数個の音がミキシング回路に入るタイミングを遅
延回路によつて整合したから、音声分析系において別々
に分析抽出した複数個の音を完全に同時に再生すること
ができるという利点を有するものである。なお第４図の
実施例に示すように合成音の基本周期のみならず、合成
音の振幅やスペクトルをも基本周期の切換と同期して切
り換え得るように構成すれば、全く異種類の音、例えば
人間の音声メツセージとメロデイ音とを全く同時に再生
することができ、各種の家電用機器に応用すれば単なる
音声メツセージのみの報知よりも変化に富んだ再生音を
得ることができるものである。That is, in the present invention, by connecting a data latch memory to the output of the demultiplexer, it is possible to convert the intermittent sound that is output while sequentially switching in a time-division manner into a continuous sound. The advantage of this is that the timing at which multiple sounds that have been converted into sounds enter the mixing circuit is matched by a delay circuit, so multiple sounds that have been analyzed and extracted separately can be played back completely simultaneously in the audio analysis system. It has the following. As shown in the embodiment of FIG. 4, if the structure is configured so that not only the fundamental period of the synthesized sound but also the amplitude and spectrum of the synthesized sound can be switched in synchronization with the switching of the fundamental period, completely different kinds of sounds can be obtained. For example, it is possible to reproduce a human voice message and a melody sound at the same time, and if applied to various home appliances, it is possible to obtain a richer variety of reproduced sounds than when only voice messages are notified.

また本発明に対する併合発明においては、デジタルフイ
ルタの出力をデマルチプレクサを介さずにそのままＤＡ
コンバータに入力し、ＤＡコンバータの出力側にデマル
チプレクサを接続し、デマルチプレクサの各出力のうち
、最後にデジタルフイルタに接続される出力をデータ保
持回路を介してミキシング回路に直接入力し、デマルチ
プレクサの他の出力を夫々データ保持回路と遅延回路と
を介してミキシング回路に入力し、各遅延回路の遅延時
間をデマルチプレクサの全出力が同時にミキシング回路
に入力されるように設定したから、デジタルフイルタか
ら時分割的に出力される複数種の音について１つのＤＡ
コンバータを用いて一括してＤＡ変換を行なうことがで
き、したがつて特別に多数の音を同時に出力するような
場合にもＤＡコンバータの数が増えることがなく、回路
構成が著しく簡単になるという利点がある。In addition, in a combined invention of the present invention, the output of the digital filter is directly converted to DA without going through a demultiplexer.
input to the converter, connect a demultiplexer to the output side of the DA converter, and among the outputs of the demultiplexer, the output that is last connected to the digital filter is directly input to the mixing circuit via the data holding circuit, and the demultiplexer The other outputs of the demultiplexer are input to the mixing circuit via a data holding circuit and a delay circuit, respectively, and the delay time of each delay circuit is set so that all outputs of the demultiplexer are simultaneously input to the mixing circuit. One DA for multiple types of sounds output in a time-sharing manner from
It is possible to perform DA conversion all at once using a converter, so even if a large number of sounds are to be output at the same time, the number of DA converters will not increase, and the circuit configuration will be significantly simplified. There are advantages.

すなわち本併合発明はデジタルフイルタから出力される
データが元々時分割的に出力されてくるということを利
用して、デジタルフイルタの出力をデマルチプレクサに
通す前にＤＡコンバータに通すことにより、ＤＡコンバ
ータの数を節減し、もつて回路構成の単純化を図つたも
のである。なお本発明およびその併合発明は、いずれも
デジタルフイルタそのものについて言及するものではな
く、複合音の出力を得るためにデジタルフイルタの共用
化を図つたものである。In other words, the present combined invention takes advantage of the fact that the data output from the digital filter is originally output in a time-division manner, and passes the output of the digital filter through the DA converter before passing it through the demultiplexer. This is intended to reduce the number of circuits and simplify the circuit configuration. It should be noted that the present invention and its combined inventions do not refer to the digital filter itself, but are intended to share the digital filter in order to obtain a composite sound output.

したがつてデジタルフイルタとしてはどのような構成の
ものを用いてもかまわないが、通常のデジタルフイルタ
は単音声の合成を前提としているために１回のフイルタ
演算は１サンプリング周期で終了し、この期間内に１音
声データを出力するように設計されている。このため本
発明およびその併合発明のように１サンプリング周期内
で時分割的に２乃至それ以上の音声データを出力する場
合にはそれだけデジタルフイルタの演算動作に高速性を
要求されるものである。また、デジタルフイルタとして
いわゆるＰＡＲＣＯＲ型（部分自己相関係数型）のフイ
ルタを用いる場合には、１つ前のサンプリング周期にお
けるデータを記憶しておく必要があるので、２乃至それ
以上の音声データを得ようとする場合には次のフイルタ
演算が行なわれるまでの間データを遅延記憶させておく
遅延記憶手段を複合音を構成する各単音の数だけ設ける
ことが必要となるものである。Therefore, any configuration may be used as a digital filter, but since a normal digital filter is based on the synthesis of a single voice, one filter operation is completed in one sampling period, and this It is designed to output one piece of audio data within a period. Therefore, when two or more pieces of audio data are time-divisionally output within one sampling period as in the present invention and its combined invention, the digital filter is required to operate at a higher speed. Furthermore, when using a so-called PARCOR type (partial autocorrelation coefficient type) filter as a digital filter, it is necessary to store data in the previous sampling period, so two or more pieces of audio data can be stored. If this is desired, it is necessary to provide delay storage means for storing data in a delayed manner until the next filter operation is performed, corresponding to the number of individual sounds constituting the complex sound.

[Brief explanation of the drawing]

第１図は本発明に利用せるＰＡＲＣＯＲ型音声合成方式
の原理図、第２図は同上の時間割り当て図、第３図は本
発明の一実施例の全体プロツク図、第４図は同上の要部
プロツク図、第５図Ａ，ｂは同上の動作を示す動作説明
図、第６図は併．合発明の一実施例の要部プロツク図で
ある。１３はラツチメモリ、２３はデジタルフイルタ、３０は
音源形成手段、３４はマルチプレクサ、５はデマルチプ
レクサ、３６はデータラツチメリ、３ＴはＤＡコンバー
タ、３８はミキシング路、３９は遅延回路、４０はデー
タ保持回路でる。Fig. 1 is a principle diagram of the PARCOR type speech synthesis method used in the present invention, Fig. 2 is a time allocation diagram of the same as above, Fig. 3 is an overall block diagram of an embodiment of the present invention, and Fig. 4 is a diagram of the same as above. 5A and 5B are operation explanatory diagrams showing the same operation as above, and FIG. 6 is also a partial block diagram. FIG. 2 is a block diagram of a main part of an embodiment of the invention. 13 is a latch memory, 23 is a digital filter, 30 is a sound source forming means, 34 is a multiplexer, 5 is a demultiplexer, 36 is a data latch, 3T is a DA converter, 38 is a mixing path, 39 is a delay circuit, 40 is a data holding circuit Out.

Claims

[Claims] 1. A plurality of latch memories for temporarily storing characteristic parameters related to the fundamental period among characteristic parameters related to the amplitude, fundamental period, and spectrum constituting an audible sound signal such as voice or melody sound. A multiplexer is provided to sequentially switch and connect the output of each latch memory to the sound source forming means, and the output of the sound source forming means is passed through a digital filter controlled by each characteristic parameter related to amplitude and spectrum, thereby producing the audible sound. In a speech synthesis device that also serves as melody sound synthesis and is configured to resynthesize signals, a demultiplexer that operates in synchronization with the multiplexer is connected to the output of the digital filter, and a data latch memory is connected to each output of the demultiplexer. A plurality of DA converters are connected, and the output of the DA converter that is last connected to the digital filter by a demultiplexer is directly input to the mixing circuit, and the output of the other DA converters is input to the delay circuit, respectively. 1. A speech synthesis device for melody tone synthesis, characterized in that the delay time of each delay circuit is set so that the outputs of all DA converters are simultaneously input to the mixing circuit. 2. Provide a shift register into which a set of characteristic parameters relating to amplitude, fundamental period, and spectrum are input in series at each data update interval, and a plurality of sub-shift registers into which only characteristic parameters relating to the fundamental period are input in series, A transfer means that connects the input and output terminals of each shift register in parallel with each other via a register switching circuit, and transfers characteristic parameters related to different fundamental periods input to each shift register to each latch memory via the register switching circuit. 2. A speech synthesis device for both melody sound synthesis and melody sound synthesis as claimed in claim 1. 3. A shift register into which a set of feature parameters related to amplitude, fundamental period, and spectrum are input in series at each data update interval, and bit allocation of feature parameters related to at least the fundamental period among each feature parameter input to the shift register. An index ROM stores a number in advance, and a number of shift clocks corresponding to the number of bit allocations stored in the index ROM are generated to read the characteristic parameters related to the fundamental period from among the characteristic parameters in the shift register. 2. A voice synthesis device for melody tone synthesis and melody tone synthesis as claimed in claim 1, further comprising a playback control circuit for transferring data to a latch memory. 4 Parameter stacks for temporarily storing characteristic parameters related to amplitude and spectrum are provided in the same number as a plurality of latch memories temporarily storing characteristic parameters related to fundamental period, and a parameter stack is provided between each latch memory and the sound source forming means. The melody sound synthesis system as claimed in claim 1, characterized in that another multiplexer operating in synchronization with the multiplexer interposed in the digital filter is interposed between each parameter stack and the digital filter. Speech synthesizer. 5. A plurality of latch memories are provided to temporarily store characteristic parameters related to the fundamental period among the characteristic parameters related to the amplitude, fundamental period, and spectrum constituting an audible sound signal such as voice or melody sound, and each latch memory is A multiplexer is provided that sequentially switches and connects the output to the sound source forming means, and the output of the sound source forming means is passed through digital filters each controlled by characteristic parameters regarding amplitude and spectrum, thereby resynthesizing the audible sound signal. A demultiplexer that operates in synchronization with the multiplexer is connected to the output of a DA converter connected to the output of the digital filter, and the last of each output of the demultiplexer is The output connected to the digital filter is directly input to the mixing circuit via the data holding circuit, the other outputs of the demultiplexer are input to the mixing circuit via the data holding circuit and the delay circuit, respectively, and the delay of each delay circuit is 1. A speech synthesis device for melody sound synthesis, characterized in that the time is set so that all outputs of a demultiplexer are simultaneously input to a mixing circuit. 6 Parameter stacks for temporarily storing characteristic parameters related to amplitude and spectrum are provided in the same number as a plurality of latch memories temporarily storing characteristic parameters related to fundamental period, and a stack is provided between each latch memory and the sound source forming means. A melody sound synthesis system according to claim 5, characterized in that another multiplexer operating in synchronization with the multiplexer installed in the digital filter is interposed between each parameter stack and the digital filter. Speech synthesizer.