JPH0664477B2

JPH0664477B2 - Speech synthesizer

Info

Publication number: JPH0664477B2
Application number: JP55106856A
Authority: JP
Inventors: 豊安井; 雄二郎笹原; 文忠板倉; 茂樹嵯峨山; 昇管村
Original assignee: Fujitsu Ltd; Nippon Telegraph and Telephone Corp
Current assignee: Fujitsu Ltd; Nippon Telegraph and Telephone Corp
Priority date: 1980-08-04
Filing date: 1980-08-04
Publication date: 1994-08-22
Anticipated expiration: 2009-08-22
Also published as: JPS5732499A

Description

【発明の詳細な説明】本発明は、音声データを合成して音声信号を出力する音
声合成装置に関するものである。The present invention relates to a voice synthesizer that synthesizes voice data and outputs a voice signal.

音声合成装置に用いる方式としては、音声の特徴を利用
してパラメータ化したPARCOR（partial autocorrelatio
n）（偏自己相関）方式、LPC（linear predictive codi
ng）（線形予測符号化）方式、LSP（line spectrum pai
r）（線スペクトル対）方式等が知られている。これら
の方式は、音声波形をPCM等に符号化した波形符号化方
式に比較してデータ圧縮度が大きいので、音声データを
格納するメモリの容量を削減できるものである。The method used in the speech synthesizer is PARCOR (partial autocorrelatio)
n) (Partial autocorrelation) method, LPC (linear predictive codi
ng) (linear predictive coding) method, LSP (line spectrum pai
r) (line spectrum pair) method and the like are known. Since these methods have a higher data compression degree than the waveform coding method in which the voice waveform is encoded in PCM or the like, the capacity of the memory for storing the voice data can be reduced.

音声合成は、例えばフレーム周期毎に１組の音声データ
（例えば６バイト）を所定の手順で変換し、直線補間等
によりサンプル周期毎の補間出力をデイジタルフイルタ
の係数として加え、音源部からのパルス列又はホワイト
ノイズをデイジタルフイルタに加えて音声合成を行なう
ものであり、例えば複数フレームに亘り同一音を含む音
声の合成，所謂長音或は信号音の合成に於いては、その
複数フレームに亘る音声データを用意する必要がある。
即ち長音や連続信号音等を合成出力する場合、それに対
応した多量の音声データを蓄積しなければならなかつ
た。In speech synthesis, for example, one set of speech data (for example, 6 bytes) is converted for each frame cycle by a predetermined procedure, and interpolation output for each sample cycle is added as a coefficient of a digital filter by linear interpolation or the like, and a pulse train from a sound source section is added. Alternatively, white noise is added to a digital filter to perform voice synthesis. For example, in the synthesis of voices containing the same sound over a plurality of frames, that is, in the synthesis of so-called long sound or signal sound, voice data over a plurality of frames Need to prepare.
That is, when a long sound or a continuous signal sound is synthesized and output, a large amount of voice data corresponding to it has to be accumulated.

本発明は、連続的な同一音を含む音声の合成に於いて、
１組以上の音声データでもつて合成し得るようにするこ
とを目的とするものである。ここで音声データは、音声
の特徴部分（ピッチ周期，振幅，特徴パラメータ等）を
データ化したものであり、本発明の構成を明確にするた
め、実施例の図面と対応して示す。即ち、音声のピッチ周期，振幅，特徴パラメータ等の特徴成分
を抽出し、一定のバイト長からなる音声データとして外
部メモリ（２）に格納された前記音声データを、特定の
周期毎に読み込み音声合成する音声合成装置において、前記特定の周期毎に読み込まれる前記音声データ中の振
幅及び特徴パラメータのデータを入力とし、ディジタル
フィルタ係数に変換する変換手段（16）と、前記変換手段（16）で変換されて得られる前記ディジタ
ルフィルタ係数と、その１つ前の周期で得られる前記デ
ィジタルフィルタ係数により、前記特定の周期間を予め
定められた間隔におけるディジタルフィルタ係数を生成
し補間する補間手段（17）と、前記音声データ中の前記ピッチ周期のデータに基づいて
生成されるパルス列と、前記補間手段（17）から得られ
るディジタルフィルタ係数を用いて所定のフィルタ演算
を行い音声合成するディジタルフィルタ手段（19）と、音声合成の開始，停止，リピート動作等の外部装置
（１）から与えられる制御情報を格納する記憶手段（1
2）を備え、前記制御情報中のリピート動作を指示する情報が前記記
憶手段（12）に格納されたとき、前記変換手段（16）に
よる音声データの変換動作及び前記補間手段（17）によ
る補間動作を停止し、その直前に前記補間手段（17）に
より得られたディジタルフィルタ係数を前記ディジタル
フィルタ手段（19）に与えることにより、同一音の合成
を行うことを特徴とする音声合成装置である。以下実施
例について詳細に説明する。The present invention relates to the synthesis of voices containing the same continuous sound,
The purpose is to enable composition with one or more sets of audio data. Here, the voice data is data in which characteristic portions of the voice (pitch period, amplitude, characteristic parameters, etc.) are converted into data, and is shown in correspondence with the drawings of the embodiment in order to clarify the configuration of the present invention. That is, characteristic components such as the pitch period, amplitude, and characteristic parameter of the voice are extracted, and the voice data stored in the external memory (2) as voice data having a constant byte length is read for each specific period and voice synthesis is performed. In the voice synthesizer, a conversion unit (16) for converting the amplitude and characteristic parameter data in the voice data read in the specific cycle into digital filter coefficients, and converting the conversion unit (16). Interpolation means (17) for generating and interpolating digital filter coefficients at predetermined intervals between the specific cycles by the digital filter coefficients obtained in this way and the digital filter coefficients obtained in the immediately preceding cycle. A pulse train generated based on the data of the pitch period in the audio data, and the interpolation means (17). A digital filter means (19) for performing voice synthesis by performing a predetermined filter operation using the obtained digital filter coefficient, and a memory for storing control information given from an external device (1) such as start, stop, and repeat operation of voice synthesis. Means (1
2), when the information instructing the repeat operation in the control information is stored in the storage means (12), the conversion operation of the audio data by the conversion means (16) and the interpolation by the interpolation means (17) A voice synthesizing device characterized in that the same sound is synthesized by stopping the operation and giving the digital filter coefficient obtained by the interpolating means (17) immediately before the operation to the digital filter means (19). . Examples will be described in detail below.

第１図はマイクロプロセツサ等の制御により音声合成を
行なう場合のブロツク線図であり、１はマイクロプロセ
ツサ、２は音声テープを格納したメモリ、３はマイクロ
プロセツサ１に状態情報を送り、又マイクロプロセツサ
１からの制御情報に従つて動作し、メモリ２からの音声
データの変換，補間等の処理により音声を合成する音声
合成回路、４はタイミング信号等を発生する為の原発振
用の水晶発振子、５はDA変換器、６はローパスフイル
タ、７は増幅器、8,10はスピーカ、９はトランスであ
る。FIG. 1 is a block diagram when voice synthesis is performed by control of a microprocessor or the like. 1 is a microprocessor, 2 is a memory storing a voice tape, 3 is status information sent to the microprocessor 1, Further, a voice synthesizing circuit 4 which operates according to control information from the microprocessor 1 and synthesizes voice by processing such as conversion and interpolation of voice data from the memory 2 is used for original oscillation for generating timing signals and the like. Is a crystal oscillator, 5 is a DA converter, 6 is a low-pass filter, 7 is an amplifier, 8 and 10 are speakers, and 9 is a transformer.

マイクロプロセツサ１からのアドレス信号によりメモリ
２から音声データが読出されて音声合成回路３に加えら
れ、マイクロプロセツサ１からの制御情報により音声合
成の開始，停止，フレーム周期の選択等が行なわれ、音
声データを基にして音声合成が実行される。合成出力が
デイジタル直列出力の場合に、DA変換器５によりアナロ
グ音声信号に変換され、ローパスフイルタ６を介して増
幅器７により増幅されてスピーカ８が駆動される。この
音声合成回路３には簡易型DA変換器が内蔵されているの
で、例えばトランス９を介してスピーカ10を駆動するこ
ともできる。この場合の音声品質は、簡易型DA変換器に
よりアナログ信号に変換されるので、多少低いものとな
る。The voice data is read from the memory 2 by the address signal from the microprocessor 1 and added to the voice synthesizer circuit 3. The control information from the microprocessor 1 starts and stops the voice synthesizer and selects the frame period. , Voice synthesis is executed based on the voice data. When the combined output is digital serial output, it is converted into an analog audio signal by the DA converter 5, amplified by the amplifier 7 via the low-pass filter 6, and the speaker 8 is driven. Since the simple DA converter is built in the voice synthesis circuit 3, the speaker 10 can be driven via the transformer 9, for example. The voice quality in this case is somewhat low because it is converted to an analog signal by the simple DA converter.

本発明に於いては、音声合成回路３に加える制御情報の
中にリピートモード指定ビツトを設け、このビツトを例
えば“0"としたときは、通常の音声合成処理を行なわ
せ、“1"としたときは、音声データの変換，補間等を停
止させ、同一のデータに従つた音声信号を出力させるも
のである。即ち同一音を連続的に出力させるもので、そ
の為の音声データは、例えば６バイトで済むことにな
る。In the present invention, a repeat mode designating bit is provided in the control information added to the voice synthesizing circuit 3. When this bit is set to, for example, "0", a normal voice synthesizing process is performed and "1" is set. In this case, conversion of audio data, interpolation, etc. are stopped, and audio signals according to the same data are output. That is, the same sound is continuously output, and the voice data for that is 6 bytes, for example.

第２図は音声合成回路のブロツク線図であり、11は音声
データ（D0〜D7）を信号によりセツトして音声デー
タの変換時まで保持しておくレジスタ群からなるデータ
バツフアスタツク、12はインタフエース部、13は制御信
号ST,RPT,T0,T1,MODE,SBEを信号によりセツトす
るコントロールレジスタ、14は状態情報としての信号
，，を信号SEにより出力するステータ
スレジスタ、15は音声合成部、16は音声データからフイ
ルタ係数に変換する読取専用メモリ（ROM）等からなる
変換部、17はフレーム周期毎に加えられるフイルタ係数
を１フレーム間補間してサンプル周期毎に出力する補間
部、18は音源部、19はデイジタルフイルタ部、20は簡易
型のDA変換器、21はタイミング信号の発生や各部の制御
を行なう制御部である。FIG. 2 is a block diagram of the voice synthesizing circuit. Reference numeral 11 is a data buffer stack consisting of registers which set voice data (D0 to D7) by a signal and hold the voice data until conversion. Is an interface section, 13 is a control register for setting control signals ST, RPT, T0, T1, MODE, SBE with signals, 14 is a signal as status information, and is a status register for outputting with signal SE, and 15 is voice synthesis. Reference numeral 16 denotes a conversion unit configured by a read-only memory (ROM) for converting audio data into filter coefficients, and 17 an interpolation unit for interpolating the filter coefficients added for each frame period for one frame and outputting for each sample period, Reference numeral 18 is a sound source section, 19 is a digital filter section, 20 is a simple DA converter, and 21 is a control section for generating a timing signal and controlling each section.

この音声合成回路３は集積回路化した場合のものであ
り、V_DD,V_SSV_SAは電源電圧、その他の端子に示す符号は
信号である。例えばSTは音声合成開始を示すスタート信
号、RPTはリピートモード指定信号、は音声デー
タの要求信号、はアラーム信号、FPはフレームパ
ルス、SOUTはデイジタル音声出力信号、POUT,NOUTはア
ナログ音声出力信号,XOUTは内部論理演算用の基本クロ
ツク、WSYNはSOUTの同期信号、MRは内部フリツプフロツ
プ及びカウンタ類のリセツト、CKSTは水晶発振回路の制
御、CLK及びXTALは水晶振動子接続端子である。The voice synthesizing circuit 3 is an integrated circuit, V _DD and V _SS V _SA are power supply voltages, and the symbols shown at the other terminals are signals. For example, ST is a start signal indicating voice synthesis start, RPT is a repeat mode designation signal, is a voice data request signal, is an alarm signal, FP is a frame pulse, SOUT is a digital voice output signal, POUT and NOUT are analog voice output signals, XOUT is a basic clock for internal logical operation, WSYN is a sync signal of SOUT, MR is an internal flip-flop and a reset of counters, CKST is a control of a crystal oscillation circuit, and CLK and XTAL are crystal oscillator connection terminals.

LSP方式の場合、音声データとしては、例えば１ビツト
のスタートビツトと７ビツトのピツチ周期データ、２ビ
ツトのフレーム長指定ビツトと６ビツトの振幅データ、
４ビツトづつのLSPパラメータからなる６バイトを１組
とし、フレーム長は5,10,20,40mSの指定が可能となつて
いる。In the case of the LSP system, as the voice data, for example, a 1-bit start bit, a 7-bit pitch period data, a 2-bit frame length designating bit and a 6-bit amplitude data,
A set of 6 bytes consisting of 4-bit LSP parameters makes it possible to specify a frame length of 5, 10, 20, 40 mS.

このような音声データがデータバツフアスタツク11にメ
モリから加えられ、信号（データロードパルス）に
よりセツトされ、変換部16によりLSPパラメータと振幅
データはフイルタ係数に変換され、補間部17に転送され
る。補間部17に於いては、サンプル周期毎に直線補間し
てデイジタルフイルタ部19にフイルタ係数を出力する。
又音源部18は音声データの中のピツチ周期データに従つ
たインパルス或はホワイトノイズを発生してデイジタル
フイルタ部19に加える。Such voice data is added to the data buffer 11 from the memory, set by the signal (data load pulse), the conversion unit 16 converts the LSP parameter and the amplitude data into filter coefficients, and transfers them to the interpolation unit 17. It The interpolator 17 linearly interpolates every sample period and outputs a filter coefficient to the digital filter unit 19.
Further, the sound source section 18 generates impulses or white noise according to the pitch cycle data in the voice data and adds them to the digital filter section 19.

デイジタルフイルタ部19は加減算回路及び乗算回路等を
含むもので、音源部18からのパルス列と補間部17からの
フイルタ係数との演算により音声合成フイルタとして動
作し、例えば16ビツト直列に合成音声信号が出力され
る。The digital filter unit 19 includes an adder / subtractor circuit, a multiplier circuit, etc., and operates as a voice synthesizing filter by the operation of the pulse train from the sound source unit 18 and the filter coefficient from the interpolating unit 17, for example, a 16-bit serial voice signal is synthesized. Is output.

前述の動作はリピートモード指定信号RPTが“0"の場合
についてのものであり、リピート動作を行なわせる為に
はリピートモード指定信号RPTを“1"にすると、前述の
如く変換及び補間動作が停止し、音源部18からのパルス
列がデイジタル部19の一定のフイルタ係数による演算で
同一音として出力されることになる。The above operation is for the case where the repeat mode designating signal RPT is "0", and when the repeat mode designating signal RPT is set to "1" to perform the repeat operation, the conversion and interpolation operations are stopped as described above. Then, the pulse train from the sound source unit 18 is output as the same sound by the calculation by the digital unit 19 with a constant filter coefficient.

第３図は本発明の実施例の要部ブロツク線図であり、3
1,32,33,39,43はインヒビツトゲート、40はアンドゲー
ト、46,47はオアゲート、34はフリツプフロツプ、35は
タイミング調整回路、36はピツチ周期データを保持する
レジスタ、37は音源発生回路、38,45は加算回路、41は
差分値レジスタ、44は補間値レジスタ、42は１／2n回路
である。FIG. 3 is a block diagram of an essential part of the embodiment of the present invention.
1, 32, 33, 39, 43 are inhibit gates, 40 is an AND gate, 46, 47 are OR gates, 34 is a flip-flop, 35 is a timing adjustment circuit, 36 is a register for holding pitch cycle data, 37 is a sound source generation circuit. , 38 and 45 are addition circuits, 41 is a difference value register, 44 is an interpolation value register, and 42 is a 1 / 2n circuit.

メモリから読出された音声データD0〜D7はデータロード
パルスDLによりデータバツフアスタツク11にセツトされ
る。リピートモード指定信号RPTが“0"であると、フリ
ツプフロツプ34のデータ端子Ｄには“0"が加えられ、書
込タイミングパルスCRLがクロツク端子Ｃに加えられて
もフリツプフロツプ34はセツトされないので出力端子Ｑ
は“0"のままである。従つてタイミング調整回路35の各
出力も“0"となる。そして変換用読出タイミングパルス
t1がインヒビツトゲート31及びオアゲート46を介してデ
ータバツフアスタツク11に加えられて、音声データ中の
振幅データ及びLSPパラメータが変換部16に、又ピツチ
周期データが音源部18のレジスタ36に加えられる。レジ
スタ36にはピツチ周期データ更新タイミングパルスt2が
インヒビツトゲート32を介して加えられ、そのタイミン
グでピツチ周期データが更新される。音源発生回路37は
レジスタ36の内容のピツチ周期データに従つたパルス列
を出力し、デイジタルフイルタ部19へ加えるものであ
る。The voice data D0 to D7 read from the memory are set in the data buffer 11 by the data load pulse DL. When the repeat mode designating signal RPT is "0", "0" is added to the data terminal D of the flip-flop 34, and even if the write timing pulse CRL is applied to the clock terminal C, the flip-flop 34 is not set, so that the output terminal. Q
Remains "0". Therefore, each output of the timing adjustment circuit 35 also becomes "0". And read timing pulse for conversion
t1 is added to the data buffer 11 via the inhibit gate 31 and the OR gate 46 so that the amplitude data and LSP parameters in the audio data are transferred to the converter 16 and the pitch period data is transferred to the register 36 of the sound source 18. Added. A pitch cycle data update timing pulse t2 is applied to the register 36 via the inhibit gate 32, and the pitch cycle data is updated at that timing. The sound source generation circuit 37 outputs a pulse train according to the pitch cycle data of the contents of the register 36 and adds it to the digital filter unit 19.

変換部16の変換出力は補間部17の加算回路38に加えら
れ、インヒビツトゲート33を介して加えられる差分計算
タイミング信号t3により、加算回路38の出力はアンドゲ
ート40,オアゲート47を介して差分値レジスタ41に加え
られる。加算回路38は変換出力とデイジタルフイルタ部
19へ加える加算回路45の出力のフイルタ係数との差分を
差分値レジスタ41にセツトすることになり、この差分値
レジスタ41の内容が１／2n回路42によりサンプル周期に
分割され、分割出力が加算回路45に加えられる。補間値
レジスタ44にはディジタルフィルタ部19へ前回出力した
フィルタ係数が格納されており、加算部45はこの前回出
力のフィルタ係数と分割出力とを加算し、結果のフィル
タ係数をディジタルフィルタ部19に出力すると共に、補
間値レジスタ44に格納する。加算回路45は、次回には新
たに格納したフィルタ係数を補間値レジスタ44から取り
出し、分割出力と加算する。即ちフレーム周期毎に変換
部16から入力される係数と、前のフレームの係数との差
分を求め、この差分を１フレーム内のサンプル数で割つ
て、１サンプル毎の値を求め、この１サンプル毎の値を
前のフレームの係数に１サンプル毎に加算することによ
り補間処理が行なわれ、１サンプル毎にデイジタルフイ
ルタ部19にフイルタ係数が加えられることになる。The conversion output of the conversion unit 16 is applied to the addition circuit 38 of the interpolation unit 17, and the difference calculation timing signal t3 applied via the inhibit gate 33 causes the output of the addition circuit 38 to differ via the AND gate 40 and the OR gate 47. It is added to the value register 41. The adder circuit 38 is a conversion output and digital filter section.
The difference between the output of the adder circuit 45 and the filter coefficient added to 19 is set in the difference value register 41, and the contents of the difference value register 41 are divided into sample periods by the 1 / 2n circuit 42, and the divided outputs are added. Added to circuit 45. The interpolation value register 44 stores the filter coefficient previously output to the digital filter unit 19, and the adding unit 45 adds the filter coefficient of this previous output and the divided output, and the resulting filter coefficient is supplied to the digital filter unit 19. It is output and stored in the interpolation value register 44. Next time, the adder circuit 45 takes out the newly stored filter coefficient from the interpolation value register 44 and adds it to the divided output. That is, the difference between the coefficient input from the conversion unit 16 and the coefficient of the previous frame is calculated for each frame cycle, the difference is divided by the number of samples in one frame, and the value for each sample is calculated. Interpolation processing is performed by adding each value to the coefficient of the previous frame for each sample, and the filter coefficient is added to the digital filter unit 19 for each sample.

リピートモード指定信号RPTが“1"となると、書込タイ
ミングパルスCRLが“1"となつたときにフリツプフロツ
プ34がセツトされ、タイミング調整回路35によりそれぞ
れタイミングが調整されてインヒビツトゲート31,32,3
3,43に加えられ、それによつてデータバツフアスタツク
11からの読出しが停止され、又音源部18のレジスタ36の
更新動作が停止される。更に１サンプル毎に加算回路45
で加算する為の分割出力がインヒビツトゲート43で阻止
されるので、補間動作が停止する。従つて音源発生回路
37は前フレームのピツチ周期データに従つたパルス列を
出力し、補間レジスタ44の内容はフレーム周期単位で変
化することなく、補間動作の停止直前のフィルタ係数で
固定となり、このフィルタ係数が繰り返しディジタルフ
ィルタ部19に加えられ、同一音が連続的に出力されるこ
とになる。When the repeat mode designating signal RPT becomes "1", the flip-flop 34 is set when the write timing pulse CRL becomes "1", and the timing adjusting circuit 35 adjusts the timings respectively and the inhibit gates 31, 32, 3
Added to 3,43, which allows data buffering
The reading from 11 is stopped, and the updating operation of the register 36 of the sound source section 18 is stopped. Adder circuit 45 for each sample
Since the divided output for addition at is blocked by the inhibit gate 43, the interpolation operation is stopped. Therefore, the sound source generation circuit
37 outputs a pulse train according to the pitch cycle data of the previous frame, the contents of the interpolation register 44 does not change in frame cycle units, and is fixed at the filter coefficient immediately before the interpolation operation is stopped. It is added to the section 19 and the same sound is continuously output.

第４図は動作説明図であり、同図（Ａ）は要求信号
が“1"の期間にリピートモード指定信号RPTを“1"と
した場合を示し、フレームパルスFPが“1"から“0"にな
つたとき要求信号が“0"となつて音声データDATA
が要求され、＃ｍのデータが読込まれ、音声合成動作Ｍ
の期間T1では、＃ｍ−２から＃ｍ−１のデータの変換及
びデイジタルフイルタ部に於けるフイルタ動作が行なわ
れ、音声合成が行なわれる。FIG. 4 is a diagram for explaining the operation, and FIG. 4A shows the case where the repeat mode designation signal RPT is set to “1” during the period when the request signal is “1”, and the frame pulse FP changes from “1” to “0”. When it becomes ", the request signal becomes" 0 "and the audio data DATA
Is requested, the data of #m is read, and the voice synthesis operation M
In the period T1 of, the conversion of the data from # m-2 to # m-1 and the filter operation in the digital filter section are performed, and voice synthesis is performed.

リピートモード指定信号RPTが“1"になると、次のフレ
ームパルスFPの後には要求信号は“0"にならない
ので、新たな音声データDATAの読込みは行なわれない。
そして期間T2では、フレームパルスFPの立上り直前の補
間値でフイルタ動作を行なうことになり、変換，差分計
算，補間動作は行なわれない。When the repeat mode designation signal RPT becomes "1", the request signal does not become "0" after the next frame pulse FP, so that new audio data DATA is not read.
Then, in the period T2, the filter operation is performed with the interpolation value immediately before the rise of the frame pulse FP, and the conversion, difference calculation, and interpolation operation are not performed.

リピートモード指定信号RPTが“0"になると、次のフレ
ームパルスFPの期間T3に於いて、＃ｍ−２から＃ｍ−１
のデータの補間の最終ステツプ及びフイルタ動作が行な
われ、次に＃ｍのデータの変換及び＃ｍ−１と＃ｍのデ
ータの差分計算が行なわれる。そして要求信号が
“0"となつて次のデータ＃ｍ＋１が読込まれる。When the repeat mode designation signal RPT becomes "0", # m-2 to # m-1 in the period T3 of the next frame pulse FP.
The final step of interpolating the data of # 1 and the filter operation are performed, and then the conversion of #m data and the difference calculation of # m-1 and #m data are performed. Then, the request signal becomes "0" and the next data # m + 1 is read.

又第４図の（Ｂ）は要求信号が“0"の間にリピー
トモード指定信号RPTを“1"とする場合を示すもので、
アラーム信号は“1"の状態であり、リピートモー
ド指定信号RPTが“1"から“0"になつた後のフレームパ
ルスFPの後で要求信号は“1"となり、それ以外は
通常の動作となる。なおリピートモード指定信号RPTが
“1"の期間の音声合成動作は前述の場合と同様である。FIG. 4B shows a case where the repeat mode designation signal RPT is set to "1" while the request signal is "0".
The alarm signal is in the "1" state, the request signal becomes "1" after the frame pulse FP after the repeat mode designation signal RPT changes from "1" to "0", and other than normal operation Become. The voice synthesizing operation during the period when the repeat mode designating signal RPT is "1" is similar to the above case.

前述の実施例はLSP方式を主として説明したものである
が、PARCOR方式やLPC方式にも適用し得るものであり、
又音声データとしては、比較的長い単語或は文の分析結
果、又は母音Ｖと子音Ｃとの組合せのVCV方式等に用い
る音片の分析結果でも良いものである。Although the above-described embodiment mainly describes the LSP method, it is also applicable to the PARCOR method and the LPC method.
The voice data may be the analysis result of a relatively long word or sentence, or the analysis result of a voice unit used in the VCV method in which a vowel V and a consonant C are combined.

以上説明したように、本発明は、指定されたフレーム周
期毎の特定の周期毎に音声データと一定バイト数毎に読
込んで音声合成を行なう装置に於いて、制御情報を記憶
するコントロールレジスタ13等の記憶手段と、音声デー
タをフイルタ係数に変換する変換部16等の変換手段と、
サンプル周期毎にフイルタ係数を補間する補間部17等の
補間手段と、フイルタ係数に応じて加減乗算等の演算を
行なうデイジタルフイルタ部19とを備え、制御情報中に
設けたリピートモード指定信号RPTがコントロールレジ
スタ13等の記憶手段に記憶されたとき、変換手段による
変換動作及び補間手段による補間動作を停止させて、そ
の直前に変換手段により変換された結果を用いて音声合
成を行なうもので、同一音を連続的に合成出力する場合
に、特定周期毎に音声データを読込む必要がなくなり、
１組の音声データで任意の長さの同一音を合成出力でき
るから、音声データ用メモリの容量の削減が可能とな
り、又信号音等の合成も容易となるものである。As described above, the present invention is an apparatus for performing voice synthesis by reading voice data and a fixed number of bytes for each specific period of a designated frame period, and for controlling the control register 13 for storing control information. Storage means and conversion means such as a conversion unit 16 for converting voice data into filter coefficients,
The repeat mode specifying signal RPT provided in the control information is provided with an interpolating unit such as an interpolating unit 17 for interpolating the filter coefficient for each sample period, and a digital filter unit 19 for performing operations such as addition and subtraction multiplication according to the filter coefficient. When stored in the storage means such as the control register 13, the conversion operation by the conversion means and the interpolation operation by the interpolation means are stopped, and speech synthesis is performed immediately before that by using the result converted by the conversion means. When continuously synthesizing and outputting sounds, it is not necessary to read the audio data in every specific cycle,
Since the same sound of arbitrary length can be synthesized and output with one set of voice data, the capacity of the voice data memory can be reduced, and the synthesis of signal sounds and the like becomes easy.

[Brief description of drawings]

第１図はマイクロプロセツサの制御により音声合成を行
なうシステムのブロツク線図、第２図は本発明の実施例
の音声合成回路のブロツク線図、第３図は本発明の実施
例の要部ブロツク線図、第４図は動作説明図である。１はマイクロプロセツサ、２はメモリ、３は音声合成回
路、11はデータバツフアスタツク、12はインタフエース
部、13はコントロールレジスタ、14はステータスレジス
タ、15は音声合成部、16は変換部、17は補間部、18は音
源部、19はデイジタルフイルタ部、20はDA変換器、21は
制御部である。FIG. 1 is a block diagram of a system for synthesizing voice by controlling a microprocessor, FIG. 2 is a block diagram of a voice synthesizing circuit of an embodiment of the present invention, and FIG. 3 is a main part of an embodiment of the present invention. A block diagram and FIG. 4 are operation explanatory diagrams. 1 is a microprocessor, 2 is a memory, 3 is a voice synthesis circuit, 11 is a data buffer stack, 12 is an interface unit, 13 is a control register, 14 is a status register, 15 is a voice synthesis unit, and 16 is a conversion unit. , 17 is an interpolation unit, 18 is a sound source unit, 19 is a digital filter unit, 20 is a DA converter, and 21 is a control unit.

フロントページの続き (72)発明者笹原雄二郎神奈川県川崎市中原区上小田中1015番地富士通株式会社内 (72)発明者板倉文忠東京都武蔵野市緑町３丁目９番11号日本電信電話公社武蔵野電気通信研究所内 (72)発明者嵯峨山茂樹東京都武蔵野市緑町３丁目９番11号日本電信電話公社武蔵野電気通信研究所内 (72)発明者管村昇東京都武蔵野市緑町３丁目９番11号日本電信電話公社武蔵野電気通信研究所内 (56)参考文献特開昭55−2234（ＪＰ，Ａ) 特開昭51−123003（ＪＰ，Ａ)Front page continued (72) Inventor Yujiro Sasahara 1015 Kamiodanaka, Nakahara-ku, Kawasaki City, Kanagawa Prefecture, Fujitsu Limited (72) Inventor Fumada Itakura, 3-9-11 Midoricho, Musashino City, Tokyo Nippon Telegraph and Telephone Public Corporation Musashino Telecommunications In the laboratory (72) Inventor Shigeki Sagayama 3-9-11 Midoricho, Musashino-shi, Tokyo Inside Nippon Telegraph and Telephone Public Corporation Musashino Telecommunications Research Institute (72) Inventor Noboru Tsunemura 3-9-11 Midoricho, Musashino-shi Japan Telegraph and Telephone Public Corporation Musashino Electro-Communication Research Laboratory (56) References JP-A-55-2234 (JP, A) JP-A-51-123003 (JP, A)

Claims

[Claims]

1. A characteristic component such as a pitch period, an amplitude, a characteristic parameter of a voice is extracted, and the voice data stored in an external memory (2) as voice data having a constant byte length is read every specific period. In a voice synthesizer for reading voice synthesis, a conversion unit (16) for converting the amplitude and characteristic parameter data in the voice data read at each specific cycle into digital filter coefficients, and the conversion unit (16). ), And an interpolation means for generating and interpolating a digital filter coefficient at a predetermined interval between the specific cycles by the digital filter coefficient obtained by the conversion and the digital filter coefficient obtained in the immediately preceding cycle. (17), a pulse train generated based on the pitch cycle data in the audio data, and the interpolation means (1 Digital filter means (19) for performing voice synthesis by performing a predetermined filter operation using the digital filter coefficient obtained from 7) and control information given from an external device (1) such as start, stop and repeat operation of voice synthesis. Storage means for storing (1
2), when the information instructing the repeat operation in the control information is stored in the storage means (12), the conversion operation of the audio data by the conversion means (16) and the interpolation by the interpolation means (17) A voice synthesizing apparatus which synthesizes the same sound by stopping the operation and giving the digital filter coefficient (19) obtained by the interpolating means (17) immediately before the operation.