JPS6059398A

JPS6059398A - Voice synthesizer

Info

Publication number: JPS6059398A
Application number: JP58167534A
Authority: JP
Inventors: 森戸　誠; 隆矢頭; 三木　敬
Original assignee: Oki Electric Industry Co Ltd
Current assignee: Oki Electric Industry Co Ltd
Priority date: 1983-09-13
Filing date: 1983-09-13
Publication date: 1985-04-05
Also published as: JPH0449956B2

Abstract

(57)【要約】本公報は電子出願前の出願データであるた
め要約のデータは記録されません。(57) [Summary] This bulletin contains application data before electronic filing, so abstract data is not recorded.

Description

【発明の詳細な説明】（技術分野）本発明は記憶領域から音素波形の波形領域での情報を読
み出し音声を合成する音声合成器に関し、特に複数のＡ
ＤＰＣＭ符号化された対称音素波形を重畳する加算を実
行し、音声出力を得る音声合成器に関する。Detailed Description of the Invention (Technical Field) The present invention relates to a speech synthesizer that reads information in the waveform region of a phoneme waveform from a storage area and synthesizes speech.
The present invention relates to a speech synthesizer that performs addition to superimpose DPCM-encoded symmetric phoneme waveforms and obtains speech output.

（従来技術）人間の声は有声音の場合、肺からの空気流が声帯によっ
て■周期的なインパルス流となり声道と呼ばれる空洞共
鳴体と共鳴することによって発せられる。(Prior Art) When a human voice is a voiced sound, it is emitted when airflow from the lungs becomes a periodic impulse flow through the vocal cords and resonates with a hollow resonator called the vocal tract.

人間の声の゛波形を第１図に示す。Figure 1 shows the waveform of a human voice.

第１図に示されるごとく人間の声は゛ピッチ１と呼ばれ
る周期ことにほとんど同じ波形がくりかえされている。As shown in Figure 1, the human voice has almost the same waveform repeated in a period called ``pitch 1''.

このことは前にも述べたように人間の声は準周期的なイ
ンパルス流によって発せられることから起因しておシピ
ッチ周期はこのイン・ぐルス流の間隔に等しい。This is because, as mentioned earlier, the human voice is emitted by a quasi-periodic impulse flow, and the pitch period is equal to the interval of this impulse flow.

このような音声を合成しようとしたとき音声の情−：し
をとのような形で格納しておくかにより各種方式があげ
られる。When attempting to synthesize such speech, various methods can be used depending on whether the information of the speech is stored in a format such as "shioto".

その１つの方式として音声の１ピッチ周期の波形（これ
を音素波形と称する）をいろいろな音声について記憶領
域に格納し制御情報にしたかいこ力、ら音素波形をつな
ぎ合わぜることによって音声を出力する方式がある。One method is to store the waveform of one pitch period of voice (this is called a phoneme waveform) in a storage area for various voices and use it as control information, and output voice by connecting the phoneme waveforms. There is a method to do this.

第２図にこの方式による構成を示す。Figure 2 shows a configuration using this method.

１は各種音声に対して１ピ、チ周期の音素波形を格納し
ておく記憶領域で、２は音素波形をつなき合わせるだめ
のピッチならびに振幅倍などの制御情報用記憶領域で、
３は音声素片をつなぎ合わぜ、′３合成部である。1 is a storage area for storing phoneme waveforms of 1 period and 1 period for various voices, 2 is a storage area for control information such as pitch and amplitude multiplication for connecting phoneme waveforms,
3 is a '3 synthesis section that connects speech segments.

記憶領域ｌに格納された音声を記憶領域２に格納された
制御情報によってつなぎ合わせることによって音声を合
成部３で合成するが、自然性の高い合成音を作るために
は声の高さ、声の大きさを連節に制御しなければならな
い。声の大きさは記憶領域内の振幅倍情報によって記憶
領域内の音素を一定倍することによシ制御される。また
声の高さは記憶領域２内のピッチ情報によって制御され
る。しかし記憶領域１の中の音素波形は音素波形を抽出
した際の音声のピッチ周期の長さをもってお９制御情報
によって与えられるピッチとは必ずしも一致してい々い
。Speech is synthesized by the synthesizer 3 by connecting the voices stored in the storage area 1 using the control information stored in the storage area 2. However, in order to create a highly natural synthesized sound, the pitch of the voice, The size of the must be controlled in an articulated manner. The loudness of the voice is controlled by multiplying the phonemes in the storage area by a certain amount using amplitude multiplication information in the storage area. Furthermore, the pitch of the voice is controlled by pitch information in the storage area 2. However, the phoneme waveform in the storage area 1 does not necessarily match the pitch given by the control information 9 based on the length of the pitch period of the voice when the phoneme waveform is extracted.

したがって制御情報のピッチが音声素片長より短い場合
には音声素片の後端を切シ、制御情報のピッチが音声素
片長より長い場合には音声素片の最後の値を延長して制
御情報と同一ピッチ長をもった音素をする。（第３図参
照）しかし記憶領域１に格納された音素波形を途中で切
った場合には音素波形が十分に減衰していないときには
接続する音素波形との間に不連続を生じ合成音に悪影き
ょうをおよほずという欠点があり、また逆に音素波形を
長くし′に場合には音素波形のスぜクトラムが変形して
し７１い音質劣化をまねくという欠点があった。Therefore, if the pitch of the control information is shorter than the speech segment length, the rear end of the speech segment is cut off, and if the pitch of the control information is longer than the speech segment length, the last value of the speech segment is extended and the control information is A phoneme with the same pitch length as . (See Figure 3) However, if the phoneme waveform stored in storage area 1 is cut in the middle, if the phoneme waveform is not sufficiently attenuated, discontinuity will occur between the phoneme waveform and the connected phoneme waveform, which will affect the synthesized sound. It has the disadvantage of causing shadows, and conversely, when the phoneme waveform is lengthened, the spectrum of the phoneme waveform is deformed, resulting in severe deterioration of sound quality.

（発明の目的及び概要）本発明の目的はこれらの欠点を解決することにあシ、ピ
ッチ周期ずつずれる複数チャンネルのＡＤ　＋）ＣＭ符
号化された対称音素波形を重畳する音声合成器において
、ＡＤＰＣＭ符号再生を時間多重処理することにより、
また対称音素波形の重畳演算に用いる加算器をＡＤＰＣ
Ｍ符号再生器内の加算と共有することによシ簡単な回路
構成で自然性のある良質、午音声を合成する音声合成器
を実現させたもので、以下詳細に説明する。(Objective and Summary of the Invention) The object of the present invention is to solve these drawbacks, and to solve the above problems, it is an object of the present invention to solve the above problems. By time-multiplexing code reproduction,
In addition, the adder used for superposition calculation of symmetric phoneme waveforms is ADPC.
By sharing this with the addition in the M-code regenerator, a speech synthesizer for synthesizing natural and high-quality speech is realized with a simple circuit configuration, and will be described in detail below.

（発明の前提）前にも述べたように音声は声帯によるインパルス流が共
鳴することによシ発せられておシこれは電気回路に置き
換えることができる。(Premise of the Invention) As mentioned before, speech is produced by the resonance of the impulse flow of the vocal cords, and this can be replaced by an electric circuit.

すなわち音声は声帯に相当する励振回路の発する１つの
インパルスに対応した共振フィルタの出力波形（以下こ
れを音素波形と呼ぶ）の重なシ合っゾヒものと考えられ
る。このことを第４図を用いて説明する。In other words, speech is considered to be a combination of overlapping output waveforms of a resonant filter (hereinafter referred to as phoneme waveforms) corresponding to one impulse emitted by an excitation circuit corresponding to the vocal cords. This will be explained using FIG. 4.

第４図のｌＯは、励振回路の発するインパルス列である
。ここで、各インパルス間の時間間隔はピッチ周期間隔
である。１１は、励振回路の発するイン・ぞルス列で共
振フィルタを駆動した合成音声出力波形である。１２は
励振回路の発するイン・ぐルス列のうち、インパルスＰ
１によって共振フィルタを駆動した場合の音素波形であ
る。以下同様に１３〜１７はイン・やルスＰ２〜Ｐ６に
よって共振フィルタ４を駆動した場合の音素波形である
。励振回路よシ発せられる駆動イン・♀ルス列ｌＯはイ
ンパルスＰ１からＰ６の加算であるから重畳の定理によ
れば合成音声出力波形１１は各音素波形１２から１７４
での加算によって得られる。IO in FIG. 4 is an impulse train generated by the excitation circuit. Here, the time interval between each impulse is the pitch period interval. 11 is a synthesized audio output waveform obtained by driving a resonant filter with the in-sense train generated by the excitation circuit. 12 is an impulse P of the impulse train generated by the excitation circuit.
This is a phoneme waveform when the resonant filter is driven by 1. Similarly, 13 to 17 are phoneme waveforms when the resonant filter 4 is driven by in/ya ruses P2 to P6. Since the drive impulse train lO emitted by the excitation circuit is the addition of impulses P1 to P6, according to the superposition theorem, the synthesized speech output waveform 11 is the sum of each phoneme waveform 12 to 174.
It is obtained by adding .

第４図に示される音素波形１２は時間点ｔ１以前はＯで
ある。時間点ｔ１以後では時間経過とともに減衰し無限
大時間点では０となる性質を有する。The phoneme waveform 12 shown in FIG. 4 is O before time point t1. After the time point t1, it has a property of attenuating with the passage of time and becoming 0 at an infinite time point.

実際、音声波形の場合励振点（１，）から１６ミリ秒を
経過した時点での音素波形はほとんどＯと考えられる。In fact, in the case of a speech waveform, the phoneme waveform is considered to be almost O when 16 milliseconds have passed from the excitation point (1,).

したがって音素波形１２の再生処理は励振時点（ｔｌ　
／から１６　ミ’）秒間で十分である。Therefore, the reproduction process of the phoneme waveform 12 is performed at the excitation time point (tl
/ to 16 m') seconds is sufficient.

しかし、音素波形１３の再生処理を励振時間点ｔ２から
開始するには、音素波形１２の再生処理が終了していな
いために多重的な再生処理が必要となる。必要となる波
形再生多重度ｎは次の第（１）式、て与えられる。通常
音声の場合ｎは４程度で十分である。そこでｎ　＝　４
として以後説明する。However, in order to start the reproduction process of the phoneme waveform 13 from the excitation time point t2, multiple reproduction processes are required because the reproduction process of the phoneme waveform 12 has not yet been completed. The required waveform reproduction multiplicity n is given by the following equation (1). In the case of normal voice, n of about 4 is sufficient. So n = 4
This will be explained below.

次に音素波形を再生するだめの符号として、符−づ化効
率のよいＡＤＰＣＭ符号を用い、さらに音素波形として
半分の情報量ですむ偶対称波形を用いる。Next, as a code for reproducing the phoneme waveform, an ADPCM code with good encoding efficiency is used, and an even symmetric waveform that requires half the amount of information is used as the phoneme waveform.

一連のＡＤＰＣＭ符号から対称な音素波形を再生する再
生器については特願昭５６−１．８５４８９に提案され
ているので詳しい説明は省略する。A regenerator for reproducing symmetrical phoneme waveforms from a series of ADPCM codes has been proposed in Japanese Patent Application No. 56-1.85489, so a detailed explanation will be omitted.

、ＡｌＴｌＰＣＭ符号の再生処理のような差分復号処理
において１６ミリ秒（１２８標本周期）で再生処理を打
ち切った場合には最終出力値が保持される。In differential decoding processing, such as reproduction processing of AlTlPCM codes, when the reproduction processing is terminated after 16 milliseconds (128 sample periods), the final output value is held.

しだがって音素波形の条件（条件］）　励振時間点以前は０である（条件２）　無
限大時間点では０であるを）１にだすためには再生初期
値がＯである事と、再生最終出力値がＯである事が必要
である。Therefore, the phoneme waveform conditions (conditions) are 0 before the excitation time point (condition 2). In order to make it 1 (0 at the infinite time point), the initial reproduction value must be O. It is necessary that the reproduction final output value is O.

本発明では偶対称波形を扱っているため再生初期値さえ
Ｏにすれば再生最終値は０となシ条件を満足する。Since the present invention deals with even symmetrical waveforms, if the initial reproduction value is O, the final reproduction value will be 0, which satisfies the condition.

（発明の実施例）第５図に本発明における１実施例を示す。(Example of the invention) FIG. 5 shows one embodiment of the present invention.

ここで１標本周期時間内に時分割に処理される処理に対
して番号付けのだめ「チャネル」又はｒ　ｃｈ　Ｊとい
う言葉を用いる。Here, the term "channel" or r ch J is used for numbering purposes for processes that are time-divisionally processed within one sample period.

第５図における各部は次のとおりである。１００は第１
Ｃｈ入カレノスタ、１０１は第２ｃｈ入力レジスタ、１
０２は第３ｃｈ入カレソスタ、１０３け第４ｃｈ入力レ
ジスタ、１０４，１０５はセレクタ、１０６はＡＤＰＣ
Ｍ　ｉ号データＬｎを格納するレジスタ、１０７はレジ
スタ１０６の値をアドレスとしてポインタ移動ｆ３．Ｄ
ｎを出力するポインタ移動量メモリ、１０８は加減算器
、１０９はセレクタ、１１０はセレクタ、１１１は第１
ｃｈポインタレジスタ、ｌ１２は第２ｃｈポインタレジ
スタ、１１３　ハ第３ｃｈポインタレノスタ、１１４は
第４ｃｈ、］？インクレノスタ、１１５はセレクタ、１
１６はポインタ値Ｐｎを特定の範囲に限定してポインタ
リミッタ値〜Ｐｎ’として出力するポインタリミッタ、１１７は量子
化ステ、プ値Ｘを格納している量子化メモリ、１１Ｂは
シフトレジスタ、１１９は加減算器、１２０はレジスタ
、１２１は合成音声を出力するレジスタ、１２２は出力
端子、１３５は第１ｃｈダウンカウンク、１３６は第２
ｃｈダウンカウンク、１３７は第３ｃｈダウンカウンタ
、１３８は第４．ｃｈグウンカウンク、１３９は第１ｃ
ｂデコーダ、１４０は第２ｃｈデコーダ、１４ノは第３
　ｃｈデコーダ、１４２は第４．ｃｈデコーダ、１４３
は各チャネルのＢＵＳＹ信号、１４４は対称波形の再生
のうち左半分の再生を示すＬＥＦＴ信号、１４５は波形
再生のだめの起動信号、１５０けコントローラである。Each part in FIG. 5 is as follows. 100 is the first
Ch input careno star, 101 is the second channel input register, 1
02 is the 3rd channel input register, 103 digits is the 4th channel input register, 104 and 105 are the selectors, and 106 is the ADPC.
The register 107 that stores the M i data Ln moves the pointer f3 using the value of the register 106 as an address. D
108 is an adder/subtractor, 109 is a selector, 110 is a selector, 111 is a first
ch pointer register, l12 is the second channel pointer register, 113 c is the third channel pointer register, 114 is the fourth channel, ]? Increnostar, 115 is selector, 1
16 is a pointer limiter that limits the pointer value Pn to a specific range and outputs it as a pointer limiter value ~ Pn'; 117 is a quantization step; quantization memory stores the step value X; 11B is a shift register; 119 is a Adder/subtractor, 120 is a register, 121 is a register for outputting synthesized speech, 122 is an output terminal, 135 is a first channel down count, 136 is a second channel
137 is the 3rd channel down counter, 138 is the 4th channel down counter, and 137 is the 3rd channel down counter. ch gounkaunk, 139 is the 1st c
b decoder, 140 is the second channel decoder, 14 is the third
ch decoder 142 is the fourth .ch decoder. ch decoder, 143
144 is a BUSY signal of each channel, 144 is a LEFT signal indicating reproduction of the left half of the symmetrical waveform reproduction, 145 is a start signal for waveform reproduction, and 150 is a controller.

コントコーラ１５０は前述の第１ｃｈ入力レノスタ１０
０、第２ｃｈの入力レジスタ１０１．・・・、第３ｃｈ
デコーダ１４ノ、第４ｃｈデコーダ１４２等の各回路と
接続さハ、その動作の制御を行なっているが第５図は図
が複雑になるのをさけるためにその接続の様子は省略し
ている。The controller 150 is the first channel input renostar 10 described above.
0, second channel input register 101. ..., 3rd ch
It is connected to each circuit such as the decoder 14 and the fourth channel decoder 142 to control their operation, but the connections are omitted in FIG. 5 to avoid complicating the diagram.

第６図は偶対称な音素波形を再生するために本発明にお
いて用いるデータでポインタ初期値ＤＰｎ。FIG. 6 shows data used in the present invention to reproduce an even symmetrical phoneme waveform, and the pointer initial value DPn.

６４個のＡＤＰＣＭ符号Ｌｎ＋　、　Ｌｎ２．　Ｌｎ３
．−　＋　Ｌｎ６４から成り立っている。特願５６−１
．８５４８９にも述べているように１６ミリ秒（１２８
標本周期）の偶対称波形を再生するためには６４個のＡ
ＤＰＣＭ符号が必要でちる。この６４個のＡＤＰＣＭ符
号を順方向及び逆方向に再生して１２８個の音素波形を
再生する。64 ADPCM codes Ln+, Ln2. Ln3
．． - + Consists of Ln64. Patent application 56-1
．． As stated in 85489, 16 milliseconds (128
In order to reproduce an even symmetrical waveform with sample period), 64 A
DPCM code is required. These 64 ADPCM codes are reproduced in forward and reverse directions to reproduce 128 phoneme waveforms.

寸たポインタの初期値データＴ）　Ｐは対称波形再生開
始時点にポインタに格納されるデータである。Initial value data T) of the pointer P is data stored in the pointer at the start of symmetrical waveform reproduction.

また各チャンネルに割り当てられる再生処理と時間との
関係を第７図に示す。Further, FIG. 7 shows the relationship between the playback processing and time assigned to each channel.

第７図において２００は合成出力波形、２０ＩはチーＶ
ネル１に割り当てられた再生処理によって再生される音
素波形、２０２はチャネル１用起動信号Ｓ　Ｔ　］　、
　２０．９はチャネル１用ＢＵＳＹ信号ＢＵＳＹＩ　、
　２０４はチャネル１用ＬＥＦＴ信月ＬＥＦＴＩ、以下
２０５，２０９，２１３はチャネル２，３．４に割り当
てられた再生処理によって再生される音素波形、２０６
．２１０，２１４はチャネル２．３．４の起動信号ＳＴ
２．ＳＴ３．ＳＴ４．２０７，２１１．２１５はチャネ
ル２゜３．４用ＢＵＳＹ信号ＢＵＳＹ　２　、　ＢＵＳ
Ｙ　３　、　ＢＵＳＹ　４　。In Fig. 7, 200 is the composite output waveform, 20I is the chi V
The phoneme waveform reproduced by the reproduction process assigned to channel 1, 202 is the activation signal S T for channel 1,
20.9 is the BUSY signal BUSYI for channel 1,
204 is the LEFT Shingetsu LEFTI for channel 1, 205, 209, 213 are the phoneme waveforms reproduced by the reproduction processing assigned to channels 2, 3.4, 206
．． 210, 214 are activation signals ST of channel 2.3.4
2. ST3. ST4.207, 211.215 are BUSY signals for channel 2°3.4 BUSY 2, BUS
Y3, BUSY4.

：！０８．２１２，２１６はチャネル２，３．４用ＬＥ
ＴＦ信号ｉ、ＥＦＴ　２　、　ＬＥＦＴ　３　、　ＬＥ
ＦＴ、４信号である。:! 08.212, 216 is LE for channel 2, 3.4
TF signal i, EFT 2, LEFT 3, LE
FT, 4 signals.

第５図〜第７図を用いて本発明の実施例を詳しく説明す
る。Embodiments of the present invention will be described in detail with reference to FIGS. 5 to 7.

時間点ｔ１において外部から起動信号８１゛１がコント
ローラ１５０にくわえられチャネル］によって第４図１
２に相当する音素波形の再生処理が開始される。At time point t1, a starting signal 81'1 is applied to the controller 150 from the outside and the channel] is activated as shown in FIG.
The reproduction process of the phoneme waveform corresponding to 2 is started.

？−のときコントローラ１５０はダウンカウンタ１３５
に値］２８をセットする。この値は前記１６ミＩＪ秒１
で相当する値である。（出力波形の標本化周期を１２５
マイクロ秒とすると、１６ミリ秒は１：己８標本化周期
となる）デコーダ１３９はダウンカウンタ１３５の出力がＯ，Ｌ
、Ｉ外の場合は′°１′″をＢＵＳＹ　１信号として出
力する。まだダウンカウンタ１３５の出力が６５以上な
らば１″をＬＢＦＴ　１信号として出力する。（第７図
参照）以下、標本化周期（１２５マイクロ秒）ごとに入
力レジスタ１０θから波形再生用のデータを読み取ｐ　
ＡＤＰＣＭ符号による対称波形再生処理ンタ１３５は１
ずつ減じられる。６４個のＡＤＰＣＭ符号を最初から順
番に順方向に再生し、終わると、すなわち６５回目の再
生処理以降はＬＥＦＴ　１信号が″０″となり、６４個
のＡＤＰＣＭ符号を最後から順番に逆方向にＡＤＰＣＭ
逆再生処理を行なう。このようにして１２８２８回目生
処理が終了した時点でダウンカウンタ１３５はＯとなり
　、ＢＵＳＹ　］信号が“０″となる。? -, the controller 150 starts the down counter 135
Set the value to 28. This value is 16 mIJ seconds 1
is the corresponding value. (Sampling period of output waveform is 125
(If it is a microsecond, 16 milliseconds is a 1:8 sampling period.) The decoder 139 outputs the output of the down counter 135 from
, outside I, ``°1'' is output as the BUSY 1 signal. If the output of the down counter 135 is still 65 or more, 1'' is output as the LBFT 1 signal. (Refer to Figure 7) Hereafter, data for waveform reproduction is read from the input register 10θ every sampling period (125 microseconds).
The symmetrical waveform reproduction processing unit 135 based on the ADPCM code is 1
It is reduced by The 64 ADPCM codes are played back in order from the beginning in the forward direction, and when it is finished, that is, after the 65th playback processing, the LEFT 1 signal becomes "0", and the 64 ADPCM codes are played back in order from the end.
Perform reverse playback processing. In this way, when the 12828th raw processing is completed, the down counter 135 becomes O, and the BUSY ] signal becomes "0".

第１表に、コントローラ１５０が、セレクタ１０４を介
して入力される各チャンネルの入力レジスタ（１００，
１０１，ＩＯ２，１０３）に格納されたＡＤＰＣＭ　符
号Ｌｎと、シフトレジスタ１１８によって力）と、ＬＥ
ＦＴ信号１４４とを用いてし・ゾスタ１２０と加減算器
１１９において行なうＡＤＰＣＭ波形再生演算処理（ポ
インタの移動演算は除く）を示す。Table 1 shows that the controller 150 inputs input registers (100, 100,
ADPCM code Ln stored in 101, IO2, 103), output by shift register 118), and LE
This figure shows the ADPCM waveform reproduction calculation processing (excluding pointer movement calculation) performed in the digital camera 120 and the adder/subtractor 119 using the FT signal 144.

同様に時間点ｔ２において外部から起動信号ＳＴ２がコ
ントローラ１５０に加えられ、チャネル２（でよって第
４図１３に相当する音素波形の再生処理が開始される。Similarly, at time point t2, an activation signal ST2 is externally applied to the controller 150, and the reproduction process of the phoneme waveform of channel 2 (corresponding to FIG. 4, 13) is started.

このときダウンカウンタ１　、？　６に値１２８がセッ
トされ、ＢＵＳＹ　２信号が”　１　”　となる。以下
チャネル１と同様な再生処理を行ない１２８標本化周期
後にダウンカウンタ１３６は０となりＢＵＳＹ　２偕号
は”　ｏ　”　となる。At this time, the down counter is 1,? 6 is set to the value 128, and the BUSY 2 signal becomes "1". Thereafter, the same reproduction process as in channel 1 is performed, and after 128 sampling periods, the down counter 136 becomes 0 and the BUSY 2 signal becomes "o".

以下時間点ｔ３からはチャネル３において、時間点ｔ４
からはチャネル４において、時間点ｔ。From time point t3 onwards, in channel 3, time point t4
From channel 4, time point t.

から４−ｉチャネル」において同様な制御が行なわれる
。Similar control is performed for the 4-i channels.

このように４つのチャネルによって再生処理を行ない合
成音声出力波形を標本化周期ごとに出力するが、各チャ
ンネルの再生処理は１標本時間点の間で時分割に行なわ
れる。ここで、１標本化周期内での処理の時間関係を第
８図（ａ）に、そのフローチャートを第８図（ｂ）　、
　（ｃ）に示す。In this way, reproduction processing is performed using the four channels and a synthesized audio output waveform is output at each sampling period, but the reproduction processing of each channel is performed in a time-division manner between one sampling time point. Here, the time relationship of processing within one sampling period is shown in Fig. 8(a), and its flowchart is shown in Fig. 8(b).
Shown in (c).

１襟本化周期内での処理は５つのサイクルに分かれてお
り、順次処理される。尚、それぞれのサイクルにおいて
処理されるＡＤＦ’ＣＭ再生処理に必要な回路はほとん
ど共有化されており第６図の各構成要素のうちその名称
にチャネル番号の付加されていない構成要素はすべて各
サイクルに共通に用いられるものであり、これらを総称
して「共有部」と称する。The processing within one embedding period is divided into five cycles, which are sequentially processed. It should be noted that most of the circuits required for the ADF'CM playback process processed in each cycle are shared, and among the components shown in Figure 6, all components that do not have a channel number added to their name are used in each cycle. These are commonly used in common areas, and these are collectively referred to as "shared parts."

（サイクル１）チャネル１に割り当てられた波形再生処理が入力レジス
タ１００、ポインタレジスタ１１ノ、ダウンカウンタ１
３５、デコーダ１３９と共有部を用いて行なわれ、結果
はレジスタ１２０に格納される。(Cycle 1) The waveform reproduction processing assigned to channel 1 is performed by input register 100, pointer register 11, and down counter 1.
35, is performed using the decoder 139 and the shared part, and the result is stored in the register 120.

（サイクル２）チャネル２に割り当てられた波形再生処理が入カレノス
タ１０１、ポインタレジスタ１１２、ダウンカウンタ１
３６、デコーダ１４０と共有部を用いて行なわれ、結果
はレジスタ１２０に格納される。(Cycle 2) The waveform reproduction process assigned to channel 2 is input to the current register 101, pointer register 112, and down counter 1.
36, is performed using the decoder 140 and the shared part, and the result is stored in the register 120.

（サイクル３）天ヤイ、ル３に割り当てられた波形再生処理が入カレノ
スク１０２、ポインタレノスフ１１３、ダウンカウンタ
Ｊ３７、デコーダ１４１と共有部を川゛、＾て行なわれ
、結果はレジスタ１２０に格納されろ。(Cycle 3) The waveform reproduction process assigned to the top and bottom 3 is carried out through the shared part of the input screen 102, pointer screen 113, down counter J37, and decoder 141, and the result is stored in the register 120. Be it.

（サイクル４）チャネル４に割探当てられた波形再生処理が入カレソス
タ１０３、ポインタレジスタ１１４、ダウンカウンタ１
３８、デコーダ１４２と共有部を用いて行なわれ、結果
はレジスタ１２０に格納される。(Cycle 4) The waveform reproduction processing assigned to channel 4 is performed by the input register register 103, pointer register 114, and down counter 1.
38, is performed using the decoder 142 and the shared part, and the result is stored in the register 120.

（サイクル５）し／ノスタ１２０の値をレジスタ１２１に格納して合成
音声出力のためのＰＣＭデーデーする。(Cycle 5) The value of the /nostar 120 is stored in the register 121 and used as PCM data for outputting synthesized speech.

以上の５つのサイクルにおける制御を第８図（ｂ）。The control in the above five cycles is shown in FIG. 8(b).

＜ｃ）・′Ｄフローチャートに示す。<c)・'D It is shown in the flowchart.

このように本実施例では、各チャンネルに割シ肖てられ
だＡＤＰＣＭ符号による対称音素波形再生処理・計、各
チャンネルごとの構成要素と、共通的に用いる共有部と
で、時分割処理によって実施している。また各チ、ヤン
ネルの波形再生に用いる逐次加Ｘ手段（レジスタ１２０
と加減算器１１９）は音素波形を重ね合わせるだめの加
算手段も兼ねており小さな回路構成により実現される特
長を有すイ名る。又、各チャンネルの起動作置の間隔の変更により合
成音声出力のピッチ周期を容易に変化させることができ
る。さらに１つの音素波形もＯから始まＩ）Ｏで終わる
ためそれぞれの音素の加算による波形の不連続性等の音
質劣化も発生しない特長を合わせ持っている。As described above, in this embodiment, the symmetrical phoneme waveform reproduction processing using the ADPCM code applied to each channel is performed by time-division processing using the constituent elements for each channel and the commonly used shared part. are doing. Also, a sequential addition X means (register 120) used for waveform reproduction of each channel and channel.
The adder/subtracter 119) also serves as an addition means for superimposing the phoneme waveforms, and has the advantage of being realized by a small circuit configuration. Furthermore, by changing the activation interval of each channel, the pitch period of the synthesized audio output can be easily changed. Furthermore, since one phoneme waveform starts from O and ends at I)O, it also has the advantage that no deterioration in sound quality such as waveform discontinuity due to addition of each phoneme occurs.

（発明の効果）以上説明したように本発明によれば、簡単な回路構成に
よりピッチ周期のコントロールが可能で良質な合成音を
出力する音声合成回路が構成できる。(Effects of the Invention) As described above, according to the present invention, it is possible to configure a speech synthesis circuit that can control the pitch period and outputs high-quality synthesized speech with a simple circuit configuration.

[Brief explanation of the drawing]

第１図は音声波形を示しだ図、第２図は従来の音声合成
器の構成図、第３図はピッチを変化させる場合に用いる
波形を示す図、第４−図は音素波形の重ね合わせの説明
図、第５図は本発明の１実施例計示した図、第６図は音
素波形のｆ−夕形式を示した図、第７図は音素波形とＳ
Ｔ倍信号ＢＵＳＹ信号、ＬＥＦＴ信号の時間関係を示し
だ図、第８図（ａ）は１標本周期時間内での処理の時間
関係を表わした図、第８図（ｂ）　、　（ｃ）は１標本
周期時間内での処理を表わしたフローチャートである。１００・・第１．ｃｈ入カレノスタ、１０１・・・第２
ｃｈ人力レジスタ、１０２・・・第３ｃｈ入力レノスタ
、１θ３　第４．ｃｈ入力レジスタ、１０４・・セレク
タ、１０５・・セレクタ、１０６・レジスタ、１０７・
月９インク移動量メモリ、１０８・・・加減算器、１０
９セレクタ、１１０・・セレクタ、ｔ７１１・・第１ｃ
ｈポインタレノスタ、１１２・・第２ｃｈポインタレソ
スタ、１１３・第３ｃｈ７］？インクレノスタ、１１４
第４　ｃｈポインタレジスタ、１１５　・セレクタ、１
１６・・ポインタリミ、り、１１７・・量子化メモリ、
１１８・・・シフトレジスタ、１１９・・加減算器、ｊ
　２０　・レジスタ、１２１・・・レジスタ、１２２・
・出力端子、１３５・・第１ｃｈダウンカウンタ、１３
６・・第２ｃｈダウンカウンタ、１３７・・・第３ｃｈ
ダウンカウンタ、１３８・・・第４．ｃｈダウンカウン
タ、１３９・・・第１ｃｈデコーダ、１４０・・第２ｃ
ｈデコーダ、１４ノ　第３ｃｈデコーダ、１４２・・・
第４ｃｈデコーダ、１４３・ＢＵＳＹ信号、１．４４・
・・ＬＥＦＴ信号、１４５・・起動信号、１５０・・・
コントローラ。特許　出　願人　沖電気工業株式会社第６図図（ｃ）１サイフ、ν　３ ′リーイクル　４一−１−一（〕、、Ｈ，に件の表示昭和５８年　特　許　願第１６７５３４号２ヅこ明の名
称音声合成器６　補正の内容別紙のとおシ〉補正の内容（１）明細書第１０頁第６行目に「１２８個の」とある
のを１１２８標本周期の」と補正する。（２）　同書第１０頁第７行目に「初期値データＤ　Ｉ
）　Ｊとあるのを「初期値データＤＰｎ」と補正する。Figure 1 shows the speech waveform, Figure 2 is a block diagram of a conventional speech synthesizer, Figure 3 shows the waveform used to change the pitch, and Figure 4 shows the superposition of phoneme waveforms. FIG. 5 is a diagram showing one embodiment of the present invention, FIG. 6 is a diagram showing the f-event format of the phoneme waveform, and FIG. 7 is a diagram showing the phoneme waveform and S
Figure 8(a) shows the time relationship between the T times signal BUSY signal and LEFT signal. Figure 8(a) shows the time relationship of processing within one sample period time. Figures 8(b) and (c) show the time relationship of the T times signal BUSY signal and LEFT signal. 3 is a flowchart showing processing within one sample cycle time. 100... 1st. Channel entry Karen Star, 101...2nd
ch human power register, 102...3rd ch input renostar, 1θ3 4th. channel input register, 104... selector, 105... selector, 106... register, 107...
Monthly 9 Ink movement amount memory, 108... Addition/subtraction device, 10
9 selector, 110... selector, t711... 1st c
h pointerless star, 112...2nd ch pointerless star, 113, 3rd ch7]? Increnosta, 114
4th channel pointer register, 115 ・Selector, 1
16...Pointer limit, 117...Quantization memory,
118...Shift register, 119...Addition/subtraction unit, j
20・Register, 121...Register, 122・
・Output terminal, 135...1st channel down counter, 13
6... 2nd channel down counter, 137... 3rd channel
Down counter, 138...4th. Channel down counter, 139...1st channel decoder, 140...2nd c
h decoder, 14th 3rd channel decoder, 142...
4th channel decoder, 143・BUSY signal, 1.44・
...LEFT signal, 145...Start signal, 150...
controller. Patent Applicant: Oki Electric Industry Co., Ltd. Figure 6 (c) 1 Wallet, ν 3' Leakle 4 1-1-1 ( ), H, 1982 Patent Application No. 167534 2ヅKomei's Name Speech Synthesizer 6 Contents of Correction Details of the Attachment Contents of Correction (1) In the 6th line of page 10 of the specification, ``128 pieces'' is corrected to ``1128 sampling periods''. (2) In the same book, page 10, line 7, “Initial value data DI
) Correct "J" to "Initial value data DPn".

Claims

[Claims]

Corresponding to a plurality of channels, unit data including an I ink initial value and a plurality of PCM codes are manually inputted into a reproducing means, and an even symmetrical phoneme waveform is reproduced for each channel with a pitch cycle shift. In a speech synthesizer that adds the phoneme waveforms of each channel using an adding means and outputs the addition result as a synthesized sound, the phoneme waveform of each channel has a certain time length from the start of playback, and the initial value and final value are O. The waveform is evenly j-symmetrical, and the first-stage reproduction means is used for each of the first and second waves. 2 times for each channel within the sample time.
1. A speech synthesis device which performs reproduction processing in a time-division manner, and wherein said addition means shares a renostar and an adder/subtractor used in the Ail recording reproduction means and accumulates phoneme waveforms. vessel.