JPS6059397A

JPS6059397A - Voice synthesizer

Info

Publication number: JPS6059397A
Application number: JP58167533A
Authority: JP
Inventors: 森戸　誠; 隆矢頭; 三木　敬
Original assignee: Oki Electric Industry Co Ltd
Current assignee: Oki Electric Industry Co Ltd
Priority date: 1983-09-13
Filing date: 1983-09-13
Publication date: 1985-04-05

Abstract

(57)【要約】本公報は電子出願前の出願データであるた
め要約のデータは記録されません。(57) [Summary] This bulletin contains application data before electronic filing, so abstract data is not recorded.

Description

【発明の詳細な説明】（技術分野）本発明は記憶領域から音素波形の波形領域での情報を読
み出し音声を合成する音声合成器に関し、特に複数のＤ
ＰＣＭ符号化された音素波形を重畳する加算を実行し、
音声出力を得る音声合成器に関する。Detailed Description of the Invention (Technical Field) The present invention relates to a speech synthesizer that reads information in the waveform region of a phoneme waveform from a storage area and synthesizes speech, and particularly relates to a speech synthesizer that reads information in the waveform region of a phoneme waveform from a storage area and synthesizes speech.
Perform addition to superimpose PCM encoded phoneme waveforms,
This invention relates to a speech synthesizer that obtains speech output.

（従来技術）人間の声は有声音の場合、肺からの空気流が声帯によっ
て準周期的なイン・ぐルス流となり声道と呼ばれる空洞
共鳴体と共鳴することによって発せられる。(Prior Art) When a human voice is a voiced sound, it is emitted when the airflow from the lungs becomes a quasi-periodic in-gurus flow through the vocal cords and resonates with a hollow resonator called the vocal tract.

人間の声の波形を第１図に示す。Figure 1 shows the waveform of a human voice.

第１図に示されるごとく人間の声は゛ピ、チ″と呼ばれ
る周期ごとにほとんど同じ波形がくりかえされている。As shown in Figure 1, the human voice has almost the same waveform that repeats every cycle called ``pi'' and ``chi''.

このことは前にも述べたように人間の声は準周期的なイ
ンパルス流によって発せられることから起因しておりピ
ッチ周期はこのインパルス流の間隔に等しい。This is because, as mentioned earlier, the human voice is emitted by a quasi-periodic impulse stream, and the pitch period is equal to the interval of this impulse stream.

このような音声を合成し７ようとしたとき音声の情Ｎを
どのような形で格納しておくかにより各種方式があげら
れる。When attempting to synthesize such voices, there are various methods depending on how the information N of the voice is stored.

その１つの方式として音声の１ピッチ周期の波形（これ
を音素波形と称する）をいろいろな音声について記憶領
域に格納し制御情報にしたがいこれら音素波形をつなぎ
合わぜることによって音声を出力する方式がある。One method is to store one-pitch period waveforms of speech (called phoneme waveforms) for various speech sounds in a storage area, and output speech by connecting these phoneme waveforms according to control information. be.

第２図にこの方式による構成を示す。Figure 2 shows a configuration using this method.

１は各種音声に対して］ピッチ周期の音素波形を格納し
ておく記憶領域で、２は音素波形をつなぎ合わぜるだめ
のピッチならびに振幅倍などの制御情報用記憶領域で、
３は音声素片をつなぎ合わせる合成部である。1 is a storage area for storing phoneme waveforms with pitch periods for various voices; 2 is a storage area for control information such as pitch and amplitude multiplication for connecting phoneme waveforms;
3 is a synthesis unit that connects speech segments.

記１意領域１に格納された音声を記憶領域２に格納され
た制御情報によってつなぎ合わせることによって音声を
合成部３で合成するが、自然性の高い合成音を作るため
には声の高さ、声の大きさを連節に制御しなければなら
ない。声の大きさは記憶領域内の振幅倍情報によって記
憶領域内の音素を一定倍することにより制御される。ま
た声の高さは記憶領域２内のピッチ情報によって制御さ
れる。しかし記憶領域ｌの中の音素波形は音素波形を抽
出した際の音声のピッチ周期の長さをもっており制御情
報によって与えられるピッチとは必ずしも一致していな
い。Note 1: Voices are synthesized by the synthesizer 3 by connecting the voices stored in the storage area 1 according to the control information stored in the storage area 2. However, in order to create a highly natural synthesized sound, the pitch of the voice must be adjusted. , the volume of the voice must be controlled in a continuous manner. The loudness of the voice is controlled by multiplying the phonemes in the storage area by a certain amount using amplitude multiplication information in the storage area. Furthermore, the pitch of the voice is controlled by pitch information in the storage area 2. However, the phoneme waveform in the storage area l has the pitch period length of the voice when the phoneme waveform was extracted, and does not necessarily match the pitch given by the control information.

したがって制御情報のピッチが音声素片長よシ短い場合
には音声素片の後端を切シ、制御情報のピッチが音声素
片長よシ長い場合には音声素片の最後の値を延長して制
御情報と同一ピンチ長をもった音素をする。（第３図参
照）しかし記憶領域１に格納された音素波形を途中で切
った場合には音素波形が十分に減衰していないときには
接続する音素波形との間に不連続を生じ合成音に悪影き
ょうをおよぼすという欠点かあ、Ｉｔた逆に音素波形を
長くした場合には音素波形のス波りトラムが変形してし
まい音質劣化をまねくという欠点があった。Therefore, if the pitch of the control information is shorter than the phoneme length, the trailing end of the phoneme is cut off, and if the pitch of the control information is longer than the phoneme length, the last value of the phoneme is extended. Play phonemes with the same pinch length as the control information. (See Figure 3) However, if the phoneme waveform stored in storage area 1 is cut in the middle, if the phoneme waveform is not sufficiently attenuated, discontinuity will occur between the phoneme waveform and the connected phoneme waveform, which will affect the synthesized sound. On the other hand, when the phoneme waveform is lengthened, the wave tram of the phoneme waveform is deformed, resulting in deterioration of sound quality.

（発明の目的及び概要）本発明の目的はこれらの欠点を解決することにあり、ピ
ッチ周期ずつずれる複数チャンネルのＩ）ＰＣＭ　相号
化された音素波形を重畳する音声合成器にン９いて、Ｄ
ＰＣＭ符号再生を時間多重処理することにより、また音
素波形の重畳演算に用いる加算器をＩ）ＰＣＭ符号再生
器内の加算と共有することによりｆｙｉ〕単な回路構成
で自然性のある良質な音声を合成すも音声合成器を実現
させたもので、以下詳細に説明する。(Objective and Summary of the Invention) The object of the present invention is to solve these drawbacks, and to provide a speech synthesizer that superimposes I) PCM phase encoded phoneme waveforms of multiple channels shifted by pitch period. D
By time-multiplexing PCM code reproduction, and by sharing the adder used for the phoneme waveform superimposition calculation with the addition in the PCM code regenerator, natural, high-quality speech can be achieved with a simple circuit configuration. This is an implementation of a speech synthesizer, and will be explained in detail below.

（発明の前提）前：・ても述べたように音声は声帯によるインパルス流
が共鳴することにより発せられておシこれは電気回路に
置き換えることができる。(Premise of the invention) Previous: As mentioned above, sound is produced by the resonance of impulse flows from the vocal cords, and this can be replaced by an electric circuit.

すなわち音声は声帯に相当する励振回路の発する１つの
インパルスに対応した共振フィルタの出力波形（以下こ
れを音素波形と呼ぶ）の重なり合ったものと考えられる
。このことを第４図を用いて説明する。In other words, speech is considered to be a superposition of output waveforms (hereinafter referred to as phoneme waveforms) of a resonant filter corresponding to one impulse generated by an excitation circuit corresponding to the vocal cords. This will be explained using FIG. 4.

２：′λ４図のＩＯは、励振回路の発するイン・ぐルス
列である。ここで、各インパルス間の時間間隔はピッチ
周期間隔である。１１は励振回路の発するイン・Ｑルス
列で共振フィルタを駆動した合成音声出力波形である。2:'λ4 IO in the diagram is an in-wave train generated by the excitation circuit. Here, the time interval between each impulse is the pitch period interval. 11 is a synthesized audio output waveform obtained by driving a resonant filter with an in-Q pulse train generated by an excitation circuit.

１２は励振回路の発するインパルス列のうち、インパル
スＰ１によって共振フィルタを駆動した場合の音素波形
である。12 is a phoneme waveform when the resonant filter is driven by the impulse P1 of the impulse train generated by the excitation circuit.

以下同様に１３〜１７はインパルスＰ２〜Ｐ６（でよっ
て共振フィルタ４を駆動した場合の音素波形である。Similarly, 13 to 17 are phoneme waveforms when the resonance filter 4 is driven by impulses P2 to P6.

励振回路より発せられる駆動インパルス列１゜はインパ
ルスＰ１からＰ６の加算であるから重畳の定理によれば
合成音声出力波形１１は各音素波形１２から１７までの
加算によって得られる。Since the driving impulse train 1° emitted from the excitation circuit is the addition of impulses P1 to P6, according to the superposition theorem, the synthesized speech output waveform 11 is obtained by the addition of each phoneme waveform 12 to 17.

第・１図に示される音素波形１２は時間点ｔ１以前はＯ
である。時間点１．以後では時間経過とともに減衰し無
限大時間点では０となる性質を有する。実際、音声波形
の場合励振点（１＋）から１６ミＩＪ秒を経過した時点
での音素波形はほとんど０と考えられる。したがって音
素波形１２の再生処理は励振時点（ｔｌ）から１６ミリ
秒間で十分である。The phoneme waveform 12 shown in FIG. 1 is O before time point t1.
It is. Time point 1. Thereafter, it has the property of attenuating with the passage of time and becoming 0 at an infinite time point. In fact, in the case of a speech waveform, the phoneme waveform is considered to be almost 0 when 16 milliJ seconds have passed from the excitation point (1+). Therefore, 16 milliseconds from the excitation time point (tl) is sufficient for the reproduction processing of the phoneme waveform 12.

しかし、音素波形１３の再生処理を励振時間点Ｌ２から
開始するには、音素波形１２の再生処理が滓了していな
いために多重的な再生処理が必要となる。必要と在る波
形再生多重度ｎは次の第（１）で−りえられる。通常音
声の場合ｎは４程度で十分である。そこでｎ　＝　４と
して以後説明する。However, in order to start the reproduction process of the phoneme waveform 13 from the excitation time point L2, multiple reproduction processes are required because the reproduction process of the phoneme waveform 12 has not yet been completed. The necessary waveform reproduction multiplicity n can be determined in the following step (1). In the case of normal voice, n of about 4 is sufficient. Therefore, the following explanation will be made assuming that n = 4.

次に音素波形を再生するための符号として、符号化効率
のよいＤＩ）ＣＭ符号を用いる・一連のＤＰＣＭ符号か
ら波形を再生する再生器については特願５５−］　０９
８００に提案されているＤＰＣＭ復号器において量子化
ステップサイズを与える量子化メモリのアドレスｙ、　
Ｊえるポインタ値を一定にすることによって与えられる
ため詳しい説明は省略する。Next, as a code for reproducing the phoneme waveform, a DI)CM code with high encoding efficiency is used.・For a regenerator that reproduces a waveform from a series of DPCM codes, Patent Application No. 55-09
address y of the quantization memory giving the quantization step size in the DPCM decoder proposed in 800;
The detailed explanation will be omitted since it is given by keeping the constant pointer value constant.

ＤＰＣＭ符号の再生処理のような差分復号処理において
１６ミリ秒（１２８標本周期）で再生処理を打ち切った
場合には最終出力値が保持される。したがって音素波形
の条件（条件１）　励振時間点以前ば０である（条件２）　無
限大時間点でば０であるを満たすためには再生初期値が
Ｏである事と、再生最終出力値がＯである事が必要であ
り、再生初期値をＯとしても通常再生最終値は０とはな
らない。したがって再生最終値が０となるようにＤＰＣ
Ｍ符号列を与えなければなら々い。In a differential decoding process such as a DPCM code reproduction process, when the reproduction process is terminated after 16 milliseconds (128 sample periods), the final output value is held. Therefore, the phoneme waveform condition (Condition 1) is 0 before the excitation time point (Condition 2) and is 0 at the infinite time point.In order to satisfy the condition, the initial reproduction value must be O, and the reproduction final output value must be 0. It is necessary that the value is O, and even if the initial value for reproduction is O, the final value for normal reproduction will not be 0. Therefore, DPC is set so that the final playback value is 0.
It is necessary to provide M code strings.

（発明の実施例）第５図に本発明における１実施例を示す。(Example of the invention) FIG. 5 shows one embodiment of the present invention.

ここで１標本周期時間内に時分割に処理される処理に対
して番刊付けのため１−チャネル」又はｒｃｈＪという
言葉を用いる。Here, the term ``1-channel'' or rchJ is used to number the processes that are time-divisionally processed within one sample period.

第５図における各部は次のとおりである。１００は第１
ｃ＋＋入力レジスク、１０１は第２ｃｈ入カレノスタ、
１０２は第３ｃｈ入力レジスク、１ｏ３は第４ｃ１〕入
力レジスタ、１０４．１０５はセレクタ、１０６はＤＰ
ＣＭ符号データＬｎを格納するレノスタ、１０９はセレ
クタ、１１０はセレクタ、１１１は第１．ｃｈポインタ
レジスタ、１１２は第２ｃｈポインタレノスタ、１１３
は第３ｃｈポインタレノスク、１１４は第２ｃｈポイン
タレノスタ、１１５はセレクタ、１１７は量子化ステッ
プ値Ｘを格納している量子化メモリ、１１８はシフトレ
ソスタ、１１９ｔｒｉ　／Ｊ１ｉ、］誠ｌｆＩ器、１２
０はレノスタ、１２１は合成音声を出力するレノスタ、
１２２は出力端子、１３５は第１Ｃ１１グウンカウンタ
、１３６は第２ｃｈりゞランカウンタ、１３７は第３Ｃ
）〕ダウンカウンタ、１３８は第１Ｉｃｈグウンカウン
ク、１３９は第２Ｃｈデコーダ、１４０は第２Ｃｈデコ
ーダ、１４１は第３ｃｈデコーダ、１４２は第４．ｃｈ
デコーダ、１４　Ｊは各チャネルのＢＩＪＳＹ信号、１
’４．５は波形再生のための起動信号、１５０はコント
ローラである。コントローラ１５θは前述の第１０１〕
入力レジスタ１００、第２Ｃ１１の入カレジスク１０１
＋・、第３ｃｈデコータ１４１　、　第４ｃｈデコーダ
１４２等の各回路と接続され、その動作の制御を行なっ
ているが第５図ｔｄ図が複雑になるのをさけるためにそ
の接続の様子は省略している。Each part in FIG. 5 is as follows. 100 is the first
c++ input register, 101 is 2nd channel input careno star,
102 is the 3rd channel input register, 1o3 is the 4th c1] input register, 104.105 is the selector, 106 is the DP
109 is a selector, 110 is a selector, 111 is a first . channel pointer register, 112, second channel pointer register, 113
114 is the 3rd channel pointer nosk, 114 is the 2nd channel pointer nostar, 115 is the selector, 117 is the quantization memory storing the quantization step value
0 is Renostar, 121 is Renostar that outputs synthesized voice,
122 is an output terminal, 135 is the 1st C11 count counter, 136 is the 2nd channel rerun counter, and 137 is the 3rd C
)] Down counter, 138 is the first Ich count, 139 is the second Ch decoder, 140 is the second Ch decoder, 141 is the third ch decoder, 142 is the fourth... ch
Decoder, 14 J is BIJSY signal of each channel, 1
'4.5 is a start signal for waveform reproduction, and 150 is a controller. The controller 15θ is the above-mentioned No. 101]
Input register 100, second C11 input register 101
+., 3rd channel decoder 141, 4th channel decoder 142, etc., and controls their operation, but the connections are omitted to avoid complicating the Figure 5 TD diagram. ing.

第６図は音素波形を再生するために本発明において用い
るデータでポインタ値ＤＰｎと１２８個のＤＩ’Ｃ１１
？符号Ｌｎｌ　Ｊ　Ｌｎ２　＋　Ｌ１２　＋　”’　ｒ
　Ｌｎ６４から成り立っている。またポインタ値データ
Ｄ　Ｉ”は波形再生開始時点にポインタに格納されるデ
ータである。Figure 6 shows the data used in the present invention to reproduce the phoneme waveform, including the pointer value DPn and 128 DI'C11.
? Code Lnl J Ln2 + L12 + ”' r
It consists of Ln64. Further, the pointer value data DI'' is data stored in the pointer at the start of waveform reproduction.

捷た各チャンネルに割り当てられる再生処理と時間との
関係を第７図に示す。FIG. 7 shows the relationship between the playback processing assigned to each skipped channel and time.

第７図において２００は合成出力波形、２０ノはチャネ
ル１に割り当てられた再生処理によって再生される音素
波形、２０２はチャネル１用起動信号ＳＴ１．２０３は
チャネル１用ＢＵＳＹ信号ＢＵＳＹ１．　。In FIG. 7, 200 is a synthesized output waveform, 20 is a phoneme waveform reproduced by the reproduction processing assigned to channel 1, 202 is a channel 1 activation signal ST1.203 is a channel 1 BUSY signal BUSY1. .

以下２０５　、２　ｏ　９　、２１３はチャネル２，３
゜４に割り当てられた再生処理によって再生される音素
波形、２０６，２１０，２１４はチャネル２゜３．４の
起動信号ＳＴ２．ＳＴ３．ＳＴ４．２０７，２１１゜２
１．５はチャネル２，３．４用ＢＵＳＹ信号ＢＵＳＹ２
　。Below 205, 2 o 9 and 213 are channels 2 and 3
The phoneme waveforms 206, 210, and 214 reproduced by the reproduction process assigned to channel 2°3.4 are activation signals ST2. ST3. ST4.207,211゜2
1.5 is the BUSY signal BUSY2 for channels 2 and 3.4
.

ＢＵＳＹ３　、　ＢＵＳＹ４信号であ不。BUSY3, BUSY4 signal is not working.

第５図〜第７図を用いて本発明の実施例を詳しく説明す
る。Embodiments of the present invention will be described in detail with reference to FIGS. 5 to 7.

時間点ｔ１において外部から起動信号ＳＴＩがコントロ
ーラ１５０にくわえられチャネル１によって第４図１２
に相当する音素波形の再生処理が開始される。このとき
コントローラ１５０はダウンカウンタ１３５に値１２８
をセ、１・する。この値は前記１６ミリ秒に相当する値
である。（出力波形の標本化周期を１２５マイクロ秒と
すると、１６ミリ秒は１２８標本化周期となる）デコー
ダ１３９はダウンカウンタ１３５の出力が０以外の場合
は”　１　”をＢＵＳ−１１’ｌ信号として出力する。At time point t1, a start signal STI is applied to the controller 150 from the outside and is transmitted through channel 1 as shown in FIG.
The reproduction process of the phoneme waveform corresponding to is started. At this time, the controller 150 sets the down counter 135 to a value of 128.
Set, 1. This value corresponds to the aforementioned 16 milliseconds. (If the sampling period of the output waveform is 125 microseconds, 16 milliseconds is 128 sampling periods.) If the output of the down counter 135 is other than 0, the decoder 139 outputs "1" as the BUS-11'l signal. Output.

以下、標本化周期（１２５マイクロ秒）ごとに入カレノ
スタ１００から波形再生用のデータを読み取りＤＰＣＭ
再生処理を行なう。再生処理終了ごとにダウンカウンタ
１’　３５は１づつ減じられる。１２８回目の再生処理
が終了した時点でダウンカウンタ１３５は０とな９、Ｂ
ＵＳＹＩ信号が”　ｏ　”となる。Hereafter, data for waveform reproduction is read from the input current recorder 100 every sampling period (125 microseconds) and the DPCM
Perform playback processing. The down counter 1' 35 is decremented by 1 each time the reproduction process is completed. At the end of the 128th playback process, the down counter 135 becomes 0.9,B
The USYI signal becomes "o".

第１表に、コントロ〜う１５０が、セレクタ１０４ｆ介
して入力される各チャンネルの入カレノスタ（１００，
１０１，１０２，１ｏ３）に格納されたＤＰＣＭ符号Ｌ
ｎと、シフトレジスタ１１８によってシフトダウンされ
た値（ＣＸＩ、［因Ｘ〕、〔イＸ）　、　（、、ｘ）　
；但しＣＸ）は量子化メモリ１１７の出力）とを用いて
ｌ／レジスタ２０と加減算器１１９において行なう１．
）ＰＣＭ波形再ケ演算処理（ポインタの移動演算は除く
）を示す。Table 1 shows the input current status (100,
101, 102, 1o3) DPCM code L stored in
n and the value shifted down by the shift register 118 (CXI, [Cause X], [IX), (,,x)
; However, CX) is performed in the l/ register 20 and the adder/subtractor 119 using the output of the quantization memory 117.
) PCM waveform re-key calculation processing (excluding pointer movement calculation).

第１表同様に時間点ｔ２において外部から起動信号ＳＴ２がコ
ノトローラ１５０に加えられ、チャネル２（・こよって
第４図１３に相当する音素波形の再生処理が開始される
。このときダウンカウンタ１３６に（１！、ｌ；　２　
ｇがセットされＢＵＳＹ２信号が１″と在る。以下チャ
ネル１と同様な再生処理を行ない１２８標本化周期後に
ダウンカウンタ１３６はＯと々すＢＵＳＹ２信号は”　
ｏ　”となる。Similarly to Table 1, at time point t2, a start signal ST2 is externally applied to the controller 150, and the reproduction process of the phoneme waveform corresponding to channel 2 (FIG. 4, 13) is started. At this time, the down counter 136 (1!, l; 2
g is set and the BUSY2 signal is 1''.Then, the same reproduction process as for channel 1 is performed, and after 128 sampling periods, the down counter 136 reaches O.The BUSY2 signal is ``1''.
o”.

以下時間点ｔ３からはチャネル３において、時間点１，
４からはチャネル４において、時間点ｔ５からはチャネ
ル１において同様な制御が行なわれる。From time point t3 onwards, in channel 3, time point 1,
Similar control is performed on channel 4 from time point t5 onwards, and on channel 1 from time point t5 onwards.

このように４つのチャネルによって再生処理を？ｊ庁い
合成音声出力波形を標本化周期ごとに出力するが、各チ
ャンネルの再生処理は１標本時間点の；１１ｊて時分割
に行なわれる。ここで、１標本化周期内での処理の時間
関係を第８図（ａ）に、フローチ、＼・−１−全第８図
（ｂ）　、　、（ｃ）に示す。Playback processing using four channels like this? The synthesized audio output waveform is output every sampling period, and the reproduction processing of each channel is performed in a time-division manner at one sampling time point. Here, the time relationship of processing within one sampling period is shown in FIG. 8(a), and FIGS. 8(b), 8(c).

１標本化周期内での処理は５つのサイクルに分か机でお
り、順次処理される。尚、それぞれのサイクルにおいて
処理されるＤｐｃｒ４生処理に必要な回路はほとんど共
有化されており第６図の各構成要素のうちその名称にチ
ャネル番号の旬月されてい々い構成要素はすべて各サイ
クルに共通に用いられるものであり、これらを総称し７
て［共有部］と称する。The processing within one sampling period is divided into five cycles, which are sequentially processed. It should be noted that most of the circuits required for the Dpcr4 raw processing processed in each cycle are shared, and all of the components shown in Figure 6, whose names include the channel number, are the same for each cycle. These are commonly used in 7
This is called the [shared part].

（サイクルｊ）チャネル１に割ｇ当てられた波形再生処理が入カレノス
タ１θ０＋ポインタレノスク１１１　、　ダウンカウン
タ１３５．デコーダ１３９と共有部を用いて行なわれ、
結果はレジスタ１２０ＶＣ格納される。(Cycle j) The waveform reproduction process assigned to channel 1 is input to the current register 1θ0 + pointer register 111, down counter 135. This is done using the decoder 139 and a shared part,
The result is stored in register 120VC.

（サイクル２）チ・、ネル２に割り尚てられ／こ波形再生処理が入カレ
ノスクｌｏｌ、！インタレノスタ１１２．ダウンカラ／
り１３６．デコーダ１４０と共有部を用いて行なわれ、
結果はレジスタ１２０に格納される。(Cycle 2) Reassigned to Channel 2/This waveform playback process is input! Interenosta 112. Down collar/
ri136. This is done using the decoder 140 and a shared part,
The result is stored in register 120.

（サイクル３）チャネル３に割シ当てられた波形再生処理が入カレノス
ク１０２．ポインタレノスタ１１３．ダウンカラ／り１
３７、デコーダ１４１と共有部を用いて行庁われ、結果
はレジスタ１２０に格納される。(Cycle 3) The waveform playback process assigned to channel 3 is input to current screen 102. Pointerenosta 113. Down collar/ri 1
37, the processing is performed using the decoder 141 and the shared section, and the result is stored in the register 120.

（サイクル４）チャネル４に割り当てられた波形再生処理が入力レゾス
フ１０３．ポインタレノスタ１１４．ダウンカウンタ１
３８．デコーダ１４２と共有部を用１ハて行々われ、結
果はレジスタ１２０に格納される。(Cycle 4) The waveform reproduction processing assigned to channel 4 is performed by the input resouf 103. Pointerenosta 114. down counter 1
38. One step is performed using the decoder 142 and the shared section, and the result is stored in the register 120.

（サイクル５）し７′スタ１２０の値をレジスタ１２１に格納して合成
音声出力のためのＰＣＭデータとする。(Cycle 5) The value of the 7' star 120 is stored in the register 121 and used as PCM data for outputting synthesized speech.

以上の５つのサイクルにおけるｉｉ制御を第８図（＋）
）　。Figure 8 (+) shows the ii control in the above five cycles.
).

（ｃ）のフローチャートに示す。This is shown in the flowchart in (c).

このように本実施例では、各チャンネルに割り当てられ
たＤＰＣＭ符号による音素波形再生処理を、各チャンネ
ルごとの構成要素と、共通的に用いる共有部とで、時分
割処理によって実施している。As described above, in this embodiment, the phoneme waveform reproduction processing using the DPCM code assigned to each channel is performed by time-division processing using the constituent elements for each channel and the commonly used shared section.

また各チャンネルの波形再生に用いる遂次加算手段（レ
ジスタ１２０と加減算器１１９）は音素波形を重ね合わ
せるための加算手段も兼ねており小さな回路構成により
実現される特長を有する。Furthermore, the sequential addition means (register 120 and adder/subtractor 119) used to reproduce the waveforms of each channel also serves as addition means for superimposing phoneme waveforms, and has the advantage of being realized by a small circuit configuration.

又、各チャンネルの起動信号の間隔の変更によシ合成音
声出力のピッチ周期を容易に変化させることができる。Furthermore, by changing the interval between activation signals of each channel, the pitch period of the synthesized audio output can be easily changed.

さらに一つの音素波形もＯから始まり０で終わるため、
それぞれの音素の加算による波形の不連続性等の音質劣
化も発生し々い特長を合わせ持っている。Furthermore, since one phoneme waveform also starts from O and ends at 0,
It also has the characteristic that sound quality deterioration such as waveform discontinuity due to the addition of individual phonemes is likely to occur.

（発明の効果）以上説明したように本発明によれば、簡単な回路構成に
よりピッチ周期のコントロールが可能で良質々合成音を
出力する音声合成回路が構成できる。(Effects of the Invention) As described above, according to the present invention, it is possible to configure a speech synthesis circuit that can control the pitch period and outputs high-quality synthesized speech with a simple circuit configuration.

[Brief explanation of drawings]

第１図は音声波形を示した図、第２図は従来の音声合成
器の構成図、第３図はピッチを変化させる場合に用いる
波形を示す図、第４図は音素波形の重ね合わせの説明図
、第５図は本発明の１実施例を示した図、第６図は音素
波形のデータ形式を示した図、第７図は音素波形とＳＴ
倍信号ＢＵＳＹ信号の時間関係を示した図、第８図（ａ
）は１標本周期時間内での処理の時間関係を表わした図
、第８図（ｂ）　、　（ｃ）は１標本周期時間内での処
理を表わしたフローチャートである。１００−第１ｃｈ入カレノスタ、１０１　・・第２ｃｈ
入力１／ジスタ、１０２　第３ｃｈ入カレノスタ、１０
３　・第４　ｃｌ＋ｌ＋入力レジスフ０４・セレクタ、
１０５　セレクタ、ｌθ６　・レジスタ、１０９・・セ
レクタ、１１０・セレクタ、１１１・第ｉ　ｃｈポイン
タレジスタ、１１２　第２ｃｂポインタレソスク、１１
３・・・第３ｃｈポインタレジスタ、１１４・第４．ｃ
ｌ＋ポインタレジスタ、１１５・・・セレクタ、１１７
−　ｆｆｉ子化メモリ、１１Ｂ・・・シフ１−レ／゛ス
タ、１１９・・加減算器、１２θ・・レジスタ、１２ル
ゾスタ、１２２・・出力端子、１．９５・第１ｃ）１ダ
ウンカウンタ、１３６　第２Ｃ１〕ダウンカウンク、１
３７・第３ｃｈダウンカワンク、１３８・第４ｃｈグウ
ンカウンタ、１３９・・・第１ｃｈデコーダ、１４０・
第２ｃｈデコーダ、１４１・・第３ｃｈデコーダ１１４
２・第４ｃｈデコーダ、１４３−ＢＵＳＹ信号、１４５
・起動信号、１５０・・・コントローラ。特許出願人　沖電気工業株式会社第６図第８図（ｂ’ 第８図（ｃ）サイクル゛３！１− ナイフ、ｌ／４手続補正書（自発）１　事件の表示昭和５８年　特　許　願第　１．６７５３３　号２発臥
１の名称音声合成器３　補正をする者事件との関係　特許出願人任　所（〒１０５）　東京都港区虎ノ門１丁目７番１２
号４代理人住　所（〒１０５）　東京都港区虎ノ門１丁目７香１２
号６補正の内容　別紙のとおＩ）　ｊ− （’−５＋’ｉ−’：）、９６、補正の内容（１）明細書第４頁第１０行目に１音素をする。」とあ
るのを１音素とする。」と補正する。（２）同１１第７頁第１０行目に「ＤｐｃＭ＠４号器」
とあるのｋ　ｒ　ＡＤＰＣＭ復号器」と補正する。（３）同省−第９更第１８行目に１７Ｌｎ６４」とある
のをｒＬｎｊ’２８ｊと補正する。Figure 1 is a diagram showing speech waveforms, Figure 2 is a diagram showing the configuration of a conventional speech synthesizer, Figure 3 is a diagram showing waveforms used when changing pitch, and Figure 4 is a diagram showing the superposition of phoneme waveforms. Explanatory drawings, FIG. 5 is a diagram showing one embodiment of the present invention, FIG. 6 is a diagram showing a data format of a phoneme waveform, and FIG. 7 is a diagram showing a phoneme waveform and ST
A diagram showing the time relationship of the double signal BUSY signal, Fig. 8 (a
) is a diagram showing the time relationship of processing within one sample period time, and FIGS. 8(b) and 8(c) are flowcharts showing processing within one sample period time. 100-1st channel entry, 101...2nd channel
Input 1/Jista, 102 3rd channel input Kareno Star, 10
3 ・4th cl+l+input register 04・selector,
105 selector, lθ6 register, 109... selector, 110 selector, 111 i-th ch pointer register, 112 2nd cb pointer register, 11
3...3rd channel pointer register, 114/4th. c.
l + pointer register, 115... selector, 117
- ffi child memory, 11B... shift 1 register/star, 119... adder/subtractor, 12θ... register, 12 Luzo star, 122... output terminal, 1.95 1st c) 1 down counter, 136 2nd C1] Down count, 1
37. 3rd channel down counter, 138. 4th channel down counter, 139... 1st channel decoder, 140.
2nd channel decoder, 141... 3rd channel decoder 114
2. 4th channel decoder, 143-BUSY signal, 145
・Start signal, 150...controller. Patent applicant Oki Electric Industry Co., Ltd. Figure 6 Figure 8 (b' Figure 8 (c) Cycle ゛3! 1- Knife, l/4 Procedural amendment (voluntary) 1 Indication of case 1982 Patent application No. 1.67533 Name of No. 2 Speech Synthesizer 3 Relation to the case of the person making the amendment Patent applicant's office (105) 1-7-12 Toranomon, Minato-ku, Tokyo
No. 4 Agent address (105) 1-7 Kaori, Toranomon, Minato-ku, Tokyo
Contents of amendment No. 6 Attachment I) j- ('-5+'i-':), 9 6. Contents of amendment (1) Add one phoneme to line 10 on page 4 of the specification. ” is one phoneme. ” he corrected. (2) "DpcM@Unit 4" on page 7, line 10 of the same 11
A certain k r ADPCM decoder” is corrected. (3) Ministry of the same Ministry - 17Ln64'' on the 18th line of the 9th correction is corrected to rLnj'28j.

Claims

[Claims]

In correspondence with a plurality of channels, unit data including a pointer initial value and a plurality of DPCM codes is input to a reproduction means, and a phoneme waveform of each channel is reproduced to generate a phoneme waveform of each channel shifted by a pinch period. In a speech synthesizer that performs addition by an adding means and outputs the addition result as a synthesized sound, the DPCM code is a code set so that the initial value of one phoneme waveform matches the final value, and the DPCM code of each channel is The phoneme waveform is a waveform that has a fixed time length from the start of playback and has an initial value and a final value of 0. The early playback means is shared by each channel, and the playback processing in each channel is time-divided within the sample time. 1. A speech synthesizer according to claim 1, wherein said adding means shares a laser star and an adder/subtractor used in said reproducing means and accumulates phoneme waveforms.