JPS59128600A

JPS59128600A - Voice synthesizer

Info

Publication number: JPS59128600A
Application number: JP58004894A
Authority: JP
Inventors: 稔黒田; 糸山　博
Original assignee: Matsushita Electric Works Ltd
Current assignee: Panasonic Electric Works Co Ltd
Priority date: 1983-01-14
Filing date: 1983-01-14
Publication date: 1984-07-24
Also published as: JPH0325799B2

Abstract

(57)【要約】本公報は電子出願前の出願データであるた
め要約のデータは記録されません。(57) [Summary] This bulletin contains application data before electronic filing, so abstract data is not recorded.

Description

【発明の詳細な説明】〔技術分野〕本発明は１チツプＬＳＩよりなる音声合成装置に関する
ものであり、マツサージ椅子や音声目覚時計、音声警報
器、音声時報装置などの各種の電気製品に組み込まれて
音声メツセージの出力を行なうような用途に使用される
ものである。[Detailed Description of the Invention] [Technical Field] The present invention relates to a speech synthesizer made of a one-chip LSI, which can be incorporated into various electrical products such as a Matusage chair, an audio alarm clock, an audio alarm, and an audio time signal device. It is used for purposes such as outputting voice messages.

[Background technology]

一般に音声の特徴を表わす特徴パラメータには、音の大
小を表わす振巾パラメータ（以下Ａパラメータと略称す
る）と、音の高低すなわち基本周期を表わすピッチパラ
メータ（以下Ｐパラメータと略称する）と、音の音色す
なわちスペクトル分布を表わすスペクトルパラメータ（
以下Ｓパラメータと略称する）とがある。したがって音
声を合成するには音声信号を音声周波数よりも十分高い
周波数を有するサンプリングパルスでサンプリングし、
各％徴パラメータを抽出して予めデータメ℃りに記憶さ
せ、データメ亡りから読み出された特徴パラメータに基
いて音源を駆動して音声を合成すれば良いことになる。In general, the characteristic parameters that represent the characteristics of speech include the amplitude parameter (hereinafter referred to as the A parameter) that represents the magnitude of the sound, the pitch parameter (hereinafter referred to as the P parameter) that represents the pitch of the sound, or the fundamental period, and the The spectral parameters that represent the timbre or spectral distribution of
(hereinafter abbreviated as S-parameter). Therefore, to synthesize speech, the speech signal is sampled with a sampling pulse having a frequency sufficiently higher than the speech frequency.
It is sufficient to extract each characteristic parameter and store it in advance in a data memory, and to synthesize speech by driving a sound source based on the characteristic parameters read from the data memory.

この種の音声合成装置では音声信号のサンづリング数を
多くすればするほど忠実な音声を合成できることＫなる
が、反面サンプリング数が多くなると音声合成データの
ヒツト数が増大して大きな容量のデータメモリが必要に
なるとともにデータ処理の回路構成が複雑になり、コス
トが高くなるという間唾がある。In this type of speech synthesis device, the greater the number of samplings of the speech signal, the more faithful the speech can be synthesized.However, on the other hand, as the number of samplings increases, the number of hits in the speech synthesis data increases, resulting in a large amount of data. This requires more memory, complicates the data processing circuitry, and increases costs.

従って従来の音声合成装置にあってはサンプリングパル
ス周波数（以下サンづリング周波数と略称スル）は人間
の声を忠実に再生するために最低必要な周波数に設定さ
れており、通常・サンづリング周波数は８または１０Ｋ
Ｈｚ（サンプリング周期１２５μｓまたは１００μｓ）
に設定する。ところで、サンプリングパルスにて音声信
号をサンづリングしてＡ、Ｐ、Ｓパラメータよりなる特
徴パラメータを抽出してメ七りに記憶させ、メモリに記
憶させた特徴パラメータをサンづリンジバルスに等しい
周期の同期Ｊ＼ルスにて読み出して音声を合成する場合
、Ｐパラメータに基いて再生される音声の基本周期はサ
ンプリング周波数によって決められる離散値しかとり得
ない。すなわち、サンづリング周期を１００μＳ、Ｐパ
ラメータをＰｉ（整数値）とすれば再生される基本周期
【はｔ　＝　１００Ｐｉ　Ｘ　１０　（ｓｅｃ）（但しＰ１
玉１．２．３・・・・・・）となって再生し得る音声周
波数は離散値となる。Therefore, in conventional speech synthesizers, the sampling pulse frequency (hereinafter referred to as sampling frequency) is set to the minimum frequency required to faithfully reproduce the human voice, and the sampling pulse frequency (hereinafter referred to as sampling frequency) is set to the minimum frequency required to faithfully reproduce the human voice. is 8 or 10K
Hz (sampling period 125μs or 100μs)
Set to . By the way, the audio signal is sampled using a sampling pulse to extract feature parameters consisting of A, P, and S parameters and stored in a memory. When reading and synthesizing audio using synchronous J\rus, the fundamental period of the audio reproduced based on the P parameter can only take discrete values determined by the sampling frequency. In other words, if the sampling period is 100 μS and the P parameter is Pi (an integer value), then the fundamental period to be reproduced [is t = 100Pi x 10 (sec) (where P1
The audio frequencies that can be reproduced as 1.2.3...) are discrete values.

このような離散的な音声周波数しか発生できなくとも人
間の声などは比較的忠実に再生できる。しかしながら音
階周波数で構成されたメロディ音を再生する場合、各音
階（ド・し・三−−−−−−）の音階、周波数は　　　
−上記離散値に含まれていないものが多く、メロディ音
をこのような離散的な音声周波数を用いて再生すれば著
しく音程のずれたメロディ音が再生されるという問題が
あった。Even if only such discrete audio frequencies can be generated, human voices and the like can be reproduced with relative fidelity. However, when playing a melody sound composed of scale frequencies, the scale and frequency of each scale (do, shi, 3) are
- There are many things that are not included in the above-mentioned discrete values, and there is a problem that if a melody sound is reproduced using such discrete audio frequencies, a melody sound with a significantly shifted pitch will be reproduced.

そこで従来、特願昭５６−６９９５０号に示されている
ように、メ０：ｐイ音や歌唱のように音階周波数に応じ
て音の高低が変化する場合には、別に設けた音階パルス
発生回路によって音源の基本周期を決定し、話し言葉の
ように均一に連続的に音の高低が変化するような音声を
合成する場合には、上述の数式で定まる離散的な基本周
期ｔを音源の基本周期とするようにしだ音声合成装置を
開発したものであるが、かかる従来例にあつτ１よメロ
ディ音と音声とを区別するた、めに入力パラメータの中
からメロディコードを検出するメロディコード検出回路
を音声合成装置の内に設けることが必要となり〜チップ
面積が余分に必要になり、データメモリにもメロダイコ
ードを余分に記憶させる必要が生じるという問題があっ
た。またかかるメロディコード検出回路を省略しようと
すると、メロディ音と音声とを区別するための入力ピン
が１個余分に必要になるという問題があった。Therefore, as shown in Japanese Patent Application No. 56-69950, when the pitch of the sound changes depending on the scale frequency, such as in me0:p and singing, a separate scale pulse generator is used. When synthesizing speech in which the fundamental period of the sound source is determined by a circuit and the pitch of the sound changes uniformly and continuously, such as spoken words, the discrete fundamental period t determined by the above formula is determined as the fundamental period of the sound source. A melody code detection circuit for detecting a melody code from input parameters has been developed to distinguish between melody sounds and voices. It is necessary to provide this in the speech synthesizer, requiring an extra chip area, and there is a problem in that it is necessary to store an extra melody code in the data memory. Furthermore, if such a melody code detection circuit is omitted, there is a problem in that one additional input pin is required to distinguish between melody sounds and voices.

また全く別の従来例として、特開昭５２−２８２１１号
公報に開示されているように、男声奮の音源データを記
憶した音源ＲＯＭと、女声前の音源データを記憶した音
源ＲＯＭとを切り換えて使用するようにした音声合成装
置が開発され゛ているが、かかる音源ＲＯＭを切り換え
るためには音源ＲＯＭ選択用の音源選択入力ピンを１個
余分に必要とする。ところでかかる従来例において男声
音の音源データと女声前の音源データの他に、メロディ
音の音源データをも他の音源ＲＯＭに記憶させるように
して９．切り換えて使用できるようにすれば合成される
メロディ音を美しい音にすることができるはずであるが
、この場合メロディ音と音声とを区別するためにメロデ
ィコード検出回路を設けるか・あるいはメロディ音と音
声とを区別する入力ピンを１個余分に設ける必要があっ
た。As a completely different conventional example, as disclosed in Japanese Patent Application Laid-Open No. 52-28211, a sound source ROM storing sound source data for a male voice and a sound source ROM storing sound source data for a female voice are switched. A speech synthesizer has been developed for this purpose, but in order to switch between such sound source ROMs, one additional sound source selection input pin for selecting the sound source ROM is required. By the way, in this conventional example, in addition to the sound source data of the male voice and the sound source data before the female voice, the sound source data of the melody sound is also stored in another sound source ROM.9. If the melody sound can be switched and used, it should be possible to make the synthesized melody sound a beautiful sound, but in this case, a melody code detection circuit should be provided to distinguish between the melody sound and the voice, or a melody code detection circuit should be provided to distinguish between the melody sound and the voice. It was necessary to provide one extra input pin to distinguish it from audio.

[Purpose of the invention]

本発明は上述のような問題点を解決するために為された
ものであり、メロディコード検出回路を必要とせず・　
１個の入力ピンによりて男声前と女声音とメロディ音と
のうち、いずれか２つを選択的に使用できるようにした
音声合成゛装置を提供することを目的とするものである
。The present invention was made to solve the above-mentioned problems, and does not require a melody code detection circuit.
It is an object of the present invention to provide a speech synthesis device that can selectively use any two of pre-male voice, female voice, and melody sound using one input pin.

[Disclosure of the invention]

（構　成）第１図は本発明の特許請求の範囲に記載された構成を示
すいわゆるクレーム対応図である。(Structure) FIG. 1 is a so-called claim correspondence diagram showing the structure recited in the claims of the present invention.

本発明は同図に示すように、音声信号を音声周波波数よ
りも高い周波数のサンづリンジパルスにてサンプリンク
し、振巾パラメータＡ・ピッチパラメータＰおよびスペ
クトルパラメータＳｉ：すなる特徴パラメータを抽出し
てデーモノ七りｆｔｌ　（２１ｔａｌに記憶させ〜デー
タメ七りｆｌ）　＋２＋　＋３）から読み出された特徴
パラメータに基いて音源を制御して音声を合成するよう
にした１チツプＬＳＩよりなる音声合成装置において、
サンづリンジパルスと等しい周期の同期パルスＣＫをカ
ウントして第１および第２の音源ＲＯＭ　（４）（５）
から音源データを読み出すアドレスカウンタ（６）と、
アドレスカウンタ（６）の値がピッチパラメータＰに一
致したとき一致信号を出力する一致回ｗＫｒ（７）と、
ピッチパラメータＰに対応する音階周波数のパルスを発
生する音階パルス発生回路（８）と、音階パルス発生回
路（８）からのパルス出力が得られた直後の同期パルス
ＣＫを出力するりセットパルス発生回路（９）と、アド
レスカウンタ（６）のリセ・ソト信号をリセットパ１し
ス発生回路（９）の出力あるいは一致回路（７）の出力
とに切換える切換回路（ｌＯ）と、音源選択入力ピン（
１１）に接続されて、第１および簗２の音源ＲＯＭ　＋
４）＋５）を切換える音源切換回路α匂とを設けて、第
１および第２の音源ＲＯＭ　＋４）　（６＋にメロディ
音および音声の各音源データを夫々マスク記憶せしめた
ときＫは、ｊ￥！１の音源ＲＯＭ　［４）の選択時にの
み切換回路（ｌＯ）をリセットパルス発生回路（９）の
側に切換えるように、音源選択入力ピン（ｌｔ）を切換
回路（ｌＯ）の切換入力に接続し、第１および第２の音
源ＲＯＭ（４）（６）に共に音声の音源データをマスク
記憶せしめたときには、切換回路（１０）を常時一致回
路（７）の出力の側に接続するよう罠、切換回路（１０
）の切換入力のマスクパターンを選択可能としたもので
ある。As shown in the figure, the present invention samples and links an audio signal with a sampling pulse having a frequency higher than the audio frequency, and extracts the amplitude parameter A, the pitch parameter P, and the spectral parameter Si: In a speech synthesis device consisting of a 1-chip LSI that controls a sound source and synthesizes speech based on characteristic parameters read out from a data memory (stored in 21tal ~ data memory fl) +2+ +3). ,
The first and second sound source ROMs are counted by counting synchronizing pulses CK with the same period as the ringing pulse (4) (5)
an address counter (6) for reading sound source data from the
a matching time wKr (7) that outputs a matching signal when the value of the address counter (6) matches the pitch parameter P;
A scale pulse generation circuit (8) that generates a pulse with a scale frequency corresponding to the pitch parameter P, and a set pulse generation circuit that outputs a synchronization pulse CK immediately after the pulse output from the scale pulse generation circuit (8) is obtained. (9), a switching circuit (lO) that switches the reset/sort signal of the address counter (6) to the output of the reset pass generation circuit (9) or the output of the matching circuit (7), and the sound source selection input pin (
11), the first and second sound source ROM +
4)+5) is provided, and when the first and second sound source ROMs are mask-stored in each sound source data of the melody sound and the voice in the 6+, K is j\! Connect the sound source selection input pin (lt) to the switching input of the switching circuit (lO) so that the switching circuit (lO) is switched to the reset pulse generation circuit (9) only when sound source ROM [4] is selected. , when both the first and second sound source ROMs (4) and (6) have mask-stored audio sound source data, a trap is provided so that the switching circuit (10) is always connected to the output side of the matching circuit (7); Switching circuit (10
), the mask pattern of the switching input can be selected.

かかる構成を有する本発明の音声合成装置においては、
次の（Ｉ）〜Ｑｌの３つのマスクパターンのうち、いず
れか１つを選択して使用するようにしている。In the speech synthesis device of the present invention having such a configuration,
One of the following three mask patterns (I) to Ql is selected and used.

（＋）第１の音源Ｒｏ　Ｍ　Ｉ＋）にメロディ音の音源
データをマスク記憶させ、第２の音源ＲＯＭ　（５１に
男声前の音源データを記憶させた場合。この場合には、
音源選択入力ピン（１１）を切換回路叫の切換入力に接
続して、第１の音源ＲＯＭ（４）の選択時にはりセット
パルス発生回路（９）の出力によってアドレスカウンタ
（６）をリセットし、まだ篇２の音源ＲＯＭ（５）の選
択時には一致回路（７）の一致検出出力にょうてアドレ
スカウンタ（６）をリセットするものである。これによ
って、メロディ音の合成時には音階周波数に合った高さ
の音を出力できるようにし、また男声前の合成時には音
階周波数に関係しない離散的な基本周期の音を出力でき
るようにするものである。(+) When the first sound source Ro M I+) stores the sound source data of the melody sound as a mask, and the second sound source ROM (51) stores the sound source data before the male voice. In this case,
Connect the sound source selection input pin (11) to the switching input of the switching circuit, reset the address counter (6) by the output of the set pulse generation circuit (9) when the first sound source ROM (4) is selected, When the second sound source ROM (5) is selected, the address counter (6) is reset based on the coincidence detection output of the coincidence circuit (7). As a result, when synthesizing melody sounds, it is possible to output a sound with a pitch that matches the scale frequency, and when synthesizing a male voice, it is possible to output sounds with a discrete fundamental period that is not related to the scale frequency. .

（ＩＩ）ＩＧｌの音源ＲＯＭ　［４）にメロディ音の音
源データをマスク記憶させ、第２の音源ＲＯＭ　（ａ）
に女声音の音源データをマスク記憶させた場合０この場
合にも、音源選択入力ピン（１１）を切換回路（１０）
の切換入力に接続するものであり、第１の音源ＲＯＭ（
４）の選択時には、リセットパルス発生回路（９）の出
力によってアドレスカウンタ（６）をリセットし、また
第２の音源ＲＯＭ　＋５）の選択時には一致回路（７）
の一致検出出力によってアドレスカウンタ（６）をリセ
ットするものである。(II) The sound source data of the melody sound is mask-stored in the IGl sound source ROM [4], and the second sound source ROM (a)
In this case, the sound source selection input pin (11) is connected to the switching circuit (10).
is connected to the switching input of the first sound source ROM (
When selecting 4), the address counter (6) is reset by the output of the reset pulse generating circuit (9), and when selecting the second sound source ROM +5), the matching circuit (7) is reset.
The address counter (6) is reset by the match detection output.

（ｌｉｔ）　　第１の音源ＲＯＭ　ｆ＋）に男声前の音
源データをマスク記憶させ、第２の音源ＲＯＭ（５１に
女声音の音源データをマスク記憶させた場合。この場合
には、切換回路（ｌＯ）の切換入力と音源選択ピン（！
ｌ）との接続は遮断し、上記切換入力を接地しておくも
のである。そしてこれによって切換回路（ｌＯ）を常時
−数回路（７）の出力の側に接続しておくものである。(lit) When the first sound source ROM (f+) stores the sound source data before the male voice as a mask, and the second sound source ROM (51) stores the sound source data for the female voice as a mask. In this case, the switching circuit (lO ) switching input and sound source selection pin (!
1) is cut off, and the switching input is grounded. As a result, the switching circuit (lO) is always connected to the output side of the minus number circuit (7).

したがってこの場合には、音階パルス発生回路（８）と
リセットパルス発生回路（９）とは使用されないことに
なる。Therefore, in this case, the scale pulse generation circuit (8) and the reset pulse generation circuit (9) are not used.

本発明にあっては以上の（１）〜（ＩＩＩ）の３つのマ
スクパターンのうち、いずれか１つを選択して使用する
ことＫより、（１）男声前とメロディ音とを入力ピンに
て切換可能な音声合１ｉＬＳＩ、（ＩＩ）女声音とメロ
ディ音とを入力ピンにて切換可能な音声合成ＬＳＩ、お
よび（ＩＩＩ）男声前と女声音とを入力ピンにて切換可
能な音声合成ＬＳＩのうちのいずれか１つを選択できる
ものであり、しかもメロディ音の合成時には自動的にア
ドレスカウンタ（６）のリセツ、ト信号を切シ換えて音
階周波数に合致したメロディ音が出力されるようにする
ことができるものである。In the present invention, by selecting and using any one of the three mask patterns (1) to (III) above, (1) the male voice front and the melody sound are input to the input pin. (II) A voice synthesis LSI that can switch between female and melody sounds using an input pin, and (III) A voice synthesis LSI that can switch between pre-male and female voices using an input pin. One of these can be selected, and when melody sounds are synthesized, the address counter (6) is automatically reset and the G signal is switched to output a melody sound that matches the scale frequency. It is something that can be done.

（実施例）第２図は本発明の一実施例に係るＰＡＲＣＯＲ型の音声
合成装置のブロック図である。ＰＡＲＣＯＲ型を声合成
方式は第３図に示すように音声信号（Ｖｓ）をサンプリ
ンタＪ＼ルスにより適当周期（ｔｏ）でサンプリングし
・サンプリングされたサンプリング匝ＸｉとＸｌ−ｐの
間にある（Ｐ−１＞個のサンづリンク値による相関関係
を除外し、Ｘｔとｘｔ−ｐとの相関関係のみを抽出した
ＰＡＲＣＯＲ係数（部分自己相関係数：以下にパラメー
タと略称する）をＳパラメータとして音声を合成するも
のであり、Ｋパラメータは音声がほぼ定常状態とみなせ
る１フレーム（５〜２０ｍ５ｅｃ）において、適当周期
（ｔｏ）　（約１００μｓｅｃ　）毎に音声信号ｆｆ５
）のサンづリンクを行ない、隣り合うサンプル値間の相
関係数をに１とし、複数間隔離れたサンＪＪｒＬＩ値間
では、その間に挾まれたサンプル値による影響を最小２
乗誤差による線形予測によって求め、それらを差引いて
できる相関係数をに！〜に＋ｏとしたものである。この
にパラメータはに１、Ｋｘ−ＫｓのようにＸｔＫ近い点
との部分自己相関関係を表わす係数にはスペクトル分布
に関する情報が豊富に含まれているが、Ｋ８、Ｋｓｓ　
ＫｓｏのようなＸｔから遠い点との部分自己相関関係忙
はスペクトル分布に関する情報があまり含まれていない
ので、低次のにパラメータに多数の量子化ピットを割シ
当て、高次のにパラメータには少数の量子化ピットを割
シ当てることＫよシピット数を節減して冗長度を小さく
するほうが効果的である。したがってＰＡＲＣＯＲ方式
はＳパラメータとして自己相関係数を用いて各係数に同
一ピット数を割り当てるようにした自己相関係数方式に
比べて帯域圧縮率がすぐれているものである。(Embodiment) FIG. 2 is a block diagram of a PARCOR type speech synthesis device according to an embodiment of the present invention. As shown in Figure 3, the PARCOR type voice synthesis method samples the voice signal (Vs) with a sampler J\Rus at an appropriate period (to), and the sampled signal is between Xi and Xl-p ( The PARCOR coefficient (partial autocorrelation coefficient: hereinafter abbreviated as parameter) obtained by excluding the correlation due to the P-1> sun link values and extracting only the correlation between Xt and xt-p is used as the S parameter. It synthesizes audio, and the K parameter is the audio signal ff5 every appropriate period (to) (approximately 100 μsec) in one frame (5 to 20 m5 ec) where the audio can be considered to be in an almost steady state.
), the correlation coefficient between adjacent sample values is set to 1, and the influence of sample values sandwiched between them is set to a minimum of 2 between sample values that are separated by multiple intervals.
Find the correlation coefficient by linear prediction using the multiplicative error and subtract them! +o is added to ~. In this case, the parameter is 1, and the coefficient representing the partial autocorrelation with points near XtK, such as Kx - Ks, contains a wealth of information regarding the spectral distribution,
Partial autocorrelation with points far from Xt, such as Kso, does not contain much information about the spectral distribution, so a large number of quantization pits are assigned to the low-order parameters, and a large number of quantization pits are assigned to the high-order parameters. It is more effective to allocate a small number of quantization pits to reduce the number of pits and reduce redundancy. Therefore, the PARCOR method has a better band compression rate than the autocorrelation coefficient method, which uses an autocorrelation coefficient as an S parameter and allocates the same number of pits to each coefficient.

通常各Ａ、Ｐ、にパラメータは圧縮されて記憶あるいは
伝送され、Ａパラメータに対して５ピツト・Ｐパラメー
タに対して６じ〜リド、Ｋパラメータの各係数ＫＩＳＫ
ｚ・・・Ｋ１゜に対して７．６．５．４．４．４．３．
３．３．３ピツト等のように割り当てる。Normally, the parameters for each A, P, and P are compressed and stored or transmitted, and each coefficient KISK is 5 pits for the A parameter, 6 pits for the P parameter, and 6 coefficients for the K parameter.
7.6.5.4.4.4.3 for z...K1°
3.3.3 Assign as Pitt et al.

第２図に示すｐＡＲｃＯＲ型の音声合成装置は音声・メ
ロディを圧縮された特徴パラメータとして記憶するデー
タメモリ（財）を具備した制御用ＩＣ（Ａ）と、音声合
成用ＩＣ（点線部Ａ、Ｂを除いた部分）とで構成され、
両ＩＣ間でピットシリアルにデータの受渡しを行なうよ
うにしたものである。ところで、音声の特徴パラメータ
はすべて再生用ＲＯＭＨ内に１０ピツトのデータとして
記憶されておシ、各特徴パラメータに割り当てられるデ
ータの個数は、その特徴パラメータが音質に寄与する度
合に応じて最適に配分されている。例えばＡ　ｌｌラメ
ータの場合１０ピツトで表現されるデータが３２個記憶
されている。したがってＡパラメータの任意のデータを
アクセスするときに必要とされる相対アドレスのヒツト
数は５ヒツトである。この相対アドレスは特徴パラメー
タを必要最小限に圧縮して表現したものであるので圧縮
ノ＼ラメータと呼ばれる。これに対して再生用ＲｏＭＨ
内に記憶されている実際の特徴パラメータは再生パラメ
ータと呼ばれる。上述した所から明らかなように再生パ
ラメータのヒツト数はＡ−Ｐ、、Ｋｔ〜Ｋｘｏの各特徴
パラメータについてすべて共通に１０ピツトであるが、
圧縮パラメータのピット数はＡ、Ｐ・Ｋ１〜ＫＩＯの各
パラメータについて異なるものであり、たとえばそれぞ
れ５．６．３．３Ｓ３．３．４．４．４．５．６．７ピ
ツト（合計５３ピツト）である。かかる圧縮パラメータ
は音声信号がほぼ定常状態とみなし得る５〜２９　ｍ５
ｅｃ　（ｌフレーム）ごとに１組（−５３ヒツト）抽出
されたものであるから、高々２６５０ヒツト／秒でデー
タを処理することにより音声信号を再生することができ
、無音区間やリヒート区間をも考慮に入れると実際には
１６００ピット／秒程度で音声信号を再生することがで
きるものである。ところで、実施例にあっては話し言葉
のように均一に連続的に音の高低が変化する音声を合成
する場合とメロディ音や歌唱のように離散的に続く音声
を合成する場合とにおける基本周期発生方式を変更する
ようなっており、メロディ音を再生する場合、各音階音
の基本周期に等しい基本周期で音源を駆動してメロディ
音を合成するように構成されている。The pARcOR type speech synthesis device shown in Fig. 2 includes a control IC (A) equipped with a data memory for storing voices and melodies as compressed feature parameters, and a speech synthesis IC (dotted line areas A and B). (excluding the part) and
Data is transferred between both ICs in a pit serial manner. By the way, all voice characteristic parameters are stored in the playback ROMH as 10 pit data, and the number of data allocated to each characteristic parameter is optimally distributed according to the degree to which that characteristic parameter contributes to sound quality. has been done. For example, in the case of All parameters, 32 pieces of data expressed by 10 pits are stored. Therefore, the number of relative address hits required when accessing arbitrary data of the A parameter is 5 hits. This relative address is called a compressed parameter because it represents the characteristic parameter compressed to the minimum necessary size. On the other hand, RoMH for playback
The actual feature parameters stored within are called playback parameters. As is clear from the above, the number of hits of the playback parameters is 10 pits in common for each characteristic parameter of A-P, , Kt to Kxo.
The number of pits of the compression parameter is different for each parameter of A, P, K1 to KIO, for example, 5.6.3.3S3.3.4.4.4.5.6.7 pits (53 pits in total). ). Such a compression parameter is 5 to 29 m5, which allows the audio signal to be considered to be in an approximately steady state.
Since one set (-53 hits) is extracted for every ec (l frame), the audio signal can be reproduced by processing the data at a rate of at most 2650 hits/second, and even silent sections and reheat sections can be reproduced. Taking this into consideration, it is actually possible to reproduce audio signals at about 1600 pits/second. By the way, in the embodiment, the fundamental period generation is explained in the case of synthesizing speech in which the pitch changes evenly and continuously, such as spoken words, and in the case of synthesizing speech that continues discretely, such as melody sounds and singing. The method has been changed, and when reproducing melody sounds, the melody sound is synthesized by driving the sound source with a basic cycle equal to the basic cycle of each scale note.

以下、実施例の基本構成および動作（人間の声などを合
成する通常の音声合成動作）について説明する。The basic configuration and operation (normal speech synthesis operation for synthesizing human voice, etc.) of the embodiment will be described below.

いま、圧縮パラメータ（すなわち再生用ＲＯＭθ３）の
相対アドレス）は１フレームごとにデータ入力端子θ荀
から切換回路θ→を介してリングレジスタ０鴫にピット
シリアルに記憶されるが、このような相対アドレスだけ
では再生用ＲＯＭ　（＋３）には各パラメータの再生デ
ータが連続して記憶しであるので・特定のデータを取シ
出すことができない。そこでインデックスＲＯＭ（１７
）の中に記憶されている再生用ＲＯＭ（１３ｉ中の各パ
ラメータの先頭アドレスをアドレスカウンタ（１８）の
制御の下に順次取り出して、上記相対アドレスと加算回
路０９）によって加算することにより再生用ＲＯＭ　（
＋３）の絶対アドレス（９ヒツト）を計算し、この絶対
アドレスによって再生用ＲＯＭ　（１３）をアクセスす
るようにしている。インデックスＲＯＭ　Ｑ７）には圧
縮パラメータのピット配分数を３ピツトの２進数で記憶
させており、この圧縮パラメータのピット配分数に関す
るデータは再生制御回路馨０）に送られ、再生器（財）
回路！２０）は、ピット配分数だけシフトクロックをリ
ングレジスタ（＋６）に送出する。したがってリングレ
ジスタ（１→→・らは、上記ピット配分数に応じて例え
ばＡパラメータの場合には５ピツト、Ｐパラメータの場
合には６ヒツト％に１０　　パラメータの場合には３ヒ
ツト、・・・・・・Ｋ＋パラメータの場合には７ヒツト
という具合に圧縮パラメータ（相対アドレス）をそれぞ
れ加算回路θ９）にシリアルに送出するものである。ま
たインデックスＲ’ＯＭ（ｌη内に記憶されている各特
徴パラメータの再生用ＲＯＭ　（＋３）内における先頭
アドレスは１パラレルシリアル変換回路伐１）を介して
１ピツトづつ順次加算回路（１９）に送出されるので、
順次１ヒツトづつ加算されて絶対アドレスが計算される
ものである。こうして計算されたシリアルな絶対アドレ
スはシリアルパラレル変換回路−を介してパラレルデー
タに変換され・再生用ＲＯＭ（１３）をアクセスするア
ドレスに変換される。Now, the compression parameters (i.e., the relative address of the playback ROM θ3) are stored pit-serially from the data input terminal θ through the switching circuit θ→ in the ring register 0 for each frame. However, since the reproduction data of each parameter is stored continuously in the reproduction ROM (+3), it is not possible to extract specific data. Therefore, the index ROM (17
) The starting address of each parameter in the playback ROM (13i) stored in the playback ROM (13i) is sequentially taken out under the control of the address counter (18) and added to the above relative address by the adder circuit 09. ROM (
+3) absolute address (9 hits) is calculated, and the playback ROM (13) is accessed using this absolute address. The index ROM Q7) stores the number of pits allocated to the compression parameter as a 3-pit binary number, and data regarding the number of pits allocated to the compression parameter is sent to the reproduction control circuit Kaoru0), which controls the regenerator.
circuit! 20) sends shift clocks as many as the allocated number of pits to the ring register (+6). Therefore, the ring register (1→→...) corresponds to the above pit distribution number, for example, 5 pits for the A parameter, 6 hits for the P parameter, 3 hits for the 10% parameter, etc. . . . In the case of K+ parameters, compressed parameters (relative addresses) such as 7 hits are sent serially to the adder circuit θ9). In addition, the index R'OM (the first address in the reproduction ROM (+3) of each characteristic parameter stored in lη is sent out one pit at a time to the addition circuit (19) via the 1-parallel-serial converter circuit 1). Because it is done,
The absolute address is calculated by sequentially adding each hit one by one. The serial absolute address thus calculated is converted into parallel data via a serial-parallel conversion circuit and converted into an address for accessing the reproduction ROM (13).

この再生用ＲＯＭＯＳ）から出力される特徴パラメータ
は１フレームごとに更新されるものであるが・データを
更新する際に各フレーム間の接続点において特徴パラメ
ータが不連続的に変化すると音声信号に歪みを生じて明
瞭度が低下するおそれがあるので、データ更新の際に特
徴パラメータがスムーズに変化し得るように補間計算回
路（２３）を設けて１フレーム内の８点において近似的
な直線的補間を行なうようにしている。このため、タイ
ニング制御回路（２蜀では第４図に示すように１フレー
ム（２０ｍｓｅｃ）中に８個の補間用Ｄクロ・ツク（２
，５ｍ５ｅｃ）を発生し、１個のＤクロック中に２５個
のパラメータ読込用Ｐり０ツク（１００μｓｅｃ　）、
さらに１個のＰクロック中に２２個のじット読込用Ｔり
０ツク（４，５μｓｅｃ周期）を作成する。なおＰクロ
ックはサンプリングパルスに相当する同期パルスである
。８個のＤクロックのうら１最初のＤｌにおいてデータ
入力端子０→からリングレジスタ（＋６）にデータが読
み込まれる。各圧縮パラメータＡ、　Ｐ、ＫＬ。・・・
、Ｋ１は奇数番目のＰクロックで順次読みみ込まれるも
のであり、例えばＡパラメータはＰ１区間のＴ６〜Ｔｌ
ｏの５個のＴり０ツクで読み込まれる０偶数番目のＰり
Ｏツクあるいは上記以外のＴりＤツクは補間計算回路（
支）、音源ＲＯＭ　＋４１　＋５）、デジタルフィルタ
（社）などのタイミングとして使用されるものである。The feature parameters output from this playback ROMOS are updated every frame, but if the feature parameters change discontinuously at the connection point between each frame when updating data, the audio signal will be distorted. Therefore, an interpolation calculation circuit (23) is provided to perform approximate linear interpolation at 8 points within one frame so that feature parameters can change smoothly when updating data. I try to do this. For this reason, the tinning control circuit (in the case of 2 Shu, 8 interpolation D clocks (2
, 5m5ec), and 25 parameter read P resets (100μsec) during one D clock.
Furthermore, 22 bit reading T clocks (4.5 μsec period) are created during one P clock. Note that the P clock is a synchronization pulse corresponding to a sampling pulse. Data is read into the ring register (+6) from the data input terminal 0→ at the first Dl of the eight D clocks. Each compression parameter A, P, KL. ...
, K1 are read sequentially at odd-numbered P clocks. For example, the A parameter is read from T6 to Tl in the P1 interval.
The 0th even-numbered PO which is read by the 5 T-tricks of o or the T-tricks other than the above are processed by the interpolation calculation circuit (
This is used as the timing for the sound source ROM +41 +5), digital filter, etc.

この補間計算回路（四はメロディ音の合成時にはその動
作を停止する。Ｍはパラレルシリアル変換回路である。This interpolation calculation circuit (4 stops its operation when melody sounds are synthesized; M is a parallel-to-serial conversion circuit).

上記補間計算回路内によって２．５ｍ５ｅｃごとに新し
い値に更新された各特徴パラメータは、それぞれＰラッ
チ（２７）、ＡＫラッチ瞥に一時的に蓄えられる。ただ
し、補間計算に差し当り必要のないパラメータはすべて
ＡＫパラメータスタックＩ２９）に転送してデジタルフ
ィルタレ（へ）の音声合成用データとして蓄積している
。Each feature parameter updated to a new value every 2.5 m5ec by the interpolation calculation circuit is temporarily stored in the P latch (27) and the AK latch, respectively. However, all parameters that are not required for the time being for interpolation calculations are transferred to the AK parameter stack I29) and stored as data for speech synthesis in the digital filter.

ところでＰラッチ國に蓄えられたＰパラメータは音源を
駆動してＰパラメータに対応する基本周期を有するイン
パルス信号を発生するだめのデータであり、人間の話し
言葉のような音声を合成する場合・サンプリンタパルス
に等しいＰクロックをカウントしている音源ＲＯＭ　＋
４＋　＋６１のアドレスカウンタ（６）のリセット信号
はアドレスカウンタ（６）の出力とＰラッチ罰に蓄えら
れたＰパラメータの一致を検出する一致回路（７）の出
力となり、アドレスカウンタ（６）はＰクロック周期の
整数倍（Ｐパラメータ）の周期でリセットされるように
なっているｏしたがって音源ＲＯＭ　＋４＋　＋５）か
らＰパラメータに基いた有声音源制御データが出力され
、音源切換回路（１２）を介して切換回路（至）に入力
される。音声に基本周期がない場合には、音源制御回路
のりにて切換回路例を駆動し、無声音源（ハ）に切換え
るようになっており、無声音源の力は基本周期を持たな
いホワイトノイズ（白色雑音）を発生させるものである
。次ＫＡパラメータおよびにパラメータはデジタルフィ
ルターに供給され、音源より供給された信号に振巾の大
小およびスペクトル分布に関する情報を付は加えること
により音声を再生するものである。図中色３）は再生さ
れた音声信号を増巾する低周波アンプ、−はスヒーカ、
側は水晶発振回路である。By the way, the P parameters stored in the P latch are data used to drive a sound source to generate an impulse signal having a fundamental period corresponding to the P parameters. Sound source ROM that counts P clocks equal to pulses +
The reset signal of the address counter (6) at 4+ +61 becomes the output of the matching circuit (7) that detects the match between the output of the address counter (6) and the P parameter stored in the P latch, and the address counter (6) The voiced sound source control data based on the P parameter is output from the sound source ROM +4+ +5), which is reset at a cycle that is an integral multiple of the clock cycle (P parameter), and is sent via the sound source switching circuit (12). Input to the switching circuit (to). When the sound has no fundamental period, the switching circuit is driven by the sound source control circuit to switch to the unvoiced sound source (c), and the power of the unvoiced sound source is the white noise (white noise). The next KA parameter and the second parameter are supplied to a digital filter, which reproduces the sound by adding information regarding amplitude and spectral distribution to the signal supplied from the sound source. Color 3) in the figure is a low frequency amplifier that amplifies the reproduced audio signal, - is a speaker,
The side is a crystal oscillation circuit.

以下用５図及び第６図に示す音階パルス発生回路（８）
、リセットパルス発生回路（９）の構成およびメロディ
音を合成する音声合成動作について説明する。音階パル
ス発生回路（８）はＰパラメータに対応するデータすな
わち制御用ＩＣｍがら出力される圧縮Ｐパラメータをリ
クエスト信号（ＶＲＥ）によりとシこむようにしたシフ
トレジスタ（３６）と、圧縮Ｐパラメータをアドレスデ
ータとして圧縮Ｐパラメータに対応する音階データを読
み出すようにＬ７だ音階ＲＯＭ（３ηと、音階ＲＯＭｅ
ηから読み出された欲階データをプリセット入力としＰ
り０ツクよりも周波数の高いりＯツクパルス例えばＴク
ロック−をカウントするプリセットカウンタ（喝と、プ
リセットカウンタ盃のｔ！ｏ検出信号を反転するインバ
ータの９）とで構成され、クロックパルスの周期の整数
倍（音階データ）の周期を有するせ口検出信号を音階パ
ルス（ＰＭ）として出力する。この場合・音階パルス発
生回路（８）から出力される音階パルス（ＰＭ）の周波
数は離散的な値をとるが離散間隔はり０ツクパルスの周
波数に応じて小さくなる。したがって音階ＲＯＭのηに
適当な音階データを記憶させておくことにより音階パル
ス発生回路（８）にて各音階の周波数に一致するような
音階パルス（ＰＭ）が形成できることになる。例えばり
０ツクパルスをＴりＯツク（周期４．５　ｐ　ｓｅｃ　
）とし・Ｐパラメータ「１２」に対応する圧縮Ｐパラメ
ータにて音階ＲＯＭ（３ηから音階データ「２８４」が
読み出されるようＫすれば、プリセットカウンタ（へ）
から４．５　Ｘ　２８４μｓｅｃの周期でゼロ検出信号
が得られ、この音階パルス（ＰＭ）はＰパラメータのｒ
１２．８Ｊに相当する基本周期となシ、Ｐパラメータに
対応する離散的な基本周期を補間できることになる。リ
セットパルス発生回路（９）はインバータ鴎（４１）、
コンデンサは乃、ナントゲート（至）、Ｄフリップフロ
ラづ（伺およびアンドゲート（４句にて形成されており
、第７図のタイムチャートに示すようにプリセットカウ
ンタ（ハ）から出力される音階信号（ＰＭ）が得られた
直後のＰり０ツクをアドレスカウンタ（６）のリセット
パルス（ＶＲ）として出力するようになっている。なお
図中（イ）はＰパラメータが「１２」のときの一致回路
（７）の出力、（ロ）は音階信号（ＰＭ）、０幼はリセ
ットパルス（ＶＲ）を示すものである。Scale pulse generation circuit (8) shown in Figures 5 and 6 below
, the configuration of the reset pulse generation circuit (9) and the voice synthesis operation for synthesizing melody sounds will be explained. The scale pulse generation circuit (8) is connected to a shift register (36) in which the data corresponding to the P parameter, that is, the compressed P parameter outputted from the control ICm, is input by a request signal (VRE), and the compressed P parameter is input to the address data. The L7 scale ROM (3η and the scale ROMe
The desire level data read from η is used as a preset input and P
It consists of a preset counter (9) that counts the 0 clock pulse, for example, the T clock, which has a higher frequency than the 0 clock pulse, and an inverter 9 that inverts the t!o detection signal of the preset counter cup. A gap detection signal having a cycle that is an integral multiple (scale data) is output as a scale pulse (PM). In this case, the frequency of the scale pulse (PM) output from the scale pulse generation circuit (8) takes discrete values, but the discrete interval becomes smaller in accordance with the frequency of the zero pulse. Therefore, by storing appropriate scale data in η of the scale ROM, the scale pulse generation circuit (8) can generate a scale pulse (PM) that matches the frequency of each scale. For example, if the 0 pulse is set to 0 (period: 4.5 p sec)
) and set the compressed P parameter corresponding to the P parameter "12" so that the scale data "284" is read from the scale ROM (3η), the preset counter (to)
A zero detection signal is obtained with a period of 4.5 × 284 μsec, and this scale pulse (PM) is
This means that a discrete fundamental period corresponding to the P parameter can be interpolated with a fundamental period corresponding to 12.8J. The reset pulse generation circuit (9) is an inverter AOI (41),
The capacitor is formed by four lines: a Nant gate (to), a D flip Florazu (to), and an and gate (to), and the scale signal output from the preset counter (c) as shown in the time chart in Figure 7. The P pulse immediately after obtaining (PM) is output as the reset pulse (VR) of the address counter (6). In the figure, (a) shows the pulse when the P parameter is "12". The output of the matching circuit (7), (b) indicates a scale signal (PM), and 0 indicates a reset pulse (VR).

いま制御用ＩＣ（Ａ）から音源選択入力ピン（１１）に
メロディ音の記憶されている第１の音源ＲＯＭ　ｆ４）
を選択するように信号が与えられている場合、切換回路
（ｌＯ）はりセットパルス発生回路（９）の側に切換え
られ）アトしスカウンタ（６）はリセットパルス発生回
路（９）から出力されるリセットパルス（ＶＲ）にてリ
セットされ）アドレスカウンタ（６）はＰり０ツクを１
３個カウントしてリセットされる場合とが、４：１の割
合で起きることになる。したがって等測的ニＰｔ”＋ラ
メータｒ１２．８ｊに相当する基本周期で音源ＲＯＭ　
＋４）がアクセスされ、音階音「ソ」が正確に再生され
ることになる。同様にして各音階音が正確に再生され、
メロディが正しい音程で再生される。The first sound source ROM f4) in which the melody sound is currently stored in the sound source selection input pin (11) from the control IC (A)
When a signal is given to select , the switching circuit (lO) is switched to the set pulse generating circuit (9) side) and the counter (6) is output from the reset pulse generating circuit (9). The address counter (6) is reset by the reset pulse (VR)
The number of cases in which the number is counted and then reset occurs at a ratio of 4:1. Therefore, the sound source ROM is
+4) is accessed, and the scale note "G" is accurately reproduced. In the same way, each scale note is played accurately,
The melody is played at the correct pitch.

第８図に示すタイムチャートは音階パルス（ＰＭ）とリ
セットパルス（ＶＲ）の関係をさらに分かシ易く説明す
るもので、例として３．７５ＫＨｚ（２６７μｓｅｃ）
の音階パルス（ＰＭ）に対応するリセットパルス（ＶＲ
）を示したものである。図から明らかなようにリセット
パルス（ＶＲ）としてＰパルスの３Ｓ６．８−１１．１
４．１６・・・番目のパルスが出力される。このりセッ
トパルス（ＶＲ）でリセットされるアドレスカウンタ（
６）により音源ＲＯＭ（４）５％アドレスされるの００で、音源ＲＯＭ　（４）から等測的に３．７５　Ｋ　Ｈ
ｚ　（３μｓｅｃ　）とみなせる周期で音源データが読
み出されるととＫなシ、音源が正しい音階周波数で駆動
されてメロディ音や歌唱などの音声が正確な音程で再生
されることになる。The time chart shown in Figure 8 explains the relationship between the scale pulse (PM) and the reset pulse (VR) more easily.
The reset pulse (VR
). As is clear from the figure, 3S6.8-11.1 of the P pulse is used as the reset pulse (VR).
4.16th pulse is output. The address counter (
00 of the sound source ROM (4) 5% addressed by 6), isometrically 3.75 K H from the sound source ROM (4)
If the sound source data is read out at a period that can be considered as z (3 μsec), the sound source will be driven at the correct scale frequency, and sounds such as melody sounds and singing will be reproduced at accurate pitches.

しかして第２図のづ０９９図においては・音源選択入力
ピン（１１）を切換回路（ｌＯ）の切換入力に接続し、
第１の音源ＲＯＭ　＋４）の選択時には切換回路（１０
）をリセットパルス発生回路（９）の側に切換えるよう
にし、第２の音源ＲＯＭ　＋５＋の選択時には切換回路
ｆｌｏ）を一致回路θηの側に切換えるようにしてあり
、第１の音源ＲＯＭ　＋４）にはメロディ音の音源デー
タをマスク記憶させ、第２の音源Ｒ□Ｍ（５ＮＣは男声
音または女声音の音源ヂ〜りをマスク記憶させるように
しているもめであるが、第２図の点線Ｃで囲まれる範囲
のマスクパターンを変更して切換回路（１０）の切換入
力を接地することもでき、この場合には切換回路（１０
）は一致回路（７）の側にのみ接続され、音階パルス発
生回路（８）およびリセットパルス発生回路（９）は使
用されないものである。そしてこの場合には、第１およ
び第２の音源ＲＯＭ　＋４＋　＋５）にはそれぞれ男声
音および女声音の音源データがマスク記憶されるもので
ある。なお、かかる音源データは、男声音や女声音、メ
ロディ音などをそれぞれＰＡＲＣＯＲＪＩの音声分析フ
ィルタに入力して、Ａパラメータおよびにパラメータを
抽出した後に得られる残差信号の波形を示すデータであ
って、かかる残差波形を記憶せる音源ＲＯＭ（４）（５
）を一定の周期でリセットされるアドレスカウンタ（６
）によってくり返しアクセスすることにより、基本周期
を有する残差信号が再生されるようになっているもので
ある。However, in Figure 2-099, the sound source selection input pin (11) is connected to the switching input of the switching circuit (lO),
When selecting the first sound source ROM (+4), the switching circuit (10
) is switched to the reset pulse generation circuit (9) side, and when the second sound source ROM +5+ is selected, the switching circuit flo) is switched to the matching circuit θη side, and the first sound source ROM +4) is switched to the matching circuit θη side. The problem is that the sound source data of the melody sound is stored as a mask, and the second sound source R It is also possible to ground the switching input of the switching circuit (10) by changing the mask pattern in the area surrounded by
) is connected only to the coincidence circuit (7) side, and the scale pulse generation circuit (8) and reset pulse generation circuit (9) are not used. In this case, the first and second sound source ROMs +4+ +5) mask the sound source data of male and female voices, respectively. Note that such sound source data is data indicating the waveform of a residual signal obtained after inputting a male voice sound, a female voice sound, a melody sound, etc., respectively to the PARCORJI voice analysis filter and extracting the A parameter and the 2 parameter. , sound source ROM (4) (5) that can store such residual waveforms.
) is reset at regular intervals (6
), a residual signal having a fundamental period can be reproduced by repeatedly accessing it.

〔Effect of the invention〕

本発明は板上のように、第１および第２の音源ＲＯＭに
メロディ音および音声の各音源データを夫々マスク記憶
せしめたときには、第１の音源ＲＯＭの選択時にのみ切
換回路をリセットパルス発生回路の側に切換えるように
、音源選択入力ピンを切換回路の切換入力に接続し、第
１および第２の音源ＲＯＭに共に音声の音源データをマ
スク記憶せしめたときには、切換回路を常時一致回路用
力の側に接続するように、切換回路の切換入力のマスク
パターンを選択可能としたものであるから、メロディ音
と音声とを選択可能な音声合成ＬＳＩとするときには音
源選択入力ピンを切換回路の切換入力に接続して、音源
ＲＯＭを選択すると同時に切換回路によってアドレスカ
ウンタのリセット信号を切換えることができ、これによ
ってメロディ音の合成時における音源の基本周期と音声
の合成時における音源の基本周期とを音源ＲＯＭの選択
と連動して切り換えることができ、したがってメロディ
音の合成時には音程の正確な再生音を合成できるという
効果があり、まだ男声音と女声音とを選択可能な音声合
成ＬＳＩとするときには音源選択入力ピンを切換回路の
切換入力から切シ離して、音階パルス発生回路等を使用
しないようだすることができ、しかもかかる選択を従来
のようにメロディコード検出回路を用いないで行なうこ
とができるという効果がある。As described above, when each sound source data of melody sound and voice is mask-stored in the first and second sound source ROMs, the switching circuit is reset only when the first sound source ROM is selected. When the sound source selection input pin is connected to the switching input of the switching circuit so as to switch to the side of Since the mask pattern of the switching input of the switching circuit can be selected as shown in FIG. When the sound source ROM is selected, the reset signal of the address counter can be switched by the switching circuit. It can be switched in conjunction with the ROM selection, and therefore has the effect of synthesizing playback sounds with accurate pitches when synthesizing melody sounds, and when creating a voice synthesis LSI that can select between male and female voices, the sound source By separating the selection input pin from the switching input of the switching circuit, it is possible to avoid using a scale pulse generation circuit, etc., and furthermore, such selection can be made without using a melody code detection circuit as in the past. There is an effect.

[Brief explanation of the drawing]

第１図は本発明の特許請求の範囲に示された構成を端的
に示すいわゆるクレーム対応ブロック図、簗２図は本発
明の一実施例のプＯｔツク図、第３図は同上の原理説明
用の波形図、第４図は同上に用いるり０ツクパルスのタ
イムチャート、第５図７図および第８図は同上の動作説
明図である。ｔｘ）　＋２１　＋３）はデータメそり、＋４）＋５）
は音源ＲＯＭ、（６）はアドレスカウンタ・（７）は−
数回路、（８）は音階Ｉ＼ルス発生回路、（９）はリセ
ットパルス発生回路、（ｌＯ）は切換回路、（ＩＩ）は
音源選択入力ピン、０２は音源切換回路、ＣＫは同期パ
ルスである。代理人　弁理士　　石　１）長　七Fig. 1 is a so-called claim-corresponding block diagram concisely showing the configuration shown in the claims of the present invention, Fig. 2 is a block diagram of an embodiment of the present invention, and Fig. 3 is an explanation of the principle of the same. FIG. 4 is a time chart of the 0-pass pulse used in the above, and FIG. 5, FIG. 7, and FIG. 8 are operation explanatory diagrams of the same. tx) +21 +3) is data memory, +4) +5)
is the sound source ROM, (6) is the address counter, and (7) is -
Several circuits, (8) is the scale I\rus generation circuit, (9) is the reset pulse generation circuit, (lO) is the switching circuit, (II) is the sound source selection input pin, 02 is the sound source switching circuit, and CK is the synchronization pulse. be. Agent Patent Attorney Ishi 1) Choshichi

Claims

[Claims]

+11 Sampling the audio signal with a sampler pulse of a higher frequency than the audio frequency, extracting characteristic parameters such as amplitude parameters, pitch parameters, and spectral parameters and storing them in data storage,
A 1-chip LS that controls the sound source and synthesizes speech based on the characteristic parameters read from the data memory.
In a speech synthesis device consisting of I, a matching device outputs a matching signal when the value of an address counter that reads out sound source data from the first and second sound source ROMs by counting synchronizing pulses with a period equal to the sampling pulse and matches the pitch parameter. circuit, a scale pulse generation circuit that generates a pulse with a scale frequency corresponding to the pitch parameter, a set pulse generation circuit that outputs a synchronization pulse immediately after the pulse output from the scale pulse generation circuit is obtained, and an address counter. a switching circuit that switches the reset signal of the RO to either the reset pulse generation circuit output or the matching circuit output; and the switching circuit connected to the sound source selection input pin,
A sound source switching circuit for switching the first and second sound sources is provided.
When each sound source data for melody and voice is mask-stored in the sound source ROM, the sound source selection input pin is connected to a switching circuit so that the switching circuit is switched to the reset pulse generation circuit only when the first sound source ROM is selected. When connecting the switching input of the switching circuit to the switching input of the switching circuit and masking the audio sound source data in both the first and second sound source ROMs, connect the switching circuit to the matching circuit output side at all times. A speech synthesis device characterized in that patterns can be selected.