JPS58158694A

JPS58158694A - Voice generation system

Info

Publication number: JPS58158694A
Application number: JP57040776A
Authority: JP
Inventors: 茂大島
Original assignee: Hitachi Ltd
Current assignee: Hitachi Ltd
Priority date: 1982-03-17
Filing date: 1982-03-17
Publication date: 1983-09-20

Abstract

(57)【要約】本公報は電子出願前の出願データであるた
め要約のデータは記録されません。(57) [Summary] This bulletin contains application data before electronic filing, so abstract data is not recorded.

Description

【発明の詳細な説明】発明の対象本発明はディジタル情報により所定の音声を生成するこ
とのできる音声発生機構を具備した機器における、音声
発生方式に係わる。よシ具体的にはＰＣＭ−ｐＰＡＲｃ
ＯＲ符号を用いた音声合成機構を有する機器における音
声修飾方式に係わる。DETAILED DESCRIPTION OF THE INVENTION Object of the Invention The present invention relates to a sound generation method in a device equipped with a sound generation mechanism capable of generating a predetermined sound based on digital information. Specifically, PCM-pPARc
It relates to a voice modification method in a device that has a voice synthesis mechanism using OR codes.

従来技術所望の音声を生成する手法として従来のアナログ録音（
記憶）・再生（読み出し）編集方式に対しディジタル技
術を基本とした方式が考案゛され実用化されつつあるが
、アナログ記ｉ意方式に較ベデイジタル化された機器へ
の相性の良さや制御のし易さといった利点を持つ反面、
アナログ状態のまま記憶する場合に比して情報の記゛憶
効率が無く、為に生成可能な語葉数に難があシ、特に同
−語葉に対し発声形態（音量やイントネーシ宵ンあるい
は高低など）別に音声生成パラメータを用意することは
経済的に不可能であった。Prior Art Conventional analog recording (
Methods based on digital technology have been devised and put into practical use for memory) and playback (readout) editing methods, but compared to analog recording methods, they are less compatible with digital equipment and have less controllability. While it has the advantage of being easy to use,
Compared to storing information in an analog state, the information storage efficiency is lower, and as a result, the number of words that can be generated is difficult. It was economically impossible to prepare separate voice generation parameters (high, low, etc.).

発明の目的本発明の目的は、このようなディジタル技術を基本とし
た音生発生機構を具備し音声ケ発生することができる機
器において、有声発生機構の本来有する音声に１６飾を
加えることによシ等価的に多数の音声が収容された音声
発生機構を有するが如き機器を経済的にかつ付加的に提
供することにある。OBJECT OF THE INVENTION The object of the present invention is to provide a device that is equipped with a sound generation mechanism based on digital technology and capable of generating voices, by adding 16 decorations to the original voice of the voiced generation mechanism. Another object of the present invention is to economically and additionally provide such a device having a sound generation mechanism that accommodates equivalently a large number of sounds.

ディジタル技術を用いた音声合成方式は、いずれも、基
となる音声を、アナログ→ディジタ。All voice synthesis methods using digital technology convert the base voice from analog to digital.

層変換した後所定の圧縮方式（差分パルス符号゛化、ｆ
ＬＭ形予測符号化など）にてデータ量を削減゛した形で
表現する（これを該音声のパラメータと呼ぶことにする
）。η声合成時にハ該パラメータを符号化した時とは逆
のアルゴリズムで復号したのちデジタル→アナログ変換
し、所望の音声信号を得る。該パラメータは音韻期間に
対。After layer conversion, a predetermined compression method (differential pulse encoding, f
(LM-type predictive coding, etc.) to reduce the amount of data (this will be referred to as the parameter of the voice). η At the time of voice synthesis, the parameter is decoded using an algorithm reverse to that used for encoding, and then converted from digital to analog to obtain a desired audio signal. The parameter corresponds to the phonological period.

し可変長のデータ量を有しかつ断えまなく供給されねば
ならない。従って音声発生機構は一般的には、あらかじ
め所望の長さの音声区間に対応するパラメータ列を専用
の記憶機構に格納しておき５専用の制御機構により、該
パラメータ機構からパラメータ列を読み出し、パラメー
タを復号し音声信号を得る音声合成機構に供給する構成
をとる。従って、表音的に同義であっても（たとえは皮
と川など）ニーアンスの異なる音声を区別して発生させ
る為には従来はそれぞ゛れに対して符号化した結果を別
々に音韻パラメータとして記憶機構に格納する必要があ
シ、経済的に発音形態の異なる音声を得ることは困難゛
であった。（毎秒１２００〜９６００ビツトのデータが
必要）本発明は、このような発音形態が異なるのみ□で表音的
に同値である音声の発生は、符号化の゛アルゴリズムと
は独立（関係あってもなくとも。It must have a variable length data amount and be continuously supplied. Therefore, the sound generation mechanism generally stores in advance a parameter string corresponding to a sound section of a desired length in a dedicated storage mechanism, and reads out the parameter string from the parameter mechanism using a dedicated control mechanism 5, and The system is configured to decode the data and supply it to a speech synthesis mechanism that obtains a speech signal. Therefore, in order to distinguish and generate sounds with different nuances even if they are phonetically synonymous (for example, skin and river), conventionally the results of encoding each sound are used as phonological parameters separately. It was difficult to economically obtain sounds with different pronunciation forms because they needed to be stored in a memory device. (Data of 1,200 to 9,600 bits per second is required) In the present invention, the generation of sounds that are phonetically equivalent only with different pronunciation forms is independent of the encoding algorithm (even if there is a relationship). At least.

良い）に、復号績あるいは復号過程の音声に操作を加え
ることにより容易に達成できることに着目し、なされた
ものである。This was done by focusing on the fact that this can be easily achieved by adding operations to the decoding results or the audio during the decoding process.

つまシ基本となる音韻に対応するパラメータ列の納めら
れた記憶機構のアドレスを指定すると共に、発音形態を
併せ指定することにより、ニーアンスの異なる音韻を得
ようとするものである。By specifying the address of the storage mechanism containing the parameter string corresponding to the basic phoneme and also specifying the pronunciation form, it is possible to obtain phonemes with different nuances.

更にこのようにして得られるニーアンスの異なる音韻を
連けいすることにより有意の文章を発生することを考慮
して、パラメータ列のアドレスとその修飾情報を収める
有限サイズの第２゛の記憶機構を持ち、この第２の記憶
機構からパラメータアドレスと修飾情報を読み取るよう
に構成する。Furthermore, in consideration of generating a meaningful sentence by linking the phonemes with different nuances obtained in this way, it has a second storage mechanism of finite size that stores the address of the parameter string and its modification information, It is configured to read parameter addresses and modification information from this second storage mechanism.

第２の記憶機構は機器の目的によシ様々な形。The secondary storage mechanism may take various forms depending on the purpose of the device.

態をとることができる。たとえば、音声発生の′みな目
的とした機器の場合はＦＩＦＯ形式のパイプラインレジ
スタ（メモリ）とすることができるし、文字表示装置で
あって部分的に音声発生を゛行う機器の場合は、文字表
示用のりフレノシーバッファをこれに当てることができ
る。いずれの場合においてもバッファに格納されたデー
・りは該データがパラメータアドレスとそのイじ軸情報
に関するものであることを示す特殊な符号で区切られる
かまたはバッファの一文字を構成する特定の情報ビット
により識別されるよう構成される。can take a position. For example, in the case of a device whose sole purpose is to generate audio, it can be a FIFO-format pipeline register (memory), and in the case of a device that is a character display device that partially generates audio, it can be used as a character display device. You can apply a display glue frenosie buffer to this. In either case, the data stored in the buffer is delimited by a special code indicating that the data pertains to the parameter address and its axis information, or by specific information bits that make up one character of the buffer. configured to be identified by

発明の実施例以下、本発明の実施例を第１図によシ説明する。同図は
文字表示装置にパラメータ合成方式による音声発生機構
を付加し、異常状態や、特゛に操作者の注意を換起すべ
きメツセージを、文字表示とともに音声出力するよう構
成された装置に於けるバッファ構造と、関連する音電合
成。Embodiments of the Invention Hereinafter, embodiments of the present invention will be explained with reference to FIG. The figure shows a device in which a voice generation mechanism based on a parameter synthesis method is added to a character display device, and the device is configured to output sounds along with the character display to indicate abnormal conditions or messages that should particularly call the operator's attention. Buffer structure and related sound-electric synthesis.

機構の間の接続を示したものである。同図でプ゛ロック
１は文字表示装置のりフレッシー用バッファと、音声パ
ラメータおよびその修飾情報を格納する共用バッファ機
構である。ブロック２゛は、パラメータ合成方式による
音声生成部、プ・ロック３は音声修飾部である。バッフ
ァ４は１語が表示文字または、音声パラメータアドレス
とその修飾情報を表現するに充分なるビット長を有する
。更に、制御ビット５を有し、表示用文字コードまたは
音声パラメータアドレスとの識別に使用される。It shows the connections between the mechanisms. In the figure, block 1 is a shared buffer mechanism for storing a character display device buffer, voice parameters and their modification information. Block 2' is a voice generation section using a parameter synthesis method, and block 3 is a voice modification section. The buffer 4 has a bit length sufficient for one word to represent a display character or a voice parameter address and its modification information. Furthermore, it has a control bit 5, which is used for identification with a display character code or audio parameter address.

バッファ４の読み出し時、制御ビット５はレジスタ乙に
置かれ、語の情報部をそれぞれパラメータアドレス＆修
飾情報レジスタ７あるいは文字コード＆表示属性情報レ
ジスタ８にふり分ける。レジスタ８に格納されたデータ
はキャラクタゼネレータ９に与えられ表示器１０に文字
と。When reading the buffer 4, the control bit 5 is placed in register B, and the information part of the word is allocated to the parameter address & modification information register 7 or the character code & display attribute information register 8, respectively. The data stored in register 8 is given to character generator 9 and displayed as characters on display 10.

して、表示される。一方レジスタフに格納された音声パ
ラメータアドレスとその修飾情報はそれぞれ、パラメー
タ記憶部１１と音声修飾部１５に送られる。パラメータ
記憶部１１に送られたアドレスは該アドレスの内容ンバ
ラメータレジスタ。and it will be displayed. On the other hand, the voice parameter address and its modification information stored in the register are sent to the parameter storage section 11 and the voice modification section 15, respectively. The address sent to the parameter storage unit 11 is a parameter register containing the contents of the address.

１２に読み出し、パラメータの定義に従い基準ピ１゛ッ
チ発生器１３と多段ディジタルフィルタ１４を制。12 and controls the reference pitch generator 13 and multistage digital filter 14 according to the parameter definition.

御し、対応する音声に相当するディジタル値を得る。音
声修飾部１５は、このディジタルフィルタ１４からの出
力？受け、これに、レジスタ７よシ与えられている修飾
指示に従い修飾を施した後Ｄ／Ａ変換器＆低域沖波器１
６を経てスピーカ１７より修飾済音声を出力する。control and obtain a digital value corresponding to the corresponding voice. The voice modification section 15 uses the output from this digital filter 14? After receiving and modifying it according to the modification instruction given in register 7, the D/A converter & low-frequency wave transducer 1
6, the modified voice is output from the speaker 17.

音声修飾部は修飾内容によっては基準ピッチ発生器１５
、多段ディジタルフィルタ部１４に組込まれ基本パラメ
ータに修飾を加えるよう構成するとともできる。Depending on the content of the modification, the voice modification section may be used as a reference pitch generator 15.
It can also be configured to be incorporated into the multi-stage digital filter section 14 and to modify the basic parameters.

第２図は音声修飾部の一構成例である。同図は音量を修
飾する構成である。FIG. 2 shows an example of the configuration of the voice modification section. The figure shows a configuration for modifying the volume.

２１は修飾情報として与えられる音量情報を蓄゛えるシ
フトレジスタである。２２は同様に音声合成部２より出
力される直列音声情報を蓄えるシフトレジスタである。21 is a shift register that stores volume information given as modification information. 22 is a shift register that similarly stores serial audio information output from the audio synthesizer 2.

２１にはレジスタ中ただ１゜ビットが儀１〃であるよう
蓬きこまれる。またその最右端のセル位置には％１１１
検出器２５が設けられる。シフトし′ラスタ２２に所定
のサンプル値を格納したのちシフトレジスタ２１および
２２ヲ共通りロックＣＬＫによりシフトレジスタ２０Ｍ
５Ｂ方向（図では右）にシフトする。このシフトは一１
１検出器２５によシ凧１〃が検出されるまで継続され。21 is programmed so that only 1° bit in the register is 1. Also, in the rightmost cell position, %111
A detector 25 is provided. After storing a predetermined sample value in the raster 22, the shift register 20M is transferred to the shift register 20M by a common lock CLK for the shift registers 21 and 22.
Shift in the 5B direction (to the right in the figure). This shift is 11
This continues until the kite 1 is detected by the kite 1 detector 25.

る。シフトによりあふれた２２のデータはあふれレジス
タ２４に格納される。検出器２３に１１〃が検出される
とＣＬＫを停止しシフト動作は終了する。Ru. The 22 data overflowed by the shift are stored in the overflow register 24. When 11 is detected by the detector 23, the CLK is stopped and the shift operation is completed.

シフトレジスタ２２およびあふれレジスタ２４の保持す
る値をまとめてＤ／Ａ変換器２５によυアナログ化する
。この構成によれば１シフトあたり２倍つまυ音圧にし
て６ｄＢ単位の音量調節が可能。The values held in the shift register 22 and overflow register 24 are collectively converted into an analog signal by a D/A converter 25. With this configuration, the sound pressure is doubled per shift, making it possible to adjust the volume in 6 dB increments.

である。It is.

発明の効果本発明によれば、限られた音素をもとに、あ。Effect of the invention According to the present invention, based on limited phonemes, A.

たかも多種の音素か収容されているが如き効果を経済的
に得ることか可能である。It is possible to economically obtain the effect of accommodating a wide variety of phonemes.

[Brief explanation of drawings]

第１図は本発明の一実施例を示す音声発生機構を具備し
た装置のブロック図、第２図は第１′図の音声修飾部の
詳細を示すブロック図である゛み符号の説明１・・・共用バッファ機構、２・・・音声生成部、　　　３・・・音声修飾部。４・・・バッファ、７・・・パラメータアドレス＆修飾情報レジスタ、８・
・・文字コード＆表示属性情報レジスタ、９・・・キャ
ラクタゼネレータ、１０・・表示器、１１・・・パラメータ記憶部、１２・・・パラメータレジスタ、１３・・・基準ピンチ発生器、１４・・・多段ディジタルフィルタ、１５・・・音声修飾部、１６・・・Ｄ／Ａ変換器＆低域沖波器、１７・・・スピ
ーカ。Fig. 1 is a block diagram of a device equipped with a sound generation mechanism showing an embodiment of the present invention, and Fig. 2 is a block diagram showing details of the sound modification section shown in Fig. 1'. ...Shared buffer mechanism, 2..Speech generation section, 3..Speech modification section. 4... Buffer, 7... Parameter address & modification information register, 8...
...Character code & display attribute information register, 9...Character generator, 10...Display device, 11...Parameter storage section, 12...Parameter register, 13...Reference pinch generator, 14...・Multi-stage digital filter, 15...Speech modification unit, 16...D/A converter & low-frequency wave transducer, 17...Speaker.

Claims

[Claims]

A storage mechanism that stores generation parameters (for example, a PCM code string or a pARcOR code string) of a voice generated according to a predetermined procedure for synthesizing a predetermined voice, and a parameter output from the storage mechanism to synthesize a predetermined voice. Speech synthesis mechanism (e.g. PCM decoder or PARCOR
A second storage mechanism is provided for storing the address of the storage mechanism in which the audio parameters to be generated are stored, and the second storage mechanism stores the parameters of the storage mechanism. A voice generation method characterized in that voice modification information is stored together with an address, and when voice is generated, the generated voice is modified based on the modification information.