JP2970254B2

JP2970254B2 - Speech synthesis method and apparatus

Info

Publication number: JP2970254B2
Application number: JP4255899A
Authority: JP
Inventors: 浩志磯野
Original assignee: Nippon Electric Co Ltd
Current assignee: NEC Corp
Priority date: 1992-08-31
Filing date: 1992-08-31
Publication date: 1999-11-02
Anticipated expiration: 2014-11-02
Also published as: JPH0683398A

Description

DETAILED DESCRIPTION OF THE INVENTION

【０００１】[0001]

【産業上の利用分野】本発明は、波形符号化方式の一種
であるＡＤＰＣＭ（ＡｄａｐｔｉｖｅＤｉｆｆｅｒｅ
ｎｔｉａｌＰｕｌｓｅＣｏｄｅＭｏｄｕｌａｔｉ
ｏｎ）方式を用いた音声合成装置に関し、差分値データ
を読み出し専用メモリ（ＲＯＭ）として全て持つものの
差分値データの冗長性削減を行い、差分値データメモリ
を削減する方法とその装置に関する。BACKGROUND OF THE INVENTION 1. Field of the Invention The present invention relates to an ADPCM (Adaptive Differe) which is a kind of waveform encoding system.
neutral Pulse Code Modulati
The present invention relates to a method and an apparatus for reducing the difference value data memory by reducing the redundancy of the difference value data even if the difference value data is entirely provided as a read-only memory (ROM).

【０００２】[0002]

【従来の技術】まず初めに、従来の音声合成装置におい
て行われていたＡＤＰＣＭ（ＡｄａｐｔｉｖｅＤｉｆ
ｆｅｒｅｎｔｉａｌＰｕｌｓｅＣｏｄｅＭｏｄｕ
ｌａｔｉｏｎ）方式と乗算器を用いないで合成処理をす
るテーブル参照法について図面を参照しながら説明す
る。2. Description of the Related Art First, an ADPCM (Adaptive Dif) performed in a conventional speech synthesizer.
Fermental Pulse Code Modu
With reference to the drawings, a description will be given of a table referencing method for performing a combining process without using a multiplier.

【０００３】図５は従来のテーブル参照法によるＡＤＰ
ＣＭ音声合成装置のブロック図、図６は量子化テーブル
説明図、図７は３ビットＰＣＭ波形の説明図、図８
（ａ）は４ビットＰＣＭ波形の説明図、図８（ｂ）は、
５ビットＰＣＭ波形の説明図、図９はＤＰＣＭ方式の説
明する、図１０はＡＤＰＣＭ波形の説明図、図１１は３
ビットのＡＤＰＣＭ方式の量子化係数ｋの説明図、図１
２は２〜５ビット時における量子化係数ｋの説明図であ
る。FIG. 5 shows an ADP by a conventional table reference method.
FIG. 6 is a block diagram of a CM speech synthesizer, FIG. 6 is an explanatory diagram of a quantization table, FIG. 7 is an explanatory diagram of a 3-bit PCM waveform, FIG.
FIG. 8A is an explanatory diagram of a 4-bit PCM waveform, and FIG.
FIG. 9 is an explanatory diagram of a 5-bit PCM waveform, FIG. 9 is an explanatory diagram of a DPCM system, FIG. 10 is an explanatory diagram of an ADPCM waveform, and FIG.
FIG. 1 is an explanatory diagram of a quantization coefficient k of a bit ADPCM method,
2 is an explanatory diagram of the quantization coefficient k at the time of 2 to 5 bits.

【０００４】ディジタルコードで音声信号を扱う場合、
まず図７に示したようにある周期でサンプリングして、
各サンプリングした値に対して量子化し、量子化値に対
応した符号に置き換えたものをＰＣＭ（ＰｕｌｓｅＣ
ｏｄｅＭｏｄｕｌａｔｉｏｎ）という。一般的にはＡ
／Ｄコンバータの出力値がＰＣＭコードである。図７の
斜線で示した部分が量子化による誤差であり、量子化誤
差と呼ばれている。この符号化方式は、アナログ信号に
対して忠実に符号化を試みようとすると、図８（ａ），
（ｂ）のようにビット数が多く必要となり情報量が増え
る。When a voice signal is handled by a digital code,
First, as shown in FIG. 7, sampling is performed at a certain cycle,
A value obtained by quantizing each sampled value and replacing it with a code corresponding to the quantized value is PCM (Pulse C
Ode Modulation). Generally A
The output value of the / D converter is a PCM code. The hatched portion in FIG. 7 indicates an error due to quantization, which is called a quantization error. This encoding method attempts to faithfully attempt to encode an analog signal, as shown in FIG.
As shown in (b), a large number of bits are required and the amount of information increases.

【０００５】ＰＣＭ方式の音質改善並びに情報量削減と
して考え出されたのが、次に説明するＤＰＣＭ（Ｄｉｆ
ｆｅｒｅｎｔｉａｌＰｕｌｓｅＣｏｄｅＭｏｄｕ
ｌａｔｉｏｎ）方式である。[0005] The DPCM (Dif) described below has been devised to improve the sound quality and reduce the amount of information in the PCM system.
Fermental Pulse Code Modu
lation) method.

【０００６】ＰＣＭ方式では、ビットすべてを割り当て
て、音声波形の符号化を行っていた。この方法では、良
質な音声を合成するには、符号ビット数を多くしなけれ
ばならなかった。そこで音声信号の相関性を利用し、図
９に示したように前の音声信号との差分値に対してＰＣ
Ｍ符号化を行ったものがＤＰＣＭである。In the PCM system, audio bits are encoded by allocating all bits. In this method, the number of code bits must be increased in order to synthesize high quality speech. Therefore, by utilizing the correlation of the audio signal, the difference between the previous audio signal and the PC is calculated as shown in FIG.
The result of the M encoding is the DPCM.

【０００７】つまり、同じビット数であれば、差分値に
対して割り当てた方がより良質な音声を合成することが
できる。また逆に同程度の音質で良ければＤＰＣＭ符号
ビットを少なくでき音声データの情報圧縮を行うことも
できる。In other words, if the number of bits is the same, higher quality speech can be synthesized by assigning the difference value. Conversely, if the sound quality is the same, the DPCM code bits can be reduced and the information of the audio data can be compressed.

【０００８】ＤＰＣＭ方式においては差分値に対しての
ＰＣＭであるため、量子化のための量子化幅が一定であ
り、急激な音声信号の振幅変化や、あまりにも緩慢な音
声信号の場合はひずみや雑音となって現れる。[0008] In the DPCM system, since the PCM is used for the difference value, the quantization width for the quantization is constant, and the amplitude of the audio signal changes suddenly. And appear as noise.

【０００９】さらにＤＰＣＭ方式に対して適応符号化の
考え方を導入してより高能率に音声信号を符号化したも
のが、次に説明するＡＤＰＣＭ（ＡｄａｐｔｉｖｅＤ
ｉｆｆｅｒｅｎｔｉａｌＰｕｌｓｅＣｏｄｅＭｏ
ｄｕｌａｔｉｏｎ）方式である。Further, a speech signal which is more efficiently coded by introducing the concept of adaptive coding into the DPCM system is referred to as an ADPCM (Adaptive D) described below.
iferential Pulse Code Mo
(duration) method.

【００１０】ＡＤＰＣＭ方式は、差分信号値の量子化幅
Δ_i を前のサンプル値の結果から定める方法である。す
なわち、前のサンプルの量子化幅Δ_i-1 とすると次のサ
ンプル量子化幅は、次式で定められ、係数ｋの値は前の
サンプルの振幅値の関数として定める。[0010] ADPCM scheme is a method of determining the quantization width delta _i of the difference signal value from the results of the previous sample value. That is, assuming that the quantization width of the previous sample is Δ _i−1 , the quantization width of the next sample is determined by the following equation, and the value of the coefficient k is determined as a function of the amplitude value of the previous sample.

【００１１】Δ_i ＝ｋ・Δ_i-1 Δ _i = k · Δ _i-1

【００１２】ここでも簡単な例を示す。図１１に３ビッ
トで量子化を行う場合の係数ｋの与え方を示す。３ビッ
トで量子化する場合、最上位の１桁は符号ビットとして
割り当てられるので振幅の絶対値を２ビットが担当す
る。そしてこの２ビットの振幅値の中で、図１１に矢印
で示す値の半値に相当する振幅値以上のコードに符号化
された場合は、次のサンプルの量子化幅が大きくなるよ
うにｋの値を１より大きくする。Here, a simple example is shown. FIG. 11 shows how to give a coefficient k when performing quantization with three bits. When quantizing with three bits, the most significant digit is assigned as a sign bit, so that two bits are responsible for the absolute value of the amplitude. In the case where a code having an amplitude equal to or more than the half value of the value indicated by the arrow in FIG. 11 is encoded in the 2-bit amplitude value, k is set so that the quantization width of the next sample becomes large. Increase the value to greater than 1.

【００１３】逆に、半値に相当する振幅値よりも小さな
振幅値である場合、前の量子化幅よりも減少させるた
め、ｋを１より小さくする。もちろん量子化幅の最小値
と最大値を定めておき、必要以上に小さくなったり、逆
に大きくなったりすることがないようにする必要があ
る。Conversely, if the amplitude value is smaller than the half value, k is made smaller than 1 in order to reduce the quantization value from the previous quantization width. Of course, it is necessary to determine the minimum value and the maximum value of the quantization width so that the quantization width does not become unnecessarily small or vice versa.

【００１４】基本的には、適応化された量子化幅Δによ
る量子化結果が、絶対値的に常に許容レベルの中央付近
にあるようにして、オーバーロードひずみの状態（許容
レベルの上限に近づく状態）にも、粒状ノイズ（許容レ
ベルの下限に近づく状態）にも陥らないようにする方式
である。参照までに、いろいろなビットの時の量子化係
数を図１２に示す。以上、音声符号化について説明した
が、音声合成においては前述した符号化の逆の動作によ
り復号することで実現する。量子化係数に関しては、符
号化に用いた値と同じ値を用いることで復号化する。Basically, the quantization result based on the adapted quantization width Δ is always absolutely in the vicinity of the center of the allowable level, and the state of overload distortion (approaching the upper limit of the allowable level). State) and granular noise (a state approaching the lower limit of the allowable level). For reference, FIG. 12 shows quantization coefficients at various bits. The speech encoding has been described above. Speech synthesis is realized by decoding by the reverse operation of the above-described encoding. The quantization coefficient is decoded by using the same value as the value used for encoding.

【００１５】次にテーブル参照法のアルゴリズムについ
て図５と図６を参照しながら述べる。本従来例は４ビッ
トＡＤＰＣＭ方式の音声合成を示したものである。Next, the algorithm of the table reference method will be described with reference to FIGS. This conventional example shows speech synthesis of the 4-bit ADPCM system.

【００１６】図５において、１０３はＡＤＰＣＭ復号に
用いる加算器、１０５は適応制御器、１０９は音声符号
データを格納しておく音声符号データＲＯＭ、１０８は
音声符号データＲＯＭ１０９から読み出した値を保持す
る符号レジスタ、１１０，１１１，１１２はそれぞれ復
号作業に用いるレジスタ、５０１は復号に用いる差分値
をあらかじめ格納しておく量子化テーブル、１０４は量
子化テーブル５０１を指し示す量子化幅ポインタであ
る。In FIG. 5, reference numeral 103 denotes an adder used for ADPCM decoding; 105, an adaptive controller; 109, a voice code data ROM for storing voice code data; and 108, a value read from the voice code data ROM 109. Code registers, 110, 111, and 112 are registers used for decoding, 501 is a quantization table that stores difference values used for decoding in advance, and 104 is a quantization width pointer that indicates the quantization table 501.

【００１７】テーブル参照法は、量子化係数ｋの乗算処
理を行わなくて済む方法として知られている（例えば、
特願平３−１１５２１０号）。The table reference method is known as a method that does not require the multiplication process of the quantization coefficient k (for example,
Japanese Patent Application No. 3-115210).

【００１８】従来のテーブル参照アルゴリズムについて
述べる。基本原理としては１．２５という値は次式に示
したように、べき乗の値が近似的に量子化係数に近いこ
とを利用する方法である。（１．２５）^-1＝０．８約０．９（１．２５）¹ ＝１．２５約１．２（１．２５）² ＝１．５６２５約１．６（１．２５）³ ＝１．９５３１２５約２．０（１．２５）⁴ ＝２．４４１４０６２５約２．４A conventional table reference algorithm will be described. As a basic principle, a value of 1.25 is a method utilizing the fact that the value of the power is approximately close to the quantization coefficient as shown in the following equation. (1.25) ^-1 = 0.8 about 0.9 (1.25) ¹ = 1.25 about 1.2 (1.25) ² = 1.5625 about 1.6 (1.25) ³ = 1.953125 about 2.0 (1.25) ⁴ = 2.4441625 about 2.4

【００１９】つまり、最大の差分値データに対して１．
２５倍づつ掛け合わされた値を、表としてＲＯＭ（Ｒｅ
ａｄＯｎｌｙＭｅｍｏｒｙ）のような記憶媒体にす
べて書き込んでおくことにより（図６参照）、これを読
み出し累算することで音声合成を行う。That is, for the maximum difference value data, 1.
The values multiplied by 25 times are stored in the ROM (Re
By writing all the data into a storage medium such as an ad only memory (see FIG. 6), and reading and accumulating the data, voice synthesis is performed.

【００２０】動作について図５を用いて説明する。音声
符号データＲＯＭ１０９から読み出された符号化データ
は、符号レジスタ１０８に格納される。符号化レジスタ
１０８に格納された値は振幅情報としてそのまま量子化
テーブル５０１の差分値検索に利用される。さらに、符
号化データは適応制御器１０５に入力され、所定の適応
アルゴリズムにより量子化ポインタ１０４の制御がなさ
れる。制御としては振幅情報の絶対値が大きい場合はポ
インタを右へ２〜４程度ずらすことで量子化係数ｋの所
定の近似倍率を掛けたことに相当する。逆に振幅情報の
絶対値が小さい場合は、ポインタを左へ２〜４程度ずら
すことで量子化係数ｋの所定の近似倍率で割ったことに
相当する。The operation will be described with reference to FIG. The encoded data read from the audio encoded data ROM 109 is stored in the encoded register 108. The value stored in the encoding register 108 is used as it is as amplitude information in a difference value search of the quantization table 501. Further, the encoded data is input to the adaptive controller 105, and the quantization pointer 104 is controlled by a predetermined adaptive algorithm. When the absolute value of the amplitude information is large, the control is equivalent to multiplying the quantization coefficient k by a predetermined approximate magnification by shifting the pointer to the right by about 2 to 4. Conversely, when the absolute value of the amplitude information is small, it is equivalent to dividing the quantization coefficient k by a predetermined approximate magnification by shifting the pointer by about 2 to 4 to the left.

【００２１】量子化幅ポインタ１０４と符号レジスタ１
０８により示された量子化テーブル５０１の差分値デー
タは読み出され、レジスタ１１２に格納される。レジス
タ１１２に格納された差分値データは、あらかじめ計算
されている１つ前の合成波形データが格納されているレ
ジスタ１１１の内容と加算し、新たな合成波形をレジス
タ１１０に格納し出力する。出力と同時に次の合成のた
めにレジスタ１１１にも格納する。Quantization width pointer 104 and code register 1
The difference value data of the quantization table 501 indicated by 08 is read out and stored in the register 112. The difference value data stored in the register 112 is added to the content of the register 111 in which the immediately preceding synthesized waveform data is stored, and a new synthesized waveform is stored in the register 110 and output. At the same time as the output, it is also stored in the register 111 for the next synthesis.

【００２２】以降、音声符号データＲＯＭ１０９から新
たな符号データを読み出し、上記動作を繰り返す。Thereafter, new code data is read from the voice code data ROM 109, and the above operation is repeated.

【００２３】[0023]

【発明が解決しようとする課題】上述したように従来の
ＡＤＰＣＭ方式を用いた音声合成装置は、量子化係数の
乗算処理のための乗算回路削減のため乗算結果つまり差
分値データをすべて持つことで解決していたため、差分
値を記憶する大規模な記憶媒体の容量を有する。As described above, the conventional speech synthesizer using the ADPCM method has all the multiplication results, that is, difference value data, in order to reduce the number of multiplication circuits for multiplying quantization coefficients. Since this has been solved, it has a large-capacity storage medium for storing difference values.

【００２４】本発明の目的は、メモリを削減し、ハード
ウェア資源の削減を図る音声合成方法及びその装置を提
供することにある。An object of the present invention is to provide a speech synthesizing method and apparatus for reducing memory and hardware resources.

【００２５】[0025]

【課題を解決するための手段】前記目的を達成するた
め、本発明に係る音声合成方法は、第１の量子化幅にお
ける第１の差分値データと、第１の量子化幅の１．２５
倍に当たる第２の量子化幅における第２の差分値データ
と、第２の量子化幅の１．２５倍に当たる第３の量子化
幅における第３の差分値データから成る量子化テーブル
を用いてＡＤＰＣＭ符号の復号動作を行う音声合成方法
であって、第１の量子化幅、第２の量子化幅及び第３の
量子化幅と異なる量子化幅における差分値データは量子
化テーブルの第１の差分値データ又は第２の差分値デー
タ又は第３の差分値データをビットシフトさせて求める
ようにしたものである。 Means for Solving the Problems] To achieve the above object, a method speech synthesis according to the present invention, contact with the first quantization width
First difference value data and a first quantization width of 1.25
Second difference value data in a second quantization width corresponding to the double
And a third quantization corresponding to 1.25 times the second quantization width.
Quantization table comprising third difference value data in width
Speech Synthesis Method for Decoding ADPCM Code Using GSM
Wherein the first quantization width, the second quantization width, and the third
The difference value data at the quantization width different from the quantization width is
First difference value data or second difference value data of the conversion table
Data or third difference value data by bit shifting
It is like that.

【００２６】また、本発明に係る音声合成方法を実施す
る音声合成装置は、ＡＤＰＣＭ符号から量子化幅ポイン
タを決める適応制御器と、適応制御器からの量子化幅ポ
インタに応じて巡回する３進アップダウンカウンタと、
３進アップダウンカウンタの巡回数をカウントするアッ
プダウンカウンタと、第１の量子化幅における第１の差
分値データ、第１の量子化幅の１．２５倍に当たる第２
の量子化幅における第２の差分値データ及び第２の量子
化幅の１．２５倍に当たる第３の量子化幅における第３
の差分値データから成る量子化テーブルと、３進アップ
ダウンカウンタによるカウント値に応じて量子化テーブ
ルの第１の差分値データ、第２の差分値データ及び第３
の差分値データの何れかの差分値データに対し、アップ
ダウンカウンタが示す巡回数に応じたビット数分シフト
演算を行うシフターと、シフターから出力される差分値
データと１つ前の合成波形データとを加算し合成波形デ
ータを出力する加算器とを有するものである。Further, the speech synthesis apparatus for carrying out the speech synthesis method according to the present invention, the quantization width point from ADPCM code
And the quantization width from the adaptive controller.
A ternary up / down counter that circulates according to the interchange,
Updates the number of ternary up / down counter cycles
And a first difference in a first quantization width.
Separated value data, the second which corresponds to 1.25 times the first quantization width
The second difference value data and the second quantum in the quantization width of
The third in the third quantization width corresponding to 1.25 times the quantization width
Quantization table consisting of differential value data
Quantization table according to count value by down counter
First difference value data, second difference value data,
Of any of the difference value data
Shift by the number of bits according to the number of rounds indicated by the down counter
The shifter that performs the operation and the difference value output from the shifter
The data and the previous synthesized waveform data are added and the synthesized waveform data is added.
And an adder for outputting data .

【００２７】[0027]

【作用】ＡＤＰＣＭ符号を復号する過程で用いる量子化
テーブルの冗長性を削減することにより、記憶容量を減
らしハードウェア資源の削減を実現する。By reducing the redundancy of the quantization table used in the process of decoding the ADPCM code, the storage capacity is reduced and the hardware resources are reduced.

【００２８】[0028]

【実施例】以下、本発明の実施例について図面を参照し
て説明する。Embodiments of the present invention will be described below with reference to the drawings.

【００２９】（実施例１）図１は、本発明の実施例１を
示すブロック図、図２は、本実施例１の量子化テーブル
を示す図である。(First Embodiment) FIG. 1 is a block diagram showing a first embodiment of the present invention, and FIG. 2 is a diagram showing a quantization table of the first embodiment.

【００３０】本実施例は４ビットＡＤＰＣＭ方式の音声
合成を示したものである。図１において、本発明は、Ａ
ＤＰＣＭ音声合成で用いる差分値データを格納してある
量子化テーブル１０１と、シフタ１０２と、差分値デー
タの累算に用いる加算器１０３と、量子化幅ポインタ１
０４と、適応制御器１０５と、３進アップダウンカウン
タ１０６と、アップダウンカウンタ１０７と、ＡＤＰＣ
Ｍ符号を格納する符号レジスタ１０８と、音声符号デー
タＲＯＭ１０９と、差分値の累算処理に用いるレジスタ
１１０と、レジスタ１１１と、レジスタ１１２とからな
る。The present embodiment shows 4-bit ADPCM speech synthesis. In FIG. 1, the present invention
A quantization table 101 storing difference value data used in DPCM speech synthesis, a shifter 102, an adder 103 used for accumulating difference value data, and a quantization width pointer 1
04, adaptive controller 105, ternary up / down counter 106, up / down counter 107, ADPC
It comprises a code register 108 for storing the M code, a voice code data ROM 109, a register 110 used for accumulating difference values, a register 111, and a register 112.

【００３１】まず初めに図２に示す量子化テーブルにつ
いてその構成と本発明の原理について述べる。First, the configuration of the quantization table shown in FIG. 2 and the principle of the present invention will be described.

【００３２】図２に示した量子化テーブルは縦方向には
ＡＤＰＣＭ符号に対応した差分値の振幅レベルの異なる
ものを格納しておく。振幅レベルは図２において、最も
上側は正の最大差分値振幅を格納し順次下側に向かうに
したがって差分値の振幅レベルの小さいものとなるよう
に配置しておく。中央付近が最も振幅レベルの小さな正
の差分値振幅レベルの値になるようにする。中央より下
の値は負の差分値振幅レベルを格納しておく。最も下側
は負の最大差分値振幅を格納し順次上側に向かうにした
がって差分値の振幅レベルの小さいものとなるように配
置しておく。中央付近が最も振幅レベルの小さな負の差
分値振幅レベルの値になるように配置しておく。The quantization table shown in FIG. 2 stores, in the vertical direction, those having different amplitude levels of difference values corresponding to ADPCM codes. In FIG. 2, the uppermost level stores the positive maximum differential value amplitude in FIG. 2 and is arranged so that the amplitude level of the differential value becomes smaller as going downward. The positive difference value having the smallest amplitude level near the center is set to the value of the amplitude level. The value below the center stores the negative difference amplitude level. The lowermost part stores the amplitude of the maximum negative differential value, and is arranged so that the amplitude level of the differential value becomes smaller as going upward. The negative difference value having the smallest amplitude level near the center is arranged to be the value of the amplitude level.

【００３３】図２に示した量子化テーブルの横方向に
は、ある適当な量子化幅の振幅セットを基準にその１．
２５倍、さらに１．２５倍（最大の量子化幅からみて約
１．６倍）、さらに１．２５倍（最大の量子化幅からみ
て約２倍）といった具合に１．２５倍づつべき乗倍され
てゆく構成になっている。ところで、そのある適当な量
子化幅からみて約２倍となるということは、最大の量子
化幅の値を呼び出し１ビットディジタルデータとしてシ
フト演算してやれば２倍したデータを読み出したのと同
じこととなり、全ての差分値を保有する必要がなくな
る。In the horizontal direction of the quantization table shown in FIG.
25 times, further 1.25 times (approximately 1.6 times from the maximum quantization width), and further 1.25 times (approximately 2 times from the maximum quantization width), such as a power multiplier of 1.25 times It is configured to be done. By the way, when the value is approximately doubled from the viewpoint of a certain appropriate quantization width, the value of the maximum quantization width is called, and if a shift operation is performed as 1-bit digital data, the doubled data is read out. , There is no need to hold all the difference values.

【００３４】さらに、進歩させて逆に考えて、逆方向シ
フト演算で１／２にすることも可能である。むしろ、基
の量子化幅セットから除算する方が、精度良く音声合成
することができる。つまり、そのある適当な量子化幅セ
ットを最大振幅セットの量子化幅とその近傍を選択して
置けば良質な音声合成が実現できることを意味する。Further, it is also possible to make a half by performing a backward shift operation by making progress and thinking in reverse. Rather, the voice synthesis can be performed more accurately by dividing the original quantization width set. In other words, it means that high quality speech synthesis can be realized if the certain appropriate quantization width set is set by selecting the quantization width of the maximum amplitude set and its vicinity.

【００３５】故に、３回１／１．２５倍すると約１／２
倍の量子化幅となるため、最大の量子化幅の値とその１
／１．２５倍の値と１／（１．２５）² 倍の３種類の値
を有するだけで簡単なシフト演算だけで図２に示した実
際には実装していない仮想量子化テーブル１に相当する
部分や仮想量子化テーブル２に相当する部分が量子化テ
ーブルを読み出すと同時にシフト演算で容易に算出でき
るのが原理である。Therefore, when 1/25 times is performed three times, about 1/2 is obtained.
Since the quantization width is doubled, the maximum quantization width value and its 1
The virtual quantization table 1 shown in FIG. 2 which has only three types of values, /1.25 times and 1 / (1.25) ² times, is simply implemented by a simple shift operation. The principle is that a corresponding part or a part corresponding to the virtual quantization table 2 can be easily calculated by a shift operation at the same time as reading out the quantization table.

【００３６】続いて動作について図１を参照しながら述
べる。音声合成処理を行うときの初期状態としてはシフ
タ１０２、３進アップダウンカウンタ１０６、アップダ
ウンカウンタ１０７は、最小量子化幅の振幅セットを指
し示すようにシフト演算されている状態から始まるもの
とする。Next, the operation will be described with reference to FIG. It is assumed that the shifter 102, the ternary up / down counter 106, and the up / down counter 107 are shifted from the initial state when performing the voice synthesis processing to indicate the amplitude set of the minimum quantization width.

【００３７】音声符号データＲＯＭ１０９から読み出さ
れた符号化データは、符号レジスタ１０８に格納され
る。符号化レジスタ１０８に格納された値は振幅情報と
してそのまま量子化テーブル１０１の差分値検索に利用
される。さらに、符号化データは適応制御器１０５に入
力され、所定の適応アルゴリズムにより量子化幅ポイン
タ１０４の制御がなされる。制御としては振幅情報の絶
対値が大きい場合はポインタを右へ２〜４程度ずらすこ
とで量子化係数ｋの所定の近似倍率を掛けたことに相当
する。逆に振幅情報の絶対値が小さい場合は、ポインタ
を左へ２〜４程度ずらすことで量子化係数ｋの所定の近
似倍率で割ったことに相当する。The coded data read from the voice coded data ROM 109 is stored in the code register 108. The value stored in the encoding register 108 is used as it is as amplitude information in a difference value search of the quantization table 101. Further, the encoded data is input to the adaptive controller 105, and the quantization width pointer 104 is controlled by a predetermined adaptive algorithm. When the absolute value of the amplitude information is large, the control is equivalent to multiplying the quantization coefficient k by a predetermined approximate magnification by shifting the pointer to the right by about 2 to 4. Conversely, when the absolute value of the amplitude information is small, it is equivalent to dividing the quantization coefficient k by a predetermined approximate magnification by shifting the pointer by about 2 to 4 to the left.

【００３８】適応制御器１０５は３進アップダウンカウ
ンタ１０６に対してアップまたはダウンの制御とその量
を与える。The adaptive controller 105 gives up or down control and the amount to the ternary up / down counter 106.

【００３９】３進アップダウンカウンタ１０６は０，
１，２，０，１，２，…といったようにアップし、逆に
２，１，０，２，１，０，…といったようにダウンす
る。３進アップダウンカウンタ１０６は、２から０に変
化するときにアップダウンカウンタ１０７にアップ要求
をし、逆に０から２に変化するときにアップダウンカウ
ンタ１０７にダウンの要求をする。アップダウンカウン
タ１０７は３進アップダウンカウンタ１０６からの要求
に応じてアップダウンを行う。The ternary up / down counter 106 has 0,
Up, such as 1, 2, 0, 1, 2, ..., and conversely, down, such as 2, 1, 0, 2, 1, 0, .... The ternary up / down counter 106 issues an up request to the up / down counter 107 when the value changes from 2 to 0, and requests a down to the up / down counter 107 when the value changes from 0 to 2. The up / down counter 107 performs up / down in response to a request from the ternary up / down counter 106.

【００４０】量子化幅ポインタ１０４と符号レジスタ１
０８により示された量子化テーブル５０１の差分値デー
タは読み出され、アップダウンカウンタ１０７と３進ア
ップダウンカウンタ１０６の指示によりシフタ１０２が
読み出した差分値データに対してシフト演算処理を施し
た後にレジスタ１１２に格納する。レジスタ１１２に格
納されたシフト演算された差分値データは、あらかじめ
計算されている１つ前の合成波形データが格納されてい
るレジスタ１１１の内容と加算器１０３により加算し、
新たな合成波形をレジスタ１１０に格納し出力する。出
力と同時に次の合成のためにレジスタ１１１にも格納す
る。Quantization width pointer 104 and code register 1
The difference value data of the quantization table 501 indicated by 08 is read out, and after the shift value processing is performed on the difference value data read out by the shifter 102 according to the instruction of the up / down counter 107 and the ternary up / down counter 106. Stored in register 112. The adder 103 adds the shift-calculated difference value data stored in the register 112 to the content of the register 111 in which the immediately preceding synthesized waveform data is stored, and
The new synthesized waveform is stored in the register 110 and output. At the same time as the output, it is also stored in the register 111 for the next synthesis.

【００４１】以降、音声符号データＲＯＭ１０９から新
たな符号データを読み出し、上記動作を繰り返す。Thereafter, new code data is read from the voice code data ROM 109, and the above operation is repeated.

【００４２】（実施例２）図３は、本発明の実施例２を
示すブロック図、図４は、本実施例２の量子化テーブル
を示す図である。(Embodiment 2) FIG. 3 is a block diagram showing Embodiment 2 of the present invention, and FIG. 4 is a diagram showing a quantization table of Embodiment 2 of the present invention.

【００４３】本実施例は４ビットＡＤＰＣＭ方式の音声
合成を示したものである。本実施例２は、実施例１にお
いて量子化テーブルの値を正側と負側の値として正か負
かの違いだけのために同じ絶対値レベルの差分値の値を
２つ持っていたが、符号ビットを持つことにより、差分
値の振幅レベルを正／負兼用し、その符号により加減算
することで実施例１に比べさらに半分の量子化テーブル
容量でＡＤＰＣＭ音声合成を実現した例である。This embodiment shows a 4-bit ADPCM speech synthesis. In the second embodiment, the values of the quantization table in the first embodiment are set to the positive side and the negative side, and two values of the difference value of the same absolute value level are provided only for the difference between positive and negative. In this example, ADPCM speech synthesis is realized with a quantization table capacity which is half that of the first embodiment by adding and subtracting the amplitude level of the difference value by using the sign bit.

【００４４】図３において、本実施例はＡＤＰＣＭ音声
合成で用いる差分値データを格納してある量子化テーブ
ル３０１と、シフタ１０２と、差分値データの累算に用
いる加減算器３０３と、量子化幅ポインタ１０４と、適
応制御器１０５と、３進アップダウンカウンタ１０６
と、アップダウンカウンタ１０７と、ＡＤＰＣＭ符号を
格納する符号レジスタ３０８と、音声データＲＯＭ１０
９と、差分値の累算処理に用いるレジスタ１１０と、レ
ジスタ１１１と、レジスタ１１２とからなる。Referring to FIG. 3, in this embodiment, a quantization table 301 storing difference value data used in ADPCM speech synthesis, a shifter 102, an adder / subtractor 303 used for accumulating difference value data, a quantization width Pointer 104, adaptive controller 105, ternary up / down counter 106
, An up / down counter 107, a code register 308 for storing an ADPCM code, and a voice data ROM 10
9, a register 110 used for accumulating the difference value, a register 111, and a register 112.

【００４５】まず初めに図４に示す量子化テーブルにつ
いてその構成と本発明の原理について述べる。図４に示
した量子化テーブルは縦方向にはＡＤＰＣＭ符号に対応
した差分値の振幅レベルの異なるものを格納しておく。
振幅レベルは図４において、最も上側は正／負兼用の最
大差分値振幅を格納し順次下側に向かうにしたがって差
分値の振幅レベルの小さいものとなるように配置してお
く。最も下側が最も振幅レベルの小さな正／負兼用の差
分値振幅レベルの値になるようにする。First, the configuration of the quantization table shown in FIG. 4 and the principle of the present invention will be described. The quantization table shown in FIG. 4 stores, in the vertical direction, those having different amplitude levels of difference values corresponding to ADPCM codes.
In FIG. 4, the amplitude level in FIG. 4 is arranged such that the maximum difference value amplitude for both positive / negative values is stored and the amplitude level of the difference value decreases gradually toward the lower side. The lowermost side is set to the positive / negative differential value amplitude level value having the smallest amplitude level.

【００４６】図４に示した量子化テーブルの横方向に
は、ある適当な量子化幅の振幅セットを基準にその１．
２５倍、さらに１．２５倍（最大の量子化幅からみて約
１．６倍）、さらに１．２５倍（最大の量子化幅からみ
て約２倍）といった具合に１．２５倍づつべき乗倍され
てゆく構成になっている。ところで、そのある適当な量
子化幅からみて約２倍となるということは、最大の量子
化幅の値を呼び出し１ビットディジタルデータとしてシ
フト演算してやれば２倍したデータを読み出したのと同
じこととなり、全ての差分値を保有する必要がなくな
る。In the horizontal direction of the quantization table shown in FIG. 4, based on an amplitude set having an appropriate quantization width as a reference.
25 times, further 1.25 times (approximately 1.6 times from the maximum quantization width), and further 1.25 times (approximately 2 times from the maximum quantization width), such as a power multiplier of 1.25 times It is configured to be done. By the way, when the value is approximately doubled from the viewpoint of a certain appropriate quantization width, the value of the maximum quantization width is called, and if a shift operation is performed as 1-bit digital data, the doubled data is read out. , There is no need to hold all the difference values.

【００４７】さらに、進歩させて逆に考えて、逆方向シ
フト演算で１／２にすることも可能である。むしろ、基
の量子化幅セットから除算する方が、精度良く音声合成
することができる。つまり、そのある適当な量子化幅セ
ットを最大振幅セットの量子化幅とその近傍を選択して
置けば良質な音声合成が実現できることを意味する。Further, it is also possible to make progress and think in the opposite way, and to halve it by a backward shift operation. Rather, the voice synthesis can be performed more accurately by dividing the original quantization width set. In other words, it means that high quality speech synthesis can be realized if the certain appropriate quantization width set is set by selecting the quantization width of the maximum amplitude set and its vicinity.

【００４８】故に、３回１／１．２５倍すると約１／２
倍の量子化幅となるため、最大の量子化幅の値とその１
／１．２５倍の値と１／（１．２５）² 倍の３種類の値
を有するだけで簡単なシフト演算だけで図４に示した実
際には実装していない仮想量子化テーブル１に相当する
部分や仮想量子化テーブル２に相当する部分が量子化テ
ーブルを読み出すと同時にシフト演算で容易に算出でき
るのが原理である。Therefore, when 1/25 times is performed three times, about 1/2 is obtained.
Since the quantization width is doubled, the maximum quantization width value and its 1
The virtual quantization table 1 shown in FIG. 4 which has only three types of values of /1.25 times and 1 / (1.25) ² times is simply implemented by a simple shift operation and is not actually implemented. The principle is that a corresponding part or a part corresponding to the virtual quantization table 2 can be easily calculated by a shift operation at the same time as reading out the quantization table.

【００４９】続いて動作について図３を参照しながら述
べる。音声合成処理を行うときの初期状態としてはシフ
タ１０２、３進アップダウンカウンタ１０６、アップダ
ウンカウンタ１０７は、最小量子化幅の振幅セットを指
し示すようにシフト演算されている状態から始まるもの
とする。Next, the operation will be described with reference to FIG. It is assumed that the shifter 102, the ternary up / down counter 106, and the up / down counter 107 are shifted from the initial state when performing the voice synthesis processing to indicate the amplitude set of the minimum quantization width.

【００５０】音声符号データＲＯＭ１０９から読み出さ
れた符号化データは、符号レジスタ３０８に格納され
る。符号化レジスタ３０８に格納された値は振幅情報部
分と符号部分に分けられ、振幅情報部分はそのまま量子
化テーブル１０１の差分値検索に利用される。さらに、
符号化データの振幅部分は適応制御器１０５に入力さ
れ、所定の適応アルゴリズムにより量子化幅ポインタ１
０４の制御がなされる。制御としては振幅情報の絶対値
が大きい場合はポインタを右へ２〜４程度ずらすことで
量子化係数ｋの所定の近似倍率を掛けたことに相当す
る。逆に振幅情報の絶対値が小さい場合は、ポインタを
左へ２〜４程度ずらすことで量子化係数ｋの所定の近似
倍率で割ったことに相当する。The coded data read from the voice coded data ROM 109 is stored in the code register 308. The value stored in the encoding register 308 is divided into an amplitude information part and a code part, and the amplitude information part is used as it is for a difference value search of the quantization table 101. further,
The amplitude part of the encoded data is input to the adaptive controller 105, and the quantization width pointer 1 is calculated by a predetermined adaptive algorithm.
04 is performed. When the absolute value of the amplitude information is large, the control is equivalent to multiplying the quantization coefficient k by a predetermined approximate magnification by shifting the pointer to the right by about 2 to 4. Conversely, when the absolute value of the amplitude information is small, it is equivalent to dividing the quantization coefficient k by a predetermined approximate magnification by shifting the pointer by about 2 to 4 to the left.

【００５１】適応制御器１０５は３進アップダウンカウ
ンタ１０６に対してアップまたはダウンの制御とその量
を与える。The adaptive controller 105 gives up / down control and its amount to the ternary up / down counter 106.

【００５２】３進アップダウンカウンタ１０６は０，
１，２，０，１，２，…といったようにアップし、逆に
２，１，０，２，１，０，…といったようにダウンす
る。３進アップダウンカウンタ１０６は、２から０に変
化するときにアップダウンカウンタ１０７にアップ要求
をし、逆に０から２に変化するときにアップダウンカウ
ンタ１０７にダウンの要求をする。アップダウンカウン
タ１０７は３進アップダウンカウンタ１０６からの要求
に応じてアップダウンを行う。The ternary up / down counter 106 has 0,
Up, such as 1, 2, 0, 1, 2, ..., and conversely, down, such as 2, 1, 0, 2, 1, 0, .... The ternary up / down counter 106 issues an up request to the up / down counter 107 when the value changes from 2 to 0, and requests a down to the up / down counter 107 when the value changes from 0 to 2. The up / down counter 107 performs up / down in response to a request from the ternary up / down counter 106.

【００５３】量子化幅ポインタ１０４と符号レジスタ１
０８により示された量子化テーブル５０１の差分値デー
タは読み出され、アップダウンカウンタ１０７と３進ア
ップダウンカウンタ１０６の指示によりシフタ１０２が
読み出した差分値データに対してシフト演算処理を施し
た後にレジスタ１１２に格納する。レジスタ１１２に格
納されたシフト演算された差分値データは、あらかじめ
計算されている１つ前の合成波形データが格納されてい
るレジスタ１１１の内容と加減算器３０３により加減算
し、新たな合成波形をレジスタ１１０に格納し出力す
る。The quantization width pointer 104 and the sign register 1
The difference value data of the quantization table 501 indicated by 08 is read out, and after the shift value processing is performed on the difference value data read out by the shifter 102 according to the instruction of the up / down counter 107 and the ternary up / down counter 106. Stored in register 112. The difference value data subjected to the shift operation stored in the register 112 is added and subtracted by the adder / subtractor 303 with the contents of the register 111 in which the immediately preceding synthesized waveform data is stored, and a new synthesized waveform is registered in the register. Stored in 110 and output.

【００５４】加算か減算かの区別は符号レジスタ３０８
の符号部分の値により区別される。出力と同時に次の合
成のためにレジスタ１１１にも格納する。The sign register 308 distinguishes between addition and subtraction.
Are distinguished by the value of the sign part of. At the same time as the output, it is also stored in the register 111 for the next synthesis.

【００５５】以降、音声符号データＲＯＭ１０９から新
たな符号データを読み出し、上記動作を繰り返す。Thereafter, new code data is read from the voice code data ROM 109, and the above operation is repeated.

【００５６】[0056]

【発明の効果】以上説明したように本発明は、ＡＤＰＣ
Ｍ符号を復号する過程で用いる量子化テーブルの冗長性
を削減することにより、記憶容量を減らすことでハード
ウェア資源の削減を行うことができる効果がある。As described above, the present invention provides an ADPC
There is an effect that hardware resources can be reduced by reducing storage capacity by reducing the redundancy of the quantization table used in the process of decoding the M code.

【００５７】また、最大値を用いることにより、差分値
を記憶する量子化テーブルのビット精度を最良に実現す
ることができ、高音質の音声合成ができる効果もある。Further, by using the maximum value, the bit precision of the quantization table for storing the difference value can be realized at the best, and there is an effect that high-quality sound can be synthesized.

[Brief description of the drawings]

【図１】本発明の実施例１を示すブロック図である。FIG. 1 is a block diagram showing a first embodiment of the present invention.

【図２】本発明の実施例１の量子化テーブルを示す図で
ある。FIG. 2 is a diagram illustrating a quantization table according to the first embodiment of the present invention.

【図３】本発明の実施例２を示すブロック図である。FIG. 3 is a block diagram showing a second embodiment of the present invention.

【図４】本発明の実施例２の量子化テーブルを示す図で
ある。FIG. 4 is a diagram illustrating a quantization table according to the second embodiment of the present invention.

【図５】従来のテーブル参照法によるＡＤＰＣＭ音声合
成装置のブロック図である。FIG. 5 is a block diagram of a conventional ADPCM speech synthesizer using a table reference method.

【図６】量子化テーブル説明図である。FIG. 6 is an explanatory diagram of a quantization table.

【図７】３ビットＰＣＭ波形の説明図である。FIG. 7 is an explanatory diagram of a 3-bit PCM waveform.

【図８】（ａ）は４ビットＰＣＭ波形の説明図、（ｂ）
は５ビットＰＣＭ波形の説明図である。FIG. 8A is an explanatory diagram of a 4-bit PCM waveform, and FIG.
FIG. 4 is an explanatory diagram of a 5-bit PCM waveform.

【図９】ＤＰＣＭ方式の説明図である。FIG. 9 is an explanatory diagram of a DPCM method.

【図１０】ＡＤＰＣＭ波形の説明図である。FIG. 10 is an explanatory diagram of an ADPCM waveform.

【図１１】３ビットのＡＤＰＣＭ方式の量子化係数ｋの
説明図である。FIG. 11 is an explanatory diagram of a 3-bit ADPCM quantization coefficient k;

【図１２】２〜５ビット時における量子化係数ｋの説明
図である。FIG. 12 is an explanatory diagram of a quantization coefficient k at the time of 2 to 5 bits.

【符号の説明】１０１量子化テーブル１０２シフタ１０３加算器１０４量子化幅ポインタ１０５適応制御器１０６３進アップダウンカウンタ１０７アップダウンカウンタ１０８符号レジスタ１０９音声符号データＲＯＭ１１０レジスタ１１１レジスタ１１２レジスタ３０１量子化テーブル３０３加減算器５０１量子化テーブル[Description of Code] 101 Quantization Table 102 Shifter 103 Adder 104 Quantization Width Pointer 105 Adaptive Controller 106 Binary Up / Down Counter 107 Up / Down Counter 108 Code Register 109 Voice Code Data ROM 110 Register 111 Register 112 Register 301 Quantization Table 303 Adder / Subtractor 501 Quantization Table

Claims

(57) [Claims]

1. A first difference value data in a first quantization width.
Data and a second quantization width 1.25 times the first quantization width.
Second difference value data at a quantization width of
In a third quantization width corresponding to 1.25 times the quantization width
Using a quantization table composed of third difference value data, A
A speech synthesis method for performing a DPCM code decoding operation,
The first quantization width, the second quantization width, and the third
The difference value data at a quantization width different from that of
The first difference value data or the second
Bit of the second difference value data or the third difference value data
A voice synthesizing method characterized by being obtained by shifting .

2. A quantization width pointer is obtained from an ADPCM code.
An adaptive controller to determine and the quantization width from the adaptive controller
A ternary up / down counter that cycles according to the pointer
And count the number of rounds of the ternary up / down counter.
An up-down counter, and a first
1 difference value data, 1.25 times the first quantization width.
Second difference value data in a corresponding second quantization width;
Third quantization corresponding to 1.25 times the second quantization width
Quantization table comprising third difference value data in width
And the count value of the ternary up / down counter
The first difference value data of the quantization table,
Of the second difference value data and the third difference value data
For any of the difference value data,
A shift operation by the number of bits corresponding to the number of rounds indicated by the
Shifter and difference value data output from the shifter
And the previous synthesized waveform data to add the synthesized waveform data
A speech synthesizer comprising: an adder for outputting .