JPH0815261B2

JPH0815261B2 - Adaptive transform vector quantization coding method

Info

Publication number: JPH0815261B2
Application number: JP3134831A
Authority: JP
Inventors: 正利新保
Original assignee: Panasonic Corp; Matsushita Electric Industrial Co Ltd
Current assignee: Panasonic Corp; Panasonic Holdings Corp
Priority date: 1991-06-06
Filing date: 1991-06-06
Publication date: 1996-02-14
Anticipated expiration: 2011-02-14
Also published as: JPH06291674A

Description

Detailed Description of the Invention

【０００１】[0001]

【産業上の利用分野】本発明は、発生頻度に偏りのある
標本化されたオ−ディオ信号の情報圧縮法に関するもの
である。BACKGROUND OF THE INVENTION 1. Field of the Invention The present invention relates to a method for compressing the information of sampled audio signals whose frequency of occurrence is biased.

【０００２】[0002]

【従来の技術】音声やオ−ディオ信号を高能率符号化し
て転送あるいは記録することは重要な意味があり、多く
の例が報告されている。その中で、ベクトル量子化や適
応変換符号化、あるいはこれらを組み合わせたものの例
として以下の様なものがある。伝送や蓄積用途向けの音
声の中帯域（４．８〜９．６ｋｂｐｓ）符号化方式とし
て、線形予測残差信号を周波数領域で適応的にベクトル
量子化するものと、スカラ−量子化するものとを組み合
わせた適応変換ベクトル符号化法（電子通信学会論文
誌’８４／１０Ｖｏｌ．Ｊ６７−ＡＮｏ．１０、守谷、
誉田）、音声の低ビットレ−ト（２．４〜１．７Ｋｂｐ
ｓ）符号化方式として、ファジ−ベクトル量子化を簡素
化し、２つのベクトルとそれらの線形結合係数を用いた
相補型ベクトル量子化方式（電子情報通信学会情報理論
研究会’９１／２ＩＴ９０−１０３、浅川他）などが
ある。2. Description of the Related Art It is important to highly efficiently encode a voice or audio signal for transmission or recording, and many examples have been reported. Among them, the following are examples of vector quantization, adaptive transform coding, or a combination thereof. As a mid-band (4.8 to 9.6 kbps) audio coding method for transmission and storage applications, there are adaptively vector-quantized linear prediction residual signals in the frequency domain and scalar-quantized ones. A method of adaptive transform vector coding that combines the following: (The Institute of Electronics and Communication Engineers, Journal of Japan, '84 / 10 Vol. J67-ANo. 10, Moriya,
HONDA), Low bit rate of voice (2.4 to 1.7 Kbp)
s) As a coding method, a fuzzy-vector quantization is simplified, and a complementary vector quantization method using two vectors and their linear combination coefficients (IEICE Information Theory Research Group '91 / 2 IT90-103) , Asakawa and others).

【０００３】これらの方式は、いずれもベクトル量子化
を適用したもので、一番目のものは、線形予測残差信号
をＤＦＴ（離散フ−リエ変換）し、その出力スペクトル
を複数帯域に分割し、各帯域へビットを適応的に割り当
て、ベクトル量子化したものであり、二番目のものは、
音声信号をＰＳＥ分析し、これから得られるインパルス
応答波形をベクトル量子化し、入力ベクトルに最も近い
コ−ドベクトルと、予め選んでおいたこのコ−ドベクト
ルに近い複数個の候補ベクトルのうちの一つのベクトル
の線形結合から歪が最も小さくなるベクトルを１個選び
だすものである。All of these methods apply vector quantization. The first method is to perform DFT (discrete Fourier transform) on a linear prediction residual signal and divide its output spectrum into a plurality of bands. , Adaptively allocating bits to each band and vector quantizing, the second one is
PSE analysis of the voice signal, vector quantization of the impulse response waveform obtained from this, and one vector of the code vector closest to the input vector and a plurality of previously selected candidate vectors close to this code vector One vector with the smallest distortion is selected from the linear combination of.

【０００４】第一の方法は、予測残差スペクトルの分散
がほぼ平坦であると見なせることから、帯域を均等分割
し、各スペクトルの平均電力から、各帯域に割り当てら
れるビット数を適応的に計算し、各帯域に割り当てられ
たビット数ｂが予め決められたビット当たりの転送レ−
トと帯域内のスペクトル数のかけ算より割り出されたビ
ット数ｈ以下であればそのままベクトル量子化し、それ
以上であれば第一段階として、入力ベクトルに対して歪
の小さい順にｎ個のベクトルの候補を残し、第二段階と
して、各重心を中心に（ｂ−ｈ）ビットのスカラ−量子
化器を設け、歪が最小となる符号を決定する。しかし、
この方式では、予測残差を用いており、オ−ディオの様
に変化の激しい信号にこのまま適用することは難しい。In the first method, since the variance of the prediction residual spectrum can be regarded as being substantially flat, the bands are equally divided, and the number of bits assigned to each band is adaptively calculated from the average power of each spectrum. However, the number of bits b allocated to each band is determined by a predetermined transfer rate per bit.
If the number of bits is less than or equal to the number of bits h calculated from the multiplication of the spectrum number in the band and the number of spectrums in the band, vector quantization is performed as it is. With the candidates left, as a second step, a (b-h) -bit scalar-quantizer is provided around each centroid to determine the code with the minimum distortion. But,
In this method, the prediction residual is used, and it is difficult to apply it as it is to a signal with a large change such as audio.

【０００５】第二の方法は、ファジ−ベクトル量子化で
あると、コ−ドベクトルの次元は小さくとも、結果的に
多くコ−ドベクトルを合成できるが、多くのコ−ドベク
トルと結合係数を転送する必要があるので、ファジ−ベ
クトル量子化の究極の形態である２つのベクトルとその
間を結ぶ線形結合係数のみで表している。（図８）を用
いて従来例を説明する。入力端子９０に印加された音声
信号は、ＰＳＥ分析器９１およびピッチ抽出器９２に入
力され、ピッチに関連してパワ−スペクトルのエンベロ
−プが求められる。このスペクトルエンベロ−プから変
換されるインパルス応答は振幅を最大値で正規化されて
ベクトルの次元だけのサンプルがとりだされ、ベクトル
化器９３でベクトル化され、相補型ベクトル量子化器９
４、コ−ドブック９５で２つのベクトルと一つの線形結
合係数としてベクトル量子化される。これらのデ−タは
時分割多重器９６により多重化されて転送される。再生
時にはこれらのデ−タは分配器９７によりそれぞれ分配
され、相補型ベクトル量子化器９８、コ−ドブック９９
で復号され、ＬＰＣ外挿器１００で符号化でのインパル
波形の打ち切りの影響が除去される様に線形予測により
外挿される。最後に波形の合成はＰＳＥ合成器１０１で
インパルス応答のピッチ周期間隔の重ねあわせにより合
成され、出力端子１０２より復号音声信号が得られる。
この方式は、基本的に音声信号を対象とし、ＰＳＥ分析
から得られるピッチのインパルス応答をベクトル量子化
しているが、スペクトル構造の異なるオ−ディオ信号へ
はそのまま適用できない。The second method is fuzzy-vector quantization, in which many code vectors can be combined as a result even though the code vector has a small dimension, but many code vectors and coupling coefficients are transferred. Since it is necessary, two vectors, which are the ultimate form of fuzzy-vector quantization, and the linear combination coefficient connecting them are used. A conventional example will be described with reference to FIG. The audio signal applied to the input terminal 90 is input to the PSE analyzer 91 and the pitch extractor 92, and the envelope of the power spectrum is calculated in relation to the pitch. The impulse response converted from this spectral envelope has its amplitude normalized by the maximum value, samples of only the dimensions of the vector are taken out, vectorized by the vectorizer 93, and the complementary vector quantizer 9
4. In the code book 95, two vectors and one vector combination are quantized as a linear combination coefficient. These data are multiplexed by the time division multiplexer 96 and transferred. During reproduction, these data are respectively distributed by the distributor 97, and the complementary vector quantizer 98 and the codebook 99 are distributed.
And is extrapolated by linear prediction so that the LPC extrapolator 100 removes the effect of truncation of the impulse waveform in encoding. Finally, the PSE synthesizer 101 synthesizes the waveforms by superimposing the pitch period intervals of the impulse response, and a decoded speech signal is obtained from the output terminal 102.
This method basically targets speech signals and vector-quantizes the pitch impulse response obtained from PSE analysis, but cannot be applied to audio signals having different spectral structures.

【０００６】[0006]

【発明が解決しようとする課題】この様に、ベクトル量
子化はデ−タの統計的分布の偏りを利用する能率のよい
量子化法であり、コ−ドブックサイズを大きくすれば、
量子化歪を低減できるが、コ−ドブックの記憶容量や探
索処理量等に問題がでる。また、ファジ−ベクトル量子
化は同一のコ−ドブックを用いて量子化歪を通常のベク
トル量子化よりも小さくできるが、級関数等の情報量が
増大するため符号化への適用はこのままでは困難であ
る。従って、ファジ−ベクトルの究極の形態として２つ
のベクトルと一つの線形結合係数のみで表わすことも試
みられているが、帯域の広いオ−ディオ信号を低い転送
レ−トで符号化すると、スペクトルの平均符号化レ−ト
の低いところでは、ベクトルと線形結合係数に充分なビ
ット割り当てができず、かえって、歪を増大させてしま
うという問題がある。As described above, the vector quantization is an efficient quantization method utilizing the bias of the statistical distribution of data, and if the codebook size is increased,
Quantization distortion can be reduced, but problems occur in the storage capacity of the codebook, the amount of search processing, and the like. Further, the fuzzy-vector quantization can reduce the quantization distortion by using the same codebook as compared with the normal vector quantization, but it is difficult to apply it to the encoding as it is because the amount of information such as a class function increases. Is. Therefore, it has been attempted to represent only two vectors and one linear combination coefficient as the ultimate form of a fuzzy vector, but when a wide band audio signal is encoded at a low transfer rate, the spectrum At a low average coding rate, there is a problem that sufficient bits cannot be assigned to the vector and the linear combination coefficient, which rather increases the distortion.

【０００７】本発明は上記の欠点を解消し、ファジ−ベ
クトル量子化と従来のベクトル量子化を組み合わせるこ
とにより量子化効率を高め、歪の少ない符号化方式を提
供することを目的とする。It is an object of the present invention to solve the above-mentioned drawbacks and to provide a coding method with less distortion by increasing the quantization efficiency by combining fuzzy-vector quantization and conventional vector quantization.

【０００８】[0008]

【課題を解決するための手段】この目的を達成するため
本発明は、入力信号波形のレベルを分散により正規化し
たのち時間領域から周波数領域に変換し、変換スペクト
ルを複数の帯域に分割し、各出力の平均電力を求め、こ
の平均電力により前記各変換スペクトルを正規化し、前
記平均電力を基に補間により各個別のスペクトルの大き
さを推定し、これにより各スペクトルの量子化幅と割り
当てる量子化ビット数を算出し、各帯域のスペクトル数
をベクトルの次元数とし、これに各帯域の平均スペクト
ル符号化レ−トを乗じた数をレベル数としてベクトル量
子化、および、２つのベクトルとそれらを結ぶ一つの線
形結合係数で表されるファジ−ベクトル量子化を行い、
両者の歪の小さい方を選択するものである。According to the present invention, in order to achieve this object, the level of an input signal waveform is normalized by dispersion and then converted from a time domain to a frequency domain, and a transformed spectrum is divided into a plurality of bands. Obtaining the average power of each output, normalizing each of the conversion spectra by this average power, estimating the size of each individual spectrum by interpolation based on the average power, thereby the quantization width of each spectrum and the quantum to be assigned. The number of coded bits is calculated, the number of spectra in each band is used as the number of dimensions of the vector, and the number of levels obtained by multiplying this by the average spectrum coding rate of each band is used as the vector quantization, and two vectors and Fuzzy-vector quantization represented by one linear combination coefficient connecting
The one with smaller distortion is selected.

【０００９】また、上記のファジ−ベクトル量子化にお
いて、２つのベクトルに充分なビット数が割り当てられ
ないときのみ従来のベクトル量子化を適用するものであ
る。Also, in the above fuzzy-vector quantization, the conventional vector quantization is applied only when a sufficient number of bits are not allocated to two vectors.

【００１０】また、各帯域の平均スペクトル符号化レ−
トの大きい帯域ではベクトルの次元数が小さくなる様に
帯域を再分割するものである。Further, the average spectrum coding rate of each band
In the large band, the band is subdivided so that the dimension of the vector becomes smaller.

【００１１】また、周波数の低い帯域ではベクトルの次
元数が小さくなる様に分割するものである。In the low frequency band, the vector is divided so that the dimension of the vector becomes small.

【００１２】[0012]

【作用】転送レ−トを低くするために、ベクトルをファ
ジ−ベクトル量子化の究極の形態である２つのベクトル
とその間を結ぶ線形結合係数のみで表し、帯域の広いオ
−ディオ信号を低い転送レ−トで符号化するときに、転
送レ−トの制限からくるスペクトルの平均符号化レ−ト
の低い帯域では、ベクトルと線形結合係数に充分なビッ
ト割り当てができず、かえって歪を増大させてしまうと
いう問題を、通常のベクトル量子化と２つのベクトルと
線形結合係数で表すファジ−ベクトル量子化を併用し、
歪の小さい方を選択することで、歪を小さくするもので
ある。In order to lower the transfer rate, a vector is represented by only two vectors, which are the ultimate form of fuzzy-vector quantization, and a linear coupling coefficient connecting the two vectors, and a wide band audio signal is transferred at a low rate. When encoding with a rate, in the low band of the average coding rate of the spectrum due to the limitation of the transfer rate, it is not possible to allocate sufficient bits to the vector and the linear combination coefficient, rather increasing the distortion. The problem of being a normal vector quantization and two vectors and fuzzy-vector quantization represented by a linear combination coefficient are used together,
The distortion is reduced by selecting the one with the smaller distortion.

【００１３】また、演算量を削減するために、上記ファ
ジ−ベクトル量子化で転送レ−トから逆算される各帯域
の平均スペクトル符号化レ−トとスペクトル数を乗じた
値に対して、２つのベクトルに充分なビット数が割り当
てられないときにのみ、通常のベクトル量子化を適用し
歪を小さくするものである。Further, in order to reduce the amount of calculation, the value obtained by multiplying the average spectrum coding rate of each band, which is back-calculated from the transfer rate in the fuzzy-vector quantization, by the number of spectra is 2 Only when a sufficient number of bits are not allocated to one vector, normal vector quantization is applied to reduce distortion.

【００１４】また、一旦、ブロックのスペクトル帯域を
均等に分割し、各帯域に割り当てられたビット数が予め
決められたビット数より大きい場合には、帯域をさらに
２等分し、帯域の平均電力と割り当てビット数を計算し
直すことにより、平均電力の大きい帯域ではベクトルの
次元数を抑えて各帯域のコ−ドブックサイズをそろえ、
かつ、コ−ドブックサイズの大きくなるのを避け、ま
た、平均電力の計算精度の向上を図り各スペクトルの符
号化精度の向上を図るものである。Further, once the spectral band of the block is evenly divided, and when the number of bits assigned to each band is larger than the predetermined number of bits, the band is further divided into two parts, and the average power of the band is divided. By recalculating the number of allocated bits, the number of vector dimensions is suppressed and the codebook size of each band is adjusted in the band with high average power.
At the same time, the codebook size is prevented from becoming large, the average power calculation accuracy is improved, and the coding accuracy of each spectrum is improved.

【００１５】また、通常のオ−ディオ信号では、周波数
帯域の低いところが高いところに比べて平均電力が高い
ので、予め決められた低周波数領域を予め決めた分割数
で再分割することにより、符号化精度の向上を図るもの
である。Further, in a normal audio signal, the average power is higher at a low frequency band than at a high frequency band, so that a predetermined low frequency region is redivided by a predetermined number of divisions to obtain a code. It is intended to improve the conversion accuracy.

【００１６】[0016]

【実施例】以下、具体例について詳細に述べる。（図
１）は本発明の適応変換ベクトル量子化符号化法を適用
しうる第１の実施例に係る符号化器のブロック図であ
る。（図１）に従って符号化の手順を説明する。EXAMPLES Specific examples will be described in detail below. FIG. 1 is a block diagram of an encoder according to a first embodiment to which the adaptive transform vector quantization encoding method of the present invention can be applied. The encoding procedure will be described with reference to FIG.

【００１７】広帯域のオ−ディオ信号は入力端子１０に
印加され、一旦バッファ１１に蓄えられる。バッファ１
１内の信号から複数サンプルで構成されるブロックが切
り出され、分散計算器１２で（数１）に基づいて分散が
求められ、量子化器１３で量子化され多重化回路３０へ
入力される。A broadband audio signal is applied to the input terminal 10 and temporarily stored in the buffer 11. Buffer 1
A block composed of a plurality of samples is cut out from the signal in 1, the variance is obtained by the variance calculator 12 based on (Equation 1), and the quantizer 13 quantizes and inputs the variance to the multiplexing circuit 30.

【００１８】[0018]

【数１】 [Equation 1]

【００１９】切り出されたブロックの信号は、先ほど量
子化された分散の値が再生側と同じ操作の逆量子化器１
４で復号化され、それにより可変利得増幅器１５で正規
化される。正規化された信号は直交変換回路１６、例え
ばＤＣＴにより時間領域から（図５（ａ））に示される
様な周波数領域へ変換される。ここで、ｘ（ｋ）は変換
係数である。ＤＣＴ係数（スペクトル）は複数の帯域に
分けられ、その帯域内の平均電力が（数２）により、帯
域内平均電力計算器１７で計算され、（図５（ｂ））に
示される様になる。ここで、ｐ（ｌ）は各帯域の平均電
力である。The signal of the clipped block is the inverse quantizer 1 in which the quantized dispersion value is the same as that on the reproducing side.
4 and thereby normalized by the variable gain amplifier 15. The normalized signal is transformed from the time domain to the frequency domain as shown in FIG. 5A by the orthogonal transformation circuit 16, for example, DCT. Here, x (k) is a conversion coefficient. The DCT coefficient (spectrum) is divided into a plurality of bands, and the average power in that band is calculated by the in-band average power calculator 17 according to (Equation 2), as shown in (FIG. 5B). . Here, p (l) is the average power of each band.

【００２０】[0020]

【数２】 [Equation 2]

【００２１】このようにして求められた平均電力は量子
化器１８で符号化され多重化回路３０に入力される。こ
の量子化された平均電力は逆量子化器１９で復号され、
補間器２０で線形補間される。それをもとに係数の大き
さσ_K（分散）が推定され、これにより正規化利得計算
器２１、ビット割当器２２でＤＣＴ係数に対して適応ビ
ット割り当てが行われる。The average power thus obtained is coded by the quantizer 18 and input to the multiplexing circuit 30. This quantized average power is decoded by the inverse quantizer 19,
Linear interpolation is performed by the interpolator 20. The coefficient size σ _K (variance) is estimated based on this, and the normalized gain calculator 21 and the bit allocator 22 allocate adaptive bits to the DCT coefficients.

【００２２】ここで各帯域への適応ビット割り当ては
（数３）に基づいて計算される。Here, the adaptive bit allocation to each band is calculated based on (Equation 3).

【００２３】[0023]

【数３】 (Equation 3)

【００２４】この後、正規化された係数はベクトル量子
化器２６、ファジ−ベクトル量子化器２４に入力され
る。ファジ−ベクトル量子化器２４では、次の手順に従
って、２つのベクトルとそれらを結ぶ一つの線形結合係
数で、ある帯域のスペクトルに最も近いベクトルが表現
される。まず、ある帯域のスペクトルに最も近いコ−ド
ベクトルが選択され、次に、（数４）に基づいて、予め
選択されているこのコ−ドベクトルに近い複数個の候補
コ−ドベクトルグル−プから一つのベクトルが取り出さ
れる。Thereafter, the normalized coefficient is input to the vector quantizer 26 and the fuzzy-vector quantizer 24. In the fuzzy-vector quantizer 24, the vector closest to the spectrum of a certain band is represented by two vectors and one linear coupling coefficient connecting them according to the following procedure. First, the code vector closest to the spectrum of a certain band is selected, and then based on (Equation 4), a plurality of candidate code vector groups close to this preselected code vector are selected. One vector is fetched.

【００２５】[0025]

【数４】 [Equation 4]

【００２６】ここで、ｙ＾はファジ−ベクトル、ｖ₁は
最も近いコ−ドベクトル、ｖ₂は候補ベクトル、ｍは結
合係数である。ここで、ｖ₁，ｖ₂を線形結合する結合係
数ｍが変化させられ、歪計算器２５で歪が計算される。
歪が単一のベクトルｖ₁よりも増大する場合はその候補
ベクトルは捨てられる。歪が小さくなる場合は、歪が記
憶され、第２の候補ベクトルについて、同様にして歪が
計算される。この様にして歪の最も小さくなる一つのコ
−ドベクトルｖ₂が選択される。これらの候補コ−ドベ
クトルグル−プ内のベクトルは先の最も近いコ−ドベク
トルとの位置関係によっては必ずしも歪が小さくなると
は限らず、かえって増大することもある。この様子を
（図６）に示す。（図６）において、（ａ）は歪が減る
場合、（ｂ）は歪が増える場合を示している。[0026] Here, y ^ is fuzzy - vector, v ₁ is the closest co - a Dobekutoru, v ₂ candidate vectors, m is the coupling coefficient. Here, the coupling coefficient m for linearly coupling v ₁ and v ₂ is changed, and the distortion is calculated by the distortion calculator 25.
If the distortion increases beyond the single vector v ₁ , the candidate vector is discarded. When the distortion is small, the distortion is stored, and the distortion is similarly calculated for the second candidate vector. In this way, one code vector v ₂ with the smallest distortion is selected. The distortion of the vectors in these candidate code vector groups does not always decrease depending on the positional relationship with the nearest code vector, but may increase on the contrary. This is shown in FIG. In FIG. 6, (a) shows the case where the strain decreases, and (b) shows the case where the strain increases.

【００２７】従来のベクトル量子化器２６では、その帯
域に割り当てられたビット数で表現されるコ−ドブック
の中から、その帯域のスペクトルに最も近いコ−ドベク
トルが選択される。ベクトル量子化器２６の出力は歪計
算器２７で歪が計算され、ファジ−ベクトル量子化器２
４の歪の最も小さいものと歪比較器２８で比較される。
これらのうち歪の小さいものが選択回路２９で選択さ
れ、多重化回路３０に入力される。多重化回路３０では
最終的に選択されたベクトルおよび結合係数（場合によ
ってはベクトルのみ）、通常ベクトルかファジ−ベクト
ルかの選択信号、量子化された帯域内平均電力、量子化
された分散情報が時分割多重化される。圧縮されたデ−
タは出力端子３１に送られる。ここで、各ベクトル量子
化器では各帯域の平均スペクトル符号化レ−トとスペク
トル数を乗じた数がレベル数計算器３２で計算され、そ
れに基づいてベクトル化される。In the conventional vector quantizer 26, the code vector closest to the spectrum of the band is selected from the code book represented by the number of bits allocated to the band. The distortion of the output of the vector quantizer 26 is calculated by the distortion calculator 27 and the fuzzy-vector quantizer 2
The smallest distortion of 4 is compared with the distortion comparator 28.
Of these, the one with less distortion is selected by the selection circuit 29 and input to the multiplexing circuit 30. In the multiplexing circuit 30, the finally selected vector and the coupling coefficient (in some cases only vector), the selection signal of the normal vector or the fuzzy-vector, the quantized in-band average power, and the quantized dispersion information are output. Time division multiplexed. Compressed data
Data is sent to the output terminal 31. Here, in each vector quantizer, a number obtained by multiplying the average spectrum coding rate of each band by the number of spectra is calculated by the level number calculator 32, and vectorized based on this.

【００２８】次に本発明の第２の実施例について（図
２）を用いて説明する。（図２）において、（図１）と
同じものは同一の符号を付している。以下では第１の実
施例と異なる点についてのみ説明する。第１の実施例で
は、一つのブロック内のすべての帯域において通常ベク
トルとファジ−ベクトルの歪を計算し、それらの歪の小
さい方を求めていたが、第２の実施例ではファジ−ベク
トル量子化に重点をおいて、（図２）に示される様に、
各帯域の平均電力から線形補間によって求められた各係
数への割り当てビット数を基に、その帯域の平均スペク
トル符号化レ−トとその帯域のスペクトル数を掛け合わ
せたレベル数をレベル数計算器３２で求め、ブロック内
の各帯域に割り当てられるレベル数（ビット数）から、
ビット数判定器３３により、予め決められた精度を有す
る結合係数ｍのビット数を差し引いた残りの（ｖ₁＋
ｖ₂）のビット数が、予め決められた２つのベクトルの
ビット数に満たない場合に限って、通常のベクトル量子
化を採用するものである。簡単化のためにはブロック内
の各帯域に割り当てられる（ｖ₁＋ｖ₂）のビット数が、
予め決められた結合係数ｍのビット数の２倍以下の場合
に限って通常のベクトル量子化が採用される。Next, a second embodiment of the present invention will be described with reference to FIG. In (FIG. 2), the same components as those in (FIG. 1) are designated by the same reference numerals. Only the points different from the first embodiment will be described below. In the first embodiment, the distortions of the normal vector and the fuzzy-vector are calculated in all the bands in one block, and the smaller one is calculated, but in the second embodiment, the fuzzy-vector quantum is calculated. As shown in (Fig. 2),
Based on the number of bits assigned to each coefficient obtained by linear interpolation from the average power of each band, the number of levels calculated by multiplying the average spectrum coding rate of that band by the number of spectra of that band 32, and from the number of levels (number of bits) assigned to each band in the block,
The bit number determiner 33 subtracts the number of bits of the coupling coefficient m having a predetermined precision from the remaining (v ₁ +
The normal vector quantization is adopted only when the number of bits of v ₂ ) is less than the predetermined number of bits of two vectors. For simplification, the number of bits of (v ₁ + v ₂ ) allocated to each band in the block is
Normal vector quantization is adopted only when the number of bits of the predetermined coupling coefficient m is not more than twice.

【００２９】また、本発明の第３の実施例を（図３）を
用いて説明する。なお、（図２）と同一ブロックには同
一符号を付しており詳細な説明は省略する。A third embodiment of the present invention will be described with reference to FIG. The same blocks as those in (FIG. 2) are designated by the same reference numerals, and detailed description thereof will be omitted.

【００３０】はじめにブロックのスペクトル帯域が均等
に分割され、通常の適応変換符号化と同じ様に適応ビッ
ト割り当てされる。このときのビット数がレベル数計算
器３２で計算され、ビット数判定器３３で予め決められ
たビット数より大きいと判定された場合には、ビット数
判定器３３より再分割指示信号が直交変換回路１６に直
結された不均等分割器３４に送られ、それによりその帯
域がさらに２等分され、その帯域の平均電力が計算し直
され、それに基づいて再度割り当てビット数も計算し直
される。それ以後の処理は第２の実施例と同じ処理が実
行されるものである。これは各帯域ごとの平均転送レ−
トを計算し、これが大きい場合には再度分割し直し、平
均電力の大きい帯域ではベクトルの次元数を抑えて各帯
域のコ−ドブックサイズをそろえ、かつ、コ−ドブック
サイズの大きくなるのを避け、また、平均電力の計算精
度の向上により各スペクトルの推定精度の向上を図るも
のである。ここでは、平均電力の大きい順での帯域の入
れ替えはこの入れ替え情報の転送情報が増えるので行わ
ない。この処理を第１の実施例に適用することも可能で
ある。First, the spectral band of the block is equally divided, and adaptive bits are allocated in the same manner as in normal adaptive transform coding. If the number of bits at this time is calculated by the level number calculator 32 and the number of bits determiner 33 determines that the number of bits is larger than the predetermined number of bits, the number of bits determiner 33 performs the orthogonal transformation on the re-division instruction signal. It is sent to the unequal divider 34 which is connected directly to the circuit 16, whereby the band is further divided into two equal parts, the average power of the band is recalculated, and the number of allocated bits is again calculated based on it. The subsequent processing is the same as that of the second embodiment. This is the average transfer rate for each band.
, And if the average power is large, re-divide again, reduce the number of vector dimensions in the band with high average power to make the codebook size of each band uniform, and avoid increasing the codebook size. Further, the accuracy of estimation of each spectrum is improved by improving the accuracy of calculation of average power. Here, the switching of the bands in the descending order of average power is not performed because the transfer information of this switching information increases. It is also possible to apply this processing to the first embodiment.

【００３１】次に本発明の第４の実施例について（図
４）を用いて説明する。なお、（図３）と同じブロック
には同一の符号を付している。通常のオ−ディオ信号で
は、周波数帯域の低いところが高いところに比べて平均
電力が高いので、予め決められた低周波数領域が不均等
帯域分割器３４で、予め決められた分割数で再分割され
る。これも第３の実施例と同じ目的である。Next, a fourth embodiment of the present invention will be described with reference to FIG. The same blocks as those in (FIG. 3) are designated by the same reference numerals. Since a normal audio signal has a higher average power in a low frequency band than in a high frequency band, a predetermined low frequency region is subdivided by the unequal band divider 34 by a predetermined division number. It This is also the same purpose as the third embodiment.

【００３２】各実施例の復号化は基本的に符号化の逆の
処理が実行される。例として、第１の実施例の復号化処
理ブロック図を（図７）に示す。動作および内容は明ら
かであるので説明は省略する。（図２）乃至（図４）の
符号化処理に対する復号化では、（図７）にビット数判
定器が加えられ、選択信号の転送は不要となる。In the decoding of each embodiment, basically the reverse processing of the encoding is executed. As an example, a block diagram of the decoding process of the first embodiment is shown in FIG. Since the operation and contents are clear, the description is omitted. In the decoding for the encoding process of (FIG. 2) to (FIG. 4), the bit number determiner is added to (FIG. 7), and the transfer of the selection signal becomes unnecessary.

【００３３】上記の様な符号化方法により、広い帯域を
有するオ−ディオ信号を低い、例えば、１２８ｋｂｐｓ
以下の転送レ−トで高品位に符号化することができる。By the encoding method as described above, an audio signal having a wide band is low, for example, 128 kbps.
High-quality coding can be performed with the following transfer rate.

【００３４】[0034]

【発明の効果】以上の様に、本発明は、直交変換された
係数に適応量子化幅、適応ビット割り当てを行い、それ
に通常のベクトル量子化と、２つのベクトルとそれらを
線形結合する結合係数で表されるファジ−ベクトル量子
化を適応的に組み合わせることにより、歪を抑え、か
つ、低転送レ−トで符号化できるものである。また、直
交変換された係数のスペクトル帯域を適応的に分割し、
各帯域のコ−ドブックサイズをそろえ、かつ、コ−ドブ
ックの記憶容量を小さく抑えながら、高品位にオ−ディ
オ信号を符号化できるものである。As described above, according to the present invention, adaptive quantization width and adaptive bit allocation are performed on orthogonally transformed coefficients, normal vector quantization is performed, and two vectors and a combination coefficient for linearly combining them. By adaptively combining the fuzzy-vector quantization represented by, the distortion can be suppressed and the encoding can be performed at a low transfer rate. Also, the spectral band of the orthogonally transformed coefficients is adaptively divided,
The audio signal can be encoded with high quality while keeping the codebook size of each band uniform and suppressing the storage capacity of the codebook small.

[Brief description of drawings]

【図１】本発明の第１の実施例の符号化処理ブロック図
である。FIG. 1 is a block diagram of an encoding process according to a first embodiment of this invention.

【図２】本発明の第２の実施例の符号化処理ブロック図
である。FIG. 2 is a block diagram of a coding process according to a second embodiment of the present invention.

【図３】本発明の第３の実施例の符号化処理ブロック図
である。FIG. 3 is a block diagram of an encoding process according to a third embodiment of the present invention.

【図４】本発明の第４の実施例の符号化処理ブロック図
である。FIG. 4 is a block diagram of an encoding process according to a fourth embodiment of the present invention.

【図５】（ａ）従来例あるいは本発明の実施例で処理さ
れる直交変換（ＤＣＴ）による時間領域から周波数領域
に変換された係数（スペクトラム）の説明図である。（ｂ）符号化や復号化において、直交変換（ＤＣＴ）に
より変換された係数を量子化したり逆量子化するための
補助情報としての各帯域の平均電力の説明図である。（ｃ）平均電力から推定された各スペクトルの大きさの
説明図である。5A is an explanatory diagram of coefficients (spectrum) converted from a time domain to a frequency domain by orthogonal transform (DCT) processed in a conventional example or an embodiment of the present invention. FIG. (B) In encoding and decoding, it is explanatory drawing of the average power of each band as auxiliary information for quantizing or dequantizing the coefficient transformed by orthogonal transformation (DCT). (C) It is explanatory drawing of the magnitude | size of each spectrum estimated from the average electric power.

【図６】（ａ）２つのベクトルとそれらを線形結合した
ファジ−ベクトルにおける２つのベクトルにおいて歪が
減る場合のベクトル配置と歪との関係の説明図である。（ｂ）２つのベクトルとそれらを線形結合したファジ−
ベクトルにおける２つのベクトルにおいて歪が増える場
合のベクトル配置と歪との関係の説明図である。FIG. 6A is an explanatory diagram of a relation between vector arrangement and distortion when distortion is reduced in two vectors in a fuzzy vector in which two vectors are linearly combined. (B) Two vectors and fuzzy linear combination of them
It is an explanatory view of the relation between vector arrangement and distortion when distortion increases in two vectors in a vector.

【図７】本発明の第１の実施例の復号化処理ブロック図
である。FIG. 7 is a block diagram of a decoding process according to the first embodiment of this invention.

【図８】従来例の適応変換符号化の符号化処理ブロック
図である。FIG. 8 is a block diagram of a coding process of a conventional adaptive transform coding.

[Explanation of symbols]

１０入力端子１１バッファ１２分散計算器１３，１８量子化器１４，１９逆量子化器１５，２３可変利得増幅器１６直交変換回路１７帯域内平均電力計算器２０補間器２１正規化利得計算器２２ビット割当器２４ファジ−ベクトル量子化器２５，２７歪計算器２６ベクトル量子化器２８歪比較器２９選択回路３０多重化回路３１出力端子３２レベル数計算器３３ビット数判定器３４不均等分割器７０入力端子７１分配器７２ファジ−ベクトル量子化器７３ベクトル化器７４選択回路７５，８１逆量子化器７６補間器７７正規化利得計算器７８ビット割当器７９，８２可変利得増幅器８０逆直交変換回路８３バッファ８４出力端子８５レベル数計算器 10 Input Terminals 11 Buffer 12 Variance Calculator 13, 18 Quantizer 14, 19 Inverse Quantizer 15, 23 Variable Gain Amplifier 16 Orthogonal Transform Circuit 17 In-Band Average Power Calculator 20 Interpolator 21 Normalized Gain Calculator 22 bits Allocator 24 Fuzzy-vector quantizer 25, 27 Distortion calculator 26 Vector quantizer 28 Distortion comparator 29 Selection circuit 30 Multiplexing circuit 31 Output terminal 32 Level number calculator 33 Bit number determiner 34 Unequal divider 70 Input terminal 71 Distributor 72 Fuzzy-vector quantizer 73 Vectorizer 74 Selection circuit 75,81 Inverse quantizer 76 Interpolator 77 Normalization gain calculator 78 Bit allocator 79,82 Variable gain amplifier 80 Inverse orthogonal transformation circuit 83 buffer 84 output terminal 85 level number calculator

───────────────────────────────────────────────────── フロントページの続き (51)Int.Cl.⁶ 識別記号庁内整理番号ＦＩ技術表示箇所Ｈ０４Ｂ 14/00 Ｅ ─────────────────────────────────────────────────── ─── Continuation of the front page (51) Int.Cl. ⁶ Identification code Internal reference number FI Technical indication H04B 14/00 E

Claims

[Claims]

1. The variance of an input voice or an audio signal is obtained, the input signal is normalized, then the time domain is transformed into the frequency domain, and the transformed spectrum is divided into a plurality of bands.
Obtaining the average power of each output, normalizing each of the conversion spectra by this average power, estimating the size of each individual spectrum by interpolation based on the average power, thereby the quantization width of each spectrum and the quantum to be assigned. In the adaptive transform coding method for calculating the number of coded bits, the number of spectra in each band is defined as the number of dimensions of the vector, and the number of levels obtained by multiplying this by the average spectrum coding rate of each band is vector-quantized, and An adaptive transform vector quantization coding method characterized by performing fuzzy-vector quantization represented by two linear vectors and one linear combination coefficient connecting them, and selecting the one having smaller distortion.

2. The variance of the input voice or audio signal is obtained, the input signal is normalized, then the time domain is transformed into the frequency domain, and the transformed spectrum is divided into a plurality of bands.
Obtaining the average power of each output, normalizing each of the conversion spectra by this average power, estimating the size of each individual spectrum by interpolation based on the average power, thereby the quantization width of each spectrum and the quantum to be assigned. In the adaptive transform coding method for calculating the number of coded bits, the number of spectra in each band is defined as the number of dimensions of the vector, and the number obtained by multiplying this by the average spectrum coding rate of each band is defined as the number of levels. An adaptive transform vector quantum characterized by performing fuzzy-vector quantization represented by one linear combination coefficient connecting them and applying conventional vector quantization when a sufficient number of bits is not allocated to two vectors. Coding method.

3. Obtaining the variance of an input voice or audio signal, normalizing the input signal, converting it from the time domain to the frequency domain, dividing the transformed spectrum into a plurality of bands,
Obtaining the average power of each output, normalizing each of the conversion spectra by this average power, estimating the size of each individual spectrum by interpolation based on the average power, thereby the quantization width of each spectrum and the quantum to be assigned. In the adaptive transform coding method for calculating the number of coded bits, vector quantization and fuzzy-vector quantization represented by one linear coefficient connecting the two vectors are applied to adaptively switch the coding method. An adaptive transform vector quantization coding method characterized in that the band is subdivided so that the number of dimensions of the vector becomes small in the band having a large average spectrum coding rate of each band.

4. Obtaining the variance of an input voice or audio signal, normalizing the input signal, converting it from the time domain to the frequency domain, dividing the transformed spectrum into a plurality of bands,
Obtaining the average power of each output, normalizing each of the conversion spectra by this average power, estimating the size of each individual spectrum by interpolation based on the average power, thereby the quantization width of each spectrum and the quantum to be assigned. In the adaptive transform coding method for calculating the number of quantization bits, vector quantization and 2
Applying fuzzy-vector quantization expressed by one vector and one linear coefficient connecting them, and adaptively switching coding method, the division is performed so that the dimensionality of the vector becomes small in the low frequency band. Characteristic adaptive transform vector quantization coding method.