JPH09120300A

JPH09120300A - Vector quantization device

Info

Publication number: JPH09120300A
Application number: JP7277367A
Authority: JP
Inventors: Masanao Suzuki; 政直鈴木; Takashi Ota; 恭士大田
Original assignee: Fujitsu Ltd
Current assignee: Fujitsu Ltd
Priority date: 1995-10-25
Filing date: 1995-10-25
Publication date: 1997-05-06

Abstract

PROBLEM TO BE SOLVED: To provide the vector quantization device which can perform the vector quantization of a speech signal with good efficiency. SOLUTION: A means 21 takes a linear predictive analysis of an input speech signal S2 at different analytic window positions in the same frame to find a 1st LSP(linear spectrum pair) coefficient xi and a 2nd LSP coefficient yi , where the order (i) is a complex value; and a quantizer 22 performs the vector quantization of the 1st LSP coefficient xi , an inverse quantizer 23 performs the inverse quantization of the output quantized value CODE1 of the quantizer 22, and a means 24 finds the difference vector Δi consisting of the difference between the inverse quantized value xqi and the 2nd LSP coefficient yi of the same order. A means 26 decides plural properties of the signal S2 to obtain pieces mO-mK of mode information corresponding to the properties, a means 50 selects quantizers 51-53 corresponding to the mO-mK, and the selected quantizers 51-53 quantize Δi , so that the quantized value COD2 in transmitted.

Description

Detailed Description of the Invention

【０００１】[0001]

【発明の属する技術分野】本発明は音声信号の情報圧縮
を行うことによって音声の符号化を行う装置、特に音声
のスペクトル包絡情報を表すＬＳＰ(Line Spectrum Pai
r)係数（線スペクトル対係数）を抽出し、ＬＳＰ係数を
ベクトル量子化するベクトル量子化装置に関する。BACKGROUND OF THE INVENTION 1. Field of the Invention The present invention relates to a device for encoding a voice by compressing the information of the voice signal, and more particularly to an LSP (Line Spectrum Pai) which represents the spectrum envelope information of the voice.
r) The present invention relates to a vector quantizer which extracts a coefficient (line spectrum pair coefficient) and vector-quantizes an LSP coefficient.

【０００２】近年、この種の装置は、有線電話のみなら
ず、携帯電話や自動車電話などのディジタル移動体通信
においても広く用いられており、音声品質を保ちつつ高
能率に音声を情報圧縮することが求められている。In recent years, this type of device has been widely used not only in wired telephones but also in digital mobile communications such as mobile telephones and car telephones, and is capable of efficiently compressing voice information while maintaining voice quality. Is required.

【０００３】[0003]

【従来の技術】従来から音声符号化装置において、音声
信号の周波数スペクトル包絡情報を表すＬＰＣ係数（線
形予測係数）を伝送するために、これと等価であり、か
つ量子化特性や補間特性の優れたＬＳＰ係数に変換して
から量子化する方法が広く用いられている。2. Description of the Related Art Conventionally, in a speech coder, an LPC coefficient (linear prediction coefficient) representing frequency spectrum envelope information of a speech signal is transmitted, which is equivalent to the LPC coefficient and is excellent in quantization characteristic and interpolation characteristic. A method of converting to LSP coefficients and then quantizing is widely used.

【０００４】近年の携帯電話や自動車電話などのディジ
タル移動体通信で採用されている音声符号化方式では、
さらなる量子化効率の向上のため、ＬＳＰ係数の量子化
にベクトル量子化が用いられている。In the voice coding system adopted in recent years for digital mobile communication such as mobile phones and car phones,
To further improve the quantization efficiency, vector quantization is used to quantize the LSP coefficient.

【０００５】図１０に従来のベクトル量子化装置のブロ
ック構成図を示し、その説明を行う。図１０において、
１１は符号帳、１２は選択スイッチ、１３は誤差最小化
部、１４は減算器である。FIG. 10 shows a block diagram of a conventional vector quantizer and its description will be given. In FIG.
Reference numeral 11 is a codebook, 12 is a selection switch, 13 is an error minimizing unit, and 14 is a subtractor.

【０００６】Ｓ１で示す入力信号は１０次元のＬＳＰ係
数からなるＬＳＰ係数ベクトルである。また、符号帳１
１はＬＳＰ係数ベクトルＳ１と同じ１０次元の符号ベク
トルをＬ本格納しており、Ｌ本にはそれぞれ１〜Ｌのイ
ンデックスが割り当てられている。The input signal indicated by S1 is an LSP coefficient vector consisting of 10-dimensional LSP coefficients. Also, the codebook 1
1 stores L pieces of 10-dimensional code vectors, which is the same as the LSP coefficient vector S1, and indexes L to 1 are assigned to the L pieces.

【０００７】減算器１４はスイッチ１２で順次選択され
る１つの符号ベクトルをＬＳＰ係数ベクトルＳ１から減
算することによって２者の誤差を得るものである。誤差
最小化部１３はその誤差が最も小さくなる符号ベクトル
が符号帳１１から選択されるようにスイッチ１２を制御
するものである。The subtractor 14 obtains an error between the two by subtracting one code vector sequentially selected by the switch 12 from the LSP coefficient vector S1. The error minimization unit 13 controls the switch 12 so that the code vector having the smallest error is selected from the code book 11.

【０００８】このような構成においてベクトル量子化が
行われる場合、Ｌ本の符号ベクトルの中からＬＳＰ係数
ベクトルＳ１との誤差（例えばユークリッド距離）が最
も小さくなるベクトルが選択され、この選択された符号
ベクトルのインデックス１〜Ｌが伝送されることによっ
て伝送情報の圧縮が行われるようになっている。When vector quantization is performed in such a configuration, a vector having the smallest error (for example, Euclidean distance) from the LSP coefficient vector S1 is selected from the L code vectors, and the selected code is selected. By transmitting the vector indexes 1 to L, the transmission information is compressed.

【０００９】[0009]

【発明が解決しようとする課題】上述した従来のベクト
ル量子化装置による音声符号化方式は有線・無線を問わ
ず広く用いられている。The above-described conventional voice coding method by the vector quantizer is widely used regardless of whether it is wired or wireless.

【００１０】無線においては、周波数の有効利用が求め
られているため、伝送レートの削減が必須である。この
ため、ベクトル量子化による情報圧縮の効果を高めるた
めに、符号化処理の単位となる符号化フレーム長（１回
の符号化処理で符号化される入力音声サンプル数）を長
くする傾向がある。In wireless communication, effective use of frequency is required, and therefore reduction of transmission rate is essential. Therefore, in order to enhance the effect of information compression by vector quantization, the coding frame length (the number of input voice samples coded in one coding process), which is a unit of the coding process, tends to be long. .

【００１１】しかし、符号化フレーム長を長くすると音
声の時間変化への追従性が悪くなる場合もあるので、フ
レームをさらに細かく分割したサブフレームを単位とし
てベクトル量子化する対策が取られることが多いが、逆
に伝送する情報が増えるという矛盾が生じる。音声符号
化において必須である線形予測係数の量子化においては
これが大きな問題となっていた。However, if the length of the coded frame is increased, the ability to follow the time change of the voice may be deteriorated. Therefore, it is often the case that vector quantization is performed in units of subframes obtained by further dividing the frame. However, there is a contradiction that the information to be transmitted increases. This has been a big problem in the quantization of linear prediction coefficients, which is essential in speech coding.

【００１２】この問題を解決するために、これまでに様
々な手法が提案されている。例えば、マトリクス量子
化、予測型ベクトル量子化、差分ベクトル量子化などの
手法がある。To solve this problem, various methods have been proposed so far. For example, there are techniques such as matrix quantization, predictive vector quantization, and difference vector quantization.

【００１３】マトリクス量子化は、ベクトル量子化の拡
張であり、複数のベクトルを一つのマトリクスと見なし
て量子化する手法であり、情報圧縮度が高いが行列間の
距離計算をコードブック内の行列の数分だけ繰り返さな
ければならないため、演算量が膨大になってしまうとい
う欠点がある。Matrix quantization is an extension of vector quantization, and is a method of quantizing a plurality of vectors by treating them as one matrix. Although the degree of information compression is high, the distance calculation between matrices is performed in a matrix in a codebook. Since it has to be repeated for several minutes, there is a drawback that the amount of calculation becomes huge.

【００１４】予測ベクトル量子化、差分ベクトル量子化
は隣接するフレーム間のＬＳＰ係数に相関があることを
利用して情報圧縮を実現する手法である。しかし、これ
らの手法を無線などに用いる場合には伝送エラーの問題
が生じる。Prediction vector quantization and difference vector quantization are techniques for realizing information compression by utilizing the fact that there is a correlation between LSP coefficients between adjacent frames. However, when these methods are used for radio or the like, a problem of transmission error occurs.

【００１５】無線ではフェージング等の影響による伝送
エラーが避けられない。予測ベクトル量子化、差分ベク
トル量子化は過去のＬＳＰ係数を用いるため、伝送エラ
ーが生じてＬＳＰ係数に誤りが生じた場合、その影響が
後続のフレームまで続いてしまうという大きな問題があ
る。In wireless, transmission errors due to the effects of fading are unavoidable. Since prediction vector quantization and difference vector quantization use past LSP coefficients, if a transmission error occurs and an error occurs in the LSP coefficient, there is a big problem that the influence continues to the subsequent frame.

【００１６】音声符号化装置において、ＬＳＰ係数の誤
りは致命的であり、誤ったＬＳＰ係数で音声を再生する
と、入力音声とは全く異なる信号が再生されてしまう可
能性が高い。In the speech coding apparatus, the error of the LSP coefficient is fatal, and if the speech is reproduced with the wrong LSP coefficient, there is a high possibility that a signal completely different from the input speech will be reproduced.

【００１７】本発明は、このような点に鑑みてなされた
ものであり、音声信号の効率のよいベクトル量子化を行
うことができるベクトル量子化装置を提供することを目
的としている。The present invention has been made in view of the above circumstances, and an object of the present invention is to provide a vector quantizer capable of performing efficient vector quantization of a voice signal.

【００１８】[0018]

【課題を解決するための手段】図１に本発明の原理図を
示す。この図に示すベクトル量子化装置は、入力音声信
号Ｓ２を分析して線形予測係数を抽出し、これを量子化
して伝送するものであり、本発明の特徴は、入力音声信
号Ｓ２を同一フレーム内で異なる分析窓位置で線形予測
分析することによって次数ｉが複数値を取る第１ＬＳＰ
係数ｘ_i及び第２ＬＳＰ係数ｙ_iを求める分析手段２１
と、第１ＬＳＰ係数ｘ_iのベクトル量子化を行うベクト
ル量子化器２２と、ベクトル量子化器２２の出力量子化
値ＣＯＤＥ１の逆量子化を行う逆量子化器２３と、逆量
子化器２３の出力逆量子化値ｘ_qiと第２ＬＳＰ係数ｙ_i
との同じ次数同士の差分からなる差分ベクトルΔ_iを求
める差分検出手段２４と、入力音声信号Ｓ２の複数の性
質を判定し、この判定された性質に応じたモード情報ｍ
０〜ｍＫを得る判定手段２６と、入力音声信号Ｓ２の複
数の性質に対応する第１〜第Ｋ差分ベクトル量子化器５
１〜５３と、判定手段２６から出力されるモード情報ｍ
０〜ｍＫに対応する第１〜第Ｋ差分ベクトル量子化器５
１〜５３を選択し、この選択された第１〜第Ｋ差分ベク
トル量子化器５１〜５３へ差分ベクトルΔ_iが供給され
るようにする選択手段５０とを具備し、第１〜第Ｋ差分
ベクトル量子化器５１〜５３で量子化された差分ベクト
ルＣＯＤＥ２を伝送するように構成したことである。FIG. 1 shows the principle of the present invention. The vector quantizer shown in this figure analyzes the input speech signal S2, extracts a linear prediction coefficient, and quantizes and transmits the linear prediction coefficient. The feature of the present invention is that the input speech signal S2 is transmitted in the same frame. The first LSP in which the order i takes multiple values by performing linear prediction analysis at different analysis window positions in
Analyzing means 21 for obtaining the coefficient x _i and the second LSP coefficient y _i
A vector quantizer 22 for vector quantizing the first LSP coefficient x _i ; an inverse quantizer 23 for inverse quantizing the output quantized value CODE1 of the vector quantizer 22; and an inverse quantizer 23 Output dequantized value x _qi and second LSP coefficient y _i
The difference detection means 24 for obtaining the difference vector Δ _i consisting of the difference between the same orders with and the plurality of properties of the input audio signal S2 are determined, and the mode information m according to the determined properties.
The determining means 26 for obtaining 0 to mK, and the first to Kth difference vector quantizers 5 corresponding to a plurality of properties of the input speech signal S2.
1 to 53 and the mode information m output from the determination means 26
First to Kth difference vector quantizers 5 corresponding to 0 to mK
1-53, and a selecting means 50 for supplying the difference vector Δ _i to the selected first-Kth difference vector quantizers 51-53. That is, the difference vector CODE2 quantized by the vector quantizers 51 to 53 is transmitted.

【００１９】[0019]

【発明の実施の形態】以下、図面を参照して本発明の一
実施の形態について説明する。図２は本発明の一実施形
態によるベクトル量子化装置のブロック構成図である。DETAILED DESCRIPTION OF THE PREFERRED EMBODIMENTS An embodiment of the present invention will be described below with reference to the drawings. FIG. 2 is a block diagram of a vector quantization apparatus according to an exemplary embodiment of the present invention.

【００２０】図２において、２１はＬＰＣ分析部、２２
はベクトル量子化器、２３は逆量子化器、２４，２７，
２８は減算器、２５はベクトル分割部、２６はモード判
定部、２９，３０は誤差最小化部、３１，３２，３３，
３４，３５，３６，３７，３８，３９，４０は選択スイ
ッチ、Ｂ０，Ｂ１，Ｂ２，Ｂ３，Ｃ０，Ｃ１，Ｃ２，Ｃ
３は符号帳である。In FIG. 2, reference numeral 21 denotes an LPC analysis unit, 22
Is a vector quantizer, 23 is an inverse quantizer, 24, 27,
28 is a subtractor, 25 is a vector division unit, 26 is a mode determination unit, 29 and 30 are error minimization units, 31, 32, 33,
34, 35, 36, 37, 38, 39, 40 are selection switches, B0, B1, B2, B3, C0, C1, C2, C
3 is a codebook.

【００２１】このような構成要素から成るベクトル量子
化装置において、図３に示すように、入力音声信号Ｓ２
における１フレーム内で複数回（この例では２回）のＬ
ＰＣ分析を行うことによって得られるＬＳＰ係数をベク
トル量子化する場合を考える。但し、ＬＰＣ分析とは、
音声サンプル間の近接相関を利用して、音声信号の周波
数特性を推定する手法である。言い換えると、音声生成
モデルにおける声道の特性を近似するフィルタの係数を
入力音声から推定する手法である。In the vector quantizer comprising such components, as shown in FIG. 3, the input voice signal S2
Multiple times (two times in this example) within one frame in
Consider the case of vector-quantizing the LSP coefficient obtained by performing the PC analysis. However, what is LPC analysis?
This is a method of estimating the frequency characteristics of a voice signal by using the close correlation between voice samples. In other words, it is a method of estimating the coefficient of the filter that approximates the characteristics of the vocal tract in the voice generation model from the input voice.

【００２２】このようなＬＰＣ分析を複数回行うことに
より音声品質は向上するが、従来技術で説明したように
ベクトル量子化をフレーム内で複数回行うと、ＬＰＣ分
析を１回行う場合に比べてその回数分だけ伝送すべき情
報量が増えてしまう。Speech quality is improved by performing such LPC analysis a plurality of times. However, when vector quantization is performed a plurality of times within a frame as described in the prior art, compared to the case where LPC analysis is performed once. The amount of information to be transmitted increases by the number of times.

【００２３】図２に示すＬＰＣ分析部２１は、ＬＰＣ分
析を２回行う場合を示しており、１回目及び２回目のＬ
ＰＣ分析で得られたＬＳＰ係数を、図３に示すようにそ
れぞれＬＳＰ１_i，（ｉ＝１，…，１０）、ＬＳＰ
２_i，（ｉ＝１，…，１０）とする。The LPC analysis unit 21 shown in FIG. 2 shows a case where the LPC analysis is performed twice, and the LPC analysis at the first and second times is performed.
As shown in FIG. 3, the LSP coefficients obtained by the PC analysis are LSP1 _i , (i = 1, ..., 10), LSP, respectively.
_Let 2 _i , (i = 1, ..., 10).

【００２４】このように得られたＬＳＰ１_iが、ベクト
ル量子化器２２でベクトル量子化することによって、伝
送すべきインデックスＣＯＤＥ１を決定する。次に、Ｃ
ＯＤＥ１を入力として、逆量子化器２３によりＬＳＰ１
_iの逆量子化値ＬＳＰ１ｑ_i，（ｉ＝１，…，１０）を
求める。ここで用いる逆量子化器２３は、受信側（復号
器側）で用いる逆量子化器と同じものである。The LSP1 _i thus obtained is vector-quantized by the vector quantizer 22 to determine the index CODE1 to be transmitted. Next, C
Using the ODE1 as an input, the inverse quantizer 23 uses the LSP1
dequantized value LSP1q _i of _i, a (i = 1, ..., 10 ) determined. The inverse quantizer 23 used here is the same as the inverse quantizer used on the receiving side (decoder side).

【００２５】減算器２４によりＬＳＰ１ｑ_iとＬＳＰ２
_iの差分Δ_iを求める。ここで、 Δ_i＝ＬＳＰ１ｑ_i−ＬＳＰ２_i，（ｉ＝１，…，１０）（１）である。尚、減算の仕方は Δ_i＝ＬＳＰ２_i−ＬＳＰ１ｑ_i，（ｉ＝１，…，１０）（２）でもよい。以下では式（１）とした場合について説明す
る。The subtractor 24 causes LSP1q _i and LSP2
_Find the difference Δ _i between _i . _{_{_{Here, Δ i = LSP1q i -LSP2 i}}} , (i = 1, ..., 10) is (1). The subtraction method may be Δ _i = LSP2 _i −LSP1 q _i , (i = 1, ..., 10) (2). In the following, the case where the formula (1) is used will be described.

【００２６】ＬＳＰ係数の各次数の値の存在範囲は必ず
０〜πの間にある。ここでπはナイキスト周波数であ
る。従って、ＬＳＰ係数はπで正規化すると０〜１の間
に存在する。The existence range of the value of each order of the LSP coefficient is always between 0 and π. Where π is the Nyquist frequency. Therefore, the LSP coefficient exists between 0 and 1 when normalized by π.

【００２７】一方、式（１）で求めた差分Δ_iの各次数
の値の存在範囲は更に狭くほぼ−０．２５〜０．２５の
間にあることが知られている。即ち、Δ_iの分布の範囲
は元のＬＳＰ係数（ここではＬＳＰ２_i）の存在範囲の
ほぼ半分である。On the other hand, it is known that the range of existence of the values of the respective orders of the difference Δ _i obtained by the equation (1) is narrower and is approximately between −0.25 and 0.25. That is, the range of the distribution of Δ _i is almost half of the existing range of the original LSP coefficient (here, LSP2 _i ).

【００２８】また、Δ_iの分布は音声の性質によっても
異なっている。従って、音声の性質に着目してΔ_iを量
子化する方が単にＬＳＰ２_iを量子化するよりも効率が
よい。The distribution of Δ _i also differs depending on the nature of the voice. Therefore, it is more efficient to quantize Δ _i by paying attention to the nature of speech than to simply quantize LSP2 _i .

【００２９】ここで、現在符号化しようしているフレー
ムの音声信号Ｓ２の性質を表すモード情報を用いる。音
声信号Ｓ２はその性質から有声部、無声部、無声
から有声への変化部、有声から無声への変化部などの
いくつかのモードに分けられる。Here, the mode information representing the property of the audio signal S2 of the frame currently being encoded is used. The voice signal S2 is divided into several modes such as voiced part, unvoiced part, unvoiced to voiced part, and voiced to unvoiced part.

【００３０】Δ_iの分布はモードによっても異なってい
る。従って、音声の性質ごと、つまりモードごとにΔ_i
に対して最適な符号帳を用意しておくことにより従来の
ベクトル量子化よりも効率よく量子化できる。但し、符
号帳はあらかじめ学習しておく必要がある。The distribution of Δ _i also differs depending on the mode. Therefore, Δ _i for each voice property, that is, for each mode
Quantization can be performed more efficiently than conventional vector quantization by preparing an optimal codebook for. However, the codebook needs to be learned in advance.

【００３１】以上説明したように本発明は、音声の性質
を利用して入力音声信号Ｓ２を複数のモードに分け、各
モードごとに用意した符号帳を用いてＬＳＰ係数の差分
を効率良くベクトル量子化する構成をとっている。但
し、ＬＳＰ１_iとＬＳＰ２_iを入れ換えた場合において
も、上記の論理は成立する。As described above, according to the present invention, the input voice signal S2 is divided into a plurality of modes by utilizing the nature of voice, and the difference between the LSP coefficients is efficiently vector-quantized by using the codebook prepared for each mode. It has a structure to be converted. However, the above logic holds even when the LSP1 _i and LSP2 _i are interchanged.

【００３２】ここで、ＬＳＰ（ＬＳＰ１ｑ_iとＬＳＰ２
_i）の差分Δ_iを求め、その差分Δ _iをモードによって
異なるベクトル量子化器（各符号帳Ｂ０〜Ｂ３とＣ０〜
Ｃ３に該当）を用いて量子化する利点について説明す
る。Here, LSP (LSP1q_iAnd LSP2
_i) Difference Δ_iAnd the difference Δ _iDepending on the mode
Different vector quantizers (each codebook B0 to B3 and C0
Explain the advantage of quantization using C3)
You.

【００３３】図４及び図５にＬＳＰの差分Δ_iのヒスト
グラムを示す。ここで用いたＬＳＰは、サンプリング周
波数８ＫＨｚでＡ／Ｄ変換して得られた４００単文章
（約３０分間）のデータをＬＰＣ分析して求めたもので
ある。分析次数は１０次とした。4 and 5 show histograms of LSP differences Δ _i . The LSP used here is obtained by LPC analysis of data of 400 single sentences (about 30 minutes) obtained by A / D conversion at a sampling frequency of 8 KHz. The analysis order was 10th.

【００３４】図４は音声の有声部のΔ_iであり、図５は
音声が有声部から無声部に変化する部分のΔ_iである。
図４及び図５では説明を容易にするため１，２，７，８
次を例に取って掲載している。[0034] FIG. 4 is a delta _i voiced portion of speech, and FIG. 5 is a delta _i of the part sound is changed to unvoiced portion from the voiced portion.
In FIG. 4 and FIG. 5, 1, 2, 7, and 8 are provided for ease of explanation.
The following is an example.

【００３５】図４及び図５から明らかなように、Δ_iの
平均値はほぼ０であり、平均値の近傍に局在化してい
る。これは、ＬＳＰ係数の時間的な変化が小さいことに
よるものである。As is apparent from FIGS. 4 and 5, the average value of Δ _i is almost 0 and is localized in the vicinity of the average value. This is because the temporal change of the LSP coefficient is small.

【００３６】また、Δ_iの次数によっても分布が異なる
ことが分かり、傾向としては低次の差分ほど分散が小さ
く、高次の差分は分散が大きい。また、モードによって
もΔ _iの分布の形が異なっていることがわかる。Also, Δ_iDistribution depends on the degree of
It is clear that the lower the difference, the smaller the variance.
In addition, high-order differences have large variance. Also, depending on the mode
Also Δ _iIt can be seen that the distribution forms of are different.

【００３７】従って、Δ_i（１〜１０次）を低次側（例
えば１〜４次）のΔ_Lと高次側（例えば５〜１０次）の
Δ_Hの２つのベクトルにベクトル分割部２５で分割し、
各ベクトルΔ_L，Δ_Hを入力フレームの音声の性質に最
適な量子化器を用いて量子化すれば、高い量子化効率が
得られる。但し、Δ_iの分割数は２以上でもよい。[0037] Therefore, delta _i (1 to 10 order) low-order side (e.g. 1-4 order) of delta _L and higher side (for example, 5 to 10 order) of delta vectors split section into two vectors of _H 25 Split with
If each vector Δ _L , Δ _H is quantized using a quantizer that is optimal for the nature of the speech of the input frame, high quantization efficiency can be obtained. However, the number of divisions of Δ _i may be two or more.

【００３８】即ち、モード判定部２６から出力されるモ
ード情報ｍｏｄｅ０〜３に応じてスイッチ３１で符号帳
Ｂ０〜Ｂ３の何れかを選択し、この選択された符号帳
（例えばＢ０）の符号ベクトルをスイッチ３２で順次選
択し、この選択される１つの符号ベクトルを減算器２７
においてΔ_Lから減算することによって２者の誤差を取
得し、誤差最小化部２９によって、その誤差が最も小さ
くなる符号ベクトルが符号帳Ｂ０から選択されるように
スイッチ３２を制御するようになっている。また符号帳
Ｃ０〜Ｃ３側も同様に制御されるのでその説明を省略す
る。That is, any one of the codebooks B0 to B3 is selected by the switch 31 according to the mode information modes 0 to 3 output from the mode determination unit 26, and the code vector of the selected codebook (for example, B0) is selected. The switches 32 are sequentially selected, and the selected one code vector is subtracted by the subtractor 27.
The error between the two is obtained by subtracting from Δ _L in Δ, and the error minimization unit 29 controls the switch 32 so that the code vector having the smallest error is selected from the code book B0. There is. Further, the codebooks C0 to C3 are also controlled in the same manner, and the description thereof will be omitted.

【００３９】次に、このような構成のベクトル量子化装
置の動作を説明する。入力音声信号Ｓ２は一定数のサン
プルからなるフレームを単位として処理される。即ち、
フレーム内の音声信号Ｓ２がＬＰＣ分析部２１に入力さ
れ、ＬＰＣ分析部２１において異なる分析窓位置で２回
のＬＰＣ分析が実行され、２組のＬＳＰ係数ＬＳＰ
１_i，ＬＳＰ２_i，（ｉ＝１，…，１０）が求められ
る。Next, the operation of the vector quantizer having such a configuration will be described. The input audio signal S2 is processed in units of frames including a fixed number of samples. That is,
The voice signal S2 in the frame is input to the LPC analysis unit 21, and the LPC analysis unit 21 performs the LPC analysis twice at different analysis window positions to obtain two sets of LSP coefficient LSP.
1 _i , LSP2 _i , (i = 1, ..., 10) are obtained.

【００４０】ここで、ＬＳＰ係数の算出については、始
めにＬＰＣ係数（α係数）又はＰＡＲＣＯＲ係数を先に
求めてからＬＳＰ係数に変換することもできる。一方、
モード判定部２６においては、音声信号Ｓ２の性質を分
析してモードが判別され、モード情報ｍｏｄｅ０〜３が
出力される。モードの判別法としては、既に提案されて
いる任意の手法を用いることができる。Here, regarding the calculation of the LSP coefficient, it is also possible to first obtain the LPC coefficient (α coefficient) or the PARCOR coefficient and then convert the LPC coefficient into the LSP coefficient. on the other hand,
In the mode determination unit 26, the characteristics of the audio signal S2 are analyzed to determine the mode, and the mode information modes 0 to 3 are output. As a method for discriminating modes, any method already proposed can be used.

【００４１】ここでは、前述したように音声を無声部、
有声から無声への変化部、無声から有声への変化部、有
声部の４つのモードに分類し、それぞれのモードに０、
１、２、３の番号を割り当てる。従って、モード情報は
ｍｏｄｅ０〜３の４通りの値を取ることになる。Here, as described above, the voice is converted to the unvoiced part,
It is classified into four modes: voiced to unvoiced change part, unvoiced to voiced change part, and voiced part change mode.
Assign numbers 1, 2, and 3. Therefore, the mode information has four values of modes 0 to 3.

【００４２】ＬＳＰ１_iは量子化器１により量子化さ
れ、量子化結果である符号ＣＯＤＥ１が求められる。量
子化器１としては、既に提案されている任意の量子化手
法を用いることができる。The LSP1 _i is quantized by the quantizer 1 to obtain the code CODE1 which is the quantization result. As the quantizer 1, any previously proposed quantization method can be used.

【００４３】次に、ＣＯＤＥ１は逆量子化器２３に入力
されてＬＳＰ１_iの量子化値ＬＳＰ１ｑ_iが求められ
る。そして減算器２４においてＬＳＰ１ｑ_iとＬＳＰ２
_iの差分Δ_iが次式により求められる。 Δ_i＝ＬＳＰ１ｑ_i−ＬＳＰ２_i（ｉ＝１，…，１０）（３） Δ_iはベクトル分割部２５に入力され、低次側（例えば
１〜４次）からなるΔ _Lと、高次側（例えば５〜１０
次）からなるΔ_Hの２つのベクトルが生成される。Next, CODE1 is input to the inverse quantizer 23.
Is LSP1_iQuantized value of LSP1q_iIs sought
You. Then, in the subtractor 24, LSP1q_iAnd LSP2
_iDifference Δ_iIs calculated by the following equation. Δ_i= LSP1q_i-LSP2_i(I = 1, ..., 10) (3) Δ_iIs input to the vector division unit 25, and the low-order side (for example,
1 to 4) _LAnd the higher side (for example 5-10
) Consisting of_HTwo vectors are generated.

【００４４】次に、Δ_LとΔ_Hをそれぞれベクトル量子
化するが、その際に用いる符号帳Ｂ０〜Ｂ３，Ｃ０〜Ｃ
３をモード情報ｍｏｄｅ０〜３に応じて選択する。ここ
で、符号帳Ｂ０、符号帳Ｂ１、符号帳Ｂ２、符号帳Ｂ３
はΔ_L用の符号帳であり、それぞれモード０、１、２、
３に対応している。Next, Δ _L and Δ _H are vector-quantized, respectively. The code books B0 to B3 and C0 to C used at that time are quantized.
3 is selected according to the mode information modes 0 to 3. Here, codebook B0, codebook B1, codebook B2, codebook B3
Is a codebook for Δ _L , and modes 0, 1, 2, and
3 is supported.

【００４５】また、符号帳Ｃ０、符号帳Ｃ１、符号帳Ｃ
２、符号帳Ｃ３はΔ_H用の符号帳であり、それぞれモー
ド０、１、２、３に対応している。各符号帳は、予め各
モード用に学習されているものとする。Codebook C0, codebook C1, codebook C
2, codebook C3 is a codebook for Δ _H , and corresponds to modes 0, 1, 2, and 3, respectively. It is assumed that each codebook has been learned in advance for each mode.

【００４６】Δ_L、Δ_Hのベクトル量子化をｍｏｄｅ３
で行う場合、即ち入力音声信号Ｓ２が有声部の際にベク
トル量子化を行う場合を例に取って説明する。まず、ｍ
ｏｄｅ３の情報に応じたスイッチ３１の選択動作によっ
て、Δ_Lに対して符号帳Ｂ３が選択される。符号帳Ｂ３
内に格納されている全ての符号ベクトルとΔ_Lとの間の
誤差が減算器２７で計算され、その誤差が最も小さくな
る符号ベクトルが誤差最小化部２９の制御によるスイッ
チ３５の選択動作によって選択され、この選択符号ベク
トルのインデックスが伝送される。Vector quantization of Δ _L and Δ _H is performed in mode3.
The case of performing vector quantization when the input voice signal S2 is a voiced portion will be described as an example. First, m
The codebook B3 is selected for Δ _L by the selection operation of the switch 31 in accordance with the information of the node 3. Codebook B3
The error between all the code vectors stored in the matrix and Δ _L is calculated by the subtractor 27, and the code vector with the smallest error is selected by the selection operation of the switch 35 under the control of the error minimization unit 29. Then, the index of this selected code vector is transmitted.

【００４７】但し、誤差の尺度にはユークリッド距離を
用いることができる。また、処理量削減のため、何らか
の予備選択を行うことにより符号帳内の一部の符号ベク
トルと誤差評価することもできる。However, the Euclidean distance can be used as a measure of the error. Further, in order to reduce the processing amount, it is possible to evaluate the error with some code vectors in the codebook by performing some preliminary selection.

【００４８】Δ_Hに対しては、符号帳Ｃ３が選択される
点以外はΔ_Lの場合と同様であるので、説明を省略す
る。また、他のモードが選択された場合も用いる符号帳
が異なる点以外は同様であるので説明を省略する。The procedure for Δ _H is the same as that for Δ _L except that codebook C3 is selected, and a description thereof will be omitted. The description is omitted because the same is applied except that the codebook used is different when another mode is selected.

【００４９】復号側（受信側）では、全く同じ符号帳を
持つことにより、受け取ったインデックスから容易にＬ
ＳＰの量子化値を再生することができる。この実施形態
では、モードの数だけ符号帳が必要となるが、Δ_L、Δ
_Hの分布の偏りを利用しているため、ＬＳＰ係数をその
まま量子化する従来のベクトル量子化に比べて小さいサ
イズ符号帳で( 少ない伝送情報で) 効率良く量子化が実
現できる。Since the decoding side (reception side) has exactly the same codebook, it is easy to use L from the received index.
The quantized value of SP can be reproduced. In this embodiment, as many codebooks as the number of modes are required, but Δ _L , Δ
_Since the bias of the distribution of _H is used, the quantization can be efficiently realized with a smaller size codebook (with less transmission information) than the conventional vector quantization in which the LSP coefficient is quantized as it is.

【００５０】また、ＬＳＰ係数の差分ではなくＰＡＲＣ
ＯＲ係数（Ｋパラメータ又は反射係数と呼ばれることも
ある。) の差分を量子化するようにしてもよい。この場
合はＰＡＲＣＯＲ係数の差分を量子化する点が上述した
実施形態と異なるだけで他の部分は同じであるので説明
を省略する。Further, PARC is used instead of LSP coefficient difference.
You may make it quantize the difference of OR coefficient (It may be called a K parameter or a reflection coefficient.). In this case, the point that the difference between PARCOR coefficients is quantized is different from the above-described embodiment, and the other parts are the same, so the description will be omitted.

【００５１】ここで、本発明の量子化器（図２に示すベ
クトル量子化器２２）の性能を検証するため、本発明の
量子化器による量子化誤差（変換誤差）と、本発明の量
子化器２２の代わりに、既に提案されている２４ｂｉｔ
二分割／二段ベクトル量子化器（以下では、これを参照
量子化器と呼ぶ。）との双方を用いてＬＳＰ２_iを量子
化した時の量子化誤差を比較する。Here, in order to verify the performance of the quantizer of the present invention (the vector quantizer 22 shown in FIG. 2), the quantization error (conversion error) by the quantizer of the present invention and the quantum of the present invention will be described. 24 bits already proposed instead of the rectifier 22
The quantization error when the LSP2 _i is quantized by using both of the two-division / two-stage vector quantizer (hereinafter referred to as a reference quantizer) is compared.

【００５２】尚、参照量子化器は３０ｂｉｔのスカラー
量子化と同等の特性であり、高い量子化効率を有するも
のであることが知られている。また、量子化器の性能を
表す尺度として、量子化前のＬＳＰ係数と量子化後のＬ
ＳＰ係数との間のＬＰＣケプストラム距離（以下ＣＤと
呼ぶ。）を用いる。ＣＤは周波数領域における距離尺度
であり、式（４）で定義される。It is known that the reference quantizer has characteristics equivalent to 30-bit scalar quantization and has high quantization efficiency. Further, as a measure of the performance of the quantizer, the LSP coefficient before quantization and the LSP after quantization are used.
The LPC cepstrum distance (hereinafter referred to as CD) with the SP coefficient is used. CD is a distance measure in the frequency domain and is defined by equation (4).

【００５３】[0053]

【数１】 (Equation 1)

【００５４】ここで、Ｃ_x(i) 、Ｃ_y(i) はそれぞれ量
子化前のＬＳＰ係数、量子化後のＬＳＰ係数から求めら
れるＬＰＣケプストラム係数である。また、ｐ＝３０と
した。ＣＤの値が小さい程、量子化による量子化誤差が
小さいことを意味する。Here, C _x (i) and C _y (i) are LPC cepstrum coefficients obtained from the LSP coefficient before quantization and the LSP coefficient after quantization, respectively. Also, p = 30. The smaller the CD value, the smaller the quantization error due to quantization.

【００５５】評価には符号帳の学習に用いなかった４０
単文章（約５分間）を用いた。サンプリング周波数は８
ＫＨｚである。図６に音声信号Ｓ２の有声部におけるＣ
Ｄ特性を示し、図７に無声部におけるＣＤ特性、図８に
有声から無声へ変化する部分におけるＣＤ特性、図９に
無声から有声へ変化する部分におけるＣＤ特性を示す。40 was not used for learning the codebook for evaluation.
A single sentence (about 5 minutes) was used. The sampling frequency is 8
KHz. FIG. 6 shows C in the voiced part of the voice signal S2.
FIG. 7 shows the CD characteristics in the unvoiced part, FIG. 8 shows the CD characteristics in the part where voiced changes to unvoiced, and FIG. 9 shows the CD characteristics in the part where voiced changes to unvoiced.

【００５６】図６〜図９に示す何れのモードにおいても
参照量子化器よりもビット数の少ない本発明の量子化器
で参照量子化器の性能を上回っている。特に、図６に示
す有声部のモードにおいては、わずか１４ｂｉｔの本発
明の量子化器で２４ｂｉｔの参照量子化器と同等の性能
を達成している。In any of the modes shown in FIGS. 6 to 9, the quantizer of the present invention having a smaller number of bits than the reference quantizer exceeds the performance of the reference quantizer. Particularly, in the voiced mode shown in FIG. 6, the quantizer of the present invention having only 14 bits achieves the same performance as that of the reference quantizer of 24 bits.

【００５７】また、図８に示す有声から無声へ変化する
モード、図９に示す無声から有声へ変化するモードにお
いては、有声部のモードよりも２ｂｉｔだけ大きな符号
帳サイズが必要であることが分かった。Further, in the mode of changing from voiced to unvoiced shown in FIG. 8 and in the mode of changing from unvoiced to voiced shown in FIG. 9, it is found that a codebook size larger by 2 bits than the mode of the voiced part is required. It was

【００５８】これは、有声部のモードにおけるＬＳＰの
差分の分散よりも有声から無声へ変化するモード、無声
から有声へ変化するモードにおけるＬＳＰの差分の分散
の方が多きいためと考えられる。It is considered that this is because the variance of the LSP difference in the voiced mode is greater than the variance of the LSP difference in the voiced mode, and the variance of the LSP difference in the unvoiced to voiced mode is larger.

【００５９】以上の説明してように、本実施形態のベク
トル量子化装置によれば、従来のベクトル量子化装置よ
りもはるかに効率よくベクトル量子化を行うことが可能
である。As described above, according to the vector quantizing device of this embodiment, it is possible to perform vector quantizing much more efficiently than the conventional vector quantizing device.

【００６０】[0060]

【発明の効果】以上説明したように、本発明のベクトル
量子化装置によれば、音声の性質を利用してＬＳＰ係数
を効率よくベクトル量子化することによって、音声信号
の効率のよいベクトル量子化を行うことができる効果が
ある。As described above, according to the vector quantizing device of the present invention, the LSP coefficient is efficiently vector-quantized by utilizing the property of the voice, so that the efficient vector quantizing of the voice signal is performed. There is an effect that can be done.

[Brief description of the drawings]

【図１】本発明の原理図である。FIG. 1 is a principle diagram of the present invention.

【図２】本発明の一実施形態によるベクトル量子化装置
のブロック構成図である。FIG. 2 is a block diagram of a vector quantization device according to an exemplary embodiment of the present invention.

【図３】ＬＰＣ分析とフレームの関係を説明するための
図である。FIG. 3 is a diagram for explaining the relationship between LPC analysis and frames.

【図４】有声部の音声におけるＬＳＰの差分のヒストグ
ラムを示す図である。FIG. 4 is a diagram showing a histogram of LSP differences in voiced voice.

【図５】有声部から無声部へ変化する部分におけるＬＳ
Ｐの差分のヒストグラムを示す図である。FIG. 5: LS in the part where the voiced part changes to the unvoiced part
It is a figure which shows the histogram of the difference of P.

【図６】有声部におけるＣＤ特性を示す図である。FIG. 6 is a diagram showing a CD characteristic in a voiced part.

【図７】無声部におけるＣＤ特性を示す図である。FIG. 7 is a diagram showing a CD characteristic in an unvoiced part.

【図８】有声から無声へ変化する部分におけるＣＤ特性
を示す図である。FIG. 8 is a diagram showing CD characteristics in a portion where voiced voice changes to unvoiced voice.

【図９】無声から有声へ変化する部分におけるＣＤ特性
を示す図である。FIG. 9 is a diagram showing CD characteristics in a portion where voicelessness changes to voiced sound.

【図１０】従来例によるベクトル量子化装置のブロック
構成図である。FIG. 10 is a block configuration diagram of a vector quantization device according to a conventional example.

[Explanation of symbols]

２１分析手段２２ベクトル量子化器２３逆量子化器２４差分検出手段２６判定手段５０選択手段５１〜５３第１〜第Ｋ差分ベクトル量子化器Ｓ２入力音声信号ｘ_i 第１ＬＳＰ係数ｙ_i 第２ＬＳＰ係数ＣＯＤＥ１第１ＬＳＰ係数ｘ_iのベクトル量子化値ｘ_qi ベクトル量子化値ＣＯＤＥ１の逆量子化値 Δ_i 差分ベクトルｍ０〜ｍＫモード情報ＣＯＤＥ２差分ベクトルΔ_iの量子化値21 analysis means 22 vector quantizer 23 inverse quantizer 24 difference detection means 26 judgment means 50 selection means 51-53 1st-Kth difference vector quantizer S2 input speech signal x _i first LSP coefficient y _i second LSP coefficient CODE1 vector quantized value of first LSP coefficient x _i x _qi vector quantized value inverse quantized value of CODE1 Δ _i difference vector m0 to mK mode information CODE2 quantized value of difference vector Δ _i

Claims

[Claims]

1. A vector quantizer which analyzes an input speech signal to extract a linear prediction coefficient, quantizes and transmits the coefficient, and performs linear prediction analysis on the input speech signal at different analysis window positions within the same frame. By analyzing the first LSP coefficient x _i and the second LSP coefficient y _i , the vector quantizer performing vector quantization of the first LSP coefficient x _i , and the vector quantizer An inverse quantizer that performs inverse quantization of the output quantized value, an output inverse quantized value of the inverse quantizer, and the second LSP coefficient y
difference detecting means for obtaining a difference vector composed of differences of the same degree with _i , judging means for judging a plurality of properties of the input voice signal, and obtaining mode information according to the judged properties, and the input voice First to Kth difference vector quantizers corresponding to a plurality of properties of the signal, and the first corresponding to the mode information output from the determining means.
To Kth difference vector quantizers, and selecting means for supplying the difference vectors to the selected first to Kth difference vector quantizers, A vector quantization device, which transmits the difference vector quantized by a difference vector quantizer.

2. A vector quantizer, wherein the order i is a tenth order of 1, ..., 10.

3. The mode information corresponds to a voiced portion of the input audio signal, a voiceless portion thereof, a voiceless to voiced portion thereof, and a voiced to voiceless portion of the input audio signal. The vector quantization device according to claim 1 or 2, characterized in that

4. A vector division means for dividing the difference vector into a difference vector of a low order group and a difference vector of a high order group;
As Kth difference vector quantizers, first to Kth difference vector quantizers for low-order groups that quantize the difference vectors of the low-order group and high-order groups for quantizing difference vectors of the high-order group A first to a Kth difference vector quantizer, wherein the selecting means selects the first to the Kth difference vector quantizers for the low-order group and the high-order group according to the mode information. The vector quantization device according to claim 1, wherein the selected difference vector quantizer for the low-order group and the high-order group transmits a difference vector quantized.

5. The vector quantizer according to claim 1, wherein PARCOR coefficients are used instead of LSP coefficients.