JPH0981196A

JPH0981196A - Vector quantizing device

Info

Publication number: JPH0981196A
Application number: JP7230856A
Authority: JP
Inventors: Tadashi Yonezaki; 崎正米
Original assignee: Matsushita Electric Industrial Co Ltd
Current assignee: Panasonic Holdings Corp
Priority date: 1995-09-08
Filing date: 1995-09-08
Publication date: 1997-03-28
Anticipated expiration: 2015-09-08
Also published as: JP3091828B2

Abstract

PROBLEM TO BE SOLVED: To reduce the distortion of synthesized voice in voice coding. SOLUTION: On the decoding side of the vector quantizing device, a distribution coding book 109 which stores the distribution of the vectors clustered to the reproduced vector is installed in addition to a coding book 108 which stores reproduced vectors. From the correlation between this distribution and the decoded vectors determined by a maximum likelihood detector 106 in the previous frame, the likelihood as the decoded vector at each point of the vector space having the vector reproduced by a decoder 104 as the center, is calculated by a likelihood calculator 105, and the point where the likelihood is judged to be maximum by a maximum likelihood detector 106 is regarded as the decoded vector.

Description

Detailed Description of the Invention

【０００１】[0001]

【産業上の利用分野】本発明は、ディジタル電話やディ
ジタル録音器で必要な音声情報圧縮において使用される
ベクトル量子化装置に関するものである。BACKGROUND OF THE INVENTION 1. Field of the Invention The present invention relates to a vector quantizer used in voice information compression required in digital telephones and digital recorders.

【０００２】[0002]

【従来の技術】近年、データを伝送または蓄積する媒体
が有限であることから、高圧縮率かつ高音声品質を可能
とする音声情報圧縮技術の実現が望まれている。特に、
複数のサンプルからなるベクトルを単位として量子化す
るベクトル量子化は、高能率圧縮の要素技術として用い
られており、歪みの少ないベクトル量子化装置の実現が
望まれている。2. Description of the Related Art In recent years, since the medium for transmitting or accumulating data is limited, it is desired to realize a voice information compression technique which enables a high compression rate and a high voice quality. Especially,
Vector quantization, which quantizes a vector consisting of a plurality of samples as a unit, is used as an element technology of high-efficiency compression, and realization of a vector quantizer with less distortion is desired.

【０００３】以下、従来のベクトル量子化装置につい
て、図６に示すブロック図を用いて説明する。図６にお
いて、４０１は音声を分析し、分析パラメータを抽出す
る音声分析器である。４０２は符号器で、抽出されたパ
ラメータ群（以下、ベクトルと記す）に最も近いベクト
ルを符号帳４０３から選択し、その符号番号を伝送路４
０７へ送出する。４０３は符号帳で、予め、学習サンプ
ル音声を分析して求めたベクトルが格納されている。以
上が符号化側の構成である。次に復号化側の構成を示
す。４０４は復号器で、伝送路４０７から伝送されてき
た符号から符号帳４０６を用いて、再生ベクトルを求め
る。４０５は音声合成器で、復号された再生ベクトルを
用いて音声合成を行う。４０６は復号化側の符号帳で、
符号化側の符号帳に対応した再生ベクトルが格納されて
いる。４０７は符号化側の装置と復号化側の装置を結ぶ
伝送路である。A conventional vector quantizer will be described below with reference to the block diagram shown in FIG. In FIG. 6, reference numeral 401 is a voice analyzer that analyzes voice and extracts analysis parameters. An encoder 402 selects a vector closest to the extracted parameter group (hereinafter, referred to as a vector) from the codebook 403, and the code number thereof is set to the transmission line 4
07. A codebook 403 stores vectors obtained by analyzing learning sample voices in advance. The above is the configuration on the encoding side. Next, the configuration on the decoding side is shown. A decoder 404 obtains a reproduction vector from the code transmitted from the transmission line 407 using a codebook 406. A voice synthesizer 405 performs voice synthesis using the decoded reproduction vector. 406 is a codebook on the decoding side,
A reproduction vector corresponding to the codebook on the encoding side is stored. Reference numeral 407 is a transmission path connecting the device on the encoding side and the device on the decoding side.

【０００４】以上のように構成されたベクトル量子化装
置について、以下、その動作について説明する。まず、
音声分析器４０１で音声を分析し、分析モデルに応じて
ベクトルが求められる。このベクトルは、符号器４０２
でベクトル量子化される。ベクトル量子化に際しては、
予め、学習サンプル音声を分析して得られた学習ベクト
ルを統計処理して求められる符号ベクトルで構成される
符号帳４０３を用いて、符号化時に分析されたベクトル
が最も近い符号ベクトルに量子化し、その符号番号を伝
送路４０７へ送出する。以上が符号化側の動作である。
復号化側では、復号器４０４が伝送路４０７を経て得た
符号番号の符号ベクトルを符号帳４０６から再生ベクト
ルとして選択する。符号帳４０６は、符号帳４０３と同
様に、予め学習サンプル音声を用いて求められる再生ベ
クトルで構成したものである。このようにして得られた
再生ベクトルを用いて音声合成器４０５で音声が合成さ
れる。The operation of the vector quantizer having the above configuration will be described below. First,
The voice analyzer 401 analyzes the voice and obtains a vector according to the analysis model. This vector is the encoder 402
Is vector quantized by. In vector quantization,
In advance, using a codebook 403 composed of a code vector obtained by statistically processing the learning vector obtained by analyzing the learning sample speech, the vector analyzed at the time of encoding is quantized to the closest code vector, The code number is transmitted to the transmission line 407. The above is the operation on the encoding side.
On the decoding side, the decoder 404 selects the code vector of the code number obtained via the transmission path 407 from the code book 406 as the reproduction vector. Like the codebook 403, the codebook 406 is composed of reproduction vectors obtained in advance using learning sample voices. The speech synthesizer 405 synthesizes speech using the reproduction vector thus obtained.

【０００５】[0005]

【発明が解決しようとする課題】しかしながら、上記の
ような従来のベクトル量子化装置では、分析により得ら
れたベクトルと再生ベクトルの間に生じる誤差を考慮し
ていないため、量子化歪みが生じてしまうという問題を
有していた。However, since the conventional vector quantizer as described above does not consider the error between the vector obtained by the analysis and the reproduction vector, quantization distortion occurs. It had the problem of being lost.

【０００６】本発明は、上記従来の問題を解決するもの
で、学習サンプル音声を分析して得られる学習ベクトル
を用いて作成される符号帳として、再生ベクトルだけで
なくその再生ベクトルにクラスタリングされる学習ベク
トルの分布を持つことにより、ベクトル量子化歪みを小
さくすることのできるベクトル量子装置を提供すること
を目的とする。The present invention solves the above-mentioned conventional problem. As a codebook created using a learning vector obtained by analyzing a learning sample speech, not only the reproduction vector but also the reproduction vector is clustered. An object of the present invention is to provide a vector quantum device capable of reducing vector quantization distortion by having a distribution of learning vectors.

【０００７】[0007]

【課題を解決するための手段】上記の目的を達成するた
めに、本発明の第１の構成は、音声符号化における量子
化装置において、符号化側に、音声を分析してパラメー
タを抽出する音声分析器と、分析されたパラメータをベ
クトル量子化する符号器と、ベクトル量子化をするため
の符号ベクトルを格納している符号帳を備え、復号化側
に、符号化側の符号帳に対応した再生ベクトルを格納し
ている符号帳と、伝送路から得た符号から符号帳を用い
て再生ベクトルを求める復号器と、再生ベクトルにクラ
スタリングされるベクトルの分布を格納している分布符
号帳と、再生ベクトルと分布符号帳を用いて求めたベク
トルの生起確率分布および前フレームで得られた復号ベ
クトルとの相関から、パラメータ空間内の尤度関数を算
出する尤度算出器と、算出された尤度を最大とするベク
トルを復号ベクトルとして求める最大尤度検出器と、得
られた復号ベクトルを用いて音声を合成する音声合成器
とを備えたものである。In order to achieve the above object, the first structure of the present invention is, in a quantizing device for speech coding, analyzes speech on the coding side to extract parameters. Equipped with a speech analyzer, an encoder that vector-quantizes the analyzed parameters, and a codebook that stores the code vector for vector quantization.The decoding side corresponds to the coding-side codebook. A codebook that stores the reproduced vector, a decoder that obtains the reproduced vector from the code obtained from the transmission line using the codebook, and a distributed codebook that stores the distribution of the vectors clustered in the reproduced vector. , A likelihood calculator that calculates a likelihood function in the parameter space from the correlation between the reproduction vector and the occurrence probability distribution of the vector obtained using the distribution codebook and the decoded vector obtained in the previous frame It is obtained by a speech synthesizer for synthesizing speech by using a maximum likelihood detector for determining a vector that maximizes the calculated likelihood as a decoded vector, the resulting decoded vector.

【０００８】本発明の第２の構成は、上記第１の構成に
加え、復号化側に、伝送路の符号誤りを検出する誤り検
出器と、検出された誤りに応じて尤度算出器で算出され
る尤度に重み付けをする重み切替器とを備えたものであ
る。A second structure of the present invention is, in addition to the above-mentioned first structure, an error detector for detecting a code error of a transmission line on the decoding side, and a likelihood calculator according to the detected error. And a weight switching device for weighting the calculated likelihood.

【０００９】本発明の第３の構成は、上記第１の構成に
加え、復号化側に、伝送路の符号誤りを検出する誤り検
出器と、検出された誤りに応じて分布符号帳から得られ
るベクトルの生起確率分布を調整する分布調整器とを備
えたものである。A third configuration of the present invention is, in addition to the above first configuration, obtained on the decoding side from an error detector for detecting a code error in a transmission line and a distributed codebook according to the detected error. And a distribution adjuster that adjusts the occurrence probability distribution of the generated vector.

【００１０】[0010]

【作用】本発明は、上記第１の構成により、復号ベクト
ルを求める際に再生ベクトルに対する復号ベクトルの生
起確率分布と前フレームからの連続性を考慮することに
より、聴感上の量子化歪みを小さくすることができる。
また上記第２の構成により、伝送路誤りに応じて尤度を
算出する際に、再生ベクトルまたは前フレームとの相関
の尤度に対する重み付けを切り替えることにより、伝送
路での符号誤り時の合成音声品質の劣化を低減すること
ができる。また、本発明の第３の構成により、伝送路誤
りに応じて、再生ベクトルの分布を変化させることによ
り、誤った再生ベクトルで音声を合成することを避ける
ことができ、伝送路での符号誤り時の合成音声品質の劣
化を低減することができる。According to the first aspect of the present invention, when the decoded vector is obtained, the occurrence probability distribution of the decoded vector with respect to the reproduction vector and the continuity from the previous frame are taken into consideration, thereby reducing the quantization distortion in the auditory sense. can do.
Further, according to the second configuration, when the likelihood is calculated according to the transmission path error, the weighting for the likelihood of the correlation with the reproduction vector or the previous frame is switched, so that the synthesized speech at the time of the code error on the transmission path is obtained. It is possible to reduce deterioration of quality. Further, according to the third configuration of the present invention, by changing the distribution of the reproduction vector according to the transmission path error, it is possible to avoid synthesizing the voice with an incorrect reproduction vector, and it is possible to avoid the code error in the transmission path. It is possible to reduce the deterioration of the synthesized voice quality.

【００１１】[0011]

【Example】

（実施例１）以下、本発明のベクトル量子化装置の実施
例について、図面を参照しながら説明する。図１は本発
明の第１の実施例におけるベクトル量子化装置のブロッ
ク図を示すものである。図１において、１０１は音声を
分析してパラメータを抽出する音声分析器である。１０
２は符号器で、分析されたパラメータをベクトル量子化
する。１０３は符号帳で、ベクトル量子化をするための
符号ベクトルを格納している。以上が符号化側の構成で
ある。次に復号化側の構成について説明する。１０４は
復号器で、伝送路１１０から得た符号から再生ベクトル
を求める。１０５は尤度算出器で、得られた再生ベクト
ルと復号ベクトルの生起確率分布および前フレームで得
られた復号ベクトルから、ベクトル空間上の全ての点に
対して尤度を算出する。１０６は最大尤度検出器で、算
出された尤度を最大とするベクトルを復号ベクトルとし
て求める。１０７は音声合成器で、このようにして得ら
れた復号ベクトルを用いて音声合成を行う。１０８は符
号帳で、符号化側の符号帳１０３に対応した再生ベクト
ルを格納している。１０９は分布符号帳で、符号帳１０
８の再生ベクトルにクラスタリングされる学習ベクトル
の分布が格納されている。１１０は符号化側の装置と復
号化側の装置を結ぶ伝送路である。(Embodiment 1) Hereinafter, an embodiment of the vector quantization apparatus of the present invention will be described with reference to the drawings. FIG. 1 is a block diagram of a vector quantizer according to the first embodiment of the present invention. In FIG. 1, reference numeral 101 is a voice analyzer that analyzes voice and extracts parameters. 10
Reference numeral 2 is an encoder which vector-quantizes the analyzed parameters. A codebook 103 stores a code vector for vector quantization. The above is the configuration on the encoding side. Next, the configuration on the decoding side will be described. A decoder 104 obtains a reproduction vector from the code obtained from the transmission line 110. A likelihood calculator 105 calculates likelihoods for all points on the vector space from the occurrence probability distribution of the obtained reproduction vector and decoded vector and the decoded vector obtained in the previous frame. Reference numeral 106 denotes a maximum likelihood detector, which obtains a vector that maximizes the calculated likelihood as a decoded vector. Reference numeral 107 denotes a voice synthesizer, which performs voice synthesis using the decoded vector thus obtained. A codebook 108 stores reproduction vectors corresponding to the codebook 103 on the encoding side. 109 is a distributed codebook, which is the codebook 10.
The distribution of learning vectors clustered into eight reproduction vectors is stored. Reference numeral 110 is a transmission line connecting the device on the encoding side and the device on the decoding side.

【００１２】図２はこれらの符号帳について、２次元の
ベクトル空間上での例を示す。Ｌ_i（ｉ＝１，…，４）
の４つの学習ベクトルが同じクラスタにクラスタリング
されているとき、ベクトルが各軸で独立に正規分布する
と仮定すると、ｘ軸に対して分散は１／２｛１²＋（−
１）²｝＝１、ｙ軸に対して分散は１／２｛２²＋（−
２）²｝＝４となる。したがって、再生ベクトルとして
平均（０，０）が符号帳１０８に、それに対応する分布
として（１，４）が分布符号帳１０９に格納される。こ
のような分布符号帳１０９を用いることで、ｘ、ｙは独
立なので、再生ベクトル（０，０）に対して復号ベクト
ルの生起確率分布ｐ（ｘ，ｙ）が次式で求められる。FIG. 2 shows an example of these codebooks in a two-dimensional vector space. L _i (i = 1, ..., 4)
Assuming that the vectors are normally distributed independently in each axis when the four learning vectors of are clustered in the same cluster, the variance is 1/2 {1 ² + (-
1) ² } = 1, the variance with respect to the y-axis is 1/2 {2 ² + (-
2) ² } = 4. Therefore, the average (0,0) is stored in the codebook 108 as the reproduction vector, and (1,4) is stored in the distribution codebook 109 as the corresponding distribution. By using such distribution codebook 109, since x and y are independent, the occurrence probability distribution p (x, y) of the decoding vector with respect to the reproduction vector (0, 0) is obtained by the following equation.

【数１】また、このようなモデル化をせず、符号帳作成時に学習
ベクトルの分布をテーブルとして蓄積し、生起確率分布
としてもよい。[Equation 1] Instead of such modeling, the distribution of learning vectors may be accumulated as a table at the time of creating the codebook and used as the occurrence probability distribution.

【００１３】以上のように構成されたベクトル量子化装
置について、以下その動作を説明する。まず、音声分析
器１０１によって分析されたベクトルは、符号帳１０３
を用いて符号器１０２で符号化される。この符号を伝送
路１１０を通じて得た復号器１０４は、符号帳１０８を
用いて再生ベクトルを生成する。尤度算出器１０５で
は、再生ベクトルに対応する生起確率分布を分布符号帳
１０９を用いて取得し、これと、前フレームの復号ベク
トルから次式を用いてベクトル空間上の点ｖに対して、
次式で尤度ｌ（ｖ）を求める。The operation of the vector quantizer having the above configuration will be described below. First, the vector analyzed by the speech analyzer 101 is the codebook 103
Is encoded by the encoder 102. The decoder 104 which has obtained this code through the transmission line 110 generates a reproduction vector using the codebook 108. The likelihood calculator 105 acquires the occurrence probability distribution corresponding to the reproduction vector using the distribution codebook 109, and from this and the decoded vector of the previous frame, using the following equation, for the point v on the vector space,
The likelihood l (v) is calculated by the following formula.

【数２】このようにして算出された尤度から、最大尤度検出器１
０６で尤度が最大となるベクトルを復号ベクトルとして
復号する。以上の復号処理により得られた復号ベクトル
を用いて音声合成器１０７で音声を合成する。[Equation 2] From the likelihood calculated in this way, the maximum likelihood detector 1
The vector having the maximum likelihood at 06 is decoded as a decoding vector. The speech synthesizer 107 synthesizes speech using the decoding vector obtained by the above decoding process.

【００１４】このように、本実施例によれば、フレーム
間のつながりを考慮し、聴感上の量子化歪みを低減させ
ることが可能となり、効率良いベクトル量子化が実現で
きる。As described above, according to this embodiment, it is possible to reduce the audible quantization distortion in consideration of the connection between frames, and it is possible to realize efficient vector quantization.

【００１５】（実施例２）図３は本発明の第２の実施例
におけるベクトル量子化装置のブロック図を示すもので
ある。図３において、２０１は音声を分析してパラメー
タを抽出する音声分析器である。２０２は符号器で、分
析されたパラメータをベクトル量子化する。２０３は符
号帳で、ベクトル量子化をするための符号ベクトルを格
納している。以上が符号化側の構成である。次に復号化
側の構成について説明する。２０４は復号器で、伝送路
２１２から得た符号から再生ベクトルを求める。２０５
は尤度算出器で、得られた再生ベクトルと分布符号帳２
０９から得られる生起確率分布および前フレームで得ら
れた復号ベクトルから、重み切替器２１１による尤度重
みを考慮して、ベクトル空間上の点に対して尤度を算出
する。２０６は最大尤度検出器で、算出された尤度を最
大とするベクトルを復号ベクトルとして求める。２０７
は音声合成器で、このようにして得られた復号ベクトル
を用いて復号音声を合成する。２０８は符号帳で、符号
化側の符号帳２０３に対応した再生ベクトルを格納して
いる。２０９は分布符号帳で、符号帳２０８の再生ベク
トルに対するクラスタリングされる学習ベクトルの分布
が格納されている。２１０は誤り検出器で、伝送路２１
２での符号誤りを検出する。２１１は重み切替器で、検
出された誤りに応じて尤度計算器２０５で算出される尤
度に重み付けをする。２１２は符号化側の装置と復号化
側の装置を結ぶ伝送路である。(Embodiment 2) FIG. 3 is a block diagram of a vector quantizer according to a second embodiment of the present invention. In FIG. 3, 201 is a voice analyzer for analyzing voice and extracting parameters. Reference numeral 202 denotes an encoder that vector-quantizes the analyzed parameters. A codebook 203 stores a code vector for vector quantization. The above is the configuration on the encoding side. Next, the configuration on the decoding side will be described. A decoder 204 obtains a reproduction vector from the code obtained from the transmission line 212. 205
Is a likelihood calculator, and the obtained reproduction vector and distribution codebook 2
09, the likelihood is calculated for the point on the vector space from the occurrence probability distribution obtained from No. 09 and the decoded vector obtained in the previous frame, considering the likelihood weight by the weight switching unit 211. A maximum likelihood detector 206 obtains a vector that maximizes the calculated likelihood as a decoded vector. 207
Is a speech synthesizer, which synthesizes the decoded speech by using the decoded vector thus obtained. Reference numeral 208 denotes a codebook, which stores reproduction vectors corresponding to the codebook 203 on the encoding side. A distribution codebook 209 stores the distribution of learning vectors to be clustered with respect to the reproduction vector of the codebook 208. Reference numeral 210 denotes an error detector, which is a transmission line 21.
The code error at 2 is detected. A weight switching unit 211 weights the likelihood calculated by the likelihood calculator 205 according to the detected error. Reference numeral 212 is a transmission line connecting the device on the encoding side and the device on the decoding side.

【００１６】以上のように構成されたベクトル量子化装
置について、以下その動作を説明する。まず、実施例１
と同様、音声分析器２０１によって求められたパラメー
タは、符号器２０２で符号帳２０３を用いて符号化され
る。この符号を伝送路２１２を通じて受けとった復号器
２０４は、符号帳２０８を用いて再生ベクトルを求め
る。尤度算出器２０５では、再生ベクトルと復号ベクト
ルの生起確率分布および前フレームで復号された復号ベ
クトルから（２）式を用いて尤度を算出する。このと
き、（２）式のパラメータαは、重み切替器２１１によ
って決定する。最大尤度検出器２０６では、算出された
尤度が最大となるベクトルを復号ベクトルとして求め、
この復号ベクトルを用いて音声合成器２０７で音声を合
成する。一方、誤り検出器２１０で伝送路２１２での符
号誤りが検出されたとき、重み切替器２１１はαを１に
近付け、尤度算出において前フレームの復号ベクトルと
の相関に重みを付け、誤って再生された再生ベクトルの
尤度への影響を小さくする。The operation of the vector quantizer having the above configuration will be described below. First, Example 1
Similarly to, the parameters obtained by the voice analyzer 201 are encoded by the encoder 202 using the codebook 203. The decoder 204 receiving this code through the transmission path 212 obtains a reproduction vector using the codebook 208. The likelihood calculator 205 calculates the likelihood using the equation (2) from the occurrence probability distribution of the reproduction vector and the decoded vector and the decoded vector decoded in the previous frame. At this time, the parameter α of the equation (2) is determined by the weight switching unit 211. The maximum likelihood detector 206 obtains a vector having the maximum calculated likelihood as a decoded vector,
The speech synthesizer 207 synthesizes speech using this decoded vector. On the other hand, when the error detector 210 detects a code error in the transmission line 212, the weight switching unit 211 brings α close to 1, weights the correlation with the decoded vector of the previous frame in likelihood calculation, The influence of the reproduced reproduction vector on the likelihood is reduced.

【００１７】このように、本実施例によれば、伝送路で
符号誤りが生じた時に誤りを考慮して最尤復号すること
が可能となり、誤りによる合成音声品質の劣化を低減す
ることができる。As described above, according to this embodiment, when a code error occurs in the transmission path, it is possible to perform maximum likelihood decoding in consideration of the error, and it is possible to reduce the deterioration of the synthesized voice quality due to the error. .

【００１８】（実施例３）図４は本発明の第３の実施例
におけるベクトル量子化装置のブロック図を示すもので
ある。図４において、３０１は音声を分析してパラメー
タを抽出する音声分析器である。３０２は符号器で、分
析されたパラメータをベクトル量子化する。３０３は符
号帳で、ベクトル量子化をするための符号ベクトルを格
納している。以上が符号化側の構成である。次に復号化
側の構成について説明する。３０４は復号器で、伝送路
３１２から得た符号から再生ベクトルを求める。３０５
は尤度算出器で、得られた再生ベクトルと分布調整器３
１１で得られる生起確率分布および前フレームで得られ
た復号ベクトルから、ベクトル空間上の点に対して尤度
を算出する。３０６は最大尤度検出器で、算出された尤
度を最大とするベクトルを復号ベクトルとして求める。
３０７は音声合成器で、このようにして得られた復号ベ
クトルを用いて復号音声を合成する。３０８は符号帳
で、符号化側の符号帳３０３に対応した再生ベクトルを
格納している。３０９は分布符号帳で、符号帳３０８の
再生ベクトルにクラスタリングされる学習ベクトルの分
布が格納されている。３１０は誤り検出器で、伝送路３
１２での符号誤りを検出する。３１１は分布調整器で、
誤りが検出された際に分布符号帳３０９から得られる生
起確率分布を変化させる。３１２は符号化側の装置と復
号化側の装置を結ぶ伝送路である。(Embodiment 3) FIG. 4 is a block diagram of a vector quantizer according to a third embodiment of the present invention. In FIG. 4, reference numeral 301 is a voice analyzer that analyzes voice and extracts parameters. An encoder 302 vector-quantizes the analyzed parameters. A codebook 303 stores code vectors for vector quantization. The above is the configuration on the encoding side. Next, the configuration on the decoding side will be described. A decoder 304 obtains a reproduction vector from the code obtained from the transmission line 312. 305
Is a likelihood calculator, and the obtained reproduction vector and distribution adjuster 3
The likelihood is calculated for a point on the vector space from the occurrence probability distribution obtained in 11 and the decoded vector obtained in the previous frame. A maximum likelihood detector 306 obtains a vector that maximizes the calculated likelihood as a decoded vector.
A speech synthesizer 307 synthesizes decoded speech using the decoded vector thus obtained. A codebook 308 stores a reproduction vector corresponding to the codebook 303 on the coding side. A distribution codebook 309 stores the distribution of learning vectors clustered with the reproduction vector of the codebook 308. Reference numeral 310 denotes an error detector, which is a transmission line 3
A code error at 12 is detected. 311 is a distribution adjuster,
The occurrence probability distribution obtained from the distribution codebook 309 when an error is detected is changed. Reference numeral 312 is a transmission line connecting the device on the encoding side and the device on the decoding side.

【００１９】以上のように構成されたベクトル量子化装
置について、以下その動作を説明する。まず、実施例１
と同様、音声分析器３０１によって求められたパラメー
タは、符号器３０２で符号帳３０３を用いて符号化され
る。この符号を伝送路３１２を通じて受けとった復号器
３０４は、符号帳３０８を用いて再生ベクトルを求め
る。尤度算出器３０５では、再生ベクトルと分布調整器
３１１から得られた生起確率分布および前フレームで復
号された復号ベクトルから（２）式を用いて尤度を算出
する。最大尤度検出器３０６では、算出された尤度が最
大となるベクトルを復号ベクトルとして求め、この復号
ベクトルを用いて音声合成器３０７で音声を合成する。
一方、誤り検出器３１０で伝送路３１２での符号誤りが
検出されたとき、分布調整器３１１は、分布符号帳３０
９で得られる分布を調整する。The operation of the vector quantizer having the above configuration will be described below. First, Example 1
Similarly to, the parameters obtained by the speech analyzer 301 are encoded by the encoder 302 using the codebook 303. The decoder 304 that has received this code through the transmission line 312 obtains a reproduction vector using the codebook 308. The likelihood calculator 305 calculates the likelihood from the reproduction vector, the occurrence probability distribution obtained from the distribution adjuster 311, and the decoded vector decoded in the previous frame using the equation (2). The maximum likelihood detector 306 obtains a vector having the maximum calculated likelihood as a decoded vector, and the speech synthesizer 307 synthesizes speech using this decoded vector.
On the other hand, when the error detector 310 detects a code error in the transmission line 312, the distribution adjuster 311 causes the distribution codebook 30
Adjust the distribution obtained in 9.

【００２０】ここで、分布調整器３１１における生起確
率分布の調整例について、図５を用いて説明する。ただ
し、図５では１次元ベクトル空間上での例である。図５
において、伝送路誤りがないときの生起確率は、符号帳
中の隣接ベクトルとの距離の２等分点を境に０となる。
これは、２等分点を境に距離が近い方のベクトル量子化
されるために、復号ベクトルは２等分点より再生ベクト
ルに近い点に存在していることを示している。一方、誤
りが検出された時、隣接ベクトルとの２等分点に関係な
く復号ベクトルが存在する可能性があるので、図のよう
に２等分点以遠のベクトルも存在するような生起確率分
布を与える。Here, an example of adjusting the occurrence probability distribution in the distribution adjuster 311 will be described with reference to FIG. However, FIG. 5 shows an example on a one-dimensional vector space. FIG.
In, the occurrence probability when there is no transmission path error is 0 at the bisector of the distance from the adjacent vector in the codebook.
This indicates that the vector is quantized at a distance closer to the bisector, so that the decoded vector exists at a point closer to the reproduction vector than the bisector. On the other hand, when an error is detected, there is a possibility that a decoding vector exists regardless of the bisector of the adjacent vector. give.

【００２１】このように、本実施例によれば、実施例２
と同様に、伝送路で符号誤りが生じた時に誤りを考慮し
て最尤復号することが可能となり、誤りによる合成音声
品質の劣化を低減することができる。As described above, according to this embodiment, the second embodiment
Similarly, when a code error occurs in the transmission path, it is possible to perform maximum likelihood decoding in consideration of the error, and it is possible to reduce deterioration of synthesized speech quality due to the error.

【００２２】[0022]

【発明の効果】以上のように、本発明によれば、ベクト
ル量子化された符号の復号に際し、各符号ベクトルに対
してどのような確率で復号ベクトルが分布しているかと
いうことと、前フレームからの相関を考慮することによ
り、歪みを低減することができる。また、伝送路で符号
誤りが生じたときに算出される尤度を調整することによ
り、合成音声品質の劣化を低減することができる。As described above, according to the present invention, at the time of decoding a vector-quantized code, the probability that the decoded vector is distributed to each code vector, and the previous frame The distortion can be reduced by considering the correlation from Also, by adjusting the likelihood calculated when a code error occurs in the transmission path, it is possible to reduce the deterioration of the synthesized voice quality.

[Brief description of drawings]

【図１】実施例１におけるベクトル量子化装置のブロッ
ク図FIG. 1 is a block diagram of a vector quantization device according to a first embodiment.

【図２】各実施例における復号側の符号帳構成法の模式
図FIG. 2 is a schematic diagram of a decoding-side codebook configuration method in each embodiment.

【図３】実施例２におけるベクトル量子化装置のブロッ
ク図FIG. 3 is a block diagram of a vector quantization device according to a second embodiment.

【図４】実施例３におけるベクトル量子化装置のブロッ
ク図FIG. 4 is a block diagram of a vector quantization device according to a third embodiment.

【図５】実施例３における確率分布の調整法の模式図FIG. 5 is a schematic diagram of a probability distribution adjustment method according to the third embodiment.

【図６】従来例におけるベクトル量子化装置のブロック
図FIG. 6 is a block diagram of a vector quantization device in a conventional example.

[Explanation of symbols]

１０１、２０１、３０１音声分析器１０２、２０２、３０２符号器１０３、２０３、３０３符号帳１０４、２０４、３０４復号器１０５、２０５、３０５尤度算出器１０６、２０６、３０６最大尤度検出器１０７、２０７、３０７音声合成器１０８、２０８、３０８符号帳１０９、２０９、３０９分布符号帳１１０、２１２、３１２伝送路２１０、３１０誤り検出器２１１重み切替器３１１分布調整器 101, 201, 301 Speech analyzer 102, 202, 302 Encoder 103, 203, 303 Codebook 104, 204, 304 Decoder 105, 205, 305 Likelihood calculator 106, 206, 306 Maximum likelihood detector 107, 207, 307 Speech synthesizer 108, 208, 308 Codebook 109, 209, 309 Distribution codebook 110, 212, 312 Transmission line 210, 310 Error detector 211 Weight switcher 311 Distribution adjuster

─────────────────────────────────────────────────────
─────────────────────────────────────────────────── ───

【手続補正書】[Procedure amendment]

【提出日】平成７年１０月１６日[Submission date] October 16, 1995

【手続補正１】[Procedure amendment 1]

【補正対象書類名】明細書[Document name to be amended] Statement

【補正対象項目名】図６[Correction target item name] Fig. 6

【補正方法】変更[Correction method] Change

【補正内容】[Correction contents]

【図６】 FIG. 6

Claims

[Claims]

1. A vector quantizer in speech coding, wherein a speech analyzer for analyzing speech to extract parameters, a coder for vector quantizing the analyzed parameters, and a vector quantizer on a coding side. A codebook that stores a code vector for performing the decoding is provided, and on the decoding side, the codebook that stores the reproduction vector corresponding to the coding side codebook and the codebook obtained from the code obtained from the transmission path. , A distribution codebook that stores the distribution of the vectors clustered into the reproduction vector, the occurrence probability distribution of the vector obtained using the reproduction vector and the distribution codebook, and the previous frame From the correlation with the obtained decoding vector, the likelihood calculator that calculates the likelihood function in the parameter space and the vector that maximizes the calculated likelihood as the decoding vector. A vector quantizer comprising a maximum likelihood detector to be obtained and a speech synthesizer for synthesizing speech using the obtained decoded vector.

2. The decoding side is provided with an error detector that detects a code error in a transmission path, and a weight switcher that weights the likelihood calculated by the likelihood calculator according to the detected error. The vector quantizing device according to claim 1.

3. The decoding side is provided with an error detector that detects a code error in a transmission line, and a distribution adjuster that adjusts a probability distribution of occurrence of a vector obtained from a distributed codebook according to the detected error. The vector quantizing device according to claim 1.