JPH05173596A

JPH05173596A - Code excitation linear predicting and encoding method

Info

Publication number: JPH05173596A
Application number: JP3341177A
Authority: JP
Inventors: Hiromi Aoyanagi; 弘美青柳; Kenichiro Hosoda; 賢一郎細田; Hiroshi Katsuragawa; 浩桂川; Yoshihiro Ariyama; 義博有山
Original assignee: Oki Electric Industry Co Ltd
Current assignee: Oki Electric Industry Co Ltd
Priority date: 1991-12-24
Filing date: 1991-12-24
Publication date: 1993-07-13
Anticipated expiration: 2013-07-09
Also published as: JP2774003B2

Abstract

PURPOSE:To obtain superior synthesized speech quality even at a low bit rate by generating a specific impulse response according to a vocal cord parameter and performing convolution processing for a statistical excitation vector, and then adding it to an adaptive excitation vector and generating an excitation vector. CONSTITUTION:The statistical excitation vector esl is converted and corrected before an adder 115 adds the adaptive excitation vector eai outputted from an adaptive excitation code book 107 and the statistical excitation vector esl outputted from a statistical excitation code book 108 to generate the excitation vector (e). For the conversion correction, a code vector converting circuit 109 performs the convolution processing for impulse response showing the short-time characteristics (vocal cord parameter) or/and long-time characteristics (pitch lag) of an input speech vector. Then the frequency characteristics of the input speech vector S are included in the statistical excitation vector esl and low-bit encoding is performed to improve the synthesized speech quality.

Description

Detailed Description of the Invention

【０００１】[0001]

【産業上の利用分野】本発明は、コード励振線形予測符
号化装置に関する。BACKGROUND OF THE INVENTION 1. Field of the Invention The present invention relates to a code excitation linear predictive coding apparatus.

【０００２】[0002]

【従来の技術】過去の原音声ベクトルに対する最適な励
振ベクトルを蓄積した適応励振コードブックと、予め定
められた励振ベクトルを蓄積した統計励振コードブック
とを備えたコード励振線形予測符号化装置は、例えば、
次記文献で開示されている。文献名：Ｎ．Ｓ．ＪａｙａｎｔａｎｄＪ．Ｈ．Ｃｈ
ｅｎ，“ＳｐｅｅｃｈＣｏｄｉｎｇｗｉｔｈＴｉｍ
ｅ−ＶａｒｙｉｎｇＢｉｔＡｌｌｏｃａｔｉｏｎｓ
ｔｏＥｘｃｉｔａｔｉｏｎａｎｄＬＰＣＰａ
ｒａｍｅｔｅｒｓ”，Ｐｒｏｃ．ＩＣＡＳＳＰ，ｐｐ６
５−６８，１９８９．この文献では、音声合成のための残差信号が、周期性の
ある信号と、雑音性の信号で構成されていると考え、統
計励振コードブックを雑音性の信号に対応させ、適応励
振コードブックを周期性の信号に対応させ、原音声ベク
トルに対する最適な励振ベクトルを選択決定するため
に、そこから読みだした複数個の適応励振ベクトルと複
数個の統計励振ベクトルとに基づいて複数個の励振ベク
トルを作成する。2. Description of the Related Art A code-excited linear predictive coding apparatus provided with an adaptive excitation codebook accumulating optimum excitation vectors for past original speech vectors and a statistical excitation codebook accumulating predetermined excitation vectors, For example,
It is disclosed in the following documents. Reference name: N. S. Jayant and J. H. Ch
en, "SpeechCoding with Tim
e-Varying Bit Allocations
to Excitation and LPC Pa
parameters ”, Proc. ICASSP, pp6
5-68,1989. In this paper, it is considered that the residual signal for speech synthesis is composed of a periodic signal and a noisy signal, and the statistical excitation codebook is made to correspond to the noisy signal. Corresponding to the periodic signal, and in order to select and determine the optimum excitation vector for the original speech vector, a plurality of excitation vectors are read based on the plurality of adaptive excitation vectors and the plurality of statistical excitation vectors read therefrom. Create a vector.

【０００３】[0003]

【発明が解決しようとする課題】上記文献においては、
統計励振ベクトルは予め定められており、低ビットレー
ト化に伴いそのベクトル数を減少させると、固定のベク
トルでは対応しきれなくなり、合成音声品質が劣化して
しまうという問題があった。従って、本発明は、低ビッ
トレートにおいても優れた合成音声品質が得られるコー
ド励振線形予測符号化装置を提供することを目的とす
る。SUMMARY OF THE INVENTION In the above documents,
The statistical excitation vector is predetermined, and if the number of vectors is reduced as the bit rate is reduced, there is a problem that fixed vectors cannot be used and the synthesized speech quality deteriorates. Therefore, it is an object of the present invention to provide a code-excited linear predictive coding apparatus that can obtain excellent synthesized speech quality even at a low bit rate.

【０００４】[0004]

【課題を解決するための手段】本発明は、量子化された
声道パラメータと励振ベクトルとに基づいて合成音声ベ
クトルを作成し、入力音声ベクトルとこの合成音声ベク
トルとの誤差を評価することによって、現時刻の入力音
声ベクトルに最適な励振ベクトルに対応した適応及び統
計励振コードブックのインデックスを決定し、声道パラ
メータと両インデックスとをトータルコードとして出力
するコード励振線形予測符号化装置に関するものであ
る。そして、声道パラメータに基づいて特定のインパル
ス応答を作成し、且つ、当該インパルス応答を用いて、
統計励振ベクトルに、畳み込み処理を施した後、適応励
振ベクトルと加算して励振ベクトルを作成する手段を備
えている。第１の発明では、このインパルス応答は、声
道パラメータを用いて表現された伝達関数のインパルス
応答に対応するものであり、例えば、次の式（１）で示
す伝達関数Ｈ（Ｚ）のインパルス応答に対応するもので
ある。According to the present invention, a synthetic speech vector is created on the basis of a quantized vocal tract parameter and an excitation vector, and an error between the input speech vector and this synthetic speech vector is evaluated. , A code-excited linear predictive coding device that determines the index of the adaptive and statistical excitation codebook corresponding to the excitation vector most suitable for the input speech vector at the current time, and outputs the vocal tract parameter and both indexes as a total code. is there. Then, create a specific impulse response based on the vocal tract parameters, and using the impulse response,
The statistical excitation vector is provided with a unit for performing a convolution process and then adding the adaptive excitation vector to create the excitation vector. In the first invention, the impulse response corresponds to the impulse response of the transfer function expressed using the vocal tract parameter, and for example, the impulse of the transfer function H (Z) shown in the following equation (1). It corresponds to the response.

【０００５】[0005]

【数１】 [Equation 1]

【０００６】ただし、Ａ、Ｂは、０＜Ａ＜Ｂ＜１の定
数、ａ_qjはＬＰＣ逆量子化器１０４の出力、ｐは声道分
析次数である。However, A and B are constants of 0 <A <B <1, a _qj is an output of the LPC dequantizer 104, and p is a vocal tract analysis order.

【０００７】また、第２の発明においては、このインパ
ルス応答は、次の式で示す伝達関数Ｈ（Ｚ）のインパル
ス応答に対応するものである。Ｈ（Ｚ）＝１／（１−εＺ^-L）（２）ただし、εは０＜ε≦１なる定数、Ｌは適応励振ベクト
ルのインデックスから計算したピッチラグである。In the second aspect of the invention, the impulse response corresponds to the impulse response of the transfer function H (Z) expressed by the following equation. H (Z) = 1 / (1-εZ ^−L ) (2) where ε is a constant 0 <ε ≦ 1, and L is a pitch lag calculated from the index of the adaptive excitation vector.

【０００８】[0008]

【作用】本発明では、適応励振ベクトルと統計励振ベク
トルとを加算して励振ベクトルを作成する前に、統計励
振ベクトルを変換修正しておく。変換修正は、入力音声
ベクトルの短時間的な性質（声道パラメータ）もしくは
長時間的な性質（ピッチラグ）を表わすインパルス応答
の、又は双方を表わすインパルス応答の畳み込み処理に
よって行う。この畳み込み処理は、ｅｓｌを統計励振コ
ードブックの出力統計励振ベクトル、ｅｓｃｌを変換後
の統計励振ベクトル、ｈをインパルス応答として、次の
式（３）で示すことができる。ｅｓｃｌ＝ｅｓｌ＊ｈ（３）ただし、ｅｓｃｌ＝［ｘ₀、ｘ₁、、、ｘ_n-1］、ｅｓ
ｌ＝［ｙ₀、ｙ₁、、、ｙ_n-1］、ｈ＝［ｈ₀、
ｈ₁、、、ｈ_n-1］、［］は列ベクトル、ｘ、ｙ、ｈ
はそれぞれの要素、ｎはサブフレーム長（又はフレーム
長）である。In the present invention, the statistical excitation vector is converted and corrected before the adaptive excitation vector and the statistical excitation vector are added to create the excitation vector. The conversion correction is performed by convolving the impulse response representing the short-term property (vocal tract parameter) or the long-term property (pitch lag) of the input speech vector, or the impulse response representing both. This convolution process can be expressed by the following equation (3), where esl is the output statistical excitation vector of the statistical excitation codebook, escl is the converted statistical excitation vector, and h is the impulse response. escl = esl * h (3) where escl = [x ₀ , x _1, ..., x _n-1 ], es
l = [y ₀ , y _1, ..., y _n-1 ], h = [h ₀ ,
h _1, ..., H _n-1 ], [] are column vectors, x, y, h
Is each element, and n is the subframe length (or frame length).

【０００９】インパルス応答は、入力音声ベクトルの短
時間的な性質を用いる第１の発明の場合は、声道パラメ
ータを用いて表現した伝達関数のインパルス応答であ
り、長時間的な性質を用いる第２の発明の場合は、ピッ
チラグを用いて表現した伝達関数のインパルス応答であ
る。The impulse response is the impulse response of the transfer function expressed by using the vocal tract parameter in the case of the first invention which uses the short-term property of the input speech vector, and the first response which uses the long-term property. In the case of the invention of No. 2, it is the impulse response of the transfer function expressed using the pitch lag.

【００１０】理論的には、入力音声ベクトルの周波数特
性は声道パラメータ及びピッチラグが担い、残差信号は
周波数的にはフラットな性質を持つはずであるが、実際
にはそのときの（逆フィルタを通した場合の）残差信号
に周波数的な性質が残る。本発明はこれを利用（あるい
は逆に利用）して、統計励振ベクトルに入力音声ベクト
ルの周波数的特性を加味させるようにして、合成音声品
質を向上させたものである。Theoretically, the frequency characteristic of the input speech vector should be borne by the vocal tract parameter and the pitch lag, and the residual signal should have a flat characteristic in frequency. The residual signal (when passed through) has frequency characteristics. The present invention utilizes this (or vice versa) to improve the synthesized speech quality by adding the frequency characteristic of the input speech vector to the statistical excitation vector.

【００１１】[0011]

【実施例】図１は、本発明のコード励振線形予測符号化
器の第１実施例を示すブロック図である。図１におい
て、端子１０１よりフレーム単位にまとめられて、ベク
トルとして入力される入力音声ベクトルＳは、まず声道
分析回路１０２に入力され、声道予測パラメータａｊが
計算される。声道分析回路１０２は、声道予測パラメー
タａｊをＬＰＣ量子化器１０３に送出する。ＬＰＣ量子
化器１０３は、声道予測パラメータａｊを量子化し、そ
のコードＩｃ（ＬＰＣコード）をＬＰＣ逆量子化器１０
４、多重化回路１０６に送出する。ＬＰＣ逆量子化器１
０４は、ＬＰＣコードＩｃを声道予測パラメータａｑｊ
に逆変換して合成フィルタ１０５に送出する。次に、適
応励振コードブック１０７は適応励振ベクトルｅａｉ
（ｉ＝１〜ｎ）、統計励振コードブック１０８は統計励
振ベクトルｅｓｌ（ｌ＝１〜ｍ）、ＶＱゲインコードブ
ック１１０は励振ゲインβｋ、γｋ（ｋ＝１〜ｐ）を各
々出力する。1 is a block diagram showing a first embodiment of a code-excited linear predictive encoder according to the present invention. In FIG. 1, an input voice vector S, which is collected as a vector from a terminal 101 by a frame unit, is first input to a vocal tract analysis circuit 102 to calculate a vocal tract prediction parameter aj. The vocal tract analysis circuit 102 sends the vocal tract prediction parameter aj to the LPC quantizer 103. The LPC quantizer 103 quantizes the vocal tract prediction parameter aj, and its code Ic (LPC code) is quantized by the LPC dequantizer 10.
4. Send to the multiplexing circuit 106. LPC inverse quantizer 1
04 uses the LPC code Ic as the vocal tract prediction parameter aqj
To the synthesis filter 105. Next, the adaptive excitation codebook 107 uses the adaptive excitation vector eai.
(I = 1 to n), the statistical excitation codebook 108 outputs the statistical excitation vector esl (l = 1 to m), and the VQ gain codebook 110 outputs the excitation gains βk and γk (k = 1 to p), respectively.

【００１２】コードベクトル変換回路１０９は、次の式
（４）で示すフィルタ伝達関数Ｈ（Ｚ）のインパルス応
答を用いてコードブック１０８からの統計励振ベクトル
ｅｓｌを畳み込み、修正統計励振ベクトルｅｓｃｌを計
算する。The code vector conversion circuit 109 convolves the statistical excitation vector esl from the codebook 108 with the impulse response of the filter transfer function H (Z) represented by the following equation (4) to calculate the modified statistical excitation vector escl. To do.

【００１３】[0013]

【数２】 [Equation 2]

【００１４】但し、ａ_qjはＬＰＣ逆量子化器１０４の出
力、ｐは声道分析次数である。適応励振ベクトルｅａｉ
は乗算器１１３によりゲインβｋが乗ぜられベクトルｅ
ａｉｋとなり、修正統計励振ベクトルｅｓｃｌは乗算器
１１４によりゲインγｋが乗ぜられベクトルｅｓｃｌｋ
となる。加算器１１５は、ベクトルｅａｉｋとベクトル
ｅｓｃｌｋの成分単位の加算を行い励振ベクトルｅを計
算する。合成フィルタ１０５は、励振ベクトルｅに対す
る合成音声ベクトルＳｗを計算し、減算器１１６に送出
する。減算器１１６は、合成音声ベクトルＳｗと入力音
声ベクトルＳの成分単位の減算を行い、誤差ベクトルｅ
ｒを知覚フィルタ１１１に送出する。Here, a _qj is the output of the LPC dequantizer 104, and p is the vocal tract analysis order. Adaptive excitation vector eai
Is multiplied by the gain βk by the multiplier 113, and the vector e
aik, and the modified statistical excitation vector escl is multiplied by the gain γk by the multiplier 114 to obtain the vector esclk.
Becomes The adder 115 calculates the excitation vector e by adding the vector eaik and the vector esclk in component units. The synthesis filter 105 calculates a synthesized speech vector Sw for the excitation vector e and sends it to the subtractor 116. The subtractor 116 subtracts the synthesized speech vector Sw and the input speech vector S in component units to obtain an error vector e.
Send r to the perceptual filter 111.

【００１５】知覚フィルタ１１１は誤差ベクトルｅｒに
対する知覚誤差ベクトルｅｗを知覚誤差計算回路１１２
に送出する。知覚誤差計算回路１１２は、知覚誤差ベク
トルｅｗの各成分の２乗平均を計算し、この値が最小と
なる励振ベクトル（即ち、ｉ、ｌ、ｋの組み合わせ）を
現時刻の入力音声ベクトルの最適な励振ベクトルとして
決定する。そして、その時の各コードブックのインデッ
クスＩａ、Ｉｓ、Ｉｇを、適応励振コードブック１０
７、統計励振コードブック１０８、ＶＱゲインコードブ
ック１１０、多重化回路１０６に送出する。The perceptual filter 111 calculates the perceptual error vector ew for the error vector er from the perceptual error calculation circuit 112.
To send to. The perceptual error calculation circuit 112 calculates the root mean square of each component of the perceptual error vector ew, and determines the excitation vector (that is, the combination of i, l, k) that minimizes this value as the optimum input speech vector at the current time. It is determined as an exciting vector. Then, the indices Ia, Is, and Ig of the respective codebooks at that time are set to the adaptive excitation codebook 10
7, statistical excitation codebook 108, VQ gain codebook 110, and multiplexing circuit 106.

【００１６】適応励振コードブック１０７は、インデッ
クスＩａにより最適な適応励振ベクトルｅａｏ、統計励
振コードブック１０７は、インデックスＩｓにより最適
な統計励振ベクトルｅｓｏ、ＶＱゲインコードブック１
１０は、インデックスＩｇにより最適なＶＱゲインβ
ｏ、γｏを各々出力する。コードベクトル変換回路１０
９は、最適な統計励振ベクトルｅｓｏを最適な修正統計
励振ベクトルｅｓｃｏに変換する。これらｅａｏ、ｅｓ
ｃｏ、βｏ、γｏにより構成される最適な励振ベクトル
ｅｏｐｔは適応励振コードブック１０７に入力され、適
応励振コードブック１０７の内容が更新される。多重化
回路１０６は、Ｉｃ、Ｉａ、Ｉｓ、Ｉｇをトータルコー
ドＣとして出力端子１１３により受信側に伝送する。The adaptive excitation codebook 107 is the optimal adaptive excitation vector eao according to the index Ia, and the statistical excitation codebook 107 is the optimal statistical excitation vector eso according to the index Is, and the VQ gain codebook 1
10 is the optimum VQ gain β according to the index Ig
It outputs o and γo, respectively. Code vector conversion circuit 10
9 transforms the optimum statistical excitation vector eso into the optimum modified statistical excitation vector esco. These eao, es
The optimum excitation vector eopt composed of co, βo, and γo is input to the adaptive excitation codebook 107, and the contents of the adaptive excitation codebook 107 are updated. The multiplexing circuit 106 transmits Ic, Ia, Is, and Ig as the total code C to the receiving side through the output terminal 113.

【００１７】図２は、図１の符号化に対応したコード励
振線形予測復号化器のブロック図を示す。図２におい
て、入力端子２０１より入力されたトータルコードＣは
多重分離回路２１２によりＬＰＣコードＩｃ、適応励振
コードインデックスＩａ、統計励振コードインデックス
Ｉｓ、ＶＱゲインコードインデックスＩｇに分離され、
各々ＬＰＣ逆量子化器２０２、適応励振コードブック２
０４、統計励振コードブック２０５、ＶＱゲインコード
ブック２０７に送出される。ＬＰＣ逆量子化器２０２
は、ＬＰＣコードＩｃを声道予測パラメータａｊに変換
して合成フィルタ２０３に送出する。適応励振コードブ
ック２０４はインデックスＩａに相当する適応励振ベク
トルｅａ、統計励振コードブック２０６はインデックス
Ｉｓに相当する統計励振ベクトルｅｓ、ＶＱゲインコー
ドブック２０７はインデックスＩｇに相当する励振ゲイ
ンβ、γを各々出力する。FIG. 2 shows a block diagram of a code-excited linear predictive decoder corresponding to the encoding of FIG. In FIG. 2, the total code C input from the input terminal 201 is separated by the demultiplexing circuit 212 into an LPC code Ic, an adaptive excitation code index Ia, a statistical excitation code index Is, and a VQ gain code index Ig,
LPC dequantizer 202 and adaptive excitation codebook 2 respectively
04, statistical excitation codebook 205, and VQ gain codebook 207. LPC dequantizer 202
Converts the LPC code Ic into a vocal tract prediction parameter aj and sends it to the synthesis filter 203. The adaptive excitation codebook 204 is the adaptive excitation vector ea corresponding to the index Ia, the statistical excitation codebook 206 is the statistical excitation vector es corresponding to the index Is, and the VQ gain codebook 207 is the excitation gain β and γ corresponding to the index Ig, respectively. Output.

【００１８】コードベクトル変換回路２０６は、第１の
実施例の符号化器と同様にしてベクトルｅｓをベクトル
ｅｓｃに変換する。ｅａ、ｅｓｃ、β、γにより励振ベ
クトルｅが構成される。合成フィルタ２０３は励振ベク
トルｅに対する合成音声ベクトルＳを計算し、出力端子
２１１より出力する。適応励振コードブック２０４はベ
クトルｅによりその内容が更新される。The code vector conversion circuit 206 converts the vector es into the vector esc in the same manner as the encoder of the first embodiment. The excitation vector e is composed of ea, esc, β, and γ. The synthesis filter 203 calculates a synthesized speech vector S for the excitation vector e and outputs it from the output terminal 211. The contents of the adaptive excitation codebook 204 are updated by the vector e.

【００１９】図１を使い、本発明のコード励振線形予測
符号化器の第２実施例について説明する。コードベクト
ル変換回路１０９以外は、全て第１実施例の符号化器と
同じであるので、以下にコードベクトル変換回路１０９
の動作を説明する。コードベクトル変換回路１０９は、
以下に示すフィルタのインパルス応答を用いてベクトル
ｅｓｌを畳み込み、ベクトルｅｓｃｌを計算する。Ｈ（Ｚ）＝１／（１−εＺ^-L）（５）但し、ε＝１．０、Ｌは適応励振コードのインデックス
から求めたピッチラグである。なお、シフトタイプの適
応コードブック（特開平０１−５４４９７号公報参照）
では、適応励振コードのインデックスと、ピッチラグ
は、たとえば以下の様に１対１対応している。適応励振コードインデックス０１２・・・ｉ・・・ ↓ ↓ ↓ ↓ ピッチラグ２０２１２２・・２０＋ｉ・・・A second embodiment of the code-excited linear predictive encoder according to the present invention will be described with reference to FIG. The code vector conversion circuit 109 is the same as the encoder of the first embodiment except for the code vector conversion circuit 109.
The operation of will be described. The code vector conversion circuit 109
The vector esl is convolved with the impulse response of the filter shown below to calculate the vector escl. H (Z) = 1 / (1-εZ ^−L ) (5) where ε = 1.0 and L is the pitch lag obtained from the index of the adaptive excitation code. A shift type adaptive codebook (see Japanese Patent Laid-Open No. 01-54497)
Then, the index of the adaptive excitation code and the pitch lag are in one-to-one correspondence with each other as follows, for example. Adaptive excitation code index 0 1 2 ・・・ i ・・・ ↓ ↓ ↓ ↓ ↓ Pitch lag 20 21 22 ... 20 + i ...

【００２０】[0020]

【発明の効果】統計励振ベクトルを符号化時刻毎に変換
するため、少ないベクトル数でも入力音声ベクトルによ
く適合した励振ベクトルが得られ、優れた合成音声品質
が得られる。これは、低ビットレート化に有効である。Since the statistical excitation vector is converted for each coding time, an excitation vector that is well suited to the input speech vector can be obtained even with a small number of vectors, and excellent synthesized speech quality can be obtained. This is effective for lowering the bit rate.

[Brief description of drawings]

【図１】図１は本発明に係るコード励振線形予測符号化
装置のブロック図。FIG. 1 is a block diagram of a code-excited linear predictive coding apparatus according to the present invention.

【図２】図２は図１に対応したコード励振線形予測復号
化装置のブロック図。FIG. 2 is a block diagram of a code-excited linear predictive decoding apparatus corresponding to FIG.

[Explanation of symbols]

１０２声道分析回路１０３ＬＰＣ量子化器１０４ＬＰＣ逆量子化器１０５合成フィルタ１０６多重化回路１０７適応励振コードブック１０８統計励振コードブック１０９コードベクトル変換回路１１０ＶＱゲインコードブック１１１知覚フィルタ１１２知覚誤差計算回路１１３乗算器１１４乗算器１１５加算器Ｃトータルコードｅａｉ適応励振候補ベクトルｅｓｌ統計励振候補ベクトルＩａ適応励振コードインデックスＩｃＬＰＣコードＩｇゲインコードインデックスＩｓ統計励振コードインデックスＳ入力音声ベクトルＳｗ合成音声ベクトル βｋ適応励振候補ゲイン γｋ統計励振候補ゲイン 102 vocal tract analysis circuit 103 LPC quantizer 104 LPC inverse quantizer 105 synthesis filter 106 multiplexing circuit 107 adaptive excitation codebook 108 statistical excitation codebook 109 code vector conversion circuit 110 VQ gain codebook 111 perceptual filter 112 perceptual error calculation Circuit 113 Multiplier 114 Multiplier 115 Adder C Total code eai Adaptive excitation candidate vector esl Statistical excitation candidate vector Ia Adaptive excitation code index Ic LPC code Ig Gain code index Is Statistical excitation code index S Input speech vector Sw Synthetic speech vector βk Adaptive Excitation candidate gain γk Statistical excitation candidate gain

───────────────────────────────────────────────────── フロントページの続き (72)発明者有山義博東京都港区虎ノ門１丁目７番12号沖電気工業株式会社内 ─────────────────────────────────────────────────── ─── Continuation of front page (72) Inventor Yoshihiro Ariyama 1-7-12 Toranomon, Minato-ku, Tokyo Oki Electric Industry Co., Ltd.

Claims

[Claims]

1. A means for obtaining a vocal tract parameter by linear predictive analysis of an input speech vector, and a voice for quantizing the vocal tract parameter and outputting a quantized vocal tract parameter and a vocal tract parameter code corresponding thereto. A road parameter quantizer, an adaptive excitation codebook that stores an adaptive excitation vector that represents the optimal excitation vector of past input speech vectors, and a statistical excitation vector that is a predetermined excitation vector. A statistical excitation codebook; and a means for creating an excitation vector by multiplying each of the adaptive excitation vector and the statistical excitation vector by an excitation gain and adding the excitation gain, and the quantized vocal tract parameter and the excitation vector Create a synthesized speech vector based on the above, and evaluate the error between the input speech vector and the synthesized speech vector. By determining the index of both excitation codebooks corresponding to the excitation vector optimum to the input speech vector at the current time, the code excitation linear prediction code that outputs the vocal tract parameter and both indexes as a total code In the digitizing device, a specific impulse response is created based on the vocal tract parameter, and, using the impulse response, the statistical excitation vector is subjected to convolution processing, and then the adaptive excitation vector is added to the statistical excitation vector. A code excitation linear predictive coding method, comprising means for creating an excitation vector, wherein the impulse response corresponds to an impulse response of a transfer function expressed using the vocal tract parameter.

2. A means for obtaining a vocal tract parameter by linear predictive analysis of an input speech vector, and a voice which quantizes the vocal tract parameter and outputs a quantized vocal tract parameter and a vocal tract parameter code corresponding thereto. A road parameter quantizer, an adaptive excitation codebook that stores an adaptive excitation vector that represents an optimal excitation vector of past input speech vectors, and a statistical excitation vector that is a predetermined excitation vector. A statistical excitation codebook; and a means for creating an excitation vector by multiplying each of the adaptive excitation vector and the statistical excitation vector by an excitation gain and adding the excitation gain, and the quantized vocal tract parameter and the excitation vector Create a synthesized speech vector based on the above, and evaluate the error between the input speech vector and the synthesized speech vector. By determining the index of both excitation codebooks corresponding to the excitation vector optimum for the input speech vector at the current time, the code excitation linear prediction code that outputs the vocal tract parameter and both indexes as a total code In the digitizing device, a specific impulse response is created based on the vocal tract parameter, and, using the impulse response, the statistical excitation vector is subjected to convolution processing, and then added to the adaptive excitation vector. The impulse response is a transfer function H (Z) represented by the following equation.
A code-excited linear predictive coding device characterized by the impulse response of. H (Z) = 1 / (1-εZ ^−L ) where ε is a constant 0 <ε ≦ 1, and L is a pitch lag calculated from the index of the adaptive excitation vector.