JPH05127699A

JPH05127699A - Code excitation linear prediction and encoding system

Info

Publication number: JPH05127699A
Application number: JP3251152A
Authority: JP
Inventors: Hiromi Aoyanagi; 弘美青柳; Hiroshi Katsuragawa; 浩桂川; Yoshihiro Ariyama; 義博有山; Kenichiro Hosoda; 賢一郎細田
Original assignee: Oki Electric Industry Co Ltd
Current assignee: Oki Electric Industry Co Ltd
Priority date: 1991-09-30
Filing date: 1991-09-30
Publication date: 1993-05-25
Anticipated expiration: 2015-04-10
Also published as: JP3031765B2

Abstract

PURPOSE:To obtain sufficient synthesized speech quality even when a transmission bit rate is low. CONSTITUTION:With Ia, Is, and Ig supplied from an auditory error calculating circuit 316, an adaptive excitation code book 305 outputs and supplies an optimum code vector eai to a multiplier 308, and a gain code book 306 outputs a pair of optimum excitation gains gak, gsk and supplies the gak to a multiplier 308 and the gsk to a multiplier 310. The multiplier 308 supplies a multiplication vector eaik to an adder 309. The multiplier 310 supplies a multiplication vector eslk to the adder 309. The adder 309 supplies the added excitation vector (e) to a multiplier 312, which multiplies it by an excitation gain sigma and supplies the excitation vector eg to a composing filter 313. The composing filter 313 filters the excitation vector eg with the LSP coefficient obtained by an LSP inverse quantizer 303 to obtain a synthesized speech.

Description

Detailed Description of the Invention

【０００１】[0001]

【産業上の利用分野】この発明は、コード励振線形予測
符号化方式に関し、例えば音声信号などの高品質圧縮符
号化方式として適応し得るものである。BACKGROUND OF THE INVENTION 1. Field of the Invention The present invention relates to a code-excited linear predictive coding system, which can be applied as a high-quality compression coding system for voice signals.

【０００２】[0002]

【従来の技術】従来例のコード励振線形予測符号化方式
は、例えば、文献１に示されている。文献１：『1989,Proc.IEEE Int.Conf.on Acoustics,Spe
ech and Signal Processing,PP.65-pp.68,N.S.Jayant a
nd J.H.Chen,"Speech Coding with Time-Varying Bit A
llocations to Excitation and LPC Parameters"』図２は、一例のコード励振線形予測符号化器の機能ブロ
ック図である。2. Description of the Related Art A conventional code-excited linear predictive coding system is disclosed in, for example, Document 1. Reference 1: "1989, Proc.IEEE Int.Conf.on Acoustics, Spe
ech and Signal Processing, PP.65-pp.68, NSJayant a
nd JH Chen, "Speech Coding with Time-Varying Bit A
llocations to Excitation and LPC Parameters "] FIG. 2 is a functional block diagram of an example of a code excitation linear predictive encoder.

【０００３】図２において、入力原音声ベクトルＳは、
分析回路１０２に供給され、ここでＰＡＲＣＯＲ(PARti
al auto-CORrelation:偏自己相関)分析などを行い、量
子化短時間予測係数αjpを多重化回路１１１に供給し、
声道予測係数αjqを短時間フィルタ１０３に供給する。In FIG. 2, the input original speech vector S is
It is supplied to the analysis circuit 102, where PARCOR (PARti
al auto-COR relation: partial autocorrelation) analysis, etc., and supplies the quantized short-time prediction coefficient αjp to the multiplexing circuit 111.
The vocal tract prediction coefficient αjq is supplied to the short-time filter 103.

【０００４】次に、スイッチ１０８と１１３を開き、適
応励振コードテーブル１０４は、適応励振コードベクト
ルｅa(i)（ｉ＝１〜ｎ）を供給される適応励振コードイ
ンデックスＩａに基づいて選択出力する。ゲイン制御回
路１１９は、原音声ベクトルＳと、適応励振コードベク
トルｅa(i)とから、適応励振ゲイン制御コードβapを生
成し多重化回路１１１に供給する。更に、ゲイン制御回
路１１９は、適応励振ゲイン制御信号βapを乗算器１２
２に供給して、適応励振コードベクトルβｅa(i)と乗算
してゲインの制御を行い、適応励振コードベクトルβea
(i)を加算器１１０に供給する。Next, the switches 108 and 113 are opened, and the adaptive excitation code table 104 selectively outputs the adaptive excitation code vector ea (i) (i = 1 to n) based on the supplied adaptive excitation code index Ia. .. The gain control circuit 119 generates an adaptive excitation gain control code βap from the original speech vector S and the adaptive excitation code vector ea (i) and supplies it to the multiplexing circuit 111. Further, the gain control circuit 119 outputs the adaptive excitation gain control signal βap to the multiplier 12
2 to the adaptive excitation code vector βea (i) to control the gain to obtain the adaptive excitation code vector βea (i).
(i) is supplied to the adder 110.

【０００５】加算器１１０は、適応励振コードベクトル
βea(i)をそのまま励振ベクトルｅ(i）として、短時間
フィルタに供給し、合成音声ベクトルＳｗを減算器１０
９に供給する。減算器１０９は、原音声ベクトルＳと合
成音声ベクトルＳｗの成分単位の減算を行い、誤差ベク
トルｅr(i)を聴覚フィルタ１０５に供給する。聴覚フィ
ルタ１０５は、フィルタリングを行いベクトルｅw(i)を
聴覚誤差計算回路１０６に供給する。聴覚誤差計算回路
１０６は、ベクトルｅw(i)の各成分単位の二乗平均ｇｉ
を計算し、このｇｉが最小となるｉを最適な適応励振コ
ードのインデックスＩａとして、適応励振コードテーブ
ル１０４と多重化回路１１１に供給する。次には、ス
イッチ１１３を閉じて、確率的励振コードテーブル１０
７から供給される確率的励振コードベクトルｅs(i)に対
して、ゲイン制御回路１２３で生成した確率的励振ゲイ
ン制御信号βsqを乗算器１２６で乗算してゲイン制御
し、確率的励振コードベクトルβｅs(i)を加算器１１０
に供給する。この時にゲイン制御回路１２３は、確率的
励振ゲイン制御コードβspを多重化回路１１１に供給す
る。加算器１１０は確率的励振コードベクトルβｅs(i)
と適応励振コードベクトルβeaの加算を行い、励振ベク
トルｅ（ｉ）を前述と同様に短時間フィルタ１０３に供
給して、以下同様な処理の方法で減算器１０９と、聴覚
フィルタ１０５と、聴覚誤差計算回路１０６で処理して
確率的励振コードインデックスＩｓを求めて、多重化回
路１１１と、確率的励振コードテーブル１０７に供給す
る。The adder 110 supplies the adaptive excitation code vector βea (i) as the excitation vector e (i) to the filter for a short time as it is, and the synthesized speech vector Sw is subtracted from the subtractor 10.
Supply to 9. The subtractor 109 subtracts the original speech vector S and the synthesized speech vector Sw in component units, and supplies the error vector er (i) to the auditory filter 105. The auditory filter 105 performs filtering and supplies the vector ew (i) to the auditory error calculation circuit 106. The auditory error calculation circuit 106 calculates the root mean square gi of each component unit of the vector ew (i).
Is calculated, and i with which gi is minimized is supplied to the adaptive excitation code table 104 and the multiplexing circuit 111 as the index Ia of the optimum adaptive excitation code. Next, the switch 113 is closed and the stochastic excitation code table 10
7, the stochastic excitation code vector es (i) is multiplied by the stochastic excitation gain control signal βsq generated by the gain control circuit 123 in the multiplier 126 to perform gain control, and the stochastic excitation code vector βes (i) is the adder 110
Supply to. At this time, the gain control circuit 123 supplies the stochastic excitation gain control code βsp to the multiplexing circuit 111. The adder 110 uses the stochastic excitation code vector βes (i)
And the adaptive excitation code vector βea are added, and the excitation vector e (i) is supplied to the short-time filter 103 in the same manner as described above, and thereafter, the subtracter 109, the auditory filter 105, and the auditory error are processed by the same processing method. The stochastic excitation code index Is is obtained by processing in the calculation circuit 106 and supplied to the multiplexing circuit 111 and the stochastic excitation code table 107.

【０００６】次に、スイッチ１０８と１１３を閉じて、
励振ベクトルｅ（ｉ）によって適応励振コードブック１
０４の内容を更新する。以上のようにして得られた確率
的励振コードインデックスＩｓと適応励振コードインデ
ックスＩａと確率的励振ゲイン制御コードβspと適応励
振ゲイン制御コードβapと量子化短時間予測係数αjpと
を多重化してトータルコードＣを出力していた。Next, the switches 108 and 113 are closed,
Adaptive excitation codebook 1 by excitation vector e (i)
The contents of 04 are updated. The stochastic excitation code index Is, the adaptive excitation code index Ia, the stochastic excitation gain control code βsp, the adaptive excitation gain control code βap, and the quantized short-time prediction coefficient αjp, which are obtained as described above, are multiplexed to obtain a total code. It was outputting C.

【０００７】[0007]

【発明が解決しようとする課題】しかしながら、以上の
従来の方式においては、入力音声信号に対するＰＡＲＣ
ＯＲ分析での量子化ビット数が少なくなると、再生した
合成音声に歪みなどが生じて、十分なＳ／Ｎなどが得ら
れないなど良好な音声合成品質が得られないという問題
があった。従って、量子化ビット数を少なくすることが
できず、従って伝送ビットレートを低速にするための障
害となっていた。However, in the above conventional method, the PARC for the input audio signal is used.
When the number of quantized bits in the OR analysis becomes small, the reproduced synthesized speech is distorted and the like, and there is a problem that a good speech synthesis quality cannot be obtained such that sufficient S / N cannot be obtained. Therefore, the number of quantization bits cannot be reduced, which is an obstacle to reducing the transmission bit rate.

【０００８】これは、ＰＡＲＣＯＲ分析の場合に、例え
ば、情報量を減らすために、フレーム間隔を例えば２０
msecまでに延ばし、その間に各パラメータの変わり方を
スムーズにするために、例えば、２．５msecごとに線形
補間を行う。しかしながら、この線形補間がＰＡＲＣＯ
Ｒ係数に必ずしも適応せず、そのために再生した合成音
声のスペクトルに歪みが増大して、音質が劣化してい
た。これは、ＰＡＲＣＯＲ係数が単一の明確な物理量に
対応せず、複雑な複合パラメータになっていることによ
るとも考えられる。This is because, in the case of PARCOR analysis, for example, in order to reduce the amount of information, the frame interval is set to 20.
For example, linear interpolation is performed every 2.5 msec in order to extend the time to msec and smooth the change of each parameter during that period. However, this linear interpolation is
This does not necessarily apply to the R coefficient, and as a result, distortion is increased in the spectrum of the reproduced synthesized voice, resulting in deterioration of sound quality. It can be considered that this is because the PARCOR coefficient does not correspond to a single definite physical quantity and is a complex compound parameter.

【０００９】また、適応励振コードブックと確率的励振
コードブックに対するゲインをそれぞれ独立にスカラー
量子化した係数でゲイン制御していたので、量子化ビッ
ト数が少なくなると、歪みなどが生じて再生した合成音
声のＳ／Ｎなどが十分に得られないため、量子化ビット
数を少なくすることができず、従って伝送ビットレート
を低速にするための障害となっていた。Further, since the gains for the adaptive excitation codebook and the stochastic excitation codebook are controlled independently by the scalar quantized coefficients, when the number of quantized bits becomes small, distortion or the like occurs, and the reproduced synthesis is performed. Since the S / N ratio of voice cannot be sufficiently obtained, the number of quantization bits cannot be reduced, which is an obstacle for reducing the transmission bit rate.

【００１０】適応励振ゲイン制御コードと、確率的励振
ゲイン制御コードを独立に生成して伝送していたので、
伝送ビットレートを高くする傾向にあり、励振源コード
ブックの種類がもっと多くなると、ますますゲイン制御
コードの種類も多くなり、伝送ビットレートを高くせざ
るを得なくなるという問題があった。Since the adaptive excitation gain control code and the stochastic excitation gain control code are independently generated and transmitted,
There is a problem that the transmission bit rate tends to be increased, and as the types of excitation source codebooks increase, the types of gain control codes also increase, and the transmission bit rate must be increased.

【００１１】現在のこのような符号化方式に対する要請
は、低い伝送ビットレートで再生した合成音声の品質が
良好に得られるということであるが、以上のような問題
によって実現が困難であった。The present demand for such an encoding method is that the quality of synthesized speech reproduced at a low transmission bit rate can be obtained satisfactorily, but it has been difficult to realize due to the above problems.

【００１２】この発明は、以上の課題に鑑み為されたも
のであり、その目的とするところは、低い伝送ビットレ
ートであっても、十分な合成音声品質が得られるコード
励振線形予測符号化方式を提供することである。The present invention has been made in view of the above problems, and an object of the present invention is to provide a code-excited linear predictive coding system capable of obtaining sufficient synthesized speech quality even at a low transmission bit rate. Is to provide.

【００１３】[0013]

【課題を解決するための手段】この発明は、以上の目的
を達成するために、複数種類の励振源情報を用いるコー
ド励振線形予測符号化方式において、以下の特徴的な手
段と方法で改良した。つまり、音声信号に対する線スペ
クトル対分析を行い、合成音声の生成に利用する線スペ
クトル対分析情報を生成する線スペクトル対分析手段
と、複数個のゲイン制御情報からなるゲイン制御情報組
を複数組格納するゲイン制御コードブックとを備えて、
最適なゲイン制御情報組中の複数個のゲイン制御情報に
よって上記複数種類の励振源情報に対してゲイン制御を
行い、ゲイン制御された複数種類の励振源情報と上記線
スペクトル対分析情報とを用いて、音声信号の符号化及
び復号化を行うことを特徴とする。In order to achieve the above object, the present invention has been improved by the following characteristic means and method in a code excitation linear predictive coding system using a plurality of types of excitation source information. .. That is, a line spectrum pair analysis means for performing line spectrum pair analysis on a voice signal to generate line spectrum pair analysis information used for generating a synthesized voice, and a plurality of sets of gain control information consisting of a plurality of gain control information are stored. With a gain control codebook
Gain control is performed on the plurality of types of excitation source information by a plurality of gain control information in the optimum gain control information set, and a plurality of types of gain-controlled excitation source information and the line spectrum pair analysis information are used. Then, the audio signal is encoded and decoded.

【００１４】[0014]

【作用】この発明によれば、入力音声信号に対して線ス
ペクトル対分析を行っているので、主要なホルマントに
対する線スペクトル対は、ホルマント周波数からのずれ
が少なくて、比較的良く対応する。従って、線形補間特
性が良く、線形補間によるスペクトル歪みが少なく音質
の劣化が少なくなる。そして、線スペクトル対の係数を
低ビット数に量子化しても、従来のＰＡＲＣＯＲ分析に
よるよりも、音質を良くすることができる。According to the present invention, since the line spectrum pair analysis is performed on the input voice signal, the line spectrum pairs for the main formants correspond relatively well with little deviation from the formant frequency. Therefore, the linear interpolation characteristic is good, the spectrum distortion due to the linear interpolation is small, and the deterioration of the sound quality is small. Even if the coefficient of the line spectrum pair is quantized to a low bit number, the sound quality can be improved as compared with the conventional PARCOR analysis.

【００１５】また、複数種類の励振源情報ごとのゲイン
情報を伝送する必要がない。つまり、各励振源情報に対
するゲイン制御情報が組に形成されているので、この組
を指定する情報を伝送するだけで良いので、伝送ビット
レートを従来に比べ低下させることができる。しかも、
ゲイン制御情報の組を予め最適なゲイン制御情報の組み
に設定しておくことによって、効率的に良好な音質を得
ることができる。Further, it is not necessary to transmit the gain information for each of the plural kinds of excitation source information. That is, since the gain control information for each excitation source information is formed in a set, only the information designating this set needs to be transmitted, so that the transmission bit rate can be reduced as compared with the conventional case. Moreover,
By setting the set of gain control information to the optimum set of gain control information in advance, it is possible to efficiently obtain good sound quality.

【００１６】[0016]

【実施例】次にこの発明に係るコード励振線形予測符号
化方式の好適な一実施例を図面を用いて説明する。DETAILED DESCRIPTION OF THE PREFERRED EMBODIMENTS A preferred embodiment of the code-excited linear predictive coding system according to the present invention will be described with reference to the drawings.

【００１７】この実施例では、フォワード型のコード励
振線形予測符号化を行うための、コード励振線形予測符
号化器とその復号化器を例に説明する。In this embodiment, a code-excited linear predictive encoder and its decoder for performing forward code-excited linear predictive encoding will be described as an example.

【００１８】またこの実施例の目的は、低ビットレート
においても優れた合成音声品質を得られるコード励振線
形予測符号化器とその復号化器を実現することである。Another object of this embodiment is to realize a code-excited linear predictive encoder and a decoder thereof that can obtain excellent synthesized speech quality even at a low bit rate.

【００１９】この目的を実現するために、この実施例で
は、主に声道予測パラメータをＬＳＰ（ＬｉｎｅＳｐ
ｅｃｔｒｕｍＰａｉｒ）パラメータに変換して量子化
する回路と、ＬＳＰパラメータを逆量子化して補間し、
声道予測パラメータに変換する回路と、適応励振コード
ブックと確率的励振コードブックのゲインをベクトル量
子化したＶＱ（ベクトル量子化）ゲインコードブック
と、バックワード型の励振ゲイン予測回路とを設けてい
る。In order to realize this object, in this embodiment, the vocal tract prediction parameters are mainly used for LSP (Line Sp).
and a circuit for converting the signal into an quantized parameter and quantizing it, and dequantizing the LSP parameter for interpolation.
A circuit for converting into vocal tract prediction parameters, a VQ (vector quantization) gain codebook in which the gains of the adaptive excitation codebook and the stochastic excitation codebook are vector quantized, and a backward type excitation gain prediction circuit are provided. There is.

【００２０】図１は、この実施例のコード励振線形予測
符号化回路の機能ブロック図を示している。図３は、こ
の実施例のコード励振線形予測復号化器の機能ブロック
図を示している。FIG. 1 is a functional block diagram of the code excitation linear predictive coding circuit of this embodiment. FIG. 3 shows a functional block diagram of the code-excited linear predictive decoder of this embodiment.

【００２１】図１において、この実施例のコード励振線
形予測符号化回路は、声道分析回路３０１と、ＬＳＰ量
子化器３０２と、ＬＳＰ逆量子化器３０３と、多重化回
路３０４と、適応励振コードブック３０５と、ＶＱゲイ
ンコードブック３０６と、確率的励振コードブック３０
７と、乗算器３０８と、加算器３０９と、乗算器３１０
と、ゲイン制御回路３１１と、乗算器３１２と、合成フ
ィルタ３１３と、減算器３１４と、聴覚フィルタ３１５
と、聴覚誤差計算回路３１６とで構成されている。ここ
で、声道分析回路３０１と、ＬＳＰ量子化器３０２と、
ＬＳＰ逆量子化器３０３は、ＬＳＰ分析系を形成してい
る。In FIG. 1, the code excitation linear predictive coding circuit of this embodiment includes a vocal tract analysis circuit 301, an LSP quantizer 302, an LSP dequantizer 303, a multiplexing circuit 304, and an adaptive excitation. Codebook 305, VQ gain codebook 306, stochastic excitation codebook 30
7, a multiplier 308, an adder 309, and a multiplier 310
A gain control circuit 311, a multiplier 312, a synthesis filter 313, a subtractor 314, and an auditory filter 315.
And an auditory error calculation circuit 316. Here, the vocal tract analysis circuit 301, the LSP quantizer 302,
The LSP dequantizer 303 forms an LSP analysis system.

【００２２】次に図１の動作を説明する。フレーム単位
にまとめられた原音声ベクトルＳは、声道分析回路３０
１に供給され、声道予測パラメータａｊが求められる。
この声道予測パラメータａｊは、ＬＳＰ量子化器３０２
に供給され、声道予測パラメータａｊをＬＳＰパラメー
タに変換して量子化し、そのＬＳＰコードＩｗをＬＳＰ
逆量子化器３０３と多重化回路３０４に供給される。Ｌ
ＳＰ逆量子化器３０３は、コードＩｗよりＬＳＰパラメ
ータを逆量子化し、補間を行った後に声道予測パラメー
タａqjに逆変換して合成フィルタ３１３に供給する。Next, the operation of FIG. 1 will be described. The original speech vector S collected in frame units is the vocal tract analysis circuit 30.
1 and the vocal tract prediction parameter aj is obtained.
This vocal tract prediction parameter aj is calculated by the LSP quantizer 302.
, The vocal tract prediction parameter aj is converted into an LSP parameter and quantized, and the LSP code Iw is converted into an LSP parameter.
It is supplied to the inverse quantizer 303 and the multiplexing circuit 304. L
The SP dequantizer 303 dequantizes the LSP parameter from the code Iw, performs interpolation, and then inversely converts the LSP parameter into the vocal tract prediction parameter aqj, and supplies this to the synthesis filter 313.

【００２３】一方、適応励振コードブック３０５は、適
応励振コードベクトルｅａｉ（ｉ＝１〜ｎ）を乗算器３
０８に供給する。また、確率的励振コードブック３０７
は、確率的励振コードベクトルｅsl（ｌ＝１〜ｍ）を乗
算器３１０に供給する。また、ＶＱゲインコードブック
３０６は、励振ゲインｇak、ｇsk（ｋ＝１〜ｐ）の対を
出力し、励振ゲインｇakは、乗算器３０８に供給する。
また励振ゲインｇskは、乗算器３１０に供給する。そし
て、乗算器３０８は、前記適応励振コードベクトルｅai
と励振ゲインｇakを乗算してベクトルｅａｉｋを得て加
算器３０９に供給する。また、乗算器３１０は、前記確
率的励振コードベクトルｅslと励振ゲインｇskを乗算し
てベクトルｅｓｌｋを得て加算器３０９に供給する。On the other hand, the adaptive excitation codebook 305 multiplies the adaptive excitation code vector eai (i = 1 to n) by the multiplier 3
Supply to 08. Also, the stochastic excitation codebook 307
Supplies the stochastic excitation code vector esl (l = 1 to m) to the multiplier 310. Further, the VQ gain codebook 306 outputs a pair of excitation gains gak and gsk (k = 1 to p), and the excitation gain gak is supplied to the multiplier 308.
The excitation gain gsk is supplied to the multiplier 310. Then, the multiplier 308 outputs the adaptive excitation code vector eai.
Is multiplied by the excitation gain gak to obtain a vector eaik and supplied to the adder 309. The multiplier 310 multiplies the stochastic excitation code vector esl by the excitation gain gsk to obtain a vector eslk, and supplies the vector eslk to the adder 309.

【００２４】加算器３０９は、ベクトルｅａｉｋとベク
トルｅｓｌｋの成分単位の加算を行い励振ベクトルｅを
求めて乗算器３１２に供給する。ゲイン制御回路３１１
は、過去の最適励振より励振ゲインσを求めて乗算器３
１２に供給する。乗算器３１２は、前記励振ベクトルｅ
と励振ゲインσを乗算してベクトルｅｇを得て合成フィ
ルタ３１３に供給する。The adder 309 adds the vector eaik and the vector eslk in component units to obtain the excitation vector e and supplies it to the multiplier 312. Gain control circuit 311
Is the multiplier 3 obtained by obtaining the excitation gain σ from the past optimum excitation.
Supply to 12. The multiplier 312 receives the excitation vector e
Is multiplied by the excitation gain σ to obtain a vector eg, which is supplied to the synthesis filter 313.

【００２５】合成フィルタ３１３は、供給される声道予
測パラメータａqjをフィルタの例えばタップ係数などと
してベクトルｅｇをフィルタリングして合成音声ベクト
ルＳｗを求めて減算器３１４に供給する。減算器３１４
は、原音声ベクトルＳと合成音声ベクトルＳｗの成分単
位の減算を行い誤差ベクトルｅｒを求めて聴覚フィルタ
３１５に供給する。聴覚フィルタ３１５は、誤差ベクト
ルｅｒに対するフィルタリングを行い出力ベクトルｅｗ
を聴覚誤差計算回路３１６に供給する。聴覚誤差計算回
路３１６は、ベクトルｅｗの各成分の２乗平均を求め
て、この値が最小になるｉ、ｊ、ｋの組み合わせを最適
な各コードブックの適応励振コードインデックスＩａ、
確率的励振コードインデックスＩｓ、励振ゲインコード
インデックスＩｇとして、これらを多重化回路３０４に
供給すると共に、適応励振コードインデックスＩａを適
応励振コードブック３０５に供給し、確率的励振コード
インデックスＩｓを確率的励振コードブック３０７に供
給し、励振ゲインコードインデックスＩｇをＶＱゲイン
コードブック３０６に供給する。The synthesis filter 313 filters the vector eg using the supplied vocal tract prediction parameter aqj as, for example, a tap coefficient of the filter to obtain a synthesized speech vector Sw and supplies it to the subtractor 314. Subtractor 314
Is subtracting the original voice vector S and the synthesized voice vector Sw in component units to obtain an error vector er and supplies it to the auditory filter 315. The auditory filter 315 filters the error vector er and outputs the output vector ew.
Is supplied to the auditory error calculation circuit 316. The auditory error calculation circuit 316 obtains the root mean square of each component of the vector ew, and the combination of i, j, and k that minimizes this value is the optimum adaptive excitation code index Ia of each codebook,
The stochastic excitation code index Is and the excitation gain code index Ig are supplied to the multiplexing circuit 304, the adaptive excitation code index Ia is supplied to the adaptive excitation code book 305, and the stochastic excitation code index Is is stochastically excited. The excitation gain code index Ig is supplied to the codebook 307, and the excitation gain code index Ig is supplied to the VQ gain codebook 306.

【００２６】適応励振コードブック３０５は、適応励振
コードインデックスＩａによって最適な適応励振コード
ベクトルｅaoを出力して乗算器３０８に供給する。ま
た、確率的励振コードブック３０５は確率的励振コード
インデックスＩｓによって最適な確率的励振コードベク
トルｅsoを出力して乗算器３１０に供給する。また、Ｖ
Ｑゲインコードブック３０６は、励振ゲインコードイン
デックスＩｇによって最適な励振ゲインｇao、ｇsoの対
を出力し、励振ゲインｇaoは乗算器３０８に供給し、励
振ゲインｇsoは乗算器３１０に供給する。The adaptive excitation codebook 305 outputs the optimum adaptive excitation code vector eao according to the adaptive excitation code index Ia and supplies it to the multiplier 308. Further, the stochastic excitation codebook 305 outputs the optimal stochastic excitation code vector eso according to the stochastic excitation code index Is and supplies it to the multiplier 310. Also, V
The Q gain codebook 306 outputs an optimum pair of excitation gains gao and gso according to the excitation gain code index Ig, the excitation gain gao is supplied to the multiplier 308, and the excitation gain gso is supplied to the multiplier 310.

【００２７】乗算器３０８は、最適な適応励振コードベ
クトルｅaoと最適な励振ゲインｇaoの乗算を行い、ｅao
・ｇaoを加算器３０９に供給する。また、乗算器３１０
は、最適な確率的励振コードベクトルｅsoと最適な励振
ゲインｇsoの乗算を行い、ｅso・ｇsoを加算器３０９に
供給する。加算器３０９は、ｅao・ｇaoとｅso・ｇsoの
加算を行い、この加算値をｅｏｐｔとして適応励振コー
ドブック３０５に供給し、内容を更新する。The multiplier 308 multiplies the optimum adaptive excitation code vector eao by the optimum excitation gain gao to obtain eao.
Supply gao to the adder 309. Also, the multiplier 310
Multiplies the optimal stochastic excitation code vector eso by the optimal excitation gain gso, and supplies eso · gso to the adder 309. The adder 309 performs addition of eao · gao and eso · gso, supplies the added value to the adaptive excitation codebook 305 as eopt, and updates the content.

【００２８】また、各コードブックによる最適コードイ
ンデックスの探索順番は、いろいろな方法があるが特に
限定するものではない。例えば、適応励振コードベクト
ルをｅa1とし、確率的励振コードベクトルをｅs1とし、
励振ゲインを（ｇa1、ｇｓ1)〜（ｇap、ｇsp）に順番に
変えて探索し、最適な適応励振コードベクトルと確率的
励振コードベクトルと励振ゲインとを求めることもでき
る。There are various methods for searching the optimum code index by each codebook, but there is no particular limitation. For example, the adaptive excitation code vector is ea1, the stochastic excitation code vector is es1,
It is also possible to find the optimum adaptive excitation code vector, stochastic excitation code vector, and excitation gain by changing the excitation gain in order from (ga1, gs1) to (gap, gsp).

【００２９】多重化回路３０４は、以上のようにして得
られたＬＳＰコードＩｗと、適応励振コードインデック
スＩａと、確率的励振コードインデックスＩｓと、励振
ゲインコードインデックスＩｇとを多重化してこれをト
ータルコードＣとして出力し、復号化器に供給する。The multiplexing circuit 304 multiplexes the LSP code Iw thus obtained, the adaptive excitation code index Ia, the stochastic excitation code index Is, and the excitation gain code index Ig, and totals them. The code C is output and supplied to the decoder.

【００３０】図３は、この実施例のコード励振線形予測
復号化器の機能ブロック図を示している。FIG. 3 shows a functional block diagram of the code-excited linear predictive decoder of this embodiment.

【００３１】図３において、この実施例のコード励振線
形予測復号化回路は、多重分離回路４０２と、適応励振
コードブック４０３と、ＶＱゲインコードブック４０４
と、統計励振コードブック４０５と、乗算器４０６と、
乗算器４０７と、加算器４０８と、ゲイン制御回路４０
９と、乗算器４１０と、ＬＳＰ逆量子化器４１１と、合
成フィルタ４１２とで構成されている。In FIG. 3, the code-excited linear predictive decoding circuit of this embodiment includes a demultiplexing circuit 402, an adaptive excitation codebook 403, and a VQ gain codebook 404.
A statistical excitation codebook 405, a multiplier 406,
Multiplier 407, adder 408, and gain control circuit 40
9, a multiplier 410, an LSP dequantizer 411, and a synthesis filter 412.

【００３２】次に図３の動作を説明する。入力されるト
ータルコードＣは、多重分離回路４０２において、各信
号に分離されて、ＬＳＰコードＩｗはＬＳＰ逆量子化器
４１１に供給され、適応励振コードインデックスＩａは
適応励振コードブック４０３に供給され、確率的励振コ
ードインデックスＩｓは確率的励振コードブック４０５
に供給され、励振ゲインコードインデックスＩｇはＶＱ
ゲインコードブック４０４に供給される。Next, the operation of FIG. 3 will be described. The input total code C is separated into each signal in the demultiplexing circuit 402, the LSP code Iw is supplied to the LSP dequantizer 411, the adaptive excitation code index Ia is supplied to the adaptive excitation codebook 403, The probabilistic excitation code index Is is the probabilistic excitation codebook 405.
And the excitation gain code index Ig is VQ.
It is supplied to the gain codebook 404.

【００３３】ＬＳＰ逆量子化器４１１は、ＬＳＰコード
Ｉｗより、ＬＳＰパラメータを逆量子化して、更に補間
を行った後に声道予測パラメータａｊに変換して、合成
フィルタ４１２に供給する。The LSP dequantizer 411 dequantizes the LSP parameter from the LSP code Iw, further interpolates it, and then converts it into the vocal tract prediction parameter aj, and supplies it to the synthesis filter 412.

【００３４】適応励振コードブック４０３は、適応励振
コードインデックスＩａに相当する適応励振コードベク
トルｅａを乗算器４０６に供給する。また、確率的励振
コードブック４０５は、確率的励振コードインデックス
Ｉｓに相当する確率的励振コードベクトルｅｓを乗算器
４０７に供給する。ＶＱゲインコードブック４０４は、
励振ゲインコードインデックスＩｇに相当する励振ゲイ
ンｇａ、ｇｓを出力し、励振ゲインｇａは、乗算器４０
６に供給し、励振ゲインｇｓは、乗算器４０７に供給す
る。The adaptive excitation codebook 403 supplies the adaptive excitation code vector ea corresponding to the adaptive excitation code index Ia to the multiplier 406. The stochastic excitation codebook 405 also supplies the stochastic excitation code vector es corresponding to the stochastic excitation code index Is to the multiplier 407. The VQ gain codebook 404 is
The excitation gains ga and gs corresponding to the excitation gain code index Ig are output, and the excitation gain ga is calculated by the multiplier 40.
6 and the excitation gain gs is supplied to the multiplier 407.

【００３５】乗算器４０６は、適応励振コードベクトル
ｅａと励振ゲインｇａを乗算して得られたベクトルｅag
を加算器４０８に供給する。また、乗算器４０７は、確
率的励振コードベクトルｅｓと励振ゲインｇｓを乗算し
て得られたベクトルｅsgを加算器４０８に供給する。加
算器４０８は、ベクトルｅagとベクトルｅsgを加算して
得られたベクトルｅを乗算器４１０に供給する。また、
このベクトルｅは、適応励振コードブックにも供給され
て、このベクトルｅによって内容が更新される。The multiplier 406 calculates a vector eag obtained by multiplying the adaptive excitation code vector ea by the excitation gain ga.
Is supplied to the adder 408. Further, the multiplier 407 supplies the vector esg obtained by multiplying the stochastic excitation code vector es and the excitation gain gs to the adder 408. The adder 408 supplies the vector e obtained by adding the vector eag and the vector esg to the multiplier 410. Also,
This vector e is also supplied to the adaptive excitation codebook, and the content is updated by this vector e.

【００３６】ゲイン制御回路４０９は、過去の最適励振
より励振ゲインσを求めて乗算器４１０に供給する。乗
算器４１０は、前記励振ベクトルｅと励振ゲインσを乗
算してベクトルｅｇを得て合成フィルタ４１２に供給す
る。The gain control circuit 409 obtains the excitation gain σ from the past optimum excitation and supplies it to the multiplier 410. The multiplier 410 multiplies the excitation vector e by the excitation gain σ to obtain a vector eg and supplies the vector eg to the synthesis filter 412.

【００３７】合成フィルタ４１２は、供給される声道予
測パラメータａｊをフィルタの例えばタップ係数などと
してベクトルｅｇをフィルタリングして合成音声ベクト
ルＳを求めて出力する。The synthesis filter 412 filters the vector eg using the supplied vocal tract prediction parameter aj as a filter coefficient, for example, to obtain the synthesized speech vector S and output it.

【００３８】以上の実施例によれば、ＬＳＰ分析による
ＬＳＰパラメータのコードＩｗを伝送し、復号化器側で
は、このＬＳＰコードＩｗをＬＳＰ逆量子化して声道予
測パラメータａｊを求めて、合成フィルタで合成音声を
再生しているので、ＬＳＰパラメータの量子化ビット数
を少なくしても従来のＰＡＲＣＯＲ方式による分析に比
べ、再生した合成音声のＳ／Ｎを向上させることができ
る。また、ＶＱゲインコードブックで励振ゲインコード
インデックスＩｇによって最適な励振ゲインｇao、ｇso
の対を出力し、適応励振コードベクトルに最適な励振ゲ
インｇaoを乗算し、また、確率的励振コードベクトルに
最適な励振ゲインｇsoを乗算して符号化及び復号化して
いるので、伝送するのは最適な励振ゲインコードインデ
ックスＩｇだけでよい。従来は２系統のゲインコードイ
ンデックスを伝送していた。従って、低い伝送ビットレ
ートであっても、十分な音声品質を得ることができる。According to the above embodiment, the LSP parameter code Iw obtained by the LSP analysis is transmitted, and the decoder side LSP dequantizes the LSP code Iw to obtain the vocal tract prediction parameter aj, and the synthesis filter. Since the synthetic speech is reproduced in S., even if the number of quantization bits of the LSP parameter is reduced, the S / N of the reproduced synthetic speech can be improved as compared with the analysis by the conventional PARCOR method. Also, in the VQ gain code book, the optimum excitation gains gao and gso are determined by the excitation gain code index Ig.
Is output, and the adaptive excitation code vector is multiplied by the optimal excitation gain gao, and the stochastic excitation code vector is multiplied by the optimal excitation gain gso for encoding and decoding. Only the optimum excitation gain code index Ig is required. Conventionally, two systems of gain code indexes were transmitted. Therefore, sufficient voice quality can be obtained even at a low transmission bit rate.

【００３９】以上の実施例においては、フォワード型の
コード励振線形予測符号化器及び復号化器について説明
したが、この構成に限るものではなく、例えば、バック
ワード型のコード励振線形予測符号化器及び復号化器に
対しても適用することができる。Although the forward type code excitation linear predictive encoder and decoder have been described in the above embodiments, the present invention is not limited to this configuration. For example, a backward type code excitation linear predictive encoder and decoder. And can be applied to a decoder.

【００４０】また、ＶＱゲインコードブックは２種類の
励振ゲインを一対として格納していたが、これに限るも
のではなく、ゲイン制御する励振コードブックの種類数
に応じて複数個を一組として格納する構成であってもよ
い。The VQ gain codebook stores two types of excitation gains as a pair, but the present invention is not limited to this, and a plurality of VQ gain codebooks are stored as a set according to the number of types of excitation codebooks to be gain controlled. It may be configured to.

【００４１】[0041]

【発明の効果】以上述べたようにこの発明によれば、線
スペクトル対分析手段と、ゲイン制御コードブックとを
備えて、ゲイン制御された複数種類の励振源情報と上記
線スペクトル対分析情報とを用いて、音声信号の符号化
及び復号化を行っているので、低い伝送ビットレートで
あっても、十分な合成音声品質を得ることができる。As described above, according to the present invention, a plurality of kinds of gain-controlled excitation source information and the line spectrum pair analysis information are provided, which are provided with the line spectrum pair analysis means and the gain control codebook. Since the voice signal is encoded and decoded by using, the sufficient synthesized voice quality can be obtained even at a low transmission bit rate.

[Brief description of drawings]

【図１】この実施例に係るコード励振線形予測符号化器
の機能ブロック図である。FIG. 1 is a functional block diagram of a code excitation linear predictive encoder according to this embodiment.

【図２】従来例に係るコード励振線形予測符号化器の機
能ブロック図である。FIG. 2 is a functional block diagram of a code-excited linear predictive encoder according to a conventional example.

【図３】この実施例に係るコード励振線形予測復号化器
の機能ブロック図である。FIG. 3 is a functional block diagram of a code excitation linear prediction decoder according to this embodiment.

[Explanation of symbols]

３０１…声道分析回路、３０２…ＬＳＰ量子化器、３０
３、４１１…ＬＳＰ逆量子化器、３０４…多重化回路、
３０５…適応励振コードブック、３０６…ＶＱゲインコ
ードブック、３０７…確率的励振コードブック、３０
８、３１０、３１２、４０６、４０７、４１０…乗算
器、３０９、４０８…加算器、３１１、４０９…ゲイン
制御回路、３１３、４１２…合成フィルタ、３１４…減
算器、３１５…聴覚フィルタ、３１６…聴覚誤差計算回
路、４０２…多重分離回路。301 ... Vocal tract analysis circuit, 302 ... LSP quantizer, 30
3, 411 ... LSP dequantizer, 304 ... Multiplexing circuit,
305 ... Adaptive excitation codebook, 306 ... VQ gain codebook, 307 ... Stochastic excitation codebook, 30
8, 310, 312, 406, 407, 410 ... Multiplier, 309, 408 ... Adder, 311, 409 ... Gain control circuit, 313, 412 ... Synthesis filter, 314 ... Subtractor, 315 ... Auditory filter, 316 ... Auditory Error calculation circuit, 402 ... Demultiplexing circuit.

───────────────────────────────────────────────────── フロントページの続き (72)発明者細田賢一郎東京都港区虎ノ門１丁目７番12号沖電気工業株式会社内 ─────────────────────────────────────────────────── ─── Continuation of the front page (72) Inventor Kenichiro Hosoda 1-7-12 Toranomon, Minato-ku, Tokyo Oki Electric Industry Co., Ltd.

Claims

[Claims]

1. A code excitation linear predictive coding method using a plurality of types of excitation source information, wherein a line spectrum pair analysis is performed on a speech signal, and a line spectrum pair is generated which is used to generate synthetic speech. The gain control codebook for storing a plurality of sets of gain control information composed of a plurality of gain control information is provided, and the plurality of types of excitation are controlled by the plurality of gain control information in the optimum gain control information set. A code excitation line characterized by performing gain control on source information and performing encoding and decoding of a voice signal using a plurality of types of gain-controlled excitation source information and the line spectrum pair analysis information. Predictive coding method.