JP3031765B2

JP3031765B2 - Code-excited linear predictive coding

Info

Publication number: JP3031765B2
Application number: JP3251152A
Authority: JP
Inventors: 弘美青柳; 浩桂川; 義博有山; 賢一郎細田
Original assignee: Oki Electric Industry Co Ltd
Current assignee: Oki Electric Industry Co Ltd
Priority date: 1991-09-30
Filing date: 1991-09-30
Publication date: 2000-04-10
Anticipated expiration: 2015-04-10
Also published as: JPH05127699A

Description

DETAILED DESCRIPTION OF THE INVENTION

【０００１】[0001]

【産業上の利用分野】この発明は、コード励振線形予測
符号化方式に関し、例えば音声信号などの高品質圧縮符
号化方式として適用し得るものである。BACKGROUND OF THE INVENTION 1. Field of the Invention The present invention relates to a code-excited linear predictive coding method, and can be applied as a high-quality compression coding method for, for example, audio signals.

【０００２】[0002]

【従来の技術】従来のコード励振線形予測符号化方式
は、例えば、文献１に示されている。文献１：『1989,P
roc.IEEE Int.Conf.on Acoustics,Speech and Signal P
rocessing,PP.65-pp.68,N.S.Jayant and J.H.Chen,"Spe
ech Coding with Time-VaryingBit Allocations to Exc
itation and LPC Parameters"』図２は、文献１に開示
されているものを整理した一例のコード励振線形予測符
号化器の機能ブロック図である。Code Excited Linear Prediction encoding method of the Prior Art Conventionally, for example, has been shown in the literature 1. Reference 1: 『1989, P
roc.IEEE Int.Conf.on Acoustics, Speech and Signal P
rocessing, PP.65-pp.68, NSJayant and JHChen, "Spe
ech Coding with Time-VaryingBit Allocations to Exc
FIG. 2 is disclosed in Reference 1 .
FIG. 3 is a functional block diagram of an example of a code-excited linear prediction encoder that summarizes what is performed.

【０００３】図２において、入力原音声ベクトルＳは、
分析回路１０２に供給され、ここでＰＡＲＣＯＲ(PARti
al auto-CORrelation:偏自己相関)分析などを行い、量
子化短時間予測係数αjpを多重化回路１１１に供給し、
声道予測係数αjqを短時間フィルタ１０３に供給する。In FIG. 2, an input original speech vector S is
It is supplied to the analysis circuit 102, where the PARCO (PARti
al auto-CORrelation (partial autocorrelation) analysis, and supplies the quantized short-term prediction coefficient αjp to the multiplexing circuit 111,
The vocal tract prediction coefficient αjq is supplied to the short-time filter 103.

【０００４】次に、スイッチ１０８と１１３を開き、適
応励振コードブック１０４は、この状態において、適応
励振コードベクトルｅa(i)（ｉ＝１〜ｎ）を、供給され
る適応励振コードインデックスＩａに基づいて選択出力
する。ゲイン制御回路１１９は、原音声ベクトルＳと、
適応励振コードベクトルｅa(i)とから、適応励振ゲイン
制御コードβapを生成し多重化回路１１１に供給する。
また、ゲイン制御回路１１９は、適応励振ゲイン制御コ
ードβapを乗算器１２２に供給して、適応励振コードベ
クトルβｅa(i)と適応励振ゲイン制御コードβapとを乗
算させてゲインの制御を行い、乗算後の適応励振コード
ベクトルβea(i)を加算器１１０に供給させる。Next, the switches 108 and 113 are opened, and the adaptive excitation codebook 104 in this state stores the adaptive excitation code vector ea (i) (i = 1 to n) in the supplied adaptive excitation code index Ia. Selective output based on The gain control circuit 119 calculates an original speech vector S,
The adaptive excitation gain control code βap is generated from the adaptive excitation code vector ea (i) and supplied to the multiplexing circuit 111.
The gain control circuit 119, adaptive excitation gain control co
Supplies over de βap to the multiplier 122, the adaptive excitation code vector βea (i) and adaptive excitation gain control code βap and allowed to multiply <br/> calculated by and controls the gain, the adaptive excitation code vector after multiplication βea: (i) Ru is supplied to the adder 110.

【０００５】加算器１１０は、適応励振コードベクトル
βea(i)をそのまま励振ベクトルｅ(i）として、短時間
フィルタ１０３に供給し、短時間フィルタ１０３は、合
成音声ベクトルＳｗを減算器１０９に供給する。減算器
１０９は、原音声ベクトルＳと合成音声ベクトルＳｗの
成分単位の減算を行い、誤差ベクトルｅr(i)を聴覚フィ
ルタ１０５に供給する。聴覚フィルタ１０５は、フィル
タリングを行い、ベクトルｅw(i)を聴覚誤差計算回路１
０６に供給する。聴覚誤差計算回路１０６は、ベクトル
ｅw(i)の各成分単位の二乗平均ｇｉを計算し、この二乗
平均ｇｉが最小となるｉを最適な適応励振コードベクト
ルのインデックスＩａとして、適応励振コードブック１
０４と多重化回路１１１に供給する。The adder 110 supplies the adaptive excitation code vector βea (i) as it is to the short-time filter 103 as the excitation vector e (i), and the short-time filter 103 supplies the synthesized speech vector Sw to the subtractor 109. I do. The subtracter 109 performs subtraction on a component basis between the original speech vector S and the synthesized speech vector Sw, and supplies the error vector er (i) to the auditory filter 105. The auditory filter 105 performs filtering and converts the vector ew (i) into the auditory error calculation circuit 1
06. Hearing error calculation circuit 106 calculates a root mean gi of each component unit vector ew (i), the square
I that minimizes the average gi is the optimal adaptive excitation code vector
As the index Ia Le, adaptive excitation codebook 1
04 and the multiplexing circuit 111.

【０００６】次には、スイッチ１１３を閉じて、確率的
励振コードブック１０７から出力される確率的励振コー
ドベクトルｅs(i)に対して、ゲイン制御回路１２３で生
成した確率的励振ゲイン制御コードβsqを乗算器１２６
で乗算してゲイン制御し、乗算後の確率的励振コードベ
クトルβｅs(i)を加算器１１０に供給する。このとき、
ゲイン制御回路１２３は、確率的励振ゲイン制御コード
βspを多重化回路１１１にも供給する。加算器１１０は
確率的励振コードベクトルβｅs(i)と適応励振コードベ
クトルβea(i)の加算を行い、励振ベクトルｅ（ｉ）を
前述と同様に短時間フィルタ１０３に供給して、以下、
上述と同様な処理の方法で減算器１０９と、聴覚フィル
タ１０５と、聴覚誤差計算回路１０６で処理して、最適
な確率的励振コードベクトルのインデックスＩｓを求め
て、多重化回路１１１と、確率的励振コードブック１０
７に供給する。Next, the switch 113 is closed, and the stochastic excitation gain control code βsq generated by the gain control circuit 123 is applied to the stochastic excitation code vector es (i) output from the stochastic excitation codebook 107. To the multiplier 126
To control the gain, and supplies the multiplied stochastic excitation code vector βes (i) to the adder 110. At this time,
The gain control circuit 123 also supplies the stochastic excitation gain control code βsp to the multiplexing circuit 111. The adder 110 adds the probabilistic excitation code vector βes (i) and the adaptive excitation code vector βea (i) , and supplies the excitation vector e (i) to the short-time filter 103 as described above .
A subtracter 109 in the above and similar processing methods, the auditory filter 105, and treated with auditory error calculation circuit 106, the optimal
Seeking index Is of stochastic excitation code vector, a multiplexing circuit 111, stochastic excitation codebook 10
7

【０００７】次に、スイッチ１０８と１１３とを閉じ
て、励振ベクトルｅ（ｉ）によって適応励振コードブッ
ク１０４の内容を更新する。[0007] Next, by closing the switch 108 and 113, and updates the contents of the adaptive excitation codebook 104 by the excitation vector e (i).

【０００８】以上のようにして得られた確率的励振コー
ドインデックスＩｓと適応励振コードインデックスＩａ
と確率的励振ゲイン制御コードβspと適応励振ゲイン制
御コードβapと量子化短時間予測係数αjpとを、多重化
回路１１１が多重化してトータルコードＣを出力してい
た。[0008] The probabilistic excitation code index Is and the adaptive excitation code index Ia obtained as described above.
The stochastic excitation gain control code βsp the adaptive excitation gain control code βap quantization short prediction coefficients αjp and, multiplexing
The circuit 111 multiplexes and outputs the total code C.

【０００９】なお、コード励振線形予測復号化器におい
ては、上記処理と対称的な処理を行って再生合成音声信
号を得ていた。 [0009] In the code-excited linear prediction decoder,
In other words, by performing processing symmetrical to the above processing,
Issue.

【００１０】[0010]

【発明が解決しようとする課題】しかしながら、以上の
従来の方式においては、入力音声信号に対するＰＡＲＣ
ＯＲ分析での量子化ビット数が少なくなると、再生した
合成音声に歪みなどが生じて、十分なＳ／Ｎなどが得ら
れないなど良好な音声合成品質が得られないという問題
があった。従って、量子化ビット数を少なくすることが
できず、従って伝送ビットレートを低速にするための障
害となっていた。However, in the above conventional method, the PARC for the input voice signal is not used.
When the number of quantization bits in the OR analysis is small, there is a problem that distortion or the like is generated in the reproduced synthesized voice, and that a satisfactory S / N cannot be obtained, such as a sufficient S / N cannot be obtained. Therefore, the number of quantization bits cannot be reduced, which has been an obstacle to reducing the transmission bit rate.

【００１１】ＰＡＲＣＯＲ分析の場合、情報量を減らす
ために、フレーム間隔を例えば２０msecまでに延ばし、
その間に各パラメータの変わり方をスムーズにするため
に、例えば、２．５msecごとに線形補間を行う。しかし
ながら、この線形補間がＰＡＲＣＯＲ係数に必ずしも適
応せず、そのために再生した合成音声のスペクトルに歪
みが増大して、音質が劣化していた。これは、ＰＡＲＣ
ＯＲ係数が単一の明確な物理量に対応せず、複雑な複合
パラメータになっていることによるとも考えられる。[0011] For P ARCOR analysis, in order to reduce the information amount, extending the frame interval for example to 20 msec,
In the meantime, in order to smoothly change the parameters, linear interpolation is performed, for example, every 2.5 msec. However, this linear interpolation does not always adapt to the PARCOR coefficient, and therefore, the distortion of the reproduced synthesized speech spectrum increases, and the sound quality is degraded. This is PARC
It is also considered that the OR coefficient does not correspond to a single definite physical quantity and is a complex compound parameter.

【００１２】また、適応励振コードブックと確率的励振
コードブックに対するゲインをそれぞれ独立にスカラー
量子化した係数でゲイン制御していたので、量子化ビッ
ト数が少なくなると、歪みなどが生じて再生した合成音
声のＳ／Ｎなどが十分に得られないため、量子化ビット
数を少なくすることができず、その結果、伝送ビットレ
ートを低速にするための障害となっていた。Further, since the gain for the adaptive excitation codebook and the gain for the stochastic excitation codebook are independently controlled by coefficients that are scalar-quantized, when the number of quantization bits is reduced, distortion and the like are generated, and the reproduced image is synthesized. Since the S / N of the sound cannot be sufficiently obtained, the number of quantization bits cannot be reduced, and as a result, there has been an obstacle to lowering the transmission bit rate.

【００１３】適応励振ゲイン制御コードと、確率的励振
ゲイン制御コードを独立に生成して伝送していたので、
伝送ビットレートを高くする傾向にあり、励振源コード
ブックの種類がもっと多くなると、ますますゲイン制御
コードの種類も多くなり、伝送ビットレートを高くせざ
るを得なくなるという問題があった。Since the adaptive excitation gain control code and the stochastic excitation gain control code are independently generated and transmitted,
There is a tendency that the transmission bit rate tends to be high, and if the number of excitation source codebooks increases, the number of gain control codes increases, and the transmission bit rate must be increased.

【００１４】現在のこのような符号化方式に対する要請
は、低い伝送ビットレートで再生した合成音声の品質が
良好に得られるということであるが、以上のような問題
によって実現が困難であった。The demand for such an encoding system at present is that the quality of synthesized speech reproduced at a low transmission bit rate can be obtained well, but it has been difficult to realize the above problem due to the above problems.

【００１５】この発明は、以上の課題に鑑み為されたも
のであり、その目的とするところは、低い伝送ビットレ
ートであっても、十分な合成音声品質が得られるコード
励振線形予測符号化方式を提供することである。SUMMARY OF THE INVENTION The present invention has been made in view of the above problems, and has as its object to provide a code-excited linear predictive coding method capable of obtaining a sufficient synthesized speech quality even at a low transmission bit rate. It is to provide.

【００１６】[0016]

【課題を解決するための手段】この発明は、以上の目的
を達成するために、複数種類の励振源情報を用いるコー
ド励振線形予測符号化方式に対し、以下の特徴的な手段
と方法を導入した。SUMMARY OF THE INVENTION The present invention, in order to achieve the above object, against the code excited linear predictive coding scheme using a plurality of types of excitation source information, introducing the following characteristic units and methods did.

【００１７】すなわち、音声信号に対する線スペクトル
対分析を行い、合成音声の生成に利用する線スペクトル
対分析情報を生成すると共に、その線スペクトル対分析
情報に対応したコードが多重化符号化音声信号の一要素
となっている線スペクトル対分析手段と、上記複数種類
の励振源情報のそれぞれに対する複数個のゲイン制御情
報からなるゲイン制御情報組を複数組格納しており、そ
のいずれかの組を規定するインデックスが多重化符号化
音声信号の一要素となっているゲイン制御コードブック
と、ゲイン制御コードブックからの出力によって、それ
ぞれがゲイン制御された後の複数種類の励振源情報が加
算された加算励振源情報に対するゲインを、過去の最適
な加算励振源情報から形成して、上記加算励振源情報を
ゲイン制御するゲイン制御手段とを備えて、最適なゲイ
ン制御情報組における複数個のゲイン制御情報によって
上記複数種類の励振源情報に対してゲイン制御を行い、
ゲイン制御された複数種類の励振源情報を加算し、加算
励振源情報に対してもゲイン制御を行い、ゲイン制御さ
れた加算励振源情報と上記線スペクトル対分析情報とを
用いて、音声信号の符号化及び復号化を行うことを特徴
とする。 That is , a line spectrum pair analysis is performed on a speech signal to generate line spectrum pair analysis information used for generating a synthesized voice, and the line spectrum pair analysis information is generated .
The code corresponding to the information is one element of the multiplexed coded audio signal
A line spectrum pair analysis means which is a, the plurality of types
Of which stores a plurality of sets of gain control information set comprising a plurality of gain control information for each of the excitation source information, its
Index that specifies any pair of
The gain control codebook, which is an element of the audio signal, and the output from the gain control codebook
Each type of excitation source information after gain control is added.
The gain for the calculated excitation information
From the additional excitation source information.
Gain control means for performing gain control, performing gain control on the plurality of types of excitation source information by a plurality of gain control information in an optimal gain control information set,
Adds and adds multiple types of excitation source information with gain control
Gain control is also performed on the excitation source information,
The audio signal is encoded and decoded using the added excitation source information and the line spectrum pair analysis information.

【００１８】[0018]

【作用】この発明によれば、入力音声信号に対して線ス
ペクトル対分析を行っているので、主要なホルマントに
対する線スペクトル対は、ホルマント周波数からのずれ
が少なくて、比較的良く対応する。従って、線形補間特
性が良く、線形補間によるスペクトル歪みが少なく音質
の劣化が少なくなる。そして、線スペクトル対の係数を
低ビット数に量子化しても、従来のＰＡＲＣＯＲ分析に
よるよりも、音質を良くすることができる。According to the present invention, since the line spectrum pair analysis is performed on the input voice signal, the line spectrum pair corresponding to the main formant has a small deviation from the formant frequency and corresponds relatively well. Therefore, the linear interpolation characteristics are good, the spectral distortion due to the linear interpolation is small, and the deterioration of the sound quality is reduced. Then, even if the coefficients of the line spectrum pair are quantized to a low bit number, sound quality can be improved as compared with the conventional PARCOR analysis.

【００１９】また、複数種類の励振源情報ごとのゲイン
情報を伝送する必要がない。すなわち、各励振源情報に
対するそれぞれのゲイン制御情報が組として形成されて
いるので、この組を指定するインデックスを伝送するだ
けで良いので、伝送ビットレートを従来に比べ低下させ
ることができる。しかも、ゲイン制御情報の組を予め最
適なゲイン制御情報の組合せに設定しておくことによっ
て、効率的に良好な音質を得ることができる。Further, there is no need to transmit gain information for each of a plurality of types of excitation source information. That is, since the respective gain control information for each excitation source information is formed as a set , it is only necessary to transmit an index designating this set, so that the transmission bit rate can be reduced as compared with the related art. Moreover, by setting the combination of the gain control information to the optimum combination of the gain control information in advance, it is possible to efficiently obtain good sound quality.

【００２０】さらに、ゲイン制御された複数種類の励振
源情報を加算し、加算励振源情報に対してもゲイン制御
を行うので、再生音声の音質の向上を期待できる。 Furthermore, a plurality of types of excitations with gain control
Source information is added, and gain control is also performed on the added excitation source information.
Therefore, improvement in the sound quality of the reproduced sound can be expected.

【００２１】[0021]

【実施例】以下、この発明に係るコード励振線形予測符
号化方式の好適な一実施例を図面を用いて説明する。DETAILED DESCRIPTION OF THE PREFERRED EMBODIMENTS A preferred embodiment of the code excitation linear predictive coding system according to the present invention will be described below with reference to the drawings.

【００２２】この実施例は、フォワード型のコード励振
線形予測符号化を行うコード励振線形予測符号化器とそ
の復号化器である。[0022] This embodiment is a code excited linear predictive coding of forward type and row Uco over de-excited linear prediction encoder that decoder.

【００２３】なお、この実施例の目的は、低ビットレー
トにおいても優れた合成音声品質を得られるコード励振
線形予測符号化器とその復号化器を実現することであ
る。[0023] The object of this example is to realize a code excited linear predictive coder is also obtained good synthetic speech quality at low bit rates the decoder.

【００２４】この目的を実現するために、この実施例で
は、主に声道予測パラメータをＬＳＰ（ＬｉｎｅＳｐ
ｅｃｔｒｕｍＰａｉｒ）パラメータに変換して量子化
する回路と、ＬＳＰパラメータを逆量子化して補間し、
声道予測パラメータに変換する回路と、適応励振コード
ブックと確率的励振コードブックのゲインをベクトル量
子化したＶＱ（ベクトル量子化）ゲインコードブック
と、バックワード型の励振ゲイン予測回路とを設けてい
る。In order to realize this object, in this embodiment, the vocal tract prediction parameters are mainly set to LSP (Line Sp).
(Electrum Pair) parameters and a circuit for performing quantization, and LSP parameters are inversely quantized and interpolated.
A circuit for converting to a vocal tract prediction parameter, a VQ (vector quantization) gain codebook in which the gains of the adaptive excitation codebook and the stochastic excitation codebook are vector-quantized, and a backward-type excitation gain prediction circuit are provided. I have.

【００２５】図１は、この実施例のコード励振線形予測
符号化器の機能ブロック図を示している。図３は、この
実施例のコード励振線形予測復号化器の機能ブロック図
を示している。FIG. 1 is a functional block diagram of a code excitation linear prediction encoder according to this embodiment. FIG. 3 shows a functional block diagram of the code-excited linear prediction decoder of this embodiment.

【００２６】図１において、この実施例のコード励振線
形予測符号化器は、声道分析回路３０１と、ＬＳＰ量子
化器３０２と、ＬＳＰ逆量子化器３０３と、多重化回路
３０４と、適応励振コードブック３０５と、ＶＱゲイン
コードブック３０６と、確率的励振コードブック３０７
と、乗算器３０８と、加算器３０９と、乗算器３１０
と、ゲイン制御回路３１１と、乗算器３１２と、合成フ
ィルタ３１３と、減算器３１４と、聴覚フィルタ３１５
と、聴覚誤差計算回路３１６とで構成されている。In FIG. 1, the code-excited linear predictive encoder according to this embodiment includes a vocal tract analysis circuit 301, an LSP quantizer 302, an LSP inverse quantizer 303, a multiplexing circuit 304, and an adaptive excitation. Codebook 305, VQ gain codebook 306, stochastic excitation codebook 307
, A multiplier 308, an adder 309, and a multiplier 310
, A gain control circuit 311, a multiplier 312, a synthesis filter 313, a subtractor 314, and an auditory filter 315.
And a hearing error calculation circuit 316.

【００２７】ここで、声道分析回路３０１と、ＬＳＰ量
子化器３０２と、ＬＳＰ逆量子化器３０３は、ＬＳＰ分
析系を形成している。[0027] In here, the vocal tract analysis circuit 301, the LSP quantizer 302, LSP dequantizer 303 forms an LSP analysis system.

【００２８】次に、図１に示すコード励振線形予測符号
化器の動作を説明する。Next , the code-excited linear prediction code shown in FIG.
Illustrating an operation of the encoder.

【００２９】フレーム単位にまとめられた原音声ベクト
ルＳは、声道分析回路３０１に供給され、声道予測パラ
メータａｊが求められる。この声道予測パラメータａｊ
は、ＬＳＰ量子化器３０２に供給され、ＬＳＰ量子化器
３０２は、この声道予測パラメータａｊをＬＳＰパラメ
ータに変換して量子化し、そのＬＳＰコードＩｗをＬＳ
Ｐ逆量子化器３０３と多重化回路３０４に供給する。Ｌ
ＳＰ逆量子化器３０３は、ＬＳＰコードＩｗよりＬＳＰ
パラメータを逆量子化し、補間を行った後に声道予測パ
ラメータａqjに逆変換して合成フィルタ３１３に供給す
る。The frame units summarized original speech vector S is supplied to the vocal tract analysis circuit 301, it is required vocal tract predictive parameter aj. This vocal tract prediction parameter aj
Is supplied to the LSP quantizer 302, and the LSP quantizer
302 converts the vocal tract prediction parameter aj into an LSP parameter and quantizes the LSP code, and converts the LSP code Iw into an LS
It supplied to P inverse quantizer 303 and the multiplexing circuit 304. L
The SP inverse quantizer 303 calculates the LSP from the LSP code Iw.
After the parameters are inversely quantized and interpolated, they are inversely transformed into vocal tract prediction parameters aqj and supplied to the synthesis filter 313.

【００３０】一方、適応励振コードブック３０５は、適
応励振コードベクトルｅai（ｉ＝１〜ｎ）を乗算器３０
８に供給する。また、確率的励振コードブック３０７
は、確率的励振コードベクトルｅsl（ｌ＝１〜ｍ）を乗
算器３１０に供給する。さらに、ＶＱゲインコードブッ
ク３０６は、励振ゲインｇak、ｇsk（ｋ＝１〜ｐ）の対
を出力する。励振ゲインｇakは、乗算器３０８に供給さ
れ、また、励振ゲインｇskは、乗算器３１０に供給され
る。On the other hand, the adaptive excitation codebook 305 converts the adaptive excitation code vector eai (i = 1 to n) into the multiplier 30.
8 Also, the stochastic excitation codebook 307
Supplies the stochastic excitation code vector esl (l = 1 to m) to the multiplier 310. Further , the VQ gain codebook 306 outputs a pair of excitation gains gak and gsk (k = 1 to p). The excitation gain gak is supplied to the multiplier 308.
The excitation gain gsk is supplied to the multiplier 310.

【００３１】これにより、乗算器３０８は、前記適応励
振コードベクトルｅaiと励振ゲインｇakを乗算してベク
トルｅaikを得て加算器３０９に供給し、乗算器３１０
は、前記確率的励振コードベクトルｅslと励振ゲインｇ
skを乗算してベクトルｅslkを得て加算器３０９に供給
する。[0031] Accordingly, the multiplier 308 obtains a vector eaik supplied to the adder 309 by multiplying the adaptive excitation code vector eai the excitation gain GAK, multiplier 310
Is the stochastic excitation code vector esl and the excitation gain g
The vector eslk is obtained by multiplying by sk and supplied to the adder 309.

【００３２】加算器３０９は、ベクトルｅaikとベクト
ルｅslkの成分単位の加算を行い、励振ベクトルｅを求
めて乗算器３１２に供給する。ゲイン制御回路３１１
は、過去の最適励振ベクトルより励振ゲインσを求めて
乗算器３１２に供給する。乗算器３１２は、前記励振ベ
クトルｅと励振ゲインσを乗算してベクトルｅｇを得て
合成フィルタ３１３に供給する。The adder 309 performs addition of the component unit vector eaik and vector Eslk, supplied to the multiplier 312 obtains an excitation vector e. Gain control circuit 311
Calculates the excitation gain σ from the past optimal excitation vector and supplies it to the multiplier 312. The multiplier 312 multiplies the excitation vector e by the excitation gain σ to obtain a vector eg and supplies the vector eg to the synthesis filter 313.

【００３３】合成フィルタ３１３は、供給される声道予
測パラメータａqjをフィルタの例えばタップ係数などと
してベクトルｅｇをフィルタリングして合成音声ベクト
ルＳｗを求めて減算器３１４に供給する。減算器３１４
は、原音声ベクトルＳと合成音声ベクトルＳｗの成分単
位の減算を行い、誤差ベクトルｅｒを求めて聴覚フィル
タ３１５に供給する。聴覚フィルタ３１５は、誤差ベク
トルｅｒに対するフィルタリングを行い、フィルタリン
グ後のベクトルｅｗを聴覚誤差計算回路３１６に供給す
る。The synthesis filter 313 filters the vector eg using the supplied vocal tract prediction parameter aqj as, for example, a tap coefficient of the filter, obtains a synthesized speech vector Sw, and supplies the resultant to the subtractor 314. Subtractor 314
Performs subtraction between the original speech vector S synthesized speech vector Sw component unit, and supplies the auditory filter 315 seeking error vector er. Auditory filter 315 performs filtering on the error vector er, filtering
The resulting vector ew is supplied to the hearing error calculation circuit 316.

【００３４】聴覚誤差計算回路３１６は、ベクトルｅｗ
の各成分の２乗平均を求めて、この値が最小になるｉ、
ｊ、ｋの組み合わせを、最適な適応励振コードベクトル
のインデックスＩａ、最適な確率的励振コードベクトル
のインデックスＩｓ、最適な励振ゲインコードのインデ
ックスＩｇとして、これらを多重化回路３０４に供給す
ると共に、適応励振コードインデックスＩａを適応励振
コードブック３０５に供給し、確率的励振コードインデ
ックスＩｓを確率的励振コードブック３０７に供給し、
励振ゲインコードインデックスＩｇをＶＱゲインコード
ブック３０６に供給する。[0034] Hearing error calculation circuit 316, a vector ew
The root mean square of each component of
The combination of j and k is converted to the optimal adaptive excitation code vector
Index Ia, the optimum stochastic excitation codevector
Index Is, as indenyl <br/> box Ig optimal excitation gain code, and supplies them to the multiplexing circuit 304 supplies the adaptive excitation code index Ia to the adaptive excitation codebook 305, stochastic excitation code index Supplying Is to the stochastic excitation codebook 307,
The excitation gain code index Ig is supplied to the VQ gain codebook 306.

【００３５】次に、適応励振コードブック３０５の更新
方法を説明する。適応励振コードブック３０５は、適応
励振コードインデックスＩａによって最適な適応励振コ
ードベクトルｅaoを出力して乗算器３０８に供給する。
また、確率的励振コードブック３０５は確率的励振コー
ドインデックスＩｓによって最適な確率的励振コードベ
クトルｅsoを出力して乗算器３１０に供給する。また、
ＶＱゲインコードブック３０６は、励振ゲインコードイ
ンデックスＩｇによって最適な励振ゲインｇao、ｇsoの
対を出力し、一方の励振ゲインｇaoは乗算器３０８に供
給され、他方の励振ゲインｇsoは乗算器３１０に供給さ
れる。 Next, the adaptive excitation codebook 305 is updated.
The method will be described. The adaptive excitation codebook 305 outputs an optimal adaptive excitation code vector eao according to the adaptive excitation code index Ia and supplies the output to the multiplier 308.
Further, the stochastic excitation codebook 305 outputs an optimal probabilistic excitation code vector eso according to the stochastic excitation code index Is and supplies it to the multiplier 310. Also,
The VQ gain codebook 306 outputs an optimum pair of the excitation gains gao and gso according to the excitation gain code index Ig. One excitation gain gao is supplied to the multiplier 308, and the other excitation gain gso is supplied to the multiplier 310. Sa
Re that.

【００３６】乗算器３０８は、最適な適応励振コードベ
クトルｅaoと最適な励振ゲインｇaoの乗算を行い、乗算
後のベクトルｅao・ｇaoを加算器３０９に供給する。ま
た、乗算器３１０は、最適な確率的励振コードベクトル
ｅsoと最適な励振ゲインｇsoの乗算を行い、乗算後のベ
クトルｅso・ｇsoを加算器３０９に供給する。加算器３
０９は、これらベクトルｅao・ｇao及びｅso・ｇsoの加
算を行い、この加算値ｅoptを適応励振コードブック３
０５に供給し、内容を更新させる。The multiplier 308 multiplies the optimal adaptive excitation code vector eao and the optimal excitation gain gao by multiplication.
The subsequent vector eao · gao is supplied to the adder 309. Also, the multiplier 310 performs multiplication of the optimum stochastic excitation code vector eso and optimal excitation gain gso, base after multiplication
The vector eso · gso is supplied to the adder 309. Adder 3
09 adds these vectors eao · gao and eso · gso, and adds the added value e opt to the adaptive excitation codebook 3.
It is supplied to the 05, Ru to update the content.

【００３７】なお、各コードブック３０５、３０６、３
０７による最適コードインデックスの探索順番は、いろ
いろな方法があるが特に限定するものではない。例え
ば、適応励振コードベクトルをｅa1とし、確率的励振コ
ードベクトルをｅs1とした場合において、励振ゲインを
（ｇa1、ｇsl)〜（ｇap、ｇsp）の順番で変えて探索
し、最適な適応励振コードベクトルと確率的励振コード
ベクトルと励振ゲインとを求めることもできる。[0037] In addition, each codebook 305,306,3
There are various methods for searching the optimum code index according to 07, but there is no particular limitation. For example, the ea1 the adaptive excitation code vector, in case of a es1 stochastic excitation code vector, searches by changing the excitation gain (ga1, gsl) ~ (gap , gsp) in the order of, optimal adaptive excitation code vector And the stochastic excitation code vector and the excitation gain.

【００３８】多重化回路３０４は、以上のようにして得
られたＬＳＰコードＩｗと、適応励振コードインデック
スＩａと、確率的励振コードインデックスＩｓと、励振
ゲインコードインデックスＩｇとを多重化してこれをト
ータルコードＣとして出力し、コード励振線形予測復号
化器に供給する。The multiplexing circuit 304 multiplexes the LSP code Iw, the adaptive excitation code index Ia, the stochastic excitation code index Is, and the excitation gain code index Ig obtained as described above, and sums them. Output as code C and supply to code-excited linear prediction decoder.

【００３９】図３は、この実施例のコード励振線形予測
復号化器の機能ブロック図を示している。FIG. 3 shows a functional block diagram of the code-excited linear prediction decoder of this embodiment.

【００４０】図３において、この実施例のコード励振線
形予測復号化器は、多重分離回路４０２と、適応励振コ
ードブック４０３と、ＶＱゲインコードブック４０４
と、確率的励振コードブック４０５と、乗算器４０６
と、乗算器４０７と、加算器４０８と、ゲイン制御回路
４０９と、乗算器４１０と、ＬＳＰ逆量子化器４１１
と、合成フィルタ４１２とで構成されている。[0040] In FIG. 3, the code excited linear predictive decoder of this embodiment includes a demultiplexing circuit 402, an adaptive excitation codebook 403, VQ gain codebook 404
, A stochastic excitation codebook 405 and a multiplier 406
, A multiplier 407, an adder 408, a gain control circuit 409, a multiplier 410, and an LSP dequantizer 411.
And a synthesis filter 412.

【００４１】次に、図３に示すコード励振線形予測復号
化器の動作を説明する。Next , the code-excited linear predictive decoding shown in FIG.
Illustrating an operation of the encoder.

【００４２】入力されたトータルコードＣは、多重分離
回路４０２において、各コードに分離され、ＬＳＰコー
ドＩｗはＬＳＰ逆量子化器４１１に供給され、適応励振
コードインデックスＩａは適応励振コードブック４０３
に供給され、確率的励振コードインデックスＩｓは確率
的励振コードブック４０５に供給され、励振ゲインコー
ドインデックスＩｇはＶＱゲインコードブック４０４に
供給される。The total code C that is input, in the demultiplexer 402 is separated into each code, LSP code Iw is supplied to the LSP dequantizer 411, the adaptive excitation code index Ia adaptive excitation codebook 403
, And the probabilistic excitation code index Is is supplied to the probabilistic excitation codebook 405, and the excitation gain code index Ig is supplied to the VQ gain codebook 404.

【００４３】ＬＳＰ逆量子化器４１１は、ＬＳＰコード
Ｉｗより、ＬＳＰパラメータを逆量子化し、更に補間を
行った後に声道予測パラメータａｊに変換して、合成フ
ィルタ４１２に供給する。The LSP inverse quantizer 411 inversely quantizes the LSP parameters from the LSP code Iw , further performs interpolation, converts the LSP parameters into vocal tract prediction parameters aj, and supplies the parameters to the synthesis filter 412.

【００４４】適応励振コードブック４０３は、適応励振
コードインデックスＩａに対応する適応励振コードベク
トルｅａを乗算器４０６に供給する。また、確率的励振
コードブック４０５は、確率的励振コードインデックス
Ｉｓに対応する確率的励振コードベクトルｅｓを乗算器
４０７に供給する。ＶＱゲインコードブック４０４は、
励振ゲインコードインデックスＩｇに対応する励振ゲイ
ンｇａ、ｇｓを出力し、励振ゲインｇａは、乗算器４０
６に供給され、励振ゲインｇｓは、乗算器４０７に供給
される。The adaptive excitation codebook 403 supplies the adaptive excitation code vector ea corresponding to the adaptive excitation code index Ia to the multiplier 406. Further, the stochastic excitation codebook 405 supplies the stochastic excitation code vector es corresponding to the stochastic excitation code index Is to the multiplier 407. The VQ gain codebook 404 is
Excitation gains ga and gs corresponding to the excitation gain code index Ig are output.
6 and the excitation gain gs is supplied to the multiplier 407.
Ru is.

【００４５】乗算器４０６は、適応励振コードベクトル
ｅａと励振ゲインｇａを乗算して得られたベクトルｅag
を加算器４０８に供給する。また、乗算器４０７は、確
率的励振コードベクトルｅｓと励振ゲインｇｓを乗算し
て得られたベクトルｅsgを加算器４０８に供給する。加
算器４０８は、ベクトルｅagとベクトルｅsgを加算し、
得られたベクトルｅを乗算器４１０に供給する。また、
このベクトルｅは、適応励振コードブック４０３にも供
給されて、このベクトルｅによって内容が更新される。The multiplier 406 calculates a vector eag obtained by multiplying the adaptive excitation code vector ea by the excitation gain ga.
Is supplied to the adder 408. Further, the multiplier 407 supplies a vector esg obtained by multiplying the stochastic excitation code vector es and the excitation gain gs to the adder 408. The adder 408 adds the vector eag and the vector esg ,
The obtained vector e is supplied to the multiplier 410. Also,
The vector e is also supplied to the adaptive excitation codebook 403, and the content is updated by the vector e.

【００４６】ゲイン制御回路４０９は、過去の最適励振
ベクトルより励振ゲインσを求めて乗算器４１０に供給
する。乗算器４１０は、前記励振ベクトルｅと励振ゲイ
ンσを乗算してベクトルｅｇを得て合成フィルタ４１２
に供給する。The gain control circuit 409 is used to determine the past optimum excitation
The excitation gain σ is obtained from the vector and supplied to the multiplier 410. A multiplier 410 multiplies the excitation vector e by an excitation gain σ to obtain a vector eg, and obtains a synthesis filter 412
To supply.

【００４７】合成フィルタ４１２は、供給される声道予
測パラメータａｊをフィルタの例えばタップ係数などと
してベクトルｅｇをフィルタリングして合成音声ベクト
ルＳを求めて出力する。The synthesis filter 412 filters the vector eg using the supplied vocal tract prediction parameter aj as, for example, a tap coefficient of the filter, and obtains and outputs a synthesized speech vector S.

【００４８】上述した実施例によれば、符号化器側にお
いて、ＬＳＰ分析によるＬＳＰパラメータのコードＩｗ
を伝送し、復号化器側では、このＬＳＰコードＩｗをＬ
ＳＰ逆量子化して声道予測パラメータａｊを求めて、合
成フィルタで合成音声を再生しているので、ＬＳＰパラ
メータの量子化ビット数を少なくしても従来のＰＡＲＣ
ＯＲ方式による分析に比べ、再生した合成音声のＳ／Ｎ
を向上させることができる。According to the above-described embodiment, the encoder side
And LSP parameter code Iw by LSP analysis
And the LSP code Iw is
Since the vocal tract prediction parameter aj is obtained by SP inverse quantization and the synthesized speech is reproduced by the synthesis filter, the conventional PARC even if the number of quantization bits of the LSP parameter is reduced.
Compared with the analysis by the OR method, the S / N of the reproduced synthesized speech
Can be improved.

【００４９】また、ＶＱゲインコードブックで励振ゲイ
ンコードインデックスＩｇによって最適な励振ゲインｇ
ao、ｇsoの対を出力し、適応励振コードベクトルに最適
な励振ゲインｇaoを乗算し、また、確率的励振コードベ
クトルに最適な励振ゲインｇsoを乗算して符号化及び復
号化しているので、伝送するのは最適な励振ゲインコー
ドインデックスＩｇだけでよい。従来は２系統のゲイン
コードインデックスを伝送していた。従って、低い伝送
ビットレートであっても、十分な音声品質を得ることが
できる。[0049] In addition, the optimal excitation gain by the excitation gain code index Ig in the VQ gain codebook g
Since a pair of ao and gso is output, the adaptive excitation code vector is multiplied by the optimal excitation gain gao, and the stochastic excitation code vector is multiplied by the optimal excitation gain gso to encode and decode. Only the optimum excitation gain code index Ig needs to be performed. Conventionally, two systems of gain code indexes have been transmitted. Therefore, sufficient voice quality can be obtained even at a low transmission bit rate.

【００５０】上記実施例においては、フォワード型のコ
ード励振線形予測符号化器及び復号化器について説明し
たが、この構成に限るものではなく、例えば、バックワ
ード型のコード励振線形予測符号化器及び復号化器に対
しても、本発明を適用することができる。In the above embodiment, the forward type code-excited linear predictive encoder and decoder have been described. However, the present invention is not limited to this configuration. The present invention can be applied to a decoder.

【００５１】また、ＶＱゲインコードブックは２種類の
励振ゲインを一対として格納していたが、これに限るも
のではなく、ゲイン制御する励振コードブックの種類数
に応じて複数個を一組として格納する構成であってもよ
い。The VQ gain codebook stores two types of excitation gains as a pair. However, the present invention is not limited to this, and a plurality of excitation gains are stored as a set according to the number of types of excitation codebooks to be controlled. The configuration may be as follows.

【００５２】[0052]

【発明の効果】以上述べたようにこの発明によれば、線
スペクトル対分析手段と、ゲイン制御コードブックと、
ゲイン制御手段とを備えて、最適なゲイン制御情報組に
おける複数個のゲイン制御情報によって複数種類の励振
源情報に対してゲイン制御を行い、ゲイン制御された複
数種類の励振源情報を加算し、加算励振源情報に対して
もゲイン制御を行い、ゲイン制御された加算励振源情報
と線スペクトル対分析情報とを用いて、音声信号の符号
化及び復号化を行っているので、低い伝送ビットレート
であっても、十分な合成音声品質を得ることができる。As described above, according to the present invention, a line spectrum pair analysis means, a gain control codebook ,
With gain control means to optimize the gain control information set
Excitation by multiple gain control information
Gain control is performed on the source information, and the gain-controlled
Add several types of excitation source information, and add
Also performs gain control, and performs encoding and decoding of the audio signal using the gain-controlled addition excitation source information and the line spectrum pair analysis information. Synthesized speech quality can be obtained.

[Brief description of the drawings]

【図１】実施例に係るコード励振線形予測符号化器の機
能ブロック図である。1 is a functional block diagram of a code excited linear prediction encoder according to actual施例.

【図２】従来例に係るコード励振線形予測符号化器の機
能ブロック図である。FIG. 2 is a functional block diagram of a code excitation linear prediction encoder according to a conventional example.

【図３】実施例に係るコード励振線形予測復号化器の機
能ブロック図である。3 is a functional block diagram of a code excited linear predictive decoding apparatus according to the actual施例.

[Explanation of symbols]

３０１…声道分析回路、３０２…ＬＳＰ量子化器、３０
３、４１１…ＬＳＰ逆量子化器、３０４…多重化回路、
３０５…適応励振コードブック、３０６…ＶＱゲインコ
ードブック、３０７…確率的励振コードブック、３０
８、３１０、３１２、４０６、４０７、４１０…乗算
器、３０９、４０８…加算器、３１１、４０９…ゲイン
制御回路、３１３、４１２…合成フィルタ、３１４…減
算器、３１５…聴覚フィルタ、３１６…聴覚誤差計算回
路、４０２…多重分離回路。301 vocal tract analysis circuit, 302 LSP quantizer, 30
3, 411 ... LSP inverse quantizer, 304 ... multiplexing circuit,
305: adaptive excitation codebook, 306: VQ gain codebook, 307: stochastic excitation codebook, 30
8, 310, 312, 406, 407, 410 multiplier, 309, 408 adder, 311, 409 gain control circuit, 313, 412 synthesis filter, 314 subtractor, 315 audio filter, 316 audio Error calculation circuit, 402: demultiplexing circuit.

フロントページの続き (72)発明者細田賢一郎東京都港区虎ノ門１丁目７番12号沖電気工業株式会社内 (56)参考文献特開平３−243998（ＪＰ，Ａ) 特開平４−73699（ＪＰ，Ａ) (58)調査した分野(Int.Cl.⁷，ＤＢ名) G10L 19/12 H03M 7/30 ＪＩＣＳＴファイル（ＪＯＩＳ)Continuation of front page (72) Inventor Kenichiro Hosoda 1-7-12 Toranomon, Minato-ku, Tokyo Oki Electric Industry Co., Ltd. (56) References JP-A-3-243998 (JP, A) JP-A-4- 73699 (JP, A) (58) Field surveyed (Int. Cl. ⁷ , DB name) G10L 19/12 H03M 7/30 JICST file (JOIS)

Claims

(57) [Claims]

1. A plurality of types of code excited linear predictive coding method using excitation source information, performs a line spectrum pair analysis for the audio signal, to generate a line spectrum pair analysis information utilized to produce synthesized speech, the Code corresponding to line spectrum vs. analysis information
A plurality of sets of line spectrum pair analysis means, which is one element of the multiplexed coded audio signal, and a plurality of sets of gain control information including a plurality of pieces of gain control information for each of the plurality of types of excitation source information are stored.
And the index that defines one of the pairs is
And gain control codebook that is the one component of the multiplexed encoded audio signal, the output from the gain control codebook, respectively
Of multiple types of excitation source after gain control
The gain for the added excitation source information
The above-mentioned additional excitation source information is formed from
And a gain control means for down control, the optimal gain control information by a plurality of gain control information in the set performs gain control on said plurality of types of excitation source information, a plurality of types of excitation source information gain control Add
Gain control for the added excitation source information,
A code excitation linear predictive encoding method for encoding and decoding a speech signal using the in-controlled addition excitation source information and the line spectrum pair analysis information.