JPH05165497A

JPH05165497A - C0de exciting linear predictive enc0der and decoder

Info

Publication number: JPH05165497A
Application number: JP3327443A
Authority: JP
Inventors: Kenichiro Hosoda; 賢一郎細田; Hiroshi Katsuragawa; 浩桂川; Hiromi Aoyanagi; 弘美青柳; Yoshihiro Ariyama; 義博有山
Original assignee: Oki Electric Industry Co Ltd
Current assignee: Oki Electric Industry Co Ltd
Priority date: 1991-12-11
Filing date: 1991-12-11
Publication date: 1993-07-02
Anticipated expiration: 2014-10-25
Also published as: JP2968109B2

Abstract

PURPOSE:To provide the code exciting linear predictive encoder and decoder which can reproduce voice signals containing pulse signals with high quality, can easily deal with low encoding speed and can improve the quality of repro duced voices. CONSTITUTION:A pulse code book 22 storing pulse codes composed of isolate impulse is provided and used alternatively with a statistic code book 21. An LSP analysis part 11 obtains an LSP parameter from the input voice signal, and this LSP parameter is coded and transmitted by an LSP parameter coding part 12. A frequency characteristic operation part 28 operates the frequency characteristic of a statistic code or the pulse code so as to make it close to the frequency characteristic of the input voice signal.

Description

Detailed Description of the Invention

【０００１】[0001]

【産業上の利用分野】本発明は、コード励振線形予測符
号化方式（ＣＥＬＰ）に従う符号化器及び復号化器に関
する。BACKGROUND OF THE INVENTION 1. Field of the Invention The present invention relates to a code-excited linear predictive coding (CELP) coder / decoder.

【０００２】[0002]

【従来の技術】従来、デジタル移動通信分野における音
声信号（音響信号を含む）の高能率符号化方式として、
コード励振線形予測符号化方式やこのコード励振線形予
測符号化方式の変形であるベクトル加算励振線形予測符
号化方式（ＶＳＥＬＰ）が採用されてきた。2. Description of the Related Art Conventionally, as a high-efficiency coding system for voice signals (including acoustic signals) in the field of digital mobile communication,
A code-excited linear predictive coding system and a vector addition-excited linear predictive coding system (VSELP), which is a modification of the code-excited linear predictive coding system, have been adopted.

【０００３】音声信号に対する符号化方式の基本的構成
は、音声の声道特性を表現する声道パラメータ、音源情
報を表現する音源パラメータを求めることにある。最近
のコード励振線形予測符号化方式では、音源情報として
の励振信号を統計的に周期性の強い有声音に寄与する適
応コードと統計的に周期性の弱いランダム的な無声音に
寄与する統計コードとでコード化して、それぞれコード
ブックに格納しておき、入力音声信号と合成音声信号と
の重付け誤差電力和が最小となるように各コードブック
内の最適な適応コード及び統計コードを見付け出すこと
で符号化処理を行なっている。そして、入力音声信号か
ら声道パラメータを得るフォワード型の符号化方式であ
れ、合成音声信号から声道パラメータを得るバックワー
ド型の符号化方式であれ、少なくとも音源パラメータ、
従って最適な適応コード及び統計コードの情報を伝送す
る。The basic structure of a coding system for a voice signal is to obtain a vocal tract parameter expressing a vocal tract characteristic of a voice and a sound source parameter expressing a source information. In the recent code-excited linear predictive coding method, the excitation signal as the sound source information is composed of an adaptive code that contributes to a voiced sound with statistically strong periodicity and a statistical code that contributes to a random unvoiced sound with statistically weak periodicity. Find the optimum adaptive code and statistical code in each codebook so that the sum of powers of the weighted errors between the input speech signal and the synthesized speech signal is minimized. The encoding process is performed in. At least a sound source parameter, whether it is a forward-type coding method that obtains a vocal tract parameter from an input speech signal or a backward-type coding method that obtains a vocal tract parameter from a synthesized speech signal,
Therefore, information of the optimum adaptive code and statistical code is transmitted.

【０００４】このようなコード励振線形予測符号化方式
では、６kbit/s〜８kbit/sの符号化速度において高品質
な再生音声が得られることが知られている。It is known that such a code-excited linear predictive coding system can provide high-quality reproduced speech at a coding speed of 6 kbit / s to 8 kbit / s.

【０００５】[0005]

【発明が解決しようとする課題】しかしながら、通信シ
ステムの中には、より低い符号化速度を求めるものがあ
る。例えば、４kbit/s以下の符号化速度を求めるものが
ある。このような低速度の場合、声道パラメータ及び音
源パラメータを共に伝送するフォワード型であろうと、
音源パラメータを伝送するバックワード型であろうと、
音源パラメータに割当てられる符号化ビット数は当然に
少なくなり、適応コードブック及び統計コードブックに
格納されている適応コード及び統計コードの数も少なく
なる。その結果、このような低符号化速度においては再
生音声の品質が低下する。However, some communication systems require a lower coding rate. For example, there is one that obtains a coding rate of 4 kbit / s or less. In the case of such a low speed, whether it is a forward type that transmits both vocal tract parameters and sound source parameters,
Whether it is a backward type that transmits sound source parameters,
Naturally, the number of coded bits assigned to the excitation parameter is small, and the number of adaptive codes and statistical codes stored in the adaptive codebook and the statistical codebook is also small. As a result, the quality of reproduced voice is deteriorated at such a low encoding speed.

【０００６】また、適応コードブックは、最適な適応コ
ード及び統計コードの合成コードによって適応的に更新
されるものであるので、適応コードは統計コードに基づ
いて形成されるものであるということができる。そのた
め、周期性の強い有声音の立ち上がりが遅く、また、有
声音の定常部においても明確なパルス性の強いコードを
形成できず、再生音声が明瞭性に欠けるという欠点を有
する。Further, since the adaptive codebook is adaptively updated by the combined code of the optimal adaptive code and the statistical code, it can be said that the adaptive code is formed based on the statistical code. .. As a result, the voiced sound having a strong periodicity rises slowly, and a clear code having a strong pulse cannot be formed even in the stationary part of the voiced sound, so that the reproduced voice lacks intelligibility.

【０００７】本発明は、以上の点を考慮してなされたも
のであり、パルス性の強い雑音成分が入力音声信号に含
まれている場合にも、高品質の再生音声を得ることがで
きるコード励振線形予測符号化器及び復号化器を提供し
ようとするものである。The present invention has been made in consideration of the above points, and a code capable of obtaining a reproduced voice of high quality even when a noise component having a strong pulse characteristic is included in the input voice signal. The present invention seeks to provide an excited linear predictive encoder and decoder.

【０００８】また、本発明は、以上の点を考慮してなさ
れたものであり、低符号化速度の場合であっても、主に
音源パラメータ面から再生音声の品質を高めることがで
きるコード励振線形予測符号化器及び復号化器を提供し
ようとするものである。Further, the present invention has been made in consideration of the above points, and even in the case of a low coding speed, the code excitation which can improve the quality of reproduced voice mainly from the sound source parameter side. The present invention seeks to provide a linear predictive encoder and decoder.

【０００９】[0009]

【課題を解決するための手段】】かかる課題を解決する
ため、請求項１の本発明では、励振コードブックとして
適応コードブック及び統計コードブックを備えるコード
励振線形予測符号化器において、適応コードブック及び
統計コードブックに加えて、孤立インパルスでなるパル
ス性コードを格納しているパルス性コードブックを設け
た。In order to solve the above problems, according to the present invention of claim 1, an adaptive codebook is provided in a code excitation linear predictive encoder including an adaptive codebook and a statistical codebook as an excitation codebook. In addition to the statistical codebook, a pulsed codebook storing pulsed codes consisting of isolated impulses is provided.

【００１０】また、請求項２の本発明のコード励振線形
予測符号化器では、統計コードブック又はこのように設
けられたパルス性コードブックからの励振コードを選択
して使用し、その選択情報をコード励振線形予測復号化
器に送出することとした。In the code-excited linear predictive encoder of the present invention as defined in claim 2, the excitation code from the statistical codebook or the pulse-like codebook provided in this way is selected and used, and the selection information is selected. It was decided to send it to the code-excited linear predictive decoder.

【００１１】請求項３の本発明のコード励振線形予測符
号化器は、パルス性コードブックを備えると共に、コー
ド励振線形予測復号化器に出力する声道パラメータを線
スペクトル対パラメータとした。A code-excited linear predictive encoder according to a third aspect of the present invention includes a pulse codebook, and the vocal tract parameters output to the code-excited linear predictive decoder are line spectrum pair parameters.

【００１２】請求項４の本発明のコード励振線形予測符
号化器は、統計コードブック及び又はパルス性コードブ
ックから出力された固定コードの周波数特性を、入力音
声信号の周波数特性に近付ける周波数特性操作部を備え
る。According to a fourth aspect of the code-excited linear predictive encoder of the present invention, the frequency characteristic operation for bringing the frequency characteristic of the fixed code outputted from the statistical codebook and / or the pulse codebook close to the frequency characteristic of the input speech signal. Section.

【００１３】請求項５の本発明のコード励振線形予測復
号化器は、請求項１の符号化器に対応するものであり、
適応コードブック及び統計コードブックに加えて、孤立
インパルスでなるパルス性コードを格納しているパルス
性コードブックを備える。A code-excited linear predictive decoder according to the present invention of claim 5 corresponds to the encoder of claim 1,
In addition to the adaptive codebook and the statistical codebook, a pulsed codebook storing pulsed codes composed of isolated impulses is provided.

【００１４】請求項６の本発明のコード励振線形予測復
号化器は、統計コードブック及び上述したパルス性コー
ドブックからの励振コードを、対応する請求項２の本発
明のコード励振線形予測符号化器から与えられる選択情
報に基づいて、選択して使用することを特徴とする。The code-excited linear predictive decoder according to the present invention of claim 6 uses the excitation code from the statistical codebook and the above-mentioned pulse-like codebook to correspond to the code-excited linear predictive coding according to the present invention of claim 2. It is characterized in that it is selected and used based on the selection information given from the container.

【００１５】請求項７の本発明のコード励振線形予測復
号化器は、対応するコード励振線形予測符号化器から与
えられる声道パラメータが線スペクトル対パラメータで
あり、これを音声再生に利用することを特徴とする。In the code-excited linear predictive decoder according to the present invention as defined in claim 7, the vocal tract parameter given from the corresponding code-excited linear predictive encoder is a line spectrum pair parameter, which is utilized for speech reproduction. Is characterized by.

【００１６】請求項８の本発明のコード励振線形予測復
号化器は、統計コードブック及び又はパルス性コードブ
ックから出力された固定コードの周波数特性を、対応す
る請求項４の本発明のコード励振線形予測符号化器から
与えられた情報に基づいて操作する周波数特性操作部を
有することを特徴とする。The code-excited linear predictive decoder according to the present invention of claim 8 corresponds to the frequency characteristic of the fixed code outputted from the statistical codebook and / or the pulse-like codebook, and corresponds to the code excitation of the present invention according to claim 4. It is characterized in that it has a frequency characteristic operation section that operates based on information given from the linear predictive encoder.

【００１７】[0017]

【作用】請求項１の本発明のコード励振線形予測符号化
器及び請求項５の本発明のコード励振線形予測復号化器
は、周期性の強い有声音の立ち上がりを早くでき、ま
た、有声音の定常部においても明確なパルス性の強い励
振信号を形成できるように、適応コードブック及び統計
コードブックに加えて、孤立インパルスでなるパルス性
コードを格納しているパルス性コードブックを設けたも
のである。According to the code-excited linear predictive encoder of the present invention as defined in claim 1 and the code-excited linear predictive decoder as claimed in claim 5 of the present invention, it is possible to speed up the rise of a voiced sound having a strong periodicity, and In addition to the adaptive codebook and statistical codebook, a pulsed codebook that stores pulsed codes consisting of isolated impulses is provided so that a strong excitation signal with a clear pulsed characteristic can be formed even in the stationary part of Is.

【００１８】請求項２の本発明のコード励振線形予測符
号化器は、パルス性コードブックを設けた場合であって
も、低符号化速度を実現できるように、統計コードブッ
ク又はパルス性コードブックからの励振コードを選択し
て使用することとした。従って、この符号化器に対応す
る請求項６の本発明のコード励振線形予測復号化器も、
統計コードブック又はパルス性コードブックからの励振
コードを選択して使用する。この際に必要となる選択情
報は、符号化器から与えられる。According to the code-excited linear predictive encoder of the present invention as defined in claim 2, a statistical codebook or a pulse codebook is provided so that a low coding rate can be realized even when a pulse codebook is provided. It was decided to use the excitation code from. Therefore, the code-excited linear predictive decoder according to the present invention of claim 6 corresponding to this encoder also
The excitation code from the statistical codebook or the pulse codebook is selected and used. Selection information required at this time is given from the encoder.

【００１９】請求項３の本発明のコード励振線形予測符
号化器は、パルス性コードブックを備えて低符号化速度
での再生品質の向上を音源パラメータ面から期したもの
に関し、低符号化速度での再生品質の向上を声道パラメ
ータ面からも期して、コード励振線形予測復号化器に与
える声道パラメータを線スペクトル対パラメータとし
た。従って、この符号化器に対応する請求項７の本発明
のコード励振線形予測復号化器も、この線スペクトル対
パラメータを音声再生に利用する。A code-excited linear predictive encoder according to a third aspect of the present invention relates to a code-excited linear predictive encoder provided with a pulse codebook for improving reproduction quality at a low coding rate from the viewpoint of excitation parameter. In consideration of the vocal tract parameter in terms of the improvement of the playback quality in, the vocal tract parameter given to the code-excited linear predictive decoder is defined as the line spectrum pair parameter. Therefore, the code-excited linear predictive decoder according to the present invention of claim 7 corresponding to this encoder also utilizes this line spectrum pair parameter for speech reproduction.

【００２０】なお、請求項３及び７は、いわゆるフォワ
ード型のコード励振線形予測符号化器及び復号化器に限
定される。The claims 3 and 7 are limited to a so-called forward type code excitation linear predictive encoder and decoder.

【００２１】請求項４の本発明のコード励振線形予測符
号化器において、周波数特性操作部は、統計コードブッ
ク及び又はパルス性コードブックから出力された固定コ
ードの周波数特性を、入力音声信号の周波数特性に近付
ける操作を行なう。このようにしたのは、従来、励振信
号の周波数特性は理論的に白色としてモデル化されてき
たが、実際には白色的でなく、入力音声信号の周波数特
性に近い特性を有していることが実験的に確認されてお
り、統計コードやパルス性コードの周波数特性を、入力
音声信号の周波数特性に近付れば、それだけ高品質な合
成音声信号を得ることができるためであり、また、励振
信号の有効な周波数成分は量子化誤差信号よりかなり大
きくなって量子化誤差信号のマスキング効果が得られる
ためである。従って、この符号化器に対応する請求項８
の本発明のコード励振線形予測復号化器も、同様な処理
を行なう周波数特性操作部を有する。In the code-excited linear predictive encoder of the present invention as defined in claim 4, the frequency characteristic manipulating section determines the frequency characteristic of the fixed code outputted from the statistical codebook and / or the pulse codebook from the frequency of the input speech signal. Perform an operation that approaches the characteristics. This is because the frequency characteristic of the excitation signal has been theoretically modeled as white in the past, but in reality it is not white and has a characteristic close to the frequency characteristic of the input audio signal. Is experimentally confirmed, and if the frequency characteristics of the statistical code and the pulse-like code are brought close to the frequency characteristics of the input speech signal, it is possible to obtain a high quality synthesized speech signal. This is because the effective frequency component of the excitation signal is considerably larger than the quantization error signal and the masking effect of the quantization error signal is obtained. Therefore, claim 8 corresponding to this encoder
The code-excited linear predictive decoder of the present invention also has a frequency characteristic operation unit for performing similar processing.

【００２２】[0022]

【Example】

（Ａ）コード励振線形予測符号化器の一実施例以下、本発明によるコード励振線形予測符号化器の一実
施例を図面を参照しながら詳述する。ここで、図１がこ
の実施例の構成を示すブロック図である。(A) One Embodiment of Code Excited Linear Predictive Encoder One embodiment of the code excited linear predictive encoder according to the present invention will be described in detail below with reference to the drawings. Here, FIG. 1 is a block diagram showing the configuration of this embodiment.

【００２３】図１において、この実施例のコード励振線
形予測符号化器は、大きくは、入力音声処理部１と最適
合成音声探索部２と多重化部３とから構成されている。In FIG. 1, the code-excited linear predictive encoder of this embodiment is mainly composed of an input speech processing unit 1, an optimum synthesized speech searching unit 2 and a multiplexing unit 3.

【００２４】入力音声処理部１は、ＬＳＰ（線スペクト
ル対パラメータ）分析部１１、ＬＳＰパラメータ符号化
部１２、ＬＳＰパラメータ復号化部１３、ＬＰＣ（線形
予測係数）係数変換部１４、重付けフィルタ１５、合成
フィルタ零入力応答生成部１６、重付けフィルタ零入力
応答生成部１７及び２個の減算器１８、１９から構成さ
れており、入力音声信号が与えられたときに、復号化器
に伝送する声道パラメータを得ると共に、局部再生で形
成される合成音声信号の目標音声信号を形成するもので
ある。The input speech processing unit 1 includes an LSP (line spectrum pair parameter) analysis unit 11, an LSP parameter coding unit 12, an LSP parameter decoding unit 13, an LPC (linear prediction coefficient) coefficient conversion unit 14, and a weighting filter 15. , A synthesis filter zero-input response generation unit 16, a weighting filter zero-input response generation unit 17, and two subtractors 18 and 19, and transmits them to a decoder when an input voice signal is given. The vocal tract parameters are obtained and the target voice signal of the synthesized voice signal formed by local reproduction is formed.

【００２５】この実施例の場合、デジタル化された離散
的な入力音声信号系列は、声道パラメータを求めるため
の分析フレーム長に対応する時間だけ蓄積され、さら
に、この分析フレーム長は数個のサブフレームに分割さ
れて入力音声処理部１で処理される。In the case of this embodiment, the digitized discrete input speech signal sequence is accumulated for a time corresponding to the analysis frame length for obtaining the vocal tract parameter, and further, this analysis frame length is several. It is divided into subframes and processed by the input voice processing unit 1.

【００２６】入力音声信号はＬＳＰ分析部１１に与えら
れ、このＬＳＰ分析部１１によってＬＳＰ分析されて声
道パラメータとしてのＬＳＰパラメータに変換される。
このＬＳＰパラメータはＬＳＰパラメータ符号化部１２
によって符号化（例えばベクトル量子化）されて多重化
部３に与えられてコード励振線形予測復号化器側に伝送
される。また、符号化されたＬＳＰパラメータは、ＬＳ
Ｐパラメータ復号化部１３によって復号化（ベクトル逆
量子化）された後、ＬＰＣ係数変換部１４によってＬＰ
Ｃ係数に変換される。このように変換されたＬＰＣ係数
が、重付けフィルタ１５、合成フィルタ零入力応答生成
部１６、重付けフィルタ零入力応答生成部１７及び後述
する重付き合成フィルタ２９のタップ係数として使用さ
れる。また、後述する周波数特性操作部２８に与えられ
る。なお、ＬＳＰ分析部１１から出力されたＬＳＰパラ
メータを直接ＬＰＣ係数に変換するのではなく、符号、
復号処理を施したＬＳＰパラメータをＬＰＣ係数に変換
するのは、復号化器が利用するＬＰＣ係数と同様なＬＰ
Ｃ係数を局部再生で利用して音源パラメータを適切に決
定できるようにするためである。The input voice signal is given to the LSP analysis section 11, and is subjected to LSP analysis by this LSP analysis section 11 and converted into an LSP parameter as a vocal tract parameter.
This LSP parameter is the LSP parameter encoding unit 12
Is encoded (for example, vector quantization), given to the multiplexing unit 3, and transmitted to the code excitation linear prediction decoder side. Also, the encoded LSP parameters are LS
After being decoded (vector inverse quantization) by the P parameter decoding unit 13, the LPC coefficient conversion unit 14 outputs LP.
Converted to C coefficient. The LPC coefficients converted in this way are used as tap coefficients for the weighting filter 15, the synthesis filter zero-input response generation unit 16, the weighting filter zero-input response generation unit 17, and the weighted synthesis filter 29 described later. It is also given to the frequency characteristic operation unit 28 described later. In addition, the LSP parameter output from the LSP analysis unit 11 is not directly converted into an LPC coefficient, but a code,
The LSP parameters that have been subjected to the decoding process are converted into LPC coefficients by the same LP as the LPC coefficients used by the decoder.
This is because the sound source parameter can be appropriately determined by using the C coefficient in local reproduction.

【００２７】ここで、伝送に供する声道パラメータとし
てＬＳＰパラメータを用いるようにしたのは、声道の周
波数特性に対する補間特性が良くなること、ＬＳＰパラ
メータは少ない符号化ビット数で符号化してもＬＰＣパ
ラメータ等より声道スペクトルに与える歪みが小さいこ
と、ベクトル量子化法との組み合わせによって効率の良
い符号化ができることによる。Here, the LSP parameter is used as the vocal tract parameter for transmission because the interpolation characteristic with respect to the frequency characteristic of the vocal tract is improved, and the LSP parameter is LPC even if it is encoded with a small number of encoding bits. This is because the distortion given to the vocal tract spectrum is smaller than the parameters, etc., and efficient coding can be performed in combination with the vector quantization method.

【００２８】次に、入力音声信号から、局部再生される
合成音声信号に対する目標音声信号を形成する動作を説
明する。Next, the operation of forming the target voice signal for the locally reproduced synthesized voice signal from the input voice signal will be described.

【００２９】上述した入力音声信号は重付けフィルタ１
５に与えられ、人間の聴覚特性が考慮された重付けが施
された後、減算器１８に被減算入力として与えられる。
この減算器１８には、また、合成フィルタ零入力応答生
成部１６がＬＰＣ係数をタップ係数として用いて生成し
た、重み付き合成フィルタ２９に関する零入力応答信号
が減算入力として与えられる。かくして、直前の分析フ
レームにおける重み付き合成フィルタ２９の状態の影響
が除去された音声信号が得られ、これが減算器１９に被
減算入力として与えられる。この減算器１９には、ま
た、重付けフィルタ零入力応答生成部１７がＬＰＣ係数
をタップ係数として用いて生成した、重付けフィルタ１
５に関する零入力応答信号が減算入力として与えられ
る。かくして、直前の分析フレームにおける重付けフィ
ルタ１５の状態の影響が除去された音声信号が得られ、
これが目標音声信号として後述する減算器３０に与えら
れる。The above-mentioned input audio signal is a weighting filter 1
5 and weighted in consideration of human auditory characteristics, and then given to the subtractor 18 as a subtracted input.
The subtracter 18 is also supplied with a zero input response signal regarding the weighted synthesis filter 29 generated by the synthesis filter zero input response generation unit 16 using the LPC coefficient as a tap coefficient, as a subtraction input. Thus, an audio signal in which the influence of the state of the weighted synthesis filter 29 in the immediately preceding analysis frame is removed is obtained, and this audio signal is given to the subtractor 19 as a subtracted input. The weighting filter 1 generated by the weighting filter zero input response generation unit 17 using the LPC coefficient as the tap coefficient is also included in the subtractor 19.
The quiescent response signal for 5 is provided as the subtraction input. Thus, an audio signal in which the influence of the state of the weighting filter 15 in the immediately preceding analysis frame is removed is obtained,
This is given to the subtractor 30 which will be described later as a target voice signal.

【００３０】最適合成音声探索部２は、局部再生による
合成音声信号が最も目標音声信号に類似する音源パラメ
ータを探索するものであり、適応コードブック２０、統
計コードブック２１、パルス性コードブック２２、利得
コードブック２３、利得制御器２４、２７、加算器２
５、固定コード選択スイッチ２６、周波数特性操作部２
８、重み付き合成フィルタ２９、減算器３０、誤差電力
和計算部３１及び最小誤差電力和コード選択部３２から
構成されている。The optimum synthesized speech search unit 2 searches for a sound source parameter whose synthesized speech signal produced by local reproduction is most similar to the target speech signal. The adaptive codebook 20, statistical codebook 21, pulse codebook 22, Gain codebook 23, gain controllers 24 and 27, adder 2
5, fixed code selection switch 26, frequency characteristic operation unit 2
8, a weighted synthesis filter 29, a subtractor 30, an error power sum calculation unit 31, and a minimum error power sum code selection unit 32.

【００３１】適応コードブック２０、統計コードブック
２１及びパルス性コードブック２２はそれぞれ、励振信
号に係る波形コードである適応コード、統計コード、パ
ルス性コードを格納しているものであり、利得コードブ
ック２３は適応コード及び固定コード（統計コード及び
パルス性コードをまとめてこのように呼ぶ）に関する利
得コードを格納しているものである。The adaptive codebook 20, the statistical codebook 21, and the pulse codebook 22 store an adaptive code, a statistical code, and a pulse code, which are waveform codes related to the excitation signal, respectively. Reference numeral 23 stores a gain code relating to the adaptive code and the fixed code (the statistical code and the pulse characteristic code are collectively referred to as such).

【００３２】適応コード及び統計コードはそれぞれ、従
来と同様に、統計的に周期性の強い有声音に寄与する波
形コード、統計的に周期性の弱いランダム的な無声音に
寄与する波形コードである。なお、適応コードブック２
０の適応コードは後述するように適応的に更新される。
パルス性コードは、孤立インパルスよりなる波形コード
である。パルス性コードは、周期性の強い有声音の立ち
上がりや、パルス性が明確な有声音の定常部分に寄与す
ることを考慮したものである。利得コードは、例えばベ
クトル量子化されており、ベクトルの一成分が適応コー
ドの利得に関し、他成分が固定コードの利得に関するも
のとなる。The adaptive code and the statistical code are a waveform code contributing to a voiced sound having a statistically strong periodicity and a waveform code contributing to a random unvoiced sound having a statistically weak periodicity, as in the conventional case. In addition, adaptive codebook 2
The adaptive code of 0 is adaptively updated as described later.
The pulse code is a waveform code composed of isolated impulses. The pulse code considers that it contributes to the rise of a voiced sound with a strong periodicity and the steady part of a voiced sound with a clear pulse. The gain code is, for example, vector-quantized, and one component of the vector relates to the gain of the adaptive code and the other component relates to the gain of the fixed code.

【００３３】なお、パルス性の音源信号は、周期性を有
する単純な信号であるのでパルス信号発生部が発生する
ことも考えられるが、この実施例のようにコード化して
コードブック２２から読出すことで発生することが以下
の理由によって好ましい。すなわち、適応コードブック
２０からの出力と同期させ易く、また、統計コードブッ
ク２１と同一のブック構成とすることで後述するように
統計コード又はパルス性コードを選択して復号化器に伝
送する際の多重化処理等が容易になるためである。The pulsed sound source signal is a simple signal having periodicity and may be generated by the pulse signal generator. However, as in this embodiment, it is coded and read from the codebook 22. This is preferable because of the following reasons. That is, it is easy to synchronize with the output from the adaptive codebook 20, and by adopting the same book configuration as the statistical codebook 21, the statistical code or the pulse code is selected and transmitted to the decoder as described later. This is because the multiplexing process and so on are facilitated.

【００３４】このような各種コードを用いて局部再生し
た合成音声信号が目標音声信号に最も類似する、各種コ
ードの最適コードを求めてそのインデックスを多重化部
３に与えて、コード励振線形予測復号化器側に伝送す
る。この実施例は、低符号化速度を意識したものである
ので、固定コードについては統計コード又はパルス性コ
ードを選択してそのインデックスを伝送することとして
いる。従って、固定コードとしていずれを選択している
かの選択情報も、コード励振線形予測復号化器側に伝送
する。A code-excited linear predictive decoding is performed by finding an optimum code of various codes in which the synthesized speech signal locally reproduced using such various codes is most similar to the target speech signal and giving the index to the multiplexing section 3. It is transmitted to the chemical converter side. In this embodiment, since a low coding rate is taken into consideration, a statistical code or a pulsed code is selected for a fixed code and its index is transmitted. Therefore, the selection information indicating which one is selected as the fixed code is also transmitted to the code excitation linear prediction decoder side.

【００３５】このような最適コードの探索（統計コード
又はパルス性コードの選択処理を含む）は、この実施例
の場合、適応コード、統計コード、パルス性コード、利
得コードの順に実行される。In the case of this embodiment, the search for the optimum code (including the selection process of the statistical code or the pulsed code) is executed in the order of the adaptive code, the statistical code, the pulsed code, and the gain code.

【００３６】最適な適応コードの探索時においては、統
計コードブック２１及びパルス性コードブック２２から
の出力を０とし、また、利得制御器２４が適切な値の利
得係数（例えば１）を乗算するようになされている。こ
のような状態において、適応コードブック２０に格納さ
れている全ての適応コードを時間順次に又は並列的に出
力させ、利得制御器２４及び加算器２５を介して重み付
き合成フィルタ２９に励振信号として与える。重み付き
合成フィルタ２９は、ＬＰＣ係数変換部１４から与えら
れたＬＰＣ係数をタップ係数としてこの励振信号に対し
て畳み込み処理を行ない、音源パラメータとして適応コ
ードの内容だけが反映された合成音声信号を、全ての適
応コードについて求める。At the time of searching for the optimum adaptive code, the outputs from the statistical codebook 21 and the pulse codebook 22 are set to 0, and the gain controller 24 multiplies an appropriate value of the gain coefficient (for example, 1). It is done like this. In such a state, all the adaptive codes stored in the adaptive codebook 20 are output in time sequence or in parallel, and are output as an excitation signal to the weighted synthesis filter 29 via the gain controller 24 and the adder 25. give. The weighted synthesis filter 29 performs convolution processing on this excitation signal using the LPC coefficient provided from the LPC coefficient conversion unit 14 as a tap coefficient, and outputs a synthesized speech signal in which only the content of the adaptive code is reflected as a sound source parameter. Ask for all adaptive codes.

【００３７】減算器３０は、適応コードの内容だけが反
映された合成音声信号と目標音声信号との誤差信号を全
ての適応コードについて求めて誤差電力和計算部３１に
与える。誤差電力和計算部３１は誤差信号についてその
成分の２乗和（誤差電力和）を、全ての適応コードにつ
いて求めて最小誤差電力和コード選択部３２に与える。
最小誤差電力コード選択部３２は、誤差電力和が最小の
適応コードを最適なものと決定する。The subtractor 30 obtains an error signal between the synthesized voice signal and the target voice signal, in which only the content of the adaptive code is reflected, for all the adaptive codes, and supplies it to the error power sum calculation unit 31. The error power sum calculation unit 31 obtains the sum of squares of the components of the error signal (error power sum) for all adaptive codes and supplies the sum to the minimum error power sum code selection unit 32.
The minimum error power code selection unit 32 determines the adaptive code with the minimum error power sum as the optimum one.

【００３８】次に、最適な統計コードの探索が実行され
るが、この探索時においては、固定コード選択スイッチ
２６が統計コードブック２１側に切換えられ、適応コー
ドブック２０が出力を０とする（なお、最適適応コード
を出力することも考えられる）。このような状態におい
て、統計コードブック２１に格納されている全ての統計
コードを時間順次に又は並列的に出力させ、固定コード
選択スイッチ２６及び利得制御器２４を介して周波数特
性操作部２８に入力させる。Next, the search for the optimum statistical code is executed. At the time of this search, the fixed code selection switch 26 is switched to the statistical codebook 21 side, and the adaptive codebook 20 sets the output to 0 ( It is also possible to output the optimal adaptation code). In such a state, all statistical codes stored in the statistical code book 21 are time-sequentially or in parallel output, and input to the frequency characteristic operation unit 28 via the fixed code selection switch 26 and the gain controller 24. Let

【００３９】この周波数特性操作部２８は、後述する図
３に示す詳細構成を有し、入力された統計コードの周波
数特性を統計コードの時間的な長さに対応して入力音声
信号の周波数特性に近付けるように変換操作する。この
ように周波数特性が変換操作された全ての統計コードが
加算器２５（この場合ないのに等しい）を介して励振信
号として重み付き合成フィルタ２９に与えられる。これ
以降は、最適な適応コードの探索と同様に処理され、最
小誤差電力和コード選択部３２が最適な統計コードを決
定する。The frequency characteristic operation unit 28 has a detailed configuration shown in FIG. 3 which will be described later, and changes the frequency characteristic of the input statistical code into the frequency characteristic of the input voice signal corresponding to the temporal length of the statistical code. The conversion operation is performed so as to approach. In this way, all the statistical codes whose frequency characteristics have been converted are supplied to the weighted synthesis filter 29 as excitation signals via the adder 25 (which is equal to the value in this case). After that, the process is performed in the same manner as the search for the optimum adaptive code, and the minimum error power sum code selection unit 32 determines the optimum statistical code.

【００４０】ここで、周波数特性操作部２８を設けるよ
うにしたのは以下の理由による。従来、励振信号の周波
数特性は理論的に白色としてモデル化されてきたが、実
際には白色的でなく、入力音声信号の周波数特性に近い
特性を有していることが実験的に確認されている。従っ
て、統計コードやパルス性コードの周波数特性を、入力
音声信号の周波数特性に近付れば、それだけ高品質な合
成音声信号を得ることができ、また、励振信号の有効な
周波数成分は量子化誤差信号よりかなり大きくなって量
子化誤差信号のマスキング効果が得られる。そこで、周
波数特性操作部２８を設けている。ここで、入力音声信
号の周波数特性を表す情報としては、ＬＰＣ係数があ
り、また、ピッチ予測情報を意味する最適な適応コード
の情報（それに対する利得を含む）がある。従って、周
波数特性操作部２８はこれらの情報に基づいて、統計コ
ードやパルス性コードの周波数特性を操作する。The frequency characteristic operating section 28 is provided for the following reason. Conventionally, the frequency characteristic of the excitation signal has been theoretically modeled as white, but it has been experimentally confirmed that the frequency characteristic of the excitation signal is not white and has characteristics close to those of the input audio signal. There is. Therefore, if the frequency characteristics of the statistical code and the pulse-like code are brought close to the frequency characteristics of the input speech signal, a higher quality synthesized speech signal can be obtained, and the effective frequency component of the excitation signal is quantized. The masking effect of the quantized error signal can be obtained by making it considerably larger than the error signal. Therefore, the frequency characteristic operation unit 28 is provided. Here, as the information representing the frequency characteristic of the input speech signal, there is the LPC coefficient, and there is also the information (including the gain for it) of the optimum adaptive code that means the pitch prediction information. Therefore, the frequency characteristic operation unit 28 operates the frequency characteristics of the statistical code and the pulse code based on these pieces of information.

【００４１】このようにして最適な統計コードの探索が
終了すると、次には、最適なパルス性コードの探索を行
なう。この探索時においては、固定コード選択スイッチ
２６がパルス性コードブック２２側に切換えられ、適応
コードブック２０が出力を０とする（なお、最適適応コ
ードを出力することも考えられる）。このような状態に
おいて、パルス性コードブック２２に格納されている全
てのパルス性コードを時間順次に又は並列的に出力させ
る。以降の処理は、最適な統計コードの探索時と同様で
あるのでその説明は省略する。When the search for the optimum statistical code is completed in this way, the search for the optimum pulse code is next performed. During this search, the fixed code selection switch 26 is switched to the pulse codebook 22 side, and the adaptive codebook 20 sets the output to 0 (note that the optimal adaptive code may be output). In such a state, all the pulsed codes stored in the pulsed code book 22 are output in time sequence or in parallel. Subsequent processing is the same as that at the time of searching for the optimum statistical code, and therefore its explanation is omitted.

【００４２】このようにして最適なパルス性コードが決
定されたときには、最小誤差電力和コード選択部３２
は、最適な統計コードの誤差電力和と最適なパルス性コ
ードの誤差電力和とを比較し、誤差電力和が小さい方を
コード励振線形予測復号化器側に伝送する固定コードに
決定する。When the optimum pulse-like code is determined in this way, the minimum error power sum code selecting unit 32
Compares the error power sum of the optimum statistical code and the error power sum of the optimum pulsed code, and determines the smaller error power sum as the fixed code to be transmitted to the code excitation linear prediction decoder side.

【００４３】この後、最適な利得コードの探索を行な
う。この利得コードの探索時においては、適応コードブ
ック２０からは最適な適応コードが出力され、固定コー
ド選択スイッチ２６は選択された統計コードブック２１
又はパルス性コードブック２２に切換えられ、選択され
た固定コードブック２１又は２２からは最適な固定コー
ドが出力される。１個の利得コードブック２３は適応コ
ード用の利得と固定コード用の利得からなり、適応コー
ド用の利得は利得制御器２４に与えられ、固定コード用
の利得は利得制御器２７に与えられる。かくして、利得
制御された最適適応コードと、周波数特性操作と利得制
御とが施された最適固定コードとが加算器２５によって
加算され、励振信号として重み付き合成フィルタ２９に
与えられる。このような処理は、利得コードブック２３
内の全ての利得コードに対して時間順次に又は並列的に
実行される。重み付き合成フィルタ２９以降の探索時の
処理は、他のコードの探索時の処理と同様である。After that, the optimum gain code is searched. At the time of searching for this gain code, the adaptive codebook 20 outputs the optimum adaptive code, and the fixed code selection switch 26 selects the statistical codebook 21.
Alternatively, the pulse code code book 22 is switched to and the optimum fixed code is output from the selected fixed code book 21 or 22. One gain code book 23 is composed of a gain for adaptive code and a gain for fixed code. The gain for adaptive code is given to the gain controller 24 and the gain for fixed code is given to the gain controller 27. Thus, the optimum adaptive code whose gain has been controlled and the optimum fixed code which has been subjected to the frequency characteristic operation and gain control are added by the adder 25, and are added to the weighted synthesis filter 29 as an excitation signal. Such processing is performed by the gain codebook 23.
For all gain codes in time sequence or in parallel. The processing at the time of searching after the weighted synthesis filter 29 is the same as the processing at the time of searching for other codes.

【００４４】最小誤差電力和コード選択部３２は、最適
適応コード、最適固定コード、最適利得コードが得られ
ると、これらのインデックスを多重化部３に与えると共
に、統計コード及びパルス性コードのどちらを選択した
かを表す固定コード選択スイッチ情報も多重化部３に与
える。多重化部３は、ＬＳＰパラメータ符号化部１２か
ら与えられたＬＳＰパラメータと、これら情報とを多重
化してコード励振線形予測復号化器側に出力する。な
お、利得コードとしてベクトル量子化を適用している場
合には、伝送されるインデックスはベクトル番号であ
る。When the optimum adaptive code, the optimum fixed code, and the optimum gain code are obtained, the minimum error power sum code selecting section 32 gives these indexes to the multiplexing section 3 and selects either the statistical code or the pulse code. The fixed code selection switch information indicating whether the selection has been made is also given to the multiplexing unit 3. The multiplexing unit 3 multiplexes the LSP parameter given from the LSP parameter coding unit 12 and these pieces of information, and outputs the multiplexed information to the code excitation linear prediction decoder side. When vector quantization is applied as the gain code, the transmitted index is the vector number.

【００４５】また、最小誤差電力和コード選択部３２
は、多重化部３に与えるインデックス及び固定コード選
択スイッチ情報を、対応するコードブック（２０及び２
３と、２１又は２２）や固定コード選択スイッチ２６に
与える。このとき、スイッチ２６が切換えられ、各コー
ドブックから最適コードが出力される。これにより、今
回のサブフレーム処理時において最も目標音声信号に近
い合成音声信号を形成できる励振信号が加算器２５から
出力され、これが適応コードブック２０に与えられる。
そして、適応コードブック２０は適応コードの更新処理
を行なう。In addition, the minimum error power sum code selection unit 32
Of the index and fixed code selection switch information to be given to the multiplexing unit 3 in the corresponding codebooks (20 and 2).
3 and 21 or 22) and the fixed code selection switch 26. At this time, the switch 26 is switched, and the optimum code is output from each codebook. As a result, an excitation signal that can form a synthesized speech signal that is closest to the target speech signal during the current subframe processing is output from the adder 25, and this is provided to the adaptive codebook 20.
Then, the adaptive code book 20 performs an adaptive code update process.

【００４６】以上のような符号化処理がサブフレーム毎
に繰返され、符号化音声信号が順次コード励振線形予測
復号化器に送信される。The above encoding process is repeated for each subframe, and encoded speech signals are sequentially transmitted to the code excitation linear predictive decoder.

【００４７】図３は、上述した周波数特性操作部２８の
詳細構成を示すものである。図３において、この周波数
特性操作部２８は、縦属接続された２個のフィルタ２８
１及び２８２と、ピッチラグ決定部２８３とから構成さ
れている。FIG. 3 shows a detailed configuration of the above-mentioned frequency characteristic operating unit 28. In FIG. 3, the frequency characteristic operating unit 28 includes two filters 28 connected in cascade.
1 and 282 and a pitch lag determining unit 283.

【００４８】固定コード選択スイッチ２６から出力され
た固定コードは、第１のフィルタ２８１に与えられる。
この第１のフィルタ２８１のインパルス応答Ｈ２（ｚ）
は、次式に示すように選定されており、これによって入
力された固定コードに対する周波数変換操作を行なう。The fixed code output from the fixed code selection switch 26 is applied to the first filter 281.
Impulse response H2 (z) of this first filter 281
Is selected as shown in the following equation, and the frequency conversion operation is performed on the fixed code input by this.

【００４９】Ｈ１（ｚ）＝（１−Σ γⁱａi ｚ^-i）／（１−Σ νⁱａi ｚ^-i） …(1) 但し、ａi （ｉは１〜Ｍ）は、ＬＰＣ係数変換部１４か
ら供給される合成フィルタ２９に対するタップ係数であ
る。また、γ及びνはそれぞれ、予め定められた１より
小さい定数である。H1 (z) = (1−Σγ ⁱ ai z ⁻ⁱ ) / (1−Σ v ⁱ ai z ⁻ⁱ ) ... (1) where ai (i is 1 to M) is the LPC coefficient conversion It is a tap coefficient for the synthesis filter 29 supplied from the unit 14. Further, γ and ν are constants smaller than 1 which are set in advance.

【００５０】この第１のフィルタ２８１によって周波数
特性が操作された固定コードが、第２のフィルタ２８２
に入力される。ピッチラグ決定部２８３は適応コードブ
ック２０に対する最適適応コードのインデックスからピ
ッチラグＬを得て第２のフィルタ２８２に与える。この
第２のフィルタ２８２のインパルス応答Ｈ２（ｚ）は、
次式に示すように選定されており、これによって入力さ
れた固定コードに対する周波数変換操作を行なう。The fixed code of which the frequency characteristic is manipulated by the first filter 281 is the second code 282.
Entered in. The pitch lag determining unit 283 obtains the pitch lag L from the index of the optimum adaptive code for the adaptive codebook 20 and supplies it to the second filter 282. The impulse response H2 (z) of this second filter 282 is
It is selected as shown in the following equation, and the frequency conversion operation is performed on the fixed code input by this.

【００５１】Ｈ２（ｚ）＝１／（１−εｚ^-L） …(2) 但し、εは予め定められた１より小さい定数である。こ
の第２のフィルタ２８２の出力が、利得制御器２７に与
えられる。H2 (z) = 1 / (1-εz ^−L ) (2) where ε is a predetermined constant smaller than 1. The output of the second filter 282 is given to the gain controller 27.

【００５２】このような詳細構成を有する周波数特性操
作部２８によって、上述したように、入力された固定コ
ードの周波数特性を固定コードの時間的な長さに対応し
て入力音声信号の周波数特性に近付けるようにできてい
る。As described above, the frequency characteristic manipulating unit 28 having such a detailed configuration converts the frequency characteristic of the input fixed code into the frequency characteristic of the input voice signal corresponding to the temporal length of the fixed code. It is designed to be close.

【００５３】従って、上記実施例のコード励振線形予測
符号化器によれば、低符号化速度においても高品質の再
生音声を得ることができる。以下、かかる効果が得られ
ることを具体的に説明する。Therefore, according to the code-excited linear predictive encoder of the above embodiment, it is possible to obtain a reproduced voice of high quality even at a low encoding speed. Hereinafter, it will be specifically described that such an effect is obtained.

【００５４】(1) 低符号化速度を期した場合、音源パラ
メータに割当てられる符号化ビット数が少ないので、用
意される固定コードも少なくなり、入力音声信号に含ま
れているパルス性雑音を明確に再生でき難いが、この実
施例の場合、パルス性コードを利用しているので、この
ような場合の音声の再生品質を高めることができる。(1) When a low coding rate is achieved, the number of coding bits assigned to excitation parameters is small, so the number of fixed codes prepared is small, and the pulse noise included in the input speech signal is clarified. However, in the case of this embodiment, since the pulse code is used, the reproduction quality of the voice in such a case can be improved.

【００５５】また、パルス性コードと統計コードとを切
換えて用いているので、低符号化速度に対応できると共
に、音声の過渡部のようなランダム信号とパルス的信号
が混在する信号に対する再生品質を高めることができ
る。Since the pulse code and the statistic code are switched and used, it is possible to cope with a low coding rate and to improve the reproduction quality for a signal in which a random signal and a pulse signal are mixed such as a voice transient portion. Can be increased.

【００５６】(2) 低符号化速度を期した場合、音源パラ
メータに対する符号化ビット数も少なくなるが、声道パ
ラメータに対する符号化ビット数も少なくなる。この実
施例の場合、少ない符号化ビット数で符号化してもＬＰ
Ｃ等より声道スペクトルに与える歪みが小さいＬＳＰパ
ラメータを伝送するようにしているので、この点から再
生品質を高めることができる。(2) When a low coding rate is achieved, the number of coded bits for excitation parameters is reduced, but the number of coded bits for vocal tract parameters is also reduced. In the case of this embodiment, even if encoding is performed with a small number of encoding bits, the LP
Since the LSP parameter that causes less distortion in the vocal tract spectrum than C or the like is transmitted, the reproduction quality can be improved from this point.

【００５７】(3) 上述のように、実際の励振信号が入力
音声信号の周波数特性に近い周波数特性を有することを
考慮して周波数特性操作部を設けているので、実際に即
している分だけ再生品質を高めることができると共に、
この変換に伴い量子化誤差信号に対するマスキング効果
が生じて再生品質を高めることができる。(3) As described above, since the frequency characteristic operation unit is provided in consideration of the fact that the actual excitation signal has the frequency characteristic close to the frequency characteristic of the input voice signal, it is appropriate for the actual use. It is possible to improve the playback quality only,
Along with this conversion, a masking effect for the quantized error signal is generated, and the reproduction quality can be improved.

【００５８】（Ｂ）コード励振線形予測復号化器の一実
施例次に、本発明によるコード励振線形予測復号化器の一実
施例を図面を参照しながら詳述する。この実施例は、図
１に示すコード励振線形予測符号化器の実施例に対応す
るものである。図２がこの実施例の構成を示すブロック
図である。(B) One Embodiment of Code-Excited Linear Prediction Decoder Next, one embodiment of the code-excited linear prediction decoder according to the present invention will be described in detail with reference to the drawings. This embodiment corresponds to the embodiment of the code excitation linear predictive encoder shown in FIG. FIG. 2 is a block diagram showing the configuration of this embodiment.

【００５９】図２において、この実施例のコード励振線
形予測復号化器は、多重分離部４０、ＬＳＰパラメータ
復号化部４１、ＬＰＣ係数変換部４２、適応コードブッ
ク４３、統計コードブック４４、パルス性コードブック
４５、利得コードブック４６、利得制御器４７、４９、
固定コード選択スイッチ４８、周波数特性操作部５０、
加算器５１及び重み付き合成フィルタ５２から構成され
ている。In FIG. 2, the code-excited linear predictive decoder according to this embodiment has a demultiplexing section 40, an LSP parameter decoding section 41, an LPC coefficient conversion section 42, an adaptive codebook 43, a statistical codebook 44, and pulse characteristics. Codebook 45, gain codebook 46, gain controllers 47, 49,
Fixed code selection switch 48, frequency characteristic operation unit 50,
It is composed of an adder 51 and a weighted synthesis filter 52.

【００６０】コード励振線形予測符号化器側から与えら
れた符号化音声信号は、多重分離部４０に入力される。
多重分離部４０は、この符号化音声信号を、ＬＳＰパラ
メータ、最適適応コードのインデックス、最適固定コー
ドのインデックス、最適利得コードのインデックス及び
固定コード選択スイッチ情報に分離する。そして、ＬＳ
ＰパラメータはＬＳＰパラメータ復号化部４１に与え、
最適適応コードのインデックスを適応コードブック４３
に与え、最適利得コードのインデックスを利得コードブ
ック４６に与え、固定コード選択スイッチ情報を固定コ
ード選択スイッチ４８に与える。また、最適固定コード
のインデックスを、固定コード選択スイッチ情報に基づ
いて定まる統計コードブック４４又はパルス性コードブ
ック４５に与える。The coded speech signal supplied from the code excitation linear predictive encoder side is input to the demultiplexing unit 40.
The demultiplexing unit 40 demultiplexes the encoded voice signal into LSP parameters, an index of the optimum adaptive code, an index of the optimum fixed code, an index of the optimum gain code, and fixed code selection switch information. And LS
The P parameter is given to the LSP parameter decoding unit 41,
The index of the optimum adaptive code is the adaptive code book 43.
To the gain code book 46 and the fixed code selection switch information to the fixed code selection switch 48. In addition, the index of the optimum fixed code is given to the statistical codebook 44 or the pulse codebook 45 determined based on the fixed code selection switch information.

【００６１】ＬＳＰパラメータ復号化部４１は、与えら
れた符号化されているＬＳＰパラメータを復号化（例え
ばベクトル逆量子化）し、ＬＰＣ係数変換部４２はこの
ＬＳＰパラメータをＬＰＣ係数に変換する。このように
変換されたＬＰＣ係数が、周波数特性操作部５０及び重
付き合成フィルタ５２に声道パラメータ情報として与え
られる。The LSP parameter decoding unit 41 decodes (for example, vector dequantization) the given encoded LSP parameter, and the LPC coefficient conversion unit 42 converts this LSP parameter into an LPC coefficient. The LPC coefficient converted in this way is given to the frequency characteristic operation unit 50 and the weighted synthesis filter 52 as vocal tract parameter information.

【００６２】利得コードブック４６は、与えられたイン
デックスで定まる利得コードを、適応コード用と固定コ
ード用とに分離し、それぞれ適応コード用の利得制御器
４７、固定コード用の利得制御器４９に与える。The gain code book 46 separates a gain code determined by the given index into an adaptive code gain code and a fixed code gain code, and outputs them to an adaptive code gain controller 47 and a fixed code gain controller 49, respectively. give.

【００６３】適応コードブック４３は、与えられたイン
デックスで定まる適応コードを出力し、この適応コード
が利得制御器４７を介して利得制御されて加算器５１に
与えられる。また、適応コードブック４３は適応コード
を周波数特性操作部５０に与える。The adaptive code book 43 outputs an adaptive code determined by the given index, and this adaptive code is gain-controlled by the gain controller 47 and is given to the adder 51. Further, the adaptive code book 43 provides the adaptive code to the frequency characteristic operation unit 50.

【００６４】統計コードブック４４又はパルス性コード
ブック４５は、与えられたインデックスに対応する統計
コード又はパルス性コードを固定コード選択スイッチ４
８を介して周波数特性操作部５０に与えられ、周波数特
性操作部５０は、ＬＰＣ係数、適応コードのインデック
スに基づいてその周波数特性を操作する（入力音声信号
の周波数特性に近くなるように操作する）。周波数特性
操作部５０の詳細構成は、上述した図３に示す構成を有
している。このように周波数特性が操作された固定コー
ドが、利得制御器４９で利得制御されて加算器５１に与
えられる。The statistical codebook 44 or the pulse codebook 45 stores the statistical code or pulse code corresponding to the given index in the fixed code selection switch 4.
8 is applied to the frequency characteristic operation unit 50, and the frequency characteristic operation unit 50 operates the frequency characteristic based on the LPC coefficient and the index of the adaptive code (the operation is performed so as to be close to the frequency characteristic of the input voice signal). ). The detailed configuration of the frequency characteristic operation unit 50 has the configuration shown in FIG. 3 described above. The fixed code whose frequency characteristic is manipulated in this way is gain-controlled by the gain controller 49 and is given to the adder 51.

【００６５】加算器５１は、与えられた適応コードと固
定コードを加算してその加算信号を励振信号として重み
付き合成フィルタ５２に与える。重み付き合成フィルタ
５２は、この励振信号をＬＰＣ係数で畳み込んで合成音
声信号を得て出力する。The adder 51 adds the given adaptive code and fixed code and gives the addition signal as an excitation signal to the weighted synthesis filter 52. The weighted synthesis filter 52 convolves the excitation signal with the LPC coefficient to obtain and output a synthesized speech signal.

【００６６】加算器５１から出力された励振信号は、ま
た適応コードブック４３に与えられる。このとき、適応
コードブック４３は、この励振信号を用いて適応コード
の更新を行なう。The excitation signal output from the adder 51 is also supplied to the adaptive codebook 43. At this time, the adaptive code book 43 updates the adaptive code using this excitation signal.

【００６７】コード励振線形予測復号化器は、以上のよ
うな処理を符号化音声信号が与えられる毎に、従ってサ
ブフレーム毎に行なう。The code-excited linear predictive decoder performs the above-described processing every time a coded speech signal is given, that is, every subframe.

【００６８】従って、この実施例のコード励振線形予測
復号化器によれば、与えられたＬＳＰパラメータを処理
する構成を有し、音源としてパルス性コードブック４５
を有し、入力音声信号の周波数特性に固定音源の周波数
特性を近付ける周波数特性操作部５０を有するので、こ
れにより、上述したコード励振線形予測符号化器につい
ての効果が実際のものとなる。Therefore, according to the code-excited linear predictive decoder of this embodiment, it has a configuration for processing the given LSP parameter, and the pulse-like codebook 45 is used as the sound source.
And the frequency characteristic manipulating unit 50 that brings the frequency characteristic of the fixed sound source closer to the frequency characteristic of the input speech signal. Therefore, the effect of the above-described code excitation linear predictive encoder becomes actual.

【００６９】（Ｃ）他の実施例上記実施例において特徴的なことは、声道パラメータと
してＬＳＰパラメータを用いて伝送している点、音源パ
ラメータを与えるものとしてパルス性コードブックを備
えている点、固定コードの周波数特性を操作している点
である。このうち第２点及び第３点の特徴構成は、従来
では存在せず、各々が独立して符号化器及び復号化器に
盛り込まれても良い。(C) Other Embodiments The features of the above embodiment are that the LSP parameter is used as the vocal tract parameter for transmission, and that the pulse codebook is provided as the source parameter. The point is that the frequency characteristics of the fixed code are manipulated. Of these, the characteristic configurations of the second point and the third point do not exist conventionally, and each may be independently incorporated in the encoder and the decoder.

【００７０】また、上記実施例は、フォワード型のコー
ド励振線形予測符号化器及び復号化器に関するものであ
るが、本発明を、バックワード型のコード励振線形予測
符号化器及び復号化器に適用することもできる。Although the above embodiment relates to the forward type code excitation linear predictive encoder and decoder, the present invention is applied to the backward type code excitation linear predictive encoder and decoder. It can also be applied.

【００７１】さらに、上記実施例は、４bit/s 以下の符
号化速度を意識して構成されたものであるが、これより
高い符号化速度の符号化器及び復号化器に適用できるこ
とは勿論である。符号化速度が許すならば、統計コード
ブック及びパルス性コードブックを選択的ではなく、常
に両者を有効に動作させるものであっても良い。Further, although the above embodiment is constructed in consideration of the coding rate of 4 bits / s or less, it is needless to say that it can be applied to the encoder and the decoder of the coding rate higher than this. is there. If the coding speed permits, the statistical codebook and the pulse codebook may not be selective but may always be effective.

【００７２】上記実施例の周波数特性操作部２８及び５
０は、ＬＰＣ係数を利用した周波数特性操作とピッチラ
グを利用した周波数特性操作の両方を行なうものであっ
たが、少なくとも一方だけの操作を行なうものであって
も良い。Frequency characteristic operating units 28 and 5 of the above embodiment
Although 0 performs both the frequency characteristic operation using the LPC coefficient and the frequency characteristic operation using the pitch lag, at least one operation may be performed.

【００７３】[0073]

【発明の効果】請求項１及び請求項５のコード励振線形
予測符号化器及び復号化器によれば、適応コードブック
及び統計コードブックに加えて孤立インパルスでなるパ
ルス性コードを格納しているパルス性コードブックを設
けたので、周期性の強い有声音の立ち上がりを早くでき
たり、有声音の定常部においても明確なパルス性の強い
励振信号を形成できたりして明瞭な再生音声を得ること
ができる。また、ランダム信号とパルス的な信号が混在
している、例えば音声の過渡期の信号に対しても良好な
再生音声を得ることができる。According to the code-excited linear predictive encoder and decoder of claims 1 and 5, a pulse-like code consisting of isolated impulses is stored in addition to the adaptive codebook and the statistical codebook. Since a pulse codebook is provided, it is possible to obtain a clear reproduced voice by speeding up the rise of voiced sound with strong periodicity and forming a strong pulsed excitation signal even in the stationary part of voiced sound. You can Further, it is possible to obtain a good reproduced voice even for a signal in a transitional period of a voice in which a random signal and a pulse-like signal are mixed.

【００７４】請求項２及び請求項６のコード励振線形予
測符号化器及び復号化器によれば、統計コード及びパル
ス性コードを選択して使用するので、音源パラメータの
符号化ビット数が少ない状態で良好な再生音声を得るこ
とができる。According to the code-excited linear predictive encoder and decoder of claims 2 and 6, since the statistical code and the pulse-like code are selected and used, the number of encoded bits of the excitation parameter is small. It is possible to obtain good reproduced sound.

【００７５】請求項３及び請求項７のコード励振線形予
測符号化器及び復号化器によれば、音声合成に使用する
声道パラメータを線スペクトル対パラメータとしたの
で、声道パラメータの面からも低符号化速度での高品質
を実現できる。According to the code-excited linear predictive encoder and decoder of claims 3 and 7, since the vocal tract parameter used for speech synthesis is the line spectrum pair parameter, the vocal tract parameter is also taken into consideration. High quality at low coding speed can be realized.

【００７６】請求項４及び請求項８のコード励振線形予
測符号化器及び復号化器によれば、統計コードブック及
び又はパルス性コードブックから出力された固定コード
の周波数特性を、入力音声信号の周波数特性に近付ける
操作を行なうので、音源パラメータが実際の音源と良く
対応がとれ、この点から再生音声の品質を高めることが
できる。According to the code-excited linear predictive encoder and decoder of claims 4 and 8, the frequency characteristic of the fixed code output from the statistical codebook and / or the pulse-like codebook is converted into the input speech signal. Since the operation of approaching the frequency characteristic is performed, the sound source parameters correspond well to the actual sound source, and the quality of the reproduced sound can be improved from this point.

[Brief description of drawings]

【図１】コード励振線形予測符号化器の一実施例を示す
ブロック図である。FIG. 1 is a block diagram showing an embodiment of a code-excited linear predictive encoder.

【図２】コード励振線形予測復号化器の一実施例を示す
ブロック図である。FIG. 2 is a block diagram showing an embodiment of a code-excited linear predictive decoder.

【図３】実施例の周波数特性操作部２８（５０）の詳細
構成を示すブロック図である。FIG. 3 is a block diagram showing a detailed configuration of a frequency characteristic operation unit 28 (50) of the embodiment.

[Explanation of symbols]

３…多重化部、１１…ＬＳＰパラメータ分析部、１２…
ＬＳＰパラメータ符号化部、１３、４１…ＬＳＰパラメ
ータ復号化部、１４、４２…ＬＰＣ係数変換部、２０、
４３…適応コードブック、２１、４４…統計コードブッ
ク、２２、４５…パルス性コードブック、２５、５１…
加算器、２６、４８…固定コード選択スイッチ、２８、
５０…周波数特性操作部、２９、５２…重み付き合成フ
ィルタ、３０…減算器、３１…誤差電力和計算部、３２
…最小誤差電力コード選択部、４０…多重分離部。3 ... Multiplexing unit, 11 ... LSP parameter analyzing unit, 12 ...
LSP parameter coding unit, 13, 41 ... LSP parameter decoding unit, 14, 42 ... LPC coefficient conversion unit, 20,
43 ... Adaptive codebook, 21, 44 ... Statistical codebook, 22, 45 ... Pulse codebook, 25, 51 ...
Adder, 26, 48 ... Fixed code selection switch, 28,
50 ... Frequency characteristic operation unit, 29, 52 ... Weighted synthesis filter, 30 ... Subtractor, 31 ... Error power sum calculation unit, 32
... minimum error power code selection unit, 40 ... demultiplexing unit.

───────────────────────────────────────────────────── フロントページの続き (72)発明者有山義博東京都港区虎ノ門１丁目７番12号沖電気工業株式会社内 ─────────────────────────────────────────────────── ─── Continuation of front page (72) Inventor Yoshihiro Ariyama 1-7-12 Toranomon, Minato-ku, Tokyo Oki Electric Industry Co., Ltd.

Claims

[Claims]

1. A code-excited linear predictive encoder having an adaptive codebook and a statistical codebook as an excitation codebook, in which, in addition to the adaptive codebook and the statistical codebook, a pulsed code consisting of isolated impulses is stored. A code-excited linear predictive encoder having a pulse codebook.

2. The excitation code from the statistical codebook or the pulse codebook is selected and used, and the selection information is sent to a code excitation linear predictive decoder. Code-excited linear predictive encoder.

3. The code-excited linear predictive encoder according to claim 1, wherein the vocal tract parameters output to the code-excited linear predictive decoder are line spectrum pair parameters.

4. A code-excited linear predictive coding having a frequency characteristic operation unit for approximating the frequency characteristic of the excitation code output from the statistical codebook and / or the pulse codebook to the frequency characteristic of the input speech signal. vessel.

5. A code-excited linear predictive decoder having an adaptive codebook and a statistical codebook as an excitation codebook, wherein a pulsed code consisting of isolated impulses is stored in addition to the adaptive codebook and the statistical codebook. A code-excited linear predictive decoder having a pulse codebook.

6. The excitation code from the statistical codebook or the pulse codebook is selected based on selection information provided from a corresponding code excitation linear predictive encoder.
The code-excited linear predictive decoder according to claim 5, which is selected and used.

7. The code excitation according to claim 5, wherein the vocal tract parameters given by the corresponding code excitation linear predictive encoder are line spectrum pair parameters, which are used for speech reproduction. Linear predictive decoder.

8. A frequency characteristic manipulating section for manipulating the frequency characteristic of the fixed code output from the statistical codebook and / or the pulse codebook based on the information given from the corresponding code excitation linear predictive encoder. A code-excited linear predictive decoder having the above characteristics.