JP3230380B2

JP3230380B2 - Audio coding device

Info

Publication number: JP3230380B2
Application number: JP18325794A
Authority: JP
Inventors: 慶一舟木; 一範小澤; 和永吉田
Original assignee: NEC Corp
Current assignee: NEC Corp
Priority date: 1994-08-04
Filing date: 1994-08-04
Publication date: 2001-11-19
Anticipated expiration: 2016-11-19
Also published as: CA2144693A1; JPH0844396A

Description

DETAILED DESCRIPTION OF THE INVENTION

【０００１】[0001]

【産業上の利用分野】本発明は音声符号化装置に関し、
特に音声信号を８〜４ｋｂ／ｓ程度の低いビットレート
で高品質に符号化するＣＥＬＰ方式の音声符号化装置に
関する。BACKGROUND OF THE INVENTION 1. Field of the Invention The present invention relates to a speech coding apparatus,
In particular, the present invention relates to a CELP-type audio encoding device that encodes an audio signal at a low bit rate of about 8 to 4 kb / s with high quality.

【０００２】[0002]

【従来の技術】近年、無線電波を媒体とした自動車電話
やコードレス電話のディジタル化が急激に進展してい
る。無線電波ではこの種の電話に使用可能な周波数帯域
が少ないため、占有帯域を低減するために低ビットレー
トの音声信号の符号化方式の開発は重要である。ビット
レートが８〜４ｋｂ／ｓ程度のこの種の符号化方式とし
て、例えば、１９８５年アメリカで出版されたアイキャ
スプ・プロシーディング８５，（ＩＣＡＳＳＰｐｒｏ
ｃｅｅｄｉｎｇｓ８５）第９３７〜９４０頁所載の論
文シュローダおよびアタル，コードエキサイテッド・リ
ニア・プレディクション：ハイクオリティスピーチ・ア
トロウビットレーツ（Ｍ．Ｓｃｈｒｏｅｄｅｒａｎｄ
Ｂ．Ｓ．Ａｔａｌ，”Ｃｏｄｅ−ｅｘｃｉｔｅｄｌ
ｉｅａｒｐｒｅｄｉｃｔｉｏｎ：Ｈｉｇｈｑｕａｌｉ
ｔｙｓｐｅｅｃｈａｔｌｏｗｂｉｔｒａｔｅ
ｓ”）（文献１）等に記載されているＣＥＬＰ（Ｃｏｄ
ｅＥｘｃｉｔｅｄＬＰＣＣｏｄｉｎｇ）が知られ
ている。2. Description of the Related Art In recent years, the digitization of automobile telephones and cordless telephones using radio waves as a medium has been rapidly advancing. Since a radio wave has a small frequency band that can be used for this type of telephone, it is important to develop a coding method for a low bit rate audio signal in order to reduce the occupied band. Examples of this type of coding system having a bit rate of about 8 to 4 kb / s include, for example, Eyecast Prop. 85, published in the United States in 1985 (ICASSP pro
ceedings 85) Schroeder and Atal, Code Excited Linear Prediction, pages 937-940: High Quality Speech Atrobitrates (M. Schroeder and
B. S. Atal, "Code-excited l
earprediction: High quali
ty speech at lowbit rate
s ") (Reference 1) and the like.
e Excited LPC Coding) is known.

【０００３】この文献１記載の従来の第１の音声符号化
装置であるＣＥＬＰにおいて、送信側では次の手順で符
号化処理が行われる。まず、フレーム毎（例えば２０ｍ
ｓ）に、符号化対象の音声信号から音声の周波数特性を
表す短期予測符号を抽出する（短期予測）。次にフレー
ムをさらに小区間のサブフレーム（例えば５ｍｓ）に分
割する。上記サブフレーム毎に、過去の音源信号から長
区間相関（ピッチ相関）を表すピッチパラメータを抽出
し、上記ピッチパラメータによりそのサブフレームの音
声信号を長期予測する。この長期予測は、上記過去の音
源信号を各遅延符号に対応する遅延サンプル分遅延させ
たサブフレーム長の音源信号（適応コードベクトル）か
ら成る適応コードブックを用いて、上記ピッチ相関を表
す遅延符号を次の手順で決定することによりなされる。
すなわち、上記遅延符号を適応コードブックのサイズ分
変化させ、各遅延符号に対応する適応コードベクトルを
抽出する。抽出された上記適応コードベクトルを用いて
合成信号を生成し上記音声信号との誤差電力を算出す
る。算出された上記誤差電力が最小になる最適遅延符号
と、この最適遅延符号に対応する適応コードベクトルと
そのゲインとを決定する。[0003] In the CELP which is the first conventional speech coding apparatus described in Document 1, a coding process is performed on the transmitting side in the following procedure. First, every frame (for example, 20m
In s), a short-term prediction code representing a frequency characteristic of the voice is extracted from the voice signal to be coded (short-term prediction). Next, the frame is further divided into sub-frames (for example, 5 ms) of a small section. For each sub-frame, a pitch parameter indicating a long-term correlation (pitch correlation) is extracted from a past sound source signal, and the speech signal of the sub-frame is predicted for a long time by the pitch parameter. The long-term prediction uses a delay code representing the pitch correlation using an adaptive codebook composed of a subframe-length excitation signal (adaptive code vector) obtained by delaying the past excitation signal by delay samples corresponding to each delay code. Is determined by the following procedure.
That is, the delay code is changed by the size of the adaptive codebook, and an adaptive code vector corresponding to each delay code is extracted. A composite signal is generated using the extracted adaptive code vector, and error power with respect to the audio signal is calculated. An optimal delay code that minimizes the calculated error power, an adaptive code vector corresponding to the optimal delay code, and its gain are determined.

【０００４】次に、あらかじめ用意された種類の量子化
符号である雑音信号（音源コードブック）から抽出した
音源コードベクトルから生成した合成信号と、上記長期
予測して求められた残差信号との誤差電力が最小になる
音源コードベクトルとそのゲインとを決定する（音源コ
ードブック探索）。このようにして決定された適応コー
ドベクトルならびに音源コードベクトルの種類を表すイ
ンデックスと各々の音源信号のゲインならびにスペクト
ルパラメータの種類を表すインデックスとを伝送する。Next, a synthesized signal generated from an excitation code vector extracted from a noise signal (excitation codebook), which is a kind of quantization code prepared in advance, and a residual signal obtained by the long-term prediction are described. A sound source code vector that minimizes error power and its gain are determined (sound source codebook search). An index indicating the type of the adaptive code vector and the excitation code vector determined in this way and an index indicating the type of the gain and the spectrum parameter of each excitation signal are transmitted.

【０００５】具体的な適応コードベクトルの遅延符号と
音源コードベクトルの量子化符号の探索法は次の手順で
行われる。先ず、入力された音声信号ｘ［ｎ］に対し聴
感上の重み付けおよび過去の影響信号の減算を行って信
号ｚ［ｎ］を算出する。次に、上述の短期予測で求め、
量子化および逆量子化を行ったスペクトルパラメータで
構成される合成フィルタＨを、量子化符号ｊのコードベ
クトルｅｊ［ｎ］で駆動して合成信号Ｈｅｊ［ｎ］を算
出する。次に、次式において、信号ｚ［ｎ］と信号Ｈｅ
ｊ［ｎ］の誤差電力Ｅが最小になる量子化符号ｊを求め
る。A specific method of searching for a delay code of an adaptive code vector and a quantization code of an excitation code vector is performed in the following procedure. First, a signal z [n] is calculated by performing weighting on audibility and subtracting a past influence signal from the input audio signal x [n]. Next, calculated by the above short-term forecast,
The synthesis filter H composed of the quantized and inversely quantized spectral parameters is driven by the code vector ej [n] of the quantization code j to calculate the synthesized signal Hej [n]. Next, in the following equation, the signal z [n] and the signal He are
A quantization code j that minimizes the error power E of j [n] is obtained.

【０００６】 [0006]

【０００７】ここで、Ｎ_sはサブフレーム長を、Ｈは合
成フィルタを実現する行列を、ｇ_ejはコードベクトルｅ
ｊのゲインをそれぞれ表す。実際には（１）式は、次の
ように展開される。Here, N _s is a subframe length, H is a matrix for realizing a synthesis filter, and g _ej is a code vector e.
j, respectively. In practice, equation (1) is expanded as follows.

【０００８】 [0008]

【０００９】（２）式の分子Ｃｊは相互相関、分母Ｇｊ
は自己相関であり、それぞれ次式で算出できる。The numerator Cj in the equation (2) is a cross-correlation, a denominator Gj
Is the autocorrelation, which can be calculated by the following equations.

【００１０】 [0010]

【００１１】これら自己相関Ｇｊと相互相関Ｃｊの計算
は、合成フィルタの駆動すなわちフィルタリングにより
信号Ｈｅｊ［ｎ］を算出の後、実行される。そのとき、
上記フィルタリング処理の演算は上述のコードブックの
サイズの分にわたり実行するため、処理対象のフレーム
に対する上記演算の演算量（積和の回数）は非常に多く
なる。The calculation of the auto-correlation Gj and the cross-correlation Cj is executed after the signal Hej [n] is calculated by driving the synthesis filter, that is, filtering. then,
Since the calculation of the filtering process is performed over the size of the codebook, the amount of calculation (the number of times of product sum) of the calculation for the frame to be processed becomes very large.

【００１２】この長期予測の時の演算量低減ための従来
の第２の音声符号化装置として、特願平２−２２８５８
１号公報（文献２）記載の「デジタル音声コーダーおよ
びそのコーダーに用いられるパラメータを求める方法」
に開示された遅延符号のオープン・クローズド探索がよ
く知られている。この方法は、オープンループにより遅
延符号の予備選択を行い、クローズドループにより上記
予備選択により決定された遅延符号の近傍の符号の探索
を行うことにより、長期予測を精度低下なしに低演算量
で実現している。Japanese Patent Application No. 22858/1990 discloses a second conventional speech coding apparatus for reducing the amount of computation during long-term prediction.
No. 1 (Reference 2), “Digital voice coder and method for obtaining parameters used in the coder”
Is well known. This method realizes a long-term prediction with a small amount of calculation without deteriorating accuracy by performing preliminary selection of a delay code by an open loop and searching for a code near the delay code determined by the preliminary selection by a closed loop. are doing.

【００１３】従来の第１および第２の音声符号化装置を
含むＣＥＬＰ方式の音声符号化装置をブロックで示す図
５を参照すると、この図に示す音声符号化装置は、音声
入力信号を符号化する符号化部１と、符号化信号を復号
化する復号化部２と、符号化部１と復号化部２とを接続
する伝送路３とを備える。FIG. 5 is a block diagram showing a conventional CELP type speech coding apparatus including first and second speech coding apparatuses. Referring to FIG. 5, the speech coding apparatus shown in FIG. And a transmission line 3 connecting the encoding unit 1 and the decoding unit 2 to each other.

【００１４】符号化部１は、入力端子ＴＩから入力した
音声信号を記憶するバッファ回路１１と、音声のスペク
トルパラメータであるＬＰＣ係数を抽出するＬＰＣ分析
回路１２と、ＬＰＣ係数を量子化するパラメータ量子化
回路１３と、音声信号に対し聴感重み付けを行う重み付
け回路１４と、過去の音源を蓄えておく適応コードブッ
ク１５と、ピッチ相関を表す遅延符号である適応コード
ベクトルを探索する長期予測回路１６と、長期予測残差
を表すサブフレーム長の音源コードベクトルが蓄えられ
たコードブックである音源コードブック１７と、音源コ
ードブックから最適な音源コードベクトルを決定する音
源コードブック探索回路１８と、適応コードベクトルと
音源コードベクトルのゲイン項を表すパラメータが蓄積
されているゲインコードブック１９と、適応コードベク
トルと音源コードベクトルの量子化ゲインをゲインコー
ドブックから決定するゲインコードブック探索回路４０
と、符号系列を組み合わせて出力するマルチプレクサ４
１とを備える。The encoding unit 1 includes a buffer circuit 11 for storing an audio signal input from an input terminal TI, an LPC analysis circuit 12 for extracting an LPC coefficient as an audio spectrum parameter, and a parameter quantization circuit for quantizing the LPC coefficient. Circuit 13, a weighting circuit 14 for weighting the audibility of an audio signal, an adaptive codebook 15 for storing past sound sources, and a long-term prediction circuit 16 for searching for an adaptive code vector that is a delay code representing a pitch correlation. An excitation codebook 17 which is a codebook in which excitation codes of subframe lengths representing long-term prediction residuals are stored; an excitation codebook search circuit 18 for determining an optimal excitation codevector from the excitation codebook; Gain that stores the parameters representing the gain terms of the vector and sound source code vector And Dobukku 19, the gain codebook searching circuit determines the quantization gain of the adaptive code vector and the excitation code vector from the gain codebook 40
And a multiplexer 4 for combining and outputting a code sequence
1 is provided.

【００１５】音源コードブック１７は、文献１記載の雑
音コードブックでも特願平２−２２９５５号公報や特願
平２−２２９５６号公報等記載のベクトル量子化（Ｖ
Ｑ）アルゴリズムにより学習された学習コードブックで
も構わない。The sound source codebook 17 uses the vector quantization (V) described in Japanese Unexamined Patent Application Publication No. 2-22955 or Japanese Unexamined Patent Application Publication No. 2-22956 as a noise codebook described in Reference 1.
Q) A learning codebook learned by an algorithm may be used.

【００１６】復号化部２は、供給を受けた伝送符号を所
定の各符号系列にデコードするデマルチプレクサ２１
と、適応コードブック１５と同一の適応コードブック２
２と、音源コードブック１７と同一の音源コードブック
２３と、ゲインコードブック１９と同一のゲインコード
ブック２４と、生成された音源と音声合成フィルタより
音声信号を再生する合成フィルタ２５と、出力端子ＴＯ
と、音声出力用の出力端子ＴＯとを備える。The decoding unit 2 is a demultiplexer 21 for decoding the supplied transmission code into predetermined code sequences.
And adaptive codebook 2 identical to adaptive codebook 15
2, a sound source codebook 23 that is the same as the sound source codebook 17, a gain codebook 24 that is the same as the gain codebook 19, a synthesis filter 25 that reproduces a sound signal from the generated sound source and the sound synthesis filter, and an output terminal. TO
And an output terminal TO for audio output.

【００１７】長期予測回路１６の構成をブロックで示す
図６を参照すると、この従来の長期予測回路１６は、遅
延符号を適応コードブックのサイズ分変化させる遅延符
号可変回路１６１と、遅延符号可変回路１６１で設定さ
れた遅延符号ｄに対応する適応コードベクトルｅ
_d（ｎ）をコードブック１６６に蓄えられるている過去
の信号より生成する適応コードベクトル生成回路１６２
と、適応コードベクトルｅ_d（ｎ）を入力とする合成信
号である重み付け適応コードベクトルＨ・ｅ_d（ｎ）を
生成する合成フィルタ１６３と、音声バッファに蓄積さ
れている音声信号と合成信号Ｈ・ｅ_d（ｎ）の誤差電力
を表す評価関数を算出する評価関数算出回路１６４と、
全ての可変させた遅延符号ｄに対応する評価関数を用い
て最適な遅延符号ＣＤを決定する最適遅延符号決定回路
１６５と、適応コードブック探索のための過去の音源信
号や残差信号や重み付け信号あるいは音声信号などの信
号を蓄積するバッファであるコードブック１６６と、符
号化処理を行う区間の音声信号を蓄積するバッファであ
り、この音声信号との誤差電力を最小にする遅延コード
が探索される音声バッファ１６７とを備える。Referring to FIG. 6 showing a block diagram of the configuration of the long-term prediction circuit 16, the conventional long-term prediction circuit 16 includes a delay code variable circuit 161 for changing a delay code by the size of an adaptive codebook, and a delay code variable circuit. The adaptive code vector e corresponding to the delay code d set in 161
Adaptive code vector generation circuit 162 for generating _d (n) from past signals stored in codebook 166
When the adaptive code vector e _d (n) and a synthetic signal for receiving the weighted adaptive code vector H · e _d synthesis filter 163 to generate a (n), synthesized signal H and the audio signal stored in the audio buffer An evaluation function calculation circuit 164 for calculating an evaluation function representing an error power of e _d (n);
An optimal delay code determination circuit 165 that determines an optimal delay code CD using an evaluation function corresponding to all variable delay codes d; a past excitation signal, a residual signal, and a weighting signal for adaptive codebook search; Alternatively, a codebook 166 which is a buffer for accumulating a signal such as an audio signal, and a buffer which accumulates an audio signal in a section where encoding processing is performed, are searched for a delay code which minimizes error power with respect to this audio signal. An audio buffer 167 is provided.

【００１８】次に、図５および図６を参照して、従来の
音声符号化回路の処理の流れについて説明すると、まず
符号化部１は、入力端子ＴＩより、音声信号を入力しバ
ッファ１１に格納する。このバッファ１１に蓄えられた
一定サンプルの音声信号を用いてＬＰＣ分析回路１２で
短期予測分析し、この音声信号のＬＰＣ係数を算出す
る。ＬＰＣ分析回路１２で求めた上記ＬＰＣ係数はパラ
メータ量子化回路１３で量子化され、上記ＬＰＣ係数の
量子化符号ＣＬがマルチプレクサ４１に送られると共
に、逆量子化され以後の符号化処理に用いられる。Next, with reference to FIGS. 5 and 6, a description will be given of the flow of processing of the conventional audio encoding circuit. First, the encoding unit 1 inputs an audio signal from an input terminal TI and sends it to a buffer 11. Store. The LPC analysis circuit 12 performs a short-term prediction analysis using the audio signal of a certain sample stored in the buffer 11, and calculates an LPC coefficient of the audio signal. The LPC coefficient obtained by the LPC analysis circuit 12 is quantized by the parameter quantization circuit 13, and the quantized code CL of the LPC coefficient is sent to the multiplexer 41, and is dequantized and used for the subsequent encoding processing.

【００１９】一方、バッファ１１に蓄えられた音声信号
は量子化／逆量子化されたＬＰＣ係数を用いて重み付け
回路１４で聴感上の重み付けをされた信号ＳＷとして長
期予測回路１６，音源コードブック探索回路１８，およ
びゲインコードブック探索回路４０にそれぞれ供給さ
れ、以降のコードブック探索に用いられる。On the other hand, the audio signal stored in the buffer 11 is converted into a perceptually weighted signal SW by the weighting circuit 14 using the quantized / dequantized LPC coefficients. The signal is supplied to the circuit 18 and the gain codebook search circuit 40, respectively, and used for the subsequent codebook search.

【００２０】次に、適応コードブック１５、音源コード
ブック１７、およびゲインコードブック１９の各々を用
いて信号ＳＷのそれぞれのコードブック探索を行う。ま
ず、最初に長期予測回路１６で長期予測を行い、ピッチ
相関を表す最適の遅延符号ＣＤを後述のように決定し、
その遅延符号ＣＤをマルチプレクサ４１に転送するとと
もに、対応の適応コードベクトルの生成を行なう。次
に、上記適応コードベクトルの影響を減算後、音源コー
ドブック探索回路１８で音源コードブック探索を行い、
量子化符号ＣＳを決定し、音源コードベクトルを生成す
るとともにこの量子化符号ＣＳをマルチプレクサ４１に
転送する。適応コードベクトルと音源コードベクトルと
を求めた後、ゲインコードブック探索回路４０でこれら
２つの音源のゲインを算出し、その符号ＣＧをマルチプ
レクサ４１に転送する。マルチプレクサ４１では、これ
ら符号ＣＬ，ＣＤ，ＣＳ，およびＣＧを組み合わせて伝
送符号ＣＴに変換し、この符号ＣＴを伝送路３を経由し
て復号化部２に転送する。Next, a codebook search for the signal SW is performed using each of the adaptive codebook 15, the sound source codebook 17, and the gain codebook 19. First, a long-term prediction is first performed by the long-term prediction circuit 16, and an optimum delay code CD representing the pitch correlation is determined as described later.
The delay code CD is transferred to the multiplexer 41, and a corresponding adaptive code vector is generated. Next, after subtracting the influence of the adaptive code vector, the sound source codebook search circuit 18 performs a sound source codebook search,
The quantization code CS is determined, an excitation code vector is generated, and the quantization code CS is transferred to the multiplexer 41. After obtaining the adaptive code vector and the excitation code vector, the gain codebook search circuit 40 calculates the gain of these two excitations and transfers the code CG to the multiplexer 41. The multiplexer 41 combines these codes CL, CD, CS, and CG and converts them into a transmission code CT, and transfers this code CT to the decoding unit 2 via the transmission path 3.

【００２１】復号化部２は、デマルチプレクサ２で、伝
送路３から入力された伝送符号ＣＴを符号ＣＬ，ＣＤ，
ＣＳ，およびＣＧの各々に分解する。ＬＰＣ係数対応の
符号ＣＬよりフィルタ係数をデコードし、合成フィルタ
２５に転送する。遅延符号ＣＤより適応コードブック１
５を用いて適応コードベクトルを生成する。音源対応の
量子化符号ＣＳより音源コードブック１７を用いて音源
コードベクトルを生成する。ゲイン対応の符号ＣＧより
適応コードベクトルと音源コードベクトルのゲインを算
出し、各音源にゲイン項を掛け合わせて合成フィルタの
入力信号を生成する。最後に入力信号を用いて合成フィ
ルタ２５で音声信号の合成を行ない端子ＴＯから出力す
る。The decoding unit 2 uses the demultiplexer 2 to convert the transmission code CT input from the transmission path 3 into codes CL, CD,
Decomposes into CS and CG respectively. The filter coefficient is decoded from the code CL corresponding to the LPC coefficient and transferred to the synthesis filter 25. Adaptive codebook 1 from delay code CD
5 to generate an adaptive code vector. An excitation code vector is generated from the quantization code CS corresponding to the excitation using the excitation codebook 17. The gain of the adaptive code vector and the excitation code vector is calculated from the code CG corresponding to the gain, and each excitation is multiplied by a gain term to generate an input signal of the synthesis filter. Finally, the synthesis signal is synthesized by the synthesis filter 25 using the input signal and output from the terminal TO.

【００２２】次に、図６を参照して長期予測回路１６の
動作について説明すると、まず、遅延符号可変回路１６
１は、信号ＳＷ対応の遅延符号を適応コードブックのサ
イズ分変化させ、遅延符号ｄを設定する。この遅延符号
ｄは小数点遅延を表す符号が望ましいが、整数遅延を表
す符号でもよい。次に、適応コードベクトル生成回路１
６２はこの遅延符号ｄに対応する適応コードベクトルｅ
_d（ｎ）をコードブック１６６から生成する。次に、合
成フィルタ１６３は適応コードベクトルｅ_d（ｎ）を入
力とする零状態合成信号である重み付け適応コードベク
トルＨ・ｅ_d（ｎ）を生成する。評価関数算出回路１６
４は重み付け適応コードベクトルＨ・ｅ_d（Ｎ）と、音
声バッファ１６７に格納されている符号化区間における
零入力応答減算信号ｚ（ｎ）の相互相関Ｃ_d，自己相関
Ｇ_dをそれぞれ算出し、誤差電力相当の評価関数Ｃ_d ²
／Ｇ_dを算出する。これら適応コードベクトル生成回路
１６２，合成フィルタ１６３，および評価関数算出回路
１６４の処理を遅延符号可変回路１６１の可変対象の全
遅延符号に対して行った後、最適遅延符号決定回路１６
５で評価関数Ｃ_d ²／Ｇ_dを最大にする遅延符号ｄを最
適遅延符号ＣＤとして決定する。Next, the operation of the long-term prediction circuit 16 will be described with reference to FIG.
1 sets the delay code d by changing the delay code corresponding to the signal SW by the size of the adaptive codebook. The delay code d is preferably a code representing a decimal point delay, but may be a code representing an integer delay. Next, the adaptive code vector generation circuit 1
62 is an adaptive code vector e corresponding to the delay code d.
_d (n) is generated from the codebook 166. Next, the synthesis filter 163 generates an adaptive code vector e _d (n) is the zero-state synthesis signal for receiving the weighted adaptive code vector H · e _d (n). Evaluation function calculation circuit 16
4 is a weighted adaptive code vector H · e _d (N), the cross-correlation C _d of the zero input response subtraction signal z (n) in the encoding section is stored, the autocorrelation G _d is calculated respectively in the audio buffer 167 , The evaluation function C _d ² corresponding to the error power
/ _Gd is calculated. After the processes of the adaptive code vector generation circuit 162, the synthesis filter 163, and the evaluation function calculation circuit 164 are performed on all delay codes to be changed by the delay code variable circuit 161, the optimum delay code determination circuit 16
In step 5, the delay code d that maximizes the evaluation function C _d ² / G _d is determined as the optimum delay code CD.

【００２３】[0023]

【発明が解決しようとする課題】上述した従来の第１の
音声符号化装置は、長期予測のコードブック探索時の演
算量が非常に多く、低ビットレートで良好な音質の音声
符号化の実現が困難であるという欠点があった。The above-mentioned first conventional speech coding apparatus requires a large amount of computation when searching for a long-term prediction codebook, and realizes speech coding with good sound quality at a low bit rate. There is a drawback that it is difficult.

【００２４】上記演算量の低減を図った従来の第２の音
声符号化装置でも、演算量はかなり多く、低ビットレー
トで良好な音質の音声符号化装置の実現にはより一層の
演算量の低減化を要するという問題点があった。Even in the conventional second speech coding apparatus in which the amount of calculation is reduced, the amount of calculation is considerably large, and further reduction in the amount of calculation is required for realizing a speech coding apparatus having a low bit rate and good sound quality. There was a problem that reduction was required.

【００２５】本発明の目的は、長期予測のときの演算量
を低減化し、４ｋｂ／ｓ程度の低ビットレートで音質の
良好な音声符号化装置を実現することにある。It is an object of the present invention to reduce the amount of computation in long-term prediction and to realize a speech encoding device with a low bit rate of about 4 kb / s and good sound quality.

【００２６】[0026]

【課題を解決するための手段】本発明の音声符号化装置
は、予め定めたフレーム長の音声信号を分析しこの音声
信号のスペクトルパラメータを抽出して対応の短期予測
符号を生成する音声分析手段と、前記フレーム長をさら
に予め定めたサブフレーム長に分割しこのサブフレーム
長単位の過去の音源信号を蓄積した適応コードブック格
納手段と、前記適応コードーブック格納手段から読出し
た前記音源信号を予め定めたピッチ周期の範囲で遅延さ
せ評価尺度である重み付け２乗誤差が最小となる遅延値
と遅延させた前記過去の音源に対するゲインとを算出し
て前記音声信号のピッチ相関を表す最適な遅延符号であ
る適応コードベクトルを探索して長期予測を行う長期予
測手段と、前記長期予測後の残差信号を示す量子化符号
である音源コードベクトルを蓄積する音源コードブック
格納手段と、前記音源コードブックから最適な音源コー
ドベクトルを決定する音源コードブック探索手段とを備
える音声符号化装置において、前記長期予測手段が、前
記適応コードブック格納手段探索のための前記過去の音
源信号と前記残差信号と重み付け信号を含む信号を蓄積
するバッファであるコードブックと、前記遅延符号を前
記コードブックのサイズ内で間引ながら可変させること
により低減した処理対象の遅延符号である間引き遅延符
号を生成する遅延符号間引可変手段と、前記間引き遅延
符号を入力しこの間引き遅延符号に対応する前記適応コ
ードベクトルの候補を前記コードブックから生成する適
応コードベクトル生成手段と、前記適応コードベクトル
の候補を入力し零状態合成信号である重み付け適応コー
ドベクトルを生成する合成フィルタ手段と、符号化処理
を行う区間である符号化区間の音声信号を格納した音声
バッファと、前記重み付け適応コードベクトルと前記音
声バッファに格納されている前記符号化区間における零
入力応答減算信号との相互相関と自己相関とから符号化
音声信号の前記音声信号に対する誤差電力相当の評価関
数を算出する評価関数算出手段と、前記重み付け２乗誤
差の最小化に対応するよう前記評価関数を最大とする前
記間引き遅延符号を前記最適遅延符号と決定する最適遅
延符号決定手段とを備えて構成されている。SUMMARY OF THE INVENTION A speech encoding apparatus according to the present invention analyzes a speech signal having a predetermined frame length, extracts a spectrum parameter of the speech signal, and generates a corresponding short-term prediction code. Adaptive codebook storage means for further dividing the frame length into predetermined subframe lengths and storing past excitation signals of this subframe length unit; and the excitation signal read from the adaptive codebook storage means. An optimal delay code representing a pitch correlation of the voice signal by calculating a delay value that minimizes a weighted square error as an evaluation scale by delaying within a predetermined pitch period and a gain for the delayed past sound source. A long-term prediction means for searching for an adaptive code vector that is a long-term prediction, and an excitation code that is a quantization code indicating a residual signal after the long-term prediction. A sound source code book storage means for storing vector, in the speech coding apparatus and a sound source code book search means for determining the optimum excitation code vector from the excitation codebook, said long-term prediction means, before
The past sound for searching for an adaptive codebook storage means
Stores a signal including a source signal, the residual signal, and a weighted signal
A codebook is a buffer that, before the delay code
A thinning-out delay code that is a processing-target delay code reduced by making it variable while thinning out within the codebook size.
Variable sign thinning variable means for generating a signal , and said thinning delay
Code and inputs the adaptive code corresponding to the thinning-out delay code.
Code vector candidates from the codebook.
Adaptive code vector generating means, and the adaptive code vector
Weighted adaptive code that is a zero-state synthesized signal
Synthesis filter means for generating a code vector and encoding processing
That stores the audio signal of the coding section that is the section where
A buffer, the weighted adaptive code vector and the sound
Zero in the coding interval stored in the voice buffer
Coding from cross-correlation with input response subtracted signal and auto-correlation
An evaluation function corresponding to error power of an audio signal with respect to the audio signal.
And evaluation function calculation means to calculate the number, the weighting 2 Noayama
Before maximizing the evaluation function to correspond to the minimization of the difference
Optimum delay for determining a thinning-out delay code as the optimal delay code
And an extension code determining means .

【００２７】[0027]

【実施例】次に、本発明を特徴ずける第１の実施例の長
期予測回路１６Ａの構成をブロックで示す図１を参照す
ると、この長期予測回路１６Ａは従来の技術の長期予測
回路１６の適応コードベクトル生成回路１６２と、合成
フィルタ１６３と、評価関数算出回路１６４と、最適遅
延符号決定回路１６５と、コードブック１６６と、音声
バッファ１６７とに加えて、遅延符号可変回路１６１の
代りに適応コードブックの遅延符号を均一に間引きなが
ら変化させる遅延符号均一間引可変回路１６８を備え
る。FIG. 1 is a block diagram showing the configuration of a long-term prediction circuit 16A according to a first embodiment of the present invention. In addition to the adaptive code vector generation circuit 162, the synthesis filter 163, the evaluation function calculation circuit 164, the optimum delay code determination circuit 165, the code book 166, and the audio buffer 167, the adaptive code vector generation circuit 162 is used instead of the delay code variable circuit 161. There is provided a delay code uniform thinning-out variable circuit 168 for changing the delay code of the code book while thinning it out uniformly.

【００２８】次に、図１を参照して本実施例の動作につ
いて説明すると、まず、遅延符号均一間引可変回路１６
８は遅延符号をコードブックのサイズ内で均一に間引き
ながら変化させ処理対象の遅延符号ｄを設定し、各遅延
符号ｄ毎に適応コードベクトル生成回路１６２による適
応コードベクトルｅ_d（ｎ）生成処理，合成フィルタ１
６３による重み付け適応コードベクトルＨ・ｅ_d（ｎ）
の生成処理，評価関数算出回路１６４による評価関数Ｃ
_d ²／Ｇ_dの算出処理を行う。上記均一間引方法とし
て、奇数番目の遅延符号だけを抽出する。これら適応コ
ードベクトル生成処理，重み付け適応コードベクトル生
成処理，および評価関数算出処理，および最適遅延符号
決定処理は従来と共通であるので説明を省略する。Next, the operation of this embodiment will be described with reference to FIG.
Reference numeral 8 designates a delay code d to be processed by changing the delay code while thinning it uniformly within the size of the codebook, and generates an adaptive code vector e _d (n) by the adaptive code vector generation circuit 162 for each delay code d. , Synthesis filter 1
The adaptive code vector H · _ed (n) weighted by 63
Generation processing, evaluation function C by evaluation function calculation circuit 164
Calculation processing of _d ² / G _d is performed. As the uniform thinning method, only odd-numbered delay codes are extracted. The adaptive code vector generation processing, the weighted adaptive code vector generation processing, the evaluation function calculation processing, and the optimum delay code determination processing are common to those in the related art, and thus description thereof is omitted.

【００２９】以上の各処理を可変対象の全ての遅延符号
ｄに対して行った後、従来と同様に最適遅延符号決定回
路１６５により評価関数Ｃ_d ²／Ｇ_dを最大にする遅延
符号ｄを最適遅延符号ＣＤとして決定し出力する。After performing each of the above processes on all the delay codes d to be changed, the optimum delay code determination circuit 165 determines the delay code _d that maximizes the evaluation function C _d ² / G _d in the same manner as before. It is determined and output as the optimum delay code CD.

【００３０】このように、本実施例では、長期予測時
に、遅延符号を間引くことにより、演算量低減化を実現
する。As described above, in this embodiment, the amount of calculation can be reduced by thinning out the delay codes at the time of long-term prediction.

【００３１】次に、本発明の第２の実施例の長期予測回
路１６Ｂの構成を図１と共通の構成要素には共通の参照
文字／数字を付して同様にブロックで示す図２を参照す
ると、この長期予測回路１６Ｂの第１の実施例の長期予
測回路１６Ａとの相違点は、遅延符号均一間引可変回路
１６８の代りに適応コードブックの遅延符号を非均一に
間引きながら遅延変化させる遅延符号非均一間引可変回
路１６９を備えることである。Next, the configuration of the long-term prediction circuit 16B according to the second embodiment of the present invention will be described with reference to FIG. Then, the difference between this long-term prediction circuit 16B and the long-term prediction circuit 16A of the first embodiment is that the delay code is changed while thinning out the delay code of the adaptive codebook non-uniformly instead of the delay code uniform thinning-out variable circuit 168. In other words, a variable delay code non-uniform thinning variable circuit 169 is provided.

【００３２】本実施例の動作は、遅延符号非均一間引可
変回路１６９が、遅延符号をコードブックのサイズ内で
非均一に間引きながら変化させ処理対象の遅延符号ｄを
設定する他は、上述の第１の実施例と同様である。The operation of this embodiment is the same as that described above except that the delay code non-uniform thinning-out variable circuit 169 changes the delay code while thinning it non-uniformly within the size of the codebook and sets the delay code d to be processed. Is similar to the first embodiment.

【００３３】上記非均一間引き方法として、遅延が小さ
い遅延符号は全ての符号を抽出し、遅延が大きい遅延符
号は偶数番目のみを抽出する。As the non-uniform thinning-out method, a delay code having a small delay extracts all codes, and a delay code having a large delay extracts only even-numbered delay codes.

【００３４】次に、本発明の第３の実施例の長期予測回
路１６Ｃの構成を図２と共通の構成要素には共通の参照
文字／数字を付して同様にブロックで示す図３を参照す
ると、この長期予測回路１６Ｂの第２の実施例の長期予
測回路１６Ｂとの相違点は、遅延符号非均一間引可変回
路１６９に加えて、合成フィルタ１６３および音声バッ
ファ１６７の各々の出力側に入力信号を１／Ｄ（任意の
整数）に間引くサンプル間引回路１７０，１８０を備え
ることである。Next, the configuration of the long-term prediction circuit 16C according to the third embodiment of the present invention will be described with reference to FIG. Then, the difference between this long-term prediction circuit 16B and the long-term prediction circuit 16B of the second embodiment is that, in addition to the delay code non-uniform thinning-out variable circuit 169, the output side of each of the synthesis filter 163 and the audio buffer 167. In other words, sample thinning circuits 170 and 180 for thinning the input signal to 1 / D (arbitrary integer) are provided.

【００３５】本実施例の動作は、第２の実施例と同様に
遅延符号非均一間引可変回路１６９が、遅延符号をコー
ドブックのサイズ内で非均一に間引きながら変化させ処
理対象の遅延符号ｄを設定し、各遅延符号ｄ毎に適応コ
ードベクトル生成回路１６２による適応コードベクトル
ｅ_d（ｎ）生成処理，合成フィルタ１６３による重み付
け適応コードベクトルＨ・ｅ_d（ｎ）の生成処理を行な
った後、サンプル間引回路１７０によりこの重み付け適
応コードベクトルＨ・ｅ_d（ｎ）を１／Ｄ（例えばＤ＝
２）にサンプル間引を行い間引コードベクトルＨ・ｅ_d
（ｎ）’を生成する。サンプル間引回路１８０は音声バ
ッファ１６７に格納されている符号化区間における零入
力応答減算信号ｚ（ｎ）を同様に１／Ｄにサンプル間引
を行い間引零入力応答減算信号ｚ（ｎ）’を生成する。
評価関数算出回路１６４はこれら間引コードベクトルＨ
・ｅ_d（ｎ）’と間引零入力応答減算信号ｚ（ｎ）’と
の相互相関Ｃ_d，自己相関Ｇ_dをそれぞれ算出し、これ
ら相互，自己相関Ｃ_d，Ｇ_dから実施例２と同様に評価
関数Ｃ_d ²／Ｇ_dを算出する。上記サンプル間引の実行
は、ロウパスフィルタによるダウンサンプリングによる
か単純な間引により行う。以上の他の処理は、上述の第
２の実施例と同様に行う。The operation of this embodiment is similar to that of the second embodiment, except that the delay code non-uniform thinning-out variable circuit 169 changes the delay code while thinning it out non-uniformly within the size of the codebook and changes the delay code to be processed. d, the adaptive code vector generation circuit 162 generates an adaptive code vector e _d (n) for each delay code d, and the synthesis filter 163 generates a weighted adaptive code vector H · _ed (n). Then, the weighted adaptive code vector H · _ed (n) is 1 / D (for example, D =
Thinning code vector H · e _d performs sample thinning 2)
(N) ′ is generated. The sample thinning circuit 180 thins out the sample of the zero input response subtracted signal z (n) in the coding section stored in the audio buffer 167 to 1 / D in the same manner, and thins out the zero input response subtracted signal z (n). 'Is generated.
The evaluation function calculation circuit 164 calculates these thinning code vectors H
The second embodiment calculates a cross-correlation C _d and an auto-correlation G _d between e _d (n) ′ and a decimation zero input response subtracted signal z (n) ′, respectively, and calculates the second embodiment from the cross-correlation and auto-correlation C _d and G _d. Similarly, the evaluation function C _d ² / G _d is calculated. The execution of the sample thinning is performed by downsampling using a low-pass filter or by simple thinning. The other processes are performed in the same manner as in the second embodiment.

【００３６】本実施例では、相関計算の実行時に、サン
プルを間引いた信号を用いることにより更なる演算量低
減化を実現する。In the present embodiment, a further reduction in the amount of calculation is realized by using a signal from which samples have been thinned out during execution of the correlation calculation.

【００３７】次に、本発明の第４の実施例の長期予測回
路１６Ｄの構成を図３と共通の構成要素には共通の参照
文字／数字を付して同様にブロックで示す図４を参照す
ると、この長期予測回路１６Ｄの第３の実施例の長期予
測回路１６Ｃとの相違点は、評価関数算出回路１６４の
出力の評価関数Ｃ_d ²／Ｇ_dを基準としてＭ個の最適遅
延符号ＣＤに対する候補すなわち遅延符号候補を予備選
択する遅延符号候補決定回路１７１と、最適遅延符号決
定回路１６５に代りこれらＭ個の遅延符号候補から最適
遅延符号ＣＤを決定する最終遅延符号決定回路１７２と
を備える。Next, the construction of the long-term prediction circuit 16D according to the fourth embodiment of the present invention will be described with reference to FIG. Then, the difference between the long-term prediction circuit 16D and the long-term prediction circuit 16C of the third embodiment is that M optimum delay codes CD based on the evaluation function C _d ² / G _d of the output of the evaluation function calculation circuit 164 are used as a reference. , And a final delay code determination circuit 172 that determines an optimal delay code CD from these M delay code candidates instead of the optimal delay code determination circuit 165. .

【００３８】本実施例の動作について説明すると、遅延
符号非均一間引回路１６９の可変対象の遅延符号ｄの全
てについての第３の実施例と同様の処理結果得られた評
価関数評価関数Ｃ_d ²／Ｇ_dは遅延符号候補決定回路１
７１に供給される。遅延符号候補決定回路１７１は、評
価関数評価関数Ｃ_d ²／Ｇ_dを基準にＭ（例えばＭ＝
５）個の遅延符号候補Ｄ１〜ＤＭ（＝５）を予備選択す
る。最終遅延符号決定回路１７２は、これら遅延符号候
補Ｄ１〜Ｄ５の中から、従来と同様の探索手法で最適遅
延符号ＣＤを決定する。The operation of the present embodiment will be described. The evaluation function evaluation function C _d obtained as a result of the same processing as in the third embodiment for all of the variable delay codes d of the delay code non-uniform thinning circuit 169 will be described. ² / _Gd is the delay code candidate decision circuit 1
71. The delay code candidate determination circuit 171 uses the evaluation function evaluation function C _d ² / G _d as a reference for M (for example, M =
5) Preliminarily select the delay code candidates D1 to DM (= 5). The final delay code determination circuit 172 determines an optimum delay code CD from among the delay code candidates D1 to D5 by a search method similar to the related art.

【００３９】本実施例では、サンプルの間引きを最適遅
延符号の予備選択に用い、予備選択された遅延符号候補
対応の遅延符号を用いて、従来と同様の探索を行うこと
により、精度を保持したまま演算量低減化を実現でき
る。In this embodiment, the precision is maintained by performing the same search as in the prior art using the thinning of the samples for the preliminary selection of the optimum delay code and using the delay code corresponding to the preselected delay code candidate. The amount of calculation can be reduced as it is.

【００４０】以上、本発明の実施例を説明したが、本発
明は上記実施例に限られることなく種々の変形が可能で
ある。Although the embodiments of the present invention have been described above, the present invention is not limited to the above embodiments, and various modifications can be made.

【００４１】例えば、評価関数として相互相関²／自己
相関を用いたが、相互相関²でも同様の効果が得られ
る。また、適応コードベクトルを合成した重み付け適応
コードベクトルを用いているが、合成処理を行わないま
まの適応コードベクトルそのものを用いても同様の効果
が得られる。さらに、音声バッファに格納する信号を零
状態減算信号とする代りに入力音声信号や残差信号や聴
感上の重み付け信号を用いてもよい。さらにまた、コー
ドブックとして過去の音源信号を用いる代りに、過去の
入力音声信号や残差信号や聴感上の重み付け信号を用い
ても同様の効果が得られる。さらに、相互相関や自己相
関の算出に合成フィルタＨでまともにフィルタリングす
る方法を用いたが、近似式を用いる方法でも同様の効果
が得られる。この近似法は、例えば、１９８６年アメリ
カで出版されたアイキャスプ８６・プロシーディング
（ＩＣＡＳＳＰｐｒｏｃｅｅｄｉｎｇｓ８６）第２
３７５〜２３７８頁所載の論文トランスコおよびアタ
ル，エフィシエントプロセジャーズ・フォー・ファイン
ディング・ジ・オプチマムイノベーション・イン・スト
キャスティックコーダ（ＩＭ．Ｔｒａｎｓｃｏ，Ｂ．
Ｓ．Ａｔａｌ，”ＥＦＦＩＣＩＥＮＴＰＲＯＣＥＤＵＲ
ＥＳＦＯＲＦＩＮＤＩＮＧＴＨＥＯＰＴＩＭＵ
ＭＩＮＮＯＶＡＴＩＯＮＩＮＳＴＯＣＡＳＴＩＣ
ＣＯＤＥＲＳ”）等を参照できる。また、最適遅延符号
を１個に絞る代りに複数候補を求め次のステップで本選
択を行うことや、以後のコードブック探索で同時最適探
索を行うことも同様の効果が得られる。さらに、ＬＰＣ
分析回路を用いて説明を行ったが、スペクトルパラメー
タを抽出するＢＵＲＧ法等の他の分析法を用いても同様
の効果が得られる。また、ＬＰＣ係数を用いて説明した
が、ＰＡＲＣＯＲ係数やＬＳＰ係数のような他のスペク
トルパラメータでも同様な効果が得られることは明白で
ある。さらにまた、音源コードブック探索回路を１段構
成とする代りに、多段構成にすることも、本発明の主旨
を逸脱しない限り適用できることは勿論である。For example, although the cross-correlation ² / auto-correlation is used as the evaluation function, the same effect can be obtained with the cross-correlation ² . Although the weighted adaptive code vector obtained by combining the adaptive code vectors is used, the same effect can be obtained by using the adaptive code vector itself without performing the combining process. Further, instead of using the signal stored in the audio buffer as the zero-state subtraction signal, an input audio signal, a residual signal, or a weighting signal for audibility may be used. Furthermore, the same effect can be obtained by using a past input speech signal, a residual signal, or a weighting signal on audibility instead of using a past sound source signal as a codebook. Furthermore, although the method of filtering with the synthesis filter H is used for calculating the cross-correlation and the auto-correlation, the same effect can be obtained by a method using an approximate expression. This approximation method is described, for example, in ICASP86 Proceedings 86, published in the United States in 1986.
Transco and Atal, Efficient Procedures for Finding the Optimum Innovation in Stochastic Coder, pp. 375-2378 (IM. Transco, B .;
S. Atal, "EFFICIENTPROCEDUR
ES FOR FINDING THE OPTIMU
MINNOVATION IN STOCASTIC
CODERS "), etc. Also, instead of narrowing down the optimal delay code to one, a plurality of candidates are obtained and the main selection is performed in the next step, and the simultaneous optimal search is performed in the subsequent codebook search. The effect can be obtained.
Although the description has been made using the analysis circuit, the same effect can be obtained by using another analysis method such as the BURG method for extracting a spectrum parameter. Although the description has been made using the LPC coefficient, it is apparent that a similar effect can be obtained with other spectral parameters such as the PARCOR coefficient and the LSP coefficient. Furthermore, it is needless to say that a multi-stage sound source codebook search circuit can be applied instead of a single-stage sound source code book search circuit without departing from the gist of the present invention.

【００４２】[0042]

【発明の効果】以上説明したように、本発明の音声符号
化装置は、長期予測手段が遅延符号を間引ながら可変さ
せる遅延符号間引可変手段を備えるので、上記遅延符号
を間引くことにより探索対象の遅延符号数が低減される
ため、適応コードブック探索における演算量が低減化さ
れるという効果がある。As described above, the speech coding apparatus of the present invention includes the delay code thinning-out variable means for changing the delay code while thinning out the delay code. Since the number of target delay codes is reduced, the amount of calculation in adaptive codebook search is reduced.

[Brief description of the drawings]

【図１】本発明の音声符号化装置の第１の実施例を示す
長期予測回路のブロック図である。FIG. 1 is a block diagram of a long-term prediction circuit showing a first embodiment of a speech encoding apparatus according to the present invention.

【図２】本発明の音声符号化装置の第２の実施例を示す
長期予測回路のブロック図である。FIG. 2 is a block diagram of a long-term prediction circuit showing a second embodiment of the speech coding apparatus of the present invention.

【図３】本発明の音声符号化装置の第３の実施例を示す
長期予測回路のブロック図である。FIG. 3 is a block diagram of a long-term prediction circuit showing a third embodiment of the speech coding apparatus of the present invention.

【図４】本発明の音声符号化装置の第４の実施例を示す
長期予測回路のブロック図である。FIG. 4 is a block diagram of a long-term prediction circuit showing a fourth embodiment of the speech coding apparatus of the present invention.

【図５】音声符号化装置の全体を示すブロック図であ
る。FIG. 5 is a block diagram illustrating an entire speech encoding apparatus.

【図６】従来の音声符号化装置の一例を示すブロック図
である。FIG. 6 is a block diagram illustrating an example of a conventional speech encoding device.

[Explanation of symbols]

１符号化部２復号化部３伝送路１１バッファ回路１２ＬＰＣ分析回路１３パラメータ量子化回路１４重み付け回路１５，２２適応コードブック１６長期予測回路１７，２３音源コードブック１８音源コードブック探索回路１９，２４ゲインコードブック４０ゲインコードブック探索回路４１マルチプレクサ２１デマルチプレクサ２５合成フィルタ１６１遅延符号可変回路１６２適応コードベクトル生成回路１６３合成フィルタ１６４評価関数算出回路１６５最適遅延符号決定回路１６６コードブック１６７音声バッファ１６８遅延符号均一間引可変回路１６９遅延符号非均一間引可変回路１７０，１８０サンプル間引回路１７１遅延符号候補決定回路１７２最終遅延符号決定回路 DESCRIPTION OF SYMBOLS 1 Encoding part 2 Decoding part 3 Transmission line 11 Buffer circuit 12 LPC analysis circuit 13 Parameter quantization circuit 14 Weighting circuit 15,22 Adaptive codebook 16 Long-term prediction circuit 17,23 Sound source codebook 18 Sound source codebook search circuit 19, Reference Signs List 24 gain codebook 40 gain codebook search circuit 41 multiplexer 21 demultiplexer 25 synthesis filter 161 delay code variable circuit 162 adaptive code vector generation circuit 163 synthesis filter 164 evaluation function calculation circuit 165 optimal delay code determination circuit 166 codebook 167 audio buffer 168 Delay code uniform thinning variable circuit 169 Delay code non-uniform thinning variable circuit 170, 180 Sample thinning circuit 171 Delay code candidate determination circuit 172 Final delay code determination circuit

───────────────────────────────────────────────────── フロントページの続き (56)参考文献特開平４−301900（ＪＰ，Ａ) 特開平５−11799（ＪＰ，Ａ) 特開平５−61499（ＪＰ，Ａ) 特開平５−158497（ＪＰ，Ａ) (58)調査した分野(Int.Cl.⁷，ＤＢ名) G10L 19/00 - 9/14 H04B 14/04 H03M 7/30 ──────────────────────────────────────────────────続き Continuation of the front page (56) References JP-A-4-301900 (JP, A) JP-A-5-11799 (JP, A) JP-A-5-61499 (JP, A) 158497 (JP, A) (58) Field surveyed (Int. Cl. ⁷ , DB name) G10L 19/00-9/14 H04B 14/04 H03M 7/30

Claims

(57) [Claims]

1. A speech analyzing means for analyzing a speech signal having a predetermined frame length, extracting a spectrum parameter of the speech signal and generating a corresponding short-term prediction code, and further comprising a sub-frame length having a predetermined frame length. And an adaptive codebook storing means for storing the past sound source signal of this subframe length unit, and the sound source signal read from the adaptive codebook storing means is delayed by a predetermined pitch cycle as an evaluation scale. Calculate a delay value that minimizes a weighted square error and a gain for the delayed past sound source, search for an adaptive code vector that is an optimal delay code representing a pitch correlation of the voice signal, and perform long-term prediction. Long-term prediction means, and excitation codebook storage means for storing an excitation code vector which is a quantization code indicating the residual signal after the long-term prediction And a sound source codebook searching means for determining an optimum sound source code vector from the sound source codebook, wherein the long-term predicting means is used for searching for the adaptive codebook storing means.
Including the past sound source signal, the residual signal, and the weighting signal
A codebook which is a buffer for accumulating signals, and a delay code to be processed which is reduced by changing the delay code while thinning it within the size of the codebook.
Delay code thinning variable means for generating a certain thinning delay code
And the thinning-out delay code is input and corresponds to the thinning-out delay code.
The candidate of the adaptive code vector to be
Adaptive code vector generating means for generating from said adaptive
A candidate for a code vector is input and
Synthesis filter means for generating a matching adaptive code vector
And the audio signal of the coding section, which is the section where the coding process is performed,
Stored in the audio buffer, the weighted adaptive code vector and the audio buffer.
Zero input response subtraction in the stored coding interval
Before the coded speech signal from the cross-correlation with the signal and the auto-correlation
Calculate the evaluation function equivalent to the error power for the voice signal
Evaluation function calculating means; and the evaluation function corresponding to minimization of the weighted square error.
The decimation delay code that maximizes the function is replaced with the optimal delay code
An audio encoding apparatus comprising: an optimal delay code determining means for determining a signal .

2. A speech encoding apparatus according to claim 1, wherein said delay code thinning-out variable means includes a uniform thinning means for thinning out said delay code uniformly for every predetermined number.

3. The non-uniform thinning-out means according to claim 1, wherein said delay code thinning-out variable means comprises a non-uniform thinning-out means for thinning out said delayed codes in accordance with a predetermined non-uniform thinning-out target code determining method. Audio coding device.

4. The long-term predicting means thins out each of the speech signal and the weighted adaptive code vector corresponding to the delayed code, which are supplied to an evaluation function calculating means for calculating the weighted square error, at a predetermined rate. The speech encoding apparatus according to claim 1, further comprising first and second sample thinning means.

5. A delay candidate determining means for selecting a predetermined number of delay candidates which are candidates for the adaptive code vector, an optimum delay code corresponding to the adaptive code vector from the delay candidates. 2. The speech encoding apparatus according to claim 1, further comprising: a final delay code determination unit that determines the final delay code.