JPH06138896A

JPH06138896A - Device and method for encoding speech frame

Info

Publication number: JPH06138896A
Application number: JP4160233A
Authority: JP
Inventors: William C Yip; イップウィリアム・シー; David L Barron; バロンデビッド・エル
Original assignee: Motorola Inc
Current assignee: Motorola Solutions Inc
Priority date: 1991-05-31
Filing date: 1992-05-27
Publication date: 1994-05-20
Also published as: EP0516439A3; EP0516439A2

Abstract

PURPOSE: To relieve the burden of code exciting linear prediction(CELP) voice encoding in terms of a computer by selecting an adaptive code book vector having a min. error function. CONSTITUTION: An adaptive code book searcher 220' uses the inpulse sensible weighting response function H(n) of a short term LPC filter in a convolution generator 510' in order to generate a convolution signal W(n) and uses the frame of target voice X(n) which is sensibly weighted in order to execute a convolution. Then, a self correlative coefficient 552' stored in a table 555' before code book searching is used next for calculating energy as against respective candidate vectors from an adaptive code book 155 later, the result of the normalized correlation of the respective vectors is compared in a peak selector 570' and the vector Ck (m) having a max. mutual correlative value is identified as an optimum pitch cycle vector by the peak selector 570'.

Description

Detailed Description of the Invention

【０００１】[0001]

【産業上の利用分野】本発明は、音声または他のアナロ
グ信号のデジタル符号化のための改良された手段および
方法に関し、より詳細には、コード励起またはコード駆
動線形予測符号化（ｃｏｄｅｅｘｃｉｔｅｄｌｉｎ
ｅａｒｐｒｅｄｉｃｔｉｖｅｃｏｄｉｎｇ）に関す
る。FIELD OF THE INVENTION This invention relates to improved means and methods for digital encoding of speech or other analog signals, and more particularly to code excitation or code driven linear predictive coding.
ear predictive coding).

【０００２】[0002]

【従来の技術】コード励起線形予測（ＣＥＬＰ）符号化
はよく知られた音声通信のための確率的または推計的な
（ｓｔｏｃｈａｓｔｉｃ）符号化技術である。ＣＥＬＰ
符号化においては、短時間スペクトルおよび長時間ピッ
チが１組の時間変化する線形フィルタによりモデル化さ
れる。典型的な音声コーダをベースとした通信システム
においては、音声は送信されることが望まれる最も高い
周波数のほぼ２倍でＡ／Ｄ変換器によってサンプルさ
れ、たとえば、４ＫＨｚの音声帯域幅に対しては典型的
には８ＫＨｚのサンプリング周波数が使用される。ＣＥ
ＬＰ符号化は線形予測（ＬＰＣ）フィルタを励起または
駆動するためにエンコードされた励起情報を使用して音
声を合成する。フィルタへの入力として使用される、励
起または駆動（ｅｘｃｉｔａｔｉｏｎ）はホワイトガウ
ス信号のコードブックによりモデル化される。最適の励
起は候補の（ｃａｎｄｉｄａｔｅ）励起ベクトルのコー
ドブックをフレームごとのベースでサーチすることによ
り検出される。Code Excited Linear Prediction (CELP) coding is a well-known stochastic or stochastic coding technique for voice communications. CELP
In encoding, the short time spectrum and long time pitch are modeled by a set of time varying linear filters. In a typical voice coder-based communication system, voice is sampled by an A / D converter at approximately twice the highest frequency desired to be transmitted, eg, for a voice bandwidth of 4 KHz. Typically uses a sampling frequency of 8 KHz. CE
LP coding synthesizes speech using encoded excitation information to excite or drive a linear prediction (LPC) filter. The excitation or excitation used as input to the filter is modeled by a codebook of white Gaussian signals. The optimal excitation is detected by searching the codebook of candidate excitation vectors on a frame-by-frame basis.

【０００３】ＬＰＣ分析がＬＰＣパラメータを決定する
ために入力音声フレームに対して行われる。次にＬＰＣ
フィルタがテーブル、すなわち、コードブックからの種
々の候補ベクトルによって励起される時に、ＬＰＣフィ
ルタの出力をデジタル化された入力音声と比較すること
により分析が進められる。最善の候補ベクトルが候補励
起ベクトルを使用して合成された音声がどれだけよく入
力音声に整合するかに基づき選択される。これは通常音
声のいくつかのサブフレームに対して行われる。LPC analysis is performed on input speech frames to determine LPC parameters. Next LPC
The analysis proceeds by comparing the output of the LPC filter with the digitized input speech as the filter is excited by various candidate vectors from the table, the codebook. The best candidate vector is selected based on how well the speech synthesized using the candidate excitation vector matches the input speech. This is usually done for several subframes of speech.

【０００４】最善の整合が検出された後、最善のコード
ブック入力（エントリ：ｅｎｔｒｙ）を特定する情報、
ＬＰＣフィルタの係数およびゲイン係数がシンセサイザ
に送信される。シンセサイザはコードブックの同じコピ
ーを有し、かつそのコードブックにおける適切なエント
リにアクセスし、それを同じＬＰＣフィルタを励起する
ために使用する。Information identifying the best codebook entry after the best match is detected,
The LPC filter coefficients and gain coefficients are sent to the synthesizer. The synthesizer has the same copy of the codebook and accesses the appropriate entry in that codebook and uses it to excite the same LPC filter.

【０００５】コードブックはそれらの構成要素が連続し
た励起サンプルであるベクトルからなる。各ベクトルは
サブフレームまたはフレームにおいて音声サンプルがあ
るのと同じ数の励起サンプルを含む。励起サンプルは数
多くの異なる発生源（ｓｏｕｒｃｅｓ）からくることが
できる。ロングタームのピッチの符号化は適応コードブ
ックからのコードベクトルの適切な選択によって決定さ
れる。適応コードブックは１組の異なるピッチ周期のあ
らかじめ合成された音声励起波形である。A codebook consists of vectors whose components are consecutive excitation samples. Each vector contains as many excitation samples as there are audio samples in a subframe or frame. Excited samples can come from a number of different sources. The encoding of the long term pitch is determined by the proper selection of code vectors from the adaptive codebook. The adaptive codebook is a set of pre-synthesized speech excitation waveforms with different pitch periods.

【０００６】コードベクトルの最適の選択は、推計的ま
たは適応的コードブックのいずれからでも、知覚的に重
み付けられたエラー関数を最小化することに依存する。
このエラー関数は典型的にはコードブックの各ベクトル
に対し合成された音声と目標音声との比較から得られ
る。これらの徹底的な比較手順は多量の計算を必要とし
かつ通常単一のデジタル信号プロセッサ（ＤＳＰ）がリ
アルタイムで実行するには実用的ではない。音声品質を
犠牲にすることなく計算の複雑さを低減することが可能
なことがデジタル通信環境においては重要である。The optimal choice of code vectors depends on minimizing the perceptually weighted error function, either from the stochastic or adaptive codebook.
This error function is typically obtained by comparing the synthesized speech for each vector in the codebook with the target speech. These exhaustive comparison procedures are computationally intensive and are usually not practical for a single digital signal processor (DSP) to perform in real time. It is important in a digital communication environment to be able to reduce computational complexity without sacrificing voice quality.

【０００７】[0007]

【発明が解決しようとする課題】エラー関数、コードブ
ックベクトルサーチ、計算は励起情報のベクトルおよび
マトリクス操作およびＬＰＣフィルタを用いて行われ
る。問題は、多数の計算、たとえば、４．８Ｋｂｐｓの
ボコーダに対して毎秒約５×１０^８の乗算−加算演算を
行わなければならないことである。従来技術の構成は行
わなければならない計算の数を低減する上で完全には成
功していなかった。従って、音声品質を犠牲にすること
なく計算機的な負担を低減する改良されたＣＥＬＰ符号
化手段および方法の必要性が存在し続ける。Error functions, codebook vector searches, and calculations are performed using vector and matrix operations of excitation information and LPC filters. The problem is that many calculations have to be performed, for example about 5 × 10 ⁸ multiply-add operations per second for a 4.8 Kbps vocoder. Prior art arrangements have not been entirely successful in reducing the number of calculations that must be performed. Therefore, there continues to be a need for improved CELP coding means and methods that reduce the computational burden without sacrificing voice quality.

【０００８】従来技術の４．８ｋビット／秒のＣＥＬＰ
符号化システムが合衆国政府のＧｅｎｅｒａｌＳｅｒ
ｖｉｃｅｓＡｄｍｉｎｉｓｔｒａｔｉｏｎによって発
行されたＦｅｄｅｒａｌＳｔａｎｄａｒｄＦＥＤ−
ＳＴＤ−１０１６に述べられている。従来技術のＣＥＬ
Ｐボコーダシステムはたとえば、Ｋｅｔｃｈｕｍ他への
米国特許第４，８９９，３８５号および第４，９１０，
７８１号、Ａｔａｌへの第４，２２０，８１９号、Ｌｉ
ｎへの第４，７９７，９２５号、およびＧｅｒｓｏｎへ
の第４，８１７，１５７号に述べられており、これらは
ここに参照のため導入される。Prior art 4.8 kbit / sec CELP
The encoding system is the US Government's General Ser
Federal Standard FED-issued by Vice Administration
STD-1016. Prior art CEL
P-vocoder systems are described, for example, in US Pat. Nos. 4,899,385 and 4,910 to Ketchum et al.
781, No. 4,220,819 to Atal, Li
n, 4,797,925, and Gerson, 4,817,157, which are hereby incorporated by reference.

【０００９】典型的な従来技術のＣＥＬＰボコーダシス
テムは８ＫＨｚのサンプリングレートおよび４つの７．
５ミリセカンドのサブフレームに分割された３０ミリセ
カンドのフレーム期間を使用する。従来技術のＣＥＬＰ
符号化は３つの基本的な機能からなる。すなわち、
（１）短時間遅延「スペクトル」予測、（２）長時間遅
延「ピッチ」サーチ、および（３）残留（ｒｅｓｉｄｕ
ａｌ）「コードブック」サーチである。A typical prior art CELP vocoder system has a sampling rate of 8 KHz and four 7.
A 30 millisecond frame period divided into 5 millisecond subframes is used. Prior art CELP
The encoding consists of three basic functions. That is,
(1) short delay "spectrum" prediction, (2) long delay "pitch" search, and (3) residual (residu)
al) "Codebook" search.

【００１０】本発明は人間の音声を表わすアナログ信号
の場合につき説明されるが、これは単に説明の便宜のた
めのものに過ぎず、かつ、ここで使用されている、「音
声（ｓｐｅｅｃｈ）」なる用語はシステムのサンプリン
グ能力内での帯域幅の任意の形式のアナログ信号を含む
ことを意図している。Although the present invention is described in the case of an analog signal representing human speech, this is merely for convenience of description and as used herein, "speech". The term is intended to include any form of analog signal of bandwidth within the sampling capability of the system.

【００１１】[0011]

【課題を解決するための手段および作用】本発明は適応
的および推計的コードブックに基づくＣＥＬＰ音声符号
化の計算機的な負担を実質的に低減する改良された手段
および方法を提供する。SUMMARY OF THE INVENTION The present invention provides improved means and methods that substantially reduce the computational burden of adaptive and stochastic codebook based CELP speech coding.

【００１２】第１の実施例においては、再帰計算ループ
がそこから最適の励起ベクルを選択するために適応コー
ドブックのベクトルの票決をするために使用される。好
ましい実施例においては、ショートタームの知覚的に重
み付けられたフィルタのインパルス関数が適応コードブ
ックにおいて各ベクトルと相関された知覚的に重み付け
られた目標音声および結果とコンボルブされかつ自己相
関されたコードブックベクトルおよび自己相関されたイ
ンパルス関数と組合わされてエラー関数を生成する。最
小のエラー関数を有する適応コードブックベクトルが選
択されて調べられている特定の音声フレーム（またはサ
ブフレーム）を表わす。In the first embodiment, a recursive computation loop is used to vote the vector of the adaptive codebook to select the optimal excitation vector from it. In the preferred embodiment, the impulse function of the short-term perceptually weighted filter is convolved and autocorrelated with the perceptually weighted target speech and the result correlated with each vector in the adaptive codebook. Combined with the vector and auto-correlated impulse function to generate an error function. The adaptive codebook vector with the smallest error function is selected to represent the particular speech frame (or subframe) being examined.

【００１３】別の実施例はさらに適応コードブックのた
めの再帰ループを、その各々がＮのエントリを有する適
応コードブックのＫのベクトルに対し行わなければなら
ない自己相関演算の数を低減することにより簡略化す
る。自己相関は最初は各コードブックベクトルにおける
Ｎの自己相関係数に対しＰ＜＜Ｎの小さな数Ｐについて
のみ行われ、かつ検出された値はすべてのＫのコードブ
ックベクトルを通して操作するために使用され入力音声
に対し最善の整合を与えるＫのコードブックベクトル
（Ｓ＜＜Ｋ）の内のＳを探す。Ｓのベクトルに対する自
己相関関数は次にＲの自己相関係数（Ｐ＜Ｒ≦Ｎ）に対
し再計算されかつＳのコードブックベクトルはＳの適応
コードブックベクトルの内のどれが入力音声に対し最善
の整合を与えるかを決定するために再評価される。Another embodiment further provides a recursive loop for the adaptive codebook by reducing the number of autocorrelation operations that must be performed on the K vectors of the adaptive codebook, each having N entries. To simplify. Autocorrelation is initially performed only for a small number P of P << N for N autocorrelation coefficients in each codebook vector, and the detected value is used to operate through all K codebook vectors. Search for S of the K codebook vectors (S << K) that give the best match to the input speech. The autocorrelation function for the S vector is then recomputed for the R autocorrelation coefficient (P <R≤N) and the S codebook vector is any of the S adaptive codebook vectors for the input speech. Re-evaluated to determine which gives the best match.

【００１４】さらに別の実施例は、フレーム長Ｌより短
い長さＭのコードブックベトクルが評価されている時に
自己相関係数を決定するために実行されなければならな
い計算の数をさらに低減する。長さＭ＜Ｌの第１のベク
トルＣ_ｋ（ｎ）の自己相関係数Ｕ_ｋ（ｍ）が計算され、
ここでｋ＝１かつｍは自己相関遅れ指数（ｌａｇｉｎ
ｄｅｘ）でありまたｎはコードブックベクトルにおける
連続するサンプルの指数（ｉｎｄｅｘ）であり、さらに
Ｌは次式による分析フレーム長である。Ｕ_ｌ（ｍ）＝（Ｌ／Ｍ）Ｕ′_ｌ（ｍ）（２）なおｍ＝０〜Ｔ＜ｍである。残りのコードブックベクト
ルの自己相関係数Ｕ_ｋ（ｍ）は次式に従って増分的に計
算され、ここでｋ≧２である。Ｕ′_ｋ（ｍ）＝［Ｕ′_ｋ−１（ｍ）＋Ｃ_ｋ（Ｍ＋ｋ−１）・Ｃ_ｋ（Ｍ＋ｋ−１＋ｍ）］（３）Ｕ_ｋ（ｍ）＝｛Ｌ／（Ｍ＋ｋ−１）｝Ｕ′_ｋ（ｍ）（４）Yet another embodiment further reduces the number of calculations that must be performed to determine the autocorrelation coefficient when a codebook vector of length M less than the frame length L is being evaluated. . The autocorrelation coefficient U _k (m) of the first vector C _k (n) of length M <L is calculated,
Here, k = 1 and m is an autocorrelation delay index (lag in
dex) and n is the index of consecutive samples in the codebook vector, and L is the analysis frame length according to U ₁ (m) = (L / M) U ′ ₁ (m) (2) Note that m = 0 to T <m. The autocorrelation coefficient U _k (m) of the remaining codebook vectors is incrementally calculated according to the following equation, where k ≧ 2. U ′ _k (m) = [U ′ _k−1 (m) + C _k (M + k−1) · C _k (M + k−1 + m)] (3) U _k (m) = {L / (M + k−1)} _U'k (m) (4)

【００１５】ただし、ｍ＝０〜Ｔ＜Ｍ、かつプロセスは
（Ｍ＋ｋ−１）＝Ｌまで反復される。Ｔ＝Ｍ−１である
ことが好ましい。得られた値Ｕ_ｌ′（ｍ）およびＵ_ｋ′
（ｍ）は示されたスケーリングファクタによってスケー
リングされ、たとえば、ｋ＝１に対し（Ｌ／Ｍ）、ｋ＝
２に対しＬ／（Ｍ＋１）、そして同様に（Ｍ＋ｋ−１）
＝Ｌまで行われる。得られた自己相関係数はコードブッ
クベクトルＣ_ｋ（ｎ）の内のどれが入力音声と比較した
時最小のエラーを生成するかを決定する上で使用され
る。However, m = 0 to T <M, and the process is repeated until (M + k−1) = L. It is preferred that T = M-1. The obtained values U _l ′ (m) and U _k ′
(M) is scaled by the indicated scaling factor, eg (L / M) for k = 1, k =
L / (M + 1) for 2, and similarly (M + k-1)
= L is performed. The resulting autocorrelation coefficient is used in determining which of the codebook vectors C _k (n) produces the smallest error when compared to the input speech.

【００１６】さらに別の実施例においては、目標音声を
模写するための最適の推計的コードブックベクトルを識
別するためにＣＥＬＰ符号化プロセスによって発生され
る他のベクトルとともに推計的コードブックのベクトル
の相関係数をより迅速かつ容易に決定するための手段お
よび方法が提供される。より詳細には、ｎ＝１からＮま
でに至る指数ｎによって識別される値を有する第１のベ
クトルＶ（ｎ）、および１組の第２のベクトルＳ
_ｋ（ｎ）であって該第２のベクトルの各々は指数ｋによ
って識別されかつ該第２のベクトルの各々はゼロ（ｚｅ
ｒｏ）または非ゼロ（ｎｏｎ−ｚｅｒｏ）であってかつ
ｎ＝１からＮに至る指数ｎによって識別されるＮまでの
値を有するもの、がＳ_ｋ（ｎ_ｉ）が非ゼロとし異なるｋ
に対しＳ_ｋ（ｎ）の指数ｎ_ｋ，ｉを識別し、指数ｎ
_ｋ，ｉに対応するＶ（ｎ）の値を加算して和Ｑ（ｋ）を
形成し、最も大きな値Ｑ（ｋ＝ｊ）に対応する値ｋ＝ｊ
を識別し、かつＳ_ｋ＝ｊ（ｎ）を使用して音声を合成す
ることにより組合わされる。In yet another embodiment, the phase of the stochastic codebook vector along with other vectors generated by the CELP encoding process to identify the optimal stochastic codebook vector for replicating the target speech. Means and methods are provided for determining the number of relationships more quickly and easily. More specifically, a first vector V (n) having a value identified by an index n ranging from n = 1 to N, and a set of second vectors S
_k (n), each of the second vectors is identified by an index k, and each of the second vectors is zero (ze).
ro) or non-zero and having values up to N identified by the exponent n from n = 1 to N, where _k is different from S _k (n _i ).
To the index n _{k, i} of S _k (n) for
The values of V (n) corresponding to _{k and i} are added to form a sum Q (k), and the value k = j corresponding to the largest value Q (k = j).
, And combine them by synthesizing the speech using S _{k = j} (n).

【００１７】好ましい実施例においては、前記組の第２
のベクトルの連続するベクトルはオーバラップ量Δｋ，
Δｎに従って先行する第２のベクトルのオーバラップに
より決定され、かつ前記識別および加算ステップは、Ｓ
_１（ｎ_ｉ）が非ゼロであるとしＳ_ｋ（ｎ）の指数ｎ
_１，ｉをｋ＝１に対して識別する段階、ｎ_１，ｉからス
タートしかつ前記オーバラップ量Δｋ，Δｎを使用する
段階、Ｓ_ｋ（ｎ_ｉ′）が非ゼロであるとしｋ＞１に対し
指数ｎ_ｋ，ｉ′をさらに決定する段階、およびそのよう
な指数に対するＶ（ｎ）の値およびさらに他の指数を加
算して和Ｑ（ｋ）を形成する段階を具備する。In a preferred embodiment, the second of the set
The continuous vector of the vector is the overlap amount Δk,
Determined by the overlap of the preceding second vector according to Δn, and said identifying and adding step is S
₁ (n _i ) is non-zero, the index n of S _k (n)
_{1, i} for k = 1, starting from n _{1, i} and using said overlap amounts Δk, Δn, if S _k (n _{i ′} ) is non-zero, k> 1 , For further exponents n _{k, i ′} , and for summing the values of V (n) for such exponents and further exponents to form sum Q (k).

【００１８】さらに別の実施例においては、ｎ＝１〜Ｎ
の出力、ｎ＝１〜Ｎの第１の入力、第２の入力、および
ｎ＝１〜Ｎの選択手段を有するＮ×Ｎのマルチプレクサ
がコードブックベクトルを組合わせるために使用され
る。ｎ番目の選択手段に提供される第１の論理レベルは
ｎ番目の出力をｎ番目の第１の入力に結合しかつｎ番目
の選択手段に提供される第２の論理レベルはｎ番目の出
力を前記第２の入力に結合する。前記第２の入力は便宜
的には所定の論理レベルである。前記第１のベクトルの
ｎ＝１〜Ｎの値はマルチプレクサのｎ＝１〜Ｎの第１の
入力に供給されかつ指数ｋ＝１の第２のベクトルのｎ＝
１〜Ｎの値がマルチプレクサのｎ＝１〜Ｎの選択手段に
供給される。前記第２のベクトルはｎ＝１〜Ｎの選択手
段においてｎのある値に対し第１の論理レベルを提供し
かつｎの他の値に対し第２の論理レベルを提供する。マ
ルチプレクサ出力に現われる第１のベクトルの値はアキ
ュムレータにおいて加算され和（ｓｕｍ）を提供する。
前記提供および加算段階はｋのさらに他の値に対して繰
返され、かつどの第２のベクトルが目標音声に最も近い
整合を与える和を有するかに基づき音声が合成される。In yet another embodiment, n = 1 to N
, An N × N multiplexer having n = 1 to N first inputs, a second input, and n = 1 to N selection means are used to combine the codebook vectors. The first logic level provided to the nth selection means couples the nth output to the nth first input and the second logic level provided to the nth selection means is the nth output. To the second input. The second input is expediently a predetermined logic level. The values of n = 1 to N of said first vector are fed to the first inputs of n = 1 to N of the multiplexer and n = of the second vector of index k = 1.
The values 1 to N are supplied to the n = 1 to N selection means of the multiplexer. The second vector provides a first logic level for some values of n and a second logic level for other values of n in the selection means for n = 1 to N. The values of the first vector appearing at the multiplexer output are added in the accumulator to provide the sum.
The providing and summing steps are repeated for further values of k, and speech is synthesized based on which second vector has the sum that gives the closest match to the target speech.

【００１９】好ましい実施例においては、各々の第２の
ベクトルは２つの部分に分割され、すなわち、第２のベ
クトルの０，＋１の値のロケーションに対応する値０，
１を有する第１の部分、および第２のベクトルの値０，
−１のロケーションに対応する値０，１を有する第２の
部分に分割される。各部分は望ましくはＲＯＭに記憶さ
れる。各部分に関連して上に述べられかつ同様にして動
作するもののようなマルチプレクサが設けられ、各マル
チプレクサは前記第２のベクトルのその関連する部分の
０，１の値に基づく累算された出力和を提供する。前記
提供および加算段階は上に述べたｋの各値に対し各々の
ＲＯＭ−マルチプレクサの組合わせに対して反復され、
かつ各々から生ずる和はさらに他の加算器（または減算
器）において組合わされてどのベクトルが目標音声に対
し最も近い整合を与えるかを選択する上で使用するため
の組合わされた出力を提供する。In the preferred embodiment, each second vector is divided into two parts: the value 0, which corresponds to the location of the 0, + 1 value of the second vector.
The first part having 1 and the value 0 of the second vector,
It is divided into a second part having the values 0,1 corresponding to a location of -1. Each part is preferably stored in ROM. Multiplexers are provided for each part, such as those described above and operating in a similar manner, each multiplexer providing an accumulated output based on the 0,1 value of its associated part of the second vector. Offer the sum. The providing and adding steps are repeated for each ROM-multiplexer combination for each value of k as described above,
And the sums resulting from each are further combined in another adder (or subtractor) to provide a combined output for use in selecting which vector gives the closest match to the target speech.

【００２０】[0020]

【実施例】図１は、単純化されたブロック図形式で、Ｃ
ＥＬＰ符号化を利用したボコーダ送信システムを示す。
ＣＥＬＰコーダ１００は入力音声１０２を受信しかつＣ
ＥＬＰ符号化出力信号１０４を生成する。ＣＥＬＰ符号
化信号１０４は送信経路またはチャネル１０６を介して
ＣＥＬＰデコーダ３００に送信され、そこで元の音声信
号１０２の模写（ｆａｃｓｉｍｉｌｅ）３０２が合成に
より再構成される。送信チャネル１０６は任意の形式と
することができるが、典型的には制限された帯域幅の有
線または無線通信リンクである。ＣＥＬＰコーダ１００
はしばしば「アナライザ」と称されるが、それはその機
能が元の音声１０２を最もよく表すＣＥＬＰコードパラ
メータ１０４（たとえば、コードブックベクトル、ゲイ
ン情報、ＬＰＣフィルタのパラメータ、その他）を決定
することであるためである。ＣＥＬＰデコーダ３００は
しばしばシンセサイザと称されるが、それはその機能が
入力ＣＥＬＰ符号化信号１０４に基づき出力合成音声３
０２を再生することであるためである。ＣＥＬＰデコー
ダ３００は伝統的なものでありかつ本発明の一部ではな
く、従ってこれ以上の詳細は説明しない。DETAILED DESCRIPTION OF THE PREFERRED EMBODIMENT FIG. 1 is a simplified block diagram form of C
1 illustrates a vocoder transmission system utilizing ELP coding.
CELP coder 100 receives input speech 102 and
An ELP encoded output signal 104 is generated. CELP encoded signal 104 is transmitted via transmission path or channel 106 to CELP decoder 300, where a facsimile 302 of original audio signal 102 is reconstructed by synthesis. The transmission channel 106 can be of any form, but is typically a limited bandwidth wired or wireless communication link. CELP coder 100
Is often referred to as an "analyzer", whose function is to determine the CELP code parameters 104 (eg, codebook vectors, gain information, LPC filter parameters, etc.) that best represent the original speech 102. This is because. CELP decoder 300 is often referred to as a synthesizer, whose function is to output synthesized speech 3 based on input CELP encoded signal 104.
02 is to be reproduced. CELP decoder 300 is conventional and not part of the present invention, and thus will not be described in further detail.

【００２１】図２の（ａ）および（ｂ）は、ＣＥＬＰコ
ーダ１００を非常に詳細にかつ本発明の好ましい実施例
に従って示す。入力アナログ音声信号１０２はまずフィ
ルタ１１０によって帯域ろ波されて（ｂａｎｄ−ｐａｓ
ｓｅｄ）エイリアシング（ａｌｉａｓｉｎｇ）を防止す
る。帯域ろ波されたアナログ音声信号１１１は次にアナ
ログ−デジタル（Ａ／Ｄ）コンバータ１１２によってサ
ンプルされる。サンプリングは通常ナイキストレート、
たとえば、４ＫＨｚのＣＥＬＰボコーダに対して８ＫＨ
ｚで行われる。他のサンプリングレートも使用すること
ができる。任意の適切なＡ／Ｄコンバータを使用でき
る。Ａ／Ｄコンバータ１１２からのデジタル化された信
号１１３はひとつながりのサンプル、たとえば、その振
幅が音声波形のエンベロープに対応するひとつながりの
狭いパルスからなる。2A and 2B show CELP coder 100 in greater detail and in accordance with a preferred embodiment of the present invention. The input analog audio signal 102 is first band-pass filtered by a filter 110 (band-pas).
sed) Prevents aliasing. The bandpass filtered analog audio signal 111 is then sampled by an analog-to-digital (A / D) converter 112. Sampling is usually Nike straight,
For example, 8KH for a 4K CELP vocoder
performed in z. Other sampling rates can also be used. Any suitable A / D converter can be used. The digitized signal 113 from the A / D converter 112 consists of a series of samples, for example a series of narrow pulses whose amplitude corresponds to the envelope of the speech waveform.

【００２２】テジタル化音声信号１１３は次にフレーム
またはブロックに分割され、すなわち、所定の数のデジ
タル化された音声サンプル、たとえば、フレームごとに
６０，１８０または２４０サンプル、を含む連続する時
間ブラケットに分割される。これはＣＥＬＰ処理におい
ては習慣的に「フレームレート」と称されている。他の
フレームレートも使用できる。これはフレーマ１１４に
おいて行われる。これを達成するための手段は技術上よ
く知られている。連続する音声フレーム１１５はフレー
ムメモリ１１６に記憶される。フレームメモリ１１６の
出力１１７はデジタル化音声１１５のフレーム１１７を
その機能につき以下に説明するブロック１２２，１４
２，１６２および２３５に送る。The digitized audio signal 113 is then divided into frames or blocks, ie, into consecutive time brackets containing a predetermined number of digitized audio samples, eg 60, 180 or 240 samples per frame. Will be divided. This is customarily called "frame rate" in CELP processing. Other frame rates can also be used. This is done in the framer 114. Means for achieving this are well known in the art. The consecutive audio frames 115 are stored in the frame memory 116. The output 117 of the frame memory 116 is a block 122, 14 whose function is described below for the frame 117 of the digitized audio 115.
2, 162 and 235.

【００２３】当業者はデジタル化音声のフレームはさら
にサブフレームに分割でき、かつ音声分析および合成は
サブフレームを使用して行うことができることを理解す
るであろう。ここで用いられている用語「フレーム」
は、単数または複数であっても、デジタル音声のフレー
ムおよびサブフレームの双方に言及することを考えてい
る。Those skilled in the art will appreciate that a frame of digitized speech can be further divided into subframes, and speech analysis and synthesis can be done using subframes. The term "frame" as used here
Considers to refer to both frames and subframes of digital speech, whether singular or plural.

【００２４】ＣＥＬＰコーダ１００は２つのコードブッ
ク、すなわち、適応コードブック１５５および推計的
（ｓｔｏｃｈａｓｔｉｃ）コードブック１８０を使用す
る（図２の（ｂ）を参照）。各音声フレーム１１５に対
し、コーダ１００は声道（ｖｏｃａｌｔｒａｃｔ）の
フォルマント特性を表すＬＰＣ係数１２３を計算する。
コーダ１００はまた推計的コードブック１８０および適
応的コードブック１５５の双方からの入力（ベクトル）
および関連するスケーリング（ゲイン）ファクタをサー
チし、これらは、ＬＰＣ係数１２３を有するフィルタを
励起または駆動するために使用された時、最もよく入力
音声フレーム１１７を近似するものがサーチされる。Ｌ
ＰＣケース、コードブックベクトルおよびスケーリング
（ゲイン係数）情報は処理されかつチャネルコーダ２１
０に送られ、そこでこれらは組合わされて符号化ＣＥＬ
Ｐ信号１０４を形成し、該信号１０４は経路１０６によ
ってＣＥＬＰデコーダ３００に送信される。以上のこと
が行われる処理につきより詳細に説明する。CELP coder 100 uses two codebooks, an adaptive codebook 155 and a stochastic codebook 180 (see FIG. 2 (b)). For each speech frame 115, the coder 100 calculates an LPC coefficient 123 that represents the formant characteristics of the vocal tract.
The coder 100 also inputs (vectors) from both the stochastic codebook 180 and the adaptive codebook 155.
And the associated scaling (gain) factors, which, when used to excite or drive a filter with LPC coefficients 123, are searched for the one that best approximates the input speech frame 117. L
The PC case, codebook vector and scaling (gain factor) information is processed and channel coder 21
0, where they are combined and encoded CEL
P signal 104 is formed and is transmitted to CELP decoder 300 by path 106. The processing in which the above is performed will be described in more detail.

【００２５】次に、ブロック１２２，１２５，１３０お
よび１３５を含むデータパス１２１を参照すると、ＬＰ
Ｃアナライザ１２２は入力音声フレーム１１７に応答し
てよく知られた技術を用いてＬＰＣ係数１２３を決定す
る。ＬＰＣ係数１２３はラインスペクトル対（ＬＳＰ）
またはラインスペクトル周波数（ＬＳＦ）の形式になっ
ており、これらの用語は技術上よく理解されている。Ｌ
ＳＰ１２３はコーダ１２５によって量子化され、かつ量
子化されたＬＰＣ出力信号１２６はチャネルコーダ２１
０に送信され、そこでその信号は送信チャネル１０６を
介してデコーダ３００に送信されるＣＥＬＰ信号１０４
の一部（すなわち、ＬＰＣフィルタ係数）を形成する。Referring now to data path 121, which includes blocks 122, 125, 130 and 135, LP
The C analyzer 122 is responsive to the input speech frame 117 to determine the LPC coefficient 123 using well known techniques. LPC coefficient 123 is line spectrum pair (LSP)
Or in the form of Line Spectral Frequency (LSF), these terms are well understood in the art. L
The SP 123 is quantized by the coder 125, and the quantized LPC output signal 126 is the channel coder 21.
0, where the signal is transmitted to decoder 300 via transmit channel 106.
Form a part (i.e., the LPC filter coefficient) of.

【００２６】量子化されたＬＰＣ係数１２６はデコーダ
１３０によってデコードされ、かつデコードされたＬＳ
Ｐは出力信号１３１，１３２を介してそれぞれ、データ
パス１４１および１６１に関連して説明する、スペクト
ルインバースフィルタ（ｓｐｅｃｔｒｕｍｉｎｖｅｒ
ｓｅｆｉｌｔｅｒｓ）１４５および１７０に送信さ
れ、かつ出力信号１３３を介して帯域幅拡張重み付け発
生器１３５に送信される。信号１３１，１３２および１
３３はデコードされた量子化ＬＰＣ係数に関する情報を
含む。コーダ１２５およびデコーダ１３０を実現する手
段は技術上よく知られている。The quantized LPC coefficients 126 are decoded by the decoder 130 and the decoded LS
P is a spectral inverse filter, described in connection with data paths 141 and 161, respectively, via output signals 131, 132.
se filters) 145 and 170, and via output signal 133 to bandwidth extension weighting generator 135. Signals 131, 132 and 1
33 contains information about the decoded quantized LPC coefficients. Means for implementing coder 125 and decoder 130 are well known in the art.

【００２７】帯域幅拡張重み付け発生器１３５はスケー
リングファクタ（典型的には＝０．８）を提供しかつフ
ォルマントの帯域幅拡張の機能を達成し、帯域幅拡張さ
れたＬＰＣフィルタ係数に関する情報を含む出力信号１
３６，１３７を生成する。信号１３６，１３７はそれぞ
れ、それらの機能については以下に説明するカスケード
重み付けフィルタ１５０および１７５に送信される。The bandwidth extension weight generator 135 provides a scaling factor (typically = 0.8) and performs the function of formant bandwidth extension and contains information about the bandwidth extended LPC filter coefficients. Output signal 1
36 and 137 are generated. Signals 136 and 137 are each sent to cascade weighting filters 150 and 175, whose functions are described below.

【００２８】次に、ブロック１４２，１４５および１５
０を含むデータパス１４１を参照すると、スペクトル予
測器メモリ減算器１４２は１１７を介してフレームメモ
リ１１６から到達する入力サンプル音声１１５からショ
ートタームスペクトル予測器フィルタ１９５（図２の
（ｂ）を参照）における前の状態１９６（すなわち、直
前のフレームにより残されたもの）を減算する。減算器
１４２は音声残留信号１４３を提供し該信号１４３はデ
ジタル化入力音声１１５から技術上フィルタリンギング
信号またはフィルタリングダウンと称されるものを減算
したものである。フィルタリンギング信号は与えられた
音声フレームに関連するフィルタ（たとえば、図２の
（ｂ）のＬＰＣフィルタ１９５）を駆動するために使用
されるインパルスがそのフレームの終りまでに完全に消
散せず、後続のフレームに広がるフィルタ駆動または励
起（すなわち、「リンギング」）を引き起こし得るため
に生ずる。Next, blocks 142, 145 and 15
Referring to the data path 141 including 0, the spectrum predictor memory subtractor 142 receives the short-term spectrum predictor filter 195 from the input sample speech 115 arriving from the frame memory 116 via 117 (see (b) of FIG. 2). Subtract the previous state at 196 (ie, the one left by the previous frame). Subtractor 142 provides speech residual signal 143, which is digitized input speech 115 minus what is technically referred to as a filter ringing signal or filtering down. The filter ringing signal is such that the impulse used to drive the filter associated with a given speech frame (eg, the LPC filter 195 of FIG. 2B) has not completely dissipated by the end of that frame, Occurs because it may cause a filter drive or excitation (ie, "ringing") that spans the frame of.

【００２９】このリンギング信号は後続のフレームにお
いてひずみとして現れるが、その理由はこのリンギング
信号がそのフレームの音声内容と無関係であるためであ
る。もし該リンギング信号が除去されなければ、それは
コードパラメータの選択に影響を与えかつデコーダ３０
０によって合成される音声の品質を劣化させる。The ringing signal appears as distortion in subsequent frames because it is independent of the speech content of that frame. If the ringing signal is not removed, it affects the selection of code parameters and the decoder 30.
0 deteriorates the quality of speech synthesized.

【００３０】音声１１５からフィルタリンギング信号１
９６をマイナス（減算）したものに関する情報を含む音
声残留または残差信号１４３はデコーダ１３０から信号
１３１とともにスペクトルインバースフィルタ１４５に
供給される。フィルタ１４５は典型的にはゼロフィルタ
として実施できるが（すなわち、Ａ（ｚ）＝Ａ_０＋Ａ
_１ｚ ^−１＋…＋Ａ_ｎｚ^−ｎであり、ここで各ＡはＬＰＣ
フィルタの係数でありかつｚは該フィルタの「Ｚ変換」
である）、しかし技術上よく知られた他の手段も使用で
きる。信号１３１および１４３はフィルタ１４５におい
てＬＰＣインバースろ波された音声を生成するためにコ
ンボリューションによって組合わされる。フィルタ１４
５の出力信号１４６はカスケード重み付けフィルタ１５
０に送られる。フィルタ１５０は典型的には極（ｐｏｌ
ｅ）フィルタとして実施されるが（すなわち、１／Ａ
（ｚ／ｒ）、ただしＡ（ｚ／ｒ）＝Ａ_０＋Ａ_１ｒｚ^−１
＋…＋Ａ_ｎｒ^ｎｚ^−ｎであり、かつ各ＡはＬＰＣフィル
タ係数であり、またｒは拡張ファクタでありかつｚは該
フィルタの「Ｚ変換」である）、しかし技術上よく知ら
れた他の手段を用いることもできる。Filter ringing signal 1 from voice 115
The speech residual or residual signal 143, which contains information about minus 96 (subtracted), is provided from the decoder 130 along with the signal 131 to the spectral inverse filter 145. Filter 145 can typically be implemented as a zero filter (ie, A (z) = A ₀ + A
_1z ^-1 + _... + a A ^{n z -n,} each A LPC here
Is the coefficient of the filter and z is the "Z transform" of the filter
However, other means well known in the art can also be used. Signals 131 and 143 are combined by convolution to produce LPC inverse filtered speech at filter 145. Filter 14
5 output signal 146 is a cascade weighting filter 15
Sent to 0. Filter 150 is typically a pole.
e) implemented as a filter (ie 1 / A
(Z / r), where A (z / r) = A ₀ + A ₁ rz ⁻¹
+ ... + A _n r ⁿ z ⁻ⁿ and each A is an LPC filter coefficient, r is an expansion factor and z is the “Z transform” of the filter), but are well known in the art. Other means can also be used.

【００３１】ブロック１５０からの出力信号１５２はブ
ロック１３５から到達する帯域幅拡張されたＬＰＣ係数
信号１３６によるインパルス関数（たとえば、１，０，
０，…，０）のコンボリューションから得られる知覚的
に重み付けられたＬＰＣインパルス関数Ｈ（ｎ）であ
る。信号１３６はまたブロック１５０においてコンボリ
ューションによって信号１４６と組合わされ出力１５１
においてパス１４１から得られる知覚的に重み付けられ
た短時間遅延目標音声信号Ｘ（ｎ）を生成する。The output signal 152 from block 150 is an impulse function (eg 1, 0, 0, ...) Due to the bandwidth expanded LPC coefficient signal 136 arriving from block 135.
0, ..., 0) is the perceptually weighted LPC impulse function H (n) obtained from the convolution of 0 ,. Signal 136 is also combined with signal 146 by convolution in block 150 to output 151.
Generate the perceptually weighted short delay target audio signal X (n) obtained from path 141 at.

【００３２】重み付けフィルタ１５０の出力１５１およ
び１５２は適応コードブックサーチャ２２０に供給され
る。目標音声信号１５１（すなわち、Ｘ（ｎ））および
知覚的に重み付けされたインパルス関数信号１５２（す
なわち、Ｈ（ｎ））はサーチャ２２０および適応コード
ブック１５５によって使用されてピッチ周期（すなわ
ち、フィルタ１９５のための駆動ベクトル）およびそれ
に対するゲインであって、デジタル化入力音声フレーム
１１７に最も近く対応するものを決定する。これが達成
される方法は図３および図４に関連してより詳細に説明
する。The outputs 151 and 152 of weighting filter 150 are provided to adaptive codebook searcher 220. The target speech signal 151 (ie, X (n)) and the perceptually weighted impulse function signal 152 (ie, H (n)) are used by the searcher 220 and the adaptive codebook 155 to produce a pitch period (ie, filter 195). Drive vector) and the gain therefor that most closely correspond to the digitized input speech frame 117. The way in which this is achieved will be explained in more detail in connection with FIGS.

【００３３】次に、ブロック１６２，１６５，１７０お
よび１７５を含むデータパス１６１を参照すると、ピッ
チ予測器メモリ減算器１６２が長時間遅延ピッチ予測器
フィルタ１９０における前のフィルタ状態１９２を１１
７を介してメモリ１１６から受信されたデジタル化入力
サンプル音声１１５から減算してサンプルされた音声マ
イナス長時間遅延ピッチ予測器フィルタ１９０のリンギ
ングからなる出力信号１６３を与える。出力信号１６３
はスペクトル予測器メモリ減算器１６５に供給される。Referring now to datapath 161, which includes blocks 162, 165, 170 and 175, pitch predictor memory subtractor 162 sets previous filter state 192 in long delay pitch predictor filter 190 to 11
7 provides an output signal 163 consisting of the sampled speech minus the long delay pitch predictor filter 190 ringing subtracted from the digitized input sample speech 115 received from memory 116 via 7. Output signal 163
Is supplied to the spectrum predictor memory subtractor 165.

【００３４】スペクトルメモリ減算器１６５はブロック
１４２に関して述べたのと同様の機能を達成し、かつ短
時間遅延スペクトル予測器（「スペクトルの」）フィル
タのリンギングまたはリングダウン信号１９６をピッチ
減算器１６２を介して送信されたデジタル化入力音声フ
レーム１１７から減算する。これは、現在のフレームの
サンプルされた音声１１７から前のフレームから残され
た長時間遅延（「ピッチ」）フィルタ１９０および短時
間遅延（「スペクトル」）フィルタ１９５のリンギング
を減算したものからなる剰余出力信号１６６を生成す
る。剰余信号１６６はブロック１４５と類似したスペク
トルインバースフィルタ１７０に供給される。Spectral memory subtractor 165 performs a function similar to that described with respect to block 142, and reduces the short delay spectral predictor ("spectral") filter ringing or ringdown signal 196 to pitch subtractor 162. Subtract from digitized input speech frame 117 transmitted over. This is the remainder consisting of the sampled speech 117 of the current frame minus the ringing of the long delay (“pitch”) filter 190 and the short delay (“spectral”) filter 195 left over from the previous frame. Generate an output signal 166. The remainder signal 166 is provided to a spectral inverse filter 170 similar to block 145.

【００３５】インバースフィルタ１７０は剰余信号１６
６およびデコーダ１３０の出力１３２を受信する。信号
１３２はデコードされた量子化ＬＰＣ係数に関する情報
を含む。フィルタ１７０はコンボリューションによって
信号１６６および１３２を組合わせＬＰＣインバースろ
波音声を備えた出力信号１７１を生成する。出力信号１
７１はブロック１５０に類似したカスケード重み付けフ
ィルタ１７５に送られる。The inverse filter 170 outputs the remainder signal 16
6 and the output 132 of the decoder 130. Signal 132 contains information about the decoded quantized LPC coefficients. The filter 170 combines the signals 166 and 132 by convolution to produce an output signal 171 with LPC inverse filtered speech. Output signal 1
71 is sent to a cascade weighting filter 175 similar to block 150.

【００３６】重み付けフィルタ１７５はフィルタ１７０
からの信号１７１および帯域幅拡張重み付け発生器１３
５からの信号１３７を受信する。信号１３７は帯域幅拡
張ＬＰＣ係数に関する情報を含む。カスケード重み付け
フィルタ１７５は出力信号１７６，１７７を発生する。
フィルタ１７５は典型的には極フィルタ（すなわち、複
素面に極のみ）として実施されるが、技術上よく知られ
た他の手段も使用できる。The weighting filter 175 is the filter 170.
171 from and the bandwidth extension weighting generator 13
The signal 137 from 5 is received. Signal 137 contains information about bandwidth-enhanced LPC coefficients. Cascade weighting filter 175 produces output signals 176 and 177.
Filter 175 is typically implemented as a pole filter (ie, only poles in the complex plane), although other means well known in the art can be used.

【００３７】信号１３７，１７１はフィルタ１７５にお
いてコンボリューションによって組合わされ出力１７７
においてパス１２１から得られた知覚的に重み付けられ
たＬＰＣインパルス関数Ｈ（ｎ）を生成し、かつ出力１
７６においてパス１６１から得られた知覚的に重み付け
られた長時間遅延および短時間遅延目標音声信号Ｙ
（ｎ）を生成する。出力信号１７６，１７７は推計的ま
たは確率的（ｓｔｏｃｈａｓｔｉｃ）サーチャ２２５に
送られる。確率的サーチャ２２５は確率的コードブック
１８０を使用して最適のホワイトノイズベクトルおよび
最適のスケーリング（ゲイン）ファクタを選択し、これ
らは、所定の係数のピッチおよびＬＰＣフィルタに印加
された時、入力デジタル音声フレーム１１７に対し最善
の整合を与える。確率的サーチャ２２５は技術上よく知
られた動作を行いかつ一般に図３および図４に関連して
より詳細に説明する適応サーチャ２２０によって達成さ
れるものと類似の動作を行う。Signals 137 and 171 are combined by convolution in filter 175 and output 177.
Generate the perceptually weighted LPC impulse function H (n) obtained from path 121 at
At 76, the perceptually weighted long and short delay target speech signal Y obtained from path 161.
(N) is generated. The output signals 176, 177 are sent to a stochastic or stochastic searcher 225. The probabilistic searcher 225 uses the probabilistic codebook 180 to select the optimal white noise vector and the optimal scaling (gain) factor, which when applied to the pitch of the predetermined coefficient and the LPC filter. Gives the best match to audio frame 117. Probabilistic searcher 225 operates in a manner well known in the art and generally operates similar to that achieved by adaptive searcher 220 described in more detail in connection with FIGS.

【００３８】要約すると、チェイン１４１においては、
スペクトルインバースフィルタ１４５はＬＳＰ１３１お
よび剰余１４３を受信しかつその出力１４６をカスケー
ド重み付けフィルタ１５０に送信して出力１５２に知覚
的に重み付けされたＬＰＣインパルス関数応答Ｈ（ｎ）
を発生しかつ出力１５１に知覚的に重み付けされた短時
間遅延目標音声信号Ｘ（ｎ）を発生する。チェイン１６
１においては、スペクトルインバースフィルタ１７０は
ＬＳＰ１３２と短時間遅延および長時間遅延音声残差
（ｒｅｓｉｄｕａｌ）１６６を受信し、かつその出力１
７１を重み付けフィルタ１７５に送り出力１７７に知覚
的に重み付けられたＬＰＣインパルス関数Ｈ（ｎ）を発
生しかつ出力１７６に知覚的に重み付けられたショート
タームおよびロングターム遅延目標音声信号Ｙ（ｎ）を
発生する。In summary, in the chain 141,
Spectral inverse filter 145 receives LSP 131 and residue 143 and sends its output 146 to cascade weighting filter 150 to output 152 a perceptually weighted LPC impulse function response H (n).
And a perceptually weighted short-time delayed target speech signal X (n) at output 151. Chain 16
1, the spectral inverse filter 170 receives the LSP 132 and the short and long delay speech residuals 166 and outputs 1
71 to the weighting filter 175 to generate the perceptually weighted LPC impulse function H (n) at the output 177 and the perceptually weighted short-term and long-term delayed target speech signal Y (n) at the output 176. Occur.

【００３９】集合的に２３０とラベル付けられたブロッ
ク１３５，１５０，１７５は知覚的重み付け機能を提供
する。チェイン１２１からのデコードされたＬＳＰはブ
ロック１３５において出力１３６，１３７に帯域幅拡張
重み付けファクタを発生するために使用される。重み付
けファクタ１３６，１３７はカスケード重み付けフィル
タ１５０および１７５において知覚的に重み付けられた
ＬＰＣインパルス関数Ｈ（ｎ）を発生するために使用さ
れる。知覚的重み付けブロック２３０の各要素はＬＰＣ
係数に応答して重要な音声内容を有することが知られて
いる音声の部分を強調するマトリクスの形式のスペクト
ル重み付け情報を計算する。このスペクトル重み付け情
報１／Ａ（ｚ／ｒ）はカスケード重み付けフィルタ１５
０および１７５の有限インパル応答Ｈ（ｎ）に基づく。
有限インパルス応答関数Ｈ（ｎ）の利用はコードブック
サーチャ２２０および２２５が達成しなければならない
計算の数を大幅に低減する。スペクトル重み付け情報は
サーチャによってコードブック１５５および１８０から
の駆動情報に対する最善の候補を決定するために利用さ
れる。Blocks 135, 150, 175 collectively labeled 230 provide a perceptual weighting function. The decoded LSP from chain 121 is used at block 135 to generate a bandwidth expansion weighting factor at outputs 136 and 137. Weighting factors 136, 137 are used to generate perceptually weighted LPC impulse function H (n) in cascade weighting filters 150 and 175. Each element of the perceptual weighting block 230 is an LPC.
In response to the coefficients, compute spectral weighting information in the form of a matrix that emphasizes portions of the speech known to have significant speech content. This spectrum weighting information 1 / A (z / r) is used as the cascade weighting filter 15.
Based on finite impal response H (n) of 0 and 175.
Utilizing the finite impulse response function H (n) significantly reduces the number of calculations that the codebook searchers 220 and 225 must accomplish. The spectral weighting information is utilized by the searcher to determine the best candidate for the driving information from codebooks 155 and 180.

【００４０】図２の（ａ）および（ｂ）を引き続き参照
すると、適応コードブックサーチャ２２０はチャネルコ
ーダ２１０に送られるべき最適の適応コードブックベク
トル指数２２１および関連するゲイン２２２を発生す
る。確率的コードブックサーチャ２２５はチャネルコー
ダ２１０に送信されるべき最適の確率的コードブックベ
クトル指数２２６、および関連するゲイン２２７を発生
する。これらの信号はチャネルコーダ２１０によってエ
ンコードされる。With continued reference to FIGS. 2A and 2B, adaptive codebook searcher 220 produces an optimal adaptive codebook vector index 221 and associated gain 222 to be sent to channel coder 210. Probabilistic codebook searcher 225 produces an optimal stochastic codebook vector index 226 to be transmitted to channel coder 210 and an associated gain 227. These signals are encoded by the channel coder 210.

【００４１】チャネルコーダ２１０は５つの信号を受信
する。すなわち、コーダ１２５からの量子化ＬＳＰ１２
６、最適確率的コードブックベクトル指数２２６および
そのためのゲイン設定２２７、および適応コードブック
ベクトル指数２２１とそのためのゲイン設定２２２であ
る。チャネルコーダ２１０の出力はエンコードされたパ
ラメータの直列的なビットストリーム１０４である。ビ
ットストリーム１０４はチャネル１０６を介してＣＥＬ
Ｐデコーダ３００（図１を参照）に送られ、そこで、デ
コードの後、回復されたＬＳＰ、コードブックーベクト
ルおよびゲイン設定は同じフィルタおよびコードブック
に印加されて合成された音声３０２を生成する。Channel coder 210 receives five signals. That is, the quantized LSP 12 from the coder 125
6, optimal stochastic codebook vector index 226 and gain setting 227 therefor, and adaptive codebook vector index 221 and gain setting 222 therefor. The output of channel coder 210 is a serial bitstream 104 of encoded parameters. Bitstream 104 is CEL over channel 106
It is sent to a P-decoder 300 (see FIG. 1), where after decoding, the recovered LSP, codebook vector and gain settings are applied to the same filter and codebook to produce synthesized speech 302.

【００４２】すでに説明したように、ＣＥＬＰコーダ１
００は分析、合成および比較のプロセスによりデコーダ
３００に送信されるべき最適のＣＥＬＰパラメータを決
定する。試験的なＣＥＬＰパラメータを使用した結果は
入力音声とフレームごとに比較され最適のＣＥＬＰパラ
メータが選択できるようにしなければならない。ブロッ
ク１９０，１９５，１９７，２００，２０５および２３
５はこれを達成するためにすでに図２の（ａ）および
（ｂ）において説明したブロックと組合わせて使用され
る。選択されたＣＥＬＰパラメータ（ＬＳＰ係数、コー
ドブックベクトルおよびゲイン、その他）は出力２１１
を介してデコーダ１８２に受け渡されそこでそれらのパ
ラメータはブロック１９０，１９５，１９７，２００，
２０５および２３５に分配されかつ次にすでに述べたブ
ロック１４２，１４５，１５０，１６２，１６５，１７
０および１７５に戻る。As already explained, the CELP coder 1
00 determines the optimal CELP parameters to be sent to the decoder 300 by the process of analysis, synthesis and comparison. The results of using the trial CELP parameters must be compared on a frame-by-frame basis with the input speech so that the optimal CELP parameters can be selected. Blocks 190, 195, 197, 200, 205 and 23
5 is used to achieve this in combination with the blocks already described in Figures 2a and 2b. The selected CELP parameters (LSP coefficient, codebook vector and gain, etc.) are output 211
To the decoder 182, where those parameters are passed to blocks 190, 195, 197, 200,
205 and 235 and blocks 142, 145, 150, 162, 165, 17 already mentioned above.
Return to 0 and 175.

【００４３】ブロック１８２は信号１２６，２２１，２
２２，２２６，２２７を回復するためにコーダ２１０か
らの信号２１１をデコードする機能を有する「チャネル
デコーダ」として識別される。しかしながら、当業者は
ブロック２１０−１８２によって示されるコード−デコ
ード動作は省略できかつ信号１２６，２２１，２２２，
２２６，２２７は符号化されない形式でブロック１８２
に供給されブロック１８２は単に該信号をブロック１９
０，１９５，１９７，２００，２０５および２３５に分
配するためのバッファとして作用するようにできること
を理解するであろう。いずれの構成も満足すべきもので
あり、かつ用語「チャネルコーダ１８２」、「コーダ１
８２」または「ブロック１８２」はそのような情報を受
け渡すための構成または任意の他の手段を示すことを意
図している。Block 182 represents signals 126, 221, 2
Identified as a "channel decoder" that has the ability to decode the signal 211 from the coder 210 to recover 22, 226, 227. However, those skilled in the art can omit the code-decode operation represented by blocks 210-182 and use signals 126, 221, 222,
226 and 227 are blocks 182 in uncoded form
Block 182 simply supplies the signal to block 19
It will be appreciated that it can be made to act as a buffer for distribution to 0, 195, 197, 200, 205 and 235. Both configurations are satisfactory and the terms "channel coder 182", "coder 1"
82 "or" block 182 "is intended to indicate an arrangement or any other means for passing such information.

【００４４】デコーダ１８２の出力信号は、ブロック１
９５に送られる量子化ＬＳＰ信号１２６、ブロック１９
０に送られる適応コードブック指数信号２２１、ブロッ
ク１９０に送られる適応コードブックベクトルゲイン指
数信号２２２、ブロック１８０に送られる確率的コード
ブック指数信号２２６、およびブロック１９７に送られ
る確率的コードブックベクトルゲイン指数信号２２７で
ある。これらの信号はフィルタ１９０を駆動し、それに
より適応コードブック１５５およびフィルタ１９５に供
給される出力１９１を生成する。出力１９１はコーダ１
８２の出力１２６と組合わせてさらにフィルタ１９５を
駆動し合成された音声１９６を生成する。The output signal of the decoder 182 is the block 1
Quantized LSP signal 126 to block 95, block 19
Adaptive codebook exponent signal 221, sent to block 190, adaptive codebook vector gain exponent signal 222, sent to block 180, stochastic codebook exponent signal 226, and stochastic codebook vector gain, sent to block 197. The index signal 227. These signals drive filter 190, thereby producing output 191 that is provided to adaptive codebook 155 and filter 195. Output 191 is coder 1
In combination with output 126 of 82, filter 195 is further driven to produce synthesized speech 196.

【００４５】シンセサイザ２２８はゲイン乗算器１９
７、長時間遅延ピッチ予測器１９０、および短時間遅延
スペクトル予測器１９５、減算器２３５、スペクトルイ
ンバースフィルタ２２０およびカスケード重み付けフィ
ルタ２０５を具備する。デコードされたパラメータ１２
６，２２１，２２２，２２６および２２７を使用して、
確率的コードベクトル１７９が選択されかつゲインパラ
メータ２２６によっスケーリングされるべきゲイン乗算
器１９７に送られる。ゲイン乗算器１９７の出力１９８
は長時間遅延ピッチ予測器１９０によって使用され音声
残差１９１を発生する。フィルタ状態出力情報１９２、
これはまた技術上予測器フィルタ１９０の音声残差（ｓ
ｐｅｅｃｈｒｅｓｉｄｕａｌ）とも称されるが、フィ
ルタメモリの更新のためにピッチメモリ減算器１６２に
送られる。そのパラメータが入力ＬＰＣパラメータ信号
１２６によってセットされるＬＰＣフィルタである、短
時間遅延スペクトル予測器１９５が音声残差１９１によ
って駆動され合成デジタル音声出力１９６を生成する。
同じ音声残差信号１９１は適応コードブック１５５の更
新のために使用される。The synthesizer 228 is a gain multiplier 19
7, a long delay pitch predictor 190, a short delay spectrum predictor 195, a subtractor 235, a spectrum inverse filter 220 and a cascade weighting filter 205. Decoded parameter 12
6,221,222,226 and 227,
The stochastic code vector 179 is selected and sent to the gain multiplier 197 to be scaled by the gain parameter 226. Output 198 of gain multiplier 197
Is used by the long delay pitch estimator 190 to generate the speech residual 191. Filter status output information 192,
This is also technically the speech residual of the predictor filter 190 (s
Also referred to as "peek residual", it is sent to the pitch memory subtractor 162 for updating the filter memory. A short delay spectrum predictor 195, which is an LPC filter whose parameters are set by the input LPC parameter signal 126, is driven by the speech residual 191 to produce a synthetic digital speech output 196.
The same speech residual signal 191 is used for updating the adaptive codebook 155.

【００４６】合成音声１９６は減算器２３５によってデ
ジタル化入力音声１９７から減算されデジタル音声剰余
出力信号２３６を生成する。音声剰余２３６はスペクト
ルインバースフィルタ２００に供給され残留誤差信号２
０２を発生する。出力信号２０２はカスケード重み付け
フィルタ２０５に供給され、かつ出力フィルタ状態情報
２０６，２０７は信号パス１４１および１６１と組合わ
せて前に述べたようにカスケード重み付けフィルタ１５
０および１７５を更新するために使用される。スペクト
ルインバースフィルタ２００のフィルタ状態情報であ
る、出力信号２０１，２０３はブロック１４５，１７０
に関して前に述べたようにスペクトルインバースフィル
タ１４５および１７０を更新するために使用される。The synthesized voice 196 is subtracted from the digitized input voice 197 by the subtractor 235 to generate a digital voice remainder output signal 236. The speech residue 236 is supplied to the spectrum inverse filter 200 and the residual error signal 2 is supplied.
02 is generated. The output signal 202 is provided to the cascade weighting filter 205, and the output filter state information 206, 207 in combination with the signal paths 141 and 161 is cascade weighting filter 15 as previously described.
Used to update 0 and 175. Output signals 201 and 203, which are filter state information of the spectrum inverse filter 200, are output to blocks 145 and 170.
Used to update the spectral inverse filters 145 and 170 as described above with respect to.

【００４７】図３および図４は適応コードブックサーチ
ャ２２０の単純化したブロック図である。図３は適応コ
ードブックサーチャ２２０の適切な構成を示しかつ図４
はさらに改良された構成を示す。図４の構成が好まし
い。FIGS. 3 and 4 are simplified block diagrams of adaptive codebook searcher 220. FIG. 3 illustrates a suitable configuration of adaptive codebook searcher 220 and FIG.
Shows a further improved configuration. The configuration of Figure 4 is preferred.

【００４８】次に図３および図４を概略的に参照する
と、適応コードブック１５５の情報は前のフレームから
の励起または駆動情報である。各々のフレームに対し、
前記駆動情報はサンプルされた元の音声と同じ数のサン
プルからなる。コードブック１５５は便宜的には巡回的
なリストとして編成され、それにより新しい組のサンプ
ルが単にコードブック１５５にシフト入力されて該コー
ドブックに現在ある前のサンプルに置き変わるようにさ
れる。新しい駆動サンプルは長時間遅延ピッチ予測器１
９０の出力１９１によって提供される。Referring now generally to FIGS. 3 and 4, the information in adaptive codebook 155 is the excitation or drive information from the previous frame. For each frame,
The drive information consists of the same number of samples as the original audio sampled. The codebook 155 is conveniently organized as a cyclic list so that a new set of samples is simply shifted into the codebook 155 to replace the previous sample currently in the codebook. New driving sample is long delay pitch predictor 1
90 provided by output 191.

【００４９】コードブック１５５からの駆動情報を使用
する場合、サーチャ２２０は組（ｓｅｔｓ）で、すなわ
ち、サブフレームで処理しかつ該ベクトルをばらばらに
なったサンプルとして取扱わない。サーチャ２２０はコ
ードブック１５５におけるサンプルをリニアアレイとし
て取扱う。たとえば、６０サンプルのフレームに対し、
サーチャ２２０はコードブック１５５からのサンプル１
〜サンプル６０を使用して第１の候補の組の情報を形成
し、サンプル２〜サンプル６１を使用して第２の組の候
補の情報を形成し、以下同様である。このタイプのコー
ドブックサーチはしばしばオーバラッピング・コードブ
ックサーチと称される。本発明はコードブック１５５の
構造および機能に関するものではなく、最適のコードブ
ックベクトルを識別するためにどのようにしてコードブ
ック１５５がサーチされるかに関するものである。When using the drive information from codebook 155, searcher 220 does not process the vectors in sets, ie, in subframes, and as disjoint samples. Searcher 220 treats the samples in codebook 155 as a linear array. For example, for a 60 sample frame,
Searcher 220 is sample 1 from codebook 155
~ Sample 60 is used to form the first set of candidate information, Samples 2 to 61 are used to form the second set of candidate information, and so on. This type of codebook search is often referred to as an overlapping codebook search. The invention does not relate to the structure and function of codebook 155, but to how codebook 155 is searched to identify the optimal codebook vector.

【００５０】適応コードブックサーチャ２２０は図２の
（ｂ）における出力１９１からすでに適応コードブック
１５５に格納された前に合成されたピッチ情報１５６を
アクセスし、かつ各々のそのような組の情報１５６を利
用してブロック１５０から受信された目標駆動１５１お
よびコードブック１５５からアクセスされた駆動１５６
の間のエラー基準を最小化する。スケーリングファクタ
またはゲイン指数２２２もまた各々のアクセスされた組
の情報１５６から計算されるが、それは適応コードブッ
ク１５５に記憶された情報は人間の声または他の入力信
号のダイナミックレンジの変化を考慮に入れないからで
ある。Adaptive codebook searcher 220 accesses previously synthesized pitch information 156 already stored in adaptive codebook 155 from output 191 in FIG. 2B, and each such set of information 156. Target drive 151 received from block 150 and drive 156 accessed from codebook 155 using
Minimize the error criterion between A scaling factor or gain index 222 is also calculated from each accessed set of information 156, which the information stored in the adaptive codebook 155 allows for changes in the dynamic range of the human voice or other input signal. Because I can't enter.

【００５１】使用される好ましいエラー基準は最小２乗
予測エラー（ＭＰＳＥ）であり、これはフレームメモリ
出力１１７からの元の音声フレーム１１５と図２の
（ｂ）のブロック１９５の出力において生成される合成
音声１９６の間の誤差の２乗である。合成音声１９６は
コードブック１５５から得られた試行的駆動情報１５６
に関して計算される。エラー基準はコードブック１５５
から得られた各々の候補ベクトルまたは組の駆動情報１
５６に対して評価され、かつ最も低いエラー値を与える
特定の組の駆動情報１５６′が現在のフレーム（または
サブフレーム）のために使用される組の情報である。The preferred error criterion used is Least Squares Prediction Error (MPSE), which is produced at the original speech frame 115 from the frame memory output 117 and at the output of block 195 of FIG. 2B. It is the square of the error between the synthetic speech 196. The synthetic speech 196 is the trial drive information 156 obtained from the codebook 155.
Is calculated with respect to. The error criterion is Codebook 155
Driving information 1 for each candidate vector or set obtained from
The particular set of drive information 156 'that is evaluated for 56 and gives the lowest error value is the set of information used for the current frame (or subframe).

【００５２】サーチャ２２０が対応する最善の整合のス
ケーリングファクタまたはゲイン２２２とともに使用さ
れるべき最善の組の駆動情報１５６′を決定した後、最
善の整合の指数１５６′に対応するベクトル指数出力信
号２２１および最善の整合のスケーリングファクタ２２
２′に対応するスケーリングファクタ２２２はチャネル
エンコーダ２１０に送信される。After the searcher 220 has determined the best set of driving information 156 'to be used with the corresponding best matching scaling factor or gain 222, the vector exponential output signal 221 corresponding to the best matching exponent 156'. And the best matching scaling factor 22
The scaling factor 222 corresponding to 2'is transmitted to the channel encoder 210.

【００５３】図３は第１の実施例による適応サーチャ２
２０のブロック図を示し、かつ図４はさらに改良されか
つ好ましい実施例による適応サーチャ２２０′を示す。
適応サーチャ２２０，２２０′は適応コードブック１５
５を介してベクトル指数Ｃ_１（ｎ）…Ｃ_ｋ（ｎ）の順次
的なサーチを行う。順次的（ｓｅｑｕｅｎｔｉａｌ）サ
ーチ動作の間、サーチャ２２０，２２０′はコードブッ
ク１５５から各々の候補励起または駆動ベクトルＣ
_ｋ（ｎ）をアクセスし、ここでｋはコードブックにおけ
る特定のベクトルを識別する１からＫに至る指数であ
り、かつｎはｎ＝１からｎ＝Ｎに至る別の指数であり、
Ｎは与えられたフレーム内のサンプルの数である。典型
的なＣＥＬＰのアプリケーションにおいては、Ｋ＝２５
６または５１２または１０２４であり、かつＮ＝６０ま
たは１２０または２４０であるが、ＫおよびＮの他の値
も使用することができる。FIG. 3 shows the adaptive searcher 2 according to the first embodiment.
20 shows a block diagram of 20 and FIG. 4 shows an adaptive searcher 220 'according to a further improved and preferred embodiment.
The adaptive searchers 220 and 220 'are the adaptive codebook 15
5, a vector index C ₁ (n) ... C _k (n) is sequentially searched. During a sequential search operation, the searchers 220, 220 ′ may search for each candidate excitation or drive vector C from the codebook 155.
Access _k (n), where k is an index from 1 to K that identifies a particular vector in the codebook, and n is another index from n = 1 to n = N,
N is the number of samples in a given frame. In a typical CELP application, K = 25
6 or 512 or 1024 and N = 60 or 120 or 240, but other values of K and N can also be used.

【００５４】適応コードブック１５５は前に合成された
音声波形から決定されるセットの異なるピッチ期間また
は周期を含む。第１のサンプルベクトルは合成音声波形
Ｃ_ｋ（Ｎ）のＮ番目のサンプルからスタートし、これは
現在の最後のサンプルの合成された音声波形からＮサン
プル戻った所に位置する。人間の声では、ピッチ周波数
は一般に４０Ｈｚ〜５００Ｈｚの付近である。これは約
２００〜１６サンプルに変換される。もし分数的な（ｆ
ｒａｃｔｉｏｎａｌ）ピッチが計算に含まれておれば、
Ｋはピッチ範囲を表すために２５６または５１２となり
得る。従って、適応コードブックは１組のＫのベクトル
Ｃ_ｋ（ｎ）を含み、これらは基本的に特定の周波数の１
つまたはそれ以上のピッチ期間のサンプルである。Adaptive codebook 155 includes a set of different pitch periods or periods determined from the previously synthesized speech waveform. The first sample vector starts from the Nth sample of the synthesized speech waveform C _k (N), which is N samples back from the synthesized speech waveform of the current last sample. For human voices, the pitch frequency is generally around 40 Hz to 500 Hz. This translates to about 200-16 samples. If fractional (f
rational) If pitch is included in the calculation,
K can be 256 or 512 to represent the pitch range. Therefore, the adaptive codebook contains a set of K vectors C _k (n), which are essentially one for a particular frequency.
Samples of one or more pitch periods.

【００５５】次に図３を参照すると、適応コードブック
サーチャ２２０のコンボリューション発生器５１０は各
々のコードブックベクトルＣ_ｋ（ｎ）、すなわち、信号
１５６、を知覚的に重み付けられたＬＰＣインパルス応
答関数Ｈ（ｎ）、すなわち、カスケード重み付けフィル
タ１５０からの信号１５２でコンボルブする。コンボリ
ューション発生器５１０の出力５１２は次に相互相関器
（ｃｒｏｓｓ−ｃｏｒｒｅｌａｔｏｒ）５２０において
目標音声残差信号Ｘ（ｎ）（すなわち、図２の（ａ）お
よび（ｂ）の信号１５１）と相互相関される。コンボリ
ューションおよび相関は各々のコードブックベクトルＣ
_ｋ（ｎ）に対して行われ、ここでｎ＝１，…，Ｎであ
る。コンボリューション発生器５１０によって行われる
動作は数学的に次の式（１）によって表される。相互相関発生器５２０によって行われる動作は数学的に
以下の式（２）によって表される。 Referring now to FIG. 3, the convolution generator 510 of the adaptive codebook searcher 220 provides a perceptually weighted LPC impulse response function for each codebook vector C _k (n), or signal 156. H (n), ie, convolve with the signal 152 from the cascade weighting filter 150. The output 512 of the convolution generator 510 is then cross-correlated with the target speech residual signal X (n) (ie, signal 151 of FIGS. 2A and 2B) in a cross-correlator 520. To be done. The convolution and correlation are for each codebook vector C
_{performed on k} (n), where n = 1, ..., N. The operation performed by the convolution generator 510 is mathematically represented by equation (1) below. The operation performed by cross-correlation generator 520 is mathematically represented by equation (2) below.

【００５６】コンボリューション発生器５１０の出力５
１２はまたエネルギ計算機５３５に供給され、該エネル
ギ計算機５３５は２乗器５５２およびアキュムレータ５
５３（アキュムレータ５５３は２乗器５５２によって決
定される２乗の和を提供する）を具備する。出力５５４
は除算器（ｄｉｖｉｄｅｒ）５３０に伝達され、該除算
器５３０は信号５５１および５５５の比率を計算する。
相互相関器５２０の出力５２１は２乗器５２５に供給さ
れ、該２乗器５２５の出力はまた除算器５３０に供給さ
れる。除算器５３０の出力５３１はピーク選択回路５７
０に供給され、該ピーク選択回路５７０の機能はＣ
_ｋ（ｎ）のどの値Ｃ_ｋ（ｍ）が最善の整合を生成する
か、すなわち、最大の相互相関を決定することである。
これは数学的には式（３ａ）および（３ｂ）によって表
される。式（３ａ）はエラーＥを表す。エラーＥを最小にすることは以下の式（３ｂ）により表
される相互相関を最大にすることであり、ここでＧ_ｋは
式（４）で規定される。 Output 5 of convolution generator 510
12 is also fed to an energy calculator 535, which has a squarer 552 and an accumulator 5.
53 (accumulator 553 provides the sum of squares determined by squarer 552). Output 554
Is transmitted to a divider 530, which calculates the ratio of signals 551 and 555.
The output 521 of the cross-correlator 520 is fed to a squarer 525, the output of which is also fed to a divider 530. The output 531 of the divider 530 is the peak selection circuit 57.
0, and the function of the peak selection circuit 570 is C
_It is to determine which value C _k (m) of _k (n) produces the best match, ie the maximum cross-correlation.
This is mathematically represented by equations (3a) and (3b). Equation (3a) represents the error E. Minimizing the error E is maximizing the cross-correlation represented by equation (3b) below, where G _k is defined by equation (4).

【００５７】最適のベクトル指数Ｃ_ｋ（ｍ）の識別子
（指数）は出力２２１に伝達される。ピーク選択器５７
０の出力５７１は最善の整合のピッチベクトルＣ
_ｋ（ｍ）に関連するゲインスケーリング情報をゲイン指
数出力２２２を提供するゲイン計算機５８０に伝達す
る。ゲイン計算機５８０によって行われる演算は数学的
に以下の式（４）により表される。 The identifier (index) of the optimal vector index C _k (m) is transmitted to the output 221. Peak selector 57
The output 571 of 0 is the best matching pitch vector C
_The gain scaling information associated with _k (m) is communicated to gain calculator 580, which provides gain index output 222. The operation performed by the gain calculator 580 is mathematically represented by the following equation (4).

【００５８】出力２２１および２２２はチャネルコーダ
２１０に送られる。コンボリューション発生器５１０、
相互相関発生器５２０、２乗器５２５および５５２（こ
れらは異なる入力に対し同様の機能を達成する）、アキ
ュムレータ５５３、除算器５３０、ピーク選択器５７０
およびゲイン計算機５８０を提供するための手段はそれ
ぞれ技術上よく知られている。The outputs 221 and 222 are sent to the channel coder 210. A convolution generator 510,
Cross-correlation generator 520, squarers 525 and 552 (these perform similar functions for different inputs), accumulator 553, divider 530, peak selector 570.
And means for providing gain calculator 580 are each well known in the art.

【００５９】図３の構成は満足すべき結果を与えるが、
各々のコードブックベクトルに対し必要なコンボリュー
ションを行いかつ相関を行うために必要以上の計算を必
要とする。これはコンボリューション５１０および相関
５２０が共に各々の音声フレーム１１７に対するコード
ブック１５５におけるすべての候補ベクトルに対し行わ
なければならないためである。図３の構成におけるこの
制限は図４の構成により克服される。The configuration of FIG. 3 gives satisfactory results,
It requires more computation than necessary to perform the necessary convolution and correlation for each codebook vector. This is because both convolution 510 and correlation 520 must be done for all candidate vectors in codebook 155 for each speech frame 117. This limitation in the configuration of FIG. 3 is overcome by the configuration of FIG.

【００６０】図４の適応コードブックサーチャ２２０′
はコンボリューション信号Ｗ（ｎ）を発生するためにコ
ンボリューション発生器５１０′におけるショートター
ムＬＰＣフィルタのインパルス知覚的重み付け応答関数
Ｈ（ｎ）（すなわち、図２のブロック１５０の出力１５
２）とコンボルブするために知覚的に重み付けされた目
標音声Ｘ（ｎ）のフレーム（すなわち、図２の（ａ）お
よび（ｂ）の信号１５１）を使用する。これは入力音声
のフレーム１１７ごとに１度だけ行われる。これはコー
ドブックにおける候補ベクトルの数にほぼ等しい大きな
ファクタで計算機的な負荷を直ちに低減する。これは非
常に大きな計算機的な節約になる。コンボリューション
発生器５１０によって行われる動作は以下の式（５）に
より数学的に表現される。 Adaptive codebook searcher 220 'of FIG.
Is the impulse perceptual weighting response function H (n) of the short-term LPC filter in convolution generator 510 'to generate convolution signal W (n) (ie, output 15 of block 150 of FIG. 2).
2) Use the perceptually weighted frame of the target speech X (n) (ie, signal 151 of FIGS. 2a and 2b) to convolve with 2). This is done only once per frame 117 of the input speech. This immediately reduces the computational load by a large factor that is approximately equal to the number of candidate vectors in the codebook. This is a huge computational savings. The operation performed by the convolution generator 510 is mathematically represented by equation (5) below.

【００６１】コンボリューション発生器５１０′の出力
５１２′は次に相互相関発生器５２０′により適応コー
ドブック１５５における各ベクトルＣ_ｋ（ｎ）と相関さ
れる。相互相関発生器５２０′により行われる演算は数
学的に次の式（６）によって表される。 The output 512 'of the convolution generator 510' is then correlated by the cross-correlation generator 520 'with each vector C _k (n) in the adaptive codebook 155. The operation performed by the cross-correlation generator 520 'is mathematically represented by the following equation (6).

【００６２】出力５５１′は２乗器５２５′により２乗
され候補ベクトルＣ_ｋ（ｎ）のエネルギにより正規化さ
れた各ベクトルＣ_ｋ（ｎ）の相関の２乗である出力５２
１′を生成する。これは自己相関発生器５６０′に各々
の候補ベクトルＣ_ｋ（ｎ）（出力１５６）を提供するこ
とにより、かつその出力が引き続き操作されかつ組合わ
される自己相関発生器５５０′に対しフィルタのインパ
ルス応答Ｈ（ｎ）（出力１５２からの）を提供すること
により達成される。自己相関発生器５５０′の出力５５
２′は後にその機能を説明するルックアップテーブル５
５０′に供給される。テーブル５５５′の出力５５６′
は乗算器５４３に供給され、そこで自己相関器５６０′
の出力５６１と組合わされる。The output 551 'is the square of the correlation of each vector _Ck (n) squared by the squarer 525' and normalized by the energy of the candidate vector _Ck (n).
1'is generated. This is by providing each candidate vector C _k (n) (output 156) to the autocorrelation generator 560 ', and the impulse of the filter for the autocorrelation generator 550' whose output is subsequently manipulated and combined. This is accomplished by providing the response H (n) (from output 152). Output 55 of autocorrelation generator 550 '
2'is a lookup table 5 whose function will be described later.
50 '. Output 556 'of table 555'
Is fed to a multiplier 543, where the autocorrelator 560 '
Output 561 of the.

【００６３】乗算器５４３′の出力５４５′はアキュム
レータ５４０′に供給され、該アキュムレータ５４０′
はｎの引き続く値に対する積を加算しかつその和５４
１′を除算器５３０に送り、そこで相互相関発生器５２
０′の出力５２１′と組合わされる。自己相関器５６
０′により行われる操作は数学的に以下の式（７）によ
って記述され、かつ自己相関器５５０′により行われる
演算は数学的に次の式（８）により記述される。この場合、Ｃ_ｋ（ｎ）はｋ番目の適応コードブックベク
トルであり、各ベクトルは１からＫに至る指数ｋにより
識別され、Ｈ（ｎ）は知覚的に重み付けされたＬＰＣイ
ンパルス応答であり、Ｎは分析フレームにおけるテジタ
ル化されたサンプルの数であり、そしてｍはダミー整数
指数であり、ｎは音声フレーム内のＮのサンプルの内ど
れが考慮されているかを示す整数指数である。The output 545 'of the multiplier 543' is supplied to the accumulator 540 ', and the accumulator 540'.
Adds the products of n to successive values and sums 54
1'is sent to the divider 530, where the cross-correlation generator 52
0's output 521 'combined. Autocorrelator 56
The operation performed by 0'is mathematically described by equation (7) below, and the operation performed by autocorrelator 550 'is mathematically described by equation (8) below. In this case, C _k (n) is the k-th adaptive codebook vector, each vector is identified by an index k ranging from 1 to K, and H (n) is a perceptually weighted LPC impulse response, N is the number of digitized samples in the analysis frame, and m is a dummy integer exponent, and n is an integer exponent indicating which of the N samples in the speech frame are considered.

【００６４】上記サーチ操作は各々の候補ベクトルＣ_ｋ
（ｎ）をＭＳＰＥサーチ基準を使用して目標音声残差Ｘ
（ｎ）と比較する。コードブック１５５の出力１５６か
ら受信された各々の候補ベクトルＣ_ｋ（ｎ）は自己相関
発生器５６０′に送られ、該自己相関発生器５６０′は
候補ベクトルのすべての自己相関係数を発生して自己相
関出力信号５６１′を生成し、該信号５６１′はブロッ
ク５４３′および５４０′を具備するエネルギ計算機５
３５′に供給される。The above search operation is performed for each candidate vector C _k.
(N) target speech residual X using MSPE search criteria
Compare with (n). Each candidate vector C _k (n) received from the output 156 of the codebook 155 is sent to an autocorrelation generator 560 'which generates all the autocorrelation coefficients of the candidate vector. To produce an autocorrelation output signal 561 ', which signal 561' comprises blocks 543 'and 540'.
35 '.

【００６５】自己相関発生器５５０′はＨ（ｎ）関数の
すべての自己相関係数を発生して自己相関出力信号５５
２′を生成し、該信号５５２′はテーブル５５５′およ
び出力５５６′を介してエネルギ計算機５３５′に供給
される。The autocorrelation generator 550 'generates all the autocorrelation coefficients of the H (n) function and outputs the autocorrelation output signal 55.
2 ', which signal 552' is provided to energy calculator 535 'via table 555' and output 556 '.

【００６６】エネルギ計算機５３５′は入力信号５５
６′および５６１′を候補ベクトルＣ_ｋ（ｎ）のすべて
の自己相関係数およびカスケード重み付けフィルタ１５
０により発生される知覚的に重み付けされたインパルス
関数Ｈ（ｎ）のすべての積の項を加算することにより組
合わせる。エルネギ計算機５３５′はＣ_ｋ（ｎ）の自己
相関係数をＨ（ｎ）の自己相関係数の同じ遅延項（信号
５６１′および５５２′）と乗算するための乗算器５４
３′、および乗算器５４３′の出力を加算して除算器５
３０に送られる候補ベクトルのエネルギに関する情報を
含む出力５４１′を生成するアキュムレータ５４０′を
具備する。除算器５３０′はゲインをセットするために
使用されるエネルギ正規化を行う。候補ベクトルＣ
_ｋ（ｎ）のエルネギは候補ベクトルＣ_ｋ（ｎ）のすべて
の自己相関係数および知覚的に重み付けされたショート
タームフィルタ１５０の知覚的に重み付けされたインパ
ルス関数Ｈ（ｎ）のすべての積の項を加算することによ
り非常に効率的に計算される。ループゲインＧ_ｋを決定
するための上に述べた演算は数学的に以下の式（９）に
よって記述される。この場合、Ｃ_ｋ（ｎ）、Ｘ（ｍ）、Ｈ（ｎ）、Φ
_ｋ（ｎ）、Ｕ_ｋ（ｎ）およびＮは前に規定されており、
かつＧ_ｋはｋ番目のコードベクトルのループゲインであ
る。Energy calculator 535 'receives input signal 55
6 ′ and 561 ′ are all _{autocorrelation} coefficients of the candidate vector C _k (n) and cascade weighting filter 15
Combine by summing all product terms of the perceptually weighted impulse function H (n) generated by 0. Ernegi Calculator 535 'is a multiplier 54 for multiplying the autocorrelation coefficient of C _k (n) with the same delay terms (signals 561' and 552 ') of the autocorrelation coefficient of H (n).
3 ', and the output of the multiplier 543' is added and the divider 5
It comprises an accumulator 540 'that produces an output 541' containing information about the energy of the candidate vector sent to 30. Divider 530 'performs the energy normalization used to set the gain. Candidate vector C
_k Erunegi the all products of candidate vector C _{k (n)} of all the autocorrelation coefficients and the perceptually weighted short-term filter 150 perceptually weighted impulse function H (n) of _(n) It is calculated very efficiently by adding the terms. The above-described operation for determining the loop gain G _k is mathematically described by equation (9) below. In this case, C _k (n), X (m), H (n), Φ
_k (n), U _k (n) and N have been previously defined,
And G _k is the loop gain of the kth code vector.

【００６７】テーブル５５５′は計算機的な負担をさら
に低減できるようにする。これはインパルス関数Ｈ
（ｎ）の自己相関係数５５２′が入力音声のフレーム１
１７ごとに１度だけ計算されればよいからである。これ
はコードブックサーチの前に１行うことができかつその
結果はテーブル５５５′に記憶することができる。コー
ドブックサーチの前にテーブル５５５′に記憶された自
己相関係数５５２′は次に後に適応コードブック１５５
から各候補ベクトルに対するエネルギを計算するために
使用される。これは計算をさらに大幅に節約できるよう
にする。The table 555 'enables the computational load to be further reduced. This is the impulse function H
The autocorrelation coefficient 552 'of (n) is the frame 1 of the input speech.
This is because it only has to be calculated once for every 17. This can be done one before the codebook search and the results can be stored in table 555 '. The autocorrelation coefficient 552 'stored in the table 555' prior to the codebook search is then followed by the adaptive codebook 155 '.
Is used to calculate the energy for each candidate vector. This allows to save even more computation.

【００６８】コードブック１５５における各ベクトルの
正規化された相関の結果はピークセレクタ５７０′にお
いて比較されかつ最大の相互相関値を有するベクトルＣ
_ｋ（ｍ）はピークセレクタ５７０′により最適のピッチ
周期ベクトルとして識別される。最大の相互相関は数学
的に以下の式（１０）によって表すことができる。この場合、Ｇ_ｋは式（９）において規定されており、か
つｍはダミー整数指数である。The normalized correlation results of each vector in codebook 155 are compared in peak selector 570 'and vector C having the largest cross-correlation value.
_k (m) is identified by peak selector 570 'as the optimal pitch period vector. The maximum cross-correlation can be mathematically represented by equation (10) below. In this case G _k is defined in equation (9) and m is a dummy integer exponent.

【００６９】ピッチ周期のロケーション、すなわち、コ
ードベクトルＣ_ｋ（ｍ）の指数はチャネルコーダ２１０
への送信のために出力２２１′に提供される。The location of the pitch period, that is, the index of the code vector C _k (m) is the channel coder 210.
Is provided at output 221 'for transmission to.

【００７０】ピッチゲインはゲイン計算機５８０′によ
り選択されたピッチ周期の候補ベクトルＣ_ｋ（ｍ）を使
用して計算されゲイン指数２２２′を発生する。ここに
述べた手段および方法は、実質的に音声品質の喪失を生
ずることなく計算機的な複雑さを低減する。計算機的な
複雑さが低減されるから、この構成を用いたボコーダは
単一のデジタル信号プロセッサ（ＤＳＰ）によってより
都合よく実施することができる。本発明の手段および方
法はまた、最小２乗予測エラー（ＭＰＳＥ）サーチ基準
を使用する、音声認識および音声識別のような他の領域
に適用することができる。The pitch gain is calculated using the pitch period candidate vector C _k (m) selected by the gain calculator 580 'to generate a gain index 222'. The means and methods described herein reduce the computational complexity with virtually no loss of voice quality. Vocoders using this configuration can be more conveniently implemented by a single digital signal processor (DSP) because of reduced computational complexity. The means and methods of the present invention can also be applied to other areas, such as speech recognition and speech identification, which use least squares prediction error (MPSE) search criteria.

【００７１】本発明が、ここに述べた方法および装置に
よって生成される、時には目標音声残差（ｔａｒｇｅｔ
ｓｐｅｅｃｈｒｅｓｉｄｕａｌ）と称される、知覚
的に重み付けされた目標音声信号Ｘ（ｎ）に関して述べ
られたが、本発明の方法は知覚的に重み付けされた目標
音声Ｘ（ｎ）を得るためにここで使用された特定の手段
および方法に限定されるものではなく、他の手段および
方法によって得られる目標音声と共にかつ知覚的重み付
けまたはフィルタのリンギングの除去を用いあるいは用
いずに使用するこができる。The present invention is sometimes produced by the methods and apparatus described herein, sometimes with a target speech target.
Although described with reference to a perceptually weighted target speech signal X (n), referred to as speech residual), the method of the present invention is now described to obtain a perceptually weighted target speech signal X (n). It is not limited to the particular means and method used and can be used with target speech obtained by other means and methods and with or without perceptual weighting or removal of filter ringing.

【００７２】「音声（ｓｐｅｅｃｈ）」または「目標音
声（ｔａｒｇｅｔｓｐｅｅｃｈ）」に適用される「残
差（ｒｅｓｉｄｕａｌ）」なる用語はフィルタのリンギ
ング信号が音声または目標音声から減算されている状況
を含むことを考えている。ここで用いられているよう
に、用語「音声残差」または「目標音声」または「目標
音声残差」および省略表現“Ｘ（ｎ）”はそのような変
形を含むものである。同じことはまた、有限または無限
インパルス応答関数とすることができ、かつ知覚的重み
付けを用いあるいは用いない、インパルス応答関数Ｈ
（ｎ）についても当てはまる。ここで用いられているよ
うに、用語「知覚的に重み付けされたインパルス応答関
数（ｐｅｒｃｅｐｔｕａｌｌｙｗｅｉｇｈｔｅｄｉ
ｍｐｕｌｓｅｒｅｓｐｏｎｓｅｆｕｎｃｔｉｏｎ）」
または「フィルタインパルス応答（ｆｉｌｔｅｒｉｍ
ｐｕｌｓｅｒｅｓｐｏｎｓｅ）」および“Ｈ（ｎ）”
なる表記は、そのような変形を含むことを意図してい
る。同様に、ワード「ゲイン指数（ｇａｉｎｉｎｄｅ
ｘ）」または「ゲインスケーリングファクタ（ｇａｉｎ
ｓｃａｌｉｎｇｆａｃｔｏｒ）」およびその表記Ｇ_ｋ
は、そのような「ゲイン」または「エネルギ」正規化信
号が音声のＣＥＬＰ符号化に関連して取り入れる多くの
形式を含むものである。The term "residual" as applied to "speech" or "target speech" includes the situation where the ringing signal of the filter is subtracted from the speech or target speech. Thinking. As used herein, the terms "voice residual" or "target voice" or "target voice residual" and the abbreviation "X (n)" are meant to include such variations. The same also applies to the impulse response function H, which can be a finite or infinite impulse response function and with or without perceptual weighting.
The same applies to (n). As used herein, the term “perceptually weighted impulse response function (perceptually weighted i.
mpulse response function ”
Or "filter impulse response (filter im
"pulse response)" and "H (n)"
Is intended to include such variations. Similarly, the word "gain inde
x) ”or“ gain scaling factor (gain)
scaling factor) ”and its notation G _k
Includes many forms that such "gain" or "energy" normalized signals will incorporate in connection with CELP coding of speech.

【００７３】図４に示された実施例によって提供される
有利性をもってしても、かなりの計算機的な負担が依然
として残っている。たとえば、図４のブロック５６０′
における自己相関係数の評価（前記式（７）を参照）
は、コードブック１５５におけるＫのベクトルに対する
エネルギ正規化（ゲイン）係数を計算するために（Ｋ）
・（Ｎ！）の乗算を必要とする。Ｋは典型的には５１２
または１０２４のオーダでありかつＮは典型的には６０
または１２０または２４０のオーダであるから、（Ｋ）
・（Ｎ！）＝（Ｋ）・（Ｎ）・（Ｎ−１）・（Ｎ−２）
…（２）は通常非常に大きな数となる。これらの計算
は、ブロック５１０′，５２０′，５５０′の動作に必
要とされるものおよび特定の適応コードブックベクトル
Ｃ_ｋ＝ｊ（ｎ）およびＧ_ｋ＝ｊの対応する値と共に、最
も適合する確率的コードブックベクトルおよび、入力音
声に対し目標音声Ｘ（ｎ）の最善の適合（最小のエラ
ー）を与える、対応するゲインファクタを再帰的に決定
するのに必要な他のものの他に必要である。これは必要
な計算を合理的な時間内に行うためにかなりの量の計算
機的な能力を必要とする。Even with the advantages offered by the embodiment shown in FIG. 4, a significant computational burden remains. For example, block 560 'of FIG.
Evaluation of autocorrelation coefficient in (see above equation (7))
To compute the energy normalization (gain) coefficients for the K vectors in codebook 155 (K)
• Requires (N!) Multiplication. K is typically 512
Or on the order of 1024 and N is typically 60
Or because it is on the order of 120 or 240, (K)
・ (N!) = (K) ・ (N) ・ (N-1) ・ (N-2)
(2) is usually a very large number. These calculations are best fit with what is required for the operation of blocks 510 ', 520', 550 'and the corresponding values of the particular adaptive codebook vectors C _{k = j} (n) and G _{k = j.} Needed in addition to the stochastic codebook vector and others needed to recursively determine the corresponding gain factor that gives the best fit (minimum error) of the target speech X (n) to the input speech. is there. This requires a significant amount of computational power to make the necessary calculations in a reasonable amount of time.

【００７４】ベクトル当たりＮのエントリのＫのベクト
ルを有するコードブックに対して行われることが必要な
自己相関演算の数は音声品質に大きな悪影響を与えるこ
となくかなり低減できることが分かった。これは、Ｎの
エントリの内の第１のＰ（Ｐ＜＜Ｎ）に対するコードブ
ックベクトルを自己相関してそれに対する第１の自己相
関値を決定する段階、Ｋのコードブックベクトルおよび
前記第１の自己相関値を使用して合成音声を生成しかつ
その結果を入力音声と比較することによってＫのコード
ブックベクトルを評価する段階、Ｋのコードブックベク
トルの内のどのＳ（Ｓ＜＜Ｋ）が評価されたＫ−Ｓの残
りのベクトルよりも入力音声と比較してより少ないエラ
ーを有する合成音声を提供するかを決定する段階、各コ
ードブックベクトルにおけるＲのエントリに対するＫの
ベクトルの内のこれらのＳに対するコードブックベクト
ル（Ｐ＜Ｒ≦Ｎ）を自己相関してそれに対する第２の自
己相関値を提供する段階、前記第２の自己相関値を使用
してＫのベクトルの内の前記Ｓを再評価しＳのコードブ
ックベクトルの内のどれが入力音声と比較して最少のエ
ラーを提供するかを識別する段階、そして最少のエラー
を提供するコードブックベクトルの識別を使用して音声
フレームに対するＣＥＬＰコードを形成する段階、を具
備する方法によって達成される。ここに述べられた大き
さのＫおよびＮに対しては、５≦Ｐ≦１０かつ１≦Ｓ≦
７の範囲のＰおよびＳが適切である。Ｒ＝ＮまたはＮ−
１であることが望ましい。It has been found that the number of autocorrelation operations that need to be performed on a codebook with K vectors of N entries per vector can be significantly reduced without a significant negative impact on speech quality. This comprises autocorrelating a codebook vector for a first P (P << N) of N entries to determine a first autocorrelation value for it, K codebook vectors and said first Evaluating the codebook vector of K by generating the synthesized speech using the autocorrelation value of S and comparing the result with the input speech, which S (S << K) of the K codebook vectors of K Determines whether to provide the synthesized speech with less error compared to the input speech than the rest of the evaluated K-S vectors, of the K vectors for the R entries in each codebook vector. Autocorrelating a codebook vector (P <R ≦ N) for these S to provide a second autocorrelation value for it, using said second autocorrelation value Re-evaluating the S of the codebook vectors of S to identify which of the S codebook vectors provide the least error compared to the input speech, and of the codebook vector providing the least error. Forming the CELP code for the speech frame using the identification. For K and N of the magnitudes stated here, 5 ≦ P ≦ 10 and 1 ≦ S ≦
P and S in the range of 7 are suitable. R = N or N-
It is desirable that it is 1.

【００７５】上に述べた動作はまたここに与えられた式
および図に関して説明することができる。たとえば、ｍ
＝０〜Ｎ−１に対する、ｎ＝１〜Ｎに対する、かつｋ＝
１〜ｋ＝Ｋの各々の値に対し、式（７）を再帰的に評価
する代わりに、以下の手順が用いられる。（１）ｍ＝０〜ｍ＝Ｐ、ここでＰ＜＜Ｎ、に対し、式
（７）に従ってブロック５５０′においてコードブック
ベクトルＣ_ｋ（ｎ）の自己相関を行い、（２）それによ
って検出されたＵ_ｋ（Ｐ）のＰの値を使用し、すべての
ＫのベクトルＣ_ｋ（ｎ）を再帰的に評価しかつＫのベク
トルＣ_ｋ（ｎ）の内のＳの、Ｓ＜＜Ｋ、入力音声に対し
最も近い整合を提供するものを選択し、次に（３）上の
ステップ（２）において選択されたＫのベクトルの内の
Ｓを最初に選択されたＰの値より多く、好ましくはすべ
てｍ＝０〜ｍ＝Ｎ−１の値、を使用して再帰的に再評価
し、式（７）におけるＵ_ｋ（ｍ）を決定し入力音声に対
する最善の適合を与えるｊ番目の値Ｃ_ｋ＝ｊ（ｎ）およ
び対応するゲイン指数またはファクタＧ_ｋ＝ｊを決定
し、そして（４）前と同様に、Ｃ_ｋ＝ｊ（ｎ）およびＧ
_ｋ＝ｊをチャネルコーダ２１０に送る。The operations described above can also be described with reference to the equations and figures provided herein. For example, m
= 0 to N−1, n = 1 to N, and k =
Instead of recursively evaluating equation (7) for each value of 1-k = K, the following procedure is used. (1) For m = 0 to m = P, where P << N, autocorrelate the codebook vector C _k (n) in block 550 ′ according to equation (7), and (2) detect it accordingly. Using the value of P of U _k (P) that has been calculated, recursively evaluate all K vectors C _k (n) and S of S of K vectors C _k (n), S << K , Selecting the one that provides the closest match to the input speech, and then (3) more S of the K vectors selected in step (2) above than the value of P originally selected, Recursively reevaluate using preferably all values of m = 0 to m = N−1 to determine U _k (m) in equation (7) and give the j-th best fit to the input speech. Determine the value C _{k = j} (n) and the corresponding gain index or factor G _{k = j} , and ( 4) As before, C _{k = j} (n) and G
Send _{k = j} to the channel coder 210.

【００７６】ここで使用されているように、「再帰的
（ｒｅｃｕｒｓｉｖｅｌｙ）」は反復的な合成による分
析（ａｎａｌｙｓｉｓ−ｂｙ−ｓｙｎｔｈｅｓｉｓ）コ
ードブックサーチおよび図２の（ａ）および（ｂ）と図
４に関して説明したエラー最少化手順を言及している。As used herein, "recursive" is an analysis-by-synthesis codebook search and FIGS. 2 (a) and 2 (b) and FIG. It refers to the error minimization procedure described above.

【００７７】出力音声品質はＰが約Ｐ＝１０まで増大す
るに応じて改善され、Ｐ＞１０に対してはそれ以上の少
しの改善があることが分かっている。良好な音声品質は
５≦Ｐ≦１０に対して得られる。音声品質はＰ＜５に対
し急速に劣化する。Ｎは通常６０またはそれ以上のオー
ダであるから、かなりの計算機的な節約が得られる。It has been found that the output voice quality improves as P increases up to about P = 10, with a slight improvement over P> 10. Good voice quality is obtained for 5 ≦ P ≦ 10. The voice quality deteriorates rapidly for P <5. Since N is typically on the order of 60 or more, considerable computational savings are obtained.

【００７８】有用な音声品質はＳの値をＳ＝１にまで小
さくし、かつ音声品質はＳが増大するにつれて増大する
ことが分かっている。約Ｓ＝７を超えると、音声品質の
それ以上の改善は検出するのが困難になる。従って、１
≦Ｓ≦７が有用な動作範囲であり、これは最適のコード
ブックベクトルおよび対応するゲイン指数またはファク
タに対する再帰的サーチの間に達成されなければならな
い計算の数を大幅に低減する。これは単一のデジタル信
号プロセッサを使用して所望のボコーダ機能を達成する
のをさらに容易にする。It has been found that useful voice quality reduces the value of S to S = 1, and voice quality increases as S increases. Above about S = 7, further improvement in voice quality becomes difficult to detect. Therefore, 1
≦ S ≦ 7 is a useful operating range, which significantly reduces the number of calculations that must be accomplished during the recursive search for the optimal codebook vector and corresponding gain index or factor. This makes it easier to achieve the desired vocoder function using a single digital signal processor.

【００７９】どのようにしてコードブックエントリが構
築されかつ自己相関が行われるかに関しさらに別の問題
が存在する。これはショートピッチ期間の識別に供する
ために従来技術においてしばしば使用されている「コピ
ーアップ（ｃｏｐｙ−ｕｐ）」と称される手順の結果と
して生ずる（たとえば、前述のケッチャム他の米国特許
を参照）。これは次のように説明される。Yet another problem exists as to how codebook entries are constructed and autocorrelated. This occurs as a result of a procedure called "copy-up" that is often used in the prior art to provide for the identification of short pitch periods (see, for example, the aforementioned Ketchum et al. Patent). . This is explained as follows.

【００８０】最適のピッチ周期に対する適応コードブッ
クサーチにおけるエラー関数のエネルギ項は２つの関数
の自己相関係数のリニアな組合わせに低減することがで
きる。これら２つの関数は知覚的に重み付けされたショ
ートターム・リニアフィルタのインパルス応答関数Ｈ
（ｎ）および適応コードブックのコードブックベクトル
Ｃ_ｋ（ｎ）である。適応コードブックについては計算機
的な複雑さが確率的コードブックより大きくなるが、そ
れは適応コードブックベクトルに対する自己相関係数が
予め計算しかつ記憶しておくことができないからであ
る。The energy term of the error function in the adaptive codebook search for the optimum pitch period can be reduced to a linear combination of the autocorrelation coefficients of the two functions. These two functions are the impulse response function H of the perceptually weighted short-term linear filter.
(N) and the codebook vector C _k (n) of the adaptive codebook. The computational complexity of the adaptive codebook is greater than that of the probabilistic codebook, because the autocorrelation coefficient for the adaptive codebook vector cannot be pre-computed and stored.

【００８１】各々の適応コードブックベクトルは、サン
プルまたは値とも称される、Ｎのエントリのリニアアレ
イである。各エントリは１〜Ｎに至るあるいはＮ〜１に
至る指数ｎにより識別される。コードブックにおける隣
接ベクトルは互いに１つのエントリだけ異なる、すなわ
ち、各引き続くベクトルは該ベクトルの終りに加えられ
た１つの新しいエントリを有しかつ該ベクトルの他の端
からドロップされる１つの古いエントリを有し介在する
エントリは同じままである。従って、該ベクトルの端部
を除き、隣接ベクトルは１つの指数だけずれた同じエン
トリを有する。もし隣接するベクトルが並んで配置され
れば、それらは１つのエントリまたはサンプルだけずれ
ておれば釣り合うことになる。これは０から９の間の任
意的なエントリ値および指数ｎ＝１−６０を有する仮想
的な隣接ベクトルｋ，ｋ′に対し以下に概略的に説明す
る。この変位（ｄｉｓｐｌａｃｅｍｅｎｔ）はコードブ
ック「オーバラップ」と称される。Each adaptive codebook vector is a linear array of N entries, also called samples or values. Each entry is identified by an index n ranging from 1 to N or N to 1. Adjacent vectors in the codebook differ from each other by one entry, that is, each subsequent vector has one new entry added at the end of the vector and one old entry dropped from the other end of the vector. The intervening entries have remain the same. Therefore, except for the ends of the vector, adjacent vectors have the same entry offset by one index. If adjacent vectors are placed side by side, they would be balanced if they were offset by one entry or sample. This is outlined below for a virtual adjacency vector k, k 'with an arbitrary entry value between 0 and 9 and an index n = 1-60. This displacement is referred to as the codebook "overlap".

【００８２】実例Ｉ−ベクトルオーバラップの例示 k(n): 1,2,3,4,5,6,7,……,55,56,57,58,59,60 （指数） 4,6,9,3,5,1,8,……, 0, 4, 6, 8, 2, 3 （値） k ′(n): 1,2,3,4,5,6,7,…… ,55,56,57,58,59,60 （指数） 6,9,3,5,1,8,5,…… , 4, 6, 8, 2, 3, 7 （値）ベクトルｋ′は１つの指数によって変位した隣接ベクト
ルｋと同じエントリを有し、かつ古いエントリはベクト
ルの一端からドロップされ（たとえば、値４は左端をド
ロップされ）かつ新しいエントリがパターンに加えられ
る（たとえば、値７が右端に加えられる）。Example I-Illustration of Vector Overlap k (n): 1,2,3,4,5,6,7, ..., 55,56,57,58,59,60 (Index) 4,6 , 9,3,5,1,8, ……, 0, 4, 6, 8, 2, 3 (value) k ′ (n): 1,2,3,4,5,6,7, …… , 55,56,57,58,59,60 (index) 6,9,3,5,1,8,5, ……, 4, 6, 8, 2, 3, 7 (value) Vector k ′ is It has the same entries as the neighbor vector k displaced by one exponent, and old entries are dropped from one end of the vector (eg, the value 4 is dropped on the left) and new entries are added to the pattern (eg, the value 7). Is added to the right edge).

【００８３】自己相関関数Ｕ_ｋ（ｍ）は方程式（７）で
与えられ、この場合ｍ＝０〜Ｎ−１は積Ｃ_ｋ（ｎ）＊Ｃ
_ｋ（ｎ＋ｍ）における「遅れ（ｌａｇ）」値であり、か
つｎ＝１〜Ｎはベクトルエントリの指数（ｉｎｄｅｘ）
である。今までは、ベクトル長Ｎ（すなわち、コードブ
ックベクトルごとのエントリの数）およびフレーム長Ｌ
（すなわち、分析フレームごとの音声サンプルの数）は
同じであると仮定してきた。しかしこれは常にそうであ
る訳ではない。ＮおよびＬが同じであるかあるいは異な
るかに応じて自己相関係数を決定するために異なる作戦
が使用される。The autocorrelation function U _k (m) is given by equation (7), where m = 0 to N−1 is the product C _k (n) * C
“lag” value in _k (n + m), and n = 1 to N is the index of the vector entry
Is. So far, the vector length N (ie the number of entries per codebook vector) and frame length L
(Ie the number of audio samples per analysis frame) has been assumed to be the same. But this is not always the case. Different strategies are used to determine the autocorrelation coefficient depending on whether N and L are the same or different.

【００８４】ベクトル長Ｎがフレーム長Ｌに等しいかま
たは大きい場合には、自己相関係数は加算−削除（ａｄ
ｄ−ｄｅｌｅｔｅ）エンドコレクションと呼ばれるプロ
セスによって計算できる。たとえば、引き続く適応コー
ドブックベクトルＣ_ｋ，Ｃ_ｋ′Ｃ_ｋ″などのゼロオーダ
またはゼロ遅延（遅れｍ＝０）自己相関係数は最初のベ
クトルに対し（Ｃ_ｋ（ｎ））^２の和を計算しかつエンド
コレクションによって他のベクトルを検出することによ
り決定できる。エンドコレクションは新しく加算された
ベクトル値の２乗を加算しかつたった今削除されたベク
トル値の２乗を減算することを必要とする。この同じ手
順が（いくつかの変化を伴って）ｍ＝１，２，３などに
対し続けられ、その結果計算機的な負担が各々のベクト
ルに対し別個に式（７）により各自己相関係数を計算す
る場合に比較して低減される。自己相関係数を決定する
ためのこの加算−削除エンドコレクション処理は技術上
よく知られている。If the vector length N is equal to or larger than the frame length L, the autocorrelation coefficient is added-deleted (ad
d-delete) It can be calculated by a process called end collection. For example, a zero-order or zero-delay (delay m = 0) autocorrelation coefficient, such as a subsequent adaptive codebook vector C _k , C _k ′ C _k ″, calculates the sum of (C _k (n)) ² for the first vector. And can be determined by detecting another vector with end collection, which requires adding the square of the newly added vector value and subtracting the square of the just deleted vector value. This same procedure is followed (with some changes) for m = 1, 2, 3, etc., so that the computational burden is calculated separately for each vector by equation (7) for each autocorrelation coefficient. This is reduced compared to the case of computing .. This add-delete end-correction process for determining the autocorrelation coefficient is well known in the art.

【００８５】ベクトルにおけるサンプルの数がフレーム
長Ｌより小さい場合には、フレームを満たすために該ベ
クトルを「コピーアップ（ｃｏｐｙ−ｕｐ）」すること
が普通である（たとえば、前述のケッチャム他の米国特
許を参照）。たとえば、もしフレーム長が６０でありか
つ２０のエントリのみが分析において使用されておれ
ば、これら２０のエントリは３回繰り返されて６０のベ
クトル長を得る。これは以下にベクトル値の指数に関し
て説明する。If the number of samples in the vector is less than the frame length L, it is common to "copy-up" the vector to fill the frame (eg, Ketchum et al., Supra, US). See patent). For example, if the frame length is 60 and only 20 entries are used in the analysis, these 20 entries are repeated 3 times to get 60 vector lengths. This is explained below in terms of vector-valued exponents.

【００８６】実例ＩＩ−コピーアップベクトル 1,2,................................59,60 コピーアップベクトル 1,2,...,19,20,1,2,...,19,20,1,2,...,19,20. Illustrative Example II- Copyup Vectors 1,2, .................. 59,60 Copyup Vectors 1,2, ..., 19,20,1,2, ..., 19,20,1,2, ..., 19,20.

【００８７】この複製または「コピーアップ」はもし自
己相関係数を計算するための前に述べた加算−削除エン
ドコレクション方法を用いようとするとエラーを生ず
る。これらのエラーは合成された音声の品質を劣化させ
る。This duplication or "copy-up" results in an error if one tries to use the previously mentioned add-delete end correction method for calculating the autocorrelation coefficient. These errors degrade the quality of the synthesized speech.

【００８８】エンドコレクションのエラーはｍのより大
きな値、すなわち、自己相関関数における高い次数の
（より大きな「遅れ」の項）に対し増大する。前に述べ
た単純な加算−削除エンドコレクション手順はコピーア
ップされたベクトルに対してはもはや満足に動作しな
い。従ってより小さな計算機的な負担（例えば、容易な
エンドコレクション）をもつためにより貧弱な音声品質
を受け入れるか、あるいはより高い音声品質および大き
な計算機的な負担（例えば、各ベクトルを別個に計算す
る）をもつかの望まない選択が残されることになる。ベ
クトルにおけるサンプルの数がフレーム長より小さい状
況に対して自己相関係数を得る計算機的な負担は以下に
述べる改良された計算機的手順および装置によって合成
された音声品質の喪失なく低減できることが分った。The error of the end correction increases for larger values of m, ie higher orders (larger "lag" terms) in the autocorrelation function. The simple add-delete end correction procedure described above no longer works satisfactorily for copied-up vectors. Thus accepting poorer voice quality due to having a smaller computational burden (eg easy end-collection), or higher voice quality and greater computational burden (eg computing each vector separately). There will be some unwanted choices left. It has been found that the computational burden of obtaining the autocorrelation coefficient for situations where the number of samples in the vector is less than the frame length can be reduced without loss of synthesized speech quality by the improved computational procedures and apparatus described below. It was

【００８９】分析フレームが長さＬ（例えば、６０）お
よびＮのサンプルまたは値（例えば、６０）を有するコ
ードブックが目標音声に対し最善の整合を生成する適応
コードブックベクトルを決定するために図２から図４ま
での装置および手順に関して使用されるものと仮定す
る。さらに、短時間ピッチ周期を迅速に検出するために
ベクトル値のより小さなサブセットＭ＜Ｎ（例えば、Ｍ
〜２０）が初めに分析のために使用されるものと仮定す
る。過去において、Ｍのサンプルまたは値が長さＬのフ
レームを満たすためにコピーアップされかつ分析はコピ
ーアップされたフレームに基づいていた。本発明の方法
によれば、Ｍの値のサブフレームをコピーアップする必
要はない。A codebook in which the analysis frame has lengths L (eg, 60) and N samples or values (eg, 60) is used to determine the adaptive codebook vector that produces the best match for the target speech. 2 to 4 are assumed to be used for the apparatus and procedure. In addition, a smaller subset of vector values M <N (eg M
~ 20) are initially used for the analysis. In the past, M samples or values were copied up to fill a frame of length L and the analysis was based on the copied up frames. According to the method of the present invention, it is not necessary to copy up a subframe of M values.

【００９０】この実施例に関連して与えられる説明は特
に適応コードブックベクトルの自己相関係数を効率的に
決定することに向けられており、かつより小さなエラー
および目標音声に対する最善の整合を有するコードブッ
クベクトルを選択するために使用される分析プロセスの
他の部分の説明については図２から図４までの説明を参
照すべきである。The description given in connection with this embodiment is specifically directed to the efficient determination of the autocorrelation coefficient of the adaptive codebook vector, and has a smaller error and the best match to the target speech. Reference should be had to FIGS. 2-4 for a description of the other parts of the analysis process used to select the codebook vector.

【００９１】また、式（７）を参照すべきであり、この
場合は積［Ｃ_ｋ（ｎ）＊Ｃ_ｋ（ｎ＋ｍ）］のｎ＝１〜Ｎ
かつｍ＝０〜Ｎ−１にわたる和Ｕ_ｋ（ｎ）はｋ番目のベ
クトルの自己相関係数である。指数ｍは通常０からＮ−
１におよびかつ自己相関係数を計算するために使用され
る「遅れ」を識別する。１〜Ｋの範囲の指数ｋはコード
ブックベクトルを識別しかつ指数ｎは該ベクトル内の個
々のサンプルまたは値を示す。分析において使用される
サンプルの数は検出されるピッチ周期に依存する。例え
ば、人間の声に関連する最も短いピッチ周期については
約２０のサンプルが必要とされかつ最も長いピッチ周期
に対しては約１４７のサンプルが必要とされる。Also, reference should be made to the equation (7), and in this case, n = 1 to N of the product [C _k (n) * C _k (n + m)].
And the sum U _k (n) over m = 0 to N−1 is the autocorrelation coefficient of the kth vector. The index m is usually 0 to N-
Identify the "lag" that is to be 1 and that is used to calculate the autocorrelation coefficient. An index k in the range 1 to K identifies a codebook vector and an index n indicates an individual sample or value within the vector. The number of samples used in the analysis depends on the pitch period detected. For example, about 20 samples are required for the shortest pitch period associated with the human voice and about 147 samples are required for the longest pitch period.

【００９２】０次の自己相関係数はｍ＝０に対応し、１
次の係数はｍ＝１に対応し、かつ以下同様である。「ピ
ッチ遅れ」Ｍ＜Ｎは分析のために使用されるべきベクト
ルにおける値の数として規定される。従って、短時間ピ
ッチ周期の音声成分に対する自己相関係数を決定する上
で、ｍはゼロからＭまで変化する。「フレームサイズ」
Ｌはフレームにおける音声のサンプル数として定義され
る。通常、Ｌ＝Ｎである。Ｌに対する典型的な値は６０
でありかつＭに対する典型的な値は２０であるが、Ｌ＜
Ｍであればこれら双方に対し他の値を使用することもで
きる。説明の便宜のため、以下の説明ではＬ＝６０およ
びＭ＝２０の値が仮定されている。しかしながら、当業
者はここに記述した説明に基づきこれらの値が制限的な
ものでないことおよびＭおよびＬの他の値も使用できる
ことを理解するであろう。The zero-order autocorrelation coefficient corresponds to m = 0, and 1
The next coefficient corresponds to m = 1 and so on. The "pitch lag" M <N is defined as the number of values in the vector to be used for analysis. Therefore, m varies from zero to M in determining the autocorrelation coefficient for the voice component of the short pitch period. "Frame size"
L is defined as the number of audio samples in the frame. Usually L = N. A typical value for L is 60
And a typical value for M is 20, but L <
If M, other values can be used for both of them. For convenience of description, values of L = 60 and M = 20 are assumed in the following description. However, those skilled in the art will understand based on the description provided herein that these values are not limiting and that other values for M and L can be used.

【００９３】本発明は自己相関係数を決定する場合の計
算機的な負担を低減しかつコピーアップのエラーを避け
るための手段および方法を提供する。それはコピーアッ
プが前に使用された、すなわち、最も短いピッチ周期を
迅速に識別するために限られた数のコードブックサンプ
ル（例えば、２０）が必要であるが、その限られた数の
サンプルはエネルギの正規化問題を避けるために分析フ
レーム長（例えば、６０）に拡張されなければならな
い、再起的な合成による分析手順の部分にあてはまる。
いったん第１のＭ＋ｋ−１のベクトルが分析されかつベ
クトルの拡張が完了しそれによりＮ＝Ｌになれば、自己
相関係数は前に述べた加算−削除エンドコレクション処
理によって計算される。The present invention provides means and methods for reducing the computational burden of determining autocorrelation coefficients and avoiding copy-up errors. It requires a limited number of codebook samples (eg, 20) for which copy-up was previously used, i.e. to quickly identify the shortest pitch period, but that limited number of samples is It applies to the part of the analysis procedure with recursive synthesis, which has to be extended to the analysis frame length (eg 60) to avoid energy normalization problems.
Once the first M + k-1 vectors have been analyzed and the vector expansion is complete, resulting in N = L, the autocorrelation coefficient is calculated by the add-delete end correction process described above.

【００９４】好ましい実施例においては、本発明の方法
は、（１）ｍ＝０〜Ｔ＜Ｍおよびｎ＝１〜Ｍに対し式
（７）を評価することにより最初のベクトルｋに対し自
己相関係数Ｕ_ｋを決定しかつ結果をＬ／Ｍによって乗算
する。この場合、Ｌ，Ｍ，Ｐ，ｎおよびｍは上に述べた
意味を有する。Ｌ＝６０およびＭ＝２０に対しては、Ｌ
／Ｍ＝３である。パラメータＴは自己相関遅れｍのどれ
だけ多くの値が使用されるか、すなわち、どれだけ多く
の自己相関係数が計算されるかを決定する。典型的に
は、Ｔ＝Ｍ−１であるが、Ｔの他のより小さな値も使用
できる。Ｔのより小さな値を使用することはコードブッ
クベクトルの支配的な値が計算されて支配的な自己相関
係数がｍの小さな値に対するものである場合に有利であ
る。（２）ステップ（１）において前に得られたｍの各
々の値に対し式（７）における積の和を求めかつ
（Ｃ_ｋ′（ｎ＝Ｍ＋１））^２をｍ＝０の項に加算し、Ｃ
_ｋ′（ｎ＝Ｍ＋１）＊Ｃ_ｋ′（ｎ＝Ｍ＋２）をｍ＝１の
項に加算し、Ｃ_ｋ′（ｎ＝Ｍ＋１）＊Ｃ_ｋ′（ｎ＝Ｍ＋
３）をｍ＝２の項に加算し、かつ以下同様にＴ番目の項
まで行なうことによって第２のベクトルｋ′に対する自
己相関係数Ｕ_ｋを決定し、かつその結果をＬ／（Ｍ＋
１）により乗算し、（３）前記ステップ（２）において
前に得られたｍの各値に対し前記積の和を行ないかつ
（Ｃ_ｋ′（ｎ＝Ｍ＋２））^２をｍ＝０の項に加算し、Ｃ
_ｋ″（ｎ＝Ｍ＋２）＊Ｃ_ｋ″（ｎ＝Ｍ＋３）をｍ＝１の
項に加算し、Ｃ_ｋ″（ｎ＝Ｍ＋２）＊Ｃ_ｋ″（ｎ＝Ｍ＋
４）をｍ＝２の項に加算し、かつ以下同様にＴ番目の項
まで行なうことによって第３のベクトルｋ″に対する自
己相関係数Ｕ_ｋを決定し、その結果をＬ／（Ｍ＋２）に
よって乗算し、そして（４）前記ステップ（１）〜
（３）を残りのベクトルに対し、Ｌ／（Ｍ＋ｋ−１）＝
１になるまで各々の付加的なベクトルに対し前記値を１
だけ増分して残りの自己相関係数を決定する。その後、
自己相関係数が前に述べた伝統的な従来技術の加算−削
除手順によって計算される。In the preferred embodiment, the method of the present invention self-phases for the first vector k by evaluating (7) for (1) m = 0 to T <M and n = 1 to M. Determine the relation number U _k and multiply the result by L / M. In this case L, M, P, n and m have the meanings given above. For L = 60 and M = 20, L
/ M = 3. The parameter T determines how many values of the autocorrelation delay m are used, ie how many autocorrelation coefficients are calculated. Typically T = M-1, but other smaller values of T can be used. Using a smaller value of T is advantageous when the dominant values of the codebook vector are calculated and the dominant autocorrelation coefficient is for small values of m. (2) Find the sum of the products in equation (7) for each value of m previously obtained in step (1) and add (C _k ′ (n = M + 1)) ² to the term m = 0. Then C
_k '(n = M + 1) * C _k ' (n = M + 2) is added to the term of m = 1, and C _k '(n = M + 1) * C _k ' (n = M +
3) is added to the m = 2 term, and so on until the Tth term is similarly determined, the autocorrelation coefficient U _k for the second vector k ′ is determined, and the result is L / (M +
1) and (3) sum the products for each value of m previously obtained in step (2) and add (C _k ′ (n = M + 2)) ² to the term m = 0. To C
_k ″ (n = M + 2) * C _k ″ (n = M + 3) is added to the term of m = 1, and C _k ″ (n = M + 2) * C _k ″ (n = M +
4) is added to the term of m = 2, and the same procedure is performed up to the T-th term to determine the autocorrelation coefficient U _k for the third vector k ″, and the result is determined by L / (M + 2). Multiply, and (4) steps (1)-
L / (M + k−1) = (3) for the remaining vectors
1 for each additional vector until 1
Increment by only to determine the remaining autocorrelation coefficient. afterwards,
The autocorrelation coefficient is calculated by the traditional prior art add-drop procedure described above.

【００９５】別の方法では、コードブックベクトルの自
己相関係数は以下の式（１１ａ）〜（１１ｂ）を使用し
てｍ＝０〜Ｔ＜Ｍに対し第１のベクトルｋ＝１の係数Ｕ
_ｋ（ｍ）を計算することにより決定される。Ｕ_１（ｍ）＝（Ｌ／Ｍ）Ｕ′_１（ｍ）（１１ｂ）Alternatively, the autocorrelation coefficient of the codebook vector can be calculated using the following equations (11a)-(11b) for the coefficient U of the first vector k = 1 for m = 0-T <M.
It is determined by calculating _k (m). U ₁ (m) = (L / M) U ′ ₁ (m) (11b)

【００９６】上記コードブックベクトルの自己相関係数
の決定は、次に以下の式（１２ａ）〜（１２ｂ）を使用
してｍ＝０〜Ｔ＜Ｍおよび（Ｍ＋ｋ−１）≦Ｌに対し残
りのコードブックベクトルの自己相関係数Ｕ_ｋ（ｍ）を
計算することにより決定される。Ｕ′_ｋ（ｍ）＝［Ｕ′_ｋ−１（ｍ）＋Ｃ_ｋ（Ｍ＋ｋ−１）Ｃ_ｋ（Ｍ＋ｋ−１＋ｍ）］（１２ａ）Ｕ_ｋ（ｍ）＝｛Ｌ／（Ｍ＋ｋ−１）｝Ｕ′_ｋ（ｍ）（１２ｂ）The determination of the autocorrelation coefficient of the above codebook vector then remains for m = 0-T <M and (M + k-1) ≤L using the following equations (12a)-(12b). It is determined by calculating the autocorrelation coefficient U _k (m) of the codebook vector of _U'k (m) = [ _U'k-1 (m) + _Ck (M + k-1) _Ck (M + k-1 + m)] (12a) _Uk (m) = {L / (M + k-1)} U ′ _K (m) (12b)

【００９７】合成による分析が増大する長さのベクトル
（およびそれらの対応する自己相関係数）を使用し、長
さＭのベクトルでスタートしかつベクトル長がフレーム
長に等しくなるまで、すなわち、（Ｍ＋ｋ−１）＝Ｌに
なるまで各々の引続くベクトルの長さを１サンプルだけ
増大しながら、行なわれる。フレーム長に整合するため
のショートピッチのサンプルの拡張が次に完了する。引
続くベクトルはフレーム長と同じ長さを有しかつオーバ
ラップするコードブックの各々の引続くベクトルはベク
トルの一端から古いサンプルを削除しかつ該ベクトルの
他端において新しいサンプルを加えることに対応する。
従来技術の加算−削除エンドコレクション方法が次に使
用されて分析される残りのベクトルの自己相関係数を決
定する。Analysis by synthesis uses vectors of increasing length (and their corresponding autocorrelation coefficients), starting with a vector of length M and until the vector length equals the frame length, ie ( This is done by increasing the length of each subsequent vector by one sample until M + k-1) = L. The extension of the short pitch samples to match the frame length is then complete. Subsequent vectors have the same length as the frame length and each successive vector in the overlapping codebook corresponds to removing old samples from one end of the vector and adding new samples at the other end of the vector. ..
Prior art add-delete end correction methods are then used to determine the autocorrelation coefficients of the remaining vectors to be analyzed.

【００９８】式（１１ａ）における積の和は第１のベク
トルに対し一度だけ評価される必要がありかつ次に（Ｍ
＋ｋ−１）までの他のベクトルは前記第１のベクトルの
項から含まれる付加的な値またはサンプルに対するＣ_ｋ
＊Ｃ_ｋの積の寄与分を加算することにより計算できる。
何らのコピーアップ手順も必要とされずかつコピーアッ
プによって生成される自己相関係数のエラーは生じな
い。これは実質的に図２から図４に関して説明した合成
による分析手順における計算機的な負担を低減する。The sum of products in equation (11a) need only be evaluated once for the first vector and then (M
Other vectors up to + k-1) are C _k for additional values or samples included from the first vector term.
It can be calculated by adding the product contributions of * C _k .
No copy-up procedure is required and the auto-correlation coefficient error produced by copy-up does not occur. This substantially reduces the computational burden in the synthetic analysis procedure described with respect to FIGS.

【００９９】従来技術のコピーアップ手順と本発明の手
順との間の差異がベクトル指数に関し以下に概略的に説
明される。自己相関係数の計算は遅れｍ、すなわち、ベ
クトルの相対変位の種々の量に対しそれ自身とのベクト
ルの積を加算することを含む。以下の例はコピーアップ
手法および本発明の手法に対する遅れ（ｌａｇ）ｍの種
々の量に対しどの値が一緒に乗算されるかを示す。この
例における数字はベクトル値またはエントリの指数であ
り、値そのものではなく、かつベクトルに沿った各エン
トリの位置の尺度として考えることができる。The differences between the prior art copy-up procedure and the procedure of the present invention are outlined below with respect to the vector index. The calculation of the autocorrelation coefficient involves adding the product of the vector with itself to the delay m, i.e. various amounts of relative displacement of the vector. The following example shows which values are multiplied together for various amounts of lag m for the copy-up method and the method of the present invention. The numbers in this example are vector values or exponents of entries, not the values themselves, but can be thought of as a measure of the position of each entry along the vector.

【０１００】実例ＩＩＩ−コピーアップ自己相関コピーアップに対しては、項ごとに乗算しかつ、各ｎお
よびｍに対し、加算を行なう。例えば、（ｋ＝１，ｍ＝０）に対しては、1,2,3,...,19,20,1,2,
3,...,19,20,1,2,3,...,19,20 を1,2,3,...,19,20,1,2,
3,...,19,20,1,2,3,...,19,20;により乗算し、（ｋ＝２，ｍ＝０）に対しては、1,2,...,19,20,21,1,
2,...,19,20,21,1,2,...,17,18 を1,2,...,19,20,21,1,
2,...,19,20,21,1,2,...,17,18;により乗算し、（ｋ＝３，ｍ＝０）に対しては、1,2,...,19,20,21,22,
1,2,...,20,21,22,1,2,...,15,16を1,2,...,19,20,21,2
2,1,2,...,20,21,22,1,2,...,15,16; により乗算し、以下同様にすべてのｋ，ｍおよびｎに対して行なう。Illustrative Example III-Copy-Up Autocorrelation For copy-up, we multiply by terms and add for each n and m. For example, for (k = 1, m = 0), 1,2,3, ..., 19,20,1,2,
3, ..., 19,20,1,2,3, ..., 19,20 to 1,2,3, ..., 19,20,1,2,
Multiply by 3, ..., 19,20,1,2,3, ..., 19,20 ;, for (k = 2, m = 0), 1,2, ..., 19,20,21,1,
2, ..., 19,20,21,1,2, ..., 17,18 to 1,2, ..., 19,20,21,1,
Multiply by 2, ..., 19,20,21,1,2, ..., 17,18; and for (k = 3, m = 0) 1,2, ..., 19,20,21,22,
1,2, ..., 20,21,22,1,2, ..., 15,16 to 1,2, ..., 19,20,21,2
2,1,2, ..., 20,21,22,1,2, ..., 15,16 ;, and so on for all k, m and n.

【０１０１】実例ＩＶ−改良された自己相関（ｍ＝０）本発明の構成に対しては、ｍ＝０〜Ｍ−１に対し第１の
（例えば、２０）エントリを乗算しかつ加算し、そして
次にｎ＝Ｍ＋１，ｎ＝Ｍ＋２，などのエントリの積を加
算する。例えば、（ｋ＝１，ｍ＝０）に対しては、1,2,3,...,19,20 ×1,
2,3,...,19,20 を計算し、かつＬ／Ｍにより乗算し、（ｋ＝２，ｍ＝０）に対しては、1,2,3,...,19,20,21
×1,2,3,...,19,20,21をｋ＝１に対して前に計算したも
のに２１・２１を加算することにより得、かつＬ／Ｍ＋
１により乗算し、（ｋ＝３，ｍ＝０）に対しては、1,2,3,...,19,20,21,2
2 ×1,2,3,...,19,20,21,22 をｋ＝２に対して前に計
算したものに２２・２２を加算することにより得、かつ
Ｌ／Ｍ＋２により乗算し、そしてベクトル長がフレーム
長に等しくなりかつ最後の項６０・６０が加算されるま
ですべてのｍに対し継続し、次に従来技術と同様に処理
する。Illustrative Example IV-Improved Autocorrelation (m = 0) For the configuration of the present invention, m = 0 to M-1 is multiplied and added by the first (eg, 20) entry, Then, products of entries such as n = M + 1 and n = M + 2 are added next. For example, for (k = 1, m = 0), 1,2,3, ..., 19,20 × 1,
Calculate 2,3, ..., 19,20 and multiply by L / M, and for (k = 2, m = 0), 1,2,3, ..., 19,20 ,twenty one
× 1,2,3, ..., 19,20,21 is obtained by adding 21 · 21 to the previously calculated value for k = 1, and L / M +
Multiply by 1, and for (k = 3, m = 0), 1,2,3, ..., 19,20,21,2
2 × 1,2,3, ..., 19,20,21,22 is obtained by adding 22 · 22 to the previously calculated for k = 2, and multiplied by L / M + 2, Then continue for all m until the vector length equals the frame length and the last term 60.60 has been added, then proceed as in the prior art.

【０１０２】従来技術の手法および本発明の手法に対す
る自己相関処理の上の例においては０次の項のみが示さ
れたが、当業者は本明細書の記載に基づきｍ＝１，ｍ＝
２，その他に対する積の項を表わすためにどのようにし
てベクトルをシフトするかを理解するであろう。その処
理の助けとして、ｋ＝１，ｋ＝２およびｍ＝１に対し本
発明のために以下の例が示される。実例Ｖ−改良された自己相関（ｍ＝１）（ｋ＝１，ｍ＝１）に対しては、1,2,3,...,..19,20 ×
1,2,3,...,18,19 を計算し、かつＬ／Ｍ＋１により乗算
し、（ｋ＝２，ｍ＝１）に対しては、1,2,3,...,19,20,21
×1,2,3,....,19,20をｋ＝１に対して前に計算されたも
のに２０・２１を加算することにより得、そしてＬ／Ｍ
＋２により乗算し、（ｋ＝３，ｍ＝１）に対しては、1,2,3,...,19,20,21,2
2 ×1,2,3,... ,19,20,21 をｋ＝２に対して前に計算
したものに２１・２２を加算することにより得、そして
Ｌ／Ｍ＋３により乗算し、そして評価されるすべてのｋ
およびｍに対しＬ／（Ｍ＋ｋ−１）＝１まで続ける。In the above example of the autocorrelation process for the prior art technique and the technique of the present invention, only the 0th order term was shown, but one skilled in the art will know based on the description herein that m = 1, m =
2, we will see how to shift the vector to represent the product term for others. As an aid to that process, the following examples are presented for the present invention for k = 1, k = 2 and m = 1. Example V-For improved autocorrelation (m = 1) (k = 1, m = 1), 1,2,3, ...,. 19,20 x
Calculate 1,2,3, ..., 18,19 and multiply by L / M + 1, for (k = 2, m = 1), 1,2,3, ..., 19 , 20,21
X 1,2,3, ..., 19,20 is obtained by adding 20 · 21 to the one previously calculated for k = 1, and L / M
Multiply by +2, and for (k = 3, m = 1), 1,2,3, ..., 19,20,21,2
2 × 1,2,3, ..., 19,20,21 is obtained by adding 21 · 22 to the previously calculated for k = 2 and then multiplied by L / M + 3 and evaluated All k done
And m until L / (M + k-1) = 1.

【０１０３】本発明の好ましい実施例に係わる上に述べ
たようにして自己相関係数を決定するのに適した装置が
図５に示されている。本発明に対応する自己相関装置６
００はベクトルサンプルＣ_ｋ（ｎ）が図４の適応コード
ブック１５５から受信される信号入力６０２を具備す
る。ベクトルサンプルまたは値Ｃ_ｋ（ｎ）は２つのパス
６０４，６０６に従う。パス６０６はスイッチ６０８を
介して初期ベクトル（すなわち、ｋ＝１）自己相関器６
１０に至る。初期ベクトル自己相関器６１０は前記式
（１１ａ）により示された機能を達成し、すなわち、そ
れはｋ＝１，ｍ＝１，２，３，…，Ｔ−１，Ｔに対応す
る自己相関係数Ｕ_１（ｍ）を計算する。これらの自己相
関係数はスイッチ６２０を介してエンドコレクション係
数計算機６２２に伝達される。An apparatus suitable for determining the autocorrelation coefficient as described above according to a preferred embodiment of the present invention is shown in FIG. Autocorrelation device 6 corresponding to the present invention
00 comprises a signal input 602 whose vector samples C _k (n) are received from the adaptive codebook 155 of FIG. The vector sample or value C _k (n) follows two paths 604,606. Path 606 passes through switch 608 the initial vector (ie, k = 1) autocorrelator 6
Up to 10. The initial vector autocorrelator 610 achieves the function illustrated by equation (11a) above, ie, it has an autocorrelation coefficient corresponding to k = 1, m = 1, 2, 3, ..., T-1, T. Calculate U ₁ (m). These autocorrelation coefficients are transmitted to the end correction coefficient calculator 622 via the switch 620.

【０１０４】第１のベクトルの自己相関係数計算機６１
０はレジスタ６１２および６１４を具備し、これらのレ
ジスタにはコードブックにおける最初のＭ（例えば、２
０）のサンプルがロードされる。レジスタ６１２，６１
４は便宜的にはよく知られた直列入力／並列出力レジス
タであるが、技術上よく知られた他の構成も使用するこ
とができる。First vector autocorrelation coefficient calculator 61
0 comprises registers 612 and 614, which contain the first M (eg 2
0) sample is loaded. Registers 612, 61
4 is a well known serial input / parallel output register for convenience, but other configurations well known in the art can be used.

【０１０５】前記サンプル値は自己相関器６１６に転送
され、該自己相関器６１６はｍ＝０に対する積Ｕ
_１（ｍ）＝ＳＵＭ［Ｃ_１（ｎ）Ｃ_１（ｎ＋ｍ）］（すな
わち、Ｕ_１（０））の和を決定しかつこの係数をスイッ
チ６２０を介してブロック６２２にクロック入力する。
自己相関器６１６は次にレジスタ６１４におけるサンプ
ルをｍ＝１に対応して、ブロック６１８を介し、１サン
プルだけシフトしかつＵ１（１）を計算し、これは次に
ブロック６２２にクロック出力される。この手順が初期
ベクトルＣ_１（ｎ）に対するすべての自己相関係数が決
定されかつブロック６２２にロードされるまで続けられ
る。スイッチ６０８および６２０は次に自己相関発生器
６１０をクロック６２２から切離す。The sampled values are transferred to an autocorrelator 616, which in turn multiplies the product U for m = 0.
₁ (m) = SUM [C ₁ (n) C ₁ (n + m)] (ie, U ₁ (0)) is determined and this coefficient is clocked into block 622 via switch 620.
The autocorrelator 616 then shifts the sample in register 614 by one sample and computes U1 (1), corresponding to m = 1, through block 618, which is then clocked out to block 622. . This procedure continues until all autocorrelation coefficients for the initial vector C ₁ (n) have been determined and loaded into block 622. Switches 608 and 620 then disconnect autocorrelation generator 610 from clock 622.

【０１０６】ブロック６２２は前記式（１１ｂ）および
（１２ａ）から（１２ｂ）までによって記述される機能
を達成する。これは便宜的にはレジスタ６２４、乗算器
６２６、加算器６２８、レジスタ−アキュムレータ６３
０、乗算器６３２および出力バッファ６３４の組合わせ
によって発生される。レジスタ６２４，６３０およびバ
ッファ６３４は便宜的には（例えば図５に示される）レ
ジスタ６１２，６１４と同じ長さを有するが、どれだけ
多くの自己相関係数が評価され引続くベクトルに対し更
新されることが望まれるかに応じてより長くあるいはよ
り短くすることができる。例えば、レジスタ６２４，６
３０およびバッファ６３４はフレーム長と同じくらいの
大きさとすることができる。Block 622 performs the functions described by equations (11b) and (12a) through (12b) above. For convenience, this is a register 624, a multiplier 626, an adder 628, a register-accumulator 63.
0, generated by the combination of multiplier 632 and output buffer 634. Registers 624, 630 and buffer 634 are expediently of the same length as registers 612, 614 (shown in FIG. 5, for example), but how many autocorrelation coefficients are evaluated and updated for subsequent vectors. It can be longer or shorter depending on what is desired. For example, registers 624, 6
30 and buffer 634 can be as large as the frame length.

【０１０７】レジスタの要素６３０はそれに対し後続の
ベクトルに対する自己相関係数を決定するためにエンド
コレクションが加えられるべき前に計算された自己相関
係数を含む。エンドコレクションは乗算器６２６と組合
わされたレジスタ６２４によって与えられる。乗算器６
２６からのエンドコレクションは加算器６２８において
レジスタ６３０からの前に計算された係数と加算され、
かつループ６２９を介してレジスタ６３０を更新するた
めフィードバックされる。レジスタ６３０から、自己相
関係数が乗算器６３２に転送され、そこでこれらは適切
なＬ／（Ｍ＋ｋ−１）のファクタによってスケーリング
されかつ出力バッファ６３４に送られ、該出力バッファ
においてそれらは、例えば、図４の出力５６１′を形成
し、この場合自己相関発生器６００がエレメント５６
０′を（Ｍ＋ｋ−１）≦Ｌに対しより詳細に記述する。Register element 630 contains the autocorrelation coefficient calculated before which the end correction should be added to determine the autocorrelation coefficient for the subsequent vector. The end correction is provided by register 624 associated with multiplier 626. Multiplier 6
The end correction from 26 is added with the previously calculated coefficient from register 630 in adder 628,
And is fed back via loop 629 to update register 630. From the register 630, the autocorrelation coefficients are transferred to a multiplier 632 where they are scaled by an appropriate factor of L / (M + k−1) and sent to an output buffer 634 where they are, for example, The output 561 'of FIG. 4 is formed, in this case the autocorrelation generator 600 is the element 56
0'is described in more detail for (M + k-1) ≤L.

【０１０８】ブロック６２２の動作をさらに詳細に説明
すると、レジスタ６２４はベクトル値によってレジスタ
６１２，６１４と同時にロードされる。レジスタ６３０
は自己相関器６１０がブロック６２２から切離される前
に第１のベクトルの自己相関係数発生器６１０の出力Ｕ
_１（ｍ）によりロードされる。これらの初期自己相関係
数は乗算器６３２にコピーされ、そこでこれらはＬ／Ｍ
によって乗算されかつバッファ６３４に送られ、該バッ
ファ６３４からそれらは図２から図４に関して説明した
合成による分析手順の間に抽出される。To describe the operation of block 622 in more detail, register 624 is loaded simultaneously with registers 612 and 614 by vector value. Register 630
Is the output U of the first vector autocorrelation coefficient generator 610 before the autocorrelator 610 is disconnected from the block 622.
Loaded by ₁ (m). These initial autocorrelation coefficients are copied to multiplier 632 where they are L / M.
Are sent to a buffer 634, from which they are extracted during the analysis procedure by synthesis described with respect to FIGS.

【０１０９】レジスタ６３０が最初のＴの自己相関係数
値によってロードされた後、付加的なベクトル値がレジ
スタ６２４にクロック入力されかつレジスタ６２４の各
段のベクトル値が矢印６２５で示されるようにクロック
出力される。初期ベクトルがＭの値を有するものと仮定
すると、今レジスタ６２４に存在する最も最近の値はｎ
＝Ｍ＋１である。これはベクトルｋ＝２に対応するが、
それは各ベクトルは前のベクトルとｎ＝（Ｍ＋ｋ−１）
＝Ｌまで１つのエントリの加算分だけ異なるからであ
る。After register 630 is loaded with the first T autocorrelation coefficient value, additional vector values are clocked into register 624 and the vector value of each stage of register 624 is clocked as indicated by arrow 625. Is output. Assuming the initial vector has a value of M, the most recent value now in register 624 is n.
= M + 1. This corresponds to the vector k = 2,
It means that each vector is the same as the previous vector, n = (M + k-1)
This is because one entry is different up to = L.

【０１１０】新しい値ｎ＝Ｍ＋１は乗算器６２６１にお
いてそれ自身により乗算されその結果は加算器６２８１
に伝達されそこでそれは既にレジスタ要素６３０１に記
憶されている０次のＵ_１（ｍ＝０）係数と組合わされ
る。レジスタ要素６３０１は次に矢印６２９１で示され
るように更新され、それによりＵ_１（０）＋Ｃ_ｋ（Ｍ＋
１）Ｃ_ｋ（Ｍ＋１）の和がいまやレジスタ要素６３０１
に存在しかつ乗算器６３２に転送されそこでＬ／（Ｍ＋
１）により乗算されかつ、６３２において同じファクタ
により乗算されたレジスタ６３０の他の要素からの他の
更新された係数値とともに、バッファ６３４にロードさ
れる。カウンタ６４０が設けられてレジスタ６２４にロ
ードされたコードブックベクトルのエントリの数を追跡
しかつ乗算器６３２における乗算ファクタを調整し、そ
れによりそれがｋ＝１に対してはＬ／（Ｍ）に対応し、
ｋ＝２に対してはＬ／（Ｍ＋１）に対応し、かつ以下同
様に（Ｍ＋ｋ−１）＝Ｌまで対応するようにする。The new value n = M + 1 is multiplied by itself in the multiplier 6261 and the result is the adder 6281.
Where it is transmitted to it is already U _{1 (m} = 0) of the 0th order stored in the register element 6301 combined with coefficient. Register element 6301 is then updated as indicated by arrow 6291, which causes U ₁ (0) + C _k (M +
1) The sum of C _k (M + 1) is now the register element 6301
Is present at L / (M +
1) and loaded in buffer 634 with other updated coefficient values from other elements of register 630 multiplied by 632 by the same factor. A counter 640 is provided to keep track of the number of entries in the codebook vector loaded into register 624 and adjust the multiplication factor in multiplier 632 so that it is L / (M) for k = 1. Correspondingly,
For k = 2, it corresponds to L / (M + 1), and similarly up to (M + k-1) = L.

【０１１１】レジスタ６２４からのサンプルＣ_ｋ（Ｍ）
は乗算器６２６２においてＣ_ｋ（Ｍ＋１）によって乗算
されかつ加算器６２８２においてレジスタ要素６３０２
からのＵ_１（１）と加算され、この和は接続６２９２を
介してレジスタ要素６３０２を更新する。更新された値
は乗算器６３２に送られ、そこでＬ／（Ｍ＋１）により
乗算されかつバッファ６３４に送られる。レジスタ６２
４における残りのサンプルは同様にして処理されかつ次
に他のサンプル、例えば、ｎ＝Ｍ＋２、がレジスタ６２
４にクロック入力されかつその処理が反復される。この
ようにして、自己相関係数が前のベクトルに対しもう１
つのサンプルを加算することにより形成された各々の新
しいベクトルに対しバッファ６３４において得られこれ
は実例ＩＶ−Ｖにおける単純化された形式で示されたも
のと同様にして行なわれる。Sample C _k (M) from register 624
Is multiplied by C _k (M + 1) in multiplier 6262 and register element 6302 in adder 6228.
From U ₁ (1) from which the sum updates register element 6302 via connection 6292. The updated value is sent to multiplier 632, where it is multiplied by L / (M + 1) and sent to buffer 634. Register 62
The remaining samples in 4 are processed in a similar manner and then another sample, for example n = M + 2, is registered in register 62.
4 is clocked in and the process is repeated. In this way, the autocorrelation coefficient is
Obtained in buffer 634 for each new vector formed by adding the two samples, this being done in a manner similar to that shown in simplified form in Examples IV-V.

【０１１２】一時的記憶要素６１２，６１４，６２４，
６３０および６３４はレジスタまたはバッファとして記
述されているが、当業者は本明細書の開示に基づきこれ
は単に例示の便宜のためのものでありかつ、例えばこれ
に限定されるものではないが、ランダムアクセス可能な
メモリ、内容アドレス可能なメモリ、その他のような他
の形式のデータ記憶装置も使用できることを理解するで
あろう。さらに、そのようなメモリは広範囲の物理的構
成、例えば、フリップフロップ、レジスタ、コアおよび
半導体メモリ素子とすることができる。ここで使用され
ている用語「レジスタ」および「バッファ」は、単数で
あれ複数であれ、どのような種類のものであれあるいは
構成であれ任意の修正可能な情報記憶を含むことを意図
している。同様に、例えば、自己相関器６１６、インデ
クス装置（ｉｎｄｅｘｅｒ）６１８、スイッチ６０８，
６２０、加算器６２８、乗算器６２６および／またはカ
ウンタ６４０として示される他のブロックは別個の素子
または素子の組合わせであれ、標準のまたはアプリケー
ション特定集積回路であれ、所望の機能を達成可能なプ
ログラムされた汎用プロセッサであれ、あるいはこれら
を別個にまたは組合わせた任意の形式の等価な機能を含
むことを意図している。Temporary storage elements 612, 614, 624,
Although 630 and 634 are described as registers or buffers, one of ordinary skill in the art, based on the disclosure herein, is for convenience of illustration only and is not limited to, but is not limited to, random. It will be appreciated that other types of data storage such as accessible memory, content addressable memory, etc. may be used. Further, such memory can be in a wide variety of physical configurations, such as flip-flops, registers, cores and semiconductor memory devices. The terms "register" and "buffer" as used herein are intended to include any modifiable information store, whether singular or plural, of any kind or configuration. .. Similarly, for example, an autocorrelator 616, an indexer 618, a switch 608,
620, adder 628, multiplier 626 and / or other blocks shown as counter 640, whether discrete elements or combinations of elements, standard or application specific integrated circuits, are programs capable of achieving the desired functionality. General purpose processor, or any combination thereof, either separately or in combination, is intended to be included.

【０１１３】本発明は、標準フレーム長に小さな数のコ
ードブックサンプルを拡張する上で生ずる従来のコピー
アップエラーを導入することなく短時間ピッチ周期を検
出するために必要なより短いセットのコードブックベク
トルサンプル（例えば、２０）に基づき標準分析フレー
ム長（例えば、６０）に対する自己相関係数を決定する
迅速かつ簡単な方法を提供する。音声品質の犠牲なしに
計算機的な負担が低減されるが、それは従来技術のコピ
ーアップ構成に関連するエンド自己相関加算−削除エラ
ーが避けられるからである。コピーアップは完全に避け
られる。The present invention provides a shorter set of codebooks needed to detect short pitch periods without introducing the traditional copy-up errors that occur in extending a small number of codebook samples to standard frame lengths. It provides a quick and easy way to determine the autocorrelation coefficient for a standard analysis frame length (eg 60) based on vector samples (eg 20). The computational burden is reduced without sacrificing voice quality because the end autocorrelation add-delete errors associated with prior art copy-up configurations are avoided. Copy-up is completely avoided.

【０１１４】自己相関係数を発生するための本発明の方
法がハードウェアレジスタ、自己相関器、乗算器、加算
器、スイッチなどに関連して上に述べられたが、当業者
はこれらはソフトウェアによって実現でき、それにより
前記装置についてここに説明したのと同じ機能を達成し
かつ本明細書に記載された実施例の詳細な説明に基づき
本発明の方法を実施するためにコンピュータを構成する
ことができ、かつそのような変形は本発明の範囲内であ
ることを理解するであろう。Although the method of the present invention for generating an autocorrelation coefficient has been described above in connection with hardware registers, autocorrelators, multipliers, adders, switches, etc., those skilled in the art will understand that these are software. And configuring a computer to carry out the method of the invention based on the detailed description of the embodiments described herein, which can be realized by the following, thereby achieving the same functions as described herein for the device. It will be understood that, and such variations are within the scope of the invention.

【０１１５】上に述べた改良は目標音声を複製するため
の最適のコードブックベクトルを決定することに関連す
る計算機的な負担を大幅に低減するが、さらなる改良が
望まれる。特に、確率的コードブックの最適のベクトル
が識別される様式において改良が望まれる。Although the improvements described above significantly reduce the computational burden associated with determining the optimal codebook vector for replicating the target speech, further improvements are desired. In particular, improvements are desired in the manner in which the optimal vector of the probabilistic codebook is identified.

【０１１６】米国特許第４，７９７，９２５号、リン
（Ｌｉｎ）はオーバラップする確率的ベクトルの使用に
より確率的コードブックにおけるすべてのベクトルを考
慮する計算機的な負担を低減する手順を述べている。リ
ンの構成によれば、コードブックにおける各々の引続く
ベクトルは古い値が該ベクトルの一端からドロップされ
かつ新しい値が該ベクトルの他端に加えられることによ
り先行するベクトルと異なっている。この構成によれ
ば、各々６０の値を有する１０２４のベクトルからなる
コードブックにおける独自的な値の数は１０２４×６０
＝６１，４４０から６０＋１０２３＝１，０８３に低減
される。これでもなお、分析を実行するために多数の計
算が依然として必要でありかつそのステップはそれらが
引続く乗算および加算を含むため時間を浪費するもので
ある。US Pat. No. 4,797,925, Lin, describes a procedure that reduces the computational burden of considering all vectors in a stochastic codebook by using overlapping stochastic vectors. . According to Lin's construction, each subsequent vector in the codebook differs from the preceding vector by the old value being dropped from one end of the vector and the new value being added to the other end of the vector. With this configuration, the number of unique values in the codebook of 1024 vectors, each having 60 values, is 1024 × 60.
= 61,440 to 60 + 1023 = 1,083. Even then, a large number of computations are still required to perform the analysis and the steps are time consuming as they involve subsequent multiplications and additions.

【０１１７】確率的コードブック１８０（図２の（ｂ）
を参照）は長さＮのＫのベクトルＳ_ｋ（ｎ）を含み、こ
の場合ｋ＝１〜Ｋかつｎ＝１〜Ｎ、そしてＫは便宜的に
は５１２，１０２４，２０４８，などであり、典型的に
は１０２４であり、かつＮは便宜的には２０，４０，６
０，１２０などであり、典型的には６０である。確率的
コードブックベクトルＳ_ｋ（ｎ）に対する指数ｋおよび
ｎは適応コードブックベクトルＣ_ｋ（ｎ）に対するもの
と同じ意味を持ち、すなわち、ｋはどのベクトルが考慮
されているかを識別しかつｎはベクトルｋ内で考慮され
ている値を識別する。確率的コードブック１８０のベク
トルに対する指数限界ＫおよびＮは適応コードブック１
５５のベクトルに対する指数限界ＫおよびＮと同じ大き
さを有することが都合がよいが、これは必須のものでは
ない。単に説明の便宜のためかつ限定的なものではなし
に、ＫおよびＮは両方のコードブックに対し同じ値を持
つようにすることができ、例えば、Ｋ＝１０２４かつＮ
＝６０とすることができる。Probabilistic codebook 180 ((b) of FIG. 2)
Contains a vector S _k (n) of K of length N, where k = 1 to K and n = 1 to N, and K is conveniently 512, 1024, 2048, and so on, Typically 1024, and N is conveniently 20,40,6
0, 120, etc., and typically 60. The indices k and n for the stochastic codebook vector S _k (n) have the same meaning as for the adaptive codebook vector C _k (n), ie k identifies which vector is considered and n is Identify the values considered in the vector k. The exponential bounds K and N for the vectors of the stochastic codebook 180 are adaptive codebook 1
It is convenient to have the same magnitude as the exponential limits K and N for the 55 vectors, but this is not essential. For convenience of explanation only and not by way of limitation, K and N may have the same value for both codebooks, eg K = 1024 and N.
= 60.

【０１１８】確率的コードブック１８０におけるベクト
ルは便宜的には擬似ランダムな０および１または０、１
および−１のリニアアレイである。すなわち、各ベクト
ルＳ_ｋ（ｎ）はＮの値のつながりであり、各々の値は指
数ｎによって識別される。図６は、コードブック１８０
に類似しているがＫ＝８かつＮ＝２０の例示的な３進の
（例えば、０，１，−１）確率的コードブック１８０′
を示す。当業者は本明細書の記載に基づき図６のコード
ブックの特徴的機能がどのようにしてＫおよびＮのより
大きな値に適用されるかを理解するであろう。さらに、
図６は３進の（例えば、０，１，−１）コードブックを
示しているが、２進の（例えば、０，１または０，−
１）あるいは他の形式のコードブックも使用できる。３
進コードブックが好ましい。Vectors in the probabilistic codebook 180 are conveniently pseudo-random 0 and 1 or 0,1.
And a linear array of -1. That is, each vector S _k (n) is a concatenation of N values, each value identified by an index n. FIG. 6 shows a codebook 180.
An exemplary ternary (e.g., 0,1, -1) stochastic codebook 180 'with K = 8 and N = 20, similar to
Indicates. Those of ordinary skill in the art will understand, based on the description herein, how the characteristic features of the codebook of FIG. 6 apply to larger values of K and N. further,
Although FIG. 6 shows a ternary (eg, 0,1, −1) codebook, a binary (eg, 0,1 or 0, −)
1) or other types of codebooks can also be used. Three
A hex codebook is preferred.

【０１１９】ｋの各々の引続く値に対する図６のベクト
ルＳ_ｋ（ｎ）はＮ−２だけオーバラップする。例えば、
ベクトルＳ_ｋ＝２（ｎ）はベクトルＳ_ｋ＝１（ｎ）から
ベクトルＳ_１（ｎ）の左端から２つの古い値をドロップ
させかつベクトルＳ_１（ｎ）の右端において２つの新し
い値を加えることにより異なっている。従って、ベクト
ルＳ_２（ｎ）の値はベクトルＳ_１（ｎ）に比較して左に
２つの場所だけシフトしておりかつ右端に２つの新しい
値がある。各々の連続するベクトルは前のベクトルと同
様に異なっている。オーバラップ量、例えば、図６のＮ
−２、の選択は便宜的なものであるが、必須のものでは
ない。任意の値のオーバラップ、例えば、１〜Ｎ−１を
使用することができる。また、前記ベクトルは右側に新
しい値を付加することにより左にシフトされるものとし
て説明されたが、逆の方法も使用することができ、すな
わち、右にシフトしかつ左側に新しい値を加算すること
もできる。The vector S _k (n) of FIG. 6 for each successive value of _k overlaps by N-2. For example,
Vector _{S k =} 2 (n) adds two new values at the right end of the vector _{S k =} 1 from (n) to drop two old value from the left end of the vector _S 1 (n) and the vector _S 1 (n) It depends on things. Therefore, the value of vector S ₂ (n) is shifted by two places to the left compared to vector S ₁ (n) and there are two new values at the right end. Each successive vector is as different as the previous vector. Overlap amount, for example N in FIG.
The selection of -2 is convenient, but not essential. Any value of overlap can be used, for example 1-N-1. Also, although the vector has been described as being shifted to the left by adding a new value to the right, the reverse method can also be used, i.e. shifting to the right and adding the new value to the left. You can also

【０１２０】最適の確率的コードブックベクトルを識別
するめたの分析手順は実質的に適応コードブックベクト
ルに対するものと同じであるが、Ｃ_ｋ（ｎ）の代わりに
Ｓ_ｋ（ｎ）が使用され、すなわち、コードブック１５５
の代りにコードブック１８０が使用され、かつ知覚的に
重み付けされた短時間遅延目標音声信号Ｘ（ｎ）（図２
の（ａ）から（ｂ）までの１５１を参照）の代りに知覚
的に重み付けされた短時間および長時間遅延目標音声信
号Ｙ（ｎ）（図２の（ａ）および（ｂ）の１７６を参
照）が使用されている。以下の式（１′），（２′），
（５′）および（６′）はそれぞれ前に与えられた式
（１），（２），（５），（６）と類似しているが、適
応コードブックに対し前に述べたものの代わりに確率的
コードブックのための適切な変数が使用されている。 The analysis procedure for identifying the optimal stochastic codebook vector is essentially the same as for the adaptive codebook vector, except that S _k (n) is used instead of C _k (n), That is, the codebook 155
Codebook 180 is used instead of and the perceptually weighted short delay target speech signal X (n) (FIG. 2).
Instead of (a) to (b) 151 of FIG. 2), the perceptually weighted short and long delay target speech signal Y (n) (176 of (a) and (b) of FIG. (See) is used. The following equations (1 '), (2'),
(5 ') and (6') are similar to equations (1), (2), (5), and (6) given above, respectively, but instead of those previously described for the adaptive codebook. Appropriate variables for probabilistic codebooks are used in.

【０１２１】確率的コードブックと適応コードブックと
の間の重要な差異は確率的コードブック１８０を構成す
るベクトルはコードブック１５５において生ずるように
合成による分析のプロセスの結果として変化せず、固定
されている。従って、式（１′）から（６′）までによ
って表わされる計算の多くはフレームごとに一回行なう
ことができかつ結果は記憶されかつ再使用される。例え
ば、確率的コードブックベクトルの自己相関は一度だけ
行なえばよいが、それはその結果が不変であるからであ
る。自己相関係数は便宜的にルックアップテーブルに記
憶されかつ再び計算する必要はない。これは計算機的な
負担を大幅に簡素化する。The important difference between the probabilistic codebook and the adaptive codebook is that the vectors that make up the probabilistic codebook 180 do not change as a result of the synthetic analysis process as occurs in the codebook 155 and are fixed. ing. Therefore, many of the calculations represented by equations (1 ') through (6') can be done once per frame and the results stored and reused. For example, autocorrelation of a stochastic codebook vector only needs to be done once, because the result is invariant. The autocorrelation coefficient is conveniently stored in a look-up table and does not need to be calculated again. This greatly simplifies the computational burden.

【０１２２】確率的コードブックベクトルのどれが目標
音声を最もよく表わすかを決定する上で必要なプロセス
は確率的コードブックベクトルＳ_ｋ（ｎ）の値の式
（１′），（２′），（５′），（６′）により名目上
必要とされる他の信号による乗算を除去することにより
実質的に単純化できかつより高速にすることができるこ
とが発見された。本発明の手段および方法は確率的コー
ドブックベクトルを含む相互相関操作に最も有用に適用
できるが、それはまた確率的コードブックベクトルを含
むコンボリューション操作にも適用できる。説明の便宜
のため、本発明の構成は相関操作または演算について説
明するが、当業者は本明細書の記述に基づきそれがどの
ようにしてコンボリューション操作に適用できるかを理
解するであろう。The process required to determine which of the probabilistic codebook vectors best represents the target speech is the equations (1 '), (2') of the values of the probabilistic codebook vector S _k (n). , (5 '), (6'), it has been found that it can be substantially simplified and faster by eliminating the multiplication by other signals nominally required. While the means and methods of the present invention are most usefully applicable to cross-correlation operations involving stochastic codebook vectors, they are also applicable to convolution operations involving stochastic codebook vectors. For convenience of explanation, the configuration of the present invention describes a correlation operation or operation, but those skilled in the art will understand how based on the description herein how it can be applied to a convolution operation.

【０１２３】相互相関は第１の実施例においてマルチプ
レクサ−アキュムレータの組合わせにより達成され、該
マルチプレクサの選択ラインは前記コードブックまたは
１つまたはそれ以上の前記コードブックの複製により駆
動される。これは図７から図１０を参照してより詳細に
説明する。Cross-correlation is achieved in the first embodiment by a multiplexer-accumulator combination, the select lines of the multiplexer being driven by the codebook or one or more replicas of the codebook. This will be explained in more detail with reference to FIGS.

【０１２４】図７は本発明による確率的コードブック相
互相関器７００の単純化されたブロック図である。相関
器７００は３進（例えば、０，１，−１）コードブック
の場合について示されている。当業者は本明細書の記載
に基づいて本発明は２進および他の形式のコードブック
にも同様に適用できることを理解するであろう。以下に
説明する手順はまたコードブックベクトルを他の信号に
よりコンボルブするために使用できる。FIG. 7 is a simplified block diagram of a stochastic codebook cross-correlator 700 according to the present invention. Correlator 700 is shown for the ternary (eg, 0,1, -1) codebook case. Those skilled in the art will understand based on the description herein that the present invention is equally applicable to binary and other forms of codebooks. The procedure described below can also be used to convolve the codebook vector with other signals.

【０１２５】相関器７００は入力７０１を有し、そこで
コードブックベクトルと相互相関されるべき信号（単数
または複数）７０２、制限的な意味ではなく例えば、式
（５′）からの信号Ｗ′（ｎ）、あるいはコードブック
ベクトルＳ_ｋ（ｎ）と相関されるべき他の信号、を受信
する。入力７０１において受信された信号７０２は一般
にある指数、例えばｎまたはｍであって１からＮに及ぶ
もの、によって識別されるＮの値を有するベクトルであ
る。例えば、式（６′）が評価または概算される場合
は、Ｗ′（ｎ）が入力７０１に現われる。もし式
（１′）が見積られる場合には、Ｈ（ｎ−ｍ＋１）が入
力７０１に現われる。本発明の構成は特に音声ボコーダ
（ＶＯＣＯＤＥＲＳ）に関して有用であるが、それはま
た任意の信号または同様の形式の信号ストリングに関し
て使用できる。Correlator 700 has an input 701, where signal (s) 702 are to be cross-correlated with the codebook vector, 702, but not in a limiting sense, eg, signal W '(from equation (5')). n), or another signal to be correlated with the codebook vector S _k (n). The signal 702 received at input 701 is generally a vector having N values identified by some index, for example n or m ranging from 1 to N. For example, if equation (6 ') is evaluated or estimated, W' (n) appears at input 701. If equation (1 ') is estimated, H (n-m + 1) appears at input 701. The arrangement of the invention is particularly useful for voice vocoders, but it can also be used for any signal or similar type of signal string.

【０１２６】説明の便宜のため、本発明の手段および方
法は式（６′）の評価（ｅｖａｌｕａｔｉｏｎ）に関し
て説明されるが、当業者は本明細書の記載に基づきそれ
は一方のベクトルまたはベクトルアレイが、例えば制限
的なものでないが１，０または−１，０または−１，
０，１のような、固定値を有し、他方が可変である２つ
のベクトルまたはベクトルアレイの積の任意の他の和に
適用できることを理解するであろう。式（６′）の評価
は指数ｋの各々の値に対し単一の相互相関値Ｑ（ｋ）を
生成し、すなわちを発生する。For convenience of explanation, the means and method of the present invention will be described in terms of evaluation of equation (6 '), but one of ordinary skill in the art will understand that one vector or vector array , For example, but not limiting, 1,0 or -1,0 or -1,
It will be appreciated that it is applicable to any other sum of the products of two vectors or vector arrays that have fixed values, such as 0, 1 and the other is variable. The evaluation of equation (6 ') produces a single cross-correlation value Q (k) for each value of index k, ie To occur.

【０１２７】入力７０１に供給されるベクトル信号７０
２（例えば、Ｗ′（ｎ））はマルチプレクサ７０４，７
０５に転送される。マルチプレクサ７０４は図８により
詳細に図示されておりかつマルチプレクサ７０５は実質
的にこれと同じである。マルチプレクサ７０４にはメモ
リ７０６、例えば、コードブック１８０における“１”
に対応する非ゼロエントリを有するＲＯＭまたはＥＰＲ
ＯＭが接続されている。図９はメモリ７０６に類似した
メモリ７０６′の内容を示しているが、Ｋ＝９かつＮ＝
２０であり、かつ図６のコードブック１８０′の内容に
対応している。指数ｋおよびｎはメモリ７０６（かつメ
モリ７０７）に関してコードブック１８０における場合
と同じ機能を有し、すなわち、ｋはベクトルまたはベク
トルに対応する他のデータストリングを識別し、かつｎ
は該ベクトルまたはストリング内の値を識別する。メモ
リ７０６，７０６′はコードブック１８０，１８０′に
１が表われるところを除きいずれにもゼロを有する（図
６と図９とを比較されたい）。メモリ７０６の出力はマ
ルチプレクサ７０４の選択ライン７０８に接続され、そ
れにより各々の値ｋ，ｎが入力７０１に与えられている
ベクトルの値に対して作用する特定の選択ラインｎを制
御する。Vector signal 70 applied to input 701
2 (eg W '(n)) is the multiplexer 704, 7
05. Multiplexer 704 is shown in more detail in FIG. 8 and multiplexer 705 is substantially the same. The multiplexer 704 has a memory 706, for example, “1” in the codebook 180.
ROM or EPR with non-zero entries corresponding to
OM is connected. FIG. 9 shows the contents of memory 706 ', which is similar to memory 706, but with K = 9 and N =
20 and corresponds to the contents of the codebook 180 'of FIG. The indices k and n have the same function with respect to memory 706 (and memory 707) as in codebook 180, that is, k identifies a vector or another data string corresponding to the vector, and n
Identifies the values in the vector or string. Memories 706 and 706 'have zeros except where ones appear in codebooks 180 and 180' (compare FIGS. 6 and 9). The output of memory 706 is connected to select line 708 of multiplexer 704, thereby controlling the particular select line n with which each value k, n acts on the value of the vector provided at input 701.

【０１２８】マルチプレクサ７０５にはメモリ７０６と
類似しているがコードブック１８０における−１に対応
する非ゼロエントリを有するメモリ７０７が接続されて
いる。図１０はメモリ７０７に類似しているがＫ＝８か
つＮ＝２０であり、かつ図６のコードブック１８０′に
対応するメモリ７０７′の内容を示す。メモリ７０７，
７０７′はコードブック１８０，１８０′に−１が表わ
れるすべてのところに１を有しかつそれ以外はゼロを有
する（図６および図１０を比較されたい）。メモリ７０
７の出力はマルチプレクサ７０５の選択ライン７０９に
接続され、それにより各々の値ｋ，ｎが入力７０１に与
えられたベクトルの値に作用する特定の選択ラインｎを
制御する。Connected to the multiplexer 705 is a memory 707 similar to the memory 706 but having a non-zero entry corresponding to -1 in the codebook 180. FIG. 10 shows the contents of memory 707 'which is similar to memory 707, but with K = 8 and N = 20, and which corresponds to codebook 180' of FIG. Memory 707,
707 'has a 1 everywhere a -1 appears in the codebook 180, 180' and a zero elsewhere (compare FIGS. 6 and 10). Memory 70
The output of 7 is connected to select line 709 of multiplexer 705, thereby controlling the particular select line n with which each value k, n acts on the value of the vector provided at input 701.

【０１２９】メモリ７０６，７０７はアドレスシーケン
サ７１４によって制御される。第１の（すなわち、ｋ＝
１）コードブックベクトルと相関されるために信号ベク
トル７０２が入力７０１に与えられると、シーケンサ７
１４はメモリ７０６，７０７のｋ＝１のデータセットを
アクセスし、かつｋ＝１に対する値ｎ＝１〜ｎ＝Ｎを選
択ライン７０８，７０９上の対応するマルチプレクサ７
０４，７０５に転送する。選択ライン７０８，７０９に
表われる値はマルチプレクサ７０４，７０５を入力ベク
トル７０２の適切な値がアキュムレータ７１２，７１３
に渡るようにし、該アキュムレータ７１２，７１３にお
いてそれらの値が加算されて出力７１６，７１７を発生
する。出力７１６，７１７はコンバイナ７２０において
組合わされ第１の相互相関、すなわち、Ｑ（１）を出力
７２１に提供する。The memories 706 and 707 are controlled by the address sequencer 714. The first (ie, k =
1) When the signal vector 702 is applied to the input 701 to be correlated with the codebook vector, the sequencer 7
14 accesses the k = 1 data set of the memories 706, 707 and outputs the values n = 1 to n = N for k = 1 to the corresponding multiplexers 7 on the select lines 708, 709.
04,705. The values appearing on select lines 708 and 709 pass through multiplexers 704 and 705 when the appropriate values of input vector 702 are accumulators 712 and 713.
And the values are added in the accumulators 712 and 713 to generate outputs 716 and 717. Outputs 716,717 are combined in combiner 720 to provide a first cross-correlation, Q (1), at output 721.

【０１３０】シーケンサ７１４は次にメモリ７０６，７
０７におけるｋ＝２の値を選択し、かつｋ＝２に対する
その中のｎ＝１〜Ｎの値をマルチプレクサ７０４，７０
５の選択ライン７０８，７０９に転送し、以下同様に第
２の相互相関、すなわち、Ｑ（２）を出力７２１に発生
する。このプロセスはある音声フレームに対する入力ベ
クトル信号７０２がメモリ７０６，７０７におけるエン
トリによって表わされるコードブックベクトルと相関さ
れ相互相関値Ｑ（１），…，Ｑ（Ｋ）を得るまで繰返さ
れる。Ｑ（ｋ＝ｊ）の大きな値を有する指数ｋ＝ｊの確
率的なベクトルは通常Ｑ（ｋ＝ｉ）のより小さな値を有
する他のベクトルｋ＝ｉよりも良好な音声表現を与え
る。The sequencer 714 then proceeds to the memory 706, 7
The value of k = 2 at 07 and the values of n = 1 to N therein for k = 2 are multiplexers 704, 70.
5 select lines 708 and 709, and so on, to produce a second cross-correlation, Q (2), at output 721. This process is repeated until the input vector signal 702 for a speech frame is correlated with the codebook vector represented by the entries in memories 706, 707 to obtain the cross-correlation values Q (1), ..., Q (K). Probabilistic vectors of index k = j with large values of Q (k = j) usually give better phonetic representation than other vectors k = i with smaller values of Q (k = i).

【０１３１】３進のコードブックに対しては２つのメモ
リ７０６，７０７を使用することが都合がよいが、コー
ドブック１８０において使用されるコーディングの形式
に従ってより多くまたはより少ないメモリを使用するこ
とができる。例えば、２進コードブックに対しては１つ
のメモリのみを使用する必要があり、かつコードブック
それ自体はもしそれがｎ＝１〜Ｎに対応する０，１の値
を各指数ｋに対してマルチプレクサの選択ラインに伝達
できればメモリとして十分である。従って、２進コード
ブックあるいはその等価物の場合は、別個のメモリは必
要でなくかつコードブックそれ自体がマルチプレクサの
選択ラインに信号を供給するために使用できる。Although it is convenient to use two memories 706, 707 for a ternary codebook, it may be possible to use more or less memory depending on the type of coding used in codebook 180. it can. For example, for a binary codebook it is necessary to use only one memory, and the codebook itself has a value of 0,1 for each index k, which corresponds to n = 1 to N. If it can be transmitted to the select line of the multiplexer, it is sufficient as a memory. Therefore, in the case of a binary codebook or its equivalent, no separate memory is needed and the codebook itself can be used to feed the select lines of the multiplexer.

【０１３２】次に図８を参照して、マルチプレクサ７０
４の動作を説明する。マルチプレクサ７０５の構成およ
び動作は同じである。マルチプレクサ７０４は一般に、
Ｇ１，…，ＧＮによって示される、Ｎのゲート７１５を
有するＮ×Ｎのマルチプレクサである。各ゲート７１５
の１つの入力は入力信号ベクトル７０２の特定の値（指
数ｎによって識別される）を受信するために入力７０１
に接続され、他の入力７０３はシステムの論理０の基準
レベル、たとえば、グランドに接続されている。ゲート
７１５は出力７１０を、選択ライン７０８上に存在する
論理信号によって決定されるように、入力７０１（すな
わち、信号７０２）または入力７０３（すなわち、
「０」）に接続する。図示された構成に対しては、たと
えば、選択ライン７０８のラインｎ＝ｉの１の値は入力
ベクトル７０２のｎ＝ｉの値（入力７０１のｎ＝ｉライ
ンに現われる）を出力７１０のｎ＝ｉのラインに転送さ
れるようにし、そうでなければ０の値が転送される。類
似の結果を有する任意の等価な論理構成も使用できる。Next, referring to FIG. 8, the multiplexer 70
The operation of No. 4 will be described. The configuration and operation of the multiplexer 705 are the same. The multiplexer 704 is generally
, GN, N × N multiplexer with N gates 715. Each gate 715
Input 701 to receive a particular value of input signal vector 702 (identified by index n).
And the other input 703 is connected to the system logic zero reference level, eg, ground. Gate 715 outputs 710 as input 701 (ie, signal 702) or input 703 (ie, as determined by the logic signal present on select line 708).
"0"). For the configuration shown, for example, a value of 1 on line n = i of select line 708 produces a value of n = i of input vector 702 (appears on line n = i of input 701), n = of output 710. to be transferred to line i, otherwise a value of 0 is transferred. Any equivalent logic configuration with similar results can be used.

【０１３３】マルチプレクサ７０４は入力７０１におけ
るＮの入力信号値７０２および選択ライン７０８上のＮ
の選択値を受信しかつメモリ７０６によって駆動される
選択ライン７０８が０または１にセットされているか否
かに従って入力信号７０２からのＮまでの値を出力７１
０に転送することが可能である。マルチプレクサ７０５
の動作は、入力７０２、メモリ７０７によって駆動され
る選択ライン７０９および出力７１１に関して同様であ
るが、例外としてマルチプレクサ７０５は入力７０１に
おける入力ベクトル信号７０２の値をコードブックベク
トル値が−１である場合に指数ｋ，ｎに対し出力７１１
に転送し、一方マルチプレクサ７０４は入力ベクトル値
７０２をコードブックベクトル値が＋１である場合に指
数ｋ，ｎに対し出力７１０に受け渡す。Multiplexer 704 receives N input signal values 702 at input 701 and N on select line 708.
71 of the input signals 702 to N depending on whether the select line 708 driven by the memory 706 is set to 0 or 1
It is possible to transfer to 0. Multiplexer 705
Is similar for input 702, select line 709 driven by memory 707 and output 711, except that multiplexer 705 changes the value of input vector signal 702 at input 701 to −1 if the codebook vector value is −1. Output 711 for indices k and n
While the multiplexer 704 passes the input vector value 702 to the output 710 for the indices k, n if the codebook vector value is +1.

【０１３４】出力７１０および７１１は、それぞれ、ア
キュムレータ７１２，７１３に接続され、マルチプレク
サ７０４，７０５を介して転送される入力ベクトル信号
値７０２は一緒に加算されて、それぞれ、Ｑ^＋（ｋ）お
よびＱ⁻（ｋ）相関値に対応する出力７１６，７１７を
生成する。出力７１６，７１７はコンバイナ７２０で組
合わされて７２１に相関出力値Ｑ（ｋ）を生成する。こ
の例のように、コードブック１８０が３進コードブック
である場合は、コンバイナ７２０はアキュムレータ７１
２，７１３からの出力７１６，７１７の差を取り出力７
２１、すなわち、Ｑ（ｋ）＝Ｑ^＋（ｋ）−Ｑ⁻（ｋ）を
発生する。これはマルチプレクサ７０５、メモリ７０７
およびアキュムレータ７１３によって行われる演算はコ
ードブック１８０、たとえば、図１０を参照、の−１の
値に対応することを考慮に入れている。コンバイナ７２
０はこの特定の実施例においては、減算を行うが、当業
者は本明細書の記載に基づき同じ結果を多くの他の手段
によって得ることが可能なことを理解するであろう。た
とえば、これは限定的な意図ではないが、同じ出力７２
１はマルチプレクサ７０５またはアキュムレータ７１３
の出力を反転しかつコンバイナ７２０を加算器とするこ
とによって得ることができる。Outputs 710 and 711 are connected to accumulators 712 and 713, respectively, and input vector signal values 702 transferred through multiplexers 704 and 705 are added together to produce Q ⁺ (k) and Q ⁺ , respectively. ^- (K) Generate outputs 716,717 corresponding to the correlation values. The outputs 716 and 717 are combined in combiner 720 to produce a correlated output value Q (k) at 721. If the codebook 180 is a ternary codebook, as in this example, then the combiner 720 will be the accumulator 71.
The difference between outputs 716 and 717 from 2, 713 is output 7
21, ^{i.e., Q (k) = Q +} (k) -Q - generating a (k). This is multiplexer 705, memory 707.
Taking into account that the operations performed by and accumulator 713 correspond to a value of -1 in codebook 180, eg, see FIG. Combiner 72
0 performs subtraction in this particular embodiment, but those skilled in the art will understand that the same result can be obtained by many other means based on the description herein. For example, this is not meant to be limiting, but the same output 72
1 is a multiplexer 705 or an accumulator 713
Can be obtained by inverting the output of and combining combiner 720 as an adder.

【０１３５】図７の相関発生器７００は、たとえば、図
３および図４の相関発生器５２０または５２０′に対応
し、かつ相関発生器７００の出力７２１は図３の出力５
２１または図４の出力５５１′に対応するが、どのよう
な特定の入力信号ベクトルが処理されるかに応じて、適
応コードブックベクトルＣ_ｋ（ｎ）よりはむしろ確率的
コードブックベクトルＳ_ｋ（ｎ）に対するものでありか
つＸ（ｎ）よりもむしろ目標音声信号Ｙ（ｎ）に対する
ものである。Correlation generator 700 of FIG. 7 corresponds, for example, to correlation generator 520 or 520 'of FIGS. 3 and 4, and output 721 of correlation generator 700 is output 5 of FIG.
21 or output 551 'of FIG. 4, but depending on what particular input signal vector is processed, the stochastic codebook vector S _k (rather than the adaptive codebook vector C _k (n). n) and for the target audio signal Y (n) rather than X (n).

【０１３６】本発明のさらに別の実施例を前記式
（６″）および図６、図９、図１０を参照して説明す
る。式（６″）を図６のコードブック１８０′に適用す
ることにより以下の表Ｉに示されるように、ｎ＝１〜２
０とし、Ｗ′（ｎ）の値に対し相関値Ｑ（１）からＱ
（８）が得られる。表Ｉ Q(1)=+W ′(04)-W′(05)-W′(09)+W′(14)-W′(18)+W′(19) Q(2)=+W ′(02)-W′(03)-W′(07)+W′(12)-W′(16)+W′(17) Q(3)=-W ′(01)-W′(05)+W′(10)-W′(14)+W′(15)+W′(20) Q(4)=-W ′(03)+W′(08)-W′(12)+W′(13)+W′(18)-W′(19) Q(5)=-W ′(01)+W′(06)-W′(10)+W′(11)+W′(16)-W′(17) Q(6)=+W ′(04)-W′(08)+W′(09)+W′(14)-W′(15)-W′(19) Q(7)=+W ′(02)-W′(06)+W′(07)+W′(12)-W′(13)-W′(17) Q(8)=-W ′(04)+W′(05)+W′(10)-W′(11)-W′(15)+W′(20) 表Ｉのアレイは再配列して、以下の表ＩＩに示されるよ
うにＱ（ｋ）＝［Ｑ^＋（ｋ）］−［Ｑ⁻（ｋ）］として
相関値を表すように＋１のコードブック値に対応する項
と−１のコードブック値に対応する項をグループ分けす
ることができる。Another embodiment of the present invention will be described with reference to the above equation (6 ″) and FIGS. 6, 9 and 10. The equation (6 ″) is applied to the codebook 180 ′ of FIG. Thus, as shown in Table I below, n = 1-2
0 and the correlation values Q (1) to Q (w) for the value of W '(n)
(8) is obtained. Table I Q (1) = + W '(04) -W' (05) -W '(09) + W' (14) -W '(18) + W' (19) Q (2) = + W ′ (02) -W ′ (03) -W ′ (07) + W ′ (12) -W ′ (16) + W ′ (17) Q (3) =-W ′ (01) -W ′ (05 ) + W ′ (10) -W ′ (14) + W ′ (15) + W ′ (20) Q (4) =-W ′ (03) + W ′ (08) -W ′ (12) + W ′ (13) + W ′ (18) -W ′ (19) Q (5) =-W ′ (01) + W ′ (06) -W ′ (10) + W ′ (11) + W ′ (16 ) -W ′ (17) Q (6) = + W ′ (04) -W ′ (08) + W ′ (09) + W ′ (14) -W ′ (15) -W ′ (19) Q ( 7) = + W ′ (02) -W ′ (06) + W ′ (07) + W ′ (12) -W ′ (13) -W ′ (17) Q (8) =-W ′ (04) + W '(05) + W' (10) -W '(11) -W' (15) + W '(20) The array of Table I was rearranged to produce the Q array as shown in Table II below. ^{(k) = [Q + (} k)] - is - ^{[(k) Q]} grouped term corresponding to the term and codebook values of -1, corresponding to the codebook value of +1 to represent a correlation value as be able to.

【０１３７】表ＩＩ Q(1)=[W ′(04)+W′(14)+W′(19)]-[W′(05)+W′(09)+W′(18)] Q(2)=[W ′(02)+W′(12)+W′(17)]-[W′(03)+W′(07)+W′(16)] Q(3)=[W ′(10)+W′(15)+W′(20)]-[W′(01)+W′(05)+W′(14)] Q(4)=[W ′(08)+W′(13)+W′(18)]-[W′(03)+W′(12)+W′(19)] Q(5)=[W ′(06)+W′(11)+W′(16)]-[W′(01)+W′(10)+W′(17)] Q(6)=[W ′(04)+W′(09)+W′(14)]-[W′(08)+W′(15)+W′(19)] Q(7)=[W ′(02)+W′(07)+W′(12)]-[W′(06)+W′(13)+W′(17)] Q(8)=[W ′(05)+W′(10)+W′(20)]-[W′(04)+W′(11)+W′(15)] 表ＩＩの最も左の括弧内に示された値はアキュムレータ
７１２への入力に対応し、かつ表ＩＩの最も右側の括弧
内の値はアキュムレータ７１３への入力に対応する。Table II Q (1) = [W ′ (04) + W ′ (14) + W ′ (19)]-[W ′ (05) + W ′ (09) + W ′ (18)] Q (2) = [W ′ (02) + W ′ (12) + W ′ (17)]-[W ′ (03) + W ′ (07) + W ′ (16)] Q (3) = [W ′ (10) + W ′ (15) + W ′ (20)]-[W ′ (01) + W ′ (05) + W ′ (14)] Q (4) = [W ′ (08) + W ′ (13) + W ′ (18)]-[W ′ (03) + W ′ (12) + W ′ (19)] Q (5) = [W ′ (06) + W ′ (11) + W ′ (16)]-[W ′ (01) + W ′ (10) + W ′ (17)] Q (6) = [W ′ (04) + W ′ (09) + W ′ (14)]- [W ′ (08) + W ′ (15) + W ′ (19)] Q (7) = [W ′ (02) + W ′ (07) + W ′ (12)]-[W ′ (06) + W ′ (13) + W ′ (17)] Q (8) = [W ′ (05) + W ′ (10) + W ′ (20)]-[W ′ (04) + W ′ (11) + W '(15)] The value shown in the leftmost bracket of Table II corresponds to the input to the accumulator 712, and the value in the rightmost bracket of Table II corresponds to the input to the accumulator 713. .

【０１３８】図６のコートブック１８０′を参照する
と、該コードブックがまばらに満たされている、すなわ
ち、大部分のエントリが０であることが明らかである。
さらに、表ＩおよびＩＩを参照すると、連続するベクト
ルのオーバラップ特性は相関値Ｑ（ｋ）を得るために加
算されるＷ′（ｎ）の値の指数において反映されている
ことが明らかである。従って、本コードブックの構造は
表ＩおよびＩＩに示された和を発生するより経済的な方
法に適している。これらについては以下に説明する。With reference to courtbook 180 'of FIG. 6, it is clear that the codebook is sparsely filled, that is, most entries are zero.
Further, referring to Tables I and II, it is clear that the overlap characteristics of successive vectors are reflected in the exponent of the value of W '(n) added to obtain the correlation value Q (k). . Therefore, the structure of this codebook is suitable for a more economical way of generating the sums shown in Tables I and II. These will be described below.

【０１３９】すべてのコードブック値を記憶するよりも
むしろ、ｋの各々の値に対する非ゼロエントリの指数
（すなわち、ｎの値）のみを記憶することができる。こ
れはＱ ^＋（ｋ）およびＱ⁻（ｋ）の値に対し別個に行う
と都合がよいが、しかしこれは必須のことではない。ｋ
の各々の値に対する相関値Ｑ^＋（ｋ）およびＱ⁻（ｋ）
は単にｋの各値に対しｎの記憶された値に対応するＷ′
（ｎ）を加算することにより、すなわち、表ＩまたはＩ
Ｉに示される和を実行することによって得られる。Than storing all codebook values
Rather, a non-zero entry exponent for each value of k
Only (ie, the value of n) can be stored. This
This is Q ⁺(K) and Q⁻Perform separately for the value of (k)
But this is not mandatory. k
Correlation value Q for each value of⁺(K) and Q⁻(K)
Is simply W ′ corresponding to the stored value of n for each value of k.
By adding (n), i.e. Table I or I
It is obtained by performing the sum shown in I.

【０１４０】計算機的なおよび／またはアドレス記憶の
要求はコードブックエントリのオーバラップ特性を考慮
に入れる再帰的計算方法を用いることによってさらに低
減できかつより迅速な動作が得られる。好ましい、この
手法により、ｋ＝１に対するコードブックエントリの指
数値ｎを記憶しかつベクトルｋ＝２，ｋ＝３，その他に
対する指数ｎをコードブックのオーバラップに基づきｋ
＝１の指数値から計算する。各ベクトルの端部に加えら
れるいずれかの新しいコードブックのエントリの指数も
また考慮される。Computational and / or address storage requirements can be further reduced and faster operation can be obtained by using a recursive calculation method that takes into account the overlapping properties of codebook entries. According to the preferred method, the exponent value n of the codebook entry for k = 1 is stored and the exponent n for the vectors k = 2, k = 3, etc. is k based on the codebook overlap.
Calculate from the index value of = 1. The exponents of any new codebook entries added to the end of each vector are also considered.

【０１４１】たとえば、図９の＋１のエントリおよび表
ＩＩのＱ^＋（ｋ）部分（すなわち、最も左の括弧内の
量）の場合には、ｎ＝４，１４，１９およびコードブッ
クのオーバラップを記憶し、この場合Ｎ−２であり（す
なわち、Δｋ＝＋１，Δｎ＝−２）かつ以下のように対
応するＷ′（ｎ）の値から生ずるＱ（ｋ）への寄与分を
計算する。すなわち、ｋ＝１，ｎ＝４の指数が最初に評
価されかつ以下の項のＱ^＋１（１）およびＱ^＋（２）の
値に寄与する。表ＩＩＩＱ^＋（１）＝Ｗ′（０４）Ｑ^＋（２）＝Ｗ′（０２）指数ｋ＝２，ｎ＝２に対するＱ（２）項Ｗ′（０２）は
コードブックのオーバラップ（Δｋ＝＋１，Δｎ＝−
２）を第１の指数ｋ＝１，ｎ＝４に適用することにより
決定される。For example, for the +1 entry of FIG. 9 and the Q ⁺ (k) portion of Table II (ie, the leftmost parenthesized quantity), n = 4, 14, 19 and codebook overlap. And compute the contribution to Q (k) resulting from the value of W ′ (n), which in this case is N−2 (ie Δk = + 1, Δn = −2) and is: . That is, the indices of k = 1, n = 4 are evaluated first and contribute to the values of Q ⁺ 1 (1) and Q ⁺ (2) in the following terms. Table III Q ⁺ (1) = W '(04) Q ⁺ (2) = W' (02) The Q (2) terms W '(02) for the indices k = 2, n = 2 are codebook overlaps ( Δk = + 1, Δn =-
2) is applied to the first index k = 1, n = 4.

【０１４２】次にｋ＝１，ｎ＝１４の指数が評価されか
つ付加的な項Ｗ′（１４），Ｗ′（１２），Ｗ′（１
０），Ｗ′（０８），Ｗ′（０６），Ｗ′（０４），
Ｗ′（０４）およびＷ′（０２）を与える。Ｗ′（１
４）を除くすべての項はコードブックのオーバラップを
開始指数ｋ＝１，ｎ＝１４に適用することにより決定さ
れる。この結果次のようになる。表ＩＶＱ^＋（１）＝Ｗ′（０４）＋Ｗ′（１４）Ｑ^＋（２）＝Ｗ′（０２）＋Ｗ′（１２）Ｑ^＋（３）＝Ｗ′（１０）Ｑ^＋（４）＝Ｗ′（０８）Ｑ^＋（５）＝Ｗ′（０６）Ｑ^＋（６）＝Ｗ′（０４）Ｑ^＋（７）＝Ｗ′（０２）The indices k = 1, n = 14 are then evaluated and the additional terms W '(14), W' (12), W '(1
0), W '(08), W' (06), W '(04),
Give W '(04) and W' (02). W '(1
All terms except 4) are determined by applying the codebook overlap to the starting indices k = 1, n = 14. This results in the following: Table IV Q ⁺ (1) = W '(04) + W' (14) Q ⁺ (2) = W '(02) + W' (12) Q ⁺ (3) = W '(10) Q ⁺ (4) = W '(08) Q ⁺ (5) = W' (06) Q ⁺ (6) = W '(04) Q ⁺ (7) = W' (02)

【０１４３】次にｋ＝１，ｎ＝１９の指数が評価されか
つ付加的な項Ｗ′（１９），Ｗ′（１７），Ｗ′（１
５），Ｗ′（１３），Ｗ′（１１），Ｗ′（０９），
Ｗ′（０７），Ｗ′（０５），Ｗ′（０３）およびＷ′
（０１）を与える。Ｗ′（１９）を除くすべての項はコ
ードブックのオーバラップを開始指数ｋ＝１，ｎ＝１９
に適用することにより検出される。この結果は次のよう
になるが、ここで高いベクトル番号に対しどのように寄
与が続くかを示すためにシーケンスがｋ＞８のベクトル
に対し拡張されている。表ＶＱ^＋（１）＝Ｗ′（０４）＋Ｗ′（１４）＋Ｗ′（１９）Ｑ^＋（２）＝Ｗ′（０２）＋Ｗ′（１２）＋Ｗ′（１７）Ｑ^＋（３）＝Ｗ′（１０）＋Ｗ′（１５）Ｑ^＋（４）＝Ｗ′（０８）＋Ｗ′（１３）Ｑ^＋（５）＝Ｗ′（０６）＋Ｗ′（１１）Ｑ^＋（６）＝Ｗ′（０４）＋Ｗ′（０９）Ｑ^＋（７）＝Ｗ′（０２）＋Ｗ′（０７）Ｑ^＋（８）＝Ｗ′（０５）Ｑ^＋（９）＝Ｗ′（０３）Ｑ^＋（１０）＝Ｗ′（０１）これはｋ＝１に対する指数およびコードブックのオーバ
ラップに基づきそこから決定できるすべての値を使い尽
くす。ベクトルｋ＝１，ｋ＝２の終りには何らの付加的
な非ゼロ値も現れず、従って相関値Ｑ^＋（１），Ｑ
^＋（２）は今や完全である。The indices of k = 1, n = 19 are then evaluated and the additional terms W '(19), W' (17), W '(1
5), W '(13), W' (11), W '(09),
W '(07), W' (05), W '(03) and W'
Give (01). All terms except W '(19) start codebook overlap index k = 1, n = 19
Detected by applying to. The result is then where the sequence is expanded for vectors with k> 8 to show how the contribution continues for higher vector numbers. Table V Q ⁺ (1) = W '(04) + W' (14) + W '(19) Q ⁺ (2) = W' (02) + W '(12) + W' (17) Q ⁺ (3) = W '(10) + W' (15) Q ⁺ (4) = W '(08) + W' (13) Q ⁺ (5) = W '(06) + W' (11) Q ⁺ (6) = W ' (04) + W '(09) Q ⁺ (7) = W' (02) + W '(07) Q ⁺ (8) = W' (05) Q ⁺ (9) = W '(03) Q ⁺ (10 ) = W '(01) This exhausts all the values that can be determined from it based on the index and codebook overlap for k = 1. No additional non-zero values appear at the end of the vectors k = 1, k = 2, so the correlation values Q ⁺ (1), Q
⁺ (2) is now perfect.

【０１４４】含めるべき次の指数はｋ＝３，ｎ＝２０で
ありかつ付加的な項Ｗ′（２０），Ｗ′（１８），Ｗ′
（１６），Ｗ′（１４），Ｗ′（１２），Ｗ′（１
０），Ｗ′（０８），Ｗ′（０６），Ｗ′（０４）およ
びＷ′（０２）を与える。再び、Ｗ′（２０）を除く項
はコードブックのオーバラップを開始指数ｋ＝３，ｎ＝
２０に適用することによって識別される。結果は以下の
ようになり、ここでより高いベクトル番号に対しどのよ
うに前記寄与が続くかを示すためにｋ＞１０のベクトル
に対しシーケンスが拡張されている。表ＶＩＱ^＋（１）＝Ｗ′（０４）＋Ｗ′（１４）＋Ｗ′（１９）Ｑ^＋（２）＝Ｗ′（０２）＋Ｗ′（１２）＋Ｗ′（１７）Ｑ^＋（３）＝Ｗ′（１０）＋Ｗ′（１５）＋Ｗ′（２０）Ｑ^＋（４）＝Ｗ′（０８）＋Ｗ′（１３）＋Ｗ′（１８）Ｑ^＋（５）＝Ｗ′（０６）＋Ｗ′（１１）＋Ｗ′（１６）Ｑ^＋（６）＝Ｗ′（０４）＋Ｗ′（０９）＋Ｗ′（１４）Ｑ^＋（７）＝Ｗ′（０２）＋Ｗ′（０７）＋Ｗ′（１２）Ｑ^＋（８）＝Ｗ′（０５）＋Ｗ′（１０）Ｑ^＋（９）＝Ｗ′（０３）＋Ｗ′（０８）Ｑ^＋（１０）＝Ｗ′（０１）＋Ｗ′（０６）Ｑ^＋（１２）＝Ｗ′（０４）Ｑ^＋（１３）＝Ｗ′（０２）これはｋ＝０からｋ＝７までに対する指数およびコード
ブックのオーバラップに基づきそこから決定できるすべ
ての値を使い尽くす。ベクトルｋ＝１からｋ＝７の端部
には何らの付加的な非ゼロ値も現れておらず、従って相
関値Ｑ^＋（１）からＱ^＋（７）は今や完全である。The next indices to be included are k = 3, n = 20 and the additional terms W '(20), W' (18), W '.
(16), W '(14), W' (12), W '(1
0), W '(08), W' (06), W '(04) and W' (02). Again, all terms except W '(20) start codebook overlap index k = 3, n =
Identified by applying 20. The result is as follows, where the sequence is expanded for vectors with k> 10 to show how the contribution continues for higher vector numbers. Table VI Q ⁺ (1) = W '(04) + W' (14) + W '(19) Q ⁺ (2) = W' (02) + W '(12) + W' (17) Q ⁺ (3) = W '(10) + W' (15) + W '(20) Q ⁺ (4) = W' (08) + W '(13) + W' (18) Q ⁺ (5) = W '(06) + W' ( 11) + W '(16) Q ⁺ (6) = W' (04) + W '(09) + W' (14) Q ⁺ (7) = W '(02) + W' (07) + W '(12) Q ⁺ (8) = W '(05) + W' (10) Q ⁺ (9) = W '(03) + W' (08) Q ⁺ (10) = W '(01) + W' (06) Q ⁺ ( 12) = W '(04) Q ⁺ (13) = W' (02) This can be determined from the index and codebook overlap for k = 0 to k = 7. Use all values. No additional non-zero values appear at the ends of the vectors k = 1 to k = 7, so the correlation values Q ⁺ (1) to Q ⁺ (7) are now perfect.

【０１４５】上に述べたプロセスがコードブックにおけ
る非ゼロのエントリが使い尽くされかつすべてのＱ
^＋（ｋ）相関値が決定されるまで継続する。Ｑ⁻（ｋ）
の値のために使用されるプロセスは実質的に同じであ
る。３進コードブックをＱ^＋（ｋ）およびＱ⁻（ｋ）を
計算するために別個の部分に分けることはコードブック
のオーバラップを活用してＱ（ｋ）の相関値を計算する
ための上述のプロセスの間に個々のエントリの符号を考
慮に入れる必要性を避けることができるが、それは妨げ
られない。Ｑ（ｋ）は差分Ｑ（ｋ）＝Ｑ^＋（ｋ）−Ｑ⁻
（ｋ）によって検出される。最大の相関値Ｑ（ｊ）を有
する指数ｋ＝ｊのベクトルは技術上よく知られた手段を
用いて、ｋ＝１からｋ＝Ｋに対し（または、少なくとも
そのサブセットのいくつかに対し）Ｑ（ｋ）の値を比較
することにより識別される。上のようにして決定された
計算値は最適の確率的コードブックベクトル、すなわ
ち、音声を合成するために使用された場合、入力目標音
声と比較して最小のエラーを与える確率的コードブック
ベクトル、を識別するために前に述べた合成による分析
プロセスにおいて他の情報と共に使用される。コードブ
ック１８０からのこの最適の確率的コードブックベクト
ルは次に最終的に受信機において入力音声を再び再生す
るために使用される送信されるボコード（ＶＯＣＯＤ
Ｅ）を構築するために使用される。The process described above exhausts all non-zero entries in the codebook and all Q's.
⁺ (K) Continue until the correlation value is determined. Q ^- (k)
The process used for the value of is substantially the same. Splitting the ternary codebook into separate parts to compute Q ⁺ (k) and Q ⁻ (k) is described above for computing the correlation value of Q (k) by taking advantage of the codebook overlap. While avoiding the need to take into account the sign of individual entries during the process of, it is not disturbed. Q (k) is the difference ^{Q (k) = Q + (} k) -Q -
Detected by (k). The vector of index k = j with the largest correlation value Q (j) is Q for k = 1 to k = K (or at least for some of its subsets) using means well known in the art. It is identified by comparing the values of (k). The calculated value determined as above is the optimal stochastic codebook vector, i.e., the stochastic codebook vector which, when used to synthesize the speech, gives the least error compared to the input target speech, Used with other information in the synthetic analysis process described above to identify the. This optimal stochastic codebook vector from codebook 180 is then finally transmitted at the receiver using the transmitted vocode (VOCOD) which is used to replay the input speech.
E) is used to build.

【０１４６】より一般的に説明すると、ｎ＝１〜Ｎにお
よぶ指数ｎによって識別される値を有する第１のベクト
ルＶ（ｎ）、および第２のベクトルＳ_ｋ（ｎ）のセット
であって該第２のベクトルの各々は指数ｋによって識別
されかつ該第２のベクトルの各々はゼロまたは非ゼロで
ありかつｎ＝１〜Ｎに至る指数ｎによって識別されるＮ
までの値を有する前記第２のベクトルＳ_ｋ（ｎ）のセッ
トとの組合わせを使用する上に述べた音声を符号化する
プロセスは、Ｓ_ｋ（ｎ_ｉ）が非ゼロであるとし異なるｋ
に対するＳ_ｋ（ｎ）の指数ｎ_ｋ，ｉを識別する段階、指
数ｎ_ｋ，ｉに対応するＶ（ｎ）の値を加算して和Ｑ
（ｋ）を形成する段階、最大の値Ｑ（ｋ＝ｊ）に対応す
る値ｋ＝ｊを識別する段階、そしてＳ_ｋ＝ｊ（ｎ）を使
用して音声を合成する段階、を具備する。More generally described is a set of a first vector V (n) and a second vector S _k (n) having values identified by an index n ranging from n = 1 to N. Each of the second vectors is identified by an index k and each of the second vectors is zero or non-zero and identified by an index n ranging from n = 1 to N.
The process of encoding speech described above using a combination with said set of second vectors S _k (n) having values up to and including _k is different for S _k (n _i ) is non-zero.
For identifying the index n _{k, i} of S _k (n) with respect to, the value of V (n) corresponding to the index n _{k, i} is added and the sum Q
Forming (k), identifying the value k = j corresponding to the maximum value Q (k = j), and synthesizing the speech using S _{k = j} (n). .

【０１４７】さらに、第２のベクトルの前記セットはオ
ーバラップ量Δｋ，Δｎに従って先行する第２のベクト
ルをオーバラップすることにより決定されることが望ま
しく、この場合前記識別および加算段階は、ｎ_１，ｉか
らスタートしかつオーバラップ量Δｋ，Δｎを使用して
Ｓ_ｉ（ｎ_ｉ）が非ゼロであるとし、ｋ＝１に対しＳ
_ｋ（ｎ）の指数ｎ_１，ｉを識別する段階、Ｓ
_ｋ（ｎ_ｉ′）が非ゼロであるとしｋ＞１に対しさらに指
数ｎ_ｋ，ｉ′を決定する段階、そしてそのような指数お
よびさらに他の指数に対し前記Ｖ（ｎ）の値を加算し和
Ｑ（ｋ）を形成する段階を具備する。さらに、ｋ≧２に
対しＳ_ｋ（ｎ_ｉ″）が非ゼロであるとし前に識別されて
いない第１の指数ｎ_ｋ，ｉ″を識別し、かつ次に、指数
ｎ_ｋ，ｉ″からスタートしてＳ_ｋ（ｎ_ｉ″′）が非ゼロ
であるとしオーバラップ量を使用してｋ≧３に対するさ
らに他の指数ｎ_{ｋ，ｉ″′}を決定し、かつそのようなさ
らに他の指数に対しＶ（ｎ）の値を加算してさらに和Ｑ
（ｋ）を形成すると好都合である。Furthermore, said set of second vectors is preferably determined by overlapping the preceding second vectors according to the amount of overlap Δk, Δn, in which case said identifying and summing step is n _{1 , I} , and using the amount of overlap Δk, Δn, let S _i (n _i ) be non-zero, and for k = 1, S
identifying the indices n _{1, i} of _k (n), S
determining further exponents n _{k, i ′} for k> 1 given that _k (n _{i ′} ) is non-zero, and adding the value of V (n) to such exponents and further exponents And forming a sum Q (k). Furthermore, for k ≧ 2, we identify S _k (n _{i ″} ) as non-zero and identify a previously unidentified first index n _{k, i ″} , and then from index n _{k, i ″} Starting and assuming that S _k (n _{i ″ ′} ) is non-zero, the amount of overlap is used to determine yet another index n _{k, i ″ ′} for k ≧ 3, and such an additional index. The value of V (n) is added to
It is convenient to form (k).

【０１４８】以上述べた方法は汎用目的のコンピュータ
によって実施できるが、その場合は該コンピュータは、
Ｓ_ｋ（ｎ_ｉ）が非ゼロであるとし異なるｋに対しＳ
_ｋ（ｎ）の指数ｎ_ｋ，ｉを識別するための手段、指数ｎ
_ｋ，ｉに対応するＶ（ｎ）の値を加算して和Ｑ（ｋ）を
形成するための手段、最大の値Ｑ（ｋ＝ｊ）に対応する
値ｋ＝ｊを識別するための手段、そしてＳ_ｋ＝ｊ（ｎ）
を使用して音声を合成するための手段、を提供するよう
プログラムされるべきである。当業者はどのようにして
これを行うかを理解するであろう。The method described above can be implemented by a general purpose computer, in which case the computer
S _k (n _i ) is non-zero and S for different k
Means for identifying index n _{k, i} of _k (n), index n
Means for adding the values of V (n) corresponding to _{k, i} to form the sum Q (k), and means for identifying the value k = j corresponding to the maximum value Q (k = j) , And S _{k = j} (n)
Should be programmed to provide means for synthesizing speech using. Those of ordinary skill in the art will understand how to do this.

【０１４９】さらに、前記コンピュータはＳ_１（ｎ_ｉ）
が非ゼロであるとしＳ_ｋ（ｎ）の指数ｎ_１，ｉをｋ＝１
に対し識別するための手段、Ｓ_ｋ（ｎ_ｉ′）が非ゼロで
あるとし、ｎ_１，ｉからスタートしかつオーバラップ量
Δｋ，Δｎを使用してｋ＞１に対するさらに他の指数ｎ
_ｋ，ｉ′を決定するための手段、そしてそのような指数
およびさらに他の指数に対しＶ（ｎ）の値を加算して和
Ｑ（ｋ）を形成するための手段、を提供するようプログ
ラムされることが望ましい。さらに、前記識別し、決定
しかつ加算するための手段は、Ｓ_ｋ（ｎ_ｉ″）が非ゼロ
であるとし前に識別されなかった第１の指数ｎ_ｋ，ｉ″
をｋ≧２に対し識別するための手段、Ｓ_ｋ（ｎ_ｉ″′）
が非ゼロであるとし指数ｎ_ｋ，ｉ″からスタートしかつ
オーバラップ量を使用してｋ≧３に対するさらに他の指
数ｎ_{ｋ，ｉ″′}を決定するための手段、そしてそのよう
なさらに他の指数に対しＶ（ｎ）の値を加算してさらに
和Ｑ（ｋ）を形成するための手段、を具備することが望
ましい。Further, the computer is S ₁ (n _i )
Is non-zero, the index n _{1, i} of S _k (n) is k = 1.
Means for identifying, S _k (n _{i ′} ) is non-zero, starting from n _{1, i} and using the overlap amounts Δk, Δn, yet another index n for k> 1.
_A program for providing means for determining _{k, i '} , and means for adding such values of V (n) to such and other indices to form a sum Q (k). It is desirable to be done. Further, the means for identifying, determining and adding has a first index n _{k, i ″} not previously identified as S _k (n _{i ″} ) being non-zero.
Means for identifying to _{_{k ≧ 2, S k (n}} i "')
Means for determining yet another exponent n _{k, i ″ ′} for k ≧ 3, starting from the exponent n _{k, i ″,} where _k is non-zero, and using the amount of overlap, and such further It is desirable to include means for adding the value of V (n) to the index of to further form the sum Q (k).

【０１５０】[0150]

【発明の効果】上述の、ｎ＝１〜Ｎの値を有する第１の
ベクトルとｎ＝１〜Ｎの値を有するｋ＝１〜Ｋの第２の
ベクトルのセットとの積の和の等価物をコードブックの
まばらな非ゼロ値および該コードブックベクトルのオー
バラッピング特性を活用することによって提供する手段
および方法は、従来技術に比較して計算機的な負担を実
質的に低減しかつより迅速に達成することができ、さら
に従来技術によって必要とされるものよりかなり少ない
計算機的な資源によって達成できるようにする。上に述
べたプロセスは、本明細書に記載しかつ表Ｉ−ＶＩに図
示した手順を実行するようプログラムされた、汎用目的
のコンピュータまたは特別の目的のコンピュータによっ
て達成されている。当業者は本明細書の記載に基づきか
つ技術上よく知られた手段を使用して上述のステップを
達成するためにどのようにコンピュータをプログラムす
るかを理解するであろう。EFFECT OF THE INVENTION Equivalence of the sum of the above-mentioned products of the first vector having the values of n = 1 to N and the second set of k = 1 to K having the values of n = 1 to N. Means and methods for providing an object by exploiting the sparse non-zero values of a codebook and the overlapping properties of the codebook vector are substantially less computationally intensive and faster than prior art techniques. And can be achieved with significantly less computational resources than required by the prior art. The processes described above are accomplished by a general purpose or special purpose computer programmed to perform the procedures described herein and illustrated in Tables I-VI. Those skilled in the art will understand how to program a computer based on the description herein and using means well known in the art to accomplish the above steps.

【０１５１】当業者には本明細書の記載に基づき上に述
べた手段および方法が、確率的コードブックベクトルの
どれが目標音声との最善の整合を提供するかを決定する
ことに関連する相互相関プロセスのために通常必要とさ
れる乗算ステップと同じ効果を生ずることは明らかであ
る。乗算操作を消去することにより、相関操作がより高
速になりかつベクトル値の必要な操作の数が低減され
る。これらの利点は極めて好都合なものである。Those skilled in the art will appreciate that the means and methods described above based on the description herein are related to determining which of the stochastic codebook vectors provides the best match with the target speech. It is clear that it has the same effect as the multiplication step normally required for the correlation process. Eliminating the multiply operations makes the correlation operations faster and reduces the number of vector-valued operations required. These advantages are extremely advantageous.

【０１５２】最後に、本発明の上に述べた実施例は例示
のためのみのものであると考えている。数多くの別の実
施例も本発明の精神および範囲から離れることなく考案
できる。Finally, the above-described embodiments of the present invention are considered to be exemplary only. Many alternative embodiments can be devised without departing from the spirit and scope of this invention.

[Brief description of drawings]

【図１】ＣＥＬＰボコーダシステムを一般的な形式で示
す単純化されたブロック図である。FIG. 1 is a simplified block diagram of a CELP vocoder system in general form.

【図２】本発明の好ましい実施例によるＣＥＬＰコーダ
を示す単純化されたブロック図である。FIG. 2 is a simplified block diagram showing a CELP coder according to a preferred embodiment of the present invention.

【図３】第１の実施例による、図２の（ｂ）のコーダの
１部を詳細に示すブロック図である。FIG. 3 is a block diagram showing in detail a part of the coder of FIG. 2 (b) according to the first embodiment.

【図４】本発明の好ましい実施例による、図２の（ｂ）
のコーダの１部を非常に詳細に示すブロック図である。FIG. 4 (b) of FIG. 2 according to a preferred embodiment of the present invention.
FIG. 3 is a block diagram showing in greater detail a portion of the coder of FIG.

【図５】本発明の好ましい実施例による適応コードブッ
クベクトルの自己相関係数を提供するための装置を示す
ブロック図である。FIG. 5 is a block diagram illustrating an apparatus for providing an autocorrelation coefficient of an adaptive codebook vector according to a preferred embodiment of the present invention.

【図６】ＣＥＬＰコーディングのために使用される形式
の小さな確率的コードブックの内容を示す説明図であ
る。FIG. 6 is an illustration showing the contents of a small probabilistic codebook of the type used for CELP coding.

【図７】本発明に係わる相互相関機能を説明する単純化
されたブロック図である。FIG. 7 is a simplified block diagram illustrating the cross-correlation function of the present invention.

【図８】図７において使用されているマルチプレクサの
さらに詳細を示すブロック回路図である。8 is a block circuit diagram showing further details of the multiplexer used in FIG. 7. FIG.

【図９】そのエントリが図６のコードブックの非ゼロエ
ントリに対応する第１のメモリ手段の内容を示す説明図
である。9 is an illustration showing the contents of a first memory means whose entry corresponds to a non-zero entry in the codebook of FIG.

【図１０】そのエントリが図６のコードブックの非ゼロ
エントリに対応する第２のメモリ手段の内容を示す説明
図である。10 is an illustration showing the contents of a second memory means whose entry corresponds to a non-zero entry in the codebook of FIG.

[Explanation of symbols]

１００ＣＥＬＰコーダ１０６送信経路または送信パス３００ＣＥＬＰデコーダ１１０バンドパスフィルタ１１２Ａ／Ｄ変換器１１４フレーマ１１６フレームメモリ１２２ＬＰＣアナライザ１２５コーダ１３０帯域幅拡張重み付け発生器１４２，１６５スペクトルメモリ減算器１４５，１７０スペクトルインバースフィルタＡ
（ｚ）１５０，１７５カスケード重み付けフィルタ１／Ａ
（ｚ／ｒ）１６２ピッチメモリ減算器１５５適応コードブック１８０確率的コードブック１８２チャネルデコーダ１９０長時間遅延ピッチ予測器１９５短時間遅延ベクトル予測器１／Ａ（ｚ）１９７ゲイン乗算器２００スペクトルインバースフィルタＡ（ｚ）２０５キャッシュ重み付けフィルタ１／Ａ（ｚ／ｒ）２１０チャネルコーダ２２０適応コードブックサーチャ２２５確率的コードブックサーチャ100 CELP coder 106 Transmission path or transmission path 300 CELP decoder 110 Bandpass filter 112 A / D converter 114 Framer 116 Frame memory 122 LPC analyzer 125 Coder 130 Bandwidth extension weight generator 142,165 Spectrum memory subtractor 145,170 Spectrum Inverse filter A
(Z) 150,175 Cascade weighting filter 1 / A
(Z / r) 162 Pitch memory subtractor 155 Adaptive codebook 180 Stochastic codebook 182 Channel decoder 190 Long delay pitch predictor 195 Short delay vector predictor 1 / A (z) 197 Gain multiplier 200 Spectral inverse filter A (z) 205 Cache weighting filter 1 / A (z / r) 210 Channel coder 220 Adaptive codebook searcher 225 Probabilistic codebook searcher

【手続補正書】[Procedure amendment]

【提出日】平成５年１０月２２日[Submission date] October 22, 1993

【手続補正１】[Procedure Amendment 1]

【補正対象書類名】図面[Document name to be corrected] Drawing

【補正対象項目名】全図[Correction target item name] All drawings

【補正方法】変更[Correction method] Change

【補正内容】[Correction content]

【図１】 [Figure 1]

【図３】 [Figure 3]

【図２】 [Fig. 2]

【図４】 [Figure 4]

【図６】 [Figure 6]

【図９】 [Figure 9]

【図１０】 [Figure 10]

【図５】 [Figure 5]

【図７】 [Figure 7]

【図８】 [Figure 8]

───────────────────────────────────────────────────── フロントページの続き (31)優先権主張番号７２２，５７２ (32)優先日 1991年６月27日 (33)優先権主張国米国（ＵＳ） (72)発明者デビッド・エルバロンアメリカ合衆国アリゾナ州85251、スコッツデイル、イースト・トーマス 8449 ─────────────────────────────────────────────────── ─── Continuation of front page (31) Priority claim number 722, 572 (32) Priority date June 27, 1991 (33) Priority claiming country United States (US) (72) Inventor David El Baron United States Arizona State 85251, Scottsdale, East Thomas 8449

Claims

[Claims]

1. An optimal codebook vector C _{k = j} (n) for coding a speech frame with N consecutive samples n = 1 to n = N of the input analog speech to best synthesize said speech frame. Device for determining
0,220 ') and K possible perceptually weighted drive vectors C
_An adaptive codebook (155) containing _k (n),
Where k is an integer exponent ranging from 1 to K for identifying the vector, and n = 1 to n = N is an integer exponent identifying consecutive audio samples in the audio frame. , One or more LPC filters (190, 19) for synthesizing a trial copy of the speech frame when driven by the codebook vector C _k (n)
5,200,205,170,145,175,15
0), and the LPC filters (190, 195, 2
00, 205, 170, 145, 175, 150) having an impulse response H (n), a codebook for determining an optimal codebook vector C _{k = j} (n) that best synthesizes the speech frame. One or more LPs according to the vector C _k (n)
C filter (190, 195, 200, 205, 17
0, 145, 175, 150), and the residual X of the target speech perceptually weighted for comparison with the result.
Means (150) for generating (n), for convolution of X (n) with H (n) for each value of n once per frame for transmission to the cross-correlator (520 '). Means (510 ') for producing a convolved output (512'), cross-correlate the convolved output (512 ') with C _k (n) for each value of n and k, and output (52
1 ') is connected to a dividing means (530') and is a squarer (5)
25 ') and the cross-correlated output (55
1 ') for producing (1') an autocorrelation of C _k (n) for each value of n and k for transmission to a multiplier means (543 '). Means (56) for providing a correlation output (561 ')
0 '), H (n) are autocorrelated to a second autocorrelated output (552', 552 ', for transmission to said multiplier means (543').
Means (550 ') for generating an output product for transmission to the adder (540') by multiplying the first (561 ') and second (552') autocorrelation outputs. Means (54 ') for generating (545')
3 '), the product is added to each value of k and a divider (53
0 ') means (540') for producing an added output (541 '), a squared cross-correlation output (521') and an added output of the adder (541). Dividing means (53) for detecting the ratio of ′ ′ and transmitting it to the selector means (570 ′).
0 '), and for transmission to the channel coder (210), selects the value C _{k = j} (n) of C _k (n) that produces the output of maximum amplitude from said dividing means (530'). An apparatus for encoding a speech frame, characterized in that it comprises means (570 ') for

2. Encoding a speech frame comprising N consecutive samples of the input analog speech, k being an integer index from 1 to K, and n being consecutive speech samples n = 1 in said speech frame. , ..., n = N is another integer index that identifies the speech frame using an adaptive codebook (155) containing perceptually weighted drive vectors C _k (n) of K targets. A method for determining an optimal codebook vector C _{k = j} (n) to synthesize best, the method comprising synthesizing a trial copy of the speech frame when driven by the codebook C _k (n) vector. One or more LPC filters (190, 19)
5,200,205,170,145,175,15
0), said one or more LPC filters (190, 195, 200, 205,
170, 145, 175, 150) are impulse responses H
With (n), one or more LPC filters (190, 195, 200, 2) depending on the codebook vector C _k (n)
05,170,145,175,150) to provide a perceptually weighted target speech residual X (n) for comparison with the result, H for each value of n once for each frame. Convolving X (n) with (n) to produce a convolved output W (n) for transmission to the cross-correlator (520 '), for each value of n and k C _k (n ), Cross-correlating the output W (n) convolved with the output to produce a cross-correlated output for transmission to a squarer (525 ') coupled to a divider (530'). autocorrelating C _k (n) for each value of n and k to provide a first autocorrelation output U _k (m) for transmission to a multiplier (543 ′), In this case m is m
A dummy exponent ranging from = 0 to m = N−1, a second autocorrelated output Φ (m) for transmission to the multiplier (543 ′) by autocorrelating H (n). In which m is a dummy exponent from m = 0 to m = N−1, wherein the multiplier (543 ′) multiplies the first and second autocorrelation outputs. Generating an output product (545 ') for transmission to an adder (540'), adding the product to each value of k and adding a divider (53).
0 ') to produce a summed output (541') for transmission to the peak selector (570 ') and the squared cross-correlation output (521') for addition to the adder The step of performing a division to obtain the ratio of the outputs (541 '), and C _k (n) producing the maximum magnitude output from the divider (530') for transmission to the channel coder (210). ) Value C _{k = j} (n) is selected in the peak selector.

3. A method for providing CELP coding for digitized input speech frames based on the use of a codebook (155) containing K vectors each having N entries, the method comprising: Autocorrelating the codebook vector for the first P (P << N) to determine a first autocorrelation value (561 ') for it, using K codebook vectors and said first autocorrelation value Evaluating the codebook vector of K by generating a synthesized speech and comparing the result with the input speech, which S (S << K) of the K codebook vectors is compared with the input speech. Determining whether to provide synthetic speech with a smaller error than the remaining vector of K-S evaluated by Autocorrelating a codebook vector for S (P <R ≦ N) of K vectors for a bird to provide a second autocorrelation value for it, using the second autocorrelation value Re-evaluating the S of the K vectors and comparing with the input speech to identify which of the S codebook vectors provides the minimum error, and the codebook giving the minimum error Forming a CELP code for the speech frame using a vector identifier.

4. The P, S and R are 5 ≦ P ≦ 10,
The method according to claim 3, wherein 1 ≦ S ≦ 7 and R = N or N−1.

5. The frame of digitized input speech designated X (n) comprises n = 1 to n = N consecutive samples of the input analog speech, and the codebook is K.
Target perceptually weighted drive vector C
_k (n), where k is an integer exponent ranging from 1 to K, and n is another integer exponent identifying consecutive speech samples n = 1 to n = N in the speech frame. , And C _{k = j} (n) represents the optimal codebook vector that best synthesizes the target speech frame X (n), and the first autocorrelation step is Autocorrelating with respect to m = 0 to m = P, where P << N is the codebook vector C _k (n), and said evaluating step comprises: U detected from said equation in a codebook searcher. comprising recursively evaluating all K vectors C _k (n) using the value of P of _k (P) to determine a mean square error probability, and said determining step comprising S << K, and selecting S of the K vectors C _k (n) that provide the closest match to the target speech X (n), and the second autocorrelation and reevaluation step comprises: In the codebook searcher all m = 0 to determine U _k (m) in the above equation
Recursively re-evaluate S of the K vectors chosen above using ~ m = N-1, thereby yielding the j-th value C
_{k = j} (n) and selecting the corresponding gain index _{Gk = j} that gives the best fit to the target speech X (n), and said forming step for transmitting to the CELP combiner. The method of claim 3, comprising transmitting C _{k = j} (n) and G _{k = j} to a channel coder.

6. An apparatus (100, 220, 220 ', 600) for CELP coding of speech using vector autocorrelation coefficients of an adaptive codebook (155), the analysis initially comprising a length. Utilizing a subset of samples M in association with L> M speech analysis frames, the apparatus is k = 1 and m is the autocorrelation delay index and n is the index of consecutive samples in the codebook vector. And the autocorrelation coefficient U _{k ′} (m) of the first vector C _k (n) of length M is given by the first equation, for m = 0 to T <M, Means (610) for determining according to k ≧ 2 and incrementally the autocorrelation coefficient U _k (m) of the remaining codebook vectors (M +) for m = 0 to T <M.
Up to k-1) = L, the second equation, _U'k (m) = [ _U'k-1 (m) + _Ck (M + k-
1) Means (624, 626, 62) for determining according to C _k (M + k-1 + m)]
8, 630), scaling the result of the first equation according to the following equation: U ₁ (m) = (L / M) U ′ ₁ (m) and the result of the second equation: , U _k (m) = {L / (M + k−1)} U ′ _k (m), and these scalings are m =
Means (632) for producing a result for each m and each k, for which 0 to T <M, and which codebook vector using said result is the smallest compared to the input speech. An apparatus for CELP coding of speech, characterized in that it comprises means (220, 220 ') for assessing whether to provide an error.

7. An adaptive codebook of vector length N (15)
CEL of speech using vector autocorrelation coefficient of 5)
A method for P coding, wherein the analysis initially uses a subset of samples of M <N having speech analysis frames of length L, said method being: k = 1 and m is an autocorrelation delay exponent. And n is the index of consecutive samples in the codebook vector, and for m = 0 to T <M, Calculating the autocorrelation coefficient U _k (m) of the first vector C _k (n) of length M according to U ₁ (m) = (L / M) U ′ ₁ (m), k ≧ 2 Then, for m = 0 to T <M, the following equation: U ′ _k (m) = [U ′ _k−1 (m) + C _k (M + k−
1) C _k (M + k−1 + m)] U _k (m) = {L / (M + k−1)} U ′ _k (m) according to the autocorrelation coefficient U of the remaining codebook vector
calculating _k (m), repeating the second calculation until (M + k−1) = L, and using the autocorrelation coefficient determined as described above, the codebook vector C _k Determining which of (n) produces the smallest error when compared to the input speech. A method for CELP coding of speech.

8. The method according to claim 6 or 7, wherein T and M have a relationship of T = M-1.

9. A method for CELP coding of speech using the autocorrelation coefficients of a vector of an adaptive codebook (155) identified by an index k, the analysis by synthesis being initially M <L. A codebook value is used, where L is the length of the speech analysis frame and m is an index from 0 to M-1 that describes the autocorrelation delay, the method is as follows: n is the codebook vector value N is the exponent of
Calculating the autocorrelation coefficient of m = 0 to M−1 of the first codebook vector k having values of = 1 to M therein, m = 0 to M−1 in the temporary memory (630). Storing the calculated autocorrelation coefficient of the multiplicative factor L /
Scale by M and output the result (634)
To n = M + j, where the codebook value is n = M +
multiplying by a codebook value falling from j to n = 1 and adding the product to the autocorrelation coefficients from said temporary store (630) for m = 0 to M-1, respectively, to produce a result. Replacing the autocorrelation coefficient in the temporary storage device (630) with the result, scaling the coefficient in the temporary storage device (630) by a multiplication factor L / (M + j) and outputting the result to the output (634). Transferring, repeating k, k = (L + 1−M) and j = 2 to j = k−1 for the multiplication, replacement, scaling and transfer steps, and using the autocorrelation coefficient transferred to the output And which codebook vector has better voice C
A method for CELP encoding of speech, comprising: determining whether to provide ELP encoding.

10. Apparatus (100,7) for performing CELP coding of speech by combining a first vector with a second set of vectors identified by an index k.
00) and the first and second vectors are n =
Having a value identified by an index n ranging from 1 to N, the device comprising: n = 1 to N outputs, n = 1 to N first inputs, second inputs, and n = 1 to N N × N having selection means of
Multiplexer (704) for providing a first logic level to the nth selecting means for coupling the nth output to the nth first input and to the nth selecting means. A second logic level provided is for coupling the nth output to the second input, n = 1 to N values of the first vector being n = of the first multiplexer (704). 1-N first input (70
1) means for supplying n = 1 to N values of the second vector of index k = 1 to n = 1 to N selection means of the first multiplexer (704) (708) 706), the second vector providing the first logic level for some values of n in the selection means of n = 1 to N and An output (710) of the first multiplexer (704) coupled to an output (710) of the first multiplexer for providing a second logic level for other values.
First accumulator means (712) for adding together the values of said first vector to provide a first sum (716), indexing k from k = 1 to k = K Means (714), and means (228) for synthesizing the speech based on which sum identifies the second vector that gives the closest match to the target speech. For CELP encoding of.

11. The second vector has values 0, + 1,-
Said means for providing a value of n = 1 to N of said second vector having two parts, namely an entry corresponding to the location of the value of 0, + 1 of said second vector. A first portion (706) having a value of 0,1 and a second portion (707) having a value of 0,1 corresponding to a location of a value 0, -1 of the second vector,
And said providing means provides its first part to said selecting means (708) of said first multiplexer (704), and said device further comprises: n = 1 to N outputs, n = 1 to N A second N × N having a first input, a second input, and n = 1-N selection means.
Multiplexer (705) for providing a first logic level to the nth selecting means for coupling the nth output to the nth first input and to the nth selecting means. A second logic level provided for coupling the nth output to the second input and a second of the providing means.
Is coupled to the selection means of the second multiplexer, the value n = 1 to N of the first vector to the first input n = 1 to N of the second multiplexer. Second means for supplying (701), together adding together the values of the first vector coupled to the output of the second multiplexer and transferred to the output of the second multiplexer to obtain a second sum ( Second accumulator means (713) for providing 717), and said first
Means for combining the (716) and second (717) sums to produce a result (721) that is used to determine which input vector (702) gives the closest match to the target speech (721). 720), The apparatus according to claim 10, further comprising:

12. A first vector V (n) having values identified by an index n ranging from n = 1 to N, and a second vector.
For the CELP coding of speech using a combination of a set of vectors S _k (n) of each of the second vectors, each of the second vectors being identified by an index k and each of the second vectors being Zero or non-zero and n = 1 to
Having values up to N identified by an index n leading to N,
The apparatus is means (704, 706) for identifying an index n _{k, i} of S _k (n) for different k, said S _k (n
_i ) is non-zero, means (712, 721) for adding the values of said V (n) corresponding to the exponent n _{k, i} to form sum Q (k), maximum value Q ( means (714) for identifying the value k = j corresponding to k = j) and means (228) for synthesizing the speech using S _{k = j} (n). A device for CELP encoding of speech.

13. Continuing vectors of said set of said second vectors are determined by the overlap of the preceding second vectors according to the amount of overlap Δk, Δn, means for said identification and means for addition. Is a means (705, 707) for identifying the index n _{1, i} of S _k (n) for k = 1, said S ₁ (n _i ) being non-zero, S _k (n _{i ′} ) Is non-zero, and for k> 1, n
_{1, i} and the overlap amount Δk, Δ
Means (705, 707) for determining yet another index n _{k, i '} using n, and adding the values of V (n) for such index and still other indices to sum Q. 13. Device according to claim 12, comprising means (713, 721) for forming (k).

14. The means for identifying, the means for determining, and the means for adding assume that S _k (n _{i ″} ) is non-zero and were not previously identified for k ≧ 2. First
Means for identifying the index n _{k, i ″} of S _k (n
_{i ″ ′} ) is non-zero, and for k ≧ 3, another index n
Means for determining _{ki "'} starting from the index n _{k, i"} and using said amount of overlap, and adding the values of V (n) for such yet another index and further summing Q 14. The apparatus of claim 13, comprising means for forming (k).

15. A method for CELP encoding speech by combining a first vector (701) with a second set of vectors identified by an index k, said first and second vectors. Has a value identified by the index n leading to n = 1 to N, the method comprising: n = 1 to N outputs, n = 1 to N first inputs, second inputs, and N × N first with n = 1 to N selection means
A multiplexer (704) of
The first logic level provided to the nth selection means couples the nth output to the nth first input, and the second logic level provided to the nth selection means is the nth output. A second output coupled to the second input, providing the n = 1 to N values of the first vector (701) to the n = 1 to N first inputs of the first multiplexer. Step, n = 1 to 2 of the second vector (702) with index k = 1
The value of N is set to n = 1 of the first multiplexer (704).
To N selection means, said second vector providing said first logic level for some values of n in said selection means of n = 1 to N and n. Providing the second logic level for other values, adding together the values of the first vector coupled to the output of the first multiplexer to provide a sum, and yet another of k Repeating the providing and summing steps for the values, and synthesizing the speech based on which sum identifies the second vector that gives the closest match to the target speech. For CELP encoding of speech to be played.

16. The second vector has values 0, + 1, −.
1 and providing a value of n = 1 to N of said second vector comprises a first entry having an entry 0,1 corresponding to a location of a 0, + 1 value of said second vector. Providing a portion to a first multiplexer and a second portion having a value 0,1 corresponding to a location of a value 0, -1 of the second vector, similar to the first multiplexer and Providing a second multiplexer responsive to the first input vector at its input and the second portion of the second vector at its selection means, similar to the first malplexer, said first multiplexer The values of the first vector coupled to the output of the multiplexer are added together to provide a first sum, and the values of the first vector coupled to the output of the second multiplexer are added together. To provide a second sum, the first and second sums are combined to provide an output useful for determining which input vector provides the closest match to the target speech. 16. The method of claim 15, comprising:

17. CELP for speech using a combination of a set of a first vector V (n) and a second vector S _k (n), where n = 1 to N and having a value identified by an index n. A method for encoding, wherein each of the second vectors is identified by an index k and each of the second vectors is zero or non-zero and n = 1 to N.
With values up to N identified by an exponent n, where S _k (n _i ) is nonzero and S is different for different k.
_k index _{n k} of the _(n), identifying a _i, the index _{n k,} sums by adding the value of V (n) corresponding to the _i Q
Forming (k), identifying the value k = j corresponding to the maximum value Q (k = j), and synthesizing the speech using S _{k = j} (n). A method for CELP encoding of speech, comprising:

18. The successive vectors of said set of said second vectors are determined by the overlap of the preceding second vectors according to the amount of overlap Δk, Δn, said discriminating and summing steps comprising S ₁ (n _i ) is non-zero, the index n of S _k (n)
_{Discriminating 1, i} with respect to k = 1, starting from n _{1, i} and overlapping amounts Δk, Δn
Using S _k (n _{i ′} ) is non-zero and k>
The step of determining yet another index n _{k, i ′} for _1, and adding the values of V (n) to such index and still other indices to form the sum Q (k). 18. The method of claim 17, comprising:

19. S _k (n _″ ) is non-zero, and
For k ≧ 2, the previously unidentified first index n
_{k, i ″} , and then using the amount of overlap, starting from the index n _{k, i ″} , S _k (n _{i ″ ″} )
Is non-zero, and for k ≧ 3, another index n
the step of determining _{k, i ″ ′} , and adding the values of V (n) for such further exponents and further summing Q
19. The method of claim 18, further comprising forming (k).