JPH11154894A

JPH11154894A - Direction-change echo canceler and method thereof

Info

Publication number: JPH11154894A
Application number: JP26738798A
Authority: JP
Inventors: Tom Hong Li; トム・ホン・リ
Original assignee: Motorola Inc
Current assignee: Motorola Solutions Inc
Priority date: 1997-09-04
Filing date: 1998-09-03
Publication date: 1999-06-08
Also published as: CN1212555A; GB9819078D0; FR2767941A1; GB2329097A; DE19836995A1; GB2329097B

Abstract

PROBLEM TO BE SOLVED: To provide an echo canceler, which can be executed in a comparatively small circuit with high calculation efficiency having low drain current characteristic to quickly converge and suppress echo signally more than 40 dB. SOLUTION: An echo canceler 200 converts a remote terminal input voice to its linear predictive remainder. At a coefficient generating circuit 204, an updating direction to the coefficient of the echo canceler is formed in two stages. In the first stage, the updating direction is formed in the region of the linear predictive remainder. On the second stage, the updating direction is converted from the remainder region to an audio region, so that an updating method to be used for adapting the coefficient can be provided.

Description

DETAILED DESCRIPTION OF THE INVENTION

【０００１】[0001]

【発明の属する技術分野】本発明は、一般的に、エコー
・キャンセラに関し、更に特定すれば、高速収束エコー
・キャンセラ(fast converging echo canceller)に関す
るものである。BACKGROUND OF THE INVENTION 1. Field of the Invention The present invention relates generally to echo cancellers, and more particularly to a fast converging echo canceller.

【０００２】[0002]

【従来の技術】双方向通信システムの中には、反対方向
に送信する信号のために別個の経路を有するものがあ
る。かかるシステムでは、一方の経路の信号が反射し
て、他方の経路に侵入する場合がある。これら反射信号
は、一般的に、エコー信号と呼ばれ、所望の通信信号と
干渉する。その結果、反射信号を抑制するために、エコ
ー・キャンセラが開発された。2. Description of the Related Art Some two-way communication systems have a separate path for signals transmitting in the opposite direction. In such a system, a signal on one path may be reflected and enter the other path. These reflected signals are generally called echo signals and interfere with desired communication signals. As a result, echo cancellers have been developed to suppress reflected signals.

【０００３】エコー・キャンセラは、適応フィルタを用
いて、反射して信号経路の一方に侵入するエコー信号を
推定する。エコー信号を含む信号経路から、このエコー
推定値を減算することによって、ほぼエコーのない、即
ち、エコーを抑制した信号を発生する。[0003] An echo canceller uses an adaptive filter to estimate an echo signal that is reflected and enters one of the signal paths. Subtracting this echo estimate from the signal path containing the echo signal produces a signal that is substantially echoless, ie, echo suppressed.

【０００４】平均最小二乗（ＬＭＳ：least mean squar
e ）適応フィルタは比較的単純な構造を有し、計算上安
定かつ効率的であるので、エコー信号を推定するために
これらはエコーキャンセラに広く用いられている。しか
しながら、ＬＭＳ適応フィルタには、テレビ会議(telec
onference)やハンズ・フリー・セルラ通信(hands-free
cellular communication) のような、音響エコー・キャ
ンセレーション(acoustic echo cancellation)の用途に
用いた場合、収束速度(convergence rate)が遅いという
欠点がある。[0004] Mean least square (LMS)
e) Adaptive filters have a relatively simple structure and are computationally stable and efficient, so they are widely used in echo cancellers for estimating echo signals. However, LMS adaptive filters include video conferencing (telec
onference) and hands-free cellular communications (hands-free
When used for acoustic echo cancellation such as cellular communication, there is a disadvantage that the convergence rate is low.

【０００５】ＬＭＳ適応フィルタの別の欠点として、こ
れらが抑制するエコー・信号は多くの場合３０ｄＢ（デ
シベル）以下に過ぎないことがあげられる。典型的な音
響的用途では、エコー信号は非常に強力であり、所望の
通信信号と同じ位強い場合もある。かかる環境では、適
応フィルタは、素早く変化するエコー経路を模擬するた
めに、素早く収束しなければならず、エコー信号を少な
くとも４０ｄＢ抑制することが非常に望ましい。これら
２つの要件を満たすことができない場合、大きなエコー
推定誤差が生じ、これら大きな誤差の結果、所望の通信
信号の信号品質に著しい劣化が発生する。[0005] Another disadvantage of LMS adaptive filters is that they often suppress echo signals below 30 dB (decibel). In a typical acoustic application, the echo signal is very strong and can be as strong as the desired communication signal. In such an environment, the adaptive filter must converge quickly to simulate a rapidly changing echo path, and it is highly desirable to suppress the echo signal by at least 40 dB. If these two requirements cannot be met, large echo estimation errors will occur, which will result in significant degradation in the signal quality of the desired communication signal.

【０００６】繰り返し最小二乗（ＲＬＳ：recursive le
ast square) アルゴリズムアフィン投影アルゴリズム(a
ffine-projection algorithm) のような他のアルゴリズ
ムを用いた適応フィルタは、ＬＭＳフィルタを用いたエ
コー・キャンセラよりは素早く適応するが、これらは不
安定であるか、あるいは計算に費用がかかる。計算に費
用がかかるフィルタとは、大きな回路または大量の処理
資源を必要とするもののことである。不安定な適応フィ
ルタは、迷走性推定値(erratic estimate)を生成し、そ
のために十分に信頼性のある動作を達成することができ
ず、ユーザが通話を通じて明瞭に通信可能なことを保証
することができない。[0006] Repetitive least squares (RLS)
ast square) algorithm Affine projection algorithm (a
Adaptive filters using other algorithms (such as the ffine-projection algorithm) adapt faster than echo cancellers using LMS filters, but they are unstable or computationally expensive. Computationally expensive filters are those that require large circuits or large amounts of processing resources. Unstable adaptive filters generate erratic estimates and therefore cannot achieve sufficiently reliable behavior, ensuring that users can communicate clearly over a call Can not.

【０００７】[0007]

【発明が解決しようとする課題】素早い収束，安定性，
高い抑制，および計算上効率的な適応フィルタに対する
要望は、とりわけ、移動ハンズ・フリー電話機やハンズ
・フリー・テレビ会議のような環境では大きい。かかる
環境におけるエコー経路は、素早い変化，強いエコー，
および煩い背景ノイズを生じ易い。したがって、エコー
・キャンセラの適応フィルタは、素早く収束し、エコー
信号を４０ｄＢ以上抑制することが必要となる。また、
エコー・キャンセラは、計算上効率的であり、低ドレイ
ン電流特性を有する比較的小さな回路に実施可能である
ことも望まれている。[Problems to be solved by the invention] Fast convergence, stability,
The need for high suppression and computationally efficient adaptive filters is particularly acute in environments such as mobile hands-free telephones and hands-free video conferencing. The echo path in such an environment can be fast changing, strong echo,
And troublesome background noise is likely to occur. Therefore, the adaptive filter of the echo canceller needs to converge quickly and suppress the echo signal by 40 dB or more. Also,
It is also desirable that the echo canceller be computationally efficient and be implemented in relatively small circuits having low drain current characteristics.

【０００８】このため、音響エコーを発生し易い環境に
おいて、エコー経路が素早く変化し、エコー信号が強い
場合にも、安定で計算上効率的なフィルタであり続ける
ことが必要である。For this reason, in an environment in which an acoustic echo is likely to be generated, it is necessary to keep the filter stable and computationally efficient even when the echo path changes quickly and the echo signal is strong.

【０００９】[0009]

【発明の実施の形態】エコー・キャンセラは、遠端音声
サンプル(far-end speech sample) を、その線形予測残
余サンプル(linear prediction residual sample) に変
換する。エコー・キャンセラの係数に対する更新方向
を、線形予測残余領域において形成する。次に、残余領
域における更新方向(updating direction)を、音声領域
における更新方向に変換する。音声領域において変換さ
れた更新方向を用いて、エコー・キャンセラの係数適応
化を実行する。線形予測残余領域から音声領域への変換
によって更新方向を算出することにより、最適な更新方
向を得ることができる。これによって、エコー・キャン
セラは、エコー経路への適応化を大幅に高速化すること
ができ、安定性，効率，およびロバスト性を維持しつ
つ、エコー抑制の大幅な増大を可能にする。DETAILED DESCRIPTION OF THE INVENTION An echo canceller converts a far-end speech sample into its linear prediction residual sample. An update direction for the echo canceller coefficients is formed in the linear prediction residual area. Next, the updating direction in the remaining area is converted to the updating direction in the audio area. The coefficient adaptation of the echo canceller is performed using the updated direction converted in the voice domain. By calculating the update direction by converting the linear prediction residual area into the speech area, an optimal update direction can be obtained. This allows the echo canceller to significantly speed up the adaptation to the echo path, enabling a significant increase in echo suppression while maintaining stability, efficiency and robustness.

【００１０】従来の平均最小二乗（ＬＭＳ）エコー・キ
ャンセラ１００を、図１のデバイス１０１に示す。デバ
イス１０１は、ハンズ・フリー動作のためのマイクロフ
ォン１０４およびスピーカ１０２を含む。スピーカ経路
は、デコーダ１０６，および送受信機１２０の受信機出
力に接続されたデジタル−アナログ（Ｄ／Ａ）変換器１
０８を含む。マイクロフォン経路は、アナログ−デジタ
ル（Ａ／Ｄ）変換器１１４，コンバイナ１１２，および
送受信機１２０の送信機に結合されたエンコーダ１１６
を含む。したがって、信号ｘ（ｎ），ｙ（ｎ）およびｅ
（ｎ）はデジタル信号である。送受信機１２０は、無線
電話機内のアンテナ１２２に結合されている。A conventional Least Mean Square (LMS) echo canceller 100 is shown in device 101 of FIG. Device 101 includes a microphone 104 and a speaker 102 for hands-free operation. The speaker path is a digital-to-analog (D / A) converter 1 connected to the decoder 106 and the receiver output of the transceiver 120.
08. The microphone path includes an analog-to-digital (A / D) converter 114, a combiner 112, and an encoder 116 coupled to the transmitter of the transceiver 120.
including. Therefore, the signals x (n), y (n) and e
(N) is a digital signal. Transceiver 120 is coupled to antenna 122 in the wireless telephone.

【００１１】デバイス１０１は、移動無線電話機に接続
されたハンズ・フリー・アクセサリとすることができ、
あるいは無線電話機またはハンズ・フリー・テレビ会議
デバイスとすることもできる。エコー・キャンセラ１０
０は、デジタルおよび／またはアナログ回路を有する双
方向通信デバイス内に実施可能であることを、当業者は
認めよう。したがって、デコーダ１０６およびエンコー
ダ１１６は、例示の目的のためのものである。デコーダ
１０６は、送受信機１２０がアナログ・セルラ・ホーン
またはアナログ陸線ホーン内にある場合、またはデバイ
ス１０１がデジタル・セルラ・ホーンまたはデジタル陸
線ホーンのアナログ出力に接続されている場合は、Ａ／
Ｄ変換器とすることができる。あるいは、デコーダ１０
６は、デジタル・セルラ・ホーンのようなデジタル・デ
バイスにおいては、デジタル・デコーダとすることがで
きる。エンコーダ１１６は、送受信機１２０がアナログ
・セルラ・ホーンまたはアナログ陸線ホーン内にある場
合、またはデバイス１０１がデジタル・セルラ・ホーン
またはデジタル陸線ホーンのようなデジタル・デバイス
のアナログ出力に接続されている場合は、Ｄ／Ａ変換器
とすることができる。あるいは、エンコーダ１１６は、
デジタル・セルラ・ホーンのようなデジタル・デバイス
においては、デジタル・エンコーダとすることができ
る。Device 101 can be a hands-free accessory connected to a mobile radiotelephone,
Alternatively, it may be a wireless telephone or a hands-free video conferencing device. Echo Canceller 10
One skilled in the art will recognize that 0 can be implemented in a two-way communication device having digital and / or analog circuitry. Thus, decoder 106 and encoder 116 are for illustrative purposes. Decoder 106 may provide A / A if transceiver 120 is in an analog cellular horn or analog landline horn, or if device 101 is connected to the analog output of a digital cellular horn or digital landline horn.
It can be a D converter. Alternatively, the decoder 10
6 may be a digital decoder in a digital device such as a digital cellular horn. Encoder 116 may be connected when transceiver 120 is in an analog cellular horn or analog landline horn, or when device 101 is connected to the analog output of a digital device such as a digital cellular horn or digital landline horn. If so, it can be a D / A converter. Alternatively, the encoder 116
In a digital device such as a digital cellular horn, it can be a digital encoder.

【００１２】エコー・キャンセラ１００について、これ
より、図１ないし図３を参照しながら説明する。現サン
プリング時点をｎとし、遠端音声サンプルｘ（ｎ）がス
ピーカ１０２への出力であり、近端オーディオ信号ｙ
（ｎ）がマイクロフォン１０４から受信され、この説明
の目的のために、信号ｘ（ｎ），ｙ（ｎ）は同期されて
いるものとする。即ち、デジタル−アナログ変換器１０
８およびアナログ−デジタル変換器１１４は同じクロッ
クを用いることを意味する。近端オーディオ信号ｙ
（ｎ）は、近端音声ｓ（ｎ），エコーｔ（ｎ），および
近端ノイズＮ（ｎ）から成る。エコーｔ（ｎ）は、スピ
ーカ１０２によって出力された信号ｘ（ｎ）が反射して
マイクロフォン１０４に戻ってきた部分である。ノイズ
Ｎ（ｎ）は、例えば、車両の車室内における周囲ノイズ
である。The echo canceller 100 will now be described with reference to FIGS. Let n be the current sampling point, the far-end audio sample x (n) is output to the speaker 102, and the near-end audio signal y
(N) is received from microphone 104, and for the purposes of this description, signals x (n) and y (n) are assumed to be synchronized. That is, the digital-analog converter 10
8 and the analog-to-digital converter 114 mean using the same clock. Near-end audio signal y
(N) consists of near-end speech s (n), echo t (n), and near-end noise N (n). The echo t (n) is a portion where the signal x (n) output from the speaker 102 is reflected and returns to the microphone 104. The noise N (n) is, for example, ambient noise in a vehicle cabin.

【００１３】以下の説明では、適応フィルタの係数の適
応化を実行している間、近端音声ｓ（ｎ）をゼロ（即
ち、近端音声ｓ（ｎ）は存在しない）であると仮定す
る。近端音声および遠端音声が双方とも存在する場合、
即ち、一般的にダブル・トーク(double-talk) と呼ばれ
ている状態にある場合、適応化を停止しなければならな
い。この状況を検出し、この状態の間適応化を禁止する
ためのダブル・トーク検出器(double talk detector)は
当技術分野では既知であり、簡潔化のため以下では詳し
く説明しない。したがって、ここで引用する場合、近端
オーディオ信号ｙ（ｎ）はエコーｔ（ｎ）およびノイズ
Ｎ（ｎ）のみから成るものとする。In the following description, it is assumed that near-end speech s (n) is zero (ie, near-end speech s (n) does not exist) while performing the adaptation of the coefficients of the adaptive filter. . If both near-end and far-end audio are present,
That is, adaptation must be stopped in a state commonly referred to as double-talk. Double talk detectors for detecting this situation and inhibiting adaptation during this state are known in the art and will not be described in detail below for brevity. Therefore, when cited here, it is assumed that the near-end audio signal y (n) consists only of the echo t (n) and the noise N (n).

【００１４】エコー・キャンセラ１００は、エコー経路
をモデル化するために、係数発生回路１０９およびＬＭ
Ｓ適応フィルタ１１０を含む。ＬＭＳ適応フィルタ１１
０が発生するエコー推定値ｚ（ｎ）は、コンバイナ１１
２において信号ｙ（ｎ）と結合され、エコー信号の除去
即ち抑制が行われる。エコー推定誤差ｅ（ｎ）は、エコ
ー推定値ｚ（ｎ）と近端信号ｙ（ｎ）との間の差であ
る。ＬＭＳ適応フィルタ１１０の係数は、受信した遠端
音声サンプルおよびエコー推定誤差に基づいて、係数発
生回路１０９によって更新する。The echo canceller 100 includes a coefficient generation circuit 109 and an LM for modeling an echo path.
An S adaptive filter 110 is included. LMS adaptive filter 11
The estimated echo value z (n) at which 0 occurs is determined by the combiner 11
The signal is combined with the signal y (n) in 2 to remove or suppress the echo signal. The echo estimation error e (n) is the difference between the echo estimate z (n) and the near-end signal y (n). The coefficient of the LMS adaptive filter 110 is updated by the coefficient generation circuit 109 based on the received far-end voice sample and the echo estimation error.

【００１５】時点ｎにおいて、ＬＭＳ適応フィルタ１１
０の係数は、Ｗ（ｎ）＝［ｗ₀ （ｎ）ｗ₁ （ｎ）．．．
ｗ_L-1 （ｎ）］^T である。Ｌはフィルタ長であり、上付
き文字Ｔ（即ち［］^T ）は、ベクトルの転置を意味す
る。Ｌ個の遠端受信遠端音声サンプルは、Ｘ（ｎ）＝
［ｘ（ｎ），ｘ（ｎ−１），．．．，ｘ（ｎ−Ｌ＋
１）］^T である。ＬＭＳ適応フィルタのエコー推定値ｚ
（ｎ）は、次の式で表される。ｚ（ｎ）＝Ｘ（ｎ）^T Ｗ（ｎ）（１）ＬＭＳ適応フィルタ１１０に対するエコー推定誤差ｅ
（ｎ）は、次の式で表される。At time n, the LMS adaptive filter 11
Coefficient of 0, W (n) = [w 0 (n) w 1 (n). . .
w _L-1 (n)] ^T. L is the filter length, and the superscript T (ie, [] ^T ) means transposition of the vector. The L far-end received far-end speech samples are X (n) =
[X (n), x (n-1),. . . , X (n−L +
1)] ^T. Echo estimate z of LMS adaptive filter
(N) is represented by the following equation. z (n) = X (n) ^T W (n) (1) Echo estimation error e for LMS adaptive filter 110
(N) is represented by the following equation.

【００１６】ｅ（ｎ）＝ｙ（ｎ）−ｚ（ｎ）（２）ＬＭＳ適応フィルタ１１０の係数Ｗ（ｎ）は、次の式に
したがって更新される。E (n) = y (n) −z (n) (2) The coefficient W (n) of the LMS adaptive filter 110 is updated according to the following equation.

【００１７】[0017]

【数３】ここで、μは適応化の刻み幅(step size) であり、｜｜
Ｘ（ｎ）｜｜² ＝Ｘ（ｎ）^T Ｘ（ｎ）である。(Equation 3) Where μ is the adaptation step size, and
X (n) || ² = X (n) ^T X (n).

【００１８】Ｒ（ｎ）＝Ｅ｛Ｘ（ｎ）Ｘ（ｎ）^T ｝＝
｛ｒ_ij（ｎ）｜ｉ，ｊ＝０，１，．．．，Ｌ−１｝は、
遠端音声サンプルの自己相関マトリクスである。Ｅ
｛^* ｝は、統計的平均である。遠端音声信号は非定置状
(nonstationary) であるので、Ｒ（ｎ）は時間変動Ｌｘ
Ｌマトリクスであり、Ｌ個の時間変動正固有値｛λ_i
（ｎ）ｉ＝０，１，．．．，Ｌ−１｝およびＬ個の時間
変動固有ベクトル｛ｖ_i （ｎ）ｉ＝０，１，．．．，Ｌ
−１）｝を有する。Ｓ（ｎ）＝Ｅ｛ｙ（ｎ）Ｘ（ｎ）｝
＝［ｓ₀ （ｎ）ｓ₁ （ｎ）．．．Ｓ_L-1 （ｎ）］^T ＝
［Ｅ｛ｙ（ｎ）ｘ（ｎ）｝Ｅ｛ｙ（ｎ）ｘ（ｎ−
１）｝．．．Ｅ｛ｙ（ｎ）Ｘ（ｎ−Ｌ＋１）｝］^T は、
遠端音声サンプルおよび近端オーディオ信号の自己相関
ベクトルである。R (n) = E {X (n) X (n) ^T } =
{R _ij (n) | i, j = 0,1,. . . , L-1} is
6 is an autocorrelation matrix of far-end speech samples. E
｛ ^* ｝ Is the statistical average. Far-end audio signal is non-stationary
(nonstationary), R (n) is a time variation Lx
L matrix, L time-varying positive eigenvalues ｛λ _i
(N) i = 0, 1,. . . , L-1} and L time variation eigenvectors _{{v i (n) i =} 0,1 ,. . . , L
-1) has｝. S (n) = E {y (n) X (n)}
= [S ₀ (n) s ₁ (n). . . S _L-1 (n)] ^T =
[E ｛y (n) x (n)｝ E ｛y (n) x (n−
1) II. . . E {y (n) X (n-L + 1)}] ^T is
It is an autocorrelation vector of a far-end audio sample and a near-end audio signal.

【００１９】適応フィルタの性能は、Ｌ個の時間変動固
有値および固有ベクトルの値によって左右される。μは
固定であり、次の規則に従うように選択しなければなら
ないことは既知である。The performance of the adaptive filter depends on the values of the L time-varying eigenvalues and the eigenvectors. It is known that μ is fixed and must be chosen to obey the following rules:

【００２０】０＜μ＜２／λ_max （ｎ）（４）ここで、λ_max （ｎ）はＲ（ｎ）の最大固有値である。
遠端音声は非定置状であるので、λ_max （ｎ）は大きな
ダイナミック・レンジを有する。しかしながら、適応フ
ィルタの安定性を維持するためには、μは、常に式
（４）の範囲に収まるように、非常に小さくなければな
らない。この安定性を維持するための小さなμは、ＬＭ
Ｓ適応フィルタの収束が遅い原因の１つである。0 <μ <2 / λ _max (n) (4) where λ _max (n) is the maximum eigenvalue of R (n).
Since the far-end speech is non-stationary, λ _max (n) has a large dynamic range. However, in order to maintain the stability of the adaptive filter, μ must be very small so that it always falls within the range of equation (4). A small μ to maintain this stability is LM
This is one of the causes of the slow convergence of the S adaptive filter.

【００２１】ＬＭＳ適応フィルタ１１０の最適係数は、
Ｗ₀ ＝［ｗ₀ ｗ₁ ，．．．，ｗ_L-1］^T である。通常、
Ｗ₀ は、時間と共に非常にゆっくりと変化する。したが
って、ここでは、これを一定ベクトルと見なす。ＬＭＳ
適応フィルタが現係数Ｗ（ｎ）を用いて発生するエコー
推定誤差は、以下のように与えられる。The optimum coefficient of the LMS adaptive filter 110 is
W ₀ = [w ₀ w ₁ ,. . . , W _L-1 ] ^T. Normal,
W ₀ changes very slowly over time. Therefore, here, this is regarded as a constant vector. LMS
The echo estimation error generated by the adaptive filter using the current coefficient W (n) is given as follows.

【００２２】 ε（ｎ）＝ｙ（ｎ）−ｚ（ｎ）＝ｙ（ｎ）−Ｘ（ｎ）^T Ｗ（ｎ）（５）ここで、ｙ（ｎ）は近端エコー信号である。ｙ（ｎ）が
エコーおよびノイズで構成されている状況について、以
下で検討する。以下の分析では、近端ノイズは存在しな
いと仮定する。Ｗ（ｎ）がｙ（ｎ）およびＸ（ｎ）と相
関付けされていない場合（この想定は、Ｗ（ｎ）が収束
し、Ｗ（ｎ）が一定ベクトルと見なされる場合、近似的
に正しい）、ＬＭＳ適応エコー・キャンセラのエコー推
定値の平均二乗誤差は、以下によって定義される。Ε (n) = y (n) −z (n) = y (n) −X (n) ^T W (n) (5) where y (n) is a near-end echo signal. The situation where y (n) is composed of echo and noise is discussed below. The following analysis assumes that there is no near-end noise. If W (n) is not correlated with y (n) and X (n) (this assumption is approximately correct if W (n) converges and W (n) is considered a constant vector) ), The mean square error of the echo estimate of the LMS adaptive echo canceller is defined by:

【００２３】 ξ＝Ｅ｛ε² （ｎ）｝＝Ｅ｛［ｙ（ｎ）−Ｘ（ｎ）^T Ｗ（ｎ）］² ｝＝Ｅ｛ｙ（ｎ）² ｝＋Ｅ｛Ｗ（ｎ）^T Ｘ（ｎ）Ｘ（ｎ）^T Ｗ（ｎ）｝−２Ｅ｛ｙ（ｎ）Ｘ（ｎ）Ｗ（ｎ）^T ｝＝Ｅ｛ｙ（ｎ）² ｝＋Ｗ（ｎ）^T Ｅ｛Ｘ（ｎ）Ｘ（ｎ）^T ｝Ｗ（ｎ）−２Ｅ｛ｙ（ｎ）Ｘ（ｎ）｝Ｗ（ｎ）^T ＝Ｅ｛ｙ（ｎ）² ｝＋Ｗ（ｎ）^T Ｒ（ｎ）Ｗ（ｎ）−２Ｓ（ｎ）Ｗ（ｎ）^T （６）Ｗ₀ はＷ（ｎ）の最適解であり、ξはＷ（ｎ）の関数で
あるので、Ｗ₀ におけるξの傾斜はゼロである。これ
は、以下の式を満足する。Ξ = E ｛ε ² (n)｝ = E ｛[y (n) −X (n) ^T W (n)] ² ＝= E ｛y (n) ² ｝ + E ｛W (n) ^T X (n) X (n) T W (n)} - 2E {y (n) X (n) W (n) T} = E {y (n) 2} + W (n) T E {X (n ) X (n) ^T {W (n) -2E} y (n) X (n)} W (n) ^T = E {y (n) ² } + W (n) ^TR (n) W (n) −2S (n) W (n) ^T (6) Since W ₀ is the optimal solution of W (n) and ξ is a function of W (n), the slope of ξ at W ₀ is zero. This satisfies the following equation:

【００２４】 ∂ξ／∂Ｗ（ｎ）＝０＝＞２Ｒ（ｎ）Ｗ₀ −２Ｓ（ｎ）＝０＝＞Ｗ₀ ＝Ｒ（ｎ）^-1Ｓ（ｎ）（７）（７）式を（６）式に代入することによって、以下の式
を用いて、エコー推定値の最小平均二乗誤差を得ること
ができる。[0024] ∂ξ / ∂W (n) = 0 => 2R (n) W 0 -2S (n) = 0 => W 0 = R (n) -1 S (n) (7) (7) equation Into the equation (6), the minimum mean square error of the echo estimation value can be obtained using the following equation.

【００２５】 ξ_min ＝Ｅ｛ｙ（ｎ）² ｝−Ｓ（ｎ）^T Ｗ₀ （８） ξ_min は時間変動するが、時間と共に非常にゆっくり変
化するので、ここでは定数と見なす。式（８）および式
（７）を式（６）に代入することによって、以下の関係
を証明することができる。Ξ _min = E ｛y (n) ² ｝ −S (n) ^T W ₀ (8) Although ξ _min fluctuates with time, it changes very slowly with time, so it is regarded as a constant here. By substituting Equations (8) and (7) into Equation (6), the following relationship can be proven.

【００２６】 ξ＝ξ_min ＋［Ｗ（ｎ）−Ｗ₀ ］^T Ｒ（ｎ）［Ｗ（ｎ）−Ｗ₀ ］（９）ＬＭＳ適応フィルタは、Ｗ（ｎ）をその真値Ｗ₀ に近づ
けようとする。即ち、ξをξ_min に近づけようとする。
適応フィルタの収束挙動は、係数が２つの場合、即ち、
Ｌ＝２で、Ｗ（ｎ）＝［ｗ₀ （ｎ）ｗ₁ （ｎ）］および
Ｗ₀ ＝［ｗ₀ ｗ₁ ］という場合について理解することが
できる。したがって、図２および図３に示す等平均二乗
曲線(equal mean square curve) を参照しながら、適応
フィルタの収束挙動について説明する。これらの曲線
は、ξを異なる一定値に設定し、ｗ₀ （ｎ）およびｗ₁
（ｎ）を軸として用いることによって、プロットしたも
のである。曲線は、ｗ₀ （ｎ）−ｗ₁ （ｎ）面では楕円
形であり、Ｒ（ｎ）の固有ベクトルが楕円の主軸を規定
し、固有値が誤差面の急降度(steepness) を規定する。
したがって、固有値が大きい程、軸が長く、マトリクス
Ｒ（ｎ）の固有値は、軸の長さを定義する。Ξ = ξ _min + [W (n) −W ₀ ] ^TR (n) [W (n) −W ₀ ] (9) The LMS adaptive filter converts W (n) to its true value W ₀ . Try to get closer. That is, ξ is made to approach ξ _min .
The convergence behavior of the adaptive filter is the case where there are two coefficients,
It is possible to understand the case where L = 2 and W (n) = [w ₀ (n) w ₁ (n)] and W ₀ = [w ₀ w ₁ ]. Therefore, the convergence behavior of the adaptive filter will be described with reference to the equal mean square curves shown in FIGS. These curves set ξ to different constant values, w ₀ (n) and w ₁
This is plotted by using (n) as an axis. Curve is w ₀ (n) oval with -w ₁ (n) plane, the eigenvectors of R (n) is defined the major axis of the ellipse, the eigenvalues defines a sudden Hod (steepness) of the error surface.
Thus, the larger the eigenvalue, the longer the axis, and the eigenvalue of the matrix R (n) defines the length of the axis.

【００２７】音声信号は非定置であるので、等平均二乗
誤差曲線は、時間変動楕円形(time-varying ellipses)
となる。ＬＭＳ適応フィルタは、更新方向として、図２
における１１および図３における１３の傾斜方向を採用
する。傾斜方向は、単純で信頼性の高い唯一の更新方向
である。図からわかるように、方向１１は、図２に示す
最適方向１２から大きく変動し、更新方向１３は、マト
リクスＲ（ｎ）が時間変動固有値を有するこれら２つの
例を含む殆どの場合、図３に示す最適方向から大きく変
動する。Since the audio signal is non-stationary, the equal mean square error curve has a time-varying ellipses.
Becomes The LMS adaptive filter uses the update direction shown in FIG.
3 and 13 in FIG. The tilt direction is the only simple and reliable update direction. As can be seen, the direction 11 fluctuates significantly from the optimal direction 12 shown in FIG. 2 and the update direction 13 is almost always the same as FIG. 3 where the matrix R (n) includes these two examples with time-varying eigenvalues. Greatly fluctuates from the optimal direction shown in FIG.

【００２８】等平均二乗誤差曲線の形状およびサイズ
は、時間と共に変化する。したがって、ＬＭＳ適応フィ
ルタは、多くの場合、最適な方向から大きな発散を有す
る更新方向を用いる。これは、何故ＬＭＳ適応フィルタ
の収束が遅く、音響エコー・キャンセレーションのよう
な用途においてエコー信号の４０ｄＢ以上の抑制が不可
能な別の理由である。The shape and size of the mean-square error curve change over time. Thus, LMS adaptive filters often use update directions that have a large divergence from the optimal direction. This is another reason why the convergence of the LMS adaptive filter is so slow that it is not possible to suppress the echo signal by more than 40 dB in applications such as acoustic echo cancellation.

【００２９】改善されたエコー・キャンセラ２００のシ
ステム・アーキテクチャを図４に示す。エコー・キャン
セラ２００は、マイクロプロセッサ，デジタル信号プロ
セッサ，マイクロコンピュータ，コンピュータ，または
その他の適切な回路であればいかなるものを用いても、
実施可能である。エコー・キャンセラ２００は、線形予
測回路２０２を含み、入力信号である遠端音声信号ｘ
（ｎ）を受信するように接続され、線形予測係数および
残余信号ｄ（ｎ）を出力２０３に出力する。線形予測回
路は、いずれか適切な線形予測フィルタ回路を用いて実
施することができる。線形予測誤差フィルタは、その係
数を推定するために処理速度が速い多くのアルゴリズム
が入手可能であるので有利である。好ましくは、計算上
効率的でありかつ安定な高速アルゴリズムを用いる。こ
れらの特性を有する線形予測誤差フィルタを選択するこ
とによって、エコー・キャンセラ２００が安定で効率的
であることを保証する。The system architecture of the improved echo canceller 200 is shown in FIG. Echo canceller 200 may be implemented using a microprocessor, digital signal processor, microcomputer, computer, or any other suitable circuit.
It is feasible. The echo canceller 200 includes a linear prediction circuit 202, and a far-end audio signal x as an input signal.
(N), and outputs a linear prediction coefficient and a residual signal d (n) to an output 203. The linear prediction circuit can be implemented using any suitable linear prediction filter circuit. The linear prediction error filter is advantageous because many algorithms are available with a fast processing speed to estimate its coefficients. Preferably, a fast algorithm that is computationally efficient and stable is used. Choosing a linear prediction error filter with these characteristics ensures that the echo canceller 200 is stable and efficient.

【００３０】線形予測分析は、非常に精度が高い音声パ
ラメータの推定値を与えることができる。線形予測シス
テムは、発声音声(voiced speech) に対するインパルス
列、または非発声音声(unvoiced speech) に対するラン
ダム・ノイズ・シーケンスによって励起される。線形予
測システムのパラメータは、時間と共に非常にゆっくり
と変動する。音声サンプルｓ（ｎ）と励起ｕ（ｎ）との
間の関係は、次の式によって表される。Linear prediction analysis can provide very accurate estimates of speech parameters. Linear prediction systems are excited by an impulse train for voiced speech or a random noise sequence for unvoiced speech. The parameters of a linear prediction system vary very slowly over time. The relationship between the audio sample s (n) and the excitation u (n) is represented by the following equation.

【００３１】[0031]

【数４】ここで、ｐは時間変動デジタル・フィルタの次数、Ｇは
ゲイン・ファクタ、および｛α_k ｜ｋ＝１，
２，．．．，ｐ｝は、時間変動デジタル・フィルタの係
数である。(Equation 4) Where p is the order of the time-varying digital filter, G is the gain factor, and ｛α _k | k = 1,
2,. . . , P} are the coefficients of the time-varying digital filter.

【００３２】音声サンプルの線形予測は、過去のサンプ
ルの線形結合(linear combination)から現音声サンプル
の推定値を生成する。ｓ（ｎ）ハットをｓ（ｎ）の線形
予測値とすると、ｓ（ｎ）ハットは、以下のように、予
測フィルタ係数｛α_k ｜ｋ＝１，２，．．．，ｐ｝を用
いて、線形予測器によって得ることができる。Linear prediction of speech samples generates an estimate of the current speech sample from a linear combination of past samples. Assuming that the s (n) hat is a linear prediction value of s (n), the s (n) hat has prediction filter coefficients {α _k | k = 1, 2,. . . , P} using a linear predictor.

【００３３】[0033]

【数５】式（１１）の線形予測器の予測誤差は、以下の式で与え
られる。(Equation 5) The prediction error of the linear predictor of Expression (11) is given by the following expression.

【００３４】[0034]

【数６】式（１２）から、予測誤差シーケンスは、線形予測誤差
フィルタの出力であり、その伝達関数は、以下の通りで
あることがわかる。(Equation 6) From equation (12), it can be seen that the prediction error sequence is the output of the linear prediction error filter, and its transfer function is as follows.

【００３５】[0035]

【数７】線形予測誤差フィルタ係数｛α_k ｜ｋ＝１，
２，．．．，ｐ｝を推定する繰り返し高速アルゴリズム
は、既知である。線形予測誤差フィルタは、白色ノイズ
の特性を有する残余信号を生成するので、白色化フィル
タ(whitening filter)である。本発明では、Levinson-D
urbin 繰り返しアルゴリズムが特に有利である。このフ
ィルタは、多数の望ましい特性を有する。これは、音声
予測には普及しており、したがって非常によく理解さ
れ、開発されている。これは計算上効率的であるので、
比較的小さな回路に実施することができる。このアルゴ
リズムの別の重要な特性としては、このフィルタを採用
した線形予測誤差フィルタは安定であることがあげられ
る。(Equation 7) Linear prediction error filter coefficient ｛α _k | k = 1,
2,. . . , P} are known. The linear prediction error filter is a whitening filter because it generates a residual signal having white noise characteristics. In the present invention, Levinson-D
The urbin iterative algorithm is particularly advantageous. This filter has a number of desirable properties. It is popular for speech prediction and is therefore very well understood and developed. This is computationally efficient, so
It can be implemented in relatively small circuits. Another important property of this algorithm is that the linear prediction error filter employing this filter is stable.

【００３６】エコー・キャンセラ２００は、受信オーデ
ィオ・サンプルのフレーム単位で動作する。受信オーデ
ィオ・サンプルとは、スピーカ１０２に出力される遠端
信号およびマイクロフォン１０４から入力される近端信
号双方からの信号サンプルを意味する。遠端信号および
近端信号双方からのオーディオ・サンプルは同期が取ら
れている。フレーム・サイズＫは、通常５０ないし２０
０サンプルである。一旦遠端および近端双方からＫ個の
新しいオーディオ・サンプルのフレームが受信される
と、アルゴリズムの動作が開始する。Ｋ個の受信遠端音
声サンプルは、ｘ（ｎ），ｘ（ｎ−１），．．．ｘ（ｎ
−Ｋ＋１）であり、Ｋ個の受信近端信号サンプルは、ｙ
（ｎ），ｙ（ｎ−１），．．．ｙ（ｎ−Ｋ＋１）であ
る。ここで、ｎは現サンプル時点である。The echo canceller 200 operates on a frame basis of a received audio sample. The received audio samples refer to signal samples from both the far-end signal output to the speaker 102 and the near-end signal input from the microphone 104. Audio samples from both the far-end and near-end signals are synchronized. The frame size K is usually 50 to 20
0 samples. Once the frames of K new audio samples have been received from both the far end and the near end, the operation of the algorithm begins. The K received far-end speech samples are x (n), x (n-1),. . . x (n
−K + 1), and the K received near-end signal samples are y
(N), y (n-1),. . . y (n−K + 1). Here, n is the current sample time.

【００３７】動作において、Ｋ個の受信遠端音声サンプ
ル、即ち、入力信号サンプルの線形予測分析を、最初に
線形予測回路２０２において実行する。線形予測誤差フ
ィルタ係数｛α_k ｜ｋ＝１，２，．．．，ｐ｝は、例え
ば、Levinson-Durbin アルゴリズムのような種々のアル
ゴリズムによって、推定することができる。線形予測回
路２０２に用いる線形予測誤差フィルタの次数ｐは、例
えば、５ないし１５の範囲とするとよい。線形予測回路
２０２に対する線形予測誤差フィルタは、以下の式で表
される伝達関数Ａ（ｚ）を有する。In operation, a linear prediction analysis of the K received far-end speech samples, ie, the input signal samples, is first performed in the linear prediction circuit 202. Linear prediction error filter coefficients {α _k | k = 1, 2,. . . , P} can be estimated by various algorithms such as the Levinson-Durbin algorithm. The order p of the linear prediction error filter used in the linear prediction circuit 202 may be, for example, in the range of 5 to 15. The linear prediction error filter for the linear prediction circuit 202 has a transfer function A (z) represented by the following equation.

【００３８】[0038]

【数８】適応フィルタの係数は、係数更新回路である、係数発生
回路２０４において更新される。ＬＭＳ適応フィルタ１
１０において更新された係数を用いて、適応エコー・キ
ャンセレーションを、Ｋ個の受信オーディオ・サンプ
ル、即ち、入力信号サンプル上で実行する。時間インデ
ックスｉが、Ｋ個の新たな受信オーディオ・サンプルの
時間範囲内にある。即ち、ｎ≦ｉ≦ｎ−Ｋ＋１である。
Ｗ（ｉ）＝［ｗ₀ （ｉ）ｗ₁ （ｉ）．．．ｗ_L-1
（ｉ）］^T は、時点ｉにおける適応エコー・キャンセラ
の係数であり、Ｌは適応エコー・キャンセラの長さであ
り、Ｘ（ｉ）＝［Ｘ（ｉ）ｘ（ｉ−１）．．．ｘ（ｉ−
Ｌ＋１）］^T は、時点ｉにおけるＬ個の受信遠端音声サ
ンプルのベクトルである。Ｌは、Ｋよりも小さくても大
きくてもよい。(Equation 8) The coefficients of the adaptive filter are updated in a coefficient generating circuit 204, which is a coefficient updating circuit. LMS adaptive filter 1
Using the updated coefficients at 10, adaptive echo cancellation is performed on the K received audio samples, ie, the input signal samples. The time index i is within the time range of the K new received audio samples. That is, n ≦ i ≦ n−K + 1.
_{W (i) = [w 0} (i) w 1 (i). . . w _L-1
(I)] ^T is the coefficient of the adaptive echo canceller at time point i, L is the length of the adaptive echo canceller, and X (i) = [X (i) x (i−1). . . x (i-
L + 1)] ^T is a vector of L received far-end speech samples at time i. L may be smaller or larger than K.

【００３９】Ｋ個の受信サンプルに対して、各サンプル
毎に、即ち、ｉ＝ｎ−Ｋ＋１，ｎ−ｋ＋２，．．．，ｎ
に対して、エコー・キャンセラ２００は、エコー・キャ
ンセレーションを実行する。エコー・キャンセラ２００
は：ｉ＝ｎ−Ｋ＋１において、係数Ｗ（ｎ−Ｋ＋１）を
用いてエコー・エコーキャンセレーションを実行し、係
数Ｗ（ｎ−Ｋ＋１）を係数Ｗ（ｎ−Ｋ＋２）に更新す
る。For each of the K received samples, i.e., i = n-K + 1, nk + 2,. . . , N
In response, the echo canceller 200 executes the echo cancellation. Echo canceller 200
Performs echo echo cancellation using the coefficient W (n-K + 1) at i = n-K + 1, and updates the coefficient W (n-K + 1) to the coefficient W (n-K + 2).

【００４０】時点ｉ＝ｎ−Ｋ＋２において、係数（ｎ−
Ｋ＋２）を用いてエコー・キャンセレーションを実行
し、係数（ｎ−Ｋ＋２）を係数（ｎ−Ｋ＋３）に更新す
る。At time i = n−K + 2, the coefficient (n−
The echo cancellation is performed using (K + 2), and the coefficient (n−K + 2) is updated to the coefficient (n−K + 3).

【００４１】更に、最後のサンプル時点ｉ＝ｎにおい
て、係数Ｗ（ｎ）を用いてエコー・キャンセレーション
を実行し、係数Ｗ（ｎ）が係数Ｗ（ｎ＋１）に更新され
るまで、連続する各サンプル時点においてこの動作を繰
り返し、次のフレームを待ち、当該フレームに対してこ
のシーケンスを繰り返す。Further, at the last sampling time point i = n, echo cancellation is performed using the coefficient W (n), and each successive echo is canceled until the coefficient W (n) is updated to the coefficient W (n + 1). This operation is repeated at the sample time, waiting for the next frame, and this sequence is repeated for that frame.

【００４２】指定した時点ｉにおいて、適応エコー・キ
ャンセラのエコー推定誤差は、以下のようになる。At the specified time point i, the echo estimation error of the adaptive echo canceller is as follows.

【００４３】ｅ（ｉ）＝ｙ（ｉ）−Ｘ（ｉ）^T Ｗ（ｉ）（１５）ここで、ｙ（ｉ）は時点ｉにおける近端オーディオ・サ
ンプルである。[0043] Here e (i) = y (i ) -X (i) T W (i) (15), y (i) is the near-end audio sample at time i.

【００４４】時点ｉにおいて、適応エコー・キャンセラ
の係数更新方向は、線形予測残余領域において構築され
る。受信遠端音声サンプルのＬ個の線形予測残余サンプ
ルｄ（ｉ），ｄ（ｉ−１），．．．ｄ（ｉ−Ｌ＋１）
は、線形予測誤差フィルタＡ（ｚ）を用いて、時点ｉに
おける受信遠端音声サンプルをフィルタ処理することに
よって生成される。At time i, the coefficient update direction of the adaptive echo canceller is constructed in the linear prediction residual area. The L linear prediction residual samples d (i), d (i-1),. . . d (i-L + 1)
Is generated by filtering the received far-end speech sample at time point i using a linear prediction error filter A (z).

【００４５】[0045]

【数９】線形予測残余領域における適応エコー・キャンセラの係
数更新方向は、Ｑ（ｉ）であり、以下のように表され
る。(Equation 9) The coefficient update direction of the adaptive echo canceller in the linear prediction residual region is Q (i), and is expressed as follows.

【００４６】[0046]

【数１０】ここで、Ｄ（ｉ）＝［ｄ（ｉ），ｄ（ｉ−１），．．．
ｄ（ｉ−Ｌ＋１）］^T 、およびＱ（ｉ）＝［ｑ₀
（ｉ），ｑ₁ （ｉ），．．．ｑ_L-1 （ｉ）］^T ，μは適
応化の刻み幅であり、｜｜Ｄ（ｉ）｜｜² ＝Ｄ（ｉ）^T
Ｄ（ｉ）である。(Equation 10) Here, D (i) = [d (i), d (i-1),. . .
d (i−L + 1)] ^T , and Q (i) = [q ₀
(I), q ₁ (i),. . . q _L-1 (i)] ^T and μ are the steps of adaptation, and || D (i) || ² = D (i) ^T
D (i).

【００４７】線形予測残余領域において適応エコー・キ
ャンセラの係数変更方向を得た後、係数発生回路２０４
では、音声領域における更新方法を、Ａ（ｚ）およびＱ
（ｉ）に基づいて算出することができる。音声領域にお
ける適応エコー・キャンセラの係数更新方向は、Ｇ
（ｉ）＝［ｇ₀ （ｉ）ｇ₁ （ｉ）．．．ｇ_L-1 （ｉ）］
^Tである。Ｇ（ｉ）の算出は、以下の式によって、Ｇ
（ｉ）の各成分について行う。After obtaining the coefficient change direction of the adaptive echo canceller in the linear prediction residual area, the coefficient generation circuit 204
Then, the update method in the audio domain is represented by A (z) and Q
It can be calculated based on (i). The coefficient update direction of the adaptive echo canceller in the voice domain is G
(I) = [g ₀ (i) g ₁ (i). . . g _L-1 (i)]
^T. G (i) is calculated by the following equation.
This is performed for each component of (i).

【００４８】[0048]

【数１１】ここで、ｇ_j （ｉ）＝１（ｊ＝−１，−２，．．．，−
ｐ）は、先の式を計算するために仮定されたものであ
る。[Equation 11] Here, g _j (i) = 1 (j = −1, −2,.
p) was assumed to calculate the above equation.

【００４９】式（１８）および式（１６）は、異なる動
作方向を有するフィルタを表す。式（１６）は、前進線
形予測フィルタ処理(forward linear prediction filte
ring) を行い、音声領域からの遠端音声サンプルを、線
形予測残余領域に変換する。式（１８）は、逆進線形予
測(backward linear prediction)を行い、更新方向を線
形予測残余領域から音声領域に変換する。Equations (18) and (16) represent filters with different operating directions. Equation (16) is used for forward linear prediction filter processing (forward linear prediction filter processing).
ring), and converts the far-end speech sample from the speech area into a linear prediction residual area. Equation (18) performs backward linear prediction and converts the update direction from the linear prediction residual area to the speech area.

【００５０】Ｇ（ｉ）を得た後、適応エコー・キャンセ
ラの係数は、以下のように、直接更新することができ
る。After obtaining G (i), the coefficients of the adaptive echo canceller can be updated directly as follows.

【００５１】Ｗ（ｉ＋１）＝Ｗ（ｉ）＋Ｇ（ｉ）（１９）ここで、Ｗ（ｉ＋１）は、時点ｉ＋１において次のサン
プルに用いるために更新された係数である。W (i + 1) = W (i) + G (i) (19) Here, W (i + 1) is a coefficient updated at time point i + 1 for use in the next sample.

【００５２】音声信号に対する線形予測誤差フィルタの
係数は、時間と共にゆっくりと変化するので、各フレー
ム内部では、線形予測誤差フィルタの係数は、事実上１
フレームの間では変化しない。したがって、これは、音
声サンプルの１フレームの内側では、線形時間不変ＦＩ
Ｒフィルタ(linear time-invariant FIR filter)と見な
すことができる。音声サンプルの１フレームに対する係
数は、Levinson-Durbin アルゴリズムによって推定す
る。また、線形予測誤差フィルタは、白色化フィルタで
あり、線形予測残余は事実上白色ノイズである。したが
って、線形予測残余の自己相関マトリクスは、白色ノイ
ズのそれである。Since the coefficients of the linear prediction error filter for the speech signal change slowly with time, the coefficients of the linear prediction error filter within each frame are substantially 1
It does not change between frames. Therefore, within one frame of the audio sample, this is a linear time-invariant FI
It can be regarded as an R filter (linear time-invariant FIR filter). The coefficient for one frame of the audio sample is estimated by the Levinson-Durbin algorithm. Also, the linear prediction error filter is a whitening filter, and the linear prediction residual is effectively white noise. Therefore, the autocorrelation matrix of the linear prediction residual is that of white noise.

【００５３】Ｐ（ｉ）＋Ｅ｛Ｄ（ｉ）Ｄ（ｉ）^T ｝＝
｛ｐ_ij（ｎ）｜ｉ，ｊ＝０，１，．．．，Ｌ−１｝は、
線形予測残余の自己相関マトリクスである。線形予測残
余領域における更新方向Ｑ（ｉ）を、適応化において更
新方向として用いる場合、時点ｉにおけるＬＭＳエコー
推定値の平均二乗誤差β（ｉ）は、式（５）ないし
（９）にしたがって、以下の式から得られる。P (i) + E {D (i) D (i) ^T } =
{P _ij (n) | i, j = 0,1,. . . , L-1} is
It is an autocorrelation matrix of a linear prediction residual. When the update direction Q (i) in the linear prediction residual region is used as the update direction in the adaptation, the mean square error β (i) of the LMS echo estimation value at the time point i is given by the following equations (5) to (9). It is obtained from the following equation.

【００５４】 β（ｉ）＝β_min ＋［Ｖ（ｉ）−Ｖ₀ ］^T Ｐ（ｉ）［Ｖ（ｉ）−Ｖ₀ ］（２０）ここで、β_min は、β（ｉ）の最小値である。先と同
様、これは、時間と共にゆっくりと変化するので、定数
と見なすことができる。Ｖ（ｉ）は、時点ｉにおけるエ
コー・キャンセラの係数であり、（１／Ａ（ｚ））によ
るＷ（ｉ）の畳み込みである。Ｖ₀ は、先に定義した係
数Ｖ（ｉ）の最適解であり、これは時間と共にゆっくり
と変化するので、一定ベクトルと見なされる。Ｖ（ｉ）
は、Ｗ₀ および（１／Ａ（ｚ））の畳み込みである。[0054] _{β (i) = β min +} [V (i) -V 0] T P (i) [V (i) -V 0] (20) Here, β _min is the smallest of the β (i) Value. As before, this can be considered a constant since it changes slowly over time. V (i) is a coefficient of the echo canceller at time point i, and is a convolution of W (i) by (1 / A (z)). V ₀ is the optimal solution of the previously defined coefficient V (i), which changes slowly with time and is therefore considered a constant vector. V (i)
Is the convolution of W ₀ and (1 / A (z)).

【００５５】線形予測残余領域に対する等平均二乗誤差
曲線を図５に示す。Ｐ（ｉ）は、Ｌ個の等しい固有値を
有し、等平均二乗誤差曲線は円である。その結果、傾斜
方向５１が、ＬＭＳフィルタに得られる唯一の信頼性の
ある更新方向となり、傾斜方向５１は、円上のいずれの
点においても最適な方向となる。Ｖ（ｉ）は、Ｗ（ｉ）
がＷ₀ に収束するよりも、格段に速くＶ₀ に収束する。
尚、Ｖ（ｉ）はＷ（ｉ）に等しくなく、実際には、（１
／Ａ（ｚ））によるＷ（ｉ）の畳み込みである。最適な
更新方向は、残余領域においてＱ（ｉ）として求めるこ
とができる。FIG. 5 shows an equal mean square error curve for the linear prediction residual region. P (i) has L equal eigenvalues and the equal mean square error curve is a circle. As a result, the tilt direction 51 is the only reliable update direction obtained for the LMS filter, and the tilt direction 51 is the optimal direction at any point on the circle. V (i) is W (i)
Converges to V ₀ much faster than converges to W ₀ .
Note that V (i) is not equal to W (i), and in fact, (1)
/ A (z)) is the convolution of W (i). The optimal update direction can be obtained as Q (i) in the remaining area.

【００５６】Ｗ（ｉ）に対する音声領域における最適更
新方向は、Ｑ（ｉ）およびＡ（ｚ）の得られる情報から
求めることができる。以下の式を、Ｖ（ｉ）の適応化に
ついて検討する。The optimal update direction in the voice domain for W (i) can be obtained from the information obtained from Q (i) and A (z). Consider the following equation for the adaptation of V (i):

【００５７】Ｖ（ｉ＋１）＝Ｖ（ｉ）＋Ｑ（ｉ）（２１）これは、線形予測残余領域における更新方向を用いるこ
とを除いて、図１のＬＭＳ適応フィルタと同一である。
収束の後、係数は、真の係数Ｗ（ｉ）と等価ではない
が、（１／Ａ（ｚ））による畳み込みと等価である。線
形予測誤差フィルタＡ（ｚ）は、受信遠端音声サンプル
の１フレームの内側では、時間不変であるので、Ｗ
（ｉ）はＶ（ｉ）およびＡ（ｚ）の畳み込みである。式
（２１）のＡ（ｚ）による畳み込みは、以下のように行
われる。V (i + 1) = V (i) + Q (i) (21) This is the same as the LMS adaptive filter of FIG. 1, except that the update direction in the linear prediction residual region is used.
After convergence, the coefficients are not equivalent to the true coefficients W (i), but are equivalent to convolution with (1 / A (z)). Since the linear prediction error filter A (z) is time-invariant within one frame of the received far-end speech sample, W
(I) is the convolution of V (i) and A (z). The convolution according to A (z) in equation (21) is performed as follows.

【００５８】[0058]

【数１２】ここで、丸印内のｘは、２つのフィルタの畳み込みを表
す。したがって、上述の適応方程式は、残余領域におけ
る更新方向のＡ（ｚ）による畳み込みによって、更新方
向を導出し、エコー・キャンセラ２００において用いら
れるのは、この変換更新方向である。(Equation 12) Here, x in a circle represents convolution of two filters. Therefore, the above adaptive equation derives the update direction by convolution of the update direction in the residual area with A (z), and it is the conversion update direction that is used in the echo canceller 200.

【００５９】変換更新方向は、未だ音声領域における最
適方向を示す。これは、以下によって立証される。（１）Ｑ（ｉ）は、Ｖ（ｉ）の適応化に対する最適方向
であるので、Ｖ（ｉ）＋ηＱ（ｉ）−Ｖ₀ ＝０となるよ
うなスカラηが存在する。（２）Ａ（ｚ）とＱ（ｉ）の畳み込みがＷ（ｉ）に対す
る最適更新方向でない場合、The conversion update direction indicates the optimum direction in the audio area. This is demonstrated by: (1) Since Q (i) is the optimal direction for the adaptation of V (i), there exists a scalar η such that V (i) + ηQ (i) −V ₀ = 0. (2) If the convolution of A (z) and Q (i) is not the optimal update direction for W (i),

【００６０】[0060]

【数１３】となるようなスカラαは存在しない。（３）Ａ（ｚ）は線形時間不変フィルタであり、(Equation 13) Does not exist. (3) A (z) is a linear time invariant filter,

【００６１】[0061]

【数１４】であるので、上述の式は以下のように書き換えることが
できる。[Equation 14] Therefore, the above equation can be rewritten as follows.

【００６２】[0062]

【数１５】（４）明らかに、上述の式を成り立たせる少なくとも１
つのスカラα＝ηがある。したがって、Ａ（ｚ）および
Ｑ（ｉ）の畳み込みによって、Ｗ（ｉ）に対する最適更
新方向が生成される。(Equation 15) (4) Clearly, at least one that satisfies the above equation
There are two scalars α = η. Therefore, the convolution of A (z) and Q (i) produces an optimal update direction for W (i).

【００６３】同様に、エコー・キャンセラ２００のエコ
ー推定値の二乗平均誤差は、式（５）ないし（９）にし
たがって、以下のように表すことができる。Similarly, the root-mean-square error of the echo estimated value of the echo canceller 200 can be expressed as follows according to equations (5) to (9).

【００６４】 ε（ｎ）＝ｙ（ｎ）−ｚ（ｎ）＝ｙ（ｎ）−Ｘ（ｎ）^T Ｗ（ｎ）（２３）ここで、ｙ（ｎ）は近端信号である。Ｗ（ｎ）＝［ｗ₀
（ｎ）ｗ₁ （ｎ）．．．ｗ_L-1 （ｎ）］^T は、エコー・
キャンセラ２００の係数である。Ｘ（ｎ）＝［ｘ
（ｎ），ｘ（ｎ−１），．．．，ｘ（ｎ−Ｌ＋１）］^T
は、受信遠端音声サンプルである。この新たな適応エコ
ー・キャンセラのエコー推定値の二乗平均誤差は、以下
によって定義される。[0064] ε (n) = y (n ) -z (n) = y (n) -X (n) T W (n) (23) where, y (n) is the near-end signal. W (n) = [w ₀
(N) w ₁ (n). . . w _L-1 (n)] ^T
This is the coefficient of the canceller 200. X (n) = [x
(N), x (n-1),. . . , X (n−L + 1)] ^T
Is the received far-end speech sample. The root mean square error of the echo estimate of this new adaptive echo canceller is defined by:

【００６５】 ξ_N ＝Ｅ｛ε² （ｎ）｝＝Ｅ｛［ｙ（ｎ）−Ｘ（ｎ）^T Ｗ（ｎ）］² ｝＝ξ_min ＋［Ｗ（ｎ）−Ｗ₀ ］^T Ｒ（ｎ）［Ｗ（ｎ）−Ｗ₀ ］ここで、ξ_min ＝Ｅ｛ｙ（ｎ）² ｝−Ｓ（ｎ）^T Ｗ₀
は、エコー・キャンセラ２００による最小平均二乗誤差
である。ｗ₀ ＝［ｗ₀ ｗ₁ ，．．．，Ｗ_L-1 ］^T は、エ
コー・キャンセラ２００の最適係数である。Ｒ（ｎ）＝
｛Ｘ（ｎ）Ｘ（ｎ）^T ｝は、以前と同様に定義される。
等平均二乗誤差曲線は、図１のＬＭＳエコー・キャンセ
ラのそれと同一である。しかしながら、図６に示すよう
に、更新方向は改善されている。[0065] _{^{ξ N = E {ε 2 (}} n)} = E {[y (n) -X (n) T W (n)] 2} = ξ min + [W (n) -W 0] T R (n) [W (n) -W 0] _{where, ξ min = E {y (} n) 2} -S (n) T W 0
Is the minimum mean square error by the echo canceller 200. w ₀ = [w ₀ w ₁ ,. . . , WL _-1 ] ^T are the optimal coefficients of the echo canceller 200. R (n) =
{X (n) X (n) ^T } is defined as before.
The equal mean square error curve is identical to that of the LMS echo canceller of FIG. However, as shown in FIG. 6, the update direction has been improved.

【００６６】２つの係数、即ち、Ｌ＝２で、Ｗ（ｉ）＝
［ｗ₀ （ｉ）ｗ₁ （ｉ）］およびＷ₀ ＝［ｗ₀ ｗ₁ ］の
場合について、新たな適応フィルタの二乗平均誤差曲線
を図６に示す。新たな適応フィルタは、その更新方向と
して、図１のＬＭＳ適応フィルタの場合のように、傾斜
方向６１を採用していない。係数発生回路２０４は、２
段階で係数更新方向を決定する。まず、遠端音声の線形
予測残余領域において、更新方向を求める。第２に、こ
の更新方向を音声領域に変換し、最適更新方向６２を得
る。係数発生回路２０４は、常に最適更新方向を用いる
ので、従来技術のＬＭＳフィルタにおいて発生する遠端
音声の固有ベクトルおよび固有値の変化の有害な影響を
回避する。Two coefficients, L = 2, W (i) =
FIG. 6 shows the root mean square error curve of the new adaptive filter for [w ₀ (i) w ₁ (i)] and W ₀ = [w ₀ w ₁ ]. The new adaptive filter does not employ the tilt direction 61 as its update direction as in the case of the LMS adaptive filter of FIG. The coefficient generation circuit 204
The coefficient updating direction is determined in stages. First, an update direction is obtained in the linear prediction residual area of the far-end speech. Second, the update direction is converted into a voice domain to obtain an optimal update direction 62. Since the coefficient generation circuit 204 always uses the optimal update direction, it avoids the detrimental effects of changes in eigenvectors and eigenvalues of far-end speech that occur in prior art LMS filters.

【００６７】エコー・キャンセラ２００（図２）は、従
来のＬＭＳ適応エコー・キャンセラよりも、格段に速く
収束する。また、エコー・キャンセラ２００は、６０ｄ
Ｂ以上音響エコーを抑制することができる。これは、音
響エコー・キャンセレーションの用途に利用する場合に
特に有用である。The echo canceller 200 (FIG. 2) converges much faster than the conventional LMS adaptive echo canceller. In addition, the echo canceller 200
B or more acoustic echo can be suppressed. This is particularly useful when used for acoustic echo cancellation applications.

【００６８】したがって、エコー・キャンセラ２００
（図２）は従来のＬＭＳ適応フィルタ１１０を用いるこ
とがわかる。しかしながら、係数発生回路２０４によっ
て、改善された更新方向が用いられる。更新方向は、線
形予測誤差フィルタの出力から導出され、このフィルタ
は、ブロック７００（図７）に示すように、入力信号サ
ンプルから線形予測係数を算出する。線形予測誤差フィ
ルタ係数を算出するためのアルゴリズムは、容易に実装
され、高速であり、しかも安定である。Levinson-Durbi
n アルゴリズムは、既知であり、計算上効率的であり、
エコー・キャンセラ回路のサイズを過度に増大させるこ
となく実装可能であるので、有利に採用することができ
る。他にも既知の低次の有限インパルス応答（ＦＩＲ）
フィルタがあり、線形予測情報を算出するために用いる
ことができる。Therefore, the echo canceller 200
2 shows that the conventional LMS adaptive filter 110 is used. However, an improved update direction is used by the coefficient generation circuit 204. The update direction is derived from the output of the linear prediction error filter, which calculates linear prediction coefficients from the input signal samples, as shown in block 700 (FIG. 7). The algorithm for calculating the linear prediction error filter coefficients is easily implemented, fast, and stable. Levinson-Durbi
The n algorithm is known, computationally efficient,
Since the echo canceller circuit can be mounted without excessively increasing the size of the echo canceller circuit, it can be advantageously employed. Other known low order finite impulse response (FIR)
There is a filter that can be used to calculate linear prediction information.

【００６９】線形予測残余領域における更新方向は、ブ
ロック７０１に示すように、線形予測残余から導出され
る。これは、遠端音声サンプルを線形予測残余に変換し
た後に行われる。これは、入力信号サンプルのＦＩＲフ
ィルタ処理を伴う。ＦＩＲフィルタの次数は、例えば、
約１０とすることができる。したがって、計算上要求さ
れる電力は小さく、安定性も保証される。したがって、
更新方向は、最初に線形予測残余領域にある（線形予測
フィルタによって処理される信号は、線形予測領域にあ
る）。The update direction in the linear prediction residual region is derived from the linear prediction residual, as shown in block 701. This is done after converting the far-end speech samples into linear prediction residuals. This involves FIR filtering of the input signal samples. The order of the FIR filter is, for example,
It can be about 10. Therefore, the power required for calculation is small, and stability is guaranteed. Therefore,
The update direction is initially in the linear prediction residual region (the signal processed by the linear prediction filter is in the linear prediction region).

【００７０】次に、ブロック７０２に示すように、更新
方向を線形予測残余領域から音声領域に変換する。音声
領域は、デジタル化音声である。領域変換は、ＦＩＲフ
ィルタを用いて行うことができる。音声領域における更
新方向は、係数発生回路２０４が用いて適応フィルタ係
数を更新する。Next, as shown in block 702, the update direction is converted from the linear prediction residual area to the speech area. The audio area is a digitized audio. The domain conversion can be performed using an FIR filter. The update direction in the voice domain is used by the coefficient generation circuit 204 to update the adaptive filter coefficients.

【００７１】ＬＭＳエコー・キャンセラは、簡素であ
り、ロバスト性が高く、しかも計算上効率的であること
が知られている。エコー・キャンセラ２００は、ＬＭＳ
フィルタを採用し、改善された更新方向を用いることに
よって適応化の収束速度を向上させつつ、ＬＭＳエコー
・キャンセラのこれらの特性を利用することが好まし
い。改善されたエコー・キャンセラは、容易に実施さ
れ、安定で、ロバスト性が高く、しかも計算上効率的で
あると結論付けることができる。シミュレーションによ
って、エコー・キャンセラ２００はノイズが多い環境に
おいてもロバスト性が高いことが示された。更新方向の
変換によって、収束速度が格段に高速化し、収束の高速
化によってエコー抑制も大幅に増大した。当業者は、変
換した適応方向の利点を、非ＬＭＳエコー・キャンセラ
に採用可能であることも認めよう。したがって、ここで
用いる場合「適応フィルタ」および「エコー・キャンセ
ラ」は、ＬＭＳ適応フィルタおよびＬＭＳエコー・キャ
ンセラには限定される訳ではない。It is known that the LMS echo canceller is simple, has high robustness, and is computationally efficient. The echo canceller 200 is an LMS
It is preferable to take advantage of these characteristics of the LMS echo canceller while employing filters and increasing the convergence speed of the adaptation by using an improved update direction. It can be concluded that the improved echo canceller is easy to implement, stable, robust and computationally efficient. Simulations have shown that the echo canceller 200 has high robustness even in noisy environments. The conversion of the update direction significantly increased the convergence speed, and the convergence speed increased the echo suppression. One skilled in the art will also recognize that the benefits of the transformed adaptation direction can be employed in non-LMS echo cancellers. Therefore, as used herein, "adaptive filter" and "echo canceller" are not limited to LMS adaptive filters and LMS echo cancellers.

[Brief description of the drawings]

【図１】従来技術によるエコー・キャンセラを示す回路
構成図。FIG. 1 is a circuit diagram showing an echo canceller according to a conventional technique.

【図２】係数が２つのみ、即ち、Ｌ＝２の場合に、ＬＭ
Ｓ適応エコー・キャンセラの係数の不正確な推定値によ
って発生したエコー推定値の等平均二乗誤差曲線の例を
示すグラフ。FIG. 2 shows that when only two coefficients are used, that is, when L = 2, LM
5 is a graph showing an example of an equal mean square error curve of an echo estimate generated by an incorrect estimate of the coefficient of the S adaptive echo canceller.

【図３】係数が２つのみ、即ち、Ｌ＝２の場合に、ＬＭ
Ｓ適応エコー・キャンセラの係数の不正確な推定値によ
って発生したエコー推定値の等平均二乗誤差曲線の別の
例を示すグラフ。FIG. 3 shows that when only two coefficients are used, that is, when L = 2, LM
9 is a graph illustrating another example of an equal mean square error curve of an echo estimate generated by an incorrect estimate of the coefficient of the S adaptive echo canceller.

【図４】エコー・キャンセラを示す回路構成図。FIG. 4 is a circuit diagram showing an echo canceller.

【図５】係数を２つのみ、即ち、Ｌ＝２として、線形予
測残余領域における更新方向を用いた場合に、適応エコ
ー・キャンセラの係数の不正確な推定値によって発生し
たエコー推定値の等平均二乗誤差曲線の例を示すグラ
フ。FIG. 5 shows an example of an echo estimation value generated by an inaccurate estimation value of a coefficient of an adaptive echo canceller when only two coefficients are used, that is, L = 2, and an update direction in a linear prediction residual region is used. 7 is a graph showing an example of a mean square error curve.

【図６】係数を２つのみ、即ち、Ｌ＝２として、線形予
測残余領域における更新方向から変換された、音声領域
における更新方向を用いた場合に、適応エコー・キャン
セラの係数の不正確な推定値によって発生するエコー推
定値の等平均二乗誤差曲線の例を示す図。FIG. 6 shows an inaccurate adaptive echo canceller coefficient when using only two coefficients, ie, L = 2, and using the update direction in the voice domain, which is transformed from the update direction in the linear prediction residual area. The figure which shows the example of the equal mean square error curve of the echo estimation value generated by an estimation value.

【図７】更新方向の発生を示すフロー・チャート。FIG. 7 is a flow chart showing the occurrence of an update direction.

[Explanation of symbols]

１００平均最小二乗（ＬＭＳ）エコー・キャンセラ１０１デバイス１０２スピーカ１０４マイクロフォン１０６デコーダ１０８デジタル−アナログ（Ｄ／Ａ）変換器１１２コンバイナ１１４アナログ−デジタル（Ａ／Ｄ）変換器１１６エンコーダ１２０送受信機２００エコー・キャンセラ２０２線形予測回路２０３出力２０４係数発生回路 REFERENCE SIGNS LIST 100 Mean least square (LMS) echo canceller 101 Device 102 Speaker 104 Microphone 106 Decoder 108 Digital-to-analog (D / A) converter 112 Combiner 114 Analog-to-digital (A / D) converter 116 Encoder 120 Transceiver 200 Echo Canceller 202 Linear prediction circuit 203 Output 204 Coefficient generation circuit

Claims

[Claims]

1. A method for generating an update direction in an echo canceller including an adaptive filter, comprising: calculating an update direction in a linear prediction residual area; and converting the update direction from the linear prediction residual area to a speech domain. Generating an audio domain update direction for use in updating the coefficients of the adaptive filter.

2. The method according to claim 1, further comprising generating a linear prediction residual signal using a Levinson-Durbin algorithm, and calculating the update direction in the linear prediction residual region.

3. The method according to claim 1, wherein the step of calculating the update direction in the linear prediction residual area includes a step of deriving a linear prediction residual signal calculated using a linear prediction coefficient. Method.

4. The method of claim 3, wherein said linear prediction residual signal is generated using a linear prediction error filter having a plurality of linear prediction coefficients.

5. The linear prediction residual signal is d (n). Where p is the order of the filter, x (n) and x (nk) are the current and past samples of the input signal, respectively, and {α _k } is the linear prediction coefficient of the linear prediction error filter. 5. The method according to claim 4, wherein
The described method.

6. The step of converting includes performing a filtering process in the updating direction in a finite impulse response filter,
4. The method of claim 3 including obtaining the update direction in the audio domain.

7. The method of claim 3, wherein the step of converting includes the step of converting the update direction to the audio domain using the linear prediction coefficients.

8. The step of transforming the update direction comprises updating the update direction g _n in the audio domain according to the following equation:
Calculating (i), Here, L is the length of the adaptive filter, p is the order of the linear prediction error filter, and Q (i) = [q ₀ (i), q ₁
(I),. . . , Q _L−1 (i)] ^T is the update direction in the linear prediction residual region, and G (i) = [g ₀ (i), g ₁
(I),. . . g _L-1 (i)] ^T is the voice region update direction, and {α _k } is a coefficient of the linear prediction error filter.

9. Estimating an echo estimate using the current coefficients of the adaptive filter and far-end speech samples; generating a linear prediction filter coefficient based on the far-end speech samples; Generating a linear prediction residual signal from the linear prediction residual signal; calculating the update direction for the echo canceller in the linear prediction residual region from the linear prediction residual signal; and, based on the current coefficient and the voice region update direction, 2. The method of claim 1, further comprising: generating update coefficients for the adaptive filter.

10. An echo canceller operating according to the method defined in claim 1. Description: