JP4642711B2

JP4642711B2 - Echo canceller

Info

Publication number: JP4642711B2
Application number: JP2006194704A
Authority: JP
Inventors: 健作藤井
Original assignee: Toa Corp
Current assignee: Toa Corp
Priority date: 2006-07-14
Filing date: 2006-07-14
Publication date: 2011-03-02
Anticipated expiration: 2026-07-14
Also published as: JP2008022478A

Description

本願発明は、エコーキャンセラに関し、特に、エコー経路変動とダブルトークとを識別することにより安定した動作を行うエコーキャンセラに関する。 The present invention relates to an echo canceller, and more particularly to an echo canceller that performs stable operation by discriminating echo path fluctuations and double talk.

通話形態の変化やハードウェアの進展に伴って、我々がマイクロホンを用いる機会は増える傾向にある。我々の聴覚は目的としている音に強く注意を向けることができるために、余分な雑音や残響音はそれほど気にならない。しかし、マイクロホンは必要とする音声だけでなく不要な雑音や残響音をも取得するため、それがそのままマイクロホンの出力となる。近年、ハンズフリーフォンやテレビ会議システムなど話者がマイクロホンから離れた位置で話すことが多くなっている。そのため、必要とする音声のレベルが低下し、相対的に雑音や残響音の影響が大きくなってきている。その影響を軽減する装置の一つに音響エコーキャンセラがある。エコーとは、スピーカーから出力された音声や室内の壁面で反射された間接波のマイクロホンへの回り込み信号である。このエコーを消去するシステムが音響エコーキャンセラである。音響エコーキャンセラは話者が部屋の中を動き回る場合やマイクロホンやスピーカーの位置を変えるような部屋の環境の変化、すなわちエコー経路の変化に自動的に素早く適応するために適応ディジタルフィルタによって構成される。 Along with changes in call formats and hardware, we have a greater chance of using microphones. Our auditory attention can be focused on the target sound, so extraneous noise and reverberation aren't bothered. However, since the microphone acquires not only the necessary sound but also unwanted noise and reverberation sound, it becomes the output of the microphone as it is. In recent years, speakers such as hands-free phones and video conferencing systems often talk at a position away from a microphone. For this reason, the level of the required voice is lowered, and the influence of noise and reverberant sound is relatively increased. One device that reduces this effect is an acoustic echo canceller. The echo is a sneak signal to the microphone of the sound output from the speaker or the indirect wave reflected by the wall surface in the room. A system for canceling this echo is an acoustic echo canceller. Acoustic echo cancellers are composed of adaptive digital filters to automatically adapt quickly to changes in the room environment, such as changing the location of microphones and speakers, as the speaker moves around the room, or changes in the echo path .

また、このエコーキャンセラにおける最も重要な特性の一つにダブルトークに対する安定性がある。ダブルトークとは近端話者が話すことである。適応フィルタの更新に利用される誤差信号はエコーを擬似エコーで相殺したあとの残差信号である。近端話者が話すとき、残差信号には近端話者の音声が残るので、この状態で適応フィルタを更新すれば発散の危険性が高まる。このダブルトークを検出するためにはダブルトークとエコー経路変動を区別する必要がある。それは、ダブルトークもエコー経路変動も残差信号を増加させるためである。この識別処理に関して、これまでに多くの方法が提案されている［例えば、特許文献１、非特許文献１、非特許文献２、非特許文献３、非特許文献４、非特許文献５参照］。当然ながら、その識別を誤ったり、時間がかかればハウリングする危険が高まる。また、その識別に正確を期せば、小さな（推定誤差の小さい）エコー経路変動に対して判定がさらに遅れるという問題が生じる。 One of the most important characteristics of this echo canceller is stability against double talk. Double talk is a near-end speaker speaking. The error signal used for updating the adaptive filter is a residual signal after the echo is canceled by a pseudo echo. When the near-end speaker speaks, since the near-end speaker's voice remains in the residual signal, the risk of divergence increases if the adaptive filter is updated in this state. In order to detect this double talk, it is necessary to distinguish between double talk and echo path fluctuation. This is because both double talk and echo path variations increase the residual signal. Many methods have been proposed so far regarding this identification processing [see, for example, Patent Document 1, Non-Patent Document 1, Non-Patent Document 2, Non-Patent Document 3, Non-Patent Document 4, and Non-Patent Document 5]. Naturally, the risk of howling will increase if the identification is wrong or it takes time. Further, if the identification is made accurate, there arises a problem that the determination is further delayed with respect to a small echo path variation (small estimation error).

このように、音響エコーキャンセラにおいて推定誤差が増加する原因の一つに、エコー経路の変動が挙げられるが、これはダブルトークとエコー経路変動の識別を行うことで解決できる。本願発明は、ダブルトークとエコー経路変動の識別のための新たな識別方法を採用したエコーキャンセラを提示する。
特開２００５−３１８５７１号公報 S.Minami and T.Kawasaki，“A double talk detection for an echo canceller，”ICC’85 Rec．，pp．1492-1497，1985．藤井健作，大賀寿郎，“音響エコーキャンセラのためのダブルトークとエコー経路変動検出に関する検討，”信学技法，EA-91-82，Jan．1991． Hau Ye and Bo-Xiu wu，“ A new double-talk detection algorithm based on the orthogonality theorem，”IEEE tranc．Commun．，vol．COM-39，no．11，pp．1542-1545，Nov．1991．藤井健作，大賀寿郎，“ ダブルトークとエコー経路変動の識別，”信学技法，EA-94-15，May 1994．藤井健作，大賀寿郎，“ エコー経路変動の検出を併用するダブルトーク検出法，”信学論(A)，vol．J78-A，no．3，pp．314-322，March 1995． Thus, one of the causes of the increase in the estimation error in the acoustic echo canceller is the fluctuation of the echo path, which can be solved by identifying the double talk and the echo path fluctuation. The present invention presents an echo canceller that employs a new identification method for identifying double talk and echo path variations.
JP 2005-318571 A S. Minami and T. Kawasaki, “A double talk detection for an echo canceller,” ICC'85 Rec. , Pp. 1492-1497, 1985. Kensaku Fujii and Toshiro Oga, “Study on double talk and echo path fluctuation detection for acoustic echo canceller,” IEICE Tech., EA-91-82, Jan. 1991. Hau Ye and Bo-Xiu wu, “A new double-talk detection algorithm based on the orthogonality theorem,” IEEE tranc. Commun. , Vol. COM-39, no. 11, pp. 1542-1545, Nov. 1991. Kensaku Fujii, Toshiro Oga, “Distinction of Double Talk and Echo Path Variation,” IEICE Tech., EA-94-15, May 1994. Kensaku Fujii, Toshiro Oga, “Double-talk detection method combined with detection of echo path fluctuations,” IEICE (A), vol. J78-A, no. 3, pp. 314-322, March 1995.

本願発明の目的は、エコー経路変動とダブルトークを正確に識別して、エコー経路変動に対しては係数の収束を早め、ダブルトークに対しては安定したエコー消去量の維持を可能とするエコーキャンセラの実現にある。 The object of the present invention is to accurately distinguish between echo path fluctuations and double talk, to accelerate the convergence of coefficients for echo path fluctuations, and to enable stable echo cancellation for double talk. It is in the realization of a canceller.

上記課題を解決するために、本願発明に係るエコーキャンセラは、スピーカに供給される受信信号が該スピーカからマイクロホンへ回り込むことによって生成されるエコー信号を、該マイクロホンの出力信号から除去するエコーキャンセラであって、該受信信号に基づいて疑似エコー信号を生成する係数更新可能なＦＩＲフィルタと、該マイクロホンの出力信号から該ＦＩＲフィルタが生成した疑似エコー信号を差し引くことによって誤差信号を生成する減算器と、該受信信号と該誤差信号とに基づいて該誤差信号が最小となるように適応アルゴリズムによって該ＦＩＲフィルタの係数を更新する係数更新部とを備え、第１値と第２値との比値に基づいてダブルトーク状態が発生しているか否かを判断し、該第１値が、該ＦＩＲフィルタの疑似エコー信号のパワー値に基づいて定められる値であり、該第２値が、該誤差信号のパワー値に基づいて定められる値であり、該比値の算出が所定時間間隔で繰り返され、今回算出された該比値と、前回算出された該比値との差の大きさに基づき、ダブルトーク状態が発生しているか否かを判断する。 In order to solve the above-described problems, an echo canceller according to the present invention is an echo canceller that removes an echo signal generated when a reception signal supplied to a speaker wraps around from the speaker to a microphone from the output signal of the microphone. A coefficient updateable FIR filter that generates a pseudo echo signal based on the received signal, and a subtractor that generates an error signal by subtracting the pseudo echo signal generated by the FIR filter from the output signal of the microphone; A coefficient updating unit for updating a coefficient of the FIR filter by an adaptive algorithm so that the error signal is minimized based on the received signal and the error signal, and a ratio value between the first value and the second value it is determined whether double talk state is generated based on, first value, suspected of the FIR filter A value determined based on the power value of the echo signal, the second value, Ri value der determined based on the power value of the error signal, the calculation of the ratio value is repeated at predetermined time intervals, the current Based on the magnitude of the difference between the calculated ratio value and the previously calculated ratio value, it is determined whether or not a double talk state has occurred.

また、上記課題を解決するために、本願発明に係るもう一つのエコーキャンセラは、スピーカに供給される受信信号が該スピーカから第１マイクロホンへ回り込むことによって生成されるエコー信号を、該第１マイクロホンの出力信号から除去するエコーキャンセラであって、該受信信号に基づいて疑似エコー信号を生成する係数更新可能なＦＩＲフィルタと、該第１マイクロホンの出力信号から該ＦＩＲフィルタが生成した疑似エコー信号を差し引くことによって誤差信号を生成する減算器と、該受信信号と該誤差信号とに基づいて該誤差信号が最小となるように適応アルゴリズムによって該ＦＩＲフィルタの係数を更新する係数更新部とを備え、第１値と第２値との比値に基づいてダブルトーク状態が発生しているか否かを判断し、該第１値が、該ＦＩＲフィルタの疑似エコー信号のパワー値に基づいて定められる値であり、該第２値が、該誤差信号のパワー値と、該第１マイクロホンの周囲の周囲騒音信号のパワー値とに基づいて定められる値であり、該周囲騒音信号が、該第１マイクロホンの近傍に設置された第２マイクロホンによって取得され、該比値の算出が所定時間間隔で繰り返され、今回算出された該比値と、前回算出された該比値との差の大きさに基づき、ダブルトーク状態が発生しているか否かを判断する。 Further, in order to solve the above problems, another echo canceller according to the present invention, the echo signals received signal is supplied to a speaker is produced by sneaking from the speaker to the first microphone, the first microphone a echo canceller for removing from the output signal, and the coefficient updatable FIR filter for generating an echo replica signal based on the received signal, the pseudo echo signal which the FIR filter is generated from the output signal of the first microphone A subtractor that generates an error signal by subtraction, and a coefficient updating unit that updates the coefficient of the FIR filter by an adaptive algorithm so that the error signal is minimized based on the received signal and the error signal, it is determined whether double talk state has occurred based on the ratio value between the first value and the second value, said Value is a value determined based on the power value of the pseudo echo signal of the FIR filter, the second value, the power value of the error signal, and the power value of the ambient noise signal around the first microphone value der determined based on is, the ambient noise signal is acquired by the second microphone that is placed in the vicinity of the first microphone, the calculation of the ratio value is repeated at predetermined time intervals, the calculated time Based on the difference between the ratio value and the previously calculated ratio value, it is determined whether or not a double talk state has occurred.

また、上記エコーキャンセラにおいて、該比値が、該第１値に対する該第２値の比率であり、今回算出された該比率と、前回算出された該比率との差の大きさが所定閾値以上であるとき、ダブルトーク状態が発生していると判断するようにしてもよい。 In the echo canceller, the ratio value is a ratio of the second value to the first value, and a difference between the ratio calculated this time and the ratio calculated last time is equal to or greater than a predetermined threshold value. In such a case, it may be determined that a double talk state has occurred.

本願のエコーキャンセラによれば、ダブルトークとエコー経路変動が正確に識別され、安定した動作が保証される。 According to the echo canceller of the present application, double talk and echo path fluctuation are accurately identified, and stable operation is guaranteed.

以下、本願発明の一実施形態たるエコーキャンセラを図面を参照しつつ説明する。
（１．はじめに）
本願発明の実施形態では適応アルゴリズムとしてブロック実行型学習同定法を用い、そのブロック長を遠端話者の音声パワーで制御すると共に、ステップサイズを残差信号のパワーで制御する方法を行う。この併用制御法によれば、推定誤差が−２０ｄＢ以下となるような小さなエコー経路変動ではダブルトークとの識別が不要となるので、推定誤差が−２０ｄＢを超える大きなエコー経路変動とダブルトークとを識別する方法を提案する。
（２．適応アルゴリズムと制御法）
［２．１ブロック実行型学習同定法］
音響エコーキャンセラの基本となる構造を図１Ａ、図１Ｂに示す。図１Ａは、図１Ｂにおけるスピーカ201からマイクロホン200までの経路を伝達関数ｈを用いて示したものであり、図１Ａと図１Ｂの構造は等価である。さらに、図１Ｃは、図１Ｂと等価の構造を示す図であるが、図１Ｂにおける適応ＦＩＲフィルタ212の構造がより詳細に示されている。これらの図において、エコーキャンセラは、通話装置で適用されている。 Hereinafter, an echo canceller according to an embodiment of the present invention will be described with reference to the drawings.
(1. Introduction)
In the embodiment of the present invention, a block execution learning identification method is used as an adaptive algorithm, and the block length is controlled by the voice power of the far-end speaker and the step size is controlled by the power of the residual signal. According to this combined control method, since it is not necessary to distinguish from double talk for small echo path fluctuations where the estimation error is -20 dB or less, large echo path fluctuations and double talk with an estimation error exceeding -20 dB are eliminated. A method of identification is proposed.
(2. Adaptive algorithm and control method)
[2.1 Block execution learning identification method]
The basic structure of the acoustic echo canceller is shown in FIGS. 1A and 1B. FIG. 1A shows the path from the speaker 201 to the microphone 200 in FIG. 1B using the transfer function h, and the structures of FIG. 1A and FIG. 1B are equivalent. Further, FIG. 1C is a diagram showing a structure equivalent to FIG. 1B, but the structure of the adaptive FIR filter 212 in FIG. 1B is shown in more detail. In these figures, the echo canceller is applied in a telephone device.

図１Ｃを参照して説明すると、通話装置はマイクロホン200とスピーカ201とを有している。エコーキャンセラは、主に、係数更新可能なＦＩＲフィルタ212aと、係数更新部212bと、減算器211とによって構成される。なお、ＦＩＲフィルタ212aと係数更新部212bとによって、適応ＦＩＲフィルタ212が構成される。 Referring to FIG. 1C, the call device has a microphone 200 and a speaker 201. The echo canceller mainly includes an FIR filter 212a capable of updating coefficients, a coefficient updating unit 212b, and a subtractor 211. The FIR filter 212a and the coefficient updating unit 212b constitute an adaptive FIR filter 212.

通話装置が受信した信号ｘ_j(受信信号)は、スピーカ201に供給される。スピーカ201から放射された音は、マイクロホン200へ回り込む。マイクロホン200の出力信号ｙ_jは、この回り込みによって生成されるエコー信号ｇ_jを含んでいる。なおｎ_j は、マイクロホン200に入力される近端話者の音声と周囲騒音である。 A signal x _j (received signal) received by the call device is supplied to the speaker 201. Sound radiated from the speaker 201 goes around the microphone 200. The output signal y _j of the microphone 200 includes an echo signal g _j generated by this wraparound. Note that n _j is the near-end speaker's voice and ambient noise input to the microphone 200.

信号ｘ_jは、ＦＩＲフィルタ212aにも入力される。ＦＩＲフィルタ212aは、Ｉ個の係数タップを備えており、各係数タップには係数として、Ｈ(１)，Ｈ(２)，Ｈ(３)，・・・，Ｈ(Ｉ)が設定されている。ＦＩＲフィルタ212aからは、出力信号（疑似エコー信号）として信号Ｇ_jが出力される。 The signal x _j is also input to the FIR filter 212a. The FIR filter 212a includes I coefficient taps, and H (1), H (2), H (3),..., H (I) are set as coefficients for each coefficient tap. Yes. The FIR filter 212a outputs a signal G _j as an output signal (pseudo echo signal).

減算器211には、信号ｙ_jと信号Ｇ_jとが入力される。減算器211は、信号ｙ_jから信号Ｇ_jを差し引く。減算結果としての信号（誤差信号）ｅ_jは送信信号として通話装置から送出される。 The subtractor 211 receives the signal y _j and the signal G _j . The subtractor 211 subtracts the signal G _j from the signal y _j . Subtraction result as a signal (error signal) e _j is sent from the communication device as a transmission signal.

係数更新部212bには、信号ｅ_jが誤差信号として入力される。係数更新部212bには、信号ｘ_jも入力される。係数更新部212bにおいて、適応アルゴリズム（係数更新アルゴリズム）が作動している。係数更新部212bの適応アルゴリズム（係数更新アルゴリズム）は、信号ｘ_jと信号ｅ_jとに基づいて、信号ｅ_jが最小となるようにＦＩＲフィルタの係数タップの係数Ｈ(１)，Ｈ(２)，Ｈ(３)，・・・，Ｈ(Ｉ)を更新する。 The coefficient updating unit 212b, the signal e _j is inputted as an error signal. A signal x _j is also input to the coefficient updating unit 212b. In the coefficient update unit 212b, an adaptive algorithm (coefficient update algorithm) is operating. Adaptive algorithm of the coefficient update unit 212b (the coefficient update algorithm), based on the signal x _j and the signal e _j, coefficients of the coefficient taps of the FIR filter so that the signal e _j is minimized H (1), H (2 ), H (3),..., H (I) are updated.

このようなシステムにより、発生するエコーｇ_j は、適応ＦＩＲフィルタ212で合成された擬似エコーＧ_j によって相殺され、その残差信号 By such a system, the generated echo g _j is canceled by the pseudo echo G _j synthesized by the adaptive FIR filter 212, and the residual signal is obtained.

が遠端側に伝送される。ここで、ｎ_j はマイクロホンに入力された外乱、すなわち近端話者の音声と周囲騒音である。 Is transmitted to the far end. Here, n _j is the disturbance input to the microphone, that is, the near-end speaker's voice and ambient noise.

本実施形態ではその適応フィルタの係数を更新する適応アルゴリズムとしてブロック実行型学習同定法 In this embodiment, a block execution type learning identification method is used as an adaptive algorithm for updating the coefficient of the adaptive filter.

を用いる。ここで、Ｊはブロック長、ρはステップサイズである。 Is used. Here, J is the block length and ρ is the step size.

［２．２ブロック長制御］
学習同定法では適応フィルタの係数の収束時において推定誤差の２乗平均は [2.2 Block length control]
In the learning identification method, the mean square of the estimation error when the adaptive filter coefficient converges is

と見積もられる。ここで、μ はステップサイズ、σ_N ² は外乱の平均パワー、σ_x ² は参照信号の平均パワーである。この推定誤差に関し、学習同定法とブロック実行型学習同定法との間には It is estimated. Here, μ is the step size, σ _N ² is the average power of the disturbance, and σ _x ² is the average power of the reference signal. Regarding this estimation error, between the learning identification method and the block execution type learning identification method,

の関係がある。例えば、学習同定法における μ＝１は ρ＝Ｊに相当する。この関係を式（３）に代入すれば、ブロック実行型学習同定法で得られる推定誤差の２乗平均は There is a relationship. For example, μ = 1 in the learning identification method corresponds to ρ = J. By substituting this relationship into Equation (3), the mean square of the estimation error obtained by the block execution learning identification method is

と見積もられる。この推定誤差の２乗平均とブロック長Ｊとの関係を単純化して制御を容易にするために、式（５）を２Ｊ≫ρ と近似できる場合について整理すると It is estimated. In order to simplify the relationship between the mean square of the estimation error and the block length J and facilitate control, formula (5) can be summarized for the case where 2J >> ρ can be approximated.

となる。この式から外乱が周囲騒音である場合の平均パワーＱ₀ に対して所要の推定誤差Ｃ₀ を得るのに必要な参照信号のパワーは It becomes. From this equation, the power of the reference signal required to obtain the required estimation error C ₀ with respect to the average power Q ₀ when the disturbance is ambient noise is

と求まる。さらに、この参照信号のパワーσ_x ²は加算ノルム It is obtained. Furthermore, the power σ _x ² of this reference signal is the addition norm

から推定でき、適応フィルタのタップ数をＩとすれば平均して If the number of taps of the adaptive filter is I, the average

の大きさをもつ。このことからブロック長Ｊを制御して Has the size of From this, control the block length J

となるときに係数を更新すれば参照信号のパワー変動に関係なく推定誤差がＣ₀ まで減少すると期待される。また、本実施形態ではダブルトークとエコー経路変動の識別を安定的に行うために、最短ブロック長を定め、Ｊ₀ とおく。 If the coefficient is updated at this time, the estimation error is expected to decrease to C ₀ regardless of the power fluctuation of the reference signal. Further, in order to perform the identification of double talk and echo path change stably in this embodiment, defines the shortest block length, put the J _0.

［２．３ステップサイズ制御］
ステップサイズ制御には外乱パワーの算定が必要である。音響エコーキャンセラにおいてその算定に利用できるのはマイクロホンの出力だけである。しかし、その出力にはエコーｇ_j が混じる。そこで本実施形態では外乱ｎ_j のパワーの算定にエコーｇ_j が擬似エコーＧ_j によって十分に相殺されているときの残差信号 [2.3 Step size control]
Step size control requires calculation of disturbance power. In the acoustic echo canceller, only the output of the microphone can be used for the calculation. However, the echo g _j is mixed in the output. Therefore, in the present embodiment, the residual signal when the echo g _j is sufficiently canceled by the pseudo echo G _{j in} the calculation of the power of the disturbance n _j.

が外乱ｎ_j に近似されることを用いて残差信号の２乗平均 Is approximated by the disturbance n _j , the mean square of the residual signal

を外乱パワーに近似させる。この外乱パワーの推定値Ｑｎと同時に計算される加算ノルムＰｎを用いて Is approximated to the disturbance power. Using the addition norm Pn calculated simultaneously with the estimated value Qn of the disturbance power

をステップサイズとしてブロック単位で制御すればダブルトークの区間においても適応フィルタの係数を更新しても推定誤差は所要値Ｃ₀ に維持される。ただし、Ｐｎのばらつきと音声の有色性を考慮して学習同定法の μ＝０.２５に相当する ρ₀ ＝１６をステップサイズ ρ_n の上限とする。
（３．ダブルトークとエコー経路変動の識別法）
［３．１識別パラメータ］
本実施形態では擬似エコーのパワー平均Ｇｎと残差信号のパワー平均Ｑｎを２次元に配置して識別する手法を提案する。 If the step size is controlled in units of blocks, the estimation error is maintained at the required value C ₀ even if the coefficient of the adaptive filter is updated in the double talk section. However, ρ ₀ = 16 corresponding to μ = 0.25 of the learning identification method is set as the upper limit of the step size ρ _n in consideration of the variation of Pn and the color of the speech.
(3. Discrimination method of double talk and echo path fluctuation)
[3.1 Identification parameters]
In the present embodiment, a method is proposed in which the power average Gn of the pseudo echo and the power average Qn of the residual signal are two-dimensionally arranged and identified.

例えば、式（１５）では、疑似エコー信号Ｇ_j のパワー値Ｇ_j ² が時間幅Ｊの時間ブロック（時間範囲）で累積され、かつ、Ｊで除算されることによって、Ｇｎが得られることが示されている。式（１６）では、同様にして誤差信号ｅ_j からＱｎが得られることが示されている。 For example, in equation (15), Gn can be obtained by accumulating power values G _j ² of the pseudo echo signal G _j in a time block (time range) having a time width J and dividing by J. It is shown. In equation (16), has been shown to Qn is obtained from the error signal e _j in the same manner.

ＧｎとＱｎの比は式（１５）,（１６）よりＧ_j とｅ_j の平均パワーの比に近似される。ここで、エコー経路変動の場合で近端話者の音声が入っていないときは式（１）においてｎ_j が無視できるのでＧｎとＱｎの比は推定誤差に近似される。今係数の更新はゆっくりとなされるのでＧｎとＱｎの比はほぼ一定となる。一方、ダブルトークの場合で近端話者の音声が入っているときは式（１）においてｎ_j の影響が大きくなるのでＧｎとＱｎの比は一定とならない。図２（ａ）は推定誤差が０ｄＢ，−１０ｄＢ，−２０ｄＢとなるエコー経路変動が発生したときの同比の分布、同図（ｂ）はエコーが十分に低減された状態でダブルトークが発生したときの同比の分布である。ここで、図２は ρ₀ ＝１６，Ｊ₀ ＝６４，Ｃ₀ ＝０.００１，Ｉ＝５１２で、遠端話者と周囲騒音のパワー比が３０ｄＢ、近端話者と周囲騒音のパワー比が３０ｄＢとしてシミュレーションを行っている。以下のシミュレーションにおいても同様の条件である。 The ratio of the Gn and Qn formula (15), which approximates the ratio of the average power of G _j and e _j from (16). Here, in the case of echo path fluctuation, when the near-end speaker's voice is not contained, n _j can be ignored in equation (1), so the ratio between Gn and Qn is approximated to the estimation error. Since the coefficient is now updated slowly, the ratio between Gn and Qn becomes almost constant. On the other hand, in the case of double talk, when the near-end speaker's voice is _input , the influence of n _j becomes large in equation (1), so the ratio between Gn and Qn is not constant. FIG. 2 (a) shows the distribution of the same ratio when an echo path variation with an estimation error of 0 dB, −10 dB, and −20 dB occurs, and FIG. 2 (b) shows the occurrence of double talk in a state where the echo is sufficiently reduced. It is the distribution of the same ratio. Here, in FIG. 2, ρ ₀ = 16, J ₀ = 64, C ₀ = 0.001, I = 512, the power ratio between the far-end speaker and the ambient noise is 30 dB, and the power of the near-end speaker and the ambient noise. The simulation is performed with a ratio of 30 dB. The same conditions apply in the following simulation.

図２を見ると明らかにエコー経路変動で直線的であり、ダブルトークで不規則な分布を与えている。すなわち、この分布の違いから隣接時間での傾きの変化が少ないときにエコー経路変動、大きく変化するときにダブルトークと判別することができると期待される。また、傾き（Ｇｎに対するＱｎの比率）は As can be seen from FIG. 2, the echo path variation is linear, and double talk gives an irregular distribution. That is, it is expected from this difference in distribution that it is possible to discriminate echo path fluctuations when there is little change in slope in the adjacent time, and double talks when there is a large change. The slope (ratio of Qn to Gn) is

と計算される。 Is calculated.

上式（式(１７)）において、分母のＧｎを第１値、分子のＱｎを第２値とすると、ｒ_n は第１値に対する第２値の比率と考えることができる。 In the above equation (equation (17)), the Gn denominator first value, the Qn molecules when the second value, r _n can be thought of as the ratio of the second value to the first value.

図３（ａ）は上からエコー経路変動に対して得られた傾きの時間変化と遠端話者の音声、同図（ｂ）は上からダブルトークに対して得られた傾きの時間変化と近端話者の音声（上段）、遠端話者の音声（下段）である。ただし、図では見やすくするために傾きの上限を３として表示している。これを見ると明らかにダブルトークのときは近端話者が無音声となる区間を除いて大きく変化していることが分かる。そして、エコー経路変動ではほぼ一定の傾きが続いていることが分かる。しかし、遠端話者が無音声となる区間では傾きが大きい。これは無音声の区間で周囲騒音の影響が強く出るためである。そこで、傾き（Ｇｎに対する（Ｑｎ−Ｑ₀ ）の比率）を FIG. 3 (a) shows the time variation of the slope obtained with respect to the echo path fluctuation from above and the voice of the far-end speaker, and FIG. 3 (b) shows the time change of the slope obtained with respect to double talk from above. The voice of the near-end speaker (upper) and the voice of the far-end speaker (lower). However, in the figure, the upper limit of inclination is displayed as 3 for easy viewing. As can be seen from the graph, in the case of double talk, there is a significant change except for the section where the near-end speaker is silent. It can be seen that the echo path variation continues with a substantially constant slope. However, the slope is large in the section where the far-end speaker is silent. This is because the influence of ambient noise is strong in the silent section. Therefore, the slope (ratio of (Qn−Q ₀ ) to Gn) is

と計算する。ここで、Ｑ₀ は、周囲騒音パワーである。 And calculate. Here, Q ₀ is the ambient noise power.

上式（式(１８)）において、分母項を第１値、分子項を第２値とし、Ｒ_n は第１値に対する第２値の比率と考えることができる。分子項は、Ｑｎから周囲騒音パワーを減算した減算結果値である。この周囲騒音の信号は、近端話者と遠端話者との両方が無音声となるときのマイクロホン200（図１Ｂ、図１Ｃ参照）からの出力信号から得ることができる。また、この周囲騒音の信号を、マイクロホン200の近傍に設置された他のマイクロホンの出力信号から得るようにしてもよい。 In the above formula (formula (18)), the denominator term is the first value, the numerator term is the second value, and R _n can be considered as the ratio of the second value to the first value. The numerator term is a subtraction result value obtained by subtracting the ambient noise power from Qn. This ambient noise signal can be obtained from an output signal from the microphone 200 (see FIGS. 1B and 1C) when both the near-end speaker and the far-end speaker are silent. Further, the ambient noise signal may be obtained from the output signal of another microphone installed in the vicinity of the microphone 200.

また、上式（式(１８)）におけるＱ₀ を、時間的に変化しない所定値とみなし、（つまりＱ₀ を定数とし）、該所定値（Ｑ₀ ）をＱｎから減算することによって、第２値としての分子項を得てもよい。マイクロホン200の周囲の騒音レベルが既知であり、かつ、その騒音レベルの変化が少ないような場合に、このような方法が有効である。このようにすると、周囲騒音の信号をマイクロホンから検出する必要がなくなるので、システムを簡略化できる。 Further, Q ₀ in the above equation (Equation (18)) is regarded as a predetermined value that does not change with time (that is, Q ₀ is set as a constant), and the predetermined value (Q ₀ ) is subtracted from Qn. A molecular term as a binary value may be obtained. Such a method is effective when the noise level around the microphone 200 is known and the change in the noise level is small. This eliminates the need to detect ambient noise signals from the microphone, thus simplifying the system.

式(１８)の計算の結果を図４に示す。図３（ａ）と比較してエコー経路変動において周囲騒音の影響が小さくなったことが分かる。 The result of calculation of equation (18) is shown in FIG. Compared to FIG. 3A, it can be seen that the influence of ambient noise is smaller in the echo path variation.

このことから、式（１７）のｒ_n よりも、式（１８）のＲｎを用いた方が、ダブルトークとエコー経路変動の識別をより正確に行うことができると期待することができる。以下ではこの傾きＲｎ（式（１８）のＲｎ）を用いてダブルトークとエコー経路変動の識別を行う例を示す。ここ（式（１８））で、絶対値をとっているのは傾きの値がマイナスにならないようにするためである。 Therefore, than r _n of formula (17), who was using Rn in the formula (18), it can be expected to be able to perform identification of double talk and echo path change more accurately. In the following, an example in which double talk and echo path variation are identified using this slope Rn (Rn in equation (18)) will be described. In this (Equation (18)), the absolute value is taken to prevent the slope value from becoming negative.

この傾きの変化の大きさからダブルトークとエコー経路変動の識別が可能となるのであれば、該識別を行ってその結果に応じて、適応ＦＩＲフィルタのステップサイズを変更させたり、フィルタ係数更新を停止させたり再開させたりするなど、適応ＦＩＲフィルタの動作を制御すればよいのである。より具体的に言えば、ダブルトーク状態が発生していると判断したときには、適応ＦＩＲフィルタのステップサイズを小さくしてフィルタ係数の更新量を少なくしたり、又は、フィルタ係数の更新を停止させるようにしてもよい。また、エコー経路変動状態が発生していると判断したときは、一旦停止させたフィルタ係数の更新を再開させたり、又は、フィルタ係数の更新量を多くしてもよい。 If it is possible to discriminate double talk and echo path variation from the magnitude of the change in the slope, the identification is performed and the step size of the adaptive FIR filter is changed or the filter coefficient is updated according to the result. It is only necessary to control the operation of the adaptive FIR filter, such as stopping or restarting. More specifically, when it is determined that the double talk state has occurred, the step size of the adaptive FIR filter is reduced to reduce the update amount of the filter coefficient, or the update of the filter coefficient is stopped. It may be. Further, when it is determined that the echo path variation state has occurred, the updating of the filter coefficient once stopped may be resumed, or the update amount of the filter coefficient may be increased.

この傾きの変化の大きさからダブルトークとエコー経路変動の識別の可能性を探るために、変化の大きさを計算した結果を図５に示す。（ａ）はエコー経路変動での傾きの変化の大きさと遠端話者の音声、（ｂ）はダブルトークでの傾きの変化の大きさと近端話者の音声を表している。この結果を見ると、エコー経路変動では遠端話者の音声が無音声となるときに傾きの変化が大きくなり、ダブルトークでは近端話者が無音声となるときに傾きの変化が小さくなることが分かる。すなわち、両者とも無音声となる区間で識別誤りが発生しやすいことが分かる。 FIG. 5 shows the result of calculating the magnitude of the change in order to investigate the possibility of discriminating double talk and echo path fluctuations from the magnitude of the change in slope. (A) shows the magnitude of the change in tilt due to the echo path variation and the voice of the far-end speaker, and (b) shows the magnitude of the change in tilt and the voice of the near-end talker during double talk. As can be seen from this result, the change in the slope is large when the far-end speaker's voice is silent in the echo path variation, and the slope change is small in the double-talk when the near-end speaker is silent. I understand that. That is, it can be seen that identification errors are likely to occur in a section where both are silent.

［３．２識別が不要となる区間］
この問題を解決するために、遠端側と近端側で同時に無音声となる場合の制御の状況を調べる。この場合はシングルトークとして最大のステップサイズ ρ₀ ＝１６が設定され、ブロック長制御によって推定誤差がＣ₀ に収束するように係数が更新される。この遠端話者が無音声の状態で近端話者が発声すると、参照信号のパワーの方がはるかに小さく、この場合はステップサイズ制御で十分であり、
１．遠端話者が無音声と判断されるときは識別が不要
となる。同様に、近端話者が無音声となる区間は遠端話者の発生の有無に関わらずシングルトークの状態であり、ステップサイズ制御で十分である。すなわち
２．近端話者が無音声と判断されるときは識別が不要
となる。 [3.2 Sections that do not require identification]
In order to solve this problem, the state of control when there is no sound at the far end side and near end side simultaneously is examined. In this case, the maximum step size ρ ₀ = 16 is set as single talk, and the coefficient is updated so that the estimation error converges to C ₀ by block length control. When the far-end speaker is silent and the near-end speaker utters, the power of the reference signal is much smaller, in which case step size control is sufficient,
1. When the far-end speaker is determined to be silent, identification is not necessary. Similarly, the section where the near-end speaker is silent is a single talk state regardless of whether or not the far-end speaker is generated, and step size control is sufficient. That is, 2. When the near-end speaker is determined to be silent, identification is not necessary.

この２つの判断はＰｎとＱｎを観測し、これらがＱ₀ 以上となることをもって判断することができる。ただし、観測誤差を考慮して These two judgments can be judged by observing Pn and Qn and when these are equal to or higher than Q ₀ . However, considering the observation error

が成り立つときは識別が不要となる。 When is true, identification is not necessary.

この識別が不要な区間を取り除くと図７のようになる。ただし、見やすいようにＧｎ，Ｑｎ共０.０１以下の部分だけ表示している。不要な区間を取り除く前である図６と比べると、ＧｎやＱｎが小さい値をとるときは識別不要として取り除かれていることが確認できる。また、識別が不要な区間を０として傾きの変化の大きさを調べると図８のようになる。
（４．シミュレーション）
シミュレーションによりダブルトークとエコー経路変動の識別が行われているか確認する。シミュレーションの条件は図２などと同じである。 If a section that does not require this identification is removed, the result is as shown in FIG. However, for the sake of easy understanding, only Gn and Qn are displayed for portions below 0.01. Compared with FIG. 6 before removing unnecessary sections, it can be confirmed that when Gn and Qn have small values, they are removed as unnecessary identification. Further, when the magnitude of the change in the inclination is examined with 0 as the section where identification is not required, it is as shown in FIG.
(4. Simulation)
Check if double talk and echo path variation are identified by simulation. The simulation conditions are the same as in FIG.

［４．１ダブルトークとエコー経路変動の識別結果］
識別の条件として傾きの変化の大きさが０.１未満ならばエコー経路変動、それ以上はダブルトークとした。つまり、０．１を閾値（所定閾値）としてダブルトーク状態が発生しているか否かの判断に使うのである。この条件は本実施形態で暫定的に定めたものであるので今後の検討が必要である。この条件で識別を行い、ダブルトークとしてステップサイズ制御を行うときは１を、識別不要は０を、エコー経路変動として ρ＝ρ₀ を与えるときは−１を出力させる。ここで、エコー経路変動のときは推定誤差を０ｄＢ、ダブルトークのときは推定誤差を−３０ｄＢとする。このときの識別結果を図９に示す。これを見ると同図（ａ）は識別不要な区間以外はすべてエコー経路変動と判断しているので識別は成功している。また、同図（ｂ）の近端話者が無音となる１０,０００サンプル付近を見ると、遠端話者が発声しているときにエコー経路変動、無音声のときに識別不要と正しく判断されているのが分かる。さらに、同図（ｂ）の９,０００サンプル付近に見られるようにダブルトークであるにもかかわらずエコー経路変動と判断されている。これは両話者の音声のパワーが大きく変化しない場合に傾きの変化が小さくなるためであると考えられる。この誤りを少なくするために、ＧｎとＱｎの変化の大きさが小さいとき、すなわち [4.1 Discrimination results of double talk and echo path fluctuation]
As an identification condition, if the magnitude of the change in inclination is less than 0.1, the echo path is changed. That is, 0.1 is used as a threshold value (predetermined threshold value) to determine whether or not a double talk state has occurred. Since this condition is provisionally determined in the present embodiment, further study is necessary. Identification is performed under these conditions, and 1 is output when step size control is performed as double talk, ₀ is output when identification is not required, and -1 is output when ρ = ρ ₀ is given as echo path variation. Here, the estimation error is set to 0 dB when the echo path is changed, and the estimation error is set to −30 dB when the echo is double talk. FIG. 9 shows the identification result at this time. As can be seen from FIG. 5A, all the sections other than the section where identification is not necessary are judged to be echo path fluctuations, so that the identification is successful. Also, looking at the vicinity of 10,000 samples where the near-end speaker is silent in Fig. 5 (b), it is correctly determined that the echo path changes when the far-end speaker is speaking, and that identification is unnecessary when there is no speech. You can see that. Further, as seen in the vicinity of 9,000 samples in FIG. 9B, it is determined that the echo path changes despite the double talk. This is considered to be because the change in inclination becomes small when the power of the voices of both speakers does not change greatly. To reduce this error, when the magnitude of changes in Gn and Qn is small, ie

ならば前回の処理を続けるという条件を加える。本実施形態では仮に α＝０.００１とおく。このとき、識別結果は図１０のようになる。明らかに、正しく判断されていることが確認できる。 Then, the condition that the previous process is continued is added. In the present embodiment, it is assumed that α = 0.001. At this time, the identification result is as shown in FIG. Obviously, it can be confirmed that the judgment is correct.

［４．２ダブルトークとエコー経路変動時の収束特性］
次に、上記の識別条件を用いて識別を行ったときの収束特性を確認する。ただし、ρ₀＝６.４とし、遠端話者として図１１、近端話者として図１２を用いた。また、安定化のために式（２）の第２項である [4.2 Convergence characteristics when double talk and echo path change]
Next, the convergence characteristics when the identification is performed using the above-described identification conditions are confirmed. However, ρ ₀ = 6.4 was used, and FIG. 11 was used as the far-end speaker and FIG. 12 was used as the near-end speaker. Moreover, it is the second term of the formula (2) for stabilization.

の上限を決める必要がある。本実施形態では、上限をＡ(ｍ)＝０.１／Ｉとおく。その収束特性を図１３に示す。これを見るとシングルトークのときは推定が進み、ダブルトークのときは推定が進んでいないことが確認できる。また、図１４は５０,０００サンプルで、図１５は１５０,０００サンプルでエコー経路である未知フィルタｈ_j の値を変化させた。すると、推定誤差の値が上がった。この原因を確認するために変化した地点を拡大した図１６と図１７を見る。これを見るとρ_n の値が変化させた地点では最大の ρ₀＝６.４となっているのが分かる。ダブルトークのときはρ_n の値は小さくなるので、このときはエコー経路変動だと確認できる。 It is necessary to decide the upper limit. In the present embodiment, the upper limit is set to A (m) = 0.1 / I. The convergence characteristics are shown in FIG. From this, it can be confirmed that the estimation progresses in the case of single talk and the estimation does not advance in the case of double talk. Further, FIG. 14 shows 50,000 samples, and FIG. 15 shows that the value of the unknown filter h _j that is an echo path is changed with 150,000 samples. Then, the value of the estimation error increased. FIG. 16 and FIG. 17 are enlarged in order to confirm the cause. From this, it can be seen that at the point _where the value of ρ _n is changed, the maximum value is ρ ₀ = 6.4. Since the value of ρ _n is small in the case of double talk, it can be confirmed that this is an echo path fluctuation.

［４．３むすび］
以上のとおり、シミュレーションを行い識別が有効に動作しているかを確認した。この結果から識別が有効に行われていることは確認できた。また、今後の課題としてはＡ(ｍ)の上限について検討し、そのほかのパラメータについても検討することが挙げられる。
（５．システムの概略構成）
図１８は、通話装置に適用された、この実施形態に係る本発明（エコーキャンセラ）の概略構成および各機能を示すブロック図である。 [4.3 Conclusion]
As described above, a simulation was performed to confirm whether identification was operating effectively. From this result, it was confirmed that the identification was performed effectively. Further, as a future problem, the upper limit of A (m) will be examined and other parameters will be examined.
(5. System outline)
FIG. 18 is a block diagram showing a schematic configuration and each function of the present invention (echo canceller) according to this embodiment applied to a communication device.

図１８を参照すると、通話装置はマイクロホン200とスピーカ201とを有している。エコーキャンセラは適応ＦＩＲフィルタ212を有する。適応ＦＩＲフィルタ212の構成は、図１Ｃに示した適応ＦＩＲフィルタ212と同様である。 Referring to FIG. 18, the call device has a microphone 200 and a speaker 201. The echo canceller has an adaptive FIR filter 212. The configuration of the adaptive FIR filter 212 is the same as that of the adaptive FIR filter 212 shown in FIG. 1C.

信号ｘ_jは、適応ＦＩＲフィルタ212にも入力され、適応ＦＩＲフィルタ212からは、出力信号（疑似エコー信号）として信号Ｇ_jが出力される。 The signal x _j is also input to the adaptive FIR filter 212, and the adaptive FIR filter 212 outputs a signal G _j as an output signal (pseudo echo signal).

減算器211には、信号ｙ_jと信号Ｇ_jとが入力される。減算器211は、信号ｙ_jから信号Ｇ_jを差し引く。減算結果としての信号ｅ_jは送信信号として通話装置から送出される。 The subtractor 211 receives the signal y _j and the signal G _j . The subtractor 211 subtracts the signal G _j from the signal y _j . Signal e _j as the subtraction result is sent from the communication device as a transmission signal.

計算部213には、適応ＦＩＲフィルタ212が生成した疑似エコー信号Ｇ_j と、誤差信号ｅ_j とが入力され、これらに基づいて式（１７）の傾き（比率）ｒ_n 、または、式（１８）の傾き（比率）Ｒｎが計算部213において算出される。そして、その計算値（ｒ_n 又はＲｎ）が識別部214に送出され、ここ（識別部214）においてダブルトーク状態が発生しているのか、エコー経路変動状態であるのかが識別（判断）される。識別部214はこの識別（判断）の結果を識別信号として出力する。この識別信号に基づいて制御部（図示せず）が、適応ＦＩＲフィルタ212のステップサイズを変更させたり、フィルタ係数更新を停止させたり再開させたりするなど、適応ＦＩＲフィルタ212の動作を制御する。
（６．まとめ）
以上、本発明によればダブルトークとエコー経路変動が正確に識別され、安定した動作がシステムに保証される。ここでの本発明の説明は、適応アルゴリズムをブロック実行型の学習同定法として行った。しかし、本発明はブロック実行型の学習同定法以外の適応アルゴリズムにも適用可能であることに注意が必要である。 The calculation unit 213, a pseudo echo signal G _j of adaptive FIR filter 212 has generated, is input and the error signal e _j, or slope (ratio) r _n, of the formula (17) Based on these equations (18 ) Is calculated by the calculation unit 213. Then, the calculated value (r _n or Rn) is sent to the identification unit 214, where it is identified (determined) whether a double talk state has occurred or an echo path fluctuation state has occurred. . The identification unit 214 outputs the identification (judgment) result as an identification signal. Based on the identification signal, a control unit (not shown) controls the operation of the adaptive FIR filter 212, such as changing the step size of the adaptive FIR filter 212, stopping or restarting the filter coefficient update.
(6. Summary)
As described above, according to the present invention, double talk and echo path fluctuation are accurately identified, and stable operation is guaranteed for the system. In the description of the present invention, the adaptive algorithm is used as a block execution type learning identification method. However, it should be noted that the present invention can be applied to adaptive algorithms other than the block execution type learning identification method.

例えば、ブロック実行型ではない学習同定法を用いることもできるし、学習同定法以外の適応アルゴリズムを用いることもできる。さらに、ブロック実行型である、学習同定法以外の適応アルゴリズムを用いることもできる。ブロック実行型の適応アルゴリズムの時間ブロックを短縮してゆくと、最終的には、ブロック実行型ではない適応アルゴリズムと同一となる。よって、上記した本発明の効果と同様の効果が、ブロック実行型ではない適応アルゴリズムにおいても生ずるであろうことが期待できる。 For example, a learning identification method that is not a block execution type can be used, or an adaptive algorithm other than the learning identification method can be used. Furthermore, an adaptive algorithm other than the learning identification method, which is a block execution type, can also be used. If the time block of the block execution type adaptive algorithm is shortened, the result is finally the same as that of the non-block execution type adaptive algorithm. Therefore, it can be expected that the same effect as the effect of the present invention described above will also occur in an adaptive algorithm that is not a block execution type.

本願のエコーキャンセラによれば、ダブルトークとエコー経路変動とをより正確に識別することができるので、例えば、ハンズフリー通話装置のような電気音響の分野においても利用できる。 According to the echo canceller of the present application, double talk and echo path fluctuation can be more accurately identified, and can be used in the field of electroacoustics such as hands-free communication devices.

音響エコーキャンセラの原理図である。It is a principle diagram of an acoustic echo canceller. 音響エコーキャンセラの原理図である。It is a principle diagram of an acoustic echo canceller. 音響エコーキャンセラの原理図である。It is a principle diagram of an acoustic echo canceller. 識別パラメータを示す図である。It is a figure which shows an identification parameter. 傾きＲ₀ を示す図である。It is a figure which shows inclination _R0 . エコー経路変動のときの傾きＲｎを示す図である。It is a figure which shows inclination Rn at the time of echo path | route fluctuation | variation. 傾きの変化を示す図である。It is a figure which shows the change of inclination. 識別パラメータを示す図である。It is a figure which shows an identification parameter. 識別不要部を除いたときの識別パラメータを示す図である。It is a figure which shows an identification parameter when an identification unnecessary part is removed. 識別の不要部分を除いた傾きの変化を示す図である。It is a figure which shows the change of the inclination except the unnecessary part of identification. 識別結果を示す図である。It is a figure which shows an identification result. ダブルトークの識別結果を示す図である。It is a figure which shows the identification result of a double talk. 遠端話者の音声を示す図である。It is a figure which shows the voice of a far-end speaker. 近端話者の音声を示す図である。It is a figure which shows the voice of a near-end speaker. 収束特性を示す図である。It is a figure which shows a convergence characteristic. 収束特性を示す図である。It is a figure which shows a convergence characteristic. 収束特性を示す図である。It is a figure which shows a convergence characteristic. 収束特性等の拡大図である。It is an enlarged view of convergence characteristics. 収束特性等の拡大図である。It is an enlarged view of convergence characteristics. 通話装置に適用されたエコーキャンセラの概略構成および各機能を示すブロック図である。It is a block diagram which shows schematic structure and each function of the echo canceller applied to the telephone apparatus.

Explanation of symbols

200 マイクロホン
201 スピーカ
211 減算部
212 適応ＦＩＲフィルタ
213 計算部
214 識別部
200 microphone
201 Speaker
211 Subtraction part
212 Adaptive FIR filter
213 Calculator
214 Identification part

Claims

An echo canceller that removes an echo signal generated by a reception signal supplied to a speaker from the speaker to the microphone, from the output signal of the microphone,
A coefficient updatable FIR filter that generates a pseudo echo signal based on the received signal;
A subtractor that generates an error signal by subtracting a pseudo echo signal generated by the FIR filter from an output signal of the microphone;
A coefficient updating unit for updating the coefficient of the FIR filter by an adaptive algorithm so that the error signal is minimized based on the received signal and the error signal;
Determining whether or not a double talk state has occurred based on a ratio value between the first value and the second value;
The first value is a value determined based on a power value of a pseudo echo signal of the FIR filter;
Second value, Ri value der determined based on the power value of the error signal,
The calculation of the ratio value is repeated at predetermined time intervals,
An echo canceller that determines whether or not a double talk state has occurred based on a magnitude of a difference between the ratio value calculated this time and the ratio value calculated last time .

The echo signals received signal is supplied to a speaker is produced by sneaking from the speaker to the first microphone, a echo canceller for removing from the output signal of the first microphone,
A coefficient updatable FIR filter that generates a pseudo echo signal based on the received signal;
A subtracter for generating an error signal by subtracting the pseudo echo signal which the FIR filter is generated from the output signal of the first microphone,
A coefficient updating unit for updating the coefficient of the FIR filter by an adaptive algorithm so that the error signal is minimized based on the received signal and the error signal;
Determining whether or not a double talk state has occurred based on a ratio value between the first value and the second value;
The first value is a value determined based on a power value of a pseudo echo signal of the FIR filter;
Second value, Ri value der determined based the power value of the error signal, to the power value of the ambient noise signal around the first microphone,
The ambient noise signal is acquired by a second microphone installed in the vicinity of the first microphone;
The calculation of the ratio value is repeated at predetermined time intervals,
An echo canceller that determines whether or not a double talk state has occurred based on a magnitude of a difference between the ratio value calculated this time and the ratio value calculated last time .

The ratio value is a ratio of the second value to the first value;
The echo canceller according to claim 1 , wherein when the magnitude of the difference between the ratio calculated this time and the ratio calculated last time is equal to or greater than a predetermined threshold, it is determined that a double talk state has occurred.