JP2004274412A

JP2004274412A - Echo canceler, canceling method, and canceling program

Info

Publication number: JP2004274412A
Application number: JP2003062656A
Authority: JP
Inventors: Suehiro Shimauchi; 末廣島内; Yoichi Haneda; 陽一羽田; Akitoshi Kataoka; 章俊片岡
Original assignee: Nippon Telegraph and Telephone Corp
Current assignee: Nippon Telegraph and Telephone Corp
Priority date: 2003-03-10
Filing date: 2003-03-10
Publication date: 2004-09-30
Anticipated expiration: 2023-03-10
Also published as: JP3920795B2

Abstract

<P>PROBLEM TO BE SOLVED: To quicken convergence of a filter coefficient being determined without using a conversion region echo simulation filter coefficient restricting means requiring a large operational amount including inverse linear transform, windowing, and linear transform processing. <P>SOLUTION: The echo canceler comprises a means for obtaining a reproduction signal sequence by storing sound reproduction signals for a predetermined time, a means for obtaining a sound collection signal sequence by storing sound collection signals for a predetermined time, a means for obtaining a reproduction signal conversion sequence by performing linear conversion on the reproduction signal sequence, a means for generating a simulation echo signal conversion sequence simulating the linear conversion sequence of an echo signal leaking from the sound signal reproducing means to the sound signal collecting means by receiving the reproduction signal conversion sequence, a means for receiving the simulation echo signal conversion sequence and the sound collection signal sequence and outputting the simulation difference of the simulation echo signal conversion sequence, and a means for generating an updated sequence of a conversion region echo simulation filter coefficient having any one or both of a means for providing nonlinear processing to a reproduction signal conversion sequence being fed to a conversion region echo simulation filter coefficient updating means and a simulation error nonlinear processing means for providing nonlinear processing to a simulation error. <P>COPYRIGHT: (C)2004,JPO&NCIPI

Description

【０００１】
【発明の属する技術分野】
本発明は、相手話者からの受話再生音声を送話音声と重畳して収音することによって生じる音響エコー（受話再生音声）を除去するために、模擬誤差信号に基づいて擬似反響路を適応的に推定し、擬似反響路に基づく擬似反響を収音信号から除去して送話音声とする反響消去装置、方法、及び反響消去プログラムに関する。
【０００２】
【従来の技術】
スピーカ１からマイクロホン２へ回り込む反響を消去する反響消去装置は、図１のように接続される。
従来、装置内では、スピーカとマイクロホン間の反響路のインパルス応答ｈを推定し、推定したインパルス応答ｈ’と再生信号ｘの畳み込み演算ｈ’＊ｘにより模擬反響信号ｙ’を生成し、実際の反響信号ｙから減算することで、反響消去信号ｅを得る。しかし、推定したインパルス応答と再生信号の畳み込み演算には、多くの演算量を必要とし、実装上の問題となっている。近年、この問題を解決するために、再生信号や反響信号を一旦線形変換し、反響路のインパルス応答の線形変換に対応したパラメータを推定し、畳み込みの代わりに乗算処理を用いたり（非特許文献１参照）、あるいはより小さい畳み込み演算に分割したりする（特許文献１参照）などをして、演算量を削減する方法が提案されている。線形変換の例としては、（高速）離散フーリエ変換、（高速）離散コサイン変換、（高速）ハートレー変換などがある。
図１で音響信号再生手段としてスピーカを挙げているが、音響信号再生手段としては、再生前段の増幅器やバッファも含む。また、同様に音響信号収音手段は、マイクロホンの後段の増幅器やバッファも含む。
【０００３】
【非特許文献１】
ＳｉｍｏｎＨａｙｋｉｎ著「適応フィルタ理論科学技術出版、２００１年１月１０日ｐ．５００−５４１
【特許文献１】
特開平９−１１６４７２号公報（図６）
【０００４】
【発明が解決しようとする課題】
従来の技術で挙げた非特許文献１に記載されている２つの構成を図２、図３に示す。図２の構成は拘束付の構成と呼ばれ、図３の構成は拘束の無い構成と呼ばれている。
図２、図３の構成は、ともに以下の手段を含んでいる。つまり、音響信号再生手段１へと出力される再生信号を入力し一定時間蓄積し再生信号列を得る再生信号入力手段１０１と、前記音響信号再生手段１と同一空間に存在する音響信号収音手段２から収音信号を入力し一定時間蓄積し収音信号列を得る収音信号入力手段１０２と、前記再生信号列を線形変換し再生信号変換列を得る再生信号変換手段１０３と、前記再生信号変換列を入力し前記音響信号再生手段１から前記音響信号収音手段２へと回り込む反響信号の線形変換列を模擬する模擬反響信号変換列を生成する模擬反響生成手段（変換領域反響模擬フィルタ）１０４と、前記模擬反響信号変換列と前記収音信号列とを入力し模擬反響信号変換列の模擬誤差を出力する模擬誤差出力手段１０５と、前記再生信号変換列と前記模擬誤差を入力し前記模擬反響生成手段（変換領域反響模擬フィルタ）１０４の変換領域反響模擬フィルタ係数の更新列（変換領域反響模擬フィルタ係数の配列の線形（周波数）変換に相当する配列）を生成する変換領域反響模擬フィルタ係数更新手段１０６を含む。
模擬反響生成手段１０４は、変換領域反響模擬フィルタ係数更新手段１０６から得られる変換領域反響模擬フィルタ係数更新列をもとに、変換領域反響模擬フィルタ係数を書き換える手段を含む。変換領域反響模擬フィルタ係数更新手段１０６に入力されるために模擬誤差出力手段１０５から出力される模擬誤差は、線形変換領域で評価できる形式となっている。
【０００５】
図２（従来例１）と図３（従来例２）の構成の違いは、従来例１の構成が変換領域反響模擬フィルタ係数拘束手段１０７を有する点である。
変換領域反響模擬フィルタ係数拘束手段１０７の詳細は図４に示される。
非特許文献１では、図４の逆線形変換手段７１を（高速）逆離散フーリエ変換で実現し、線形変換手段７３を（高速）離散フーリエ変換で実現している。また、窓掛け手段７２では、変換領域反響模擬フィルタ係数更新列の逆線形変換列の半分を強制的に零としている。この拘束付けの根拠は、求めるべき変換領域反響模擬フィルタ係数が、真の反響路の長さＬのインパルス応答と同一の長さＬの零信号列を結合した長さ２Ｌの信号列の線形変換に相当することを、更新時に反映させることにある。非特許文献１で、変換領域反響模擬フィルタ係数拘束手段の無い従来例２の構成では、変換領域反響模擬フィルタ係数の求めるべき値への収束が遅くなることが指摘されている。
しかしながら、演算量の観点からは、線形逆変換、窓掛け、線形変換の一連の処理は、望ましいものではない。そこで、本発明は、図４に示すような演算量の多い変換領域反響模擬フィルタ係数拘束手段を用いずに、求めるべき変換領域反響模擬フィルタ係数への収束を速めることを課題とする。
【０００６】
【課題を解決するための手段】
非特許文献１で図２，図３の反響消去装置に適用されている高速ＬＭＳ（ＬｅａｓｔｍｅａｎＳｑｕａｒｅ）アルゴリズムでは、参照信号である再生信号と、真の反響に対する模擬反響の模擬誤差が、無相関となるように変換領域反響模擬フィルタ係数を更新する。
ここで、変換領域反響模擬フィルタ係数拘束手段が無い場合、変換領域反響模擬フィルタ係数の逆線形変換の信号列の半分が零になっている保障はない。この非零の影響は、模擬誤差に巡回畳み込みによる折り返し成分として現れるが、この折り返し成分と、再生信号とは、通常無相関とみなせる。従って、無相関化、すなわち２次の統計量の評価により、変換領域反響模擬フィルタ係数を決定する高速ＬＭＳアルゴリズムでは、積極的に折り返しの影響を除去することができない。すなわち、非拘束なフィルタ更新による反響消去では、周波数領域変換において巡回畳み込み処理による折り返し成分が反響路成分の推定に影響し収束性能の低下を招く。
【０００７】
従来例の高速ＬＭＳは、受話信号の変換係数の複素共役と、誤差信号の変換係数との積を用いて上記変換領域反響模擬フィルタ係数を更新するものであるのに対して、本発明では少なくとも、受話信号か、誤差信号の一方に、線形変換の前後いずれかで、非線形処理を施し、上記の更新用の積を求める。また、少なくとも一方に非線形処理を施した受話信号と、誤差信号との、変換（周波数）領域での積は、受話信号と誤差信号の高次の相関（高速ＬＭＳでは、２次（通常の相関））量を含む。つまり本発明は、変換領域反響模擬フィルタ係数の更新において非線形処理を施した受話信号と誤差信号との変換領域での積を用いるアルゴリズムであり、高速ＬＭＳの変形により可能になる。
【０００８】
本発明では、模擬誤差に含まれる折り返し成分を、参照信号（再生信号）と独立化、すなわち高次の統計量の評価により、変換領域反響模擬フィルタ係数を決定することにより、変換領域反響模擬フィルタ係数拘束手段を用いずに、求めるべき変換領域反響模擬フィルタ係数への収束を速める。具体的には、図５に示すように、図３（従来例２）の構成に、変換領域反響模擬フィルタ係数更新手段１０６へ入力する前記再生信号変換列に非線形処理を与える再生信号非線形処理手段１０８と、前記変換領域反響模擬フィルタ係数更新手段１０６へ入力する前記模擬誤差に非線形処理を与える模擬誤差非線形処理手段１０９のうち、少なくともどちらかを含む構成である。再生信号非線形処理手段１０８、または、模擬誤差非線形処理手段１０９により、処理された信号列に対して、従来技術と同様にして、変換領域反響模擬フィルタ係数更新手段１０６により、変換領域反響模擬フィルタ係数の更新分に相当する更新列を計算する。但し、非特許文献１にも記載のある更新列の相対的大きさを調節するステップサイズパラメータの値は、必ずしも同じではない。
【０００９】
本発明は、高次の統計量を用い、模擬誤差に含まれる折り返し成分を積極的に評価し、変換領域反響模擬フィルタ係数の更新列を決定できる。
【００１０】
【発明の実施の形態】
（実施例１）
本発明の基本構成は、図５の構成となるが、容易に類推できる代替構成として、再生信号非線形処理手段１０８を、再生信号変換手段１０３よりも前段に配置する構成、すなわち再生信号列を線形変換する前に、非線形処理を施す構成がある。また、模擬誤差出力手段１０５において、模擬誤差を線形変換前の信号列に対して求める手段と、この線形変換前の模擬誤差列を線形変換する手段と、線形変換後の模擬誤差列を出力する手段を有する場合、模擬誤差非線形処理手段１０９は、模擬誤差出力手段１０５内の線形変換前の模擬誤差列を線形変換する手段の前段に配置する構成、すなわち、線形変換前の模擬誤差列に対して、非線形処理を施す構成も採り得る。
再生信号変換手段１０３の線形変換としてＤＦＴのみならずＤＣＴ、ハートレイ変換が用いられる。なお、変換後の周波数領域係数で複素数となった場合での非線形処理の一例として、振幅を正規化して、位相項のみ取り出し、これにより信号の大きさによらず、信号の位相レベルで独立性や、相関を（正規化して）評価することができる。
【００１１】
図５における再生信号非線形処理手段１０８、または模擬誤差非線形処理手段１０９における、非線形処理は、例えば、サイン関数や、双曲線正接関数が利用できる。
サイン関数は、ある入力要素ｚに対して、
【数１】

で与えられる。また、双曲線正接関数は、ｚが複素数のとき、位相歪みが生じるため、サイン関数と組み合わせて
ｓｇｎ（ｚ）・ｔａｎｈ（｜ｚ｜）
を用いることもできる。また、ｚのある範囲の値に対して任意のａ_ｎ（ｎ＝０，・・・，∞）によって、
【数２】

と表せる任意の関数を用いることもできる。
再生信号非線形処理手段１０８と模擬誤差非線形処理手段１０９とは、同じ非線形関数により処理する場合と、異なる非線形関数により処理する場合とがある。
また、図５における再生信号非線形処理手段１０８、または模擬誤差非線形処理手段１０９において、入力される前記再生信号変換列あるいは前記模擬誤差の列の各要素ごとに、上に示した非線形関数から異なる関数を選択し、各要素ごとに異なる非線形処理を施す構成も採り得る。さらに、前記再生信号変換列あるいは前記模擬誤差の列の各要素の一部に対して、非線形処理を施さない構成も採り得る。例えば、線形変換が（高速）離散フーリエ変換によって実現される場合には、音声成分の少ない高い周波数に対応する線形変換列の要素に対する非線形処理を省略することなどが考えられる。
【００１２】
（実施例２）
図６は、図５の再生信号非線形処理手段１０８と模擬誤差非線形処理手段１０９とにおいて、同じ非線形関数ｆ（ｚ）を用い、かつｆ（ｚ）がｆ（ａ）ｆ（ｂ）＝ｆ（ａｂ）を満足する場合、例えばサイン関数などを用いた場合の図５の代替実施例である。図５の再生信号非線形処理手段１０８と模擬誤差非線形処理手段１０９は、変換領域反響模擬フィルタ係数の更新列に対してまとめて非線形処理、すなわち、非線形処理後の受話信号（再生信号）と、非線形処理後の誤差信号との積が、処理前の受話信号と誤差信号の積に非線形処理したものと等しい場合は、後段にまとめて非線形処理を施す、例えば受話、誤差信号共にサイン関数を施す場合等、を行う更新列非線形処理手段１１０により代替されている。
【００１３】
（実施例３）
図７は、図５の構成に対し、上記収音信号の短時間平均レベルの上記再生信号の短時間平均レベルに対する比として音響結合量を求める音響結合量測定手段１１１を追加した構成である。この音響結合量測定手段１１１により測定された音響結合量の大小に対応して、前記変換領域反響模擬フィルタ係数更新手段１０６では、前記変換領域反響模擬フィルタ係数の更新量の大小を調節するステップサイズパラメータを決定する。これは、例えば、再生信号非線形処理手段１０８と模擬誤差非線形処理手段１０９とにおいて、ともにサイン関数を採用した場合などには、変換領域反響模擬フィルタ係数の更新列の大きさが、再生信号や模擬誤差の大きさに依存しなくなる。
【００１４】
（高速）ＬＭＳ等では、誤差が小さくなるほど、適応フィルタの更新量（フィルタを修正するベクトルの大きさ）は小さくなり、収束が進むにつれて、より小差な修正が可能となる。その反面、本発明のように、サイン関数（出力が±１）を位相に適用した場合、適応フィルタの更新量は、誤差の大小に関わらず一定で、ある誤差範囲より精度良くは収束しない。音響結合利得が１のときは、ステップサイズを０．０１としておけば、＋０．０１，−０．０１で更新することになり、誤差１％の範囲で収束させることができる。逆に利得が１００００の場合に、ステップサイズを０．０１としたら、収束速度が非常に遅くなるので、この場合はステップサイズが１００でも構わない。このように、利得に応じてステップサイズを調整する。その例として測定された音響結合量に推定誤差許容率を掛け合わせる等の方法により、上記の例では、音響結合１００００×誤差許容率０．０１（１％）＝ステップサイズ１００とする。
【００１５】
このように図７の構成では、音響結合量が大きい場合は、相対的に大きなステップサイズパラメータを与え、音響結合量が小さい場合には、相対的に小さなステップサイズパラメータを与える。なお、音響結合量の測定は、図７においては、線形変換前の再生信号、収音信号を用いて実施しているが、線形変換後の再生信号、収音信号を用いた実施も容易に類推できる。このとき、例えば、（高速）離散フーリエ変換により線形変換を行った場合、各周波数成分ごとに音響結合量を測定し、これら各測定値に基づき、各周波数要素毎にステップサイズパラメータを与えることもできる。また、音響結合量の情報が測定により得られない場合であっても、高い周波数成分のステップサイズパラメータを小さくするなど、線形変換の各要素毎に異なるステップサイズパラメータ（事前に固定パラメータとして設定しておいても、音響結合量が周波数毎に得られた場合には、その都度算出される）を与える構成も採り得る。ここで、測定された音響結合量をステップサイズパラメータに反映させること自体は、一般に図１の構成をとる反響消去装置において、反響路のインパルス応答の推定に有効な方法といえる。
また、前記模擬反響生成手段１０４に過去の前記再生信号変換列を記憶する手段と前記記憶された再生信号変換列に対応した変換領域反響模擬フィルタ係数を有する場合には、前記再生信号変換列の記憶時期に従い記憶時期の古い前記再生信号変換列に対応する前記変換領域反響模擬フィルタ係数ほど更新量を小さくするなどの調節ができるように変換領域反響模擬フィルタ係数更新手段１０６を図８のような構成により実施することが考えられる。
【００１６】
（変換領域反響模擬フィルタ係数更新手段（実施例１））
図８の構成では、１ステップ過去までの再生信号変換列を記憶できる構成となっているが、この構成は容易に複数ステップ過去の再生信号変換列を記憶した場合に拡張できる。図８において、現時刻での（非線形処理後の）再生信号変換列は信号列保持手段６１に保持され、信号列保持手段６２により保持されている（非線形処理後の）模擬誤差列と、信号列乗算手段６３において、列の要素毎に乗算され、変換領域反響模擬フィルタ係数更新列が得られる。この変換領域反響模擬フィルタ係数更新列は、利得調節手段６３１により適切なステップサイズパラメータを個々の更新列要素毎に乗じられ、再度、変換領域反響模擬フィルタ係数更新列として、更新列保持手段６５により保持され、模擬反響生成手段１０４に渡される。さらに１ステップ過去の（非線形処理後の）再生信号変換列は、信号列遅延器６７を通して、信号列保持手段６８に保持されており、信号列保持手段６２により保持されている（非線形処理後の）模擬誤差列と、信号列乗算手段６３において、列の要素毎に乗算され、１ステップ過去の（非線形処理後の）再生信号変換列に対応する変換領域反響模擬フィルタ係数更新列を得られる。この変換領域反響模擬フィルタ係数更新列は、利得調節手段６９１により利得調節手段６３１とは異なるステップサイズパラメータを個々の更新列要素毎に乗じられ、再度、変換領域反響模擬フィルタ係数更新列として、更新列保持手段６５により、先に得られた現時刻の（線形処理後の）再生信号変換列に対応する変換領域反響模擬フィルタ係数とは独立して保持され、模擬反響生成手段１０４に渡される。ここで、利得調節手段６９１のステップサイズパラメータは、実際の反響路とインパルス応答の指数減衰性を考慮に入れると、利得調節手段６３１のステップサイズパラメータよりも小さくするほうがよい。また、図７のように、音響結合量測定手段１１１からの情報が得られる場合は、利得調節手段６３１と利得調節手段６９１とにおいて、ステップサイズパラメータの相対関係（大小関係）は保持したまま、音響結合量評価手段６６により、各々のステップサイズパラメータをスケーリングさせることもできる。音響結合量測定手段１１１からの情報が得られない場合は、音響結合量評価手段６６は省略できる。ここで、図８の構成は、非線形処理手段を含まない図２や図３の構成などにおいても、反響路のインパルス応答の特性推定の高速化または高精度化に効果があるといえる。
【００１７】
（変換領域反響模擬フィルタ係数更新手段（実施例２））
さらに、前記模擬反響生成手段に過去の前記再生信号変換列を記憶する手段と前記記憶された再生信号変換列に対応した変換領域反響模擬フィルタ係数を有する場合において、前記再生信号変換列の記憶時期によって、異なる非線形処理を実現するために、再生信号非線形処理手段１０８と模擬誤差非線形処理手段１０９を有する図５の基本構成の代わりに、従来技術の図３の構成において、変換領域反響模擬フィルタ係数更新手段１０６の内部に、図９に示す非線形処理手段６０１、非線形処理手段６０２、非線形処理手段６０３、非線形処理手段６０４を配置した構成が考えられる。非線形処理手段６０１は、現時刻の再生信号変換列に非線形処理を実施し、非線形処理手段６０２は、１ステップ過去の再生信号変換列に非線形処理を実施し、非線形処理手段６０３は、現時刻の再生信号変換列に対応した模擬誤差の列に非線形処理を実施し、非線形処理手段６０４は、１ステップ過去の再生信号変換列に対応した模擬誤差の列に非線形処理を実施する。これら、４個の非線形処理手段は、異なる非線形関数により実現され得る。また、１部の非線形処理手段を省略した構成も採り得る。これにより、反響路のインパルス応答においてエネルギーの集中する前半と、あまりエネルギーの無い後半の推定精度、推定速度を個別に調節可能となる。また、図３の構成に対し、図７と同様に、音響結合量測定手段１１１を設ければ、図９内の音響結合量評価手段６６によりステップサイズパラメータの制御も可能となるが、音響結合量測定手段１１１を設けられない場合は、音響結合量評価手段６６は省略できる。図９の構成では、１ステップ過去までの再生信号変換列を記憶できる構成となっているが、この構成は容易に複数ステップ過去の再生信号変換列を記憶した場合に拡張できる。
【００１８】
本発明の反響消去装置は、ＣＰＵやメモリ等を有するコンピュータと、利用者端末と、ＣＤ−ＲＯＭ、磁気ディスク装置、半導体メモリ等の記録媒体とから構成することができる。
記録媒体に記録された反響除去プログラム、あるいは回線を介して伝送された反響除去プログラムは、コンピュータに読み取られ、コンピュータ上に前述した各構成要素を実現し、各処理を実行する。
【００１９】
【発明の効果】
本発明による反響消去装置は、線形変換領域で真の反響路のインパルス応答に対応する変換領域反響模擬フィルタ係数を、演算量の増大する変換領域反響模擬フィルタ係数更新の拘束付けを省略しながらも、高速に求めることができる。また、非線形関数の導入により、高次統計量を評価して、再生信号と模擬誤差の独立性に基づき、変換領域反響模擬フィルタ係数を更新しているため、マイクロホンに混入する雑音や音声信号などが、再生信号と独立と見なせる場合は、それらの影響も受けにくい。
【図面の簡単な説明】
【図１】反響消去装置の概要構成を示す図。
【図２】変換領域反響模擬フィルタ係数拘束手段を有する反響消去装置（従来例１）の構成を示す図。
【図３】変換領域反響模擬フィルタ係数拘束手段の無い反響消去装置（従来例２）の構成を示す図。
【図４】従来例１の変換領域反響模擬フィルタ係数拘束手段の構成を示す図。
【図５】本発明の反響消去装置（実施例１）の構成を示す図。
【図６】本発明の反響消去装置（実施例２）の構成を示す図。
【図７】本発明の反響消去装置（実施例３）の構成を示す図。
【図８】変換領域反響模擬フィルタ係数更新手段（実施例１）の構成を示す図。
【図９】変換領域反響模擬フィルタ係数更新手段（実施例２）の構成を示す図。
【符号の説明】
１００・・・反響消去装置、１０１・・・再生信号入力手段、１０２・・・収音信号入力手段、１０３・・・再生信号変換手段、１０４・・・模擬反響生成手段、１０５・・・模擬誤差出力手段、１０６・・・変換領域反響模擬フィルタ係数更新手段、１０８・・・再生信号非線形処理手段、１０９・・・模擬誤差非線形処理手段、１１０・・・更新列非線形処理手段、１１１・・・音響結合量測定手段[0001]
TECHNICAL FIELD OF THE INVENTION
The present invention adapts a pseudo echo path based on a simulated error signal in order to remove an acoustic echo (received reproduction sound) generated by superimposing a reception reproduction sound from a partner speaker on a transmission sound and collecting the sound. The present invention relates to a reverberation elimination apparatus, method, and reverberation elimination program, which estimates a pseudo reverberation based on a pseudo reverberation path, and removes the pseudo reverberation from a picked-up signal to obtain a transmitted voice.
[0002]
[Prior art]
A reverberation canceling device for canceling reverberation circulating from the speaker 1 to the microphone 2 is connected as shown in FIG.
Conventionally, in a device, an impulse response h of a reverberation path between a speaker and a microphone is estimated, and a simulated reverberation signal y ′ is generated by a convolution operation h ′ * x of the estimated impulse response h ′ and a reproduction signal x. By subtracting from the echo signal y, an echo cancellation signal e is obtained. However, the convolution operation of the estimated impulse response and the reproduced signal requires a large amount of operation, which is a problem in mounting. In recent years, in order to solve this problem, a reproduction signal or a reverberation signal is once linearly transformed, a parameter corresponding to a linear transformation of an impulse response of a reverberation path is estimated, and multiplication processing is used instead of convolution (Non-patent Documents). 1) or a method of dividing into smaller convolution operations (see Patent Document 1), and the like, to reduce the amount of operation has been proposed. Examples of linear transformations include (fast) discrete Fourier transform, (fast) discrete cosine transform, and (fast) Hartley transform.
FIG. 1 shows a speaker as the sound signal reproducing means, but the sound signal reproducing means also includes an amplifier and a buffer in a stage before reproduction. Similarly, the sound signal collecting means also includes an amplifier and a buffer at the subsequent stage of the microphone.
[0003]
[Non-patent document 1]
Simon Haykin, “Adaptive Filter Theory, Science and Technology Publishing, January 10, 2001, pp. 500-541.
[Patent Document 1]
JP-A-9-116472 (FIG. 6)
[0004]
[Problems to be solved by the invention]
FIGS. 2 and 3 show two configurations described in Non-Patent Document 1 cited in the prior art. The configuration in FIG. 2 is called a configuration with constraints, and the configuration in FIG. 3 is called a configuration without constraints.
2 and 3 both include the following means. That is, a reproduction signal input unit 101 that inputs a reproduction signal output to the audio signal reproduction unit 1 and accumulates the reproduction signal for a certain period of time to obtain a reproduction signal sequence, and an audio signal sound collection unit that exists in the same space as the audio signal reproduction unit 1 2, a sound pickup signal input means 102 for inputting a sound pickup signal and accumulating it for a certain period of time to obtain a sound pickup signal sequence, a reproduction signal conversion means 103 for linearly converting the reproduction signal sequence to obtain a reproduction signal conversion sequence, Simulated reverberation generating means (conversion area reverberation simulating filter) for generating a simulated reverberation signal conversion sequence that simulates a linear conversion sequence of a reverberation signal that enters a conversion sequence and circulates from the sound signal reproducing means 1 to the sound signal collecting means 2. 104, simulation error output means 105 for inputting the simulated echo signal conversion sequence and the collected sound signal sequence and outputting a simulation error of the simulated echo signal conversion sequence, and inputting the reproduced signal conversion sequence and the simulation error. A conversion region echo that generates an updated sequence of the conversion region reflection simulation filter coefficients (an array corresponding to a linear (frequency) conversion of an array of the conversion region reflection simulation filter coefficients) of the simulation echo generation means (conversion region reflection simulation filter) 104. A simulation filter coefficient updating unit 106 is included.
The simulated reverberation generating means 104 includes means for rewriting the transformed area reverberation simulation filter coefficients based on the transformed area reverberation simulation filter coefficient update sequence obtained from the transformed area reverberation simulation filter coefficient updating section 106. The simulation error output from the simulation error output unit 105 to be input to the transformation region echo simulation filter coefficient updating unit 106 has a format that can be evaluated in the linear transformation domain.
[0005]
The difference between the configuration of FIG. 2 (conventional example 1) and that of FIG. 3 (conventional example 2) is that the configuration of conventional example 1 has a transform domain echo simulation filter coefficient constraining means 107.
FIG. 4 shows the details of the transform region echo simulation filter coefficient restricting means 107.
In Non-Patent Document 1, the inverse linear transform means 71 of FIG. 4 is realized by (high-speed) inverse discrete Fourier transform, and the linear transform means 73 is realized by (high-speed) discrete Fourier transform. Also, the windowing means 72 forcibly sets half of the inverse linear transformation sequence of the transformation region echo simulation filter coefficient update sequence to zero. The reason for this constraint is that the conversion region echo simulation filter coefficient to be obtained is a linear transformation of a 2L-length signal train obtained by combining a zero-signal train of the same length L with the impulse response of the true echo path length L. Is to be reflected at the time of updating. Non-Patent Document 1 points out that in the configuration of Conventional Example 2 without the transform region echo simulation filter coefficient restricting means, the convergence of the transform domain echo simulation filter coefficient to the value to be obtained becomes slow.
However, from the viewpoint of the amount of computation, a series of processes of linear inverse transformation, windowing, and linear transformation is not desirable. Therefore, an object of the present invention is to speed up the convergence to the transform region reverberation simulation filter coefficient to be obtained without using the transform region reverberation simulation filter coefficient restricting means as shown in FIG.
[0006]
[Means for Solving the Problems]
In the high-speed LMS (Least Mean Square) algorithm applied to the echo canceller shown in FIGS. 2 and 3 in Non-Patent Document 1, the reproduction error as a reference signal and the simulation error of the simulation echo with respect to the true echo are uncorrelated. The transform domain echo simulation filter coefficient is updated so that
Here, when there is no transform region echo simulation filter coefficient constraining means, there is no guarantee that half of the signal sequence of the inverse linear conversion of the transform domain echo simulation filter coefficients is zero. This non-zero effect appears in the simulation error as a folded component due to cyclic convolution, and the folded component and the reproduced signal can be generally regarded as uncorrelated. Therefore, the high-speed LMS algorithm that determines the transform-domain reverberation simulation filter coefficient by decorrelation, that is, evaluation of the second-order statistic, cannot positively remove the influence of aliasing. That is, in the echo cancellation by the unconstrained filter update, the aliasing component by the cyclic convolution process in the frequency domain transform affects the estimation of the echo path component, and causes a reduction in convergence performance.
[0007]
The conventional high-speed LMS updates the above-mentioned transform area reverberation simulation filter coefficient by using the product of the complex conjugate of the transform coefficient of the received signal and the transform coefficient of the error signal. , The received signal or the error signal is subjected to non-linear processing either before or after the linear transformation, and the above-mentioned updating product is obtained. In addition, the product of the received signal subjected to nonlinear processing on at least one side and the error signal in the conversion (frequency) domain is a higher-order correlation between the received signal and the error signal (second-order (normal correlation in a high-speed LMS)). )) Including quantity. In other words, the present invention is an algorithm that uses the product in the conversion domain of the received signal subjected to the non-linear processing and the error signal in updating the conversion domain reverberation simulation filter coefficient, and is enabled by the modification of the high-speed LMS.
[0008]
According to the present invention, the aliasing component included in the simulation error is made independent of the reference signal (reproduced signal), that is, the conversion region echo simulation filter coefficient is determined by evaluating a higher-order statistic. The convergence to the transform domain reverberation simulation filter coefficient to be obtained is expedited without using the coefficient constraint means. Specifically, as shown in FIG. 5, a reproduced signal non-linear processing means for applying a non-linear processing to the reproduced signal conversion sequence inputted to the conversion area reverberation simulating filter coefficient updating means 106 in the configuration of FIG. 3 (conventional example 2). 108 and a simulation error non-linear processing means 109 for applying a non-linear process to the simulation error input to the transform domain echo simulation filter coefficient updating means 106. For the signal sequence processed by the reproduced signal nonlinear processing means 108 or the simulated error nonlinear processing means 109, the conversion domain echo simulating filter coefficient An update column corresponding to the update of is calculated. However, the value of the step size parameter for adjusting the relative size of the update sequence described in Non-Patent Document 1 is not necessarily the same.
[0009]
According to the present invention, it is possible to positively evaluate the aliasing component included in the simulation error by using a higher-order statistic, and determine an update sequence of the transform area echo simulation filter coefficient.
[0010]
BEST MODE FOR CARRYING OUT THE INVENTION
(Example 1)
Although the basic configuration of the present invention is the configuration shown in FIG. 5, as an alternative configuration that can be easily analogized, a configuration in which the reproduced signal non-linear processing means 108 is arranged at a stage preceding the reproduced signal conversion means 103, that is, the reproduced signal sequence is linear. There is a configuration in which nonlinear processing is performed before conversion. The simulation error output unit 105 outputs a simulation error for the signal sequence before the linear conversion, a unit for linearly converting the simulation error sequence before the linear conversion, and outputs the simulation error sequence after the linear conversion. In the case where the simulation error non-linear processing means 109 is provided, the simulation error non-linear processing means 109 is arranged before the means for linearly converting the simulation error sequence before linear conversion in the simulation error output means 105, that is, for the simulation error sequence before linear conversion. Thus, a configuration for performing non-linear processing may be employed.
As the linear transformation of the reproduction signal transformation means 103, not only DFT but also DCT and Hartley transformation are used. As an example of nonlinear processing when a complex number is obtained using the frequency domain coefficient after conversion, the amplitude is normalized, and only the phase term is extracted, so that the independence is obtained at the phase level of the signal regardless of the signal size. Alternatively, the correlation can be evaluated (normalized).
[0011]
For the nonlinear processing in the reproduced signal nonlinear processing means 108 or the simulated error nonlinear processing means 109 in FIG. 5, for example, a sine function or a hyperbolic tangent function can be used.
The sine function is, for a certain input element z,
(Equation 1)

Given by In addition, since the hyperbolic tangent function causes phase distortion when z is a complex number, sgn (z) · tanh (| z |) is combined with a sine function.
Can also be used. Also, any _a n for a range of values with a z (n = 0, ···, ∞) by,
(Equation 2)

Any function that can be expressed can also be used.
The reproduced signal non-linear processing means 108 and the simulation error non-linear processing means 109 may be processed by the same non-linear function or may be processed by different non-linear functions.
In the reproduced signal non-linear processing means 108 or the simulated error non-linear processing means 109 in FIG. 5, a function different from the above-described non-linear function is provided for each element of the input reproduced signal conversion sequence or the simulated error sequence. May be adopted to perform different nonlinear processing for each element. Further, a configuration may be adopted in which non-linear processing is not performed on a part of each element of the reproduced signal conversion sequence or the simulation error sequence. For example, when the linear transformation is realized by a (fast) discrete Fourier transform, it is conceivable to omit the non-linear processing on the elements of the linear transformation sequence corresponding to a high frequency with few voice components.
[0012]
(Example 2)
FIG. 6 shows that the reproduced signal nonlinear processing means 108 and the simulated error nonlinear processing means 109 of FIG. 5 use the same nonlinear function f (z), and f (z) is f (a) f (b) = f ( FIG. 5 is an alternative embodiment of FIG. 5 when ab) is satisfied, for example, when a sine function or the like is used. The reproduction signal non-linear processing means 108 and the simulation error non-linear processing means 109 of FIG. 5 collectively perform the non-linear processing on the updated sequence of the transform domain echo simulation filter coefficients, that is, the reception signal (reproduction signal) after the non-linear processing and the non-linear processing. When the product of the processed error signal and the product of the received speech signal and the error signal before processing is equal to the product obtained by performing non-linear processing on the product, the non-linear processing is collectively performed at the subsequent stage. , Etc., is replaced by the updated column non-linear processing means 110.
[0013]
(Example 3)
FIG. 7 shows a configuration in which an acoustic coupling amount measuring means 111 for obtaining an acoustic coupling amount as a ratio of the short-time average level of the collected signal to the short-time average level of the reproduced signal is added to the configuration of FIG. In accordance with the magnitude of the acoustic coupling amount measured by the acoustic coupling amount measuring unit 111, the transform region echo simulation filter coefficient updating unit 106 adjusts the size of the update amount of the transform domain echo simulation filter coefficient. Determine the parameters. This is because, for example, when the sine function is adopted in both the reproduced signal nonlinear processing means 108 and the simulated error nonlinear processing means 109, the size of the update sequence of the transform area reverberation simulating filter coefficient becomes larger than the reproduced signal or the simulated error. It no longer depends on the magnitude of the error.
[0014]
In a (high-speed) LMS or the like, the smaller the error is, the smaller the update amount of the adaptive filter (the size of the vector for correcting the filter) becomes, and the smaller the correction becomes, the more the convergence proceeds. On the other hand, when the sine function (output is ± 1) is applied to the phase as in the present invention, the update amount of the adaptive filter is constant regardless of the magnitude of the error, and does not converge more accurately than a certain error range. When the acoustic coupling gain is 1, if the step size is set to 0.01, the update is performed at +0.01, -0.01, and the error can be converged within a range of 1%. Conversely, if the step size is 0.01 when the gain is 10000, the convergence speed becomes very slow. In this case, the step size may be 100. Thus, the step size is adjusted according to the gain. In the above example, the acoustic coupling amount is set to 10000 × the allowable error rate 0.01 (1%) = step size 100 by a method such as multiplying the measured acoustic coupling amount by the estimated error allowable rate.
[0015]
As described above, in the configuration of FIG. 7, a relatively large step size parameter is given when the acoustic coupling amount is large, and a relatively small step size parameter is given when the acoustic coupling amount is small. In FIG. 7, the measurement of the acoustic coupling amount is performed using the reproduced signal and the collected signal before the linear conversion, but the measurement using the reproduced signal and the collected signal after the linear conversion can be easily performed. I can analogy. At this time, for example, when a linear transformation is performed by a (high-speed) discrete Fourier transform, the acoustic coupling amount is measured for each frequency component, and a step size parameter may be given to each frequency element based on each of the measured values. it can. Further, even when information on the amount of acoustic coupling cannot be obtained by measurement, a step size parameter that differs for each element of the linear transformation (for example, is set as a fixed parameter in advance), such as reducing the step size parameter of a high frequency component. However, if the acoustic coupling amount is obtained for each frequency, it is calculated each time). Here, reflecting the measured acoustic coupling amount in the step size parameter itself can be said to be an effective method for estimating the impulse response of the reverberation path in the reverberation canceller generally having the configuration of FIG.
Further, when the simulated reverberation generating means 104 has means for storing the past reproduced signal conversion sequence and a conversion area reverberation simulation filter coefficient corresponding to the stored reproduced signal conversion sequence, The conversion area reverberation simulation filter coefficient updating means 106 is configured as shown in FIG. 8 so that the conversion area reverberation simulation filter coefficient corresponding to the reproduced signal conversion sequence having an earlier storage time can be adjusted to have a smaller update amount. It is conceivable that the configuration is implemented.
[0016]
(Transform area echo simulation filter coefficient updating means (Example 1))
Although the configuration of FIG. 8 has a configuration in which the reproduced signal conversion sequence up to one step in the past can be stored, this configuration can be easily extended to the case where the reproduced signal conversion sequence in a plurality of steps in the past is stored. In FIG. 8, the reproduced signal conversion sequence at the current time (after the non-linear processing) is held in the signal sequence holding unit 61, and the simulation error sequence (after the non-linear processing) held by the signal sequence holding unit 62 and the signal The column multiplication means 63 multiplies each element of the column to obtain a transformed domain echo simulation filter coefficient update sequence. The update region reverberation simulation filter coefficient update sequence is multiplied by an appropriate step size parameter for each individual update column element by the gain adjusting unit 631, and is again converted by the update column holding unit 65 as a conversion region reverberation simulation filter coefficient update sequence. It is held and passed to the simulated reverberation generating means 104. Further, the reproduced signal conversion sequence of one step past (after the non-linear processing) is held in the signal sequence holding means 68 through the signal sequence delay device 67 and held by the signal sequence holding means 62 (after the non-linear processing). ) The simulated error sequence and the signal sequence multiplying means 63 are multiplied for each element of the sequence to obtain a conversion region reverberation simulated filter coefficient update sequence corresponding to the reproduced signal conversion sequence one step past (after the non-linear processing). The conversion region echo simulation filter coefficient update sequence is multiplied by a step size parameter different from that of the gain adjustment unit 631 for each update sequence element by the gain adjustment unit 691, and updated again as the conversion region echo simulation filter coefficient update sequence. The column holding means 65 independently holds the conversion area reverberation simulating filter coefficient corresponding to the previously obtained current signal (after linear processing) conversion signal reverberation sequence, and passes it to the simulated reverberation generating means 104. Here, the step size parameter of the gain adjusting means 691 is preferably smaller than the step size parameter of the gain adjusting means 631 in consideration of the actual reverberation path and the exponential decay of the impulse response. In addition, as shown in FIG. 7, when information from the acoustic coupling amount measuring unit 111 is obtained, the gain adjusting unit 631 and the gain adjusting unit 691 maintain the relative relationship (the magnitude relationship) of the step size parameter, The acoustic coupling amount evaluation means 66 can also scale each step size parameter. When the information from the acoustic coupling amount measuring unit 111 cannot be obtained, the acoustic coupling amount evaluating unit 66 can be omitted. Here, it can be said that the configuration of FIG. 8 is effective for speeding up or estimating the characteristic of the impulse response of the echo path even in the configurations of FIGS. 2 and 3 that do not include the non-linear processing means.
[0017]
(Transform area echo simulation filter coefficient updating means (second embodiment))
Further, in a case where the simulated reverberation generating means has a means for storing the past reproduced signal conversion sequence and a conversion area reverberation simulated filter coefficient corresponding to the stored reproduced signal conversion sequence, the storage time of the reproduced signal conversion sequence In order to realize different nonlinear processing, instead of the basic configuration of FIG. 5 having the reproduced signal nonlinear processing means 108 and the simulated error nonlinear processing means 109, in the configuration of FIG. A configuration in which the non-linear processing unit 601, the non-linear processing unit 602, the non-linear processing unit 603, and the non-linear processing unit 604 shown in FIG. The non-linear processing means 601 performs non-linear processing on the reproduced signal conversion sequence at the current time, the non-linear processing means 602 performs non-linear processing on the reproduced signal conversion sequence one step past, and the non-linear processing means 603 performs the non-linear processing on the current time. The nonlinear processing is performed on the simulation error sequence corresponding to the reproduced signal conversion sequence, and the non-linear processing unit 604 performs the nonlinear processing on the simulation error sequence corresponding to the reproduction signal conversion sequence one step past. These four non-linear processing means can be realized by different non-linear functions. Further, a configuration in which a part of the non-linear processing means is omitted may be employed. This makes it possible to individually adjust the estimation accuracy and estimation speed of the first half where energy is concentrated in the impulse response of the echo path and the second half where there is not much energy. If the acoustic coupling amount measuring means 111 is provided in the configuration of FIG. 3 similarly to FIG. 7, the step size parameter can be controlled by the acoustic coupling amount evaluating means 66 in FIG. If the amount measuring unit 111 cannot be provided, the acoustic coupling amount evaluating unit 66 can be omitted. Although the configuration of FIG. 9 is configured to be able to store the reproduced signal conversion sequence up to one step past, this configuration can be easily extended to the case where the reproduced signal conversion sequence of a plurality of steps past is stored.
[0018]
The echo canceller of the present invention can be constituted by a computer having a CPU and a memory, a user terminal, and a recording medium such as a CD-ROM, a magnetic disk device, and a semiconductor memory.
The echo elimination program recorded on the recording medium or the echo elimination program transmitted via a line is read by a computer, and realizes each of the above-described components on the computer and executes each process.
[0019]
【The invention's effect】
The reverberation canceling device according to the present invention, while omitting the constraint of updating the transform region reverberation simulating filter coefficient that increases the amount of calculation, corresponds to the transform region reverberation simulating filter coefficient corresponding to the impulse response of the true reverberation path in the linear transform region. , Can be found at high speed. In addition, the introduction of a nonlinear function evaluates higher-order statistics and updates the transform-domain echo simulation filter coefficients based on the independence of the reproduced signal and simulation error. However, when it can be regarded as independent from the reproduction signal, they are hardly affected.
[Brief description of the drawings]
FIG. 1 is a diagram showing a schematic configuration of an echo canceller.
FIG. 2 is a diagram showing a configuration of an echo canceller (conventional example 1) having a transform domain echo simulation filter coefficient constraining unit.
FIG. 3 is a diagram showing a configuration of an echo canceller (conventional example 2) having no transform domain echo simulation filter coefficient restricting means.
FIG. 4 is a diagram showing a configuration of a transform-domain reverberation simulating filter coefficient constraining means according to Conventional Example 1.
FIG. 5 is a diagram showing a configuration of an echo canceller (Example 1) of the present invention.
FIG. 6 is a diagram showing a configuration of an echo canceller (Example 2) of the present invention.
FIG. 7 is a diagram showing a configuration of an echo canceller (third embodiment) of the present invention.
FIG. 8 is a diagram showing a configuration of a transform area echo simulation filter coefficient updating unit (first embodiment).
FIG. 9 is a diagram showing a configuration of a transform area echo simulation filter coefficient updating unit (second embodiment).
[Explanation of symbols]
Reference numeral 100: echo canceller, 101: reproduction signal input means, 102: sound pickup signal input means, 103: reproduction signal conversion means, 104: simulated echo generation means, 105: simulation Error output means, 106: Transformation area echo simulation filter coefficient updating means, 108: Reproduction signal non-linear processing means, 109: Simulated error non-linear processing means, 110: Updated column non-linear processing means, 111 ...・ Acoustic coupling amount measurement means

Claims

A reproduction signal input unit that inputs a reproduction signal output to an acoustic signal reproduction unit and accumulates the reproduction signal for a certain period to obtain a reproduction signal sequence;
Sound collection signal input means for inputting a sound collection signal from the sound signal sound collection means existing in the same space as the sound signal reproduction means and accumulating for a certain period of time to obtain a sound collection signal sequence;
Reproduction signal conversion means for linearly converting the reproduction signal sequence to obtain a reproduction signal conversion sequence,
A simulated reverberation generating means for inputting the reproduced signal conversion sequence and generating a simulated reverberation signal conversion sequence simulating a linear conversion sequence of a reverberation signal wrapping from the acoustic signal reproducing means to the acoustic signal collecting means,
Simulation error output means for inputting the simulation echo signal conversion sequence and the sound collection signal sequence and outputting a simulation error of the simulation echo signal conversion sequence,
An echo canceller having a conversion area reverberation simulation filter coefficient updating unit that receives the reproduction signal conversion sequence and the simulation error and generates an update sequence of the conversion region reverberation simulation filter coefficient of the simulation reverberation generation unit,
Reproduction signal nonlinear processing means for applying non-linear processing to the reproduced signal conversion sequence input to the conversion area reverberation simulation filter coefficient updating means, and simulation for applying non-linear processing to the simulation error input to the conversion area reverberation simulation filter coefficient updating means An echo canceller comprising one or both of error non-linear processing means.

The echo canceller according to claim 1,
Dividing at least one of the reproduced signal nonlinear processing means and the simulated error nonlinear processing means by the size of the nonlinear processing object, or realizing the nonlinear processing by applying a hyperbolic tangent function to the nonlinear processing object. Characteristic echo canceller.

The echo canceller according to claim 1,
Echo cancellation characterized by realizing non-linear processing with a different non-linear function for each element of the reproduced signal conversion sequence or the simulated error sequence input to at least one of the reproduced signal non-linear processing means and the simulated error non-linear processing means. apparatus.

The echo canceller according to claim 1,
A non-linear process is not performed on a part of each element of the reproduction signal conversion sequence or the simulation error sequence input to at least one of the reproduction signal nonlinear processing unit and the simulation error nonlinear processing unit. apparatus.

The echo canceller according to claim 1,
Acoustic coupling amount measuring means for determining an acoustic coupling amount as a ratio of the short-time average level of the collected signal to the short-time average level of the reproduction signal,
The reverberation canceling apparatus according to claim 1, wherein the conversion area reverberation simulation filter coefficient updating means includes means for determining the magnitude of the update amount of the conversion domain reverberation simulation filter coefficient in accordance with the magnitude of the acoustic coupling amount.

The echo canceller according to claim 1,
The simulated reverberation generating means includes means for storing the past reproduced signal conversion sequence and a conversion area reverberation simulated filter coefficient corresponding to the stored reproduced signal conversion sequence,
The conversion area reverberation simulation filter coefficient updating means includes means for reducing the update amount as the conversion area reverberation simulation filter coefficient corresponding to the reproduction signal conversion string having an earlier storage time in accordance with the storage time of the reproduction signal conversion string. Characteristic echo canceller.

A step of inputting an acoustic signal to be reproduced, accumulating the audio signal for a certain period of time, and obtaining a reproduced signal sequence;
A step of inputting a collected sound signal collected from the same space as the space in which the acoustic signal is reproduced and accumulating the collected sound signal for a certain period of time to obtain a collected sound signal sequence;
A step of linearly converting the reproduction signal sequence to obtain a reproduction signal conversion sequence;
A step of generating a simulated reverberation signal conversion sequence that simulates a linear conversion sequence of a reverberation signal in which the reproduced sound signal is input to the reproduction signal conversion sequence and wraps around to the collected signal,
A step of inputting the simulated reverberation signal conversion sequence and the collected sound signal sequence and outputting a simulated error of the simulated reverberation signal conversion sequence,
A step of inputting the reproduction signal conversion sequence and the simulation error and generating an update sequence of a transform domain echo simulation filter coefficient for generating the simulation echo signal conversion sequence;
The echo cancellation method having
One or both of a reproduction signal non-linear processing procedure for applying non-linear processing to the reproduction signal conversion sequence and a simulation error non-linear processing procedure for applying non-linear processing to the simulation error, which is used for updating the conversion area echo simulation filter coefficient. An echo cancellation method characterized by comprising:

The echo cancellation method according to claim 7,
It is necessary to divide at least one of the reproduced signal non-linear processing procedure and the simulated error non-linear processing procedure by the size of the non-linear processing target, or to realize the non-linear processing by applying a hyperbolic tangent function to the non-linear processing target. Characteristic echo cancellation method.

The echo cancellation method according to claim 7,
Echo cancellation characterized by realizing non-linear processing by a different non-linear function for each element of the reproduction signal conversion sequence or the simulation error sequence input to at least one of the reproduction signal non-linear processing procedure and the simulation error non-linear processing procedure. Method.

The echo cancellation method according to claim 7,
The echo canceller characterized in that non-linear processing is not performed on a part of each element of the reproduction signal conversion sequence or the simulation error sequence input to at least one of the reproduction signal nonlinear processing procedure and the simulation error nonlinear processing procedure. Method.

The echo cancellation method according to claim 7,
A procedure for determining the amount of acoustic coupling as a ratio of the short-time average level of the collected signal to the short-time average level of the reproduced signal;
Updating the conversion area echo simulation filter coefficient, which determines the magnitude of the update amount of the conversion area echo simulation filter coefficient in accordance with the magnitude of the acoustic coupling amount.

The echo cancellation method according to claim 7,
The generation of the simulated reverberation stores the past reproduced signal conversion sequence, and includes a conversion region reverberation simulation filter corresponding to the stored reproduced signal conversion sequence,
Updating the conversion area reverberation simulation filter coefficient by reducing the update amount as much as the conversion area reverberation simulation filter coefficient corresponding to the reproduction signal conversion row having an earlier storage time according to the storage time of the reproduction signal conversion row. Characteristic echo cancellation method.

A process of inputting an acoustic signal to be reproduced, accumulating for a certain period of time, and obtaining a reproduced signal sequence;
A process of inputting a collected sound signal collected from the same space as the space in which the sound signal is reproduced and accumulating the collected sound signal for a certain period of time to obtain a collected sound signal sequence;
A process of linearly converting the reproduction signal sequence to obtain a reproduction signal conversion sequence;
A process of generating a simulated reverberation signal conversion sequence that simulates a linear conversion sequence of a reverberation signal in which the reproduced signal is input and the reproduced acoustic signal is wrapped around to the collected signal,
A process of inputting the simulated reverberation signal conversion sequence and the collected sound signal sequence and outputting a simulated error of the simulated reverberation signal conversion sequence;
A reverberation canceling program for causing a computer to execute the process of generating an updated sequence of transform domain reverberation simulating filter coefficients by inputting the reproduction signal conversion sequence and the simulated error and generating the simulated reverberation signal conversion sequence,
Used for updating the transform region echo simulation filter coefficient, and includes one or both of a reproduction signal non-linear process for applying a non-linear process to the reproduction signal conversion sequence and a simulation error non-linear process for applying a non-linear process to the simulation error. Echo cancellation program.

The echo cancellation program according to claim 13,
A reverberation canceling program that divides at least one of a reproduced signal nonlinear process and a simulated error nonlinear process by a size of the nonlinear process target, or realizes a nonlinear process by applying a hyperbolic tangent function to the nonlinear process target.

The echo cancellation program according to claim 13,
A reverberation canceling program for realizing a nonlinear process by a different nonlinear function for each element of the reproduced signal conversion sequence or the model error sequence input to at least one of the reproduced signal nonlinear process and the simulation error nonlinear process.

The echo cancellation program according to claim 13,
A reverberation canceling program, wherein non-linear processing is not performed on a part of each element of the reproduction signal conversion sequence or the simulation error sequence input to at least one of the reproduction signal nonlinear processing and the simulation error nonlinear processing.

The echo cancellation program according to claim 13,
A process of obtaining an acoustic coupling amount as a ratio of the short-time average level of the collected signal to the short-time average level of the reproduction signal;
Determining the magnitude of the update amount of the transform area reverberation simulation filter coefficient in accordance with the magnitude of the acoustic coupling amount.

The echo cancellation program according to claim 13,
The generation of the simulated reverberation stores the past reproduced signal conversion sequence, and includes a conversion region reverberation simulated filter coefficient corresponding to the stored reproduced signal conversion sequence,
A reverberation erasing program, characterized in that the reverberation erasing program has a process of reducing the update amount as the conversion area reverberation simulating filter coefficient corresponding to the reproduction signal conversion sequence having an earlier storage time in accordance with the storage time of the reproduced signal conversion sequence.