JP2012205123A

JP2012205123A - Acoustic echo canceller

Info

Publication number: JP2012205123A
Application number: JP2011068480A
Authority: JP
Inventors: Shigenobu Minami; 重信南
Original assignee: Toshiba Corp
Current assignee: Toshiba Corp
Priority date: 2011-03-25
Filing date: 2011-03-25
Publication date: 2012-10-22

Abstract

PROBLEM TO BE SOLVED: To provide an acoustic echo canceller capable of efficiently achieving stable echo cancellation capability.SOLUTION: The acoustic echo canceller includes: an audio signal decoding unit 2 for decoding a stereo audio signal using a stereo parameter generated on the basis of correlation between input left and right stereo audio signals; and an acoustic echo cancellation unit 6, including a stereo adaptive filter 62 which estimates an acoustic echo produced by the decoded stereo audio signal, for eliminating the acoustic echo estimated from the input audio signals. The acoustic echo cancellation unit 6 further includes a stereo echo canceller controller 61 for controlling the estimation operation of the acoustic echo in the stereo adaptive filter 62, on the basis of the stereo parameter.

Description

本発明の実施形態は、音響エコーキャンセラに関する。 Embodiments described herein relate generally to an acoustic echo canceller.

テレビやカーナビゲーションシステムなどの情報家電機器では、デジタル化されたステレオ・オーディオ信号を復号してステレオ・スピーカーで受聴する方式が一般化されている。 In information home appliances such as a TV and a car navigation system, a method of decoding a digitized stereo audio signal and listening to it with a stereo speaker is generalized.

国際標準方式の多くのステレオ・オーディオ復号器は、左右のチャンネル信号間の相関を利用してステレオパラメタを生成、利用することで効率的に帯域圧縮を行っている。例えば、ＭＰＥＧで規定するオーディオ符号化モードの一つであるインテンシティ・ステレオ方式は、周波数領域のオーディオ信号から、モノラルの信号と、左右のステレオのレベル比をステレオパラメタとして抽出して符号化することにより、帯域圧縮を効率的に行っている。 Many stereo audio decoders of the international standard system efficiently perform band compression by generating and using stereo parameters using the correlation between left and right channel signals. For example, the intensity stereo system, which is one of the audio encoding modes defined by MPEG, extracts and encodes a monaural signal and a left / right stereo level ratio as a stereo parameter from an audio signal in a frequency domain. Thus, band compression is efficiently performed.

近年、このような機器にマイクロフォンを接続し、音声認識や電話・テレビ電話などを可能とする新たな機能が注目されている。例えば、テレビにマイクロフォンを接続し、マイクロフォンを通じて音声ボリュームを調整したり、テレビ電話を使用したりする機能が開発されている。また、例えば、カーナビゲーションシステムにマイクロフォンを接続し、ナビゲーションシステムの制御を音声で行う機能が開発されている。 In recent years, attention has been paid to a new function that enables a microphone to be connected to such a device and enables voice recognition, telephone / videophone, and the like. For example, functions have been developed in which a microphone is connected to a television and the sound volume is adjusted through the microphone or a videophone is used. In addition, for example, a function has been developed in which a microphone is connected to a car navigation system and the navigation system is controlled by voice.

一方、ステレオ・オーディオ再生時にハンズフリー会話や音声認識などを行おうとすると、ステレオ・オーディオ信号のエコーがマイクロフォンに回り込み、音響エコーによる通話障害や音声認識性能の劣化、音声品質の劣化などを引き起こすという問題がある。 On the other hand, if you try to do hands-free conversation or voice recognition during stereo audio playback, the echo of the stereo audio signal wraps around the microphone, causing call failure, voice recognition performance degradation, voice quality degradation, etc. due to acoustic echo There's a problem.

このような障害・劣化を防ぐために、通常は、ステレオ音響エコーキャンセラを用いることにより、マイクロフォンで収音した信号からマイクロフォンに回り込んだステレオ・オーディオ信号を除去している。 In order to prevent such obstacles / deterioration, a stereo audio echo canceller is usually used to remove a stereo audio signal that has sneak into the microphone from a signal collected by the microphone.

一般的なステレオ音響エコーキャンセラは、システム同定を用いてスピーカーとマイクロフォンとの間のエコーパス特性を推定する。このシステム同定には、一般的に適応フィルタが用いられる。 A typical stereo acoustic echo canceller estimates the echo path characteristics between a speaker and a microphone using system identification. An adaptive filter is generally used for this system identification.

しかし、従来の構成の音響エコーキャンセラでは、適応フィルタに入力される左右のステレオ・オーディオ信号の間に相関があると、正しいステレオエコーパス特性が推定できず、相関が変化したときにエコーキャンセル能力が著しく劣化するという問題があった。特に、ステレオ・オーディオ信号に本質的にモノラル信号である単独発言がある場合にこの問題は顕著となっていた。 However, with a conventional acoustic echo canceller, if there is a correlation between the left and right stereo audio signals that are input to the adaptive filter, the correct stereo echo path characteristics cannot be estimated, and the echo cancellation capability when the correlation changes There has been a problem that the material deteriorates significantly. This problem is particularly noticeable when the stereo audio signal has a single statement that is essentially a monaural signal.

この問題を解決するために、左右のステレオ・オーディオ信号間の相関を低減すべく、各チャンネルに独立な歪みを与える方法が提案されているが、ステレオ・オーディオ信号自体を歪めてしまうために、音声品質の劣化などを引き起こすといった大きな問題が生じてしまう。 In order to solve this problem, in order to reduce the correlation between the left and right stereo audio signals, a method of giving independent distortion to each channel has been proposed, but in order to distort the stereo audio signal itself, A big problem of causing degradation of voice quality occurs.

特開２００６−３１４０５１号公報JP 2006-314051 A

本実施形態が解決しようとする課題は、効率的に安定したエコーキャンセル能力を実現することができる音響エコーキャンセラを提供することである。 The problem to be solved by the present embodiment is to provide an acoustic echo canceller that can realize an efficient and stable echo cancellation capability.

本実施形態の音響エコーキャンセラは、入力された左右のステレオ・オーディオ信号の間の相関に基づき生成されたステレオパラメタを用いて、前記ステレオ・オーディオ信号を復号するオーディオ信号復号部と、前記復号されたステレオ・オーディオ信号に起因する音響エコーを推定するステレオエコー推定器を備え、入力された音声信号から前記推定された音響エコーを除去する音響エコーキャンセル部とを具備している。前記音響エコーキャンセル部は、前記ステレオパラメタに基づき、前記ステレオエコー推定器における前記音響エコーの推定動作を制御するステレオエコーキャンセラ制御器をさらに備えていることを特徴とする。 The acoustic echo canceller of the present embodiment includes an audio signal decoding unit that decodes the stereo audio signal using a stereo parameter generated based on a correlation between the input left and right stereo audio signals, and the decoded A stereo echo estimator for estimating the acoustic echo caused by the stereo audio signal, and an acoustic echo canceling unit for removing the estimated acoustic echo from the input voice signal. The acoustic echo cancellation unit further includes a stereo echo canceller controller that controls an estimation operation of the acoustic echo in the stereo echo estimator based on the stereo parameter.

第１及び第２の実施形態に係わる音響エコーキャンセラの構成の一例を説明する概略ブロック図。The schematic block diagram explaining an example of the structure of the acoustic echo canceller concerning 1st and 2nd embodiment. 第１の実施形態に係わるステレオエコーキャンセラ制御器６１による制御手順を説明するフローチャート。6 is a flowchart for explaining a control procedure by a stereo echo canceller controller 61 according to the first embodiment.

以下、図面を参照して実施形態を説明する。 Hereinafter, embodiments will be described with reference to the drawings.

（第１の実施形態）
始めに、図１を参照して、本発明の第１の実施形態に係わる音響エコーキャンセラの構成を説明する。図１は、本発明の第１及び第２の実施形態に係わる音響エコーキャンセラの構成の一例を説明する概略ブロック図である。なお、図１には、音響エコーキャンセラと接続して使用される周辺機器も図示している。 (First embodiment)
First, the configuration of an acoustic echo canceller according to the first embodiment of the present invention will be described with reference to FIG. FIG. 1 is a schematic block diagram for explaining an example of the configuration of an acoustic echo canceller according to the first and second embodiments of the present invention. FIG. 1 also shows peripheral devices used in connection with an acoustic echo canceller.

図１に示すように、本発明の第１の実施形態に係わる音響エコーキャンセラは、チューナ等からステレオ・オーディオ信号を受信して符号化する受信器１と、符号化された信号を復号してデジタルオーディオ信号を出力するオーディオ信号復号部２と、マイクロフォン５などから入力された音声信号から音響エコーを除去する音響エコーキャンセル部６とを備えている。 As shown in FIG. 1, an acoustic echo canceller according to the first embodiment of the present invention receives a stereo audio signal from a tuner or the like and encodes it, and decodes the encoded signal. An audio signal decoding unit 2 that outputs a digital audio signal and an acoustic echo canceling unit 6 that removes an acoustic echo from a voice signal input from a microphone 5 or the like are provided.

オーディオ信号復号部２には、符号化された信号を複数の帯域に分割するデータ分解器２１と、割り当てられた帯域について、符号化された信号を復号して左右のステレオ・オーディオ信号を再生するステレオ信号復号部２２と、全帯域の再生されたステレオ・オーディオ信号を合成する帯域合成器２３とを備えている。 The audio signal decoding unit 2 includes a data decomposer 21 that divides the encoded signal into a plurality of bands, and reproduces the left and right stereo audio signals by decoding the encoded signals for the allocated bands. A stereo signal decoding unit 22 and a band synthesizer 23 for synthesizing the reproduced stereo audio signal of the entire band are provided.

ステレオ信号復号部２２は、データ分解器２１によって分割される帯域の数Ｔと同数のステレオ信号復号器２２_０、２２_１、…、２２_Ｔ−１で構成されている。例えば、データ分解器２１によって符号化された信号が１６個の帯域に分割される場合、ステレオ信号復号部２２には１６個のステレオ信号復号器２２_０、２２_１、…、２２_１５が配置される。各ステレオ信号復号器２２_０、２２_１、…、２２_Ｔ−１は、それぞれ帯域分割モノラルオーディオデコーダ２４と、相関生成器２５とを備えている。 The stereo signal decoding unit 22 is composed of the same number of stereo signal decoders 22 ₀ , 22 ₁ ,..., 22 _T−1 as the number T of bands divided by the data decomposer 21. For example, if the signal coded by the data decomposer 21 is divided into 16 bands, 16 stereo signal decoder ₂₂ 0 is the stereo signal decoding section _22, 22 1, ..., _{22 15} are arranged The Each of the stereo signal decoders 22 ₀ , 22 ₁ ,..., 22 _T−1 includes a band division monaural audio decoder 24 and a correlation generator 25.

一方、音響エコーキャンセル部６は、左右のステレオ・オーディオ信号からスピーカー３、４とマイクロフォン５との間のエコーパス特性を推定して疑似エコー信号を生成する、ステレオエコー推定器としてのステレオ適応フィルタ６２と、左右のステレオ・オーディオ信号のどちらか一方の信号であるモノラル信号に基づきスピーカー３、４とマイクロフォン５との間のエコーパス特性を推定するモノラル適応フィルタ６３とを備えている。モノラル適応フィルタ６３には、切り替えスイッチ６７を介して左右のステレオ・オーディオ信号のいずれかが入力される。ステレオ適応フィルタ６２とモノラル適応フィルタ６３とはステレオエコーパス推定制御器６４で接続されている。 On the other hand, the acoustic echo canceling unit 6 estimates the echo path characteristics between the speakers 3 and 4 and the microphone 5 from the left and right stereo audio signals, and generates a pseudo echo signal, as a stereo adaptive filter 62 as a stereo echo estimator. And a monaural adaptive filter 63 that estimates an echo path characteristic between the speakers 3 and 4 and the microphone 5 based on a monaural signal that is one of the left and right stereo audio signals. One of the left and right stereo audio signals is input to the monaural adaptive filter 63 via the changeover switch 67. The stereo adaptive filter 62 and the monaural adaptive filter 63 are connected by a stereo echo path estimation controller 64.

ステレオ適応フィルタ６２で推定され生成された疑似エコー信号は、加算器６５によってマイクロフォン５から入力された音声信号から減ぜられる。疑似エコー信号が減ぜられてエコーが除去された音声信号は、外部機器に出力されると同時にステレオ適応フィルタ６２にフィードバックされ、エコーパス特性を推定する学習に用いられる。 The pseudo echo signal estimated and generated by the stereo adaptive filter 62 is subtracted from the audio signal input from the microphone 5 by the adder 65. The audio signal from which the pseudo echo signal is reduced and the echo is removed is output to an external device and simultaneously fed back to the stereo adaptive filter 62 to be used for learning for estimating the echo path characteristic.

また、モノラル適応フィルタ６３で推定され生成された疑似エコー信号は、加算器６６によってマイクロフォン５から入力された音声信号から減ぜられる。モノラル適応フィルタ６３で生成された疑似エコー信号が減ぜられた音声信号は、モノラル適応フィルタ６３にフィードバックされ、エコーパス特性を推定する学習に用いられる。 Further, the pseudo echo signal estimated and generated by the monaural adaptive filter 63 is subtracted from the sound signal input from the microphone 5 by the adder 66. The audio signal from which the pseudo echo signal generated by the monaural adaptive filter 63 is reduced is fed back to the monaural adaptive filter 63 and used for learning to estimate the echo path characteristic.

また、音響エコーキャンセル部６は、音響エコーキャンセル部６全体の制御を行うステレオエコーキャンセラ制御器６１を備えている。ステレオエコーキャンセル部６は、ステレオエコーパス推定制御器６４を介して、ステレオ適応フィルタ６２におけるエコーパス特性の推定に用いられる係数などを制御する。また、ステレオエコーキャンセラ制御器６１は、切り替えスイッチ６７を制御して、モノラル適応フィルタ６３に入力する信号を選択・切り替えさせる。 The acoustic echo canceling unit 6 includes a stereo echo canceller controller 61 that controls the entire acoustic echo canceling unit 6. The stereo echo cancellation unit 6 controls a coefficient used for estimation of echo path characteristics in the stereo adaptive filter 62 via the stereo echo path estimation controller 64. Further, the stereo echo canceller controller 61 controls the changeover switch 67 to select / switch the signal input to the monaural adaptive filter 63.

受信器１は、チューナ等からステレオ・オーディオ信号を受信し、帯域圧縮を施して符号化してオーディオ信号復号部２に出力する。 The receiver 1 receives a stereo audio signal from a tuner or the like, performs band compression, encodes it, and outputs it to the audio signal decoding unit 2.

例えば、最近標準化されたステレオ対応のオーディオ復号方式では、ＤＣＴ（Discrete Cosine Transform、離散コサイン変換）や帯域分割フィルタを用い、ステレオ・オーディオ信号を複数の帯域成分に分割して効率的に符号化する。その際、ステレオ・オーディオ信号には、各チャンネル間に相関があることが多い。そこで、フレーム区間ｉ、帯域ｊごとに、以下のような直交化処理を施してチャンネル間の相関を除去した後に、モノラルの符号化方式を各チャンネルに施す。 For example, in a recently standardized audio decoding method for stereo, a DCT (Discrete Cosine Transform) or a band division filter is used to efficiently divide a stereo audio signal into a plurality of band components. . In that case, the stereo audio signal often has a correlation between the channels. Therefore, after performing orthogonalization processing as described below for each frame section i and band j to remove the correlation between channels, a monaural encoding method is applied to each channel.

例えば、ｊ番目の帯域におけるｉ番目のフレームについて、左右のチャンネルのステレオ・オーディオ信号をそれぞれＸ_Ｌｉｊ（ｚ）、Ｘ_Ｒｉｊ（ｚ）とし、チャンネル間相関によるチャンネル間伝達関数をＧ_ＲＲｉｊ（ｚ）、Ｇ_ＲＬｉｊ（ｚ）、Ｇ_ＬＲｉｊ（ｚ）、Ｇ_ＬＬｉｊ（ｚ）とすると、左右のチャンネルのステレオ・オーディオ信号の和信号と差信号をそれぞれチルダＸ_Ｍｉｊ（ｚ）、チルダＸ_Ｓｉｊ（ｚ）は以下の式（１）で表すことができる。

For example, for the i-th frame in the j-th band, the left and right channel stereo audio signals are X _Lij (z) and X _Rij (z), respectively, and the inter-channel transfer function based on the inter-channel correlation is G _RRij (z). , _GRLij (z), _GLRij (z), and _GLLij (z), the tilde X _Mij (z) and the tilde X _Sij (z) are the sum and difference signals of the stereo audio signals of the left and right channels, respectively. Can be represented by the following formula (1).

例えば、左チャンネル信号と右チャンネル信号との間に以下の式（２）のような関係がある場合、

For example, when there is a relationship like the following formula (2) between the left channel signal and the right channel signal,

（ただし、Ｅ_Ｌｉｊ（ｚ）はチャンネル間の非相関成分を表す。）
式（１）右辺におけるチャンネル間伝達関数の共役複素数のマトリクスは、以下の式（３）のように表される。

(E _Lij (z) represents a non-correlated component between channels.)
A matrix of conjugate complex numbers of the transfer function between channels on the right side of Expression (1) is expressed as Expression (3) below.

式（２）と式（３）とを式（１）に代入すると、式（４）が得られる。

When Expression (2) and Expression (3) are substituted into Expression (1), Expression (4) is obtained.

すなわち、チルダＸ_Ｍｉｊ（ｚ）、チルダＸ_Ｓｉｊ（ｚ）は、相関成分と非相関成分とに分離することができる。このとき、非相関成分Ｅ_Ｌｉｊ（ｚ）が無視できるとすると、チルダＸ_Ｓｉｊ（ｚ）はゼロとなるので、チルダＸ_Ｍｉｊ（ｚ）とチャンネル間の相関情報を符号化し、オーディオ信号復号部２に出力すればよいことになる。 That is, tilde X _Mij (z) and tilde X _Sij (z) can be separated into a correlated component and a non-correlated component. At this time, if the non-correlation component E _Lij (z) can be ignored, the tilde X _Sij (z) becomes zero. Therefore, the correlation information between the tilde X _Mij (z) and the channel is encoded, and the audio signal decoding unit 2 It is sufficient to output to.

より具体的な実現例であるＭＰＥＧ１やＭＰＥＧ２のインテンシティ・ステレオ方式では、ステレオ・オーディオ信号Ｘ_Ｒｉｊ（ｚ）、Ｘ_Ｌｉｊ（ｚ）をそれぞれ式（５）、（６）のように仮定する。

In the intensity stereo system of MPEG1 or MPEG2, which is a more specific implementation example, the stereo audio signals X _Rij (z) and X _Lij (z) are assumed as shown in equations (5) and (6), respectively.

なお、式（５）、式（６）において、ｌ_Ｒｉｊとｌ_Ｌｉｊとは信号のレベル値を表す。すると、式（１）の右辺におけるチャンネル間伝達関数の共役複素数のマトリクスは、以下の式（７）のように表される。

In Expressions (5) and (6), l _Rij and l _Lij represent signal level values. Then, the matrix of conjugate complex numbers of the transfer function between channels on the right side of Expression (1) is expressed as Expression (7) below.

式（５）、式（６）、式（７）を、式（１）に挿入すると、式（８）の関係が得られる。

When Expression (5), Expression (6), and Expression (7) are inserted into Expression (1), the relationship of Expression (8) is obtained.

すなわち、式（８）の関係に基づき、ステレオ・オーディオ信号に直交化を行って、左右のチャンネルの和信号チルダＸ_Ｍｉｊ（ｚ）と、レベル情報ｌ_Ｒｉｊとｌ_Ｌｉｊとを符号化する。これらの符号化方式は非相関成分がゼロとなるように処理を行うので、符号化するのはステレオ・オーディオ信号の主成分と、左右のチャンネルの相関を生成するためのチャンネル間相関情報（伝達関数）もしくはその近似であるレベル値である。 That is, based on the relationship of Expression (8), the stereo audio signal is orthogonalized to encode the left and right channel sum signal tilde X _Mij (z) and the level information l _Rij and l _Lij . Since these encoding methods perform processing so that the uncorrelated component becomes zero, the main component of the stereo audio signal and the inter-channel correlation information (transmission) for generating the correlation between the left and right channels are encoded. Function) or an approximate level value.

符号化された信号は、オーディオ信号復号部２のデータ分解器２１に入力される。データ分解器２１では、受信した信号をあらかじめ設定された個数Ｔ（例えば１６個）の帯域に分割する。この分割処理により、具体的には、ｉ番目のフレームのｊ番目の帯域については、ステレオ・オーディオ信号の主成分の符号化コードＩ_Ｍｉｊと、チャンネル間相関情報の符号化コードであるステレオパラメタ信号ｐ_ｉｊが分割される。 The encoded signal is input to the data decomposer 21 of the audio signal decoding unit 2. The data decomposer 21 divides the received signal into a preset number T (for example, 16) of bands. By this division processing, specifically, for the j-th band of the i-th frame, the stereo parameter signal which is the main component encoding code I _Mij of the stereo audio signal and the inter-channel correlation information encoding code. p _ij is divided.

ステレオ・オーディオ信号の主成分の符号化コードＩ_Ｍｉｊは、それぞれの帯域に対応したステレオ信号復号器２２_ｊの帯域分割モノラルオーディオ復号器２４に入力され、復号処理される。また、ステレオパラメタ信号ｐ_ｉｊは、それぞれの帯域に対応したステレオ信号復号器２２_ｊの相関生成器２５に入力される。 The encoded code I _Mij as the main component of the stereo audio signal is input to the band division monaural audio decoder 24 of the stereo signal decoder 22 _j corresponding to each band and is decoded. Further, the stereo parameter signal p _ij is input to the correlation generator 25 of the stereo signal decoder 22 _j corresponding to each band.

帯域分割モノラルオーディオ復号器２４に入力された符号化コードＩ_Ｍｉｊは、復号処理されることで各帯域のオーディオ主信号Ｘ_Ｍｉｊ（ｚ´）となる。モノラル信号であるオーディオ主信号Ｘ_Ｍｉｊ（ｚ´）は、同じステレオ信号復号器２２_ｊに配置された相関生成器２５に出力される。 The encoded code I _Mij input to the band division monaural audio decoder 24 is decoded to become the audio main signal X _Mij (z ′) of each band. The audio main signal X _Mij (z ′) which is a monaural signal is output to the correlation generator 25 arranged in the same stereo signal decoder 22 _j .

相関生成器２５では、データ分解器２１から入力されたステレオパラメタ信号ｐ_ｉｊをもとに、チャンネル間伝達関数Ｇ_Ｒｉｊ（ｚ´）、Ｇ_Ｌｉｊ（ｚ´）が生成される。チャンネル間伝達関数Ｇ_Ｒｉｊ（ｚ´）、Ｇ_Ｌｉｊ（ｚ´）と、帯域分割モノラルオーディオ復号器２４から入力されたオーディオ主信号Ｘ_Ｍｉｊ（ｚ´）とを用い、以下の式（９）、式（１０）に基づく演算処理を行って、部分帯域ごとに左右のステレオ・オーディオ信号Ｘ_Ｌｉｊ（ｚ´）、Ｘ_Ｒｉｊ（ｚ´）を生成する。

The correlation generator 25 generates inter-channel transfer functions G _Rij (z ′) and G _Lij (z ′) based on the stereo parameter signal p _ij input from the data decomposer 21. Using the inter-channel transfer functions G _Rij (z ′), G _Lij (z ′) and the audio main signal X _Mij (z ′) input from the band division monaural audio decoder 24, the following equation (9), An arithmetic processing based on Expression (10) is performed to generate left and right stereo audio signals X _Lij (z ′) and X _Rij (z ′) for each partial band.

ステレオ信号復号器２２_ｊで生成された、部分帯域ごと左右のステレオ・オーディオ信号Ｘ_Ｌｉｊ（ｚ´）、Ｘ_Ｒｉｊ（ｚ´）は、帯域合成器２３に出力される。帯域合成器２３では、部分帯域の信号から全帯域の信号を復元する関数太字Ｆを用い、すべてのステレオ復号器２２_０、２２_１、…、２２_Ｔ−１から入力されたステレオ・オーディオ信号が、式（１１）、式（１２）に従って合成される。

The left and right stereo audio signals X _Lij (z ′) and X _Rij (z ′) generated by the stereo signal decoder 22 _j for each partial band are output to the band synthesizer 23. The band synthesizer 23, using a function bold F to recover the signals of all the bands from the signal of the partial band, all the stereo decoder 22 _0, 22 1, _..., stereo audio signal is inputted from the 22 _T-1 , And are synthesized according to Equation (11) and Equation (12).

合成された右側のステレオ・オーディオ信号チルダＸ_Ｒｉ（ｚ）は、右側のスピーカー３に出力されて再生される。同様に、左側のステレオ・オーディオ信号チルダＸ_Ｌｉ（ｚ）は、左側のスピーカー４に出力されて再生される。 The synthesized right stereo audio signal tilde X _Ri (z) is output to the right speaker 3 and reproduced. Similarly, the left stereo audio signal tilde X _Li (z) is output to the left speaker 4 and reproduced.

帯域合成器２３で合成された左右のステレオ・オーディオ信号チルダＸ_Ｒｉ（ｚ）、Ｘ_Ｌｉ（ｚ）は、音響エコーキャンセル部６のステレオ適応フィルタ６２にも出力される。ステレオ適応フィルタ６２は、システム同定を用いてスピーカー３、４とマイクロフォン５との間のエコーパス特性（疑似ステレオエコーパス特性）を推定し、疑似エコー信号ハットＹ_ｉ（ｚ）を生成する。 The left and right stereo audio signals tilde X _Ri (z) and X _Li (z) synthesized by the band synthesizer 23 are also output to the stereo adaptive filter 62 of the acoustic echo canceling unit 6. The stereo adaptive filter 62 estimates the echo path characteristic (pseudo stereo echo path characteristic) between the speakers 3 and 4 and the microphone 5 using the system identification, and generates the pseudo echo signal hat Y _i (z).

左右のスピーカーとマイクロフォンとの間のステレオエコーパス特性をそれぞれＨ_Ｌｉ（ｚ）、Ｈ_Ｒｉ（ｚ）とし、送話音声と室雑音との合成信号をＮ_ｉ（ｚ）とすると、ｉ番目のフレームにおけるマイクロフォンに入力される音声信号Ｙ_ｉ（ｚ）は式（１３）のように表すことができる。

If the stereo echo path characteristics between the left and right speakers and the microphone are H _Li (z) and H _Ri (z), and the synthesized signal of the transmitted voice and room noise is N _i (z), the i th The audio signal Y _i (z) input to the microphone in the frame can be expressed as in Expression (13).

一方、適応フィルタは、左右のスピーカーとマイクロフォンとの間のステレオエコーパス特性を推定し、左右の疑似ステレオエコーパス特性ハットＨ_Ｌｉ（ｚ）、ハットＨ_Ｒｉ（ｚ）として内部に保持しており、これらと左右のスピーカーから出力されるステレオ・オーディオ信号であるチルダＸ_Ｌｉ（ｚ）、チルダＸ_Ｒｉ（ｚ）とを用いて式（１４）に表す疑似エコー信号を生成する。

On the other hand, the adaptive filter estimates the stereo echo path characteristics between the left and right speakers and the microphone, and holds them internally as the left and right pseudo stereo echo path characteristics hat H _Li (z) and hat H _Ri (z). These and the tilde X _Li (z) and tilde X _Ri (z), which are stereo audio signals output from the left and right speakers, are used to generate a pseudo echo signal represented by Expression (14).

なお、疑似ステレオエコーパス特性は、左右のスピーカーから出力されるステレオ・オーディオ信号とそれ自身とを用いてサンプリングの時間領域で学習され修正される。式（１３）で表される音声信号Ｙ_ｉ（ｚ）から式（１４）で表される疑似エコー信号ハットＹ_ｉ（ｚ）を差し引くことにより、マイクロフォンに回り込んだステレオ・オーディオ信号を除去し、出力信号Ｅ_ｉ（ｚ）を得る。（式（１５）参照。）

The pseudo stereo echo path characteristic is learned and corrected in the sampling time domain using the stereo audio signal output from the left and right speakers and itself. By subtracting the pseudo echo signal hat Y _i (z) represented by the equation (14) from the audio signal Y _i (z) represented by the equation (13), the stereo audio signal that wraps around the microphone is removed. The output signal E _i (z) is obtained. (See equation (15).)

ここで、適応フィルタが収束していれば、下記の式（１６）、式（１７）のように推定した疑似ステレオエコーパス特性ハットＨ_Ｌｉ（ｚ）、ハットＨ_Ｒｉ（ｚ）が、真のステレオエコーパス特性Ｈ_Ｌｉ（ｚ）、Ｈ_Ｒｉ（ｚ）と一致する。

Here, if the adaptive filter has converged, the pseudo stereo echo path characteristics hat H _Li (z) and hat H _Ri (z) estimated as in the following expressions (16) and (17) are true. It matches the stereo echo path characteristics H _Li (z), H _Ri (z).

従って、式（１８）に示すように出力信号Ｅ_ｉ（ｚ）と送話音声と室雑音との合成信号Ｎ_ｉ（ｚ）とが一致する。すなわち、適応フィルタを収束させることによってエコーを抑制することができる。

Therefore, as shown in Expression (18), the output signal E _i (z) matches the synthesized signal N _i (z) of the transmitted voice and room noise. That is, echoes can be suppressed by converging the adaptive filter.

音響エコーの抑制効果を向上させるためには、上述した（１６）式、（１７）式に示されるように、疑似ステレオエコーパス特性ハットＨ_Ｌｉ（ｚ）、ハットＨ_Ｒｉ（ｚ）を、真のステレオエコーパス特性Ｈ_Ｌｉ（ｚ）、Ｈ_Ｒｉ（ｚ）と一致させることが重要になる。疑似ステレオエコーパス特性は、左右のスピーカー３、４から出力されるステレオ・オーディオ信号とそれ自身とを用いてサンプリングの時間領域で学習され修正される。 In order to improve the acoustic echo suppression effect, the pseudo stereo echo path characteristics hat H _Li (z) and hat H _Ri (z) are set to true as shown in the above equations (16) and (17). It is important to match the stereo echo path characteristics H _Li (z) and H _Ri (z). The pseudo stereo echo path characteristic is learned and corrected in the sampling time domain using the stereo audio signal output from the left and right speakers 3 and 4 and itself.

ステレオ適応フィルタ６２における疑似ステレオエコーパス特性ハットＨ_Ｌｉ（ｚ）、ハットＨ_Ｒｉ（ｚ）の学習は、実際にはサンプリングの時間領域で行われる。例えば、よく知られたＮＬＭＳ法（Normalized Least Mean Square、正規化最小平均二乗）では、次の式（１９）、及び式（２０）で与えられる。

The learning of the pseudo stereo echo path characteristics hat H _Li (z) and hat H _Ri (z) in the stereo adaptive filter 62 is actually performed in the sampling time domain. For example, in the well-known NLMS method (Normalized Least Mean Square), the following equations (19) and (20) are given.

式（１９）及び式（２０）において、太字ハットＨ_Ｌ（ｋ）、太字ハットＨ_Ｒ（ｋ）、太字Ｘ_Ｌ（ｋ）、及び、太字Ｘ_Ｒ（ｋ）は、Ｎタップの適応ＦＩＲフィルタの係数からなるＮ次元ベクトルである。また、ｅ（ｋ）はエコー除去の残差信号、αは収束速度を決めるステップゲインと呼ばれる制御パラメータである。 In Expression (19) and Expression (20), the bold hat H _L (k), the bold hat H _R (k), the bold _XL (k), and the bold X _R (k) are N-tap adaptive FIR filters. It is an N-dimensional vector composed of the coefficients of Further, e (k) is a residual signal for echo cancellation, and α is a control parameter called step gain that determines the convergence speed.

太字Ｘ_Ｌ（ｋ）、及び、太字Ｘ_Ｒ（ｋ）には、それぞれ帯域合成器２３で合成された左右のステレオ・オーディオ信号チルダＸ_Ｌｉ（ｚ）、Ｘ_Ｒｉ（ｚ）が適用され、ｅ（ｋ）は、加算器６５を経て音響エコーが除去された後の音声信号が適用される。ステップゲインであるαの値は、ステレオエコーパス推定制御器６４によって設定される。 The left and right stereo audio signal tildes X _Li (z) and X _Ri (z) synthesized by the band synthesizer 23 are applied to the bold letters X _L (k) and the bold letters X _R (k), respectively, e For (k), the sound signal after the acoustic echo is removed through the adder 65 is applied. The value of α as the step gain is set by the stereo echo path estimation controller 64.

なお、モノラル適応フィルタ６３も、システム同定を用いてスピーカー３、４のいずれか片方とマイクロフォン５との間のエコーパス特性（疑似モノラルエコーパス特性）ハットＨ_ｉ（ｚ）を推定する。また、疑似モノラルエコーパス特性ハットＨ_ｉ（ｚ）は、左右のスピーカー３、４から出力される信号のうちスイッチ６７によって選択された一方のステレオ・オーディオ信号（モノラル信号）とそれ自身などを用い、ステレオ適応フィルタ６２と同様にサンプリングの時間領域で学習され修正される。 The monaural adaptive filter 63 also estimates an echo path characteristic (pseudo monaural echo path characteristic) hat H _i (z) between one of the speakers 3 and 4 and the microphone 5 using system identification. The pseudo monaural echo path characteristic hat H _i (z) uses one stereo audio signal (monaural signal) selected by the switch 67 among the signals output from the left and right speakers 3 and 4 and the like. In the same manner as the stereo adaptive filter 62, learning is performed and corrected in the sampling time domain.

ステレオエコーパス推定制御器６４は、ステレオ適応フィルタ６２における疑似ステレオエコーパス特性の学習に用いられるステップゲインαの値を、ステレオエコーキャンセラ制御器６１からの指示に従って制御する。また、モノラル適応フィルタ６３における疑似モノラルエコーパス特性の学習に用いられるステップゲインα´の値も、ステレオエコーキャンセラ制御器６１からの指示に従って制御する。また、疑似モノラルエコーパス特性と疑似ステレオエコーパス特性との間の相互変換も行う。 The stereo echo path estimation controller 64 controls the value of the step gain α used for learning the pseudo stereo echo path characteristic in the stereo adaptive filter 62 in accordance with an instruction from the stereo echo canceller controller 61. Further, the value of the step gain α ′ used for learning the pseudo monaural echo path characteristic in the monaural adaptive filter 63 is also controlled in accordance with an instruction from the stereo echo canceller controller 61. Further, mutual conversion between the pseudo monaural echo path characteristic and the pseudo stereo echo path characteristic is also performed.

ステレオエコーキャンセラ制御器６１は、チャンネル間相関情報の符号化コードであるステレオパラメタ信号ｐ_ｉｊないしは、ステレオパラメタ信号ｐ_ｉｊから生成されたチャンネル間伝達関数Ｇ_Ｒｉｊ（ｚ´）、Ｇ_Ｌｉｊ（ｚ´）の変化を監視し、監視結果に基づきステップゲインα、α´の値を制御する。 The stereo echo canceller controller 61 is a stereo parameter signal p _ij that is an encoding code of inter-channel correlation information or an inter-channel transfer function G _Rij (z ′), G _Lij (z ′) generated from the stereo parameter signal p _ij. ) And the step gains α and α ′ are controlled based on the monitoring result.

例えば、チャンネル間伝達関数Ｇ_Ｒｉｊ（ｚ´）、Ｇ_Ｌｉｊ（ｚ´）をレベル値のみで以下の式（２１）、（２２）のように近似する場合、

For example, when the inter-channel transfer functions G _Rij (z ′) and G _Lij (z ′) are approximated by only the level values as in the following equations (21) and (22):

左右のチャンネル間伝達関数の比率変化指標μ_ｉを式（２３）によって求める。

The ratio change index μ _i of the transfer function between the left and right channels is obtained by Expression (23).

この比率変化指標μ_ｉの監視結果に基づくステップゲインαの値の制御方法を、図２を用いて説明する。図２は、第１の実施形態に係わるステレオエコーキャンセラ制御器６１による制御手順を説明するフローチャートである。 A method of controlling the value of the step gain α based on the monitoring result of the ratio change index μ _i will be described with reference to FIG. FIG. 2 is a flowchart for explaining a control procedure by the stereo echo canceller controller 61 according to the first embodiment.

まず、ステップＳ１において、当該フレームにおける左右のチャンネル間伝達関数の比率変化指標μ_ｉを式（２３）によって求める。次に、μ_ｉと予め設定された比率変化指標の閾値μ_ＴＨとを比較する（ステップＳ２）。 First, in step S1, the ratio change index μ _i of the transfer function between the left and right channels in the frame is obtained by Expression (23). Next, μ _i is compared with a preset ratio change index threshold value μ _TH (step S2).

ステップＳ２においてμ_ｉが閾値μ_ＴＨを下回った場合、左右のステレオ・オーディオ信号の相関が強いと判定してステップＳ４に進み、ステップゲインαの値を０にセットするようステレオエコーパス推定制御器６４を介してステレオ適応フィルタ６２を制御する。同時にステップゲインα´の値を１にセットするようステレオエコーパス推定制御器６４を介してモノラル適応フィルタ６３を制御する。 If μ _i falls below the threshold μ _TH in step S2, the stereo echo path estimation controller determines that the correlation between the left and right stereo audio signals is strong and proceeds to step S4, and sets the value of step gain α to 0. The stereo adaptive filter 62 is controlled via 64. At the same time, the monaural adaptive filter 63 is controlled via the stereo echo path estimation controller 64 so as to set the value of the step gain α ′ to 1.

なお、ステレオエコーキャンセラ制御器６１は、ステップＳ４においてステップゲインα´の値を１にセットするとともに、左右のステレオ・オーディオ信号のうちレベル値の高い一方とモノラル適用フィルタ６３とを接続するように切り替えスイッチ６７を制御する。また、ステレオエコーパス推定制御器６４に対し、それまでに学習した疑似ステレオエコーパス特性ハットＨ_Ｌｉ（ｚ）、ハットＨ_Ｒｉ（ｚ）から周知の変換式を用いて疑似モノラルエコーパス特性ハットＨ_ｉ（ｚ）を算出し、モノラル適応フィルタ６３にセットするよう制御指示を与える。 The stereo echo canceller controller 61 sets the value of the step gain α ′ to 1 in step S4 and connects the monaural filter 63 with one of the left and right stereo audio signals having a higher level value. The changeover switch 67 is controlled. Further, the stereo echo path estimation controller 64 uses the pseudo stereo echo path characteristic hat H _Li (z) and the hat H _Ri (z) learned so far to use the pseudo monaural echo path characteristic hat H using a known conversion formula. _i (z) is calculated, and a control instruction is given to set it to the monaural adaptive filter 63.

一方、ステップＳ２においてμ_ｉが閾値μ_ＴＨ以上である場合、左右のステレオ・オーディオ信号の相関が弱いと判定してステップＳ３に進み、ステップゲインαの値を１にセットするようステレオエコーパス推定制御器６４を介してステレオ適応フィルタ６２を制御する。同時にステップゲインα´の値を０にセットするようステレオエコーパス推定制御器６４を介してモノラル適応フィルタ６３を制御する。 On the other hand, if μ _i is greater than or equal to the threshold μ _TH in step S2, it is determined that the correlation between the left and right stereo audio signals is weak, the process proceeds to step S3, and the stereo echo path estimation is performed so that the value of step gain α is set to 1. The stereo adaptive filter 62 is controlled via the controller 64. At the same time, the monaural adaptive filter 63 is controlled via the stereo echo path estimation controller 64 so that the value of the step gain α ′ is set to zero.

ステップＳ３、Ｓ４が終了すると、共にステップＳ５に進み、監視対象のフレームを次のフレームに進めてステップＳ１に戻る。このようにして、時系列に送信されてくるステレオパラメタ信号ｐ_ｉｊないしは、ステレオパラメタ信号ｐ_ｉｊから生成されたチャンネル間伝達関数Ｇ_Ｒｉｊ（ｚ´）、Ｇ_Ｌｉｊ（ｚ´）の監視を繰り返す。 When steps S3 and S4 are completed, the process proceeds to step S5, advances the frame to be monitored to the next frame, and returns to step S1. In this way, the monitoring of the inter-channel transfer functions G _Rij (z ′) and G _Lij (z ′) generated from the stereo parameter signal p _ij or the stereo parameter signal p _ij transmitted in time series is repeated.

左右のステレオ・オーディオ信号間に強い相関がある場合には、ステレオ適応フィルタ６２での疑似ステレオエコーパス特性の学習を続けても正しい方向に収束せずに誤推定を招くために、音響エコーキャンセル効果が著しく低下する問題を招いていた。しかしながら、本実施形態の音響エコーキャンセラでは、左右のステレオ・オーディオ信号間に強い相関がある場合にはステレオ適応フィルタ６２での疑似ステレオエコーパス特性の学習を停止して、相関が弱くなると学習を再開させるため、安定した音響エコーキャンセル効果を実現できる。 When there is a strong correlation between the left and right stereo audio signals, acoustic echo cancellation is not possible because the stereo adaptive filter 62 does not converge in the correct direction even if learning of the pseudo stereo echo path characteristics is continued. This has caused a problem that the effect is remarkably reduced. However, in the acoustic echo canceller of this embodiment, when there is a strong correlation between the left and right stereo audio signals, the learning of the pseudo stereo echo path characteristic in the stereo adaptive filter 62 is stopped, and the learning is performed when the correlation becomes weak. In order to resume, a stable acoustic echo cancellation effect can be realized.

このように、本実施の形態の音響エコーキャンセラでは、左右のステレオ・オーディオ信号間に強い相関がある場合には、ステレオ適応フィルタ６２での疑似ステレオエコーパス特性の学習を停止させて誤推定を防ぐため、音響エコーキャンセル能力の低下を防ぐことができる。 As described above, in the acoustic echo canceller according to the present embodiment, when there is a strong correlation between the left and right stereo audio signals, the pseudo adaptive echo path characteristic learning in the stereo adaptive filter 62 is stopped and erroneous estimation is performed. In order to prevent this, it is possible to prevent a decrease in acoustic echo cancellation capability.

また、オーディオ信号復号部２で用いられているチャンネル間相関情報の符号化コードであるステレオパラメタ信号ｐ_ｉｊないしは、ステレオパラメタ信号ｐ_ｉｊから生成されたチャンネル間伝達関数Ｇ_Ｒｉｊ（ｚ´）、Ｇ_Ｌｉｊ（ｚ´）を利用して左右のステレオ・オーディオ信号間の相関の強さを判定するため、新たな相関検出手段を必要としない。従って、経済的に安定的なキャンセル能力を有する音響エコーキャンセラを実現することができる。 Further, the stereo parameter signal p _ij which is the encoded code of the inter-channel correlation information used in the audio signal decoding unit 2 or the inter-channel transfer function G _Rij (z ′), G generated from the stereo parameter signal p _ij _Since the strength of correlation between the left and right stereo audio signals is determined using _Lij (z ′), no new correlation detection means is required. Therefore, an acoustic echo canceller having an economically stable cancellation capability can be realized.

なお、左右のステレオ・オーディオ信号間の相関が強い場合に、ステップゲインαの値はゼロに固定的に制御される必要はなく、各種特性などに応じてゼロに近い所定の値に制御するようにしてもよい。 Note that when the correlation between the left and right stereo audio signals is strong, the value of the step gain α does not need to be fixedly controlled to zero, and is controlled to a predetermined value close to zero according to various characteristics. It may be.

（第２の実施形態）
上述した実施形態においては、左右のステレオ・オーディオ信号間の相関が強い区間が１区間出現した場合において、疑似ステレオエコーパス特性の学習を停止させることで誤推定を防ぎエコーキャンセル能力の低下を防ぐよう制御されていた。一方、本実施形態の音響エコーキャンセラでは、左右のステレオ・オーディオ信号間の相関が強い区間が複数区間出現した場合に、同区間の間に学習した疑似モノラルエコーパス特性を用いて疑似ステレオエコーパス特性を推定することで、エコーキャンセル能力をさらに向上させる点が異なっている。 (Second Embodiment)
In the above-described embodiment, when one section having a strong correlation between the left and right stereo audio signals appears, the false stereo echo path characteristic learning is stopped to prevent erroneous estimation and prevent the echo canceling ability from deteriorating. It was controlled as such. On the other hand, in the acoustic echo canceller according to the present embodiment, when a plurality of sections having a strong correlation between the left and right stereo audio signals appear, the pseudo stereo echo path using the pseudo monaural echo path characteristics learned during the section is used. The point that the echo cancellation capability is further improved by estimating the characteristics is different.

本実施形態の音響エコーキャンセラの構成は、図１用いて説明した第１の実施形態と同様であるので説明を省略する。 The configuration of the acoustic echo canceller of this embodiment is the same as that of the first embodiment described with reference to FIG.

本実施形態の音響エコーキャンセラも、第１の実施形態と同様に、ステレオエコーキャンセラ制御器６１において左右のチャンネル間伝達関数の比率変化指標μ_ｉを定常的に監視する。フレームｉからフレームｉ―ｖ_ｉ＋１の区間において比率変化指標が閾値μ_ＴＨを下回り、また、フレームｉ―κ_ｉからフレームｉ―κ_ｉ―ｖ_ｉ―κｉ＋１の区間においても比率変化指標がμ_ＴＨを下回り、さらに、これらの区間におけるチャンネル間相関情報であるステレオパラメタ信号ｐ_ｉｊとｐ_{ｉ―κｉｊ}とが異なる場合、これらの区間の間に学習した疑似モノラルエコーパス特性を用いて疑似ステレオエコーパス特性を推定する。 Similarly to the first embodiment, the acoustic echo canceller of the present embodiment also regularly monitors the ratio change index μ _i of the transfer function between the left and right channels in the stereo echo canceller controller 61. Ratio change index in the interval of frame _i-v i +1 from the frame i is below the threshold _{mu TH,} The ratio variation index even in frame i-kappa _i interval -v _i-.kappa.i +1 from frame i-kappa _i is mu _If the stereo parameter signals p _ij and p _i-κij that are the correlation information between channels in these sections are different from each _other and differ, the pseudo stereo echo using the pseudo monaural echo path characteristics learned during these sections is used. Estimate path characteristics.

具体的には、フレームｉの疑似モノラルエコーパス特性ハットＨ_ｉ（ｚ）、フレームｉ―κ_ｉの疑似モノラルエコーパス特性ハットＨ_ｉ―κｉ（ｚ）、及び、式（２４）に定義された、各々に対応するステレオ生成伝達関数Ｇ_ＲＬｉ（ｚ）、Ｇ_{ＲＬｉ−κｉ}（ｚ）を用いて、周知の式（２５）の変換式に従い、左右の疑似ステレオエコーパス特性ハットＨ_Ｌｉ（ｚ）、ハットＨ_Ｒｉ（ｚ）を推定する。

Specifically, the pseudo monaural echo path characteristic hat H _i (z) of the frame i, the pseudo monaural echo path characteristic hat H _i-κi (z) of the frame i-κ _i , and the expression (24) are defined. , Left and right pseudo stereo echo path characteristic hats H _Li (z) according to the well-known conversion equation (25) using the stereo generation transfer functions G _RLi (z) and G _RLi-κi (z) corresponding to each of them. , And estimate the hat H _Ri (z).

上記のように二区間の疑似モノラルエコーパス特性を用いた疑似ステレオエコーパス特性の推定は、ステレオエコーキャンセラ制御器６１からの制御指示に基づきステレオエコーパス推定制御器６４で行われる。推定された疑似ステレオエコーパス特性Ｈ_Ｌｉ（ｚ）、ハットＨ_Ｒｉ（ｚ）は、ステレオ適応フィルタ６２にセットされ、引き続き行われる疑似ステレオエコーパス特性の学習に用いられる。 As described above, the estimation of the pseudo stereo echo path characteristic using the two-section pseudo monaural echo path characteristic is performed by the stereo echo path estimation controller 64 based on the control instruction from the stereo echo canceller controller 61. The estimated pseudo stereo echo path characteristics H _Li (z) and hat H _Ri (z) are set in the stereo adaptive filter 62 and used for subsequent pseudo stereo echo path characteristics learning.

このように、本実施の形態の音響エコーキャンセラでは、左右のステレオ・オーディオ信号間に強い相関がある区間が複数区間出現した場合に、同区間で推定された疑似モノラルエコーパス特性を用いて、左右の疑似ステレオエコーパス特性を推定する。これにより、音響エコーキャンセル能力がさらに向上する。なお、出現する強い相関がある複数区間は、連続していてもかまわない。 Thus, in the acoustic echo canceller of the present embodiment, when a plurality of sections having a strong correlation between the left and right stereo audio signals appear, using the pseudo monaural echo path characteristic estimated in the same section, Estimate the left and right pseudo stereo echo path characteristics. Thereby, the acoustic echo cancellation capability is further improved. Note that a plurality of sections having a strong correlation that appear may be continuous.

本発明のいくつかの実施形態を説明したが、これらの実施形態は、例として提示したものであり、発明の範囲を限定することは意図していない。これら新規な実施形態は、その他の様々な形態で実施されることが可能であり、発明の要旨を逸脱しない範囲で、種々の省略、置き換え、変更を行うことができる。これら実施形態やその変形は、発明の範囲や要旨に含まれるとともに、特許請求の範囲に記載された発明とその均等の範囲に含まれる。 Although several embodiments of the present invention have been described, these embodiments are presented by way of example and are not intended to limit the scope of the invention. These novel embodiments can be implemented in various other forms, and various omissions, replacements, and changes can be made without departing from the scope of the invention. These embodiments and modifications thereof are included in the scope and gist of the invention, and are included in the invention described in the claims and the equivalents thereof.

２…オーディオ信号復号部、６…音響エコーキャンセル部、６２…ステレオ適応フィルタ、 2 ... audio signal decoding unit, 6 ... acoustic echo canceling unit, 62 ... stereo adaptive filter,

Claims

An audio signal decoding unit that decodes the stereo audio signal using a stereo parameter generated based on a correlation between the input left and right stereo audio signals;
A stereo echo estimator for estimating an acoustic echo caused by the decoded stereo audio signal; and an acoustic echo canceling unit for removing the estimated acoustic echo from the input speech signal;
An acoustic echo canceller comprising:
The acoustic echo cancellation unit further includes a stereo echo canceller controller that controls an estimation operation of the acoustic echo in the stereo echo estimator based on the stereo parameter.
Acoustic echo canceller.

The acoustic echo canceller according to claim 1, wherein the stereo echo canceller controller controls an estimation operation of the acoustic echo in the stereo echo estimator based on a variation degree of the stereo parameter.

The stereo echo estimator is composed of a multidimensional adaptive filter, and the stereo echo canceller controller controls a convergence speed of coefficients of the multidimensional adaptive filter based on a degree of variation of the stereo parameter. The acoustic echo canceller according to claim 1.

The stereo echo canceller controller suppresses the estimation operation of the acoustic echo in the stereo echo estimator when the degree of variation of the stereo parameter is smaller than a set threshold value. Item 4. The acoustic echo canceller according to any one of Items 3 to 3.

The stereo echo canceller controller has different values for the first stereo parameter in the first section and the second stereo parameter in the first section and the second section, and When the degree of fluctuation of one stereo parameter and the degree of fluctuation of the second stereo parameter are both smaller than a set threshold, the first and second stereo parameters and the estimation in the first section The acoustic echo canceller according to claim 1, wherein a third echo path characteristic is estimated based on the first echo path characteristic obtained and the second echo path characteristic estimated in the second section. .