JP6537997B2

JP6537997B2 - Echo suppressor, method thereof, program, and recording medium

Info

Publication number: JP6537997B2
Application number: JP2016079702A
Authority: JP
Inventors: 小林　和則; 和則小林
Original assignee: Nippon Telegraph and Telephone Corp
Current assignee: Nippon Telegraph and Telephone Corp
Priority date: 2016-04-12
Filing date: 2016-04-12
Publication date: 2019-07-03
Anticipated expiration: 2036-04-12
Also published as: JP2017191992A

Description

本発明は、ハンズフリー通話装置においてスピーカからマイクロホンへ回り込んだ音（音響エコー）を抑圧するための技術に関する。 The present invention relates to a technique for suppressing a sound (sound echo) that has traveled from a speaker to a microphone in a hands-free communication device.

エコー抑圧技術の従来技術として特許文献１が知られている。 Patent Document 1 is known as a prior art of echo suppression technology.

特許文献１のエコー抑圧装置の前段に適応フィルタを用いたエコー消去装置を設けた構成を図１に示す。以下、図１の構成の処理の概要を説明する。 FIG. 1 shows a configuration in which an echo cancellation device using an adaptive filter is provided at the front stage of the echo suppression device of Patent Document 1. The outline of the process of the configuration of FIG. 1 will be described below.

適応フィルタ部９１は、再生手段２に入力される受話信号x(t)(以下、単に再生手段２の受話信号ともいう)に対して適応フィルタを用いてフィルタリングを行い、エコー成分の推定値y'₁(t)を求める。ただし、tは離散化された時刻を示すインデックスである。減算部９２は、収音手段３で収音した収音信号y₁(t)からエコー成分の推定値y'₁(t)を減じて、誤差信号y₂(t)を求める。なお、適応フィルタ部９１は、受話信号x(t)と誤差信号y₂(t)とを用いて適応フィルタのフィルタ係数を更新する。周波数領域変換部９３は、時間領域の誤差信号y₂(t)を周波数領域の誤差信号Y₂(ω)に変換する。ただし、ωは周波数を示すインデックスである。周波数領域変換部９４は受話信号x(t)を周波数領域の受話信号X(ω)に変換する。音響結合量推定部９５は、周波数領域の誤差信号Y₂(ω)と周波数領域の受話信号X(ω)とを用いて、音響結合量D(ω)を推定する。エコーレベル推定部９６は、周波数領域の受話信号X(ω)のレベルに音響結合量を乗じて、周波数領域の誤差信号Y₂(ω)に含まれるエコー成分のレベルを推定する。ゲイン取得部９７は、エコー成分のレベルの推定値R(ω)と周波数領域の誤差信号Y₂(ω)とを用いてゲインG(ω)を取得する。乗算部９８は、周波数領域の誤差信号Y₂(ω)にゲインG(ω)を乗じて、周波数領域の送話信号Y₃(ω)を求める。時間領域変換部９９は周波数領域の送話信号Y₃(ω)を時間領域の送話信号y₃(t)に変換し、エコー消去装置９０の出力値として出力する。 The adaptive filter unit 91 performs filtering on the reception signal x (t) (hereinafter, also simply referred to as a reception signal of the reproduction unit 2) input to the reproduction unit 2 using an adaptive filter, and estimates an echo component value y. Find the ₁ (t). Here, t is an index indicating a discrete time. The subtraction unit 92 subtracts the estimated value y ′ ₁ (t) of the echo component from the sound collection signal y ₁ (t) collected by the sound collection means 3 to obtain an error signal y ₂ (t). The adaptive filter unit 91 updates the filter coefficient of the adaptive filter using the reception signal x (t) and the error signal y ₂ (t). The frequency domain conversion unit 93 converts the time domain error signal y ₂ (t) into the frequency domain error signal Y ₂ (ω). Here, ω is an index indicating a frequency. The frequency domain conversion unit 94 converts the reception signal x (t) into the reception signal X (ω) in the frequency domain. The acoustic coupling amount estimation unit 95 estimates the acoustic coupling amount D (ω) using the error signal Y ₂ (ω) in the frequency domain and the reception signal X (ω) in the frequency domain. The echo level estimation unit 96 multiplies the level of the reception signal X (ω) in the frequency domain by the amount of acoustic coupling to estimate the level of the echo component included in the error signal Y ₂ (ω) in the frequency domain. The gain acquisition unit 97 acquires the gain G (ω) using the estimated value R (ω) of the level of the echo component and the error signal Y ₂ (ω) in the frequency domain. The multiplication unit 98 multiplies the error signal Y ₂ (ω) in the frequency domain by the gain G (ω) to obtain the transmission signal Y ₃ (ω) in the frequency domain. The time domain conversion unit 99 converts the transmission signal Y ₃ (ω) in the frequency domain into the transmission signal y ₃ (t) in the time domain, and outputs it as an output value of the echo canceller 90.

このような構成により、特許文献１の構成では、前段の適応フィルタを用いたエコー消去装置で消去しきれなかった残留エコーの抑圧を行う。 With such a configuration, in the configuration of Patent Document 1, the residual echo that can not be erased by the echo cancellation apparatus using the adaptive filter at the previous stage is suppressed.

特開２００８−５０９４号公報JP 2008-5094 A

前段の適応フィルタを用いたエコー消去装置のエコー消去が安定していれば、特許文献１のエコー抑圧装置で残留エコーを抑圧可能である。 If the echo cancellation of the echo cancellation apparatus using the preceding adaptive filter is stable, the echo suppression apparatus of Patent Document 1 can suppress the residual echo.

しかしながら、再生手段２のスピーカユニットやスピーカアンプで再生音に歪が生じるような過大な受話信号が特許文献１のエコー抑圧装置に入力されると、その歪成分は前段の適応フィルタを用いたエコー消去装置では消去できず、大きな残留エコーとなる。従来のエコー抑圧装置では、急激に増加する残留エコーに対応できず、抑圧が十分に行われないという問題が生じる。なお、前段に適応フィルタを用いたエコー消去装置を設けずに、特許文献１のエコー抑圧装置を単体でエコー抑圧装置として用いた場合にも同様に、急激に増加するエコーに対応できず、抑圧が十分に行われないという問題が生じる。 However, when an excessive reception signal that causes distortion in the reproduced sound by the speaker unit or the speaker amplifier of the reproduction means 2 is input to the echo suppression apparatus of Patent Document 1, the distortion component is an echo using the adaptive filter of the previous stage. The erasing device can not erase the signal, resulting in a large residual echo. The conventional echo suppressor can not cope with the rapidly increasing residual echo, resulting in a problem that the suppression is not sufficiently performed. Even when the echo suppression apparatus of Patent Document 1 is used alone as an echo suppression apparatus without providing an echo cancellation apparatus using an adaptive filter in the previous stage, it can not cope with the rapidly increasing echo as well. There is a problem that is not done enough.

本発明は、過大な受話信号が入力されスピーカユニットやスピーカアンプで歪が生じても、安定してエコーを抑圧することができるエコー抑圧装置、その方法、プログラム、及び記録媒体を提供することである。 The present invention provides an echo suppression apparatus capable of stably suppressing an echo even when distortion occurs in a speaker unit or a speaker amplifier by inputting an excessive reception signal, a method, a program, and a recording medium. is there.

上記の課題を解決するために、本発明の一態様によれば、エコー抑圧装置は、周波数領域の収音信号に基づく値と、周波数領域の受話信号との比から、再生手段と収音手段との間の音響結合量を周波数領域毎に推定する音響結合量推定部と、周波数領域の受話信号のレベルに音響結合量を乗じて収音信号に含まれるエコー成分のレベルを周波数帯域毎に推定するエコーレベル推定部と、受話信号のレベルと、エコー成分のレベルの推定値と、収音信号のレベルとを用いて、再生手段において受話信号を再生する際に、受話信号のレベルが大きいために再生音に歪が生じる可能性がある場合、仮に歪が生じない場合に用いるゲインよりも、抑圧量の大きいゲインG(ω)を周波数毎に求めるゲイン取得部と、周波数領域の収音信号に基づく値にゲインG(ω)を乗じるエコー抑圧部とを含む。 In order to solve the above-mentioned problems, according to one aspect of the present invention, an echo suppression apparatus is provided with a reproduction means and a sound collection means from the ratio of a value based on a collected sound signal in the frequency domain to a reception signal in the frequency domain. And the level of the reception signal in the frequency domain multiplied by the amount of acoustic coupling to calculate the level of the echo component contained in the collected signal for each frequency band. When reproducing the reception signal in the reproduction means using the echo level estimation unit to be estimated, the level of the reception signal, the estimated value of the level of the echo component, and the level of the collected signal, the level of the reception signal is large If there is a possibility that distortion occurs in the reproduced sound, a gain acquisition unit for obtaining, for each frequency, a gain G (ω) having a larger amount of suppression than the gain used if distortion does not temporarily occur, and sound collection in the frequency domain Gay to value based on signal And an echo suppression unit multiplied by the G (ω).

上記の課題を解決するために、本発明の他の態様によれば、エコー抑圧方法は、周波数領域の収音信号に基づく値と、周波数領域の受話信号との比から、再生手段と収音手段との間の音響結合量を周波数領域毎に推定する音響結合量推定ステップと、周波数領域の受話信号のレベルに音響結合量を乗じて収音信号に含まれるエコー成分のレベルを周波数帯域毎に推定するエコーレベル推定ステップと、受話信号のレベルと、エコー成分のレベルの推定値と、収音信号のレベルとを用いて、再生手段において受話信号を再生する際に、受話信号のレベルが大きいために再生音に歪が生じる可能性がある場合、仮に歪が生じない場合に用いるゲインよりも、抑圧量の大きいゲインG(ω)を周波数毎に求めるゲイン取得ステップと、周波数領域の収音信号に基づく値にゲインG(ω)を乗じるエコー抑圧ステップとを含む。 In order to solve the above problems, according to another aspect of the present invention, an echo suppression method comprises: reproducing means and sound collection from the ratio of a value based on a collected sound signal in the frequency domain to a reception signal in the frequency domain An acoustic coupling amount estimation step of estimating the acoustic coupling amount between the means and the frequency domain, and multiplying the level of the reception signal in the frequency domain by the acoustic coupling amount to obtain the level of the echo component included in the collected signal per frequency band When the receiving signal is reproduced by the reproduction means using the echo level estimation step to be estimated, the level of the receiving signal, the estimated value of the level of the echo component, and the level of the collected signal, the level of the receiving signal is If distortion is likely to occur in the reproduced sound because it is large, a gain acquisition step for obtaining for each frequency a gain G (ω) having a larger amount of suppression than a gain used if distortion does not temporarily occur, and sound Comprising a value based on the No. and echo suppression step of multiplying a gain G (ω).

本発明によれば、過大な受話信号が入力されスピーカユニットやスピーカアンプで歪が生じても、安定してエコーを抑圧することができるという効果を奏する。 According to the present invention, it is possible to stably suppress an echo even if an excessive reception signal is input and distortion occurs in the speaker unit or the speaker amplifier.

従来技術のエコー消去装置の機能ブロック図。FIG. 1 is a functional block diagram of a prior art echo canceler. 第一実施形態に係るエコー消去装置の機能ブロック図。FIG. 2 is a functional block diagram of an echo cancellation apparatus according to the first embodiment. 第一実施形態に係るエコー消去装置の処理フローの例を示す図。The figure which shows the example of the processing flow of the echo cancellation apparatus which concerns on 1st embodiment. 第一実施形態に係るゲイン取得部の機能ブロック図。FIG. 2 is a functional block diagram of a gain acquisition unit according to the first embodiment. 第一実施形態に係るゲイン取得部の処理フローの例を示す図。The figure which shows the example of the processing flow of the gain acquisition part which concerns on 1st embodiment. 第二実施形態に係るゲイン取得部の機能ブロック図。FIG. 8 is a functional block diagram of a gain acquisition unit according to a second embodiment. 第二実施形態に係るゲイン取得部の処理フローの例を示す図。The figure which shows the example of the processing flow of the gain acquisition part which concerns on 2nd embodiment. 第三実施形態に係るゲイン取得部の機能ブロック図。The functional block diagram of the gain acquisition part which concerns on 3rd embodiment. 第三実施形態に係るゲイン取得部の処理フローの例を示す図。The figure which shows the example of the processing flow of the gain acquisition part which concerns on 3rd embodiment.

以下、本発明の実施形態について、説明する。なお、以下の説明に用いる図面では、同じ機能を持つ構成部や同じ処理を行うステップには同一の符号を記し、重複説明を省略する。ベクトルや行列の各要素単位で行われる処理は、特に断りが無い限り、そのベクトルやその行列の全ての要素に対して適用されるものとする。 Hereinafter, embodiments of the present invention will be described. In the drawings used in the following description, the same reference numerals are given to constituent parts having the same functions and steps for performing the same processing, and redundant description will be omitted. The processing performed for each element of a vector or matrix is applied to all elements of the vector or matrix unless otherwise noted.

＜第一実施形態＞
図２は第一実施形態に係るエコー抑圧装置１００の機能ブロック図を、図３はその処理フローの例を示す。 First Embodiment
FIG. 2 shows a functional block diagram of the echo suppressor 100 according to the first embodiment, and FIG. 3 shows an example of its processing flow.

エコー抑圧装置１００は、適応フィルタ部１０１、減算部１０２、周波数領域変換部１０３、周波数領域変換部１０４、音響結合量推定部１０５、エコーレベル推定部１０６、過大レベル検出部１１０、ゲイン取得部１２０、エコー抑圧部１０８及び時間領域変換部１０９を含む。 The echo suppression apparatus 100 includes an adaptive filter unit 101, a subtraction unit 102, a frequency domain conversion unit 103, a frequency domain conversion unit 104, an acoustic coupling amount estimation unit 105, an echo level estimation unit 106, an excessive level detection unit 110, and a gain acquisition unit 120. , And an echo suppression unit 108 and a time domain conversion unit 109.

エコー抑圧装置１００は、再生手段２で再生する受話信号x(t)と収音手段３で収音した収音信号y₁(t)とを入力とし、収音信号y₁(t)からエコー成分の推定値を消去及び抑圧した送話信号y₃(t)を求め、出力する。以下、各部の処理内容を説明する。 The echo suppressor 100 receives the reception signal x (t) reproduced by the reproduction means 2 and the collected signal y ₁ (t) collected by the sound collecting means 3 as an input, and the echo signal y ₁ (t) A transmission signal y ₃ (t) obtained by eliminating and suppressing the estimated value of the component is obtained and output. The processing content of each part will be described below.

再生手段２は、スピーカ、スピーカユニット、スピーカアンプ等からなり、受話信号x(t)を再生する。収音手段３はマイクロホン等からなり、収音信号y₁(t)を出力する。 The reproduction means 2 is composed of a speaker, a speaker unit, a speaker amplifier, etc., and reproduces the received signal x (t). The sound collection means 3 is composed of a microphone or the like, and outputs a sound collection signal y ₁ (t).

＜適応フィルタ部１０１＞
適応フィルタ部１０１は、再生手段２の受話信号x(t)と誤差信号y₂(t)とを受け取り、これらの値を用いて、収音手段３の収音信号y₁(t)に含まれるエコー成分の推定値y'₁(t)を求め（Ｓ１０１）、出力する。 <Adaptive filter unit 101>
The adaptive filter unit 101 receives the reception signal x (t) of the reproduction means 2 and the error signal y ₂ (t), and uses these values to be included in the sound collection signal y ₁ (t) of the sound collection means 3. An estimated value y ′ ₁ (t) of the echo component to be obtained is obtained (S 101) and output.

例えば、受話信号x(t)と後述するフィルタ係数H(t)を用いて、次式により、推定値y'₁(t)を求める。
y'₁(t)=H(t)^TX(t) (1)
H(t)=(h(0), h(1), ... , h(L-1))^T (2)
X(t)=(x(t), x(t-1), ... , x(t-L+1))^T (3)
ただし、上付き添え字Tは転置を表し、A^TはベクトルAの転置を表し、Lは適応フィルタのタップ長を表す。 For example, using the reception signal x (t) and a filter coefficient H (t) described later, an estimated value y ′ ₁ (t) is obtained by the following equation.
y ' ₁ (t) = H (t) ^T X (t) (1)
H (t) = (h (0), h (1), ..., h (L-1)) ^T (2)
X (t) = (x (t), x (t-1), ..., x (t-L + 1)) ^T (3)
However, the superscript T denotes the transpose, A ^T denotes the transpose of a vector A, L represents a tap length of the adaptive filter.

ここで、フィルタ係数H(t)は、適応フィルタ部１０１内部の図示しないフィルタ係数更新部において、更新される。例えば、NLMSアルゴリズムを用いる場合には次式によりフィルタ係数H(t)を更新する。
H(t+1)=H(t)+aX(t)y₂(t)/X(t)^TX(t) (4)
0<a<2 (5)
ただし、aはNLMSアルゴリズムのステップサイズを表す。フィルタ係数H(t)の更新方法や求め方はこの方法に限らず、従来の方法を用いればよい。 Here, the filter coefficient H (t) is updated in a filter coefficient update unit (not shown) inside the adaptive filter unit 101. For example, when using the NLMS algorithm, the filter coefficient H (t) is updated by the following equation.
H (t + 1) = H (t) + aX (t) y 2 (t) / X (t) T X (t) (4)
0 <a <2 (5)
Where a represents the step size of the NLMS algorithm. The method of updating or determining the filter coefficient H (t) is not limited to this method, and a conventional method may be used.

＜減算部１０２＞
減算部１０２は、収音手段３の収音信号y₁(t)とエコー成分の推定値y'₁(t)とを受け取り、その差分y₁(t)-y'₁(t)を求め（Ｓ１０２）、誤差信号y₂(t)(=y₁(t)-y'₁(t))として出力する。 <Subtractor 102>
The subtraction unit 102 receives the sound collection signal y ₁ (t) of the sound collection means 3 and the estimated value y ′ ₁ (t) of the echo component, and obtains the difference y ₁ (t) −y ′ ₁ (t) (S102) The error signal y ₂ (t) (= y ₁ (t) −y ′ ₁ (t)) is output.

＜周波数領域変換部１０３及び周波数領域変換部１０４＞
周波数領域変換部１０３は、誤差信号y₂(t)を受け取り、周波数領域の誤差信号Y₂(ω)に変換し（Ｓ１０３）、出力する。変換方法としてはFFT(短時間フーリエ変換)等を用いることができる。 <Frequency domain transform unit 103 and frequency domain transform unit 104>
The frequency domain conversion unit 103 receives the error signal y ₂ (t), converts it into an error signal Y ₂ (ω) in the frequency domain (S 103), and outputs it. As a conversion method, FFT (short time Fourier transform) or the like can be used.

周波数領域変換部１０４は、受話信号x(t)を受け取り、周波数領域変換部１０３と同様の変換方法を用いて、周波数領域の受話信号X(ω)に変換し（Ｓ１０４）、出力する。 The frequency domain conversion unit 104 receives the reception signal x (t), converts it to the reception signal X (ω) in the frequency domain using the same conversion method as the frequency domain conversion unit 103 (S104), and outputs it.

＜音響結合量推定部１０５＞
音響結合量推定部１０５は、周波数領域の誤差信号Y₂(ω)と周波数領域の受話信号X(ω)とを受け取り、誤差信号Y₂(ω)と、周波数領域の受話信号との比から、再生手段と収音手段との間の音響結合量D(ω)を周波数領域毎に推定し（Ｓ１０５）、出力する。例えば、音響結合量D(ω)は、再生手段２と収音手段３との間の伝達特性の振幅値であり、周波数領域の誤差信号Y₂(ω)と周波数領域の受話信号X(ω)の絶対値の比で求められる。また、音響結合量の精度を向上するために時間平滑化が行われる。音響結合量D(ω)は次式により求められる。
D(ω)=E{|Y₂(ω)|/|X(ω)|} (6)
ただし、E{A}はAの平均値を取ることを表し、|A|はAの絶対値をとることを表す。 <Acoustic Coupling Amount Estimation Unit 105>
The acoustic coupling amount estimation unit 105 receives the error signal Y ₂ (ω) in the frequency domain and the reception signal X (ω) in the frequency domain, and determines the ratio between the error signal Y ₂ (ω) and the reception signal in the frequency domain. The acoustic coupling amount D (ω) between the reproduction means and the sound collection means is estimated for each frequency domain (S105) and output. For example, the acoustic coupling amount D (ω) is an amplitude value of the transfer characteristic between the reproducing means 2 and the sound collecting means 3, and the error signal Y ₂ (ω) in the frequency domain and the received signal X (ω) in the frequency domain The ratio of the absolute value of In addition, time smoothing is performed to improve the accuracy of the amount of acoustic coupling. The acoustic coupling amount D (ω) is obtained by the following equation.
D (ω) = E {| Y ₂ (ω) | / | X (ω) |} (6)
However, E {A} represents taking an average value of A, and | A | represents taking an absolute value of A.

＜エコーレベル推定部１０６＞
エコーレベル推定部１０６は、周波数領域の受話信号X(ω)と音響結合量D(ω)とを受け取り、周波数領域の受話信号X(ω)のレベルに音響結合量D(ω)を乗じて収音信号に含まれるエコー成分のレベルを周波数帯域毎に推定し（Ｓ１０６）、推定値R(ω)を出力する。 <Echo level estimation unit 106>
The echo level estimation unit 106 receives the reception signal X (ω) in the frequency domain and the acoustic coupling amount D (ω), and multiplies the level of the reception signal X (ω) in the frequency domain by the acoustic coupling amount D (ω). The level of the echo component included in the collected signal is estimated for each frequency band (S106), and an estimated value R (ω) is output.

例えば、部屋の反響を無視した場合、エコー成分のレベルは、受話信号X(ω)に音響結合量D(ω)を乗じることで推定可能できる。しかし、実際には部屋の音響が存在するため、反響成分も含めてエコー成分を推定する必要がある。通常、部屋の音響成分は時間とともに指数減衰するので、次式により、エコー成分のレベルの推定を行う。
R(ω)=D(ω)・P(ω)
P(ω)=|X(ω)| for P'(ω)≦|X(ω)|
P(ω)=u・P'(ω)+(l-u)・|X(ω)| for P'(ω)>|X(ω)| (7)
ただし、P(ω)は反響に相当する時間平滑を行ったあとの受話信号であり、P'(ω)は1フレーム前のP(ω)であり、uは反響の長さ（残響時間）の想定値を調整するための係数でありあらかじめ固定値が設定される。uは例えば0≦u<1の値をとり、1に近いほど残響時間の長い環境が模擬され、0に近いほど残響時間の短い環境が模擬される。 For example, when the room echo is ignored, the level of the echo component can be estimated by multiplying the reception signal X (ω) by the acoustic coupling amount D (ω). However, since there is room sound in practice, it is necessary to estimate the echo component including the echo component. Usually, since the acoustic component of the room decays exponentially with time, the level of the echo component is estimated by the following equation.
R (ω) = D (ω) · P (ω)
P (ω) = | X (ω) | for P '(ω) ≦ | X (ω) |
P (ω) = u · P ′ (ω) + (lu) · | X (ω) | for P ′ (ω)> | X (ω) | (7)
Where P (ω) is the received signal after time smoothing corresponding to the echo, P ′ (ω) is P (ω) one frame before, and u is the echo length (reverberation time) It is a coefficient for adjusting the assumed value of and a fixed value is set in advance. For example, u takes a value of 0 ≦ u <1, and an environment with longer reverberation time is simulated as closer to 1 and an environment with shorter reverberation time is simulated as closer to 0.

＜過大レベル検出部１１０＞
過大レベル検出部１１０は、時間領域の受話信号x(t)を受け取り、受話信号x(t)のレベルs(t)を求め（Ｓ１１０）、出力する。受話信号x(t)のレベルs(t)は、受話信号x(t)の絶対値s(t)=|x(t)|や、絶対値を平滑化した信号s(t)=α・s'(t)+(l-α)・|x(t)|や、以下の式で計算される受話信号x(t)の最大値保持レベルを用いる。
s(t)=|x(t)| for s'(t)≦|x(t)|
s(t)=α・s'(t)+(l-α)・|x(t)| for s'(t)>|x(t)|
s(t)は反響に相当する時間平滑を行ったあとの受話信号であり、s'(t)は1フレーム前のs(t)であり、αは平滑化関数であり、0から１の間の値をとる。 <Excessive Level Detection Unit 110>
The excessive level detection unit 110 receives the reception signal x (t) in the time domain, obtains the level s (t) of the reception signal x (t) (S110), and outputs it. The level s (t) of the reception signal x (t) is the absolute value s (t) = | x (t) | of the reception signal x (t) or the signal s (t) =. Alpha. The maximum value holding level of the reception signal x (t) calculated using the following equation is used: s ′ (t) + (l−α) · x (t) |
s (t) = | x (t) | for s' (t) ≦ | x (t) |
s (t) = α · s ′ (t) + (l−α) · x (t) | for s ′ (t)> | x (t) |
s (t) is a received signal after time smoothing corresponding to the echo, s' (t) is s (t) one frame before, α is a smoothing function, 0 to 1 Take the value between

＜ゲイン取得部１２０＞
ゲイン取得部１２０は、エコー成分のレベルの推定値R(ω)と、受話信号x(t)のレベルs(t)と、周波数領域の誤差信号Y₂(ω)とを受け取り、受話信号x(t)のレベルs(t)が大きいために再生音に歪が生じる可能性がある場合、仮に歪が生じない場合に用いるゲインよりも、抑圧量の大きいゲインG(ω)を周波数毎に求め（Ｓ１２０）、出力する。 <Gain Acquisition Unit 120>
Gain acquisition section 120 receives estimated value R (ω) of the level of the echo component, level s (t) of reception signal x (t), and error signal Y ₂ (ω) in the frequency domain, and receives reception signal x If there is a possibility that distortion occurs in the reproduced sound because the level s (t) of (t) is large, gain G (ω) with a larger amount of suppression for each frequency than the gain used if distortion does not occur temporarily Determine (S120) and output.

図４はゲイン取得部１２０の機能ブロック図を、図５はその処理フローの例を示す。 FIG. 4 shows a functional block diagram of the gain acquisition unit 120, and FIG. 5 shows an example of its processing flow.

ゲイン取得部１２０は、通常時乗算係数記憶部１２１、過大時乗算係数記憶部１２２、係数選択部１２３、係数乗算部１２４及びエコー抑圧ゲイン取得部１２５を含む。
（通常時乗算係数記憶部１２１及び過大時乗算係数記憶部１２２）
通常時乗算係数記憶部１２１及び過大時乗算係数記憶部１２２には、予め通常時乗算係数γ₁及び過大時乗算係数γ₂をそれぞれ記憶しておく。γ₁＜γ₂とする。 The gain acquisition unit 120 includes a normal time multiplication coefficient storage unit 121, an excessive time multiplication coefficient storage unit 122, a coefficient selection unit 123, a coefficient multiplication unit 124, and an echo suppression gain acquisition unit 125.
(Normal multiplication coefficient storage unit 121 and excessive multiplication coefficient storage unit 122)
The normal multiplication coefficient storage unit 121 and an excessive time multiplier coefficient storage unit 122, previously stored normal multiplication coefficient gamma ₁ and excessive time multiplication factor gamma _2, respectively. It is assumed that γ ₁ <γ ₂ .

（係数選択部１２３）
係数選択部１２３は、受話信号x(t)のレベルs(t)を受け取り、受話信号x(t)のレベルs(t)が大きいために再生音に歪が生じる可能性がある場合、過大時乗算係数記憶部１２２から過大時乗算係数γ₂を取り出し、係数乗算部１２４に出力する（Ｓ１２３）。また、歪が生じない場合、通常時乗算係数記憶部１２１から通常時乗算係数γ₁を取り出し、係数乗算部１２４に出力する（Ｓ１２３）。例えば、受話信号x(t)のレベルs(t)があらかじめ設定した閾値β₁を超えた場合に、再生音に歪が生じる可能性がある(以下、このレベルを過大レベルともいう)と判定する。閾値β₁は再生手段２に合わせて実験、シミュレーション等により予め調べておけばよい。 (Coefficient selection unit 123)
The coefficient selection unit 123 receives the level s (t) of the reception signal x (t), and if the level s (t) of the reception signal x (t) is large, distortion may occur in the reproduced sound. taking out an excessive time multiplication coefficient gamma ₂ from the time of multiplying the coefficient storage unit 122, and outputs the coefficient multiplication unit 124 (S123). Also, if the strain does not occur, it is taken out normal multiplication coefficient gamma ₁ from normal multiplication coefficient storage unit 121, and outputs the coefficient multiplication unit 124 (S123). For example, if the level of the received signal x (t) s (t) exceeds the threshold value beta ₁ set in advance, there is a possibility that distortion occurs in the reproduced sound (hereinafter, also referred to the level excessive level) determination Do. The threshold value β ₁ may be checked in advance by experiment, simulation or the like in accordance with the regeneration means 2.

（係数乗算部１２４）
係数乗算部１２４は、エコー成分のレベルの推定値R(ω)と、係数選択部１２３において選択された過大時乗算係数γ₂または通常時乗算係数γ₁とを乗じて積(γ₁R(ω)またはγ₂R(ω))を求め（Ｓ１２４）、出力する。乗じた後の信号(γ₁R(ω)またはγ₂R(ω))がエコー成分のレベルの推定値として利用される。γ₁＜γ₂なので、歪が生じる可能性があることを示す場合には、エコー成分のレベルが高く見積もられることとなる。 (Coefficient multiplication unit 124)
Coefficient multiplying unit 124, the estimated value of the level of the echo component R (omega), the product by multiplying the excess time multiplication factor gamma ₂ or normal multiplication coefficient gamma ₁ selected in the coefficient selection unit 123 (γ ₁ R ( ω) or γ ₂ R (ω)) is obtained (S 124) and output. The multiplied signal (γ ₁ R (ω) or γ ₂ R (ω)) is used as an estimate of the level of the echo component. Since γ ₁ <γ ₂ indicates that distortion may occur, the level of the echo component will be highly estimated.

（エコー抑圧ゲイン取得部１２５）
エコー抑圧ゲイン取得部１２５は、積(γ₁R(ω)またはγ₂R(ω))と周波数領域の誤差信号Y₂(ω)とを受け取り、周波数帯域毎に、積(γ₁R(ω)またはγ₂R(ω))と誤差信号Y₂(ω)のレベルとを比較し、積が大きい程抑圧量の大きいゲインを設定し、後述するエコー抑圧部１０８で用いるゲインG(ω)とし（Ｓ１２５）、出力する。 (Echo suppression gain acquisition unit 125)
The echo suppression gain acquisition unit 125 receives the product (γ ₁ R (ω) or γ ₂ R (ω)) and the error signal Y ₂ (ω) in the frequency domain, and calculates the product (γ ₁ R (R) for each frequency band. comparing the level of omega) or γ ₂ R (ω)) and the error signal Y ₂ (omega), and set a large gain suppression amount as the product is large, the gain G (omega used in the echo suppressing unit 108 to be described later ) And output (S125).

例えば、特許文献１と同様の方法により、ゲインを設定することができる。まず、誤差信号Y₂(ω)にエコー成分が多く含まれている場合、積(γ₁R(ω)またはγ₂R(ω))と誤差信号Y₂(ω)のレベルが近い値をとるので、積(γ₁R(ω)またはγ₂R(ω))に予め設定した固定値、例えば1以上の固定係数Cを乗じた値より、誤差信号Y₂(ω)のレベルが小さい場合に、エコー成分が多く含まれる期間として検出する。例えば、誤差信号Y₂(ω)のレベルをW(ω)とすると、この条件は次式で表される。
W(ω)≦C・γR(ω) (8)
ただし、γはγ₁またはγ₂である。なお、誤差信号Y₂(ω)のレベルW(ω)としては、誤差信号Y₂(ω)の絶対値や、絶対値を平滑化した信号を用いればよい。例えば、
W(ω)=|Y(ω)| for W'(ω)≦|Y(ω)|
W(ω)=u・W'(ω)+(l-u)・|Y(ω)| for W'(ω)>|Y(ω)|
とする。ただし、W(ω)は反響に相当する時間平滑を行ったあとの受話信号であり、W'(ω)は1フレーム前のW(ω)であり、uは反響の長さ（残響時間）の想定値を調整するための係数でありあらかじめ固定値が設定される。uは例えば0≦u<1の値をとり、1に近いほど残響時間の長い環境が模擬され、0に近いほど残響時間の短い環境が模擬される。 For example, the gain can be set by the same method as that of Patent Document 1. First, when the error signal Y ₂ (ω) contains many echo components, the level of the product (γ ₁ R (ω) or γ ₂ R (ω)) and the level of the error signal Y ₂ (ω) are close Therefore, the level of the error signal Y ₂ (ω) is smaller than the product (γ ₁ R (ω) or γ ₂ R (ω)) multiplied by a preset fixed value, for example, a fixed coefficient C of 1 or more. In this case, it is detected as a period in which a large amount of echo component is contained. For example, assuming that the level of the error signal Y ₂ (ω) is W (ω), this condition is expressed by the following equation.
W (ω) ≦ C · γR (ω) (8)
However, γ is γ ₁ or γ ₂ . As the level W (omega) of the error signal Y ₂ (omega), the absolute value and the error signal Y ₂ (omega), the absolute value may be used smoothed signal. For example,
W (ω) = | Y (ω) | for W '(ω) ≦ | Y (ω) |
W (ω) = u · W ′ (ω) + (lu) · | Y (ω) | for W ′ (ω)> | Y (ω) |
I assume. Where W (ω) is the received signal after time smoothing corresponding to echo, W '(ω) is W (ω) one frame before, and u is the echo length (reverberation time) It is a coefficient for adjusting the assumed value of and a fixed value is set in advance. For example, u takes a value of 0 ≦ u <1, and an environment with longer reverberation time is simulated as closer to 1 and an environment with shorter reverberation time is simulated as closer to 0.

エコー成分が多く含まれる期間として検出されたら、その帯域の瞬時利得係数g(ω)を、あらかじめ固定値で設定したエコー抑圧量Dに設定する。ただし、エコー抑圧量Dは例えば0≦D<1の値をとり、小さい値にするほどエコー抑圧量が増加するが、ダブルトーク時の近端話者音声の劣化が増加する。次に、エコー成分が多く含まれる期間として検出されなかった場合は、エコー成分が小さいので、瞬時利得係数g(ω)を予め設定した固定値、例えば1に設定し、誤差信号Y₂(ω)をそのまま通過させる。このゲイン制御を式で表せば次式となる。
g(ω)＝D for W(ω)≦C・R(ω)
g(ω)＝1 for W(ω)＞C・R(ω)
次に、瞬時利得係数g(ω)を時間平滑化して、エコー抑圧部１０８に出力するゲインG(ω)を求める。時間平滑化することでゲインの急激な変化による音質劣化を抑えることができる。時間平滑化は、例えば次式のように行われる。
G(ω)＝a・G'(ω)+(l-a)・g(ω) for g(ω)≦G'(ω)
G(ω)＝b・G'(ω)+(l-b)・g(ω) for g(ω)＞G'(ω) (9)
ただし、G'(ω)は1フレーム前のゲインG(ω)である。aはゲイン下降時の平滑化係数、bはゲイン上昇時の平滑化係数であり、あらかじめ固定値で設定される。aとbは0から1の間の値をとり、1に近いほど長い時間での平滑化となり、0に近いほど短い時間での時間平滑化となる。 When it is detected as a period in which a large amount of echo components are contained, the instantaneous gain coefficient g (ω) of that band is set to an echo suppression amount D set in advance as a fixed value. However, the amount of echo suppression D takes, for example, a value of 0 ≦ D <1, and the smaller the value, the more the echo suppression amount increases, but the deterioration of the near-end speaker voice at the time of double talk increases. Next, when the echo component is not detected as a period including a large amount of echo components, the echo component is small, so the instantaneous gain coefficient g (ω) is set to a preset fixed value, for example, 1 and the error signal Y ₂ (ω Let pass). This gain control can be expressed by the following equation.
g (ω) = D for W (ω) ≦ C · R (ω)
g (ω) = 1 for W (ω)> C · R (ω)
Next, the instantaneous gain coefficient g (ω) is time-smoothed to obtain a gain G (ω) to be output to the echo suppression unit 108. By smoothing the time, it is possible to suppress the sound quality deterioration due to the abrupt change of the gain. Temporal smoothing is performed as follows, for example.
G (ω) = a · G ′ (ω) + (la) · g (ω) for g (ω) ≦ G ′ (ω)
G (ω) = b · G ′ (ω) + (lb) · g (ω) for g (ω)> G ′ (ω) (9)
Here, G ′ (ω) is a gain G (ω) one frame before. a is a smoothing coefficient at the time of gain decrease, b is a smoothing coefficient at the time of gain increase, and is set in advance as a fixed value. a and b take values between 0 and 1, and the closer to 1, the longer time is smoothed, and the closer to 0, the shorter time is smoothed.

なお、上述のゲイン取得方法は、例示であって、周波数帯域毎に、積(γ₁R(ω)またはγ₂R(ω))と誤差信号Y₂(ω)のレベルとを比較し、積が大きい程抑圧量の大きいゲインを設定することができれば、他の方法であってもよい。例えば、時間平滑化を行わなず、g(ω)をそのままゲインG(ω)としても用いてもよい。 The above gain acquisition method is an example, and the product (γ ₁ R (ω) or γ ₂ R (ω)) is compared with the level of the error signal Y ₂ (ω) for each frequency band, Other methods may be used as long as the larger the product, the larger the amount of suppression can be set. For example, g (ω) may be used as gain G (ω) as it is without performing time smoothing.

＜エコー抑圧部１０８＞
エコー抑圧部１０８は、周波数領域の誤差信号Y₂(ω)とゲインG(ω)とを受け取り、周波数領域の誤差信号Y₂(ω)にゲインG(ω)を乗じ、送話信号Y₃(ω)(Y₃(ω)=G(ω)Y₂(ω))を求め（Ｓ１０８）、出力する。 <Echo Suppression Unit 108>
Echo suppressing unit 108 receives the error signal Y ₂ in the frequency domain and (omega) and a gain G (omega), multiplied by the gain G (omega) the error signal Y ₂ (omega) of the frequency domain, transmission signal Y ₃ (ω) (Y ₃ (ω) = G (ω) Y ₂ (ω)) is obtained (S 108) and output.

＜時間領域変換部１０９＞
時間領域変換部１０９は、送話信号Y₃(ω)を受け取り、時間領域の送話信号y₃(t)に変換し（Ｓ１０９）、出力する。変換方法としては、周波数領域変換部１０３及び周波数領域変換部１０４で用いた変換方法に対応するものを用いればよい。例えば、IFFT(逆短時間フーリエ変換)等を用いることができる。 <Time domain conversion unit 109>
The time domain conversion unit 109 receives the transmission signal Y ₃ (ω), converts it into a transmission signal y ₃ (t) in the time domain (S 109), and outputs it. As a conversion method, a method corresponding to the conversion method used in the frequency domain conversion unit 103 and the frequency domain conversion unit 104 may be used. For example, IFFT (inverse short time Fourier transform) or the like can be used.

＜効果＞
以上の構成により、過大な受話信号を検出した場合のみ、大きな係数をエコーレベルに乗算することで、スピーカユニットやスピーカアンプの歪によるエコーの増加分を含んだエコーレベルに近い値が推定され、安定したエコー抑圧を行うことが可能である。 <Effect>
With the above configuration, a value close to the echo level including an increase in echo due to distortion of the speaker unit or the speaker amplifier is estimated by multiplying the echo level by a large coefficient only when an excessive reception signal is detected. It is possible to perform stable echo suppression.

＜変形例＞
必ずしも適応フィルタ部１０１、減算部１０２を含まなくともよい。その場合、Ｓ１０３以降の処理では、誤差信号y₂(t)に代えて収音信号y₁(t)を用いればよい。なお、誤差信号y₂(t)は収音信号y₁(t)からエコー成分の推定値y'₁(t)を減じた値であり、収音信号y₁(t)に基づく値と言える。もちろん、収音信号y₁(t)自体も収音信号y₁(t)に基づく値と言える。 <Modification>
The adaptive filter unit 101 and the subtraction unit 102 may not necessarily be included. In that case, in the processing after S103, the collected sound signal y ₁ (t) may be used instead of the error signal y ₂ (t). Incidentally, it can be said from the error signal y ₂ (t) are collected signal y ₁ (t) is a value obtained by subtracting the estimated value y _'1 (t) of the echo component, a value based on the collected signal y ₁ (t) . Of course, the collected signal y ₁ (t) itself can also be said to be a value based on the collected signal y ₁ (t).

本実施形態では、過大レベル検出部１１０において、受話信号x(t)のレベルs(t)を求めるだけだが、レベルs(t)が閾値β₁を超えるか否かを判定し、判定結果を出力する構成としてもよい。係数選択部１２３では、判定結果(レベルs(t)が閾値β₁を超えるか否か、言い換えると、受話信号x(t)のレベルs(t)が大きいために再生音に歪が生じる可能性があるか否かを示す判定結果)に従って、係数を選択すればよい。 In the present embodiment, the excessive level detection unit 110 only determines the level s (t) of the reception signal x (t), but determines whether the level s (t) exceeds the threshold β ₁ and determines the determination result. It may be configured to output. In coefficient selection section 123, whether the determination result (level s (t) exceeds threshold β ₁ or not, in other words, distortion may occur in the reproduced sound because the level s (t) of reception signal x (t) is large. The coefficient may be selected according to the judgment result indicating whether or not there is a sex.

＜第二実施形態＞
第一実施形態と異なる部分を中心に説明する。
第二実施形態では、ゲイン取得部１２０に代えて、ゲイン取得部２２０を含む。 Second Embodiment
Description will be made focusing on parts different from the first embodiment.
In the second embodiment, a gain acquisition unit 220 is included instead of the gain acquisition unit 120.

＜ゲイン取得部２２０＞
ゲイン取得部２２０は、エコー成分のレベルの推定値R(ω)と、受話信号x(t)のレベルs(t)と、周波数領域の誤差信号Y₂(ω)とを受け取り、受話信号x(t)のレベルs(t)が大きいために再生音に歪が生じる可能性がある場合、仮に歪が生じない場合に用いるゲインよりも、抑圧量の大きいゲインG(ω)を周波数毎に求め（Ｓ２２０）、出力する。 <Gain Acquisition Unit 220>
Gain acquisition section 220 receives estimated value R (ω) of the level of the echo component, level s (t) of reception signal x (t), and error signal Y ₂ (ω) in the frequency domain, and receives reception signal x If there is a possibility that distortion occurs in the reproduced sound because the level s (t) of (t) is large, gain G (ω) with a larger amount of suppression for each frequency than the gain used if distortion does not occur temporarily Determine (S220) and output.

図６はゲイン取得部２２０の機能ブロック図を、図７はその処理フローの例を示す。 FIG. 6 shows a functional block diagram of the gain acquisition unit 220, and FIG. 7 shows an example of its processing flow.

ゲイン取得部２２０は、エコー抑圧ゲイン取得部２２５、過大時ゲイン記憶部２２６及びゲイン選択部２２７を含む。 The gain acquisition unit 220 includes an echo suppression gain acquisition unit 225, an excessive gain storage unit 226, and a gain selection unit 227.

（エコー抑圧ゲイン取得部２２５）
エコー抑圧ゲイン取得部２２５は、エコー成分のレベルの推定値R(ω)と誤差信号Y₂(ω)とを受け取り、エコー成分のレベルの推定値R(ω)と誤差信号Y₂(ω)のレベルとを比較し、エコー成分のレベルの推定値R(ω)が大きい程抑圧量の大きいゲインG(1,ω)を周波数帯域毎に設定し（Ｓ２２５）、出力する。具体的な処理は、エコー抑圧ゲイン取得部１２５と同様であり、積(γ₁R(ω)またはγ₂R(ω))に代えて、エコー成分のレベルの推定値R(ω)を用いる点が異なる。 (Echo suppression gain acquisition unit 225)
The echo suppression gain acquisition unit 225 receives the estimated value R (ω) of the level of the echo component and the error signal Y ₂ (ω), and the estimated value R (ω) of the level of the echo component and the error signal Y ₂ (ω) The gain G (1, .omega.) Having a larger suppression amount is set for each frequency band as the estimated value R (.omega.) Of the level of the echo component is larger for each frequency band (S225). The specific process is the same as that of the echo suppression gain acquisition unit 125, and uses the estimated value R (ω) of the level of the echo component instead of the product (γ ₁ R (ω) or γ ₂ R (ω)). The point is different.

（過大時ゲイン記憶部２２６）
過大時ゲイン記憶部２２６には、予め過大時ゲインG(2,ω)を記憶しておく。なお、エコー抑圧ゲイン取得部２２５で得られるどのようなG(1,ω)に対してもG(2,ω)<G(1,ω)を満たすように、過大時ゲインG(2,ω)を設定する。要は、過大時ゲインG(2,ω)が、ゲインG(1,ω)よりも抑圧量が大きいものとする。 (Excessive gain storage unit 226)
The excessive gain G storage unit 226 stores the excessive gain G (2, ω) in advance. Note that the over-time gain G (2, ω) is satisfied so that G (2, ω) <G (1, ω) is satisfied for any G (1, ω) obtained by the echo suppression gain acquisition unit 225. Set). The point is that the amount of suppression G is larger than the gain G (1, ω).

（ゲイン選択部２２７）
ゲイン選択部２２７は、過大レベル検出部１１０で求めた受話信号x(t)のレベルs(t)を受け取り、受話信号x(t)のレベルs(t)のレベルが大きいために再生音に歪が生じる可能性がある場合、過大時ゲイン記憶部２２６から過大時ゲインG(2,ω)を取り出し、エコー抑圧部１０８で用いるゲインG(ω)として出力し（Ｓ２２７）、歪が生じない場合にはエコー抑圧ゲイン取得部２２５で求めたゲインG(1,ω)をエコー抑圧部１０８で用いるゲインG(ω)として出力する（Ｓ２２７）。例えば、受話信号x(t)のレベルs(t)があらかじめ設定した閾値β₂を超えた場合に、再生音に歪が生じる可能性があると判定する。閾値β₂は再生手段２に合わせて実験、シミュレーション等により予め調べておけばよい。 (Gain selection unit 227)
The gain selection unit 227 receives the level s (t) of the reception signal x (t) obtained by the excessive level detection unit 110, and the level of the level s (t) of the reception signal x (t) is large. If distortion is likely to occur, the in-excess gain G (2, ω) is taken out from the in-excess gain storage unit 226 and is output as the gain G (ω) used by the echo suppression unit 108 (S227). In this case, the gain G (1, ω) obtained by the echo suppression gain acquisition unit 225 is output as the gain G (ω) used by the echo suppression unit 108 (S227). For example, it is determined that if the level of the received signal x (t) s (t) exceeds the threshold value beta ₂ set in advance, there is a possibility that distortion occurs in the reproduced sound. Threshold beta ₂ experiments in accordance with the reproduction unit 2, it is sufficient to examine in advance by simulation or the like.

＜効果＞
このような構成とすることで、第一実施形態と同様の効果を得ることができる。なお、第一実施形態の変形例と本実施形態とを組合せてもよい。 <Effect>
With such a configuration, the same effect as that of the first embodiment can be obtained. A modification of the first embodiment may be combined with the present embodiment.

例えば、本実施形態では、過大レベル検出部１１０において、受話信号x(t)のレベルs(t)を求めるだけだが、レベルs(t)が閾値β₂を超えるか否かを判定し、判定結果を出力する構成としてもよい。ゲイン選択部２２７では、判定結果(レベルs(t)が閾値β₂を超えるか否か、言い換えると、受話信号x(t)のレベルs(t)が大きいために再生音に歪が生じる可能性があるか否かを示す判定結果)に従って、ゲインを選択すればよい。 For example, in the present embodiment, the excessive level detector 110, but just finding the level s (t) of the received signal x (t), determines whether the level s (t) exceeds the threshold value beta _2, determination The result may be output. In gain selection section 227, whether or not the determination result (level s (t) exceeds threshold β ₂ or not, in other words, distortion may occur in the reproduced sound because level s (t) of reception signal x (t) is large. The gain may be selected according to the judgment result indicating whether or not there is a sex.

＜第三実施形態＞
第一実施形態と異なる部分を中心に説明する。第三実施形態は、第一実施形態と第二実施形態とを組み合わせた構成である。
第三実施形態では、ゲイン取得部１２０に代えて、ゲイン取得部３２０を含む。 Third Embodiment
Description will be made focusing on parts different from the first embodiment. The third embodiment is a configuration in which the first embodiment and the second embodiment are combined.
In the third embodiment, a gain acquisition unit 320 is included instead of the gain acquisition unit 120.

＜ゲイン取得部３２０＞
ゲイン取得部３２０は、エコー成分のレベルの推定値R(ω)と、受話信号x(t)のレベルs(t)と、周波数領域の誤差信号Y₂(ω)とを受け取り、受話信号x(t)のレベルs(t)が大きいために再生音に歪が生じる可能性がある場合、仮に歪が生じない場合に用いるゲインよりも、抑圧量の大きいゲインG(ω)を周波数毎に求め（Ｓ３２０）、出力する。 <Gain Acquisition Unit 320>
Gain acquisition section 320 receives estimated value R (ω) of the level of the echo component, level s (t) of received signal x (t), and error signal Y ₂ (ω) in the frequency domain, and receives received signal x If there is a possibility that distortion occurs in the reproduced sound because the level s (t) of (t) is large, gain G (ω) with a larger amount of suppression for each frequency than the gain used if distortion does not occur temporarily Obtain (S320) and output.

図８はゲイン取得部３２０の機能ブロック図を、図９はその処理フローの例を示す。 FIG. 8 shows a functional block diagram of the gain acquisition unit 320, and FIG. 9 shows an example of its processing flow.

ゲイン取得部３２０は、通常時乗算係数記憶部１２１、過大時乗算係数記憶部１２２、係数選択部１２３、係数乗算部１２４、エコー抑圧ゲイン取得部３２５、過大時ゲイン記憶部２２６及びゲイン選択部２２７を含む。 The gain acquisition unit 320 is a normal multiplication coefficient storage unit 121, an excess multiplication coefficient storage unit 122, a coefficient selection unit 123, a coefficient multiplication unit 124, an echo suppression gain acquisition unit 325, an excess gain storage unit 226, and a gain selection unit 227. including.

なお、係数選択部１２３で用いる閾値β₁とゲイン選択部２２７で用いる閾値β₂とは、β₁<β₂となるように設定する。 Note that the threshold value beta ₂ used in the threshold beta ₁ and gain selection unit 227 used in the coefficient selector 123 is set to be a β ₁ <β _2.

（エコー抑圧ゲイン取得部３２５）
エコー抑圧ゲイン取得部３２５は、積(γ₁R(ω)またはγ₂R(ω))と周波数領域の誤差信号Y₂(ω)とを受け取り、周波数帯域毎に、積(γ₁R(ω)またはγ₂R(ω))と誤差信号Y₂(ω)のレベルとを比較し、積が大きい程抑圧量の大きいゲインを設定し、ゲインG(1,ω)をゲイン選択部２２７に出力する（Ｓ３２５）。 (Echo suppression gain acquisition unit 325)
The echo suppression gain acquisition unit 325 receives the product (γ ₁ R (ω) or γ ₂ R (ω)) and the error signal Y ₂ (ω) in the frequency domain, and calculates the product (γ ₁ R (R) for each frequency band. comparing the level of omega) or γ ₂ R (ω)) and the error signal Y ₂ (omega), and set a large gain suppression amount as the product is large, the gain G (1, omega) the gain selector 227 (S325).

過大時ゲイン記憶部２２６及びゲイン選択部２２７の処理内容は第二実施形態と同様である。このとき、β₂＜s(t)の場合には、ゲイン選択部２２７において、過大時ゲインG(2,ω)が選択されることは明らかなので、係数選択部１２３、係数乗算部１２４、エコー抑圧ゲイン取得部３２５の処理は省略してもよい。 The processing contents of the excessive gain storage unit 226 and the gain selection unit 227 are the same as in the second embodiment. At this time, in the case of β ₂ <s (t), it is clear that the gain G (2, ω) is selected in the gain selection unit 227, so the coefficient selection unit 123, the coefficient multiplication unit 124, the echo The processing of the suppression gain acquisition unit 325 may be omitted.

＜効果＞
このような構成とすることで、第一実施形態、第二実施形態と同様の効果を得ることができる。本実施形態では、閾値β₁を超え閾値β₂以下の場合は第一実施形態に示すようにエコー成分に過大時乗算係数を乗じ、閾値β₂を超えた場合は第二実施形態に示すようにゲインを強制的に過大時ゲインに置き換える。 <Effect>
With such a configuration, the same effects as those of the first embodiment and the second embodiment can be obtained. In the present embodiment, in the case of less than the threshold value beta ₂ exceeds the threshold value beta ₁ multiplied by the excess time multiplication coefficient to the echo component, as shown in the first embodiment, if the threshold is exceeded beta ₂ as shown in the second embodiment Force the gain to be replaced with the overtime gain.

このようにすることにより、過大な受話信号で歪が、それほど大きくいない場合は、推定エコーを大きく見積もることで対応し、歪が大きい場合は、送話音声を完全に抑える。なお、第一実施形態の変形例と本実施形態とを組合せてもよい。 By doing this, when the distortion is not so large with an excessive reception signal, the estimated echo is largely estimated, and when the distortion is large, the transmission voice is completely suppressed. A modification of the first embodiment may be combined with the present embodiment.

例えば、本実施形態では、過大レベル検出部１１０において、受話信号x(t)のレベルs(t)を求めるだけだが、レベルs(t)が閾値β₁またはβ₂を超えるか否かを判定し、判定結果を出力する構成としてもよい。係数選択部１２３では、判定結果(レベルs(t)が閾値β₁を超えるか否か、言い換えると、受話信号x(t)のレベルs(t)が大きいために再生音に歪が生じる可能性があるか否かを示す判定結果)に従って、係数を選択すればよい。ゲイン選択部２２７では、判定結果(レベルs(t)が閾値β₂を超えるか否か、言い換えると、受話信号x(t)のレベルs(t)が大きいために再生音に歪が生じる可能性があるか否かを示す判定結果)に従って、ゲインを選択すればよい。 For example, in the present embodiment, the excessive level detector 110, but just finding the level s (t) of the received signal x (t), determines whether the level s (t) exceeds the threshold value beta ₁ or beta ₂ And the determination result may be output. In coefficient selection section 123, whether the determination result (level s (t) exceeds threshold β ₁ or not, in other words, distortion may occur in the reproduced sound because the level s (t) of reception signal x (t) is large. The coefficient may be selected according to the judgment result indicating whether or not there is a sex. In gain selection section 227, whether or not the determination result (level s (t) exceeds threshold β ₂ or not, in other words, distortion may occur in the reproduced sound because level s (t) of reception signal x (t) is large. The gain may be selected according to the judgment result indicating whether or not there is a sex.

＜その他の変形例＞
本発明は上記の実施形態及び変形例に限定されるものではない。例えば、上述の各種の処理は、記載に従って時系列に実行されるのみならず、処理を実行する装置の処理能力あるいは必要に応じて並列的にあるいは個別に実行されてもよい。その他、本発明の趣旨を逸脱しない範囲で適宜変更が可能である。 <Other Modifications>
The present invention is not limited to the above embodiments and modifications. For example, the various processes described above may be performed not only in chronological order according to the description, but also in parallel or individually depending on the processing capability of the apparatus that executes the process or the necessity. In addition, changes can be made as appropriate without departing from the spirit of the present invention.

＜プログラム及び記録媒体＞
また、上記の実施形態及び変形例で説明した各装置における各種の処理機能をコンピュータによって実現してもよい。その場合、各装置が有すべき機能の処理内容はプログラムによって記述される。そして、このプログラムをコンピュータで実行することにより、上記各装置における各種の処理機能がコンピュータ上で実現される。 <Program and Recording Medium>
In addition, various processing functions in each device described in the above-described embodiment and modification may be realized by a computer. In that case, the processing content of the function that each device should have is described by a program. By executing this program on a computer, various processing functions in each of the above-described devices are realized on the computer.

この処理内容を記述したプログラムは、コンピュータで読み取り可能な記録媒体に記録しておくことができる。コンピュータで読み取り可能な記録媒体としては、例えば、磁気記録装置、光ディスク、光磁気記録媒体、半導体メモリ等どのようなものでもよい。 The program describing the processing content can be recorded in a computer readable recording medium. As the computer readable recording medium, any medium such as a magnetic recording device, an optical disc, a magneto-optical recording medium, a semiconductor memory, etc. may be used.

また、このプログラムの流通は、例えば、そのプログラムを記録したＤＶＤ、ＣＤ−ＲＯＭ等の可搬型記録媒体を販売、譲渡、貸与等することによって行う。さらに、このプログラムをサーバコンピュータの記憶装置に格納しておき、ネットワークを介して、サーバコンピュータから他のコンピュータにそのプログラムを転送することにより、このプログラムを流通させてもよい。 Further, this program is distributed, for example, by selling, transferring, lending, etc. a portable recording medium such as a DVD, a CD-ROM or the like in which the program is recorded. Furthermore, the program may be stored in a storage device of a server computer, and the program may be distributed by transferring the program from the server computer to another computer via a network.

このようなプログラムを実行するコンピュータは、例えば、まず、可搬型記録媒体に記録されたプログラムもしくはサーバコンピュータから転送されたプログラムを、一旦、自己の記憶部に格納する。そして、処理の実行時、このコンピュータは、自己の記憶部に格納されたプログラムを読み取り、読み取ったプログラムに従った処理を実行する。また、このプログラムの別の実施形態として、コンピュータが可搬型記録媒体から直接プログラムを読み取り、そのプログラムに従った処理を実行することとしてもよい。さらに、このコンピュータにサーバコンピュータからプログラムが転送されるたびに、逐次、受け取ったプログラムに従った処理を実行することとしてもよい。また、サーバコンピュータから、このコンピュータへのプログラムの転送は行わず、その実行指示と結果取得のみによって処理機能を実現する、いわゆるＡＳＰ（Application Service Provider）型のサービスによって、上述の処理を実行する構成としてもよい。なお、プログラムには、電子計算機による処理の用に供する情報であってプログラムに準ずるもの（コンピュータに対する直接の指令ではないがコンピュータの処理を規定する性質を有するデータ等）を含むものとする。 For example, a computer that executes such a program first temporarily stores a program recorded on a portable recording medium or a program transferred from a server computer in its own storage unit. Then, at the time of execution of the process, the computer reads the program stored in its storage unit and executes the process according to the read program. In another embodiment of the program, the computer may read the program directly from the portable recording medium and execute processing in accordance with the program. Furthermore, each time a program is transferred from this server computer to this computer, processing according to the received program may be executed sequentially. In addition, a configuration in which the above-described processing is executed by a so-called ASP (Application Service Provider) type service that realizes processing functions only by executing instructions and acquiring results from the server computer without transferring the program to the computer It may be Note that the program includes information provided for processing by a computer that conforms to the program (such as data that is not a direct command to the computer but has a property that defines the processing of the computer).

また、コンピュータ上で所定のプログラムを実行させることにより、各装置を構成することとしたが、これらの処理内容の少なくとも一部をハードウェア的に実現することとしてもよい。 In addition, although each device is configured by executing a predetermined program on a computer, at least a part of the processing content may be realized as hardware.

Claims

An acoustic coupling amount estimation unit for estimating an acoustic coupling amount between the reproduction means and the sound pickup means for each frequency domain from a ratio between a value based on a collected sound signal in the frequency domain and a reception signal in the frequency domain;
An echo level estimation unit that estimates the level of the echo component included in the collected signal by multiplying the level of the reception signal in the frequency domain by the acoustic coupling amount;
When the reproduction signal is reproduced by the reproduction means using the level of the reception signal, the estimated value of the level of the echo component, and the level of the collected signal, the level of the reception signal is large. A gain acquisition unit which obtains, for each frequency, a gain G (ω) having a larger amount of suppression than a gain used if distortion does not occur if distortion may occur in the reproduced sound;
An echo suppression unit that multiplies the gain G (ω) by a value based on the collected sound signal in the frequency domain;
The gain acquisition unit
If the normal multiplication coefficient is smaller than the excessive multiplication coefficient, and if there is a possibility that distortion occurs in the reproduced sound because the level of the reception signal is large, the excessive multiplication coefficient is selected, and no distortion occurs. And a coefficient selection unit for selecting a multiplication coefficient at a normal time,
A coefficient multiplication unit which obtains a product by multiplying the estimated value of the level of the echo component and the over-time multiplication coefficient or the normal-time multiplication coefficient selected in the coefficient selection unit;
An echo suppression gain acquisition unit which compares, for each frequency band, the product with the level of a value based on the collected sound signal, sets a larger gain as the product is larger, and sets the gain as the gain G (ω); including,
Echo suppressor.

An acoustic coupling amount estimation unit for estimating an acoustic coupling amount between the reproduction means and the sound pickup means for each frequency domain from a ratio between a value based on a collected sound signal in the frequency domain and a reception signal in the frequency domain;
An echo level estimation unit that estimates the level of the echo component included in the collected signal by multiplying the level of the reception signal in the frequency domain by the acoustic coupling amount;
When the reproduction signal is reproduced by the reproduction means using the level of the reception signal, the estimated value of the level of the echo component, and the level of the collected signal, the level of the reception signal is large. A gain acquisition unit which obtains, for each frequency, a gain G (ω) having a larger amount of suppression than a gain used if distortion does not occur if distortion may occur in the reproduced sound;
An echo suppression unit that multiplies the gain G (ω) by a value based on the collected sound signal in the frequency domain;
The gain acquisition unit
The estimated value of the level of the echo component is compared with the level of the value based on the collected sound signal, and the larger the estimated value of the level of the echo component, the larger the gain G (1, ω) of the suppression amount is for each frequency band An echo suppression gain acquisition unit to be set;
It is assumed that the excessive gain G (2, ω) has a larger suppression amount than the gain G (1, ω), and if the level of the reception signal is large, distortion may occur in the reproduced sound. Selecting a time gain G (2, ω), and selecting no gain G (1, ω) if distortion does not occur;
Echo suppressor.

An acoustic coupling amount estimation unit for estimating an acoustic coupling amount between the reproduction means and the sound pickup means for each frequency domain from a ratio between a value based on a collected sound signal in the frequency domain and a reception signal in the frequency domain;
An echo level estimation unit that estimates the level of the echo component included in the collected signal by multiplying the level of the reception signal in the frequency domain by the acoustic coupling amount;
When the reproduction signal is reproduced by the reproduction means using the level of the reception signal, the estimated value of the level of the echo component, and the level of the collected signal, the level of the reception signal is large. A gain acquisition unit which obtains, for each frequency, a gain G (ω) having a larger amount of suppression than a gain used if distortion does not occur if distortion may occur in the reproduced sound;
An echo suppression unit that multiplies the gain G (ω) by a value based on the collected sound signal in the frequency domain;
The gain acquisition unit
It is assumed that β ₁ <β ₂ , the level of the reception signal is s (t), the normal time multiplication factor is smaller than the overtime multiplication factor, and if β ₁ <s (t) ≦ β ₂ , the overtime multiplication is performed A coefficient selection unit which selects a coefficient, and in the case of s (t) ≦ β ₁ selects a normal multiplication coefficient;
A coefficient multiplication unit which obtains a product by multiplying the estimated value of the level of the echo component and the over-time multiplication coefficient or the normal-time multiplication coefficient selected in the coefficient selection unit;
An echo suppression gain acquisition unit that compares, for each frequency band, the product with the level of a value based on the collected signal, and sets a gain G (1, ω) with a larger amount of suppression as the product is larger;
The overrun gain G (2, ω) is larger than the gain G (1, ω) by the amount of suppression, and in the case of β ₂ <s (t), the overrun gain G (2, ω) is selected, in the case of s (t) ≦ β ₂ selects the gain G (1, ω), and a gain selection unit for the gain G (ω),
Echo suppressor.

An acoustic coupling amount estimation step of estimating, for each frequency domain, an acoustic coupling amount between the reproduction means and the sound collecting means from a ratio between a value based on a collected sound signal in the frequency domain and a reception signal in the frequency domain;
An echo level estimation step of estimating the level of the echo component included in the collected signal by multiplying the level of the reception signal in the frequency domain by the acoustic coupling amount;
When the reproduction signal is reproduced by the reproduction means using the level of the reception signal, the estimated value of the level of the echo component, and the level of the collected signal, the level of the reception signal is large. A gain acquisition step of obtaining, for each frequency, a gain G (ω) having a larger amount of suppression than a gain used if distortion does not occur if distortion may occur in the reproduced sound;
Look including the echo suppression step of multiplying the gain G (omega) the value based on the collected sound signal in the frequency domain,
The gain acquisition step is
If the normal multiplication coefficient is smaller than the excessive multiplication coefficient, and if there is a possibility that distortion occurs in the reproduced sound because the level of the reception signal is large, the excessive multiplication coefficient is selected, and no distortion occurs. And a coefficient selection step of selecting a multiplication coefficient normally.
A coefficient multiplication step for obtaining a product by multiplying the estimated value of the level of the echo component and the over-time multiplication coefficient or the normal-time multiplication coefficient selected in the coefficient selection step;
An echo suppression gain acquisition step of comparing the product with the level of a value based on the collected sound signal for each frequency band, setting a larger gain as the product is larger, and setting the gain as the gain G (ω); including,
Echo suppression method.

An acoustic coupling amount estimation step of estimating, for each frequency domain, an acoustic coupling amount between the reproduction means and the sound collecting means from a ratio between a value based on a collected sound signal in the frequency domain and a reception signal in the frequency domain;
An echo level estimation step of estimating the level of the echo component included in the collected signal by multiplying the level of the reception signal in the frequency domain by the acoustic coupling amount;
When the reproduction signal is reproduced by the reproduction means using the level of the reception signal, the estimated value of the level of the echo component, and the level of the collected signal, the level of the reception signal is large. A gain acquisition step of obtaining, for each frequency, a gain G (ω) having a larger amount of suppression than a gain used if distortion does not occur if distortion may occur in the reproduced sound;
Look including the echo suppression step of multiplying the gain G (omega) the value based on the collected sound signal in the frequency domain,
The gain acquisition step is
The estimated value of the level of the echo component is compared with the level of the value based on the collected sound signal, and the larger the estimated value of the level of the echo component, the larger the gain G (1, ω) of the suppression amount is for each frequency band An echo suppression gain acquisition step to be set;
It is assumed that the excessive gain G (2, ω) has a larger suppression amount than the gain G (1, ω), and if the level of the reception signal is large, distortion may occur in the reproduced sound. Selecting a gain G (2, ω) and selecting a gain G (1, ω) if no distortion occurs, and selecting the gain G (ω).
Echo suppression method.

An acoustic coupling amount estimation step of estimating, for each frequency domain, an acoustic coupling amount between the reproduction means and the sound collecting means from a ratio between a value based on a collected sound signal in the frequency domain and a reception signal in the frequency domain;
An echo level estimation step of estimating the level of the echo component included in the collected signal by multiplying the level of the reception signal in the frequency domain by the acoustic coupling amount;
When the reproduction signal is reproduced by the reproduction means using the level of the reception signal, the estimated value of the level of the echo component, and the level of the collected signal, the level of the reception signal is large. A gain acquisition step of obtaining, for each frequency, a gain G (ω) having a larger amount of suppression than a gain used if distortion does not occur if distortion may occur in the reproduced sound;
Look including the echo suppression step of multiplying the gain G (omega) the value based on the collected sound signal in the frequency domain,
The gain acquisition step is
It is assumed that β ₁ <β ₂ , the level of the reception signal is s (t), the normal time multiplication factor is smaller than the overtime multiplication factor, and if β ₁ <s (t) ≦ β ₂ , the overtime multiplication is performed select the coefficients, a coefficient selection step of selecting a normal multiplication factor in the case of s (t) ≦ β _1,
A coefficient multiplication step for obtaining a product by multiplying the estimated value of the level of the echo component and the over-time multiplication coefficient or the normal-time multiplication coefficient selected in the coefficient selection step;
An echo suppression gain acquisition step of comparing the product with the level of a value based on the collected sound signal for each frequency band, and setting a gain G (1, ω) with a larger amount of suppression as the product is larger;
The overrun gain G (2, ω) is larger than the gain G (1, ω) by the amount of suppression, and in the case of β ₂ <s (t), the overrun gain G (2, ω) is selected, in the case of s (t) ≦ β ₂ selects the gain G (1, ω), and a gain selection step of the gain G (ω),
Echo suppression method.

A program for causing a computer to function as the echo suppressor according to any one of claims 1 to 3 .

A computer-readable recording medium having recorded thereon a program for causing a computer to function as the echo suppressor according to any one of claims 1 to 3 .