JP4818014B2

JP4818014B2 - Signal processing device

Info

Publication number: JP4818014B2
Application number: JP2006203418A
Authority: JP
Inventors: 隆須藤
Original assignee: Toshiba Corp
Current assignee: Toshiba Corp
Priority date: 2005-07-28
Filing date: 2006-07-26
Publication date: 2011-11-16
Anticipated expiration: 2026-07-26
Also published as: JP2007060644A

Description

本発明は、信号処理装置に係り、音声信号の高品質化処理に関する。 The present invention relates to a signal processing apparatus and relates to a process for improving the quality of an audio signal.

音声信号の高品質化処理、例えば、通話装置などにおいて、通話を行うに際し、通話信
号以外の信号、すなわち音響エコーなどを抑圧する処理が種々知られている。また、この
処理を、前段の処理と後段の処理との２段階に分けて行うことにより、高い性能が得られ
ることが知られている。 Various processes are known for improving the quality of an audio signal, for example, a process for suppressing a signal other than a call signal, that is, an acoustic echo or the like when making a call in a call device or the like. In addition, it is known that high performance can be obtained by performing this process in two stages of a pre-stage process and a post-stage process.

例えば、前段では、エコー抑圧処理を行い、後段では、後段の出力である、エコーなど
が抑圧された送話出力信号を用いた短時間スペクトラル振幅推定を行い、その推定値に基
づいてエコー抑圧を行う技術が知られている（例えば、特許文献１参照。）。 For example, the first stage performs echo suppression processing, and the second stage performs short-term spectral amplitude estimation using the transmission output signal with echoes suppressed, which is the output of the second stage, and performs echo suppression based on the estimated value. The technique to perform is known (for example, refer patent document 1).

また、前段では適応フィルタによって、マイクロホンから入力された送話入力信号に含
まれるエコーを抑圧する処理を行い、後段では、前段で生じた擬似エコー信号に基づいて
エコーまたはノイズの抑圧を行う技術が知られている（例えば、特許文献２参照。）。
特許第３４２０７０５号公報（第２−７頁、図１）特開２００４−５６４５３号公報（第２−４頁、図１） In the first stage, an adaptive filter performs processing to suppress echo included in the transmission input signal input from the microphone, and in the second stage, technology to suppress echo or noise based on the pseudo echo signal generated in the previous stage. It is known (for example, refer to Patent Document 2).
Japanese Patent No. 3420705 (page 2-7, FIG. 1) JP 2004-56453 A (page 2-4, FIG. 1)

しかしながら、上述した特許文献１に開示されている方法では、前段の線形エコー抑圧
処理（適応フィルタ）と後段の非線形エコー抑圧処理とを結合させた場合、ダブルトーク
かつエコーパス変動時に、エコー経路の音響結合量（エコーパスロス）は大幅に変化しな
いものと仮定して適応フィルタのフィルタ係数を転用する。このため、ダブルトーク中の
エコーパスロスの変動時など適応フィルタが追従できない場合、充分にエコー抑圧できな
い問題点があった。 However, in the method disclosed in Patent Document 1 described above, when the linear echo suppression process (adaptive filter) in the previous stage and the nonlinear echo suppression process in the subsequent stage are combined, the acoustics of the echo path can be detected at the time of double talk and echo path fluctuation. The filter coefficient of the adaptive filter is diverted assuming that the coupling amount (echo path loss) does not change significantly. For this reason, there is a problem that the echo cannot be sufficiently suppressed when the adaptive filter cannot follow, such as when the echo path loss varies during double talk.

また、特許文献２に開示されている方法では、擬似エコー信号を必ず用いるため、適応
フィルタによる前段のエコーパス追従性に、最終の出力である送話出力信号のエコーパス
追従性が依存することになる問題点があった。つまり、前段で、適応フィルタによるエコ
ー推定が精度よくできる前提となっており、急激なエコーパス変動やダブルトーク中のエ
コーパスロスの変動では後段の出力である送話出力信号にエコーが残留しやすい問題点が
あった。 In addition, since the method disclosed in Patent Document 2 always uses a pseudo echo signal, the echo path followability of the transmission output signal that is the final output depends on the echo path followability of the previous stage by the adaptive filter. There was a problem. In other words, it is assumed that the echo estimation by the adaptive filter can be performed with high accuracy in the first stage, and the echo tends to remain in the transmission output signal, which is the second stage output, due to a sudden echo path fluctuation or echo path loss fluctuation during double talk. There was a point.

即ち、上記のような従来技術において、例えばダブルトーク中のエコーパスロス変動や
急激なエコーパス変動時に、線形処理であるか非線形処理であるかを問わず、前段で行わ
れるエコー抑圧の性能が充分でない場合、後段で行われるエコー抑圧処理、またはノイズ
抑圧処理、更には、それらを組み合わせた処理において、前段の出力信号中の残留エコー
により、エコー量推定及びノイズ量推定の精度が劣化する。この劣化のため、後段で行わ
れるエコー抑圧処理の性能、及びノイズ抑圧処理の性能が充分でないという問題があった
。この問題点は、後段で行われるボイススイッチにおけるエコーサプレス処理においても
顕著である。 That is, in the prior art as described above, the performance of echo suppression performed in the preceding stage is not sufficient regardless of whether the processing is linear processing or nonlinear processing, for example, when echo path loss fluctuation or abrupt echo path fluctuation during double talk is performed. In this case, in echo suppression processing or noise suppression processing performed in the subsequent stage, and further in a combination thereof, the accuracy of echo amount estimation and noise amount estimation deteriorates due to residual echo in the output signal of the previous stage. Due to this deterioration, there is a problem that the performance of the echo suppression processing performed at the later stage and the performance of the noise suppression processing are not sufficient. This problem is conspicuous also in echo suppression processing in a voice switch that is performed later.

本発明は、上記問題点を解決するためになされたもので、エコー抑圧性能を向上させ、
またはノイズ抑圧性能を向上させ、エコーパスロスの変動を含むエコーパス変動に対する
頑健性を高めた信号処理装置を提供することを目的とする。 The present invention has been made to solve the above problems, improve the echo suppression performance,
Another object of the present invention is to provide a signal processing apparatus that has improved noise suppression performance and improved robustness against echo path fluctuations including fluctuations in echo path loss.

実施形態によれば、信号処理装置は、入力信号に含まれる少なくともエコーを抑圧してエコー低減信号を出力する第１の信号処理手段と、前記入力信号と前記エコー低減信号とを入力し、エコー及びノイズの少なくとも一方を抑圧する第２の信号処理手段と、前記入力信号と前記エコー低減信号とのパワースペクトル間の相違を示す量を計算することにより前記第１の信号処理手段のエコー抑圧量を算出するエコー抑圧量算出手段と、を備え、前記第２の信号処理手段は、前記エコー抑圧量算出手段によって算出された前記第１の信号処理手段のエコー抑圧量が十分でないと判定される場合は前記入力信号を選択し、それ以外の場合は前記エコー低減信号を選択する選択手段を有し、前記選択手段によって選択された前記入力信号及び前記エコー低減信号のいずれか一方の前記エコー及び前記ノイズの少なくとも一方を抑圧する。 According to the embodiment, the signal processing device inputs the first signal processing means for suppressing the echo included in the input signal and outputting the echo reduction signal, the input signal and the echo reduction signal, And second signal processing means for suppressing at least one of noise, and an echo suppression amount of the first signal processing means by calculating an amount indicating a difference between power spectra of the input signal and the echo reduction signal. And the second signal processing means determines that the echo suppression amount of the first signal processing means calculated by the echo suppression amount calculation means is not sufficient. If selects the input signal, otherwise have a selecting means for selecting the echo reduction signal, the input signal selected by said selecting means and said error It suppresses at least one of either said echo and said noise over reduction signal.

本発明によれば、エコー抑圧性能を向上させ、またはノイズ抑圧性能を向上させ、エコ
ーパスロスの変動を含むエコーパス変動に対する頑健性を高めた信号処理装置を提供する
ことができる。 ADVANTAGE OF THE INVENTION According to this invention, the signal processing apparatus which improved the echo suppression performance or improved the noise suppression performance, and improved the robustness with respect to the echo path fluctuation | variation including the fluctuation | variation of an echo path loss can be provided.

以下に、本発明による信号処理装置の実施の形態を、図面を参照して説明する。 Embodiments of a signal processing apparatus according to the present invention will be described below with reference to the drawings.

（第１の実施形態）
図１は、本発明の第１の実施形態に係る信号処理装置の構成を示すブロック図である。
この信号処理装置は、通信部（ＣＯＭ）１０１と、遅延処理部（ＤＥＬＡＹ）１０２と、
Ｄ／Ａ変換器（Ｄ／Ａ）１０３と、受話増幅器１０４と、スピーカ１０５と、マイクロホ
ン１０６と、送話増幅器１０７と、Ａ／Ｄ変換器（Ａ／Ｄ）１０８と、ハイパスフィルタ
部（ＨＰＦ）１０９と、エコーキャンセラ部（ＥＣ）１１０と、エコーリダクション部（
ＥＲ）１１１とからなる。 (First embodiment)
FIG. 1 is a block diagram showing a configuration of a signal processing apparatus according to the first embodiment of the present invention.
The signal processing apparatus includes a communication unit (COM) 101, a delay processing unit (DELAY) 102,
D / A converter (D / A) 103, receiver amplifier 104, speaker 105, microphone 106, transmitter amplifier 107, A / D converter (A / D) 108, and high-pass filter section (HPF) ) 109, an echo canceller (EC) 110, and an echo reduction unit (
ER) 111.

図２は、エコーキャンセラ部（ＥＣ）１１０の構成を示すブロック図である。このエコ
ーキャンセラ部（ＥＣ）１１０は、遅延処理部（ＤＥＬＡＹ）１０２と接続される適応フ
ィルタ部（ＡＤＦ）１１０ａと、ハイパスフィルタ部（ＨＰＦ）１０９と接続され、また
、後述するエコーリダクション部（ＥＲ）１１１の周波数領域変換処理部（ＦＴ）１１１
ｃと接続される信号減算処理部１１０ｂと、遅延処理部（ＤＥＬＡＹ）１０２と接続され
、後述するエコーリダクション部（ＥＲ）１１１の周波数領域変換処理部（ＦＴ）１１１
ｃ及び制御部（ＣＴＲＬ）１１１ｋと接続されるダブルトーク検出部（ＤＴＤ）１１０ｃ
とからなる。 FIG. 2 is a block diagram illustrating a configuration of the echo canceller unit (EC) 110. The echo canceller unit (EC) 110 is connected to an adaptive filter unit (ADF) 110a connected to the delay processing unit (DELAY) 102 and a high-pass filter unit (HPF) 109, and an echo reduction unit (ER) described later. ) 111 frequency domain transform processing unit (FT) 111
a signal subtraction processing unit 110b connected to c and a delay processing unit (DELAY) 102, and a frequency domain transform processing unit (FT) 111 of an echo reduction unit (ER) 111 described later.
c and a double talk detector (DTD) 110c connected to the controller (CTRL) 111k
It consists of.

図３は、エコーリダクション部（ＥＲ）１１１の構成を示すブロック図である。このエ
コーリダクション部（ＥＲ）１１１は、遅延処理部（ＤＥＬＡＹ）１０２と接続される周
波数領域変換処理部（ＦＴ）１１１ａと、ハイパスフィルタ部（ＨＰＦ）１０９と接続さ
れる周波数領域変換処理部（ＦＴ）１１１ｂと、エコーキャンセラ部（ＥＣ）１１０の信
号減算処理部１１０ｂと接続される周波数領域変換処理部（ＦＴ）１１１ｃと、受話パワ
ー算出部（ＰＯＷ）１１１ｄと、送話パワー算出部（ＰＯＷ）１１１ｅと、残差パワー算
出部（ＰＯＷ）１１１ｆと、音響結合量推定部（ＡＣＬＥ）１１１ｇと、エコー量推定部
（ＥＬＥ）１１１ｈと、エコー抑圧量推定部（ＥＣＬＥ）１１１ｉと、周波数領域ダブル
トーク検出部（ＦＤＴＤ）１１１ｊと、エコーキャンセラ部（ＥＣ）１１０のダブルトー
ク検出部（ＤＴＤ）１１０ｃと接続される制御部（ＣＴＲＬ）１１１ｋと、スペクトル選
択部１１１Ｌと、ゲイン格納部（ＧＴＢＬ）１１１ｍと、エコー抑圧ゲイン算出部（ＧＣ
ＡＬ）１１１ｎと、信号抑圧部（ＳＳ）１１１ｏと、通信部（ＣＯＭ）１０１と接続され
る周波数領域逆変換処理部（ＩＦＴ）１１１ｐとからなる。 FIG. 3 is a block diagram showing a configuration of the echo reduction unit (ER) 111. The echo reduction unit (ER) 111 includes a frequency domain conversion processing unit (FT) 111 a connected to the delay processing unit (DELAY) 102 and a frequency domain conversion processing unit (FT) connected to the high pass filter unit (HPF) 109. ) 111b, a frequency domain conversion processing unit (FT) 111c connected to the signal subtraction processing unit 110b of the echo canceller unit (EC) 110, a received power calculation unit (POW) 111d, and a transmission power calculation unit (POW) 111e, residual power calculation unit (POW) 111f, acoustic coupling amount estimation unit (ACLE) 111g, echo amount estimation unit (ELE) 111h, echo suppression amount estimation unit (ECLE) 111i, and frequency domain double talk Detection unit (FDTD) 111j and double-talk detection unit (DTD) 1 of echo canceller unit (EC) 110 Control unit connected with 0c (CTRL) and 111k, a spectrum selection portion 111L, gain storage section (GTBL) 111m and the echo suppression gain calculator (GC
AL) 111n, a signal suppression unit (SS) 111o, and a frequency domain inverse transform processing unit (IFT) 111p connected to the communication unit (COM) 101.

上記のように構成された、本発明の第１の実施形態に係る信号処理装置の各部の動作を
、図１〜図４を参照して説明する。 The operation of each part of the signal processing device according to the first embodiment of the present invention configured as described above will be described with reference to FIGS.

通信部（ＣＯＭ）１０１は、通信相手から受信された受信データを入力し、事前に決め
られた処理時間の単位、つまり１フレーム（Ｎサンプル）ごとにディジタル信号に復号化
する。但し、このサンプリング周波数をｆ_Ｓ［Ｈｚ］とする。そして、通信部（ＣＯＭ）
１０１はこの復号化されたディジタル信号を１フレームごとに受話入力信号ｘ［ｎ］（ｎ
＝０，１，・・・，Ｎ−１）として出力する。 A communication unit (COM) 101 receives reception data received from a communication partner, and decodes it into a digital signal every predetermined unit of processing time, that is, every one frame (N samples). However, this sampling frequency is assumed to be f _S [Hz]. And communication department (COM)
101 receives the decoded digital signal for each frame as a received input signal x [n] (n
= 0, 1, ..., N-1).

また、通信部（ＣＯＭ）１０１は、周波数領域逆変換処理部（ＩＦＴ）１１１ｐから出
力された送話信号ｓ’［ｎ］（ｎ＝０，１，・・・，Ｎ−１）を入力とし、符号化して、
送信データとして通信相手に出力する。 The communication unit (COM) 101 receives the transmission signal s ′ [n] (n = 0, 1,..., N−1) output from the frequency domain inverse transform processing unit (IFT) 111p. , Encode
Output to the communication partner as transmission data.

遅延処理部（ＤＥＬＡＹ）１０２は、通信部（ＣＯＭ）１０１から１フレームごとに出
力された受話入力信号ｘ［ｎ］を入力とし、一時的に格納して、事前に決められた時間分
（Ｄサンプル）ディジタル処理によって遅延させる処理を行い出力する。 The delay processing unit (DELAY) 102 receives the reception input signal x [n] output from the communication unit (COM) 101 for each frame, temporarily stores it, and stores it for a predetermined time (D Sample) Delayed by digital processing and output.

Ｄ／Ａ変換器（Ｄ／Ａ）１０３は、通信部（ＣＯＭ）１０１から出力された受話入力信
号ｘ［ｎ］を入力とし、アナログ信号に変換して出力する。 The D / A converter (D / A) 103 receives the received input signal x [n] output from the communication unit (COM) 101, converts it into an analog signal, and outputs it.

受話増幅器１０４は、Ｄ／Ａ変換器（Ｄ／Ａ）１０３から出力されたアナログ信号を入
力とし、増幅して出力する。 The receiving amplifier 104 receives the analog signal output from the D / A converter (D / A) 103, amplifies it, and outputs it.

スピーカ１０５は、受話増幅器１０４から出力された増幅したアナログ信号を入力とし
、音響空間へ信号ｘ（ｔ）として出力する。 The speaker 105 receives the amplified analog signal output from the receiving amplifier 104 and outputs it to the acoustic space as a signal x (t).

マイクロホン１０６は、上記のようにしてスピーカ１０５から音響空間に出力された信
号ｘ（ｔ）と、送話音声信号ｓ（ｔ）とが音響結合した信号ｚ（ｔ）を収音し、アナログ
信号に変換して出力する。 The microphone 106 collects a signal z (t) obtained by acoustically coupling the signal x (t) output from the speaker 105 to the acoustic space as described above and the transmitted voice signal s (t), and outputs an analog signal. Convert to and output.

送話増幅器１０７は、マイクロホン１０６から出力されたこのアナログ信号を入力とし
、増幅して出力する。 The transmission amplifier 107 receives the analog signal output from the microphone 106, amplifies it, and outputs it.

Ａ／Ｄ変換器（Ａ／Ｄ）１０８は、送話増幅器１０７から出力された増幅されたアナロ
グ信号を入力とし、１フレームごとにディジタル信号に変換して出力する。 An A / D converter (A / D) 108 receives the amplified analog signal output from the transmission amplifier 107, converts it into a digital signal for each frame, and outputs it.

ハイパスフィルタ部（ＨＰＦ）１０９は、Ａ／Ｄ変換器（Ａ／Ｄ）１０８から出力され
たディジタル信号を入力とし、オフセット（直流成分）除去をし、送話入力信号ｚ［ｎ］
（ｎ＝０，１，・・・，Ｎ−１）として出力する。 The high-pass filter unit (HPF) 109 receives the digital signal output from the A / D converter (A / D) 108, removes the offset (DC component), and transmits the transmission input signal z [n].
(N = 0, 1,..., N−1) is output.

エコーキャンセラ部（ＥＣ）１１０は、ハイパスフィルタ部（ＨＰＦ）１０９から出力
された送話入力信号ｚ［ｎ］と、遅延処理部（ＤＥＬＡＹ）１０２から出力された遅延し
た受話入力信号ｘ［ｎ−Ｄ］を入力とし、送話入力信号ｚ［ｎ］からエコー成分を抑圧し
、そのエコー抑圧後の信号を残差信号ｅ［ｎ］（ｎ＝０，１，・・・，Ｎ−１）として出
力する。更に、ダブルトーク情報ＥＣｓｔａｔｅ［ｎ］を出力する。 The echo canceller unit (EC) 110 transmits the transmission input signal z [n] output from the high-pass filter unit (HPF) 109 and the delayed reception input signal x [n− output from the delay processing unit (DELAY) 102. D] as an input, the echo component is suppressed from the transmission input signal z [n], and the signal after the echo suppression is used as a residual signal e [n] (n = 0, 1,..., N−1). Output as. Furthermore, double talk information ECstate [n] is output.

適応フィルタ部（ＡＤＦ）１１０ａは、長さLのフィルタ係数ｈ［ｉ］（ｉ＝０，１，
・・・，Ｌ−１）が可変のトランスバーサルフィルタ（Transversal Filter）で構成され
る適応フィルタである。 The adaptive filter unit (ADF) 110a has a length L filter coefficient h [i] (i = 0, 1,
..., L-1) is an adaptive filter composed of a variable transversal filter.

適応フィルタ部（ＡＤＦ）１１０ａは遅延処理部（ＤＥＬＡＹ）１０２から出力された
遅延した受話入力信号ｘ［ｎ−Ｄ］と、信号減算処理部１１０ｂから出力されたエコー抑
圧後の１サンプル前の残差信号ｅ［ｎ−１］と、ダブルトーク検出部（ＤＴＤ）１１０ｃ
から出力されたダブルトーク情報ＥＣｓｔａｔｅ［ｎ］とを入力とし、ダブルトーク情報
ＥＣｓｔａｔｅ［ｎ］がダブルトーク状態でなかった場合はフィルタ係数ｈ［ｉ］をサン
プルｎごとに適応学習し、ダブルトーク情報ＥＣｓｔａｔｅ［ｎ］がダブルトーク状態で
あった場合は、適応学習を行わない。 The adaptive filter unit (ADF) 110a receives the delayed received input signal x [n−D] output from the delay processing unit (DELAY) 102 and the remaining one sample before the echo suppression output from the signal subtraction processing unit 110b. Difference signal e [n−1] and double talk detector (DTD) 110c
The double talk information ECstate [n] output from the input is input, and when the double talk information ECstate [n] is not in the double talk state, the filter coefficient h [i] is adaptively learned for each sample n, and the double talk information is obtained. When ECstate [n] is in the double talk state, adaptive learning is not performed.

また、適応フィルタ部（ＡＤＦ）１１０ａは、遅延処理部（ＤＥＬＡＹ）１０２から出
力された遅延した受話入力信号ｘ［ｎ−Ｄ］とフィルタ係数ｈ［ｉ］を用いて擬似エコー
信号ｙ’［ｎ］（ｎ＝０，１，・・・，Ｎ−１）を算出して出力する。 The adaptive filter unit (ADF) 110a uses the delayed received input signal x [n-D] output from the delay processing unit (DELAY) 102 and the filter coefficient h [i] to generate a pseudo echo signal y ′ [n. ] (N = 0, 1,..., N−1) are calculated and output.

適応フィルタ部（ＡＤＦ）１１０ａは、フィルタ係数ｈ［ｉ］の更新幅を制御する固定
あるいは可変のステップサイズμ_T［ｎ］（ｎ＝０，１，・・・，Ｎ−１）を用いて適応
学習を行う。 The adaptive filter unit (ADF) 110a uses a fixed or variable step size μ _T [n] (n = 0, 1,..., N−1) for controlling the update width of the filter coefficient h [i]. Perform adaptive learning.

また、適応フィルタ部（ＡＤＦ）１１０ａは、例えばＬＭＳ（Least-Mean-Square）ア
ルゴリズム、ＮＬＭＳ（Normalized-Least-Mean-Square）アルゴリズム、学習同定法、ア
フィン射影（ＡＰ：Affine-Projection）アルゴリズム、逐次最小二乗（ＲＬＳ：Recursi
ve-Least-Squares）アルゴリズムなどの線形適応アルゴリズムに基づく適応フィルタや勾
配制限型学習同定法（Gradient-limited Normalized-Least-Mean-Square）、適応ボルテ
ラフィルタ（Adaptive Volterra Filter）などの非線形適応アルゴリズムに基づく適応フ
ィルタで構成される。また、本実施形態では時間領域型適応フィルタの例を示しているが
、サブバンド型（帯域分割型）・周波数領域型で用いる適応フィルタで構成してもよい。 The adaptive filter unit (ADF) 110a includes, for example, an LMS (Least-Mean-Square) algorithm, an NLMS (Normalized-Least-Mean-Square) algorithm, a learning identification method, an affine projection (AP) algorithm, and a sequential. Least squares (RLS: Recursi
Non-linear adaptive algorithms such as adaptive filters based on linear adaptive algorithms such as the ve-Least-Squares algorithm, gradient-limited normalized-least-mean-square, and adaptive Volterra filters. Based on adaptive filter. In this embodiment, an example of a time domain type adaptive filter is shown, but an adaptive filter used in a subband type (band division type) or frequency domain type may be used.

信号減算処理部１１０ｂは、ハイパスフィルタ部（ＨＰＦ）１０９から出力された送話
入力信号ｚ［ｎ］と、適応フィルタ部（ＡＤＦ）１１０ａから出力された擬似エコー信号
ｙ’［ｎ］を入力とし、送話入力信号ｚ［ｎ］から擬似エコー信号ｙ’［ｎ］をサンプル
ｎごとに減算することでエコー成分を抑圧し、そのエコー抑圧後の信号である残差信号ｅ
［ｎ］を出力する。 The signal subtraction processing unit 110b receives the transmission input signal z [n] output from the high pass filter unit (HPF) 109 and the pseudo echo signal y ′ [n] output from the adaptive filter unit (ADF) 110a. The echo component is suppressed by subtracting the pseudo echo signal y ′ [n] from the transmission input signal z [n] for each sample n, and the residual signal e which is the signal after the echo suppression is suppressed.
[N] is output.

ダブルトーク検出部（ＤＴＤ）１１０ｃは、ハイパスフィルタ部（ＨＰＦ）１０９から
出力された送話入力信号ｚ［ｎ］と、遅延処理部（ＤＥＬＡＹ）１０２から出力された遅
延した受話入力信号ｘ［ｎ−Ｄ］と、信号減算処理部１１０ｂから出力された１サンプル
前の残差信号ｅ［ｎ−１］を入力とし、サンプルｎごとにダブルトーク状態か否かを判定
する。 The double-talk detector (DTD) 110c transmits the transmission input signal z [n] output from the high-pass filter (HPF) 109 and the delayed reception input signal x [n] output from the delay processor (DELAY) 102. −D] and the residual signal e [n−1] one sample before output from the signal subtraction processing unit 110b are input, and it is determined whether or not the sample is in the double talk state for each sample n.

具体的には、ダブルトーク検出部（ＤＴＤ）１１０ｃは、送話入力信号ｚ［ｎ］のパワ
ー特性（パワー値またはピーク値。以下、「パワー特性」と称する。）Ｐ_Ｚ［ｎ］（ｎ＝
０，１，・・・，Ｎ−１）と遅延した受話入力信号ｘ［ｎ−Ｄ］のパワー特性Ｐ_Ｘ［ｎ］
（ｎ＝０，１，・・・，Ｎ−１）と残差信号ｅ［ｎ］のパワー特性Ｐ_Ｅ［ｎ］（ｎ＝０，
１，・・・，Ｎ−１）とをサンプルｎごとに算出し、Ｐ_Ｅ［ｎ］＞λ［ｎ］・Ｐ_Ｘ［ｎ］
またはＰ_Ｚ［ｎ］＞δ・Ｐ_Ｘ［ｎ］となる場合にダブルトーク状態と判定する。ここで、
λ［ｎ］（ｎ＝０，１，・・・，Ｎ−１）はエコーパスロスの推定値であり、フィルタ係
数ｈ［ｉ］（ｉ＝０，１，・・・，Ｌ−１）を適応学習したサンプルｎごとに算出し、適
応学習が進めば小さくなり、適応学習が間違っていれば大きくなる可変量である。また、
δは動作開始前に外部から予め設定可能な固定値である。そして、ダブルトーク検出部（
ＤＴＤ）１１０ｃは、ダブルトーク状態か否かの情報であるダブルトーク情報ＥＣｓｔａ
ｔｅ［ｎ］を出力する。 Specifically, the double talk detector (DTD) 110c has a power characteristic (power value or peak value; hereinafter referred to as “power characteristic”) P _Z [n] (n =
0, 1,..., N−1) and the delayed power input signal x [n−D] power characteristic P _X [n]
(N = 0, 1,..., N−1) and the power characteristic P _E [n] (n = 0,
1,..., N−1) for each sample n, and P _E [n]> λ [n] · P _X [n]
Alternatively, when P _Z [n]> δ · P _X [n], the double-talk state is determined. here,
λ [n] (n = 0, 1,..., N−1) is an estimate of the echo path loss, and the filter coefficient h [i] (i = 0, 1,. It is calculated for each sample n that has been adaptively learned, and is a variable that decreases as adaptive learning progresses and increases when adaptive learning is incorrect. Also,
δ is a fixed value that can be preset from the outside before the operation starts. And the double talk detector (
DTD) 110c is double talk information ECsta which is information indicating whether or not a double talk state exists.
te [n] is output.

この場合、エコーキャンセラ部（ＥＣ）１１０は、フィルタ係数ｈ［ｉ］（ｉ＝０，１
，・・・，Ｌ−１）、ステップサイズμ_T［ｎ］、エコーパスロスの推定値λ［ｎ］、ダ
ブルトーク情報ＥＣｓｔａｔｅ［ｎ］、受話入力信号のパワー特性Ｐ_Ｘ［ｎ］、送話入力
信号のパワー特性Ｐ_Ｚ［ｎ］、残差信号のパワー特性Ｐ_Ｅ［ｎ］（ｎ＝０，１，・・・，
Ｎ−１）を内部状態としてメモリに保持している。ここで、内部状態とは少なくとも時間
に従って変化する変数の集合を指し、これ以降説明を省略する。 In this case, the echo canceller unit (EC) 110 uses the filter coefficient h [i] (i = 0, 1).
,..., L-1), step size μ _T [n], echo path loss estimated value λ [n], double talk information ECstate [n], received input signal power characteristic P _X [n], transmission Input signal power characteristic P _Z [n], residual signal power characteristic P _E [n] (n = 0, 1,...,
N-1) is held in the memory as an internal state. Here, the internal state refers to a set of variables that change at least according to time, and a description thereof will be omitted.

ダブルトーク検出部（ＤＴＤ）１１０ｃを備えないエコーキャンセラ部（ＥＣ）１１０
であっても構わない。この場合、適応フィルタ部（ＡＤＦ）１１０ａと制御部（ＣＴＲＬ
）１１１ｋは、ダブルトーク情報ＥＣｓｔａｔｅ［ｎ］がダブルトーク状態でないことを
示す場合の動作をする。 Echo canceller (EC) 110 without double talk detector (DTD) 110c
It does not matter. In this case, the adaptive filter unit (ADF) 110a and the control unit (CTRL)
) 111k operates when the double talk information ECstate [n] indicates that it is not in the double talk state.

エコーリダクション部（ＥＲ）１１１は、遅延処理部（ＤＥＬＡＹ）１０２から出力さ
れた遅延した受話入力信号ｘ［ｎ−Ｄ］と、ハイパスフィルタ部（ＨＰＦ）１０９から出
力された送話入力信号ｚ［ｎ］と、信号減算処理部１１０ｂから出力された残差信号ｅ［
ｎ］と、ダブルトーク検出部（ＤＴＤ）１１０ｃから出力されたダブルトーク情報ＥＣｓ
ｔａｔｅ［ｎ］とを入力とし、送話入力信号ｚ［ｎ］あるいは残差信号ｅ［ｎ］の少なく
とも一方からエコー成分を抑圧し、そのエコー抑圧後の信号を送話出力信号ｓ’［ｎ］（
ｎ＝０，１，・・・，Ｎ−１）として１フレームごとに出力する。 The echo reduction unit (ER) 111 includes a delayed received input signal x [n−D] output from the delay processing unit (DELAY) 102 and a transmitted input signal z [output from the high pass filter unit (HPF) 109. n] and the residual signal e [
n] and the double talk information ECs output from the double talk detector (DTD) 110c
ate [n] as an input, the echo component is suppressed from at least one of the transmission input signal z [n] or the residual signal e [n], and the signal after the echo suppression is transmitted as the transmission output signal s ′ [n. ] (
n = 0, 1,..., N−1) and output every frame.

周波数領域変換処理部（ＦＴ）１１１ａは、遅延処理部（ＤＥＬＡＹ）１０２から出力
された遅延した受話入力信号ｘ［ｎ−Ｄ］を入力として、ＦＦＴ（Fast Fourier Transfo
rm）などの処理によって周波数領域に変換して、受話入力信号の周波数スペクトルＸ［ｆ
，ω］を算出して出力する。 The frequency domain transform processing unit (FT) 111a receives the delayed received input signal x [n−D] output from the delay processing unit (DELAY) 102 as an input, and performs FFT (Fast Fourier Transform).
rm) and the like, and the frequency spectrum X [f
, Ω] is calculated and output.

周波数領域変換処理部（ＦＴ）１１１ｂは、ハイパスフィルタ部（ＨＰＦ）１０９から
出力された送話入力信号ｚ［ｎ］をＦＦＴなどによって周波数領域に変換して、送話入力
信号の周波数スペクトルＺ［ｆ，ω］を算出して出力する。 The frequency domain transform processing unit (FT) 111b converts the transmission input signal z [n] output from the high-pass filter unit (HPF) 109 into the frequency domain by FFT or the like, and the frequency spectrum Z [ f, ω] is calculated and output.

周波数領域変換処理部（ＦＴ）１１１ｃは、信号減算処理部１１０ｂから出力された残
差信号ｅ［ｎ］をＦＦＴなどによって周波数領域に変換して、残差信号の周波数スペクト
ルＥ［ｆ，ω］を算出して出力する。 The frequency domain transform processing unit (FT) 111c transforms the residual signal e [n] output from the signal subtraction processing unit 110b into the frequency domain by FFT or the like, and the frequency spectrum E [f, ω] of the residual signal. Is calculated and output.

周波数領域変換処理部（ＦＴ）１１１ａ、周波数領域変換処理部（ＦＴ）１１１ｂ及び
周波数領域変換処理部（ＦＴ）１１１ｃは、適宜、ハミング窓などによる窓掛けや、過去
のフレームを用いたり零補間したりオーバーラップを行う。例えば、過去１フレーム分と
当該フレームからＦＦＴ点数分の信号を取り出し、ハミング窓による窓掛けを行い、ＦＦ
Ｔを行う。 The frequency domain transformation processing unit (FT) 111a, the frequency domain transformation processing unit (FT) 111b, and the frequency domain transformation processing unit (FT) 111c appropriately use windowing with a Hamming window or the like, use past frames, or perform zero interpolation. Or overlap. For example, a signal corresponding to the number of FFT points is extracted from the past one frame and the corresponding frame, windowed by a Hamming window, and FF
Do T.

受話パワー算出部（ＰＯＷ）１１１ｄは、周波数領域変換処理部（ＦＴ）１１１ａから
出力された受話入力信号の周波数スペクトルＸ［ｆ，ω］を入力とし、そのパワースペク
トルである受話パワースペクトル｜Ｘ［ｆ，ω］｜^２を算出して出力する。そして、音響
結合量は通常時間的に急に変化しないので、瞬時値を用いるよりもスムージングした値を
用いた方が音響結合量を精度良く推定できるため、受話パワー算出部（ＰＯＷ）１１１ｄ
は、例えば、式１に示すように１フレーム前の値｜Ｘ_Ｓ［ｆ−１，ω］｜^２を用いてスム
ージングした受話パワースペクトル｜Ｘ_Ｓ［ｆ，ω］｜^２を算出して出力する。ただし、
α_Ｘ［ω］は、０．７５〜０．９９９程度が望ましい。

The received power calculation unit (POW) 111d receives the frequency spectrum X [f, ω] of the received input signal output from the frequency domain transform processing unit (FT) 111a, and receives the received power spectrum | X [ f, ω] | ² is calculated and output. Since the amount of acoustic coupling usually does not change abruptly in time, the amount of acoustic coupling can be estimated with higher accuracy by using a smoothed value than by using an instantaneous value. Therefore, the received power calculation unit (POW) 111d
For example, as shown in Equation 1, the received power spectrum | X _S [f, ω] | ² smoothed using the value | X _S [f−1, ω] | ² of the previous frame is calculated and output. To do. However,
α _X [ω] is preferably about 0.75 to 0.999.

送話パワー算出部（ＰＯＷ）１１１ｅは、周波数領域変換処理部（ＦＴ）１１１ｂから
出力された送話入力信号の周波数スペクトルＺ［ｆ，ω］を入力とし、そのパワースペク
トルである送話パワースペクトル｜Ｚ［ｆ，ω］｜^２を算出して出力する。そして、音響
結合量は通常時間的に急に変化しないので、瞬時値を用いるよりもスムージングした値を
用いた方が音響結合量を精度良く推定できるため、送話パワー算出部（ＰＯＷ）１１１ｅ
は、例えば、式２に示すように１フレーム前の値｜Ｚ_Ｓ［ｆ−１，ω］｜^２を用いてスム
ージングした送話パワースペクトル｜Ｚ_Ｓ［ｆ，ω］｜^２を算出して出力する。ただし、
α_Ｚ［ω］は、０．７５〜０．９９９程度が望ましい。

The transmission power calculation unit (POW) 111e receives the frequency spectrum Z [f, ω] of the transmission input signal output from the frequency domain conversion processing unit (FT) 111b as an input, and the transmission power spectrum which is the power spectrum | Z [f, ω] | ² is calculated and output. Since the amount of acoustic coupling usually does not change abruptly in time, the amount of acoustic coupling can be estimated more accurately by using a smoothed value than by using an instantaneous value. Therefore, the transmission power calculation unit (POW) 111e.
Is, for example, the value of the previous frame as shown in Equation _{2 | Z S [f-1} , ω] | sending power spectra were smoothed using a ^{_{2 | Z S [f, ω}} ] | 2 calculates the Output. However,
α _Z [ω] is preferably about 0.75 to 0.999.

残差パワー算出部（ＰＯＷ）１１１ｆは、周波数領域変換処理部（ＦＴ）１１１ｃから
出力された残差信号の周波数スペクトルＥ［ｆ，ω］を入力とし、そのパワースペクトル
である残差パワースペクトル｜Ｅ［ｆ，ω］｜^２を算出して出力する。そして、音響結合
量は通常時間的に急に変化しないので、瞬時値を用いるよりもスムージングした値を用い
た方が音響結合量を精度良く推定できるため、残差パワー算出部（ＰＯＷ）１１１ｆは、
例えば、式３に示すのように１フレーム前の値｜Ｅ_Ｓ［ｆ−１，ω］｜^２を用いてスムー
ジングした残差パワースペクトル｜Ｅ_Ｓ［ｆ，ω］｜^２を算出して出力する。ただし、α
_Ｅ［ω］は、０．７５〜０．９９９程度が望ましい。

The residual power calculation unit (POW) 111f receives the frequency spectrum E [f, ω] of the residual signal output from the frequency domain transform processing unit (FT) 111c as an input, and the residual power spectrum, which is the power spectrum | E [f, ω] | ² is calculated and output. Since the amount of acoustic coupling does not usually change suddenly in time, it is possible to estimate the amount of acoustic coupling more accurately by using a smoothed value than by using an instantaneous value. Therefore, the residual power calculation unit (POW) 111f is ,
For example, one frame previous value as shown in Equation _{3 | E S [f-1} , ω] | residual was smoothed using a ^second power spectrum _{| E S [f, ω]} | 2 calculated by the output To do. Where α
_E [ω] is preferably about 0.75 to 0.999.

音響結合量推定部（ＡＣＬＥ）１１１ｇは、受話パワー算出部（ＰＯＷ）１１１ｄから
出力されたスムージングされた受話パワースペクトル｜Ｘ_Ｓ［ｆ，ω］｜^２と、送話パワ
ー算出部（ＰＯＷ）１１１ｅから出力されたスムージングされた送話パワースペクトル｜
Ｚ_Ｓ［ｆ，ω］｜^２と、周波数領域ダブルトーク検出部（ＦＤＴＤ）１１１ｊから出力さ
れる周波数領域ダブルトーク情報ＥＲｓｔａｔｅ［ｆ，ω］とを入力とし、エコーキャン
セラ部（ＥＣ）１１０の性能の影響を受けないように残差信号に基づく｜Ｅ_Ｓ［ｆ，ω］
｜^２を用いずに送話入力信号に基づく｜Ｚ_Ｓ［ｆ，ω］｜^２を用いて、周波数帯域ω毎に
音響結合量｜Ｈ［ｆ，ω］｜^２を例えば、以下の式４によって算出する。このようにする
ことで、後述する第５の実施形態に係る信号処理装置と比較して、計算量を少なくするこ
とができ、各パラメータの格納に必要なメモリ量も少なくすることができる。

The acoustic coupling amount estimation unit (ACLE) 111g includes a smoothed reception power spectrum | X _S [f, ω] | ² output from the reception power calculation unit (POW) 111d, and a transmission power calculation unit (POW) 111e. Smoothed transmission power spectrum output from
Z _S [f, ω] | ² and frequency domain double talk information ERstate [f, ω] output from the frequency domain double talk detector (FDTD) 111j are input, and the performance of the echo canceller unit (EC) 110 is obtained. | E _S [f, ω] based on residual signal so as not to be affected by
Using | Z _S [f, ω] | ² based on the transmission input signal without using | ² , the acoustic coupling amount | H [f, ω] | ² for each frequency band ω, for example, Calculated by By doing in this way, compared with the signal processing apparatus which concerns on 5th Embodiment mentioned later, a calculation amount can be decreased and the memory amount required for storage of each parameter can also be decreased.

そして、音響結合量推定部（ＡＣＬＥ）１１１ｇは、以下の式５のように１フレーム前
の値を用いてスムージングした音響結合量｜Ｈ_Ｓ［ｆ，ω］｜^２を算出して出力する。た
だし、α_Ｈ［ω］は、０．０３〜０．９９程度が望ましい。

Then, the acoustic coupling amount estimation unit (ACLE) 111 g calculates and outputs the acoustic coupling amount | H _S [f, ω] | ² that is smoothed by using the value one frame before as shown in the following Expression 5. However, α _H [ω] is preferably about 0.03 to 0.99.

ここで、通話開始初期時、例えば通話開始から５秒間程度はα_Ｈ［ω］を大きくするこ
とで音響結合量｜Ｈ_Ｓ［ｆ，ω］｜^２の更新を速めておく。このようにすることで通話開
始初期時は音響結合量が初期化されているので、通話開始初期時に抑圧量が少なくなって
しまうのを防止することができる。 Here, at the initial stage of the call, for example, by increasing α _H [ω] for about 5 seconds from the start of the call, the update of the acoustic coupling amount | H _S [f, ω] | ² is accelerated. By doing so, since the acoustic coupling amount is initialized at the beginning of the call, it is possible to prevent the suppression amount from being reduced at the beginning of the call.

ただし、周波数領域ダブルトーク検出部（ＦＤＴＤ）１１１ｊから出力される周波数領
域ダブルトーク情報ＥＲｓｔａｔｅ［ｆ，ω］がダブルトーク状態である場合、あるいは
音響結合量が急激に変化する場合、即ち、｜Ｈ_Ｓ［ｆ，ω］｜^２＞β_Ｈ［ω］・｜Ｈ_Ｓ［
ｆ−１，ω］｜^２が満たされる場合、あるいは受話入力信号が十分に大きくない場合、即
ち、｜Ｘ_Ｓ［ｆ，ω］｜^２＜β_Ｘ［ω］が満たされる場合には、エコーパス変動への高速
な追従性を保ちつつ、ダブルトークとなる周波数帯域での音響結合量の算出を行わないよ
うにするため、音響結合量推定部（ＡＣＬＥ）１１１ｇは、音響結合量を更新しないで１
フレーム前の過去の音響結合量｜Ｈ_Ｓ［ｆ−１，ω］｜^２を用いる。極端な音響結合量の
変化はダブルトークの可能性があるので、このように音響結合量を更新しないでおくこと
で送話音質劣化を防止することができる。ただし、β_Ｈ［ω］は０．９〜３０程度が望ま
しい。β_Ｘ［ω］は３０ｄＢ〜４０ｄＢ程度が望ましい。 However, when the frequency domain double talk information ERstate [f, ω] output from the frequency domain double talk detector (FDTD) 111j is in a double talk state, or when the acoustic coupling amount changes abruptly, that is, | H _S [f, ω] | ² > β _H [ω] · | H _S [
If f−1, ω] | ² is satisfied, or if the received input signal is not sufficiently large, that is, if | X _S [f, ω] | ² <β _X [ω] is satisfied, the echo path The acoustic coupling amount estimation unit (ACLE) 111g does not update the acoustic coupling amount in order to avoid calculating the acoustic coupling amount in the frequency band that is a double talk while maintaining high-speed followability to fluctuation. 1
The past acoustic coupling amount | H _S [f−1, ω] | ² before the frame is used. Since an extreme change in the amount of acoustic coupling has the possibility of double talk, it is possible to prevent deterioration in transmitted sound quality by not updating the amount of acoustic coupling in this way. However, β _H [ω] is preferably about 0.9 to 30. β _X [ω] is desirably about 30 dB to 40 dB.

エコー量推定部（ＥＬＥ）１１１ｈは、受話パワー算出部（ＰＯＷ）１１１ｄから出力
されたスムージングされた受話パワースペクトル｜Ｘ_Ｓ［ｆ，ω］｜^２と、音響結合量推
定部（ＡＣＬＥ）１１１ｇから出力された音響結合量｜Ｈ_Ｓ［ｆ，ω］｜^２とを入力とし
、送話入力信号の周波数スペクトルＺ［ｆ，ω］に含まれるエコー量｜Ｙ［ｆ，ω］｜^２
を以下の式６のように周波数帯域ω毎に推定して出力する。

The echo amount estimation unit (ELE) 111h includes the smoothed reception power spectrum | X _S [f, ω] | ² output from the reception power calculation unit (POW) 111d and the acoustic coupling amount estimation unit (ACLE) 111g. The output acoustic coupling amount | H _S [f, ω] | ² is used as an input, and the echo amount | Y [f, ω] | ² included in the frequency spectrum Z [f, ω] of the transmission input signal.
Is estimated and output for each frequency band ω as shown in Equation 6 below.

そして、エコー量推定部（ＥＬＥ）１１１ｈは、瞬時のエコー量｜Ｙ［ｆ，ω］｜^２を
用いるよりもスムージングした値を用いた方がエコー抑圧後の信号をより自然な信号にで
きるため、以下の式７のように１フレーム前の値を用いてスムージングしたエコー量｜Ｙ
_Ｓ［ｆ，ω］｜^２を周波数帯域ω毎に算出して出力する。ただし、αＹ［ω］は０．７〜
０．９９程度が望ましい。

Then, the echo amount estimation unit (ELE) 111h can make the signal after echo suppression a more natural signal by using the smoothed value rather than using the instantaneous echo amount | Y [f, ω] | ^2. The amount of echo smoothed using the value one frame before | Y
_S [f, ω] | ² is calculated and output for each frequency band ω. However, αY [ω] is 0.7 to
About 0.99 is desirable.

エコー抑圧量推定部（ＥＣＬＥ）１１１ｉは、送話パワー算出部（ＰＯＷ）１１１ｅか
ら出力された送話パワースペクトル｜Ｚ［ｆ，ω］｜^２と、残差パワー算出部（ＰＯＷ）
１１１ｆから出力された残差パワースペクトル｜Ｅ［ｆ，ω］｜^２とを入力とし、それら
のパワースペクトル間の相違を示す量を計算することによって、エコーキャンセラ部（Ｅ
Ｃ）１１０で抑圧されたエコー抑圧量ＥＣＬ［ｆ，ω］を周波数帯域ω毎に推定して出力
する。具体的には、以下に示す式８に示すように算出する。もちろん、これら２つのパワ
ースペクトルの差などを用いてもよい。

The echo suppression amount estimation unit (ECLE) 111i includes a transmission power spectrum | Z [f, ω] | ² output from the transmission power calculation unit (POW) 111e, and a residual power calculation unit (POW).
The residual power spectrum | E [f, ω] | ² output from 111f is input, and an echo canceller (E
C) The echo suppression amount ECL [f, ω] suppressed at 110 is estimated and output for each frequency band ω. Specifically, it is calculated as shown in Equation 8 below. Of course, the difference between these two power spectra may be used.

周波数領域ダブルトーク検出部（ＦＤＴＤ）１１１ｊは、受話パワー算出部（ＰＯＷ）
１１１ｄから出力されたスムージングされた受話パワースペクトル｜Ｘ_Ｓ［ｆ，ω］｜^２
と、残差パワー算出部（ＰＯＷ）１１１ｆから出力されたスムージングされた残差パワー
スペクトル｜Ｅ_Ｓ［ｆ，ω］｜^２と、エコー量推定部（ＥＬＥ）１１１ｈから出力された
１フレーム前のエコー量｜Ｙ［ｆ−１，ω］｜^２とを入力とし、周波数帯域ω毎に以下に
示す式９が満たされるか否かの判定を行う。 The frequency domain double talk detector (FDTD) 111j is a received power calculator (POW).
Smoothed received power spectrum output from 111d | X _S [f, ω] | ²
And the smoothed residual power spectrum | E _S [f, ω] | ² output from the residual power calculation unit (POW) 111f and the frame one frame before output from the echo amount estimation unit (ELE) 111h. The echo amount | Y [f−1, ω] | ² is input, and it is determined whether or not the following Expression 9 is satisfied for each frequency band ω.

ここでは、エコーリダクション部（ＥＲ）１１１のエコー抑圧処理後のパワースペクト
ルを残差パワースペクトルとエコー量の距離で代用し、それが十分に受話パワースペクト
ルよりも小さくなっているか否かを判定に用いる。そして、閾値β_Ｙ［ω］よりも大きけ
れば、即ち、式９が満たされる場合、ダブルトーク状態と判定し、式９が満たされない場
合、ダブルトーク状態でないと判定して、ダブルトーク状態か否かの情報を周波数領域ダ
ブルトーク情報ＥＲｓｔａｔｅ［ｆ，ω］として出力する。これにより、計算量が小さい
周波数領域ＤＴＤが実現できる。 Here, the power spectrum after the echo suppression processing of the echo reduction unit (ER) 111 is substituted with the distance between the residual power spectrum and the echo amount, and it is determined whether or not it is sufficiently smaller than the received power spectrum. Use. If it is larger than the threshold β _Y [ω], that is, if Expression 9 is satisfied, it is determined that the state is a double talk state. If Expression 9 is not satisfied, it is determined that the state is not a double talk state. Such information is output as frequency domain double talk information ERstate [f, ω]. As a result, a frequency domain DTD with a small calculation amount can be realized.

もちろん、周波数領域ダブルトーク検出部（ＦＤＴＤ）１１１ｊを備えないエコーリダ
クション部（ＥＲ）１１１であっても構わない。この場合、音響結合量推定部（ＡＣＬＥ
）１１１ｇと制御部（ＣＴＲＬ）１１１ｋは、周波数領域ダブルトーク情報ＥＲｓｔａｔ
ｅ［ｆ，ω］がダブルトーク状態でないことを示す場合の動作をする。ただし、β_Ｙ［ω
］は１．０〜２０程度が望ましい。

Of course, an echo reduction unit (ER) 111 that does not include the frequency domain double talk detection unit (FDTD) 111j may be used. In this case, the acoustic coupling amount estimation unit (ACLE)
) 111g and control unit (CTRL) 111k are frequency domain double talk information ERstat.
The operation is performed when e [f, ω] indicates that it is not in the double talk state. However, β _Y [ω
] Is preferably about 1.0 to 20.

制御部（ＣＴＲＬ）１１１ｋは、エコー抑圧量推定部（ＥＣＬＥ）１１１ｉから出力さ
れたエコー抑圧量ＥＣＬ［ｆ，ω］と、ダブルトーク検出部（ＤＴＤ）１１０ｃから出力
されたダブルトーク情報ＥＣｓｔａｔｅ［ｎ］と、周波数領域ダブルトーク検出部（ＦＤ
ＴＤ）１１１ｊから出力される周波数領域ダブルトーク情報ＥＲｓｔａｔｅ［ｆ，ω］と
を入力とし、周波数帯域ω毎にダブルトーク状態かつエコーキャンセラ部（ＥＣ）１１０
のエコー抑圧量が十分でない周波数帯域を検出して、ダブルトーク状態かつエコーキャン
セラ部（ＥＣ）１１０のエコー抑圧量が十分でない周波数帯域であるか否かの情報である
制御情報ＥＲｃｏｎｔｒｏｌ１［ｆ，ω］及びＥＲｃｏｎｔｒｏｌ２［ｆ，ω］を出力す
る。 The control unit (CTRL) 111k includes the echo suppression amount ECL [f, ω] output from the echo suppression amount estimation unit (ECLE) 111i and the double talk information ECstate [n] output from the double talk detection unit (DTD) 110c. ] And a frequency domain double talk detector (FD
TD) frequency domain double talk information ERstate [f, ω] output from 111j, and a double talk state and echo canceller (EC) 110 for each frequency band ω.
The control information ERcontrol1 [f, ω, which is information indicating whether or not the frequency band in which the echo suppression amount is not sufficient is detected, and whether or not the frequency band in which the echo suppression amount of the echo canceller unit (EC) 110 is insufficient is detected. ] And ERcontrol2 [f, ω] are output.

まず、制御部（ＣＴＲＬ）１１１ｋは、周波数領域ダブルトーク情報ＥＲｓｔａｔｅ［
ｆ，ω］を用いて周波数帯域ω毎に周波数領域ダブルトーク検出部（ＦＤＴＤ）１１１ｊ
でダブルトーク状態と判定されているか否か、ダブルトーク情報ＥＣｓｔａｔｅ［ｎ］を
用いてエコーキャンセラ部（ＥＣ）１１０のダブルトーク検出部（ＤＴＤ）１１０ｃが当
該フレームでダブルトーク状態と１サンプルでも判定されているか否か、によって周波数
帯域ω毎にダブルトーク状態であるか否かを判定する。 First, the control unit (CTRL) 111k performs frequency domain double talk information ERstate [
f, ω] for each frequency band ω, a frequency domain double talk detector (FDTD) 111j
It is determined whether or not the double talk state is determined by the double talk detection unit (DTD) 110c of the echo canceller unit (EC) 110 using the double talk information ECstate [n]. It is determined whether or not a double talk state is set for each frequency band ω depending on whether or not it is performed.

ここでは、周波数領域ダブルトーク情報ＥＲｓｔａｔｅ［ｆ，ω］あるいはダブルトー
ク情報ＥＣｓｔａｔｅ［ｎ］のいずれか１つ、または双方がダブルトーク状態であること
を示していれば、制御部（ＣＴＲＬ）１１１ｋは周波数帯域ω毎にダブルトーク状態であ
ると判定する。 Here, if one or both of the frequency domain double talk information ERstate [f, ω] and the double talk information ECstate [n] indicate that they are in a double talk state, the control unit (CTRL) 111k It is determined that the double talk state is set for each frequency band ω.

このようにエコーキャンセラ部（ＥＣ）１１０のダブルトーク検出部（ＤＴＤ）１１０
ｃから出力されたダブルトーク情報ＥＣｓｔａｔｅ［ｎ］を用いれば、エコーキャンセラ
部（ＥＣ）１１０のエコー抑圧量が十分でない状況は、ダブルトーク中のエコーパス変動
で生じやすいため、ダブルトーク状態かつエコーキャンセラ部（ＥＣ）１１０のエコー抑
圧量が十分でない周波数帯域を精度よく検出することできる。 Thus, the double talk detector (DTD) 110 of the echo canceller (EC) 110
If the double talk information ECstate [n] output from c is used, a situation in which the echo suppression amount of the echo canceller (EC) 110 is not sufficient is likely to occur due to echo path fluctuations during double talk. The frequency band where the echo suppression amount of the unit (EC) 110 is not sufficient can be detected with high accuracy.

次に、制御部（ＣＴＲＬ）１１１ｋは、周波数帯域ω毎にＥＣＬ［ｆ，ω］＞β_Ｚ１［
ω］を満たした周波数帯域をエコーキャンセラ部（ＥＣ）１１０のエコー抑圧量が十分で
ないと判定する。ただし、β_Ｚ１［ω］は＋８ｄＢ〜−１５ｄＢ程度が望ましい。本実施
形態では以下、β_Ｚ１［ω］を０ｄＢとして説明する。 Next, the control unit (CTRL) 111k has ECL [f, ω]> β _Z1 [for each frequency band ω.
It is determined that the echo suppression amount of the echo canceller unit (EC) 110 is not sufficient for the frequency band satisfying ω]. However, β _Z1 [ω] is preferably about +8 dB to −15 dB. In the present embodiment, hereinafter, β _Z1 [ω] will be described as 0 dB.

１つの周波数帯域でのみエコーキャンセラ部（ＥＣ）１１０のエコー抑圧量が少ないと
いうことはありえにくい、つまりエコーキャンセラ部（ＥＣ）１１０の性能は時間領域で
切り替わると考えられるため、制御部（ＣＴＲＬ）１１１ｋは、エコーキャンセラ部（Ｅ
Ｃ）１１０のエコー抑圧量が十分でない周波数帯域を精度よく検出するために、併せて、
１フレーム当りの不等式ＥＣＬ［ｆ，ω］＞β_Ｚ１［ω］が成立した帯域数ｃｏｕｎｔも
加味してエコー抑圧量が少ないと判定する。 Since it is unlikely that the echo suppression amount of the echo canceller unit (EC) 110 is small only in one frequency band, that is, the performance of the echo canceller unit (EC) 110 is considered to be switched in the time domain, the control unit (CTRL) 111k is an echo canceller (E
C) In order to accurately detect a frequency band where the echo suppression amount of 110 is not sufficient,
It is determined that the amount of echo suppression is small in consideration of the number of bands count in which the inequality ECL [f, ω]> β _Z1 [ω] per frame is established.

つまり、ＥＣＬ［ｆ，ω］＞β_Ｚ１［ω］かつｃｏｕｎｔ＞β_Ｃ、となる場合にエコー
抑圧量が少ないと判定する。ただし、β_Ｃ１は全周波数帯域数の１０％〜４０％程度が望
ましい。このようにすることで周波数帯域毎だけで判定するよりも受話入力信号と送話入
力信号の音量差に対して頑健になりエコー抑圧量が十分でないフレームを精度よく検出す
ることができる。そして、先の周波数帯域ω毎にダブルトーク状態の判定と併せて、制御
部（ＣＴＲＬ）１１１ｋは、ダブルトーク状態かつエコーキャンセラ部（ＥＣ）１１０の
エコー抑圧量が十分でない周波数帯域を検出して制御情報ＥＲｃｏｎｔｒｏｌ１［ｆ，ω
］を出力する。 That is, it is determined that the echo suppression amount is small when ECL [f, ω]> β _Z1 [ω] and count> β _C. However, β _C1 is desirably about 10% to 40% of the total number of frequency bands. By doing so, it is more robust to the volume difference between the received input signal and the transmitted input signal than it is determined only for each frequency band, and a frame with an insufficient echo suppression amount can be accurately detected. In addition to the determination of the double talk state for each frequency band ω, the control unit (CTRL) 111k detects the frequency band in which the double talk state and the echo suppression amount of the echo canceller unit (EC) 110 are not sufficient. Control information ERcontrol1 [f, ω
] Is output.

同様にして、制御部（ＣＴＲＬ）１１１ｋは、１フレーム当りの式ＥＣＬ［ｆ，ω］＞
β_Ｚ２［ω］が成立した帯域数ｃｏｕｎｔも加味して、ＥＣＬ［ｆ，ω］＞β_Ｚ２［ω］
かつｃｏｕｎｔ＞β_Ｃ２、となる場合にエコー抑圧量が少ないと判定して、先の周波数帯
域ω毎にダブルトーク状態の判定と併せて、制御情報ＥＲｃｏｎｔｒｏｌ２［ｆ，ω］を
出力する。ただし、β_Ｚ２［ω］は＋８ｄＢ〜−１５ｄＢ程度、β_Ｃ２は全周波数帯域数
の１０％〜４０％程度が望ましい。 Similarly, the control unit (CTRL) 111k uses the equation ECL [f, ω]>
ECL [f, ω]> β _Z2 [ω] in consideration of the number of bands count in which β _Z2 [ω] is established.
When count> β _C2 , it is determined that the echo suppression amount is small, and the control information ERcontrol 2 [f, ω] is output for each frequency band ω together with the determination of the double talk state. However, β _Z2 [ω] is preferably about +8 dB to −15 dB, and β _C2 is preferably about 10% to 40% of the total number of frequency bands.

スペクトル選択部１１１Ｌは、周波数領域変換処理部（ＦＴ）１１１ｂから出力された
送話入力信号の周波数スペクトルＺ［ｆ，ω］と、周波数領域変換処理部（ＦＴ）１１１
ｃから出力された残差信号の周波数スペクトルＥ［ｆ，ω］と、制御部（ＣＴＲＬ）１１
１ｋから出力されたダブルトーク状態かつエコーキャンセラ部（ＥＣ）１１０のエコー抑
圧量が十分でない周波数帯域であるか否かの情報である制御情報ＥＲｃｏｎｔｒｏｌ１［
ｆ，ω］とを入力とし、送話入力信号の周波数スペクトルＺ［ｆ，ω］あるいは残差信号
の周波数スペクトルＥ［ｆ，ω］のいずれか一方を周波数スペクトルとして選択して出力
する。 The spectrum selection unit 111L includes the frequency spectrum Z [f, ω] of the transmission input signal output from the frequency domain transform processing unit (FT) 111b and the frequency domain transform processing unit (FT) 111.
The frequency spectrum E [f, ω] of the residual signal output from c and the control unit (CTRL) 11
The control information ERcontrol1 [, which is information indicating whether or not the frequency range is the double talk state output from 1k and the echo suppression amount of the echo canceller unit (EC) 110 is not sufficient.
f, ω] as an input, and one of the frequency spectrum Z [f, ω] of the transmission input signal and the frequency spectrum E [f, ω] of the residual signal is selected and output as a frequency spectrum.

具体的には、制御情報ＥＲｃｏｎｔｒｏｌ１［ｆ，ω］がダブルトーク状態かつエコー
キャンセラ部（ＥＣ）１１０のエコー抑圧量が十分でないと検出された周波数帯域であっ
た場合は、スペクトル選択部１１１Ｌは、周波数スペクトルとして送話入力信号の周波数
スペクトルＺ［ｆ，ω］を選択する。それ以外の周波数帯域は周波数スペクトルとして残
差信号の周波数スペクトルＥ［ｆ，ω］を選択する。 Specifically, when the control information ERcontrol 1 [f, ω] is a frequency band that is detected as a double talk state and an echo suppression amount of the echo canceller unit (EC) 110 is not sufficient, the spectrum selection unit 111L The frequency spectrum Z [f, ω] of the transmission input signal is selected as the frequency spectrum. For the other frequency bands, the frequency spectrum E [f, ω] of the residual signal is selected as the frequency spectrum.

このようにすることで周波数帯域ごとに、エコーキャンセラ部（ＥＣ）１１０のエコー
抑圧量が十分でない場合、つまりエコーキャンセラ部（ＥＣ）１１０の推定精度が十分と
れない場合はエコーリダクション部（ＥＲ）１１１を単体で動作させることができる。 By doing so, when the echo suppression amount of the echo canceller unit (EC) 110 is not sufficient for each frequency band, that is, when the estimation accuracy of the echo canceller unit (EC) 110 is not sufficient, the echo reduction unit (ER) 111 can be operated alone.

ゲイン格納部（ＧＴＢＬ）１１１ｍは、事前に設定された非線形エコー抑圧量を制御す
るパラメータγ［ω］を格納して出力する。ただし、γ［ω］は１．０〜２．０程度が望
ましい。 The gain storage unit (GTBL) 111m stores and outputs a parameter γ [ω] that controls a preset nonlinear echo suppression amount. However, γ [ω] is preferably about 1.0 to 2.0.

エコー抑圧ゲイン算出部（ＧＣＡＬ）１１１ｎは、送話パワー算出部（ＰＯＷ）１１１
ｅから出力されたスムージングされた送話パワースペクトル｜Ｚ_Ｓ［ｆ，ω］｜^２と、エ
コー量推定部（ＥＬＥ）１１１ｈから出力されたスムージングしたエコー量｜Ｙ_Ｓ［ｆ，
ω］｜^２と、ゲイン格納部（ＧＴＢＬ）１１１ｍから出力されたパラメータγ［ω］と、
エコー抑圧量推定部（ＥＣＬＥ）１１１ｉから出力されたエコー抑圧量ＥＣＬ［ｆ，ω］
と、制御部（ＣＴＲＬ）１１１ｋから出力されたダブルトーク状態かつエコーキャンセラ
部（ＥＣ）１１０のエコー抑圧量が十分でない周波数帯域であるか否かの情報である制御
情報ＥＲｃｏｎｔｒｏｌ２［ｆ，ω］とを入力とし、エコー抑圧ゲインＧ［ｆ，ω］を算
出して出力する。 The echo suppression gain calculation unit (GCAL) 111n is a transmission power calculation unit (POW) 111.
The smoothed transmission power spectrum | Z _S [f, ω] | ² output from e and the smoothed echo amount output from the echo amount estimation unit (ELE) 111 h | Y _S [f,
ω] | ² and the parameter γ [ω] output from the gain storage unit (GTBL) 111m;
Echo suppression amount ECL [f, ω] output from the echo suppression amount estimation unit (ECLE) 111i
Control information ERcontrol2 [f, ω], which is information indicating whether the frequency range is a double talk state output from the control unit (CTRL) 111k and the echo suppression amount of the echo canceller unit (EC) 110 is insufficient. Is input and the echo suppression gain G [f, ω] is calculated and output.

具体的には、制御情報ＥＲｃｏｎｔｒｏｌ２［ｆ，ω］からダブルトーク状態かつエコ
ーキャンセラ部（ＥＣ）１１０のエコー抑圧量が十分でないと検出された周波数帯域につ
いては、エコー抑圧ゲイン算出部（ＧＣＡＬ）１１１ｎは、エコー抑圧ゲインＧ［ｆ，ω
］をウィナー・フィルタ（Wiener Filter）法を用いて、式１０によって算出する。この
ように算出することで、ダブルトーク状態かつエコーキャンセラ部（ＥＣ）１１０のエコ
ー抑圧量が十分でないと検出された周波数帯域については、エコーリダクション部（ＥＲ
）１１１単体でエコー抑圧することができる。

Specifically, for a frequency band detected from the control information ERcontrol2 [f, ω] in a double talk state and the echo suppression amount of the echo canceller unit (EC) 110 is not sufficient, an echo suppression gain calculation unit (GCAL) 111n Is the echo suppression gain G [f, ω
] Is calculated by Equation 10 using the Wiener Filter method. By calculating in this way, for the frequency band detected when the echo suppression amount of the double talk state and the echo canceller unit (EC) 110 is not sufficient, the echo reduction unit (ER
) The echo can be suppressed by 111 alone.

それ以外の周波数帯域では、エコー抑圧ゲイン算出部（ＧＣＡＬ）１１１ｎは、エコー
キャンセラ部（ＥＣ）１１０のエコー抑圧量が十分であり、エコーキャンセラ部（ＥＣ）
１１０が正常に機能していると見なして、エコーリダクション（ＥＲ）１１１のエコー抑
圧量を多くするように、エコーキャンセラ部（ＥＣ）１１０のエコー抑圧量ＥＣＬ［ｆ，
ω］を用いて補正し、エコー抑圧ゲインＧ［ｆ，ω］を以下の式１１が示すように算出す
る。 In other frequency bands, the echo suppression gain calculation unit (GCAL) 111n has a sufficient echo suppression amount of the echo canceller unit (EC) 110, and the echo canceller unit (EC).
110, the echo suppression amount ECL [f, f of the echo canceller unit (EC) 110 is increased so that the echo suppression amount of the echo reduction (ER) 111 is increased.
is corrected using [omega]], and the echo suppression gain G [f, [omega]] is calculated as shown in Equation 11 below.

このように算出することで、後述する第５の実施形態に係る信号処理装置と比較して、
計算量を少なくすることができ、各パラメータの格納に必要なメモリ量も少なくすること
ができる。

By calculating in this way, compared with a signal processing apparatus according to a fifth embodiment to be described later,
The amount of calculation can be reduced, and the amount of memory required for storing each parameter can also be reduced.

また、エコー抑圧ゲイン算出部（ＧＣＡＬ）１１１ｎは、過剰なエコー抑圧により送話
音声の品質が劣化するのを防止し、背景雑音の断続的な抑圧を防止するため、エコー抑圧
ゲインＧ［ｆ，ω］が所定の下限値以下にならないように制御する。 Further, the echo suppression gain calculation unit (GCAL) 111n prevents the quality of the transmitted voice from being deteriorated due to excessive echo suppression, and prevents the background noise from being intermittently suppressed, so that the echo suppression gain G [f, Control is performed so that ω] does not fall below a predetermined lower limit.

図４は、その下限値の一例を示す。図４の実線、破線が示すように、エコー抑圧ゲイン
の下限値を帯域別にバスタブ曲線状、即ち、最低域（０からｆ_Ｓ／１６［Ｈｚ］）側及び
最高域（７・ｆ_Ｓ／１６からｆ_Ｓ／２［Ｈｚ］）側を０．０４から０．１２程度に大きく
し、中域（ｆ_Ｓ／８から３・ｆ_Ｓ／８［Ｈｚ］）付近を０．０１から０．０５程度に小さ
くするように設定する。なぜなら、最低域側や最高域側に抑圧できなかったエコー成分が
残ると耳障りであるので、そのエコー成分を抑圧しやすくすることができて送話音声を聴
感上よくすることができる。勿論、図４の点線が示すようにバスタブ曲線を段階的に変化
する一定値として設定してもかまわない。 FIG. 4 shows an example of the lower limit value. As indicated by the solid and broken lines in FIG. 4, the lower limit value of the echo suppression gain is bathtub-shaped for each band, that is, the lowest range (0 to f _S / 16 [Hz]) side and the highest range (7 · f _S / 16 To f _S / 2 [Hz]) side is increased from 0.04 to 0.12, and the vicinity of the middle range (f _S / 8 to 3 · f _S / 8 [Hz]) is 0.01 to 0.05. Set to be as small as possible. This is because it is harsh if an echo component that could not be suppressed remains on the lowest side or the highest side, so that the echo component can be easily suppressed and the transmitted voice can be improved in hearing. Of course, as shown by the dotted line in FIG. 4, the bathtub curve may be set as a constant value that changes stepwise.

さらに、エコー抑圧ゲイン算出部（ＧＣＡＬ）１１１ｎは、過剰なエコー抑圧により送
話音声の品質が劣化するのを防止するため、送話入力信号ｚ［ｎ］あるいは残差信号ｅ［
ｎ］の有音区間ではない区間の信号レベルを用いて周波数帯域ごとに背景雑音レベルを算
出しておき、背景雑音レベルよりも抑圧しないようにエコー抑圧ゲインを制御しても構わ
ない。 Further, the echo suppression gain calculation unit (GCAL) 111n prevents the transmission voice quality from deteriorating due to excessive echo suppression, so that the transmission input signal z [n] or the residual signal e [
n], a background noise level may be calculated for each frequency band using a signal level of a section that is not a voiced section, and the echo suppression gain may be controlled so as not to suppress the background noise level.

さらにまた、エコー抑圧ゲイン算出部（ＧＣＡＬ）１１１ｎは、送話音質が劣化するの
を防止するために、エコー抑圧ゲインＧ［ｆ，ω］を以下の式１２−１あるいは式１２−
２が示すように周波数方向にスムージングして出力してもよい。例えば、ε_ｊは、［０．
１、０．２、０．４、０．２、０．１］、η_ｊは、［０．１、０．２、０．４、０．８、
０．４、０．２、０．１］のようにしてよい。

Furthermore, the echo suppression gain calculation unit (GCAL) 111n sets the echo suppression gain G [f, ω] to the following expression 12-1 or expression 12- in order to prevent the transmission sound quality from deteriorating.
As indicated by 2, the output may be smoothed in the frequency direction. For example, ε _j is [0.
1, 0.2, 0.4, 0.2, 0.1], η _j is [0.1, 0.2, 0.4, 0.8,
0.4, 0.2, 0.1].

信号抑圧部（ＳＳ）１１１ｏは、スペクトル選択部１１１Ｌから出力された周波数スペ
クトルと、エコー抑圧ゲイン算出部（ＧＣＡＬ）１１１ｎから出力されたエコー抑圧ゲイ
ンＧ［ｆ，ω］とを入力として、スペクトル選択部１１１Ｌから出力された周波数スペク
トルのエコーを抑圧し、以下の式１３−１あるいは式１３−２が示すように送話出力信号
のスペクトルＳ’［ｆ，ω］として出力する。

The signal suppression unit (SS) 111o receives the frequency spectrum output from the spectrum selection unit 111L and the echo suppression gain G [f, ω] output from the echo suppression gain calculation unit (GCAL) 111n as input. The echo of the frequency spectrum output from the unit 111L is suppressed and output as the spectrum S ′ [f, ω] of the transmission output signal as shown in the following expression 13-1 or expression 13-2.

このとき、送話入力信号の周波数スペクトルＺ［ｆ，ω］と残差信号の周波数スペクト
ルＥ［ｆ，ω］とのどちらの周波数スペクトルがスペクトル選択部１１１Ｌで選択されて
も、エコーキャンセラ部（ＥＣ）１１０は時間領域での処理であるから、位相スペクトル
に差がないため、出力された送話出力信号は聴感上滑らかにつなぐことができる。 At this time, even if any frequency spectrum of the frequency spectrum Z [f, ω] of the transmission input signal or the frequency spectrum E [f, ω] of the residual signal is selected by the spectrum selection unit 111L, the echo canceller unit ( Since EC) 110 is processing in the time domain, there is no difference in the phase spectrum, so that the output transmission output signal can be smoothly connected in terms of audibility.

制御部（ＣＴＲＬ）１１１ｋからスペクトル選択部１１１Ｌへ出力される制御情報ＥＲ
ｃｏｎｔｒｏｌ１［ｆ，ω］と、制御部（ＣＴＲＬ）１１１ｋからエコー抑圧ゲイン算出
部（ＧＣＡＬ）１１１ｎへ出力される制御情報ＥＲｃｏｎｔｒｏｌ２［ｆ，ω］とは異な
っていても構わない。例えば、制御部（ＣＴＲＬ）１１１ｋのパラメータβ_Ｚ１［ω］及
びβ_Ｃ１の値とβ_Ｚ２［ω］及びβ_Ｃ２を異なる値にすることで、制御情報ＥＲｃｏｎｔ
ｒｏｌ１［ｆ，ω］のダブルトーク状態かつエコー抑圧量が十分でないと検出される条件
を包含するように、制御情報ＥＲｃｏｎｔｒｏｌ２［ｆ，ω］のダブルトーク状態かつエ
コー抑圧量が十分でないと検出される条件を広く設定する。こうするに、２つの制御情報
を用いることによって、エコー抑圧ゲインをスムージングした場合に、送話音声の品質劣
化を防止できる。 Control information ER output from the control unit (CTRL) 111k to the spectrum selection unit 111L
The control 1 [f, ω] may be different from the control information ERcontrol 2 [f, ω] output from the control unit (CTRL) 111k to the echo suppression gain calculation unit (GCAL) 111n. For example, the values of the parameters β _Z1 [ω] and β _C1 and β _Z2 [ω] and β _C2 of the control unit (CTRL) 111k are set to different values, so that the control information ERcont
It is detected that the double talk state of control information ERcontrol2 [f, ω] and the amount of echo suppression are not sufficient so as to include the condition that the double talk state of roll1 [f, ω] is detected and the amount of echo suppression is not sufficient. Set a wide range of conditions. In this way, by using two pieces of control information, when the echo suppression gain is smoothed, it is possible to prevent deterioration of the quality of the transmitted voice.

周波数領域逆変換処理部（ＩＦＴ）１１１ｐは、信号抑圧部（ＳＳ）１１１ｏから出力
された周波数スペクトルＳ’［ｆ，ω］を入力とし、ＩＦＦＴ（Inverse Fast Fourier T
ransform）などによって送話出力信号ｓ’［ｎ］（ｎ＝０，１，・・・，Ｎ−１）を算出
して出力する。このとき適宜、周波数領域変換処理部（ＦＴ）１１１ｂ及び周波数領域変
換処理部（ＦＴ）１１１ｃの窓掛けを考慮して、過去のフレームのｓ’［ｎ］を用いてオ
ーバーラップを戻す処理を行う。 The frequency domain inverse transform processing unit (IFT) 111p receives the frequency spectrum S ′ [f, ω] output from the signal suppression unit (SS) 111o as an input, and performs IFFT (Inverse Fast Fourier T).
ransform) or the like to calculate and output a transmission output signal s ′ [n] (n = 0, 1,..., N−1). At this time, considering the windowing of the frequency domain transform processing unit (FT) 111b and the frequency domain transform processing unit (FT) 111c as appropriate, a process of returning overlap using s ′ [n] of the past frame is performed. .

上記のように構成された、第１の実施形態に係る信号処理装置の処理の流れを、図５〜
図７を参照して説明する。図５は、全体の処理の流れを示すフローチャートであり、図６
はエコーキャンセラ部（ＥＣ）１１０における処理の流れを示すフローチャートであり、
図７はエコーリダクション部（ＥＲ）１１１における処理の流れを示すフローチャートで
ある。 The processing flow of the signal processing apparatus according to the first embodiment configured as described above is shown in FIGS.
This will be described with reference to FIG. FIG. 5 is a flowchart showing the overall processing flow.
Is a flowchart showing the flow of processing in the echo canceller (EC) 110,
FIG. 7 is a flowchart showing the flow of processing in the echo reduction unit (ER) 111.

図５において、発呼又は着呼があると、通信部（ＣＯＭ）１０１は通信リンクを確立す
る処理を行い、また各パラメータや各バッファの初期化などの初期設定処理を行う（ステ
ップＳ１００１）。通信リンクが確立することにより、通信相手と双方向通話が可能な状
態となり、双方向の通話が開始されると、通信部（ＣＯＭ）１０１にあり図示されないデ
コーダは１フレームごとに復号化され受話入力信号ｘ［ｎ］として読み込む。また、マイ
クロホン１０６を介して送話入力信号ｚ［ｎ］が読み込まれる（ステップＳ１００２）。 In FIG. 5, when there is an outgoing call or an incoming call, the communication unit (COM) 101 performs processing for establishing a communication link, and performs initial setting processing such as initialization of each parameter and each buffer (step S1001). When the communication link is established, it becomes possible to make a two-way call with the communication partner. When the two-way call is started, a decoder (not shown) in the communication unit (COM) 101 decodes every frame and receives the call. Read as input signal x [n]. Further, the transmission input signal z [n] is read through the microphone 106 (step S1002).

そして、ハイパスフィルタ部（ＨＰＦ）１０９は、送話入力信号ｚ［ｎ］のオフセット
除去処理を行う（ステップＳ１００３）。また、遅延処理部（ＤＥＬＡＹ）１０２は受話
入力信号ｘ［ｎ］を一時的に蓄えて遅延させる処理を行う（ステップＳ１００４）。これ
ら遅延した受話入力信号ｘ［ｎ−Ｄ］とオフセット除去された送話入力信号ｚ［ｎ］を入
力として、エコーキャンセラ部（ＥＣ）１１０はエコーキャンセラ処理を行う（ステップ
Ｓ１００５）。 Then, the high pass filter unit (HPF) 109 performs an offset removal process on the transmission input signal z [n] (step S1003). The delay processing unit (DELAY) 102 performs processing for temporarily storing and delaying the received input signal x [n] (step S1004). The echo canceller unit (EC) 110 performs an echo canceller process using the delayed received input signal x [n−D] and the transmission input signal z [n] from which the offset is removed as input (step S1005).

そして、遅延した受話入力信号ｘ［ｎ−Ｄ］と、オフセット除去された送話入力信号ｚ
［ｎ］と、エコーキャンセラ部（ＥＣ）１１０から出力されたエラーキャンセラ処理後の
信号である残差信号ｅ［ｎ］を入力として、エコーリダクション部（ＥＲ）１１１は非線
形エコー抑圧処理であるエコーリダクション処理を行う（ステップＳ１００６）。そして
、ステップＳ１００２からステップＳ１００６の処理を、通話が終了するまで行う（ステ
ップＳ１００７）。 Then, the delayed reception input signal x [n−D] and the transmission input signal z with the offset removed.
[N] and the residual signal e [n], which is the signal after error canceller processing output from the echo canceller (EC) 110, are input, and the echo reduction unit (ER) 111 is an echo that is nonlinear echo suppression processing. Reduction processing is performed (step S1006). Then, the processing from step S1002 to step S1006 is performed until the call is finished (step S1007).

図６に示すエコーキャンセラ部（ＥＣ）１１０の処理は、次のように行われる。まず、
ダブルトーク検出部（ＤＴＤ）１１０ｃがダブルトーク検出処理を行う（ステップＳ１１
０１）。次に、適応フィルタ部（ＡＤＦ）１１０ａはダブルトーク情報ＥＣｓｔａｔｅ［
ｎ］の制御を受けながら、適応フィルタ処理を行う（ステップＳ１１０２）。そして、信
号減算処理部１１０ｂは、送話入力信号ｚ［ｎ］から、適応フィルタ部（ＡＤＦ）１１０
ａから出力された擬似エコー信号ｙ’［ｎ］を減算し、残差信号ｅ［ｎ］を算出して出力
し、エコーキャンセラ処理が終了する。 The processing of the echo canceller (EC) 110 shown in FIG. 6 is performed as follows. First,
The double talk detection unit (DTD) 110c performs a double talk detection process (step S11).
01). Next, the adaptive filter unit (ADF) 110a double-talk information ECstate [
n], adaptive filter processing is performed (step S1102). Then, the signal subtraction processing unit 110b generates an adaptive filter unit (ADF) 110 from the transmission input signal z [n].
The pseudo echo signal y ′ [n] output from “a” is subtracted to calculate and output the residual signal e [n], and the echo canceller processing ends.

図７に示すエコーリダクション部（ＥＲ）１１１の処理は、次のように行われる。まず
、周波数領域変換処理部（ＦＴ）１１１ａ、周波数領域変換処理部（ＦＴ）１１１ｂ、周
波数領域変換処理部（ＦＴ）１１１ｃは、それぞれ周波数領域に変換するためのバッファ
である受話フレーム、送話フレーム、残差フレームを更新する（ステップＳ１２０１ｒ、
Ｓ１２０１ｓ、Ｓ１２０１ｅ）。 The processing of the echo reduction unit (ER) 111 shown in FIG. 7 is performed as follows. First, the frequency domain transform processing unit (FT) 111a, the frequency domain transform processing unit (FT) 111b, and the frequency domain transform processing unit (FT) 111c are respectively a reception frame and a transmission frame that are buffers for conversion to the frequency domain. , Update the residual frame (step S1201r,
S1201s, S1201e).

次に、周波数領域変換処理部（ＦＴ）１１１ａは遅延した受話入力信号ｘ［ｎ−Ｄ］を
周波数領域に変換して、受話入力信号の周波数スペクトルＸ［ｆ，ω］を算出して（ステ
ップＳ１２０２ｒ）、受話パワー算出部（ＰＯＷ）１１１ｄは受話パワースペクトル｜Ｘ
［ｆ，ω］｜^２及びスムージングされた受話パワースペクトル｜Ｘ_Ｓ［ｆ，ω］｜^２を算
出する（ステップＳ１２０３ｒ）。 Next, the frequency domain transform processing unit (FT) 111a converts the delayed received input signal x [n-D] into the frequency domain and calculates the frequency spectrum X [f, ω] of the received input signal (step). S1202r), the received power calculation unit (POW) 111d receives the received power spectrum | X
[F, ω] | ² and the smoothed received power spectrum | X _S [f, ω] | ² are calculated (step S1203r).

同様に、周波数領域変換処理部（ＦＴ）１１１ｂは送話入力信号ｚ［ｎ］を周波数領域
に変換して、送話入力信号の周波数スペクトルＺ［ｆ，ω］を算出して（ステップＳ１２
０２ｓ）、送話パワー算出部（ＰＯＷ）１１１ｅは送話パワースペクトル｜Ｚ［ｆ，ω］
｜^２及びスムージングされた送話パワースペクトル｜Ｚ_Ｓ［ｆ，ω］｜^２を算出する（ス
テップＳ１２０３ｓ）。 Similarly, the frequency domain transform processing unit (FT) 111b converts the transmission input signal z [n] into the frequency domain and calculates the frequency spectrum Z [f, ω] of the transmission input signal (step S12).
02s), the transmission power calculation unit (POW) 111e transmits the transmission power spectrum | Z [f, ω].
| ² and the smoothed transmission power spectrum | Z _S [f, ω] | ² are calculated (step S1203s).

さらに同様に、周波数領域変換処理部（ＦＴ）１１１ｃは残差信号ｅ［ｎ］を周波数領
域に変換して、残差信号の周波数スペクトルＥ［ｆ，ω］を算出して（ステップＳ１２０
２ｅ）、残差パワー算出部（ＰＯＷ）１１１ｆは残差パワースペクトル｜Ｅ［ｆ，ω］｜
^２及びスムージングされた残差パワースペクトル｜Ｅ_Ｓ［ｆ，ω］｜^２を算出する（ステ
ップＳ１２０３ｅ）。 Similarly, the frequency domain transform processing unit (FT) 111c transforms the residual signal e [n] into the frequency domain and calculates the frequency spectrum E [f, ω] of the residual signal (step S120).
2e), the residual power calculation unit (POW) 111f receives the residual power spectrum | E [f, ω] |
² and the smoothed residual power spectrum | E _S [f, ω] | ² are calculated (step S1203e).

そして、周波数領域ダブルトーク検出部（ＦＤＴＤ）１１１ｊは周波数領域ダブルトー
ク情報ＥＲｓｔａｔｅ［ｆ，ω］を出力し、音響結合量推定部（ＡＣＬＥ）１１１ｇは、
スムージングされた受話パワースペクトル｜Ｘ_Ｓ［ｆ，ω］｜^２とスムージングされた送
話パワースペクトル｜Ｚ_Ｓ［ｆ，ω］｜^２と、周波数領域ダブルトーク情報ＥＲｓｔａｔ
ｅ［ｆ，ω］とを入力として音響結合量｜Ｈ_Ｓ［ｆ，ω］｜^２を算出する（ステップＳ１
２０４）。エコー量推定部（ＥＬＥ）１１１ｈは、音響結合量｜Ｈ_s［ｆ，ω］｜^２とス
ムージングした受話パワースペクトル｜Ｘ_Ｓ［ｆ，ω］｜^２とを入力として送話入力信号
に含まれるエコー量｜Ｙ_Ｓ［ｆ，ω］｜^２を推定する（ステップＳ１２０５）。 The frequency domain double talk detector (FDTD) 111j outputs frequency domain double talk information ERstate [f, ω], and the acoustic coupling amount estimator (ACLE) 111g
Smoothed reception power spectrum | X _S [f, ω] | ² and smoothed transmission power spectrum | Z _S [f, ω] | ² and frequency domain double talk information ERstat
The acoustic coupling amount | H _S [f, ω] | ² is calculated using e [f, ω] as an input (step S 1
204). Echo amount estimating unit (ELE) 111h is acoustic coupling amount | contained in ² and transmission input signal as an input _{^{| H s [f, ω]}} | 2 and smoothing the received power spectrum _{| X} S [f, _ω] The echo amount | Y _S [f, ω] | ² is estimated (step S1205).

次に、エコー抑圧量推定部（ＥＣＬＥ）１１１ｉは、送話パワースペクトル｜Ｚ［ｆ，
ω］｜^２と残差パワースペクトル｜Ｅ［ｆ，ω］｜^２とを入力として、エコーキャンセラ
部（ＥＣ）１１０で抑圧されたエコー抑圧量ＥＣＬ［ｆ，ω］を推定する（ステップＳ１
２０６）。そして、制御部（ＣＴＲＬ）１１１ｋは、ダブルトーク状態であるか否か、及
びエコーキャンセラ部（ＥＣ）１１０のエコー抑圧量が十分であるか否かの判定処理を行
い、制御情報ＥＲｃｏｎｔｒｏｌ１［ｆ，ω］及びＥＲｃｏｎｔｒｏｌ２［ｆ，ω］を出
力する（ステップＳ１２０７）。 Next, the echo suppression amount estimation unit (ECLE) 111i transmits the transmission power spectrum | Z [f,
ω] | ² and residual power spectrum | E [f, ω] | ² are input, and the echo suppression amount ECL [f, ω] suppressed by the echo canceller unit (EC) 110 is estimated (step S1).
206). Then, the control unit (CTRL) 111k determines whether or not the state is a double talk state and whether or not the echo suppression amount of the echo canceller unit (EC) 110 is sufficient, and the control information ERcontrol1 [f, ω] and ERcontrol2 [f, ω] are output (step S1207).

エコー抑圧ゲイン算出部（ＧＣＡＬ）１１１ｎは、送話パワー算出部（ＰＯＷ）１１１
ｅから出力されたスムージングされた送話パワースペクトル｜Ｚ_Ｓ［ｆ，ω］｜^２と、エ
コー量推定部（ＥＬＥ）１１１ｈから出力されたスムージングしたエコー量｜Ｙ_Ｓ［ｆ，
ω］｜^２と、ゲイン格納部（ＧＴＢＬ）１１１ｍから出力されたパラメータγ［ω］と、
エコー抑圧量推定部（ＥＣＬＥ）１１１ｉから出力されたエコー抑圧量ＥＣＬ［ｆ，ω］
と、制御部（ＣＴＲＬ）１１１ｋから出力された制御情報ＥＲｃｏｎｔｒｏｌ２［ｆ，ω
］とを入力として、ダブルトーク状態かつエコー抑圧量が十分でないと検出された周波数
帯域と、それ以外の周波数帯域とで、それぞれ異なる計算によってエコー抑圧ゲインＧ［
ｆ，ω］を算出する。また、エコー抑圧ゲイン算出部（ＧＣＡＬ）１１１ｎはエコー抑圧
ゲインＧ［ｆ，ω］を所定の下限値以下にならないように制御する（ステップＳ１２０８
）。 The echo suppression gain calculation unit (GCAL) 111n is a transmission power calculation unit (POW) 111.
The smoothed transmission power spectrum | Z _S [f, ω] | ² output from e and the smoothed echo amount output from the echo amount estimation unit (ELE) 111 h | Y _S [f,
ω] | ² and the parameter γ [ω] output from the gain storage unit (GTBL) 111m;
Echo suppression amount ECL [f, ω] output from the echo suppression amount estimation unit (ECLE) 111i
And control information ERcontrol2 [f, ω output from the control unit (CTRL) 111k
] In the double talk state and the frequency band detected when the amount of echo suppression is not sufficient, and the other frequency band, the echo suppression gain G [
f, ω] is calculated. Further, the echo suppression gain calculation unit (GCAL) 111n controls the echo suppression gain G [f, ω] so as not to be equal to or lower than a predetermined lower limit value (step S1208).
).

一方、スペクトル選択部１１１Ｌは、制御部（ＣＴＲＬ）１１１ｋから出力された制御
情報ＥＲｃｏｎｔｒｏｌ１［ｆ，ω］を入力として、ダブルトーク状態かつエコー抑圧量
が十分でないと検出された周波数帯域では、周波数スペクトルとして送話入力信号の周波
数スペクトルＺ［ｆ，ω］を選択し、それ以外の周波数帯域では周波数スペクトルとして
残差信号の周波数スペクトルＥ［ｆ，ω］を選択する（ステップＳ１２０９）。 On the other hand, the spectrum selection unit 111L receives the control information ERcontrol1 [f, ω] output from the control unit (CTRL) 111k as an input, and in the frequency band detected that the echo suppression amount is not sufficient in the double talk state, the frequency spectrum Is selected as the frequency spectrum Z [f, ω] of the transmission input signal, and the frequency spectrum E [f, ω] of the residual signal is selected as the frequency spectrum in other frequency bands (step S1209).

そして、信号抑圧部（ＳＳ）１１１ｏは、スペクトル選択部１１１Ｌで選択された周波
数スペクトルと、エコー抑圧ゲイン算出部（ＧＣＡＬ）１１１ｎで算出されたエコー抑圧
ゲインＧ［ｆ，ω］を入力として、選択された周波数スペクトルのエコーを抑圧する（ス
テップＳ１２１０）。最終的に、周波数領域逆変換処理部（ＩＦＴ）１１１ｐは、信号抑
圧部（ＳＳ）１１１ｏから出力された周波数スペクトルＳ’［ｆ，ω］を周波数逆変換処
理することによって（ステップＳ１２１１）、エコーリダクション処理が終了する。 Then, the signal suppression unit (SS) 111o selects the frequency spectrum selected by the spectrum selection unit 111L and the echo suppression gain G [f, ω] calculated by the echo suppression gain calculation unit (GCAL) 111n as inputs. The echo of the frequency spectrum is suppressed (step S1210). Finally, the frequency domain inverse transform processing unit (IFT) 111p performs frequency inverse transform processing on the frequency spectrum S ′ [f, ω] output from the signal suppression unit (SS) 111o (step S1211), thereby performing echo. The reduction process ends.

このようにしてエコーリダクション部（ＥＲ）１１１から出力された送話出力信号ｓ’
［ｎ］は、通信部（ＣＯＭ）１０１にあり図示しないエンコーダによって１フレームごと
に符号化され、この符号化によって得られたデータは、通信部（ＣＯＭ）１０１を通じて
、送信データとして通信相手に送信される。 The transmission output signal s ′ output from the echo reduction unit (ER) 111 in this way.
[N] is encoded in each frame by an encoder (not shown) in the communication unit (COM) 101, and data obtained by this encoding is transmitted to the communication partner as transmission data through the communication unit (COM) 101. Is done.

上記では、エコーリダクション部（ＥＲ）１１１はＦＦＴによる周波数領域型で周波数
帯域毎に処理する方式として動作するものとして説明した。ＦＦＴによる周波数帯域をグ
ループでまとめて周波数帯域グループ毎に処理する方式や、フィルタバンクなどの帯域分
割フィルタなどの周波数領域型を実現してもよい。 In the above description, the echo reduction unit (ER) 111 is described as operating as a method of processing for each frequency band in the frequency domain type by FFT. You may implement | achieve the frequency domain type | molds, such as the method which puts together the frequency band by FFT and processes for every frequency band group, and band division filters, such as a filter bank.

なお、ダブルトーク中のエコーパスロス変動時では、エコーキャンセラ部（ＥＣ）１１
０の処理によって送話入力信号ｚ［ｎ］よりも残差信号ｅ［ｎ］が大きくなってしまい、
エコーキャンセラ部（ＥＣ）１１０の処理によるエコー抑圧量が十分でない場合がある。 When echo path loss fluctuates during double talk, the echo canceller (EC) 11
Due to the processing of 0, the residual signal e [n] becomes larger than the transmission input signal z [n],
In some cases, the amount of echo suppression by the processing of the echo canceller unit (EC) 110 is not sufficient.

その場合、以上説明した信号処理装置の動作により、エコーキャンセラ部（ＥＣ）１１
０の処理によるエコー抑圧量が十分でない周波数領域をエコー抑圧量推定部（ＥＣＬＥ）
１１１ｉと制御部（ＣＴＲＬ）１１１ｋで判定して、スペクトル選択部１１１Ｌとエコー
抑圧ゲイン算出部（ＧＣＡＬ）１１１ｎにおいて周波数帯域毎にエコーキャンセラ部（Ｅ
Ｃ）１１０の処理とエコーリダクション部（ＥＲ）１１１の処理の重みを変化させるよう
に制御することができるので、エコーパスロス変動に対して頑健にすることが可能であり
、高品質な送話出力信号を出力することが可能である。 In this case, the echo canceller (EC) 11 is operated by the operation of the signal processing apparatus described above.
Echo suppression amount estimation unit (ECLE) in a frequency region where the echo suppression amount due to processing of 0 is not sufficient
111i and the control unit (CTRL) 111k determine, and the spectrum selection unit 111L and the echo suppression gain calculation unit (GCAL) 111n perform echo canceller units (E
C) Since it is possible to control to change the weight of the processing of 110 and the processing of the echo reduction unit (ER) 111, it is possible to make robust against the echo path loss fluctuation, and high quality transmission output It is possible to output a signal.

（第１の実施形態の変形例）
本発明の第１の実施形態の変形例に係る信号処理装置が第１の実施形態に係る信号処理
装置と異なる点は、エコーリダクション部（ＥＲ）１１１に代えて、エコーリダクション
部（ＥＲ）１１１２を有する点である。図８に、本発明の第１の実施形態の変形例に係る
信号処理装置のエコーリダクション部（ＥＲ）１１１２の構成を示すブロック図を示す。 (Modification of the first embodiment)
The signal processing device according to the modification of the first embodiment of the present invention is different from the signal processing device according to the first embodiment in that an echo reduction unit (ER) 1112 is used instead of the echo reduction unit (ER) 111. It is a point which has. FIG. 8 is a block diagram showing the configuration of the echo reduction unit (ER) 1112 of the signal processing device according to the modification of the first embodiment of the present invention.

エコーリダクション部（ＥＲ）１１１２がエコーリダクション部（ＥＲ）１１１と異な
る点は、音響結合量推定部（ＡＣＬＥ）１１１ｇに代えて音響結合量推定部（ＡＣＬＥ）
１１１ｇ２を有し、エコー抑圧ゲイン算出部（ＧＣＡＬ）１１１ｎに代えてエコー抑圧ゲ
イン算出部（ＧＣＡＬ）１１１ｎ２を有する点にある。 The echo reduction unit (ER) 1112 is different from the echo reduction unit (ER) 111 in that an acoustic coupling amount estimation unit (ACLE) is used instead of the acoustic coupling amount estimation unit (ACLE) 111g.
111g2 and an echo suppression gain calculation unit (GCAL) 111n2 instead of the echo suppression gain calculation unit (GCAL) 111n.

更に、エコーリダクション部（ＥＲ）１１１２において、送話パワー算出部（ＰＯＷ）
１１１ｅの出力をエコー抑圧量推定部（ＥＣＬＥ）１１１ｉのみに入力させ、残差パワー
算出部（ＰＯＷ）１１１ｆの出力をエコー抑圧量推定部（ＥＣＬＥ）１１１ｉと周波数領
域ダブルトーク検出部（ＦＤＴＤ）１１１ｊに加えて、音響結合量推定部（ＡＣＬＥ）１
１１ｇ２とエコー抑圧ゲイン算出部（ＧＣＡＬ）１１１ｎ２にも入力させた点にあり、そ
の他の部分は同じである。そこで、このエコーリダクション部（ＥＲ）１１１２で、エコ
ーリダクション部（ＥＲ）１１１と同じ部分については、同じ符号を付してその説明を省
略する。 Further, in the echo reduction unit (ER) 1112, the transmission power calculation unit (POW)
The output of 111e is input only to the echo suppression amount estimation unit (ECLE) 111i, and the output of the residual power calculation unit (POW) 111f is input to the echo suppression amount estimation unit (ECLE) 111i and the frequency domain double talk detection unit (FDTD) 111j. In addition to the acoustic coupling amount estimation unit (ACLE) 1
11g2 and the echo suppression gain calculation unit (GCAL) 111n2 are input, and the other parts are the same. Therefore, in the echo reduction unit (ER) 1112, the same parts as those of the echo reduction unit (ER) 111 are denoted by the same reference numerals and description thereof is omitted.

音響結合量推定部（ＡＣＬＥ）１１１ｇ２は、受話パワー算出部（ＰＯＷ）１１１ｄか
ら出力されたスムージングされた受話パワースペクトル｜Ｘ_Ｓ［ｆ，ω］｜^２と、残差パ
ワー算出部（ＰＯＷ）１１１ｆから出力されたスムージングされた残差パワースペクトル
｜Ｅ_Ｓ［ｆ，ω］｜^２と、周波数領域ダブルトーク検出部（ＦＤＴＤ）１１１ｊから出力
される周波数領域ダブルトーク情報ＥＲｓｔａｔｅ［ｆ，ω］とを入力とし、周波数帯域
ω毎に音響結合量｜Ｈ［ｆ，ω］｜^２を、｜Ｅ_Ｓ［ｆ，ω］｜^２を用いて以下の式１４に
よって算出する。

The acoustic coupling amount estimation unit (ACLE) 111g2 includes the smoothed reception power spectrum | X _S [f, ω] | ² output from the reception power calculation unit (POW) 111d, and the residual power calculation unit (POW) 111f. And the smoothed residual power spectrum | E _S [f, ω] | ² output from the frequency domain double talk information ERstate [f, ω] output from the frequency domain double talk detector (FDTD) 111j. As an input, for each frequency band ω, the acoustic coupling amount | H [f, ω] | ² is calculated by the following expression 14 using | E _S [f, ω] | ² .

そして、音響結合量推定部（ＡＣＬＥ）１１１ｇ２は、以下の式１５のように１フレー
ム前の値を用いてスムージングした音響結合量｜Ｈ_Ｓ［ｆ，ω］｜^２を算出して出力する
。ただし、α_Ｈ［ω］は、０．０３〜０．９９程度が望ましい。

Then, the acoustic coupling amount estimation unit (ACLE) 111g2 calculates and outputs a smoothed acoustic coupling amount | H _S [f, ω] | ² using a value one frame before as in the following Expression 15. However, α _H [ω] is preferably about 0.03 to 0.99.

エコー抑圧ゲイン算出部（ＧＣＡＬ）１１１ｎ２は、残差パワー算出部（ＰＯＷ）１１
１ｆから出力されたスムージングされた残差パワースペクトル｜Ｅ_Ｓ［ｆ，ω］｜^２と、
エコー量推定部（ＥＬＥ）１１１ｈから出力されたスムージングしたエコー量｜Ｙ_Ｓ［ｆ
，ω］｜^２と、ゲイン格納部（ＧＴＢＬ）１１１ｍから出力されたパラメータγ［ω］と
、エコー抑圧量推定部（ＥＣＬＥ）１１１ｉから出力されたエコー抑圧量ＥＣＬ［ｆ，ω
］と、制御部（ＣＴＲＬ）１１１ｋから出力されたダブルトーク状態かつエコーキャンセ
ラ部（ＥＣ）１１０のエコー抑圧量が十分でない周波数帯域であるか否かの情報である制
御情報ＥＲｃｏｎｔｒｏｌ２［ｆ，ω］とを入力とし、エコー抑圧ゲインＧ［ｆ，ω］を
算出して出力する。 The echo suppression gain calculation unit (GCAL) 111n2 is a residual power calculation unit (POW) 11
The smoothed residual power spectrum | E _S [f, ω] | ² output from If
Smoothed echo amount output from the echo amount estimation unit (ELE) 111h | Y _S [f
, Ω] | ² , the parameter γ [ω] output from the gain storage unit (GTBL) 111m, and the echo suppression amount ECL [f, ω output from the echo suppression amount estimation unit (ECLE) 111i.
] And control information ERcontrol2 [f, ω] which is information indicating whether the frequency range is a double talk state output from the control unit (CTRL) 111k and the echo suppression amount of the echo canceller unit (EC) 110 is not sufficient. Are input and the echo suppression gain G [f, ω] is calculated and output.

具体的には、エコー抑圧ゲイン算出部（ＧＣＡＬ）１１１ｎ２は、制御部（ＣＴＲＬ）
１１１ｋから出力された制御情報ＥＲｃｏｎｔｒｏｌ２［ｆ，ω］からダブルトーク状態
かつエコーキャンセラ部（ＥＣ）１１０のエコー抑圧量が十分でないと検出された周波数
帯域については、エコーキャンセラ部（ＥＣ）１１０のエコー抑圧量の分を増した抑圧量
をエコーリダクション（ＥＲ）１１１２のエコー抑圧量にするため、エコー抑圧ゲインＧ
［ｆ，ω］をエコーキャンセラ部（ＥＣ）１１０のエコー抑圧量ＥＣＬ［ｆ，ω］を用い
て抑圧量を補正し以下の式１６のように算出する。

Specifically, the echo suppression gain calculation unit (GCAL) 111n2 is a control unit (CTRL).
For the frequency band detected from the control information ERcontrol2 [f, ω] output from 111k in the double talk state and the echo suppression amount of the echo canceller unit (EC) 110 is not sufficient, the echo of the echo canceller unit (EC) 110 is echoed. In order to use the suppression amount obtained by increasing the suppression amount as the echo suppression amount of the echo reduction (ER) 1112, the echo suppression gain G
[F, ω] is calculated by using the echo suppression amount ECL [f, ω] of the echo canceller unit (EC) 110 to correct the suppression amount, as shown in Equation 16 below.

それ以外の周波数帯域では、エコー抑圧ゲイン算出部（ＧＣＡＬ）１１１ｎ２は、エコ
ー抑圧ゲインＧ［ｆ，ω］をウィナー・フィルタ（Wiener Filter）法を用いて、式１７
−１によって算出する。あるいは、エコーキャンセラ部（ＥＣ）１１０が正常に機能して
いると見なして、エコーリダクション（ＥＲ）１１１のエコー抑圧量を多くするように、
エコーキャンセラ部（ＥＣ）１１０のエコー抑圧量ＥＣＬ［ｆ，ω］を用いて補正し、エ
コー抑圧ゲインＧ［ｆ，ω］を以下の式１７−２が示すように算出する。

In other frequency bands, the echo suppression gain calculation unit (GCAL) 111n2 uses the Wiener Filter method to express the echo suppression gain G [f, ω] using Equation 17
Calculated by -1. Alternatively, assuming that the echo canceller (EC) 110 is functioning normally, the echo suppression amount of the echo reduction (ER) 111 is increased.
Correction is performed using the echo suppression amount ECL [f, ω] of the echo canceller (EC) 110, and the echo suppression gain G [f, ω] is calculated as shown in the following Expression 17-2.

以上説明した第１の実施形態の変形例に係る信号処理装置の動作により、エコーパス変
動はほとんどなく、エコーキャンセラ部（ＥＣ）１１０の抑圧性能が安定していることが
多い用途では、エコーキャンセラ部（ＥＣ）１１０とエコーリダクション（ＥＲ）１１１
２の直列接続が選択されやすくなるので、過剰なエコー抑圧を防ぐことができ、送話音質
が劣化するのを防止することが可能である。 The echo canceller unit is used in applications in which the echo path fluctuation hardly occurs and the suppression performance of the echo canceller unit (EC) 110 is often stable due to the operation of the signal processing device according to the modification of the first embodiment described above. (EC) 110 and echo reduction (ER) 111
Since it becomes easy to select the serial connection of 2, it is possible to prevent excessive echo suppression and to prevent the transmission sound quality from deteriorating.

（第２の実施形態）
第２の実施形態に係る信号処理装置が第１の実施形態に係る信号処理装置と異なる点は
、図９に示すようにエコーリダクション部（ＥＲ）１１１を有さず、ノイズリダクション
部（ＮＲ）２１１を有する点にあり、その他の部分は同じである。そこで、同じ部分につ
いては、同じ符号を付してその説明を省略し、図面を参照して第２の実施形態に係るノイ
ズリダクション部（ＮＲ）２１１を説明する。なお、このノイズリダクション部（ＮＲ）
２１１で、第１の実施形態に係るエコーリダクション部（ＥＲ）１１１と同じ部分につい
ては、同じ符号を付してその説明を省略する。 (Second Embodiment)
The signal processing apparatus according to the second embodiment is different from the signal processing apparatus according to the first embodiment in that it does not have the echo reduction unit (ER) 111 as shown in FIG. 9, but the noise reduction unit (NR). The other parts are the same. Therefore, the same parts are denoted by the same reference numerals and description thereof is omitted, and the noise reduction unit (NR) 211 according to the second embodiment will be described with reference to the drawings. This noise reduction part (NR)
In FIG. 21, the same parts as those of the echo reduction unit (ER) 111 according to the first embodiment are denoted by the same reference numerals and description thereof is omitted.

図１０は、第２の実施形態に係る信号処理装置のノイズリダクション部（ＮＲ）２１１
の構成を示すブロック図である。このノイズリダクション部（ＮＲ）２１１は、周波数領
域変換処理部（ＦＴ）１１１ｂと、周波数領域変換処理部（ＦＴ）１１１ｃと、送話パワ
ー算出部（ＰＯＷ）１１１ｅと、残差パワー算出部（ＰＯＷ）１１１ｆと、エコー抑圧量
推定部（ＥＣＬＥ）１１１ｉと、制御部（ＣＴＲＬ）２１１ｋと、スペクトル選択部２１
１Ｌと、ノイズ抑圧ゲイン算出部（ＧＣＡＬ）２１１ｎと、信号抑圧部（ＳＳ）２１１ｏ
と、周波数領域逆変換処理部（ＩＦＴ）１１１ｐと、雑音レベル推定部（ＮＬＥ）２１１
ｑとからなる。 FIG. 10 shows a noise reduction unit (NR) 211 of the signal processing apparatus according to the second embodiment.
It is a block diagram which shows the structure of these. The noise reduction unit (NR) 211 includes a frequency domain conversion processing unit (FT) 111b, a frequency domain conversion processing unit (FT) 111c, a transmission power calculation unit (POW) 111e, and a residual power calculation unit (POW). ) 111f, echo suppression amount estimation unit (ECLE) 111i, control unit (CTRL) 211k, and spectrum selection unit 21
1L, a noise suppression gain calculation unit (GCAL) 211n, and a signal suppression unit (SS) 211o
A frequency domain inverse transform processing unit (IFT) 111p, and a noise level estimation unit (NLE) 211.
q.

ノイズリダクション部（ＮＲ）２１１は、ハイパスフィルタ部（ＨＰＦ）１０９から出
力された送話入力信号ｚ［ｎ］と、信号減算処理部１１０ｂから出力された残差信号ｅ［
ｎ］とを入力とし、送話入力信号ｚ［ｎ］あるいは残差信号ｅ［ｎ］の少なくとも一方か
らノイズ成分を抑圧し、そのノイズ抑圧後の信号を送話出力信号ｓ’［ｎ］（ｎ＝０，１
，・・・，Ｎ−１）として１フレームごとに出力する。 The noise reduction unit (NR) 211 receives the transmission input signal z [n] output from the high-pass filter unit (HPF) 109 and the residual signal e [
n] as an input, a noise component is suppressed from at least one of the transmission input signal z [n] or the residual signal e [n], and the signal after the noise suppression is transmitted as a transmission output signal s ′ [n] ( n = 0, 1
,..., N-1) are output every frame.

雑音レベル推定部（ＮＬＥ）２１１ｑは、周波数領域変換処理部（ＦＴ）１１１ｂから
出力された送話スペクトルＺ［ｆ，ω］と、周波数領域変換処理部（ＦＴ）１１１ｃから
出力された残差スペクトルＥ［ｆ，ω］とを入力とし、有音ではない区間のスペクトルを
測定し、周波数帯域ω毎に送話スペクトルＺ［ｆ，ω］に含まれる雑音レベル｜Ｎ_Ｚ［ｆ
，ω］｜^２と残差スペクトルＥ［ｆ，ω］に含まれる雑音レベル｜Ｎ_Ｅ［ｆ，ω］｜^２と
を算出して出力する。 The noise level estimator (NLE) 211q is a transmission spectrum Z [f, ω] output from the frequency domain transform processor (FT) 111b and a residual spectrum output from the frequency domain transform processor (FT) 111c. E [f, ω] as an input, a spectrum in a non-sound period is measured, and a noise level | N _Z [f included in the transmission spectrum Z [f, ω] for each frequency band ω.
, Ω] | ² and the noise level | N _E [f, ω] | ² included in the residual spectrum E [f, ω] are calculated and output.

制御部（ＣＴＲＬ）２１１ｋは、エコー抑圧量推定部（ＥＣＬＥ）１１１ｉから出力さ
れたエコー抑圧量ＥＣＬ［ｆ，ω］を入力とし、周波数帯域ω毎にエコーキャンセラ部（
ＥＣ）１１０のエコー抑圧量が十分でない周波数帯域を検出して、エコーキャンセラ部（
ＥＣ）１１０のエコー抑圧量が十分でない周波数帯域であるか否かの情報である制御情報
ＮＲｃｏｎｔｒｏｌ［ｆ，ω］を出力する。 The control unit (CTRL) 211k receives the echo suppression amount ECL [f, ω] output from the echo suppression amount estimation unit (ECLE) 111i as an input, and for each frequency band ω, an echo canceller unit (
EC) The frequency band where the echo suppression amount of 110 is not sufficient is detected, and the echo canceller (
EC) Outputs control information NRcontrol [f, ω], which is information indicating whether or not the echo suppression amount of 110 is an insufficient frequency band.

具体的には、制御部（ＣＴＲＬ）２１１ｋは、周波数帯域ω毎に不等式ＥＣＬ［ｆ，ω
］＞β_Ｚ［ω］を満たした周波数帯域であり、かつ１フレーム当りの不等式ＥＣＬ［ｆ，
ω］＞β_Ｚ［ω］が成立した帯域数ｃｏｕｎｔがβ_Ｃより大きい場合にエコー抑圧量が少
ないと判定する。ただし、β_Ｚ［ω］は＋８ｄＢ〜−１５ｄＢ程度、β_Ｃは全周波数帯域
数の１０％〜４０％程度が望ましい。本実施形態では以下、β_Ｚ［ω］を０ｄＢとして説
明する。 Specifically, the control unit (CTRL) 211k performs the inequality ECL [f, ω for each frequency band ω.
]> Β _Z [ω], and the inequality ECL [f,
When the number of bands count in which ω]> β _Z [ω] is established is larger than β _C, it is determined that the echo suppression amount is small. However, β _Z [ω] is preferably about +8 dB to −15 dB, and β _C is preferably about 10% to 40% of the total number of frequency bands. In the present embodiment, hereinafter, β _Z [ω] will be described as 0 dB.

スペクトル選択部２１１Ｌは、周波数領域変換処理部（ＦＴ）１１１ｂから出力された
送話入力信号の周波数スペクトルＺ［ｆ，ω］と、周波数領域変換処理部（ＦＴ）１１１
ｃから出力された残差信号の周波数スペクトルＥ［ｆ，ω］と、制御部（ＣＴＲＬ）２１
１ｋから出力されたエコーキャンセラ部（ＥＣ）１１０のエコー抑圧量が十分でない周波
数帯域であるか否かの情報である制御情報ＮＲｃｏｎｔｒｏｌ［ｆ，ω］とを入力とし、
送話入力信号の周波数スペクトルＺ［ｆ，ω］あるいは残差信号の周波数スペクトルＥ［
ｆ，ω］のいずれか一方を周波数スペクトルとして選択して出力する。 The spectrum selection unit 211L includes the frequency spectrum Z [f, ω] of the transmission input signal output from the frequency domain transform processing unit (FT) 111b and the frequency domain transform processing unit (FT) 111.
The frequency spectrum E [f, ω] of the residual signal output from c and the control unit (CTRL) 21
The control information NRcontrol [f, ω], which is information indicating whether or not the echo suppression amount of the echo canceller (EC) 110 output from 1k is a frequency band that is not sufficient, is input,
Frequency spectrum Z [f, ω] of the transmission input signal or frequency spectrum E [
f, ω] is selected as a frequency spectrum and output.

具体的には、制御情報ＮＲｃｏｎｔｒｏｌ［ｆ，ω］がエコーキャンセラ部（ＥＣ）１
１０のエコー抑圧量が十分でないと検出された周波数帯域であった場合は、周波数スペク
トルとして送話入力信号の周波数スペクトルＺ［ｆ，ω］を選択する。それ以外の周波数
帯域であった場合は周波数スペクトルとして残差信号の周波数スペクトルＥ［ｆ，ω］を
選択する。 Specifically, the control information NRcontrol [f, ω] is the echo canceller (EC) 1
If the frequency band detected that the echo suppression amount of 10 is not sufficient, the frequency spectrum Z [f, ω] of the transmission input signal is selected as the frequency spectrum. If the frequency band is other than that, the frequency spectrum E [f, ω] of the residual signal is selected as the frequency spectrum.

このようにすることで周波数帯域ごとに、エコーキャンセラ部（ＥＣ）１１０のエコー
抑圧量が十分でない場合は、ノイズリダクション部（ＮＲ）２１１を単体で動作させ、雑
音レベルの推定がエコーの影響を受けないようにすることができる。 In this way, when the echo suppression amount of the echo canceller (EC) 110 is not sufficient for each frequency band, the noise reduction unit (NR) 211 is operated alone, and noise level estimation affects the influence of echo. It can be made not to receive.

ノイズ抑圧ゲイン算出部（ＧＣＡＬ）２１１ｎは、雑音レベル推定部（ＮＬＥ）２１１
ｑから出力された雑音レベル｜Ｎ_Ｚ［ｆ，ω］｜^２及び｜Ｎ_Ｅ［ｆ，ω］｜^２と、送話パ
ワー算出部（ＰＯＷ）１１１ｅから出力された送話パワースペクトル｜Ｚ［ｆ，ω］｜^２
と、制御部（ＣＴＲＬ）２１１ｋから出力された制御情報ＮＲｃｏｎｔｒｏｌ［ｆ，ω］
と、信号抑圧部（ＳＳ）１１１ｏから出力された１フレーム前の送話出力信号のスペクト
ルＳ’［ｆ−１，ω］を入力とし、ノイズ抑圧ゲインＧ［ｆ，ω］を算出して出力する。 The noise suppression gain calculation unit (GCAL) 211n is a noise level estimation unit (NLE) 211.
The noise level | N _Z [f, ω] | ² and | N _E [f, ω] | ² output from q and the transmission power spectrum | Z [output from the transmission power calculation unit (POW) 111e f, ω] | ²
And control information NRcontrol [f, ω] output from the control unit (CTRL) 211k.
And the spectrum S ′ [f−1, ω] of the transmission output signal one frame before output from the signal suppression unit (SS) 111o as an input, and calculate and output a noise suppression gain G [f, ω]. To do.

具体的には、ノイズ抑圧ゲイン算出部（ＧＣＡＬ）２１１ｎは、まず、制御部（ＣＴＲ
Ｌ）２１１ｋから出力された制御情報ＮＲｃｏｎｔｒｏｌ［ｆ，ω］によってエコー抑圧
量が十分でないと検出された周波数帯域については、エコーキャンセラ部（ＥＣ）１１０
の処理後の残留エコーの影響で雑音推定の精度が取れていないと判定して、雑音レベルと
して｜Ｎ_Ｚ［ｆ，ω］｜^２を選択する。それ以外の周波数帯域では雑音レベルとして｜Ｎ
_Ｅ［ｆ，ω］｜^２を選択する。 Specifically, the noise suppression gain calculation unit (GCAL) 211n first has a control unit (CTR)
L) For the frequency band detected by the control information NRcontrol [f, ω] output from 211k that the echo suppression amount is not sufficient, the echo canceller (EC) 110
It is determined that the accuracy of noise estimation has not been obtained due to the influence of the residual echo after the above processing, and | N _Z [f, ω] | ² is selected as the noise level. In other frequency bands, the noise level is | N
_E [f, ω] | ² is selected.

そして、周波数帯域ω毎に選択された雑音レベルに基づいて、以下のアルゴリズムまた
はそれらの組み合わせでノイズ抑圧ゲインＧ［ｆ，ω］を算出する。即ち、一般のノイズ
リダクションであるスペクトル・サブトラクション（Spectral Subtraction）法（S. F.
Boll, “Suppression of acoustic noise in speech using spectral subtraction”, IE
EE Trans. Acoustics, Speech, and Signal Processing, vol.ASSP-29, pp.113-120 (197
9).）、ウィナー・フィルター（Wiener Filter）法（J. S. Lim, A. V. Oppenheim, “En
hancement and bandwidth compression of noisy speech”, Proc. IEEE Vol.67, No.12,
pp.1586-1604, Dec.1979.）及び最尤推定（Maximum Likelihood）法（R. J. McAulay, M
. L. Malpass, “Speech enhancement using a soft-decision noise suppression filte
r”, IEEE Trans. on Acoustics, Speech, and Signal Processing, vol.ASSP-28, no.2,
pp.137-145, Apr.1980.）などである。 Then, based on the noise level selected for each frequency band ω, the noise suppression gain G [f, ω] is calculated by the following algorithm or a combination thereof. That is, the spectral subtraction method (SF) that is a general noise reduction.
Boll, “Suppression of acoustic noise in speech using spectral subtraction”, IE
EE Trans.Acoustics, Speech, and Signal Processing, vol.ASSP-29, pp.113-120 (197
9).), Wiener Filter method (JS Lim, AV Oppenheim, “En
hancement and bandwidth compression of noisy speech ”, Proc. IEEE Vol.67, No.12,
pp.1586-1604, Dec.1979.) and Maximum Likelihood method (RJ McAulay, M
L. Malpass, “Speech enhancement using a soft-decision noise suppression filte
r ”, IEEE Trans. on Acoustics, Speech, and Signal Processing, vol.ASSP-28, no.2,
pp.137-145, Apr.1980.).

また、送話出力信号の１フレーム前の周波数スペクトルＳ’［ｆ−１，ω］を用いて事
前ＳＮ比ＳＮＲ_ＰＲＩＯ［ｆ，ω］及び事後ＳＮ比ＳＮＲ_ＰＯＳＴ［ｆ，ω］を推定算出
してノイズ抑圧ゲインを精度よく推定する手法（例えば、P. Scalart, J. V. Filho, ”S
peech enhancement based on a priori signal to noise estimation”, Proc. ICASSP96
, pp.629-632, May 1996.）、ＭＭＳＥ−ＳＴＳＡ（Minimum Mean-Square Error Short-T
ime Spectral Amplitude estimator）法（Y. Ephraim, D. Malah, “Speech enhancement
using a minimum mean-square error short-time spectral amplitude estimator”, IE
EE Trans. on Acoustics, Speech, and Signal Processing, vol.ASSP-32, no.6, pp.110
9-1121, Dec.1984.）及びJoint MAP法（T. Lotter, P. Vary, “Noise reduction by max
imum a posteriori spectral amplitude estimation with super Gaussian speech model
ing”, Proc. IWAENC, pp.83-86, Sep.2003.）などである。 Further, the pre-SNR SNR _PRIO [f, ω] and the post-SNR SNR _POST [f, ω] are estimated and calculated using the frequency spectrum S ′ [f−1, ω] one frame before the transmission output signal. To accurately estimate the noise suppression gain (eg, P. Scalart, JV Filho, “S
peech enhancement based on a priori signal to noise estimation ”, Proc. ICASSP96
, pp.629-632, May 1996), MMSE-STSA (Minimum Mean-Square Error Short-T
ime Spectral Amplitude estimator method (Y. Ephraim, D. Malah, “Speech enhancement
using a minimum mean-square error short-time spectral amplitude estimator ”, IE
EE Trans.on Acoustics, Speech, and Signal Processing, vol.ASSP-32, no.6, pp.110
9-1121, Dec. 1984.) and Joint MAP method (T. Lotter, P. Vary, “Noise reduction by max
imum a posteriori spectral amplitude estimation with super Gaussian speech model
ing ”, Proc. IWAENC, pp. 83-86, Sep. 2003.).

例えば、雑音レベル｜Ｎ_Ｚ［ｆ，ω］｜^２が選択されたときは、Ｐ［・］を半波整流と
して、事前ＳＮ比ＳＮＲ_ＰＲＩＯ［ｆ，ω］及び事後ＳＮ比ＳＮＲ_ＰＯＳＴ［ｆ，ω］は
、それぞれ、以下の式１８−１、式１８−２により求められ、この場合ウィナー・フィル
ター法を用いるとノイズ抑圧ゲインＧ［ｆ，ω］は、以下の式１８−３により算出される
。但し、μ［ω］は０．９〜０．９９９程度の忘却係数である。一方、雑音レベル｜Ｎ_Ｅ
［ｆ，ω］｜^２が選択されたときは、同様にして以下の式１８−１、式１８−２における
｜Ｎ_Ｚ［ｆ，ω］｜^２を｜Ｎ_Ｅ［ｆ，ω］｜^２に置き換えてノイズ抑圧ゲインＧ［ｆ，ω
］を以下の式１８−３により算出する。

For example, when the noise level | N _Z [f, ω] | ² is selected, P [•] is set as a half-wave rectification, and the prior SN ratio SNR _PRIO [f, ω] and the subsequent SN ratio SNR _POST [f, ω] is obtained by the following equations 18-1 and 18-2. In this case, when the Wiener filter method is used, the noise suppression gain G [f, ω] is calculated by the following equation 18-3. The However, μ [ω] is a forgetting factor of about 0.9 to 0.999. On the other hand, noise level | N _E
When [f, ω] | ² is selected, | N _Z [f, ω] | ² in the following equations 18-1 and 18-2 is similarly changed to | N _E [f, ω] | ² Is replaced with the noise suppression gain G [f, ω
] Is calculated by the following equation 18-3.

また、ノイズ抑圧ゲイン算出部（ＧＣＡＬ）２１１ｎは、過剰なノイズ抑圧により送話
音質が劣化するのを防止し、背景雑音の断続的な抑圧を防止するため、ノイズ抑圧ゲイン
Ｇ［ｆ，ω］が所定の下限値以下にならないように制御する。即ち、図４を参照して説明
したエコー抑圧ゲインの下限値の帯域別の設定が、ノイズ抑圧ゲインの下限値の帯域別の
設定にも適用される。 In addition, the noise suppression gain calculation unit (GCAL) 211n prevents the transmission sound quality from being deteriorated due to excessive noise suppression, and prevents noise suppression gain G [f, ω] in order to prevent intermittent suppression of background noise. Is controlled so as not to fall below a predetermined lower limit. That is, the setting for each band of the lower limit value of the echo suppression gain described with reference to FIG. 4 is also applied to the setting for each band of the lower limit value of the noise suppression gain.

さらに、過剰なノイズ抑圧により送話音質が劣化するのを防止するため、送話入力信号
ｚ［ｎ］あるいは残差信号ｅ［ｎ］の有音区間ではない区間の信号レベルを用いて周波数
帯域ごとに背景雑音レベルを算出しておき、背景雑音レベルよりも抑圧しないようにノイ
ズ抑圧ゲインを制御しても構わない。 Further, in order to prevent the transmission sound quality from deteriorating due to excessive noise suppression, a frequency band is used by using a signal level of a section that is not a voiced section of the transmission input signal z [n] or the residual signal e [n]. The background noise level may be calculated for each, and the noise suppression gain may be controlled so as not to suppress the background noise level.

信号抑圧部（ＳＳ）２１１ｏは、スペクトル選択部２１１Ｌから出力された周波数スペ
クトルと、ノイズ抑圧ゲイン算出部（ＧＣＡＬ）２１１ｎから出力されたノイズ抑圧ゲイ
ンＧ［ｆ，ω］とを入力として、スペクトル選択部２１１Ｌから出力された周波数スペク
トルのノイズを抑圧し、以下の式１９−１あるいは式１９−２が示すようにして送話出力
信号のスペクトルＳ’［ｆ，ω］として出力する。

The signal suppression unit (SS) 211o receives the frequency spectrum output from the spectrum selection unit 211L and the noise suppression gain G [f, ω] output from the noise suppression gain calculation unit (GCAL) 211n as the spectrum selection. The noise of the frequency spectrum output from the unit 211L is suppressed and output as the spectrum S ′ [f, ω] of the transmission output signal as shown in the following Expression 19-1 or 19-2.

このとき、送話入力信号の周波数スペクトルＺ［ｆ，ω］と残差信号の周波数スペクト
ルＥ［ｆ，ω］のどちらの周波数スペクトルがスペクトル選択部２１１Ｌで選択されても
、エコーキャンセラ部（ＥＣ）１１０は時間領域での処理であるから、位相スペクトルに
差がないため、出力された送話出力信号は聴感上滑らかにつなぐことができる。 At this time, regardless of which frequency spectrum Z [f, ω] of the transmission input signal or frequency spectrum E [f, ω] of the residual signal is selected by the spectrum selection unit 211L, the echo canceller unit (EC ) 110 is a process in the time domain, so there is no difference in the phase spectrum, so that the output transmission output signal can be smoothly connected in terms of hearing.

次に、上記のように構成された第２の実施形態に係る信号処理装置の処理の流れを説明
する。図１１は、第２の実施形態に係る信号処理装置の全体の処理の流れを示すフローチ
ャートである。なお、図５を参照して説明した第１の実施形態に係る信号処理装置の動作
と同じ動作ステップについては、同じ符号を付してその部分の説明を省略する。 Next, a processing flow of the signal processing device according to the second embodiment configured as described above will be described. FIG. 11 is a flowchart showing an overall processing flow of the signal processing apparatus according to the second embodiment. Note that the same operation steps as those of the signal processing apparatus according to the first embodiment described with reference to FIG. 5 are denoted by the same reference numerals, and description thereof is omitted.

ステップＳ１００５のエコーキャンセラ処理の後、信号処理装置は、第１の実施形態に
おいてはステップＳ１００６でエコーリダクション処理を行ったのに代えて、第２の実施
形態においては、ステップＳ１００５で得られたエコーキャンセラ処理後の信号を用いて
ノイズリダクション処理を行う（ステップＳ２００６）。そして、ステップＳ１００７の
終話か否かの判断に移る。 After the echo canceller processing in step S1005, the signal processing apparatus replaces the echo reduction processing in step S1006 in the first embodiment with the echo obtained in step S1005 in the second embodiment. Noise reduction processing is performed using the signal after the canceller processing (step S2006). Then, the process proceeds to step S1007 to determine whether or not the call is an end story.

第２の実施形態に係るエコーキャンセル部（ＥＣ）１１０は、第１の実施形態に係るエ
コーキャンセル部（ＥＣ）１１０と同じであるので、それぞれの実施形態における処理の
流れは、当然に同じであり説明を省略する。 Since the echo cancellation unit (EC) 110 according to the second embodiment is the same as the echo cancellation unit (EC) 110 according to the first embodiment, the flow of processing in each embodiment is naturally the same. There is no explanation.

図１２は、第２の実施形態に係るノイズリダクション部（ＮＲ）２１１における処理の
流れを示すフローチャートである。なお、図７を参照して説明した第１の実施形態に係る
エコーリダクション部（ＥＲ）１１１における動作と同じ動作ステップについては、同じ
符号を付してその部分の説明を省略する。 FIG. 12 is a flowchart showing the flow of processing in the noise reduction unit (NR) 211 according to the second embodiment. Note that the same operation steps as the operations in the echo reduction unit (ER) 111 according to the first embodiment described with reference to FIG. 7 are denoted by the same reference numerals, and description thereof is omitted.

第２の実施形態に係るノイズリダクション部（ＮＲ）２１１は、ステップＳ１２０１ｒ
の受話フレーム更新処理、ステップＳ１２０２ｒの受話周波数変換処理、ステップＳ１２
０３ｒの受話パワースペクトル算出処理、ステップＳ１２０４の音響結合量算出処理、ス
テップＳ１２０５のエコーレベル算出処理、ステップＳ１２０７の判定処理からステップ
Ｓ１２１０の送話信号抑圧処理までの動作はない。 The noise reduction unit (NR) 211 according to the second embodiment performs step S1201r.
Received frame update processing, received frequency conversion processing of step S1202r, step S12
There is no operation from the reception power spectrum calculation process of 03r, the acoustic coupling amount calculation process of step S1204, the echo level calculation process of step S1205, the determination process of step S1207 to the transmission signal suppression process of step S1210.

即ち、第２の実施形態に係るノイズリダクション部（ＮＲ）２１１は、ノイズリダクシ
ョン処理を開始して、ステップＳ１２０３ｓの送話パワースペクトル算出処理、及びステ
ップＳ１２０３ｅの残差パワースペクトル算出処理の後、雑音レベル推定部（ＮＬＥ）２
１１ｑは、有音区間でない区間のスペクトルを測定することにより、周波数帯域ω毎に送
話入力信号の周波数スペクトルに含まれる雑音レベルと、残差信号の周波数スペクトルに
含まれる雑音レベルをそれぞれ算出する（ステップＳ２２０５）。その後、ステップＳ１
２０６のエコー抑圧量算出処理を行い、制御部（ＣＴＲＬ）２１１ｋは、エコーキャンセ
ラ部（ＥＣ）１１０のエコー抑圧量が十分であるか否かの判定処理を行い、制御情報ＮＲ
ｃｏｎｔｒｏｌ［ｆ，ω］を出力する（ステップＳ２２０７）。 That is, the noise reduction unit (NR) 211 according to the second embodiment starts the noise reduction process, and after the transmission power spectrum calculation process in step S1203s and the residual power spectrum calculation process in step S1203e, Level estimation unit (NLE) 2
11q calculates the noise level included in the frequency spectrum of the transmission input signal and the noise level included in the frequency spectrum of the residual signal for each frequency band ω by measuring the spectrum of the section that is not a sound section. (Step S2205). Then, step S1
An echo suppression amount calculation process 206 is performed, and the control unit (CTRL) 211k performs a determination process as to whether or not the echo suppression amount of the echo canceller unit (EC) 110 is sufficient, and performs control information NR.
Control [f, ω] is output (step S2207).

そして、ノイズ抑圧ゲイン算出部（ＧＣＡＬ）２１１ｎは、送話パワー算出部（ＰＯＷ
）１１１ｅから出力された送話パワースペクトル｜Ｚ［ｆ，ω］｜^２と、制御部（ＣＴＲ
Ｌ）２１１ｋから出力された制御情報ＮＲｃｏｎｔｒｏｌ［ｆ，ω］を入力として、エコ
ー抑圧量が十分でないと検出された周波数帯域と、それ以外の周波数帯域とで、それぞれ
異なる計算によってノイズ抑圧ゲインＧ［ｆ，ω］を算出する。また、ノイズ抑圧ゲイン
算出部（ＧＣＡＬ）２１１ｎはノイズ抑圧ゲインＧ［ｆ，ω］を所定の下限値以下になら
ないように制御する（ステップＳ２２０８）。 The noise suppression gain calculation unit (GCAL) 211n then transmits the transmission power calculation unit (POW).
) 111e output power spectrum | Z [f, ω] | ² output from 111e and the control unit (CTR)
L) The control information NRcontrol [f, ω] output from 211k is used as an input, and the noise suppression gain G [ f, ω] is calculated. Further, the noise suppression gain calculation unit (GCAL) 211n controls the noise suppression gain G [f, ω] so as not to be equal to or lower than a predetermined lower limit value (step S2208).

そして、スペクトル選択部２１１Ｌは、制御部（ＣＴＲＬ）２１１ｋから出力された制
御情報ＮＲｃｏｎｔｒｏｌ［ｆ，ω］を入力として、エコー抑圧量が十分でないと検出さ
れた周波数帯域について、周波数スペクトルとして送話入力信号の周波数スペクトルＺ［
ｆ，ω］を選択し、それ以外の周波数帯域では周波数スペクトルとして残差信号の周波数
スペクトルＥ［ｆ，ω］を選択する（ステップＳ２２０９）。 Then, the spectrum selection unit 211L receives the control information NRcontrol [f, ω] output from the control unit (CTRL) 211k as an input, and transmits the transmission frequency as a frequency spectrum for the frequency band detected that the echo suppression amount is not sufficient. Signal frequency spectrum Z [
f, ω] is selected, and in the other frequency bands, the frequency spectrum E [f, ω] of the residual signal is selected as the frequency spectrum (step S2209).

そして、信号抑圧部（ＳＳ）２１１ｏは、スペクトル選択部２１１Ｌで選択された周波
数スペクトルとノイズ抑圧ゲイン算出部（ＧＣＡＬ）２１１ｎで算出されたノイズ抑圧ゲ
インＧ［ｆ，ω］を入力として、選択された周波数スペクトルのノイズを抑圧する（ステ
ップＳ２２１０）。そして、ステップＳ１２１１の周波数逆変換処理に移って、ノイズリ
ダクション処理が終了する。 The signal suppression unit (SS) 211o is selected with the frequency spectrum selected by the spectrum selection unit 211L and the noise suppression gain G [f, ω] calculated by the noise suppression gain calculation unit (GCAL) 211n as inputs. The frequency spectrum noise is suppressed (step S2210). Then, the process proceeds to the frequency inverse transform process in step S1211, and the noise reduction process ends.

以上、ノイズリダクション部（ＮＲ）２１１はＦＦＴによる周波数領域型で周波数帯域
毎に処理する方式として動作するものとして説明した。ＦＦＴによる周波数帯域をグルー
プでまとめて周波数帯域グループ毎に処理する方式や、フィルタバンクなどの帯域分割フ
ィルタなどの周波数領域型を実現してもよい。 As described above, the noise reduction unit (NR) 211 has been described as operating as a method of processing for each frequency band in the frequency domain type by FFT. You may implement | achieve the frequency domain type | molds, such as the method which puts together the frequency band by FFT and processes for every frequency band group, and band division filters, such as a filter bank.

ダブルトーク中のエコーパスロス変動時では、エコーキャンセラ部（ＥＣ）１１０の処
理によって送話入力信号ｚ［ｎ］よりも残差信号ｅ［ｎ］が大きくなってしまい、エコー
キャンセラ部（ＥＣ）１１０の処理によるエコー抑圧量が十分でない場合がある。 At the time of echo path loss fluctuation during double talk, the residual signal e [n] becomes larger than the transmission input signal z [n] by the processing of the echo canceller unit (EC) 110, and the echo canceller unit (EC) 110. In some cases, the amount of echo suppression by this processing is not sufficient.

以上説明した第２の実施形態に係る信号処理装置の動作により、エコーキャンセラ部（
ＥＣ）１１０の処理によるエコー抑圧量が十分でない周波数領域をエコー抑圧量推定部（
ＥＣＬＥ）１１１ｉと制御部（ＣＴＲＬ）２１１ｋで検出して、周波数帯域毎にノイズリ
ダクション部（ＮＲ）２１１の雑音レベル推定部（ＮＬＥ）２１１ｑとスペクトル選択部
２１１Ｌとノイズ抑圧ゲイン算出部（ＧＣＡＬ）２１１ｎによって雑音レベルの推定がエ
コーの影響を受けないようにすることができるので、エコーパスロス変動に対して頑健に
することが可能であり、高品質な信号を出力することが可能である。 By the operation of the signal processing apparatus according to the second embodiment described above, an echo canceller (
EC) The frequency region where the echo suppression amount by the processing of 110 is not sufficient is selected as an echo suppression amount estimation unit (
ECLE) 111i and control unit (CTRL) 211k, and for each frequency band, noise level estimation unit (NLE) 211q, spectrum selection unit 211L, noise suppression gain calculation unit (GCAL) 211n of noise reduction unit (NR) 211 This makes it possible to prevent the noise level estimation from being affected by echoes, so that it is possible to make robust against echo path loss fluctuations and output high-quality signals.

（第３の実施形態）
図１３は、第３の実施形態に係る信号処理装置の構成を示すブロック図である。この信
号処理装置が第１の実施形態に係る信号処理装置と異なる点は、エコーリダクション部（
ＥＲ）１１１を有さず、エコーノイズリダクション部（ＥＮＲ）３１１を有する点にあり
、その他の部分は同じである。そこで、同じ部分については、同じ符号を付してその説明
を省略し、図面を参照して第３の実施形態に係るエコーノイズリダクション部（ＥＮＲ）
３１１を説明する。なお、このエコーノイズリダクション部（ＥＮＲ）３１１で、第１の
実施形態に係るエコーリダクション部（ＥＲ）１１１及び第２の実施形態に係るノイズリ
ダクション部（ＮＲ）２１１と同じ部分については、同じ符号を付してその説明を省略す
る。 (Third embodiment)
FIG. 13 is a block diagram illustrating a configuration of a signal processing device according to the third embodiment. This signal processing device is different from the signal processing device according to the first embodiment in that an echo reduction unit (
ER) 111 and an echo noise reduction unit (ENR) 311, and the other parts are the same. Therefore, the same parts are denoted by the same reference numerals and description thereof is omitted, and an echo noise reduction unit (ENR) according to the third embodiment with reference to the drawings.
311 will be described. In this echo noise reduction unit (ENR) 311, the same parts as those of the echo reduction unit (ER) 111 according to the first embodiment and the noise reduction unit (NR) 211 according to the second embodiment are denoted by the same reference numerals. The description is omitted.

図１４は、第３の実施形態に係る信号処理装置のエコーノイズリダクション部（ＥＮＲ
）３１１の構成を示すブロック図である。このエコーノイズリダクション部（ＥＮＲ）３
１１は、周波数領域変換処理部（ＦＴ）１１１ａと、周波数領域変換処理部（ＦＴ）１１
１ｂと、周波数領域変換処理部（ＦＴ）１１１ｃと、受話パワー算出部（ＰＯＷ）１１１
ｄと、送話パワー算出部（ＰＯＷ）１１１ｅと、残差パワー算出部（ＰＯＷ）１１１ｆと
、音響結合量推定部（ＡＣＬＥ）１１１ｇと、エコー量推定部（ＥＬＥ）１１１ｈ、エコ
ー抑圧量推定部（ＥＣＬＥ）１１１ｉと、制御部（ＣＴＲＬ）２１１ｋと、スペクトル選
択部２１１Ｌと、ゲイン格納部（ＧＴＢＬ）１１１ｍと、エコーノイズ抑圧ゲイン算出部
（ＧＣＡＬ）３１１ｎと、信号抑圧部（ＳＳ）３１１ｏと、周波数領域逆変換処理部（Ｉ
ＦＴ）１１１ｐと、雑音レベル推定部（ＮＬＥ）３１１ｑとからなる。 FIG. 14 illustrates an echo noise reduction unit (ENR) of the signal processing device according to the third embodiment.
3) is a block diagram showing the configuration of 311. This echo noise reduction unit (ENR) 3
11 is a frequency domain transformation processing unit (FT) 111a and a frequency domain transformation processing unit (FT) 11
1b, a frequency domain conversion processing unit (FT) 111c, and a received power calculation unit (POW) 111
d, transmission power calculation unit (POW) 111e, residual power calculation unit (POW) 111f, acoustic coupling amount estimation unit (ACLE) 111g, echo amount estimation unit (ELE) 111h, echo suppression amount estimation unit (ECLE) 111i, control unit (CTRL) 211k, spectrum selection unit 211L, gain storage unit (GTBL) 111m, echo noise suppression gain calculation unit (GCAL) 311n, signal suppression unit (SS) 311o, Frequency domain inverse transform processing unit (I
FT) 111p and a noise level estimation unit (NLE) 311q.

エコーノイズリダクション部（ＥＮＲ）３１１は、ハイパスフィルタ部（ＨＰＦ）１０
９から出力された送話入力信号ｚ［ｎ］と、信号減算処理部１１０ｂから出力された残差
信号ｅ［ｎ］とを入力とし、送話入力信号ｚ［ｎ］あるいは残差信号ｅ［ｎ］の少なくと
も一方からエコー成分及びノイズ成分を抑圧し、そのエコー抑圧及びノイズ抑圧後の信号
を送話出力信号ｓ’［ｎ］（ｎ＝０，１，・・・，Ｎ−１）として１フレームごとに出力
する。 An echo noise reduction unit (ENR) 311 is a high-pass filter unit (HPF) 10.
9 is input with the transmission input signal z [n] output from the signal 9 and the residual signal e [n] output from the signal subtraction processing unit 110b, and the transmission input signal z [n] or the residual signal e [ n] suppresses an echo component and a noise component from at least one of them, and a signal after the echo suppression and noise suppression is set as a transmission output signal s ′ [n] (n = 0, 1,..., N−1). Output every frame.

雑音レベル推定部（ＮＬＥ）３１１ｑは、スペクトル選択部２１１Ｌから出力された周
波数スペクトル（送話スペクトルＺ［ｆ，ω］あるいは残差スペクトルＥ［ｆ，ω］）を
入力とし、有音ではない区間のスペクトルを測定し、周波数帯域ω毎に周波数スペクトル
に含まれる雑音レベル｜Ｎ［ｆ，ω］｜^２を算出して出力する。 The noise level estimation unit (NLE) 311q receives the frequency spectrum (transmission spectrum Z [f, ω] or residual spectrum E [f, ω]) output from the spectrum selection unit 211L, and is not a voiced section And a noise level | N [f, ω] | ² included in the frequency spectrum is calculated and output for each frequency band ω.

エコーノイズ抑圧ゲイン算出部（ＧＣＡＬ）３１１ｎは、エコー量推定部（ＥＬＥ）１
１１ｈから出力されたスムージングしたエコー量｜Ｙ_Ｓ［ｆ，ω］｜^２と、ゲイン格納部
（ＧＴＢＬ）１１１ｍから出力されたパラメータγ［ω］と、送話パワー算出部（ＰＯＷ
）１１１ｅから出力されたスムージングされた送話パワースペクトル｜Ｚ_Ｓ［ｆ，ω］｜
^２と、雑音レベル推定部（ＮＬＥ）３１１ｑから出力された雑音レベル｜Ｎ［ｆ，ω］｜
^２と、制御部（ＣＴＲＬ）２１１ｋから出力された制御情報ＮＲｃｏｎｔｒｏｌ［ｆ，ω
］とを入力とし、エコーノイズ抑圧ゲインＧ［ｆ，ω］を算出して出力する。 The echo noise suppression gain calculation unit (GCAL) 311n is an echo amount estimation unit (ELE) 1
The smoothed echo amount | Y _S [f, ω] | ² output from 11h, the parameter γ [ω] output from the gain storage unit (GTBL) 111m, and the transmission power calculation unit (POW)
) Smoothed transmission power spectrum output from 111e | Z _S [f, ω] |
² and the noise level | N [f, ω] | output from the noise level estimator (NLE) 311q
² and control information NRcontrol [f, ω output from the control unit (CTRL) 211k
] Is input and the echo noise suppression gain G [f, ω] is calculated and output.

具体的には、エコーノイズ抑圧ゲイン算出部（ＧＣＡＬ）３１１ｎは、まず、制御部（
ＣＴＲＬ）２１１ｋから出力された制御情報ＮＲｃｏｎｔｒｏｌ［ｆ，ω］によってエコ
ー抑圧量が十分でないと検出された周波数帯域については、過剰なエコー抑圧により送話
音質が劣化するのを防止するため、スムージングされた送話パワースペクトル｜Ｚ_Ｓ［ｆ
，ω］｜^２から雑音レベル｜Ｎ［ｆ，ω］｜^２を除去して、エコー抑圧ゲインＧ_ＥＲ［ｆ
，ω］を式２０によって算出する。

Specifically, the echo noise suppression gain calculation unit (GCAL) 311n first has a control unit (
(CTRL) 211k, the frequency band detected by the control information NRcontrol [f, ω] output from the control information NRcontrol is not smoothed in order to prevent the transmission sound quality from being deteriorated due to excessive echo suppression. Transmission power spectrum | Z _S [f
, Ω] | ² removes the noise level | N [f, ω] | ² from the echo suppression gain G _ER [f
, Ω] is calculated by Equation 20.

それ以外の周波数帯域では、エコーキャンセラ部（ＥＣ）１１０のエコー抑圧量が十分
であり、エコーキャンセラ部（ＥＣ）１１０が正常に機能していると見なして、エコーノ
イズ抑圧ゲイン算出部（ＧＣＡＬ）３１１ｎは、エコーノイズリダクション部（ＥＮＲ）
３１１のエコー抑圧量を多くするように、エコーキャンセラ部（ＥＣ）１１０のエコー抑
圧量ＥＣＬ［ｆ，ω］を用いて補正し、エコー抑圧ゲインＧ_ＥＲ［ｆ，ω］を以下の式２
１が示すように算出する。

In other frequency bands, the echo cancellation amount of the echo canceller unit (EC) 110 is sufficient, and it is assumed that the echo canceller unit (EC) 110 is functioning normally, and an echo noise suppression gain calculation unit (GCAL) 311n is an echo noise reduction unit (ENR)
The echo suppression amount ECL [f, ω] of the echo canceller unit (EC) 110 is corrected so as to increase the echo suppression amount 311, and the echo suppression gain G _ER [f, ω] is expressed by the following equation 2
1 is calculated.

また、エコーノイズ抑圧ゲイン算出部（ＧＣＡＬ）３１１ｎは、周波数帯域ω毎に選択
された雑音レベル｜Ｎ［ｆ，ω］｜^２に基づいて、前述した一般のノイズリダクションで
あるスペクトル・サブトラクション（Spectral Subtraction）法やウィナー・フィルター
（Wiener Filter）法や最尤推定（Maximum Likelihood）法などのアルゴリズムでノイズ
抑圧ゲインＧ_ＮＲ［ｆ，ω］を算出する。 Also, the echo noise suppression gain calculation unit (GCAL) 311n, based on the noise level | N [f, ω] | ² selected for each frequency band ω, spectral subtraction (Spectral), which is the general noise reduction described above. The noise suppression gain G _NR [f, ω] is calculated by an algorithm such as a subtraction method, a Wiener Filter method, or a maximum likelihood estimation method.

そして、式２２に示すように、エコー抑圧ゲインＧ_ＥＲ［ｆ，ω］とノイズ抑圧ゲイン
Ｇ_ＮＲ［ｆ，ω］の積を算出し、エコーノイズ抑圧ゲインＧ［ｆ，ω］とする。

Then, as shown in Equation 22, the product of the echo suppression gain G _ER [f, ω] and the noise suppression gain G _NR [f, ω] is calculated and set as the echo noise suppression gain G [f, ω].

また、過剰なエコー抑圧により送話音質が劣化するのを防止するため、エコーノイズ抑
圧ゲイン算出部（ＧＣＡＬ）３１１ｎは、エコーノイズ抑圧ゲインＧ［ｆ，ω］は所定の
下限値以下にならないように制御する。即ち、図４を参照して説明したエコー抑圧ゲイン
の下限値の帯域別の設定が、エコーノイズ抑圧ゲインの下限値の帯域別の設定にも適用さ
れる。 In order to prevent the transmission sound quality from being deteriorated due to excessive echo suppression, the echo noise suppression gain calculation unit (GCAL) 311n prevents the echo noise suppression gain G [f, ω] from being below a predetermined lower limit value. To control. That is, the setting for each band of the lower limit value of the echo suppression gain described with reference to FIG. 4 is also applied to the setting for each band of the lower limit value of the echo noise suppression gain.

さらに、過剰なエコー及びノイズ抑圧により送話音質が劣化するのを防止するため、エ
コーノイズ抑圧ゲイン算出部（ＧＣＡＬ）３１１ｎは、送話入力信号ｚ［ｎ］あるいは残
差信号ｅ［ｎ］の有音区間ではない区間の信号レベルを用いて周波数帯域ごとに背景雑音
レベルを算出しておき、背景雑音レベルよりも抑圧しないようにエコーノイズ抑圧ゲイン
を制御しても構わない。 Further, in order to prevent the transmission sound quality from deteriorating due to excessive echo and noise suppression, the echo noise suppression gain calculation unit (GCAL) 311n performs the transmission input signal z [n] or the residual signal e [n]. A background noise level may be calculated for each frequency band using a signal level of a section that is not a sound section, and the echo noise suppression gain may be controlled so as not to suppress the background noise level.

さらにまた、エコーノイズ抑圧ゲイン算出部（ＧＣＡＬ）３１１ｎは、送話音質が劣化
するのを防止するために、エコーノイズ抑圧ゲインＧ［ｆ，ω］を以下の式２３−１ある
いは式２３−２が示すように周波数方向にスムージングして出力してもよい。例えば、ε
_ｊは、［０．１、０．２、０．４、０．２、０．１］、η_ｊは、［０．１、０．２、０．
４、０．８、０．４、０．２、０．１］のようにしてよい。

Furthermore, the echo noise suppression gain calculation unit (GCAL) 311n sets the echo noise suppression gain G [f, ω] to the following Expression 23-1 or Expression 23-2 in order to prevent the transmission sound quality from deteriorating. As shown in FIG. 5, the output may be smoothed in the frequency direction. For example, ε
_j is [0.1, 0.2, 0.4, 0.2, 0.1], and η _j is [0.1, 0.2, 0.
4, 0.8, 0.4, 0.2, 0.1].

信号抑圧部（ＳＳ）３１１ｏは、スペクトル選択部２１１Ｌから出力された周波数スペ
クトルと、エコーノイズ抑圧ゲイン算出部（ＧＣＡＬ）３１１ｎから出力されたエコーノ
イズ抑圧ゲインＧ［ｆ，ω］とを入力として、スペクトル選択部２１１Ｌから出力された
周波数スペクトルのエコー及びノイズを抑圧し、以下の式２４−１または式２４−２のよ
うに送話出力信号のスペクトルＳ’［ｆ，ω］を算出して出力する。

The signal suppression unit (SS) 311o receives the frequency spectrum output from the spectrum selection unit 211L and the echo noise suppression gain G [f, ω] output from the echo noise suppression gain calculation unit (GCAL) 311n as inputs. The echo and noise of the frequency spectrum output from the spectrum selection unit 211L are suppressed, and the spectrum S ′ [f, ω] of the transmission output signal is calculated and output as in the following Expression 24-1 or Expression 24-2. To do.

次に、上記のように構成された第３の実施形態に係る信号処理装置の処理の流れを説明
する。図１５は、第３の実施形態に係る信号処理装置の全体の処理の流れを示すフローチ
ャートである。なお、図５を参照して説明した第１の実施形態に係る信号処理装置の動作
と同じ動作ステップについては、同じ符号を付してその部分の説明を省略する。 Next, a processing flow of the signal processing device according to the third embodiment configured as described above will be described. FIG. 15 is a flowchart illustrating an overall processing flow of the signal processing apparatus according to the third embodiment. Note that the same operation steps as those of the signal processing apparatus according to the first embodiment described with reference to FIG. 5 are denoted by the same reference numerals, and description thereof is omitted.

ステップＳ１００５のエコーキャンセラ処理の後、信号処理装置は、第１の実施形態に
おいてはステップＳ１００６でエコーリダクション処理を行ったのに代えて、第３の実施
形態においては、ステップＳ１００５で得られたエコーキャンセラ処理後の信号を用いて
エコーノイズリダクション処理を行う（ステップＳ３００６）。そして、ステップＳ１０
０７の終話か否かの判断に移る。 After the echo canceller processing in step S1005, the signal processing apparatus replaces the echo reduction processing in step S1006 in the first embodiment with the echo obtained in step S1005 in the third embodiment. An echo noise reduction process is performed using the signal after the canceller process (step S3006). And step S10
The process proceeds to the judgment of whether or not the end talk of 07.

第３の実施形態に係るエコーキャンセル部（ＥＣ）１１０は、第１の実施形態に係るエ
コーキャンセル部（ＥＣ）１１０と同じであるので、それぞれの実施形態における処理の
流れは、当然に同じであり説明を省略する。 Since the echo cancellation unit (EC) 110 according to the third embodiment is the same as the echo cancellation unit (EC) 110 according to the first embodiment, the flow of processing in each embodiment is naturally the same. There is no explanation.

図１６は、第３の実施形態に係るエコーノイズリダクション部（ＥＮＲ）３１１におけ
る処理の流れを示すフローチャートである。なお、図７を参照して説明した第１の実施形
態に係るエコーリダクション部（ＥＲ）１１１及び図１２を参照して説明した第２の実施
形態に係るノイズリダクション部（ＮＲ）２１１における動作と同じ動作ステップについ
ては、同じ符号を付してその部分の説明を省略する。 FIG. 16 is a flowchart showing the flow of processing in the echo noise reduction unit (ENR) 311 according to the third embodiment. The operation of the echo reduction unit (ER) 111 according to the first embodiment described with reference to FIG. 7 and the operation of the noise reduction unit (NR) 211 according to the second embodiment described with reference to FIG. About the same operation step, the same code | symbol is attached | subjected and description of the part is abbreviate | omitted.

第３の実施形態に係るエコーノイズリダクション部（ＥＮＲ）３１１は、エコーノイズ
リダクション処理を開始して、ステップＳ１２０１ｒ〜Ｓ１２０３ｒ、Ｓ１２０１ｓ〜Ｓ
１２０３ｓ及びＳ１２０１ｅ〜Ｓ１２０３ｅの受話、送話、残差の各信号に対するフレー
ム更新処理、周波数変換処理、パワースペクトル算出処理、ステップＳ１２０４の音響結
合量算出処理、ステップＳ１２０５のエコーレベル算出処理、ステップＳ１２０６のエコ
ー抑圧量算出処理を行う。 The echo noise reduction unit (ENR) 311 according to the third embodiment starts an echo noise reduction process, and steps S1201r to S1203r, S1201s to S
Frame update processing, frequency conversion processing, power spectrum calculation processing, acoustic coupling amount calculation processing in step S1204, echo level calculation processing in step S1205, and step S1206 in steps 1203s and S1201e to S1203e. Echo suppression amount calculation processing is performed.

そして、ステップＳ２２０７の判定処理、及びステップＳ２２０９のスペクトル選択処
理の後、雑音レベル推定部（ＮＬＥ）３１１ｑから出力された雑音レベル｜Ｎ［ｆ，ω］
｜^２を算出する（ステップＳ３２０５）。 Then, after the determination processing in step S2207 and the spectrum selection processing in step S2209, the noise level | N [f, ω] output from the noise level estimation unit (NLE) 311q.
| ² is calculated (step S3205).

そして、エコーノイズ抑圧ゲイン算出部（ＧＣＡＬ）３１１ｎは、雑音レベル推定部（
ＮＬＥ）３１１ｑから出力された雑音レベル｜Ｎ［ｆ，ω］｜^２と、制御部（ＣＴＲＬ）
２１１ｋから出力された制御情報ＮＲｃｏｎｔｒｏｌ［ｆ，ω］とを入力として、エコー
抑圧量が十分でないと検出された周波数帯域と、それ以外の周波数帯域とで、それぞれ異
なる計算によってエコーノイズ抑圧ゲインＧ［ｆ，ω］を算出する。また、エコーノイズ
抑圧ゲイン算出部（ＧＣＡＬ）３１１ｎはエコーノイズ抑圧ゲインＧ［ｆ，ω］を所定の
下限値以下にならないように制御する（ステップＳ３２０８）。 Then, the echo noise suppression gain calculation unit (GCAL) 311n has a noise level estimation unit (
NLE) The noise level | N [f, ω] | ² output from 311q and the control unit (CTRL)
The control information NRcontrol [f, ω] output from the 211k is used as an input, and the echo noise suppression gain G [] is calculated by different calculations for the frequency band detected when the echo suppression amount is not sufficient and the other frequency bands. f, ω] is calculated. Further, the echo noise suppression gain calculation unit (GCAL) 311n controls the echo noise suppression gain G [f, ω] so as not to be equal to or lower than a predetermined lower limit value (step S3208).

そして、信号抑圧部（ＳＳ）３１１ｏは、スペクトル選択部２１１Ｌで選択された周波
数スペクトルとエコーノイズ抑圧ゲイン算出部（ＧＣＡＬ）３１１ｎで算出されたエコー
ノイズ抑圧ゲインＧ［ｆ，ω］を入力として、選択された周波数スペクトルのエコー及び
ノイズを抑圧する（ステップＳ３２１０）。そして、ステップＳ１２１１の周波数逆変換
処理に移って、エコーノイズリダクション処理が終了する。 Then, the signal suppression unit (SS) 311o receives the frequency spectrum selected by the spectrum selection unit 211L and the echo noise suppression gain G [f, ω] calculated by the echo noise suppression gain calculation unit (GCAL) 311n as inputs. The echo and noise of the selected frequency spectrum are suppressed (step S3210). Then, the process proceeds to the frequency inverse transform process in step S1211, and the echo noise reduction process ends.

以上、エコーノイズリダクション部（ＥＮＲ）３１１はＦＦＴによる周波数領域型で周
波数帯域毎に処理する方式として動作するものとして説明した。ＦＦＴによる周波数帯域
をグループでまとめて周波数帯域グループ毎に処理する方式や、フィルタバンクなどの帯
域分割フィルタなどの周波数領域型を実現してもよい。 As described above, the echo noise reduction unit (ENR) 311 has been described as operating in a frequency domain type by FFT and processing for each frequency band. You may implement | achieve the frequency domain type | molds, such as the method which puts together the frequency band by FFT and processes for every frequency band group, and band division filters, such as a filter bank.

以上説明した第３の実施形態に係る信号処理装置の動作により、エコーキャンセラ部（
ＥＣ）１１０の処理によるエコー抑圧量が十分でない周波数領域をエコー抑圧量推定部（
ＥＣＬＥ）１１１ｉと制御部（ＣＴＲＬ）２１１ｋで判定して、スペクトル選択部２１１
Ｌとエコーノイズ抑圧ゲイン算出部（ＧＣＡＬ）３１１ｎにおいて周波数帯域毎にエコー
キャンセラ部（ＥＣ）１１０の処理とエコーノイズリダクション部（ＥＮＲ）３１１の処
理の重みを変化させるように制御することができるので、エコーパスロス変動に対して頑
健にすることが可能であり、高品質な信号を出力することが可能である。 By the operation of the signal processing apparatus according to the third embodiment described above, the echo canceller unit (
EC) The frequency region where the echo suppression amount by the processing of 110 is not sufficient is selected as an echo suppression amount estimation unit (
ECLE) 111i and control unit (CTRL) 211k make determination, and spectrum selection unit 211
Since L and the echo noise suppression gain calculation unit (GCAL) 311n can be controlled to change the weights of the processing of the echo canceller unit (EC) 110 and the processing of the echo noise reduction unit (ENR) 311 for each frequency band. It is possible to make robust against echo path loss fluctuations and to output a high-quality signal.

また、非線形抑圧処理である周波数領域のエコーリダクションと周波数領域でのノイズ
リダクションをエコーノイズ抑圧ゲイン算出部（ＧＣＡＬ）３１１ｎと信号抑圧部（ＳＳ
）３１１ｏで共通化してエコーノイズリダクション部（ＥＮＲ）３１１とすることで、計
算量が少なくすることができ、エコーリダクションとノイズリダクションを直列接続して
処理するよりも送話音質の劣化が防止できる。 In addition, echo reduction in the frequency domain, which is nonlinear suppression processing, and noise reduction in the frequency domain are performed by an echo noise suppression gain calculation unit (GCAL) 311n and a signal suppression unit (SS).
) By using 311o as the common echo noise reduction unit (ENR) 311, the amount of calculation can be reduced, and deterioration of the transmission sound quality can be prevented as compared with the case where echo reduction and noise reduction are connected in series. .

（第４の実施形態）
図１７は、第４の実施形態に係る信号処理装置の構成を示すブロック図である。この信
号処理装置が第１の実施形態に係る信号処理装置と異なる点は、図１７に示すようにエコ
ーリダクション部（ＥＲ）１１１を有さず、エコーサプレッサ部（ＥＳ）４１１を有する
点と、遅延処理部（ＤＥＬＡＹ）１０２に代えて遅延処理部（ＤＥＬＡＹ）４０２を有す
る点、及びＤ／Ａ変換器（Ｄ／Ａ）１０３に代えてＤ／Ａ変換器（Ｄ／Ａ）４０３を有す
る点であり、その他の部分は同じである。そこで、同じ部分については、同じ符号を付し
てその説明を省略する。 (Fourth embodiment)
FIG. 17 is a block diagram illustrating a configuration of a signal processing device according to the fourth embodiment. This signal processing device is different from the signal processing device according to the first embodiment in that it does not have the echo reduction unit (ER) 111 and has the echo suppressor unit (ES) 411 as shown in FIG. A point having a delay processing unit (DELAY) 402 instead of the delay processing unit (DELAY) 102 and a point having a D / A converter (D / A) 403 instead of the D / A converter (D / A) 103 And other parts are the same. Therefore, the same parts are denoted by the same reference numerals and the description thereof is omitted.

また、遅延処理部（ＤＥＬＡＹ）４０２と、Ｄ／Ａ変換器（Ｄ／Ａ）４０３とは、エコ
ーサプレッサ部（ＥＳ）４１１から出力される受話出力信号ｘ’［ｎ］を入力とすること
が異なるものの、それぞれ遅延処理部（ＤＥＬＡＹ）１０２と、Ｄ／Ａ変換器（Ｄ／Ａ）
１０３との動作を同じ動作をするので、それらの説明を省略し、図面を参照して第４の実
施形態に係るエコーサプレッサ部（ＥＳ）４１１を説明する。 Further, the delay processing unit (DELAY) 402 and the D / A converter (D / A) 403 may receive the reception output signal x ′ [n] output from the echo suppressor unit (ES) 411. Although different, the delay processing unit (DELAY) 102 and the D / A converter (D / A)
Since the same operation as that of 103 is performed, the description thereof is omitted, and an echo suppressor unit (ES) 411 according to the fourth embodiment is described with reference to the drawings.

図１８は、第４の実施形態に係る信号処理装置のエコーサプレッサ部（ＥＳ）４１１の
構成を示すブロック図である。このエコーサプレッサ部（ＥＳ）４１１は、受話パワー算
出部（ＰＯＷ）４１１ｄと、送話パワー算出部（ＰＯＷ）４１１ｅと、残差パワー算出部
（ＰＯＷ）４１１ｆと、エコー抑圧量推定部（ＥＣＬＥ）４１１ｉと、制御部（ＣＴＲＬ
）４１１ｋと、信号選択部４１１Ｌと、ゲイン格納部（ＧＴＢＬ）４１１ｍと、エコー抑
圧ゲイン算出部（ＧＣＡＬ）４１１ｎと、送話信号抑圧部（ＳＳ）４１１ｏと、受話信号
抑圧部（ＳＳ）４１１ｒとからなる。 FIG. 18 is a block diagram illustrating a configuration of an echo suppressor unit (ES) 411 of the signal processing device according to the fourth embodiment. The echo suppressor (ES) 411 includes a reception power calculation unit (POW) 411d, a transmission power calculation unit (POW) 411e, a residual power calculation unit (POW) 411f, and an echo suppression amount estimation unit (ECLE). 411i and the control unit (CTRL
) 411k, a signal selection unit 411L, a gain storage unit (GTBL) 411m, an echo suppression gain calculation unit (GCAL) 411n, a transmission signal suppression unit (SS) 411o, and a reception signal suppression unit (SS) 411r, Consists of.

エコーサプレッサ部（ＥＳ）４１１は、通信部（ＣＯＭ）１０１から出力された受話入
力信号ｘ［ｎ］と、ハイパスフィルタ部（ＨＰＦ）１０９から出力された送話入力信号ｚ
［ｎ］と、信号減算処理部１１０ｂから出力された残差信号ｅ［ｎ］とを入力とし、受話
出力信号ｘ’［ｎ］及び送話出力信号ｓ’［ｎ］（ｎ＝０，１，・・・，Ｎ−１）を１フ
レームごとに出力する。ここで、受話入力信号ｘ［ｎ］あるいは送話信号（送話入力信号
ｚ［ｎ］または残差信号ｅ［ｎ］）の少なくとも一方に抑圧処理を施し、そのエコー抑圧
後の信号を出力する。 The echo suppressor unit (ES) 411 includes a reception input signal x [n] output from the communication unit (COM) 101 and a transmission input signal z output from the high-pass filter unit (HPF) 109.
[N] and the residual signal e [n] output from the signal subtraction processing unit 110b are input, and the received output signal x ′ [n] and the transmitted output signal s ′ [n] (n = 0, 1). ,..., N-1) are output for each frame. Here, at least one of the reception input signal x [n] or the transmission signal (transmission input signal z [n] or residual signal e [n]) is subjected to suppression processing, and the signal after the echo suppression is output. .

受話パワー算出部（ＰＯＷ）４１１ｄは、通信部（ＣＯＭ）１０１から出力された受話
入力信号ｘ［ｎ］を入力とし、受話パワーＰ_Ｘ［ｆ］を以下の式２５−１が示すようにフ
レーム単位で算出して出力する。受話パワーＰ_Ｘ［ｆ］は以下の式２５−２が示すように
１フレーム前の値Ｐ_ＳＸ［ｆ−１］を用いてスムージングした値Ｐ_ＳＸ［ｆ］を用いても
よい。ただし、α_Ｘは、０．３７５〜０．９９９程度が望ましい。

The reception power calculation unit (POW) 411d receives the reception input signal x [n] output from the communication unit (COM) 101, and receives the reception power P _X [f] as shown in the following expression 25-1. Calculate and output in units. As the reception power P _X [f], a value P _SX [f] smoothed by using a value P _SX [f−1] one frame before may be used as shown in the following Expression 25-2. However, α _X is preferably about 0.375 to 0.999.

送話パワー算出部（ＰＯＷ）４１１ｅは、ハイパスフィルタ部（ＨＰＦ）１０９から出
力された送話入力信号ｚ［ｎ］を入力とし、送話パワーＰ_Ｚ［ｆ］を以下の式２６−１が
示すようにフレーム単位で算出して出力する。送話パワーＰ_Ｚ［ｆ］は以下の式２６−２
が示すように１フレーム前の値Ｐ_ＳＺ［ｆ−１］を用いてスムージングした値Ｐ_ＳＺ［ｆ
］を用いてもよい。ただし、α_Ｚは、０．３７５〜０．９９９程度が望ましい。

The transmission power calculation unit (POW) 411e receives the transmission input signal z [n] output from the high-pass filter unit (HPF) 109, and the transmission power P _Z [f] is expressed by the following equation 26-1. As shown, it is calculated and output in units of frames. The transmission power P _Z [f] is expressed by the following equation 26-2.
The value _P SZ [f was smoothed using one previous frame values _P SZ [f-1] as shown in the
] May be used. However, α _Z is, is preferably about 0.375 to 0.999.

残差パワー算出部（ＰＯＷ）４１１ｆは、信号減算処理部１１０ｂから出力された残差
信号ｅ［ｎ］を入力とし、残差パワーＰ_Ｅ［ｆ］を以下の式２７−１が示すようにフレー
ム単位で算出して出力する。残差パワーＰ_Ｅ［ｆ］は以下の式２７−２が示すように１フ
レーム前の値Ｐ_ＳＥ［ｆ−１］を用いてスムージングした値Ｐ_ＳＥ［ｆ］を用いてもよい
。ただし、α_Ｅは、０．３７５〜０．９９９程度が望ましい。

The residual power calculation unit (POW) 411f receives the residual signal e [n] output from the signal subtraction processing unit 110b, and the residual power P _E [f] is expressed by the following equation 27-1. Calculate and output in units of frames. Residual power _P E [f] may be used a value _P SE [f] were smoothed using the following one frame as shown by equation 27-2 previous value _P SE [f-1]. However, α _E is preferably about 0.375 to 0.999.

エコー抑圧量推定部（ＥＣＬＥ）４１１ｉは、送話パワー算出部（ＰＯＷ）４１１ｅか
ら出力された送話パワーＰ_Ｚ［ｆ］と、残差パワー算出部（ＰＯＷ）４１１ｆから出力さ
れた残差パワーＰ_Ｅ［ｆ］とを入力とし、エコーキャンセラ部（ＥＣ）１１０で抑圧され
たエコー抑圧量ＥＣＬ［ｆ］を推定して出力する。具体的には、以下に示す式２８によっ
て算出する。もちろん、これら２つのパワーの差などを用いてもよい。

The echo suppression amount estimation unit (ECLE) 411i transmits the transmission power P _Z [f] output from the transmission power calculation unit (POW) 411e and the residual power output from the residual power calculation unit (POW) 411f. P _E [f] is input, and the echo suppression amount ECL [f] suppressed by the echo canceller unit (EC) 110 is estimated and output. Specifically, it is calculated by the following equation 28. Of course, the difference between these two powers may be used.

制御部（ＣＴＲＬ）４１１ｋは、エコー抑圧量推定部（ＥＣＬＥ）４１１ｉから出力さ
れたＥＣＬ［ｆ］を入力とし、エコーキャンセラ部（ＥＣ）１１０のエコー抑圧量が十分
でないフレームであるか否かを判定し、その判定結果情報である制御情報ＥＳｃｏｎｔｒ
ｏｌ［ｆ］を出力する。具体的には、ＥＣＬ［ｆ］＞β_Ｚを満たしたフレームはエコーキ
ャンセラ部（ＥＣ）１１０のエコー抑圧量が少ないと判定する。ただし、β_Ｚは＋８ｄＢ
〜−１５ｄＢ程度が望ましい。本実施形態では以下、β_Ｚを０ｄＢとして説明する。 The control unit (CTRL) 411k receives ECL [f] output from the echo suppression amount estimation unit (ECLE) 411i as an input, and determines whether the echo suppression amount of the echo canceller unit (EC) 110 is insufficient. Control information EScontr which is the determination result information
ol [f] is output. Specifically, the frame filled with ECL [f]> β _Z determines that the echo suppression amount of the echo canceller (EC) 110 is small. However, β _Z + 8dB
About -15 dB is desirable. In the present embodiment will be described below beta _Z as 0 dB.

また、制御部（ＣＴＲＬ）４１１ｋは、受話パワー算出部（ＰＯＷ）４１１ｄから出力
された受話パワーＰ_Ｘ［ｆ］と、送話パワー算出部（ＰＯＷ）４１１ｅから出力された送
話パワーＰ_Ｚ［ｆ］とを入力とし、受話信号抑圧部（ＳＳ）４１１ｒによって受話入力信
号ｘ［ｎ］を抑圧するか、送話信号抑圧部（ＳＳ）４１１ｏによって送話信号（送話入力
信号ｚ［ｎ］または残差信号ｅ［ｎ］）を抑圧するか否かを判定し、その判定結果情報で
ある状態情報ＥＳｓｔａｔｅ［ｆ］を出力する。 Further, the control unit (CTRL) 411k receives the reception power P _X [f] output from the reception power calculation unit (POW) 411d and the transmission power P _Z [output from the transmission power calculation unit (POW) 411e. f] as an input, the received signal suppression unit (SS) 411r suppresses the received input signal x [n], or the transmitted signal suppression unit (SS) 411o transmits the transmitted signal (transmitted input signal z [n]. Alternatively, it is determined whether or not the residual signal e [n]) is to be suppressed, and state information ESstate [f] that is the determination result information is output.

例えば、受話パワーＰ_Ｘ［ｆ］と受話側の雑音レベルを表す可変閾値とを用いて受話側
が有音であるか否か検出し、送話パワーＰ_Ｚ［ｆ］と送話側の雑音レベルを表す可変閾値
とを用いて送話側が有音であるか否かを検出し、受話側が有音であれば送話信号を抑圧す
るように状態情報ＥＳｓｔａｔｅ［ｆ］を設定し、送話側が有音であれば受話信号を抑圧
するように状態情報ＥＳｓｔａｔｅ［ｆ］を設定し、そして受話パワーＰ_Ｘ［ｆ］と送話
パワーＰ_Ｚ［ｆ］の差が大きくなるまで以前の状態情報ＥＳｓｔａｔｅ［ｆ］を継続する
ように設定する。 For example, using the reception power P _X [f] and a variable threshold value representing the noise level on the reception side, it is detected whether or not the reception side is voiced, and the transmission power P _Z [f] and the noise level on the transmission side are detected. The state information ESstate [f] is set so as to suppress the transmission signal if the receiving side is sounded, and the transmitting side sets the state information ESstate [f]. If there is sound, the state information ESstate [f] is set so as to suppress the reception signal, and the previous state information ESstate until the difference between the reception power P _X [f] and the transmission power P _Z [f] becomes large. [F] is set to continue.

なおここで送話パワーＰ_Ｓ［ｆ］として、制御情報ＥＳｃｏｎｔｒｏｌ［ｆ］によって
エコー抑圧量が十分でないと判定されたフレームについてはＰ_Ｚ［ｆ］を用い、エコー抑
圧量が十分であると判定されたフレームではＰ_Ｅ［ｆ］を用いてもよい。 Here, as the transmission power P _S [f], P _Z [f] is used for a frame for which the echo suppression amount is determined to be insufficient by the control information EScontrol [f], and it is determined that the echo suppression amount is sufficient. P _E [f] may be used in the frame that has been processed.

信号選択部４１１Ｌは、ハイパスフィルタ部（ＨＰＦ）１０９から出力された送話入力
信号ｚ［ｎ］と、信号減算処理部１１０ｂから出力された残差信号ｅ［ｎ］と、制御部（
ＣＴＲＬ）４１１ｋから出力された制御情報ＥＳｃｏｎｔｒｏｌ［ｆ］とを入力とし、送
話入力信号ｚ［ｎ］あるいは残差信号ｅ［ｎ］のいずれか一方をフレーム単位で出力する
。具体的には、制御情報ＥＳｃｏｎｔｒｏｌ［ｆ］がエコーキャンセラ部（ＥＣ）１１０
のエコー抑圧量が十分でないと検出されたフレームであった場合は、信号として送話入力
信号Ｚ［ｎ］を選択する。それ以外のフレームでは信号として残差信号ｅ［ｎ］を選択す
る。 The signal selection unit 411L includes a transmission input signal z [n] output from the high-pass filter unit (HPF) 109, a residual signal e [n] output from the signal subtraction processing unit 110b, and a control unit (
CTRL) 411k and control information EScontrol [f] are input, and either one of transmission input signal z [n] or residual signal e [n] is output in frame units. Specifically, the control information EScontrol [f] is stored in the echo canceller (EC) 110.
If the frame is detected to have an insufficient echo suppression amount, the transmission input signal Z [n] is selected as a signal. In other frames, the residual signal e [n] is selected as a signal.

このようにすることでフレームごとに、エコー抑圧量が十分でない場合、つまりエコー
キャンセラ部（ＥＣ）１１０の推定精度が十分とれない場合はエコーサプレッサ部（ＥＳ
）４１１を単体で動作させることができる。 In this way, when the echo suppression amount is not sufficient for each frame, that is, when the estimation accuracy of the echo canceller (EC) 110 is not sufficient, the echo suppressor (ES
) 411 can be operated alone.

ゲイン格納部（ＧＴＢＬ）４１１ｍは、事前に設定されたゲインγを格納しておく。た
だし、γは通話開始前に事前にアナログフロントエンド（Ｄ／Ａ変換器（Ｄ／Ａ）４０３
、受話増幅器１０４、スピーカ１０５、音響空間、マイクロホン１０６、送話増幅器１０
７、Ａ／Ｄ変換器（Ａ／Ｄ）１０８）のエコーリターンロスに基づいて設定されているこ
とが望ましい。 The gain storage unit (GTBL) 411m stores a gain γ set in advance. However, γ is an analog front end (D / A converter (D / A) 403) in advance before a call is started.
, Receiver amplifier 104, speaker 105, acoustic space, microphone 106, transmitter amplifier 10
7. It is desirable to set based on the echo return loss of the A / D converter (A / D) 108).

エコー抑圧ゲイン算出部（ＧＣＡＬ）４１１ｎは、ゲイン格納部（ＧＴＢＬ）４１１ｍ
から出力されたゲインγと、エコー抑圧量推定部（ＥＣＬＥ）４１１ｉから出力されたＥ
ＣＬ［ｆ］と、制御部（ＣＴＲＬ）４１１ｋから出力された制御情報ＥＳｃｏｎｔｒｏｌ
［ｆ］及び状態情報ＥＳｓｔａｔｅ［ｆ］とを入力とし、エコー抑圧ゲインＧ［ｆ］を算
出して出力する。 The echo suppression gain calculation unit (GCAL) 411n is a gain storage unit (GTBL) 411m.
Gain γ output from E, and E output from echo suppression amount estimation unit (ECLE) 411i.
CL [f] and control information EScontrol output from the control unit (CTRL) 411k
[F] and state information ESstate [f] are input, and an echo suppression gain G [f] is calculated and output.

具体的には、エコー抑圧ゲイン算出部（ＧＣＡＬ）４１１ｎは、状態情報ＥＳｓｔａｔ
ｅ［ｆ］が送話信号（送話入力信号ｚ［ｎ］または残差信号ｅ［ｎ］）を抑圧すると判定
されたことを示す場合は次のようにする。まず、制御情報ＥＳｃｏｎｔｒｏｌ［ｆ］がエ
コーキャンセラ部（ＥＣ）１１０のエコー抑圧量が十分でないと検出されたフレームであ
った場合は、エコー抑圧ゲインＧ［ｆ］をＧ［ｆ］＝γとする。 Specifically, the echo suppression gain calculation unit (GCAL) 411n receives the state information ESstat.
When e [f] indicates that it is determined to suppress the transmission signal (transmission input signal z [n] or residual signal e [n]), the following is performed. First, when the control information EScontrol [f] is a frame detected when the echo suppression amount of the echo canceller unit (EC) 110 is not sufficient, the echo suppression gain G [f] is set to G [f] = γ. .

それ以外のフレームではエコー抑圧ゲインＧ［ｆ］を、エコーキャンセラ部（ＥＣ）１
１０で十分にできているため、抑圧量を強めてもよいと判定し、Ｇ［ｆ］＝γ・ＥＣＬ［
ｆ］と算出する。あるいは、過剰な抑圧を防止するためエコーキャンセラ部（ＥＣ）１１
０のエコー抑圧量分を低減した抑圧量にするようにＧ［ｆ］＝γ／ＥＣＬ［ｆ］と算出す
る。一方、状態情報ＥＳｓｔａｔｅ［ｆ］が受話入力信号を抑圧すると判定されたことを
示す場合は、エコー抑圧ゲインＧ［ｆ］をＧ［ｆ］＝γとする。 In other frames, the echo suppression gain G [f] is set to the echo canceller (EC) 1
10 is sufficient, it is determined that the amount of suppression may be increased, and G [f] = γ · ECL [
f]. Alternatively, an echo canceller (EC) 11 is used to prevent excessive suppression.
G [f] = γ / ECL [f] is calculated so that the echo suppression amount of 0 is reduced. On the other hand, when the state information ESstate [f] indicates that it is determined to suppress the incoming input signal, the echo suppression gain G [f] is set to G [f] = γ.

さらに、過剰なエコー抑圧により送話音質が劣化するのを防止するため、送話入力信号
ｚ［ｎ］あるいは残差信号ｅ［ｎ］の有音区間ではない区間の信号レベルを用いて背景雑
音レベルを算出しておき、背景雑音レベルよりも抑圧しないようにエコー抑圧ゲインを制
御してもよい。 Further, in order to prevent the transmission sound quality from being deteriorated due to excessive echo suppression, background noise is used by using the signal level of the transmission input signal z [n] or the residual signal e [n] that is not a voiced section. The level may be calculated, and the echo suppression gain may be controlled so that the level is not suppressed below the background noise level.

送話信号抑圧部（ＳＳ）４１１ｏは、制御部（ＣＴＲＬ）４１１ｋから出力された状態
情報ＥＳｓｔａｔｅ［ｆ］と、エコー抑圧ゲイン算出部（ＧＣＡＬ）４１１ｎから出力さ
れたエコー抑圧ゲインＧ［ｆ］と、信号選択部４１１Ｌで選択された送話信号（送話入力
信号ｚ［ｎ］または残差信号ｅ［ｎ］）とを入力とし、状態情報ＥＳｓｔａｔｅ［ｆ］に
よって送話信号（送話入力信号ｚ［ｎ］または残差信号ｅ［ｎ］）を抑圧すると判定され
たことを示す場合、信号選択部４１１Ｌで選択された信号とエコー抑圧ゲイン算出部（Ｇ
ＣＡＬ）４１１ｎで算出された抑圧ゲインＧ［ｆ］を用いて以下の式２９−１または式２
９−２が示すように信号を抑圧し、送話出力信号ｓ’［ｎ］を算出して出力する。 The transmission signal suppression unit (SS) 411o includes state information ESstate [f] output from the control unit (CTRL) 411k, echo suppression gain G [f] output from the echo suppression gain calculation unit (GCAL) 411n, and , The transmission signal (transmission input signal z [n] or residual signal e [n]) selected by the signal selection unit 411L is input, and the transmission signal (transmission input signal) is transmitted by the state information ESstate [f]. z [n] or the residual signal e [n]) is determined to be suppressed, the signal selected by the signal selection unit 411L and the echo suppression gain calculation unit (G
CAL) Using the suppression gain G [f] calculated in 411n, the following Expression 29-1 or Expression 2
The signal is suppressed as indicated by 9-2, and the transmission output signal s ′ [n] is calculated and output.

また、状態情報ＥＳｓｔａｔｅ［ｆ］によって送話信号（送話入力信号ｚ［ｎ］または
残差信号ｅ［ｎ］）を抑圧しないと判定された場合は、以下の式２９−３または式２９−
４が示すように送話信号をそのまま送話出力信号ｓ’［ｎ］として出力する。

If it is determined that the transmission signal (transmission input signal z [n] or residual signal e [n]) is not suppressed by the state information ESstate [f], the following Expression 29-3 or Expression 29-
As shown in FIG. 4, the transmission signal is output as it is as the transmission output signal s ′ [n].

これらのとき、どちらの信号が信号選択部４１１Ｌで選択されても、出力された送話出
力信号ｓ’[ｎ]が聴感上滑らかに繋がるように、時間方向にスムージングしたエコー抑圧
ゲインＧ［ｆ］を乗じてもよい。 At these times, regardless of which signal is selected by the signal selection unit 411L, the echo suppression gain G [f smoothed in the time direction is smoothly connected so that the transmitted transmission output signal s ′ [n] is smoothly connected for hearing. ] May be multiplied.

受話信号抑圧部（ＳＳ）４１１ｒは、制御部（ＣＴＲＬ）４１１ｋから出力された状態
情報ＥＳｓｔａｔｅ［ｆ］と、エコー抑圧ゲイン算出部（ＧＣＡＬ）４１１ｎから出力さ
れたエコー抑圧ゲインＧ［ｆ］と、通信部（ＣＯＭ）１０１から出力された受話入力信号
ｘ［ｎ］とを入力とし、状態情報ＥＳｓｔａｔｅ［ｆ］によって受話入力信号ｘ［ｎ］を
抑圧すると判定された場合、受話信号ｘ［ｎ］に対してエコー抑圧ゲイン算出部（ＧＣＡ
Ｌ）４１１ｎで算出されたエコー抑圧ゲインＧ［ｆ］を用いて、以下の式３０−１が示す
ように信号を抑圧し、受話出力信号ｘ’［ｎ］を算出して出力する。 The received signal suppression unit (SS) 411r includes state information ESstate [f] output from the control unit (CTRL) 411k, echo suppression gain G [f] output from the echo suppression gain calculation unit (GCAL) 411n, When the reception input signal x [n] output from the communication unit (COM) 101 is input, and it is determined that the reception input signal x [n] is suppressed by the state information ESstate [f], the reception signal x [n] Echo suppression gain calculator (GCA)
L) Using the echo suppression gain G [f] calculated in 411n, the signal is suppressed as shown in the following Expression 30-1, and the received output signal x ′ [n] is calculated and output.

また、状態情報ＥＳｓｔａｔｅ［ｆ］によって受話入力信号ｘ［ｎ］を抑圧しないと判
定された場合は、以下の式３０−２が示すように受話入力信号ｘ［ｎ］をそのまま受話出
力信号ｘ’［ｎ］として出力する。

If it is determined that the received input signal x [n] is not suppressed by the state information ESstate [f], the received input signal x [n] is directly used as the received output signal x ′ as shown in the following Expression 30-2. Output as [n].

次に、上記のように構成された第４の実施形態に係る信号処理装置の処理の流れを説明
する。図１９は、第４の実施形態に係る信号処理装置の全体の処理の流れを示すフローチ
ャートである。なお、図５を参照して説明した第１の実施形態に係る信号処理装置の動作
と同じ動作ステップについては、同じ符号を付してその部分の説明を省略する。 Next, a processing flow of the signal processing device according to the fourth embodiment configured as described above will be described. FIG. 19 is a flowchart showing an overall processing flow of the signal processing apparatus according to the fourth embodiment. Note that the same operation steps as those of the signal processing apparatus according to the first embodiment described with reference to FIG. 5 are denoted by the same reference numerals, and description thereof is omitted.

ステップＳ１００３のオフセット除去処理の後、遅延処理部（ＤＥＬＡＹ）４０２は受
話信号抑圧部（ＳＳ）４１１ｒから出力された受話出力信号ｘ’［ｎ］を一時的に蓄えて
遅延させる処理を行う（ステップＳ４００４）。そしてステップＳ１００５のエコーキャ
ンセラ処理に移る。ステップＳ１００５のエコーキャンセラ処理の後、信号処理装置は、
第１の実施形態においてはステップＳ１００６でエコーリダクション処理を行ったのに代
えて、第４の実施形態においては、ステップＳ１００５で得られたエコーキャンセラ処理
後の信号を用いてエコーサプレッサ処理を行う（ステップＳ４００６）。そして、ステッ
プＳ１００７の終話か否かの判断に移る。 After the offset removal processing in step S1003, the delay processing unit (DELAY) 402 performs processing for temporarily storing and delaying the reception output signal x ′ [n] output from the reception signal suppression unit (SS) 411r (step S1003). S4004). Then, the process proceeds to the echo canceller process in step S1005. After the echo canceller process in step S1005, the signal processing apparatus
Instead of performing echo reduction processing in step S1006 in the first embodiment, echo suppression processing is performed using the signal after echo canceller processing obtained in step S1005 in the fourth embodiment ( Step S4006). Then, the process proceeds to step S1007 to determine whether or not the call is an end story.

第４の実施形態に係るエコーキャンセル部（ＥＣ）１１０は、第１の実施形態に係るエ
コーキャンセル部（ＥＣ）１１０と同じであるので、それぞれの実施形態における処理の
流れは、当然に同じであり説明を省略する。 Since the echo cancellation unit (EC) 110 according to the fourth embodiment is the same as the echo cancellation unit (EC) 110 according to the first embodiment, the flow of processing in each embodiment is naturally the same. There is no explanation.

図２０は、第４の実施形態に係るエコーサプレッサ部（ＥＳ）４１１における処理の流
れを示すフローチャートである。まず、受話パワー算出部（ＰＯＷ）４１１ｄは、受話入
力信号ｘ［ｎ］から受話パワーＰ_Ｘ［ｆ］を算出し（ステップＳ４２０１ｒ）、送話パワ
ー算出部（ＰＯＷ）４１１ｅは、送話入力信号ｚ［ｎ］から送話パワーＰ_Ｚ［ｆ］を算出
し（ステップＳ４２０１ｓ）、残差パワー算出部（ＰＯＷ）４１１ｆは、残差信号ｅ［ｎ
］から残差パワーＰ_Ｅ［ｆ］を算出する（ステップＳ４２０１ｅ）。 FIG. 20 is a flowchart showing the flow of processing in the echo suppressor unit (ES) 411 according to the fourth embodiment. First, the received power calculator (POW) 411d calculates the received power P _X [f] from the received input signal x [n] (step S4201r), and the transmitted power calculator (POW) 411e receives the transmitted input signal. The transmission power P _Z [f] is calculated from z [n] (step S4201s), and the residual power calculation unit (POW) 411f receives the residual signal e [n.
] To calculate the residual power P _E [f] (step S4201e).

エコー抑圧量推定部（ＥＣＬＥ）４１１ｉは、送話パワーＰ_Ｚ［ｆ］と残差パワーＰ_Ｅ
［ｆ］を入力として、エコーキャンセラ部（ＥＣ）１１０で抑圧されたエコー抑圧量ＥＣ
Ｌ［ｆ］を推定する（ステップＳ４２０２）。 The echo suppression amount estimation unit (ECLE) 411i transmits the transmission power P _Z [f] and the residual power P _E.
Echo suppression amount EC suppressed by echo canceller unit (EC) 110 using [f] as input
L [f] is estimated (step S4202).

制御部（ＣＴＲＬ）４１１ｋは、エコー抑圧量ＥＣＬ［ｆ］を入力としてフレーム毎に
エコー抑圧量が十分であるか否かを判定し、制御情報ＥＳｃｏｎｔｒｏｌ［ｆ］を出力す
る。また、制御部（ＣＴＲＬ）４１１ｋは、受話パワーＰ_Ｘ［ｆ］と送話パワーＰ_Ｚ［ｆ
］に基づいて、受話信号抑圧部（ＳＳ）４１１ｒによって受話入力信号ｘ［ｎ］を抑圧す
るか、送話信号抑圧部（ＳＳ）４１１ｏによって送話信号（送話入力信号ｚ［ｎ］または
残差信号ｅ［ｎ］）を抑圧するか判定し、状態情報ＥＳｓｔａｔｅ［ｆ］を出力する。（
ステップＳ４２０３）。 The control unit (CTRL) 411k receives the echo suppression amount ECL [f] as an input, determines whether or not the echo suppression amount is sufficient for each frame, and outputs control information EScontrol [f]. In addition, the control unit (CTRL) 411k receives the reception power P _X [f] and the transmission power P _Z [f].
], The received input signal x [n] is suppressed by the received signal suppression unit (SS) 411r, or the transmitted signal (transmitted input signal z [n] or the remaining signal is suppressed by the transmitted signal suppressing unit (SS) 411o. It is determined whether the difference signal e [n]) is suppressed, and the state information ESstate [f] is output. (
Step S4203).

そして、信号選択部４１１Ｌは、制御情報ＥＳｃｏｎｔｒｏｌ［ｆ］を入力として、エ
コー抑圧量が十分でないと判定されたフレームについて、信号としてｚ［ｎ］を選択して
出力し、エコー抑圧量が十分であると判定されたフレームでは信号としてｅ［ｎ］を選択
して出力する（ステップＳ４２０４）。 Then, the signal selection unit 411L receives the control information EScontrol [f] as an input, selects and outputs z [n] as a signal for the frame determined to have insufficient echo suppression amount, and the echo suppression amount is sufficient. In a frame determined to be present, e [n] is selected and output as a signal (step S4204).

エコー抑圧ゲイン算出部（ＧＣＡＬ）４１１ｎは、制御情報ＥＳｃｏｎｔｒｏｌ［ｆ］
とエコー抑圧量ＥＣＬ［ｆ］を入力として、エコー抑圧量が十分でないと検出されたフレ
ームと、それ以外のフレームとで、それぞれ異なる計算によってエコー抑圧ゲインＧ［ｆ
］を算出して出力する（ステップＳ４２０５）。 The echo suppression gain calculation unit (GCAL) 411n controls the control information EScontrol [f].
And the echo suppression amount ECL [f] as an input, the echo suppression gain G [f] is calculated by a different calculation for each of the frames detected as having insufficient echo suppression amount and the other frames.
] Is calculated and output (step S4205).

送話信号抑圧部（ＳＳ）４１１ｏは、エコー抑圧ゲインＧ［ｆ］と状態情報ＥＳｓｔａ
ｔｅ［ｆ］を入力として、送話信号（送話入力信号ｚ［ｎ］または残差信号ｅ［ｎ］）を
抑圧すると判定された場合に、信号選択部４１１Ｌで選択された信号とエコー抑圧ゲイン
算出部（ＧＣＡＬ）４１１ｎで算出されたエコー抑圧ゲインＧ［ｆ］を用いて、信号を抑
圧して送話出力信号ｓ’［ｎ］として出力する。あるいは、送話信号を抑圧しないと判定
された場合は、信号選択部４１１Ｌで選択された信号を送話出力信号ｓ’［ｎ］として出
力する（ステップＳ４２０６ｓ）。一方、受話信号抑圧部（ＳＳ）４１１ｒは、エコー抑
圧ゲインＧ［ｆ］と状態情報ＥＳｓｔａｔｅ［ｆ］を入力として、受話入力信号ｘ［ｎ］
を抑圧すると判定された場合、受話信号ｘ［ｎ］に対してエコー抑圧ゲイン算出部（ＧＣ
ＡＬ）４１１ｎで算出されたエコー抑圧ゲインＧ［ｆ］を用いて、信号を抑圧して受話出
力信号ｘ’［ｎ］として出力する。あるいは、受話入力信号ｘ［ｎ］を抑圧しないと判定
された場合は、受話信号ｘ［ｎ］を受話出力信号ｘ’［ｎ］として出力する（ステップＳ
４２０６ｒ）。これにより、エコーサプレッサ処理が終了する。 The transmission signal suppression unit (SS) 411o has an echo suppression gain G [f] and state information ESsta.
When it is determined to suppress the transmission signal (transmission input signal z [n] or residual signal e [n]) using te [f] as an input, the signal selected by the signal selection unit 411L and echo suppression The echo suppression gain G [f] calculated by the gain calculation unit (GCAL) 411n is used to suppress the signal and output it as a transmission output signal s ′ [n]. Alternatively, if it is determined not to suppress the transmission signal, the signal selected by the signal selection unit 411L is output as the transmission output signal s ′ [n] (step S4206s). On the other hand, the received signal suppression unit (SS) 411r receives the echo suppression gain G [f] and the state information ESstate [f] as input, and receives the received input signal x [n].
Is determined to be suppressed, an echo suppression gain calculation unit (GC) is applied to the received signal x [n].
AL) Using the echo suppression gain G [f] calculated in 411n, the signal is suppressed and output as a reception output signal x ′ [n]. Alternatively, when it is determined not to suppress the reception input signal x [n], the reception signal x [n] is output as the reception output signal x ′ [n] (step S).
4206r). As a result, the echo suppressor process ends.

以上、エコーサプレッサ部（ＥＳ）４１１は、時間領域型でフレーム毎に処理する方式
として動作するものとして説明した。ＦＦＴやフィルタバンクなどの帯域分割フィルタな
どを用いて周波数領域型に処理する方式で実現してもよい。 As described above, the echo suppressor unit (ES) 411 has been described as operating as a time domain type processing method for each frame. You may implement | achieve by the system processed to a frequency domain type | mold using band division filters, such as FFT and a filter bank.

以上説明した信号処理装置の動作により、エコーキャンセラ部（ＥＣ）１１０の処理に
よるエコー抑圧量が十分でない場合をエコー抑圧量推定部（ＥＣＬＥ）４１１ｉと制御部
（ＣＴＲＬ）４１１ｋで検出して、エコー抑圧ゲイン算出部（ＧＣＡＬ）４１１ｎと送話
信号抑圧部（ＳＳ）４１１ｏにおいてエコーキャンセラ部（ＥＣ）１１０のエコー抑圧量
を考慮することにより過剰な送話音声に対する抑圧を避けることができるので、エコーパ
スロス変動に対して頑健で高品質な信号を出力することができる。 By the operation of the signal processing apparatus described above, the echo suppression amount estimation unit (ECLE) 411i and the control unit (CTRL) 411k detect when the echo suppression amount due to the processing of the echo canceller unit (EC) 110 is not sufficient, and the echo Since the suppression gain calculation unit (GCAL) 411n and the transmission signal suppression unit (SS) 411o take into account the echo suppression amount of the echo canceller unit (EC) 110, it is possible to avoid suppression of excessive transmission speech. It is possible to output a high-quality signal that is robust against path loss fluctuations.

（第５の実施形態）
第５の実施形態に係る信号処理装置が第１の実施形態に係る信号処理装置と異なる点は
、エコーリダクション部（ＥＲ）１１１を有さず、エコーリダクション部（ＥＲ）５１１
を有する点にある。一方、その他の部分は同じである。そこで、同じ部分については、同
じ符号を付してその説明を省略する。 (Fifth embodiment)
The signal processing device according to the fifth embodiment is different from the signal processing device according to the first embodiment in that it does not have the echo reduction unit (ER) 111, but the echo reduction unit (ER) 511.
It is in the point which has. On the other hand, the other parts are the same. Therefore, the same parts are denoted by the same reference numerals and the description thereof is omitted.

図２１に、本発明の第５の実施形態に係る信号処理装置のエコーリダクション部（ＥＲ
）５１１の構成を示すブロック図を示す。本発明の第５の実施形態に係る信号処理装置の
エコーリダクション部（ＥＲ）５１１が第１の実施形態に係る信号処理装置のエコーリダ
クション部（ＥＲ）１１１と異なる点を以下に説明する。なお、第１の実施形態に係るエ
コーリダクション部（ＥＲ）１１１及び第２の実施形態に係るノイズリダクション部（Ｎ
Ｒ）２１１と同じ部分については、同じ符号を付してその説明を省略する。 FIG. 21 shows an echo reduction unit (ER) of a signal processing device according to the fifth embodiment of the invention.
) Shows a block diagram showing the configuration of 511. The difference between the echo reduction unit (ER) 511 of the signal processing device according to the fifth embodiment of the present invention and the echo reduction unit (ER) 111 of the signal processing device according to the first embodiment will be described below. Note that the echo reduction unit (ER) 111 according to the first embodiment and the noise reduction unit (N) according to the second embodiment.
The same parts as those in R) 211 are given the same reference numerals and their description is omitted.

エコーリダクション部（ＮＲ）５１１は、周波数領域変換処理部（ＦＴ）１１１ａと、
周波数領域変換処理部（ＦＴ）１１１ｂと、周波数領域変換処理部（ＦＴ）１１１ｃと、
受話パワー算出部（ＰＯＷ）１１１ｄと、送話パワー算出部（ＰＯＷ）１１１ｅと、残差
パワー算出部（ＰＯＷ）１１１ｆと、音響結合量推定部（ＡＣＬＥ）５１１ｇと、エコー
量推定部（ＥＬＥ）５１１ｈと、エコー抑圧量推定部（ＥＣＬＥ）１１１ｉと、制御部（
ＣＴＲＬ）５１１ｋと、スペクトル選択部５１１Ｌと、ゲイン格納部（ＧＴＢＬ）１１１
ｍ、エコー抑圧ゲイン算出部（ＧＣＡＬ）５１１ｎと、周波数領域逆変換処理部（ＩＦＴ
）１１１ｐと、送話信号抑圧部（ＳＳ）５１１ｓと、残差信号抑圧部（ＥＳ）５１１ｔと
からなる。 The echo reduction unit (NR) 511 includes a frequency domain transform processing unit (FT) 111a,
A frequency domain transform processing unit (FT) 111b, a frequency domain transform processing unit (FT) 111c,
Received power calculation unit (POW) 111d, transmission power calculation unit (POW) 111e, residual power calculation unit (POW) 111f, acoustic coupling amount estimation unit (ACLE) 511g, and echo amount estimation unit (ELE) 511h, an echo suppression amount estimation unit (ECLE) 111i, a control unit (
CTRL) 511k, spectrum selection unit 511L, and gain storage unit (GTBL) 111
m, an echo suppression gain calculator (GCAL) 511n, and a frequency domain inverse transform processor (IFT)
) 111p, a transmission signal suppression unit (SS) 511s, and a residual signal suppression unit (ES) 511t.

エコーリダクション部（ＥＲ）５１１は、遅延処理部（ＤＥＬＡＹ）１０２から出力さ
れた遅延した受話入力信号ｘ［ｎ−Ｄ］と、ハイパスフィルタ部（ＨＰＦ）１０９から出
力された送話入力信号ｚ［ｎ］と、信号減算処理部１１０ｂから出力された残差信号ｅ［
ｎ］とを入力とし、送話入力信号ｚ［ｎ］あるいは残差信号ｅ［ｎ］の少なくとも一方か
らエコー成分を抑圧し、そのエコー抑圧後の信号を送話出力信号ｓ’［ｎ］（ｎ＝０，１
，・・・，Ｎ−１）として１フレームごとに出力する。 The echo reduction unit (ER) 511 receives the delayed received input signal x [n−D] output from the delay processing unit (DELAY) 102 and the transmitted input signal z [output from the high pass filter unit (HPF) 109. n] and the residual signal e [
n] as an input, the echo component is suppressed from at least one of the transmission input signal z [n] or the residual signal e [n], and the signal after the echo suppression is transmitted as the transmission output signal s ′ [n] ( n = 0, 1
,..., N-1) are output every frame.

音響結合量推定部（ＡＣＬＥ）５１１ｇは、受話パワー算出部（ＰＯＷ）１１１ｄから
出力されたスムージングされた受話パワースペクトル｜Ｘ_Ｓ［ｆ，ω］｜^２と、送話パワ
ー算出部（ＰＯＷ）１１１ｅから出力されたスムージングされた送話パワースペクトル｜
Ｚ_s［ｆ，ω］｜^２と、残差パワー算出部（ＰＯＷ）１１１ｆから出力されたスムージン
グされた残差パワースペクトル｜Ｅ_s［ｆ，ω］｜^２とを入力とし、音響結合量｜Ｈ_Z［ｆ
，ω］｜^２及び｜Ｈ_Ｅ［ｆ，ω］｜^２を、それぞれ式３１−１、式３１−２よって算出す
る。

The acoustic coupling amount estimation unit (ACLE) 511g includes the smoothed reception power spectrum | X _S [f, ω] | ² output from the reception power calculation unit (POW) 111d and the transmission power calculation unit (POW) 111e. Smoothed transmission power spectrum output from
Z _s [f, ω] | ² and the smoothed residual power spectrum | E _s [f, ω] | ² output from the residual power calculation unit (POW) 111f are input, and the acoustic coupling amount | H _Z [f
, Ω] | ² and | H _E [f, ω] | ² are calculated by Expression 31-1 and Expression 31-2, respectively.

そして、音響結合量推定部（ＡＣＬＥ）５１１ｇは、以下の式３２−１及び式３２−２
が示すように１フレーム前の値を用いてスムージングした音響結合量｜Ｈ_ＺＳ［ｆ，ω］
｜^２及び｜Ｈ_ＥＳ［ｆ，ω］｜^２を算出して出力する。ただし、α_ＨＺ［ω］及びα_ＨＥ
［ω］は、０．０３〜０．９９程度が望ましい。

And the acoustic coupling amount estimation part (ACLE) 511g is the following formula 32-1 and formula 32-2.
Indicates that the amount of acoustic coupling smoothed using the value one frame before | H _ZS [f, ω]
| ² and | H _ES [f, ω] | ² are calculated and output. However, α _HZ [ω] and α _HE
[Ω] is preferably about 0.03 to 0.99.

ここで、通話開始初期時、例えば通話開始から５秒間程度はα_ＨＺ［ω］及びα_ＨＥ［
ω］を大きくすることで音響結合量｜Ｈ_ＺＳ［ｆ，ω］｜^２及び｜Ｈ_ＥＳ［ｆ，ω］｜^２
の更新を速めておく。このようにすることで通話開始初期時は音響結合量が初期化されて
いるので、通話開始初期時に抑圧量が少なくなってしまうのを防止することができる。 Here, at the beginning of the call, for example, about 5 seconds after the start of the call, α _HZ [ω] and α _HE [
The acoustic coupling amount | H _ZS [f, ω] | ² and | H _ES [f, ω] | ² by increasing ω].
Speed up the update. By doing so, since the acoustic coupling amount is initialized at the beginning of the call, it is possible to prevent the suppression amount from being reduced at the beginning of the call.

ただし、音響結合量が急激に変化する場合、即ち、不等式｜Ｈ_Z［ｆ，ω］｜^２＞β_Ｚ
_Ｈ［ω］・｜Ｈ_ＺＳ［ｆ−１，ω］｜^２及び｜Ｈ_Ｅ［ｆ，ω］｜^２＞β_ＥＨ［ω］・｜Ｈ
_ＥＳ［ｆ−１，ω］｜^２が成り立つ場合、及び受話入力信号が十分に大きくない場合、即
ち、不等式｜Ｘ_Ｓ［ｆ，ω］｜^２＜β_Ｘ［ω］が成り立つ場合には、エコーパス変動への
高速な追従性を保ちつつ、ダブルトークとなる周波数帯域での音響結合量の算出を行わな
いようにするため、音響結合量を更新しないで１フレーム前の過去の音響結合量｜Ｈ_ＺＳ
［ｆ−１，ω］｜^２及び｜Ｈ_ＥＳ［ｆ−１，ω］｜^２を用いる。 However, when the amount of acoustic coupling changes abruptly, that is, the inequality | H _Z [f, ω] | ² > β _Z
_H [ω] · | H _ZS [f−1, ω] | ² and | H _E [f, ω] | ² > β _EH [ω] · | H
_{If ES} [f−1, ω] | ² holds and if the received input signal is not large enough, that is, if the inequality | X _S [f, ω] | ² <β _X [ω] holds, In order not to calculate the amount of acoustic coupling in the frequency band where double talk occurs while maintaining high-speed follow-up to echo path fluctuations, the past acoustic coupling amount one frame before without updating the acoustic coupling amount | H _ZS
[F-1, ω] | ² and | H _ES [f-1, ω] | ² are used.

極端な音響結合量の変化はダブルトークの可能性があるので、このように音響結合量を
更新しないでおくことで送話音質劣化を防止することができる。ただし、β_ＺＨ［ω］及
びβ_ＥＨ［ω］は０．９〜３０程度が望ましい。β_Ｘ［ω］は３０ｄＢ〜４０ｄＢ程度が
望ましい。 Since an extreme change in the amount of acoustic coupling has the possibility of double talk, it is possible to prevent deterioration in transmitted sound quality by not updating the amount of acoustic coupling in this way. However, β _ZH [ω] and β _EH [ω] are preferably about 0.9 to 30. β _X [ω] is desirably about 30 dB to 40 dB.

エコー量推定部（ＥＬＥ）５１１ｈは、受話パワー算出部（ＰＯＷ）１１１ｄから出力
されたスムージングされた受話パワースペクトル｜Ｘ_Ｓ［ｆ，ω］｜^２と、音響結合量推
定部（ＡＣＬＥ）５１１ｇから出力された音響結合量｜Ｈ_ＺＳ［ｆ，ω］｜^２及び｜Ｈ_Ｅ
_Ｓ［ｆ，ω］｜^２とを入力とし、送話入力信号の周波数スペクトルＺ［ｆ，ω］に含まれ
る送話エコー量｜Ｙ_Ｚ［ｆ，ω］｜^２及び残差信号の周波数スペクトルＥ［ｆ，ω］に含
まれる残差エコー量｜Ｙ_Ｅ［ｆ，ω］｜^２を、それぞれ以下の式３３−１及び式３３−２
が示すように周波数帯域ω毎に推定する。

The echo amount estimation unit (ELE) 511h includes the smoothed reception power spectrum | X _S [f, ω] | ² output from the reception power calculation unit (POW) 111d and the acoustic coupling amount estimation unit (ACLE) 511g. Output acoustic coupling amount | H _ZS [f, ω] | ² and | H _E
_S [f, ω] | ² as an input, the amount of transmitted echoes | Y _Z [f, ω] | ² included in the frequency spectrum Z [f, ω] of the transmission input signal and the frequency spectrum of the residual signal The residual echo amount | Y _E [f, ω] | ² included in _E [f, ω] is expressed by the following equations 33-1 and 33-2, respectively.
As shown in FIG.

そして、エコー量推定部（ＥＬＥ）５１１ｈは、瞬時のエコー量｜Ｙ_Ｚ［ｆ，ω］｜^２
及び｜Ｙ_Ｅ［ｆ，ω］｜^２を用いるよりもスムージングした値を用いた方がエコー抑圧後
の信号をより自然な信号にできるため、以下の式３４−１及び式３４−２が示すように１
フレーム前の値を用いてスムージングしたエコー量｜Ｙ_ＺＳ［ｆ，ω］｜^２及び｜Ｙ_ＥＳ
［ｆ，ω］｜^２を周波数帯域ω毎に算出して出力する。ただし、α_ＺＹ［ω］及びα_ＥＹ
［ω］は０．７〜０．９９程度が望ましい。

Then, the echo amount estimation unit (ELE) 511h provides an instantaneous echo amount | Y _Z [f, ω] | ²
And | Y _E [f, ω] | ² , the smoothed value can be used to make the signal after echo suppression a more natural signal. Therefore, the following equations 34-1 and 34-2 are shown. 1
Echo amount smoothed using values before frame | Y _ZS [f, ω] | ² and | Y _ES
[F, ω] | ² is calculated and output for each frequency band ω. Where α _ZY [ω] and α _EY
[Ω] is preferably about 0.7 to 0.99.

制御部（ＣＴＲＬ）５１１ｋは、エコー抑圧量推定部（ＥＣＬＥ）１１１ｉから出力さ
れたエコー抑圧量ＥＣＬ［ｆ，ω］を入力とし、エコーキャンセラ部（ＥＣ）１１０のエ
コー抑圧量が十分でない周波数帯域であるか否かの情報である制御情報ＥＲｃｏｎｔｒｏ
ｌ［ｆ，ω］を出力する。具体的には、制御部（ＣＴＲＬ）５１１ｋは、周波数帯域ω毎
にＥＣＬ［ｆ，ω］＞β_Ｚ［ω］を満たした周波数帯域であり、かつ１フレーム当りの式
ＥＣＬ［ｆ，ω］＞β_Ｚ［ω］が成立した帯域数ｃｏｕｎｔがβ_Ｃより大きい場合にエコ
ー抑圧量が少ないと判定する。ただし、β_Ｚ［ω］は＋８ｄＢ〜−１５ｄＢ程度、β_Ｃは
全周波数帯域数の１０％〜４０％程度が望ましい。本実施形態では以下、β_Ｚ［ω］を０
ｄＢとして説明する。 The control unit (CTRL) 511k receives the echo suppression amount ECL [f, ω] output from the echo suppression amount estimation unit (ECLE) 111i, and the frequency band where the echo suppression amount of the echo canceller unit (EC) 110 is not sufficient. Control information ERcontrol which is information on whether or not
l [f, ω] is output. Specifically, the control unit (CTRL) 511k is a frequency band satisfying ECL [f, ω]> β _Z [ω] for each frequency band ω, and the expression ECL [f, ω] per frame. When the number of bands count in which> β _Z [ω] is established is larger than β _C, it is determined that the echo suppression amount is small. However, β _Z [ω] is preferably about +8 dB to −15 dB, and β _C is preferably about 10% to 40% of the total number of frequency bands. In the present embodiment, β _Z [ω] is set to 0 below.
It will be described as dB.

エコー抑圧ゲイン算出部（ＧＣＡＬ）５１１ｎは、送話パワー算出部（ＰＯＷ）１１１
ｅから出力されたスムージングされた送話パワースペクトル｜Ｚ_Ｓ［ｆ，ω］｜^２と、残
差パワー算出部（ＰＯＷ）１１１ｆから出力されたスムージングされた残差パワースペク
トル｜Ｅ_Ｓ［ｆ，ω］｜^２と、エコー量推定部（ＥＬＥ）５１１ｈから出力されたスムー
ジングした送話エコー量｜Ｙ_ＺＳ［ｆ，ω］｜^２及びスムージングした残差エコー量｜Ｙ
_ＥＳ［ｆ，ω］｜^２と、ゲイン格納部（ＧＴＢＬ）１１１ｍから出力されたパラメータγ
［ω］とを入力とし、送話エコー抑圧ゲインＧ_Ｚ［ｆ，ω］及び残差エコー抑圧ゲインＧ
_Ｅ［ｆ，ω］を算出して出力する。 The echo suppression gain calculation unit (GCAL) 511n is a transmission power calculation unit (POW) 111.
The smoothed transmission power spectrum | Z _S [f, ω] | ² output from e and the smoothed residual power spectrum | E _S [f, output from the residual power calculation unit (POW) 111 f ω] | ² and the smoothed transmission echo amount | Y _ZS [f, ω] | ² and the smoothed residual echo amount | Y output from the echo amount estimation unit (ELE) 511h
_ES [f, ω] | ² and the parameter γ output from the gain storage unit (GTBL) 111m
[Ω] as an input, the transmission echo suppression gain G _Z [f, ω] and the residual echo suppression gain G
_E [f, ω] is calculated and output.

具体的には、エコー抑圧ゲイン算出部（ＧＣＡＬ）５１１ｎは、送話エコー抑圧ゲイン
Ｇ_Ｚ［ｆ，ω］及び残差エコー抑圧ゲインＧ_Ｅ［ｆ，ω］をウィナー・フィルタ（Wiener
Filter）法を用いて、以下に示す式３５−１及び式３５−２によって算出する。

Specifically, the echo suppression gain calculation unit (GCAL) 511n converts the transmission echo suppression gain G _Z [f, ω] and the residual echo suppression gain G _E [f, ω] to a Wiener filter (Wiener
Using the (Filter) method, calculation is performed according to the following Expression 35-1 and Expression 35-2.

また、エコー抑圧ゲイン算出部（ＧＣＡＬ）５１１ｎは、過剰なエコー抑圧により送話
音質が劣化するのを防止し、背景雑音の断続的な抑圧を防止するため、送話エコー抑圧ゲ
インＧ_Ｚ［ｆ，ω］及び残差エコー抑圧ゲインＧ_Ｅ［ｆ，ω］が所定の下限値以下になら
ないように制御する。 Further, the echo suppression gain calculation unit (GCAL) 511n prevents the transmission sound quality from being deteriorated due to excessive echo suppression, and prevents the background noise from being intermittently suppressed, so that the transmission echo suppression gain G _Z [f , Ω] and residual echo suppression gain G _E [f, ω] are controlled so as not to fall below a predetermined lower limit value.

さらに、エコー抑圧ゲイン算出部（ＧＣＡＬ）５１１ｎは、過剰なエコー抑圧により送
話音質が劣化するのを防止するため、送話入力信号ｚ［ｎ］あるいは残差信号ｅ［ｎ］の
有音区間ではない区間の信号レベルを用いて周波数帯域ごとに背景雑音レベルを算出して
おき、背景雑音レベルよりも抑圧しないように送話エコー抑圧ゲインＧ_Ｚ［ｆ，ω］及び
残差エコー抑圧ゲインＧ_Ｅ［ｆ，ω］を制御しても構わない。 Further, the echo suppression gain calculation unit (GCAL) 511n prevents the transmission sound quality from deteriorating due to excessive echo suppression, so that the voiced interval of the transmission input signal z [n] or the residual signal e [n] The background noise level is calculated for each frequency band using the signal level of the section that is not, and the transmission echo suppression gain G _Z [f, ω] and the residual echo suppression gain G are prevented from being suppressed below the background noise level. _E [f, ω] may be controlled.

さらにまた、エコー抑圧ゲイン算出部（ＧＣＡＬ）５１１ｎは、送話音質が劣化するの
を防止するために、エコー抑圧ゲインＧ_Ｚ［ｆ，ω］を以下の式３６−１あるいは式３６
−２に示すように、そして、Ｇ_Ｅ［ｆ，ω］を以下の式３６−３あるいは式３６−４に示
すように周波数方向にスムージングして出力してもよい。例えば、ε_ｊは、［０．１、０
．２、０．４、０．２、０．１］、η_ｊは、［０．１、０．２、０．４、０．８、０．４
、０．２、０．１］のようにしてよい。

Furthermore, the echo suppression gain calculation unit (GCAL) 511n sets the echo suppression gain G _Z [f, ω] to the following Expression 36-1 or Expression 36 in order to prevent the transmission sound quality from deteriorating.
-2 and G _E [f, ω] may be smoothed and output in the frequency direction as shown in the following Expression 36-3 or Expression 36-4. For example, ε _j is [0.1, 0
. 2, 0.4, 0.2, 0.1], η _j is [0.1, 0.2, 0.4, 0.8, 0.4
, 0.2, 0.1].

送話信号抑圧部（ＳＳ）５１１ｓは、周波数領域変換処理部（ＦＴ）１１１ｂから出力
された送話入力信号の周波数スペクトルＺ［ｆ，ω］と、エコー抑圧ゲイン算出部（ＧＣ
ＡＬ）５１１ｎから出力された送話エコー抑圧ゲインＧ_Ｚ［ｆ，ω］とを入力として、送
話入力信号の周波数スペクトルのエコーを抑圧し、以下の式３７が示すようにスペクトル
Ｓ_Ｚ’［ｆ，ω］を出力する。

The transmission signal suppression unit (SS) 511s includes a frequency spectrum Z [f, ω] of the transmission input signal output from the frequency domain conversion processing unit (FT) 111b, and an echo suppression gain calculation unit (GC).
AL) The transmission echo suppression gain G _Z [f, ω] output from 511n is used as an input to suppress the echo of the frequency spectrum of the transmission input signal, and the spectrum S _Z ′ [ f, ω] is output.

残差信号抑圧部（ＥＳ）５１１ｔは、周波数領域変換処理部（ＦＴ）１１１ｃから出力
された残差信号の周波数スペクトルＥ［ｆ，ω］と、エコー抑圧ゲイン算出部（ＧＣＡＬ
）５１１ｎから出力された残差エコー抑圧ゲインＧ_Ｅ［ｆ，ω］とを入力として、残差信
号の周波数スペクトルのエコーを抑圧し、以下のようにスペクトルＳ_Ｅ’［ｆ，ω］を出
力する。

The residual signal suppression unit (ES) 511t includes a frequency spectrum E [f, ω] of the residual signal output from the frequency domain transform processing unit (FT) 111c, and an echo suppression gain calculation unit (GCAL).
) The residual echo suppression gain G _E [f, ω] output from 511n is input, and the echo of the frequency spectrum of the residual signal is suppressed, and the spectrum S _E ′ [f, ω] is output as follows: To do.

スペクトル選択部５１１Ｌは、送話信号抑圧部（ＳＳ）５１１ｓから出力されたエコー
抑圧された送話入力信号の周波数スペクトルＳ_Ｚ’［ｆ，ω］と、残差信号抑圧部（ＥＳ
）５１１ｔから出力されたエコー抑圧された残差信号の周波数スペクトルＳ_Ｅ’［ｆ，ω
］と、制御部（ＣＴＲＬ）５１１ｋから出力されたエコーキャンセラ部（ＥＣ）１１０の
エコー抑圧量が十分でない周波数帯域であるか否かを示す情報である制御情報ＥＲｃｏｎ
ｔｒｏｌ［ｆ，ω］とを入力とし、エコー抑圧された送話入力信号の周波数スペクトルＳ
_Ｚ’［ｆ，ω］か、エコー抑圧された残差信号の周波数スペクトルＳ_Ｅ’［ｆ，ω］のい
ずれか一方を周波数スペクトルとして選択して出力する。 The spectrum selection unit 511L includes the frequency spectrum S _Z ′ [f, ω] of the echo-suppressed transmission input signal output from the transmission signal suppression unit (SS) 511s and the residual signal suppression unit (ES
) Frequency spectrum S _E ′ [f, ω of echo-suppressed residual signal output from 511t
] And control information ERcon that is information indicating whether or not the echo suppression amount of the echo canceller unit (EC) 110 output from the control unit (CTRL) 511k is an insufficient frequency band
trol [f, ω] as an input and the frequency spectrum S of the echo-suppressed transmission input signal
Either _Z ′ [f, ω] or the frequency spectrum S _E ′ [f, ω] of the echo-suppressed residual signal is selected and output as a frequency spectrum.

具体的には、制御情報ＥＲｃｏｎｔｒｏｌ［ｆ，ω］がエコーキャンセラ部（ＥＣ）１
１０のエコー抑圧量が十分でないと検出された周波数帯域であった場合は、スペクトル選
択部５１１Ｌは、周波数スペクトルとしてＳ’_Ｚ［ｆ，ω］を選択する。それ以外の周波
数帯域は周波数スペクトルとしてＳ’_Ｅ［ｆ，ω］を選択する。 Specifically, the control information ERcontrol [f, ω] is the echo canceller (EC) 1
When the frequency band detected that the echo suppression amount of 10 is not sufficient, the spectrum selection unit 511L selects S ′ _Z [f, ω] as the frequency spectrum. For other frequency bands, S ′ _E [f, ω] is selected as the frequency spectrum.

このようにすることで周波数帯域ごとに、エコーキャンセラ部（ＥＣ）１１０のエコー
抑圧量が十分でない場合、つまりエコーキャンセラ部（ＥＣ）１１０の推定精度が十分と
れない場合はエコーリダクション部（ＥＲ）５１１を単体で動作させることができる。 By doing so, when the echo suppression amount of the echo canceller unit (EC) 110 is not sufficient for each frequency band, that is, when the estimation accuracy of the echo canceller unit (EC) 110 is not sufficient, the echo reduction unit (ER) 511 can be operated alone.

次に、上記のように構成された第５の実施形態に係る信号処理装置の処理の流れを説明
する。図２２は、第５の実施形態に係る信号処理装置の全体の処理の流れを示すフローチ
ャートである。なお、図５を参照して説明した第１の実施形態に係る信号処理装置の動作
と同じ動作ステップについては、同じ符号を付してその部分の説明を省略する。 Next, a processing flow of the signal processing device according to the fifth embodiment configured as described above will be described. FIG. 22 is a flowchart showing an overall processing flow of the signal processing apparatus according to the fifth embodiment. Note that the same operation steps as those of the signal processing apparatus according to the first embodiment described with reference to FIG. 5 are denoted by the same reference numerals, and description thereof is omitted.

ステップＳ１００５のエコーキャンセラ処理の後、信号処理装置は、第１の実施形態に
おいてはステップＳ１００６でエコーリダクション処理を行ったのに代えて、第５の実施
形態においては、ステップＳ１００５で得られたエコーキャンセラ処理後の信号を用いて
エコーリダクション処理を行う（ステップＳ５００６）。そして、ステップＳ１００７の
終話か否かの判断に移る。 After the echo canceller process in step S1005, the signal processing apparatus replaces the echo reduction process in step S1006 in the first embodiment with the echo obtained in step S1005 in the fifth embodiment. An echo reduction process is performed using the signal after the canceller process (step S5006). Then, the process proceeds to step S1007 to determine whether or not the call is an end story.

第５の実施形態に係るエコーキャンセル部（ＥＣ）１１０は、第１の実施形態に係るエ
コーキャンセル部（ＥＣ）１１０と同じであるので、それぞれの実施形態における処理の
流れは、当然に同じであり説明を省略する。 Since the echo cancellation unit (EC) 110 according to the fifth embodiment is the same as the echo cancellation unit (EC) 110 according to the first embodiment, the flow of processing in each embodiment is naturally the same. There is no explanation.

図２３は、第５の実施形態に係るエコーリダクション部（ＥＲ）５１１における処理の
流れを示すフローチャートである。なお、図７を参照して説明した第１の実施形態に係る
エコーリダクション部（ＥＲ）１１１及び図１２を参照して説明した第２の実施形態に係
るノイズリダクション部（ＮＲ）２１１における動作と同じ動作ステップについては、同
じ符号を付してその部分の説明を省略する。 FIG. 23 is a flowchart showing the flow of processing in the echo reduction unit (ER) 511 according to the fifth embodiment. The operation of the echo reduction unit (ER) 111 according to the first embodiment described with reference to FIG. 7 and the operation of the noise reduction unit (NR) 211 according to the second embodiment described with reference to FIG. About the same operation step, the same code | symbol is attached | subjected and description of the part is abbreviate | omitted.

第５の実施形態に係るエコーリダクション部（ＥＲ）５１１は、エコーリダクション処
理を開始して、ステップＳ１２０３ｒの受話パワースペクトル算出処理、ステップＳ１２
０３ｓの送話パワースペクトル算出処理、及びステップＳ１２０３ｅの残差パワースペク
トル算出処理の後、音響結合量推定部（ＡＣＬＥ）５１１ｇは、スムージングされた受話
パワースペクトル｜Ｘ_Ｓ［ｆ，ω］｜^２とスムージングされた送話パワースペクトル｜Ｚ
_Ｓ［ｆ，ω］｜^２を入力として音響結合量｜Ｈ_ＺＳ［ｆ，ω］｜^２を算出する（ステップ
Ｓ５２０４ｓ）。同様に、音響結合量推定部（ＡＣＬＥ）５１１ｇは、スムージングされ
た受話パワースペクトル｜Ｘ_Ｓ［ｆ，ω］｜^２とスムージングされた残差パワースペクト
ル｜Ｅ_Ｓ［ｆ，ω］｜^２を入力として音響結合量｜Ｈ_ＥＳ［ｆ，ω］｜^２を算出する（ス
テップＳ５２０４ｅ）。 The echo reduction unit (ER) 511 according to the fifth embodiment starts the echo reduction process, and the received power spectrum calculation process in step S1203r, step S12.
Sending power spectrum calculation process of 03s, and after the residual power spectrum calculation processing in step S1203e, the acoustic coupling amount estimating unit (ACLE) 511 g is smoothed received power spectrum _{| X S [f, ω]} | 2 and Smoothed transmission power spectrum | Z
_The acoustic coupling amount | H _ZS [f, ω] | ² is calculated by using _S [f, ω] | ² as an input (step S5204s). Similarly, the acoustic coupling amount estimating unit (ACLE) 511 g is smoothed received power spectrum _{| X S [f, ω]} | 2 and the smoothed residual power spectrum _{| E S [f, ω]} | Input ² As a result, the acoustic coupling amount | H _ES [f, ω] | ² is calculated (step S5204e).

次に、エコー量推定部（ＥＬＥ）５１１ｈは、音響結合量｜Ｈ_ＺＳ［ｆ，ω］｜^２とス
ムージングした受話パワースペクトル｜Ｘ_Ｓ［ｆ，ω］｜^２とを入力として送話エコー量
｜Ｙ_ＺＳ［ｆ，ω］｜^２を推定する（ステップＳ５２０５ｓ）。同様に、エコー量推定部
（ＥＬＥ）５１１ｈは、音響結合量｜Ｈ_ＥＳ［ｆ，ω］｜^２とスムージングした受話パワ
ースペクトル｜Ｘ_Ｓ［ｆ，ω］｜^２とを入力として残差エコー量｜Ｙ_ＥＳ［ｆ，ω］｜^２
を推定する（ステップＳ５２０５ｅ）。 Then, the echo estimation unit (ELE) 511h is acoustic coupling amount _{| H ZS [f, ω]} | 2 and smoothing the received power spectrum _{| X S [f, ω]} | 2 and transmission echo amount as input | Y _ZS [f, ω] | ² is estimated (step S5205s). Similarly, the echo estimation unit (ELE) 511h is acoustic coupling amount _{| H ES [f, ω]} | 2 and smoothing the received power spectrum _{| X S [f, ω]} | 2 and residual echo quantity as an input | Y _ES [f, ω] | ²
Is estimated (step S5205e).

そして、エコー抑圧ゲイン算出部（ＧＣＡＬ）５１１ｎは、エコー量推定部（ＥＬＥ）
５１１ｈから出力された送話エコー量｜Ｙ_ＺＳ［ｆ，ω］｜^２と、送話パワー算出部（Ｐ
ＯＷ）１１１ｅから出力されたスムージングされた送話パワースペクトル｜Ｚ_Ｓ［ｆ，ω
］｜^２とを入力として、送話エコー抑圧ゲインＧ_Ｚ［ｆ，ω］を算出し、所定の下限値以
下にならないように制御する（ステップＳ１２０８ｓ）。 Then, the echo suppression gain calculation unit (GCAL) 511n is an echo amount estimation unit (ELE).
511h and the transmission echo amount | Y _ZS [f, ω] | ² and the transmission power calculation unit (P
OW) 111e and the smoothed transmission power spectrum | Z _S [f, ω
] | ² is input, and the transmission echo suppression gain G _Z [f, ω] is calculated, and control is performed so as not to be below a predetermined lower limit value (step S1208s).

同様に、エコー抑圧ゲイン算出部（ＧＣＡＬ）５１１ｎは、エコー量推定部（ＥＬＥ）
５１１ｈから出力された残差エコー量｜Ｙ_ＥＳ［ｆ，ω］｜^２と、残差パワー算出部（Ｐ
ＯＷ）１１１ｆから出力されたスムージングされた残差パワースペクトル｜Ｅ_Ｓ［ｆ，ω
］｜^２とを入力として、残差エコー抑圧ゲインＧ_Ｅ［ｆ，ω］を算出し、所定の下限値以
下にならないように制御する（ステップＳ１２０８ｅ）。 Similarly, an echo suppression gain calculation unit (GCAL) 511n is an echo amount estimation unit (ELE).
511h output residual echo amount | Y _ES [f, ω] | ² and residual power calculation unit (P
OW) 111f smoothed residual power spectrum | E _S [f, ω
] ² is input, and the residual echo suppression gain G _E [f, ω] is calculated and controlled so as not to be equal to or lower than a predetermined lower limit (step S1208e).

その後、送話信号抑圧部（ＳＳ）５１１ｓは、周波数領域変換処理部（ＦＴ）１１１ｂ
から出力された送話入力信号の周波数スペクトルＺ［ｆ，ω］と、エコー抑圧ゲイン算出
部（ＧＣＡＬ）５１１ｎで算出された送話エコー抑圧ゲインＧ_Ｚ［ｆ，ω］を入力として
、送話入力信号の周波数スペクトルのエコーを抑圧する（ステップＳ５２１０ｓ）。 After that, the transmission signal suppression unit (SS) 511s is a frequency domain transform processing unit (FT) 111b.
The frequency spectrum Z [f, ω] of the transmission input signal output from the signal and the transmission echo suppression gain G _Z [f, ω] calculated by the echo suppression gain calculation unit (GCAL) 511n are input. The echo of the frequency spectrum of the input signal is suppressed (step S5210s).

同様に、残差信号抑圧部（ＥＳ）５１１ｔは、周波数領域変換処理部（ＦＴ）１１１ｃ
から出力された残差信号の周波数スペクトルＥ［ｆ，ω］と、エコー抑圧ゲイン算出部（
ＧＣＡＬ）５１１ｎで算出された残差エコー抑圧ゲインＧ_Ｅ［ｆ，ω］を入力として、残
差信号の周波数スペクトルのエコーを抑圧する（ステップＳ５２１０ｅ）。 Similarly, the residual signal suppression unit (ES) 511t is a frequency domain transform processing unit (FT) 111c.
The frequency spectrum E [f, ω] of the residual signal output from, and the echo suppression gain calculation unit (
GCAL) The residual echo suppression gain G _E [f, ω] calculated in 511n is used as an input to suppress the echo of the frequency spectrum of the residual signal (step S5210e).

次に、ステップＳ１２０６のエコー抑圧量算出処理及びステップＳ２２０７の判定処理
の後、スペクトル選択部５１１Ｌは、送話信号抑圧部（ＳＳ）５１１ｓから出力されたエ
コー抑圧された送話入力信号の周波数スペクトルＳ_Ｚ’［ｆ，ω］と、残差信号抑圧部（
ＥＳ）５１１ｔから出力されたエコー抑圧された残差信号の周波数スペクトルＳ_Ｅ’［ｆ
，ω］と、制御部（ＣＴＲＬ）５１１ｋから出力された制御情報ＥＲｃｏｎｔｒｏｌ［ｆ
，ω］とを入力とし、エコー抑圧量が十分でないと検出された周波数帯域では、エコー抑
圧された送話入力信号の周波数スペクトルＳ_Ｚ’［ｆ，ω］を周波数スペクトルとして選
択して出力し、それ以外の周波数帯域ではエコー抑圧された残差信号の周波数スペクトル
Ｓ_Ｅ’［ｆ，ω］を周波数スペクトルとして選択して出力する（ステップＳ５２０９）。 Next, after the echo suppression amount calculation processing in step S1206 and the determination processing in step S2207, the spectrum selection unit 511L performs the frequency spectrum of the echo-suppressed transmission input signal output from the transmission signal suppression unit (SS) 511s. S _Z '[f, ω] and the residual signal suppression unit (
ES) Frequency spectrum S _E ′ [f of the echo-suppressed residual signal output from 511t
, Ω] and control information ERcontrol [f output from the control unit (CTRL) 511k.
, Ω] as an input, and in the frequency band where the echo suppression amount is detected to be insufficient, the frequency spectrum S _Z ′ [f, ω] of the echo-suppressed transmission input signal is selected and output as the frequency spectrum. In other frequency bands, the frequency spectrum S _E ′ [f, ω] of the residual signal subjected to echo suppression is selected and output as a frequency spectrum (step S5209).

そして、ステップＳ１２１１の周波数逆変換処理に移って、エコーリダクション処理が
終了する。 Then, the process proceeds to the frequency inverse transform process in step S1211, and the echo reduction process ends.

以上、エコーリダクション部（ＥＲ）５１１はＦＦＴによる周波数領域型で周波数帯域
毎に処理する方式として動作するものとして説明した。ＦＦＴによる周波数帯域をグルー
プでまとめて周波数帯域グループ毎に処理する方式や、フィルタバンクなどの帯域分割フ
ィルタなどの周波数領域型を実現してもよい。 As described above, the echo reduction unit (ER) 511 has been described as operating as a method of processing for each frequency band in the frequency domain type by FFT. You may implement | achieve the frequency domain type | molds, such as the method which puts together the frequency band by FFT and processes for every frequency band group, and band division filters, such as a filter bank.

以上説明した信号処理装置の動作により、エコーキャンセラ部（ＥＣ）１１０の処理に
よるエコー抑圧量が十分でない周波数領域をエコー抑圧量推定部（ＥＣＬＥ）１１１ｉと
制御部（ＣＴＲＬ）５１１ｋで判定して、スペクトル選択部５１１Ｌとエコー抑圧ゲイン
算出部（ＧＣＡＬ）５１１ｎにおいて周波数帯域毎にエコーキャンセラ部（ＥＣ）１１０
の処理とエコーリダクション部（ＥＲ）５１１の処理の重みを変化させるように制御する
ことができるので、エコーパスロス変動に対して頑健にすることが可能であり、高品質な
信号を出力することが可能である。 By the operation of the signal processing apparatus described above, the echo suppression amount estimation unit (ECLE) 111i and the control unit (CTRL) 511k determine the frequency region where the echo suppression amount by the processing of the echo canceller unit (EC) 110 is not sufficient, An echo canceller (EC) 110 for each frequency band in the spectrum selector 511L and the echo suppression gain calculator (GCAL) 511n.
And the processing weight of the echo reduction unit (ER) 511 can be controlled to be changed, so that it is possible to make robust against echo path loss fluctuations and to output a high-quality signal. Is possible.

また、エコーリダクション部（ＥＲ）５１１の音響結合量推定部（ＡＣＬＥ）５１１ｇ
とエコー量推定部（ＥＬＥ）５１１ｈにおいて、送話入力信号ｚ［ｎ］に含まれるエコー
量と残差信号ｅ［ｎ］に含まれるエコー量を別々に推定し、エコー抑圧ゲイン算出部（Ｇ
ＣＡＬ）５１１ｎにおいて別々にエコー抑圧ゲインを算出し、送話信号抑圧部（ＳＳ）５
１１ｓ及び残差信号抑圧部（ＥＳ）５１１ｔにおいて別々に抑圧しているため、過剰なエ
コー抑圧を防止し送話音質が劣化するのを防止することが可能である。 Also, the acoustic coupling amount estimation unit (ACLE) 511g of the echo reduction unit (ER) 511
And the echo amount estimation unit (ELE) 511h separately estimate the echo amount contained in the transmission input signal z [n] and the echo amount contained in the residual signal e [n], and the echo suppression gain calculation unit (G
(CAL) 511n separately calculates the echo suppression gain, and the transmission signal suppression unit (SS) 5
11s and the residual signal suppression unit (ES) 511t are separately suppressed, so that it is possible to prevent excessive echo suppression and prevent deterioration of the transmission sound quality.

（第６の実施形態）
図２４は、第６の実施形態に係る信号処理装置の構成を示すブロック図である。この信
号処理装置が第１の実施形態に係る信号処理装置と異なる点は、図２４に示すようにエコ
ーキャンセラ部（ＥＣ）１１０とエコーリダクション部（ＥＲ）１１１を有さず、エコー
キャンセラ部（ＥＣ）６１０とエコーリダクション部（ＥＲ）６１１を有し、エコーリダ
クション部（ＥＲ）６１１の外部にエコー抑圧量推定部（ＥＣＬＥ）６１２を有する点に
あり、その他の部分は同じである。 (Sixth embodiment)
FIG. 24 is a block diagram illustrating a configuration of a signal processing device according to the sixth embodiment. This signal processing device is different from the signal processing device according to the first embodiment in that it does not have an echo canceller (EC) 110 and an echo reduction unit (ER) 111 as shown in FIG. EC) 610 and an echo reduction unit (ER) 611, and an echo suppression amount estimation unit (ECLE) 612 outside the echo reduction unit (ER) 611. The other parts are the same.

そこで、第１の実施形態に係る信号処理装置と同じ部分については、同じ符号を付して
その説明を省略する。なお、このエコーリダクション部（ＥＲ）６１１で、第１の実施形
態に係るエコーリダクション部（ＥＲ）１１１と同じ部分については、同じ符号を付して
その説明を省略する。 Therefore, the same parts as those of the signal processing apparatus according to the first embodiment are denoted by the same reference numerals and description thereof is omitted. In the echo reduction unit (ER) 611, the same parts as those of the echo reduction unit (ER) 111 according to the first embodiment are denoted by the same reference numerals, and description thereof is omitted.

図２５は、エコーキャンセラ部（ＥＣ）６１０の構成を示すブロック図である。このエ
コーキャンセラ部（ＥＣ）６１０は、周波数領域変換処理部（ＦＴ）６１０ｄ、周波数領
域適応フィルタ部（ＦＤＡＤＦ）６１０ｅ、周波数領域逆変換処理部（ＩＦＴ）６１０ｆ
、信号減算処理部６１０ｇ、周波数領域変換処理部（ＦＴ）６１０ｈ、周波数領域ダブル
トーク検出部（ＦＤＤＴＤ）６１０ｉからなる。 FIG. 25 is a block diagram showing the configuration of the echo canceller unit (EC) 610. The echo canceller (EC) 610 includes a frequency domain transform processor (FT) 610d, a frequency domain adaptive filter unit (FDADF) 610e, and a frequency domain inverse transform processor (IFT) 610f.
, A signal subtraction processing unit 610g, a frequency domain conversion processing unit (FT) 610h, and a frequency domain double talk detection unit (FDDTD) 610i.

図２６は、エコーリダクション部（ＥＲ）６１１の構成を示すブロック図である。この
エコーリダクション部（ＥＲ）６１１は、周波数領域変換処理部（ＦＴ）１１１ａと、周
波数領域変換処理部（ＦＴ）１１１ｂと、周波数領域変換処理部（ＦＴ）１１１ｃと、受
話パワー算出部（ＰＯＷ）１１１ｄと、送話パワー算出部（ＰＯＷ）１１１ｅと、音響結
合量推定部（ＡＣＬＥ）６１１ｇと、エコー量推定部（ＥＬＥ）１１１ｈと、周波数領域
ダブルトーク検出部（ＦＤＴＤ）６１１ｊと、制御部（ＣＴＲＬ）６１１ｋと、スペクト
ル選択部１１１Ｌと、ゲイン格納部（ＧＴＢＬ）１１１ｍと、エコー抑圧ゲイン算出部（
ＧＣＡＬ）６１１ｎと、信号抑圧部（ＳＳ）１１１ｏと、周波数領域逆変換処理部（ＩＦ
Ｔ）１１１ｐと、送話出力パワー算出部（ＰＯＷ）６１１ｕとからなる。 FIG. 26 is a block diagram showing a configuration of the echo reduction unit (ER) 611. The echo reduction unit (ER) 611 includes a frequency domain transformation processing unit (FT) 111a, a frequency domain transformation processing unit (FT) 111b, a frequency domain transformation processing unit (FT) 111c, and a received power calculation unit (POW). 111d, transmission power calculation unit (POW) 111e, acoustic coupling amount estimation unit (ACLE) 611g, echo amount estimation unit (ELE) 111h, frequency domain double talk detection unit (FDTD) 611j, and control unit ( CTRL) 611k, spectrum selection unit 111L, gain storage unit (GTBL) 111m, echo suppression gain calculation unit (
GCAL) 611n, signal suppressor (SS) 111o, and frequency domain inverse transform processor (IF
T) 111p and a transmission output power calculation unit (POW) 611u.

図２７は、音響結合量推定部（ＡＣＬＥ）６１１ｇの構成を示すブロック図である。こ
の音響結合量推定部（ＡＣＬＥ）６１１ｇは、音響結合量推定部（ＣＡＣＬ）６１１ｇ１
と、音響結合量補正部（ＡＤＪ）６１１ｇ２と、音響結合量平滑部（ＳＭＡＣＬ）６１１
ｇ３とからなる。 FIG. 27 is a block diagram illustrating a configuration of the acoustic coupling amount estimation unit (ACLE) 611g. The acoustic coupling amount estimation unit (ACLE) 611g is an acoustic coupling amount estimation unit (CACL) 611g1.
And an acoustic coupling amount correction unit (ADJ) 611g2 and an acoustic coupling amount smoothing unit (SMACL) 611
g3.

上記のように構成された、本発明の第６の実施形態に係る信号処理装置の各部の動作を
、図２４〜図２７を参照して説明する。 The operation of each part of the signal processing device according to the sixth embodiment of the present invention configured as described above will be described with reference to FIGS.

エコーキャンセラ部（ＥＣ）６１０は、ハイパスフィルタ部（ＨＰＦ）１０９から出力
された送話入力信号ｚ［ｎ］と、遅延処理部（ＤＥＬＡＹ）１０２から出力された遅延し
た受話入力信号ｘ［ｎ−Ｄ］を入力とし、オーバーラップ保存法（Overlap-Save Method
）、あるいはオーバーラップ加算法（Overlap-Add Method）に基づき、送話入力信号ｚ［
ｎ］からエコー成分を抑圧し、そのエコー抑圧後の信号を残差信号ｅ［ｎ］（ｎ＝０，１
，・・・，Ｎ−１）として出力し、またフィルタ係数Ｈ_ＦＤＡＦ［ｆ，ω］及びエコーパ
スロスの推定値λ_ＦＤＡＦ［ｆ，ω］を出力する。 The echo canceller unit (EC) 610 includes a transmission input signal z [n] output from the high-pass filter unit (HPF) 109 and a delayed reception input signal x [n− output from the delay processing unit (DELAY) 102. D] as an input, Overlap-Save Method
), Or based on the overlap-add method, the transmission input signal z [
n] suppresses the echo component, and the signal after the echo suppression is the residual signal e [n] (n = 0, 1
,..., N−1), and the filter coefficient H _FDAF [f, ω] and the estimated echo path loss λ _FDAF [f, ω] are output.

周波数領域変換処理部（ＦＴ）６１０ｄは、遅延処理部（ＤＥＬＡＹ）１０２から出力
された遅延した受話入力信号ｘ［ｎ−Ｄ］を入力とし、ＦＦＴ（Fast Fourier Transform
）などによって周波数領域に変換して、受話入力信号の周波数スペクトルＸ_ＦＤＡＦ［ｆ
，ω］を算出して出力する。このとき適宜、オーバーラップ保存法（Overlap-Save Metho
d）、あるいはオーバーラップ加算法（Overlap-Add Method）に基づき、ハミング窓など
による窓掛けや、過去のフレームを用いたり零補間したりオーバーラップを行う。 The frequency domain transform processing unit (FT) 610d receives the delayed received input signal x [n−D] output from the delay processing unit (DELAY) 102 as input, and performs FFT (Fast Fourier Transform).
) _{Or the like, and} the frequency spectrum _XFDAF [f
, Ω] is calculated and output. At this time, the overlap-save method (Overlap-Save Metho
d) Or, based on the overlap-add method, perform windowing using a Hamming window, etc., use past frames, perform zero interpolation, or overlap.

周波数領域適応フィルタ部（ＦＤＡＤＦ）６１０ｅは、フィルタ係数Ｈ_ＦＤＡＦ［ｆ，
ω］が可変のトランスバーサルフィルタ（Transversal Filter）で構成される周波数領域
の適応フィルタである。また、周波数領域適応フィルタ部（ＦＤＡＤＦ）６１０ｅは、周
波数領域変換処理部（ＦＴ）６１０ｄから出力された受話入力信号の周波数スペクトルＸ
_ＦＤＡＦ［ｆ，ω］と、周波数領域変換処理部（ＦＴ）６１０ｈから出力された１フレー
ム前の残差信号の周波数スペクトルＥ_ＦＤＡＦ［ｆ−１，ω］と、周波数領域ダブルトー
ク検出部（ＦＤＤＴＤ）６１０ｉから出力されたダブルトーク情報ＥＣｓｔａｔｅ［ｆ，
ω］とを入力とし、ダブルトーク情報ＥＣｓｔａｔｅ［ｆ，ω］がダブルトーク状態でな
かった場合はフィルタ係数Ｈ_ＦＤＡＦ［ｆ，ω］をフレームf及び周波数帯域ωごとに適
応学習し、ダブルトーク情報ＥＣｓｔａｔｅ［ｆ，ω］がダブルトーク状態であった場合
は適応学習を行わない。このようにしてフィルタ係数Ｈ_ＦＤＡＦ［ｆ，ω］を算出して出
力する。 The frequency domain adaptive filter unit (FDADF) 610e includes a filter coefficient H _FDAF [f,
ω] is a frequency domain adaptive filter composed of a variable transversal filter. Also, the frequency domain adaptive filter unit (FDADF) 610e is a frequency spectrum X of the received input signal output from the frequency domain transform processing unit (FT) 610d.
_FDAF [f, ω], the frequency spectrum E _FDAF [f−1, ω] of the residual signal one frame before output from the frequency domain transform processing unit (FT) 610h, and the frequency domain double talk detecting unit (FDDTD) ) Double talk information ECstate [f, output from 610i
ω] and the double talk information ECstate [f, ω] is not in the double talk state, the filter coefficient H _FDAF [f, ω] is adaptively learned for each frame f and frequency band ω, and the double talk information is obtained. If ECstate [f, ω] is in the double talk state, adaptive learning is not performed. In this way, the filter coefficient H _FDAF [f, ω] is calculated and output.

また、周波数領域適応フィルタ部（ＦＤＡＤＦ）６１０ｅは、周波数領域変換処理部（
ＦＴ）６１０ｄから出力された受話入力信号の周波数スペクトルＸ_ＦＤＡＦ［ｆ，ω］と
、フィルタ係数Ｈ_ＦＤＡＦ［ｆ，ω］とを用いて擬似エコー信号の周波数スペクトルＹ’
_ＦＤＡＦ［ｆ，ω］をＹ’_ＦＤＡＦ［ｆ，ω］＝Ｈ_ＦＤＡＦ［ｆ，ω］・Ｘ_ＦＤＡＦ［ｆ
，ω］として算出して出力する。 In addition, the frequency domain adaptive filter unit (FDADF) 610e includes a frequency domain conversion processing unit (
FT) The frequency spectrum Y ′ of the pseudo echo signal using the frequency spectrum X _FDAF [f, ω] of the received input signal output from _610d and the filter coefficient H _FDAF [f, ω].
_FDAF [f, ω] is changed to Y ′ _FDAF [f, ω] = H _FDAF [f, ω] · X _FDAF [f
, Ω].

周波数領域適応フィルタ部（ＦＤＡＤＦ）６１０ｅは、フィルタ係数Ｈ_ＦＤＡＦ［ｆ，
ω］の更新幅を制御する固定あるいは可変のステップサイズμ_F［ｆ，ω］を用いて、適
応学習を行う。 The frequency domain adaptive filter unit (FDADF) 610e includes a filter coefficient H _FDAF [f,
Adaptive learning is performed using a fixed or variable step size μ _F [f, ω] for controlling the update width of ω].

また、周波数領域適応フィルタ部（ＦＤＡＤＦ）６１０ｅは、例えばＬＭＳ（Least-Me
an-Square）アルゴリズム、ＮＬＭＳ（Normalized-Least-Mean-Square）アルゴリズム、
学習同定法、アフィン射影（ＡＰ：Affine-Projection）アルゴリズム、逐次最小二乗（
ＲＬＳ：Recursive-Least-Squares）アルゴリズムなどの線形適応アルゴリズムに基づく
適応フィルタや勾配制限型学習同定法（Gradient-limited Normalized-Least-Mean-Squar
e）、適応ボルテラフィルタ（Adaptive Volterra Filter）などの非線形適応アルゴリズ
ムに基づく適応フィルタで構成される。また、本実施形態では勾配拘束のない（gradient
unconstrained）周波数領域型適応フィルタの例を示しているが、勾配拘束のある（grad
ient constrained）周波数領域型適応フィルタで構成してもよい。 Further, the frequency domain adaptive filter unit (FDADF) 610e is, for example, an LMS (Least-Me).
an-Square) algorithm, NLMS (Normalized-Least-Mean-Square) algorithm,
Learning identification method, affine projection (AP) algorithm, sequential least squares (
Adaptive filter based on linear adaptive algorithm such as RLS (Recursive-Least-Squares) algorithm and gradient-limited learning identification method (Gradient-limited Normalized-Least-Mean-Square)
e), composed of adaptive filters based on non-linear adaptive algorithms such as Adaptive Volterra Filter. In this embodiment, there is no gradient constraint (gradient
An example of an unconstrained frequency domain adaptive filter, but with gradient constraints (grad
(ient constrained) frequency domain type adaptive filter.

周波数領域逆変換処理部（ＩＦＴ）６１０ｆは、周波数領域適応フィルタ部（ＦＤＡＤ
Ｆ）６１０ｅから出力された擬似エコー信号の周波数スペクトルＹ’_ＦＤＡＦ［ｆ，ω］
を入力とし、ＩＦＦＴ（Inverse Fast Fourier Transform）などによって擬似エコー信号
ｙ’_ＦＤＡＦ［ｎ］（ｎ＝０，１，・・・，Ｎ−１）を算出して出力する。このとき適宜
、オーバーラップ保存法（Overlap-Save Method）あるいはオーバーラップ加算法（Overl
ap-Add Method）に基づき、過去のフレームを用いたり零補間したりオーバーラップを戻
したりする処理を行う。 The frequency domain inverse transform processing unit (IFT) 610f includes a frequency domain adaptive filter unit (FDAD).
F) Frequency spectrum Y ′ _FDAF [f, ω] of the pseudo echo signal output from _610e
, And the pseudo echo signal y ′ _FDAF [n] (n = 0, 1,..., N−1) is calculated and output by IFFT (Inverse Fast Fourier Transform) or the like. At this time, the overlap-save method (overlap-save method) or the overlap-add method (overl
Based on ap-Add Method), the past frame is used, zero interpolation is performed, and overlap is returned.

信号減算処理部６１０ｇは、ハイパスフィルタ部（ＨＰＦ）１０９から出力された送話
入力信号ｚ［ｎ］と、周波数領域逆変換処理部（ＩＦＴ）６１０ｆから出力された擬似エ
コー信号ｙ’_ＦＤＡＦ［ｎ］とを入力とし、送話入力信号ｚ［ｎ］から擬似エコー信号ｙ
’_ＦＤＡＦ［ｎ］をサンプルｎごとに減算し、エコー成分を抑圧し、そのエコー抑圧後の
信号である残差信号ｅ［ｎ］を算出して出力する。 Signal subtraction unit 610g includes a high-pass filter unit and (HPF) output from 109 sending speech input signal z [n], the pseudo echo signal y _'FDAF [n output from the frequency domain inverse transform unit (IFT) 610f ] And the pseudo-echo signal y from the transmission input signal z [n]
' _FDAF [n] is subtracted for each sample n, the echo component is suppressed, and a residual signal e [n] which is a signal after the echo suppression is calculated and output.

周波数領域変換処理部（ＦＴ）６１０ｈは、信号減算処理部６１０ｇから出力された時
間領域の残差信号ｅ［ｎ］を入力として、ＦＦＴ（Fast Fourier Transform）などによっ
て周波数領域に変換して、残差信号の周波数スペクトルＥ_ＦＤＡＦ［ｆ，ω］を算出して
出力する。このとき適宜、オーバーラップ保存法（Overlap-Save Method）、あるいはオ
ーバーラップ加算法（Overlap-Add Method）に基づき、ハミング窓などによる窓掛けや、
過去のフレームを用いたり零補間したりオーバーラップを行う。 The frequency domain transform processing unit (FT) 610h receives the time domain residual signal e [n] output from the signal subtraction processing unit 610g, converts it into the frequency domain by FFT (Fast Fourier Transform), etc. The frequency spectrum E _FDAF [f, ω] of the difference signal is calculated and output. At this time, based on the overlap-save method (Overlap-Save Method) or overlap-add method (Overlap-Add Method),
Use past frames, perform zero interpolation, or overlap.

周波数領域ダブルトーク検出部（ＦＤＤＴＤ）６１０ｉは、周波数領域変換処理部（Ｆ
Ｔ）６１０ｄから出力された受話信号の周波数スペクトルＸ_ＦＤＡＦ［ｆ，ω］と、周波
数領域変換処理部（ＦＴ）６１０ｈから出力された１フレーム前の残差信号の周波数スペ
クトルＥ_ＦＤＡＦ［ｆ−１，ω］とを、入力とし、フレームf及び周波数帯域ωごとにダ
ブルトーク状態か否かを判定し、ダブルトーク状態か否かを示す情報であるダブルトーク
情報ＥＣｓｔａｔｅ［ｆ，ω］と、エコーパスロスの推定値であるλ_ＦＤＡＦ［ｆ，ω］
を算出して出力する。 The frequency domain double talk detector (FDDTD) 610i is a frequency domain conversion processor (F
T) Frequency spectrum X _FDAF [f, ω] of the received signal output from 610d and frequency spectrum E _FDAF [f−1] of the residual signal one frame before output from the frequency domain transform processing unit (FT) 610h. , Ω] as input, it is determined for each frame f and frequency band ω whether or not it is in a double talk state, double talk information ECstate [f, ω] that is information indicating whether or not it is in a double talk state, and an echo Λ _FDAF [f, ω] which is the estimated path loss
Is calculated and output.

具体的には、まず周波数領域ダブルトーク検出部（ＦＤＤＴＤ）６１０ｉは、受話信号
のパワースペクトル｜Ｘ_ＦＤＡＦ［ｆ，ω］｜^２と１フレーム前の残差信号のパワースペ
クトル｜Ｅ_ＦＤＡＦ［ｆ−１，ω］｜^２とをフレームf及び周波数帯域ωごとに算出する
。そして、不等式｜Ｅ_ＦＤＡＦ［ｆ−１，ω］｜^２＞λ_ＦＤＡＦ［ｆ，ω］・｜Ｘ_ＦＤＡ
_Ｆ［ｆ，ω］｜^２が成り立つ場合にダブルトーク状態と判定する。ここでλ_ＦＤＡＦ［ｆ
，ω］は、エコーパスロスの推定値であり、適応学習が進めば小さくなり、適応学習が間
違っていれば大きくなる可変量である。また、λ_ＦＤＡＦ［ｆ，ω］は、フィルタ係数Ｈ
_ＦＤＡＦ［ｆ，ω］を適応学習したフレームf及び周波数帯域ωごとに更新して算出する
。 Specifically, first, the frequency domain double-talk detector (FDDDTD) 610i determines the power spectrum of the received signal | X _FDAF [f, ω] | ² and the power spectrum of the residual signal one frame before | E _FDAF [f− 1, ω] | ² for each frame f and frequency band ω. And the inequality | E _FDAF [f−1, ω] | ² > λ _FDAF [f, ω] · | X _FDA
_{If F} [f, ω] | ² holds, it is determined that the state is a double talk state. Where λ _FDAF [f
, Ω] is an estimated value of the echo path loss, and is a variable amount that decreases as adaptive learning progresses and increases when adaptive learning is incorrect. Also, λ _FDAF [f, ω] is the filter coefficient H
_FDAF [f, ω] is updated and calculated for each adaptively learned frame f and frequency band ω.

この場合、エコーキャンセラ部（ＥＣ）６１０は、フィルタ係数Ｈ_ＦＤＡＦ［ｆ，ω］
、ステップサイズμ_F［ｆ，ω］、エコーパスロスの推定値λ_ＦＤＡＦ［ｆ，ω］、ダブ
ルトーク情報ＥＣｓｔａｔｅ［ｆ，ω］、受話信号のパワースペクトル｜Ｘ_ＦＤＡＦ［ｆ
，ω］｜^２、残差信号のパワースペクトル｜Ｅ_ＦＤＡＦ［ｆ，ω］｜^２を内部状態として
メモリに保持している。 In this case, the echo canceller unit (EC) 610 has a filter coefficient H _FDAF [f, ω].
, Step size μ _F [f, ω], estimated echo path loss λ _FDAF [f, ω], double talk information ECstate [f, ω], received signal power spectrum | X _FDAF [f
, Ω] | ² and the power spectrum of the residual signal | E _FDAF [f, ω] | ² are held in the memory as internal states.

エコー抑圧量推定部（ＥＣＬＥ）６１２は、周波数領域適応フィルタ部（ＦＤＡＤＦ）
６１０ｅから出力されるフィルタ係数Ｈ_ＦＤＡＦ［ｆ，ω］と、周波数領域ダブルトーク
検出部（ＦＤＤＴＤ）６１０ｉから出力されるエコーパスロスの推定値λ_ＦＤＡＦ［ｆ，
ω］とを入力とし、エコーキャンセラ部（ＥＣ）６１０で抑圧されたエコー抑圧量ＥＣＬ
［ｆ，ω］を周波数帯域ω毎に推定して出力する。具体的には、ダブルトーク状態でない
場合に｜Ｅ_ＦＤＡＦ［ｆ，ω］｜^２を｜Ｅ_ＦＤＡＦ［ｆ，ω］｜^２＝λ_ＦＤＡＦ［ｆ，ω
］・｜Ｘ_ＦＤＡＦ［ｆ，ω］｜^２のように近似し、送話入力信号のパワースペクトル｜Ｙ
’_ＦＤＡＦ［ｆ，ω］＋Ｅ_ＦＤＡＦ［ｆ，ω］｜^２との比をエコー抑圧量ＥＣＬ［ｆ，ω
］として、以下に示す式３９のように算出する。

An echo suppression amount estimation unit (ECLE) 612 is a frequency domain adaptive filter unit (FDADF).
The filter coefficient H _FDAF [f, ω] output from the 610e and the estimated echo path loss λ _FDAF [f, ω] output from the frequency domain double talk detector (FDDTD) 610i.
ω] as an input and the echo suppression amount ECL suppressed by the echo canceller unit (EC) 610
[F, ω] is estimated and output for each frequency band ω. Specifically, when not in the double talk state, | E _FDAF [f, ω] | ^{2 is changed} to | E _FDAF [f, ω] | ² = λ _FDAF [f, ω
] · | X _FDAF [f, ω] | ² and approximate the power spectrum of the transmission input signal | Y
'The _{ratio of FDAF} [f, ω] + E _FDAF [f, ω] | ² is the echo suppression amount ECL [f, ω
] Is calculated as shown in Equation 39 below.

エコーリダクション部（ＥＲ）６１１は、受話信号ｘ［ｎ］及び送話入力信号ｚ［ｎ］
及び残差信号ｅ［ｎ］に基づいてエコー成分を抑圧するものであって、そのエコー抑圧後
の信号を送話出力信号ｓ’［ｎ］（ｎ＝０，１，・・・，Ｎ−１）として出力する。 The echo reduction unit (ER) 611 receives the received signal x [n] and the transmitted input signal z [n].
And the echo component is suppressed based on the residual signal e [n], and the signal after the echo suppression is transmitted as the transmission output signal s ′ [n] (n = 0, 1,..., N− Output as 1).

音響結合量推定部（ＡＣＬＥ）６１１ｇは、受話パワー算出部（ＰＯＷ）１１１ｄから
出力されたスムージングされた受話パワースペクトル｜Ｘ_Ｓ［ｆ，ω］｜^２と、送話パワ
ー算出部（ＰＯＷ）１１１ｅから出力されたスムージングされた送話パワースペクトル｜
Ｚ_Ｓ［ｆ，ω］｜^２と、エコー抑圧量推定部（ＥＣＬＥ）６１２から出力されたエコー抑
圧量ＥＣＬ［ｆ，ω］と、周波数領域ダブルトーク検出部（ＦＤＴＤ）６１１ｊから出力
された周波数領域ダブルトーク情報ＥＲｓｔａｔｅ［ｆ，ω］と、エコー量推定部（ＥＬ
Ｅ）１１１ｈから出力された１フレーム前のエコー量｜Ｙ_Ｓ［ｆ−１，ω］｜^２とを入力
とし、スムージングした補正音響結合量｜Ｈ_Ｓ［ｆ，ω］｜^２を算出して出力する。 The acoustic coupling amount estimation unit (ACLE) 611g includes the smoothed reception power spectrum | X _S [f, ω] | ² output from the reception power calculation unit (POW) 111d, and the transmission power calculation unit (POW) 111e. Smoothed transmission power spectrum output from
Z _S [f, ω] | ² , the echo suppression amount ECL [f, ω] output from the echo suppression amount estimation unit (ECLE) 612, and the frequency output from the frequency domain double talk detection unit (FDTD) 611j Area double talk information ERstate [f, ω] and an echo amount estimation unit (EL
E) 111h echo of one frame before output from _{| Y S [f-1,} ω] | ² and the input smoothing correction acoustic coupling amount _{| H S [f, ω]} | 2 calculates the Output.

音響結合量推定部（ＣＡＣＬ）６１１ｇ１は、受話パワー算出部（ＰＯＷ）１１１ｄか
ら出力されたスムージングされた受話パワースペクトル｜Ｘ_Ｓ［ｆ，ω］｜^２と、送話パ
ワー算出部（ＰＯＷ）１１１ｅから出力されたスムージングされた送話パワースペクトル
｜Ｚ_Ｓ［ｆ，ω］｜^２とを入力とし、エコーキャンセラ部（ＥＣ）６１０の性能の影響を
受けないように残差信号に基づく｜Ｅ_Ｓ［ｆ，ω］｜^２を用いずに送話入力信号に基づく
｜Ｚ_Ｓ［ｆ，ω］｜^２を用いて、周波数帯域ω毎に音響結合量｜Ｈ［ｆ，ω］｜^２を以下
に示す式４０のように算出して出力する。

The acoustic coupling amount estimation unit (CACL) 611g1 includes the smoothed reception power spectrum | X _S [f, ω] | ² output from the reception power calculation unit (POW) 111d and the transmission power calculation unit (POW) 111e. sending power spectrum is smoothed output from _{| Z S [f, ω]} | ² and the input, so as not to be affected by the performance of the echo canceller (EC) 610 based on the residual signal | E _S [f, ω] ^{| 2} based on the transmission input signal without using the _{| Z S [f, ω]} | 2 with an acoustic coupling amount for each frequency band ω | H [f, ω] | 2 below Is calculated and output as shown in Equation 40.

音響結合量補正部（ＡＤＪ）６１１ｇ２は、音響結合量推定部（ＣＡＣＬ）６１１ｇ１
から出力された音響結合量｜Ｈ［ｆ，ω］｜^２と、エコー抑圧量推定部（ＥＣＬＥ）６１
２から出力されたエコー抑圧量ＥＣＬ［ｆ，ω］と、送話パワー算出部（ＰＯＷ）１１１
ｅから出力されたスムージングされた送話パワースペクトル｜Ｚ_Ｓ［ｆ，ω］｜^２と、エ
コー量推定部（ＥＬＥ）１１１ｈから出力された１フレーム前のエコー量｜Ｙ_Ｓ［ｆ−１
，ω］｜^２とを入力とし、エコー抑圧量ＥＣＬ［ｆ，ω］に基づいて音響結合量｜Ｈ［ｆ
，ω］｜^２を補正した補正音響結合量｜Ｈ’［ｆ，ω］｜^２を算出して出力する。 The acoustic coupling amount correction unit (ADJ) 611g2 is an acoustic coupling amount estimation unit (CACL) 611g1.
Acoustic output amount | H [f, ω] | ² output from the signal and an echo suppression amount estimation unit (ECLE) 61
2 and echo transmission amount ECL [f, ω] output from the transmission power calculation unit (POW) 111.
The smoothed transmission power spectrum | Z _S [f, ω] | ² output from e and the echo amount one frame before output from the echo amount estimation unit (ELE) 111 h | Y _S [f−1
, Ω] | ² as an input, and the acoustic coupling amount | H [f based on the echo suppression amount ECL [f, ω]
, Ω] | ² , the corrected acoustic coupling amount | H ′ [f, ω] | ² is calculated and output.

具体的には、音響結合量補正部（ＡＤＪ）６１１ｇ２は、まず以下のようにエコーキャ
ンセラ部（ＥＣ）６１０で抑圧されたエコー抑圧量が十分であるか判定を行う。周波数帯
域ω毎に、不等式ＥＣＬ［ｆ，ω］＞β_ＡＣＬ［ω］が成立した１フレーム当りの帯域数
ｃｏｕｎｔも加味し、ＥＣＬ［ｆ，ω］＞β_ＡＣＬ［ω］かつｃｏｕｎｔ＞β_ＣＡＣＬと
なる周波数帯域をエコーキャンセラ部（ＥＣ）６１０のエコー抑圧量が十分でないと判定
する。ただし、β_ＡＣＬ［ω］は＋８ｄＢ〜−３５ｄＢ程度が、β_ＣＡＣＬは全周波数帯
域数の１０％〜４０％程度が望ましい。本実施形態では以下、β_ＡＣＬ［ω］を０ｄＢと
して説明する。 Specifically, the acoustic coupling amount correction unit (ADJ) 611g2 first determines whether the echo suppression amount suppressed by the echo canceller unit (EC) 610 is sufficient as follows. For each frequency band ω, ECL [f, ω]> β _ACL [ω] and count> β _CACL are also taken into account, with the number of bands per frame in which the inequality ECL [f, ω]> β _ACL [ω] is established. It is determined that the echo suppression amount of the echo canceller unit (EC) 610 is not sufficient. However, β _ACL [ω] is preferably about +8 dB to −35 dB, and β _CACL is preferably about 10% to 40% of the total number of frequency bands. In the present embodiment, hereinafter, β _ACL [ω] is described as 0 dB.

エコー抑圧量が十分であると判定された周波数帯域について、音響結合量補正部（ＡＤ
Ｊ）６１１ｇ２は、１フレーム前のエコー量｜Ｙ_Ｓ［ｆ−１，ω］｜^２よりも送話パワー
スペクトル｜Ｚ_Ｓ［ｆ，ω］｜^２が大きい場合にのみ、以下に示す式４１のように音響結
合量｜Ｈ［ｆ，ω］｜^２を補正し、補正音響結合量｜Ｈ’［ｆ，ω］｜^２を算出する。

For the frequency band for which the echo suppression amount is determined to be sufficient, the acoustic coupling amount correction unit (AD
J) 611g2 is an expression 41 shown below only when the transmission power spectrum | Z _S [f, ω] | ² is larger than the echo amount | Y _S [f-1, ω] | ^{2 of} one frame before. The acoustic coupling amount | H [f, ω] | ² is corrected as shown below, and the corrected acoustic coupling amount | H ′ [f, ω] | ² is calculated.

上記の条件が満たされない場合、またエコーキャンセラ部（ＥＣ）６１０のエコー抑圧
量が十分でないと判定された周波数帯域では、音響結合量補正部（ＡＤＪ）６１１ｇ２は
、｜Ｈ’［ｆ，ω］｜^２＝｜Ｈ［ｆ，ω］｜^２として補正を行わない。 When the above condition is not satisfied, and in a frequency band in which the echo suppression amount of the echo canceller unit (EC) 610 is determined to be insufficient, the acoustic coupling amount correction unit (ADJ) 611g2 performs | H ′ [f, ω] ^{| 2 = | H [f,} ω] | 2 as not corrected.

音響結合量平滑部（ＳＭＡＣＬ）６１１ｇ３は、音響結合量補正部（ＡＤＪ）６１１ｇ
２から出力された補正音響結合量｜Ｈ’［ｆ，ω］｜^２と、周波数領域ダブルトーク検出
部（ＦＤＴＤ）６１１ｊから出力された周波数領域ダブルトーク情報ＥＲｓｔａｔｅ［ｆ
，ω］とを入力として、以下に示す式４２のように１フレーム前の値を用いてスムージン
グした｜Ｈ_Ｓ［ｆ，ω］｜^２を算出して出力する。ただし、α_Ｈ［ω］は、０．０３〜０
．９９程度が望ましい。

The acoustic coupling amount smoothing unit (SMCL) 611g3 is an acoustic coupling amount correction unit (ADJ) 611g.
2 and the corrected acoustic coupling amount | H ′ [f, ω] | ² output from the frequency domain double talk information ERstate [f] output from the frequency domain double talk detector (FDTD) 611j.
As input and omega], smoothed by using the value of one frame before as shown in Equation 42 below _{| H S [f, ω]} | 2 and outputs them. However, α _H [ω] is 0.03 to 0.
. About 99 is desirable.

ここで、音響結合量平滑部（ＳＭＡＣＬ）６１１ｇ３は、通話開始初期時、例えば通話
開始から５秒間程度はα_Ｈ［ω］を大きくすることで音響結合量｜Ｈ_Ｓ［ｆ，ω］｜^２の
更新を速めておく。このようにすることで通話開始初期時は音響結合量が初期化されてい
るので、通話開始初期時に抑圧量が少なくなってしまうのを防止することができる。 Here, the acoustic coupling amount smoothing unit (SMCL) 611g3 increases the acoustic coupling amount | H _S [f, ω] | ² by increasing α _H [ω] at the beginning of the call, for example, for about 5 seconds from the start of the call. Speed up the update. By doing so, since the acoustic coupling amount is initialized at the beginning of the call, it is possible to prevent the suppression amount from being reduced at the beginning of the call.

ただし、周波数領域ダブルトーク検出部（ＦＤＴＤ）６１１ｊから出力された周波数領
域ダブルトーク情報ＥＲｓｔａｔｅ［ｆ，ω］によりダブルトーク状態であると判定され
る場合、音響結合量が急激に変化する場合、即ち、不等式｜Ｈ_Ｓ［ｆ，ω］｜^２＞β_Ｈ［
ω］・｜Ｈ_Ｓ［ｆ−１，ω］｜^２が成り立つ場合、及び受話信号が十分に大きくない場合
、即ち、不等式｜Ｘ_Ｓ［ｆ，ω］｜^２＜β_Ｘ［ω］が成り立つ場合には、エコーパス変動
への高速な追従性を保ちつつ、ダブルトークとなる周波数帯域での音響結合量の算出を行
わないようにするため、音響結合量平滑部（ＳＭＡＣＬ）６１１ｇ３は、音響結合量を更
新しないで１フレーム前の過去の音響結合量｜Ｈ_Ｓ［ｆ−１，ω］｜^２を用いる。 However, if the frequency domain double talk information ERstate [f, ω] output from the frequency domain double talk detector (FDTD) 611j is determined to be in the double talk state, the acoustic coupling amount changes abruptly, that is, , Inequality | H _S [f, ω] | ² > β _H [
ω] · | H _S [f−1, ω] | ² holds, and when the received signal is not sufficiently large, that is, the inequality | X _S [f, ω] | ² <β _X [ω] holds. In this case, the acoustic coupling amount smoothing unit (SMCL) 611g3 does not calculate the acoustic coupling amount in the frequency band where double talk is maintained while maintaining high-speed followability to the echo path fluctuation. The past acoustic coupling amount | H _S [f−1, ω] | ² one frame before is used without updating the amount.

極端な音響結合量の変化はダブルトークの可能性があるので、このように音響結合量を
更新しないでおくことで送話音質劣化を防止することができる。ただし、β_Ｈ［ω］は０
．９〜３０程度が望ましい。β_Ｘ［ω］は３０ｄＢ〜４０ｄＢ程度が望ましい。 Since an extreme change in the amount of acoustic coupling has the possibility of double talk, it is possible to prevent deterioration in transmitted sound quality by not updating the amount of acoustic coupling in this way. However, β _H [ω] is 0.
. About 9-30 is desirable. β _X [ω] is desirably about 30 dB to 40 dB.

周波数領域ダブルトーク検出部（ＦＤＴＤ）６１１ｊは、受話パワー算出部（ＰＯＷ）
１１１ｄから出力されたスムージングされた受話パワースペクトル｜Ｘ_Ｓ［ｆ，ω］｜^２
と、送話出力パワー算出部（ＰＯＷ）６１１ｕから出力された１フレーム前のスムージン
グされた送話出力パワースペクトル｜Ｓ’_Ｓ［ｆ−１，ω］｜^２とを入力とし、ダブルト
ーク状態か否かを示す情報を周波数領域ダブルトーク情報ＥＲｓｔａｔｅ［ｆ，ω］を算
出して出力する。具体的には、周波数領域ダブルトーク検出部（ＦＤＴＤ）６１１ｊは、
不等式｜Ｓ’_Ｓ［ｆ−１，ω］｜^２＞λ_ER［ｆ，ω］・｜Ｘ_Ｓ［ｆ，ω］｜^２が成り立つ
場合にダブルトーク状態と判定する。 The frequency domain double talk detector (FDTD) 611j is a received power calculator (POW).
Smoothed received power spectrum output from 111d | X _S [f, ω] | ²
And the transmission output power spectrum | S ′ _S [f−1, ω] | ² that has been smoothed one frame before output from the transmission output power calculation unit (POW) 611u, and is in a double talk state. The frequency domain double talk information ERstate [f, ω] is calculated and output as information indicating whether or not. Specifically, the frequency domain double talk detector (FDTD) 611j
Inequality _{| S 'S [f-1} , ω] | 2> λ ER [f, ω] · | X S [f, ω] | determines that the double-talk state in the case ^{where 2} is true.

ここでλ_ER［ｆ，ω］は、エコーパスロスの推定値であり、音響結合量｜Ｈ_Ｓ［ｆ，ω
］｜^２を更新したフレームf及び周波数帯域ωごとに更新し、音響結合量｜Ｈ_Ｓ［ｆ，ω
］｜^２の更新が進めば小さくなり、音響結合量｜Ｈ_Ｓ［ｆ，ω］｜^２の更新が進まなけれ
ば大きくなる可変量である。 Here, λ _ER [f, ω] is an estimated value of the echo path loss, and the acoustic coupling amount | H _S [f, ω
] ² is updated for each updated frame f and frequency band ω, and the acoustic coupling amount | H _S [f, ω
] | ^{2 is} a variable amount that decreases as the update proceeds, and increases as the acoustic coupling amount | H _S [f, ω] | ² does not progress.

制御部（ＣＴＲＬ）６１１ｋは、エコー抑圧量推定部（ＥＣＬＥ）６１２から出力され
たエコー抑圧量ＥＣＬ［ｆ，ω］と、周波数領域ダブルトーク検出部（ＦＤＴＤ）６１１
ｊから出力された周波数領域ダブルトーク情報ＥＲｓｔａｔｅ［ｆ，ω］とを入力とし、
周波数帯域ω毎にダブルトーク状態かつエコーキャンセラ部（ＥＣ）６１０のエコー抑圧
量が十分でない周波数帯域を検出して、ダブルトーク状態かつエコーキャンセラ部（ＥＣ
）６１０のエコー抑圧量が十分でない周波数帯域であるか否かを示す情報である制御情報
ＥＲｃｏｎｔｒｏｌ［ｆ，ω］を出力する。 The control unit (CTRL) 611k includes an echo suppression amount ECL [f, ω] output from the echo suppression amount estimation unit (ECLE) 612 and a frequency domain double talk detection unit (FDTD) 611.
frequency domain double talk information ERstate [f, ω] output from j as input,
For each frequency band ω, the frequency band ω detects a frequency band in which the double talk state and the echo suppression amount of the echo canceller (EC) 610 are not sufficient, thereby detecting the double talk state and the echo canceller (EC
) The control information ERcontrol [f, ω], which is information indicating whether the echo suppression amount of 610 is a frequency band that is not sufficient, is output.

まず、制御部（ＣＴＲＬ）６１１ｋは、周波数帯域ω毎に周波数領域ダブルトーク検出
部（ＦＤＴＤ）６１１ｊでダブルトーク状態であると判定されているか否かによって周波
数帯域ω毎にダブルトーク状態であるか否かを判定する。 First, whether the control unit (CTRL) 611k is in the double talk state for each frequency band ω depending on whether or not the frequency domain double talk detection unit (FDTD) 611j determines that it is in the double talk state for each frequency band ω. Determine whether or not.

次に、制御部（ＣＴＲＬ）６１１ｋは、周波数帯域ω毎に不等式ＥＣＬ［ｆ，ω］＞β
_Ｚ［ω］が成り立ち、かつ不等式ｃｏｕｎｔ＞β_Ｃが成り立つ場合に周波数帯域をエコー
キャンセラ部（ＥＣ）６１０のエコー抑圧量が少ないと判定する。ただし、ｃｏｕｎｔは
１フレーム当りの式ＥＣＬ［ｆ，ω］＞β_Ｚ［ω］が成立した帯域数であり、β_Ｚ［ω］
は＋８ｄＢ〜−１５ｄＢ程度、β_Ｃは全周波数帯域数の１０％〜４０％程度が望ましい。
本実施形態では以下、β_Ｚ［ω］＝２．５（＋８ｄＢ）として説明する。 Next, the control unit (CTRL) 611k performs the inequality ECL [f, ω]> β for each frequency band ω.
_{When Z} [ω] is satisfied and the inequality count> β _C is satisfied, it is determined that the echo suppression amount of the echo canceller (EC) 610 is small in the frequency band. However, count is the number of bands in which the expression ECL [f, ω]> β _Z [ω] per frame is established, and β _Z [ω]
Is preferably about +8 dB to −15 dB, and β _C is preferably about 10% to 40% of the total number of frequency bands.
In the present embodiment, the following description will be made assuming that β _Z [ω] = 2.5 (+8 dB).

エコー抑圧ゲイン算出部（ＧＣＡＬ）６１１ｎは、送話パワー算出部（ＰＯＷ）１１１
ｅから出力されたスムージングされた送話パワースペクトル｜Ｚ_Ｓ［ｆ，ω］｜^２と、エ
コー量推定部（ＥＬＥ）１１１ｈから出力されたスムージングしたエコー量｜Ｙ_Ｓ［ｆ，
ω］｜^２と、エコー抑圧量推定部（ＥＣＬＥ）６１２から出力されたエコー抑圧量ＥＣＬ
［ｆ，ω］と、制御部（ＣＴＲＬ）６１１ｋから出力された制御情報ＥＲｃｏｎｔｒｏｌ
［ｆ，ω］と、ゲイン格納部（ＧＴＢＬ）１１１ｍから出力されたパラメータγ［ω］と
を入力とし、スムージングされたエコー抑圧ゲインＧ_Ｓ［ｆ，ω］を算出して出力する。 The echo suppression gain calculation unit (GCAL) 611n is a transmission power calculation unit (POW) 111.
The smoothed transmission power spectrum | Z _S [f, ω] | ² output from e and the smoothed echo amount output from the echo amount estimation unit (ELE) 111 h | Y _S [f,
ω] | ² and the echo suppression amount ECL output from the echo suppression amount estimation unit (ECLE) 612
[F, ω] and control information ERcontrol output from the control unit (CTRL) 611k
[F, ω] and the parameter γ [ω] output from the gain storage unit (GTBL) 111m are input, and the smoothed echo suppression gain G _S [f, ω] is calculated and output.

具体的には、エコー抑圧ゲイン算出部（ＧＣＡＬ）６１１ｎは、制御情報ＥＲｃｏｎｔ
ｒｏｌ［ｆ，ω］からダブルトーク状態かつエコーキャンセラ部（ＥＣ）６１０のエコー
抑圧量が十分でないと検出された周波数帯域については、エコー抑圧ゲインＧ［ｆ，ω］
をウィナー・フィルタ（Wiener Filter）法を用いて、式４３によって算出する。

Specifically, the echo suppression gain calculation unit (GCAL) 611n controls the control information ERcont.
For the frequency band detected from rol [f, ω] in a double talk state and the echo suppression amount of the echo canceller (EC) 610 is not sufficient, the echo suppression gain G [f, ω]
Is calculated by Equation 43 using the Wiener Filter method.

それ以外の周波数帯域では、エコーキャンセラ部（ＥＣ）６１０のエコー抑圧量が十分
であり、エコーキャンセラ部（ＥＣ）６１０が正常に機能していると見なして、エコー抑
圧ゲイン算出部（ＧＣＡＬ）６１１ｎは、エコーリダクション部（ＥＲ）６１１のエコー
抑圧量を多くするように、エコーキャンセラ部（ＥＣ）６１０のエコー抑圧量ＥＣＬ［ｆ
，ω］を引数とした不等式０≦Ｆ（ＥＣＬ［ｆ，ω］）≦１を満たす関数Ｆ（ＥＣＬ［ｆ
，ω］）を用いて補正し、エコー抑圧ゲインＧ［ｆ，ω］を以下に示す式４４のように算
出する。

In other frequency bands, the echo suppression amount of the echo canceller unit (EC) 610 is sufficient and the echo canceller unit (EC) 610 is regarded as functioning normally, and the echo suppression gain calculation unit (GCAL) 611n. Is the echo suppression amount ECL [f of the echo canceller unit (EC) 610 so that the echo suppression amount of the echo reduction unit (ER) 611 is increased.
, Ω] as an argument, the function F (ECL [f [f] satisfying the inequality 0 ≦ F (ECL [f, ω]) ≦ 1.
, Ω]), and the echo suppression gain G [f, ω] is calculated as shown in Equation 44 below.

ここで関数Ｆ（ＥＣＬ［ｆ，ω］）は、以下のようにする。ただし、δ_Ｃは周波数帯域
数を考慮した正値にしておくことが望ましい。
０≦ＥＣＬ［ｆ，ω］≦１のとき式４５−１により、
１≦ＥＣＬ［ｆ，ω］≦β_Ｚ［ω］のとき式４５−２により、
β_Ｚ［ω］≦ＥＣＬ［ｆ，ω］のとき式４５−３あるいは式４５−４による。

Here, the function F (ECL [f, ω]) is as follows. However, [delta] _C it is desirable to positively value considering the number of frequency bands.
When 0 ≦ ECL [f, ω] ≦ 1, Equation 45-1 gives
When 1 ≦ ECL [f, ω] ≦ β _Z [ω]
_{β Z [ω] ≦ ECL [} f, ω] according to equation 45-3 or Formula 45-4 when.

このように抑圧ゲインの補正ゲインとしての関数Ｆ（ＥＣＬ［ｆ，ω］）を不等式０≦
Ｆ（ＥＣＬ［ｆ，ω］）≦１を満たすように設定することで、補正後の抑圧ゲインが不自
然な値になることを防ぐことができ、送話音質の劣化を防ぐことができる。また、このよ
うに関数Ｆ（ＥＣＬ［ｆ，ω］）を非線形な関数にすることで、エコーキャンセラ部（Ｅ
Ｃ）６１０のエコー抑圧量が十分でない場合（β_Ｚ［ω］≦ＥＣＬ［ｆ，ω］）に抑圧量
が大きくなるようにゲインを補正することができる。 Thus, the function F (ECL [f, ω]) as the correction gain of the suppression gain is expressed by the inequality 0 ≦
By setting so as to satisfy F (ECL [f, ω]) ≦ 1, it is possible to prevent the corrected suppression gain from becoming an unnatural value, and it is possible to prevent deterioration of the transmission sound quality. Further, by making the function F (ECL [f, ω]) a non-linear function in this way, the echo canceller (E
C) When the echo suppression amount of 610 is not sufficient (β _Z [ω] ≦ ECL [f, ω]), the gain can be corrected so as to increase the suppression amount.

また、エコー抑圧ゲイン算出部（ＧＣＡＬ）６１１ｎは、過剰なエコー抑圧により送話
音質が劣化するのを防止し、背景雑音の断続抑圧を防止するため、抑圧ゲインＧ［ｆ，ω
］は所定の下限値以下にならないように制御する。 The echo suppression gain calculation unit (GCAL) 611n prevents the transmission sound quality from being deteriorated due to excessive echo suppression, and prevents the background noise from being intermittently suppressed.
] Is controlled so as not to fall below a predetermined lower limit value.

さらに、過剰なエコー抑圧により送話音質が劣化するのを防止するため、エコー抑圧ゲ
イン算出部（ＧＣＡＬ）６１１ｎは、送話入力信号ｚ［ｎ］あるいは残差信号ｅ［ｎ］の
有音区間ではない区間の信号レベルを用いて周波数帯域ごとに背景雑音レベルを算出して
おき、背景雑音レベルよりも抑圧しないようにエコー抑圧ゲインを制御しても構わない。 Further, in order to prevent the transmission sound quality from deteriorating due to excessive echo suppression, the echo suppression gain calculation unit (GCAL) 611n is provided with a sound interval of the transmission input signal z [n] or the residual signal e [n]. A background noise level may be calculated for each frequency band using a signal level of a section that is not, and the echo suppression gain may be controlled so as not to suppress the background noise level.

さらにまた、エコー抑圧ゲイン算出部（ＧＣＡＬ）６１１ｎは、送話音質が劣化するの
を防止するために、エコー抑圧ゲインＧ［ｆ，ω］を以下の式４６−１あるいは式４６−
２が示すように周波数方向にスムージングして出力してもよい。例えば、ε_ｊは、［０．
１，０．２，０．４，０．２，０．１］、η_ｊは、［０．１，０．２，０．４，０．８，
０．４，０．２，０．１］のようにしてよい。

Furthermore, the echo suppression gain calculation unit (GCAL) 611n sets the echo suppression gain G [f, ω] to the following Expression 46-1 or Expression 46- in order to prevent the transmission sound quality from deteriorating.
As indicated by 2, the output may be smoothed in the frequency direction. For example, ε _j is [0.
1, 0.2, 0.4, 0.2, 0.1], η _j is [0.1, 0.2, 0.4, 0.8,
0.4, 0.2, 0.1].

送話出力パワー算出部（ＰＯＷ）６１１ｕは、信号抑圧部（ＳＳ）１１１ｏから出力さ
れた送話出力信号の周波数スペクトルＳ’［ｆ，ω］を入力とし、そのパワースペクトル
である送話出力パワースペクトル｜Ｓ’［ｆ，ω］｜^２を算出し、以下に示す式４７のよ
うに１フレーム前の値｜Ｓ’_Ｓ［ｆ−１，ω］｜^２を用いてスムージングした送話出力パ
ワースペクトル｜Ｓ’_Ｓ［ｆ，ω］｜^２を算出して出力する。ただし、α_Ｓ［ω］は、０
．７５〜０．９９９程度が望ましい。

The transmission output power calculation unit (POW) 611u receives the frequency spectrum S ′ [f, ω] of the transmission output signal output from the signal suppression unit (SS) 111o as an input, and the transmission output power that is the power spectrum. spectrum | S '[f, ω] | 2 is calculated, and the previous frame values as in equation 47 below _{| S' S [f-1} , ω] | 2 sending output power was smoothed using The spectrum | S ′ _S [f, ω] | ² is calculated and output. However, α _S [ω] is 0
. About 75 to 0.999 is desirable.

次に、上記のように構成された第６の実施形態に係る信号処理装置の処理の流れを説明
する。図２８は、第６の実施形態に係る信号処理装置の全体の処理の流れを示すフローチ
ャートである。なお、図５を参照して説明した第１の実施形態に係る信号処理装置の動作
と同じ動作ステップについては、同じ符号を付してその部分の説明を省略する。 Next, a processing flow of the signal processing device according to the sixth embodiment configured as described above will be described. FIG. 28 is a flowchart showing an overall processing flow of the signal processing apparatus according to the sixth embodiment. Note that the same operation steps as those of the signal processing apparatus according to the first embodiment described with reference to FIG. 5 are denoted by the same reference numerals, and description thereof is omitted.

ステップＳ１００４の遅延処理の後、遅延した受話入力信号ｘ［ｎ−Ｄ］とオフセット
除去された送話入力信号ｚ［ｎ］を入力として、エコーキャンセラ部（ＥＣ）６１０はエ
コーキャンセラ処理を行う（ステップＳ６００５）。 After the delay process in step S1004, the echo canceller unit (EC) 610 performs the echo canceller process using the delayed received input signal x [n-D] and the transmission input signal z [n] from which the offset is removed as input (step S1004). Step S6005).

次に、エコー抑圧量推定部（ＥＣＬＥ）６１２は、周波数領域適応フィルタ部（ＦＤＡ
ＤＦ）６１０ｅから出力されるフィルタ係数Ｈ_ＦＤＡＦ［ｆ，ω］と、周波数領域ダブル
トーク検出部（ＦＤＤＴＤ）６１０ｉから出力されるエコーパスロスの推定値λ_ＦＤＡＦ
［ｆ，ω］とを入力とし、エコーキャンセラ部（ＥＣ）６１０で抑圧されたエコー抑圧量
ＥＣＬ［ｆ，ω］を推定する（ステップＳ６００８）。 Next, the echo suppression amount estimation unit (ECLE) 612 includes a frequency domain adaptive filter unit (FDA).
DF) 610e filter coefficient H _FDAF [f, ω] and frequency domain double talk detector (FDDTD) 610i estimated echo path loss λ _FDAF
[F, ω] is used as an input, and the echo suppression amount ECL [f, ω] suppressed by the echo canceller (EC) 610 is estimated (step S6008).

そして、遅延した受話入力信号ｘ［ｎ−Ｄ］と、オフセット除去された送話入力信号ｚ
［ｎ］と、エコーキャンセラ部（ＥＣ）６１０から出力されたエラーキャンセラ処理後の
信号である残差信号ｅ［ｎ］を入力として、エコーリダクション部（ＥＲ）６１１は非線
形エコー抑圧処理であるエコーリダクション処理を行う（ステップＳ６００６）。そして
、ステップＳ１００７の終話か否かの判断に移る。 Then, the delayed reception input signal x [n−D] and the transmission input signal z with the offset removed.
[N] and the residual signal e [n], which is a signal after error canceller processing output from the echo canceller unit (EC) 610, are input, and the echo reduction unit (ER) 611 is an echo that is nonlinear echo suppression processing. Reduction processing is performed (step S6006). Then, the process proceeds to step S1007 to determine whether or not the call is an end story.

図２９は、第６の実施形態に係るエコーキャンセラ部（ＥＣ）６１０における処理の流
れを示すフローチャートである。 FIG. 29 is a flowchart showing the flow of processing in the echo canceller unit (EC) 610 according to the sixth embodiment.

エコーキャンセラ部（ＥＣ）６１０の処理は、次のように行われる。まず、受話入力信
号ｘ［ｎ−Ｄ］を周波数領域に変換して、受話入力信号の周波数スペクトルＸ_ＦＤＡＦ［
ｆ，ω］を算出し、残差信号ｅ［ｎ］を周波数領域に変換して、残差信号の周波数スペク
トルＥ_ＦＤＡＦ［ｆ，ω］を算出する。次に、周波数領域ダブルトーク検出部（ＦＤＤＴ
Ｄ）６１０ｉが受話信号の周波数スペクトルＸ_ＦＤＡＦ［ｆ，ω］と、１フレーム前の残
差信号の周波数スペクトルＥ_ＦＤＡＦ［ｆ−１，ω］とを用いて、周波数領域ダブルトー
ク検出処理を行う（ステップＳ６１０１）。そして、周波数領域適応フィルタ部（ＦＤＡ
ＤＦ）６１０ｅはダブルトーク情報ＥＣｓｔａｔｅ［ｆ，ω］の制御を受けながら、受話
信号の周波数スペクトルＸ_ＦＤＡＦ［ｆ，ω］と、１フレーム前の残差信号の周波数スペ
クトルＥ_ＦＤＡＦ［ｆ−１，ω］とを用いて周波数領域適応フィルタ処理を行う（ステッ
プＳ６１０２）。次に、擬似エコー信号の周波数スペクトルＹ’_ＦＤＡＦ［ｆ，ω］を周
波数領域逆変換して、擬似エコー信号ｙ’_ＦＤＡＦ［ｎ］を算出する。そして、信号減算
処理部６１０ｇは、送話入力信号ｚ［ｎ］から、周波数領域適応フィルタ部（ＦＤＡＤＦ
）６１０ｅから出力された擬似エコー信号ｙ’_ＦＤＡＦ［ｎ］を減算し、残差信号ｅ［ｎ
］を算出して出力、エコーキャンセラ処理が終了する。 The processing of the echo canceller (EC) 610 is performed as follows. First, the received input signal x [n−D] is converted into the frequency domain, and the frequency spectrum X _FDAF [
f, ω] is calculated, the residual signal e [n] is converted into the frequency domain, and the frequency spectrum E _FDAF [f, ω] of the residual signal is calculated. Next, the frequency domain double talk detector (FDDT)
D) 610i uses the frequency spectrum _XFDAF [f, ω] of the received signal and the frequency spectrum E _FDAF [f-1, ω] of the residual signal one frame before to perform frequency domain double talk detection processing. (Step S6101). The frequency domain adaptive filter unit (FDA
DF) 610e is double-talk information ECstate [f, ω] while receiving control of the frequency spectrum _{X FDAF} of the received signal [f, omega] a, the frequency spectrum of the previous frame of the residual signal _E FDAF [f-1, ω] is used to perform frequency domain adaptive filter processing (step S6102). Next, the frequency spectrum Y ′ _FDAF [f, ω] of the pseudo echo signal is inversely transformed in the frequency domain to calculate the pseudo echo signal y ′ _FDAF [n]. Then, the signal subtraction processing unit 610g generates a frequency domain adaptive filter unit (FDADF) from the transmission input signal z [n].
) The pseudo echo signal y ′ _FDAF [n] output from 610 e is subtracted and the residual signal e [n]
] Is calculated and the output and echo canceller processing ends.

図３０は、第６の実施形態に係るエコーリダクション部（ＥＲ）６１１における処理の
流れを示すフローチャートである。なお、図７を参照して説明した第１の実施形態に係る
エコーリダクション部（ＥＲ）１１１における動作と同じ動作ステップについては、同じ
符号を付してその部分の説明を省略する。 FIG. 30 is a flowchart showing the flow of processing in the echo reduction unit (ER) 611 according to the sixth embodiment. Note that the same operation steps as the operations in the echo reduction unit (ER) 111 according to the first embodiment described with reference to FIG. 7 are denoted by the same reference numerals, and description thereof is omitted.

第６の実施形態に係るエコーリダクション部（ＥＲ）６１１は、ステップＳ１２０３ｅ
の残差パワースペクトル算出処理の動作はない。 The echo reduction unit (ER) 611 according to the sixth embodiment performs step S1203e.
There is no operation of the residual power spectrum calculation process.

即ち、第６の実施形態に係るエコーリダクション部（ＥＲ）６１１は、エコーリダクシ
ョン処理を開始して、ステップＳ１２０３ｒの受話パワースペクトル算出処理、ステップ
Ｓ１２０３ｓの送話パワースペクトル算出処理、及びステップＳ１２０２ｅの残差周波数
変換処理の後、周波数領域ダブルトーク検出部（ＦＤＴＤ）６１１ｊは、ダブルトーク状
態か否かの情報である周波数領域ダブルトーク情報ＥＲｓｔａｔｅ［ｆ，ω］を算出して
出力し、音響結合量推定部（ＡＣＬＥ）６１１ｇは、スムージングされた受話パワースペ
クトル｜Ｘ_Ｓ［ｆ，ω］｜^２と、スムージングされた送話パワースペクトル｜Ｚ_Ｓ［ｆ，
ω］｜^２と、エコー抑圧量推定部（ＥＣＬＥ）６１２から出力されたエコー抑圧量ＥＣＬ
［ｆ，ω］と、周波数領域ダブルトーク情報ＥＲｓｔａｔｅ［ｆ，ω］とに基づいてスム
ージングした補正音響結合量｜Ｈ_Ｓ［ｆ，ω］｜^２を算出する（ステップＳ６２０４）。 That is, the echo reduction unit (ER) 611 according to the sixth embodiment starts the echo reduction process, and performs the received power spectrum calculation process in step S1203r, the transmitted power spectrum calculation process in step S1203s, and the remaining steps in step S1202e. After the difference frequency conversion process, the frequency domain double talk detection unit (FDTD) 611j calculates and outputs frequency domain double talk information ERstate [f, ω], which is information indicating whether or not it is in a double talk state. The estimation unit (ACLE) 611g includes the smoothed reception power spectrum | X _S [f, ω] | ² and the smoothed transmission power spectrum | Z _S [f,
ω] | ² and the echo suppression amount ECL output from the echo suppression amount estimation unit (ECLE) 612
Based on [f, ω] and the frequency domain double talk information ERstate [f, ω], a corrected acoustic coupling amount | H _S [f, ω] | ² is calculated (step S6204).

その後ステップＳ１２０５でエコーレベル算出処理を行った後、制御部（ＣＴＲＬ）６
１１ｋは、エコー抑圧量ＥＣＬ［ｆ，ω］と、周波数領域ダブルトーク情報ＥＲｓｔａｔ
ｅ［ｆ，ω］とを入力とし、周波数帯域ω毎にダブルトーク状態かつエコーキャンセラ部
（ＥＣ）１１０のエコー抑圧量が十分でない周波数帯域を検出して、制御情報ＥＲｃｏｎ
ｔｒｏｌ［ｆ，ω］を出力する（ステップＳ６２０７）。 After performing echo level calculation processing in step S1205, the control unit (CTRL) 6
11k is an echo suppression amount ECL [f, ω] and frequency domain double talk information ERstat.
e [f, ω] as input, a frequency band in which the double talk state and the echo suppression amount of the echo canceller (EC) 110 are not sufficient is detected for each frequency band ω, and the control information ERcon is detected.
trol [f, ω] is output (step S6207).

次に、エコー抑圧ゲイン算出部（ＧＣＡＬ）６１１ｎは、スムージングされた送話パワ
ースペクトル｜Ｚ_Ｓ［ｆ，ω］｜^２と、エコー量｜Ｙ_Ｓ［ｆ，ω］｜^２と、エコー抑圧量
ＥＣＬ［ｆ，ω］と、制御情報ＥＲｃｏｎｔｒｏｌ［ｆ，ω］と、パラメータγ［ω］と
を入力とし、エコー抑圧ゲインＧ［ｆ，ω］を算出して出力する（ステップＳ６２０８）
。 Next, the echo suppression gain calculation unit (GCAL) 611n includes the smoothed transmission power spectrum | Z _S [f, ω] | ² , the echo amount | Y _S [f, ω] | ^2, and the echo suppression amount. ECL [f, ω], control information ERcontrol [f, ω], and parameter γ [ω] are input, and echo suppression gain G [f, ω] is calculated and output (step S6208).
.

その後ステップＳ１２０９でスペクトル選択処理、ステップＳ１２１０で送話信号抑圧
処理を行った後、送話出力パワー算出部（ＰＯＷ）６１１ｕは、送話出力パワースペクト
ルを算出する（ステップＳ６２０３ｓｓ）。そして、ステップＳ１２１１の周波数逆変換
処理に移って、エコーリダクション処理が終了する。 Then, after performing spectrum selection processing in step S1209 and transmission signal suppression processing in step S1210, the transmission output power calculation unit (POW) 611u calculates a transmission output power spectrum (step S6203ss). Then, the process proceeds to the frequency inverse transform process in step S1211, and the echo reduction process ends.

上記では、エコーキャンセラ部（ＥＣ）６１０は、本実施形態ではオーバーラップ保存
法（Overlap-Save Method）、あるいはオーバーラップ加算法（Overlap-Add Method）に
基づいた周波数領域型適応フィルタの例を示しているが、巡回畳み込み法（Circular-Con
volution Method）に基づいた周波数領域型適応フィルタで構成してもよい。 In the above, the echo canceller unit (EC) 610 shows an example of a frequency domain adaptive filter based on the overlap-save method or the overlap-add method in this embodiment. However, the circular convolution method (Circular-Con
A frequency domain type adaptive filter based on the “volution method” may be used.

また上記では、エコーリダクション部（ＥＲ）６１１はＦＦＴによる周波数領域型で周
波数帯域毎に処理する方式として動作するものとして説明した。ＦＦＴによる周波数帯域
をグループでまとめて周波数帯域グループ毎に処理する方式や、フィルタバンクなどの帯
域分割フィルタなどの周波数領域型を実現してもよい。 In the above description, the echo reduction unit (ER) 611 is described as operating in a frequency domain type using FFT and processing for each frequency band. You may implement | achieve the frequency domain type | molds, such as the method which puts together the frequency band by FFT and processes for every frequency band group, and band division filters, such as a filter bank.

以上説明した信号処理装置の動作により、エコーキャンセラ部（ＥＣ）６１０の処理に
よるエコー抑圧量が十分でない周波数領域をエコー抑圧量推定部（ＥＣＬＥ）６１２と制
御部（ＣＴＲＬ）６１１ｋで判定して、スペクトル選択部１１１Ｌとエコー抑圧ゲイン算
出部（ＧＣＡＬ）６１１ｎにおいて周波数帯域毎にエコーキャンセラ部（ＥＣ）６１０の
処理とエコーリダクション部（ＥＲ）６１１の処理の重みを変化させるように制御するこ
とができるので、エコーパスロス変動に対して頑健にすることが可能であり、高品質な信
号を出力することが可能である。 By the operation of the signal processing apparatus described above, the echo suppression amount estimation unit (ECLE) 612 and the control unit (CTRL) 611k determine the frequency region where the echo suppression amount due to the processing of the echo canceller unit (EC) 610 is not sufficient, The spectrum selection unit 111L and the echo suppression gain calculation unit (GCAL) 611n can be controlled to change the weights of the processing of the echo canceller unit (EC) 610 and the processing of the echo reduction unit (ER) 611 for each frequency band. Therefore, it is possible to make robust against echo path loss fluctuations and to output a high quality signal.

（第６の実施形態の変形例）
図３１は、本発明の第６の実施形態の変形例に係る信号処理装置のエコーリダクション
部（ＥＲ）６１１２の構成を示すブロック図である。この第６の実施形態に係るエコーリ
ダクション部（ＥＲ）６１１２が第６の実施形態に係るエコーリダクション部（ＥＲ）６
１１と異なる点は、音響結合量推定部（ＡＣＬＥ）６１１ｇを有さず、音響結合量推定部
（ＡＣＬＥ）６１１ｇ−２を有する点にあり、その他の部分は同じである。そこで、同じ
部分については、同じ符号を付してその説明を省略する。 (Modification of the sixth embodiment)
FIG. 31 is a block diagram showing a configuration of an echo reduction unit (ER) 6112 of a signal processing device according to a modification of the sixth embodiment of the present invention. The echo reduction unit (ER) 6112 according to the sixth embodiment is the echo reduction unit (ER) 6 according to the sixth embodiment.
11 is that the acoustic coupling amount estimation unit (ACLE) 611g is not included and the acoustic coupling amount estimation unit (ACLE) 611g-2 is included, and the other portions are the same. Therefore, the same parts are denoted by the same reference numerals and the description thereof is omitted.

図３２は、第６の実施形態の変形例に係る音響結合量推定部（ＡＣＬＥ）６１１ｇ−２
の構成を示すブロック図である。この音響結合量推定部（ＡＣＬＥ）６１１ｇ−２は、音
響結合量推定部（ＣＡＣＬ）６１１ｇ１−２と、音響結合量平滑部（ＳＭＡＣＬ）６１１
ｇ３−２と、音響結合量補正部（ＡＤＪ）６１１ｇ２−２とからなる。 FIG. 32 shows an acoustic coupling amount estimation unit (ACLE) 611g-2 according to a modification of the sixth embodiment.
It is a block diagram which shows the structure of these. The acoustic coupling amount estimation unit (ACLE) 611g-2 includes an acoustic coupling amount estimation unit (CACL) 611g1-2 and an acoustic coupling amount smoothing unit (SMACL) 611.
g3-2 and an acoustic coupling amount correction unit (ADJ) 611g2-2.

音響結合量推定部（ＣＡＣＬ）６１１ｇ１−２は、受話パワー算出部（ＰＯＷ）１１１
ｄから出力されたスムージングされた受話パワースペクトル｜Ｘ_Ｓ［ｆ，ω］｜^２と、送
話パワー算出部（ＰＯＷ）１１１ｅから出力されたスムージングされた送話パワースペク
トル｜Ｚ_Ｓ［ｆ，ω］｜^２と、周波数領域ダブルトーク検出部（ＦＤＴＤ）６１１ｊから
出力された周波数領域ダブルトーク情報ＥＲｓｔａｔｅ［ｆ，ω］とを入力とし、エコー
キャンセラ部（ＥＣ）６１０の性能の影響を受けないように残差信号に基づく｜Ｅ_Ｓ［ｆ
，ω］｜^２を用いずに送話入力信号に基づく｜Ｚ_Ｓ［ｆ，ω］｜^２を用いて、周波数帯域
ω毎に音響結合量｜Ｈ［ｆ，ω］｜^２を以下に示す式４８のように算出して出力する。

The acoustic coupling amount estimation unit (CACL) 611g1-2 receives the received power calculation unit (POW) 111.
The smoothed reception power spectrum | X _S [f, ω] | ² output from d and the smoothed transmission power spectrum | Z _S [f, ω output from the transmission power calculation unit (POW) 111e. ] ² and the frequency domain double talk information ERstate [f, ω] output from the frequency domain double talk detection unit (FDTD) 611j, and the performance of the echo canceller unit (EC) 610 is not affected. Based on the residual signal | E _S [f
, Ω] | ^2, and using | Z _S [f, ω] | ² based on the transmission input signal, the acoustic coupling amount | H [f, ω] | ² is shown below for each frequency band ω. It calculates and outputs like Formula 48.

ただし、周波数領域ダブルトーク検出部（ＦＤＴＤ）６１１ｊから出力された周波数領
域ダブルトーク情報ＥＲｓｔａｔｅ［ｆ，ω］によりダブルトーク状態であると判定され
たことが示されている場合や、音響結合量が急激に変化する場合、即ち、不等式｜Ｈ［ｆ
，ω］｜^２＞β_Ｈ［ω］・｜Ｈ［ｆ−１，ω］｜^２が成り立つ場合、及び受話信号が十分
に大きくない場合、即ち、不等式｜Ｘ_Ｓ［ｆ，ω］｜^２＜β_Ｘ［ω］が成り立つ場合には
、エコーパス変動への高速な追従性を保ちつつ、ダブルトークとなる周波数帯域での音響
結合量の算出を行わないようにするため、音響結合量推定部（ＣＡＣＬ）６１１ｇ１−２
は、音響結合量を更新しないで１フレーム前の過去の音響結合量｜Ｈ［ｆ−１，ω］｜^２
を用いる。 However, when the frequency domain double talk information ERstate [f, ω] output from the frequency domain double talk detection unit (FDTD) 611j indicates that the double talk state is determined, or the amount of acoustic coupling is If it changes rapidly, that is, the inequality | H [f
, Ω] | ² > β _H [ω] · | H [f−1, ω] | ² and when the received signal is not sufficiently large, that is, the inequality | X _S [f, ω] | ² When <β _X [ω] is satisfied, an acoustic coupling amount estimation unit is provided so as not to calculate the acoustic coupling amount in the frequency band that causes double talk while maintaining high-speed follow-up to echo path fluctuations. (CACL) 611g1-2
Does not update the acoustic coupling amount, and the past acoustic coupling amount one frame before | H [f−1, ω] | ²
Is used.

音響結合量平滑部（ＳＭＡＣＬ）６１１ｇ３−２は、音響結合量推定部（ＣＡＣＬ）６
１１ｇ１−２から出力された音響結合量｜Ｈ［ｆ，ω］｜^２と、エコー抑圧量推定部（Ｅ
ＣＬＥ）６１２から出力されたエコー抑圧量ＥＣＬ［ｆ，ω］とを入力とし、以下に示す
式４９のように１フレーム前の値を用いてスムージングした音響結合量｜Ｈ_Ｓ’［ｆ，ω
］｜^２を算出して出力する。

The acoustic coupling amount smoothing unit (SMACL) 611 g 3-2 is an acoustic coupling amount estimation unit (CACL) 6.
11g1-2, the acoustic coupling amount | H [f, ω] | ² and the echo suppression amount estimation unit (E
CLE) echo suppression amount output from 612 ECL [f, ω] as input and the acoustic coupling amount was smoothed by using the value of the previous frame as in Equation 49 below | H _S '[f, ω
] | ² is calculated and output.

ただし、α_Ｈ［ω］は、０．０３〜０．９９程度が望ましい。このとき、エコーキャン
セラ部（ＥＣ）６１０で抑圧されたエコー抑圧量ＥＣＬ［ｆ，ω］に基づいて、周波数帯
域ω毎にα_Ｈ［ω］を可変にする。つまり、エコー抑圧量ＥＣＬ［ｆ，ω］が十分ある程
、α_Ｈ［ω］を０に近づけ、エコー抑圧量ＥＣＬ［ｆ，ω］が十分ではない程α_Ｈ［ω］
を１に近づける。 However, α _H [ω] is preferably about 0.03 to 0.99. At this time, α _H [ω] is made variable for each frequency band ω based on the echo suppression amount ECL [f, ω] suppressed by the echo canceller unit (EC) 610. That, as the echo suppression amount ECL [f, ω] is enough, close the alpha _H [omega] 0, as the echo suppression amount ECL [f, omega] is not sufficient alpha _H [omega]
Is brought close to 1.

音響結合量補正部（ＡＤＪ）６１１ｇ２−２は、音響結合量平滑部（ＳＭＡＣＬ）６１
１ｇ３−２から出力されたスムージングした音響結合量｜Ｈ_Ｓ’［ｆ，ω］｜^２と、エコ
ー抑圧量推定部（ＥＣＬＥ）６１２から出力されたエコー抑圧量ＥＣＬ［ｆ，ω］とを入
力とし、補正音響結合量｜Ｈ_Ｓ［ｆ，ω］｜^２を算出して出力する。 The acoustic coupling amount correction unit (ADJ) 611g2-2 is connected to an acoustic coupling amount smoothing unit (SMACL) 61.
The smoothed acoustic coupling amount | H _S ′ [f, ω] | ² output from 1g3-2 and the echo suppression amount ECL [f, ω] output from the echo suppression amount estimation unit (ECLE) 612 are input. The corrected acoustic coupling amount | H _S [f, ω] | ² is calculated and output.

具体的には、音響結合量補正部（ＡＤＪ）６１１ｇ２−２は、まず周波数帯域毎にエコ
ーキャンセラ部（ＥＣ）６１０で抑圧されたエコー抑圧量ＥＣＬ［ｆ，ω］が十分である
か否かを判定する。そして、エコーキャンセラ部（ＥＣ）６１０で抑圧されたエコー抑圧
量ＥＣＬ［ｆ，ω］が十分であると判定された周波数帯域では、０＜β_ＨＡＣＬ［ω］≦
１なる係数β_ＨＡＣＬ［ω］を用いて以下に示す式５０のようにスムージングされた音響
結合量｜Ｈ_Ｓ’［ｆ，ω］｜^２を補正し、補正音響結合量｜Ｈ_Ｓ［ｆ，ω］｜^２を算出し
て出力する。

Specifically, the acoustic coupling amount correction unit (ADJ) 611g2-2 first determines whether or not the echo suppression amount ECL [f, ω] suppressed by the echo canceller unit (EC) 610 for each frequency band is sufficient. Determine. Then, in a frequency band in which the echo suppression amount ECL [f, ω] suppressed by the echo canceller unit (EC) 610 is determined to be sufficient, 0 <β _HACL [ω] ≦
The smoothed acoustic coupling amount | H _S ′ [f, ω] | ² using the coefficient β _HACL [ω] of 1 is _corrected as shown in Equation 50 below, and the corrected acoustic coupling amount | H _S [f, ω] | ² is calculated and output.

一方、エコーキャンセラ部（ＥＣ）６１０のエコー抑圧量ＥＣＬ［ｆ，ω］が十分でな
いと判定された周波数帯域では、音響結合量補正部（ＡＤＪ）６１１ｇ２−２は、｜Ｈ_Ｓ
［ｆ，ω］｜^２＝｜Ｈ_Ｓ’［ｆ，ω］｜^２としてスムージングされた音響結合量｜Ｈ_s’
［ｆ，ω］｜^２の補正を行わない。このようにすることで、エコーパス変動がそれほど起
きない場合に送話音質が劣化することを防げる。 On the other hand, the echo suppression amount ECL [f, ω] of the echo canceller unit (EC) 610 is the determined frequency bands not sufficient, the acoustic coupling amount correction unit (ADJ) 611g2-2 is, | _{H S}
^{[F, ω] | 2 =} | H S '[f, ω] | smoothed acoustic coupling amount as ² | H _s'
[F, ω] | ² is not corrected. By doing so, it is possible to prevent the transmission sound quality from deteriorating when the echo path fluctuation does not occur so much.

（第７の実施形態）
図３３は、第７の実施形態に係る信号処理装置の構成を示すブロック図である。この信
号処理装置が第６の実施形態に係る信号処理装置と異なる点は、図３３に示すようにエコ
ーキャンセラ部（ＥＣ）６１０とエコーリダクション部（ＥＲ）６１１とエコー抑圧量推
定部（ＥＣＬＥ）６１２を有さず、エコーキャンセラ部（ＥＣ）７１０とエコーリダクシ
ョン部（ＥＲ）７１１とエコー抑圧量推定部（ＥＣＬＥ）７１２とを有する点にあり、そ
の他の部分は同じである。そこで、第６の実施形態に係る信号処理装置と同じ部分につい
ては、同じ符号を付してその説明を省略する。 (Seventh embodiment)
FIG. 33 is a block diagram illustrating a configuration of a signal processing device according to the seventh embodiment. This signal processing device is different from the signal processing device according to the sixth embodiment in that an echo canceller (EC) 610, an echo reduction (ER) 611, and an echo suppression amount estimator (ECLE) as shown in FIG. 612, but has an echo canceller unit (EC) 710, an echo reduction unit (ER) 711, and an echo suppression amount estimation unit (ECLE) 712, and other parts are the same. Therefore, the same parts as those of the signal processing apparatus according to the sixth embodiment are denoted by the same reference numerals, and the description thereof is omitted.

また、第７の実施形態に係るエコーリダクション部（ＥＲ）７１１で、第１の実施形態
に係るエコーリダクション部（ＥＲ）１１１及び第６の実施形態に係るエコーリダクショ
ン部（ＥＲ）６１１と同じ部分については、同じ符号を付してその説明を省略する。 Further, the echo reduction unit (ER) 711 according to the seventh embodiment is the same as the echo reduction unit (ER) 111 according to the first embodiment and the echo reduction unit (ER) 611 according to the sixth embodiment. Are denoted by the same reference numerals, and the description thereof is omitted.

図３４は、エコーキャンセラ部（ＥＣ）７１０の構成を示すブロック図である。このエ
コーキャンセラ部（ＥＣ）７１０は、時間領域エコーキャンセラ部（ＴＤＡＦ）と周波数
領域型エコーキャンセラ部（ＦＤＡＦ）からなり、これらを直列接続したものである。す
なわち、時間領域エコーキャンセラ部（ＴＤＡＦ）での処理後の信号を用いて周波数領域
型エコーキャンセラ部（ＦＤＡＦ）の処理を行う。 FIG. 34 is a block diagram showing the configuration of the echo canceller unit (EC) 710. The echo canceller unit (EC) 710 includes a time domain echo canceller unit (TDAF) and a frequency domain echo canceller unit (FDAF), which are connected in series. That is, processing of the frequency domain echo canceller (FDAF) is performed using the signal after processing in the time domain echo canceller (TDAF).

時間領域エコーキャンセラ部（ＴＤＡＦ）は、第１の実施形態に係るエコーキャンセラ
部（ＥＣ）１１０と同じであり、周波数領域型エコーキャンセラ部（ＦＤＡＦ）は、第６
の実施形態に係るエコーキャンセラ部（ＥＣ）６１０と同じである。 The time domain echo canceller (TDAF) is the same as the echo canceller (EC) 110 according to the first embodiment, and the frequency domain echo canceller (FDAF)
This is the same as the echo canceller unit (EC) 610 according to the embodiment.

そして、遅延処理部（ＤＥＬＡＹ）１０２から出力された遅延した受話入力信号ｘ［ｎ
−Ｄ］を、エコーキャンセラ部（ＥＣ）１１０及びエコーキャンセラ部（ＥＣ）６１０に
入力させる。更に、エコーキャンセラ部（ＥＣ）１１０によって出力された残差信号ｅ［
ｎ］を、エコーキャンセラ部（ＥＣ）６１０に送話入力信号ｚ［ｎ］として入力させて、
直列接続する。この場合、エコーキャンセラ部（ＥＣ）７１０は、エコーキャンセラ部（
ＥＣ）１１０の内部状態とエコーキャンセラ部（ＥＣ）６１０の内部状態を併せて、内部
状態としてメモリに保持している。 The delayed received input signal x [n output from the delay processing unit (DELAY) 102
-D] is input to the echo canceller unit (EC) 110 and the echo canceller unit (EC) 610. Further, the residual signal e [
n] is input to the echo canceller (EC) 610 as a transmission input signal z [n],
Connect in series. In this case, the echo canceller (EC) 710 is connected to the echo canceller (
The internal state of the EC) 110 and the internal state of the echo canceller unit (EC) 610 are stored in the memory as the internal state.

そこで、第１の実施形態に係るエコーキャンセラ部（ＥＣ）１１０及び第６の実施形態
に係るエコーキャンセラ部（ＥＣ）６１０と同じ符号を付して、エコーキャンセラ部（Ｅ
Ｃ）７１０の説明を省略する。 Therefore, the echo canceller unit (EC) 110 according to the first embodiment and the echo canceller unit (EC) 610 according to the sixth embodiment are denoted by the same reference numerals, and the echo canceller unit (E
C) Description of 710 is omitted.

図３５は、エコーリダクション部（ＥＲ）７１１の構成を示すブロック図である。この
エコーリダクション部（ＥＲ）７１１は、周波数領域変換処理部（ＦＴ）１１１ａと、周
波数領域変換処理部（ＦＴ）１１１ｂと、周波数領域変換処理部（ＦＴ）１１１ｃと、受
話パワー算出部（ＰＯＷ）１１１ｄと、送話パワー算出部（ＰＯＷ）１１１ｅと、残差パ
ワー算出部（ＰＯＷ）１１１ｆと、エコー量推定部（ＥＬＥ）７１１ｈと、周波数領域ダ
ブルトーク検出部（ＦＤＴＤ）７１１ｊと、制御部（ＣＴＲＬ）６１１ｋと、スペクトル
選択部１１１Ｌと、ゲイン格納部（ＧＴＢＬ）１１１ｍと、エコー抑圧ゲイン算出部（Ｇ
ＣＡＬ）６１１ｎと、信号抑圧部（ＳＳ）１１１ｏと、周波数領域逆変換処理部（ＩＦＴ
）１１１ｐとからなる。 FIG. 35 is a block diagram showing the configuration of the echo reduction unit (ER) 711. As shown in FIG. The echo reduction unit (ER) 711 includes a frequency domain transformation processing unit (FT) 111a, a frequency domain transformation processing unit (FT) 111b, a frequency domain transformation processing unit (FT) 111c, and a received power calculation unit (POW). 111d, transmission power calculation unit (POW) 111e, residual power calculation unit (POW) 111f, echo amount estimation unit (ELE) 711h, frequency domain double talk detection unit (FDTD) 711j, control unit ( CTRL) 611k, spectrum selection unit 111L, gain storage unit (GTBL) 111m, echo suppression gain calculation unit (G
CAL) 611n, signal suppressor (SS) 111o, frequency domain inverse transform processor (IFT)
) 111p.

図３６は、エコー量推定部（ＥＬＥ）７１１ｈの詳細な構成を示すブロック図である。
このエコー量推定部（ＥＬＥ）７１１ｈは、信号減算処理部７１１ｈ１、エコー量推定制
御部（ＥＬＥＣＴＲＬ）７１１ｈ２、可変ゲイン７１１ｈ３、７１１ｈ４、信号加算処理
部７１１ｈ５からなる。 FIG. 36 is a block diagram showing a detailed configuration of the echo amount estimation unit (ELE) 711h.
The echo amount estimation unit (ELE) 711h includes a signal subtraction processing unit 711h1, an echo amount estimation control unit (ELECTRL) 711h2, variable gains 711h3 and 711h4, and a signal addition processing unit 711h5.

上記のように構成された、本発明の第７の実施形態に係る信号処理装置の各部の動作を
、図３３〜図３６を参照して説明する。 The operation of each part of the signal processing device according to the seventh embodiment of the present invention configured as described above will be described with reference to FIGS.

まず、エコー抑圧量推定部（ＥＣＬＥ）７１２は、ハイパスフィルタ部（ＨＰＦ）１０
９から出力された送話入力信号ｚ［ｎ］と、エコーキャンセラ部（ＥＣ）７１０の信号減
算処理部６１０ｇから出力された時間領域の残差信号ｅ［ｎ］を入力とし、エコーキャン
セラ部（ＥＣ）６１０で抑圧されたエコー抑圧量ＥＣＬ［ｆ，ω］を周波数帯域ω毎に推
定して出力する。 First, an echo suppression amount estimation unit (ECLE) 712 includes a high-pass filter unit (HPF) 10.
9 and the time domain residual signal e [n] output from the signal subtraction processing unit 610g of the echo canceller unit (EC) 710 as inputs, and the echo canceller unit ( EC) The echo suppression amount ECL [f, ω] suppressed in 610 is estimated for each frequency band ω and output.

具体的には、エコー抑圧量推定部（ＥＣＬＥ）７１２は、まず送話入力信号ｚ［ｎ］を
窓掛け及びオーバーラップして、ＦＦＴによって周波数領域に変換し、送話パワースペク
トル｜Ｚ’［ｆ，ω］｜^２を算出する。次に、残差信号ｅ［ｎ］を窓掛け及びオーバーラ
ップして、ＦＦＴによって周波数領域に変換し、残差パワースペクトル｜Ｅ’［ｆ，ω］
｜^２を算出する。そして、以下に示す式５１のように送話パワースペクトル｜Ｚ’［ｆ，
ω］｜^２と残差パワースペクトル｜Ｅ’［ｆ，ω］｜^２の比をエコーキャンセラ部（ＥＣ
）７１０で抑圧されたエコー抑圧量ＥＣＬ［ｆ，ω］として算出して出力する。

Specifically, the echo suppression amount estimation unit (ECLE) 712 first windows and overlaps the transmission input signal z [n], converts it to the frequency domain by FFT, and transmits the transmission power spectrum | Z ′ [ f, ω] | ² is calculated. Next, the residual signal e [n] is windowed and overlapped and transformed into the frequency domain by FFT, and the residual power spectrum | E ′ [f, ω]
| ² is calculated. Then, the transmission power spectrum | Z ′ [f,
ω] | ² and the residual power spectrum | E ′ [f, ω] | ²
) Calculate and output as the echo suppression amount ECL [f, ω] suppressed in 710.

次に、図３５及び図３６を参照して第７の実施形態に係るエコーリダクション部（ＥＲ
）７１１の動作を説明する。 Next, referring to FIG. 35 and FIG. 36, the echo reduction unit (ER
) The operation of 711 will be described.

エコーリダクション部（ＥＲ）７１１は、受話信号ｘ［ｎ］及び送話入力信号ｚ［ｎ］
及び残差信号ｅ［ｎ］に基づいてエコー成分を抑圧するものであって、そのエコー抑圧後
の信号を送話出力信号ｓ’［ｎ］（ｎ＝０，１，・・・，Ｎ−１）として出力する。 The echo reduction unit (ER) 711 receives the reception signal x [n] and the transmission input signal z [n].
And the echo component is suppressed based on the residual signal e [n], and the signal after the echo suppression is transmitted as the transmission output signal s ′ [n] (n = 0, 1,..., N− Output as 1).

エコー量推定部（ＥＬＥ）７１１ｈは、受話パワー算出部（ＰＯＷ）１１１ｄから出力
されたスムージングされた受話パワースペクトル｜Ｘ_Ｓ［ｆ，ω］｜^２と、送話パワー算
出部（ＰＯＷ）１１１ｅから出力されたスムージングされた送話パワースペクトル｜Ｚ_Ｓ
［ｆ，ω］｜^２と、残差パワー算出部（ＰＯＷ）１１１ｆから出力されたスムージングさ
れた残差パワースペクトル｜Ｅ_Ｓ［ｆ，ω］｜^２と、エコー抑圧量推定部（ＥＣＬＥ）７
１２から出力されたエコー抑圧量ＥＣＬ［ｆ，ω］と、周波数領域ダブルトーク検出部（
ＦＤＴＤ）７１１ｊから出力された周波数領域ダブルトーク情報ＥＲｓｔａｔｅ［ｆ，ω
］とを入力とし、スムージングしたエコー量｜Ｙ_Ｓ［ｆ，ω］｜^２を算出して出力する。 The echo amount estimation unit (ELE) 711h receives the smoothed reception power spectrum | X _S [f, ω] | ² output from the reception power calculation unit (POW) 111d and the transmission power calculation unit (POW) 111e. Output smoothed transmission power spectrum | Z _S
[F, ω] | ² , the smoothed residual power spectrum output from the residual power calculation unit (POW) 111 f | E _S [f, ω] | ^2, and the echo suppression amount estimation unit (ECLE) 7
12 and the echo suppression amount ECL [f, ω] output from the frequency domain double talk detector (
FDTD) frequency domain double talk information ERstate [f, ω output from 711j
] Is input and a smoothed echo amount | Y _S [f, ω] | ² is calculated and output.

信号減算処理部７１１ｈ１は、送話パワー算出部（ＰＯＷ）１１１ｅから出力された送
話パワースペクトル｜Ｚ_Ｓ［ｆ，ω］｜^２と、残差パワー算出部（ＰＯＷ）１１１ｆから
出力された残差パワースペクトル｜Ｅ_Ｓ［ｆ，ω］｜^２とを入力とし、送話パワースペク
トル｜Ｚ_Ｓ［ｆ，ω］｜^２から残差パワースペクトル｜Ｅ_Ｓ［ｆ，ω］｜^２をフレームf
及び周波数帯域ω毎に減算して、減算した信号｜Ｚ_Ｓ［ｆ，ω］｜^２−｜Ｅ_Ｓ［ｆ，ω］
｜^２を出力する。 The signal subtraction processing unit 711h1 transmits the transmission power spectrum | Z _S [f, ω] | ² output from the transmission power calculation unit (POW) 111e and the residual output from the residual power calculation unit (POW) 111f. differential power spectrum _{| E S [f, ω]} | ² and the input, sending power spectrum _{| Z S [f, ω]} | 2 residual power spectrum from _{| E S [f, ω]} | ² frames f
And the subtracted signal | Z _S [f, ω] | ² − | E _S [f, ω]
| ² is output.

エコー量推定制御部（ＥＬＥＣＴＲＬ）７１１ｈ２は、受話パワー算出部（ＰＯＷ）１
１１ｄから出力されたスムージングされた受話パワースペクトル｜Ｘ_Ｓ［ｆ，ω］｜^２と
、送話パワー算出部（ＰＯＷ）１１１ｅから出力されたスムージングされた送話パワース
ペクトル｜Ｚ_Ｓ［ｆ，ω］｜^２と、残差パワー算出部（ＰＯＷ）１１１ｆから出力された
スムージングされた残差パワースペクトル｜Ｅ_Ｓ［ｆ，ω］｜^２と、エコー抑圧量推定部
（ＥＣＬＥ）７１２から出力されたエコー抑圧量ＥＣＬ［ｆ，ω］と、周波数領域ダブル
トーク検出部（ＦＤＴＤ）７１１ｊから出力された周波数領域ダブルトーク情報ＥＲｓｔ
ａｔｅ［ｆ，ω］とを入力とし、２つの可変ゲインＧ_１［ｆ，ω］及びＧ_２［ｆ，ω］を
算出して出力する。 The echo amount estimation control unit (ELECTRL) 711h2 is a received power calculation unit (POW) 1
The smoothed reception power spectrum | X _S [f, ω] | ² output from 11d and the smoothed transmission power spectrum | Z _S [f, ω output from the transmission power calculation unit (POW) 111e. ] | ² , the smoothed residual power spectrum output from the residual power calculation unit (POW) 111 f | E _S [f, ω] | ^2, and output from the echo suppression amount estimation unit (ECLE) 712 Echo suppression amount ECL [f, ω] and frequency domain double talk information ERst output from frequency domain double talk detector (FDTD) 711j
ate [f, ω] is input, and two variable gains G ₁ [f, ω] and G ₂ [f, ω] are calculated and output.

具体的には、まず周波数領域ダブルトーク情報ＥＲｓｔａｔｅ［ｆ，ω］がダブルトー
ク状態であると判定されたことを示している場合を除いたシングルトーク状態時に、エコ
ー量推定制御部（ＥＬＥＣＴＲＬ）７１１ｈ２は、エコー量｜Ｙ_Ｓ［ｆ，ω］｜^２を精度
よく推定するために以下に示す式５２−１、式５２−２、式５２−３のようにＧ_ＨＺ、Ｇ
_ＨＥ、Ｇ_ＨＺＥの３つのゲインをそれぞれ算出する。

Specifically, first, the echo amount estimation control unit (ELECTRL) 711h2 in the single talk state except when the frequency domain double talk information ERstate [f, ω] indicates that it is determined that the state is the double talk state. In order to estimate the amount of echo | Y _S [f, ω] | ² with high accuracy, G _HZ , G as shown in Equations 52-1, 52-2, and 52-3 shown below.
Three gains _HE and _GHZE are calculated.

次に、エコー量推定制御部（ＥＬＥＣＴＲＬ）７１１ｈ２は、エコー抑圧量ＥＣＬ［ｆ
，ω］を引数とし値域が０以上１以下である３つの関数Ｆ_ＨＺ（ＥＣＬ［ｆ，ω］）、Ｆ
_ＨＥ（ＥＣＬ［ｆ，ω］）、及びＦ_ＨＺＥ（ＥＣＬ［ｆ，ω］）を以下のように制御する
。エコー抑圧量ＥＣＬ［ｆ，ω］が十分ではない場合、残差パワースペクトル｜Ｅ_Ｓ［ｆ
，ω］｜^２の影響を少なくするために、Ｆ_ＨＥ（ＥＣＬ［ｆ，ω］）及びＦ_ＨＺＥ（ＥＣ
Ｌ［ｆ，ω］）を０に近づけるかあるいは小さい値に設定し、Ｆ_ＨＺ（ＥＣＬ［ｆ，ω］
）を１に近づける。 Next, the echo amount estimation control unit (ELECTRL) 711h2 selects the echo suppression amount ECL [f
, Ω] as an argument, and three functions F _HZ (ECL [f, ω]), F having a range of 0 to 1
_HE (ECL [f, ω]) and F _HZE (ECL [f, ω]) are controlled as follows. If the echo suppression amount ECL [f, ω] is not sufficient, the residual power spectrum | E _S [f
, Ω] | ² to reduce the influence of F _HE (ECL [f, ω]) and F _HZE (EC
L [f, ω]) is set to a value close to 0 or small, and F _HZ (ECL [f, ω]) is set.
) Is brought close to 1.

一方、エコー抑圧量ＥＣＬ［ｆ，ω］が十分である場合、残差パワースペクトル｜Ｅ_Ｓ
［ｆ，ω］｜^２を信頼して用いるように、Ｆ_ＨＺ（ＥＣＬ［ｆ，ω］）を０に近づけるか
あるいは小さい値に設定し、Ｆ_ＨＥ（ＥＣＬ［ｆ，ω］）及びＦ_ＨＺＥ（ＥＣＬ［ｆ，ω
］）を１に近づける。このとき、エコー抑圧量ＥＣＬ［ｆ，ω］が十分であるか否かの判
定条件として、制御部（ＣＴＲＬ）６１１ｋと同様の判定条件を用いてもよい。そして、
２つの可変ゲインＧ_１［ｆ，ω］及びＧ_２［ｆ，ω］を、それぞれ以下に示す式５３−１
、式５３−２のように算出する。

On the other hand, if the echo suppression amount ECL [f, ω] is sufficient, the residual power spectrum | _{E S}
F _HZ (ECL [f, ω]) is set to a value close to 0 or small so that [f, ω] | ² is used reliably, and F _HE (ECL [f, ω]) and F _HZE (ECL [f, ω
]) Close to 1. At this time, a determination condition similar to that of the control unit (CTRL) 611k may be used as a determination condition as to whether or not the echo suppression amount ECL [f, ω] is sufficient. And
Two variable gains G ₁ [f, ω] and G ₂ [f, ω] are expressed by the following equations 53-1:
, As shown in Equation 53-2.

可変ゲイン７１１ｈ３は、受話パワー算出部（ＰＯＷ）１１１ｄから出力されたスムー
ジングされた受話パワースペクトル｜Ｘ_Ｓ［ｆ，ω］｜^２と、エコー量推定制御部（ＥＬ
ＥＣＴＲＬ）７１１ｈ２から出力された可変ゲインＧ_１［ｆ，ω］とを入力とし、｜Ｘ_Ｓ
［ｆ，ω］｜^２にＧ_１［ｆ，ω］を乗じて、受話パワースペクトル｜Ｘ_Ｓ［ｆ，ω］｜^２
を増幅あるいは減衰させて出力する。 The variable gain 711h3 includes a smoothed reception power spectrum | X _S [f, ω] | ² output from the reception power calculation unit (POW) 111d, and an echo amount estimation control unit (EL
ECTh) 711h2 and variable gain G ₁ [f, ω] as input, | X _S
[F, ω] | ² is multiplied by G ₁ [f, ω], and the received power spectrum | X _S [f, ω] | ²
Is amplified or attenuated and output.

可変ゲイン７１１ｈ４は、信号減算処理部７１１ｈ１から出力された信号｜Ｚ_Ｓ［ｆ，
ω］｜^２−｜Ｅ_Ｓ［ｆ，ω］｜^２と、エコー量推定制御部（ＥＬＥＣＴＲＬ）７１１ｈ２
から出力された可変ゲインＧ_２［ｆ，ω］とを入力とし、｜Ｚ_Ｓ［ｆ，ω］｜^２−｜Ｅ_Ｓ
［ｆ，ω］｜^２にＧ_２［ｆ，ω］を乗じて、増幅あるいは減衰させて出力する。 The variable gain 711h4 is the signal | Z _S [f, output from the signal subtraction processing unit 711h1.
ω] | ² − | E _S [f, ω] | ² and an echo amount estimation control unit (ELECTRL) 711h2.
And variable gain _G 2 [f, _ω] and an input that is output _{from, | Z S [f, ω} ] | 2 - | E S
Multiply [f, ω] | ² by G ₂ [f, ω], and output after amplification or attenuation.

信号加算処理部７１１ｈ５は、可変ゲイン７１１ｈ３から出力されたＧ_１［ｆ，ω］を
乗じた｜Ｘ_Ｓ［ｆ，ω］｜^２と、可変ゲイン７１１ｈ４から出力されたＧ_２［ｆ，ω］を
乗じた｜Ｚ_Ｓ［ｆ，ω］｜^２−｜Ｅ_Ｓ［ｆ，ω］｜^２とを入力とし、スムージングされた
エコー量｜Ｙ_Ｓ［ｆ，ω］｜^２を算出して出力する。具体的には、以下に示す式５４−１
のようにフレームf及び周波数帯域ω毎に、Ｇ_１［ｆ，ω］を乗じた｜Ｘ_Ｓ［ｆ，ω］｜
^２とＧ_２［ｆ，ω］を乗じた｜Ｚ_Ｓ［ｆ，ω］｜^２−｜Ｅ_Ｓ［ｆ，ω］｜^２を加算してエ
コー量｜Ｙ［ｆ，ω］｜^２を算出する。 Signal addition processing unit 711h5 is, _G 1 output from the variable gain 711h3 [f, ω] multiplied by _{| X S [f, ω]} | a ^2, _G 2 outputted from the variable gain 711h4 [f, ω] Is inputted as | Z _S [f, ω] | ² − | E _S [f, ω] | ² as an input, and a smoothed echo amount | Y _S [f, ω] | ² is calculated and output. . Specifically, the following formula 54-1
| X _S [f, ω] | multiplied by G ₁ [f, ω] for each frame f and frequency band ω
² multiplied by G ₂ [f, ω] | Z _S [f, ω] | ² − | E _S [f, ω] | ² is added to calculate the echo amount | Y [f, ω] | ² To do.

そして以下に示す式５４−２のように、エコー量｜Ｙ［ｆ，ω］｜^２をスムージングし
たエコー量｜Ｙ_s［ｆ，ω］｜^２を算出して出力する。ただし、αＹ［ω］は０．７〜０
．９９程度が望ましい。勿論、スムージングしないエコー量｜Ｙ［ｆ，ω］｜^２を出力し
てもよい。

And as shown in Equation 54-2 below, the echo amount ^{| Y [f, ω] |} 2 smoothed echo amount _{| Y s [f, ω]} | 2 and outputs them. However, αY [ω] is 0.7-0.
. About 99 is desirable. Of course, the echo amount | Y [f, ω] | ² that is not smoothed may be output.

周波数領域ダブルトーク検出部（ＦＤＴＤ）７１１ｊは、受話パワー算出部（ＰＯＷ）
１１１ｄから出力されたスムージングされた受話パワースペクトル｜Ｘ_Ｓ［ｆ，ω］｜^２
と、エコー量推定部（ＥＬＥ）７１１ｈから出力された１フレーム前のスムージングした
エコー量｜Ｙ_Ｓ［ｆ−１，ω］｜^２と、残差パワー算出部（ＰＯＷ）１１１ｆから出力さ
れたスムージングされた残差パワースペクトル｜Ｅ_Ｓ［ｆ，ω］｜^２とを入力とし、ダブ
ルトーク状態か否かを示す情報である周波数領域ダブルトーク情報ＥＲｓｔａｔｅ［ｆ，
ω］を算出して出力する。 The frequency domain double talk detector (FDTD) 711j is a received power calculator (POW).
Smoothed received power spectrum output from 111d | X _S [f, ω] | ²
, The smoothed echo amount | Y _S [f−1, ω] | ² output from the echo amount estimation unit (ELE) 711h and the smoothing output from the residual power calculation unit (POW) 111f residual power spectrum _{| E S [f, ω]} | ² and the input is information indicating whether or not a double-talk state frequency domain double talk information ERstate [f,
ω] is calculated and output.

具体的には、周波数領域ダブルトーク検出部（ＦＤＴＤ）７１１ｊは、周波数帯域ω毎
に以下のように閾値よりも大きい、即ち、以下に示す不等式５５が成り立てば、ダブルト
ーク状態と判定する。ただし、β_Ｙ［ω］は１．０〜２０程度が望ましい。

Specifically, the frequency domain double-talk detector (FDTD) 711j determines that the state is a double-talk state if the frequency band ω is larger than the threshold as follows, that is, if the inequality 55 shown below holds. However, β _Y [ω] is preferably about 1.0 to 20.

次に、上記のように構成された第７の実施形態に係る信号処理装置の処理の流れを説明
する。図３７は、第７の実施形態に係る信号処理装置の全体の処理の流れを示すフローチ
ャートである。なお、図５を参照して説明した第１の実施形態に係る信号処理装置の動作
と同じ動作ステップについては、同じ符号を付してその部分の説明を省略する。 Next, a processing flow of the signal processing device according to the seventh embodiment configured as described above will be described. FIG. 37 is a flowchart showing an overall processing flow of the signal processing apparatus according to the seventh embodiment. Note that the same operation steps as those of the signal processing apparatus according to the first embodiment described with reference to FIG. 5 are denoted by the same reference numerals, and description thereof is omitted.

ステップＳ１００４の遅延処理の後、遅延した受話入力信号ｘ［ｎ−Ｄ］とオフセット
除去された送話入力信号ｚ［ｎ］を入力として、エコーキャンセラ部（ＥＣ）７１０はエ
コーキャンセラ処理を行う（ステップＳ７００５）。 After the delay processing in step S1004, the echo canceller unit (EC) 710 performs echo canceller processing using the delayed reception input signal x [n-D] and the transmission input signal z [n] from which the offset is removed as input (FIG. Step S7005).

次に、エコー抑圧量推定部（ＥＣＬＥ）７１２は、ハイパスフィルタ部（ＨＰＦ）１０
９から出力された送話入力信号ｚ［ｎ］と信号減算処理部６１０ｇから出力された時間領
域の残差信号ｅ［ｎ］を入力とし、エコーキャンセラ部（ＥＣ）７１０で抑圧されたエコ
ー抑圧量ＥＣＬ［ｆ，ω］を推定する（ステップＳ７００８）。 Next, an echo suppression amount estimation unit (ECLE) 712 includes a high-pass filter unit (HPF) 10.
Echo suppression suppressed by an echo canceller (EC) 710 using the transmission input signal z [n] output from 9 and the time domain residual signal e [n] output from the signal subtraction processor 610g as inputs. The quantity ECL [f, ω] is estimated (step S7008).

そして、遅延した受話入力信号ｘ［ｎ−Ｄ］と、オフセット除去された送話入力信号ｚ
［ｎ］と、エコーキャンセラ部（ＥＣ）７１０から出力されたエラーキャンセラ処理後の
信号である残差信号ｅ［ｎ］を入力として、エコーリダクション部（ＥＲ）７１１は非線
形エコー抑圧処理であるエコーリダクション処理を行う（ステップＳ７００６）。そして
、ステップＳ１００７の終話か否かの判断に移る。 Then, the delayed reception input signal x [n−D] and the transmission input signal z with the offset removed.
[N] and the residual signal e [n], which is the signal after error canceller processing output from the echo canceller unit (EC) 710, are input, and the echo reduction unit (ER) 711 is an echo that is nonlinear echo suppression processing. Reduction processing is performed (step S7006). Then, the process proceeds to step S1007 to determine whether or not the call is an end story.

図３８は、第７の実施形態に係るエコーキャンセラ部（ＥＣ）７１０における処理の流
れを示すフローチャートである。なお、図６を参照して説明した第１の実施形態に係るエ
コーキャンセラ部（ＥＣ）１１０の動作と同じ動作ステップについては、同じ符号を付し
てその部分の説明を省略する。 FIG. 38 is a flowchart showing the flow of processing in the echo canceller (EC) 710 according to the seventh embodiment. Note that the same operation steps as the operation of the echo canceller unit (EC) 110 according to the first embodiment described with reference to FIG. 6 are denoted by the same reference numerals, and description thereof is omitted.

エコーキャンセラ部（ＥＣ）７１０の処理は、まず、ステップＳ１１０１でダブルトー
ク検出処理を行った後、ステップＳ１１０２で適応フィルタ処理を行い、信号減算処理部
１１０ｂは、送話入力信号ｚ［ｎ］から、適応フィルタ部（ＡＤＦ）１１０ａから出力さ
れた擬似エコー信号ｙ’［ｎ］を減算し、残差信号ｅ_TDAF［ｎ］を算出する。次に、周波
数領域ダブルトーク検出処理を行い（ステップＳ６１０１）、周波数領域適応フィルタ処
理を行い、信号減算処理部６１０ｇは、残差信号ｅ_TDAF［ｎ］から、周波数領域適応フィ
ルタ部（ＦＤＡＤＦ）６１０ｅから出力された擬似エコー信号ｙ’_ＦＤＡＦ［ｎ］を減算
し、残差信号ｅ［ｎ］を算出して出力し（ステップＳ６１０２）、エコーキャンセラ処理
が終了する。 The echo canceller unit (EC) 710 first performs double talk detection processing in step S1101, and then performs adaptive filter processing in step S1102. The signal subtraction processing unit 110b uses the transmission input signal z [n]. The pseudo echo signal y ′ [n] output from the adaptive filter unit (ADF) 110a is subtracted to calculate a residual signal e _TDAF [n]. Next, frequency domain double talk detection processing is performed (step S6101), frequency domain adaptive filter processing is performed, and the signal subtraction processing unit _{610g performs} frequency domain adaptive filter unit (FDADF) 610e from the residual signal e _TDAF [n]. _Is subtracted from the pseudo echo signal y ′ _FDAF [n] output from, and the residual signal e [n] is calculated and output (step S6102), and the echo canceller processing is terminated.

図３９は、第７の実施形態に係るエコーリダクション部（ＥＲ）７１１における処理の
流れを示すフローチャートである。なお、図７を参照して説明した第１の実施形態に係る
エコーリダクション部（ＥＲ）１１１及び図３０を参照して説明した第６の実施形態に係
るエコーリダクション部（ＥＲ）６１１における動作と同じ動作ステップについては、同
じ符号を付してその部分の説明を省略する。 FIG. 39 is a flowchart showing the flow of processing in the echo reduction unit (ER) 711 according to the seventh embodiment. The operation of the echo reduction unit (ER) 111 according to the first embodiment described with reference to FIG. 7 and the operation of the echo reduction unit (ER) 611 according to the sixth embodiment described with reference to FIG. About the same operation step, the same code | symbol is attached | subjected and description of the part is abbreviate | omitted.

第７の実施形態に係るエコーリダクション部（ＥＲ）７１１は、エコーリダクション処
理を開始して、ステップＳ１２０３ｒの受話パワースペクトル算出処理、ステップＳ１２
０３ｓの送話パワースペクトル算出処理、及びステップＳ１２０３ｅの残差パワースペク
トル算出処理の後、周波数領域ダブルトーク検出部（ＦＤＴＤ）７１１ｊは、ダブルトー
ク状態か否かを示す情報である周波数領域ダブルトーク情報ＥＲｓｔａｔｅ［ｆ，ω］出
力し、エコー量推定部（ＥＬＥ）７１１ｈは、受話パワースペクトル｜Ｘ_Ｓ［ｆ，ω］｜
^２と、送話パワースペクトル｜Ｚ_Ｓ［ｆ，ω］｜^２と、残差パワースペクトル｜Ｅ_Ｓ［ｆ
，ω］｜^２と、エコー抑圧量ＥＣＬ［ｆ，ω］と、周波数領域ダブルトーク情報ＥＲｓｔ
ａｔｅ［ｆ，ω］とを入力とし、スムージングしたエコー量｜Ｙ_Ｓ［ｆ，ω］｜^２を算出
して出力する（ステップＳ７２０５）。 The echo reduction unit (ER) 711 according to the seventh embodiment starts the echo reduction process, and the received power spectrum calculation process in step S1203r, step S12.
After the transmission power spectrum calculation process of 03s and the residual power spectrum calculation process of step S1203e, the frequency domain double talk detection unit (FDTD) 711j is frequency domain double talk information that is information indicating whether or not a double talk state is present. ERstate [f, ω] is output, and the echo amount estimation unit (ELE) 711h receives the received power spectrum | X _S [f, ω] |
² , transmission power spectrum | Z _S [f, ω] | ² and residual power spectrum | E _S [f
, Ω] | ² , echo suppression amount ECL [f, ω], and frequency domain double talk information ERst
Using ate [f, ω] as an input, the smoothed echo amount | Y _S [f, ω] | ² is calculated and output (step S7205).

その後、ステップＳ６２０７で判定処理、ステップＳ６２０８でエコー抑圧ゲイン算出
処理、ステップＳ１２０９でスペクトル選択処理、ステップＳ１２１０で送話信号抑圧処
理を行った後、ステップＳ１２１１の周波数逆変換処理に移って、エコーリダクション処
理が終了する。 Then, after performing the determination process in step S6207, the echo suppression gain calculation process in step S6208, the spectrum selection process in step S1209, and the transmission signal suppression process in step S1210, the process proceeds to the frequency inverse transform process in step S1211, and echo reduction is performed. The process ends.

上記では、エコーリダクション部（ＥＲ）７１１はＦＦＴによる周波数領域型で周波数
帯域毎に処理する方式として動作するものとして説明した。ＦＦＴによる周波数帯域をグ
ループでまとめて周波数帯域グループ毎に処理する方式や、フィルタバンクなどの帯域分
割フィルタなどの周波数領域型を実現してもよい。 In the above description, the echo reduction unit (ER) 711 has been described as operating as a method of processing for each frequency band in the frequency domain type by FFT. You may implement | achieve the frequency domain type | molds, such as the method which puts together the frequency band by FFT and processes for every frequency band group, and band division filters, such as a filter bank.

以上説明した信号処理装置の動作により、エコーキャンセラ部（ＥＣ）７１０の処理に
よるエコー抑圧量が十分でない周波数領域をエコー抑圧量推定部（ＥＣＬＥ）７１２と制
御部（ＣＴＲＬ）６１１ｋで判定して、スペクトル選択部１１１Ｌとエコー抑圧ゲイン算
出部（ＧＣＡＬ）６１１ｎにおいて周波数帯域毎にエコーキャンセラ部（ＥＣ）７１０の
処理とエコーリダクション部（ＥＲ）７１１の処理の重みを変化させるように制御するこ
とができるので、エコーパスロス変動に対して頑健にすることが可能であり、高品質な信
号を出力することが可能である。 By the operation of the signal processing apparatus described above, the echo suppression amount estimation unit (ECLE) 712 and the control unit (CTRL) 611k determine the frequency region where the echo suppression amount by the processing of the echo canceller unit (EC) 710 is not sufficient, The spectrum selection unit 111L and the echo suppression gain calculation unit (GCAL) 611n can be controlled to change the weights of the processing of the echo canceller unit (EC) 710 and the processing of the echo reduction unit (ER) 711 for each frequency band. Therefore, it is possible to make robust against echo path loss fluctuations and to output a high quality signal.

（第７の実施形態の変形例）
図４０は、本発明の第７の実施形態の変形例に係る信号処理装置の構成を示すブロック
図である。この第７の実施形態の変形例に係る信号処理装置が第７の実施形態に係る信号
処理装置と異なる点は、エコーキャンセラ部（ＥＣ）７１０、エコーリダクション部（Ｅ
Ｒ）７１１、エコー抑圧量推定部（ＥＣＬＥ）７１２に代えて、エコーキャンセラ部（Ｅ
Ｃ）７１０ｒ及びエコーリダクション部（ＥＲ）７１１ｒ及びエコー抑圧量推定部（ＥＣ
ＬＥ）７１２ｒを有することである。そして、送話入力信号格納部（ＢＵＦＦ）７１３と
受話入力信号格納部（ＢＵＦＦ）７１４と、遅延処理部（ＤＥＬＡＹ）７０２ｒを有する
ことである。 (Modification of the seventh embodiment)
FIG. 40 is a block diagram showing a configuration of a signal processing device according to a modification of the seventh embodiment of the present invention. The signal processing device according to the modification of the seventh embodiment is different from the signal processing device according to the seventh embodiment in that an echo canceller (EC) 710, an echo reduction unit (E
R) 711 and echo suppression amount estimation unit (ECLE) 712 instead of echo canceller unit (E
C) 710r, echo reduction unit (ER) 711r, and echo suppression amount estimation unit (EC
LE) 712r. Then, the transmission input signal storage unit (BUFF) 713, the reception input signal storage unit (BUFF) 714, and the delay processing unit (DELAY) 702r are included.

そして、図３３に示す第７の実施形態に係る信号処理装置では受話出力信号が送話入力
信号に回り込み音響結合することで発生していたエコーを受話入力信号を参照として送話
入力信号から抑圧する構成であったのに対し、この第７の実施形態の変形例に係る信号処
理装置では送話出力信号が図示しない遠端において回り込んで音響結合したエコーを送話
出力信号を参照として受話入力信号から抑圧する構成になっている点であり、その他の部
分は同じである。そこで、同じ部分については、同じ符号を付してその説明を省略する。 Then, in the signal processing apparatus according to the seventh embodiment shown in FIG. 33, the echo generated when the reception output signal wraps around the transmission input signal and is acoustically coupled is suppressed from the transmission input signal with reference to the reception input signal. In contrast, in the signal processing apparatus according to the modification of the seventh embodiment, the transmission output signal circulates at the far end (not shown) and the acoustically coupled echo is received with reference to the transmission output signal. This is a configuration that suppresses the input signal, and the other parts are the same. Therefore, the same parts are denoted by the same reference numerals and the description thereof is omitted.

送話入力信号格納部（ＢＵＦＦ）７１３と受話入力信号格納部（ＢＵＦＦ）７１４では
、例えば、特開２００５−１４２８８６号公報に記載の技術、あるいは送話入力信号と受
話入力信号の相互相関を用いる遅延時間推定技術によって、図示しない遠端において発生
していたエコーについて遅延などの処理を用いて同期が合う構成になっている。 In the transmission input signal storage unit (BUFF) 713 and the reception input signal storage unit (BUFF) 714, for example, the technique described in Japanese Patent Application Laid-Open No. 2005-142886 or the cross-correlation between the transmission input signal and the reception input signal is used. With the delay time estimation technique, an echo generated at a far end (not shown) is synchronized using a process such as a delay.

遅延処理部（ＤＥＬＡＹ）７０２ｒ、エコーキャンセラ部（ＥＣ）７１０ｒ、エコーリ
ダクション部（ＥＲ）７１１ｒ及びエコー抑圧量推定部（ＥＣＬＥ）７１２ｒは、入力さ
れる信号が異なるが、第７の実施形態に係る遅延処理部（ＤＥＬＡＹ）１０２、エコーキ
ャンセラ部（ＥＣ）７１０、エコーリダクション部（ＥＲ）７１１及びエコー抑圧量推定
部（ＥＣＬＥ）７１２と同様の処理を行う。 The delay processing unit (DELAY) 702r, the echo canceller unit (EC) 710r, the echo reduction unit (ER) 711r, and the echo suppression amount estimation unit (ECLE) 712r are different in input signals, but are related to the seventh embodiment. The same processing as that of the delay processing unit (DELAY) 102, the echo canceller unit (EC) 710, the echo reduction unit (ER) 711, and the echo suppression amount estimation unit (ECLE) 712 is performed.

このような構成にすることで、遠端において発生していた音響エコーについてもエコー
抑圧することができる。 By adopting such a configuration, it is possible to suppress echo even for an acoustic echo generated at the far end.

なお、本発明は通話装置に適用されるものであって、通話装置は、通話機能を有する装
置全般をいい、例えば、ハンズフリー通話機能を有するハンズフリー通話装置や、電話機
、インターフォン、携帯電話、ＰＨＳ、ＶｏＩＰソフトウェア、ＶｏＩＰシステム、ＴＶ
会議システムなどを含む。 Note that the present invention is applied to a call device, and the call device generally refers to a device having a call function. For example, a hands-free call device having a hands-free call function, a telephone, an interphone, a mobile phone, PHS, VoIP software, VoIP system, TV
Includes a conference system.

少なくともエコー成分を低減させる信号処理の例として、エコーキャンセラ、エコーリ
ダクション、エコーサプレッサ、エコーノイズリダクションを、少なくともノイズ成分を
低減させる信号処理の例として、ノイズリダクション、エコーノイズリダクションを挙げ
て説明した。このように記載したのは一般的な呼称であるからであり、本発明はこれら呼
称に限定されるものではない。また、本発明の要旨を逸脱しない範囲でこれらの信号処理
の組み合わせにおいても実施し得ることが可能である。 As an example of signal processing for reducing at least the echo component, echo canceller, echo reduction, echo suppressor, and echo noise reduction have been described, and as an example of signal processing for reducing at least the noise component, noise reduction and echo noise reduction have been described. This is because they are general names, and the present invention is not limited to these names. In addition, the present invention can be implemented in a combination of these signal processings without departing from the gist of the present invention.

本発明は、上記各実施の形態に限ることなく、その他、実施段階ではその要旨を逸脱し
ない範囲で種々の変形を実施し得ることが可能である。さらに、上記各実施形態には、種
々の段階の発明が含まれており、開示される複数の構成要件における適宜な組合せにより
種々の発明が抽出され得る。 The present invention is not limited to the above-described embodiments, and various modifications can be made without departing from the scope of the invention at the stage of implementation. Further, the above embodiments include inventions at various stages, and various inventions can be extracted by appropriately combining a plurality of disclosed constituent elements.

また、例えば各実施形態に示される全構成要件から幾つかの構成要件が削除されても、
発明が解決しようとする課題の欄で述べた課題が解決でき、発明の効果で述べられている
効果が得られる場合には、この構成要件が削除された構成が発明として抽出され得る。本
発明は装置のみならず、当該装置の機能を実現する方法やプログラムとしても成立する。 In addition, for example, even if some configuration requirements are deleted from all the configuration requirements shown in each embodiment,
When the problem described in the column of the problem to be solved by the invention can be solved and the effect described in the effect of the invention can be obtained, a configuration from which this constituent requirement is deleted can be extracted as the invention. The present invention is established not only as a device but also as a method and program for realizing the function of the device.

本発明の第１の実施形態に係る信号処理装置の構成を示すブロック図。1 is a block diagram showing a configuration of a signal processing device according to a first embodiment of the present invention. 本発明の第１の実施形態に係るエコーキャンセラ部（ＥＣ）の構成を示すブロック図。The block diagram which shows the structure of the echo canceller part (EC) which concerns on the 1st Embodiment of this invention. 本発明の第１の実施形態に係るエコーリダクション部（ＥＲ）の構成を示すブロック図。The block diagram which shows the structure of the echo reduction part (ER) which concerns on the 1st Embodiment of this invention. 本発明の実施形態に係る抑圧ゲインの下限値の一例を示す図。The figure which shows an example of the lower limit value of the suppression gain which concerns on embodiment of this invention. 本発明の第１の実施形態に係る信号処理装置の動作を示すフローチャート。3 is a flowchart showing the operation of the signal processing apparatus according to the first embodiment of the present invention. 本発明の第１の実施形態に係るエコーキャンセラ部（ＥＣ）の動作を示すフローチャート。The flowchart which shows operation | movement of the echo canceller part (EC) which concerns on the 1st Embodiment of this invention. 本発明の第１の実施形態に係るエコーリダクション部（ＥＲ）の動作を示すフローチャート。The flowchart which shows operation | movement of the echo reduction part (ER) which concerns on the 1st Embodiment of this invention. 本発明の第１の実施形態の変形例に係るエコーリダクション部（ＥＲ）の構成を示すブロック図。The block diagram which shows the structure of the echo reduction part (ER) which concerns on the modification of the 1st Embodiment of this invention. 本発明の第２の実施形態に係る信号処理装置の構成を示すブロック図。The block diagram which shows the structure of the signal processing apparatus which concerns on the 2nd Embodiment of this invention. 本発明の第２の実施形態に係るノイズリダクション部（ＮＲ）の構成を示すブロック図。The block diagram which shows the structure of the noise reduction part (NR) which concerns on the 2nd Embodiment of this invention. 本発明の第２の実施形態に係る信号処理装置の動作を示すフローチャート。The flowchart which shows operation | movement of the signal processing apparatus which concerns on the 2nd Embodiment of this invention. 本発明の第２の実施形態に係るノイズリダクション部（ＮＲ）の動作を示すフローチャート。The flowchart which shows operation | movement of the noise reduction part (NR) which concerns on the 2nd Embodiment of this invention. 本発明の第３の実施形態に係る信号処理装置の構成を示すブロック図。The block diagram which shows the structure of the signal processing apparatus which concerns on the 3rd Embodiment of this invention. 本発明の第３の実施形態に係るエコーノイズリダクション部（ＥＮＲ）の構成を示すブロック図。The block diagram which shows the structure of the echo noise reduction part (ENR) which concerns on the 3rd Embodiment of this invention. 本発明の第３の実施形態に係る信号処理装置の動作を示すフローチャート。The flowchart which shows operation | movement of the signal processing apparatus which concerns on the 3rd Embodiment of this invention. 本発明の第３の実施形態に係るエコーノイズリダクション部（ＥＮＲ）の動作を示すフローチャート。The flowchart which shows operation | movement of the echo noise reduction part (ENR) which concerns on the 3rd Embodiment of this invention. 本発明の第４の実施形態に係る信号処理装置の構成を示すブロック図。The block diagram which shows the structure of the signal processing apparatus which concerns on the 4th Embodiment of this invention. 本発明の第４の実施形態に係るエコーサプレッサ部（ＥＳ）の構成を示すブロック図。The block diagram which shows the structure of the echo suppressor part (ES) which concerns on the 4th Embodiment of this invention. 本発明の第４の実施形態に係る信号処理装置の動作を示すフローチャート。The flowchart which shows operation | movement of the signal processing apparatus which concerns on the 4th Embodiment of this invention. 本発明の第４の実施形態に係るエコーサプレッサ部（ＥＳ）の動作を示すフローチャート。The flowchart which shows operation | movement of the echo suppressor part (ES) which concerns on the 4th Embodiment of this invention. 本発明の第５の実施形態に係るエコーリダクション部（ＥＲ）の構成を示すブロック図。The block diagram which shows the structure of the echo reduction part (ER) which concerns on the 5th Embodiment of this invention. 本発明の第５の実施形態に係る信号処理装置の動作を示すフローチャート。The flowchart which shows operation | movement of the signal processing apparatus which concerns on the 5th Embodiment of this invention. 本発明の第５の実施形態に係るエコーリダクション部（ＥＲ）の動作を示すフローチャート。The flowchart which shows operation | movement of the echo reduction part (ER) which concerns on the 5th Embodiment of this invention. 本発明の第６の実施形態に係る信号処理装置の構成を示すブロック図。The block diagram which shows the structure of the signal processing apparatus which concerns on the 6th Embodiment of this invention. 本発明の第６の実施形態に係るエコーキャンセラ部（ＥＣ）の構成を示すブロック図。The block diagram which shows the structure of the echo canceller part (EC) which concerns on the 6th Embodiment of this invention. 本発明の第６の実施形態に係るエコーリダクション部（ＥＲ）の構成を示すブロック図。The block diagram which shows the structure of the echo reduction part (ER) which concerns on the 6th Embodiment of this invention. 本発明の第６の実施形態に係る音響結合量推定部（ＡＣＬＥ）の構成を示すブロック図。The block diagram which shows the structure of the acoustic coupling amount estimation part (ACLE) which concerns on the 6th Embodiment of this invention. 本発明の第６の実施形態に係る信号処理装置の動作を示すフローチャート。The flowchart which shows operation | movement of the signal processing apparatus which concerns on the 6th Embodiment of this invention. 本発明の第６の実施形態に係るエコーキャンセラ部（ＥＣ）の動作を示すフローチャート。The flowchart which shows operation | movement of the echo canceller part (EC) which concerns on the 6th Embodiment of this invention. 本発明の第６の実施形態に係るエコーリダクション部（ＥＲ）の動作を示すフローチャート。The flowchart which shows operation | movement of the echo reduction part (ER) which concerns on the 6th Embodiment of this invention. 本発明の第６の実施形態の変形例に係るエコーリダクション部（ＥＲ）の構成を示すブロック図。The block diagram which shows the structure of the echo reduction part (ER) which concerns on the modification of the 6th Embodiment of this invention. 本発明の第６の実施形態の変形例に係る音響結合量推定部（ＡＣＬＥ）の構成を示すブロック図。The block diagram which shows the structure of the acoustic coupling amount estimation part (ACLE) which concerns on the modification of the 6th Embodiment of this invention. 本発明の第７の実施形態に係る信号処理装置の構成を示すブロック図。The block diagram which shows the structure of the signal processing apparatus which concerns on the 7th Embodiment of this invention. 本発明の第７の実施形態に係るエコーキャンセラ部（ＥＣ）の構成を示すブロック図。The block diagram which shows the structure of the echo canceller part (EC) which concerns on the 7th Embodiment of this invention. 本発明の第７の実施形態に係るエコーリダクション部（ＥＲ）の構成を示すブロック図。The block diagram which shows the structure of the echo reduction part (ER) which concerns on the 7th Embodiment of this invention. 本発明の第７の実施形態に係るエコー量推定部（ＥＬＥ）の構成を示すブロック図。The block diagram which shows the structure of the echo quantity estimation part (ELE) which concerns on the 7th Embodiment of this invention. 本発明の第７の実施形態に係る信号処理装置の動作を示すフローチャート。The flowchart which shows operation | movement of the signal processing apparatus which concerns on the 7th Embodiment of this invention. 本発明の第７の実施形態に係るエコーキャンセラ部（ＥＣ）の動作を示すフローチャート。The flowchart which shows operation | movement of the echo canceller part (EC) which concerns on the 7th Embodiment of this invention. 本発明の第７の実施形態に係るエコーリダクション部（ＥＲ）の動作を示すフローチャート。The flowchart which shows operation | movement of the echo reduction part (ER) which concerns on the 7th Embodiment of this invention. 本発明の第７の実施形態の変形例に係る信号処理装置の構成を示すブロック図。The block diagram which shows the structure of the signal processing apparatus which concerns on the modification of the 7th Embodiment of this invention.

Explanation of symbols

１１０、６１０、７１０、７１０ｒエコーキャンセラ部（ＥＣ）
１１０ａ適応フィルタ部（ＡＤＦ）
１１０ｂ、６１０ｇ、７１１ｈ１信号減算処理部
１１０ｃダブルトーク検出部（ＤＴＤ）
１１１、５１１、６１１、７１１、７１１ｒ、１１１２エコーリダクション部（ＥＲ）
１１１ａ、１１１ｂ、１１１ｃ、６１０ｄ、６１０ｈ周波数領域変換処理部（ＦＴ）
１１１ｄ、４１１ｄ受話パワー算出部（ＰＯＷ）
１１１ｅ、４１１ｅ送話パワー算出部（ＰＯＷ）
１１１ｆ、４１１ｆ残差パワー算出部（ＰＯＷ）
１１１ｇ、１１１ｇ２、５１１ｇ、６１１ｇ、６１１ｇ−２音響結合量推定部（ＡＣＬ
Ｅ）
１１１ｈ、５１１ｈ、７１１ｈエコー量推定部（ＥＬＥ）
１１１ｉ、４１１ｉ、６１２、７１２、７１２ｒエコー抑圧量推定部（ＥＣＬＥ）
１１１ｊ、６１１ｊ、７１１ｊ周波数領域ダブルトーク検出部（ＦＤＴＤ）
１１１ｋ、２１１ｋ、４１１ｋ、５１１ｋ、６１１ｋ制御部（ＣＴＲＬ）
１１１Ｌ、２１１Ｌ、５１１Ｌスペクトル選択部
１１１ｍ、４１１ｍゲイン格納部（ＧＴＢＬ）
１１１ｎ、１１１ｎ２、４１１ｎ、５１１ｎ、６１１ｎエコー抑圧ゲイン算出部（ＧＣ
ＡＬ）
１１１ｏ、２１１ｏ、３１１ｏ信号抑圧部（ＳＳ）
１１１ｐ、６１０ｆ周波数領域逆変換処理部（ＩＦＴ）
２１１ノイズリダクション部（ＮＲ）
２１１ｎノイズ抑圧ゲイン算出部（ＧＣＡＬ）
２１１ｑ、３１１ｑ雑音レベル推定部（ＮＬＥ）
３１１エコーノイズリダクション部（ＥＮＲ）
３１１ｎエコーノイズ抑圧ゲイン算出部（ＧＣＡＬ）
４１１エコーサプレッサ部（ＥＳ）
４１１Ｌ信号選択部
４１１ｏ、５１１ｓ送話信号抑圧部（ＳＳ）
４１１ｒ受話信号抑圧部（ＳＳ）
５１１ｔ残差信号抑圧部（ＥＳ）
６１０ｅ周波数領域適応フィルタ部（ＦＤＡＤＦ）
６１０ｉ周波数領域ダブルトーク検出部（ＦＤＤＴＤ）
６１１ｇ１、６１１ｇ１−２音響結合量推定部（ＣＡＣＬ）
６１１ｇ２、６１１ｇ２−２音響結合量補正部（ＡＤＪ）
６１１ｇ３、６１１ｇ３−２音響結合量平滑部（ＳＭＡＣＬ）
６１１ｕ送話出力パワー算出部（ＰＯＷ）
７１１ｈ２エコー量推定制御部（ＥＬＥＣＴＲＬ）
７１１ｈ３、７１１ｈ４可変ゲイン
７１１ｈ５信号加算処理部 110, 610, 710, 710r Echo canceller (EC)
110a Adaptive filter unit (ADF)
110b, 610g, 711h1 Signal subtraction processor 110c Double talk detector (DTD)
111, 511, 611, 711, 711r, 1112 Echo reduction part (ER)
111a, 111b, 111c, 610d, 610h Frequency domain transform processing unit (FT)
111d, 411d Received power calculation unit (POW)
111e, 411e Transmission power calculator (POW)
111f, 411f Residual power calculator (POW)
111g, 111g2, 511g, 611g, 611g-2 acoustic coupling amount estimation unit (ACL)
E)
111h, 511h, 711h Echo amount estimation unit (ELE)
111i, 411i, 612, 712, 712r Echo suppression amount estimation unit (ECLE)
111j, 611j, 711j Frequency domain double talk detector (FDTD)
111k, 211k, 411k, 511k, 611k control unit (CTRL)
111L, 211L, 511L Spectrum selection unit 111m, 411m Gain storage unit (GTBL)
111n, 111n2, 411n, 511n, 611n Echo suppression gain calculator (GC
AL)
111o, 211o, 311o Signal suppression unit (SS)
111p, 610f Frequency domain inverse transform processing unit (IFT)
211 Noise Reduction Unit (NR)
211n Noise suppression gain calculator (GCAL)
211q, 311q Noise level estimator (NLE)
311 Echo Noise Reduction Unit (ENR)
311n Echo noise suppression gain calculator (GCAL)
411 Echo Suppressor (ES)
411L Signal selection unit 411o, 511s Transmission signal suppression unit (SS)
411r Received signal suppressor (SS)
511t Residual signal suppression unit (ES)
610e Frequency domain adaptive filter unit (FDADF)
610i Frequency domain double talk detector (FDDTD)
611g1, 611g1-2 Acoustic coupling amount estimation unit (CACL)
611g2, 611g2-2 Acoustic coupling amount correction unit (ADJ)
611g3, 611g3-2 Acoustic coupling amount smoothing section (SMACL)
611u Transmission output power calculation unit (POW)
711h2 Echo amount estimation control unit (ELECTRL)
711h3, 711h4 Variable gain 711h5 Signal addition processing unit

Claims

First signal processing means for suppressing an echo contained in the input signal and outputting an echo reduction signal;
Second signal processing means for inputting the input signal and the echo reduction signal and suppressing at least one of echo and noise;
Echo suppression amount calculating means for calculating an echo suppression amount of the first signal processing means by calculating an amount indicating a difference in power spectrum between the input signal and the echo reduction signal;
With
The second signal processing means includes
If an echo suppression amount of the first signal processing means which is calculated by the echo suppression quantity calculating means is determined not to be sufficient to select the input signal, otherwise selecting the echo reduction signal Having a selection means,
You suppress at least one of either said echo and said noise selected the input signal and the echo-reduced signal by said selecting means
Signal processing apparatus.

The second signal processing means includes
At least one of a suppression amount calculated by estimating at least one of the echo and the noise included in at least one of the input signal and the echo reduction signal, and an echo suppression amount calculated by the echo suppression amount calculation means And a suppression amount setting means for setting the suppression amount of the signal selected by the selection means,
You suppressed by the suppression amount set by said suppression amount setting means
The signal processing apparatus according to 請 Motomeko 1.

The second signal processing means includes
Based on the echo suppression amount calculated by the echo suppression amount calculation means , included in the first suppression amount and the echo reduction signal signal calculated by estimating at least one of the echo and the noise included in the input signal Suppression amount setting means for selecting one of the second suppression amounts calculated by estimating at least one of the echo and the noise ,
It suppressed by one suppression amount either previous SL suppression amount the first suppression amount is selected by the setting means and said second suppression amount
The signal processing apparatus according to 請 Motomeko 1.

The second signal processing means includes
When the input signal is selected by the selection unit, the first suppression amount calculated by estimating the echo or the noise included in the input signal is selected, and the echo reduction signal is selected by the selection unit. If there is a suppression amount setting means for selecting a second suppression amount calculated by estimating the echo or the noise included in the echo reduction signal ,
It suppressed by suppressing the amount selected by the suppression amount setting means
The signal processing apparatus according to 請 Motomeko 1.

First signal processing means for suppressing an echo at least with respect to an input signal and outputting an echo reduction signal;
Second signal processing means for inputting the input signal and the echo reduction signal, and suppressing and outputting at least one of echo and noise;
Echo suppression amount calculating means for calculating an echo suppression amount of the first signal processing means by calculating an amount indicating a difference in power spectrum between the input signal and the echo reduction signal;
With
The second signal processing means includes
First suppression amount setting means for calculating a first suppression amount by estimating at least one of the echo and the noise included in the input signal;
Second suppression amount setting means for calculating a second suppression amount by estimating at least one of the echo and the noise included in the echo reduction signal;
When it is determined that the echo suppression amount of the first signal processing means calculated by the echo suppression amount calculation means is not sufficient, at least one of the echo and the noise included in the input signal is used as the first suppression. A signal suppressed by the first suppression amount estimated by the amount setting means is selected; otherwise , at least one of the echo and the noise included in the echo reduction signal is selected by the second suppression amount setting means. and selection means for selecting a signal which has been suppressed in the second suppression amount estimated,
You output the signal selected by said selection means
Signal processing apparatus.

The suppression amount setting means includes:
Echo amount estimation means for estimating an echo amount based on the echo suppression amount calculated by the echo suppression amount calculation means;
To set the reduction quantity of the second signal processing means using the echo quantity estimated by said echo estimation means
The signal processing device according to claim 2 .

The suppression amount setting unit estimates an echo amount for each frequency region by the echo amount estimation unit, and uses the echo amount estimated for each frequency region to determine the suppression amount of the second signal processing unit for each frequency region. the signal processing apparatus according to 請 Motomeko 6 to set.

It said suppression amount setting means, the signal processing apparatus according to any one of the second claim to set the amount of suppression of the signal processing means for each frequency region 2 to claim 5.

The echo amount estimating means includes
An acoustic coupling amount estimating means for estimating an acoustic coupling amount;
An acoustic coupling amount correction unit that corrects the acoustic coupling amount estimated by the acoustic coupling amount estimation unit based on the echo suppression amount calculated by the echo suppression amount calculation unit;
Have
The signal processing apparatus according to 請 Motomeko 6 you estimate the echo amount using the acoustic coupling amount corrected by the acoustic coupling amount correcting means.

The echo amount estimation means estimates the echo amount for each frequency region,
The acoustic coupling amount estimation means estimates the acoustic coupling amount for each frequency region,
The acoustic coupling amount correcting means, correct the acoustic coupling amount for each of the frequency domain
The signal processing apparatus according to 請 Motomeko 9.

The suppression amount setting means includes:
Echo amount estimation means for estimating the echo amount;
A suppression gain calculating means for calculating a suppression gain using the echo amount estimated by the echo amount estimating means;
Suppression gain correction means for correcting the suppression gain calculated by the suppression gain calculation means based on the echo suppression amount calculated by the echo suppression amount calculation means;
Have
To set the suppression gain corrected by the suppression gain correcting means as a suppression amount
The signal processing device according to claim 2 .

The echo amount estimation means estimates the echo amount for each frequency region,
The suppression gain calculation means calculates the suppression gain for each frequency domain,
The suppression gain correcting means, you correcting the suppression gain for each frequency domain
The signal processing apparatus according to 請 Motomeko 11.

It said selecting means, the signal processing apparatus according to any one of 請 Motomeko 1 to claim 12 that select either of the two signals for each frequency region.

The echo suppression amount calculation means calculates the echo suppression amount of the first signal processing means for each frequency domain ,
The selection means selects one of the two signals for each frequency domain.
The signal processing device according to any one of claims 1 to 13 .

Wherein the input signal is a signal processing apparatus according to any one of the transmission signal der Ru請 Motomeko 1 to claim 14 picked up by the microphone.

Wherein the input signal is a signal processing apparatus according to any one of the received signals Der Ru請 Motomeko 1 to claim 15.