JP2006074642A

JP2006074642A - Conference telephone system

Info

Publication number: JP2006074642A
Application number: JP2004257932A
Authority: JP
Inventors: Kenichi Taniguchi; 賢一谷口
Original assignee: Matsushita Electric Industrial Co Ltd
Current assignee: Panasonic Holdings Corp
Priority date: 2004-09-06
Filing date: 2004-09-06
Publication date: 2006-03-16

Abstract

<P>PROBLEM TO BE SOLVED: To make noise outputted by an acoustic echo canceler to natural noise. <P>SOLUTION: This conference telephone system has the acoustic echo cancelers 104, 105 which cancel echos to voice signals outputted by a microphone 103 using the voice signal from a remote speaker as a reference signal, a near end environmental noise detection part 106 which defines power by adding a first threshold to transmission power minimized from the minimum transmission power by a plurality of first sections as second threshold power in a second section and detects a signal section of transmission power smaller than transmission power by averaging the minimum transmission power by each first section smaller than the second threshold power in the second section, a near end noise storage part 107 which stores signals of the detection part from an input point and has a plurality of output points and a near end noise processing part 108 which adds signals of the plurality of output points of the near end noise storage part 107 to outputs of the acoustic echo canceler. <P>COPYRIGHT: (C)2006,JPO&NCIPI

Description

本発明は、環境雑音のある中でも不自然な雑音を受信および送信することのない会議電話装置に関するものである。 The present invention relates to a conference telephone apparatus that does not receive and transmit unnatural noise even in the presence of environmental noise.

会議電話装置では、音響側のパスに対するエコーキャンセラが実装されており、また、アナログ回線に対するエコーキャンセラでは、ハイブリッド回路に対するエコーキャンセラも実装されている。音響パスに対するエコーキャンセラのエコー消去性能の限界は、マイクで収音したときの環境雑音に依存し、ハイブリッド回路に対するエコーキャンセラは遠端話者からの背景雑音および回線雑音に依存する。音響パスでスピーカからマイクに入ってくる信号を環境雑音より小さくはキャンセルすることができないし、また、ハイブリッド回路において送話側から受話側に入ってくる信号を環境雑音および回線雑音より小さくはキャンセルすることができないという制約がある。その際に、キャンセルすることができない信号は白色化された雑音としてそれぞれのエコーキャンセラから出力される。 In the conference telephone apparatus, an echo canceller for the path on the acoustic side is mounted, and an echo canceller for the analog circuit is also mounted with an echo canceller for the hybrid circuit. The limit of the echo cancellation performance of the echo canceller for the acoustic path depends on the environmental noise when picked up by the microphone, and the echo canceller for the hybrid circuit depends on the background noise and the line noise from the far-end speaker. The signal that enters the microphone from the speaker in the acoustic path cannot be canceled if it is smaller than the environmental noise, and the signal that enters the receiver side from the transmitter side in the hybrid circuit is canceled if it is smaller than the environmental noise and line noise. There is a restriction that it cannot be done. At that time, a signal that cannot be canceled is output from each echo canceller as whitened noise.

しかし、受話側では遠端話者の音声が小さいときには回線エコーキャンセラの出力する雑音が気になるという不具合があり、送話側では近端話者の音声が小さいときには音響エコーキャンセラの出力する雑音が気になるという不具合があった。 However, there is a problem that the noise output by the line echo canceller is anxious when the far-end speaker's voice is low on the receiving side, and the noise output by the acoustic echo canceller when the near-end speaker's voice is low on the transmitting side. There was a problem that I was worried about.

そのため、音響エコーキャンセラの出力がある閾値より大きいときのみ送話信号を出力し、音響エコーキャンセラの出力する雑音を送話しないようにし、その代わりに近端話者の環境雑音を周波数分析し、ホワイトノイズにそのスペクトル包絡特性を持たせた雑音を付加するという方法で自然な環境雑音に重畳した送話音声を出力するようにしていた。同様に、回線エコーキャンセラについても、遠端話者と回線からの雑音を周波数分析し、そのホワイトノイズにそのスペクトル包絡特性を持たせた雑音を付加するという方法で自然な環境雑音に重畳した受話音声を出力するようにしていた（例えば、特許文献１参照）。 Therefore, only when the output of the acoustic echo canceller is larger than a certain threshold, the transmission signal is output, the noise output by the acoustic echo canceller is not transmitted, and instead, the environmental noise of the near-end speaker is frequency-analyzed, The transmission voice superimposed on the natural environmental noise is output by adding the noise having the spectral envelope characteristic to the white noise. Similarly, for line echo cancellers, the frequency of noise from far-end speakers and lines is analyzed, and the white noise is added to the noise with the spectral envelope characteristics. Audio was output (see, for example, Patent Document 1).

また、音響あるいは回線のエコーキャンセラが出力する雑音を全く消してしまう方法として、ノンリニアプロセッサ処理がある（例えば、非特許文献１参照）。これは振幅の小さい部分は出力しないようにする処理である。
特開２００２−４１１００号公報ＩＴＵ−Ｔ勧告Ｇ．１６５ Further, there is a nonlinear processor process as a method for completely eliminating noise output from an acoustic or line echo canceller (for example, see Non-Patent Document 1). This is a process in which a portion with a small amplitude is not output.
JP 2002-41100 A ITU-T recommendation 165

しかしながら、従来の会議電話装置では、遠端話者および近端話者からの環境雑音を周波数分析し、そのホワイトノイズにそのスペクトル包絡特性を持たせるようにしたが、そのスペクトル包絡を持つ雑音は、環境雑音としては近い周波数特性をもっているが、雑音生成の元になる信号がホワイトノイズであり、周波数特性が時刻により変動しない特性を持っているため、不自然な信号になってしまうという問題点を有していた。また、エコーキャンセラが出力する雑音を全く消してしまうノンリニアプロセッサ処理では、最適なクリッピングレベルを決定しなければならないという問題点を有していた。 However, in the conventional conference telephone device, the environmental noise from the far-end speaker and the near-end speaker is frequency-analyzed, and the white envelope has the spectral envelope characteristic. However, the noise having the spectral envelope is However, environmental noise has similar frequency characteristics, but the signal that generates noise is white noise, and the frequency characteristics do not vary with time, resulting in an unnatural signal. Had. In addition, the nonlinear processor processing that completely eliminates the noise output from the echo canceller has a problem that an optimum clipping level must be determined.

この会議電話装置では、音響エコーキャンセラまたは回線エコーキャンセラが出力する雑音を自然な雑音とすることが要求されている。 In this conference telephone apparatus, it is required that the noise output from the acoustic echo canceller or the line echo canceller is a natural noise.

本発明は、この要求を満たすため、音響エコーキャンセラまたは回線エコーキャンセラ
が出力する雑音を自然な雑音とすることができる会議電話装置を提供することを目的とする。 In order to satisfy this requirement, an object of the present invention is to provide a conference telephone apparatus that can make noise output from an acoustic echo canceller or a line echo canceller natural noise.

この課題を解決するために本発明は、遠端話者からの音声信号を音声として再生するスピーカと、スピーカで再生された音声と近端話者から発声された音声とを収音して音声信号を出力するマイクと、適応フィルタと加算器から構成され、遠端話者からの音声信号をリファレンス信号として、マイクから出力される音声信号に対してエコーキャンセルを行う音響エコーキャンセラと、マイクから出力される音声信号から、第１の区間ごとの送信最低パワーと、第１の区間より広い第２の区間において複数の第１の区間ごとの送信最低パワーから最小となる送信パワーに第１の閾値を加えたパワーを第２の閾値パワーとし、第２の区間において第２の閾値パワーより小さい第１の区間ごとの送信最低パワーを平均化した送信パワーより小さい送信パワーの信号区間を検出する近端環境雑音検出部と、近端環境雑音検出部から出力される信号区間の信号を入力ポイントから記憶し、複数の出力ポイントを持ち、それぞれの入力ポイントと出力ポイントを時間の進行方向に進める近端雑音記憶部と、近端雑音記憶部の複数の出力ポイントの信号を加算して音響エコーキャンセラの出力に加算する近端雑音加工部とを有する構成を備えている。 In order to solve this problem, the present invention provides a speaker that reproduces a voice signal from a far-end speaker as a voice, a voice that is reproduced by the speaker, and a voice that is uttered by a near-end speaker. A microphone that outputs a signal, an adaptive filter, and an adder. An acoustic echo canceller that performs echo cancellation on the audio signal output from the microphone, using the audio signal from the far-end speaker as a reference signal. From the audio signal to be output, the transmission minimum power for each first interval and the transmission power that becomes the minimum from the transmission minimum power for each of the plurality of first intervals in the second interval wider than the first interval are first. The power to which the threshold is added is defined as the second threshold power, and the transmission power lower than the transmission power obtained by averaging the transmission minimum power for each of the first intervals smaller than the second threshold power in the second interval. The near-end environmental noise detection unit that detects the power signal interval and the signal interval signal output from the near-end environmental noise detection unit are stored from the input point and has multiple output points, each input point and output point A near-end noise storage unit that advances the signal in the direction of time, and a near-end noise processing unit that adds signals from a plurality of output points of the near-end noise storage unit and adds them to the output of the acoustic echo canceller. Yes.

これにより、音響エコーキャンセラが出力する雑音を自然な雑音とすることができる会議電話装置が得られる。 Thereby, the conference telephone apparatus which can make the noise which an acoustic echo canceller outputs natural noise is obtained.

本発明の会議電話装置は、適応フィルタと加算器から構成され、遠端話者からの音声信号をリファレンス信号として、マイクから出力される音声信号に対してエコーキャンセルを行う音響エコーキャンセラと、マイクから出力される音声信号から、第１の区間ごとの送信最低パワーと、第１の区間より広い第２の区間において複数の第１の区間ごとの送信最低パワーから最小となる送信パワーに第１の閾値を加えたパワーを第２の閾値パワーとし、第２の区間において第２の閾値パワーより小さい第１の区間ごとの送信最低パワーを平均化した送信パワーより小さい送信パワーの信号区間を検出する近端環境雑音検出部と、近端環境雑音検出部から出力される信号区間の信号を入力ポイントから記憶し、複数の出力ポイントを持ち、それぞれの入力ポイントと出力ポイントを時間の進行方向に進める近端雑音記憶部と、近端雑音記憶部の複数の出力ポイントの信号を音響エコーキャンセラの出力に加算する近端雑音加工部とを有することにより、近端雑音記憶部の複数の出力ポイントから読み出して時間的に異なる環境雑音を音響エコーキャンセラの出力信号に重畳させることができ、自然で且つ平均化された環境雑音を実現することができるので、音響エコーキャンセラが出力する雑音を自然な雑音とすることができ、また、平均化された環境雑音であるので、発声区間と環境雑音区間の判定間違いがあっても、判定間違いの部分を目立たなくすることができるという有利な効果が得られる。 The conference telephone device according to the present invention includes an adaptive filter and an adder, and an acoustic echo canceller that performs echo cancellation on a voice signal output from a microphone using a voice signal from a far-end speaker as a reference signal, and a microphone From the audio signal output from the first minimum transmission power for each first interval and the minimum transmission power from the minimum transmission power for each of the plurality of first intervals in the second interval wider than the first interval. The power obtained by adding the threshold is set as the second threshold power, and the signal interval of the transmission power smaller than the transmission power obtained by averaging the lowest transmission power for each first interval smaller than the second threshold power in the second interval is detected. The near-end environmental noise detection unit and the signal of the signal section output from the near-end environmental noise detection unit are stored from the input point, and have multiple output points, By having a near-end noise storage unit that advances the input point and output point in the direction of time, and a near-end noise processing unit that adds the signals of the multiple output points of the near-end noise storage unit to the output of the acoustic echo canceller Because it can be read from multiple output points of the near-end noise storage unit and temporally different environmental noise can be superimposed on the output signal of the acoustic echo canceller, natural and averaged environmental noise can be realized The noise output by the acoustic echo canceller can be natural noise, and since it is an averaged environmental noise, even if there is a misjudgment between the utterance section and the environmental noise section, the misjudgment part is conspicuous The advantageous effect that it can be eliminated is obtained.

さらに、近端話者からの信号である近端話者送話信号を２線−４線変換して遠端話者に送信し、遠端話者からの信号である遠端話者受話信号を４線−２線変換して受信するハイブリッド回路と、適応フィルタと加算器から構成され、ハイブリッド回路へ入力される近端話者送話信号をリファレンス信号とし、ハイブリッド回路から出力される遠端話者受話信号をエコーキャンセルする回線エコーキャンセラと、ハイブリッド回路から出力される遠端話者受話信号から、第１の区間ごとの受信最低パワーと、第１の区間より広い第２の区間において複数の第１の区間ごとの受信最低パワーから最小となる受信パワーに第１の閾値を加えたパワーを第２の閾値パワーとし、第２の区間において第２の閾値パワーより小さい第１の区間ごとの受信パワーを平均化した受信パワーより小さい受信パワーの信号区間を検出する遠端環境雑音検出部と、遠端環境雑音検出部から出力される信号区間の信号を入力ポイントから記憶し、複数の出力ポイントを持ち、それぞれの入力ポイントと出
力ポイントを時間の進行方向に進める遠端雑音記憶部と、遠端雑音記憶部の複数の出力ポイントの信号を回線エコーキャンセラの出力に加算する遠端雑音加工部とを有することにより、遠端雑音記憶部の複数の出力ポイントから読み出して時間的に異なる環境雑音を回線エコーキャンセラの出力信号に重畳させることができ、自然で且つ平均化された環境雑音を実現することができるので、回線エコーキャンセラが出力する雑音を自然な雑音とすることができ、また、平均化された環境雑音であるので、発声区間と環境雑音区間の判定間違いがあっても、判定間違いの部分を目立たなくすることができるという有利な効果が得られる。 Further, the near-end speaker transmission signal, which is a signal from the near-end speaker, is subjected to 2-wire-4 wire conversion and transmitted to the far-end speaker, and the far-end speaker reception signal, which is a signal from the far-end speaker. 4 is a hybrid circuit that receives 4-wire-to-wire conversion, an adaptive filter, and an adder. The near-end speaker transmission signal input to the hybrid circuit is used as a reference signal, and the far-end output from the hybrid circuit. From the line echo canceller for echo canceling the speaker reception signal and the far-end speaker reception signal output from the hybrid circuit, a plurality of reception minimum powers for each first interval and a second interval wider than the first interval Power obtained by adding the first threshold to the minimum received power from the lowest received power for each first interval of the first interval as the second threshold power, and for each first interval smaller than the second threshold power in the second interval Receiver A far-end environmental noise detector that detects a signal section with a received power smaller than the average received power, and a signal section signal that is output from the far-end environmental noise detector is stored from the input point, and multiple output points A far-end noise storage unit that advances each input point and output point in the direction of time, and a far-end noise processing unit that adds signals from multiple output points of the far-end noise storage unit to the output of the line echo canceller Can be read from multiple output points of the far-end noise storage unit and temporally different environmental noise can be superimposed on the output signal of the line echo canceller, realizing natural and averaged environmental noise Therefore, the noise output by the line echo canceller can be natural noise, and since it is an averaged environmental noise, Even if judgment mistakes section and environmental noise section, advantageous effect can be obscured portion of determination errors is obtained.

さらに、近端雑音記憶部は、複数の出力ポイントのうち少なくとも一つの進行方向を入力ポイントの進行方向と逆方向に進めることにより、記憶された環境雑音を更に有効に用いることができるという有利な効果が得られる。 Furthermore, the near-end noise storage unit is advantageous in that the stored environmental noise can be more effectively used by advancing at least one traveling direction of the plurality of output points in a direction opposite to the traveling direction of the input point. An effect is obtained.

さらに、遠端雑音記憶部は、複数の出力ポイントのうち少なくとも一つの進行方向を入力ポイントの進行方向と逆方向に進めることにより、記憶された環境雑音を更に有効に用いることができるという有利な効果が得られる。 Further, the far-end noise storage unit is advantageous in that the stored environmental noise can be used more effectively by advancing at least one traveling direction of the plurality of output points in a direction opposite to the traveling direction of the input point. An effect is obtained.

さらに、近端環境雑音検出部で検出した平均化した送信パワーのレベルに対応する振幅を用いて、音響エコーキャンセラ出力をセンタークリッピング処理する近端センタークリッピング処理部を備えたことにより、送信パワーレベルにより発声区間と環境雑音区間に分離する際のレベルとクリッピングレベルを同期させることができ、同じレベルの平均化された特徴を持つ環境雑音を加算することができるので、クリッピングの不自然さを減少させることができるという有利な効果が得られる。 Furthermore, a transmission power level is provided by providing a near-end center clipping processing unit that performs center clipping processing on the acoustic echo canceller output using an amplitude corresponding to the averaged transmission power level detected by the near-end environmental noise detection unit. Can be used to synchronize the clipping level and the level when separating into the voice and environmental noise sections, and add the environmental noise with the same level of averaged features, reducing the unnaturalness of clipping The advantageous effect that it can be made is obtained.

さらに、遠端環境雑音検出部で検出した平均化した受信パワーのレベルに対応する振幅を用いて、回線エコーキャンセラ出力をセンタークリッピング処理する遠端センタークリッピング処理部を備えたことにより、受信パワーレベルにより発声区間と環境雑音区間に分離する際のレベルとクリッピングレベルを同期させることができ、同じレベルの平均化された特徴を持つ環境雑音を加算することができるので、クリッピングの不自然さを減少させることができるという有利な効果が得られる。 Furthermore, a reception power level is provided by providing a far-end center clipping processing unit that performs center clipping processing on the line echo canceller output using an amplitude corresponding to the average received power level detected by the far-end environmental noise detection unit. Can be used to synchronize the clipping level and the level when separating into the voice and environmental noise sections, and add the environmental noise with the same level of averaged features, reducing the unnaturalness of clipping The advantageous effect that it can be made is obtained.

本発明は、音響エコーキャンセラまたは回線エコーキャンセラが出力する雑音を自然な雑音とするという目的を、マイクまたは回線エコーキャンセラから出力される信号をパワーレベルにより発声区間と環境雑音区間に分離し、環境雑音区間の信号を環境雑音記憶部に記憶し、環境雑音記憶部から複数のポイントから信号を順次読み出し、時間的に異なる環境雑音を重畳させて自然で且つ平均化された環境雑音とすることにより実現した。 The present invention aims to make the noise output from the acoustic echo canceller or line echo canceller natural noise, and separates the signal output from the microphone or line echo canceller into an utterance section and an environmental noise section according to the power level. By storing the signal of the noise section in the environmental noise storage unit, sequentially reading out the signal from a plurality of points from the environmental noise storage unit, and superimposing the environmental noise different in time to make a natural and averaged environmental noise It was realized.

上記課題を解決するためになされた第１の発明は、遠端話者からの音声信号を音声として再生するスピーカと、スピーカで再生された音声と近端話者から発声された音声とを収音して音声信号を出力するマイクと、適応フィルタと加算器から構成され、遠端話者からの音声信号をリファレンス信号として、マイクから出力される音声信号に対してエコーキャンセルを行う音響エコーキャンセラと、マイクから出力される音声信号から、第１の区間ごとの送信最低パワーと、第１の区間より広い第２の区間において複数の第１の区間ごとの送信最低パワーから最小となる送信パワーに第１の閾値を加えたパワーを第２の閾値パワーとし、第２の区間において第２の閾値パワーより小さい第１の区間ごとの送信最低パワーを平均化した送信パワーより小さい送信パワーの信号区間を検出する近端環境雑音検出部と、近端環境雑音検出部から出力される信号区間の信号を入力ポイントから記憶し、複数の出力ポイントを持ち、それぞれの入力ポイントと出力ポイントを時間の進行方向に進める近端雑音記憶部と、近端雑音記憶部の複数の出力ポイントの信号を音響エコーキ
ャンセラの出力に加算する近端雑音加工部とを有することとしたものであり、近端雑音記憶部の複数の出力ポイントから読み出して時間的に異なる環境雑音を音響エコーキャンセラの出力信号に重畳させることができ、自然で且つ平均化された環境雑音を実現することができるので、音響エコーキャンセラが出力する雑音を自然な雑音とすることができ、また、平均化された環境雑音であるので、発声区間と環境雑音区間の判定間違いがあっても、判定間違いの部分を目立たなくすることができるという作用・効果を有する。 A first invention made to solve the above problem is to collect a speaker that reproduces a voice signal from a far-end speaker as a voice, a voice reproduced by the speaker, and a voice uttered by a near-end speaker. An acoustic echo canceller that consists of a microphone that emits sound and outputs an audio signal, an adaptive filter, and an adder, and that performs echo cancellation on the audio signal output from the microphone using the audio signal from the far-end speaker as a reference signal And the minimum transmission power from the minimum transmission power for each first interval and the minimum transmission power for each of the plurality of first intervals in the second interval wider than the first interval from the audio signal output from the microphone. The power obtained by adding the first threshold to the second threshold power, and the transmission power obtained by averaging the transmission minimum power for each first section smaller than the second threshold power in the second section The near-end environmental noise detector that detects a signal section with a smaller transmission power, and the signal of the signal section that is output from the near-end environmental noise detector is stored from the input points, and has multiple output points. And a near-end noise storage unit that advances the output point in the direction of time, and a near-end noise processing unit that adds signals from multiple output points of the near-end noise storage unit to the output of the acoustic echo canceller It can be read from a plurality of output points of the near-end noise storage unit and temporally different environmental noise can be superimposed on the output signal of the acoustic echo canceller, and natural and averaged environmental noise can be realized. Therefore, the noise output from the acoustic echo canceller can be natural noise, and since it is the averaged environmental noise, Even if judgment mistake between, having operations and effects that can be obscured portion of the judgment mistake.

上記課題を解決するためになされた第２の発明は、近端話者からの信号である近端話者送話信号を２線−４線変換して遠端話者に送信し、遠端話者からの信号である遠端話者受話信号を４線−２線変換して受信するハイブリッド回路と、適応フィルタと加算器から構成され、ハイブリッド回路へ入力される近端話者送話信号をリファレンス信号とし、ハイブリッド回路から出力される遠端話者受話信号をエコーキャンセルする回線エコーキャンセラと、ハイブリッド回路から出力される遠端話者受話信号から、第１の区間ごとの受信最低パワーと、第１の区間より広い第２の区間において複数の第１の区間ごとの受信最低パワーから最小となる受信パワーに第１の閾値を加えたパワーを第２の閾値パワーとし、第２の区間において第２の閾値パワーより小さい第１の区間ごとの受信パワーを平均化した受信パワーより小さい受信パワーの信号区間を検出する遠端環境雑音検出部と、遠端環境雑音検出部から出力される信号区間の信号を入力ポイントから記憶し、複数の出力ポイントを持ち、それぞれの入力ポイントと出力ポイントを時間の進行方向に進める遠端雑音記憶部と、遠端雑音記憶部の複数の出力ポイントの信号を回線エコーキャンセラの出力に加算する遠端雑音加工部とを有することとしたものであり、遠端雑音記憶部の複数の出力ポイントから読み出して時間的に異なる環境雑音を回線エコーキャンセラの出力信号に重畳させることができ、自然で且つ平均化された環境雑音を実現することができるので、回線エコーキャンセラが出力する雑音を自然な雑音とすることができ、また、平均化された環境雑音であるので、発声区間と環境雑音区間の判定間違いがあっても、判定間違いの部分を目立たなくすることができるという作用・効果を有する。 A second invention made to solve the above-mentioned problem is that the near-end speaker transmission signal, which is a signal from the near-end speaker, is subjected to 2-wire to 4-wire conversion and transmitted to the far-end speaker, and the far-end speaker is transmitted. A near-end speaker transmission signal input to the hybrid circuit, which is composed of a hybrid circuit that receives the far-end speaker reception signal, which is a signal from the speaker, by performing 4-to-2 wire conversion, and an adaptive filter and an adder. As a reference signal, a line echo canceller for echo canceling the far-end speaker reception signal output from the hybrid circuit, and a reception minimum power for each first interval from the far-end speaker reception signal output from the hybrid circuit, In the second section wider than the first section, the power obtained by adding the first threshold to the minimum received power for each of the plurality of first sections is set as the second threshold power, and the second section The second threshold at A far-end environmental noise detector for detecting a signal section having a received power smaller than the received power obtained by averaging the received power for each first section smaller than the word, and a signal in the signal section output from the far-end environmental noise detector. A far-end noise storage unit that stores a plurality of output points and advances each input point and output point in the direction of time progress, and a signal from a plurality of output points of the far-end noise storage unit. A far-end noise processing unit that adds to the output of the far-end noise, and reads from multiple output points of the far-end noise storage unit and superimposes temporally different environmental noise on the output signal of the line echo canceller Since natural and averaged environmental noise can be realized, the noise output by the line echo canceller should be natural noise. Can also because it is averaged ambient noise, even if judgment mistakes vocal section and environmental noise section, has an action and effect that it is possible to obscure the portion of the judgment mistake.

上記課題を解決するためになされた第３の発明は、近端雑音記憶部は、複数の出力ポイントのうち少なくとも一つの進行方向を入力ポイントの進行方向と逆方向に進めることとしたものであり、記憶された環境雑音を更に有効に用いることができるという作用・効果を有する。また、音響エコーキャンセラについて、環境雑音の周波数特性が自然なものとなるので、話者に対して周囲音の違和感を与えないようにすることができ、円滑な会話をすることができるという作用・効果を有する。 According to a third aspect of the invention made to solve the above problem, the near-end noise storage unit advances at least one traveling direction of the plurality of output points in a direction opposite to the traveling direction of the input points. The stored environmental noise can be used more effectively. In addition, the acoustic echo canceller has a natural frequency characteristic of environmental noise, so it can prevent the speaker from feeling uncomfortable with ambient sounds and can have a smooth conversation. Has an effect.

上記課題を解決するためになされた第４の発明は、遠端雑音記憶部は、複数の出力ポイントのうち少なくとも一つの進行方向を入力ポイントの進行方向と逆方向に進めることとしたものであり、記憶された環境雑音を更に有効に用いることができるという作用・効果を有する。また、回線エコーキャンセラについて、環境雑音の周波数特性が自然なものとなるので、話者に対して周囲音の違和感を与えないようにすることができ、円滑な会話をすることができるという作用・効果を有する。 A fourth invention made to solve the above-described problem is that the far-end noise storage unit advances at least one traveling direction of the plurality of output points in a direction opposite to the traveling direction of the input points. The stored environmental noise can be used more effectively. In addition, because the frequency characteristics of environmental noise are natural for line echo cancellers, it is possible to prevent the speaker from feeling uncomfortable with ambient sounds and to have a smooth conversation. Has an effect.

上記課題を解決するためになされた第５の発明は、近端環境雑音検出部で検出した平均化した送信パワーのレベルに対応する振幅を用いて、音響エコーキャンセラ出力をセンタークリッピング処理する近端センタークリッピング処理部を備えることとしたものであり、送信パワーレベルにより発声区間と環境雑音区間に分離する際のレベルとクリッピングレベルを同期させることができ、同じレベルの平均化された特徴を持つ環境雑音を加算することができるので、クリッピングの不自然さを減少させることができるという作用・効果を有する。 A fifth invention made to solve the above-described problem is a near-end that performs center clipping processing on an acoustic echo canceller output using an amplitude corresponding to an averaged transmission power level detected by a near-end environmental noise detector. An environment with an averaged feature of the same level, which is equipped with a center clipping processing unit and can synchronize the clipping level with the level when separating into the utterance interval and the environmental noise interval according to the transmission power level Since noise can be added, there is an effect that the unnaturalness of clipping can be reduced.

上記課題を解決するためになされた第６の発明は、遠端環境雑音検出部で検出した平均
化した受信パワーのレベルに対応する振幅を用いて、回線エコーキャンセラ出力をセンタークリッピング処理する遠端センタークリッピング処理部を備えることとしたものであり、受信パワーレベルにより発声区間と環境雑音区間に分離する際のレベルとクリッピングレベルを同期させることができ、同じレベルの平均化された特徴を持つ環境雑音を加算することができるので、クリッピングの不自然さを減少させることができるという作用・効果を有する。 A sixth aspect of the invention made to solve the above-described problem is that a far-end which performs center clipping processing on the line echo canceller output using an amplitude corresponding to the level of the average received power detected by the far-end environmental noise detector. An environment with an averaged feature of the same level, which is equipped with a center clipping processing unit, which can synchronize the level and clipping level when separating into the utterance interval and the environmental noise interval according to the received power level Since noise can be added, there is an effect that the unnaturalness of clipping can be reduced.

（実施の形態１）
図１は、本発明の実施の形態１による会議電話装置を示すブロック図である。 (Embodiment 1)
FIG. 1 is a block diagram showing a conference telephone apparatus according to Embodiment 1 of the present invention.

図１において、１０１は公衆回線からの２線−４線変換を行うハイブリッド回路、１０２は遠端話者から受信した通話信号（遠端話者受話信号）を近端話者側で再生するスピーカである。図１では、デジタル処理された遠端話者受話信号をアナログ信号に変換するＤ／Ａ変換器およびスピーカを駆動するためのパワーアンプは省略している。１０３はスピーカ１０２で再生した遠端話者の受信音声と近端話者の音声を収音して音声信号として出力するマイクロホン（マイク）である。同様に、マイクロホン１０３から出力される音声信号をＡ／Ｄ変換するためのＡ／Ｄ変換器は省略している。１０４は音響適応フィルタ１０５と共に動作してエコーキャンセル処理を行う加算器、１０６はマイク１０３から出力した信号から、近端環境雑音のレベルを判定し、後述の近端雑音記憶部１０７に近端環境雑音信号として出力する近端環境雑音検出部、１０７は記憶した近端環境雑音信号を後述の近端雑音加工部１０８に出力する近端雑音記憶部、１０８は近端雑音記憶部１０７から出力される近端環境雑音信号を加工し、自然な近端環境雑音を生成する近端雑音加工部、１０９は加算器１０４からの出力信号と近端雑音加工部１０８からの出力信号を加算し、ハイブリッド回路１０１へ出力する加算器、１１０は近端センタークリッピング処理のクリッピングレベルを決定する近端センタークリッピング処理部、１１１は回線適応フィルタ１１２と共に動作してエコーキャンセル処理を行う加算器、１１３はハイブリッド回路１０１から出力した信号から、遠端環境雑音のレベルを判定し、後述の遠端雑音記憶部１１４に遠端環境雑音信号として出力する遠端環境雑音検出部、１１４は記憶した遠端環境雑音信号を後述の遠端雑音加工部１１５に出力する遠端雑音記憶部、１１５は遠端雑音記憶部１１４の遠端環境雑音信号を加工し、自然な遠端環境雑音を生成する遠端雑音加工部、１１６は加算器１１１からの出力信号と遠端雑音加工部１１５からの出力信号とを加算してスピーカ１０２へ出力する加算器、１１７は遠端センタークリッピング処理のクリッピングレベルを決定する遠端センタークリッピング処理部である。 In FIG. 1, 101 is a hybrid circuit that performs 2-wire to 4-wire conversion from a public line, and 102 is a speaker that reproduces a call signal (far end talker received signal) received from a far end talker on the near end talker side. It is. In FIG. 1, a D / A converter that converts a digitally processed far-end speaker reception signal into an analog signal and a power amplifier for driving a speaker are omitted. Reference numeral 103 denotes a microphone (microphone) that picks up the received voice of the far-end speaker and the voice of the near-end speaker reproduced by the speaker 102 and outputs them as voice signals. Similarly, an A / D converter for A / D converting the audio signal output from the microphone 103 is omitted. Reference numeral 104 denotes an adder that operates together with the acoustic adaptive filter 105 to perform echo cancellation processing, and 106 determines the level of near-end environmental noise from the signal output from the microphone 103, and the near-end environment storage unit 107 (to be described later) stores the near-end environment noise level. A near-end environmental noise detection unit 107 that outputs as a noise signal, 107 is a near-end noise storage unit that outputs a stored near-end environmental noise signal to a near-end noise processing unit 108 described later, and 108 is output from the near-end noise storage unit 107. A near-end noise processing unit 109 for processing a near-end environmental noise signal to generate natural near-end environmental noise, and 109 adds the output signal from the adder 104 and the output signal from the near-end noise processing unit 108 to obtain a hybrid An adder to be output to the circuit 101, 110 is a near-end center clipping processing unit that determines a clipping level of the near-end center clipping process, and 111 is a line adaptive filter. 112, an adder that operates together with 112 to perform echo cancellation processing, determines the level of far-end environmental noise from the signal output from the hybrid circuit 101, and outputs it as a far-end environmental noise signal to the far-end noise storage unit 114 described later The far-end environmental noise detection unit 114 outputs a stored far-end environmental noise signal to a later-described far-end noise processing unit 115, and 115 represents the far-end environmental noise signal of the far-end noise storage unit 114. A far-end noise processing unit that processes and generates natural far-end environmental noise, and 116 adds an output signal from the adder 111 and an output signal from the far-end noise processing unit 115 and outputs the result to the speaker 102 Reference numeral 117 denotes a far end center clipping processing unit that determines a clipping level of the far end center clipping process.

このように構成された会議電話装置についてまず、近端環境雑音検出部１０６の動作について図２（ａ）を用いて述べる。図２（ａ）は、近端環境雑音検出部１０６を示すブロック図である。 Regarding the conference telephone apparatus configured as described above, first, the operation of the near-end environmental noise detection unit 106 will be described with reference to FIG. FIG. 2A is a block diagram showing the near-end environmental noise detection unit 106.

図２（ａ）において、２０１は近端話者からの受話信号について一定期間Ｔ１毎の平均受信パワー（数１）を計算する短時間パワー検出部、２０２は平均受信パワー（数１）を過去期間Ｔ２分記憶する短時間パワー記憶部、２０３は過去期間Ｔ２分の平均受信パワー（数１）から最低レベルの受信パワー（数２）を探索する最低パワー検出部、２０４は所定の条件を満たす平均受信パワーを探索し、その平均値を平均受信雑音パワーとする平均送話雑音パワー計算部である。 In FIG. 2A, 201 is a short-time power detection unit that calculates an average received power (Equation 1) for each predetermined period T1 for a received signal from a near-end speaker, and 202 is an average received power (Equation 1) in the past. A short-time power storage unit that stores the period T2 and 203 is a minimum power detection unit that searches for the lowest level received power (Equation 2) from the average received power (Equation 1) for the past period T2, and 204 satisfies a predetermined condition An average transmission noise power calculation unit that searches for average reception power and uses the average value as the average reception noise power.

このように構成された近端環境雑音検出部１０６の動作を説明する。 The operation of the near-end environmental noise detection unit 106 configured as described above will be described.

短時間パワー検出部２０１において、近端話者受信信号について一定期間Ｔ１毎の平均受信パワー（数１）を計算し、短時間パワー記憶部２０２に過去期間Ｔ２分記憶する。ｎはＴ２期間分の平均受信パワーのインデックスを表す。次に、最低パワー検出部２０３において、過去期間Ｔ２分の平均受信パワー（数１）から最低レベルの受信パワー（数２）を探索する。次に、平均受話雑音パワー計算部２０４において、（数３）を満たす過去期間Ｔ２分の平均受信パワーを探索し、その平均値を平均受信雑音パワーとする。αは最低レベルの受信パワーから雑音の範囲とするばらつきを吸収するパラメータである。 The short-time power detection unit 201 calculates the average received power (Equation 1) for each fixed period T1 for the near-end speaker reception signal, and stores it in the short-time power storage unit 202 for the past period T2. n represents an index of average received power for the period T2. Next, the lowest power detection unit 203 searches for the lowest level received power (Equation 2) from the average received power (Equation 1) for the past period T2. Next, the average reception noise power calculation unit 204 searches for the average reception power for the past period T2 that satisfies (Equation 3), and sets the average value as the average reception noise power. α is a parameter that absorbs the variation from the lowest level received power to the noise range.

これにより、近端話者が発声していない区間で、ばらつきを考慮した近端側の環境雑音と回線雑音が重畳した受信信号のパワーを求めることができる。このパワーと比較してパワーが小さい区間においてマイク１０３で収音した信号を近端側の環境雑音とみなし、近端雑音記憶部１０７へ出力する。 As a result, the power of the received signal on which the near-end environmental noise and the line noise are superimposed in consideration of the variation can be obtained in a section where the near-end speaker is not speaking. A signal picked up by the microphone 103 in a section where the power is smaller than this power is regarded as environmental noise on the near end side, and is output to the near end noise storage unit 107.

次に、近端雑音記憶部１０７の詳細な構成を図３を用いて説明する。図３は環境雑音信号の入力ポイント、その入力ポイント進行方向、環境雑音信号の出力ポイント、その出力ポイント進行方向を示す説明図である。近端雑音記憶部１０７では、近端環境雑音信号を順次、リングバッファ上の近端環境雑音入力ポイントのメモリに記憶し、ポインタを近端環境雑音入力ポイント進行方向に進める。同時に環境雑音信号出力ポイント１から環境雑音信号出力ポイント４まで記憶されている環境雑音信号を読み出し、近端雑音加工部１０８へ出力する。その後、環境雑音信号出力ポイント１を環境雑音信号出力ポイント１進行方向へ、環境雑音信号出力ポイント２を環境雑音信号出力ポイント２進行方向へ、環境雑音信号出力ポイント３を環境雑音信号出力ポイント３進行方向へ、環境雑音信号出力ポイント４を環境雑音信号出力ポイント４進行方向へ進める。 Next, a detailed configuration of the near-end noise storage unit 107 will be described with reference to FIG. FIG. 3 is an explanatory diagram showing the input point of the environmental noise signal, the input point traveling direction, the output point of the environmental noise signal, and the output point traveling direction. The near-end noise storage unit 107 sequentially stores near-end environmental noise signals in the memory of the near-end environmental noise input point on the ring buffer, and advances the pointer in the traveling direction of the near-end environmental noise input point. At the same time, the environmental noise signals stored from the environmental noise signal output point 1 to the environmental noise signal output point 4 are read and output to the near-end noise processing unit 108. After that, environmental noise signal output point 1 proceeds in the direction of environmental noise signal output point 1, environmental noise signal output point 2 proceeds in the direction of environmental noise signal output point 2, and environmental noise signal output point 3 proceeds in environmental noise signal output point 3. The environmental noise signal output point 4 is advanced in the direction of travel.

また、記憶された環境雑音をさらに有効に用いるために、進行方向が逆のポインタを設定しても良い。進行方向が逆の環境雑音信号出力ポイント５から環境雑音信号出力ポイント８まで記憶されている環境雑音信号を読み出し、近端雑音加工部１０８へ出力する。その後、環境雑音信号出力ポイント５を環境雑音信号出力ポイント５進行方向へ、環境雑音信号出力ポイント６を環境雑音信号出力ポイント６進行方向へ、環境雑音信号出力ポイント７を環境雑音信号出力ポイント７進行方向へ、環境雑音信号出力ポイント８を環境雑音信号出力ポイント８進行方向へ進める。 Further, in order to use the stored environmental noise more effectively, a pointer having a reverse traveling direction may be set. The environmental noise signal stored from the environmental noise signal output point 5 to the environmental noise signal output point 8 whose traveling direction is reverse is read and output to the near-end noise processing unit 108. Thereafter, the environmental noise signal output point 5 proceeds in the traveling direction of the environmental noise signal output point 5, the environmental noise signal output point 6 travels in the traveling direction of the environmental noise signal output point 6, and the environmental noise signal output point 7 travels in the environmental noise signal output point 7. The environmental noise signal output point 8 is advanced in the direction of travel.

近端雑音加工部１０８では、近端雑音記憶部１０７から出力された複数の環境雑音信号を加算し、加算器１０９へ出力する。加算器１０９では、加算器１０４の出力信号と近端雑音加工部１０８の出力信号とを加算して、ハイブリッド回路１０１を経由し遠端話者へ送信する。 The near-end noise processing unit 108 adds a plurality of environmental noise signals output from the near-end noise storage unit 107 and outputs the result to the adder 109. The adder 109 adds the output signal of the adder 104 and the output signal of the near-end noise processing unit 108 and transmits the result to the far-end speaker via the hybrid circuit 101.

ここで、遠端環境雑音検出部１１３の動作は近端環境雑音検出部１０６と同様の動作を行い、遠端雑音記憶部１１４は近端雑音記憶部１０７と同様の動作を行い、遠端雑音加工部１１５は近端雑音加工部１０８と同様の動作を行い、加算器１１６では、加算器１１１の出力信号と遠端雑音加工部１１５の出力信号とを加算して、スピーカ１０２を経由し近端話者への再生音となる。 Here, the operation of the far-end environmental noise detection unit 113 performs the same operation as that of the near-end environmental noise detection unit 106, and the far-end noise storage unit 114 performs the same operation as that of the near-end noise storage unit 107. The processing unit 115 performs the same operation as the near-end noise processing unit 108, and the adder 116 adds the output signal of the adder 111 and the output signal of the far-end noise processing unit 115 and passes through the speaker 102 to the near-end noise processing unit 108. Plays back to the end speaker.

次に、近端センタークリッピング処理のクリッピングレベルおよび遠端センタークリッピング処理のクリッピングレベルについて説明する。近端環境雑音検出部１０６および遠端環境雑音検出部１１３で、それぞれ近端側、遠端側の環境雑音レベルが求まっている。近端環境雑音検出部１０６の近端環境雑音レベルにより近端センタークリッピング処理部１１０が近端センタークリッピング処理のクリッピングレベルを決定することにより、近端側の雑音と判断された区間ではクリッピング処理され、同じレベルの近端環境雑音が近端雑音加工部１０８から出力され加算されて、遠端話者へハイブリッド回路１０１を経由して送信される。 Next, the clipping level of the near-end center clipping process and the clipping level of the far-end center clipping process will be described. The near-end environmental noise detection unit 106 and the far-end environmental noise detection unit 113 obtain environmental noise levels on the near-end side and the far-end side, respectively. The near-end center clipping processing unit 110 determines the clipping level of the near-end center clipping process based on the near-end environment noise level of the near-end environmental noise detection unit 106, so that clipping processing is performed in the section determined to be near-end side noise. The near-end environmental noise of the same level is output from the near-end noise processing unit 108, added, and transmitted to the far-end speaker via the hybrid circuit 101.

同様に、遠端環境雑音検出部１１３の遠端環境雑音レベルにより遠端センタークリッピング処理部１１７が遠端センタークリッピング処理のクリッピングレベルを決定することにより、遠端側の雑音と判断された区間ではクリッピング処理され、同じレベルの遠端環境雑音が遠端雑音加工部１１５から出力され加算されて、近端話者へスピーカ１０２を経由して再生される。 Similarly, the far-end center clipping processing unit 117 determines the clipping level of the far-end center clipping process based on the far-end environmental noise level of the far-end environmental noise detection unit 113, so that in the section determined as the far-end side noise. Clipping processing is performed, and far-end environmental noise of the same level is output from the far-end noise processing unit 115, added, and reproduced to the near-end speaker via the speaker 102.

次に、遠端環境雑音検出部１１３の動作について図２（ｂ）を用いて述べる。図２（ｂ）は、遠端環境雑音検出部１１３を示すブロック図である。図２（ｂ）において、短時間パワー検出部２０１、短時間パワー記憶部２０２、最低パワー検出部２０３、平均受話雑音パワー検出部２０４は図２（ａ）と同様のものであり、同一符号を付し、説明は省略する。 Next, the operation of the far-end environmental noise detection unit 113 will be described with reference to FIG. FIG. 2B is a block diagram showing the far-end environmental noise detection unit 113. In FIG. 2B, the short-time power detection unit 201, the short-time power storage unit 202, the minimum power detection unit 203, and the average received noise power detection unit 204 are the same as those in FIG. The description is omitted.

遠端環境雑音検出部１１３の動作内容は、近端環境雑音検出部１０６がマイクロフォン１０３からの信号を入力とするのに対し、遠端環境雑音検出部１１３がハイブリッド回路１０１からの信号を入力とする点が異なるが、遠端環境雑音検出部１１３の動作は近端環境雑音検出部１０６の動作と同様である。 The operation content of the far-end environmental noise detection unit 113 is that the near-end environmental noise detection unit 106 receives a signal from the microphone 103 while the far-end environmental noise detection unit 113 receives a signal from the hybrid circuit 101. However, the operation of the far-end environmental noise detection unit 113 is the same as the operation of the near-end environmental noise detection unit 106.

以上のように本実施の形態によれば、音響適応フィルタ１０５と加算器１０４から構成され、遠端話者からの音声信号をリファレンス信号として、マイク１０３から出力される音声信号に対してエコーキャンセルを行う音響エコーキャンセラと、マイク１０３から出力される音声信号から、第１の区間ごとの送信最低パワーと、第１の区間より広い第２の区間において複数の第１の区間ごとの送信最低パワーから最小となる送信パワーに第１の閾値を加えたパワーを第２の閾値パワーとし、第２の区間において第２の閾値パワーより小さい第１の区間ごとの送信最低パワーを平均化した送信パワーより小さい送信パワーの信号区間を検出する近端環境雑音検出部１０６と、近端環境雑音検出部１０６から出力される信号区間の信号を入力ポイントから記憶し、複数の出力ポイントを持ち、それぞれの入力ポイントと出力ポイントを時間の進行方向に進める近端雑音記憶部１０７と、近端雑音記憶部１０７の複数の出力ポイントの信号を音響エコーキャンセラ（加算器１０４、音響適応フィルタ１０５）の出力に加算する近端雑音加工部１０８とを有することにより、近端雑音記憶部１０７の複数の出力ポイントから読み出して時間的に異なる環境雑音を音響エコーキャンセラ（加算器１０４、音響適応フィルタ１０５）の出力信号に重畳させることができ、自然で且つ平均化された環境雑音を実現することができるので、音響エコーキャンセラ（加算器１０４、音響適応フィルタ１０５）が出力する雑音を自然な雑音とすることができ、また、平均化された環境雑音であるので、発声区間と環境雑音区間の判定間違いがあっても、判定間違いの部分を目立たなくすることができる。 As described above, according to the present embodiment, the acoustic adaptive filter 105 and the adder 104 are used, and echo cancellation is performed on the audio signal output from the microphone 103 using the audio signal from the far-end speaker as a reference signal. And a minimum transmission power for each first interval and a minimum transmission power for each of a plurality of first intervals in a second interval wider than the first interval from the acoustic echo canceller that performs the sound and the audio signal output from the microphone 103 The power obtained by adding the first threshold to the minimum transmission power from the second threshold power is used as the second threshold power, and the transmission power obtained by averaging the transmission minimum power for each first section smaller than the second threshold power in the second section A near-end environmental noise detection unit 106 that detects a signal section having a smaller transmission power, and a signal in the signal section output from the near-end environmental noise detection unit 106 Near-end noise storage unit 107 that has a plurality of output points, advances each input point and output point in the direction of time progress, and acoustic echoes of signals at the plurality of output points of near-end noise storage unit 107 By having a near-end noise processing unit 108 that adds to the output of the canceller (adder 104, acoustic adaptive filter 105), environmental noises that differ in time from the plurality of output points of the near-end noise storage unit 107 are acoustically read. Since it can be superimposed on the output signal of the echo canceller (adder 104, acoustic adaptive filter 105) and natural and average environmental noise can be realized, the acoustic echo canceller (adder 104, acoustic adaptive filter) 105) can be a natural noise, and since it is an averaged environmental noise, Even if judgment mistakes section and environmental noise section, can be made inconspicuous part of the judgment mistake.

さらに、近端話者からの信号である近端話者送話信号を２線−４線変換して遠端話者に送信し、遠端話者からの信号である遠端話者受話信号を４線−２線変換して受信するハイブリッド回路１０１と、回線適応フィルタ１１２と加算器１１１から構成され、ハイブリッド回路１０１へ入力される近端話者送話信号をリファレンス信号とし、ハイブリッド回路１０１から出力される遠端話者受話信号をエコーキャンセルする回線エコーキャンセラと、ハイブリッド回路１０１から出力される遠端話者受話信号から、第１の区間ごとの受信最低パワーと、第１の区間より広い第２の区間において複数の第１の区間ごとの受信最低パワーから最小となる受信パワーに第１の閾値を加えたパワーを第２の閾値パワーとし、第２の区間において第２の閾値パワーより小さい第１の区間ごとの受信パワーを平均化した受信パワーより小さい受信パワーの信号区間を検出する遠端環境雑音検出部１１３と、遠端環境雑音検出部１１３から出力される信号区間の信号を入力ポイントから記憶し、複数の出力ポイントを持ち、それぞれの入力ポイントと出力ポイントを時間の進行方向に進める遠端雑音記憶部１１４と、遠端雑音記憶部１１４の複数の出力ポイントの信号を回線エコーキャンセラ（加算器１１１、回線適応フィルタ１１２）の出力に加算する遠端雑音加工部１１５とを有することにより、遠端雑音記憶部１１４の複数の出力ポイントから読み出して時間的に異なる環境雑音を回線エコーキャンセラ（加算器１１１、回線適応フィルタ１１２）の出力信号に重畳させることができ、自然で且つ平均化された環境雑音を実現することができるので、回線エコーキャンセラ（加算器１１１、回線適応フィルタ１１２）が出力する雑音を自然な雑音とすることができ、また、平均化された環境雑音であるので、発声区間と環境雑音区間の判定間違いがあっても、判定間違いの部分を目立たなくすることができる。 Further, the near-end speaker transmission signal, which is a signal from the near-end speaker, is subjected to 2-wire-4 wire conversion and transmitted to the far-end speaker, and the far-end speaker reception signal, which is a signal from the far-end speaker. 4 is a hybrid circuit 101 that receives the signal after four-line to two-line conversion, a line adaptive filter 112, and an adder 111. The hybrid circuit 101 uses a near-end speaker transmission signal input to the hybrid circuit 101 as a reference signal. From the line echo canceller for echo-cancelling the far-end speaker received signal output from, and from the far-end speaker received signal output from the hybrid circuit 101, the minimum received power for each first section and the first section The power obtained by adding the first threshold to the minimum received power from the lowest received power for each of the plurality of first intervals in the wide second interval is defined as the second threshold power, and the second threshold is determined in the second interval. A far-end environmental noise detection unit 113 for detecting a signal section with a reception power smaller than the reception power obtained by averaging the reception power for each first section smaller than the power, and a signal section output from the far-end environmental noise detection unit 113 A signal is stored from the input point, has a plurality of output points, and advances the respective input points and output points in the direction of time travel, and signals from a plurality of output points of the far end noise storage unit 114 And a far-end noise processing unit 115 for adding the signal to the output of the line echo canceller (adder 111, line adaptive filter 112), so that the environment is different in time by reading from a plurality of output points of the far-end noise storage unit 114. Noise can be superimposed on the output signal of the line echo canceller (adder 111, line adaptive filter 112). Therefore, the noise output from the line echo canceller (adder 111, line adaptive filter 112) can be natural noise, and the averaged environmental noise can be realized. Therefore, even if there is a determination error between the utterance section and the environmental noise section, the determination error portion can be made inconspicuous.

さらに、近端雑音記憶部１０７は、複数の出力ポイントのうち少なくとも一つの進行方向を入力ポイントの進行方向と逆方向に進めることにより、記憶された環境雑音を更に有効に用いることができる。 Furthermore, the near-end noise storage unit 107 can use the stored environmental noise more effectively by advancing at least one traveling direction of the plurality of output points in a direction opposite to the traveling direction of the input points.

さらに、遠端雑音記憶部１１４は、複数の出力ポイントのうち少なくとも一つの進行方向を入力ポイントの進行方向と逆方向に進めることにより、記憶された環境雑音を更に有効に用いることができる。 Furthermore, the far-end noise storage unit 114 can further effectively use the stored environmental noise by advancing at least one traveling direction of the plurality of output points in a direction opposite to the traveling direction of the input point.

さらに、近端環境雑音検出部１０６で検出した平均化した送信パワーのレベルに対応する振幅を用いて、音響エコーキャンセラ出力をセンタークリッピング処理する近端センタークリッピング処理部１１０を備えたことにより、送信パワーレベルにより発声区間と環境雑音区間に分離する際のレベルとクリッピングレベルを同期させることができ、同じレベルの平均化された特徴を持つ環境雑音を加算することができるので、クリッピングの不自然さを減少させることができる。 Further, by including a near-end center clipping processing unit 110 that performs center clipping processing on the acoustic echo canceller output using the amplitude corresponding to the averaged transmission power level detected by the near-end environmental noise detection unit 106, transmission is performed. The clipping level can be synchronized with the speech level and the environmental noise interval according to the power level, and environmental noise with the same level of averaged features can be added. Can be reduced.

さらに、遠端環境雑音検出部１１３で検出した平均化した受信パワーのレベルに対応する振幅を用いて、回線エコーキャンセラ出力をセンタークリッピング処理する遠端センタークリッピング処理部１１７を備えたことにより、受信パワーレベルにより発声区間と環境雑音区間に分離する際のレベルとクリッピングレベルを同期させることができ、同じレベルの平均化された特徴を持つ環境雑音を加算することができるので、クリッピングの不自然さを減少させることができる。 In addition, since a far-end center clipping processing unit 117 that performs center-clipping processing on the line echo canceller output using the amplitude corresponding to the average received power level detected by the far-end environmental noise detection unit 113 is provided. The clipping level can be synchronized with the speech level and the environmental noise interval according to the power level, and environmental noise with the same level of averaged features can be added. Can be reduced.

本発明は、環境雑音のある中でも不自然な雑音を受信および送信することのない会議電話装置に関し、音響エコーキャンセラまたは回線エコーキャンセラが出力する雑音を自然な雑音とすることができる。 The present invention relates to a conference telephone apparatus that does not receive and transmit unnatural noise even in the presence of environmental noise, and can make noise output from an acoustic echo canceller or line echo canceller natural noise.

本発明の実施の形態１による会議電話装置を示すブロック図1 is a block diagram showing a conference telephone device according to Embodiment 1 of the present invention. （ａ）近端環境雑音検出部を示すブロック図、（ｂ）遠端環境雑音検出部を示すブロック図(A) Block diagram showing the near-end environmental noise detection unit, (b) Block diagram showing the far-end environmental noise detection unit 環境雑音信号の入力ポイント、その入力ポイント進行方向、環境雑音信号の出力ポイント、その出力ポイント進行方向を示す説明図An explanatory diagram showing the input point of the environmental noise signal, the input point travel direction, the output point of the environmental noise signal, and the output point travel direction

Explanation of symbols

１０１ハイブリッド回路
１０２スピーカ
１０３マイクロホン（マイク）
１０４、１０９、１１１、１１６加算器
１０５音響適応フィルタ
１０６近端環境雑音検出部
１０７近端雑音記憶部
１０８近端雑音加工部
１１０近端センタークリッピング処理部
１１２回線適応フィルタ
１１３遠端環境雑音検出部
１１４遠端雑音記憶部
１１５遠端雑音加工部
１１７遠端センタークリッピング処理部
２０１短時間パワー検出部
２０２短時間パワー記憶部
２０３最低パワー検出部
２０４平均受話雑音パワー計算部 101 Hybrid circuit 102 Speaker 103 Microphone (microphone)
104, 109, 111, 116 Adder 105 Acoustic adaptive filter 106 Near-end environmental noise detection unit 107 Near-end noise storage unit 108 Near-end noise processing unit 110 Near-end center clipping processing unit 112 Line adaptive filter 113 Far-end environmental noise detection unit 114 Far-end noise storage unit 115 Far-end noise processing unit 117 Far-end center clipping processing unit 201 Short-time power detection unit 202 Short-time power storage unit 203 Minimum power detection unit 204 Average received noise power calculation unit

Claims

A speaker that reproduces a voice signal from a far-end speaker as voice;
A microphone that picks up the voice reproduced by the speaker and the voice uttered by a near-end speaker and outputs a voice signal;
An acoustic echo canceller configured with an adaptive filter and an adder, using the voice signal from the far-end speaker as a reference signal, and performing echo cancellation on the voice signal output from the microphone;
From the audio signal output from the microphone, the transmission becomes the minimum from the minimum transmission power for each first interval and the minimum transmission power for each of the plurality of first intervals in the second interval wider than the first interval. The power obtained by adding the first threshold value to the power is set as the second threshold power, and is smaller than the transmission power obtained by averaging the minimum transmission power for each of the first intervals smaller than the second threshold power in the second interval. A near-end environmental noise detection unit for detecting a signal section of transmission power;
A signal of a signal section output from the near-end environmental noise detection unit is stored from an input point, has a plurality of output points, and a near-end noise storage unit that advances each input point and output point in the direction of time travel,
A conference telephone apparatus comprising: a near-end noise processing unit that adds signals of a plurality of output points of the near-end noise storage unit to an output of the acoustic echo canceller.

The near-end talker transmission signal, which is a signal from the near-end talker, is 2-wire to 4-wire converted and transmitted to the far-end talker, and the far-end talker reception signal, which is a signal from the far-end talker, is 4 A hybrid circuit that receives a line-to-line conversion;
A line echo canceller comprising an adaptive filter and an adder, wherein the near-end speaker transmission signal input to the hybrid circuit is used as a reference signal, and the far-end speaker reception signal output from the hybrid circuit is echo-cancelled When,
From the far-end speaker reception signal output from the hybrid circuit, the lowest reception power for each first interval and the lowest reception for each of the plurality of first intervals in a second interval wider than the first interval. The power obtained by adding the first threshold value to the minimum received power from the power is defined as the second threshold power, and the received power for each of the first intervals smaller than the second threshold power in the second interval is averaged. A far-end environmental noise detector that detects a signal section of received power smaller than the received power;
A signal of a signal section output from the far-end environmental noise detection unit is stored from an input point, has a plurality of output points, and a far-end noise storage unit that advances each input point and output point in the direction of time travel,
A conference telephone apparatus comprising: a far-end noise processing unit that adds signals at a plurality of output points of the far-end noise storage unit to an output of the line echo canceller.

The conference telephone device according to claim 1, wherein the near-end noise storage unit advances a traveling direction of at least one of the plurality of output points in a direction opposite to the traveling direction of the input point.

The conference telephone device according to claim 2, wherein the far-end noise storage unit advances at least one traveling direction of the plurality of output points in a direction opposite to the traveling direction of the input points.

A near-end center clipping processing unit that performs center clipping on the output of the acoustic echo canceller using an amplitude corresponding to an averaged transmission power level detected by the near-end environmental noise detection unit. Item 2. The conference telephone device according to Item 1.

5. A far-end center clipping processing unit that performs center clipping processing on the line echo canceller output using an amplitude corresponding to the level of the average received power detected by the far-end environmental noise detection unit. Item 3. The conference telephone device according to Item 2.