JP2006217649A

JP2006217649A - Signal processor

Info

Publication number: JP2006217649A
Application number: JP2006077342A
Authority: JP
Inventors: Hitoshi Nagata; 仁史永田
Original assignee: Toshiba Corp
Current assignee: Toshiba Corp
Priority date: 2006-03-20
Filing date: 2006-03-20
Publication date: 2006-08-17
Anticipated expiration: 2017-01-22
Also published as: JP4256400B2

Abstract

<P>PROBLEM TO BE SOLVED: To provide a signal processor which surely suppresses only the noise, by setting an allowable arrival angle range of a target signal with high precision by a small amount of operation, when the noise is suppressed by performing filter operation processing to signals of a plurality of channels. <P>SOLUTION: Noise is suppressed by an adaptive beam former 11 and the filter operation processing for outptting the signal from a target sound source, inputted from the input direction perpendicular to a straight line connecting the two microphones, is performed to signals of two channels from a microphone array, consisting of two microphones arranged by differing the directions of the directivity of each. <P>COPYRIGHT: (C)2006,JPO&NCIPI

Description

本発明は、複数チャネルの信号を入力して雑音を抑圧し目的信号のみを抽出する処理を行う信号処理装置に係り、特に音声認識装置やテレビ会議装置などの音声入力のためにマイクロホンアレイを用いて雑音を抑圧し目的の音声を抽出するマイクロホンアレイ処理装置に適した信号処理装置に関する。 The present invention relates to a signal processing apparatus that inputs a signal of a plurality of channels and suppresses noise and extracts only a target signal. In particular, the present invention uses a microphone array for voice input of a voice recognition apparatus, a video conference apparatus, and the like. The present invention relates to a signal processing apparatus suitable for a microphone array processing apparatus that suppresses noise and extracts a target voice.

配列された複数のマイクロホンからなるマイクロホンアレイを用い、このマイクロホンアレイの出力信号を処理することにより、雑音を抑圧して目的音源からの目的信号のみを抽出するマイクロホンアレイ処理装置は、音声認識装置やテレビ会議装置などにおける音声入力を目的として開発が進められている。中でも、少ないマイクロホン数で大きな雑音抑圧効果が得られる適応ビームフォーマを利用したマイクロホンアレイ処理装置については、例えば文献１：電子情報通信学会編「音響システムとデジタル処理」や、文献２：Heykin著、“Adaptive Filter Theory(Plentice Hall) ”に述べられているように、一般化サイドローブキャンセラ、フロスト型ビームフォーマおよび参照信号法など、種々の方法が知られている。 A microphone array processing apparatus that extracts a target signal from a target sound source by suppressing a noise by using a microphone array composed of a plurality of arranged microphones and processing an output signal of the microphone array is a speech recognition apparatus or Development is underway for the purpose of voice input in video conferencing equipment. Among them, for microphone array processing apparatuses using an adaptive beamformer that can obtain a large noise suppression effect with a small number of microphones, for example, Reference 1: Electronic Information Communication Society edited by “Acoustic System and Digital Processing”, Reference 2: Heykin, As described in “Adaptive Filter Theory (Plentice Hall)”, various methods such as a generalized sidelobe canceller, a frosted beamformer, and a reference signal method are known.

適応ビームフォーマ処理は、基本的には雑音源の方向に死角を有する指向性ビームを形成したフィルタにより雑音を抑圧する処理であり、その中でも特に一般化サイドローブキャンセラ（ＧＳＣ）は、比較的性能が良いことで知られている。しかし、ＧＳＣでは設定された目的音源の方向とずれた方向から目的信号が到来した際に、目的信号がキャンセルされて劣化するという問題点がある。これを防ぐため、設定された目的音源方向の近傍の方向から到来する信号があったときにはＧＳＣの適応を停止させる必要がある。そこで、例えば文献３：Julie E.Greenburg et.al.: “Evaluation of anadaptive beamforming method for heating aids”,pp.1662-1676,Jarnul of Acous.Soc.of Am.91(3),1992に開示されている補聴器用のマイクロホンアレイ処理装置では、二つのマイクロホン出力信号間の相関に基づいたしきい値処理により適応停止を実現している。 The adaptive beamformer processing is basically processing for suppressing noise by a filter in which a directional beam having a blind spot in the direction of the noise source is formed. Among them, the generalized sidelobe canceller (GSC) in particular has relatively high performance. Is known for being good. However, GSC has a problem that when the target signal arrives from a direction deviating from the set direction of the target sound source, the target signal is canceled and deteriorated. In order to prevent this, it is necessary to stop the adaptation of GSC when there is a signal arriving from a direction near the set target sound source direction. Therefore, for example, it is disclosed in Reference 3: Julie E. Greenburg et.al .: “Evaluation of anadaptive beamforming method for heating aids”, pp.1662-1676, Jarnul of Acous. Soc. Of Am. 91 (3), 1992. In the hearing aid microphone array processing apparatus, adaptive stop is realized by threshold processing based on the correlation between two microphone output signals.

しかしながら、このようにＧＳＣの適応停止を相関に基づいて行う方法では、最適なしきい値の設定は難しい。すなわち、二つのマイクロホン出力信号間の相関の値は、目的音源の方向が同じで目的信号が一定方向から到来する場合でも、目的信号の強度や雑音の状況により変化するため、目的信号の到来を許容する角度範囲を精度よく設定することができない。従って、例えば音声入力中に雑音の状況が変わった場合、今まで入力できていた音声が一時的に入力できなくなることがあり、安定性の面で問題がある。 However, it is difficult to set an optimum threshold value in the method in which adaptive stop of GSC is performed based on the correlation as described above. That is, the value of the correlation between the two microphone output signals changes depending on the intensity of the target signal and the noise situation even when the target sound source has the same direction and the target signal arrives from a certain direction. The allowable angle range cannot be set with high accuracy. Therefore, for example, if the noise situation changes during voice input, the voice that could be input up to now may not be input temporarily, which is problematic in terms of stability.

また、適応ビームフォーマは設定した目的音源の方向と実際の目的信号到来方向がずれた場合、周波数特性が平坦とならずに歪むことがある。このような周波数特性の歪みが存在すると、例えば適応ビームフォーマを音声認識の前処理に使う場合、認識性能を低下させるおそれがある。この問題に対しては、多数のセンサを用いて音源方向を推定する方向検出処理に基づいて、適応ビームフォーマに設定される目的音源の方向を定めることにより、設定される目的音源の方向を目的信号の実際の到来方向に自動追随させる方法が提案されている。方向検出処理の他の方式として、高分解能方式と呼ばれる手法も知られている。 Also, the adaptive beamformer may be distorted without flattening the frequency characteristics when the set direction of the target sound source and the actual direction of arrival of the target signal deviate. If such distortion of frequency characteristics exists, for example, when an adaptive beamformer is used for preprocessing of speech recognition, there is a possibility that the recognition performance is lowered. To solve this problem, the direction of the target sound source to be set is determined by determining the direction of the target sound source set in the adaptive beamformer based on the direction detection process that estimates the sound source direction using a large number of sensors. A method for automatically following the direction of arrival of a signal has been proposed. As another method of direction detection processing, a method called a high resolution method is also known.

しかし、センサを用いて目的音源の方向を推定する方法は、センサ数が例えば２個と少ない場合には精度よく目的音源の方向を検出するのは困難であり、また高分解能方式では入力信号の相関行列の推定と空間走査処理のために目的音源の方向検出に要する演算量が多くなるという問題がある。 However, in the method of estimating the direction of the target sound source using the sensor, it is difficult to accurately detect the direction of the target sound source when the number of sensors is as small as two, for example. There is a problem that the amount of calculation required for detecting the direction of the target sound source for estimation of the correlation matrix and spatial scanning processing increases.

上述したように、従来のＧＳＣを用いた適応ビームフォーマ処理により雑音を抑圧するマイクロホンアレイ処理装置において、設定された目的音源方向とずれた方向から目的信号が到来した際に目的信号がキャンセルされて劣化する現象を防ぐために、二つのマイクロホン出力信号間の相関に基づいたしきい値処理により、設定された目的音源方向の近傍の方向から到来する信号があったときＧＳＣの適応を停止させる方法は、目的信号が一定方向から到来する場合でも相関の値が目的信号の強度や雑音の状況により変化するため、目的信号の許容到来角度範囲を精度よく設定することができず、音声入力中に雑音の状況が変わった場合に音声が一時的に入力できなくなるなど、安定性の面で問題があった。 As described above, in the microphone array processing apparatus that suppresses noise by adaptive beamformer processing using the conventional GSC, the target signal is canceled when the target signal arrives from a direction deviating from the set target sound source direction. In order to prevent the phenomenon of deterioration, a method for stopping GSC adaptation when there is a signal arriving from a direction near the set target sound source direction by threshold processing based on the correlation between two microphone output signals is as follows. Even if the target signal arrives from a certain direction, the correlation value changes depending on the strength of the target signal and the noise condition, so the allowable arrival angle range of the target signal cannot be set accurately, and noise is generated during voice input. There was a problem in terms of stability, such as temporarily disabling voice input when the situation changed.

一方、多数のセンサによる目的音源の方向検出と適応ビームフォーマを組み合わせることにより目的音源の方向に自動追随する方式においては、センサ数が少ない場合に精度よく目的音源の方向を検出することが難しく、さらに高分解能方式では入力信号の相関行列の推定と空間走査処理のために目的音源の方向検出に要する演算量が多くなるという問題があった。 On the other hand, in the method of automatically following the direction of the target sound source by combining the direction detection of the target sound source with many sensors and an adaptive beamformer, it is difficult to detect the direction of the target sound source accurately when the number of sensors is small, Furthermore, the high resolution method has a problem that the amount of calculation required for detecting the direction of the target sound source for estimation of the correlation matrix of the input signal and spatial scanning processing increases.

本発明は上記の問題点を解決するためになされたもので、複数チャネルの信号に対しフィルタ演算処理を施して雑音を抑圧する場合に、雑音のみを確実に抑圧することを可能とした信号処理装置を提供することにある。 The present invention has been made to solve the above-described problems, and in the case where noise is suppressed by performing filter operation processing on a signal of a plurality of channels, it is possible to reliably suppress only noise. To provide an apparatus.

より具体的には、少ない演算量で目的信号の許容到来角度範囲を高精度に設定できる信号処理装置を提供することにある。 More specifically, an object of the present invention is to provide a signal processing apparatus that can set an allowable arrival angle range of a target signal with high accuracy with a small amount of calculation.

また、少ない演算量で目的信号の到来方向を高精度に推定できる信号処理装置を提供することにある。 It is another object of the present invention to provide a signal processing apparatus that can accurately estimate the direction of arrival of a target signal with a small amount of calculation.

上記の課題を解決するため、本発明に係る第１の信号処理装置は、それぞれの指向性の向きを異ならせて配置された２つのマイクロホンよりなるマイクロホンアレイと、前記マイクロホンアレイからの２チャネルの信号に対して雑音を抑圧し、前記２つのマイクロホンを結ぶ直線に垂直な入力方向より入力される目的音源からの信号を出力するためのフィルタ演算処理を施す適応ビームフォーマとを具備することを特徴とする。 In order to solve the above-described problem, a first signal processing apparatus according to the present invention includes a microphone array including two microphones arranged with different directivity directions, and two channels from the microphone array. And an adaptive beamformer that performs a filter calculation process for suppressing a noise with respect to a signal and outputting a signal from a target sound source input from an input direction perpendicular to a straight line connecting the two microphones. And

本発明に係る第２の信号処理装置は、それぞれの指向性の向きを異ならせて配置された２つのマイクロホンよりなるマイクロホンアレイと、前記マイクロホンアレイからの２チャネルの信号に対して雑音を抑圧し、前記２つのマイクロホンを結ぶ直線に一致した入力方向または該直線に９０°以外の任意の角度をなす入力方向より入力される目的音源からの信号を出力するためのフィルタ演算処理を施す適応ビームフォーマと、前記方向から信号が到来する場合に前記適応ビームフォーマに入力される２チャネルの信号の位相を一致させるための遅延器とを具備することを特徴とする。 A second signal processing apparatus according to the present invention suppresses noise with respect to a microphone array composed of two microphones arranged with different directivity directions and two-channel signals from the microphone array. An adaptive beamformer that performs a filter calculation process for outputting a signal from a target sound source that is input from an input direction that coincides with a straight line connecting the two microphones or an input direction that forms an arbitrary angle other than 90 ° to the straight line And a delay unit for matching the phases of the two-channel signals input to the adaptive beamformer when signals arrive from the direction.

本発明に係る第３の信号処理装置は、複数チャネルの信号に対して雑音を抑圧し、設定された入力方向より入力される目的音源からの信号を出力するためのフィルタ演算処理を施す第１の適応ビームフォーマと、前記複数チャネルの信号に対して前記目的音源からの信号を抑圧し、設定された入力方向より入力される雑音源からの信号を出力するためのフィルタ演算処理を施す第２の適応ビームフォーマと、前記複数チャネルの信号の伝播位相遅れを表す方向ベクトルと前記第１の適応ビームフォーマにより生成されるフィルタ係数の各周波数成分との内積の絶対値の２乗を周波数について加算して得られる方向毎の感度が最小となる方向を雑音源の方向と推定する雑音源方向推定手段と、前記方向ベクトルと前記第２の適応ビームフォーマにより生成されるフィルタ係数の各周波数成分との内積の絶対値の２乗を周波数について加算して得られる方向毎の感度が最小となる方向を前記目的音源の方向と推定する目的音源方向推定手段と、前記雑音源方向推定手段により推定された前記雑音源の方向を前記第２の適応ビームフォーマの入力方向として設定する第１の制御手段と、前記目的音源方向推定手段により推定された前記目的音源の方向を前記第１の適応ビームフォーマの入力方向として設定する第２の制御手段とを具備することを特徴とする。 The third signal processing apparatus according to the present invention is a first signal processing unit that performs a filter calculation process for suppressing noise with respect to a signal of a plurality of channels and outputting a signal from a target sound source input from a set input direction. A second adaptive beamformer, and a filter calculation process for suppressing a signal from the target sound source and outputting a signal from a noise source input from a set input direction with respect to the signals of the plurality of channels. The square of the absolute value of the inner product of the adaptive beamformer, the direction vector representing the propagation phase delay of the signals of the plurality of channels, and the frequency components of the filter coefficients generated by the first adaptive beamformer with respect to the frequency Noise source direction estimating means for estimating the direction in which the sensitivity for each direction obtained as a minimum is the direction of the noise source, the direction vector, and the second adaptive beamformer Target sound source direction estimating means for estimating the direction of the target sound source as the direction in which the sensitivity for each direction obtained by adding the square of the absolute value of the inner product of the filter coefficient generated by each frequency component to the frequency is minimized. First control means for setting the direction of the noise source estimated by the noise source direction estimation means as an input direction of the second adaptive beamformer, and the purpose estimated by the target sound source direction estimation means And second control means for setting the direction of the sound source as the input direction of the first adaptive beamformer.

本発明によれば複数チャネルの信号に対しフィルタ演算処理を施して雑音を抑圧する場合に雑音のみを確実に抑圧可能であり、具体的には少ない演算量で目的信号の到来方向を高精度に推定することができる。 According to the present invention, it is possible to reliably suppress only noise when performing filter arithmetic processing on a signal of a plurality of channels to suppress noise. Specifically, the arrival direction of a target signal can be accurately determined with a small amount of calculation. Can be estimated.

すなわち、本発明ではマイクロホンアレイ処理装置において、マイクロホンアレイをそれぞれの指向性の向きを異ならせて配置された複数のマイクロホンにより構成し、このマイクロホンアレイからの複数チャネルの信号に対して雑音を抑圧するためのフィルタ演算処理を施すことにより、いわゆる空間的エリアジングの影響を緩和して、全ての周波数において雑音源の方向の感度を低く抑えることができる。 That is, according to the present invention, in the microphone array processing apparatus, the microphone array is configured by a plurality of microphones arranged with different directivity directions, and noise is suppressed with respect to signals of a plurality of channels from the microphone array. By performing the filter calculation process for this purpose, it is possible to alleviate the influence of so-called spatial aliasing and to suppress the sensitivity of the direction of the noise source at all frequencies.

さらに、本発明によれば、雑音を抑圧して目的音源からの信号を出力するための第１のフィルタ演算手段とは別に、目的音源からの信号を抑圧する第２のフィルタ演算手段を設けて、第２のフィルタ演算手段内のフィルタの指向性から目的音源の方向を推定し、その推定結果に基づいて第１のフィルタ演算手段の設定を行うことにより、相関行列の推定や逆行列、固有値展開などの演算を必要とせず、またチャネル数が少ない場合でも、少ない演算量で目的音源からの信号に追随することが容易に可能となる。 Further, according to the present invention, the second filter calculation means for suppressing the signal from the target sound source is provided separately from the first filter calculation means for suppressing the noise and outputting the signal from the target sound source. By estimating the direction of the target sound source from the directivity of the filter in the second filter calculation means and setting the first filter calculation means based on the estimation result, the correlation matrix estimation, inverse matrix, eigenvalue It is possible to easily follow the signal from the target sound source with a small amount of computation even when computation such as expansion is not required and the number of channels is small.

以下、図面を参照して本発明の実施形態を説明する。
（第１の実施形態）
本実施形態では、適応ビームフォーマにおける目的信号の誤キャンセル動作を防ぐため、複数チャネルの信号間のインパルス応答に基づいて適応ビームフォーマの制御を行う信号処理装置について説明する。 Hereinafter, embodiments of the present invention will be described with reference to the drawings.
(First embodiment)
In the present embodiment, a signal processing apparatus that controls an adaptive beamformer based on an impulse response between signals of a plurality of channels in order to prevent erroneous cancellation of a target signal in the adaptive beamformer will be described.

本実施形態に係る信号処理装置は、図１に示されるように入力端子１０−１〜１０ｎからのｎチャネルｃｈ１〜ｃｈｎの信号を入力とし、これらの信号に対して雑音を抑圧するためのフィルタ演算処理（適応ビームフォーマ処理）を施す適応ビームフォーマ１１と、チャネルｃｈ１〜ｃｈｎの信号間のインパルス応答を推定するインパルス応答推定部１２と、このインパルス応答推定部１２の推定結果に従って適応ビームフォーマ１１の適応処理を制御する制御部１３とにより構成される。 As shown in FIG. 1, the signal processing apparatus according to the present embodiment receives n-channel ch1 to chn signals from input terminals 10-1 to 10n as input, and filters for suppressing noise with respect to these signals. An adaptive beamformer 11 that performs arithmetic processing (adaptive beamformer processing), an impulse response estimation unit 12 that estimates an impulse response between signals of channels ch1 to chn, and an adaptive beamformer 11 according to the estimation result of the impulse response estimation unit 12 And a control unit 13 for controlling the adaptive processing.

この信号処理装置は、具体的には例えばマイクロホンアレイ処理装置であり、入力端子１０−１〜１０ｎには、図示しないｎ個のマイクロホンからなるマイクロホンアレイからのｎチャネルｃｈ１〜ｃｈｎの信号が入力される。ここでは、簡単のためチャネル数ｎが２の場合を例にとり説明する。 Specifically, this signal processing apparatus is, for example, a microphone array processing apparatus, and n-channel ch1 to chn signals from a microphone array including n microphones (not shown) are input to the input terminals 10-1 to 10n. The Here, for simplicity, a case where the number of channels n is 2 will be described as an example.

適応ビームフォーマ１１の内部の処理方法は、例えば先に挙げた文献１や２に述べられているように種々の方法が知られており、一般化サイドローブキャンセラ（ＧＳＣ）、適応ノイズキャンセラ、フロスト型ビームフォーマおよび参照信号法などがある。本実施形態は適応ビームフォーマであればどのようなものにも適用が可能であるが、ここでは２チャネルのＧＳＣを例にとり説明する。 As the processing method inside the adaptive beamformer 11, for example, various methods are known as described in the above-mentioned documents 1 and 2, and a generalized sidelobe canceller (GSC), adaptive noise canceller, and frost type are known. There are beamformers and reference signal methods. The present embodiment can be applied to any adaptive beamformer, but here, a two-channel GSC will be described as an example.

図２に、適応ビームフォーマ１１の例として、２チャネルのＧＳＣの中で一般的なJim-Griffith型のＧＳＣの構成を示す。これは例えば文献２に示されているように、減算器２１、加算器２２、遅延器２３、適応フィルタ２４および減算器２５からなるＧＳＣである。適応フィルタ２４はＬＭＳ、ＲＬＳ、射影型ＬＭＳなど種々のものが使用可能であり、フィルタ長Ｌａは例えばＬａ＝５０である。遅延器２３の遅延量は例えばＬａ／２である。 FIG. 2 shows a configuration of a general Jim-Griffith type GSC in a two-channel GSC as an example of the adaptive beamformer 11. This is a GSC composed of a subtracter 21, an adder 22, a delay unit 23, an adaptive filter 24 and a subtracter 25, as shown in Document 2, for example. Various adaptive filters 24 such as LMS, RLS, and projective LMS can be used, and the filter length La is, for example, La = 50. The delay amount of the delay unit 23 is, for example, La / 2.

図３に、インパルス応答推定部１２の構成を示す。このインパルス応答推定部１２は複数チャネルの信号間のインパルス応答、この例では第１チャネルｃｈ１と第２チャネルｃｈ２の信号間のインパルス応答を逐次推定するものであり、遅延器３１と適応フィルタ３２および減算器３３からなる。なお、インパルス応答の推定にはクロススペクトル法なども適用可能であるが、演算量の点から図３のように適応フィルタ３２を用いることが望ましい。適応フィルタ３２の種類は、先に述べたように何でもよいが、フィルタ長Ｌｂは適応ビームフォーマ１１における図２中の適応フィルタ２４のフィルタ長Ｌａより小さくすることが可能であり、例えばＬｂ＝１０が用いられる。これにより、インパルス応答推定部１２の演算量は非常に小さくできる。 FIG. 3 shows the configuration of the impulse response estimation unit 12. The impulse response estimation unit 12 sequentially estimates an impulse response between signals of a plurality of channels, in this example, an impulse response between signals of the first channel ch1 and the second channel ch2, and includes a delay unit 31, an adaptive filter 32, and It consists of a subtracter 33. In addition, although the cross spectrum method etc. are applicable to estimation of an impulse response, it is desirable to use the adaptive filter 32 as shown in FIG. 3 from the point of computational complexity. The type of the adaptive filter 32 may be anything as described above, but the filter length Lb can be made smaller than the filter length La of the adaptive filter 24 in FIG. 2 in the adaptive beamformer 11, for example, Lb = 10 Is used. Thereby, the amount of calculation of the impulse response estimation part 12 can be made very small.

本実施形態においては、二つのチャネルｃｈ１，ｃｈ２の入力信号により適応ビームフォーマ１１のビームフォーマ処理と並行してインパルス応答推定部１２によるインパルス応答の推定を行い、推定したインパルス応答の成分の値に基づいて、制御部１３により適応ビームフォーマ１１の適応の制御を行う。この制御は、より詳しくは適応ビームフォーマ１１の適応停止および適応続行の制御である。この適応ビームフォーマ１１の適応の制御は、適応フィルタ２４にＬＭＳ適応フィルタや射影ＬＭＳ適応フィルタを用いている場合には、ステップサイズμの値を変えることにより行うことができる。 In the present embodiment, the impulse response is estimated by the impulse response estimation unit 12 in parallel with the beamformer processing of the adaptive beamformer 11 by using the input signals of the two channels ch1 and ch2, and the estimated impulse response component values are obtained. Based on this, the control unit 13 performs adaptive control of the adaptive beamformer 11. More specifically, this control is control of adaptive stop and adaptive continuation of the adaptive beamformer 11. The adaptive control of the adaptive beamformer 11 can be performed by changing the value of the step size μ when an LMS adaptive filter or a projection LMS adaptive filter is used as the adaptive filter 24.

図１の適応ビームフォーマ１１を構成する図２に示した２チャネルのJim-Griffith型ＧＳＣの適応フィルタ２４にＬＭＳ適応フィルタを用いた場合、この適応フィルタの更新は、時刻をｎとして適応フィルタ２４の係数をＷ（ｎ）、第ｉチャネルの入力信号をｘ_i（ｎ）、第ｉチャネルの入力信号ベクトルをＸ_i（ｎ）＝（ｘ_i（ｎ−Ｌ−１），ｘ_i（ｎ−Ｌ−１），…，ｘ_i（ｎ−Ｌ−２））とおくと、次式で表される。 When an LMS adaptive filter is used as the adaptive filter 24 of the two-channel Jim-Griffith GSC shown in FIG. 2 constituting the adaptive beamformer 11 of FIG. 1, the adaptive filter is updated by setting the time to n as the adaptive filter 24. , The i-th channel input signal x _i (n), and the i-th channel input signal vector X _i (n) = (x _i (n−L−1), x _i (n −L−1),..., X _i (n−L−2)), it is expressed by the following equation.

ｙ（ｎ）＝ｘ₀（ｎ）＋ｘ₁（ｎ）
Ｘ′（ｎ）＝Ｘ₁（ｎ）−Ｘ₀（ｎ）
ｅ（ｎ）＝ｙ（ｎ）−Ｗ（ｎ）Ｘ′（ｎ）
Ｗ（ｎ＋１）＝Ｗ（ｎ）−μＸ′（ｎ）ｅ（ｎ）（１）
ここで、ステップサイズμを小さくすれば適応が遅くなるので、このμの値をインパルス応答の成分によって変化させる。 y (n) = x ₀ (n) + x ₁ (n)
X ′ (n) = X ₁ (n) −X ₀ (n)
e (n) = y (n) -W (n) X '(n)
W (n + 1) = W (n) −μX ′ (n) e (n) (1)
Here, since the adaptation becomes slow if the step size μ is reduced, the value of μ is changed according to the component of the impulse response.

一方、インパルス応答推定部１２でのインパルス応答の推定も、同様に図３の適応フィルタ３２にＬＭＳ適応フィルタを用いて行うことができる。このとき、図３に示すようにインパルス応答推定に用いる二つのチャネルｃｈ１，ｃｈ２のうちの一方のチャネルｃｈ１の信号を遅延器３１により例えばフィルタ長Ｌｂの半分Ｌｂ／２だけ遅延させて目的信号とし、他方のチャネルｃｈ２の信号を遅延させずに参照信号として用いる。その場合、インパルス応答推定部１２の適応フィルタ３２のフィルタ係数をＷｂとおくと、チャネルｃｈ１，ｃｈ２間の時間遅れ０の成分は、ＷｂのうちのＬｂ／２番目の値Ｗｂ（Ｌｂ／２）となる。 On the other hand, the impulse response estimation in the impulse response estimation unit 12 can be similarly performed using an LMS adaptive filter as the adaptive filter 32 in FIG. At this time, as shown in FIG. 3, the signal of one channel ch1 of the two channels ch1 and ch2 used for impulse response estimation is delayed by a delay unit 31 by, for example, half Lb / 2 of the filter length Lb to obtain the target signal. The signal of the other channel ch2 is used as a reference signal without being delayed. In this case, if the filter coefficient of the adaptive filter 32 of the impulse response estimation unit 12 is set to Wb, the component of the time delay 0 between the channels ch1 and ch2 is the Lb / 2th value Wb (Lb / 2) of Wb. It becomes.

図１の制御部１３において、適応ビームフォーマ１１の適応の制御にインパルス応答推定部１２で推定されたインパルス応答のどの成分値を用いるかは、目的信号についてのチャネルｃｈ１，ｃｈ２間の時間遅れに基づいて決めるようにすればよい。 In the control unit 13 of FIG. 1, which component value of the impulse response estimated by the impulse response estimation unit 12 is used for adaptive control of the adaptive beamformer 11 depends on the time delay between the channels ch1 and ch2 for the target signal. You may decide based on it.

例えば、図４に示すように二つのチャネルｃｈ１，ｃｈ２の信号としてマイクロホンＭ１，Ｍ２からの信号が入力されており、両マイクロホンＭ１，Ｍ２を結ぶ線と垂直な方向を目的信号の到来方向（目的音源の方向）と設定した場合、インパルス応答推定部１２で推定されたインパルス応答の成分のうち、時間遅れ０に相当する１個の成分Ｗｂ（Ｌｂ／２）の値を適応ビームフォーマ１１の適応の制御に使うようにする。 For example, as shown in FIG. 4, signals from microphones M1 and M2 are input as signals of two channels ch1 and ch2, and the direction perpendicular to the line connecting both microphones M1 and M2 is the arrival direction of the target signal (target Sound source direction), the value of one component Wb (Lb / 2) corresponding to a time delay of 0 among the components of the impulse response estimated by the impulse response estimator 12 is applied by the adaptive beamformer 11. It is used to control.

一方、図５に示すように二つのマイクロホンＭ１，Ｍ２を結ぶ線と垂直な方向を目的音源の方向であると設定し、さらに±θに示した角度範囲から到来する信号についても目的信号であると許容する場合には、インパルス応答推定部１２で推定されるインパルス応答の成分のうち、矢印ａで示した方向から到来する信号の時間遅れと矢印ｂで示した方向から到来する信号の時間遅れの間の全ての時間遅れに相当する成分に関して、例えばその最大値を適応ビームフォーマ１１の適応の制御に用いる。 On the other hand, as shown in FIG. 5, the direction perpendicular to the line connecting the two microphones M1 and M2 is set as the direction of the target sound source, and the signal coming from the angle range indicated by ± θ is also the target signal. In the case of allowing, the time delay of the signal arriving from the direction indicated by the arrow a and the time delay of the signal arriving from the direction indicated by the arrow b among the impulse response components estimated by the impulse response estimation unit 12 For the components corresponding to all the time delays between, for example, the maximum value is used for adaptive control of the adaptive beamformer 11.

図６に示すようにマイクロホンＭ１，Ｍ２の間隔をｄ、音速をｃとすると、信号の到来角度φとチャネルｃｈ１，ｃｈ２間の時間遅れτは、
τ＝ｄｓｉｎ（φ）／ｃ（２）
により計算できる。これより、インパルス応答推定部１２中の適応フィルタ３２における時間遅れの成分の番号は、Ｔｓをサンプリング周期として、τ／Ｔｓの整数をとることにより得られる。図５の±θのように目的信号の到来角度範囲を設定する場合、式（２）により求めたτ＝ｄｓｉｎ（θ）／ｃから、τ／Ｔｓの整数部をｋとすると、
｛Ｗｂ（−ｋ＋Ｌｄ／２），Ｗｂ（−ｋ＋Ｌｂ／２＋１），…Ｗｂ（Ｌｂ／２），Ｗｃ（Ｌｃ／２＋１），…，Ｗｂ（Ｌｂ／２＋ｋ）｝
の中の最大値をインパルス応答の成分値ｈとする。 As shown in FIG. 6, when the distance between the microphones M1 and M2 is d and the sound speed is c, the time delay τ between the signal arrival angle φ and the channels ch1 and ch2 is
τ = dsin (φ) / c (2)
Can be calculated by Thus, the number of the time delay component in the adaptive filter 32 in the impulse response estimation unit 12 is obtained by taking an integer of τ / Ts with Ts as the sampling period. When setting the arrival angle range of the target signal as ± θ in FIG. 5, from τ = dsin (θ) / c obtained by Equation (2), if the integer part of τ / Ts is k,
{Wb (-k + Ld / 2), Wb (-k + Lb / 2 + 1), ... Wb (Lb / 2), Wc (Lc / 2 + 1), ..., Wb (Lb / 2 + k)}
The maximum value among the values is the component value h of the impulse response.

次に、上述した本実施形態の信号処理装置における処理の流れを図７に示すフローチャートを用いて説明する。
まず、初期設定として目的音源方向（目的信号到来方向）の許容角度範囲±θを設定し（ステップＳ１０１）、次いでインパルス応答推定部１２で入力信号を適応フィルタ３２に入力してインパルス応答を推定する（ステップＳ１０２）。このステップＳ１０２では、前述のように予め定めた時間遅れのインパルス応答成分の成分値ｈを求め、さらに次式により成分値ｈの平均化を行って平均値ｂを求める。
ｂ＝ｈ＊α＋ｂ＊（１−α）（３）
ここでαは忘却係数であり、例えばα＝０．１を用いる。 Next, the flow of processing in the signal processing apparatus of the present embodiment described above will be described using the flowchart shown in FIG.
First, an allowable angle range ± θ in the target sound source direction (target signal arrival direction) is set as an initial setting (step S101), and then the impulse response estimation unit 12 inputs the input signal to the adaptive filter 32 to estimate the impulse response. (Step S102). In step S102, the component value h of the impulse response component having a predetermined time delay as described above is obtained, and the component value h is averaged by the following equation to obtain the average value b.
b = h * α + b * (1−α) (3)
Here, α is a forgetting factor, and α = 0.1 is used, for example.

次に、成分値ｈについてしきい値判定、すなわち現在の成分値ｈが予めたしきい値ｓと平均値ｂの和の値より大きいか否かの判定を行い、大きければ適応停止の信号を適応ビームフォーマ１１に出力し、小さければ適応続行の信号を出力する。すなわち、ｈ＞ｂ＋ｓならば適応停止、そうでなければ適応続行とする（ステップＳ１０３）。しきい値ｓの値は、例えばｓ＝０．１５を用いる。 Next, it is determined whether or not the component value h is a threshold value, that is, whether or not the current component value h is larger than the sum of the threshold value s and the average value b. The signal is output to the adaptive beamformer 11, and if it is smaller, a signal to continue adaptation is output. That is, if h> b + s, the adaptation is stopped, otherwise the adaptation is continued (step S103). For example, s = 0.15 is used as the threshold value s.

次に、例えば式（１）に従って適応ビームフォーマ１１の処理を行う。このとき、ステップＳ１０３の判定結果に基づいて、適応停止信号が出ている場合は式（１）中のステップサイズμをμ＝０として適応を停止し（ステップＳ１０４）、また適応続行の信号が出ている場合はμを予め定めた値、例えばμ＝０．０１として適応処理を行う（ステップＳ１０５）。 Next, the adaptive beamformer 11 is processed according to, for example, equation (1). At this time, if an adaptive stop signal is output based on the determination result of step S103, the adaptation is stopped by setting the step size μ in the equation (1) to μ = 0 (step S104), and an adaptation continuation signal is received. If it appears, the adaptive processing is performed with μ set to a predetermined value, for example, μ = 0.01 (step S105).

そして、上述したステップＳ１０１〜Ｓ０１５の処理を入力信号の１サンプル毎に繰り返す。信号入出力の効率化のため、繰り返しの単位を例えば２５６サンプル毎のブロック処理としてもよい。すなわち、ステップＳ１０２とステップＳ１０４，Ｓ１０５の適応フィルタ処理については、２５６サンプル分まとめて行い、またステップＳ１０２における成分値ｈの平均化とステップＳ１０３のしきい値処理については、２５６点に１回行うようにする。 And the process of step S101-S015 mentioned above is repeated for every sample of an input signal. In order to increase the efficiency of signal input / output, the repetition unit may be a block process for every 256 samples, for example. That is, the adaptive filter processing in step S102 and steps S104 and S105 is performed for 256 samples collectively, and the averaging of the component value h in step S102 and the threshold processing in step S103 is performed once for 256 points. Like that.

文献３に述べられている相関に基づく適応処理の制御は、時間遅れ０に関する相関のみに着目しているため、大きくずれた目的信号の到来角度に関する制御が不安定になっていたが、本実施形態ではインパルス応答推定部１２で複数チャネルの信号間のインパルス応答を求め、制御部１３により該インパルス応答の成分に基づいて適応ビームフォーマ１１の適応処理を制御するため、より少ない演算量で高精度に広い角度範囲から到来する信号を目的信号であると許容するように設定できる。すなわち、目的信号の許容到来角度範囲を精度よくかつ広範囲に設定することができる。従って、例えばマイクロホンへの音声入力中に雑音の状況が変わった場合でも、連続して安定に音声の入力を行うことができる。 The control of adaptive processing based on the correlation described in Reference 3 focuses only on the correlation related to time delay 0, and thus the control related to the arrival angle of the target signal greatly deviated has become unstable. In the embodiment, the impulse response estimation unit 12 obtains an impulse response between signals of a plurality of channels, and the control unit 13 controls the adaptive processing of the adaptive beamformer 11 based on the impulse response component. It can be set to allow a signal coming from a wide angle range to be a target signal. That is, the allowable arrival angle range of the target signal can be set accurately and over a wide range. Therefore, for example, even when the noise situation changes during the voice input to the microphone, the voice can be input continuously and stably.

（第２の実施形態）
次に、適応マイクロホンアレイ処理装置の入力に用いるマイクロホンとして、指向性マイクロホンを使う場合の実施形態について説明する。 (Second Embodiment)
Next, an embodiment in which a directional microphone is used as a microphone used for input of the adaptive microphone array processing apparatus will be described.

一般に、適応ビームフォーマの入力としては、無指向性のマイクロホンやアンテナ素子からの信号が用いられるが、例えば二つの無指向性のマイクロホンを使った適応ビームフォーマでは、雑音を消去できない周波数帯域ができる。 In general, the signal from an omnidirectional microphone or antenna element is used as the input of an adaptive beamformer. For example, an adaptive beamformer using two omnidirectional microphones can generate a frequency band that cannot eliminate noise. .

図８は、２チャネルのJim-Griffith型ＧＳＣを適応ビームフォーマに用い、これを図９に示したマイクロホンＭ１，Ｍ２の配置と雑音源の方向を仮定して適応させた場合のＧＳＣ内のフィルタの周波数特性であり、直線ａは目的音源の方向（目的信号の到来方向）、曲線ｂは雑音源の方向（白色雑音の到来方向）の周波数特性である。 FIG. 8 shows a filter in the GSC when a two-channel Jim-Griffith type GSC is used as an adaptive beamformer and is adapted assuming the arrangement of the microphones M1 and M2 and the direction of the noise source shown in FIG. The straight line a is the frequency characteristic of the direction of the target sound source (the direction of arrival of the target signal), and the curve b is the frequency characteristic of the direction of the noise source (the direction of arrival of white noise).

曲線ｂは大部分の周波数でパワーが数十ｄＢ低下しており、雑音除去が働いていることを示しているが、０周波数付近とｆｏで示した周波数付近においては雑音が除去されないことを示している。これは、図９に示したようなマイクロホンＭ１，Ｍ２の配置と雑音源の方向の場合に、設定した雑音源方向からの到来雑音が各マイクロホンＭ１，Ｍ２に到達する際の時間遅れに相当する位相が目的信号の時間遅れの位相と２πの整数倍で一致してしまうためであり、空間的エリアジングと呼ばれている。この現象は、多数のマイクロホンを使う場合は避けられるが、このような２チャネル構成の場合は回避が困難であった。 Curve b shows that the power is reduced by several tens of dB at most frequencies, and that noise removal is working. However, the noise is not removed near the zero frequency and the frequency indicated by fo. ing. This corresponds to a time delay when arrival noise from the set noise source direction reaches each of the microphones M1 and M2 in the case of the arrangement of the microphones M1 and M2 and the direction of the noise source as shown in FIG. This is because the phase coincides with the phase of the time delay of the target signal by an integral multiple of 2π, which is called spatial aliasing. This phenomenon can be avoided when a large number of microphones are used, but it has been difficult to avoid in the case of such a two-channel configuration.

このような問題を解決するため、本実施形態では無指向性マイクロホンに代えて指向性マイクロホンを用い、それらを指向性の向きを異ならせて配置することにより、空間的エリアジングの影響を小さくしている。本実施形態における指向性マイクロホンの種々の配置と指向性パターンの例を図１０に示す。 In order to solve such problems, in this embodiment, directional microphones are used instead of omnidirectional microphones, and these are arranged with different directivity directions to reduce the effect of spatial aliasing. ing. Examples of various arrangements of directivity microphones and directivity patterns in this embodiment are shown in FIG.

まず、図１０（ａ）では二つの単一指向性マイクロホンを用い、指向性の向きを目的方向（目的音源の方向）に対して角度±φだけずらせて対称となるように各マイクロホンを並べている。空間的エリアジング抑圧の効果を十分出すには、ずれ角φの値を２０°≦φ≦１８０°の範囲に設定することが望ましい。マイクロホンの指向性は図１０（ａ）のような単一指向性に限る必要はなく、双指向性でも狭指向性でもよい。二つのマイクロホンの指向性パターンが同じであれば、図１０（ａ）のような対称な配置でよい。 First, in FIG. 10A, two unidirectional microphones are used, and the microphones are arranged so as to be symmetrical by shifting the directivity direction by an angle ± φ with respect to the target direction (the direction of the target sound source). . In order to obtain a sufficient effect of suppressing spatial aliasing, it is desirable to set the value of the shift angle φ in a range of 20 ° ≦ φ ≦ 180 °. The directivity of the microphone need not be limited to the single directivity as shown in FIG. 10A, and may be bidirectional or narrow directivity. If the directivity patterns of the two microphones are the same, a symmetrical arrangement as shown in FIG.

一方、二つのマイクロホンの指向性パタンは必ずしも同じである必要はなく、例えば図１０（ｂ）に示すように一方が無指向性、他方が単一指向性であってもよい。この場合、目的方向に対する感度が二つのマイクロホンで一致するように感度の調整を行う必要がある。例えば、図１０（ｂ）の配置の場合、第１チャネルｃｈ１の単一指向性マイクロホンの目的方向の感度ｇ１と、第２チャネルｃｈ２の無指向性マイクロホンの感度ｇ２が一致する角度を選ぶか、両マイクロホンの出力信号に対して振幅が一致するような補正を施すようにする。信号振幅の補正は、二つのマイクロホンの感度差を振幅に変換した係数を両マイクロホンの出力信号に乗じればよい。 On the other hand, the directivity patterns of the two microphones are not necessarily the same. For example, as shown in FIG. 10B, one may be omnidirectional and the other may be unidirectional. In this case, it is necessary to adjust the sensitivity so that the sensitivity with respect to the target direction matches between the two microphones. For example, in the case of the arrangement shown in FIG. 10B, an angle at which the sensitivity g1 of the target direction of the unidirectional microphone of the first channel ch1 matches the sensitivity g2 of the omnidirectional microphone of the second channel ch2 is selected. Corrections are made so that the amplitudes coincide with the output signals of both microphones. The signal amplitude can be corrected by multiplying the output signals of both microphones by a coefficient obtained by converting the sensitivity difference between the two microphones into an amplitude.

また、目的方向は二つのマイクロホンを結ぶ直線ｌと垂直である必要はなく、図１０（ｃ）に示すように二つのマイクロホンを結ぶ直線ｌと目的方向とが一致したり、図１０（ｄ）に示すように直線ｌと目的方向が任意の角度をなしてもよい。これらの場合、図１１に示すようにビームフォーマ本体１５の前に、遅延器１４を配置し、目的方向から信号が到来した場合の各チャネルｃｈ１，ｃｈ２の信号の位相が一致するように、一方のチャネルｃｈ２の信号を遅延時間ｄ／ｃだけ遅延させるようにする。ここでｃは信号の伝播速度である。 Further, the target direction does not need to be perpendicular to the straight line l connecting the two microphones. As shown in FIG. 10C, the straight line l connecting the two microphones coincides with the target direction, or FIG. As shown, the straight line l and the target direction may form an arbitrary angle. In these cases, as shown in FIG. 11, a delay device 14 is arranged in front of the beamformer main body 15 so that the phases of the signals of the channels ch1 and ch2 when the signals arrive from the target direction coincide with each other. The channel ch2 signal is delayed by the delay time d / c. Here, c is a signal propagation speed.

以上述べたマイクロホン配置を用いた場合の２チャネルJim-Griffith型ＧＳＣ内のフィルタの周波数特性を図８に重ねて示す。図８の曲線ｃ，ｄ，ｅは、図９と同じ配置のマイクロホン各々を単一指向性にし、ずれ角φ＝１０°，３０°，６０°にした場合の雑音到来方向に対する周波数特性である。この結果から、ずれ角φを大きくするに従い、曲線ｂの無指向性の場合に出ていた空間的エリアジングのピークが小さくなり、全ての周波数において雑音源の方向の感度が低くなっていることが分かる。 FIG. 8 shows the frequency characteristics of the filter in the two-channel Jim-Griffith type GSC when the microphone arrangement described above is used. Curves c, d, and e in FIG. 8 are frequency characteristics with respect to the noise arrival direction when the microphones having the same arrangement as in FIG. 9 are made unidirectional and the deviation angles φ = 10 °, 30 °, and 60 °. . From this result, as the deviation angle φ is increased, the spatial aliasing peak that appears in the case of the omnidirectional curve b decreases, and the sensitivity in the direction of the noise source decreases at all frequencies. I understand.

（第３の実施形態）
本実施形態では、目的信号が設定された目的音源の方向とずれた方向から到来した場合でも、その方向にビームフォーマに設定される目的音源の方向を追随させることにより、目的信号を劣化させずに出力することを可能とした信号処理装置について説明する。 (Third embodiment)
In the present embodiment, even when the target signal arrives from a direction deviating from the direction of the set target sound source, the target signal is not deteriorated by following the direction of the target sound source set in the beam former in that direction. A signal processing apparatus that can output the signal will be described.

従来、信号の到来方向の推定は、例えば文献１に述べられているように最小分散法やＭＵＳＩＣ法により行う場合、入力信号の相関行列を推定して、相関行列の逆行列や固有ベクトルを求め、さらに空間的な走査処理を行う必要があった。しかし、これらの処理は演算量が多く、また環境変化への追随性を考慮した相関行列の推定を行わなければならないという煩雑さがある。 Conventionally, when the direction of arrival of a signal is estimated by, for example, the minimum variance method or the MUSIC method as described in Reference 1, a correlation matrix of an input signal is estimated to obtain an inverse matrix or eigenvector of the correlation matrix, Furthermore, it was necessary to perform a spatial scanning process. However, these processes require a large amount of calculation, and there is a complexity that it is necessary to estimate a correlation matrix in consideration of the ability to follow environmental changes.

これに対し、本実施形態では目的音源の方向に追随するため、目的信号を出力する第１のビームフォーマとは別に第２のビームフォーマを設けて、第２のビームフォーマ内のフィルタの指向性から目的音源の方向を推定し、その推定結果に基づいて第１のビームフォーマの設定を行う。これにより、相関行列の推定や逆行列、固有値展開などの演算を必要とせず、少ない演算量で目的信号への追随を可能にしている。 On the other hand, in this embodiment, in order to follow the direction of the target sound source, a second beam former is provided separately from the first beam former that outputs the target signal, and the directivity of the filter in the second beam former is provided. Then, the direction of the target sound source is estimated, and the first beamformer is set based on the estimation result. This eliminates the need for operations such as correlation matrix estimation, inverse matrix, and eigenvalue expansion, and enables tracking of the target signal with a small amount of calculation.

図１２に、本実施形態に係る信号処理装置の構成を示す。入力端子４０−１，４０−２からのチャネルｃｈ１，ｃｈ２の信号は、第１および第２のビームフォーマ４１，４２に入力される。第１のビームフォーマ４１では目的信号が出力され、第２のビームフォーマ４２では目的信号が抑圧され雑音が出力される。 FIG. 12 shows the configuration of the signal processing apparatus according to the present embodiment. The signals of the channels ch1 and ch2 from the input terminals 40-1 and 40-2 are input to the first and second beamformers 41 and 42. The first beamformer 41 outputs a target signal, and the second beamformer 42 suppresses the target signal and outputs noise.

雑音源方向推定部４３は、第１のビームフォーマ４１内のフィルタのフィルタ係数から雑音源の方向（雑音信号の到来方向）を推定し、その推定結果を第１の制御部４４に与える。目的音源方向推定部４５は、第２のビームフォーマ４２内のフィルタのフィルタ係数から目的音源の方向（目的信号の到来方向）を推定し、その推定結果を第２の制御部４６に与える。以降、第１および第２ビームフォーマ４１，４２において設定されている目的音源の方向を入力方向と呼ぶことにする。 The noise source direction estimation unit 43 estimates the direction of the noise source (the arrival direction of the noise signal) from the filter coefficient of the filter in the first beamformer 41, and gives the estimation result to the first control unit 44. The target sound source direction estimation unit 45 estimates the direction of the target sound source (the arrival direction of the target signal) from the filter coefficient of the filter in the second beamformer 42 and gives the estimation result to the second control unit 46. Hereinafter, the direction of the target sound source set in the first and second beam formers 41 and 42 is referred to as an input direction.

第１の制御部４４は、雑音源方向推定部４３により推定された雑音源の方向が入力方向として設定されるように第２のビームフォーマ４２を制御する。第２の制御部４６は、目的音源方向推定部４５により推定された目的音源の方向が入力方向として設定されるように第１のビームフォーマ４１を制御する。第２のビームフォーマ４２の入力方向を雑音源の方向に設定するのは、第２のビームフォーマ４２により雑音源の方向が推定されるのを防ぐためである。 The first control unit 44 controls the second beamformer 42 so that the direction of the noise source estimated by the noise source direction estimation unit 43 is set as the input direction. The second control unit 46 controls the first beamformer 41 so that the direction of the target sound source estimated by the target sound source direction estimation unit 45 is set as the input direction. The reason why the input direction of the second beam former 42 is set to the direction of the noise source is to prevent the second beam former 42 from estimating the direction of the noise source.

第１および第２のビームフォーマ４１，４２は、既に述べたようにＧＳＣでもフロスト型でも参照信号型でもよい。この場合、第１のビームフォーマ４１内のフィルタでは雑音源の方向に、第２のビームフォーマ４２内のフィルタでは目的音源の方向にそれぞれ感度の低い死角ができているため、各々のフィルタのフィルタ係数からその指向性を調べることにより、雑音源および目的音源の方向を推定することができる。 As described above, the first and second beam formers 41 and 42 may be GSC, frost type, or reference signal type. In this case, since a blind spot with low sensitivity is formed in the direction of the noise source in the filter in the first beam former 41 and in the direction of the target sound source in the filter in the second beam former 42, the filter of each filter By examining the directivity from the coefficients, the direction of the noise source and the target sound source can be estimated.

一般に、ビームフォーマの入力方向の設定は、よく知られているように入力方向からの信号がチャネルどうし同位相になるように入力信号に遅延を与えることにより行われる。このため本実施形態においても、第１および第２のビームフォーマ４１，４２の内部を図１１に示した構成として、遅延器１４により上記の遅延を与えている。なお、第１および第２のビームフォーマ４１，４２が参照信号型の場合は、参照信号の到来方向を入力方向に合わせてもよいし、フロスト型の場合は、例えば文献４：Kazuaki Takao et.al.: “An adaptive antenna array under directional constraint”,IEEE Teans.on Antenna Propag.Vol.AP-24,No.5,Sep.1976 に開示されているように、任意の入力方向からの到来信号が減衰しないように線形拘束条件を設定することによっても行うことができる。 Generally, the input direction of the beamformer is set by delaying the input signal so that signals from the input direction are in phase with each other as is well known. For this reason, also in the present embodiment, the first and second beam formers 41 and 42 are configured as shown in FIG. When the first and second beamformers 41 and 42 are of the reference signal type, the arrival direction of the reference signal may be matched with the input direction. In the case of the frost type, for example, Reference 4: Kazuaki Takao et. al .: “An adaptive antenna array under directional constraint”, IEEE Teans.on Antenna Propag.Vol.AP-24, No.5, Sep.1976, the incoming signal from any input direction This can also be done by setting linear constraint conditions so as not to attenuate.

雑音源方向推定部４３と目的音源方向推定部４５では、前述のように第１および第２のビームフォーマ４１，４２内のフィルタの指向性から雑音源および目的音源の方向を推定するため、図１３のフローチャートに示すような手順で処理を行う。なお、ここでは入力信号が２チャネルの場合の処理を例として述べるが、２チャネルに限定されるものではない。 Since the noise source direction estimation unit 43 and the target sound source direction estimation unit 45 estimate the directions of the noise source and the target sound source from the directivities of the filters in the first and second beam formers 41 and 42 as described above, The processing is performed according to the procedure shown in the flowchart of FIG. In addition, although the process in case an input signal is 2 channels is described here as an example, it is not limited to 2 channels.

まず、初期設定として探索範囲θ_r、フィルタ長Ｌ、ＦＦＴ長（ＦＦＴポイント数）Ｎ、チャネル数Ｍなどを設定する（ステップＳ２０１）。第１のビームフォーマ４１では、任意の雑音源の方向に対応するために、目的音源の方向を基準（０°）として探索範囲θ_rは９０°とし、第２のビームフォーマ４２では目的信号の到来範囲内のみを探索するため、例えばθ_r＝２０°とする。探索角度範囲は、入力方向±θ_rの範囲とする。フィルタ長Ｌとチャネル数Ｍはビームフォーマ４１，４２と同じ値を用い、例えば各々Ｌ＝５０、Ｍ＝２とし、ＦＦＴ長Ｎは、フィルタ長Ｌより大きい値、例えばこの場合は６４点とする。 First, as initial settings, a search range θ _r , a filter length L, an FFT length (FFT point number) N, a channel number M, and the like are set (step S201). In the first beam former 41, in order corresponding to the direction of any noise sources, the search range theta _r a direction of the target sound source as a reference (0 °) and 90 °, the target signal in the second beam former 42 In order to search only within the arrival range, for example, θ _r = 20 °. The search angle range is the range of the input direction ± θ _r . The filter length L and the number of channels M are the same as those of the beam formers 41 and 42, for example, L = 50 and M = 2, respectively, and the FFT length N is a value larger than the filter length L, for example, 64 points in this case. .

次に、ビームフォーマ４１，４２がＧＳＣならばフィルタ変換を行い（ステップＳ２０３）、そうでなければＦＦＴを行う（ステップＳ２０４）。 Next, if the beam formers 41 and 42 are GSC, filter conversion is performed (step S203), and if not, FFT is performed (step S204).

ステップＳ２０３では、ビームフォーマ４１，４２がＧＳＣの場合、フィルタ係数をトランスバーサル型のビームフォーマと等価な形に変換する。例えば、２チャネルのJim Griffith型ＧＳＣの場合、ＧＳＣ内の適応フィルタの特性をｗｇ＝（ｗ₀，ｗ₁，ｗ₂，…，ｗ_L-2，ｗ_L-1）とおくと、第１チャネルｃｈ１の等価フィルタの特性は、ｗ_e1＝（−ｗ₀，−ｗ₁，−ｗ₂，…，−ｗ_L/2+1，…，−ｗ_L-2，−ｗ_L-1）、第２チャネルｃｈ２の等価フィルタの特性は、ｗ_e2＝（ｗ₀，ｗ₁，ｗ₂，…，ｗ_L/2-1，…，ｗ_L-2，ｗ_L-1）とおけばよい。 In step S203, when the beamformers 41 and 42 are GSC, the filter coefficient is converted into a form equivalent to a transversal beamformer. For example, in the case of a two-channel Jim Griffith type GSC, if the characteristics of the adaptive filter in the GSC are set as wg = (w ₀ , w ₁ , w ₂ ,..., W _L−2 , w _L−1 ), characteristics of the equivalent filter of the channel ch1 _{_{is, w e1 = (- w 0}} , -w 1, -w 2, ..., -w L / 2 + 1, ..., -w L-2, -w L-1), characteristics of the equivalent filter of the second channel ch2 _{_{is, w e2 = (w 0,}} w 1, w 2, ..., w L / 2 -1, ..., w L-2, w L-1) and may be put.

次に、チャネル毎にフィルタ係数のＦＦＴ（高速フーリエ変換）を行い、その周波数成分Ｗ_ei(k) を求める（ステップＳ２０４）。ここで、ｋは周波数成分の番号、ｉはチャネルの番号である。 Next, FFT (fast Fourier transform) of the filter coefficient is performed for each channel, and the frequency component W _ei (k) is obtained (step S204). Here, k is a frequency component number, and i is a channel number.

次に、探索範囲の中のある一つの方向をθとすると、θ方向から到来する信号に関する各チャネルの伝播位相遅れを表わす方向ベクトルＳ（ｋ，θ）を生成する（ステップＳ２０５）。方向ベクトルＳ（ｋ，θ）は、例えば図６に示したマイクロホン配置の場合、第１チャネルｃｈ１を基準にすると、Ｓ（ｋ，θ）＝（１，ｅ^{-jk/Nfsdsin(θ)}）となる。ｆｓはサンプリング周波数である。 Next, assuming that one direction in the search range is θ, a direction vector S (k, θ) representing a propagation phase delay of each channel related to a signal arriving from the θ direction is generated (step S205). For example, in the case of the microphone arrangement shown in FIG. 6, the direction vector S (k, θ) is given by S (k, θ) = (1, e ^{−jk / Nfsdsin (θ)} ) with reference to the first channel ch1. Become. fs is a sampling frequency.

次に、ＦＦＴにより求めたフィルタの周波数成分Ｗ_e＝（Ｗ_e1(k) 、Ｗ_e2(k))と方向ベクトルＳ（ｋ，θ）＝（１，ｅ^{-jk/Nfsdsin(θ)}）の内積の絶対値の２乗｜Ｓ・Ｗ｜²を求める（ステップＳ２０６）。 Then, the frequency components of the filter determined by the _{_{FFT W e = (W e1 (}} k), W e2 (k)) and the direction vector S (k, θ) = ( 1, e -jk / Nfsdsin (θ)) of The absolute value square of the inner product | S · W | ² is obtained (step S206).

ステップＳ２０５〜Ｓ２０６の処理を全ての周波数、すなわちｋ＝１からｋ＝２／Ｎまでについて行い、求めた内積の２乗和を方向θ毎に周波数ｋについて加算し、全帯域についてまとめた方向毎の感度
Ｄ（θ）＝Σ｜Ｗ(k) Ｓ（ｋ，θ）｜² （４）
を求める（ステップＳ２０７〜Ｓ２０８）。このとき、方向を例えば１°ずつ変化させ、探索範囲の全ての方向について調べるようにする。 The processing in steps S205 to S206 is performed for all frequencies, that is, k = 1 to k = 2 / N, the square sum of the obtained inner products is added for frequency k for each direction θ, and for each direction collected for all bands. Sensitivity
D (θ) = Σ | W (k) S (k, θ) | ² (4)
Is obtained (steps S207 to S208). At this time, the direction is changed by 1 °, for example, and all directions in the search range are examined.

次に、求めた方向毎の感度が最小となる方向θmin をＤ（θ）から求め、これを信号（目的信号または雑音信号）の到来方向とする（ステップＳ２０９）。このとき、雑音による推定誤差の影響を軽減するため、推定された音源方向に対しビームフォーマの出力により重み付けを行い、過去に推定された音源方向との平均化を行いながら、入力方向を更新するようにする。例えば、次式に従って計算を行う。 Next, the direction θmin in which the obtained sensitivity for each direction is minimized is obtained from D (θ), and this is set as the arrival direction of the signal (target signal or noise signal) (step S209). At this time, in order to reduce the influence of the estimation error due to noise, the input direction is updated while the estimated sound source direction is weighted by the output of the beamformer and averaged with the previously estimated sound source direction. Like that. For example, the calculation is performed according to the following equation.

θ₁＝θ′₁＊ｇ₁−（１．０−ｇ₁）＊θ₀
θ₂＝θ′₂＊ｇ₂−（１．０−ｇ₂）＊θ_n
ｇ₁＝１．０−１／（ｆ（ｐ₁−ｐ_th) ｆ（ｒ₁−ｒ_th）＋１．０）
ｇ₂＝１．０−１／（ｆ（ｐ₂−ｐ_th) ｆ（ｒ₂−ｒ_th）＋１．０）
ｒ₁＝ｐ₁／ｐ₂、ｒ₂＝ｐ₂／ｐ₁
ここで、ｇ₁，ｇ₂はそれぞれ第１および第２のビームフォーマ４１，４２の入力方向更新のための重み、ｐ₁，ｐ₂はそれぞれ第１および第２のビームフォーマ４１，４２のブロック単位の出力パワー、ｐ_thはパワーのしきい値、ｒ_thはパワー比のしきい値、θ′₁，θ′₂はそれぞれ第１および第２のビームフォーマ４１，４２の更新前の入力方向、θ₁，θ₂はそれぞれ第１および第２のビームフォーマ４１，４２の更新後の入力方向、θ₀，θ_nはそれぞれ新たに推定された目的音源と雑音源の方向、ｆ（ｘ）はｘが正ならばｘ、負ならば０を与える関数である。 θ ₁ = θ ′ ₁ * g ₁ − (1.0−g ₁ ) * θ ₀
θ ₂ = θ ′ ₂ * g ₂ − (1.0−g ₂ ) * θ _n
_{g 1 = 1.0-1 / (f (} p 1 -p th) f (r 1 -r th) +1.0)
_{g 2 = 1.0-1 / (f (} p 2 -p th) f (r 2 -r th) +1.0)
r ₁ = p ₁ / p ₂ , r ₂ = p ₂ / p ₁
Here, g ₁ and g ₂ are weights for updating the input directions of the first and second beam formers 41 and 42, respectively, and p ₁ and p ₂ are blocks of the first and second beam formers 41 and 42, respectively. Unit output power, p _th is a power threshold value, r _th is a power ratio threshold value, θ ′ ₁ and θ ′ ₂ are input directions before updating the first and second beam formers 41 and 42, respectively. , Θ ₁ , θ ₂ are the updated input directions of the first and second beam formers 41, 42, respectively, θ ₀ , θ _n are the newly estimated directions of the target sound source and noise source, f (x) Is a function that gives x if x is positive and 0 if negative.

このような更新により、目的音源からの目的音のパワーが大きく、雑音のパワーが小さいときには更新を速くし、それ以外では更新を遅くするように制御している。ｇ₁，ｇ₂、ｐ_th、ｒ_thの値は、例えば入力信号が１６ビットのディジタル信号で、２５６点毎に出力パワーを求める場合、ｇ₁＝０．３、ｇ₂＝０．０２、ｐ_th＝１．０、ｒ_th＝０．５とするが、状況に応じて実験的に決めるのがよい。 By such an update, the control is performed so that the update is accelerated when the power of the target sound from the target sound source is large and the power of the noise is small, and the update is delayed otherwise. The values of g ₁ , g ₂ , p _th and r _th are, for example, when the input signal is a 16-bit digital signal and the output power is obtained every 256 points, g ₁ = 0.3, g ₂ = 0.02, Although p _th = 1.0 and r _th = 0.5, it is preferable to determine experimentally according to the situation.

次に、上述した信号方向推定の処理を含む本実施形態の全体的な処理の流れを図１４に示すフローチャートを参照しながら説明する。なお、ここでは２チャネルアレイの処理を例として述べるが、２チャネルに限定されず、多チャネルでも応用可能である。 Next, the overall processing flow of this embodiment including the above-described signal direction estimation processing will be described with reference to the flowchart shown in FIG. Note that the processing of the two-channel array is described here as an example, but the present invention is not limited to two channels and can be applied to multiple channels.

まず、初期設定として目的音源の方向として許容する範囲Φを設定し、第１のビームフォーマ４１の入力方向θ１を例えば０°に、第２のビームフォーマ４２の入力方向θ２を例えば９０°に、雑音源方向推定部４３の探索範囲θr1を例えば９０°に、目的音源方向推定部４５の探索範囲θr2を例えば２０°にそれぞれ設定する（ステップＳ３０１）。目的音源方向の許容範囲Φは、第２のビームフォーマ４２の探索範囲θｒと同じ値とし、Φ＝θr2＝２０°とする。この設定により、±Φの角度範囲に到来した信号を目的信号とみなすようになる。ここで、方向は二つのマイクロホンを結ぶ直線に垂直な方向を０°としている。 First, a range Φ allowed as the direction of the target sound source is set as an initial setting, the input direction θ1 of the first beamformer 41 is set to, for example, 0 °, and the input direction θ2 of the second beamformer 42 is set to, for example, 90 °. The search range θr1 of the noise source direction estimation unit 43 is set to 90 °, for example, and the search range θr2 of the target sound source direction estimation unit 45 is set to 20 °, for example (step S301). The allowable range Φ in the target sound source direction is set to the same value as the search range θr of the second beamformer 42, and Φ = θr2 = 20 °. With this setting, a signal that arrives within an angle range of ± Φ is regarded as a target signal. Here, the direction is 0 ° perpendicular to the straight line connecting the two microphones.

次に、第１のビームフォーマ４１の入力方向を設定する（ステップ３０２）。ここでは、２チャネルの信号に遅延を与えることにより、設定した入力方向からの信号が等価的にアレイに同時に到達するようにする。このため、図１１の構成の第１のビームフォーマ４１において、遅延器１４により第１チャネルｃｈ１の信号に与える遅延をτ＝ｄｓｉｎ（θ１）／ｃにより計算する。 Next, the input direction of the first beamformer 41 is set (step 302). Here, by delaying the signals of the two channels, the signals from the set input direction can equivalently reach the array simultaneously. Therefore, in the first beamformer 41 having the configuration shown in FIG. 11, the delay given to the signal of the first channel ch1 by the delay unit 14 is calculated by τ = dsin (θ1) / c.

次に、第１のビームフォーマ４１の処理を行い（ステップＳ３０３）、得られたフィルタ係数から上述した方法により探索範囲±θr1内で雑音源の方向を推定する（ステップＳ３０４）。推定された雑音源の方向をθｎとする。 Next, the first beamformer 41 is processed (step S303), and the direction of the noise source is estimated within the search range ± θr1 by the method described above from the obtained filter coefficient (step S304). Let θn be the direction of the estimated noise source.

次に、ステップＳ３０４で推定された雑音源の方向θｎが目的音源の方向の近傍（０°±Φ）にあるか否かを判断し（ステップＳ３０５）、近傍にある場合はそのままステップＳ３０８に進む。 Next, it is determined whether or not the direction θn of the noise source estimated in step S304 is in the vicinity (0 ° ± Φ) of the direction of the target sound source (step S305). .

一方、ステップＳ３０４で推定された雑音源の方向θｎが目的音源の方向の近傍でない場合は、推定された雑音源の方向を入力方向とするように第２のビームフォーマ４２の入力方向を設定する（ステップＳ３０７）。すなわち、θ２の値を先にのべた平均化により更新する。ステップＳ３０２と同様に、第２チャネルｃｈ２の信号に遅延を与えて入力方向からの信号が等価的にアレイに同時に到達するようにするため、図１１の構成の第２のビームフォーマ４２において、遅延器１４により第１チャネルｃｈ１に与える遅延をτ＝ｄｓｉｎ（θ２）／ｃにより計算する）。 On the other hand, if the direction θn of the noise source estimated in step S304 is not near the direction of the target sound source, the input direction of the second beamformer 42 is set so that the estimated direction of the noise source is the input direction. (Step S307). That is, the value of θ2 is updated by the above-mentioned averaging. As in step S302, in order to delay the signal of the second channel ch2 so that the signal from the input direction reaches the array at the same time, the second beamformer 42 having the configuration of FIG. The delay given to the first channel ch1 by the device 14 is calculated by τ = dsin (θ2) / c).

次に、第２のビームフォーマ４２の処理を行い（ステップＳ３０８）、探索範囲±θr2の中で目的音源の方向を推定し（ステップＳ３０９）、再びステップＳ３０２に戻って、推定された目的音源の方向を入力方向とするように第１のビームフォーマ４１の入力方向を設定する。このときも、先に述べた平均化により入力方向を更新する。 Next, the processing of the second beamformer 42 is performed (step S308), the direction of the target sound source is estimated within the search range ± θr2 (step S309), and the process returns to step S302 again to determine the estimated target sound source. The input direction of the first beamformer 41 is set so that the direction is the input direction. Also at this time, the input direction is updated by the averaging described above.

以降、以上の処理を繰り返す。なお、処理の単位は効率化のため例えば２５６点を１ブロックとしたブロック単位で行ってもよい。この場合、ビームフォーマ４１，４２の処理は１ブロック分のデータを続けて処理した後のフィルタ係数を次のステップの入力とするようにする。 Thereafter, the above processing is repeated. The unit of processing may be performed in units of blocks, for example, with 256 points as one block for efficiency. In this case, the processing of the beam formers 41 and 42 uses the filter coefficient after processing one block of data as input for the next step.

このように本実施形態によれば、二つのビームフォーマ４１，４２のうち、第２のビームフォーマ４２内のフィルタのフィルタ係数から目的音源の方向（目的信号の到来方向）を推定するようにしたため、目的音源の方向が設定と大きくずれた各ビームフォーマ４１，４２の入力方向から目的信号が到来した場合に目的信号がキャンセルされる現象を抑えて、少ない演算量で高精度に目的信号の抽出を行うことができる。 As described above, according to the present embodiment, the direction of the target sound source (the arrival direction of the target signal) is estimated from the filter coefficient of the filter in the second beam former 42 out of the two beam formers 41 and 42. The target signal is accurately extracted with a small amount of computation by suppressing the phenomenon that the target signal is canceled when the target signal arrives from the input direction of each beam former 41, 42 in which the direction of the target sound source is greatly deviated from the setting. It can be performed.

（第４の実施形態）
本実施形態では、第３の実施形態で述べた目的信号への追随を用いたビームフォーマ処理の際、第１の実施形態で述べたインパルス応答成分に基づいた適応制御を組み合わせることにより、信号方向推定の精度を高めるようにした信号処理装置について説明する。 (Fourth embodiment)
In the present embodiment, the signal direction is determined by combining the adaptive control based on the impulse response component described in the first embodiment in the beamformer process using the follow-up to the target signal described in the third embodiment. A signal processing apparatus that improves the accuracy of estimation will be described.

図１５に、本実施形態に係る信号処理装置の構成を示す。この信号処理装置は図１２に示した第３の実施形態の構成に、インパルス応答推定部４７が追加された構成となっている。インパルス応答推定部４７は、チャネルｃｈ１，ｃｈ２の信号間のインパルス応答を推定し、その推定結果であるインパルス応答の成分値を第１および第２の制御部４４，４６に与える。第１および第２の制御部４４，４６は、該インパルス応答の成分値に基づいて第１および第２のビームフォーマ４１，４２の適応処理を制御する。 FIG. 15 shows the configuration of the signal processing apparatus according to the present embodiment. This signal processing apparatus has a configuration in which an impulse response estimation unit 47 is added to the configuration of the third embodiment shown in FIG. The impulse response estimation unit 47 estimates the impulse response between the signals of the channels ch1 and ch2, and gives the impulse response component values, which are the estimation results, to the first and second control units 44 and 46. The first and second control units 44 and 46 control the adaptive processing of the first and second beam formers 41 and 42 based on the component values of the impulse response.

第１の実施形態で述べたように、インパルス応答は複数チャネルの信号間の時間遅れに相当した成分の値が大きくなる。この時間遅れは信号の到来方向に対応しているので、第１のビームフォーマ４１の入力方向に対応した時間遅れを求め、この時間遅れに相当するインパルス応答の成分値が一定値以上のときには第１のビームフォーマ４１の適応を停止して第２のビームフォーマ４２の適応を行い、一定値以下のときにはその逆の処理とすることにより、各ビームフォーマ４１，４２が自身の入力方向からの信号に適応する目的信号キャンセルの現象を防ぐことができるので、適応フィルタのフィルタ係数を用いた信号方向推定精度の劣化を防ぐことができ、結果として全体の性能向上を図ることができる。 As described in the first embodiment, the impulse response has a component value corresponding to a time delay between signals of a plurality of channels. Since this time delay corresponds to the arrival direction of the signal, the time delay corresponding to the input direction of the first beamformer 41 is obtained, and when the component value of the impulse response corresponding to this time delay is greater than or equal to a certain value, the first time delay is obtained. The adaptation of the first beamformer 41 is stopped and the second beamformer 42 is adapted, and when the value is below a certain value, the opposite process is performed so that each beamformer 41, 42 receives a signal from its own input direction. Therefore, it is possible to prevent the target signal cancellation phenomenon that is adapted to the above, and it is possible to prevent the deterioration of the signal direction estimation accuracy using the filter coefficient of the adaptive filter. As a result, the overall performance can be improved.

次に、図１６に示すフローチャートを参照しながら本実施形態の処理の流れについて説明する。 Next, the processing flow of this embodiment will be described with reference to the flowchart shown in FIG.

なお、ここでは２チャネルアレイの処理を例として述べるが、２チャネルに限定されず、多チャネルでも応用可能である。 Note that the processing of the two-channel array is described here as an example, but the present invention is not limited to two channels and can be applied to multiple channels.

まず、初期設定として第３の実施形態と同様に、目的信号の到来方向として許容する範囲Φを設定し、第１のビームフォーマ４１の入力方向θ１を例えば０°に、第２のビームフォーマ４２の入力方向θ２を例えば９０°に、雑音源方向推定部の４３探索範囲θr1を例えば９０°に、目的音源方向推定部４５の探索範囲θr2を例えば２０°にそれぞれ設定する（ステップＳ４０１）。目的音源方向の許容範囲Φは、第２のビームフォーマ４２の探索範囲θｒと同じ値とし、Φ＝θr2＝２０°とする。この設定により、±Φの角度範囲に到来した信号を目的信号とみなすようになる。ここで、方向は二つのマイクロホンを結ぶ直線に垂直な方向を０°としている。 First, as in the case of the third embodiment, a range Φ that is allowed as the arrival direction of the target signal is set as an initial setting, the input direction θ1 of the first beamformer 41 is set to 0 °, for example, and the second beamformer 42 is set. Is set to 90 °, for example, 43 search range θr1 of the noise source direction estimation unit is set to 90 °, for example, and search range θr2 of the target sound source direction estimation unit 45 is set to 20 °, for example (step S401). The allowable range Φ in the target sound source direction is set to the same value as the search range θr of the second beamformer 42, and Φ = θr2 = 20 °. With this setting, a signal that arrives within an angle range of ± Φ is regarded as a target signal. Here, the direction is 0 ° perpendicular to the straight line connecting the two microphones.

次に、第１のビームフォーマ４１の入力方向を設定する（ステップ４０２）。ここでは、２チャネルの信号に遅延を与えることにより、設定した入力方向からの信号が等価的にアレイに同時に到達するようにする。このため、図１１の構成の第１のビームフォーマ４１において、遅延器１４により第１チャネルｃｈ１の信号に与える遅延をτ＝ｄｓｉｎ（θ１）／ｃにより計算する。 Next, the input direction of the first beamformer 41 is set (step 402). Here, by delaying the signals of the two channels, the signals from the set input direction can equivalently reach the array simultaneously. Therefore, in the first beamformer 41 having the configuration shown in FIG. 11, the delay given to the signal of the first channel ch1 by the delay unit 14 is calculated by τ = dsin (θ1) / c.

次に、同様に第１チャネルｃｈ１の信号に与える遅延をτ＝ｄｓｉｎ（θ２）／ｃにより計算して、第２のビームフォーマ４２の入力方向を設定する（ステップＳ４０３）。 Next, similarly, the delay given to the signal of the first channel ch1 is calculated by τ = dsin (θ2) / c, and the input direction of the second beamformer 42 is set (step S403).

次に、インパルス応答推定部４７においてチャネルｃｈ１，ｃｈ２の信号間のインパルス応答を推定し、第１の実施形態で述べたと同様に、定めた時間遅れのインパルス応答の成分値ｈとその平均値ｂを求める（ステップＳ４０４）。 Next, the impulse response estimation unit 47 estimates the impulse response between the signals of the channels ch1 and ch2, and, as described in the first embodiment, the component value h of the determined time delay impulse response and the average value b thereof Is obtained (step S404).

次に、現在の成分値ｈが予めたしきい値ｓと平均値ｂの和の値より大きいか否かを判定し（ステップＳ４０５）、大きければステップＳ４０６の第１のビームフォーマ４１の処理に進み、小さければステップＳ４１０の第１のビームフォーマ４１の処理に進む。ステップＳ４０６では、適応を停止して第１のビームフォーマ４１の処理を行う。すなわち、適応フィルタのステップサイズμを０とおいて処理する。ステップＳ４０７では、適応を停止せず通常の第２のビームフォーマの処理を行う。すなわち、適応フィルタのステップサイズμを所定の値、例えば０．０１として処理する。 Next, it is determined whether or not the current component value h is larger than the sum of the threshold value s and the average value b (step S405). If larger, the first beamformer 41 is processed in step S406. If it is smaller, the process proceeds to the process of the first beamformer 41 in step S410. In step S406, adaptation is stopped and the first beamformer 41 is processed. That is, processing is performed with the step size μ of the adaptive filter set to zero. In step S407, normal second beamformer processing is performed without stopping adaptation. That is, the step size μ of the adaptive filter is processed as a predetermined value, for example, 0.01.

次に、第２のビームフォーマ４２のフィルタ係数から、第３の実施形態に述べた方法により、目的音源の方向を推定し（ステップＳ４０８）、推定した方向から第１のビームフォーマ４１の入力方向を第３の実施形態で述べた平均化を用いて設定し（ステップＳ４０９）、ステップＳ４０４のインパルス応答推定処理に戻る。 Next, the direction of the target sound source is estimated from the filter coefficient of the second beam former 42 by the method described in the third embodiment (step S408), and the input direction of the first beam former 41 is estimated from the estimated direction. Is set using the averaging described in the third embodiment (step S409), and the process returns to the impulse response estimation process of step S404.

ステップＳ４１０では、適応を停止せず通常の第１のビームフォーマ４１の処理を行い、得られたフィルタ係数から第１の実施形態に述べた方法により雑音源の方向を推定する（ステップＳ４１１）。 In step S410, the normal first beamformer 41 is processed without stopping adaptation, and the direction of the noise source is estimated from the obtained filter coefficients by the method described in the first embodiment (step S411).

次に、推定した雑音源の方向が目的音源の方向の近傍（θ１±Φ）にあるか否かを判断し（ステップＳ４１２）、近傍にある場合はそのままステップＳ４０４のインパルス応答処理に戻り、近傍でない場合は推定した雑音源の方向から第３の実施形態で述べた平均化を用いて第２のビームフォーマ４２の入力方向を設定し（ステップＳ４１３）、その後ステップＳ４０４のインパルス応答推定処理に戻る。 Next, it is determined whether or not the estimated noise source direction is in the vicinity (θ1 ± Φ) of the direction of the target sound source (step S412). If it is in the vicinity, the process returns to the impulse response processing in step S404 as it is. If not, the input direction of the second beamformer 42 is set from the estimated direction of the noise source using the averaging described in the third embodiment (step S413), and then the process returns to the impulse response estimation process of step S404. .

以降、以上の処理を繰り返す。なお、処理の単位は効率化のため例えば２５６点を１ブロックとしたブロック処理で行ってもよい。この場合、第１および第２のビームフォーマ４１，４２の処理は１ブロック分のデータを続けて処理した後のフィルタ係数を次のステップの方向推定処理の入力とするようにする。 Thereafter, the above processing is repeated. The unit of processing may be performed by block processing with 256 points as one block, for example, for efficiency. In this case, the processing of the first and second beamformers 41 and 42 uses the filter coefficient after processing one block of data as input for the direction estimation processing of the next step.

このように本実施形態では、チャネルｃｈ１，ｃｈ２の信号間のインパルス応答の成分値に基づいて二つのビームフォーマ４１，４２の適応処理の制御を行うようにしたことにより、各ビームフォーマ４１，４２の入力方向から信号が到来した場合の信号キャンセルによる劣化とフィルタの劣化を防ぐことができ、目的音源方向へのビームフォーマ４１，４２の追随と、これによる目的信号の抽出を高精度に行うことができる。 As described above, in the present embodiment, the adaptive processing of the two beam formers 41 and 42 is controlled based on the component value of the impulse response between the signals of the channels ch1 and ch2. Deterioration due to signal cancellation and filter deterioration when a signal arrives from the input direction of the signal can be prevented, and the beam formers 41 and 42 follow the target sound source direction and the target signal is extracted with high accuracy. Can do.

なお、以上の実施形態ではマイクロホンアレイ処理装置について説明したが、本発明はマイクロホンアレイ以外の素子、例えば複数個のアンテナ素子を配列したアンテナアレイから信号を扱う信号処理装置にも適用でき、要するに複数チャネルの信号を入力してフィルタ演算処理を施すことにより雑音（不要信号）を抑圧する信号処理装置であれば基本的に本発明を適用することができる。 Although the microphone array processing apparatus has been described in the above embodiment, the present invention can be applied to a signal processing apparatus that handles signals from elements other than the microphone array, for example, an antenna array in which a plurality of antenna elements are arranged. The present invention can be basically applied to any signal processing apparatus that suppresses noise (unnecessary signal) by inputting a channel signal and performing filter calculation processing.

本発明の第１の実施形態に係る信号処理装置の構成を示すブロック図The block diagram which shows the structure of the signal processing apparatus which concerns on the 1st Embodiment of this invention. 図１における適応ビームフォーマの構成を示すブロック図Block diagram showing the configuration of the adaptive beamformer in FIG. 図１におけるインパルス応答推定部の構成を示すブロック図The block diagram which shows the structure of the impulse response estimation part in FIG. 二つのマイクロホンの配置と目的音源の方向の関係を示す図Diagram showing the relationship between the arrangement of two microphones and the direction of the target sound source 二つのマイクロホンの配置と目的音源の方向および目的音源の許容到来角度範囲の関係を示す図Diagram showing the relationship between the arrangement of two microphones, the direction of the target sound source, and the allowable arrival angle range of the target sound source 二つのマイクロホンからの信号間の時間遅れについて説明するための図A diagram for explaining the time delay between signals from two microphones 同実施形態における一連の処理の流れを示すフローチャートA flowchart showing a flow of a series of processing in the embodiment 本発明の第２の実施形態を説明するための２チャネルのJim-Griffith型ＧＳＣにおけるフィルタの目的音源方向および雑音源方向の周波数特性を示す図The figure which shows the frequency characteristic of the target sound source direction of a filter, and the noise source direction in 2 channel Jim-Griffith type | mold GSC for demonstrating the 2nd Embodiment of this invention. 図８の周波数特性に対応するマイクロホンの配置と目的音源よび雑音源の方向の関係を示す図The figure which shows the relationship between the arrangement | positioning of the microphone corresponding to the frequency characteristic of FIG. 8, and the direction of a target sound source and a noise source 同実施形態における二つのマイクロホンの指向性および配置の種々の例を示す図The figure which shows the various directivity and arrangement | positioning of two microphones in the embodiment 一方のチャネルの入力側に遅延器を挿入したビームフォーマの構成を示すブロック図Block diagram showing the configuration of a beamformer with a delay unit inserted on the input side of one channel 本発明の第３の実施形態に係る信号処理装置の構成を示すブロック図The block diagram which shows the structure of the signal processing apparatus which concerns on the 3rd Embodiment of this invention. 同実施形態における雑音源および目的音源の方向の推定処理の手順を示すフローチャートThe flowchart which shows the procedure of the estimation process of the direction of a noise source and the target sound source in the embodiment 同実施形態における全体的な処理の流れを示すフローチャートThe flowchart which shows the flow of the whole process in the same embodiment 本発明の第４の実施形態に係る信号処理装置の構成を示すブロック図The block diagram which shows the structure of the signal processing apparatus which concerns on the 4th Embodiment of this invention. 同実施形態における処理の流れを示すフローチャートA flowchart showing a flow of processing in the embodiment

Explanation of symbols

１０−１〜１０−ｎ…入力端子
１１…適応ビームフォーマ
１２…インパルス応答推定部
１３…制御部
１４…遅延器
１５…ビームフォーマ本体
２１…減算器
２２…加算器
２３…遅延器
２４…適応フィルタ
２５…減算器
３１…遅延器
３２…適応フィルタ
３３…減算器
４１…第１のビームフォーマ
４２…第２のビームフォーマ
４３…雑音源方向推定部
４４…第１の制御部
４５…目的音源方向推定部
４６…第２の制御部
４７…インパルス応答推定部 DESCRIPTION OF SYMBOLS 10-1-10-n ... Input terminal 11 ... Adaptive beamformer 12 ... Impulse response estimation part 13 ... Control part 14 ... Delayer 15 ... Beamformer main body 21 ... Subtractor 22 ... Adder 23 ... Delayer 24 ... Adaptive filter DESCRIPTION OF SYMBOLS 25 ... Subtractor 31 ... Delay device 32 ... Adaptive filter 33 ... Subtractor 41 ... 1st beam former 42 ... 2nd beam former 43 ... Noise source direction estimation part 44 ... 1st control part 45 ... Target sound source direction estimation Unit 46 ... Second control unit 47 ... Impulse response estimation unit

Claims

A microphone array composed of two microphones arranged with different directivity directions;
An adaptive beam that performs filter calculation processing for suppressing noise with respect to the two-channel signal from the microphone array and outputting a signal from a target sound source input from an input direction perpendicular to a straight line connecting the two microphones A signal processing apparatus comprising: a former.

A microphone array composed of two microphones arranged with different directivity directions;
Noise is suppressed with respect to the two-channel signal from the microphone array, and the target sound source is input from an input direction that coincides with a straight line connecting the two microphones or an input direction that forms an arbitrary angle other than 90 ° with respect to the straight line. An adaptive beamformer that performs filter calculation processing to output a signal from
A signal processing apparatus comprising: a delay unit configured to match phases of two-channel signals input to the adaptive beamformer when a signal arrives from the direction.

A first adaptive beamformer that suppresses noise with respect to a signal of a plurality of channels and performs a filter calculation process for outputting a signal from a target sound source input from a set input direction;
A second adaptive beamformer that performs a filter calculation process for suppressing a signal from the target sound source with respect to the signals of the plurality of channels and outputting a signal from a noise source that is input from a set input direction;
For each direction obtained by adding the square of the absolute value of the inner product of the direction vector representing the propagation phase delay of the signals of the plurality of channels and each frequency component of the filter coefficient generated by the first adaptive beamformer with respect to the frequency. Noise source direction estimation means for estimating the direction in which the sensitivity of the noise is the minimum as the direction of the noise source,
The direction in which the sensitivity in each direction obtained by adding the square of the absolute value of the inner product of the direction vector and each frequency component of the filter coefficient generated by the second adaptive beamformer with respect to the frequency is minimized A target sound source direction estimating means for estimating the direction of the sound source;
First control means for setting the direction of the noise source estimated by the noise source direction estimation means as an input direction of the second adaptive beamformer;
And a second control unit configured to set the direction of the target sound source estimated by the target sound source direction estimation unit as an input direction of the first adaptive beamformer.