JP4724054B2

JP4724054B2 - Specific direction sound collection device, specific direction sound collection program, recording medium

Info

Publication number: JP4724054B2
Application number: JP2006165492A
Authority: JP
Inventors: 裕輔日岡; 和則小林; 章俊片岡; 賢一古家
Original assignee: Nippon Telegraph and Telephone Corp
Current assignee: Nippon Telegraph and Telephone Corp
Priority date: 2006-06-15
Filing date: 2006-06-15
Publication date: 2011-07-13
Anticipated expiration: 2026-06-15
Also published as: JP2007336232A

Description

本発明は音声通話や機器の操作などハンズフリー方式で音声を取得する収音装置に関するものであり、特に収音装置から見て特定の方向に存在する音源からの音だけを強調して収音したい場合に適用して好適な特定方向収音装置に関する。 The present invention relates to a sound collection device that acquires a voice in a hands-free manner such as a voice call or operation of a device, and particularly emphasizes only sound from a sound source existing in a specific direction when viewed from the sound collection device. The present invention relates to a specific direction sound pickup device that is suitable for use when desired.

従来技術では、図１８に示すようにｘ−ｙ平面状のＭ個の異なる位置（ｐ_１，ｑ_１）〜（ｐ_Ｍ，ｑ_Ｍ）に配置されたマイクロホンｍｉｃ．１〜ｍｉｃ．Ｍを用いて、任意の角度θ_Ｓの方向にある音源から発生される音を信号とし、それ以外の方向で発せられる音を雑音とした場合に、信号のみを強調して高いＳＮＲ（信号雑音比）で収音する。図１９は従来の強調収音法の構成を示すブロック図である。位置（ｘ_ｍ，ｙ_ｍ）に配置されたマイクロホンｍで受音した信号ｘ_ｍ（ｎ）（ｍ＝１…Ｍ）に対し、式（１）のように遅延Ｄ_ｍを付加することにより信号ｙ_ｍ（ｎ）を得る。 In the conventional technique, as shown in FIG. 18, microphones mic. Arranged at _M different positions (p ₁ , q ₁ ) to (p _M , q _M ) on the xy plane are arranged. 1-mic. When using M as a sound generated from a sound source in the direction of an arbitrary angle θ _S as a signal and a sound generated in other directions as noise, only the signal is emphasized and a high SNR (signal noise) Ratio). FIG. 19 is a block diagram showing the configuration of a conventional enhanced sound collection method. The signal x _m (n) (m = 1... M) received by the microphone m arranged at the position (x _m , y _m ) is added by adding a delay D _m as shown in the equation (1). Obtain y _m (n).

ｙ_ｍ（ｎ）＝ｘ_ｍ（ｎ−Ｄ_ｍ）（１）
このとき遅延量Ｄ_ｍは、あらかじめ与えられた所望音源の方向θ_Ｓから、それぞれ式（２）により導出することができる。
Ｄ_ｍ＝（ｄ_ｍ／ｃ）ｓｉｎθ_Ｓ（２）
ここでｃは音速であり、ｄ_ｍは図１８においてθ_Ｓ方向から到来した音波から見たときの、マイクｍと基準点ｏの間の距離で、式（３）により表される。
ｄ_ｍ＝ｐ_ｍｓｉｎθ＋ｑ_ｍｃｏｓθ （３）
次にいま得られたｙ_ｍ（ｎ）を式（４）のように加算することで、所望位置から発せられる音を強調した信号ｚ（ｎ）が求められる。 y _m (n) = x _m (n−D _m ) (1)
At this time, the delay amount D _m can be derived from the desired sound source direction θ _S given in advance by the equation (2).
D _m = (d _m / c) sin θ _S (2)
Where c is the speed of sound, d _m is when viewed from the sound wave arriving from theta _S direction in FIG. 18, the distance between the microphone m and a reference point o, represented by the formula (3).
d _m = p _m sin θ + q _m cos θ (3)
Next, y _m (n) obtained now is added as shown in Expression (4) to obtain a signal z (n) that emphasizes the sound emitted from the desired position.

以上が従来の強調収音法（非特許文献１）である。この従来技術で形成される指向特性には図２０に示すように、主ビームＢＭに近接して比較的大きい利得を持つサイドローブＳＢとよばれる領域が生じるため、雑音を十分に抑圧することができない。またこのサイドローブＳＢの利得を極力小さく抑えるためには、マイクロホン数を増やし、またマイクロホンアレーを大型にする必要がある（非特許文献１）。
大賀寿郎、山崎芳男、金田豊共著、音響システムとディジタル処理、電子情報通信学会Ｐ．１８１〜Ｐ．１８６７．１．２．

The above is the conventional enhanced sound collection method (Non-Patent Document 1). As shown in FIG. 20, the directivity formed by this conventional technique has a region called a side lobe SB having a relatively large gain close to the main beam BM, so that noise can be sufficiently suppressed. Can not. In order to keep the gain of the side lobe SB as small as possible, it is necessary to increase the number of microphones and increase the size of the microphone array (Non-Patent Document 1).
Toshiro Oga, Yoshio Yamazaki, Yutaka Kaneda, Acoustic systems and digital processing, IEICE 181-P. 186 7.1.2.

従来技術を用いて収音装置の指向特性をある特定の方向に向け、その方向で発せられる音を強調し、それ以外の方向で発せられる音を抑圧して収音する場合に、従来技術により形成される指向特性はサイドローブを持つことから、本来抑圧したい方向から発せられる音が十分に抑圧されずに収音されてしまう問題があった。
このため強調したい音源の方向以外に非常に大きな音を発する雑音源が存在する場合に、従来技術の収音装置は所望音源に対する十分な強調効果が得られなかった。
また従来技術において、サイドローブを低減するには、マイクロホン数を増やし、またマイクロホンアレーを大型にしなければならず、実用する際には設置、運搬が困難であった。さらに従来技術による収音装置の指向特性は周波数によって変化するため、所望音や雑音のもつ周波数構造によっては、十分な強調効果が得られない問題があった。 When the sound collecting device is directed to a specific direction using conventional technology, the sound emitted in that direction is emphasized, and the sound emitted in other directions is suppressed and collected. Since the formed directional characteristic has side lobes, there is a problem in that sound emitted from the direction in which it is desired to be suppressed is collected without being sufficiently suppressed.
For this reason, when there is a noise source that emits a very loud sound other than the direction of the sound source to be emphasized, the sound collecting device of the prior art cannot obtain a sufficient enhancement effect for the desired sound source.
Further, in the prior art, in order to reduce the side lobes, the number of microphones must be increased and the microphone array must be increased in size, which is difficult to install and transport in practical use. Furthermore, since the directivity characteristics of the sound collecting device according to the prior art change depending on the frequency, there is a problem that a sufficient emphasis effect cannot be obtained depending on the frequency structure of the desired sound or noise.

本発明は以上の課題を解決されるためになされたもので、マイクロホンアレーの規模を拡大することなく、従来技術よりも高いＳＮＲで所望音源からの音を強調して収音する装置を実現することにある。 The present invention has been made to solve the above-described problems, and realizes an apparatus for enhancing and collecting sound from a desired sound source with an SNR higher than that of the prior art without increasing the scale of a microphone array. There is.

本発明では図３に示すように、収音する領域を複数の方向の角度領域Θ_１〜Θ_Ｑに分割し、マイクロホンアレーの指向性をそれぞれの方向の角度領域に向けるように制御して受音した信号を用いる。このときマイクロホンアレーによって処理された信号は、その処理前と比較して音源の存在する方向に応じてパワーが変化する。本発明ではこのパワーの変化量を利用して、それぞれの方向領域から到来する信号のパワーを推定する。そして推定されたパワーから、事前に与えられた方向の角度領域から到来する信号を強調するための利得係数を算出し、その利得係数を特定方向から到来した信号に乗算した信号を最終的な出力信号として得る。 In the present invention, as shown in FIG. 3, the sound collecting area is divided into angular areas Θ _{1 to} Θ _Q in a plurality of directions, and the directivity of the microphone array is controlled so as to be directed to the angular areas in the respective directions. Use the sound signal. At this time, the power of the signal processed by the microphone array changes in accordance with the direction in which the sound source exists, as compared to before the processing. In the present invention, the amount of change in power is used to estimate the power of a signal arriving from each direction area. Then, from the estimated power, a gain coefficient for emphasizing a signal coming from an angle region in a given direction is calculated, and a signal obtained by multiplying the signal arriving from a specific direction by the gain coefficient is finally output. Get as a signal.

本発明による特定方向収音装置の具体的な構成としては複数のマイクロホンを搭載して構成されるマイクロホンアレーの出力信号を利用してそれぞれが異なる方向の角度領域から到来する音を強調して収音する複数のビームフォーマー部と、複数のビームフォーマー部が収音した角度領域信号のそれぞれを複数の帯域成分に分割した周波数領域信号に変換する周波数領域変換部と、各周波数領域変換部が出力する周波数領域信号の中の所望方向の角度領域の角度領域信号に所属する特定方向周波数領域信号を選択する特定方向選択部と、各角度領域に含まれる信号の総和をその角度領域の信号量とし、複数のビームフォーマー部の各角度領域に対する指向特性から求められるパラメータを要素とするゲイン行列の逆行列に対し、各周波数領域変換部が出力する周波数領域信号の信号量を乗じ、各角度領域の信号量を推定する信号量推定部と、所望方向の角度領域における信号量と、全ての角度領域における信号量の総和との比により、周波数帯域毎の利得係数を算出する利得係数算出部と、利得係数算出部が算出した利得係数を特定方向周波数領域信号の各対応する周波数帯域の信号量に乗算する乗算部とを備えることを特徴とする。 As a specific configuration of the sound collecting device in a specific direction according to the present invention, the output signals of a microphone array configured by mounting a plurality of microphones are used to emphasize and collect sounds coming from angular regions in different directions. A plurality of beamformer units for sound, a frequency domain conversion unit for converting each of the angle domain signals collected by the plurality of beamformer units into a frequency domain signal divided into a plurality of band components, and each frequency domain conversion unit A specific direction selection unit that selects a specific direction frequency domain signal belonging to an angular domain signal of an angular domain in a desired direction from among the frequency domain signals output by the signal, and a sum of signals included in each angular domain is a signal of the angular domain Each frequency domain with respect to the inverse matrix of the gain matrix whose elements are parameters obtained from the directivity characteristics for each angular domain of multiple beamformer units The signal amount estimation unit that estimates the signal amount of each angle region by multiplying the signal amount of the frequency region signal output by the conversion unit, the signal amount in the angle region of the desired direction, and the sum of the signal amount in all angle regions A gain coefficient calculation unit that calculates a gain coefficient for each frequency band according to the ratio, and a multiplication unit that multiplies the signal amount of each corresponding frequency band of the specific direction frequency domain signal by the gain coefficient calculated by the gain coefficient calculation unit. It is characterized by that.

本発明による特定方向集音装置は更に複数のマイクロホンを搭載して構成されているマイクロホンアレーの出力信号を利用してそれぞれが異なる方向の角度領域から到来する音を強調して収音する複数のビームフォーマー部と、複数のビームフォーマー部が収音した角度領域信号のそれぞれを複数の帯域成分に分割した周波数領域信号に変換する周波数領域変換部と、周波数領域変換部が出力する周波数領域信号を周波数帯域成分に分割する帯域分割部と、帯域分割部が出力する帯域成分を同一帯域成分毎に収集する同一帯域成分収集部と、同一帯域成分収集部の出力側に設けられ、各同一帯域成分収集部から出力される帯域成分を合成する帯域合成部とを備え、同一帯域成分収集部は、ビームフォーマー部の何れかが収音する所望方向の角度領域信号に所属する特定方向帯域成分を選択する特定方向選択部と、各角度領域に含まれる信号の総和をその角度領域の信号量とし、複数のビームフォーマー部の各角度領域及び周波数帯域に対する指向特性から求められるパラメータを要素とするゲイン行列の逆行列に対し、各帯域分割部が出力する周波数帯域成分の信号量を乗じ、各角度領域の信号量を推定する信号量推定部と、所望方向の角度領域における信号量と、全ての角度領域における信号量の総和との比により、利得係数を算出する利得係数算出部と、利得係数算出部が算出した利得係数を前記特定方向帯域成分に乗算する乗算部とを備えることを特徴とする。 The specific direction sound collecting apparatus according to the present invention further uses a plurality of microphones configured to mount a plurality of microphones to emphasize and collect a plurality of sounds that are collected from angle regions in different directions. A beamformer unit, a frequency domain conversion unit that converts each of the angle domain signals collected by a plurality of beamformer units into a frequency domain signal divided into a plurality of band components, and a frequency domain output by the frequency domain conversion unit Provided on the output side of the same band component collection unit, the band division unit that divides the signal into frequency band components, the same band component collection unit that collects the band components output by the band division unit for each same band component, and the same A band synthesizing unit that synthesizes the band components output from the band component collecting unit, and the same band component collecting unit is an angle in a desired direction in which any of the beam former units collects sound. A specific direction selection unit for selecting a specific direction band component belonging to the area signal, the sum of the signals included in each angular region as the signal of the angle regions, for each angle and frequency bands of a plurality of beamformer unit A signal amount estimation unit that estimates the signal amount of each angle region by multiplying the inverse matrix of the gain matrix whose elements are parameters obtained from directivity characteristics by the signal amount of the frequency band component output by each band dividing unit ; A gain coefficient calculation unit that calculates a gain coefficient based on a ratio of a signal amount in a direction angle region and a sum of signal amounts in all angle regions, and the gain coefficient calculated by the gain factor calculation unit as the specific direction band component And a multiplication unit for multiplying.

更に本発明による特定方向収音装置は前記した特定方向収音装置において、周波数領域信号の信号量は、信号のパワー値である、または、前記周波数帯域成分の信号量は、信号のパワー値であることを特徴とする。
更に、本発明による特定方向収音装置の特徴とする点は、前記した特定方向収音装置において、周波数領域信号の信号量は、信号の絶対値である、または、周波数帯域成分の信号量は、信号の絶対値であることを特徴とする。
更に、本発明による特定方向収音装置の特徴とする点は、前記した特定方向収音装置において、利得係数算出部の利得係数算出特性は所望方向の角度領域における信号量に対して、その他の角度領域における信号量が無視できる程度に微少値である場合、利得係数の値は所定の最大値で与えられ、所望方向の角度領域における信号量がその他の角度領域における信号量に対して、特定領域信号の量が無視できる程度に微少値である場合は利得係数の値はほぼ０に近い値となる特性で算出することを特徴とする。 Furthermore, in the specific direction sound pickup device according to the present invention, the signal amount of the frequency domain signal is a signal power value, or the signal amount of the frequency band component is a signal power value. characterized in that there.
Further, the specific direction sound pickup device according to the present invention is characterized in that, in the above-described specific direction sound pickup device, the signal amount of the frequency domain signal is an absolute value of the signal, or the signal amount of the frequency band component is The absolute value of the signal .
Moreover, the point which is characterized in a particular direction and collection device according to the present invention, in the specific direction and collecting apparatus described above, the gain factor calculating characteristics of the gain coefficient calculation unit for the signal amount in the desired direction of angular regions, other When the signal amount in the angle region is negligibly small, the gain coefficient value is given as a predetermined maximum value, and the signal amount in the angle region in the desired direction is specified relative to the signal amount in other angle regions . When the amount of the area signal is a negligible value, the gain coefficient value is calculated with a characteristic that is almost close to zero.

更に、本発明による特定方向収音装置の特徴とする点は、前記した特定方向収音装置において、利得係数算出部の利得係数算出特性は、所望方向の角度領域における信号量に対してその他の角度領域の信号量が小さい帯域では利得係数の値を最大値乃至は最大値に近い値に維持させ、所望方向の角度領域における信号量に対し、その他の角度領域の信号量が大きい領域では利得係数の値をほぼ０乃至０に近い値に維持させる特性で算出することを特徴とする。 Further, the specific direction sound pickup device according to the present invention is characterized in that, in the above-described specific direction sound pickup device, the gain coefficient calculation characteristic of the gain coefficient calculation unit is different from the signal amount in the angle region of the desired direction . the maximum value or the value of the signal amount is small bandwidth the gain coefficient of the angular region is kept at a value close to the maximum value, against the amount of signal in the desired direction of angular region, the gain in the region signal amount is large and other angular regions The coefficient is calculated with a characteristic that maintains the value of the coefficient at a value close to 0 to nearly 0.

本発明による特定方向収音装置によれば、所望方向の音源が発する音を強調して収音する際の強調効果を改善するために、マイクロホンアレーによって受音した信号を用いて複数のビームフォーマー部処理の結果から各音源が発する音信号のパワーを推定し、ＳＮ比に対応する利得係数を計算して所望音信号を強調する。このため従来技術で同様の効果を得るためにはマイクロホンの本数の増大やマイクロホンアレーの大型化を図る必要が有ったが、本発明によればこれらの必要が無く、実用において設置や運搬が容易な小規模システムのまま強調効果を改善する効果がある。
また本発明による特定方向収音装置では、マイクロホンアレーによって受音された信号のパワー値又は絶対値を周波数成分ごとに推定して強調処理を行うため、所望音や雑音の持つ周波数構造によらず高い強調効果を得ることができる。 According to the specific direction sound collecting apparatus of the present invention, in order to improve the enhancement effect when sound is collected by enhancing the sound emitted from a sound source in a desired direction, a plurality of beam forks are used using signals received by the microphone array. The power of the sound signal emitted from each sound source is estimated from the result of the image processing, and the desired sound signal is emphasized by calculating a gain coefficient corresponding to the SN ratio. For this reason, in order to obtain the same effect with the prior art, it was necessary to increase the number of microphones and increase the size of the microphone array. It has the effect of improving the emphasis effect with an easy small system.
Further, in the specific direction sound pickup device according to the present invention, the power value or absolute value of the signal received by the microphone array is estimated for each frequency component and the enhancement process is performed. Therefore, regardless of the frequency structure of the desired sound or noise. A high emphasis effect can be obtained.

本発明による特定方向収音装置を実施する場合、全てをハードウェアによって構成することも可能であるが、最も簡素に実施するにはコンピュータに、本発明による特定方向収音プログラムをインストールし、コンピュータに特定方向収音装置として機能させる実施形態が最良の形態である。
コンピュータに本発明による特定方向収音装置として機能させるには、コンピュータにインストールした特定方向収音プログラムによりコンピュータ内にマイクロホンアレーの出力信号を利用してそれぞれが異なる方向の角度領域から到来する音を強調して収音する複数のビームフォーマー部と、複数のビームフォーマー部が収音した角度領域信号のそれぞれを複数の帯域成分に分割した周波数領域信号に変換する周波数領域変換部と、各周波数領域変換部が出力する周波数領域信号の中の所望方向の角度領域の角度領域信号に所属する特定方向周波数領域信号を選択する特定方向選択部と、特定方向周波数領域信号の信号量と、他の方向の角度領域信号から周波数領域信号に変換された周波数領域信号の総和の量を推定する信号量推定部と、特定方向周波数領域信号の信号量と、他の方向の角度領域信号から周波数領域信号に変換された周波数領域信号の総和量を推定する信号量推定部と、特定方向周波数領域信号の信号量と、この特定方向周波数領域信号の信号量を含む他の周波数領域信号の総和量との比により周波数領域毎の利得係数を算出する利得係数算出部と、利得係数算出部が算出した利得係数を特定方向領域信号の各対応する周波数帯域の信号量に乗算する乗算部とを構築し、特定方向収音装置として機能させる。 When implementing the specific direction sound pickup apparatus according to the present invention, it is possible to configure all by hardware, but in the simplest implementation, the specific direction sound pickup apparatus according to the present invention is installed in a computer, and the computer The best mode is to let the device function as a specific direction sound pickup device.
In order for a computer to function as a specific direction sound pickup device according to the present invention, sound that arrives from angular regions in different directions by using a microphone array output signal in the computer by a specific direction sound pickup program installed in the computer. A plurality of beamformer sections for emphasizing and collecting sound, a frequency domain conversion section for converting each of the angle domain signals collected by the plurality of beamformer sections into frequency domain signals divided into a plurality of band components, and A specific direction selection unit that selects a specific direction frequency domain signal belonging to an angular domain signal of an angular domain in a desired direction from among the frequency domain signals output by the frequency domain conversion unit, a signal amount of the specific direction frequency domain signal, and the like A signal amount estimation unit for estimating the sum of frequency domain signals converted from frequency domain signals into angle domain signals in the direction of A signal amount estimation unit that estimates a signal amount of a specific direction frequency domain signal, a total amount of frequency domain signals converted from a frequency domain signal from an angle domain signal in another direction, and a signal amount of a specific direction frequency domain signal, A gain coefficient calculation unit that calculates a gain coefficient for each frequency domain based on a ratio with the total amount of other frequency domain signals including the signal quantity of the specific direction frequency domain signal, and the gain coefficient calculated by the gain coefficient calculation unit in a specific direction A multiplication unit that multiplies the signal amount of each corresponding frequency band of the region signal is constructed and functions as a specific direction sound collecting device.

はじめに本発明の全体の概要を説明する。図１は本発明の特定方向収音装置の全体構成を示している。Ｍ（≧２）個のマイクロホンから構成されるマイクロホンアレー１１によって受音された信号ｘ_ｍ（ｎ）（ｍ＝１，２，…，Ｍ）は、第１ビームフォーマー部１２−１から第Ｑビームフォーマー部１２−ＱまでのＱ個のビームフォーマー部１２−１〜１２−Ｑに入力される。ここでｎは離散時間信号のサンプル番号を表す。
ビームフォーマー部１２−１〜１２−Ｑでは、例えば図２に示すような指向性のビームＢＭを、図３であらかじめ与えられたＱ個の方向領域Θ_１〜Θ_Ｑのいずれかに向け、該当する方向領域で発せられる音を強調して収音する処理を行い、結果を出力する。各ビームフォーマー部１２−１〜１２−Ｑの出力信号ｙ_１（ｎ）、ｙ_２（ｎ）、…、ｙ_Ｑ（ｎ）はそれぞれ周波数領域変換部１３−１〜１３−Ｑに入力され、周波数領域変換部１３−１〜１３−Ｑの出力信号Ｙ_１（ω，ｌ）、Ｙ_２（ω，ｌ）、…Ｙ_Ｑ（ω，ｌ）は信号量推定部１４と特定方向選択部１５にそれぞれ入力される。 First, an overall outline of the present invention will be described. FIG. 1 shows the overall configuration of a sound collecting device in a specific direction according to the present invention. Signals x _m (n) (m = 1, 2,..., M) received by the microphone array 11 composed of M (≧ 2) microphones are transmitted from the first beam former unit 12-1. The signals are input to Q beam former units 12-1 to 12-Q up to the Q beam former unit 12-Q. Here, n represents the sample number of the discrete time signal.
In the beam former units 12-1 to 12-Q, for example, a directional beam BM as shown in FIG. 2 is directed to any one of the _Q direction regions Θ _{1 to} Θ Q given in advance in FIG. A process of collecting sound by emphasizing the sound emitted in the corresponding direction area is performed, and the result is output. The output signals y ₁ (n), y ₂ (n),..., Y _Q (n) of the beam former units 12-1 to 12-Q are respectively input to the frequency domain transform units 13-1 to 13-Q. The output signals Y ₁ (ω, l), Y ₂ (ω, l),... Y _Q (ω, l) of the frequency domain transforming units 13-1 to 13-Q are the signal amount estimating unit 14 and the specific direction selecting unit. 15 respectively.

信号量推定部１４は、入力されたビームフォーマー部１２−１〜１２−Ｑの出力信号パワーから各方向領域Θ_１〜Θ_Ｑにおける音源から発せられる音信号の総和のパワー成分を求め、これを１つのベクトルにまとめた信号パワーベクトルＸ_ｅｓｔ※（ω，ｌ）（以下文中で※が付された英文字はベクトルを表わす。また数式中肉太文字はベクトルを表わす）を出力する。
特定方向選択部１５は、強調したい方向領域に指向性のビームを向けたビームフォーマー部の出力を選択しＹ_Ｓ（ω，ｌ）として出力する。 The signal amount estimation unit 14 obtains a power component of the sum of sound signals emitted from the sound sources in the respective direction regions Θ _{1 to} Θ _Q from the output signal power of the input beam former units 12-1 to 12 _-Q . The signal power vector X _est * (ω, l) in which the symbols are combined into one vector (in the following sentence, the English character with * represents a vector, and the bold character in the formula represents a vector).
The specific direction selection unit 15 selects the output of the beam former unit that directs the directional beam to the direction region to be emphasized, and outputs it as Y _S (ω, l).

利得係数算出部１６は、入力された信号パワーベクトルＸ_ｅｓｔ※（ω，ｌ）から利得係数Ｒ（ω，ｌ）を算出し、出力する。利得係数Ｒ（ω，ｌ）は乗算部１７に入力される。乗算部１７は入力された利得係数Ｒ（ω，ｌ）と特定方向選択部１５の出力Ｙ_Ｓ（ω，ｌ）を同じ周波数の成分ごとに掛け算した結果を出力する。乗算部１７の出力信号Ｙ_ＳＲ（ω，ｌ）は逆周波数領域変換部１８に入力され、逆離散フーリエ変換を行って時間信号に復元された信号ｙ（ｎ）が出力される。この信号ｙ（ｎ）が本発明の装置によって所望音が強調されて収音された信号である。
ビームフォーマー部１２−１〜１２−Ｑ、信号量推定部１４，利得係数算出部１６，特定方向選択部１５の詳細は別の図を用いて以下に順に説明する。 The gain coefficient calculation unit 16 calculates the gain coefficient R (ω, l) from the input signal power vector X _est * (ω, l) and outputs it. The gain coefficient R (ω, l) is input to the multiplication unit 17. The multiplier 17 outputs the result of multiplying the input gain coefficient R (ω, l) and the output Y _S (ω, l) of the specific direction selector 15 for each component of the same frequency. The output signal Y _SR (ω, l) of the multiplication unit 17 is input to the inverse frequency domain transform unit 18, and a signal y (n) restored to a time signal by performing inverse discrete Fourier transform is output. This signal y (n) is a signal picked up by enhancing the desired sound by the apparatus of the present invention.
The details of the beam former units 12-1 to 12-Q, the signal amount estimation unit 14, the gain coefficient calculation unit 16, and the specific direction selection unit 15 will be described in order below with reference to another drawing.

（ビームフォーマー部）
図４はビームフォーマー部１２−１〜１２−Ｑの中の一つの例えばビームフォーマー部１２−ｑの構成を示している。同様の処理がすべてのビームフォーマー部において行われる。入力された信号ｘ_ｍ（ｎ）（ｍ＝１，２，…，Ｍ）はフィルタ処理部ＦＣ１〜ＦＣＭに入力される。フィルタ処理部ＦＣ１〜ＦＣＭではあらかじめ与えられた（決定方法は後述する）フィルタ係数Ｗ_ｑｍ（ｎ）を、式（５）に示す畳み込み演算に代入して得られる信号ｘ’_ｑｍ（ｎ）を出力する。

各フィルタ処理部ＦＣ１〜ＦＣＭの出力信号は加算部ＡＤＤに入力される。加算部ＡＤＤでは入力信号を式（６）のように加算し、ビームフォーマー部の出力信号ｙ_ｑ（ｎ）（ｑ＝１…Ｑ）を得る。

ここでフィルタ係数Ｗ_ｑｍ（ｎ）は、それぞれのビームフォーマー部１２−１〜１２−Ｑの指向特性Ｄ_ｑ（ω，θ）が、図３に示すあらかじめ与えられた第Ｑ方向領域Θ_Ｑで発せられる音を強調して受音し、それ以外の方向で発せられる音を抑圧するように構成される。 (Beam former part)
FIG. 4 shows the configuration of one of the beam formers 12-1 to 12-Q, for example, the beam former 12-q. Similar processing is performed in all beam former units. The input signal x _m (n) (m = 1, 2,..., M) is input to the filter processing units FC1 to FCM. The filter processing units FC1 to FCM output a signal x ′ _qm (n) obtained by substituting a filter coefficient W _qm (n) given in advance (determination method will be described later) into the convolution operation shown in Expression (5). To do.

The output signals of the filter processing units FC1 to FCM are input to the adding unit ADD. The adder ADD adds the input signals as shown in Expression (6) to obtain the output signal y _q (n) (q = 1... Q) of the beam former.

Here, the filter coefficient W _qm (n) indicates that the directivity characteristics D _q (ω, θ) of the respective beam former units 12-1 to 12-Q are given in the Q-direction region Θ _Q given in advance shown in FIG. It is configured to emphasize and receive the sound emitted from and to suppress the sound emitted in other directions.

（信号量推定部）
図５は信号量推定部１４の構成を示している。信号量推定部１４に入力される周波数成分Ｙ_１（ω，ｌ）、Ｙ_２（ω，ｌ）、…、Ｙ_Ｑ（ω，ｌ）はそれぞれパワー演算部ＰＷ−１〜ＰＷ−Ｑに入力され、信号のパワー値｜Ｙ_１（ω，ｌ）｜^２、｜Ｙ_２（ω，ｌ）｜^２、…、｜Ｙ_Ｑ（ω，ｌ）｜^２が出力され、ベクトル化部１４Ａに入力される。ベクトル化部１４Ａでは、入力されたパワー値を式（７）のようにベクトル形式でまとめた、ビームフォーマー部出力パワーベクトルＹ※（ω，ｌ）を出力する。

ビームフォーマー部出力パワーベクトルＹ※（ω，ｌ）は乗算部１４Ｂに入力される。乗算部１４Ｂのもう一方の入力であるパワー推定行列Ｔ^−１※は、逆行列演算部１４Ｃの出力信号である。逆行列演算部１４Ｃには式（８）により定義されるゲイン行列Ｔ※が入力され、その逆行列Ｔ^−１※を出力する。 (Signal amount estimation unit)
FIG. 5 shows the configuration of the signal amount estimation unit 14. The frequency components Y ₁ (ω, l), Y ₂ (ω, l),..., Y _Q (ω, l) input to the signal amount estimation unit 14 are input to the power calculation units PW-1 to PW-Q, respectively. The signal power values | Y ₁ (ω, l) | ² , | Y ₂ (ω, l) | ² ,..., | Y _Q (ω, l) | ² are output and input to the vectorization unit 14A. Is done. The vectorization unit 14A outputs a beamformer unit output power vector Y * (ω, l) in which input power values are collected in a vector format as shown in Expression (7).

The beamformer unit output power vector Y * (ω, l) is input to the multiplication unit 14B. The power estimation matrix T ⁻¹ *, which is the other input of the multiplier 14B, is an output signal of the inverse matrix calculator 14C. A gain matrix T * defined by Expression (8) is input to the inverse matrix calculation unit 14C, and the inverse matrix T ⁻¹ * is output.

ゲイン行列Ｔ※の各要素は、図６に示すように各ビームフォーマー部１２−１〜１２−Ｑの各方向領域に対する指向特性から求められるパラメータであり、例えば式（９）に示すような指向特性の周波数および方向に関する平均値を用いる。なお指向特性は、例えば前述の非特許文献１に記載されている周知の技術を用いてフィルタ係数Ｗ_ｍ（ｎ）より求めることができる。

α_ｐｑは第ｐビームフォーマー部の第ｑ方向領域に対する指向特性の平均値である。乗算部１４Ｂは式（１６）に示すように入力されたビームフォーマー部出力パワーベクトルとパワー推定行列の乗算を周波数成分ごとに行い、推定信号パワーベクトルＸ_ｅｓｔ※（ω，ｌ）を出力する。

Each element of the gain matrix T * is a parameter obtained from directivity characteristics with respect to each direction region of each beamformer unit 12-1 to 12-Q as shown in FIG. 6, for example, as shown in Expression (9). The average value for the frequency and direction of the directivity is used. The directivity can be obtained from the filter coefficient W _m (n) using a known technique described in Non-Patent Document 1, for example.

α _pq is an average value of directivity with respect to the q-th direction region of the p-th beam former part. The multiplication unit 14B performs multiplication of the input beamformer unit output power vector and the power estimation matrix for each frequency component as shown in Expression (16), and outputs an estimated signal power vector X _est * (ω, l). .

（特定方向選択部）
図７は特定方向選択部１５の構成を示している。特定方向選択部１５では各ビームフォーマー部１２−１〜１２−Ｑより周波数領域変換部１３−１〜１３−Ｑを経て入力された周波数成分Ｙ_１（ω，ｌ）からＹ_Ｑ（ω，ｌ）のうち、強調したい第ｑｓ方向領域に対応するものを選択してＹ_Ｓ（ω，ｌ）として出力する。
Ｙ_Ｓ（ω，ｌ）＝Ｙ_ｑｓ（ω，ｌ）（１１） (Specific direction selector)
FIG. 7 shows the configuration of the specific direction selection unit 15. In the specific direction selection unit 15, the frequency components Y ₁ (ω, l) input from the beam former units 12-1 to 12 -Q through the frequency domain conversion units 13-1 to 13 -Q to Y _Q (ω, Among the l), the one corresponding to the qs direction region to be emphasized is selected and output as Y _S (ω, l).
Y _S (ω, l) = Y _qs (ω, l) (11)

（利得係数算出部）
図８は利得係数算出部１６における処理の流れを示している。信号量推定部１４より入力された推定信号パワーベクトルＸ_ｅｓｔ※（ω，ｌ）はベクトル要素抽出部１６Ａに入力される。ベクトル要素抽出部１６Ａでは式（１２）に示すように、入力された推定信号パワーベクトルの第１成分を第１方向領域信号推定パワー｜Ｓ_１（ω，ｌ）｜^２、第２成分を第２方向領域信号推定パワー、｜Ｓ_２（ω，ｌ）｜^２のように第ｑ成分を第ｑ方向領域からの信号推定パワーとしてそれぞれ出力し、それらはＳＮ比推定部１６Ｂに入力される。

ＳＮ比推定部１６Ｂでは式（１３）を用いて所望方向領域の信号を強調する利得係数Ｒ（ω，ｌ）を計算し出力する。

利得係数Ｒ（ω，ｌ）は図９に示すように、特定方向信号｜Ｓ_ｑｓ（ω，ｌ）｜^２と角度領域の信号の総和Σ_ｑ｜Ｓ_ｑ（ω，ｌ）｜^２の関係が｜Ｓ_ｑｓ（ω，ｌ）｜^２≒Σ_ｑ｜Ｓ_ｑ（ω，ｌ）｜^２であった場合は利得係数Ｒ（ω，ｌ）は最大値である例えば「１」が与えられ、｜Ｓ_ｑｓ（ω，ｌ）｜^２＜＜Σ_ｑ｜Ｓ_ｑ（ω，ｌ）｜^２であった場合は利得係数Ｒ（ω，ｌ）は≒０を与える。 (Gain coefficient calculator)
FIG. 8 shows the flow of processing in the gain coefficient calculation unit 16. The estimated signal power vector X _est * (ω, l) input from the signal amount estimation unit 14 is input to the vector element extraction unit 16A. In the vector element extraction unit 16A, as shown in Expression (12), the first component of the input estimated signal power vector is the first direction area signal estimated power | S ₁ (ω, l) | ² , and the second component is the second component. The q-th component is output as the signal estimation power from the q-th direction area as in the two-direction area signal estimation power | S ₂ (ω, l) | ² , and these are input to the SN ratio estimation unit 16B.

The signal-to-noise ratio estimation unit 16B calculates and outputs a gain coefficient R (ω, l) that enhances the signal in the desired direction region using Expression (13).

Gain factor R (omega, l), as shown in FIG. 9, a specific direction signal _{| S qs (ω, l)} | sum of signals ² and angular region _{_{Σ q | S q (ω,}} l) | 2 of the relationship Is | S _qs (ω, l) | ² ≈Σ _q | S _q (ω, l) | ² , the gain coefficient R (ω, l) is a maximum value, for example, “1”, When | S _qs (ω, l) | ² << Σ _q | S _q (ω, l) | ² , the gain coefficient R (ω, l) gives ≈0.

図９に示した利得係数の特性によれば特定方向信号成分Ｓ_ｑｓ（ω，ｌ）が、他の角度領域の信号成分Ｓ_ｑ（ω，ｌ）より大きい程利得係数Ｒ（ω，ｌ）は「１」に近づき、その周波数帯域の信号は減衰されることなく、そのまま出力される。特定方向信号成分Ｓ_ｑｓ（ω，ｌ）が、他の角度領域信号成分Ｓ_ｑ（ω，ｌ）より小さい程、利得係数Ｒ（ω，ｌ）は「０」に近い値となり、この場合は信号成分は利得係数Ｒ（ω，ｌ）の乗算によって減衰される。この結果、利得係数Ｒ（ω，ｌ）を特定方向選択部１５で取り出した特定方向信号に乗算することにより特定方向信号成分が強調されて出力される。 According to the characteristics of the gain coefficient shown in FIG. 9, the gain coefficient R (ω, l) increases as the specific direction signal component S _qs (ω, l) is larger than the signal component S _q (ω, l) in the other angle region. Approaches “1”, and the signal in that frequency band is output without being attenuated. As the specific direction signal component S _qs (ω, l) is smaller than the other angle region signal component S _q (ω, l), the gain coefficient R (ω, l) becomes closer to “0”. The signal component is attenuated by multiplication of the gain factor R (ω, l). As a result, the specific direction signal component is emphasized and output by multiplying the specific direction signal extracted by the specific direction selection unit 15 by the gain coefficient R (ω, l).

ここで本発明により所望音を選択強調した収音が可能になる原理について説明する。ビームフォーマー部出力パワーベクトルＹ※（ω，ｌ）の各要素である、各ビームフォーマー部の出力パワーは、式（１４）に示すように、マイクロホンアレーが受音した信号Ｘ_θ※（ω，ｌ）のパワーにその信号の音源方向および周波数に基づく指向特性が乗算された形で近似することができる。ただしここでは各音源の発する音は互いに無相関とし、すべてのマイクロホンにおいて音は同レベルで受音されると仮定している。 Here, the principle that enables the sound collection by selectively emphasizing the desired sound according to the present invention will be described. The output power of each beamformer section, which is each element of the beamformer section output power vector Y * (ω, l), is a signal X _θ * (*) received by the microphone array as shown in Expression (14). It can be approximated by multiplying the power of ω, l) by the directivity characteristics based on the sound source direction and frequency of the signal. However, here, it is assumed that the sounds emitted by the sound sources are uncorrelated with each other, and the sound is received at the same level in all microphones.

いま信号Ｘ_Θｑ※（ω，ｌ）は図３で示したいずれかの方向領域Θ_ｑに含まれる信号の総和を表すとする。このときビームフォーマー部の指向特性は方向領域内で一様であると仮定すると、Ｙ※（ω，ｌ）は式（１５）により表される。本実施例では各領域に対する指向特性の代表値として式（９）により求めた指向特性の平均値を用いている。

Now it signals _X Θq ※ (ω, l) is a representative of the sum of the signal included in either direction region theta _q shown in FIG. At this time, assuming that the directivity characteristic of the beam former is uniform in the direction region, Y * (ω, l) is expressed by Expression (15). In this embodiment, the average value of the directivity obtained by the equation (9) is used as the representative value of the directivity for each region.

以上の関係より、あらかじめ与えられている行列Ｔ※の逆行列を左側からビームフォーマー部出力ベクトルＹ※（ω，ｌ）にかけることで、Ｘ＾※（ω，ｌ）の推定値である推定信号パワーベクトルＸ_ｅｓｔ※（ω，ｌ）が求められる。

From the above relationship, the estimated value of X ^ * (ω, l) is obtained by applying the inverse matrix of the matrix T * given in advance to the beamformer unit output vector Y * (ω, l) from the left side. An estimated signal power vector X _est * (ω, l) is obtained.

第２の実施例は、実施例１の利得係数算出部１６における手順に変更を加えたものである。図１０は第２の実施例において用いられる利得係数算出部１６の処理手順を示したものである。実施例１における利得係数算出部１６との相違点は非線形処理部１６Ｃが追加された点である。非線形処理部１６ＣではＳＮ比推定部１６Ｂから出力される図９に示した特性の元利得係数をＲ’（ω，ｌ）と表記することとし、この元利得係数Ｒ’（ω，ｌ）に０から１の間で変動する非線形関数を乗算した計算結果である利得係数Ｒ（ω，ｌ）を出力する。ここでは非線形関数はあらかじめ与えられているもので、元利得係数Ｒ’（ω，ｌ）が大きい領域つまりΣ_ｑ｜Ｓ_ｑ（ω，ｌ）｜^２≒｜Ｓ_ｑｓ（ω，ｌ）｜^２の領域では１または１に近い値を維持し、元利得係数Ｒ’（ω，ｌ）が小さい領域つまりΣ_ｑ｜Ｓ_ｑ（ω，ｌ）｜^２＞＞｜Ｓ_ｑｓ（ω，ｌ）｜^２の領域では０または０に近い値を維持する関数で、たとえば式（１７）に示すハイポブリックタンジェント（例えばＺ１：Ｒ（ω，ｌ）＝１／２＋１／２ｔａｎｈ（ρＲ’（ω，ｌ）−ν））や式（１８）に示す対数関数（例えばＺ２：Ｒ（ω，ｌ）＝１／２＋１／２ｔａｎｈ（ρ（１０ｌｏｇ_１０）Ｒ’（ω，ｌ）−ν））と組み合わせたものなどが用いられる。図１１に非線形関数の一例を示す。 In the second embodiment, the procedure in the gain coefficient calculation unit 16 of the first embodiment is modified. FIG. 10 shows a processing procedure of the gain coefficient calculation unit 16 used in the second embodiment. The difference from the gain coefficient calculation unit 16 in the first embodiment is that a nonlinear processing unit 16C is added. In the nonlinear processing unit 16C, the original gain coefficient of the characteristic shown in FIG. 9 output from the SN ratio estimation unit 16B is expressed as R ′ (ω, l), and this original gain coefficient R ′ (ω, l) A gain coefficient R (ω, l), which is a calculation result obtained by multiplying a nonlinear function that varies between 0 and 1, is output. Here, the nonlinear function is given in advance, and the region where the original gain coefficient R ′ (ω, l) is large, that is, Σ _q | S _q (ω, l) | ² ≈ | S _qs (ω, l) | ² In the region, the value is maintained at 1 or close to 1, and the original gain coefficient R ′ (ω, l) is small, that is, Σ _q | S _q (ω, l) | ² >> | S _qs (ω, l) | ² is a function that maintains 0 or a value close to 0. For example, the hypothetical tangent shown in the equation (17) (for example, Z1: R (ω, l) = 1/2 + 1 / 2tanh (ρR ′ (ω, l) -Ν)) and a logarithmic function (for example, Z2: R (ω, l) = 1/2 + 1 / 2tanh (ρ (10 log ₁₀ ) R ′ (ω, l) −ν)) shown in equation (18) Things are used. FIG. 11 shows an example of the nonlinear function.

ここでρ、νは非線形関数の特定を変化させるパラメータで任意に設定される。これ以外の部分に関しては実施例１と同様であるので説明を省略する。図１１に示した非線形特性によれば、所望音声が優勢な周波数領域ではその周波数成分を強調し、背景雑音が優勢な周波数領域ではその周波数成分を抑圧することができ、本発明による特定方向強調量を向上させる効果がある。

Here, ρ and ν are arbitrarily set as parameters for changing the specification of the nonlinear function. Since other parts are the same as those in the first embodiment, the description thereof is omitted. According to the non-linear characteristic shown in FIG. 11, the frequency component can be emphasized in the frequency region where the desired speech is dominant, and the frequency component can be suppressed in the frequency region where the background noise is dominant. There is an effect of improving the amount.

第３の実施例は、実施例１の信号量推定部１４と、利得係数算出部１６と、乗算部１７における処理手順に変更を加えたものである。図１２は第３の実施例における本発明の全体構成を示している。実施例１との相違点は周波数領域変換部１３−１〜１３−Ｑの各後段に帯域分割部１９−１〜１９−Ｑを設けた点と、この帯域分割部１９−１〜１９−Ｑで帯域分割した各帯域成分を同一周波数帯域毎に収集する同一帯域成分収集部２０−１〜２０−Ωとを設けた構成とした点である。
各同一帯域成分収集部２０−１〜２０−Ωにはそれぞれに信号量推定部１４と、特定方向選択部１５と、利得係数算出部１６と、乗算部１７とを設ける。従って、この実施例３での特徴は信号量推定部１４と、特定方向選択部１５と、利得係数算出部１６，乗算部１７の各処理がΩ個の異なる同一帯域成分収集部２０−１、２０−２、…２０−Ωにおいて別々に行われ、同一帯域成分収集部２０−１〜２０−Ωに設けた乗算部１７の出力が帯域合成部２１により合成されている点を特徴とする。 In the third embodiment, the processing procedure in the signal amount estimation unit 14, the gain coefficient calculation unit 16, and the multiplication unit 17 in the first embodiment is changed. FIG. 12 shows the overall configuration of the present invention in the third embodiment. The difference from the first embodiment is that band division units 19-1 to 19-Q are provided in the subsequent stages of the frequency domain conversion units 13-1 to 13-Q, and the band division units 19-1 to 19-Q. It is the point which set it as the structure which provided the same band component collection part 20-1-20- (omega | ohm) which collects each band component divided | segmented by (5) for every same frequency band.
Each of the same band component collection units 20-1 to 20-Ω includes a signal amount estimation unit 14, a specific direction selection unit 15, a gain coefficient calculation unit 16, and a multiplication unit 17. Therefore, the feature of the third embodiment is that the signal amount estimating unit 14, the specific direction selecting unit 15, the gain coefficient calculating unit 16, and the multiplying unit 17 are Ω different same band component collecting units 20-1, 20-... 20-Ω separately, and the output of the multiplier 17 provided in the same band component collection unit 20-1 to 20 -Ω is synthesized by the band synthesizing unit 21.

図１３と図１４はそれぞれ帯域分割部１９−１〜１９−Ｑと帯域合成部２１の処理手順を示している。帯域分割部１９−１〜１９−Ωでは入力された周波数成分Ｙ_１（ω、ｌ）〜Ｙ_Ｑ（ω、ｌ）がΩ_１からΩ_Ωまでの帯域に分割され、それぞれ出力される。一方帯域合成部２１では、入力されたΩ_１からΩ_Ωまでの帯域の周波数成分をまとめてひとつの信号Ｙ（ω、ｌ）として出力する。また本実施例においては、パワー推定部において用いられるゲイン行列Ｔ※の各要素は、例えば式（１９）および式（２０）に示すように与えられた帯域内での平均値を用いる。 FIGS. 13 and 14 show processing procedures of the band dividing units 19-1 to 19-Q and the band synthesizing unit 21, respectively. In the band dividing units 19-1 to 19-Ω, the input frequency components Y ₁ (ω, l) to Y _Q (ω, l) are divided into bands from Ω ₁ to Ω _Ω and output respectively. On the other hand, the band synthesizing unit 21 outputs the frequency components of the input bands from Ω ₁ to Ω _Ω together as one signal Y (ω, l). In this embodiment, each element of the gain matrix T * used in the power estimation unit uses an average value within a given band as shown in, for example, Expression (19) and Expression (20).

以上の部分以外については実施例１と同様であるので説明を省略する。一般にビームフォーマー部の指向特性は周波数に応じて変化するが、実施例１ではすべての周波数においてゲインは均一であると仮定し、ゲイン行列の要素を代入する値を決定していた。本実施例ではゲイン行列を周波数帯域ごとに用意して処理を行うことができるため、信号量推定部１４においてより正しい指向特性に近い処理を行うことができ、推定信号パワーベクトルの推定精度を向上させる効果がある。

Since the other parts are the same as those in the first embodiment, description thereof is omitted. In general, the directivity characteristic of the beamformer unit varies depending on the frequency. In the first embodiment, however, the gain is assumed to be uniform at all frequencies, and the value for substituting the elements of the gain matrix is determined. In this embodiment, the gain matrix can be prepared for each frequency band and processing can be performed, so that the signal amount estimation unit 14 can perform processing closer to the correct directivity and improve the estimation accuracy of the estimated signal power vector. There is an effect to make.

第４の実施例は、実施例１の信号量推定部１４と、利得係数算出部１６における手順に変更を加えたものである。第４の実施例に用いられる信号量推定部１４の構成を図１５に、利得係数算出部１６の構成を図１６に示す。信号量推定部１４に入力される周波数成分Ｙ_１（ω，ｌ）〜Ｙ_Ｑ（ω，ｌ）はそれぞれ絶対値演算部１４Ｄ−１〜１４Ｄ−Ｑに入力され、信号の絶対値｜Ｙ_１（ω，ｌ）｜、｜Ｙ_２（ω，ｌ）｜、｜Ｙ_３（ω，ｌ）｜、…｜Ｙ_Ｑ（ω，ｌ）｜を出力し、ベクトル化部１４Ａに入力される。ベクトル化部１４Ａでは入力された信号を式（２１）に示す絶対値ベクトルＹ※（ω，ｌ）を出力する。 In the fourth example, the procedure in the signal amount estimation unit 14 and the gain coefficient calculation unit 16 in the first example is modified. FIG. 15 shows the configuration of the signal amount estimation unit 14 used in the fourth embodiment, and FIG. 16 shows the configuration of the gain coefficient calculation unit 16. The frequency components Y ₁ (ω, l) to Y _Q (ω, l) input to the signal amount estimation unit 14 are input to the absolute value calculation units 14D-1 to 14D-Q, respectively, and the absolute value | Y _{1 of the} signal (Ω, l) |, | Y ₂ (ω, l) |, | Y ₃ (ω, l) |,... | Y _Q (ω, l) | are output and input to the vectorization unit 14A. The vectorization unit 14A outputs an absolute value vector Y * (ω, l) shown in Expression (21) from the input signal.

絶対値ベクトルは乗算部１４Ｂに入力される。乗算部１４Ｂのもう一方の入力である絶対値推定行列Ｔ^−１※は、逆行列演算部１４Ｃの出力信号である。逆行列演算部１４Ｃは入力されたゲイン行列Ｔ※の逆行列Ｔ^−１※を出力する。
ゲイン行列Ｔ※の各要素は式（８）と同様にビームフォーマー部の各方向領域に対する指向特性が求められるパラメータであり、フィルタ係数から計算される指向特性のゲイン量から式（２２）により定義され、事前に与えられる。

The absolute value vector is input to the multiplication unit 14B. The absolute value estimation matrix T ⁻¹ *, which is the other input of the multiplication unit 14B, is an output signal of the inverse matrix calculation unit 14C. The inverse matrix calculator 14C outputs an inverse matrix T ⁻¹ * of the input gain matrix T *.
Each element of the gain matrix T * is a parameter for which the directivity characteristic with respect to each direction region of the beamformer unit is obtained in the same manner as Expression (8). From the gain amount of the directivity characteristic calculated from the filter coefficient, Expression (22) Defined and given in advance.

乗算部１４Ｂに入力されたビームフォーマー部出力パワーベクトルとパワー推定行列の乗算を周波数成分ごとに行い、推定信号絶対値ベクトルＸ_ｅｓｔ※（ω，ｌ）を出力する。
次に図１６に示す利得係数算出部１６ではベクトル要素抽出部１６Ａは式（２３）に示すように入力された推定信号絶対値ベクトルの第１成分を｜Ｓ_１（ω，ｌ）｜、第２成分を｜Ｓ_２（ω，ｌ）｜、第３成分を｜Ｓ_３（ω，ｌ）｜、…としてそれぞれ出力し、それらはＳＮ比推定部１６Ｂに入力される。

The beamformer output power vector input to the multiplier 14B is multiplied by the power estimation matrix for each frequency component, and an estimated signal absolute value vector X _est * (ω, l) is output.
Next, in the gain coefficient calculation unit 16 shown in FIG. 16, the vector element extraction unit 16A sets the first component of the input estimated signal absolute value vector as | S ₁ (ω, l) | The two components are output as | S ₂ (ω, l) |, the third component is output as | S ₃ (ω, l) |,..., And these are input to the SN ratio estimation unit 16B.

これら以外の部分に関しては第１の実施例と同じであるのでこれ以上の説明を省略する。この第４の実施例によれば実施例１に比べて２乗計算をする必要がないことから演算量を削減することができる。
なおこの実施例４は実施例２の信号量推定部１４と利得係数算出部１６に対しても適用することができる。図１７は実施例２に本実施例４の変更を加えた場合の利得係数算出部１６の構成を示す。

Since other parts are the same as those of the first embodiment, further explanation is omitted. According to the fourth embodiment, it is not necessary to perform the square calculation as compared with the first embodiment, so that the calculation amount can be reduced.
The fourth embodiment can also be applied to the signal amount estimating unit 14 and the gain coefficient calculating unit 16 of the second embodiment. FIG. 17 shows the configuration of the gain coefficient calculation unit 16 when the modification of the fourth embodiment is added to the second embodiment.

以上説明した本発明による収音装置は全てをハードウェアによって構成することも可能であるが、最も簡素に実現するには上述した各手順をコンピュータが解読可能なプログラム言語によって記述した本発明による収音プログラムを作成し、この収音プログラムをコンピュータにインストールし、コンピュータに収音プログラムを実行させ、コンピュータに収音装置として機能させる実施例が最良である。本発明による収音プログラムはコンピュータが読み取り可能な例えば磁気媒体、ＣＤ−ＲＯＭ、半導体メモリ等の記録媒体に記録され、これらの記録媒体から、或いは通信回線を通じてコンピュータにインストールされる。インストールされた収音プログラムはコンピュータに備えられたＣＰＵにより解読され、コンピュータを収音装置として機能させる。 The sound collecting device according to the present invention described above can be entirely configured by hardware. However, in order to achieve the simplest implementation, the above-described procedures are recorded by a program language readable by a computer. An embodiment in which a sound program is created, this sound collection program is installed in a computer, the computer executes the sound collection program, and the computer functions as a sound collection device is the best. The sound collection program according to the present invention is recorded on a computer-readable recording medium such as a magnetic medium, a CD-ROM, or a semiconductor memory, and is installed in the computer from the recording medium or through a communication line. The installed sound collection program is decoded by a CPU provided in the computer, and causes the computer to function as a sound collection device.

本発明による収音装置は例えば電話会議システム等のハンズフリー通話装置の分野で活用される。 The sound pickup device according to the present invention is utilized in the field of hands-free call devices such as a telephone conference system.

本発明の第１の実施例を説明するためのブロック図。The block diagram for demonstrating the 1st Example of this invention. 本発明に用いるビームフォーマー部の指向特性を説明するための図。The figure for demonstrating the directivity of the beam former part used for this invention. 本発明に用いるビームフォーマー部に設定する指向方向領域の分割例を説明するための図。The figure for demonstrating the example of a division | segmentation of the directivity direction area | region set to the beam former part used for this invention. 本発明に用いるビームフォーマー部の構成の一例を説明するためのブロック図。The block diagram for demonstrating an example of a structure of the beam former part used for this invention. 本発明に用いる信号量推定部の構成の一例を説明するためのブロック図。The block diagram for demonstrating an example of a structure of the signal amount estimation part used for this invention. 本発明に用いるビームフォーマー部の指向特性の一例を説明するための図。The figure for demonstrating an example of the directional characteristic of the beam former part used for this invention. 本発明に用いる特定方向選択部の構成の一例を説明するためのブロック図。The block diagram for demonstrating an example of a structure of the specific direction selection part used for this invention. 本発明に用いる利得係数算出部の構成の一例を説明するためのブロック図。The block diagram for demonstrating an example of a structure of the gain coefficient calculation part used for this invention. 本発明に用いる利得係数の特性の一例を説明するためのグラフ。The graph for demonstrating an example of the characteristic of the gain coefficient used for this invention. 本発明の第２の実施例を説明するためのブロック図。The block diagram for demonstrating the 2nd Example of this invention. 図１０に示した利得係数算出部が算出する利得係数の特性を説明するためのグラフ。The graph for demonstrating the characteristic of the gain coefficient which the gain coefficient calculation part shown in FIG. 10 calculates. 本発明の第３の実施例を説明するためのブロック図。The block diagram for demonstrating the 3rd Example of this invention. 本発明の第３の実施例に用いた帯域分割部の構成の一例を説明するためのブロック図。The block diagram for demonstrating an example of a structure of the band division part used for the 3rd Example of this invention. 本発明の第３の実施例に用いた帯域合成部の構成を説明するためのブロック図。The block diagram for demonstrating the structure of the zone | band synthetic | combination part used for the 3rd Example of this invention. 本発明の第４の実施例を説明するためのブロック図。The block diagram for demonstrating the 4th Example of this invention. 本発明の第４の実施例に用いる利得係数算出部の構成を説明するためのブロック図。The block diagram for demonstrating the structure of the gain coefficient calculation part used for the 4th Example of this invention. 本発明の第４の実施例に用いる利得係数算出部の他の構成を説明するためのブロック図。The block diagram for demonstrating the other structure of the gain coefficient calculation part used for the 4th Example of this invention. 従来技術によるマイクロホンアレーの収音方法を説明するための図。The figure for demonstrating the sound collection method of the microphone array by a prior art. 従来の特定方向収音装置を説明するためのブロック図。The block diagram for demonstrating the conventional specific direction sound-collecting apparatus. 従来の特定方向収音装置の収音特性を説明するための図。The figure for demonstrating the sound collection characteristic of the conventional specific direction sound collection apparatus.

Explanation of symbols

１１マイクロホンアレー１６利得係数算出部
１２−１〜１２−Ｑビームフォーマー部１７乗算部
１３−１〜１３−Ｑ周波数領域変換部１８逆周波数領域変換部
１４信号量推定部１９−１〜１９−Ｑ帯域分割部
１５特定方向選択部２０−１〜２０−Ω 同一帯域成分収集部
２１帯域合成部 DESCRIPTION OF SYMBOLS 11 Microphone array 16 Gain coefficient calculation part 12-1 to 12-Q Beam former part 17 Multiplication part 13-1 to 13-Q Frequency domain conversion part 18 Inverse frequency domain conversion part
14 Signal amount estimation unit 19-1 to 19-Q Band division unit
15 Specific direction selector 20-1 to 20-Ω Same band component collector
21 Band synthesis unit

Claims

A plurality of beamformer sections that emphasize and collect sound coming from angular regions in different directions using output signals of a microphone array configured with a plurality of microphones;
A frequency domain conversion unit that converts each of the angle domain signals collected by the plurality of beamformer units into a frequency domain signal divided into a plurality of band components;
A specific direction selection unit that selects a specific direction frequency domain signal belonging to an angular domain signal of an angular domain in a desired direction among the frequency domain signals output by each frequency domain transform unit;
For each inverse of the gain matrix whose element is a parameter obtained from the directivity characteristics of each of the plurality of beamformer units for each angle region, with the sum of the signals included in each angle region as the signal amount of that angle region. A signal amount estimation unit that multiplies the signal amount of the frequency domain signal output by the region conversion unit and estimates the signal amount of each angle region;
A gain coefficient calculation unit that calculates a gain coefficient for each frequency band by a ratio of a signal amount in an angle region of a desired direction and a sum of signal amounts in all angle regions;
A multiplier that multiplies the signal amount in each corresponding frequency band of the specific direction frequency domain signal by the gain coefficient calculated by the gain coefficient calculator;
A specific direction sound pickup device comprising:

A plurality of beamformer units that use the output signals of a microphone array configured with a plurality of microphones to emphasize and collect sound coming from angular regions in different directions; and
A frequency domain conversion unit that converts each of the angle domain signals collected by the plurality of beamformer units into a frequency domain signal divided into a plurality of band components;
A band dividing unit that divides the frequency domain signal output by the frequency domain converting unit into frequency band components;
The same band component collecting unit that collects the band components output by the band dividing unit for each same band component;
A band synthesizing unit that is provided on the output side of the same band component collecting unit and synthesizes band components output from each same band component collecting unit;
With
The same band component collecting unit
A specific direction selection unit that selects a specific direction band component belonging to an angle region signal of a desired direction in which any of the beam former units collects sound;
For the inverse matrix of the gain matrix whose elements are the parameters obtained from the directivity characteristics for each angle region and frequency band of the plurality of beamformer units, with the sum of signals included in each angle region as the signal amount of that angle region. A signal amount estimation unit that multiplies the signal amount of the frequency band component output by each band dividing unit to estimate the signal amount of each angle region;
A gain coefficient calculation unit that calculates a gain coefficient by a ratio of a signal amount in an angle region of a desired direction and a sum of signal amounts in all angle regions;
A multiplier for multiplying the specific direction band component by the gain coefficient calculated by the gain coefficient calculator;
A specific direction sound pickup device comprising:

In certain direction pickup device according to Claim 1, the signal amount of the frequency domain signal, a specific direction sound collection device which is a power value of the signal.

2. The specific direction sound pickup apparatus according to claim 1, wherein a signal amount of the frequency domain signal is an absolute value of the signal.

The specific direction sound pickup apparatus according to claim 2, wherein the signal amount of the frequency band component is a signal power value.

The specific direction sound pickup apparatus according to claim 2, wherein the signal amount of the frequency band component is an absolute value of the signal.

In certain direction sound collection device according to any one of claims 1 to 6, with respect to the signal amount gain factor calculating characteristics of said gain coefficient calculation unit in the desired direction of angular regions, signal in the other angles regions When the amount is negligibly small, the gain coefficient value is given as a predetermined maximum value, so that the signal amount in the angle region of the desired direction is negligible with respect to the signal amount in other angle regions. The specific direction sound collecting device, wherein the gain coefficient value is calculated with a characteristic that is a value close to 0 when the value is very small.

In certain direction sound collection device according to any one of claims 1 to 6, the gain factor calculating characteristics of said gain coefficient calculation unit, signals of the other angular region for the signal amount in the desired direction of angular regions in a small band is maintained at a value close to the maximum value to the maximum value the value of the gain factor, with respect to the signal amount in the desired direction of angular regions, wherein the gain factor in the region signal amount is large and other angular regions The specific direction sound collecting device is characterized in that it is calculated with a characteristic that maintains the value of 0 at a value close to approximately 0 to 0.

A specific direction sound collecting program which is written in a computer readable program language and causes the computer to function as the specific direction sound collecting device according to any one of claims 1 to 8 .

A recording medium comprising a recording medium readable by a computer, wherein the sound recording program for a specific direction according to claim 9 is recorded on the recording medium.