JP2016149612A

JP2016149612A - Microphone interval control device and program

Info

Publication number: JP2016149612A
Application number: JP2015024526A
Authority: JP
Inventors: 克之高橋; Katsuyuki Takahashi
Original assignee: Oki Electric Industry Co Ltd
Current assignee: Oki Electric Industry Co Ltd
Priority date: 2015-02-10
Filing date: 2015-02-10
Publication date: 2016-08-18
Anticipated expiration: 2035-02-10
Also published as: JP6481397B2

Abstract

PROBLEM TO BE SOLVED: To provide a microphone interval control device capable of rapidly controlling a microphone interval in response to the frequency characteristics of a noise component.SOLUTION: A microphone interval control device is based on fixedly arranged multiple microphones. Since a microphone interval is changed by the combination of two microphones among the multiple microphones, a microphone interval is selected by selecting two acquisition signals to be output to a next stage device. A feature amount expressing the frequency and largeness of change in the inclination direction of a signal waveform concerning the acquisition signal of the predetermined microphone is calculated in order to perform the selection. The feature amount reflects the level of the inclusion of a high range component. The two acquisition signals are selected in response to the calculated feature amount.SELECTED DRAWING: Figure 1

Description

本発明はマイクロホン間隔制御装置及びプログラムに関し、例えば、複数のマイクロホンが捕捉した入力信号における雑音成分を抑圧する場合に適用し得るものである。 The present invention relates to a microphone interval control device and a program, and can be applied to, for example, a case where noise components in input signals captured by a plurality of microphones are suppressed.

複数のマイクロホンを有するマイクロホンアレイを用いて雑音成分を抑圧する技術として、特許文献１の記載技術を挙げることができる。 As a technique for suppressing a noise component using a microphone array having a plurality of microphones, a technique described in Patent Document 1 can be cited.

特許文献１の記載技術では、位置関係が固定された２個のマイクロホンから構成されるマイクロホンアレイを用いた信号処理を行うことを想定していた。しかし、従来、信号処理に供するマイクロホンの位置関係は必ずしも固定であるとは限らない。例えば、図９のように、３個以上のマイクロホンを有するマイクロホンアレイを適用し、マイクロホンアレイの中から何らかの基準に基づいて２個のマイクロホンを選択して信号処理を行う場合もある。 The technique described in Patent Document 1 assumes that signal processing is performed using a microphone array composed of two microphones whose positional relationship is fixed. However, conventionally, the positional relationship of microphones used for signal processing is not always fixed. For example, as shown in FIG. 9, there may be a case where a microphone array having three or more microphones is applied, and two microphones are selected from the microphone array based on some criteria to perform signal processing.

ところで、２つのマイクロホンの間隔が広いほど、低域成分の指向性が鋭くなる一方で、高域成分には空間サンプリング定理による誤差が生じる。逆に、マイクロホン間隔が狭いほど、低域成分の指向性が広くなるものの、高域成分の指向性が正確になる。以上のように、マイクロホン間隔に応じて、鋭い指向性が形成される帯域が変化する。そのため、マイクロホンアレイを用いて背景雑音の抑圧処理を行う場合、雑音成分の周波数特性に応じてマイクロホン間隔を切り替えることにより、適切な雑音抑圧処理を実現できる。例えば、低域成分にパワーが集中しているような背景雑音が生じている場合には、マイクロホン間隔が広いマイクロホンアレイを適用して低域成分の指向性を鋭くし、捕捉信号に背景雑音の成分を適切に反映させることで、雑音抑圧効果を十分に発揮させる。 By the way, the wider the interval between two microphones, the sharper the directivity of the low frequency component, while the high frequency component has an error due to the spatial sampling theorem. On the contrary, as the microphone interval is narrower, the directivity of the low frequency component becomes wider, but the directivity of the high frequency component becomes more accurate. As described above, the band in which the sharp directivity is formed changes according to the microphone interval. Therefore, when background noise suppression processing is performed using a microphone array, appropriate noise suppression processing can be realized by switching the microphone interval according to the frequency characteristics of the noise component. For example, if background noise is generated in which power is concentrated in the low frequency components, a microphone array with a wide microphone interval is applied to sharpen the directivity of the low frequency components. By appropriately reflecting the components, the noise suppression effect can be sufficiently exerted.

以上のように、様々な雑音下でマイクロホンアレイの効果を高めるため、雑音成分の周波数特性に応じてマイクロホンアレイを構成するマイクロホンの間隔を制御することが行われている。ここで、雑音成分の周波数特性を調べる方法として、従来、ローパスフィルタやハイパスフィルタなどのフィルタを用いて注目する周波数成分を抽出する手法が採られてきた。 As described above, in order to enhance the effect of the microphone array under various noises, the interval between the microphones constituting the microphone array is controlled according to the frequency characteristic of the noise component. Here, as a method for examining the frequency characteristics of the noise component, a method of extracting a frequency component of interest using a filter such as a low-pass filter or a high-pass filter has been conventionally employed.

特開２０１３−１２６０２６号公報JP 2013-1206026 A 特開２０１４−１０６３３７号公報JP 2014-106337 A

ＮａｏｆｕｍｉＡｏｋｉ，”ＡＢａｎｄＥｘｔｅｎｓｉｏｎＴｅｃｈｎｉｑｕｅｆｏｒＮａｒｒｏｗ−ＢａｎｄＴｅｌｅｐｈｏｎｙＳｐｅｅｃｈＢａｓｅｄｏｎＦｕｌｌＷａｖｅＲｅｃｔｉｆｉｃａｔｉｏｎ”，ＩＥＩＣＥＴｒａｎｓ．Ｃｏｍｍｕｎ．，Ｖｏｌ．Ｅ９３−Ｂ（３），ｐｐ．７２９−７３１，２０１０Naofumi Aoki, “A Band Extension Technology for Narrow-Band Telephony Speech Base on Full Wave Rectification”, IEICE Trans. Commun. , Vol. E93-B (3), pp. 729-731, 2010

しかしながら、従来のフィルタを用いた雑音成分の周波数特性の把握方法では、（ｉ）フィルタ処理には処理遅延が伴うために突発的な雑音特性の変動に追従できない、（ｉｉ）フィルタ処理をソフトウェア的に行う場合には、演算量が増すためにリアルタイム性が損なわれ、必要なリソース（例えばメモリ）の確保のためにコストが増す、といったような課題がある。 However, in the conventional method of grasping the frequency characteristic of the noise component using the filter, (i) the filter process involves a processing delay, so that it cannot follow a sudden change in the noise characteristic. (Ii) The filter process is software-like. However, there is a problem that real-time performance is impaired due to an increase in the amount of computation, and costs are increased in order to secure necessary resources (for example, memory).

そのため、雑音成分の周波数特性に応じてマイクロホン間隔を速やかに制御できるマイクロホン間隔制御装置及びプログラムが望まれている。 Therefore, a microphone interval control device and a program that can quickly control the microphone interval according to the frequency characteristics of the noise component are desired.

第１の本発明のマイクロホン間隔制御装置は、（１）複数のマイクロホンによる捕捉信号のうちの少なくとも１つの捕捉信号に基づき、マイクロホンの捕捉信号における、信号波形の傾き方向が変化する回数とその大きさを表し、かつ高域成分の含まれ具合を反映した特徴量を算出する特徴量算出手段と、（２）算出された上記特徴量に応じて、次段装置に出力する２つの捕捉信号を出力したマイクロホン間のマイクロホン間隔を調整するマイクロホン間隔最適化手段とを有することを特徴とする。 The microphone interval control apparatus according to the first aspect of the present invention is (1) the number of times and the magnitude of the change in the inclination direction of the signal waveform in the captured signal of the microphone based on at least one captured signal of the captured signals from the plurality of microphones And a feature amount calculating means for calculating a feature amount that reflects the degree of inclusion of the high frequency component, and (2) two captured signals to be output to the next-stage device according to the calculated feature amount. And a microphone interval optimizing means for adjusting a microphone interval between the output microphones.

第２の本発明のマイクロホン間隔制御プログラムは、コンピュータを、（１）複数のマイクロホンによる捕捉信号のうちの少なくとも１つの捕捉信号に基づき、マイクロホンの捕捉信号における、信号波形の傾き方向が変化する回数とその大きさを表し、かつ高域成分の含まれ具合を反映した特徴量を算出する特徴量算出手段と、（２）算出された上記特徴量に応じて、次段装置に出力する２つの捕捉信号を出力したマイクロホン間のマイクロホン間隔を調整するマイクロホン間隔最適化手段として機能させることを特徴とする。 A microphone interval control program according to a second aspect of the present invention is a computer that controls (1) the number of times that the inclination direction of a signal waveform in a captured signal of a microphone changes based on at least one captured signal among captured signals from a plurality of microphones. And a feature amount calculation means for calculating a feature amount that reflects the degree of inclusion of the high-frequency component, and (2) two output to the next-stage device according to the calculated feature amount It is made to function as a microphone space | interval optimization means which adjusts the microphone space | interval between the microphones which output the capture signal.

本発明によれば、雑音成分の周波数特性に応じてマイクロホン間隔を速やかに制御できるマイクロホン間隔制御装置及びプログラムを提供できる。 ADVANTAGE OF THE INVENTION According to this invention, the microphone space | interval control apparatus and program which can control a microphone space | interval rapidly according to the frequency characteristic of a noise component can be provided.

第１の実施形態のマイクロホン間隔制御装置の構成を示すブロック図である。It is a block diagram which shows the structure of the microphone space | interval control apparatus of 1st Embodiment. 第１の実施形態のマイクロホン間隔制御装置におけるマイクロホン選択制御部の詳細構成を示すブロック図である。It is a block diagram which shows the detailed structure of the microphone selection control part in the microphone space | interval control apparatus of 1st Embodiment. 第１の実施形態のマイクロホン間隔制御装置におけるマイクロホン選択制御部内のｍｏｄＧＩ／選択制御信号対応記憶部（テーブル）の構成を示す説明図である。It is explanatory drawing which shows the structure of the modGI / selection control signal corresponding | compatible memory | storage part (table) in the microphone selection control part in the microphone space | interval control apparatus of 1st Embodiment. 第２の実施形態のマイクロホン間隔制御装置の構成を示すブロック図である。It is a block diagram which shows the structure of the microphone space | interval control apparatus of 2nd Embodiment. 第２の実施形態のマイクロホン間隔制御装置におけるマイクロホン選択・遅延制御部内のｍｏｄＧＩ／選択・遅延制御信号対応記憶部（テーブル）の構成を示す説明図である。It is explanatory drawing which shows the structure of the modGI / selection / delay control signal corresponding | compatible memory | storage part (table) in the microphone selection / delay control part in the microphone space | interval control apparatus of 2nd Embodiment. 第３の実施形態のマイクロホン間隔制御装置の構成を示すブロック図である。It is a block diagram which shows the structure of the microphone space | interval control apparatus of 3rd Embodiment. 第３の実施形態のマイクロホン間隔制御装置におけるマイクロホン移動制御部内のｍｏｄＧＩ／マイクロホン位置対応記憶部（テーブル）の構成を示す説明図である。It is explanatory drawing which shows the structure of the modGI / microphone position corresponding | compatible memory | storage part (table) in the microphone movement control part in the microphone space | interval control apparatus of 3rd Embodiment. 第４の実施形態のマイクロホン間隔制御装置の構成を示すブロック図である。It is a block diagram which shows the structure of the microphone space | interval control apparatus of 4th Embodiment. マイクロホンアレイの説明図である。It is explanatory drawing of a microphone array.

（Ａ）第１の実施形態
以下、本発明によるマイクロホン間隔制御装置及びプログラムの第１の実施形態を、図面を参照しながら説明する。 (A) First Embodiment Hereinafter, a first embodiment of a microphone interval control device and a program according to the present invention will be described with reference to the drawings.

（Ａ−１）第１の実施形態の構成
図１は、第１の実施形態のマイクロホン間隔制御装置１０の構成を示すブロック図であり、当該マイクロホン間隔制御装置１０から出力されたマイクロホン捕捉信号が入力されるマイクロホンアレイ処理部２０も記載している。 (A-1) Configuration of the First Embodiment FIG. 1 is a block diagram showing the configuration of the microphone interval control device 10 of the first embodiment, and the microphone capture signal output from the microphone interval control device 10 is The input microphone array processing unit 20 is also described.

図１に示すマイクロホン間隔制御装置１０の構成中、３個のマイクロホンｍ１〜ｍ２を除く構成部分は、ハードウェア的な各種構成要素を接続して構築されたものであっても良く、また、ＣＰＵ、ＲＯＭ、ＲＡＭなどのプログラムの実行構成を適用してその機能を実現するように構築されたものであっても良い。いずれの構築方法を適用した場合であっても、マイクロホン間隔制御装置１０の機能的な詳細構成は、図１で表す構成となっている。 In the configuration of the microphone interval control device 10 shown in FIG. 1, the components other than the three microphones m1 and m2 may be constructed by connecting various hardware components, and the CPU Further, it may be constructed so as to realize the function by applying an execution configuration of a program such as a ROM or a RAM. Regardless of which construction method is applied, the functional detailed configuration of the microphone interval control device 10 is the configuration shown in FIG.

図１において、第１の実施形態のマイクロホン間隔制御装置１０は、第１〜第３のマイクロホンｍ１〜ｍ３、ｍｏｄＧＩ計算部１１、マイクロホン選択制御部１２及びマイクロホン選択部１３を有する。 In FIG. 1, the microphone interval control device 10 according to the first embodiment includes first to third microphones m1 to m3, a modGI calculation unit 11, a microphone selection control unit 12, and a microphone selection unit 13.

第１〜第３のマイクロホンｍ１〜ｍ３はそれぞれ、周囲の音声、音響などの音波を捕捉して電気信号（アナログ信号）に変換するものである。各マイクロホンｍ１〜ｍ３は無指向性若しくは後述する正面方向に緩い指向性を有するものである。図１では省略しているが、各マイクロホンｍ１〜ｍ３による捕捉信号（アナログ信号）をデジタル信号に変換するアナログ／デジタル変換器が設けられており、デジタル信号に変換された捕捉信号ｓ１（ｎ）〜ｓ３（ｎ）に与えられるようになされている。ここで、「ｎ」は時刻を表すパラメータであるが、以降の説明においては記述を省略する。 Each of the first to third microphones m1 to m3 captures sound waves such as ambient sounds and sounds and converts them into electrical signals (analog signals). Each of the microphones m1 to m3 has non-directionality or directivity that is loose in the front direction described later. Although not shown in FIG. 1, an analog / digital converter is provided for converting captured signals (analog signals) from the microphones m1 to m3 into digital signals, and the captured signals s1 (n) converted into digital signals. To s3 (n). Here, “n” is a parameter representing time, but the description is omitted in the following description.

第１の実施形態の場合、第１〜第３のマイクロホンｍ１〜ｍ３はこの順序で１直線上に配置されており、第１及び第２のマイクロホンｍ１及びｍ２のマイクロホン間隔Ｄ_１２は、第２及び第３のマイクロホンｍ２及びｍ３のマイクロホン間隔Ｄ_２３より狭くなっている。当然に、第１及び第３のマイクロホンｍ１及びｍ３のマイクロホン間隔Ｄ_１３は、第１及び第２のマイクロホンｍ１及びｍ２のマイクロホン間隔Ｄ_１２や、第２及び第３のマイクロホンｍ２及びｍ３のマイクロホン間隔Ｄ_２３より広い。 For the first embodiment, the first to third microphone m1~m3 are arranged in one straight line in this order, the microphone spacing _{D 12} between the first and second microphones m1 and m2, second and it is smaller than a microphone interval _{D 23} of the third microphone m2 and m3. Naturally, the microphone spacing _{D 13} between the first and third microphones m1 and m3 are and microphone distance _{D 12} of the first and second microphones m1 and m2, a microphone interval between the second and third microphones m2 and m3 wider than the D _23.

第１〜第３のマイクロホンｍ１〜ｍ３からの捕捉信号ｓ１〜ｓ３はマイクロホン選択部１３に与えられる。マイクロホン選択部１３は、３入力２出力構成のものであり、マイクロホン選択制御部１２から出力された選択制御信号ｃｎｔｌｍｉｃに応じて、自己へ入力された３つの捕捉信号ｓ１〜ｓ３から、２つの捕捉信号ＳＡ及びＳＢを選択してマイクロホンアレイ処理部２０に与えるものである。マイクロホン選択部１３は、例えば、２個の２入力１出力スイッチｓｗ１及びｓｗ２を備え、第１のスイッチｓｗ１は、捕捉信号ｓ１及びｓ２から一方を選択し、選択後の一方の捕捉信号ＳＡとしてマイクロホンアレイ処理部２０に与え、第２のスイッチｓｗ２は、捕捉信号ｓ２及びｓ３から一方を選択し、選択後の他方の捕捉信号ＳＢとしてマイクロホンアレイ処理部２０に与える。 Capture signals s1 to s3 from the first to third microphones m1 to m3 are given to the microphone selection unit 13. The microphone selection unit 13 has a configuration of three inputs and two outputs. In accordance with the selection control signal cntlmic output from the microphone selection control unit 12, two capture signals s1 to s3 input to the microphone selection unit 13 are captured. The signals SA and SB are selected and given to the microphone array processing unit 20. The microphone selection unit 13 includes, for example, two two-input one-output switches sw1 and sw2, and the first switch sw1 selects one of the captured signals s1 and s2, and the microphone as one captured signal SA after the selection. The second switch sw2 selects one of the captured signals s2 and s3, and supplies the selected signal to the microphone array processing unit 20 as the other captured signal SB after selection.

マイクロホンアレイ処理部２０に与えられる捕捉信号ＳＡ及びＳＢを、選択前の捕捉信号ｓ１〜ｓ３の組み合わせで表現すると、（ｉ）ｓ１及びｓ２、（ii）ｓ２及びｓ３、（iii）ｓ１及びｓ３である。この３種類の組み合わせに対応する２つのマイクロホンの間隔はそれぞれＤ_１２、Ｄ_２３、Ｄ_１３となっており、従って、マイクロホン選択部１３は、マイクロホン間隔を選択しているということができる。 When the captured signals SA and SB given to the microphone array processing unit 20 are expressed by a combination of captured signals s1 to s3 before selection, (i) s1 and s2, (ii) s2 and s3, (iii) s1 and s3. is there. The intervals between the two microphones corresponding to the three types of combinations are D ₁₂ , D ₂₃ , and D ₁₃ , respectively. Therefore, it can be said that the microphone selection unit 13 selects the microphone interval.

第１の実施形態の場合、第１のマイクロホンｍ１から出力された捕捉信号ｓ１がｍｏｄＧＩ計算部１１に与えられる。ｍｏｄＧＩ計算部１１は、捕捉信号ｓ１についてのｍｏｄＧＩ値ｍｏｄＧＩを計算し、得られたｍｏｄＧＩ値ｍｏｄＧＩをマイクロホン選択制御部１２に与える。ｍｏｄＧＩ計算部１１は、例えば、捕捉信号ｓ１についてのｍｏｄＧＩ値ｍｏｄＧＩを（１）式に従って算出する。

In the case of the first embodiment, the captured signal s1 output from the first microphone m1 is given to the modGI calculation unit 11. The modGI calculation unit 11 calculates a modGI value modGI for the captured signal s1 and gives the obtained modGI value modGI to the microphone selection control unit 12. The mod GI calculation unit 11 calculates, for example, a mod GI value mod GI for the captured signal s 1 according to the equation (1).

ｍｏｄＧＩ値について簡単に説明する（詳細については、特許文献２参照）。ｍｏｄＧＩは、修正されたグラディエント・インデックス（ＧｒａｄｉｅｎｔＩｎｄｅｘ：以下、ＧＩと呼ぶ）を意味している。修正される前のＧＩについては、非特許文献１に記載されている。 The modGI value will be briefly described (refer to Patent Document 2 for details). modGI means a modified gradient index (hereinafter referred to as GI). The GI before being corrected is described in Non-Patent Document 1.

ＧＩは、信号波形の傾き方向が変化する回数とその大きさを測る指標である。ＧＩは、傾き方向が変化したときの、相前後するサンプルの差分絶対値の総和を、そのフレームのパワーの平方根で除算したものとして求められる。従って、ＧＩは、１フレーム内の傾きの変化回数が多いほど大きくなり易く、また、傾きが変化したときの変化量が大きいほど大きくなり易いものである。このような性質から、ＧＩは、入力波形に含まれる高域成分の量と直結しているということもできる。 GI is an index for measuring the number of times and the magnitude of the change in the inclination direction of the signal waveform. The GI is obtained by dividing the sum of absolute difference values of successive samples when the tilt direction is changed by the square root of the power of the frame. Therefore, the GI is likely to increase as the number of changes in inclination within one frame increases, and also increases as the amount of change when the inclination changes increases. From such a property, it can be said that GI is directly connected to the amount of high frequency components included in the input waveform.

しかしながら、ＧＩは、変数ΔΨ（ｎ）という０又は２の２値しかとらない、時系列的に値の大きな飛び跳ねが多発するパラメータを算出要素としているため、値が不規則に大きくなったり小さくなったりするという特徴（「値が暴れる」）がある。 However, since GI uses as a calculation element a variable ΔΨ (n) that takes only two values of 0 or 2 and has many jumps with large values in time series, the value increases or decreases irregularly. There is a characteristic ("value goes wild").

ｍｏｄＧＩは、ＧＩの値が暴れる（値の大きな飛び跳ねを有する）という性質を有することに鑑み、ＧＩに代えて、ＧＩと高い相関を持ちながら、値の大きな飛び跳ねを抑制した変化が安定した新しい特徴量として提案されたものである。ｍｏｄＧＩは、特徴量算出対象の任意の信号（本願では捕捉信号ｓ１）に関し、その「算出対象信号のパワー」で正規化された、その「算出対象信号の２階差分のパワー」（これを定数倍したものも含まれる）として定義される。 In view of the fact that modGI has the property that the value of GI is rampant (has a jump with a large value), instead of GI, it has a high correlation with GI, and a new feature with stable changes that suppresses a large jump in value It is proposed as a quantity. modGI is the “power of the second-order difference of the calculation target signal” normalized by the “power of the calculation target signal” for an arbitrary signal (captured signal s1 in the present application) of the feature quantity calculation target (this is a constant). Doubled is also included).

ｍｏｄＧＩは、ＧＩと高い相関を持つので、信号波形の傾き方向が変化する回数とその大きさを測る安定した指標として機能し、また、入力波形に含まれる高域成分の量を反映したものとして機能する。 Since modGI has a high correlation with GI, it functions as a stable index that measures the number and magnitude of changes in the slope direction of the signal waveform, and reflects the amount of high frequency components included in the input waveform. Function.

以上のように、ｍｏｄＧＩは、信号波形（捕捉信号ｓ１）に含まれる高域成分の量と直結しているため、ｍｏｄＧＩの観察によって雑音成分の周波数特性を推定することができ、しかも、ｍｏｄＧＩの演算が単純なのでコストを増すこともない。そのため、雑音成分の周波数特性を推定する指標（パラメータ）としてｍｏｄＧＩを利用することとした。 As described above, modGI is directly connected to the amount of high-frequency components included in the signal waveform (capture signal s1), so that the frequency characteristics of noise components can be estimated by observing modGI, and modGI Since the calculation is simple, it does not increase the cost. Therefore, modGI is used as an index (parameter) for estimating the frequency characteristic of the noise component.

（１）式は、特許文献２の（１３）式と同じ算出式であるが、特許文献２に記載されている（５）式や（１０）式〜（１２）式を適用してｍｏｄＧＩ値ｍｏｄＧＩを計算するようにしても良い。 The expression (1) is the same calculation expression as the expression (13) in Patent Document 2, but the modGI value obtained by applying the expressions (5) and (10) to (12) described in Patent Document 2. You may make it calculate modGI.

マイクロホン選択制御部１２は、ｍｏｄＧＩ計算部１１が得た捕捉信号ｓ１についてのｍｏｄＧＩ値ｍｏｄＧＩに基づき、言い換えると、捕捉信号ｓ１における雑音成分が有する周波数特性に基づき、３つの捕捉信号ｓ１〜ｓ３から２つの捕捉信号を選択させる選択制御信号ｃｎｔｌｍｉｃを生成してマイクロホン選択部１３に与える。 The microphone selection control unit 12 is based on the modGI value modGI of the acquisition signal s1 obtained by the modGI calculation unit 11, in other words, based on the frequency characteristics of the noise component in the acquisition signal s1, and from the two acquisition signals s1 to s3 to 2. A selection control signal cntlmic for selecting two captured signals is generated and provided to the microphone selection unit 13.

図２は、マイクロホン選択制御部１２の詳細構成を示すブロック図である。図２において、マイクロホン選択制御部１２は、ｍｏｄＧＩ取得部２１、ｍｏｄＧＩ照合・選択制御信号生成部２２、ｍｏｄＧＩ／選択制御信号対応記憶部２３及び選択制御信号送信部２４を有する。 FIG. 2 is a block diagram showing a detailed configuration of the microphone selection control unit 12. In FIG. 2, the microphone selection control unit 12 includes a modGI acquisition unit 21, a modGI collation / selection control signal generation unit 22, a modGI / selection control signal correspondence storage unit 23, and a selection control signal transmission unit 24.

ｍｏｄＧＩ取得部２１は、ｍｏｄＧＩ計算部１１が得たｍｏｄＧＩ値ｍｏｄＧＩを取込むものである。ｍｏｄＧＩ照合・選択制御信号生成部２２は、取込んだｍｏｄＧＩ値ｍｏｄＧＩをキーとして、ｍｏｄＧＩ／選択制御信号対応記憶部２３における該当する情報を取り出して選択制御信号ｃｎｔｌｍｉｃを得る（生成する）ものである。ｍｏｄＧＩ／選択制御信号対応記憶部２３は、ｍｏｄＧＩ値ｍｏｄＧＩの複数の範囲と、その範囲に対応付けられた選択制御信号とを記憶しているものである。選択制御信号送信部２４は、ｍｏｄＧＩ照合・選択制御信号生成部２２が得た選択制御信号ｃｎｔｌｍｉｃをマイクロホン選択部１３に送信するものである。 The modGI acquisition unit 21 takes in the modGI value modGI obtained by the modGI calculation unit 11. The modGI collation / selection control signal generation unit 22 obtains (generates) a selection control signal cntlmic by extracting the corresponding information in the modGI / selection control signal correspondence storage unit 23 using the acquired modGI value modGI as a key. . The modGI / selection control signal correspondence storage unit 23 stores a plurality of ranges of the modGI value modGI and selection control signals associated with the ranges. The selection control signal transmission unit 24 transmits the selection control signal cntlmic obtained by the modGI matching / selection control signal generation unit 22 to the microphone selection unit 13.

図３は、ｍｏｄＧＩ／選択制御信号対応記憶部２３の構成例を示す説明図であり、図３は、テーブル構成の場合を示している。図３の例では、２つの境界値α及びβ（β＞α）を用いてｍｏｄＧＩ値ｍｏｄＧＩが取り得る値の範囲を３つに分けている。 FIG. 3 is an explanatory diagram illustrating a configuration example of the modGI / selection control signal correspondence storage unit 23, and FIG. 3 illustrates a table configuration. In the example of FIG. 3, the range of values that the modGI value modGI can take is divided into three using two boundary values α and β (β> α).

ｍｏｄＧＩ取得部２１が取得したｍｏｄＧＩ値ｍｏｄＧＩが最も小さな値の範囲に属する場合、すなわち、ｍｏｄＧＩ≦αの場合には、第１及び第３のマイクロホンｍ１及びｍ３からの捕捉信号ｓ１、ｓ３を選択させる選択制御信号ｃｎｔｌｍｉｃの内容「１０」になっている。第１及び第３のマイクロホンｍ１及びｍ３の間隔Ｄ_１３は３つの間隔の最大であるので、捕捉信号ｓ１における雑音成分の周波数特性が低域寄りの場合に好適な間隔である。 When the modGI value modGI acquired by the modGI acquisition unit 21 belongs to the smallest value range, that is, when modGI ≦ α, the captured signals s1 and s3 from the first and third microphones m1 and m3 are selected. The content of the selection control signal cntlmic is “10”. The spacing D ₁₃ between the first and third microphones m1 and m3 is the largest of the three intervals, the frequency characteristic of the noise component in the acquisition signal s1 is the preferred spacing when the low-frequency closer.

ｍｏｄＧＩ取得部２１が取得したｍｏｄＧＩ値ｍｏｄＧＩが中間の範囲に属する場合、すなわち、α＜ｍｏｄＧＩ≦βの場合には、第２及び第３のマイクロホンｍ２及びｍ３からの捕捉信号ｓ２、ｓ３を選択させる選択制御信号ｃｎｔｌｍｉｃの内容「０１」になっている。第２及び第３のマイクロホンｍ２及びｍ３の間隔Ｄ_２３は３つの間隔の中の中間であるので、捕捉信号ｓ１における雑音成分の周波数特性が低域寄りでも高域寄りでもない場合に好適な間隔である。 When the modGI value modGI acquired by the modGI acquisition unit 21 belongs to an intermediate range, that is, when α <modGI ≦ β, the captured signals s2 and s3 from the second and third microphones m2 and m3 are selected. The content of the selection control signal cntlmic is “01”. The spacing D ₂₃ of the second and third microphones m2 and m3 is in the middle of the three intervals, suitable intervals when the frequency characteristic of the noise component in the acquisition signal s1 is not a high-pass nearer the low range nearer It is.

ｍｏｄＧＩ取得部２１が取得したｍｏｄＧＩ値ｍｏｄＧＩが最も大きい範囲に属する場合、すなわち、β＜ｍｏｄＧＩの場合には、第１及び第２のマイクロホンｍ１及びｍ２からの捕捉信号ｓ１、ｓ２を選択させる選択制御信号ｃｎｔｌｍｉｃの内容「１１」になっている。第１及び第２のマイクロホンｍ１及びｍ２の間隔Ｄ_１２は３つの間隔の中の最小であるので、捕捉信号ｓ１における雑音成分の周波数特性が高域寄りの場合に好適な間隔である。 Selection control for selecting captured signals s1 and s2 from the first and second microphones m1 and m2 when the modGI value modGI acquired by the modGI acquisition unit 21 belongs to the largest range, that is, when β <modGI. The content of the signal cntlmic is “11”. The spacing D ₁₂ between the first and second microphones m1 and m2 are the smallest of the three intervals are suitable distance when the frequency characteristic of the noise component of the high-frequency closer in the acquisition signal s1.

２ビットの選択制御信号ｃｎｔｌｍｉｃの上位ビットは、マイクロホン選択部１３の第１のスイッチｓｗ１の有効となる入力端子を規定しており、上位ビットの「１」は第１のスイッチｓｗ１のａ入力端子を有効とし、上位ビットの「０」は第１のスイッチｓｗ１のｂ入力端子を有効とすることを表している。２ビットの選択制御信号ｃｎｔｌｍｉｃの下位ビットは、マイクロホン選択部１３の第２のスイッチｓｗ２の有効となる入力端子を規定しており、下位ビットの「１」は第２のスイッチｓｗ２のａ入力端子を有効とし、下位ビットの「０」は第２のスイッチｓｗ２のｂ入力端子を有効とすることを表している。 The upper bit of the 2-bit selection control signal cntlmic defines an input terminal that is valid for the first switch sw1 of the microphone selection unit 13, and the upper bit “1” is the a input terminal of the first switch sw1. The upper bit “0” indicates that the b input terminal of the first switch sw1 is enabled. The lower bit of the 2-bit selection control signal cntlmic defines the input terminal that is effective for the second switch sw2 of the microphone selection unit 13, and the lower bit “1” is the a input terminal of the second switch sw2. And “0” in the lower bit indicates that the b input terminal of the second switch sw2 is enabled.

マイクロホンアレイ処理部２０は、入力された２つの捕捉信号ＳＡ及びＳＢを、２つのマイクロホンでなるマイクロホンアレイからの入力信号として公知の手法に基づいた処理を施すものである。例えば、マイクロホンアレイ処理部２０が、出力信号ｏｕｔとして指向性信号を形成するものである場合には、特許文献１の図４（Ａ）に示すような内部構成を有し、遅延減算処理を施して出力信号ｏｕｔ（ｎ）を得るものとなる。 The microphone array processing unit 20 performs processing based on a known technique on the two captured signals SA and SB that are input as input signals from a microphone array composed of two microphones. For example, when the microphone array processing unit 20 forms a directional signal as the output signal out, the microphone array processing unit 20 has an internal configuration as shown in FIG. As a result, an output signal out (n) is obtained.

（Ａ−２）第１の実施形態の動作
次に、第１の実施形態に係るマイクロホン間隔制御装置１０の動作を説明する。以下では、全体動作、マイクロホン選択制御部１２における動作の順に説明する。 (A-2) Operation of the First Embodiment Next, the operation of the microphone interval control apparatus 10 according to the first embodiment will be described. Below, it demonstrates in order of the whole operation | movement and the operation | movement in the microphone selection control part 12. FIG.

周囲の音声、音響などの音波を、第１〜第３のマイクロホンｍ１〜ｍ３が捕捉して得た第１〜第３の捕捉信号ｓ１〜ｓ３はマイクロホン選択部１３に与えられる。また、第１の捕捉信号Ｓ１は、ｍｏｄＧＩ計算部１１に与えられる。 The first to third captured signals s1 to s3 obtained by capturing the surrounding sound waves such as sound and sound by the first to third microphones m1 to m3 are given to the microphone selecting unit 13. The first acquisition signal S1 is given to the modGI calculation unit 11.

第１の捕捉信号ｓ１についてのｍｏｄＧＩ値ｍｏｄＧＩが、ｍｏｄＧＩ計算部１１において（１）式に基づいて計算され、得られたｍｏｄＧＩ値ｍｏｄＧＩがマイクロホン選択制御部１２に与えられる。そして、マイクロホン選択制御部１２において、計算されたｍｏｄＧＩ値ｍｏｄＧＩに基づいて、言い換えると、捕捉信号ｓ１における雑音成分が有する周波数特性に基づいて、３つの捕捉信号ｓ１〜ｓ３から２つの捕捉信号を選択させる選択制御信号ｃｎｔｌｍｉｃが生成されてマイクロホン選択部１３に与えられる。 The modGI value modGI for the first acquisition signal s 1 is calculated based on the equation (1) in the modGI calculation unit 11, and the obtained modGI value modGI is given to the microphone selection control unit 12. Then, the microphone selection control unit 12 selects two captured signals from the three captured signals s1 to s3 based on the calculated modGI value modGI, in other words, based on the frequency characteristics of the noise component in the captured signal s1. A selection control signal cntlmic is generated and supplied to the microphone selection unit 13.

第１〜第３のマイクロホンｍ１〜ｍ３からの３つの捕捉信号ｓ１〜ｓ３のうち、選択制御信号ｃｎｔｌｍｉｃが指示する２つの捕捉信号ＳＡ及びＳＢが、マイクロホン選択部１３において選択されてマイクロホンアレイ処理部２０に与えられる。 Of the three captured signals s1 to s3 from the first to third microphones m1 to m3, two captured signals SA and SB indicated by the selection control signal cntlmic are selected by the microphone selecting unit 13 and the microphone array processing unit. 20 is given.

そして、マイクロホンアレイ処理部２０において、２つの捕捉信号ＳＡ及びＳＢは、２つのマイクロホンでなるマイクロホンアレイからの入力信号として公知の手法に基づいて処理される。 Then, in the microphone array processing unit 20, the two captured signals SA and SB are processed based on a known method as input signals from a microphone array composed of two microphones.

次に、マイクロホン選択制御部１２における詳細動作を説明する。 Next, a detailed operation in the microphone selection control unit 12 will be described.

ｍｏｄＧＩ計算部１１が得たｍｏｄＧＩ値ｍｏｄＧＩが、ｍｏｄＧＩ取得部２１によって取込まれてｍｏｄＧＩ照合・選択制御信号生成部２２に与えられる。ｍｏｄＧＩ照合・選択制御信号生成部２２によって、取込まれたｍｏｄＧＩ値ｍｏｄＧＩがキーとなってｍｏｄＧＩ／選択制御信号対応記憶部２３に対する照合が実行され、取込まれたｍｏｄＧＩ値ｍｏｄＧＩが属する範囲に応じた選択制御信号ｃｎｔｌｍｉｃ（図３参照）がｍｏｄＧＩ／選択制御信号対応記憶部２３から取り出される。そして、その選択制御信号ｃｎｔｌｍｉｃが、選択制御信号送信部２４によってマイクロホン選択部１３に送信される。 The modGI value modGI obtained by the modGI calculation unit 11 is taken in by the modGI acquisition unit 21 and given to the modGI collation / selection control signal generation unit 22. The modGI collation / selection control signal generation unit 22 performs collation with the modGI / selection control signal correspondence storage unit 23 by using the captured modGI value modGI as a key, and according to the range to which the acquired modGI value modGI belongs. The selection control signal cntlmic (see FIG. 3) is extracted from the modGI / selection control signal correspondence storage unit 23. Then, the selection control signal cntlmic is transmitted to the microphone selection unit 13 by the selection control signal transmission unit 24.

（Ａ−３）第１の実施形態の効果
以上のように、第１の実施形態によれば、ｍｏｄＧＩ値ｍｏｄＧＩによって、捕捉信号ｓ１（言い換えると、捕捉信号ｓ１〜ｓ３）における雑音成分の周波数特性を推定し、その結果に基づいて、雑音成分の処理（例えば雑音抑圧）に適した２つのマイクロホンを選択してマイクロホンアレイを構成することができ、マイクロホンアレイからの捕捉信号を処理する処理部の処理精度を向上させることができる。ここで、ｍｏｄＧＩ値ｍｏｄＧＩを雑音成分の周波数特性を反映したパラメータに用いているため、処理遅延が少なく、演算も容易である。 (A-3) Effects of the First Embodiment As described above, according to the first embodiment, the frequency characteristics of the noise components in the captured signal s1 (in other words, the captured signals s1 to s3) by the modGI value modGI. Based on the result, two microphones suitable for noise component processing (for example, noise suppression) can be selected to form a microphone array, and a processing unit for processing a captured signal from the microphone array can be configured. Processing accuracy can be improved. Here, since the modGI value modGI is used as a parameter reflecting the frequency characteristic of the noise component, the processing delay is small and the calculation is easy.

第１の実施形態のマイクロホン間隔制御装置１０を、例えば、通信端末（スマートフォンや携帯電話端末など）やテレビ会議システムなどに適用することにより通話音質の改善などの効果を期待でき、また、第１の実施形態のマイクロホン間隔制御装置１０を、例えば、カーナビゲーションシステムや家電装置のボイスコマンド入力装置などに適用することにより音声認識精度の改善などの効果を期待できる。例えば、スマートフォンは使用者が種々の環境下に携帯するものであり、携帯者が乗り込んだ自動車の車内の背景雑音（窓を閉めている場合の背景雑音）は車外に比べて低域成分が中心的である。また、カーナビゲーションシステム用のボイスコマンド入力装置近傍の背景雑音の特性は、例えば、走行速度が速いか遅いか、エアコンが使用されているか否か、さらには、自動車の窓（特に前列の窓）が開いているか閉まっているかなどの様々な条件で大きく変わる。このような背景雑音の特性が種々変化する装置へ、第１の実施形態のマイクロホン間隔制御装置１０を適用することは非常に有効である。 By applying the microphone interval control device 10 of the first embodiment to, for example, a communication terminal (such as a smartphone or a mobile phone terminal) or a video conference system, an effect such as improvement in call sound quality can be expected. By applying the microphone interval control device 10 of the embodiment to, for example, a car navigation system or a voice command input device of a home appliance, an effect such as improvement of voice recognition accuracy can be expected. For example, a smartphone is carried by a user in various environments, and background noise inside the car (background noise when the window is closed) of a car in which the user gets in is mainly low-frequency components compared to outside the vehicle. Is. In addition, the background noise characteristics in the vicinity of the voice command input device for the car navigation system include, for example, whether the traveling speed is fast or slow, whether an air conditioner is used, and further, an automobile window (especially a front row window). It varies greatly depending on various conditions such as whether the is open or closed. It is very effective to apply the microphone interval control device 10 of the first embodiment to such a device in which the characteristics of the background noise change variously.

（Ｂ）第２の実施形態
次に、本発明によるマイクロホン間隔制御装置及びプログラムの第２の実施形態を、図面を参照しながら説明する。 (B) Second Embodiment Next, a second embodiment of the microphone interval control device and program according to the present invention will be described with reference to the drawings.

図４は、第２の実施形態のマイクロホン間隔制御装置１０Ａの構成を示すブロック図であり、上述した第１の実施形態に係る図１との同一、対応部分には同一、対応符号を付して示している。 FIG. 4 is a block diagram showing the configuration of the microphone interval control apparatus 10A of the second embodiment. The same and corresponding parts as those in FIG. 1 according to the first embodiment described above are assigned the same and corresponding reference numerals. It shows.

図４において、第２の実施形態のマイクロホン間隔制御装置１０Ａは、第１〜第３のマイクロホンｍ１〜ｍ３、ｍｏｄＧＩ計算部１１、マイクロホン選択・遅延制御部１２Ａ、マイクロホン選択部１３及び付与遅延選択部１４を有する。 In FIG. 4, the microphone interval control apparatus 10A of the second embodiment includes first to third microphones m1 to m3, a modGI calculation unit 11, a microphone selection / delay control unit 12A, a microphone selection unit 13, and an added delay selection unit. 14

上述した第１の実施形態の場合、第１〜第３のマイクロホンｍ１〜ｍ３はこの順序で１直線上に配置されていた。第２の実施形態の場合、３つのマイクロホンｍ１〜ｍ３は直線状ではなく、図４に示すように、三角形（例えば、直角三角形若しくは鈍角三角形）の３つの頂点の位置に固定的に配置されている。但し、マイクロホン間隔の大小関係は、第１の実施形態と同様になっている。すなわち、第１及び第２のマイクロホンｍ１及びｍ２のマイクロホン間隔Ｄ_１２は、第２及び第３のマイクロホンｍ２及びｍ３のマイクロホン間隔Ｄ_２３より狭く、第１及び第３のマイクロホンｍ１及びｍ３のマイクロホン間隔Ｄ_１３は、第１及び第２のマイクロホンｍ１及びｍ２のマイクロホン間隔Ｄ_１２や、第２及び第３のマイクロホンｍ２及びｍ３のマイクロホン間隔Ｄ_２３より広くなっている。 In the case of the first embodiment described above, the first to third microphones m1 to m3 are arranged on one straight line in this order. In the case of the second embodiment, the three microphones m1 to m3 are not linear, and are fixedly arranged at the positions of the three vertices of a triangle (for example, a right triangle or an obtuse triangle) as shown in FIG. Yes. However, the relationship between the microphone intervals is the same as in the first embodiment. That is, the microphone spacing _{D 12} between the first and second microphones m1 and m2 are narrower than the microphone spacing _{D 23} of the second and third microphones m2 and m3, the microphone interval between the first and third microphones m1 and m3 D ₁₃ is wider than the microphone interval D ₁₂ between the first and second microphones m 1 and m 2 and the microphone interval D ₂₃ between the second and third microphones m 2 and m 3.

ここで、マイクロホン間隔制御装置１０Ａを搭載している装置（例えば、ボイスコマンド入力装置）の設計者が意図している音波の方向（指向性）が、第１及び第３のマイクロホンｍ１及びｍ３を結ぶ線分の直交方向であったとする。この場合、３つのマイクロホンｍ１〜ｍ３が三角形の３つの頂点の位置に固定的に配置されているため、２つのマイクロホンとして第１及び第３のマイクロホンｍ１及びｍ３が選択された場合には、選択後の２つの捕捉信号ＳＡ及びＳＢにおける設計者が意図している方向からの成分には時間差が生じないが、２つのマイクロホンとして第１及び第２のマイクロホンｍ１及びｍ２が選択された場合や、２つのマイクロホンとして第２及び第３のマイクロホンｍ２及びｍ３が選択された場合には、選択後の２つの捕捉信号ＳＡ及びＳＢには、設計者が意図している方向からの成分に、図４の距離Ｌに相当する時間差が生じる。 Here, the direction of the sound wave (directivity) intended by the designer of a device (for example, a voice command input device) on which the microphone interval control device 10A is mounted is determined by the first and third microphones m1 and m3. Suppose that it is the orthogonal direction of the connecting line segment. In this case, since the three microphones m1 to m3 are fixedly arranged at the positions of the three vertices of the triangle, the selection is made when the first and third microphones m1 and m3 are selected as the two microphones. There is no time difference in the components from the direction intended by the designer in the latter two captured signals SA and SB, but when the first and second microphones m1 and m2 are selected as the two microphones, When the second and third microphones m2 and m3 are selected as the two microphones, the two captured signals SA and SB after the selection have components from directions intended by the designer, as shown in FIG. A time difference corresponding to the distance L is generated.

そのため、第２の実施形態では、付与遅延選択部１４が設けている。付与遅延選択部１４は、３つのマイクロホンｍ１〜ｍ３からの２つのマイクロホンの選択結果に応じて、選択された２つのマイクロホンからの捕捉信号ＳＡ及びＳＢの時間差を解消するものである。 Therefore, in the second embodiment, the provision delay selection unit 14 is provided. The provision delay selection unit 14 eliminates the time difference between the captured signals SA and SB from the two selected microphones according to the selection result of the two microphones from the three microphones m1 to m3.

付与遅延選択部１４は、上述した機能を発揮できれば、その詳細構成は問われないものであるが、一例として、図４に示すように構成することができる。以下では、距離Ｌに対応する遅延量（時間）がｄであるとして説明する。付与遅延選択部１４は、捕捉信号ＳＡを時間ｄだけ遅延させる遅延部１４Ａ１と、捕捉信号ＳＡをそのまま通過させるか、遅延部１４Ａ１からの遅延信号を通過させるかを切り替える２入力１出力スイッチ１４Ａ２と、捕捉信号ＳＢを時間ｄだけ遅延させる遅延部１４Ｂ１と、捕捉信号ＳＢをそのまま通過させるか、遅延部１４Ｂ１からの遅延信号を通過させるかを切り替える２入力１出力スイッチ１４Ｂ２とを有している。なお、２つのスイッチ１４Ａ１及び１４Ｂ１は、制御ビットが「１」のときにａ入力端子を選択し、制御ビットが「０」のときにｂ入力端子を選択するものである。 The detailed configuration of the grant delay selection unit 14 is not limited as long as the above-described function can be exhibited, but can be configured as shown in FIG. 4 as an example. In the following description, it is assumed that the delay amount (time) corresponding to the distance L is d. The addition delay selection unit 14 delays the capture signal SA by time d, a 2-input 1-output switch 14A2 that switches between passing the capture signal SA as it is or passing the delay signal from the delay unit 14A1. The delay unit 14B1 delays the capture signal SB by time d, and the 2-input 1-output switch 14B2 that switches between passing the capture signal SB as it is or passing the delay signal from the delay unit 14B1. The two switches 14A1 and 14B1 select the a input terminal when the control bit is “1”, and select the b input terminal when the control bit is “0”.

２つのマイクロホンとして第１及び第３のマイクロホンｍ１及びｍ３が選択された場合には、選択後の２つの捕捉信号ＳＡ及びＳＢに時間差が生じていないので、付与遅延選択部１４は、マイクロホン選択部１３からの２つの捕捉信号ＳＡ及びＳＢを共に遅延させることなく、マイクロホンアレイ処理部２０に与えれば良い。また、２つのマイクロホンとして第２及び第３のマイクロホンｍ２及びｍ３が選択された場合には、選択後の２つの捕捉信号ＳＡ及びＳＢ間に捕捉信号ＳＢ側が時間差ｄだけ進んでいる関係があるので、付与遅延選択部１４は、捕捉信号ＳＢだけを時間差ｄだけ遅延させてマイクロホンアレイ処理部２０に与えれば良い。さらに、２つのマイクロホンとして第１及び第２のマイクロホンｍ１及びｍ２が選択された場合には、選択後の２つの捕捉信号ＳＡ及びＳＢ間に捕捉信号ＳＡ側が時間差ｄだけ進んでいる関係があるので、付与遅延選択部１４は、捕捉信号ＳＡだけを時間差ｄだけ遅延させてマイクロホンアレイ処理部２０に与えれば良い。 When the first and third microphones m1 and m3 are selected as the two microphones, there is no time difference between the two captured signals SA and SB after selection. The two captured signals SA and SB from 13 may be supplied to the microphone array processing unit 20 without being delayed. Further, when the second and third microphones m2 and m3 are selected as the two microphones, there is a relationship in which the captured signal SB side advances by the time difference d between the two captured signals SA and SB after selection. The provision delay selection unit 14 may delay the capture signal SB by the time difference d and provide it to the microphone array processing unit 20. Further, when the first and second microphones m1 and m2 are selected as the two microphones, there is a relationship in which the capture signal SA side advances by the time difference d between the two capture signals SA and SB after selection. The provision delay selection unit 14 may delay the capture signal SA by the time difference d and provide it to the microphone array processing unit 20.

マイクロホン選択・遅延制御部１２Ａは、マイクロホン（の捕捉信号）の選択に応じて、マイクロホン選択後の捕捉信号に対する、以上のような遅延制御を実現できるように、第１の実施形態のｍｏｄＧＩ／選択制御信号対応記憶部２３に代えて、ｍｏｄＧＩ／選択・遅延制御信号対応記憶部２３Ａを有している。 The microphone selection / delay control unit 12A mod_GI / selection according to the first embodiment so that the delay control as described above can be realized with respect to the captured signal after selection of the microphone according to the selection of the microphone (captured signal). Instead of the control signal correspondence storage unit 23, a modGI / selection / delay control signal correspondence storage unit 23A is provided.

図５は、ｍｏｄＧＩ／選択・遅延制御信号対応記憶部２３Ａの構成例を示す説明図であり、上述した第１の実施形態の図３に対応する図面である。 FIG. 5 is an explanatory diagram illustrating a configuration example of the modGI / selection / delay control signal correspondence storage unit 23A, and corresponds to FIG. 3 of the first embodiment described above.

ｍｏｄＧＩ／選択・遅延制御信号対応記憶部２３Ａも、ｍｏｄＧＩ取得部２１が取得したｍｏｄＧＩ値ｍｏｄＧＩが属する範囲に応じて、３つのマイクロホンｍ１〜ｍ３（の捕捉信号）から２つのマイクロホン（の捕捉信号）を選択させる選択制御信号ｃｎｔｌｍｉｃを得ることができるようになされている。これに加え、ｍｏｄＧＩ／選択・遅延制御信号対応記憶部２３Ａは、ｍｏｄＧＩ取得部２１が取得したｍｏｄＧＩ値ｍｏｄＧＩが属する範囲に応じて、言い換えると、選択された２つのマイクロホンの組み合わせに応じて、選択後の捕捉信号ＳＡ及びＳＢの時間差を解消させる遅延制御信号ｃｎｔｌｄｌを得ることができるようになされている。遅延制御信号ｃｎｔｌｄｌは、付与遅延選択部１４を上述したように制御するものである。 The modGI / selection / delay control signal correspondence storage unit 23A also has two microphones (capture signals) from three microphones m1 to m3 (capture signals) according to the range to which the modGI value modGI acquired by the modGI acquisition unit 21 belongs. The selection control signal cntlmic for selecting can be obtained. In addition, the modGI / selection / delay control signal correspondence storage unit 23A selects according to the range to which the modGI value modGI acquired by the modGI acquisition unit 21 belongs, in other words, according to the combination of two selected microphones. A delay control signal cntldl that eliminates the time difference between the subsequent acquisition signals SA and SB can be obtained. The delay control signal cntldl controls the added delay selection unit 14 as described above.

第２の実施形態によっても、第１の実施形態と同様な効果を奏することができる。これに加え、第２の実施形態によれば、複数のマイクロホン間で、所定方向からの音波の捕捉タイミングにずれがある場合でも、そのずれを解消して、選択された捕捉信号をマイクロホンアレイ処理部に与えることができる。 According to the second embodiment, the same effect as that of the first embodiment can be obtained. In addition, according to the second embodiment, even when there is a deviation in the timing of capturing sound waves from a predetermined direction among a plurality of microphones, the deviation is eliminated and the selected acquisition signal is processed by microphone array processing. Can be given to the department.

なお、第１の実施形態に係る図１のように、複数のマイクロホンが１直線上に配置されていても、所定方向が、その直線の直交方向でない場合には、第２の実施形態におけるような捕捉タイミングのずれを解消する手段を設けることを要する。 As shown in FIG. 1 according to the first embodiment, even when a plurality of microphones are arranged on one straight line, when the predetermined direction is not the orthogonal direction of the straight line, as in the second embodiment. It is necessary to provide a means for eliminating a large shift in capture timing.

（Ｃ）第３の実施形態
次に、本発明によるマイクロホン間隔制御装置及びプログラムの第３の実施形態を、図面を参照しながら説明する。 (C) Third Embodiment Next, a third embodiment of the microphone interval control apparatus and program according to the present invention will be described with reference to the drawings.

図６は、第３の実施形態のマイクロホン間隔制御装置１０Ｂの構成を示すブロック図であり、上述した第１の実施形態に係る図１との同一、対応部分には同一、対応符号を付して示している。 FIG. 6 is a block diagram showing the configuration of the microphone interval control apparatus 10B of the third embodiment. The same and corresponding parts as those in FIG. 1 according to the first embodiment described above are assigned the same and corresponding reference numerals. It shows.

図６において、第３の実施形態のマイクロホン間隔制御装置１０Ｂは、２つのマイクロホンｍｆ、ｍｍ、ｍｏｄＧＩ計算部１１、マイクロホン移動制御部１２Ｂ及びマイクロホン移動機構１５を有する。 In FIG. 6, the microphone interval control apparatus 10 </ b> B of the third embodiment includes two microphones mf, mm, modGI calculation unit 11, microphone movement control unit 12 </ b> B, and microphone movement mechanism 15.

２つのマイクロホンｍｆ及びｍｍのうち、一方のマイクロホンｍｆは固定的に設けられており、他方のマイクロホンｍｍはマイクロホン移動制御部１２Ｂによって移動可能に設けられている。マイクロホンｍｍの移動方向は、固定設置のマイクロホンｍｆに近付き又は遠ざかる方向である。すなわち、この第３の実施形態の場合、マイクロホンの数は２つであるが、その２つのマイクロホンｍｆ及びｍｍ間の間隔を変えることができるようになされている。 Of the two microphones mf and mm, one microphone mf is fixedly provided, and the other microphone mm is provided movably by the microphone movement control unit 12B. The moving direction of the microphone mm is a direction to approach or move away from the microphone mf that is fixedly installed. That is, in the case of the third embodiment, the number of microphones is two, but the interval between the two microphones mf and mm can be changed.

マイクロホン移動機構１５は、移動可能なマイクロホンｍｍを移動する機構であり、その詳細構成は限定されない。一例を挙げれば以下の通りである。マイクロホン移動機構１５は、駆動源としてステッピングモータを備え、ステッピングモータの回転軸にはピニオンが取り付けられている。このピニオンと係合するラックはガイドによって直動可能となされており、このラックの側面や裏面などの一部に移動可能なマイクロホンｍｍが固定的に取り付けられている。 The microphone moving mechanism 15 is a mechanism for moving the movable microphone mm, and its detailed configuration is not limited. An example is as follows. The microphone moving mechanism 15 includes a stepping motor as a drive source, and a pinion is attached to the rotation shaft of the stepping motor. A rack that engages with the pinion can be linearly moved by a guide, and a movable microphone mm is fixedly attached to a part of the side or back of the rack.

ｍｏｄＧＩ計算部１１は、マイクロホンｍｆから出力された捕捉信号ｓ１についてのｍｏｄＧＩ値ｍｏｄＧＩを計算し、得られたｍｏｄＧＩ値ｍｏｄＧＩをマイクロホン移動制御部１２Ｂに与える。 The modGI calculation unit 11 calculates a modGI value modGI for the captured signal s1 output from the microphone mf, and gives the obtained modGI value modGI to the microphone movement control unit 12B.

マイクロホン移動制御部１２Ｂは、計算されたｍｏｄＧＩ値ｍｏｄＧＩに基づいて、言い換えると、捕捉信号ｓ１における雑音成分が有する周波数特性に基づいて、２つのマイクロホンｍｆ及びｍｍ間の間隔を最適化するマイクロホンｍｍの移動量を定め、移動制御信号ｃｎｔｌｍｖをマイクロホン移動機構１５に与える。 Based on the calculated modGI value modGI, in other words, based on the frequency characteristic of the noise component in the captured signal s1, the microphone movement control unit 12B optimizes the distance between the two microphones mf and mm. A movement amount is determined, and a movement control signal cntlmv is given to the microphone moving mechanism 15.

上述した機能を発揮できれば、マイクロホン移動制御部１２Ｂの内部構成は問われない。一例を挙げると、以下のとおりである。マイクロホン移動制御部１２Ｂは、現状のマイクロホンｍｍの位置１２Ｂ１（の情報）を記憶している。また、マイクロホン移動制御部１２Ｂは、ｍｏｄＧＩ値ｍｏｄＧＩが属する範囲と、その範囲に適したマイクロホンｍｍの位置とを対応付けた、図７に示すような構成のｍｏｄＧＩ／マイクロホン位置対応記憶部（テーブル）２３Ｂを内蔵している。マイクロホン移動制御部１２Ｂは、ｍｏｄＧＩ／マイクロホン位置対応記憶部（テーブル）２３Ｂから、計算されたｍｏｄＧＩ値ｍｏｄＧＩに適したマイクロホンｍｍの位置を取り出し、取り出した位置と現状のマイクロホンｍｍの位置１２Ｂ１との差を求め、現状のマイクロホンｍｍの位置１２Ｂ１から取り出した位置へマイクロホンｍｍを移動させる移動制御信号ｃｎｔｌｍｖを生成してマイクロホン移動機構１５に与える。 As long as the above-described functions can be exhibited, the internal configuration of the microphone movement control unit 12B is not limited. An example is as follows. The microphone movement control unit 12B stores the current position 12B1 (information) of the microphone mm. Further, the microphone movement control unit 12B associates a range to which the modGI value modGI belongs with a position of the microphone mm suitable for the range, and a modGI / microphone position correspondence storage unit (table) configured as shown in FIG. 23B is built-in. The microphone movement control unit 12B extracts the position of the microphone mm suitable for the calculated modGI value modGI from the modGI / microphone position correspondence storage unit (table) 23B, and the difference between the extracted position and the current position 12B1 of the microphone mm. And a movement control signal cntlmv for moving the microphone mm to the position taken out from the current position 12B1 of the microphone mm is generated and given to the microphone moving mechanism 15.

第３の実施形態によっても、捕捉信号における雑音成分の周波数特性を反映させたｍｏｄＧＩを計算し、それに応じて、２つのマイクロホンの間隔を制御するようにしたので、第１の実施形態と同様な効果を奏することができる。 Also in the third embodiment, the modGI reflecting the frequency characteristic of the noise component in the captured signal is calculated, and the interval between the two microphones is controlled accordingly, so the same as in the first embodiment. There is an effect.

（Ｄ）第４の実施形態
次に、本発明によるマイクロホン間隔制御装置及びプログラムの第４の実施形態を、図面を参照しながら説明する。 (D) Fourth Embodiment Next, a fourth embodiment of the microphone interval control device and program according to the present invention will be described with reference to the drawings.

図８は、第４の実施形態のマイクロホン間隔制御装置１０Ｃの構成を示すブロック図であり、上述した第１の実施形態に係る図１との同一、対応部分には同一、対応符号を付して示している。 FIG. 8 is a block diagram showing the configuration of the microphone interval control apparatus 10C of the fourth embodiment. The same and corresponding parts as those in FIG. 1 according to the first embodiment described above are assigned the same and corresponding reference numerals. It shows.

図８において、第４の実施形態のマイクロホン間隔制御装置１０Ｃは、第１〜第３のマイクロホンｍ１〜ｍ３、ｍｏｄＧＩ計算部１１Ｃ、マイクロホン選択制御部１２及びマイクロホン選択部１３に加え、音声区間検出部１６を有する。 In FIG. 8, the microphone interval control device 10C according to the fourth embodiment includes a voice section detection unit in addition to the first to third microphones m1 to m3, the modGI calculation unit 11C, the microphone selection control unit 12, and the microphone selection unit 13. 16

音声区間検出部１６は、第１の捕捉信号ｓ１が音声区間か否かを、公知の音声検出技術によって検出するものであり、検出結果ＶをｍｏｄＧＩ計算部１１Ｃに与えるものである。 The voice section detection unit 16 detects whether or not the first captured signal s1 is a voice section by a known voice detection technique, and gives the detection result V to the modGI calculation unit 11C.

第４の実施形態のｍｏｄＧＩ計算部１１Ｃは、検出結果Ｖが非音声区間を表しているときに捕捉信号ｓ１についてのｍｏｄＧＩ値ｍｏｄＧＩを計算し、検出結果Ｖが音声区間を表しているときに直前のｍｏｄＧＩ値ｍｏｄＧＩを維持させ、以上のように非音声区間でのみ更新されたｍｏｄＧＩ値ｍｏｄＧＩをマイクロホン選択制御部１２に与えるものである。 The modGI calculation unit 11C of the fourth embodiment calculates the modGI value modGI for the captured signal s1 when the detection result V represents a non-speech interval, and immediately before the detection result V represents a speech interval. The present mode GI value mod GI is maintained, and the mod GI value mod GI updated only in the non-voice interval as described above is given to the microphone selection control unit 12.

従って、第４の実施形態においては、非音声区間（言い換えると雑音区間）においてのみマイクロホン間隔の見直しが実行される。 Therefore, in the fourth embodiment, the microphone interval is reviewed only in the non-voice section (in other words, the noise section).

第４の実施形態によれば、非音声区間でのみｍｏｄＧＩ値を更新してマイクロホンの選択を制御するので、第１の実施形態より高い効果を期待できる。 According to the fourth embodiment, since the mod GI value is updated only in the non-speech period and the selection of the microphone is controlled, a higher effect than in the first embodiment can be expected.

（Ｅ）他の実施形態
上記各実施形態ではｍｏｄＧＩを適用する場合を示したが、修正される前のＧＩ（特許文献２の（４）式参照）も、信号波形の傾き方向が変化する回数とその大きさを測る指標であるので、上記各実施形態におけるｍｏｄＧＩに代えてＧＩを適用するようにしても良い。 (E) Other Embodiments In each of the above embodiments, mod GI is applied. However, the number of times that the inclination direction of the signal waveform changes also in the GI before correction (see Equation (4) in Patent Document 2). Therefore, the GI may be applied instead of the modGI in each of the above embodiments.

上記各実施形態では、マイクロホン間隔の見直しを常時行うように説明したが、背景雑音の周波数特性は定常的であることが多く、マイクロホン間隔の見直しを一時的又は間欠的に実行するように装置を構成しても良い。例えば、通信端末の電源オン時など、装置の使用開始時にのみマイクロホン間隔の見直しを稼働させ、それ以降は停止させるようにしても良い。 In each of the embodiments described above, the microphone interval is constantly reviewed. However, the frequency characteristics of the background noise are often steady, and the apparatus is arranged so that the microphone interval is reviewed temporarily or intermittently. It may be configured. For example, the review of the microphone interval may be activated only at the start of use of the apparatus, such as when the communication terminal is turned on, and may be stopped thereafter.

マイクロホンを選択する上記第１、第２、第４の実施形態では、マイクロホンの数が３個の場合を示したが、マイクロホン数は３個に限定されないことは勿論である。 In the first, second, and fourth embodiments in which the microphone is selected, the case where the number of microphones is three has been described, but it is needless to say that the number of microphones is not limited to three.

上記各実施形態では、一つの捕捉信号ｓ１から各種制御部へ与えるｍｏｄＧＩ値を得るものを示したが、各種の制御部へ与えるｍｏｄＧＩ値の取得方法はこれに限定されるものではない。例えば、３種類の捕捉信号ｓ１、ｓ２、ｓ３のそれぞれについてｍｏｄＧＩ値を計算し、得られた３つのｍｏｄＧＩ値の平均値や中央値などを制御部へ与えるｍｏｄＧＩ値とするようにしても良い。 In each of the above embodiments, a method for obtaining a modGI value to be given to various control units from one captured signal s1 has been described. However, a method for obtaining a modGI value to be given to various control units is not limited to this. For example, a modGI value may be calculated for each of the three types of captured signals s1, s2, and s3, and an average value, a median value, and the like of the obtained three modGI values may be used as a modGI value to be given to the control unit.

上記第３の実施形態では、一方のマイクロホンだけ移動可能なものを示したが、両方のマイクロホンが共に移動可能なものであっても良い。 In the third embodiment, one of the microphones is movable. However, both microphones may be movable.

上記では、マイクロホンの選択と、マイクロホンの移動とが択一的な実施形態を説明したが、マイクロホンの選択とマイクロホンの移動とを組み合わせた実施形態を構成するようにしても良い。要は、マイクロホンアレイに入力される２つの捕捉信号を得たマイクロホンの間隔をｍｏｄＧＩ値に制御できるものであれば良い。 In the above description, the embodiment in which the selection of the microphone and the movement of the microphone are alternatives has been described. However, an embodiment in which the selection of the microphone and the movement of the microphone are combined may be configured. In short, it is only necessary that the interval between the microphones from which the two captured signals input to the microphone array are obtained can be controlled to the modGI value.

上記各実施形態では、ｍｏｄＧＩ値が新しい範囲に属すると直ちに制御信号を切り換えるものを示したが、ｍｏｄＧＩ値が新しい範囲に属することが所定回数連続したときに初めて制御信号を切り換えるようにしても良い。 In each of the above embodiments, the control signal is switched immediately when the modGI value belongs to the new range. However, the control signal may be switched only when the modGI value belongs to the new range for a predetermined number of times. .

マイクロホンアレイ処理部の中には、遅延部を内部に有するものがある（特許文献１の図４参照）。第２の実施形態における付与遅延選択部１４内の遅延部として、マイクロホンアレイ処理部内の遅延部を流用するようにしても良い。 Some microphone array processing units have a delay unit therein (see FIG. 4 of Patent Document 1). The delay unit in the microphone array processing unit may be used as the delay unit in the added delay selection unit 14 in the second embodiment.

上記では音声信号が目的音のように説明したが、機械音などの音響信号が目的音の場合にも、本発明の技術思想を適用することができる。 In the above description, the audio signal is described as the target sound. However, the technical idea of the present invention can be applied to the case where the acoustic signal such as a mechanical sound is the target sound.

上記各実施形態では、マイクロホンアレイからの捕捉信号を直ちに処理する装置やプログラムを示したが、マイクロホンアレイからの捕捉信号を記録媒体に記録し、それを再生する場合にも、本発明を適用することができる。 In each of the above embodiments, an apparatus or a program for immediately processing a captured signal from the microphone array has been described. However, the present invention is also applied to a case where the captured signal from the microphone array is recorded on a recording medium and reproduced. be able to.

ｍ１〜ｍ３、ｍｆ、ｍｍ、…マイクロホン、１０、１０Ａ、１０Ｂ、１０Ｃ…マイクロホン間隔制御装置、１１、１１Ｃ…ｍｏｄＧＩ計算部、１２…マイクロホン選択制御部、１２Ａ…マイクロホン選択・遅延制御部、１２Ｂ…マイクロホン移動制御部、１３…マイクロホン選択部、１４…付与遅延選択部、１５…マイクロホン移動機構、１６…音声区間検出部、２１…ｍｏｄＧＩ取得部、２２…ｍｏｄＧＩ照合・選択制御信号生成部、２３…ｍｏｄＧＩ／選択制御信号対応記憶部、２３Ａ…ｍｏｄＧＩ／選択・遅延制御信号対応記憶部、２４…選択制御信号送信部。 m1 to m3, mf, mm,... microphone, 10, 10A, 10B, 10C... microphone interval control device, 11, 11C... modGI calculation unit, 12 .. microphone selection control unit, 12A .. microphone selection / delay control unit, 12B. Microphone movement control unit, 13 ... microphone selection unit, 14 ... grant delay selection unit, 15 ... microphone movement mechanism, 16 ... voice interval detection unit, 21 ... modGI acquisition unit, 22 ... modGI collation / selection control signal generation unit, 23 ... modGI / selection control signal correspondence storage unit, 23A... modGI / selection / delay control signal correspondence storage unit, 24... selection control signal transmission unit.

Claims

Based on at least one of the signals captured by a plurality of microphones, it represents the number and magnitude of changes in the direction of the signal waveform slope in the captured signal of the microphone, and reflected the inclusion of high-frequency components. A feature amount calculating means for calculating a feature amount;
A microphone interval control device, comprising: a microphone interval optimizing unit that adjusts a microphone interval between microphones that output two captured signals to be output to the next-stage device according to the calculated feature amount.

The microphone interval optimizing means selects capture signals from two microphones according to the calculated feature amount from capture signals obtained from three or more microphones fixedly installed, and The microphone interval control apparatus according to claim 1, wherein the microphone interval is virtually changed.

2. The microphone according to claim 1, wherein the microphone interval optimization unit changes the microphone interval by moving at least one movable microphone to a position corresponding to the calculated feature amount. Interval control device.

The feature amount calculating means includes:
A target sound section detector for detecting a target sound section in the captured signal;
The microphone interval according to any one of claims 1 to 3, further comprising: a feature amount calculation main body that recalculates the feature amount only when the detection result of the target sound interval detection unit is not the target sound interval. Control device.

5. The coefficient feature quantity calculating means calculates a value obtained by normalizing the power of the second-order difference of the capture signal with the power of the capture signal as a feature quantity. Target sound component detection device.

Computer
Based on at least one of the signals captured by a plurality of microphones, it represents the number and magnitude of changes in the direction of the signal waveform slope in the captured signal of the microphone, and reflected the inclusion of high-frequency components. A feature amount calculating means for calculating a feature amount;
A microphone interval control program that functions as a microphone interval optimizing unit that adjusts a microphone interval between microphones that output two captured signals to be output to the next stage device in accordance with the calculated feature amount. .