JP4483105B2

JP4483105B2 - Microphone device

Info

Publication number: JP4483105B2
Application number: JP2001063628A
Authority: JP
Inventors: 良和高橋; 誠赤羽
Original assignee: Sony Corp
Current assignee: Sony Corp
Priority date: 2001-03-07
Filing date: 2001-03-07
Publication date: 2010-06-16
Anticipated expiration: 2021-03-07
Also published as: JP2002271885A

Description

【０００１】
【発明の属する技術分野】
本発明は、例えば、目的音源である話者の位置が絶えず変動するような家庭の居間やオフィスの会議室などのような環境において、簡単に指向性の向きを変えることができる音声収録や音声認識のためのマイクロホン装置に関するものである。
【０００２】
【従来の技術】
後述する参考文献［１］は、［２］に記載される３マイクロホンによるマイクロホンシステムを拡張し、無指向性マイクカプセル５つを用いて、３００Ｈｚ〜５ｋＨｚ，ビーム幅約１２０度の広帯域狭角度指向性マイクロホンの作成に成功している。図１５に従来のマイクロホンシステムのブロック図を示す。図１５において、マイクロホンＭＩＣ０，ＭＩＣ２，ＭＩＣ３，ＭＩＣ１Ａ，ＭＩＣ１Ｂは、４cm×７cmの平面領域に収納されている。そして、加算器１５０、１５１により２つのマイクロホンＭＩＣ２およびＭＩＣ３、ＭＩＣ１ＡおよびＭＩＣ１Ｂの差分を用いている。さらに、上述の減算器１５０、１５１の差分出力に対して、積分器１５６、１５２を用いて位相成分の排除と広帯域化を図っている。また、求める指向性の関数を先に求めて、これに対してフーリエ級数を用いて必要な成分１５２および１５３を求めている。また、共振点を有するローパスフィルタ（ＬＰＦ）１５５を用いて、さらに高域の補正を行っている。
【０００３】
参考文献［１］河野、中村、大和、高島、゛広帯域狭角度指向性マイクロホンシステム゜信学技報ＥＡ９９−８５１９９９年１２月
参考文献［２］Ｎａｋａｍｕｒａ，Ｋｏｕｎｏ，Ｙａｍａｔｏ，Ｓａｋｉｙａｍａ，゛ＲｅａｌｉｚａｔｉｏｎｏｆＷｉｄｅ−ＢａｎｄＤｉｒｅｃｔｉｖｉｔｙｗｉｔｈＴｈｒｅｅＭｉｃｒｏｐｈｏｎｅｓ″，ＩＥＩＣＥＴｒａｎｓ，Ｆｕｎｄａｍｅｎｔａｌｓ，ｖｏｌ．Ｅ８２−Ａ，Ｎｏ．４，Ａｐｒｉｌ１９９９
【０００４】
【発明が解決しようとする課題】
しかし、上述した従来のマイクロホンシステムでは、トランジスタやオペアンプなどのハードウエアで構成したため、指向性の主軸が固定となり、特にビーム主軸を任意に制御することができないという不都合があった。
【０００５】
また、抵抗器やコンデンサなどの定数の誤差が指向性の制御に影響するという不都合があった。
【０００６】
そこで、本発明は、かかる点に鑑みてなされたものであり、指向性の主軸を任意に制御することができ、指向性の鋭度を向上させることができると共に、１つのマイクロホン装置のみを使用するだけで、例えばマイクロホンを中心とする左右の音源を分離してリアルタイムに音声収録または音声認識をすることができるマイクロホン装置を提供することを課題とする。
【０００７】
【課題を解決するための手段】
本発明のマイクロホン装置は、音源からの音波が入力されるマイクロホンを用いてマイクロホンの指向特性を制御するマイクロホン装置において、基準マイクロホンと、基準マイクロホンを中心に等間隔に配置される第１の１対のマイクロホンと、基準マイクロホンを中心に第１の１対のマイクロホンに直交して等間隔に配置される第２の１対のマイクロホンと、基準マイクロホンを中心に第１の１対のマイクロホンおよび第２の１対のマイクロホンに対して４５度傾けて等間隔に配置される第３の１対のマイクロホンと、基準マイクロホン、第１、第２および第３の１対の各マイクロホンの出力をそれぞれディジタル信号に変換するＡ／Ｄ変換部と、Ａ／Ｄ変換部からのディジタル信号に対して信号処理を施す演算処理部とを備え、基準マイクロホン、第１、第２および第３の１対の各マイクロホンは同一平面上に配置され、演算処理部は、第１の１対のマイクロホンの出力の差を求め、この差をフーリエ変換することにより、基準マイクロホンの出力と位相を合わせ且つ基準マイクロホンの出力に対してｃｏｓθで振幅が変化する第１の中間生成出力を得る処理と、第２の１対のマイクロホンの出力の差を求め、この差をフーリエ変換することにより、基準マイクロホンの出力と位相を合わせ且つ基準マイクロホンの出力に対してｓｉｎθで振幅が変化する第２の中間生成出力を得る処理と、第２の１対のマイクロホンの出力の和を求め、この和をフーリエ変換することにより、基準マイクロホンの出力に対してｃｏｓ２θで振幅が変化する第３の中間生成出力を得る処理と、第３の１対のマイクロホンの出力の和を求め、この和をフーリエ変換することにより、基準マイクロホンの出力に対してｓｉｎ２θで振幅が変化する第４の中間生成出力を得る処理と、目標とする指向特性を、次数が２次のフーリエ級数の係数α０，α１，β１，α２，β２によって表し、基準マイクロホンの出力，第１の中間生成出力，第２の中間生成出力，第３の中間生成出力，第４の中間生成出力を、それぞれ係数α０，α１，β１，α２，β２を用いて重み付けして加算する処理とを行うようにしたものである。
【０００８】
従って本発明によれば、以下の作用をする。基準マイクロホン、第１、第２および第３の１対の各マイクロホンの出力に対してＡ／Ｄ変換部により各ディジタル信号を得た後に、演算処理部により各ディジタル信号に対して信号処理を施す。
【０００９】
演算処理部において施される信号処理は、次のとおりである。
第１の１対のマイクロホンの出力の差を求め、この差をフーリエ変換することにより、基準マイクロホンの出力と位相を合わせ且つ基準マイクロホンの出力に対してｃｏｓθで振幅が変化する第１の中間生成出力を得る。
第２の１対のマイクロホンの出力の差を求め、この差をフーリエ変換することにより、基準マイクロホンの出力と位相を合わせ且つ基準マイクロホンの出力に対してｓｉｎθで振幅が変化する第２の中間生成出力を得る。
第２の１対のマイクロホンの出力の和を求め、この和をフーリエ変換することにより、基準マイクロホンの出力に対してｃｏｓ２θで振幅が変化する第３の中間生成出力を得る。
第３の１対のマイクロホンの出力の和を求め、この和をフーリエ変換することにより、基準マイクロホンの出力に対してｓｉｎ２θで振幅が変化する第４の中間生成出力を得る。
目標とする指向特性を、次数が２次のフーリエ級数の係数α０，α１，β１，α２，β２によって表し、基準マイクロホンの出力，第１の中間生成出力，第２の中間生成出力，第３の中間生成出力，第４の中間生成出力を、それぞれ係数α０，α１，β１，α２，β２を用いて重み付けして加算する。
【００１３】
これらの演算処理部での処理により、容易に指向性の主軸を任意に制御することができ、さらに指向性の鋭度を向上させる。
【００１４】
【発明の実施の形態】
以下に、本発明の実施の形態を説明する。
本実施の形態のマイクロホン装置は、３つ〜７つのマイクロホンを組み合わせ、これらをディジタル信号処理することにより、主軸を容易に可変することができ、音声認識に適した広帯域、狭指向性を実現すると共に、複数の主軸方向からの音声を分離して取得することができるため、音声認識システムやテレビ会議の収録システムに最適なものである。
【００１５】
図１は、本実施の形態が適用されるマイクカプセルの配置図である。
図１において、基準マイクロホンＭＩＣ０と、基準マイクロホンＭＩＣ０を中心に配置される第１の１対のマイクロホンＭＩＣ１，ＭＩＣ２と、基準マイクロホンＭＩＣ０を中心に第１の１対のマイクロホンＭＩＣ１，ＭＩＣ２に直交して配置される第２の１対のマイクロホンＭＩＣ３，ＭＩＣ４と、基準マイクロホンＭＩＣ０を中心に第１の１対のマイクロホンＭＩＣ１，ＭＩＣ２および第２の１対のマイクロホンＭＩＣ３，ＭＩＣ４に対して４５度傾けて配置される第３の１対のマイクロホンＭＩＣ５，ＭＩＣ６とがそれぞれ配置される。これらのマイクロホンＭＩＣ０〜ＭＩＣ６は、平面空間に配置されている。また、ＭＩＣ１，ＭＩＣ０，ＭＩＣ２に基づく基準軸に対する音波の入射角度をｓ度とする。
【００１６】
ここで、基準マイクロホンＭＩＣ０の位置Ｐに対して、第１の１対のマイクロホンＭＩＣ１，ＭＩＣ２はそれぞれ等間隔ｄ１で配置され、第２の１対のマイクロホンＭＩＣ３，ＭＩＣ４はそれぞれ等間隔ｄ２で配置され、第３の１対のマイクロホンＭＩＣ５，ＭＩＣ６はそれぞれ等間隔ｄ３で配置される。
【００１７】
図２は、各マイクロホンの配置に応じた音源からの距離差を示す図である。
図２Ａは第１の１対のマイクロホンＭＩＣ１，ＭＩＣ２の配置に応じた音源からの時間差を示す。図２Ａにおいて、入射角ｓ度で入射した音源からの音声は、基準マイクロホンＭＩＣ０の位置Ｐに対して、マイクロホンＭＩＣ１には距離（＋ｄ１ｃｏｓ（ｓ））に対応した時間だけ短い時間で到達し、マイクロホンＭＩＣ２には距離（＋ｄ１ｃｏｓ（ｓ））に対応した時間だけ長い時間で到達する。
【００１８】
図２Ｂは第２の１対のマイクロホンＭＩＣ３，ＭＩＣ４の配置に応じた音源からの距離差を示す。図２Ｂにおいて、入射角ｓ度で入射した音源からの音声は、基準マイクロホンＭＩＣ０の位置Ｐに対して、マイクロホンＭＩＣ３には距離（＋ｄ２ｓｉｎ（ｓ））に対応した時間だけ短い時間で到達し、マイクロホンＭＩＣ３には距離（＋ｄ２ｓｉｎ（ｓ））に対応した時間だけ長い時間で到達する。
【００１９】
図２Ｃは第３の１対のマイクロホンＭＩＣ５，ＭＩＣ６の配置に応じた音源からの距離差を示す。図２Ｃにおいて、入射角ｓ度で入射した音源からの音声は、基準マイクロホンＭＩＣ０の位置Ｐに対して、マイクロホンＭＩＣ６には距離（＋ｄ３ｓｉｎ（４５−ｓ））に対応した時間だけ短い時間で到達し、マイクロホンＭＩＣ５には距離（＋ｄ３ｓｉｎ（４５−ｓ））に対応した時間だけ長い時間で到達する。
【００２０】
図３は、上述したマイクロホンを用いたマイクロホン装置のハードウエア構成図である。図３において、マイクロホン装置は、ＭＩＣ０（１），ＭＩＣ１（２），ＭＩＣ２（３），ＭＩＣ３（４），ＭＩＣ４（５），ＭＩＣ５（６），ＭＩＣ６（７）と、各ＭＩＣ０（１），ＭＩＣ１（２），ＭＩＣ２（３），ＭＩＣ３（４），ＭＩＣ４（５），ＭＩＣ５（６），ＭＩＣ６（７）からの信号を信号処理可能に増幅するアンプ８、アンプ９、アンプ１０、アンプ１１、アンプ１２、アンプ１３、アンプ１４と、各アンプ８、アンプ９、アンプ１０、アンプ１１、アンプ１２、アンプ１３、アンプ１４で増幅された信号をディジタル信号に変換するＡ／Ｄ変換器１５、Ａ／Ｄ変換器１６、Ａ／Ｄ変換器１７、Ａ／Ｄ変換器１８、Ａ／Ｄ変換器１９、Ａ／Ｄ変換器２０、Ａ／Ｄ変換器２１と、各Ａ／Ｄ変換器１５、Ａ／Ｄ変換器１６、Ａ／Ｄ変換器１７、Ａ／Ｄ変換器１８、Ａ／Ｄ変換器１９、Ａ／Ｄ変換器２０、Ａ／Ｄ変換器２１で変換されたディジタル信号に対して信号処理を施す演算処理装置２２と、演算処理装置２２で信号処理された結果を収録処理または音声認識処理する収録機器または音声認識装置２３とを有して構成される。
【００２１】
各マイクロホンに入力される信号に対して、演算処理装置２２において施される信号処理を図４のフローチャートに示す。図４において、ステップＳ１で、既にｉ＝０とする処理の初期化が行われれている。
【００２２】
ここで、ＭＩＣ０からＲだけ離れた音源からの音をまとめると、以下の数１式、数２式、数３式、数４式、数５式、数６式、数７式、数８式のようになる。上述において、数１式のＸｓ（ｔ）は音源信号を表し、数２式のｘＭＩＣ０（ｔ）はＭＩＣ０の位置で時刻ｔに観測される信号であり、数３式のｘＭＩＣ１（ｔ）はＭＩＣ１の位置で時刻ｔに観測される信号であり、数４式のｘＭＩＣ２（ｔ）はＭＩＣ２の位置で時刻ｔに観測される信号であり、数５式のｘＭＩＣ３（ｔ）はＭＩＣ３の位置で時刻ｔに観測される信号であり、数６式のｘＭＩＣ４（ｔ）はＭＩＣ４の位置で時刻ｔに観測される信号であり、数７式のｘＭＩＣ５（ｔ）はＭＩＣ５の位置で時刻ｔに観測される信号であり、数８式のｘＭＩＣ６（ｔ）はＭＩＣ６の位置で時刻ｔに観測される信号である。ここで、ｋ＝ω／ｃ、ωは信号の角周波数を表し、ｃは音速を表し、θは音源のマイクロホンの基準軸に対する入射角を表す。
【００２３】
【数１】

【００２４】
【数２】

【００２５】
【数３】

【００２６】
【数４】

【００２７】
【数５】

【００２８】
【数６】

【００２９】
【数７】

【００３０】
【数８】

【００３１】
これらの音声信号は、各ＭＩＣ０（１），ＭＩＣ１（２），ＭＩＣ２（３），ＭＩＣ３（４），ＭＩＣ４（５），ＭＩＣ５（６），ＭＩＣ６（７）において、電気信号に変換され、各アンプ８、アンプ９、アンプ１０、アンプ１１、アンプ１２、アンプ１３、アンプ１４で増幅された後に、各Ａ／Ｄ変換器１５、Ａ／Ｄ変換器１６、Ａ／Ｄ変換器１７、Ａ／Ｄ変換器１８、Ａ／Ｄ変換器１９、Ａ／Ｄ変換器２０、Ａ／Ｄ変換器２１でディジタル信号に変換される。ただし、上述した各マイクロホンの感度および各アンプのゲインは、一定であると仮定する。
【００３２】
このディジタル信号は、演算処理装置２２の中で以下のような処理が施される。図４においてステップＳ２でサンプリングが行われる。具体的には、フレーム期間毎にディジタル信号のサンプリングが行われる。ステップＳ３でマイクロホン出力のミキシングが行われる。具体的には、各マイクロホンから得られたディジタル信号は、演算処理装置２２により以下の数９式、数１０式、数１１式、数１２式に示すようなミキシング処理が施されることにより、ｘＡ（ｔ）、ｘＢ（ｔ）、ｘＣ（ｔ）、ｘＤ（ｔ）に変換される。
【００３３】
【数９】

【００３４】
【数１０】

【００３５】
【数１１】

【００３６】
【数１２】

【００３７】
上述した数９式、数１０式、数１１式、数１２式に示すｘＡ（ｔ）、ｘＢ（ｔ）、ｘＣ（ｔ）、ｘＤ（ｔ）の各信号は、ＭＩＣ０での信号に対して、それぞれ以下の数１３式、数１４式、数１５式、数１６式に示すような信号である。
【００３８】
【数１３】

【００３９】
【数１４】

【００４０】
【数１５】

【００４１】
【数１６】

【００４２】
すなわち、数１３式、数１４式、数１５式、数１６式に示すｘＡ（ｔ）、ｘＢ（ｔ）、ｘＣ（ｔ）、ｘＤ（ｔ）の各信号は、ＭＩＣ０で観測される数２式のｘＭＩＣ０（ｔ）信号に対して、それぞれ、ｊｓｉｎ（ｋｄ１ｃｏｓθ）、ｊｓｉｎ（ｋｄ２ｓｉｎθ）、ｃｏｓ（ｋｄ２ｓｉｎθ）、ｃｏｓ（ｋｄ３ｓｉｎ（π／４−θ）の特性が加わっていることが分かる。
【００４３】
すなわち、数１３式、数１４式、数１５式、数１６式に示すｘＡ（ｔ）、ｘＢ（ｔ）、ｘＣ（ｔ）、ｘＤ（ｔ）の各信号は、入力される信号の角周波数ω（ただし、ｋ＝ω／ｃ）と入射角θによって特性が変化することになる。また、虚数成分ｊを含む数１３式に示すｘＡ（ｔ）および数１４式に示すｘＢ（ｔ）は、ＭＩＣ０で観測される数２式のｘＭＩＣ０（ｔ）信号に対して、位相が９０度進んでいることが分かる。
【００４４】
ステップＳ４で、ミキシングされた各信号はバッファーにストアーされる。具体的には、数１３式、数１４式、数１５式、数１６式に示すｘＡ（ｔ）、ｘＢ（ｔ）、ｘＣ（ｔ）、ｘＤ（ｔ）の各信号は、それぞれフレーム処理で用いられるサンプル数Ｎに応じたバッファー数Ｎのフレームバッファに蓄えられる。
【００４５】
ステップＳ５で、処理の回数を示すｉをインクリメントする。ステップＳ６で、ｉ＝Ｎであるか否かを判断する。ステップＳ６でｉ＝Ｎでないときは、ステップＳ２へ戻り、ステップＳ２〜ステップＳ６までの処理および判断を繰り返す。
【００４６】
ステップＳ６でｉ＝Ｎとなったときは、ステップＳ７で前処理を行う。具体的には、バッファー数Ｎのフレームバッファがすべてに数１３式、数１４式、数１５式、数１６式に示すｘＡ（ｔ）、ｘＢ（ｔ）、ｘＣ（ｔ）、ｘＤ（ｔ）の各信号を蓄えられているが、このフレームバッファがすべて埋まった時点で、フレーム処理の前処理として、連続音声のフレーミングの影響を軽減するためのハミング窓またはハニング窓などの窓処理が行われる。
【００４７】
ステップＳ８で、フレーム処理が行われる。具体的には、高速フーリエ変換（ＦＦＴ）を用いて、位相変換および振幅特性の補正の各処理が行われる。
【００４８】
まず、数１３式に示すｘＡ（ｔ）に対するＦＦＴの出力ＸＡ（ω）について説明する。ｘＡ（ｔ）の振幅成分であるｓｉｎ（ｋｄ１ｃｏｓθ）（ここで、ｄ１＝０．００８ｍとする。）の入射角度依存特性を図５に示す。図５において、ｓｉｎ（ｋｄ１ｃｏｓθ）の入射角度依存特性は、信号の角周波数ω（ただし、ｋ＝ω／ｃ）（１０００Ｈｚ、２０００Ｈｚ、３０００Ｈｚ、４０００Ｈｚ、５０００Ｈｚ、６０００Ｈｚ）に応じて変化していることが分かる。
【００４９】
そこで、ｓｉｎ（ｋｄ１ｃｏｓθ）／ｓｉｎ（ｋｄ）の入射角度依存特性を図６に示す。いま、ＸＡ（ω）／ｓｉｎ（ｋｄ１）について考えてみる。図６において、ｓｉｎ（ｋｄ１ｃｏｓθ）／ｓｉｎ（ｋｄ１）の入射角度依存特性は、信号の角周波数ω（ただし、ｋ＝ω／ｃ）（１０００Ｈｚ、２０００Ｈｚ、３０００Ｈｚ、４０００Ｈｚ、５０００Ｈｚ、６０００Ｈｚ）による変動がほぼなくなることが分かる。
【００５０】
また、上述したように、虚数成分ｊを含む数１３式に示すｘＡ（ｔ）は、ＭＩＣ０で観測される数２式のｘＭＩＣ０（ｔ）信号に対して、位相が９０度進んでいるので、数１７式、数１８式のようにＸ’ＲＡ（ω）、Ｘ’ＩＡ（ω）とすると、位相進みがなくなる。ここで、数１７式、数１８式におけるφ_A（ω）は、Ｘ_A（ω）の位相を表わすものである。
【００５１】
【数１７】

【００５２】
【数１８】

【００５３】
ここで、Ｘ’Ａ（ω）＝Ｘ’ＲＡ（ω）＋ｊＸ’ＩＡ（ω）であり、数１７式、数１８式は位相変換後のスペクトルを表す。さらに、ｋｄ１＜＜１とすると、ｓｉｎ（ｋｄ１ｃｏｓ（θ））はｋｄ１ｃｏｓθに近似できるので、以下の数１９式の関係となり、数１３式に示すｘＡ（ｔ）に対するＦＦＴの出力ＸＡ（ω）から、ＭＩＣ０で入力される信号に対して、ｃｏｓθで振幅が変化する成分を得ることができる。
【００５４】
【数１９】

【００５５】
同様にして、ＭＩＣ０で入力される信号に対して考えると以下のようになる。数１４式に示すｘＢ（ｔ）に対するＦＦＴの出力ＸＢ（ω）について説明する。ｘＢ（ｔ）の振幅成分であるｓｉｎ（ｋｄ２ｓｉｎ（θ））（ここで、ｄ１＝０．００８ｍとする。）の入射角度依存特性を図７に示す。図７において、ｓｉｎ（ｋｄ２ｓｉｎ（θ））の入射角度依存特性は、信号の角周波数ω（ただし、ｋ＝ω／ｃ）（１０００Ｈｚ、２０００Ｈｚ、３０００Ｈｚ、４０００Ｈｚ、５０００Ｈｚ、６０００Ｈｚ）に応じて変化していることが分かる。
【００５６】
そこで、ｓｉｎ（ｋｄ２ｓｉｎ（θ））／ｓｉｎ（ｋｄ２）の入射角度依存特性を図８に示す。いま、ＸＢ（ω）／ｓｉｎ（ｋｄ２）について考えてみる。図８において、ｓｉｎ（ｋｄ２ｓｉｎθ）／ｓｉｎ（ｋｄ２）の入射角度依存特性は、信号の角周波数ω（ただし、ｋ＝ω／ｃ）（１０００Ｈｚ、２０００Ｈｚ、３０００Ｈｚ、４０００Ｈｚ、５０００Ｈｚ、６０００Ｈｚ）による変動がほぼなくなることが分かる。
【００５７】
また、上述したように、虚数成分ｊを含む数１４式に示すｘＢ（ｔ）は、ＭＩＣ０で観測される数２式のｘＭＩＣ０（ｔ）信号に対して、位相が９０度進んでいるので、数２０式、数２１式のようにＸ’ＲＢ（ω）、Ｘ’ＩＢ（ω）とすると、位相進みがなくなる。ここで、数２０式、数２１式におけるφ_B（ω）は、Ｘ_B（ω）の位相を表わすものである。
【００５８】
【数２０】

【００５９】
【数２１】

【００６０】
ここで、Ｘ’Ｂ（ω）＝Ｘ’ＲＢ（ω）＋ｊＸ’ＩＢ（ω）であり、数２０式、数２１式は、位相変換後のスペクトルを表す。従って、ｋｄ２＜＜１とすると、ｓｉｎ（ｋｄ２ｓｉｎθ）はｋｄ２ｓｉｎθに近似できるので、以下の数２２式の関係となり、数１４式に示すｘＢ（ｔ）に対するＦＦＴの出力ＸＢ（ω）から、ＭＩＣ０で入力される信号に対して、ｓｉｎθで振幅が変化する成分を得ることができる。
【００６１】
【数２２】

【００６２】
次に、数１５式に示すｘＣ（ｔ）に対するＦＦＴの出力ＸＣ（ω）について説明する。ｘＣ（ｔ）の振幅成分であるｃｏｓ（ｋｄ２ｓｉｎ（θ））は、テーラー展開を使って、以下の数２３式のように表される。ここで、λは近似誤差を示す。
【００６３】
【数２３】

【００６４】
これより、数２４式の関係となり、数１５式に示すｘＣ（ｔ）に対するＦＦＴの出力ｘＣ（ω）から、ＭＩＣ０で入力される信号に対して、ｃｏｓ２θで振幅が変化する成分を得ることができる。なお、λは参考文献［１］を用いている。
【００６５】
【数２４】

【００６６】
次に、数１６式に示すｘＤ（ｔ）に対するＦＦＴの出力ＸＤ（ω）について説明する。ｘＤ（ｔ）の振幅成分であるｃｏｓ（ｋｄ３ｓｉｎ（π／４−θ））は、テーラー展開を使って、以下の数２５式のように表される。ここで、γは近似誤差を示す。
【００６７】
【数２５】

【００６８】
これより、数２６式の関係となり、数１６式に示すｘＤ（ｔ）に対するＦＦＴの出力ＸＤ（ω）から、ＭＩＣ０で入力される信号に対して、ｓｉｎ２θで振幅が変化する成分を得ることができる。なお、γは参考文献［１］を用いている。
【００６９】
【数２６】

【００７０】
図９に、フーリエ級数で近似目標とする指向特性ψ（θ）を示す。図９に示す指向特性ψ（θ）とＭＩＣ０の出力を加えたとき、指向性Ｄ（θ）＝１＋ψ（θ）が得られれば、ビーム以外の感度を抑えることができる。ここで、主軸の中心角をθｃ（度）、また、ビームの幅をθｗ（度）とする。このとき、ψ（θ）はフーリエ級数展開により、以下の数２７式のように表される。
【００７１】
【数２７】

【００７２】
実際には、上述の数１３式〜数２６式までの処理では、ｃｏｓθ、ｓｉｎθ、ｃｏｓ２θ、ｓｉｎ２θまでしか求められていないので、θｗ＝６０度がビーム外の感度を抑制するために適した値である。各係数α０、αｉ、βｉは以下の数２８式、数２９式、数３０式により求められる。
【００７３】
【数２８】

【００７４】
【数２９】

【００７５】
【数３０】

【００７６】
θｃ＝６０度及びθｗ＝６０度としたときのフーリエ級数でのψ（θ）の例を図１０に示す。
【００７７】
上述した数２７式において、Ｍ＝２として、数３１式に示すように、上述の中間生成出力を重み付き加算すると、主軸方向のみに指向性を持たせる特性とすることができる。
【００７８】
【数３１】

【００７９】
ただし、数３１式において、各中間生成出力Ｙｃｏｓ（ω）、ＹＲｃｏｓ（ω）、ＹＩｃｏｓ（ω）は、それぞれ以下の数３２式、数３３式、数３４式で表される。また、各中間生成出力Ｙｓｉｎ（ω）、ＹＲｓｉｎ（ω）、ＹＩｓｉｎ（ω）は、それぞれ以下の数３５式、数３６式、数３７式で表される。ここで、φＡ（ω）、φＢ（ω）は、それぞれ、ＸＡ（ω）、ＸＢ（ω）の位相を示す。
【００８０】
【数３２】

【００８１】
【数３３】

【００８２】
【数３４】

【００８３】
【数３５】

【００８４】
【数３６】

【００８５】
【数３７】

【００８６】
また、数３１式において、各中間生成出力Ｙｃｏｓ（２ω）、Ｙｓｉｎ（２ω）は、それぞれ以下の数３８式、数３９式で表される。
【００８７】
【数３８】

【００８８】
【数３９】

【００８９】
ここで、ｄ１＝ｄ２＝ｄ３＝０．００８ｍとしたときのシミュレーション結果を図１１、図１２に示す。図１１は、θｃ＝０度としたときの指向特性のシミュレーション結果、図１２は、θｃ＝１３５度としたときの指向特性のシミュレーション結果である。それぞれ、周波数依存性のない指向性を示していることが分かる。また、これらの指向性は最終的にフーリエ級数の係数αｉ、βｉで決定しているので、予めθｃについて複数のαｉ、βｉの組を用意しておけば、各中間生成信号の重み付け加算を行うだけで、リアルタイムに複数の主軸からの音声を分離して取得することが可能となる。
【００９０】
また、上述の処理においては、基準マイクロホンＭＩＣ０を使用しているが、これらの機能は、ＭＩＣ１〜ＭＩＣ４までを使用することにより、基準マイクロホンＭＩＣ０の代用をすることができる。すなわち、数４０式にＭＩＣ１〜ＭＩＣ４までの出力和を示す。
【００９１】
【数４０】

【００９２】
ここで、ｄ１＝ｄ２＝０．００８ｍとしたとき、上述した数４０式における振幅成分である（ｃｏｓ（ｋｄ１ｃｏｓθ）＋ｃｏｓ（ｋｄ２ｓｉｎθ））／２の値は、図１３に示すＭＩＣ１〜ＭＩＣ４の出力和の入射角度依存特性に示すとおりである。これにより、ＭＩＣ１〜ＭＩＣ４の出力和は、高域では入射角度θによる値の依存性があるものの、信号の角周波数ω（ただし、ｋ＝ω／ｃ）（１０００Ｈｚ、２０００Ｈｚ、３０００Ｈｚ、４０００Ｈｚ、５０００Ｈｚ、６０００Ｈｚ）についてほぼ一定した値をとることが分かる。これらはθ＝２２．５度で平均値をとるので、以下の数４１式で示すような補正を行うことで角周波数ωについてもほぼ依存しない特性を得ることができ、近似を行うことができる。
【００９３】
【数４１】

【００９４】
これにより、基準マイクロホンＭＩＣ０を省略して、ＭＩＣ１〜ＭＩＣ６までの６つのマイクロホンを使用することにより、指向特性の主軸を可変に制御して、目的とする音源に指向性を容易に向けることができる。
【００９５】
このようにして得られた数３１式で示すＹ（ω）は、図４においてステップＳ９で、出力の処理が行われる。具体的には、出力Ｙ（ω）は周波数分析されたものであるので演算処理装置２２の中で、そのまま音声の分析結果として扱ったり、またはさらなる音声分析の入力として使用することができ、または音声認識装置２３により音声認識のための音声分析に使用することができる。またＹ（ω）を、逆フーリエ変換することにより、周波数領域の信号から時間領域の波形信号に戻すことにより、収録機器２３により音声収録などに使用することができる。
【００９６】
その後、ステップＳ１０で、ｉ＝０として初期化処理が行われた後に、ステップＳ２へ戻って、ステップＳ２〜ステップＳ６までの処理および判断を繰り返す。
【００９７】
また、図１４にＭＩＣの省略を示す。
以下に、図１４Ａに示すＭＩＣ５、６の省略、および図１４Ｃに示すＭＩＣ３、４の省略について説明する。
【００９８】
以下に示す数４２式、数４３式、数４４式、数４５式から、数４６式が得られる。
【００９９】
【数４２】

【０１００】
【数４３】

【０１０１】
【数４４】

【０１０２】
【数４５】

【０１０３】
【数４６】

【０１０４】
このようにして、数４６式により、数１４式に示すｘＢ（ｔ）に対するＦＦＴの出力ＸＢ（ω）は、ＸＡ（ω）を用いて表すことにより、ｓｉｎθ成分を生成することができる。
【０１０５】
また、以下に示す数４７式、数４８式から、数２６式に示すＸＤ（ω）は数４９式のように、ＸＡ（ω）を用いて表すことにより、倍角成分であるｓｉｎ２θ成分を生成することができ、また、数２４式に示すＸＭＩＣ０（ω）ｃｏｓ２θは数５０式のように、ＸＡ（ω）を用いて表すことにより、倍角成分であるｃｏｓ２θ成分を生成することができる。
【０１０６】
これにより、数４９式により数２６式によるｓｉｎ２θ成分の算出が不要となるため、ｘＭＩＣ５（ｔ），ｘＭＩＣ６（ｔ）のミキシング出力が不要となるため、図１４Ａに示すようにＭＩＣ５、６を省略することができる。
【０１０７】
これにより、ＭＩＣ５、６を省略して、ＭＩＣ０〜ＭＩＣ４までの５つのマイクロホンを使用することにより、簡易に指向特性の主軸を可変に制御して、目的とする音源に指向性を容易に向けることができる。
【０１０８】
なお、上述した数４１式により、ＭＩＣ０は不要となるため、図１４Ｂに示すＭＩＣ０を省略することができる。
【０１０９】
これにより、ＭＩＣ０、５、６を省略して、ＭＩＣ１〜ＭＩＣ４までの４つのマイクロホンを使用することにより、より簡易に指向特性の主軸を可変に制御して、目的とする音源に指向性を容易に向けることができる。
【０１１０】
また、数４６式により数２２式によるｓｉｎθ成分の算出が不要となる共に、数５０式により数２４式におけるｃｏｓ２θ成分の算出が不要となるため、ｘＭＩＣ３（ｔ），ｘＭＩＣ４（ｔ）のミキシング出力が不要となるため、図１４Ｃに示すＭＩＣ３、４を省略することができる。
【０１１１】
これにより、ＭＩＣ５、６、３、４を省略して、ＭＩＣ０〜ＭＩＣ２までの３つのマイクロホンを使用することにより、さらに簡易に指向特性の主軸を可変に制御して、目的とする音源に指向性を容易に向けることができる。
【０１１２】
【数４７】

【０１１３】
【数４８】

【０１１４】
【数４９】

【０１１５】
【数５０】

【０１１６】
なお、図１４Ｃにおいて、ＭＩＣ３、４を省略した際に、ＭＩＣ０を新たに設けたのは、ＭＩＣ０の信号をＭＩＣ１〜ＭＩＣ４の信号から求めていたがＭＩＣ３、４を省略したことから、必要となったためである。
【０１１７】
なお、上述した本実施の形態では、倍角成分を示す２次のフーリエ級数展開について説明したが、これに限らず、３次以上のフーリエ級数展開に適用するようにしても良い。
【０１１８】
つまり、数５１式、数５２式を利用することにより、数５３式のように、ＸＡ（ω）を用いて表すことにより、３倍角成分であるｃｏｓ３θ成分を生成することができ、また、数５４式に示すように、ＸＡ（ω）を用いて表すことにより、３倍角成分であるｓｉｎ３θ成分を生成することができる。
【０１１９】
これにより、３倍角以上の成分を生成することができ、これにより、フーリエ級数を３倍角以上に近似することができるので、さらに高次のフーリエ級数展開を可能とすることができる。
【０１２０】
【数５１】

【０１２１】
【数５２】

【０１２２】
【数５３】

【０１２３】
【数５４】

【０１２４】
【発明の効果】
この発明のマイクロホン装置は、音源からの音波が入力されるマイクロホンを用いてマイクロホンの指向特性を制御するマイクロホン装置において、基準マイクロホンと、基準マイクロホンを中心に等間隔に配置される第１の１対のマイクロホンと、基準マイクロホンを中心に第１の１対のマイクロホンに直交して等間隔に配置される第２の１対のマイクロホンと、基準マイクロホンを中心に第１の１対のマイクロホンおよび第２の１対のマイクロホンに対して４５度傾けて等間隔に配置される第３の１対のマイクロホンと、基準マイクロホン、第１、第２および第３の１対の各マイクロホンの出力をそれぞれディジタル信号に変換するＡ／Ｄ変換部と、Ａ／Ｄ変換部からのディジタル信号に対して信号処理を施す演算処理部とを備え、基準マイクロホン、第１、第２および第３の１対の各マイクロホンは同一平面上に配置され、演算処理部は、第１の１対のマイクロホンの出力の差を求め、この差をフーリエ変換することにより、基準マイクロホンの出力と位相を合わせ且つ基準マイクロホンの出力に対してｃｏｓθで振幅が変化する第１の中間生成出力を得る処理と、第２の１対のマイクロホンの出力の差を求め、この差をフーリエ変換することにより、基準マイクロホンの出力と位相を合わせ且つ基準マイクロホンの出力に対してｓｉｎθで振幅が変化する第２の中間生成出力を得る処理と、第２の１対のマイクロホンの出力の和を求め、この和をフーリエ変換することにより、基準マイクロホンの出力に対してｃｏｓ２θで振幅が変化する第３の中間生成出力を得る処理と、第３の１対のマイクロホンの出力の和を求め、この和をフーリエ変換することにより、基準マイクロホンの出力に対してｓｉｎ２θで振幅が変化する第４の中間生成出力を得る処理と、目標とする指向特性を、次数が２次のフーリエ級数の係数α０，α１，β１，α２，β２によって表し、基準マイクロホンの出力，第１の中間生成出力，第２の中間生成出力，第３の中間生成出力，第４の中間生成出力を、それぞれ係数α０，α１，β１，α２，β２を用いて重み付けして加算する処理とを行うようにしたので、指向性の主軸を任意に制御することができ、指向性の精度を向上させることができるという効果を奏する。
【０１２５】
また、この発明のマイクロホン装置は、上述において、演算処理部は、予め複数の主軸の中心角についてそれぞれ係数α１，β１，α２，β２の組を用意しておき、それらの組のうち音声を分離しようとする主軸の中心角に応じた係数を用いて第１乃至第４の中間生成出力を重み付けすることにより、リアルタイムに複数の主軸からの音声を分離して取得することが可能となるので、１つのマイクロホン装置のみを使用するだけで、例えばマイクホンを中心とする左右の音源を分離してリアルタイムに音声収録または音声認識をすることができるという効果を奏する。
【０１２６】
また、この発明のマイクロホン装置は、上述において、基準マイクロホンを省略して、演算処理部により、第１および第２の１対の各マイクロホンの出力和で基準マイクロホンの出力を近似するので、マイクロホン装置を小型化かつ容易に構成することができると共に、基準マイクロホンを省略して６つのマイクロホンを使用することにより、指向特性の主軸を可変に制御して、目的とする音源に指向性を容易に向けることができるという効果を奏する。
【０１２７】
また、この発明のマイクロホン装置は、上述において、第３の１対のマイクロホンを省略して、演算処理部により、第４の中間生成出力を、第１の中間生成出力を用いて表すので、マイクロホン装置を小型化かつ容易に構成することができると共に、第３の１対のマイクロホンを省略して５つのマイクロホンを使用することにより、簡易に指向特性の主軸を可変に制御して、目的とする音源に指向性を容易に向けることができるという効果を奏する。
【０１２８】
また、この発明のマイクロホン装置は、上述において、第３の１対のマイクロホンを省略して、演算処理部により、第４の中間生成出力を、第１の中間生成出力を用いて表すので、基準マイクロホンおよび第３の１対のマイクロホンを省略して、４つのマイクロホンを使用することにより、より簡易に指向特性の主軸を可変に制御して、目的とする音源に指向性を容易に向けることができるという効果を奏する。
【０１２９】
また、この発明のマイクロホン装置は、上述において、第２の１対のマイクロホンを省略して、演算処理部により、第２の中間生成出力を、第１の中間生成出力を用いて表すとともに、基準マイクロホンの出力に対してｃｏｓ２θで振幅が変化する成分を、第１の中間生成出力を用いて表すので、第２および第３の１対のマイクロホンを省略して、３つのマイクロホンを使用することにより、さらに簡易に指向特性の主軸を可変に制御して、目的とする音源に指向性を容易に向けることができるという効果を奏する。
【図面の簡単な説明】
【図１】本実施の形態が適用されるマイクカプセルの配置図である。
【図２】各マイクロホンの配置に応じたＭＩＣ０からの距離差を示す図であり、図２Ａは第１の１対のマイクロホンＭＩＣ１，ＭＩＣ２の配置に応じたＭＩＣ０からの距離差、図２Ｂは第２の１対のマイクロホンＭＩＣ３，ＭＩＣ４の配置に応じたＭＩＣ０からの距離差、図２Ｃは第３の１対のマイクロホンＭＩＣ５，ＭＩＣ６の配置に応じたＭＩＣ０からの距離差である。
【図３】マイクロホン装置のハードウエア構成図である。
【図４】演算処理装置における信号処理のフローチャートである。
【図５】ｓｉｎ（ｋｄｃｏｓθ）の入射角度依存特性を示す図である。
【図６】ｓｉｎ（ｋｄｃｏｓθ）／ｓｉｎ（ｋｄ）の入射角度依存特性を示す図である。
【図７】ｓｉｎ（ｋｄｓｉｎθ）の入射角度依存特性を示す図である。
【図８】ｓｉｎ（ｋｄｓｉｎθ）／ｓｉｎ（ｋｄ）の入射角度依存特性を示す図である。
【図９】フーリエ級数で近似目標とする指向特性を示す図である。
【図１０】フーリエ級数での指向特性例を示す図である。
【図１１】θｃ＝０度としたときの指向特性のシミュレーション結果を示す図である。
【図１２】θｃ＝１３５度としたときの指向特性のシミュレーション結果を示す図である。
【図１３】ＭＩＣ１〜ＭＩＣ４の出力和の入射角度依存特性を示す図である。
【図１４】ＭＩＣの省略を示す図であり、図１４ＡはＭＩＣ５、６の省略した５つのＭＩＣ、図１４ＢはＭＩＣ０、５、６を省略した４つのＭＩＣ、図１４ＣはＭＩＣ３、４、５、６を省略した３つのＭＩＣを示す。
【図１５】従来のマイクロホンシステムのブロック図である。
【符号の説明】
１……ＭＩＣ０、２……ＭＩＣ１、３……ＭＩＣ２、４……ＭＩＣ３、５……ＭＩＣ４、６……ＭＩＣ５、７……ＭＩＣ６、８〜１４……アンプ、１５〜２１……Ａ／Ｄ変換器、２２……演算処理装置、２３……収録機器または音声認識装置[0001]
BACKGROUND OF THE INVENTION
The present invention provides audio recording and audio that can easily change the direction of directivity in an environment such as a living room in a home or an office meeting room where the position of a speaker as a target sound source constantly changes. The present invention relates to a microphone device for recognition.
[0002]
[Prior art]
Reference [1], which will be described later, expands the microphone system using three microphones described in [2], and uses five omnidirectional microphone capsules to provide a wide-band narrow-angle directivity of 300 Hz to 5 kHz and a beam width of about 120 degrees. Has succeeded in creating a directional microphone. FIG. 15 shows a block diagram of a conventional microphone system. In FIG. 15, the microphones MIC0, MIC2, MIC3, MIC1A, and MIC1B are housed in a 4 cm × 7 cm plane area. The

adders

150 and 151 use the difference between the two microphones MIC2 and MIC3, MIC1A and MIC1B. Further, with respect to the difference output of the

subtracters

150 and 151,

integrators

156 and 152 are used to eliminate the phase component and increase the bandwidth. In addition, the directivity function to be obtained is obtained first, and the

necessary components

152 and 153 are obtained by using a Fourier series. Further, a high-frequency correction is performed using a low-pass filter (LPF) 155 having a resonance point.
[0003]
Reference [1] Kono, Nakamura, Yamato, Takashima, “Broadband Narrow Angle Directive Microphone System 信 Science Technical Report EA99-85 December 1999
Reference [2] Nakamura, Kouno, Yamato, Sakiyama, “Realization of Wide-Directivity with Three Microphones”, IEICE Trans, Fundamentals, Vol.
[0004]
[Problems to be solved by the invention]
However, since the conventional microphone system described above is configured by hardware such as a transistor and an operational amplifier, the directivity principal axis is fixed, and in particular, the beam principal axis cannot be arbitrarily controlled.
[0005]
In addition, there is a disadvantage that errors in constants such as resistors and capacitors affect the directivity control.
[0006]
Therefore, the present invention has been made in view of the above points, and can control the main axis of directivity arbitrarily, improve directivity sharpness, and use only one microphone device. It is an object of the present invention to provide a microphone device that can separate sound sources centered on a microphone, for example, and perform voice recording or voice recognition in real time.
[0007]
[Means for Solving the Problems]
  The microphone device of the present invention is a microphone device that controls the directivity characteristics of a microphone using a microphone to which sound waves from a sound source are input.Equally spacedA first pair of microphones disposed and orthogonal to the first pair of microphones about a reference microphoneEqually spacedA second pair of microphones to be disposed, and a tilt of 45 degrees with respect to the first pair of microphones and the second pair of microphones around the reference microphone.Equally spacedA third pair of microphones disposed;The A / D converter for converting the outputs of the reference microphone, the first, second and third pair of microphones into digital signals, respectively, and the signal processing for the digital signals from the A / D converter Arithmetic processing unitAnd the reference microphone, the first, second and third pair of microphones are arranged on the same plane,The arithmetic processing unit obtains a difference between the outputs of the first pair of microphones, and performs Fourier transform on the difference to match the phase with the output of the reference microphone and change the amplitude at cos θ with respect to the output of the reference microphone. The difference between the output of the first intermediate generation output and the output of the second pair of microphones is obtained, and the difference is Fourier transformed to match the phase of the reference microphone output and the output of the reference microphone. The process of obtaining the second intermediate generation output whose amplitude changes with sin θ and the sum of the outputs of the second pair of microphones are obtained, and this sum is subjected to Fourier transform, whereby the amplitude at cos 2θ with respect to the output of the reference microphone Is obtained by calculating the sum of the outputs of the third intermediate generation output in which the V is changed and the outputs of the third pair of microphones, and performing Fourier transform on the sum. The process of obtaining the fourth intermediate generation output whose amplitude changes with sin 2θ with respect to the output of the microphone, and the target directivity are represented by coefficients α0, α1, β1, α2, β2 of the second order Fourier series. , The output of the reference microphone, the first intermediate generation output, the second intermediate generation output, the third intermediate generation output, and the fourth intermediate generation output are weighted using coefficients α0, α1, β1, α2, and β2, respectively. And addIt is what I did.
[0008]
  Therefore, according to the present invention, the following operations are performed. After obtaining each digital signal by the A / D converter with respect to the output of each microphone of the reference microphone, the first, second, and third pair,Arithmetic processing sectionBy each digital signalApply signal processing to.
[0009]
  The signal processing performed in the arithmetic processing unit is as follows.
  A first intermediate generation in which the difference between the outputs of the first pair of microphones is obtained and the difference is Fourier transformed to match the phase with the output of the reference microphone and the amplitude changes with cos θ with respect to the output of the reference microphone. Get the output.
  A second intermediate generation in which the difference between the outputs of the second pair of microphones is obtained and the difference is Fourier transformed to match the phase of the output of the reference microphone and the amplitude changes with sin θ with respect to the output of the reference microphone. Get the output.
  A sum of outputs of the second pair of microphones is obtained, and a Fourier transform is performed on the sum to obtain a third intermediate generation output whose amplitude changes at cos 2θ with respect to the output of the reference microphone.
  The sum of the outputs of the third pair of microphones is obtained, and the sum is subjected to Fourier transform to obtain a fourth intermediate generation output whose amplitude changes with sin 2θ with respect to the output of the reference microphone.
  The target directivity is represented by the coefficients α0, α1, β1, α2, β2 of the second order Fourier series, and the output of the reference microphone, the first intermediate generation output, the second intermediate generation output, and the third The intermediate generation output and the fourth intermediate generation output are weighted and added using coefficients α0, α1, β1, α2, and β2, respectively.
[0013]
  theseProcessing in the processing unitThus, the main axis of directivity can be arbitrarily controlled and the directivity sharpness is further improved.
[0014]
DETAILED DESCRIPTION OF THE INVENTION
Hereinafter, embodiments of the present invention will be described.
The microphone device of the present embodiment can combine three to seven microphones and digitally process them, thereby easily changing the main axis and realizing a wide band and narrow directivity suitable for speech recognition. At the same time, since voices from a plurality of main axis directions can be obtained separately, it is optimal for a voice recognition system and a video conference recording system.
[0015]
FIG. 1 is a layout diagram of microphone capsules to which the present exemplary embodiment is applied.
In FIG. 1, the reference microphone MIC0, the first pair of microphones MIC1 and MIC2 arranged around the reference microphone MIC0, and the first pair of microphones MIC1 and MIC2 around the reference microphone MIC0 are orthogonal. The second pair of microphones MIC3 and MIC4 and the first pair of microphones MIC1 and MIC2 and the second pair of microphones MIC3 and MIC4 are inclined at 45 degrees with respect to the reference microphone MIC0. A third pair of microphones MIC5 and MIC6 are arranged. These microphones MIC0 to MIC6 are arranged in a planar space. Further, the incident angle of the sound wave with respect to the reference axis based on MIC1, MIC0, and MIC2 is s degrees.
[0016]
Here, with respect to the position P of the reference microphone MIC0, the first pair of microphones MIC1 and MIC2 are arranged at equal intervals d1, and the second pair of microphones MIC3 and MIC4 are arranged at equal intervals d2. The third pair of microphones MIC5 and MIC6 are arranged at equal intervals d3.
[0017]
FIG. 2 is a diagram illustrating a difference in distance from the sound source according to the arrangement of each microphone.
FIG. 2A shows a time difference from the sound source according to the arrangement of the first pair of microphones MIC1 and MIC2. In FIG. 2A, the sound from the sound source incident at the incident angle s degrees reaches the microphone MIC1 in a short time corresponding to the distance (+ d1cos (s)) with respect to the position P of the reference microphone MIC0. MIC2 is reached in a long time corresponding to the distance (+ d1cos (s)).
[0018]
FIG. 2B shows the distance difference from the sound source according to the arrangement of the second pair of microphones MIC3 and MIC4. In FIG. 2B, the sound from the sound source incident at the incident angle s degrees reaches the microphone MIC3 with respect to the position P of the reference microphone MIC0 in a short time corresponding to the distance (+ d2sin (s)). The MIC 3 is reached in a long time corresponding to the distance (+ d2sin (s)).
[0019]
FIG. 2C shows the distance difference from the sound source according to the arrangement of the third pair of microphones MIC5 and MIC6. In FIG. 2C, the sound from the sound source incident at the incident angle s degree reaches the microphone MIC6 in a short time corresponding to the distance (+ d3sin (45−s)) with respect to the position P of the reference microphone MIC0. The microphone MIC5 is reached in a longer time corresponding to the distance (+ d3sin (45−s)).
[0020]
FIG. 3 is a hardware configuration diagram of a microphone device using the above-described microphone. In FIG. 3, the microphone device includes MIC0 (1), MIC1 (2), MIC2 (3), MIC3 (4), MIC4 (5), MIC5 (6), MIC6 (7), and each MIC0 (1), MIC1 (2), MIC2 (3), MIC3 (4), MIC4 (5), MIC5 (6), MIC6 (7), an amplifier 8, an amplifier 9, an amplifier 10 and an amplifier 11 that amplify the signals so that they can be processed. , Amplifier 12, amplifier 13, amplifier 14, A / D converter 15 that converts the signals amplified by each amplifier 8, amplifier 9, amplifier 10, amplifier 11, amplifier 12, amplifier 13, amplifier 14 into a digital signal, A / D converter 16, A / D converter 17, A / D converter 18, A / D converter 19, A / D converter 20, A / D converter 21, and each A / D converter 15 A / D converter 16, A An arithmetic processing unit 22 that performs signal processing on the digital signal converted by the D converter 17, the A / D converter 18, the A / D converter 19, the A / D converter 20, and the A / D converter 21; And a recording device or a voice recognition device 23 that performs a recording process or a voice recognition process on the result of signal processing by the arithmetic processing unit 22.
[0021]
The signal processing performed in the arithmetic processing unit 22 on the signal input to each microphone is shown in the flowchart of FIG. In FIG. 4, in step S1, initialization of i = 0 has already been performed.
[0022]
Here, when the sounds from the sound source separated by R from MIC0 are collected, the following formula 1, formula 2, formula 3, formula 4, formula 4, formula 5, formula 6, formula 7, formula 7, formula 8 become that way. In the above, Xs (t) in Formula 1 represents a sound source signal, x MIC0 (t) in Formula 2 is a signal observed at time t at the position of MIC0, and x MIC1 (t) in Formula 3 Is a signal observed at time t at the position of MIC1, x MIC2 (t) in Formula 4 is a signal observed at time t at the position of MIC2, and x MIC3 (t) in Formula 5 is MIC3. X MIC4 (t) in Expression 6 is a signal observed at time t at the position of MIC4, and x MIC5 (t) in Expression 7 is the position of MIC5. The signal x MIC6 (t) in equation (8) is a signal observed at time t at the position of MIC6. Here, k = ω / c, ω represents the angular frequency of the signal, c represents the speed of sound, and θ represents the incident angle with respect to the reference axis of the microphone of the sound source.
[0023]
[Expression 1]

[0024]
[Expression 2]

[0025]
[Equation 3]

[0026]
[Expression 4]

[0027]
[Equation 5]

[0028]
[Formula 6]

[0029]
[Expression 7]

[0030]
[Equation 8]

[0031]
These audio signals are converted into electrical signals in the MIC0 (1), MIC1 (2), MIC2 (3), MIC3 (4), MIC4 (5), MIC5 (6), and MIC6 (7), respectively. After being amplified by the amplifier 8, the amplifier 9, the amplifier 10, the amplifier 11, the amplifier 12, the amplifier 13, and the amplifier 14, the A / D converter 15, the A / D converter 16, the A / D converter 17, the A / D The digital signal is converted by the D converter 18, the A / D converter 19, the A / D converter 20, and the A / D converter 21. However, it is assumed that the sensitivity of each microphone and the gain of each amplifier described above are constant.
[0032]
The digital signal is subjected to the following processing in the arithmetic processing unit 22. In FIG. 4, sampling is performed in step S2. Specifically, sampling of a digital signal is performed every frame period. In step S3, the microphone output is mixed. Specifically, the digital signal obtained from each microphone is subjected to mixing processing as shown in the following formula 9, formula 10, formula 11, formula 12, and formula 12 by the arithmetic processing unit 22. xA (t), xB (t), xC (t), and xD (t).
[0033]
[Equation 9]

[0034]
[Expression 10]

[0035]
## EQU11 ##

[0036]
[Expression 12]

[0037]
The signals xA (t), xB (t), xC (t), and xD (t) shown in the above-mentioned formula 9, formula 10, formula 11 and formula 12 are as shown in MIC0. The signals are as shown in the following Equation 13, Equation 14, Equation 15, and Equation 16, respectively.
[0038]
[Formula 13]

[0039]
[Expression 14]

[0040]
[Expression 15]

[0041]
[Expression 16]

[0042]
That is, the signals xA (t), xB (t), xC (t), and xD (t) shown in Equation 13, Equation 14, Equation 15, and Equation 16 are observed by MIC0. The characteristics of jsin (kd1cosθ), jsin (kd2sinθ), cos (kd2sinθ), and cos (kd3sin (π / 4-θ) are added to the x MIC0 (t) signal of Equation 2, respectively. I understand.
[0043]
That is, the signals of xA (t), xB (t), xC (t), and xD (t) shown in Equation 13, Equation 14, Equation 15, and Equation 16 are input signals. The characteristic changes depending on the angular frequency ω (where k = ω / c) and the incident angle θ. Further, xA (t) shown in Formula 13 including imaginary number component j and xB (t) shown in Formula 14 are in phase with respect to the xMIC0 (t) signal of Formula 2 observed at MIC0. It can be seen that is advanced 90 degrees.
[0044]
In step S4, each mixed signal is stored in a buffer. Specifically, the signals xA (t), xB (t), xC (t), and xD (t) shown in Formula 13, Formula 14, Formula 15, and Formula 16 are Each is stored in a frame buffer of N buffers corresponding to the number N of samples used in the frame processing.
[0045]
In step S5, i indicating the number of processes is incremented. In step S6, it is determined whether i = N. If i = N is not satisfied in step S6, the process returns to step S2, and the processes and determinations from step S2 to step S6 are repeated.
[0046]
When i = N in step S6, pre-processing is performed in step S7. Specifically, the frame buffers having the number N of buffers all have x A (t), x B (t), x C (t), x shown in Equation 13, Equation 14, Equation 15, and Equation 16. Each signal of D (t) is stored, but when this frame buffer is completely filled, a window such as a Hamming window or a Hanning window is used as a preprocessing for frame processing to reduce the influence of continuous audio framing. Processing is performed.
[0047]
In step S8, frame processing is performed. Specifically, each process of phase conversion and amplitude characteristic correction is performed using fast Fourier transform (FFT).
[0048]
First, the output X A (ω) of the FFT for x A (t) shown in Equation 13 will be described. FIG. 5 shows incident angle dependence characteristics of sin (kd1cos θ) (here, d1 = 0.008 m) which is the amplitude component of xA (t). In FIG. 5, the incident angle dependence characteristic of sin (kd1 cos θ) varies according to the angular frequency ω of the signal (where k = ω / c) (1000 Hz, 2000 Hz, 3000 Hz, 4000 Hz, 5000 Hz, 6000 Hz). I understand.
[0049]
Therefore, FIG. 6 shows the incident angle dependence characteristics of sin (kd1cos θ) / sin (kd). Now consider XA (ω) / sin (kd1). In FIG. 6, the incident angle dependence characteristic of sin (kd1cosθ) / sin (kd1) varies depending on the angular frequency ω of the signal (where k = ω / c) (1000 Hz, 2000 Hz, 3000 Hz, 4000 Hz, 5000 Hz, 6000 Hz). You can see that it is almost gone.
[0050]
Further, as described above, the phase of x A (t) shown in Equation 13 including the imaginary component j is advanced by 90 degrees with respect to the signal x MIC0 (t) of Equation 2 observed in MIC0. Therefore, when X′RA (ω) and X′IA (ω) are used as in Expression 17 and Expression 18, the phase advance is eliminated. Here, φ in Equation 17 and Equation 18_A(Ω) is X_AThis represents the phase of (ω).
[0051]
[Expression 17]

[0052]
[Expression 18]

[0053]
Here, X′A (ω) = X′RA (ω) + jX′IA (ω), and Equations 17 and 18 represent the spectrum after phase conversion. Further, if kd1 << 1, sin (kd1cos (θ)) can be approximated to kd1cosθ, and therefore, the following equation 19 is satisfied, and the FFT output X A (ω for x A (t) shown in equation 13 is obtained. ), A component whose amplitude changes with cos θ can be obtained for the signal input with MIC0.
[0054]
[Equation 19]

[0055]
Similarly, a signal input from MIC0 is considered as follows. The FFT output XB (ω) for xB (t) shown in Equation 14 will be described. FIG. 7 shows incident angle dependence characteristics of sin (kd2sin (θ)) (here, d1 = 0.008 m), which is the amplitude component of xB (t). In FIG. 7, the incident angle dependence characteristic of sin (kd2sin (θ)) changes according to the angular frequency ω of the signal (where k = ω / c) (1000 Hz, 2000 Hz, 3000 Hz, 4000 Hz, 5000 Hz, 6000 Hz). I understand that
[0056]
Therefore, FIG. 8 shows the incident angle dependence characteristics of sin (kd2sin (θ)) / sin (kd2). Now consider XB (ω) / sin (kd2). In FIG. 8, the incident angle dependence characteristic of sin (kd2sinθ) / sin (kd2) varies depending on the angular frequency ω of the signal (where k = ω / c) (1000 Hz, 2000 Hz, 3000 Hz, 4000 Hz, 5000 Hz, 6000 Hz). You can see that it is almost gone.
[0057]
Further, as described above, the phase of xB (t) shown in Equation 14 including the imaginary component j is advanced by 90 degrees with respect to the signal xMIC0 (t) of Equation 2 observed in MIC0. Therefore, if X′RB (ω) and X′IB (ω) are used as in Expression 20 and Expression 21, phase advance is eliminated. Here, φ in Equation 20 and Equation 21_B(Ω) is X_BThis represents the phase of (ω).
[0058]
[Expression 20]

[0059]
[Expression 21]

[0060]
Here, X′B (ω) = X′RB (ω) + jX′IB (ω), and

Expressions

20 and 21 represent the spectrum after phase conversion. Accordingly, when kd2 << 1, sin (kd2sinθ) can be approximated to kd2sinθ, and therefore, the following equation 22 is satisfied. From the output X B (ω) of the FFT with respect to x B (t) shown in equation 14, A component whose amplitude changes with sin θ can be obtained with respect to the signal input with MIC0.
[0061]
[Expression 22]

[0062]
Next, the FFT output X C (ω) for x C (t) shown in Equation 15 will be described. Cos (kd2sin (θ)), which is the amplitude component of xC (t), is expressed by the following equation 23 using Taylor expansion. Here, λ represents an approximation error.
[0063]
[Expression 23]

[0064]
From this, the relationship of Equation 24 is obtained, and a component whose amplitude changes with cos 2θ is obtained from the output x C (ω) of the FFT with respect to x C (t) shown in Equation 15 with respect to the signal input at MIC0. be able to. Note that λ uses the reference [1].
[0065]
[Expression 24]

[0066]
Next, the output X D (ω) of the FFT with respect to x D (t) shown in Equation 16 will be described. Cos (kd3sin (π / 4-θ)), which is an amplitude component of xD (t), is expressed by the following formula 25 using Taylor expansion. Here, γ represents an approximation error.
[0067]
[Expression 25]

[0068]
From this, the relationship of Equation 26 is obtained, and from the output X D (ω) of the FFT with respect to x D (t) shown in Equation 16, a component whose amplitude changes with sin 2θ is obtained with respect to the signal input by MIC0. be able to. Reference [1] is used for γ.
[0069]
[Equation 26]

[0070]
FIG. 9 shows the directivity characteristic ψ (θ) as an approximation target in the Fourier series. If the directivity D (θ) = 1 + ψ (θ) is obtained when the directivity characteristic ψ (θ) and the output of MIC0 shown in FIG. 9 are added, the sensitivity other than the beam can be suppressed. Here, the central angle of the main axis is θc (degrees), and the beam width is θw (degrees). At this time, ψ (θ) is expressed by the following series 27 by Fourier series expansion.
[0071]
[Expression 27]

[0072]
Actually, in the processing from the above formulas 13 to 26, only cos θ, sin θ, cos 2θ, and sin 2θ are obtained, so θw = 60 degrees is a value suitable for suppressing the sensitivity outside the beam. It is. The coefficients α0, αi, and βi are obtained by the following equations 28, 29, and 30.
[0073]
[Expression 28]

[0074]
[Expression 29]

[0075]
[30]

[0076]
FIG. 10 shows an example of ψ (θ) in the Fourier series when θc = 60 degrees and θw = 60 degrees.
[0077]
In the above equation 27, when M = 2 and the above intermediate generation output is weighted and added, as shown in equation 31, it is possible to obtain a characteristic that gives directivity only in the main axis direction.
[0078]
[31]

[0079]
However, in Expression 31, each intermediate generation output Ycos (ω), Y Rcos (ω), and Y Icos (ω) is expressed by the following Expression 32, Expression 33, and Expression 34, respectively. The intermediate generation outputs Ysin (ω), YRsin (ω), and YIsin (ω) are expressed by the following formulas 35, 36, and 37, respectively. Here, φA (ω) and φB (ω) indicate the phases of XA (ω) and XB (ω), respectively.
[0080]
[Expression 32]

[0081]
[Expression 33]

[0082]
[Expression 34]

[0083]
[Expression 35]

[0084]
[Expression 36]

[0085]
[Expression 37]

[0086]
Further, in Expression 31, each intermediate generation output Y cos (2ω) and Y sin (2ω) is expressed by the following Expression 38 and Expression 39, respectively.
[0087]
[Formula 38]

[0088]
[39]

[0089]
Here, simulation results when d1 = d2 = d3 = 0.008 m are shown in FIGS. FIG. 11 shows a simulation result of directivity when θc = 0 degrees, and FIG. 12 shows a simulation result of directivity characteristics when θc = 135 degrees. It can be seen that each shows directivity without frequency dependency. Since these directivities are finally determined by the coefficients αi and βi of the Fourier series, if a plurality of sets of αi and βi are prepared in advance for θc, weighted addition of each intermediate generation signal is performed. It becomes possible to separate and acquire sounds from a plurality of spindles in real time.
[0090]
In the above-described processing, the reference microphone MIC0 is used, but these functions can substitute for the reference microphone MIC0 by using MIC1 to MIC4. That is, the output sum from MIC1 to MIC4 is shown in Equation 40.
[0091]
[Formula 40]

[0092]
Here, when d1 = d2 = 0.008 m, the value of (cos (kd1cosθ) + cos (kd2sinθ)) / 2, which is the amplitude component in Equation 40, is the output sum of MIC1 to MIC4 shown in FIG. As shown in the incident angle dependence characteristics of As a result, the output sum of MIC1 to MIC4 is dependent on the value depending on the incident angle θ in the high range, but the angular frequency ω of the signal (where k = ω / c) (1000 Hz, 2000 Hz, 3000 Hz, 4000 Hz, 5000 Hz). 6000 Hz), it can be seen that the value is almost constant. Since these take an average value at θ = 22.5 degrees, it is possible to obtain characteristics that do not substantially depend on the angular frequency ω by performing correction as shown in the following formula 41, and approximation can be performed. .
[0093]
[Expression 41]

[0094]
Thus, by omitting the reference microphone MIC0 and using six microphones MIC1 to MIC6, the main axis of the directivity can be variably controlled, and directivity can be easily directed to the target sound source. .
[0095]
Y (ω) shown in the equation 31 obtained in this way is output in step S9 in FIG. Specifically, since the output Y (ω) has been subjected to frequency analysis, it can be treated as it is as a speech analysis result in the arithmetic processing unit 22, or used as an input for further speech analysis, or The voice recognition device 23 can be used for voice analysis for voice recognition. Further, Y (ω) can be used for audio recording or the like by the recording device 23 by performing inverse Fourier transform to return the frequency domain signal to the time domain waveform signal.
[0096]
Then, after initialization processing is performed with i = 0 in step S10, the process returns to step S2 to repeat the processing and determination from step S2 to step S6.
[0097]
FIG. 14 shows the MIC omitted.
Hereinafter, the omission of the

MICs

5 and 6 shown in FIG. 14A and the omission of the

MICs

3 and 4 shown in FIG. 14C will be described.
[0098]
Equation 46 is obtained from Equation 42, Equation 43, Equation 44, Equation 45 shown below.
[0099]
[Expression 42]

[0100]
[Equation 43]

[0101]
(44)

[0102]
[Equation 45]

[0103]
[Equation 46]

[0104]
In this way, by expressing the output X B (ω) of the FFT with respect to x B (t) shown in the equation 14 using the equation 46, the sin θ component can be generated. it can.
[0105]
In addition, from the following equations 47 and 48, X D (ω) shown in equation 26 is expressed using X A (ω) as shown in equation 49, so that a sin 2θ component that is a double angle component X MIC0 (ω) cos2θ shown in Formula 24 can be expressed using XA (ω) as shown in Formula 50, thereby generating a cos2θ component that is a double angle component. Can do.
[0106]
This eliminates the need to calculate the sin 2θ component according to equation (26) from equation (49), which eliminates the need for mixing outputs of x MIC5 (t) and x MIC6 (t), and therefore, as shown in FIG. Can be omitted.
[0107]
Thus, by omitting the

MICs

5 and 6 and using the five microphones MIC0 to MIC4, the main axis of the directional characteristic can be easily controlled variably and directivity can be easily directed to the target sound source. Can do.
[0108]
In addition, since MIC0 becomes unnecessary according to the above-described equation 41, MIC0 shown in FIG.
[0109]
As a result, by omitting

MICs

0, 5, and 6 and using four microphones from MIC1 to MIC4, the main axis of the directional characteristic can be variably controlled, and directivity can be easily applied to the target sound source. Can be directed to.
[0110]
In addition, since the calculation of the sin θ component according to the equation 22 is unnecessary according to the equation 46, and the calculation of the cos 2θ component according to the equation 24 is unnecessary according to the equation 50, the values of x MIC3 (t) and x MIC4 (t) Since no mixing output is required, the

MICs

3 and 4 shown in FIG. 14C can be omitted.
[0111]
Thus, by omitting MIC5, 6, 3, and 4 and using three microphones from MIC0 to MIC2, the main axis of the directional characteristic can be controlled more easily and directivity to the target sound source. Can be easily directed.
[0112]
[Equation 47]

[0113]
[Formula 48]

[0114]
[Formula 49]

[0115]
[Equation 50]

[0116]
In FIG. 14C, when MIC3 and 4 are omitted, MIC0 is newly provided because the MIC0 signal is obtained from the signals of MIC1 to MIC4, but MIC3 and 4 are omitted. This is because.
[0117]
In the above-described embodiment, the second-order Fourier series expansion indicating a double angle component has been described. However, the present invention is not limited to this, and may be applied to third-order or higher-order Fourier series expansion.
[0118]
In other words, by using Equation 51 and Equation 52, the cos 3θ component that is a triple angle component can be generated by using X A (ω) as shown in Equation 53, and As expressed in Equation 54, a sin 3θ component that is a triple angle component can be generated by using XA (ω).
[0119]
As a result, it is possible to generate a component having a triple angle or more, and thereby a Fourier series can be approximated to a triple angle or more, thereby enabling higher-order Fourier series expansion.
[0120]
[Formula 51]

[0121]
[Formula 52]

[0122]
[Equation 53]

[0123]
[Formula 54]

[0124]
【The invention's effect】
  A microphone device according to the present invention is a microphone device that controls the directivity characteristics of a microphone using a microphone to which a sound wave from a sound source is input. The reference microphone and the reference microphone are mainly used.Equally spacedA first pair of microphones disposed and orthogonal to the first pair of microphones about a reference microphoneEqually spacedA second pair of microphones to be disposed, and a tilt of 45 degrees with respect to the first pair of microphones and the second pair of microphones around the reference microphone.Equally spacedA third pair of microphones disposed;The A / D converter for converting the outputs of the reference microphone, the first, second and third pair of microphones into digital signals, respectively, and the signal processing for the digital signals from the A / D converter Arithmetic processing unitAnd the reference microphone, the first, second and third pair of microphones are arranged on the same plane,The arithmetic processing unit obtains a difference between the outputs of the first pair of microphones, and performs Fourier transform on the difference to match the phase with the output of the reference microphone and change the amplitude at cos θ with respect to the output of the reference microphone. The difference between the output of the first intermediate generation output and the output of the second pair of microphones is obtained, and the difference is Fourier transformed to match the phase of the reference microphone output and the output of the reference microphone. The process of obtaining the second intermediate generation output whose amplitude changes with sin θ and the sum of the outputs of the second pair of microphones are obtained, and this sum is subjected to Fourier transform, whereby the amplitude at cos 2θ with respect to the output of the reference microphone Is obtained by calculating the sum of the outputs of the third intermediate generation output in which the V is changed and the outputs of the third pair of microphones, and performing Fourier transform on the sum. The process of obtaining the fourth intermediate generation output whose amplitude changes with sin 2θ with respect to the output of the microphone, and the target directivity are represented by coefficients α0, α1, β1, α2, β2 of the second order Fourier series. , The output of the reference microphone, the first intermediate generation output, the second intermediate generation output, the third intermediate generation output, and the fourth intermediate generation output are weighted using coefficients α0, α1, β1, α2, and β2, respectively. And addAs a result, the main axis of directivity can be controlled arbitrarily and the accuracy of directivity can be improved.WhenThere is an effect.
[0125]
  Further, the microphone device of the present invention is as described above.The arithmetic processing unit prepares a set of coefficients α1, β1, α2, and β2 for the central angles of a plurality of main axes in advance, and sets a coefficient corresponding to the central angle of the main axis from which the voice is to be separated. By using and weighting the first to fourth intermediate generation outputs, it becomes possible to separate and acquire sounds from a plurality of main axes in real time, so that only using one microphone device, for example, Separates left and right sound sources centered on a microphone and records or recognizes sound in real timeThere is an effect that can be.
[0126]
  The microphone device of the present invention omits the reference microphone in the above,The arithmetic processing unit approximates the output of the reference microphone with the sum of the outputs of the first and second pairs of microphones.Therefore, the microphone device can be reduced in size and easily configured, and by using six microphones by omitting the reference microphone, the main axis of the directional characteristic can be variably controlled, and the directivity can be set to the target sound source. The effect that can be directed easily.
[0127]
  Further, in the microphone device of the present invention, in the above description, the third pair of microphones is omitted,The arithmetic processing unit represents the fourth intermediate generation output using the first intermediate generation output.Therefore, the microphone device can be reduced in size and easily configured, and by omitting the third pair of microphones and using five microphones, the main axis of directivity can be easily variably controlled, There is an effect that directivity can be easily directed to the target sound source.
[0128]
  Further, in the microphone device of the present invention, in the above description, the third pair of microphones is omitted,The arithmetic processing unit represents the fourth intermediate generation output using the first intermediate generation output.Therefore, by omitting the reference microphone and the third pair of microphones and using four microphones, the main axis of the directional characteristic can be variably controlled and the directivity can be easily set to the target sound source. There is an effect that it can be directed.
[0129]
  The microphone device of the present invention omits the second pair of microphones in the above description,The arithmetic processing unit expresses the second intermediate generation output using the first intermediate generation output, and uses the first intermediate generation output to express a component whose amplitude changes by cos 2θ with respect to the output of the reference microphone. To expressTherefore, by omitting the second and third pair of microphones and using three microphones, the main axis of the directivity characteristic can be controlled more easily and directivity can be easily set to the target sound source. There is an effect that it can be directed.
[Brief description of the drawings]
FIG. 1 is a layout diagram of microphone capsules to which the exemplary embodiment is applied;
FIG. 2 is a diagram illustrating a distance difference from MIC0 according to the arrangement of each microphone, FIG. 2A is a distance difference from MIC0 according to the arrangement of the first pair of microphones MIC1 and MIC2, and FIG. FIG. 2C shows the distance difference from MIC0 according to the arrangement of the third pair of microphones MIC5 and MIC6.
FIG. 3 is a hardware configuration diagram of the microphone device.
FIG. 4 is a flowchart of signal processing in the arithmetic processing unit.
FIG. 5 is a graph showing an incident angle dependency characteristic of sin (kdcos θ).
FIG. 6 is a graph showing an incident angle dependency characteristic of sin (kdcos θ) / sin (kd).
FIG. 7 is a graph showing incident angle dependence characteristics of sin (kdsinθ).
FIG. 8 is a graph showing incident angle dependence characteristics of sin (kdsinθ) / sin (kd).
FIG. 9 is a diagram showing directivity characteristics that are approximate targets in a Fourier series.
FIG. 10 is a diagram illustrating an example of directivity characteristics in a Fourier series.
FIG. 11 is a diagram showing a simulation result of directivity when θc = 0 degrees.
12 is a diagram showing a simulation result of directivity when θc = 135 degrees. FIG.
FIG. 13 is a diagram showing incident angle dependence characteristics of output sums of MIC1 to MIC4.
14A is a diagram showing omission of MICs, FIG. 14A is five MICs in which

MICs

5 and 6 are omitted, FIG. 14B is four MICs in which

MICs

0, 5, and 6 are omitted, and FIG. 14C is

MICs

3, 4, 5, Three MICs with 6 omitted are shown.
FIG. 15 is a block diagram of a conventional microphone system.
[Explanation of symbols]
1 ... MIC0, 2 ... MIC1, 3 ... MIC2, 4 ... MIC3, 5 ... MIC4, 6 ... MIC5, 7 ... MIC6, 8-14 ... Amplifier, 15-21 ... A / D Converter, 22 ... arithmetic processing device, 23 ... recording device or voice recognition device

Claims

In a microphone device that controls the directional characteristics of a microphone using a microphone to which sound waves from a sound source are input,
A reference microphone;
A first pair of microphones arranged at equal intervals around the reference microphone;
A second pair of microphones that are equally spaced orthogonal to the first pair of microphone mainly the reference microphone,
A third pair of microphones arranged at equal intervals with an inclination of 45 degrees with respect to the first pair of microphones and the second pair of microphones around the reference microphone;
An A / D converter that converts the output of each of the reference microphone, the first, second, and third pair of microphones into a digital signal;
An arithmetic processing unit that performs signal processing on the digital signal from the A / D conversion unit ,
The reference microphone and the first, second, and third pair of microphones are arranged on the same plane,
The arithmetic processing unit is
A difference between the outputs of the first pair of microphones is obtained, and the difference is Fourier transformed to match the phase of the output of the reference microphone and the amplitude changes with cos θ with respect to the output of the reference microphone. Processing to obtain the intermediate generation output of
A difference between the outputs of the second pair of microphones is obtained, and the difference is Fourier transformed to match the phase with the output of the reference microphone and the amplitude changes with sin θ with respect to the output of the reference microphone. Processing to obtain the intermediate generation output of
A process of obtaining a third intermediate generation output whose amplitude changes at cos 2θ with respect to the output of the reference microphone by obtaining a sum of outputs of the second pair of microphones and performing a Fourier transform on the sum;
A process of obtaining a fourth intermediate generation output whose amplitude changes with sin 2θ with respect to the output of the reference microphone by obtaining a sum of outputs of the third pair of microphones and performing a Fourier transform on the sum.
The target directivity is represented by the coefficients α0, α1, β1, α2, β2 of the second order Fourier series, and the output of the reference microphone, the first intermediate generation output, the second intermediate generation output, A microphone device that performs processing of weighting and adding the third intermediate generation output and the fourth intermediate generation output using the coefficients α0, α1, β1, α2, and β2, respectively .

The microphone device according to claim 1, wherein
The arithmetic processing unit prepares a set of coefficients α1, β1, α2, and β2 for the central angles of a plurality of main axes in advance, and according to the central angle of the main axis from which the voice is to be separated. A microphone device that weights the first to fourth intermediate generation outputs using a coefficient .

The microphone device according to claim 1 , wherein
Omit the reference microphone,
A microphone device that approximates the output of the reference microphone by the sum of outputs of the first and second pairs of microphones by the arithmetic processing unit .

The microphone device according to claim 1 , wherein
Omitting the third pair of microphones,
A microphone device that expresses the fourth intermediate generation output by using the first intermediate generation output by the arithmetic processing unit .

The microphone device according to claim 3, wherein
Omitting the third pair of microphones,
A microphone device that expresses the fourth intermediate generation output by using the first intermediate generation output by the arithmetic processing unit .

The microphone device according to claim 4, wherein
Omitting the second pair of microphones,
The arithmetic processing unit represents the second intermediate generation output using the first intermediate generation output, and a component whose amplitude changes at cos 2θ with respect to the output of the reference microphone is represented by the first intermediate output. A microphone device that represents the generated output .