JPH04212600A

JPH04212600A - Voice input device

Info

Publication number: JPH04212600A
Application number: JP2405395A
Authority: JP
Inventors: Toru Miyamae; 徹宮前
Original assignee: Oki Electric Industry Co Ltd
Current assignee: Oki Electric Industry Co Ltd
Priority date: 1990-12-05
Filing date: 1990-12-05
Publication date: 1992-08-04

Abstract

PURPOSE:To present the voice input device to select only an audio signal from a sound source at a desired position by using plural directional microphones. CONSTITUTION:The power of audio signals sound-collected at microphones A1-An is measured by power measuring instruments B1-Bn, next, an SNR is measured by SNR measuring instruments C1-Cn and at a comparator circuit 11, it is judged by using the conditional expression of Smax (maximum value)/ Smin (minimum value)<1+Th (threshold value) or the like whether the audio signal comes from the sound source at the desired position from power S1-Sn or not. When it is judged that the audio signal comes from the sound source at the desired position, a selection enable signal is supplied to a selecting circuit 12. When the selection enable signal is inputted, the selecting circuit 12 detects the maximum value of SNR information or power information, selects the audio signal of the microphone corresponding to the maximum value and outputs the signal to a voice recognizing device 2.

Description

[Detailed description of the invention]

【０００１】0001

【産業上の利用分野】この発明は、指向性マイクロホン
を複数個使用し、この中から最適な音声信号を選択して
音声認識装置などに音声信号を供給する音声入力装置に
関する。BACKGROUND OF THE INVENTION 1. Field of the Invention The present invention relates to a voice input device that uses a plurality of directional microphones, selects an optimal voice signal from among the microphones, and supplies the voice signal to a voice recognition device or the like.

【０００２】0002

【従来の技術】従来、音声認識装置などの音声入力部に
マイクなどから供給される音声信号に含まれる雑音をな
るべく少なくすることが、音声認識装置における音声認
識率を高くするために要請されている。そこで音声認識
を行う場合の雑音を除去する方法がいろいろ考えられて
いる。（１）その一つは、マイクロホンなどで空間の音
波を電気信号に変換する場合に雑音を含まないように集
音する方法、（２）もう一つはマイクロホンなどで音波
を電気信号に変換した後に、信号処理などを行い所望の
音声信号のみを抽出する方法などが考えられる。2. Description of the Related Art Conventionally, in order to increase the speech recognition rate of a speech recognition device, it has been required to reduce as much as possible the noise contained in the speech signal supplied from a microphone or the like to the speech input section of the speech recognition device. There is. Therefore, various methods have been considered to remove noise when performing speech recognition. (1) One method is to collect sound waves in a space without noise when converting them into electrical signals using a microphone, etc. (2) The other method is to convert sound waves into electrical signals using a microphone, etc. A possible method is to perform signal processing later and extract only the desired audio signal.

【０００３】前記（１）の方法においては、例えば指向
性の狭い角度のマイクなどを使用すると、特定の狭い範
囲の音源からの音波のみを集音した電気信号のみを得や
すい。[0003] In the method (1), if a microphone with narrow directivity angle is used, for example, it is easy to obtain only an electric signal that collects only sound waves from a sound source within a specific narrow range.

【０００４】また前記（２）の方法においては、所望の
音源からの電気信号（例えば音声信号）の回りに含まれ
る背景雑音を、フィルタによって所望の音声信号のみを
濾波して雑音を除去するという方法である。[0004] In the method (2) above, background noise included around an electrical signal (for example, an audio signal) from a desired sound source is removed by filtering only the desired audio signal using a filter. It's a method.

【０００５】[0005]

【発明が解決しようとする課題】しかしながら、以上述
べた音声入力方法では次のような問題があった。つまり
、指向性の狭い角度のマイクを用いた場合でも、マイク
ロホンの指向方向から到来する雑音に対する除去は前記
（１）の方法では除去できない。また前記（２）の方法
においては定常的な雑音に対しては対処が可能であるが
、突発的な雑音（ランダム雑音）の発生に対しては対処
することが困難であった。[Problems to be Solved by the Invention] However, the voice input method described above has the following problems. In other words, even when a microphone with a narrow directivity angle is used, noise coming from the direction of the microphone cannot be removed by the method (1) above. Further, in the method (2), it is possible to deal with stationary noise, but it is difficult to deal with the sudden occurrence of noise (random noise).

【０００６】この発明は、以上の課題に鑑み為されたも
のであり、その目的とするところは最適な音声信号を選
択し外部へ供給する音声入力装置を提供することである
。The present invention has been made in view of the above problems, and its purpose is to provide an audio input device that selects an optimal audio signal and supplies it to the outside.

【０００７】[0007]

【課題を解決するための手段】この発明は、以上の目的
を達成するために、指向性の狭い角度の複数のマイクロ
ホンで集音した音声信号のパワーやＳＮＲ（Ｓｉｇｎａ
ｌ　　ｔｏ　　ＮｏｉｓｅＲａｔｉｏ）を測定し、最適
な音声信号を供給するマイクロホンを選択するように音
声入力装置を改良した。[Means for Solving the Problems] In order to achieve the above object, the present invention aims to improve the power and SNR (signal ratio) of audio signals collected by multiple microphones with narrow directivity angles.
The audio input device was improved so that the microphone that provides the optimal audio signal was selected by measuring the ratio (l to NoiseRatio).

【０００８】つまり、指向性マイクロホンを複数個用い
て音声信号を集音し、集音した複数の音声信号から所望
の音声信号を得る音声入力装置において、前記複数のマ
イクロホンで集音された音声信号が所望の位置の音源か
ら発音されたものか否かを判断する判断手段と、所望の
位置から発音された音声信号と判断されれば音声信号の
信号対雑音比情報又はパワー情報の最大値によって所望
の音声信号を選択する選択手段を有することを特徴とす
る。That is, in an audio input device that collects audio signals using a plurality of directional microphones and obtains a desired audio signal from the collected audio signals, the audio signals collected by the plurality of microphones are a determining means for determining whether or not the voice signal is generated from a sound source at a desired position; It is characterized by having a selection means for selecting a desired audio signal.

【０００９】[0009]

【作用】この発明によれば、判断手段によって複数の指
向性マイクロホンで集音された音声信号が所望の位置の
音源から発音されたものかどうかを判断することによっ
て、所望の話者から発音された者かどうかが判断され、
所望の位置の音源からの音声信号と判断されれば選択手
段によって、複数の音声信号のＳＮＲ情報又はパワー情
報の最大値の音声信号を選択することによって最適な音
声信号を得ることができ、所望の位置以外の音源からの
雑音信号は選択されない。[Operation] According to the present invention, the determining means determines whether or not the sound signals collected by the plurality of directional microphones are produced from a sound source at a desired position, so that the sound signals are produced by a desired speaker. It is determined whether the person has
If it is determined that the audio signal is from a sound source at a desired position, the selection means selects the audio signal having the maximum value of SNR information or power information of the plurality of audio signals, thereby obtaining the optimal audio signal. Noise signals from sources other than those located at are not selected.

【００１０】0010

【実施例】次にこの発明に係る音声入力装置の好適な一
実施例を図面を用いて説明する。図中同一部材には同一
符号を付す。DESCRIPTION OF THE PREFERRED EMBODIMENTS Next, a preferred embodiment of the voice input device according to the present invention will be described with reference to the drawings. Identical members in the figures are given the same reference numerals.

【００１１】第１図は音声入力装置１の機能ブロック図
を示す。図中Ａ１〜Ａｎはマイクロホン、Ｂ１〜Ｂｎは
パワー測定器、Ｃ１〜ＣｎはＳＮＲ測定器、１１は比較
回路、１２は選択回路である。２は音声認識装置である
。FIG. 1 shows a functional block diagram of the voice input device 1. As shown in FIG. In the figure, A1 to An are microphones, B1 to Bn are power measuring devices, C1 to Cn are SNR measuring devices, 11 is a comparison circuit, and 12 is a selection circuit. 2 is a voice recognition device.

【００１２】この音声入力装置１には、指向性の狭い角
度のマイクロホンＡ１〜Ａｎが空間上に適当に配置され
、それぞれのマイクロホンで電気信号の音声信号Ｓｉ１
〜Ｓｉｎに変換されるとそれぞれ対応するパワー測定器
Ｂ１〜Ｂｎに供給される。パワー測定器Ｂ１〜Ｂｎにお
いては供給された音声信号のパワー及び雑音のパワーを
測定する。音声信号をｘ（ｔ）と表すとパワーＰは、ｘ
２（ｔ）の平均値として表される。ここで得られた音声
信号のパワーをそれぞれＳ１〜Ｓｎとし、また音声信号
が入力されていない時の信号を雑音としてこの雑音のパ
ワーをＮ１〜Ｎｎとする。この音声信号のパワーＳ１〜
Ｓｎ及び雑音のパワーＮ１〜Ｎｎはそれぞれのパワー測
定器Ｂ１〜Ｂｎから、ＳＮＲ測定器Ｃ１〜Ｃｎと選択回
路１２に供給される。In this audio input device 1, microphones A1 to An with narrow directivity are appropriately arranged in space, and each microphone receives an audio signal Si1 as an electric signal.
~Sin and then supplied to the corresponding power measuring devices B1 to Bn, respectively. The power measuring devices B1 to Bn measure the power of the supplied audio signal and the power of noise. If the audio signal is expressed as x(t), the power P is x
2(t). The powers of the audio signals obtained here are respectively S1 to Sn, and the signal when no audio signal is input is noise, and the power of this noise is N1 to Nn. The power of this audio signal S1~
Sn and noise powers N1 to Nn are supplied from the respective power measuring devices B1 to Bn to the SNR measuring devices C1 to Cn and the selection circuit 12.

【００１３】ＳＮＲ測定器Ｃ１〜Ｃｎでは入力された音
声信号のパワーＳ１〜Ｓｎと雑音のパワーＮ１〜Ｎｎか
らＳＮＲを算出する。それぞれのＳＮＲ測定器Ｃ１〜Ｃ
ｎで得られたＳＮＲ１〜ＳＮＲｎは選択回路１２に供給
される。比較回路１１では、入力された音声信号のパワ
ー信号Ｓ１〜Ｓｎが規定の領域内の音源から発っせられ
たものであるか否かを確認する。ここで規定の領域で発
せられた音声であると判断されれば、選択可能信号を選
択回路１２に供給する。The SNR measuring devices C1 to Cn calculate the SNR from the input audio signal powers S1 to Sn and the noise powers N1 to Nn. Each SNR measuring device C1-C
SNR1 to SNRn obtained at step n are supplied to the selection circuit 12. The comparison circuit 11 checks whether the power signals S1 to Sn of the input audio signal are emitted from a sound source within a specified region. If it is determined that the sound was emitted in a specified area, a selectable signal is supplied to the selection circuit 12.

【００１４】ここで前記規定の領域内の音源から発せら
れたものであるか否かの判断方法は次のようにして行う
。つまり前記パワー測定器Ａ１〜Ａｎから供給される音
声信号のパワー信号Ｓ１〜Ｓｎの中で、最大パワーをＳ
ｍａｘとし、最小パワーをＳｍｉｎとするときに、Ｓｍ
ａｘ／Ｓｍｉｎ　　＜　　１＋Ｔｈ　　の条件（１）を
満足するか否かを判断する。ここでＴｈはしきい値を表
すものであり、各マイクの空間配置やマイクの利得等で
調整されて定められる定数である。尚前記の条件は一つ
の例である。Here, the method for determining whether or not the sound is emitted from a sound source within the specified area is performed as follows. That is, among the power signals S1 to Sn of the audio signals supplied from the power measuring devices A1 to An, the maximum power S
max and the minimum power is Smin, then Sm
It is determined whether condition (1) of ax/Smin<1+Th is satisfied. Here, Th represents a threshold value, and is a constant determined by adjusting the spatial arrangement of each microphone, the gain of the microphone, and the like. Note that the above conditions are just one example.

【００１５】この条件を満たさない場合には、マイクロ
ホンの指向性の領域内で発せられた音声ではないと判断
して、前記選択可能信号を選択回路１２には供給しない
。しかしながら条件を満足すると判断されれば選択可能
信号を選択回路１２に供給する。この比較回路１１にお
ける処理は後述の選択回路１２で最終的に最適なマイク
ロホンからの音声信号を一つ選択する上での前処理的な
機能を果たす。If this condition is not met, it is determined that the sound is not emitted within the directional region of the microphone, and the selectable signal is not supplied to the selection circuit 12. However, if it is determined that the conditions are satisfied, a selectable signal is supplied to the selection circuit 12. The processing in the comparison circuit 11 serves as a preprocessing function for the selection circuit 12, which will be described later, to finally select one of the most suitable audio signals from the microphone.

【００１６】ここで比較回路１１を実現するための回路
構成の例を図２を用いて次に示す。図において比較回路
１１は、最大値検出器１１１と最小値検出器１１２と割
算器１１３と比較器１１４で構成される。ＳＮＲ測定器
Ｃ１〜Ｃｎから供給された音声信号のパワー信号Ｓ１〜
Ｓｎはそれぞれ最大値検出器１１１と最小値検出器１１
２に供給され、最大値ＳｍａｘとＳｍｉｎが格納される
。次に割算器１１３においてＳｍａｘとＳｍｉｎを割算
し、次に比較器１１４において割算結果が、外部より入
力された値（１＋Ｔｈ）よりも小さいか否かを比較し、
小さければ選択可能信号を出力し、小さくなければ選択
可能信号を出力しない。An example of a circuit configuration for realizing the comparator circuit 11 will be described below with reference to FIG. In the figure, the comparison circuit 11 includes a maximum value detector 111, a minimum value detector 112, a divider 113, and a comparator 114. Power signals S1~ of audio signals supplied from SNR measuring devices C1~Cn
Sn is the maximum value detector 111 and the minimum value detector 11, respectively.
2, and the maximum values Smax and Smin are stored. Next, the divider 113 divides Smax and Smin, and the comparator 114 compares whether the division result is smaller than the value (1+Th) input from the outside,
If it is small, a selectable signal is output, and if it is not small, a selectable signal is not output.

【００１７】選択回路１２に供給された前記条件（１）
を満たした複数の音声信号のパワー信号ＳｉとそのＳＮ
Ｒの情報は、ここで一つに選択される。選択方法は次の
ような方法である。つまり入力されたＳＮＲの内、一番
高い値のＳＮＲを示しているマイクロホンの音声信号の
パワー信号Ｓｉを選択する。The above condition (1) supplied to the selection circuit 12
The power signal Si and its SN of multiple audio signals that satisfy
One piece of information about R is selected here. The selection method is as follows. That is, the power signal Si of the microphone audio signal exhibiting the highest SNR value among the input SNRs is selected.

【００１８】選択回路１２を実現するための回路構成の
例を図３を用いて次に説明する。図において選択回路１
２は最大値検出器１２１、１２２と、選択スイッチ１２
３とで構成される。ＳＮＲ測定器Ｃ１〜Ｃｎから供給さ
れたＳＮＲ１〜ＳＮＲｎ情報は最大値検出器１２２に供
給され、最大のＳＮＲが選択される。また前記パワー測
定器Ｂ１〜Ｂｎから供給されたパワー信号Ｐ１〜Ｐｎも
最大値検出器１２１に供給され、最大のパワー信号が選
択される。次に比較回路１１から選択可能信号が選択ス
イッチ１２３に供給されていたならば、前記最大のＳＮ
Ｒを得ることができた音声信号をＳｉ１〜Ｓｉｎの中か
ら選択して出力する。An example of a circuit configuration for realizing the selection circuit 12 will now be described with reference to FIG. In the figure, selection circuit 1
2 is maximum value detector 121, 122 and selection switch 12
It consists of 3. The SNR1 to SNRn information supplied from the SNR measuring devices C1 to Cn is supplied to the maximum value detector 122, and the maximum SNR is selected. Further, the power signals P1 to Pn supplied from the power measuring devices B1 to Bn are also supplied to the maximum value detector 121, and the maximum power signal is selected. Next, if the selectable signal is supplied from the comparison circuit 11 to the selection switch 123, the maximum SN
The audio signal for which R can be obtained is selected from among Si1 to Sin and output.

【００１９】ここで選択された音声信号が最終的にこの
音声入力装置１の出力信号として出力され、この出力信
号は例えば音声認識装置２の音声入力部に供給して、音
声認識すると従来に比べ雑音の影響を抑えた分だけ音声
認識率を高くすることができる。このようにして複数の
マイクロホンで集音された音声信号の内からＳＮＲの良
い音声信号を自動的に選択することができる。The audio signal selected here is finally output as the output signal of this audio input device 1, and this output signal is supplied to, for example, the audio input section of the audio recognition device 2, and when the audio is recognized, it is faster than conventional audio signals. The speech recognition rate can be increased by suppressing the influence of noise. In this way, an audio signal with a good SNR can be automatically selected from audio signals collected by a plurality of microphones.

【００２０】また上記実施例においてはＳＮＲの値を最
適な音声信号の選択の指標として採用したが、もっと簡
易的にはＳＮＲではなく入力音声信号のパワーＰの値で
大小関係を比較し、一番大きいパワーを選択することに
よって最適な音声信号を選択することもできる場合もあ
る。このようにすることによってＳＮＲ測定器Ｃ１〜Ｃ
ｎを備えることを省略することができるので、ハードウ
エアの規模を小型化することができる。Furthermore, in the above embodiment, the value of SNR was used as an index for selecting the optimum audio signal, but more simply, the value of the power P of the input audio signal is compared instead of the SNR, and the value of the power P of the input audio signal is compared. In some cases, it is also possible to select the optimal audio signal by selecting the highest power. By doing this, the SNR measuring devices C1 to C
Since the provision of n can be omitted, the scale of the hardware can be reduced.

【００２１】次には実際に２個のマイクロホンを使用し
て、前記音声入力装置１を使用した音声認識の運用方法
を図４に示す。図において使用者Ｃはパーソナルコンピ
ュータ３の前に座って、使用者Ｃが話した言葉を使用者
Ｃの左右の前方に２つのマイクロホンＡ１及びＡ２を設
置して集音し、集音した音声信号を音声入力装置１と音
声認識装置２経由でパーソナルコンピュータ３で処理し
てパーソナルコンピュータ３のディスプレイに使用者Ｃ
が話した言葉を文字列で表示して文章を作成している運
用図である。Next, FIG. 4 shows how to actually perform voice recognition using the voice input device 1 using two microphones. In the figure, user C is sitting in front of a personal computer 3, and two microphones A1 and A2 are installed in front of user C on the left and right to collect the words spoken by user C, and the collected audio signal is is processed by the personal computer 3 via the voice input device 1 and the voice recognition device 2, and displayed on the display of the personal computer 3 by the user C.
This is an operational diagram in which a sentence is created by displaying the words spoken by a person as a string of characters.

【００２２】マイクロホンＡ１の指向性の角度をθ１と
し、マイクロホンＡ２の指向性の角度をθ２とし、マイ
クロホンＡ１の指向性の範囲とマイクロホンＡ２の指向
性の範囲が交差する範囲を交差範囲Ｄ（斜線範囲）とし
、マイクロホンＡ１の指向性の範囲内には雑音源Ａが存
在し、この雑音源Ａは使用者Ｃの後方に位置し、またマ
イクロホンＡ２の指向性の範囲内には雑音源Ｂが存在し
、この雑音源Ｂは使用者Ｃの後方に位置するものとする
。The directivity angle of microphone A1 is θ1, the directivity angle of microphone A2 is θ2, and the range where the directivity range of microphone A1 and the directivity range of microphone A2 intersect is defined as intersection range D (shaded range), a noise source A exists within the directional range of microphone A1, and this noise source A is located behind user C, and a noise source B exists within the directional range of microphone A2. It is assumed that the noise source B is located behind the user C.

【００２３】以上のような装置の配置のもとに、音声入
力装置１の動作を説明する。図４において雑音源Ａのみ
が存在し雑音を発する場合を説明する。雑音源Ａはマイ
クロホンＡ１の指向性の範囲内に位置し、マイクロホン
Ａ２の指向性の範囲内には位置しない。マイクロホンＡ
１で集音され、パワー測定器Ｂ１で測定された雑音のパ
ワーをＮ１とし使用者Ｃが発した音声のパワーをＳ１と
し、マイクロホンＡ２で集音され、パワー測定器Ｂ２で
測定された雑音のパワーをＮ２とし使用者Ｃが発した音
声のパワーをＳ２とすると、当然に前記雑音源Ａはマイ
クロホンＡ１の指向性の範囲内に位置するのであるから
、雑音のパワーＮ１＞Ｎ２となる。The operation of the voice input device 1 will be explained based on the device arrangement as described above. The case where only noise source A exists and generates noise in FIG. 4 will be described. Noise source A is located within the directivity range of microphone A1, but not within the directivity range of microphone A2. Microphone A
Let the power of the noise collected by microphone A2 and measured by power measuring device B1 be N1, and the power of the voice emitted by user C be S1, and the power of the noise collected by microphone A2 and measured by power measuring device B2. If the power is N2 and the power of the voice emitted by the user C is S2, the noise source A is naturally located within the directivity range of the microphone A1, so the noise power N1>N2.

【００２４】したがってマイクロホンＡ１で集音される
音声信号のＳＮＲ１とマイクロホンＡ２で集音される音
声信号のＳＮＲ２の関係は、ＳＮＲ１（＝Ｓ１／Ｎ１）
＜ＳＮＲ２（＝Ｓ２／Ｎ２）となる。このようにして選
択回路１２ではマイクロホンＡ２で集音された音声信号
が音声認識装置２に供給されて、雑音源Ａからの雑音信
号は音声認識装置には混入されず、雑音による音声信号
の認識率の低下を生じさせない。Therefore, the relationship between SNR1 of the audio signal collected by microphone A1 and SNR2 of the audio signal collected by microphone A2 is SNR1 (=S1/N1)
<SNR2 (=S2/N2). In this way, in the selection circuit 12, the voice signal collected by the microphone A2 is supplied to the voice recognition device 2, and the noise signal from the noise source A is not mixed into the voice recognition device, and the voice signal is recognized by the noise. No reduction in rate.

【００２５】次に雑音源Ｂが突発的に雑音を発した場合
の音声入力装置１の動作を説明する。雑音源Ｂはマイク
ロホンＡ２の指向性の範囲内に位置し、マイクロホンＡ
１の指向性の範囲内には位置しない。この場合には雑音
のパワーはＮ１＜Ｎ２となり、マイクロホンＡ２で集音
される雑音のパワーの方が大きくなる。Next, the operation of the voice input device 1 when the noise source B suddenly emits noise will be explained. Noise source B is located within the directivity range of microphone A2, and
It is not located within the directivity range of 1. In this case, the power of the noise becomes N1<N2, and the power of the noise collected by the microphone A2 becomes larger.

【００２６】したがってＳＮＲ１＞ＳＮＲ２となり、選
択回路１２ではマイクロホンＡ１で集音された音声信号
が音声認識装置２に供給される。Therefore, SNR1>SNR2, and the selection circuit 12 supplies the voice signal collected by the microphone A1 to the voice recognition device 2.

【００２７】次に雑音源Ｂが人間であって、使用者Ｃが
話さずに代わりに雑音源Ｂの話した音声で音声認識装置
が誤動作していた場合の誤動作防止法をこの実施例に係
る音声入力装置１で説明する。Next, this embodiment describes a method for preventing malfunctions in the case where the noise source B is a human being and the voice recognition device malfunctions due to the voice spoken by the noise source B instead of the user C speaking. The voice input device 1 will be explained.

【００２８】上記の誤動作防止法としては、雑音源Ｂが
規定領域（図４においては、マイクロホンＡ１の指向性
とマイクロホンＡ２の指向性の交差範囲Ｄ）に存在する
か否かを判断する。判断して規定領域内に存在する場合
にはその音声信号を選択するものとし、しかしながら図
４に示すように雑音源Ｂが交差範囲Ｄ内に存在しないと
判断された場合はこの雑音源Ｂから発せられた音声信号
を音声認識装置２には供給しない。In order to prevent the above-mentioned malfunction, it is determined whether or not the noise source B exists in a specified area (in FIG. 4, the intersecting range D between the directivity of the microphone A1 and the directivity of the microphone A2). If it is determined that the audio signal exists within the specified area, the audio signal is selected.However, as shown in FIG. The emitted voice signal is not supplied to the voice recognition device 2.

【００２９】具体的には次のようにして判断する。つま
りマイクロホンＡ１及びＡ２において集音し、集音され
た音声信号はそれぞれパワー測定器Ｂ１及びＢ２で測定
し、次にＳＮＲ測定器Ｃ１、Ｃ２においてＳＮＲが算出
され、ＳＮＲ測定器Ｃ１においてはＳＮＲ１＝Ｓ１／Ｎ
１を得る。またＳＮＲ測定器Ｃ２においてはＳＮＲ２＝
Ｓ２／Ｎ２を得る。これらのＳＮＲ情報は前記選択回路
１２に供給される。比較回路１１においては次の条件、
前記条件１を変形して例えば１−Ｔｈ　　＜　　Ｓ１／
Ｓ２＜　　１＋Ｔｈ，但し０＜Ｔｈ＜＜１、例えばＴｈ
＝０．１とする（これを条件（２）とする）。この条件
を満足するか否かを確認する。Specifically, the determination is made as follows. In other words, the microphones A1 and A2 collect sound, and the collected audio signals are measured by the power measuring devices B1 and B2, respectively.The SNR measuring devices C1 and C2 then calculate the SNR, and in the SNR measuring device C1, the SNR1= S1/N
Get 1. In addition, in the SNR measuring device C2, SNR2=
Obtain S2/N2. These SNR information are supplied to the selection circuit 12. In the comparison circuit 11, the following conditions are met.
By modifying the condition 1, for example, 1-Th < S1/
S2<1+Th, but 0<Th<<1, e.g. Th
=0.1 (this is set as condition (2)). Check whether this condition is satisfied.

【００３０】つまり、この例においてはマイクロホンＡ
１とマイクロホンＡ２は使用者Ｃから等しい距離に位置
するので、雑音源Ｂがなければ使用者Ｃの音声信号のマ
イクロホンＡ１及びＡ２におけるパワーはほぼ等しいと
考えられるので、Ｓ１／Ｓ２はほぼ１に等しいと考えら
れる。したがって前記条件（２）を満足するものとして
選択可能信号を選択回路１２に供給し、選択回路１２は
ＳＮＲ測定器Ｃ１、Ｃ２から供給されたＳＮＲ情報の最
大値を検出して、対応する前記マイクロホンＡ１の入力
音声又は前記マイクロホンＡ２の入力音声が選択される
。That is, in this example, microphone A
Since microphones A1 and A2 are located at the same distance from user C, if there is no noise source B, the power of user C's audio signal at microphones A1 and A2 is considered to be approximately equal, so S1/S2 is approximately 1. considered to be equal. Therefore, a selectable signal that satisfies the condition (2) is supplied to the selection circuit 12, and the selection circuit 12 detects the maximum value of the SNR information supplied from the SNR measuring devices C1 and C2, and selects the corresponding microphone. The input voice of A1 or the input voice of the microphone A2 is selected.

【００３１】しかしながら図４において雑音源Ｂは交差
範囲Ｄの外に位置しているので、雑音源Ｂから発せられ
た音声によるマイクロホンＡ２における音声信号のパワ
ーをＳ２とし、マイクロホンＡ１における雑音源Ｂから
の音声信号のパワーをＳ１とすると、Ｓ１とＳ２の関係
はＳ１＜＜Ｓ２となる。なぜかというと雑音源Ｂはマイ
クロホンＡ１の指向性の範囲外に位置するため、マイク
ロホンＡ１には雑音源Ｂから発せられる音声信号は集音
されないためほぼ０に近い。よって前記関係Ｓ１＜＜Ｓ
２の関係となり、この関係であると前記条件（２）を満
たさなくなる。よって比較回路１１は雑音源Ｂからの音
声信号によっては選択可能信号を選択回路１２に供給し
ないので、音声認識装置２には雑音源Ｂから音声信号は
出力されない。雑音源がマイクロホンＡ１及びＡ２のど
ちらのマイクロホンの指向性の範囲内にも存在しない場
合は、前記条件（２）を満たす可能性はあるが、満たし
たとしても指向性の範囲外であるため、マイクロホンＡ
１及びＡ２で集音される音声信号のパワーは、ほぼ０に
近いと考えられるので音声認識の誤動作を招く心配はな
い。このようにして２個の指向性マイクロホンの指向性
の交差範囲内に使用者を配置し、指向性の交差範囲以外
の雑音源からの音声信号を選択しないようにすることが
できる。したがってこのような音声入力装置は、さまざ
まな製品を製造するプラントや航空機や車両などに設置
される音声認識装置や音響装置などの音声入力部として
利用でき、音声認識率の向上等に寄与することができる
。However, in FIG. 4, the noise source B is located outside the intersection range D, so the power of the audio signal at the microphone A2 due to the audio emitted from the noise source B is set as S2, and the power of the audio signal from the noise source B at the microphone A1 is set as S2. When the power of the audio signal is S1, the relationship between S1 and S2 is S1<<S2. This is because the noise source B is located outside the directivity range of the microphone A1, so the audio signal emitted from the noise source B is not collected by the microphone A1 and is therefore close to zero. Therefore, the above relationship S1<<S
2, and this relationship does not satisfy the condition (2). Therefore, since the comparator circuit 11 does not supply a selectable signal to the selection circuit 12 depending on the audio signal from the noise source B, no audio signal is output from the noise source B to the speech recognition device 2. If the noise source does not exist within the directivity range of either microphone A1 or A2, there is a possibility that the above condition (2) is satisfied, but even if it is satisfied, it is outside the directivity range, so Microphone A
Since the power of the audio signals collected by A1 and A2 is considered to be close to 0, there is no risk of malfunctioning of the audio recognition. In this way, the user can be placed within the intersecting range of the directivity of the two directional microphones, and can be prevented from selecting audio signals from noise sources outside the intersecting range of the directivity. Therefore, such voice input devices can be used as voice input units for voice recognition devices and acoustic devices installed in plants that manufacture various products, aircraft, vehicles, etc., and can contribute to improving voice recognition rates. I can do it.

【００３２】以上の実施例においては、音声認識装置に
供給する音声信号の音声入力装置を例に説明したが、こ
の発明は音声認識装置に限るものではなく、他の音響装
置に供給する場合もこの音声入力装置は応用できる。[0032] In the above embodiments, the voice input device for supplying voice signals to a voice recognition device was explained as an example, but the present invention is not limited to the voice recognition device, and may be applied to other audio devices as well. This voice input device can be applied.

【００３３】[0033]

【発明の効果】以上述べたようにこの発明によれば、次
のような効果を発揮する。[Effects of the Invention] As described above, the present invention provides the following effects.

【００３４】つまり判断手段と選択手段を有することに
よって、所望の位置の音源からの音声信号のみを音声信
号のパワー情報又はＳＮＲ情報の最大値で選択し、所望
の位置以外の音源からの雑音は選択されないので、最適
な音声信号を得ることができる。In other words, by having the judgment means and the selection means, only the audio signal from the sound source at the desired position is selected with the maximum value of the power information or SNR information of the audio signal, and the noise from the sound source other than the desired position is selected. Since it is not selected, an optimal audio signal can be obtained.

[Brief explanation of the drawing]

【図１】この実施例に係る音声入力装置のブロック図で
ある。FIG. 1 is a block diagram of a voice input device according to this embodiment.

【図２】図１における比較回路の回路図である。FIG. 2 is a circuit diagram of a comparison circuit in FIG. 1;

【図３】図１における選択回路の回路図である。FIG. 3 is a circuit diagram of a selection circuit in FIG. 1;

【図４】音声入力装置の運用例概念図である。FIG. 4 is a conceptual diagram of an example of operation of the voice input device.

[Explanation of symbols]

１　　音声入力装置１１　　比較回路（判断手段）１２　　選択回路（選択手段）Ａ１〜Ａｎ　　マイクロホンＢ１〜Ｂｎ　　パワー測定器Ｃ１〜Ｃｎ　　ＳＮＲ測定器 1 Voice input device 11 Comparison circuit (judgment means) 12 Selection circuit (selection means) A1~An Microphone B1~Bn power measuring device C1~Cn SNR measuring device

Claims

[Claims]

1. An audio input device that collects audio signals using a plurality of directional microphones and obtains a desired audio signal from the collected audio signals, the audio signal collected by the plurality of microphones. determining means for determining whether or not the sound signal is generated from a sound source at a desired position; An audio input device comprising a selection means for selecting a desired audio signal based on a value.