JP4005203B2

JP4005203B2 - In-vehicle speech recognition device

Info

Publication number: JP4005203B2
Application number: JP02197598A
Authority: JP
Inventors: 和広崎山; 英樹北尾; 収岩田
Original assignee: Denso Ten Ltd
Current assignee: Denso Ten Ltd
Priority date: 1998-02-03
Filing date: 1998-02-03
Publication date: 2007-11-07
Anticipated expiration: 2018-02-03
Also published as: JPH11219193A

Description

【０００１】
【発明の属する技術分野】
本発明は、発話者の音声以外のノイズを低減して認識率を向上した車載用音声認識装置に関する。
【０００２】
【従来の技術】
図６は従来の車載用音声認識装置のシステム構成を示すブロック図である。以下、図に従って説明する。
車両等の運転中にナビゲーション機器やオーディオ機器等を操作するに際してスイッチ操作による運転者の負担を軽減するために、運転者（発話者）の発声した音声を認識して接続された機器に入力指示する音声認識装置がある。
【０００３】
１１は話者（発話者）の発声した音声を電気信号に変換する無指向性のマイクロフォンである。３１はマイクロフォン１１で検出された信号レベルを調整して音声認識部８に適切なレベルの音声信号を出力するためのゲイン（利得）調整部である。８はナビゲーション機器やオーディオ機器等の入力部に接続された音声認識部である。２はマイクロフォン１１の検出結果に基いてゲイン調整部３１を制御する制御部である。
【０００４】
音声認識率を向上するために、制御部２は音声認識部８に最適なレベルの信号が入力されるように、マイクロフォン１１で検出した話者の音声信号のレベルを基に、ゲイン調整部３１で調整すべきゲインを制御する。
【０００５】
【発明が解決しようとする課題】
車載用音声認識装置では、話者の音声以外に他の同乗者の会話や車両の走行に伴う騒音もマイクロフォンに入る。また、話者の体格、運転中の姿勢等により話者の方向、及び話者とマイクロフォンの距離が変化する。その結果、音声認識部に入力されるノイズや信号レベルが変化して、音声認識率が低下するという問題がある。
【０００６】
この対策として、話者の口元にマイクロフォンを固定する接話式のマイクロフォンを使用する方法もあるが、話者特に運転者には違和感を与える等の問題がある。
本発明は、話者の音声以外のノイズを低減して音声認識率を向上した車載用音声認識装置を提供することを目的とする。
【０００７】
【課題を解決するための手段】
上記目的を達成するために本発明は、発話者の方向を検出する方向検出手段と、発話者の音声を検出する音声検出手段と、前記音声検出手段の指向特性を前記方向検出手段により検出された発話者の方向において高めるように調整する指向特性調整手段と、前記音声検出手段により検出された発話者の音声を認識する音声認識手段を備えたことを特徴とするものである。
【０００８】
また、前記音声検出手段は、複数のマイクロフォンで構成され、前記方向検出手段は、前記複数のマイクロフォンにより検出された検出信号相互間の信号レベル差に基いて前記発話者の方向を検出するものであることを特徴とするものである。
また、前記音声検出手段は、複数のマイクロフォンで構成され、前記方向検出手段は、前記複数のマイクロフォンにより検出された検出信号相互間の遅延時間に基いて前記発話者の方向を検出するものであることを特徴とするものである。
【０００９】
また、発話者の位置を検出する位置検出手段を備え、前記方向検出手段は、前記音声検出手段の設置位置と前記位置検出手段により検出された発話者の位置に基いて前記発話者の方向を検出するものであることを特徴とするものである。
また、発話者の座席の設定状態を検出する座席設定状態検出手段を備え、前記位置検出手段は、前記座席設定状態検出手段により検出された発話者の座席の設定状態に基いて前記発話者の位置を検出するものであることを特徴とするものである。
【００１０】
また、前記音声検出手段は、複数のマイクロフォンで構成され、前記指向特性調整手段は、前記複数のマイクロフォンにより検出された検出信号の利得を調整して重み付けを行う利得調整手段であることを特徴とするものである。
また、発話者の音声を検出する前面に管状体が配設されたマイクロフォンと、前記マイクロフォンにより検出された発話者の音声を認識する音声認識手段を備えたことを特徴とするものである。
また、前記指向特性調整手段は、車両の少なくとも運転席または助手席の発話者の方向の指向性を選択的に高めることを特徴とするものである。
【００１１】
【実施例】
図１は本発明の第１の実施例の車載用音声認識装置のシステム構成を示すブロック図である。図２は複数のマイクロフォンによる話者位検出と指向性調整を示す図で、（ａ）はマイクロフォン（縦方向配置）と話者の位置関係を示す図、（ｂ）はマイクロフォン（横方向配置）と話者の位置関係を示す図である。以下、図に従って説明する。尚、本例は複数のマイクロフォンの出力状態から話者の方向を検出し、音声検出部１の指向性が話者の方向において最大になるようにゲインを調整するものである。
【００１２】
１は運転者や同乗者の音声を電気信号に変換する音声検出部（音声検出手段）で、所定の間隔で配置された複数の無指向性のマイクロフォン１１、１２、１３で構成される。３１はマイクロフォン１１、１２、１３の出力を個々に調整して音声検出部１全体として話者の方向に指向性をもたせるためのゲイン調整部で、電子ボリューム等で構成される。８はナビゲーション機器やオーディオ機器等の入力部に接続される音声認識部であり、この音声認識部８が認識した結果に応じて、ナビゲーション機器やオーディオ機器の動作が制御されることになる。２はマイクロフォン１１、１２、１３の出力レベルまたは遅延時間を基に話者の方向を検出し、各マイクロフォン１１、１２、１３の出力を調整するゲイン調整部３１を制御する制御部である。２１は音声検出部１により検出した話者の方向とゲイン調整部３１の調整値を対応させて記憶するメモリである。
【００１３】
先ず、マイクロフォン１１、１２、１３による話者の方向検出方法について述べる。図２（ａ）において話者８１の発声する音声は３つのマイクロフォン１１、１２、１３において信号レベルＶ１１、Ｖ１２、Ｖ１３と遅延時間ｔ１１、ｔ１２、ｔ１３が検出される。この場合、遅延時間は信号の立上りを検出して、いずれか１つのマイクロフォン（例えば中央のマイクロフォン１２）を基準にした相対的な遅延時間でもよい。マイクロフォンの出力特性が同じになるように予め調整されていると信号レベルは話者８１とマイクロフォン１１、１２、１３の距離に応じて低下する。また、遅延時間は話者８１とマイクロフォン１１、１２、１３の距離の差に比例して長くなる。従って、マイクロフォン１１、１２、１３の位置関係が判っておれば話者の方向が算出できる。
【００１４】
予め、話者８１の方向と各マイクロフォン１１、１２、１３からの信号レベルＶ１１、Ｖ１２、Ｖ１３及び遅延時間ｔ１１、ｔ１２、ｔ１３の関係をメモリ２１に記憶しておく。そして、実際にマイクロフォン１１、１２、１３で信号レベル及び遅延時間を検出して、メモリ２１に記憶されている信号レベル及び遅延時間と話者の方向の関係を使用して話者の方向を算出する。
【００１５】
次に、音声検出部１の指向性調整方法について述べる。３つのマイクロフォン１１、１２、１３を図２（ａ）のごとく話者に対して縦方向に配置して、各マイクロフォン１１、１２、１３の信号レベルに対する調整すべきゲインをＧ１１、Ｇ１２、Ｇ１３とし、マイクロフォン１１、１２、１３のゲインが同じになるように調整（Ｇ１１＝Ｇ１２＝Ｇ１３）すると、音声検出部１の指向性は話者８１の方向が強く、斜めまたは横方向が弱くなるようにできる。
【００１６】
また、３つのマイクロフォン１１、１２、１３を図２（ｂ）のごとく話者に対して横方向に配置して、マイクロフォン１１、１２、１３の順にゲインが高くなるように調整（Ｇ１１＜Ｇ１２＜Ｇ１３）すると、音声検出部１の指向性は話者８１ａの方向が強くなるようにできる。一方、マイクロフォン１３、１２、１１の順にゲインが高くなるように調整（Ｇ１３＜Ｇ１２＜Ｇ１１）すると、音声検出部１の指向性は話者８１ｂの方向が強くなるようにできる。さらに、マイクロフォン１２のゲインをマイクロフォン１１、１３のゲインよりも高くなるように調整（Ｇ１１≦Ｇ１２≧Ｇ１３）すると、音声検出部１の指向性は中央部の話者８１ｃの方向が強くなるようにできる。
【００１７】
予め、話者の方向とゲイン調整部３１において話者の方向の指向性が最大になるように調整すべきゲインＧ１１、Ｇ１２、Ｇ１３をメモリ２１に記憶させておき、話者の方向が検出できた時には、マイクロフォン１１、１２、１３の出力信号に対して話者の方向に対応したゲイン調整を行う。このようにして、話者の方向の音声検出部１の指向性を高め、その他の方向の指向性を低く抑えることにより同乗者の会話や車両からのノイズが低減された音声信号が音声認識部８に入力されるので、話者の音声の認識率が向上する。
【００１８】
尚、３つのマイクロフォン１１、１２、１３を１つの筐体内に一体として組合せたコンパクトなユニット（音声検出部１に相当）を構成し、車内のマップランプやルームランプ裏面に設置してもよく、また、３つのマイクロフォン１１、１２、１３を個々に離れた箇所（例えば、左右のフロントピラー内部とルームランプ裏面）に設置してもよい。３つのマイクロフォンを離して設置した方が話者の方向識別が容易になる。
【００１９】
また、図２（ｂ）のマイクロフォンの配置において、予め、３つのマイクロフォン１１、１２、１３からなる音声検出部１の指向性が運転席（話者８１ｂ）の方向において最大になるように調整すべき各ゲイン（Ｇ１１、Ｇ１２、Ｇ１３）と、助手席（話者８１ａ）の方向において最大になるように調整すべきゲインの２種類をメモリ２１に記憶しておくことにより、運転席、助手席のいずれの話者（８１ａ、８１ｂ）に対しても話者の方向を識別し、その話者に対して音声検出部１の検出信号にゲイン調整して、話者の方向に指向性を持たせることができる。つまり、１つの音声検出部１で運転席、助手席のいずれからでもナビゲーション機器やオーディオ機器に対して認識率のよい入力処理が行える。
【００２０】
以上のように本実施例では、複数のマイクロフォンにより話者の方向を検出して、その検出結果を基に音声検出部の指向性を話者の方向において最大になるように調整するので、話者の音声信号が大きく、同乗者の会話及び車両騒音等のノイズが低減でき、認識度の高い音声認識装置ができる。
図３は本発明の第２の実施例の車載用音声認識装置のシステム構成を示すブロック図である。図４は座席のスライド位置、座面高さ、背もたれ角を説明するための図である。以下、図に従って説明する。尚、本例は運転席の座席の設定状態から話者の方向を検出するものであり、話者の方向を検出した後の音声検出部１の指向性調整は第１の実施例と同様である。
【００２１】
４は車両の運転席（または助手席）の座席の設定状態を検出する座席状態検出部（座席設定状態検出手段）で、座席の前後方向の位置を検出するスライド位置センサ４１、座席の上下方向の位置を検出する座面高さセンサ４２、背もたれの角度を検出する背もたれ角センサ４３で構成される。尚、スライド位置センサ４１、座面高さセンサ４２は移動量（長さ）を検出するセンサで、例えば、スリットと光学センサの組合せで移動量に応じて発生するパルス数を計測する。背もたれ角センサ４３は角度を検出するセンサで、例えば、ボリューム等を使用して回転角に応じて変化する抵抗値（または電圧値）を計測する。また、座席の設定状態が電動で調整できるもの（電動シート）にあっては、特別にセンサ４１、４２、４３を設けず、座席の設定（移動）を行うモータの駆動量を計測しておき、その計測した駆動量で代用してもよい。２は座席状態検出部４の検出結果に基いて運転者（話者）の位置を検出し、音声検出部１と運転者の位置から話者の方向を検出する制御部である。また、制御部２は予め音声検出部１及び座席状態検出部４の検出結果と話者の方向が対応して記憶したメモリ２１を有する。７１は運転者用の座席で、座面部７１ａ、背もたれ部７１ｂ、ヘッドレスト部７１ｃで構成される。尚、音声検出部１、マイクロフォン１１、１２、１３、制御部２、メモリ２１、音声認識部８、ゲイン調整部３１は第１の実施例と名称、機能及び作用が同じであるため同一番号を付し説明は省略する。
【００２２】
予め、運転者が着席した状態で最適な座席の状態、つまり、座席のスライド位置Ｄ１、座面高さＨ１、背もたれ角θ１をスライド位置センサ４１、座面高さセンサ４２、背もたれ角センサ４３により検出して、車両の天井部７０ｂとフロントガラス７０ａの上部の間に設置された位置検出部１に対する話者の方向をメモリ２１に記憶しておく。
【００２３】
次に、運転者が座席に座った時に調整した座席の状態をスライド位置センサ４１、座面高さセンサ４２、背もたれ角センサ４３により検出して、メモリ２１に記憶された座席状態と方向の関係から話者の方向を検出する。
話者の方向が検出されると、第１の実施例と同様に各マイクロフォン１１、１２、１３に対応したゲインＧ１１、Ｇ１２、Ｇ１３で調整して話者の方向における指向性を最大にする。その結果、運転者の体格、姿勢が異なって座席位置及び背もたれ角度が変化しても、常に最適なマイク指向性が得られる。
【００２４】
以上のように本実施例では、座席位置により話者の方向を検出して、その検出結果を基に音声検出部の指向性を話者の方向において最大になるように調整するので、話者の音声信号が大きく、同乗者の会話及び車両騒音等のノイズが低減でき、認識度の高い音声認識装置ができる。
図５は本発明の第３の実施例の車載用音声認識装置のマイクロフォンの設置状態を説明するための図である。以下、図に従って説明する。尚、本例は無指向性のマイクロフォンを使用して話者の方向に音声検出部の指向性が最大になるようにするものである。
【００２５】
車両の内装材面７２ａにパイプ７５を介してマイクロフォン１１を設置することにより、パイプ７５の開口径φ及び長さＬで限定された方向の音がマイクロフォン１１により選択的に検出され、その他の方向の音は検出されないようにできる。パイプ７５の方向を予め話者の方向と一致するように調整しておくことにより同乗者の会話及びノイズが低減できる。
【００２６】
以上のように本実施例では、音声検出部の指向性を話者の方向において最大になるように設置するので、話者の音声信号が大きく、同乗者の会話及び車両騒音等のノイズが低減でき、認識度の高い音声認識装置ができる。
【００２７】
【発明の効果】
以上説明したように、本発明では、話者の音声以外のノイズを低減して音声認識率を向上した車載用音声認識装置が提供できる。
【図面の簡単な説明】
【図１】本発明の第１の実施例の車載用音声認識装置のシステム構成を示すブロック図である。
【図２】複数のマイクロフォンによる話者位検出と指向性調整を示す図である。
【図３】本発明の第２の実施例の車載用音声認識装置のシステム構成を示すブロック図である。
【図４】座席の座面高さ、スライド位置、背もたれ角を示す図である。
【図５】本発明の第３の実施例の車載用音声認識装置のマイクロフォンの設置状態を示す図である。
【図６】従来の車載用音声認識装置のシステム構成を示すブロック図である。
【符号の説明】
１・・・・・音声検出部、４３・・・・背もたれ角センサ、
１１、１２、１３・・・・マイクロフォン、２・・・・・制御部、
４・・・・・座席状態検出部、２１・・・・メモリ、
４１・・・・スライド位置センサ、３１・・・・ゲイン調整部、
４２・・・・座面高さセンサ、８・・・・・音声認識部。[0001]
BACKGROUND OF THE INVENTION
The present invention relates to an in-vehicle voice recognition apparatus that improves noise recognition by reducing noise other than the voice of a speaker.
[0002]
[Prior art]
FIG. 6 is a block diagram showing a system configuration of a conventional in-vehicle speech recognition apparatus. Hereinafter, it demonstrates according to a figure.
In order to reduce the burden on the driver due to the switch operation when operating navigation devices or audio devices while driving a vehicle, etc., it recognizes the voice spoken by the driver (speaker) and inputs instructions to the connected device There is a voice recognition device.
[0003]
Reference numeral 11 denotes an omnidirectional microphone that converts speech uttered by a speaker (speaker) into an electrical signal. Reference numeral 31 denotes a gain adjustment unit for adjusting the signal level detected by the microphone 11 and outputting an appropriate level of audio signal to the audio recognition unit 8. Reference numeral 8 denotes a voice recognition unit connected to an input unit such as a navigation device or an audio device. A control unit 2 controls the gain adjustment unit 31 based on the detection result of the microphone 11.
[0004]
In order to improve the speech recognition rate, the control unit 2 gains the gain adjustment unit 31 based on the level of the speech signal of the speaker detected by the microphone 11 so that a signal having an optimum level is input to the speech recognition unit 8. To control the gain to be adjusted.
[0005]
[Problems to be solved by the invention]
In the in-vehicle voice recognition device, in addition to the voice of the speaker, other passengers' conversations and noises associated with traveling of the vehicle also enter the microphone. Further, the direction of the speaker and the distance between the speaker and the microphone change depending on the physique of the speaker, the posture while driving, and the like. As a result, there is a problem that the noise and signal level input to the voice recognition unit change and the voice recognition rate is lowered.
[0006]
As a countermeasure, there is a method of using a close-talking microphone in which a microphone is fixed to the speaker's mouth, but there is a problem that the speaker, particularly the driver, feels uncomfortable.
An object of the present invention is to provide an in-vehicle speech recognition apparatus that improves noise recognition rate by reducing noise other than the speech of a speaker.
[0007]
[Means for Solving the Problems]
In order to achieve the above object, the present invention provides a direction detecting means for detecting the direction of a speaker, a voice detecting means for detecting a voice of a speaker, and a directivity characteristic of the voice detecting means detected by the direction detecting means. And a voice recognition means for recognizing the voice of the speaker detected by the voice detection means.
[0008]
Further, the voice detection means is composed of a plurality of microphones, and the direction detection means detects the direction of the speaker based on a signal level difference between detection signals detected by the plurality of microphones. It is characterized by being.
Further, the voice detection means is composed of a plurality of microphones, and the direction detection means detects the direction of the speaker based on a delay time between detection signals detected by the plurality of microphones. It is characterized by this.
[0009]
In addition, a position detection unit that detects the position of the speaker is provided, and the direction detection unit determines the direction of the speaker based on the installation position of the voice detection unit and the position of the speaker detected by the position detection unit. It is what is detected.
In addition, a seat setting state detection unit that detects a setting state of the speaker's seat is provided, and the position detection unit is configured to detect the speaker's seat based on the setting state of the speaker's seat detected by the seat setting state detection unit. It is characterized by detecting the position.
[0010]
Further, the voice detecting means is composed of a plurality of microphones, and the directivity adjusting means is a gain adjusting means for performing weighting by adjusting gains of detection signals detected by the plurality of microphones. To do.
Further, the present invention is characterized in that a microphone having a tubular body disposed on the front surface for detecting the voice of the speaker and voice recognition means for recognizing the voice of the speaker detected by the microphone are provided.
The directivity adjusting means selectively enhances the directivity of the direction of the speaker in at least the driver seat or the passenger seat of the vehicle.
[0011]
【Example】
FIG. 1 is a block diagram showing a system configuration of an in-vehicle speech recognition apparatus according to a first embodiment of the present invention. 2A and 2B are diagrams showing speaker position detection and directivity adjustment using a plurality of microphones. FIG. 2A is a diagram showing a positional relationship between a microphone (vertical arrangement) and a speaker. FIG. 2B is a microphone (horizontal arrangement). It is a figure which shows the positional relationship of a speaker. Hereinafter, it demonstrates according to a figure. In this example, the direction of the speaker is detected from the output states of a plurality of microphones, and the gain is adjusted so that the directivity of the voice detection unit 1 is maximized in the direction of the speaker.
[0012]
Reference numeral 1 denotes a voice detection unit (voice detection means) that converts a driver's or passenger's voice into an electrical signal, and includes a plurality of omnidirectional microphones 11, 12, and 13 arranged at predetermined intervals. Reference numeral 31 denotes a gain adjusting unit for individually adjusting the outputs of the microphones 11, 12, and 13 to provide directivity in the direction of the speaker as the entire sound detecting unit 1, and is configured by an electronic volume or the like. Reference numeral 8 denotes a voice recognition unit connected to an input unit such as a navigation device or an audio device. The operation of the navigation device or the audio device is controlled according to the result recognized by the voice recognition unit 8. A control unit 2 detects the direction of the speaker based on the output level or delay time of the microphones 11, 12, and 13 and controls the gain adjustment unit 31 that adjusts the output of each microphone 11, 12, and 13. A memory 21 stores the direction of the speaker detected by the voice detection unit 1 and the adjustment value of the gain adjustment unit 31 in association with each other.
[0013]
First, a method for detecting the direction of the speaker using the microphones 11, 12, and 13 will be described. 2A, signal levels V11, V12, and V13 and delay times t11, t12, and t13 are detected in the three microphones 11, 12, and 13 for the voice uttered by the speaker 81. In this case, the delay time may be a relative delay time based on any one microphone (for example, the center microphone 12) by detecting the rising edge of the signal. When the output characteristics of the microphones are adjusted in advance to be the same, the signal level decreases according to the distance between the speaker 81 and the microphones 11, 12, and 13. The delay time becomes longer in proportion to the difference between the distance between the speaker 81 and the microphones 11, 12, and 13. Therefore, if the positional relationship between the microphones 11, 12, and 13 is known, the direction of the speaker can be calculated.
[0014]
The relationship between the direction of the speaker 81, the signal levels V11, V12, V13 from the microphones 11, 12, 13 and the delay times t11, t12, t13 is stored in the memory 21 in advance. Then, the signal level and delay time are actually detected by the microphones 11, 12, and 13, and the speaker direction is calculated using the relationship between the signal level and delay time stored in the memory 21 and the speaker direction. To do.
[0015]
Next, the directivity adjustment method of the sound detection unit 1 will be described. As shown in FIG. 2A, the three microphones 11, 12, and 13 are arranged in the vertical direction with respect to the speaker, and the gains to be adjusted for the signal levels of the microphones 11, 12, and 13 are G11, G12, and G13. When the gains of the microphones 11, 12, and 13 are adjusted to be the same (G11 = G12 = G13), the directivity of the voice detector 1 is such that the direction of the speaker 81 is strong and the diagonal or lateral direction is weak. it can.
[0016]
Further, as shown in FIG. 2B, the three microphones 11, 12, and 13 are arranged laterally with respect to the speaker, and are adjusted so that the gain increases in the order of the microphones 11, 12, and 13 (G11 <G12 < G13) Then, the directivity of the voice detection unit 1 can be such that the direction of the speaker 81a becomes stronger. On the other hand, by adjusting the gains in the order of the microphones 13, 12, and 11 (G13 <G12 <G11), the directivity of the voice detection unit 1 can be increased in the direction of the speaker 81 b. Furthermore, when the gain of the microphone 12 is adjusted to be higher than the gains of the microphones 11 and 13 (G11 ≦ G12 ≧ G13), the directivity of the voice detection unit 1 is such that the direction of the speaker 81c in the center is stronger. it can.
[0017]
In advance, gains G11, G12, and G13 that should be adjusted so that the directivity of the speaker direction and the speaker direction in the gain adjusting unit 31 are maximized are stored in the memory 21, and the speaker direction can be detected. In the event that the output signal of the microphones 11, 12, 13 is adjusted, the gain adjustment corresponding to the direction of the speaker is performed. In this way, the voice signal in which the voice detection unit 1 in the direction of the speaker is enhanced and the directivity in the other direction is suppressed to reduce the noise from the passenger and the noise from the vehicle is the voice recognition unit. Thus, the recognition rate of the speaker's voice is improved.
[0018]
In addition, a compact unit (corresponding to the sound detection unit 1) in which three microphones 11, 12, and 13 are combined together in one housing may be configured and installed on the back of the map lamp or room lamp in the vehicle. Further, the three microphones 11, 12, and 13 may be installed at locations that are separated from each other (for example, inside the left and right front pillars and the back of the room lamp). If the three microphones are separated from each other, the direction of the speaker can be easily identified.
[0019]
Further, in the microphone arrangement of FIG. 2B, the directivity of the voice detection unit 1 composed of the three microphones 11, 12, and 13 is previously adjusted so as to be maximized in the direction of the driver seat (speaker 81b). The memory 21 stores two types of gains (G11, G12, G13) and gains that should be adjusted so as to be maximized in the direction of the passenger seat (speaker 81a). For any of the speakers (81a, 81b), the direction of the speaker is identified, and the gain is adjusted to the detection signal of the voice detection unit 1 for the speaker, so that the direction of the speaker has directivity. Can be made. That is, a single voice detection unit 1 can perform input processing with a high recognition rate for navigation devices and audio devices from either the driver's seat or the passenger seat.
[0020]
As described above, in this embodiment, the direction of the speaker is detected by a plurality of microphones, and the directivity of the voice detection unit is adjusted to be maximum in the direction of the speaker based on the detection result. The voice signal of the passenger is large, noise such as passengers' conversation and vehicle noise can be reduced, and a voice recognition device with a high degree of recognition can be achieved.
FIG. 3 is a block diagram showing a system configuration of the on-vehicle speech recognition apparatus according to the second embodiment of the present invention. FIG. 4 is a diagram for explaining a seat slide position, a seat surface height, and a backrest angle. Hereinafter, it demonstrates according to a figure. In this example, the direction of the speaker is detected from the setting state of the driver's seat, and the directivity adjustment of the voice detection unit 1 after detecting the direction of the speaker is the same as in the first embodiment. is there.
[0021]
4 is a seat state detection unit (seat setting state detection means) for detecting the setting state of the driver's seat (or passenger seat) of the vehicle, a slide position sensor 41 for detecting the position of the seat in the front-rear direction, and the vertical direction of the seat The seat surface height sensor 42 for detecting the position of the backrest and the backrest angle sensor 43 for detecting the angle of the backrest. The slide position sensor 41 and the seat surface height sensor 42 are sensors for detecting the movement amount (length). For example, a combination of a slit and an optical sensor measures the number of pulses generated according to the movement amount. The backrest angle sensor 43 is a sensor that detects an angle, and measures, for example, a resistance value (or voltage value) that changes according to a rotation angle by using a volume or the like. If the seat setting state can be adjusted electrically (electric seat), the sensor 41, 42, 43 is not provided, and the driving amount of the motor that sets (moves) the seat is measured. The measured drive amount may be substituted. Reference numeral 2 denotes a control unit that detects the position of the driver (speaker) based on the detection result of the seat state detection unit 4 and detects the direction of the speaker from the voice detection unit 1 and the position of the driver. The control unit 2 has a memory 21 in which the detection results of the voice detection unit 1 and the seat state detection unit 4 and the direction of the speaker are stored in advance. A driver's seat 71 includes a seat surface portion 71a, a backrest portion 71b, and a headrest portion 71c. Since the voice detection unit 1, microphones 11, 12, and 13, the control unit 2, the memory 21, the voice recognition unit 8, and the gain adjustment unit 31 have the same names, functions, and operations as the first embodiment, the same numbers are used. The description is omitted.
[0022]
The optimal seat state when the driver is seated in advance, that is, the seat slide position D1, the seat surface height H1, and the backrest angle θ1 are determined by the slide position sensor 41, the seat surface height sensor 42, and the backrest angle sensor 43. The direction of the speaker with respect to the position detection unit 1 installed between the ceiling 70b of the vehicle and the upper portion of the windshield 70a is stored in the memory 21.
[0023]
Next, the state of the seat adjusted when the driver is seated on the seat is detected by the slide position sensor 41, the seat surface height sensor 42, and the backrest angle sensor 43, and the relationship between the seat state and the direction stored in the memory 21 is detected. To detect the direction of the speaker.
When the direction of the speaker is detected, the directivity in the direction of the speaker is maximized by adjusting the gains G11, G12, and G13 corresponding to the microphones 11, 12, and 13, as in the first embodiment. As a result, even if the driver's physique and posture are different and the seat position and the backrest angle change, the optimum microphone directivity can always be obtained.
[0024]
As described above, in this embodiment, the direction of the speaker is detected based on the seat position, and the directivity of the voice detection unit is adjusted to be maximum in the direction of the speaker based on the detection result. Therefore, it is possible to reduce noises such as passengers' conversation and vehicle noise, and a speech recognition device with high recognition can be obtained.
FIG. 5 is a diagram for explaining the installation state of the microphone of the on-vehicle speech recognition apparatus according to the third embodiment of the present invention. Hereinafter, it demonstrates according to a figure. In this example, a non-directional microphone is used so that the directivity of the voice detection unit is maximized in the direction of the speaker.
[0025]
By installing the microphone 11 on the interior material surface 72a of the vehicle via the pipe 75, the sound in the direction limited by the opening diameter φ and the length L of the pipe 75 is selectively detected by the microphone 11, and the other directions Can be prevented from being detected. By adjusting the direction of the pipe 75 in advance so as to coincide with the direction of the speaker, the conversation and noise of the passenger can be reduced.
[0026]
As described above, in this embodiment, the voice detector is installed so that the directivity of the voice detection unit is maximized in the direction of the speaker, so that the speaker's voice signal is large and noise such as passenger conversation and vehicle noise is reduced. And a speech recognition device with a high degree of recognition.
[0027]
【The invention's effect】
As described above, the present invention can provide a vehicle-mounted speech recognition apparatus that improves noise recognition rate by reducing noise other than the speaker's speech.
[Brief description of the drawings]
FIG. 1 is a block diagram showing a system configuration of an in-vehicle speech recognition apparatus according to a first embodiment of the present invention.
FIG. 2 is a diagram illustrating speaker position detection and directivity adjustment using a plurality of microphones.
FIG. 3 is a block diagram showing a system configuration of an on-vehicle speech recognition apparatus according to a second embodiment of the present invention.
FIG. 4 is a diagram showing a seat surface height, a slide position, and a backrest angle of a seat.
FIG. 5 is a diagram showing a microphone installation state of the on-vehicle speech recognition apparatus according to the third embodiment of the present invention.
FIG. 6 is a block diagram showing a system configuration of a conventional in-vehicle speech recognition device.
[Explanation of symbols]
1... Voice detector 43... Back angle sensor,
11, 12, 13... Microphone, 2... Control unit,
4 ... Seat state detection unit, 21 ... Memory,
41... Slide position sensor 31.
42... Seat height sensor, 8.

Claims

Direction detecting means for detecting the direction of the speaker;
A plurality of voice detection means for detecting the voice of the speaker;
A directional characteristic adjusting unit that adjusts a directional characteristic of the voice detecting unit by adjusting a gain of detection signals detected by the plurality of voice detecting units to perform weighted synthesis ;
Storage means for storing the direction of the speaker in correspondence with the gain adjustment value for increasing the directivity of the voice detection means in the direction of the speaker;
Voice recognition means for recognizing the voice of the speaker detected by the voice detection means ,
The in-vehicle speech recognition apparatus characterized in that the directivity adjusting means adjusts the directivity of the voice detecting means based on a gain adjustment value corresponding to the direction of the speaker stored in the storage means .

The voice detection means is composed of a plurality of microphones,
The in-vehicle voice recognition according to claim 1, wherein the direction detecting means detects the direction of the speaker based on a signal level difference between detection signals detected by the plurality of microphones. apparatus.

The voice detection means is composed of a plurality of microphones,
2. The on-vehicle speech recognition apparatus according to claim 1, wherein the direction detecting means detects the direction of the speaker based on a delay time between detection signals detected by the plurality of microphones. .

Comprising position detecting means for detecting the position of the speaker;
The said direction detection means detects the direction of the said speaker based on the installation position of the said audio | voice detection means, and the position of the speaker detected by the said position detection means. In-car speech recognition device.

A seat setting state detecting means for detecting a setting state of a speaker's seat;
5. The in-vehicle voice according to claim 4, wherein the position detecting means detects the position of the speaker based on the seat setting state of the speaker detected by the seat setting state detecting means. Recognition device.

2. The on-vehicle speech recognition apparatus according to claim 1, wherein in the storage means, the direction of the speaker is at least a driver seat and a passenger seat.