JP2011102984A

JP2011102984A - Navigation device, voice recognition method, and program

Info

Publication number: JP2011102984A
Application number: JP2010266397A
Authority: JP
Inventors: Toru Yamamoto; 徹山元; Makoto Akaha; 誠赤羽; Yoshikazu Takahashi; 良和高橋; Hitoshi Okubo; 仁大久保; Eiji Yamamoto; 英二山本; Satoko Ikezawa; 聡子池澤
Original assignee: Sony Corp
Current assignee: Sony Corp
Priority date: 2010-11-30
Filing date: 2010-11-30
Publication date: 2011-05-26

Abstract

<P>PROBLEM TO BE SOLVED: To direct the directional axis of a voice input microphone to a sound source. <P>SOLUTION: The navigation device includes: a microphone system which has a plurality of voice input microphones, and a voice signal generation part for adding delays to output signals from the plurality of microphones and adding together the signals having the delays added thereto to generate a voice signal, and in which a directional axis is changed by changing the delay amounts added to the output signals; a delay amount selection part for selecting, on the basis of the output level of a sample voice signal obtained by adding together the signals of sample delays added to the output signals from the plurality of microphones, one of a plurality of combinations of the delay amounts added to the output signals; a recognition part for recognizing the voice signal generated by using the selected delay amounts; and a control unit for controlling the operation of the device on the basis of the recognition result of the recognition part. <P>COPYRIGHT: (C)2011,JPO&INPIT

Description

本発明は車載機器に関し、特に音声認識機能を備えたカーナビゲーションシステムに関するものである。 The present invention relates to an in-vehicle device, and more particularly to a car navigation system having a voice recognition function.

ＧＰＳ(GlobalPositioningSystem：全地球測位システム)を利用したカーナビゲーションシステムが普及している。このカーナビゲーションシステムは、地図情報その他の情報を表示する表示部（一般には、モニタと呼ばれる）と、演算処理装置その他を収容する本体と、カーナビゲーションシステムへの制御情報を入力するためのリモコン等を備えている。最近、リモコンのほかに制御情報を入力する手段として、音声認識機能を有する音声入力装置を備えたカーナビゲーションシステムも実用化されている。この音声入力装置を備えたカーナビゲーションシステムには音声入力用のマイクロフォンが設けられることになる。従来、このマイクロフォンは、例えば、車室内のダッシュボード、ステアリングコラム、サンバイザ、Ａピラー部などに取り付けられていた。そして、マイクロフォンは、カーナビゲーションシステムの本体から引き出されたマイクロフォンケーブルに接続されていた。カーナビゲーションシステムの本体は、車室内ではなくトランクルーム内に設置されることが多い。したがって、音声入力装置を備えたカーナビゲーションシステムでは、トランクルームからマイクロフォンケーブルを引き出し、上述のように、車室内のダッシュボード上等にマイクロフォンを取り付けていた。したがって、必要となるマイクロフォンケーブルの長さも７ｍにも及んでいた。また、マイクロフォンケーブルの配線作業が煩雑であるという問題があった。 Car navigation systems using GPS (Global Positioning System) have become widespread. This car navigation system includes a display unit (generally called a monitor) that displays map information and other information, a main body that houses an arithmetic processing unit and the like, a remote controller for inputting control information to the car navigation system, and the like It has. Recently, a car navigation system including a voice input device having a voice recognition function has been put into practical use as a means for inputting control information in addition to a remote control. A car navigation system equipped with this voice input device is provided with a microphone for voice input. Conventionally, this microphone is attached to, for example, a dashboard, a steering column, a sun visor, an A-pillar portion, etc. in a vehicle interior. The microphone is connected to a microphone cable drawn from the main body of the car navigation system. The main body of a car navigation system is often installed in a trunk room, not in the passenger compartment. Therefore, in a car navigation system equipped with a voice input device, a microphone cable is pulled out from a trunk room, and a microphone is attached on a dashboard or the like in a vehicle cabin as described above. Therefore, the required length of the microphone cable has reached 7 m. In addition, there is a problem that the work of wiring the microphone cable is complicated.

カーナビゲーションシステムの音声入力装置には、音声入力用マイクロフォンの他に、ノイズ信号を入力するためのノイズマイクロフォンを設けることが提案、実用化されている。音声入力用マイクロフォンに入力される信号は音声信号とノイズ信号が混在した信号であり、一方、ノイズマイクロフォンに入力される信号は専らノイズ信号である。したがって、音声入力用マイクロフォンに入力される信号からノイズマイクロフォンに入力される信号を減ずることにより、音声信号のみを抽出することができる。ところが、音声入力用マイクロフォンとノイズマイクロフォンの２つのマイクロフォンを設けることはコストの点で不利である。したがって、１つのマイクロフォンでＳ／Ｎが良好となることが望ましい。 It has been proposed and put to practical use in a voice input device of a car navigation system, in addition to a voice input microphone, a noise microphone for inputting a noise signal. The signal input to the audio input microphone is a signal in which an audio signal and a noise signal are mixed, while the signal input to the noise microphone is exclusively a noise signal. Accordingly, only the audio signal can be extracted by subtracting the signal input to the noise microphone from the signal input to the audio input microphone. However, providing two microphones, a voice input microphone and a noise microphone, is disadvantageous in terms of cost. Therefore, it is desirable that the S / N is good with one microphone.

このような問題点を解消し得る音声入力装置が特開平１０−１１０８４号公報に開示されている。特開平１０−１１０８４号公報の音声入力装置は、遠隔制御を行うためのナビリモコンと、音声認識を用いて制御を行うために音声を入力するマイクロフォンとを有する車載用ナビゲーションシステムの音声入力装置であって、前記ナビリモコン内に前記マイクロフォンを設けることを特徴としている。特開平１０−１１０８４号公報に開示された音声入力システムは、カーナビゲーションシステムの制御信号に入力するためのワイヤレスリモコン内にマイクロフォンを設けているため、マイクロフォンケーブルのための配線作業を行う必要がない。しかも、ユーザが音声入力を行う際には、マイクロフォン内蔵のリモコンを口元に近づけて発声することができる。したがって、ノイズの影響を受けにくいという利点もある。 A voice input device capable of solving such problems is disclosed in Japanese Patent Laid-Open No. 10-11084. The voice input device disclosed in Japanese Patent Laid-Open No. 10-11084 is a voice input device for an in-vehicle navigation system having a navigation remote controller for performing remote control and a microphone for inputting voice for performing control using voice recognition. In the navigation remote controller, the microphone is provided. In the voice input system disclosed in Japanese Patent Laid-Open No. 10-11084, a microphone is provided in a wireless remote controller for inputting a control signal of a car navigation system, so that it is not necessary to perform wiring work for a microphone cable. . In addition, when the user performs voice input, the user can speak with the remote controller with a built-in microphone close to the mouth. Therefore, there is an advantage that it is hardly affected by noise.

特開平１０−１１０８４号公報Japanese Patent Laid-Open No. 10-11084

以上説明したように、ワイヤレスリモコンにマイクロフォンを内蔵する特開平１０−１１０８４号公報の提案は従来の問題点を解消することのできる有益なものである。ところが、マイクロフォン内蔵ワイヤレスリモコンに音声入力を行うためには、ユーザが当該ワイヤレスリモコンを持つ必要がある。音声入力を車両走行中に行う場合には、運転操作に支障をきたすおそれがある。したがって、本発明はマイクロフォンのための長いケーブルが不要で、かつマイクロフォンケーブル配線作業が不要かまたは簡便な車載機器、特にカーナビゲーションシステムの提供を課題とする。また本発明は、音声入力時に運転操作に支障をきたすことのない車載機器、特にカーナビゲーションシステムの提供を課題とする。 As described above, the proposal of Japanese Patent Application Laid-Open No. 10-11084 in which a microphone is incorporated in a wireless remote controller is useful for solving the conventional problems. However, in order to input voice to the wireless remote controller with built-in microphone, the user needs to have the wireless remote controller. When voice input is performed while the vehicle is running, there is a risk of impeding driving operations. Accordingly, it is an object of the present invention to provide an in-vehicle device, particularly a car navigation system, which does not require a long cable for a microphone and does not require a microphone cable wiring operation or is simple. It is another object of the present invention to provide an in-vehicle device, particularly a car navigation system, that does not hinder driving operation during voice input.

上記課題を解決するために、本発明のある観点によれば、音声入力用の複数のマイクロフォンと、上記複数のマイクロフォンからの出力信号のそれぞれにディレイを加え、上記ディレイが加えられたそれぞれの信号を加算した音声信号を生成する音声信号生成部と、を有し上記出力信号に加えられるディレイ量を変更することにより指向性軸が変更されるマイクロフォンシステムと、上記複数のマイクロフォンからの出力信号にそれぞれサンプルディレイを加えた信号を加算したサンプル音声信号の出力レベルに基づいて、上記出力信号のそれぞれに対して加えるディレイ量の複数の組合せの中から、一のディレイ量の組合せを選択するディレイ量選択部と、上記ディレイ量選択部により選択されたディレイ量の組合せを用いて生成された上記音声信号を認識する認識部と、上記認識部の認識結果に基づいて当該装置の動作を制御する制御部と、を有する、ナビゲーション装置が提供される。 In order to solve the above-described problem, according to an aspect of the present invention, a plurality of microphones for audio input and a signal added with the delay are added to each of output signals from the plurality of microphones. An audio signal generation unit that generates an audio signal added to the microphone system, the microphone system in which the directivity axis is changed by changing the delay amount added to the output signal, and the output signals from the plurality of microphones A delay amount that selects one delay amount combination from among a plurality of combinations of delay amounts to be added to each of the above output signals, based on the output level of the sample audio signal obtained by adding the signals to which the sample delay has been added. Generated by using a combination of the selection unit and the delay amount selected by the delay amount selection unit. It has a recognition unit for recognizing voice signals, and a control unit for controlling the operation of the device based on the recognition result of the recognition unit, the navigation apparatus is provided.

また、上記ディレイ量選択部は、上記出力レベルが最も高い上記ディレイ量の組合せを選択してもよい。 The delay amount selection unit may select a combination of the delay amounts having the highest output level.

また、上記ディレイ量決定部が決定したディレイ量の組合せを、上記音源である話者と対応づけて記憶する記憶部をさらに有し、上記音声信号生成部は、指定された話者と対応づけて上記記憶部に記憶されたディレイ量の組合せを用いて上記音声信号を生成してもよい。 A storage unit that stores the combination of the delay amounts determined by the delay amount determination unit in association with the speaker as the sound source; and the audio signal generation unit associates with the designated speaker. The audio signal may be generated using a combination of delay amounts stored in the storage unit.

また、上記ディレイ量選択部は、複数の上記ディレイ量の組合せについて上記出力レベルを検出し、上記出力レベルが最も高くなる上記ディレイ量の組合せを選択してもよい。 The delay amount selection unit may detect the output level for a plurality of combinations of the delay amounts, and select the combination of the delay amounts with the highest output level.

また、上記複数のマイクロフォンからの出力信号は、上記でディレイ量選択部を有する音源推定系統と、上記音声信号生成部、上記認識部、および上記制御部を有する出力信号系統とにそれぞれ入力され、上記ディレイ量選択部は、上記出力信号系統において上記制御部が上記認識結果に基づいた制御を行っている期間中、上記ディレイ量の組合せ選択処理を実行してもよい。 The output signals from the plurality of microphones are respectively input to the sound source estimation system having the delay amount selection unit and the output signal system having the audio signal generation unit, the recognition unit, and the control unit. The delay amount selection unit may execute the delay amount combination selection process during a period in which the control unit performs control based on the recognition result in the output signal system.

また、上記複数のマイクロフォンは、等間隔に配置されてもよい。 The plurality of microphones may be arranged at equal intervals.

また、上記課題を解決するために、本発明の別の観点によれば、複数のマイクロフォンにより音声を取得するステップと、上記複数のマイクロフォンからの出力信号にそれぞれサンプルディレイを加えるステップと、上記サンプルディレイが加えられた信号を加算したサンプル音声信号を生成するステップと、上記サンプル音声信号の出力レベルを検出するステップと、上記出力レベルに基づいて、上記出力信号の夫々に対して加えるディレイ量の組合せを選択するステップと、選択された上記ディレイ量の組合せに基づいて、上記複数のマイクロフォンからの出力信号のそれぞれにディレイを加えるステップと、上記ディレイが加えられたそれぞれの信号を加算することにより、音声信号を生成するステップと、上記音声信号を認識するステップと、上記認識の結果に基づいて当該装置の動作を制御するステップと、を含む、音声認識方法が提供される。 In order to solve the above-described problem, according to another aspect of the present invention, a step of acquiring sound by a plurality of microphones, a step of adding a sample delay to output signals from the plurality of microphones, and the sample A step of generating a sample audio signal obtained by adding the signals to which the delay has been added, a step of detecting an output level of the sample audio signal, and a delay amount to be added to each of the output signals based on the output level A step of selecting a combination, a step of adding a delay to each of the output signals from the plurality of microphones based on the selected combination of the delay amounts, and adding each signal to which the delay has been added A step of generating an audio signal and a step of recognizing the audio signal. And-up, based on the result of the recognition including the steps of controlling the operation of the apparatus, the speech recognition method is provided.

また、上記課題を解決するために、本発明の別の観点によれば、コンピュータを、音声入力用の複数のマイクロフォンと、上記複数のマイクロフォンからの出力信号のそれぞれにディレイを加え、上記ディレイが加えられたそれぞれの信号を加算した音声信号を生成する音声信号生成部と、を有し上記出力信号に加えられるディレイ量を変更することにより指向性軸が変更されるマイクロフォンシステムと、上記複数のマイクロフォンからの出力信号にそれぞれサンプルディレイを加えた信号を加算したサンプル音声信号の出力レベルに基づいて、上記出力信号のそれぞれに対して加えるディレイ量の複数の組合せの中から、一のディレイ量の組合せを選択するディレイ量選択部と、上記ディレイ量選択部により選択されたディレイ量の組合せを用いて生成された上記音声信号を認識する認識部と、上記認識部の認識結果に基づいて当該装置の動作を制御する制御部とを有する、ナビゲーション装置として機能させるためのプログラムが提供される。 In order to solve the above problems, according to another aspect of the present invention, a computer adds a delay to each of a plurality of microphones for voice input and an output signal from the plurality of microphones. An audio signal generation unit that generates an audio signal obtained by adding the added signals, and a microphone system in which a directivity axis is changed by changing a delay amount added to the output signal; Based on the output level of the sampled audio signal obtained by adding the signal obtained by adding the sample delay to the output signal from the microphone, one delay amount is selected from among a plurality of combinations of delay amounts added to each of the output signals. A combination of a delay amount selection unit for selecting a combination and a delay amount selected by the delay amount selection unit. A program for functioning as a navigation device is provided, which includes a recognition unit for recognizing the voice signal generated by using the control unit and a control unit for controlling the operation of the device based on the recognition result of the recognition unit. .

本発明者は、マイクロフォンケーブルの引き出しの対象としてモニタを利用することに着目した。カーナビゲーションシステムにおいて、モニタは、通常、ダッシュボード上に取り付けられている。したがって、モニタからマイクロフォンケーブルを引き出し、例えば、ダッシュボード上にマイクロフォンを取り付けたとしても、マイクロフォンケーブルは従来に比べて短くてすみ、かつ配線作業も極めて簡便である。しかも、モニタから引き出されたマイクロフォンは、ダッシュボードのほか、ステアリングコラム、サンバイザ、Ａピラー部といった従来と同様の個所に取り付けて使用することができるから、ユーザがマイクロフォンを手に持つ必要がない。また、マイクロフォンケーブル自体を無くすことも可能である。つまり、マイクロフォンをモニタを構成する筐体と一体化することによっても、前記課題を解決できることを知見した。この場合、マイクロフォンケーブルが存在しないことになるので、配線作業は全く不要となる。しかも、マイクロフォンケーブルが車室内に露出することがないため、配線が露出することによる見栄えの劣化という心配もなくなる。 The inventor of the present invention paid attention to the use of a monitor as a target for drawing out a microphone cable. In a car navigation system, a monitor is usually mounted on a dashboard. Therefore, even if the microphone cable is pulled out from the monitor and the microphone is mounted on the dashboard, for example, the microphone cable can be shorter than the conventional one and the wiring work is very simple. In addition to the dashboard, the microphone pulled out from the monitor can be mounted and used at the same location as the steering column, sun visor, A-pillar portion, and the user does not need to hold the microphone. It is also possible to eliminate the microphone cable itself. That is, it has been found that the above-mentioned problem can be solved by integrating the microphone with the housing constituting the monitor. In this case, since no microphone cable is present, no wiring work is required. In addition, since the microphone cable is not exposed in the vehicle interior, there is no need to worry about deterioration of appearance due to the exposed wiring.

本発明は以上のような利点を供えた車載機器であって、音声認識を用いて制御することのできる車載機器であって、地図情報その他の情報を表示するためのモニタと、音声入力用のマイクロフォンと、を備え、前記マイクロフォンが前記モニタに接続されることを特徴とする車載機器である。本発明の車載機器において、音声入力用のマイクロフォン（以下、単にマイクロフォンという）を前記モニタに接続する形態としては、マイクロフォン用のケーブルを介して前記モニタに接続する形態と、ケーブルを介さずにマイクロフォンを直接前記モニタに接続する形態がある。ケーブルを介して前記モニタに接続する形態によれば、トランクルームから車室内前部まで配線作業をしていた従来に比べて配線作業が著しく軽減される。また、ケーブルの長さも従来に比べて短くて済む。マイクロフォンを直接モニタに接続する形態によれば、配線作業自体が不要となる。しかも、ケーブルが不要となるため、従来のようにケーブルが車室内に露出することがないため、見栄えが悪い、といったユーザからの苦情を解決することができる。ケーブルを介してマイクロフォンをモニタに取り付けた場合、マイクロフォンを設置する位置は任意である。つまり、従来のように、ダッシュボード、ステアリングコラム、サンバイザ、Ａピラーといった個所にマイクロフォンを設置することができる。マイクロフォンを直接前記モニタに接続する形態としては、以下の２つの形態がある。一般にモニタは、液晶表示素子等の表示部、その他の部分を保持する筐体を備えているが、その筐体外側にマイクロフォンを取り付ける形態と、筐体内部にマイクロフォンを取り付ける（内蔵する）形態である。いずれの場合であっても、マイクロフォンはモニタに設置されることになる。 The present invention is an in-vehicle device that has the advantages as described above, and can be controlled using voice recognition, and includes a monitor for displaying map information and other information, and a voice input device. An in-vehicle device, wherein the microphone is connected to the monitor. In the in-vehicle device according to the present invention, a voice input microphone (hereinafter simply referred to as a microphone) is connected to the monitor as a form connected to the monitor via a microphone cable, or a microphone without a cable. Is directly connected to the monitor. According to the form of connecting to the monitor via a cable, the wiring work is significantly reduced as compared with the conventional case where the wiring work is performed from the trunk room to the front part of the vehicle interior. Also, the length of the cable can be shorter than the conventional one. According to the embodiment in which the microphone is directly connected to the monitor, the wiring work itself becomes unnecessary. In addition, since the cable is not necessary, the cable is not exposed to the vehicle interior as in the conventional case, and it is possible to solve a complaint from the user that the appearance is poor. When the microphone is attached to the monitor via a cable, the position where the microphone is installed is arbitrary. In other words, the microphone can be installed in places such as the dashboard, the steering column, the sun visor, and the A-pillar as in the past. There are the following two forms for directly connecting the microphone to the monitor. In general, a monitor includes a housing for holding a display unit such as a liquid crystal display element and other parts. The monitor is attached to the outside of the housing, and the microphone is attached (built in) to the inside of the housing. is there. In either case, the microphone will be installed on the monitor.

モニタに接続されるマイクロフォンの数は単数に限らず、複数とすることもできる。また、マイクロフォンはモニタに対して着脱自在とすることもできる。つまり、マイクロフォンのケーブルまたはマイクロフォン自体が、前記モニタに対して着脱自在に接続されるようにすることも本発明の範囲内である。マイクロフォンを直接モニタに接続する場合には、マイクロフォンをモニタに対して固定とすることもできるが、その向きを変更可能とすることもできる。マイクロフォンの指向性軸を話者に対して最適な向きとすることが望ましいからである。マイクロフォンの向きを変更する手段としては、ユーザによる手動の他、モータ等の駆動源を用いることができる。 The number of microphones connected to the monitor is not limited to one, but may be plural. Further, the microphone can be detachable from the monitor. That is, it is also within the scope of the present invention that the microphone cable or the microphone itself is detachably connected to the monitor. When the microphone is directly connected to the monitor, the microphone can be fixed with respect to the monitor, but the direction can be changed. This is because it is desirable to set the microphone directivity axis to an optimum direction with respect to the speaker. As means for changing the direction of the microphone, a driving source such as a motor can be used in addition to manual operation by the user.

本発明の車載機器の典型的な適用例はカーナビゲーションシステムである。したがって、本発明では入力された音声を認識するための認識部と、前記認識部で認識された結果を表示するためのモニタと、前記モニタに一体的に接続され、かつ前記音声を入力するためのマイクロフォンと、を備えたことを特徴とするカーナビゲーションシステムが提供される。また、カーオーディオにも適用することができるが、車載機器であれば用途は問わない。また、車室内において、マイクロフォンには音声認識のための音声の他に、エンジン音、その他のノイズが入力される。本発明のカーナビゲーションシステムにおいては、ノイズを抑制するためのノイズ抑制手段を設けることができる。ノイズ抑制手段としては、公知のノイズ抑制手段を広く適用することができるが、本発明において特徴的なノイズ抑制手段は、マイクロフォンをモニタに直接接続した場合に機能する。つまり、モニタ全体の回折効果によりマイクロフォンの感度が向上し、ノイズが抑制される。したがってこの場合には、モニタがノイズ抑制手段を構成することになる。ノイズを抑制する手法として、ノイズ集音用のマイクロフォンを音声入力用のマイクロフォンとは別途設けることが従来より行われている。本発明では、モニタの表裏いずれか一方の面に音声入力用マイクロフォンを設置し、またモニタの他方の面にノイズ入力用マイクロフォンを設置することができる。つまり本発明では、地図情報その他の情報を表示するためのモニタ本体と、前記モニタ本体の表裏いずれか一方の面に設置した音声入力用マイクロフォンと、前記モニタ本体の他方の面に設置したノイズ入力用マイクロフォンと、を備えたことを特徴とするモニタ装置が提供される。ノイズ入力用マイクロフォンは前述のノイズ抑制手段に該当するが、音声入力用マイクロフォンが設置されたモニタ本体の面の裏面に設置する点に特徴を有する。その利点については、後述する発明の実施の形態において説明する。 A typical application example of the vehicle-mounted device of the present invention is a car navigation system. Therefore, in the present invention, a recognition unit for recognizing the input voice, a monitor for displaying a result recognized by the recognition unit, and a single unit connected to the monitor for inputting the sound. And a car navigation system characterized by comprising the above microphone. Moreover, although it can apply also to a car audio, if it is in-vehicle equipment, a use will not ask | require. In the vehicle interior, engine sound and other noises are input to the microphone in addition to the voice for voice recognition. In the car navigation system of the present invention, noise suppression means for suppressing noise can be provided. As the noise suppression means, known noise suppression means can be widely applied, but the characteristic noise suppression means in the present invention functions when a microphone is directly connected to a monitor. That is, the sensitivity of the microphone is improved by the diffraction effect of the entire monitor, and noise is suppressed. Therefore, in this case, the monitor constitutes noise suppression means. As a technique for suppressing noise, it has been conventionally practiced to provide a noise collecting microphone separately from a voice input microphone. In the present invention, an audio input microphone can be installed on either the front or back side of the monitor, and a noise input microphone can be installed on the other side of the monitor. That is, in the present invention, a monitor main body for displaying map information and other information, a voice input microphone installed on one of the front and back surfaces of the monitor main body, and a noise input installed on the other surface of the monitor main body And a microphone for monitoring. The noise input microphone corresponds to the noise suppression means described above, but is characterized in that it is installed on the back surface of the monitor main body on which the voice input microphone is installed. The advantages will be described in the embodiments of the invention described later.

以上説明したように、本発明の車載機器によれば、マイクロフォンをモニタに接続したので、マイクロフォンのための長いケーブルが不要で、かつマイクロフォンフォンケーブル配線作業が不要かまたは簡便となる。 As described above, according to the in-vehicle device of the present invention, since the microphone is connected to the monitor, a long cable for the microphone is unnecessary, and the microphone cable wiring work is unnecessary or simple.

本発明の第１実施形態にかかるカーナビゲーションシステムのシステム構成を示すブロック図である。1 is a block diagram showing a system configuration of a car navigation system according to a first embodiment of the present invention. 本発明の第１実施形態にかかるカーナビゲーションシステムの認識部５の構成を示すブロック図である。It is a block diagram which shows the structure of the recognition part 5 of the car navigation system concerning 1st Embodiment of this invention. 本発明の第１実施形態にかかるカーナビゲーションシステムの認識部５における処理フローを示す図である。It is a figure which shows the processing flow in the recognition part 5 of the car navigation system concerning 1st Embodiment of this invention. 本発明の第１実施形態に係るモニタ６の正面図である。It is a front view of the monitor 6 which concerns on 1st Embodiment of this invention. 本発明の第２実施形態に係るモニタ１６０の正面図であって、マイクロフォン１４０が取り付けられた状態を示す図である。It is a front view of the monitor 160 which concerns on 2nd Embodiment of this invention, Comprising: It is a figure which shows the state in which the microphone 140 was attached. 本発明の第２実施形態に係るモニタ１６０の正面図であって、マイクロフォン１４０を取り外した状態を示す図である。It is a front view of the monitor 160 which concerns on 2nd Embodiment of this invention, Comprising: It is a figure which shows the state which removed the microphone 140. FIG. 本発明の第２実施形態に係るマイクロフォン１４０の構造を示す図である。It is a figure which shows the structure of the microphone 140 which concerns on 2nd Embodiment of this invention. 本発明の第２実施形態に係るマイクロフォン１４０の他の構造を示す図である。It is a figure which shows the other structure of the microphone 140 which concerns on 2nd Embodiment of this invention. 本発明の第３実施形態に係るモニタ２６０の正面図である。It is a front view of the monitor 260 which concerns on 3rd Embodiment of this invention. 本発明の第３実施形態に係るモニタ２６０の背面図である。It is a rear view of the monitor 260 which concerns on 3rd Embodiment of this invention. ノイズ抑制のための電気的な仕組みの１例を示す図である。It is a figure which shows one example of the electrical mechanism for noise suppression. 本発明の第４実施形態に係るモニタ３６０の正面図である。It is a front view of the monitor 360 which concerns on 4th Embodiment of this invention. 本発明の第４実施形態に係るモニタ３６０の平面図である。It is a top view of the monitor 360 which concerns on 4th Embodiment of this invention. 音源推定手段の１例を説明するための図である。It is a figure for demonstrating an example of a sound source estimation means. 本発明の第５実施形態に係るモニタ４６０の側面図である。It is a side view of the monitor 460 which concerns on 5th Embodiment of this invention. 本発明の第５実施形態に係るモニタ４６０の正面図である。It is a front view of the monitor 460 which concerns on 5th Embodiment of this invention.

＜第１実施形態＞
以下、添付図面に示す実施の形態に基づいてこの発明を詳細に説明する。図１は第１実施形態におけるカーナビゲーションシステムの全体的なシステム構成を説明するための図である。この図において、符号１はナビゲーション装置の全体を司る演算処理部、２は自らの位置を計測する測位部、３はカーナビゲーションシステムの操作手段としてのリモートコントローラ、４はリモートコントローラ３からの操作情報を受信する受信部、５は入力された音声を認識処理する認識部、６は地図情報その他の情報を表示するためのモニタである。 <First Embodiment>
Hereinafter, the present invention will be described in detail based on embodiments shown in the accompanying drawings. FIG. 1 is a diagram for explaining the overall system configuration of the car navigation system according to the first embodiment. In this figure, reference numeral 1 is an arithmetic processing unit that controls the entire navigation apparatus, 2 is a positioning unit that measures its own position, 3 is a remote controller as operation means of the car navigation system, and 4 is operation information from the remote controller 3. 5 is a recognition unit for recognizing input speech, and 6 is a monitor for displaying map information and other information.

ここで、演算処理部１は、その全体を制御するＣＰＵ７と、所定のプログラムが格納されたＲＯＭ８と、各種データを記憶するＲＡＭ９と、前記測位部２、受信部４、認識部５、モニタ６等との間で入出力を行うＩ／Ｏ回路部１０とを備えている。測位部２は、ＧＰＳ(GlobalPositioningSystem：全地球測位システム)により測位を行うもので、高度約２万ｋｍ上の複数(通常４つ以上)のＧＰＳ衛星から発信された１５７５.４２ＭＨｚの信号(電波)を図示しない受信アンテナで受信し、この信号に基づき測位計算を行い、現在位置を示す測位データを演算処理部１に転送する。また、このカーナビゲーションシステムの各種操作は、リモートコントローラ３により行うことができる。このリモートコントローラ３は、例えば操作メニューをモニタ６に表示させたり、モニタ６への電子地図の表示範囲を変更したり、モニタ６に表示された各種操作メニュー等においてカーソルを移動させたり、することができる。そして、このリモートコントローラ３による操作情報に基づく信号はワイヤレスで送信され、これが受信部４において受信された後、演算処理部１に伝達されてその操作内容が実行される。 Here, the arithmetic processing unit 1 includes a CPU 7 that controls the whole, a ROM 8 that stores predetermined programs, a RAM 9 that stores various data, the positioning unit 2, the receiving unit 4, the recognizing unit 5, and the monitor 6. Etc., and an I / O circuit unit 10 that performs input / output with the device. The positioning unit 2 performs positioning by GPS (Global Positioning System), and a 1575.42 MHz signal (radio wave) transmitted from a plurality of (usually four or more) GPS satellites at an altitude of about 20,000 km. Is received by a receiving antenna (not shown), positioning calculation is performed based on this signal, and positioning data indicating the current position is transferred to the arithmetic processing unit 1. Various operations of the car navigation system can be performed by the remote controller 3. For example, the remote controller 3 displays an operation menu on the monitor 6, changes the display range of the electronic map on the monitor 6, or moves the cursor on various operation menus displayed on the monitor 6. Can do. And the signal based on the operation information by this remote controller 3 is transmitted wirelessly, and after this is received by the receiving part 4, it is transmitted to the arithmetic processing part 1, and the operation content is performed.

モニタ６には音声入力用のマイクロフォン４０が接続してある。このマイクロフォン４０で入力された音声によっても、リモートコントローラ３と同様にカーナビゲーションシステムの各種操作を制御することができるし、その結果がモニタ６に表示される。例えば、行き先を音声で指示することにより、その行き先を含む地図情報をモニタ６に表示させることができる。マイクロフォン４０に入力された音声は、認識部５で認識処理がなされる。認識部５は図２に示すように、ＣＰＵ５１と、所定のプログラムが格納されたＲＯＭ５２と、各種データを記憶するＲＡＭ５３とから構成される。認識部５では図３に示すように認識処理が実行される。すなわち、入力された音声信号をアナログ値からデジタル値に変換する。変換された音声信号に対してノイズ抑制処理を施した後に、音響分析を行う。ノイズ抑制処理は公知の手法を適用することができる。音響分析においては、音声の特徴量を抽出する。その後、この抽出された特徴量と標準パターンとを比較することにより音声認識がなされる。認識された結果は、モニタ６に表示される。さらに、このカーナビゲーションシステムには、図１に示したように、地図情報その他の情報を記憶した記憶媒体１１が備えられている。 A microphone 40 for voice input is connected to the monitor 6. Various operations of the car navigation system can be controlled by the sound input from the microphone 40 as well as the remote controller 3, and the result is displayed on the monitor 6. For example, by designating the destination by voice, the map information including the destination can be displayed on the monitor 6. The voice input to the microphone 40 is recognized by the recognition unit 5. As shown in FIG. 2, the recognition unit 5 includes a CPU 51, a ROM 52 that stores a predetermined program, and a RAM 53 that stores various data. The recognition unit 5 performs recognition processing as shown in FIG. That is, the input audio signal is converted from an analog value to a digital value. An acoustic analysis is performed after noise suppression processing is performed on the converted audio signal. A known method can be applied to the noise suppression process. In acoustic analysis, a feature amount of speech is extracted. Thereafter, speech recognition is performed by comparing the extracted feature quantity with the standard pattern. The recognized result is displayed on the monitor 6. Furthermore, as shown in FIG. 1, the car navigation system includes a storage medium 11 that stores map information and other information.

図４は、モニタ６の正面図を示す。モニタ６は、地図情報その他を表示するための表示部６１と、表示部６１その他を収容するモニタ筐体６２とから構成される。モニタ６のモニタ筐体６２からは、マイクロフォン４０のマイクロフォンケーブル４１が引き出されている。本実施の形態においてマイクロフォンケーブル４１はモニタ筐体６２に固着させているが、着脱自在とすることもできる。つまり、モニタ筐体６２にコネクタを内蔵し、そのコネクタに対してマイクロフォンケーブル４１を着脱可能とする。モニタ６はコラム６３を介してダッシュボード、コンソールボックス等の車室内前部に取り付けられる。したがって、マイクロフォン４０を、従来のように、サンバイザ、あるいはダッシュボード上に取り付けたとすると、マイクロフォンケーブル４１の長さは１〜２ｍ程度あれば十分に足りる。なお、マイクロフォン４０を取り付ける位置を限定するものではなく、サンバイザ、ダッシュボードの他、車室内の任意の位置に取り付けることができる。もっとも、マイクロフォン４０に対する話者は、運転手または助手席に座っている者であろうから、車室内前部のいずれかの位置ということになろう。 FIG. 4 shows a front view of the monitor 6. The monitor 6 includes a display unit 61 for displaying map information and the like, and a monitor housing 62 that houses the display unit 61 and the like. From the monitor housing 62 of the monitor 6, the microphone cable 41 of the microphone 40 is drawn out. In the present embodiment, the microphone cable 41 is fixed to the monitor housing 62, but may be detachable. That is, the monitor housing 62 has a built-in connector, and the microphone cable 41 can be attached to and detached from the connector. The monitor 6 is attached to a front part of a vehicle interior such as a dashboard or a console box via a column 63. Therefore, if the microphone 40 is mounted on a sun visor or dashboard as in the prior art, the length of the microphone cable 41 of about 1 to 2 m is sufficient. In addition, the position where the microphone 40 is attached is not limited, and the microphone 40 can be attached at any position in the vehicle cabin other than the sun visor and the dashboard. However, since the speaker with respect to the microphone 40 may be a driver or a person sitting in the passenger seat, it will be at any position in the front part of the passenger compartment.

また、本実施の形態によれば、従来のようにトランクルームからマイクロフォンケーブルを車室内前部まで配線するのに比べて、モニタ６が車室内前部に配置されているため、マイクロフォンケーブル４１の配線作業は簡単である。本実施の形態において、マイクロフォン４０およびマイクロフォンケーブル４１は、従来公知のものを適用することができる。たとえば、マイクロフォン４０としては、単一指向性マイクロフォンおよび無指向性マイクロフォンがあるが、いずれでも使用することができる。ただし、単一指向性マイクロフォンの方が、ノイズの集音の可能性が低くなることから望ましい。また、本実施の形態ではマイクロフォン４０は単数としているが、複数取り付けてもよい。 Further, according to the present embodiment, the monitor 6 is disposed in the front part of the vehicle interior as compared with the conventional case where the microphone cable is wired from the trunk room to the front part of the vehicle interior. The work is simple. In the present embodiment, conventionally known microphones 40 and microphone cables 41 can be applied. For example, as the microphone 40, there are a unidirectional microphone and an omnidirectional microphone, either of which can be used. However, the unidirectional microphone is preferable because the possibility of noise collection is reduced. Moreover, although the microphone 40 is single in this Embodiment, you may attach multiple.

＜第２実施形態＞
本発明にかかる第２実施形態を図５および図６に基づき説明する。なお、第２実施形態にかかるカーナビゲーションシステムの基本構成は、第１実施形態と同様であるので、ここでは相違点を中心に説明する。図５に示すように、第２実施形態にかかるモニタ１６０は、地図情報その他を表示するための表示部１６１と、表示部１６１その他を収容するモニタ筐体１６２とから構成される。第２実施形態が第１実施形態と相違する点は、第１実施形態がマイクロフォン４０とモニタ筐体６２とがマイクロフォンケーブル４１を介して接続されているのに対して、第２実施形態はマイクロフォン１４０がモニタ筐体１６２に直接接続されている点である。また、モニタ筐体１６２にはマイクロフォン１４０と接続されるコネクタが内蔵されており、図６に示すようにマイクロフォン１４０はモニタ筐体１６２に着脱自在となっている。 Second Embodiment
A second embodiment according to the present invention will be described with reference to FIGS. Note that the basic configuration of the car navigation system according to the second embodiment is the same as that of the first embodiment, and therefore, differences will be mainly described here. As shown in FIG. 5, the monitor 160 according to the second embodiment includes a display unit 161 for displaying map information and the like, and a monitor housing 162 that houses the display unit 161 and the like. The difference between the second embodiment and the first embodiment is that the microphone 40 and the monitor housing 62 are connected via the microphone cable 41 in the first embodiment, whereas the microphone in the second embodiment is a microphone. 140 is directly connected to the monitor housing 162. The monitor housing 162 has a built-in connector for connecting to the microphone 140, and the microphone 140 is detachable from the monitor housing 162 as shown in FIG.

以上の第２実施形態によれば、マイクロフォン１４０がモニタ筐体１６２に直接接続されているので、マイクロフォンケーブルの配線作業が全く不要となる。また、マイクロフォンケーブルが車室内に露出することがない。車室内にマイクロフォンケーブルが露出すると見栄えが悪いと嫌うユーザもいるが、この第２実施形態は、このようなユーザの要望に応えることができる。また、この第２実施形態のようにモニタ１６０にマイクロフォン１４０を直接接続、つまり一体的に取り付けることにより、ノイズ抑制効果が期待できる。つまり、モニタ１６０全体の回折効果によりマイクロフォン１４０の感度が向上するためである。この回折効果による感度向上は、人間が小さい音を聞くときに耳の後ろに手のひらをあてることにより経験することができる。 According to the second embodiment described above, since the microphone 140 is directly connected to the monitor housing 162, the work of wiring the microphone cable is completely unnecessary. Further, the microphone cable is not exposed in the vehicle interior. Although some users dislike that the appearance of the microphone cable is poor when the microphone cable is exposed in the vehicle interior, the second embodiment can meet such user demands. In addition, a noise suppression effect can be expected by directly connecting the microphone 140 to the monitor 160, that is, as a single unit, as in the second embodiment. That is, the sensitivity of the microphone 140 is improved by the diffraction effect of the entire monitor 160. This improvement in sensitivity due to the diffraction effect can be experienced by placing a palm behind the ear when a human hears a small sound.

本第２実施形態において、マイクロフォン１４０は、図７に示すように、マイクロフォン本体１４１とマイクロフォン筐体１４２とから構成される。マイクロフォン本体１４１は単一指向性のマイクロフォンから構成されているが、この指向特性をマイクロフォン筐体１４２と組み合わせることにより制御し、音声を増幅しノイズを抑制している。単一指向性のマイクロフォン本体１４１は、正面の音響端子と背面の音響端子間の距離、つまり音響２端子間距離、を長く取ることにより指向性を向上することができる。図７はマイクロフォン１４０の正面図（左側）および側面図（右側）を示しているが、マイクロフォン筐体１４２は、貫通孔１４２ｈを有する断面がかまくら型をなしていることがわかる。マイクロフォン本体１４１は、この貫通孔１４２ｈの貫通方向の中間部に配置されている。マイクロフォン本体１４１単体の音響２端子間距離がｄ１であるのに対して、マイクロフォン筐体１４２と組み合わせることにより、マイクロフォン１４０としての音響２端子間距離ｄ２を、マイクロフォン本体１４１単体の音響２端子間距離ｄ１より大きくすることができる。市販されているマイクロフォン本体１４１は図７に示すように単純な円柱形であることが多いが、本実施の形態によれば、市販のマイクロフォン本体１４１を用い、かつモニタ１６０にマイクロフォン本体１４１を取り付けるためのマイクロフォン筐体１４２と組み合わせることにより、音響２端子間距離を大きくすることができるという利点がある。本実施の形態ではマイクロフォン筐体１４２をかまくら型としたが、本発明はこれに限定されない。マイクロフォン本体１４０をモニタ１６０に取り付ける機能と、音響２端子間距離を大きくすることができる機能を備えていれば、如何なる形態であっても本発明にとって有効である。図８にマイクロフォン１４０の他の構成例を示す。図８のマイクロフォン１４０は、図７に示したマイクロフォン１４０と基本的には同様の構成からなるが、マイクロフォン筐体１４２の上部に貫通孔１４２ｉを形成している。この貫通孔１４２ｉを空気が流通することにより、マイクロフォン１４０が日射などにより高熱になることを防止することができる。 In the second embodiment, the microphone 140 includes a microphone body 141 and a microphone casing 142 as shown in FIG. The microphone main body 141 is composed of a unidirectional microphone, and this directional characteristic is controlled in combination with the microphone casing 142 to amplify sound and suppress noise. The unidirectional microphone main body 141 can improve directivity by increasing the distance between the front acoustic terminal and the rear acoustic terminal, that is, the distance between the two acoustic terminals. FIG. 7 shows a front view (left side) and a side view (right side) of the microphone 140, and it can be seen that the microphone housing 142 has a cross section having a through hole 142h. The microphone main body 141 is disposed at an intermediate portion of the through hole 142h in the penetrating direction. The distance between the two sound terminals of the microphone main body 141 is d1, whereas the distance between the two sound terminals d2 as the microphone 140 is set to the distance between the two sound terminals of the microphone main body 141 by combining with the microphone housing 142. It can be larger than d1. The commercially available microphone main body 141 is often a simple columnar shape as shown in FIG. 7, but according to the present embodiment, the commercially available microphone main body 141 is used and the microphone main body 141 is attached to the monitor 160. In combination with the microphone housing 142, there is an advantage that the distance between the two sound terminals can be increased. In the present embodiment, the microphone casing 142 is a kamakura type, but the present invention is not limited to this. Any form is effective for the present invention as long as it has a function of attaching the microphone body 140 to the monitor 160 and a function of increasing the distance between the two sound terminals. FIG. 8 shows another configuration example of the microphone 140. The microphone 140 of FIG. 8 has basically the same configuration as the microphone 140 shown in FIG. 7, but has a through hole 142 i formed in the upper portion of the microphone casing 142. When air flows through the through-hole 142i, the microphone 140 can be prevented from becoming hot due to solar radiation or the like.

＜第３実施形態＞
本発明にかかる第３実施形態を図９および図１０に基づき説明する。なお、第３実施形態にかかるカーナビゲーションシステムの基本構成は、第１〜第２実施形態と同様であるので、ここでは相違点を中心に説明する。また、図９は第３実施形態にかかるモニタ２６０の正面図であり、図１０は同背面図である。図９に示すように、第３実施形態にかかるモニタ２６０は、地図情報その他を表示するための表示部２６１と、表示部２６１その他を収容するモニタ筐体２６２とから構成される。第３実施形態が第２実施形態と相違する点は、第２実施形態がマイクロフォン１４０がモニタ筐体１６２の外部に取り付けられているのに対して、第３実施形態はマイクロフォン２４０がモニタ筐体２６２内部に埋め込まれている点である。 <Third Embodiment>
A third embodiment according to the present invention will be described with reference to FIGS. Note that the basic configuration of the car navigation system according to the third embodiment is the same as that of the first and second embodiments, and therefore, differences will be mainly described here. FIG. 9 is a front view of a monitor 260 according to the third embodiment, and FIG. 10 is a rear view thereof. As shown in FIG. 9, the monitor 260 according to the third embodiment includes a display unit 261 for displaying map information and the like, and a monitor housing 262 for housing the display unit 261 and the like. The third embodiment differs from the second embodiment in that the microphone 140 is attached to the outside of the monitor housing 162 in the second embodiment, whereas the microphone 240 is a monitor housing in the third embodiment. It is the point embedded in 262.

以上の第３実施形態によれば、マイクロフォン２４０がモニタ筐体２６２に直接接続されている。したがって、第２実施形態と同様に、マイクロフォンケーブルの配線作業が全く不要となる、マイクロフォンケーブルが車室内に露出することがない、といった効果を有する。しかも、マイクロフォン２４０が、モニタ筐体２６２内部に埋め込まれているので、第２実施形態のようにマイクロフォン１４０がモニタ１６０の外部に突出することなくすっきりとした感じとなる。また、第３実施形態によっても、第２実施形態と同様に、マイクロフォン２４０をモニタ２６０に取り付けているので、モニタ２６０全体の回折効果によりマイクロフォン２４０の感度を向上することができる。 According to the third embodiment described above, the microphone 240 is directly connected to the monitor housing 262. Therefore, similarly to the second embodiment, there is an effect that the wiring work of the microphone cable is not required at all, and the microphone cable is not exposed in the vehicle interior. In addition, since the microphone 240 is embedded in the monitor housing 262, the microphone 140 does not protrude outside the monitor 160 as in the second embodiment, and the user feels neat. Also in the third embodiment, since the microphone 240 is attached to the monitor 260 as in the second embodiment, the sensitivity of the microphone 240 can be improved by the diffraction effect of the entire monitor 260.

第３実施形態では、図１０に示すように、モニタ２６０の背面側に、ノイズを集音するためのノイズマイクロフォン２４１をモニタ筐体２６２に内蔵させている。図９に示したマイクロフォン２４０は、音声のみならず雑音をも集音する。一方、ノイズマイクロフォン２４１は雑音を集音する。したがって、マイクロフォン２４０による「音声信号＋雑音信号」からノイズマイクロフォン２４１による「雑音信号」を減ずれば、音声信号のみを抽出することができる。このように一方のマイクロフォンを「音声信号＋雑音信号」用に、他方のマイクロフォンを「雑音信号」用に用い、「音声信号」のみを抽出する技術は既に知られている。しかし、第３実施形態のように、モニタ２６０の表裏面にそれぞれ「音声信号＋雑音信号」用のマイクロフォン２４０および「雑音信号」用のノイズマイクロフォン２４１を配置する構成は新規である。 In the third embodiment, as shown in FIG. 10, a noise microphone 241 for collecting noise is built in the monitor housing 262 on the back side of the monitor 260. The microphone 240 shown in FIG. 9 collects not only voice but also noise. On the other hand, the noise microphone 241 collects noise. Therefore, if the “noise signal” from the noise microphone 241 is subtracted from the “voice signal + noise signal” from the microphone 240, only the voice signal can be extracted. As described above, a technique of using only one microphone for “voice signal + noise signal” and the other microphone for “noise signal” and extracting only “voice signal” is already known. However, as in the third embodiment, the configuration in which the “sound signal + noise signal” microphone 240 and the “noise signal” noise microphone 241 are arranged on the front and back surfaces of the monitor 260 is novel.

マイクロフォン２４０とノイズマイクロフォン２４１との距離が離れていると、マイクロフォン２４０で集められる雑音信号とノイズマイクロフォン２４１で集められる雑音信号とが相違するため、「音声信号＋雑音信号」から「雑音信号」を減ずるという処理が現実的でなくなる。したがって、マイクロフォン２４０とノイズマイクロフォン２４１とは近接した位置に配置されることが望ましい。ところが、ノイズマイクロフォン２４１は、雑音信号がほしい信号であるから、話者の音声を集めにくい位置に配置されるべきである。つまり、この点を考慮すると、マイクロフォン２４０とノイズマイクロフォン２４１とは、あまり近い位置に配置することは望ましくない。しかるに、第３実施形態のように、モニタ２６０の背面側にノイズマイクロフォン２４１を配置すれば、話者からの音声はモニタ２６０により遮られる。つまり、ノイズマイクロフォン２４１は、話者の音声を集めにくい位置に配置されているということができる。しかも、ノイズマイクロフォン２４１は、マイクロフォン２４０と近接した位置に配置すべきであるという要求をも満足することができる。したがって、「音声信号＋雑音信号」から「雑音信号」を減ずるという処理が、有効なものとなる。 When the distance between the microphone 240 and the noise microphone 241 is large, the noise signal collected by the microphone 240 and the noise signal collected by the noise microphone 241 are different. Therefore, the “noise signal” is changed from “voice signal + noise signal”. The process of decreasing becomes unrealistic. Therefore, it is desirable that the microphone 240 and the noise microphone 241 are disposed at close positions. However, since the noise microphone 241 is a signal for which a noise signal is desired, it should be placed at a position where it is difficult to collect the voice of the speaker. That is, in consideration of this point, it is not desirable to arrange the microphone 240 and the noise microphone 241 at very close positions. However, if the noise microphone 241 is disposed on the back side of the monitor 260 as in the third embodiment, the sound from the speaker is blocked by the monitor 260. That is, it can be said that the noise microphone 241 is arranged at a position where it is difficult to collect the voice of the speaker. In addition, it is possible to satisfy the requirement that the noise microphone 241 should be disposed at a position close to the microphone 240. Therefore, the process of subtracting “noise signal” from “voice signal + noise signal” is effective.

第３実施形態ではマイクロフォン２４０およびノイズマイクロフォン２４１を各々２個ずつ設けた例を示したが、本発明はこれに限定されず、各々１個ずつ設けてもよいし、各々３個以上設けてもよい。また、マイクロフォン２４０およびノイズマイクロフォン２４１を配置する位置についても、第３実施形態ではモニタ２６０の上部両端としたが、これに限定されず、マイクロフォン２４０およびノイズマイクロフォン２４１の個数に応じて適宜定めることができる。ノイズ抑制は、以上の他に、例えば図１１に示すような電気的な仕組みによっても行うこともできる。図１１において、Ｍがマイクロフォン、Ａがアンプ、τが１サンプルのディレイを示しているが、このような公知の電気的なノイズ抑制手段を設けることができる。 In the third embodiment, an example in which two microphones 240 and two noise microphones 241 are provided has been described. However, the present invention is not limited to this, and one or two or more microphones may be provided. Good. Further, the positions where the microphones 240 and the noise microphones 241 are arranged are also the upper ends of the monitor 260 in the third embodiment, but the present invention is not limited to this, and may be appropriately determined according to the number of the microphones 240 and noise microphones 241. it can. In addition to the above, noise suppression can also be performed by an electrical mechanism as shown in FIG. In FIG. 11, M represents a microphone, A represents an amplifier, and τ represents a delay of one sample. However, such known electrical noise suppression means can be provided.

＜第４実施形態＞
本発明にかかる第４実施形態を図１２および図１３に基づき説明する。なお、第４実施形態にかかるカーナビゲーションシステムの基本構成は、第１〜第３実施形態と同様であるので、ここでは相違点を中心に説明する。また、図１２は第４実施形態にかかるモニタ３６０の正面図であり、図１３は平面図である。図１２に示すように、第４実施形態にかかるモニタ３６０は、地図情報その他を表示するための表示部３６１と、表示部３６１その他を収容するモニタ筐体３６２とから構成される。第４実施形態は、第２実施形態と同様に、モニタ筐体３６２にマイクロフォン３４０が直接接続されている。第２実施形態では、モニタ筐体１６２に取り付けられたマイクロフォン１４０は着脱自在ではあったものの、その向きを変えることはできなかった。ところが、第４実施形態では、マイクロフォン３４０の向きを変えることが可能に構成されている。つまり、図１３に示すように、マイクロフォン３４０は、モニタ３６０に対して回動可能に取り付けられている。したがって、話者、つまり音源に対してマイクロフォン３４０の指向性軸を向けることができる。 <Fourth embodiment>
A fourth embodiment according to the present invention will be described with reference to FIGS. Since the basic configuration of the car navigation system according to the fourth embodiment is the same as that of the first to third embodiments, the differences will be mainly described here. FIG. 12 is a front view of a monitor 360 according to the fourth embodiment, and FIG. 13 is a plan view. As shown in FIG. 12, the monitor 360 according to the fourth embodiment includes a display unit 361 for displaying map information and the like, and a monitor housing 362 for housing the display unit 361 and the like. In the fourth embodiment, similarly to the second embodiment, a microphone 340 is directly connected to the monitor housing 362. In the second embodiment, the microphone 140 attached to the monitor housing 162 was detachable, but the direction could not be changed. However, in the fourth embodiment, the direction of the microphone 340 can be changed. That is, as shown in FIG. 13, the microphone 340 is attached to the monitor 360 so as to be rotatable. Therefore, the directivity axis of the microphone 340 can be directed to the speaker, that is, the sound source.

マイクロフォン３４０の回動は、手動で行うこととしてもよいし、モータ等の駆動源を用いて行うようにしてもよい。第４実施形態のように複数のマイクロフォン３４０を設ける場合には、リンク機構その他を利用して複数のマイクロフォン３４０を同時に回動するようにすることができる。また、第４実施形態では、水平方向にマイクロフォン３４０の向きを変える例を示したが、鉛直方向に向きを変えることもできる。また、各話者に適したマイクロフォン３４０の向きを予め登録（プリセット）しておき、その話者がドライバとなった際に、プリセットしておいた向きにマイクロフォン３４０が向くようにすることもできる。例えば、第４実施形態によるナビゲーション装置を搭載した自動車を家族で使用する場合には、その家族で当該自動車を運転する者毎にマイクロフォン３４０の向きをプリセットしておく。そして、自分が運転する場合には、プリセットしておいたマイクロフォン３４０の向きを読み出し、その向きにマイクロフォン３４０の向きをセットさせる。 The microphone 340 may be rotated manually or using a driving source such as a motor. When a plurality of microphones 340 are provided as in the fourth embodiment, the plurality of microphones 340 can be rotated simultaneously using a link mechanism or the like. In the fourth embodiment, the example in which the direction of the microphone 340 is changed in the horizontal direction has been described. However, the direction can be changed in the vertical direction. Moreover, the direction of the microphone 340 suitable for each speaker can be registered (preset) in advance, and when the speaker becomes a driver, the microphone 340 can be directed in the preset direction. . For example, when a car equipped with the navigation device according to the fourth embodiment is used by a family, the direction of the microphone 340 is preset for each person who drives the car in the family. When the user is driving, the preset orientation of the microphone 340 is read, and the orientation of the microphone 340 is set to that orientation.

マイクロフォン３４０の向きのプリセットは、音源推定手段を用いることにより実現することができる。図１４に音源推定手段の１例を示すが、図１１に示したノイズ抑制手段を利用したものである。図１１に示したノイズ抑制手段は、τ（１サンプルディレイ）が各マイクロフォンＭに付加されているが、そのディレイ量をソフト的に増減させることでマイクロフォンシステムとしての指向性軸を変えることができるというものである。例えば、図１４に示すように４つのマイクロフォンＭ１〜Ｍ４が等間隔に配置されていたとする。出力信号系統とは別に音源推定系を設け、その中で逐次ディレイ量変更を行う。はじめに、マイクロフォンＭ１の出力には６サンプルディレイ、マイクロフォンＭ２には４サンプルディレイおよびマイクロフォンＭ３には２サンプルディレイを加算し、マイクロフォンＭ４にはディレイ無し、とし、それらの出力を加算して音声出力レベルを検出する。次に、マイクロフォンＭ１の出力にはディレイ無し、マイクロフォンＭ２には２サンプルディレイおよびマイクロフォンＭ３には４サンプルディレイおよびマイクロフォンＭ４には６サンプルディレイを加算、とし、それらの出力を加算して音声出力レベルを検出する。さらに、全てのマイクロフォンＭ１〜Ｍ４にディレイをかけない場合の音声出力レベルを検出する。これら３種類の音声出力レベルの中で、最もレベルの高いものを選び、そのときのディレイのかけかたを出力信号系統にコピーして音源推定処理を行う。マイクロフォン３４０のプリセットについては、上記の音源推定手段の他に、公知の如何なる手段を採用してもよい。 The preset of the direction of the microphone 340 can be realized by using sound source estimation means. FIG. 14 shows an example of the sound source estimation means, which uses the noise suppression means shown in FIG. In the noise suppression means shown in FIG. 11, τ (one sample delay) is added to each microphone M, but the directivity axis as the microphone system can be changed by increasing or decreasing the delay amount in software. That's it. For example, assume that four microphones M1 to M4 are arranged at equal intervals as shown in FIG. A sound source estimation system is provided separately from the output signal system, and the delay amount is sequentially changed therein. First, a 6-sample delay is added to the output of the microphone M1, a 4-sample delay is added to the microphone M2, and a 2-sample delay is added to the microphone M3, and no delay is added to the microphone M4. Is detected. Next, there is no delay in the output of the microphone M1, 2 sample delay is added to the microphone M2, 4 sample delay is added to the microphone M3, and 6 sample delay is added to the microphone M4. Is detected. Furthermore, the audio output level when no delay is applied to all the microphones M1 to M4 is detected. Among these three types of audio output levels, the one with the highest level is selected, and the method of delaying at that time is copied to the output signal system to perform sound source estimation processing. As for the preset of the microphone 340, any known means may be employed in addition to the sound source estimating means described above.

＜第５実施形態＞
本発明にかかる第５実施形態を図１５および図１６に基づき説明する。なお、第５実施形態にかかるナビゲーションシステムの基本構成は第１実施形態と同様であり、マイクロフォン４４０はマイクロフォンケーブル４４１を介してモニタ４６０に接続された構成となっている。図１５および図１６に示すように、第５実施形態にかかるモニタ４６０は地図情報その他を表示するための表示部４６１と、表示部４６１その他を収容するモニタ筐体４６２とから構成される。モニタ４６０は、コラム４７０を介して車室内前部の任意の位置に取り付けられる。一般的には、ダッシュボードの車体幅方向の中央部に取り付ける。コラム４７０からマイクロフォン４４０を固定するためのＬ字状かつ中空のアーム４８０が延設されている。アーム４８０の先端部にマイクロフォン４４０が取り付けられている。マイクロフォン４４０は、モニタ４６０の表面に近接配置されている。マイクロフォン４４０は、マイクロフォンケーブル４４１を介してモニタ４６０に接続されている。マイクロフォンケーブル４４１は、モニタ４６０の背面から引き出され、アーム４８０の中空部に配線されてマイクロフォン４４０に接続される。 <Fifth Embodiment>
A fifth embodiment according to the present invention will be described with reference to FIGS. 15 and 16. The basic configuration of the navigation system according to the fifth embodiment is the same as that of the first embodiment, and the microphone 440 is connected to the monitor 460 via the microphone cable 441. As shown in FIGS. 15 and 16, the monitor 460 according to the fifth embodiment includes a display unit 461 for displaying map information and the like, and a monitor housing 462 for housing the display unit 461 and the like. The monitor 460 is attached to an arbitrary position in the front part of the vehicle interior via the column 470. Generally, it is attached to the center of the dashboard in the vehicle width direction. An L-shaped and hollow arm 480 for fixing the microphone 440 is extended from the column 470. A microphone 440 is attached to the tip of the arm 480. The microphone 440 is disposed close to the surface of the monitor 460. The microphone 440 is connected to the monitor 460 via the microphone cable 441. The microphone cable 441 is pulled out from the back surface of the monitor 460, wired in the hollow portion of the arm 480, and connected to the microphone 440.

第２（第３，第４）実施形態のように、マイクロフォン１４０（２４０，３４０）をモニタ１６０（２６０，３６０）に直接接続した場合には、モニタ１６０（２６０，３６０）全体の回折効果によりマイクロフォン１４０（２４０，３４０）の感度を向上することができた。第５実施形態の場合には、マイクロフォン４４０をマイクロフォンケーブル４４１を介してモニタ４６０に接続されているが、アーム４８０を用いてマイクロフォン４４０をモニタ４６０の表面に近接配置している。したがって、モニタ４６０全体の回折効果により、マイクロフォン４４０の感度を向上することができる。 When the microphone 140 (240, 340) is directly connected to the monitor 160 (260, 360) as in the second (third, fourth) embodiment, the diffraction effect of the entire monitor 160 (260, 360) is caused. The sensitivity of the microphone 140 (240, 340) could be improved. In the case of the fifth embodiment, the microphone 440 is connected to the monitor 460 via the microphone cable 441, but the microphone 440 is disposed close to the surface of the monitor 460 using the arm 480. Therefore, the sensitivity of the microphone 440 can be improved by the diffraction effect of the entire monitor 460.

１…演算処理部、３…リモートコントローラ、５…認識部、６，１６０，２６０，３６０，４６０…モニタ、１１…記憶媒体、４０，１４０，２４０，３４０，４４０…マイクロフォン、４１，４４１…マイクロフォンケーブル、６１，１６１，２６１，３６１，４６１…表示部、６２，１６２，２６２，３６２，４６２…モニタ筐体 DESCRIPTION OF SYMBOLS 1 ... Arithmetic processing part, 3 ... Remote controller, 5 ... Recognition part, 6,160,260,360,460 ... Monitor, 11 ... Storage medium, 40,140,240,340,440 ... Microphone, 41,441 ... Microphone Cable, 61, 161, 261, 361, 461 ... display unit, 62, 162, 262, 362, 462 ... monitor housing

Claims

A plurality of microphones for voice input; and an audio signal generation unit that adds a delay to each of output signals from the plurality of microphones and generates an audio signal obtained by adding the signals to which the delay has been added. A microphone system in which the directivity axis is changed by changing the amount of delay applied to the output signal;
Based on the output level of the sample audio signal obtained by adding the signal obtained by adding the sample delay to the output signal from the plurality of microphones, one of the combinations of the delay amounts to be added to each of the output signals is A delay amount selection section for selecting a combination of delay amounts;
A recognition unit for recognizing the audio signal generated using the combination of delay amounts selected by the delay amount selection unit;
A control unit for controlling the operation of the apparatus based on the recognition result of the recognition unit;
A navigation device comprising:

The navigation device according to claim 1, wherein the delay amount selection unit selects the combination of the delay amounts having the highest output level.

A storage unit for storing a combination of delay amounts determined by the delay amount determination unit in association with a speaker as the sound source;
Further comprising
The navigation device according to claim 1, wherein the voice signal generation unit generates the voice signal using a combination of delay amounts stored in the storage unit in association with a designated speaker.

4. The delay amount selection unit according to claim 1, wherein the delay amount selection unit detects the output level for a plurality of combinations of the delay amounts, and selects the combination of the delay amounts that maximizes the output level. 5. Navigation device.

Output signals from the plurality of microphones are respectively input to the sound source estimation system having the delay amount selection unit and the output signal system having the audio signal generation unit, the recognition unit, and the control unit.
2. The navigation device according to claim 1, wherein the delay amount selection unit performs the delay amount combination selection process during a period in which the control unit performs control based on the recognition result in the output signal system.

The navigation device according to claim 1, wherein the plurality of microphones are arranged at equal intervals.

Acquiring sound with a plurality of microphones;
Adding a sample delay to each of output signals from the plurality of microphones;
Generating a sample audio signal obtained by adding the signal to which the sample delay has been added;
Detecting an output level of the sample audio signal;
Selecting a combination of delay amounts to be added to each of the output signals based on the output level;
Adding a delay to each of the output signals from the plurality of microphones based on the selected combination of the delay amounts;
Generating an audio signal by adding the respective signals to which the delay has been added; and
Recognizing the audio signal;
Controlling the operation of the device based on the recognition result;
A speech recognition method.

Computer
A plurality of microphones for voice input; and an audio signal generation unit that adds a delay to each of output signals from the plurality of microphones and generates an audio signal obtained by adding the signals to which the delay has been added. A microphone system in which the directivity axis is changed by changing the amount of delay applied to the output signal;
Based on the output level of the sample audio signal obtained by adding the signal obtained by adding the sample delay to the output signal from the plurality of microphones, one of the combinations of the delay amounts to be added to each of the output signals is A delay amount selection section for selecting a combination of delay amounts;
A recognition unit for recognizing the audio signal generated using the combination of delay amounts selected by the delay amount selection unit;
A control unit for controlling the operation of the apparatus based on the recognition result of the recognition unit;
A program for functioning as a navigation device.