JP2007065122A

JP2007065122A - Noise suppressing device of on-vehicle voice recognition device

Info

Publication number: JP2007065122A
Application number: JP2005248912A
Authority: JP
Inventors: Yoshifumi Iwata; 良文岩田; Kenichi Komuro; 健一小室; Yuichi Murakami; 裕一村上; Rikuo Hatano; 陸生波多野
Original assignee: Aisin Seiki Co Ltd
Current assignee: Aisin Corp
Priority date: 2005-08-30
Filing date: 2005-08-30
Publication date: 2007-03-15

Abstract

<P>PROBLEM TO BE SOLVED: To effectively suppress noise so that a voice recognition rate can be improved in various operation states of each device of a vehicle. <P>SOLUTION: The noise suppressing device for suppressing a voice signal which is input to a voice recognition device in order to recognize voice of a speaker in the vehicle, comprises: a microphone 1 provided in the vehicle in order to collect internal sound of the vehicle; a vehicle state detection section 3 for detecting a vehicle state which is an operation state of each device of the vehicle; a memory section 11 for storing the sound signal collected with the microphone 1 by relating it to the vehicle state when collected; a vehicle sound to vehicle state comparison memory section 13 for storing information in which the stored sound signal for each vehicle state, is averaged in a predetermined period; and a vehicle sound suppressing section 14 for suppressing the sound by subtracting the averaged information corresponding to the vehicle state when the sound signal is collected, from the sound signal collected with the microphone 1. <P>COPYRIGHT: (C)2007,JPO&INPIT

Description

本発明は、車両内の話者の音声から、車両の音を抑圧する雑音抑圧装置に関する。 The present invention relates to a noise suppression device that suppresses the sound of a vehicle from the voice of a speaker in the vehicle.

近年の車両においては、乗員の発する音声によって車両内の機器を制御する車載用音声認識装置が用いられる場合がある。しかし、車両そのものが発生する音響、または車両に備えられたオーディオ装置から発生する音響が雑音となって、乗員の発する音声に重畳されるので、正しく音声認識できない場合がある。そこで、音声認識の前処理として、雑音を抑圧する方法が提案されている。
特許文献１は、カーオーディオ装置から出力されるオーディオ信号と、発話者近傍に設けられた音声入力用マイクロフォンから検出される検出信号とから、音声入力用マイクロフォンに混入するカーオーディオ装置からの音楽信号を推定して、検出信号から除去する技術が記載されている。特許文献１の技術は、音声認識の前処理としての雑音低減方法であるが、自動車の走行音は低減されないため、停車中などの走行音が低い状態でしか対応できない可能性が高い。
特許文献２は、車両の走行状態に応じて発生する雑音の状態に適合するように、雑音を抑圧する方法が記載されている。特許文献２の技術では、雑音抑圧装置の減算制御部は、車両情報検出装置より得られる車両情報に基づいてスイッチの開閉を制御することで、音声用マイクに入力された話者の音声信号データから定常雑音、および非定常雑音を減算器によって減算除去するのを、車両の走行状態などに応じて制御する。車両の走行状態として、停車中・加減速中・エンジン停止・一定速度での走行中の４パターンに応じた雑音抑圧を行うようになっている。しかし、走行状態としてはこれら４パターンだけではなく多種の状態があり、また、走行状態に応じた車両の走行音も変化する。従って特許文献２で提示されるような固定したパターンでの雑音抑圧では、充分に雑音抑圧できない場合がある。
特許文献３は、音声検出部の後段にカットオフ周波数決定部とそれに制御されるハイパスフィルタを設ける方法が記載されている。カットオフ周波数決定部は音声信号の各帯域の騒音の主体を分析する。騒音の主体が周期性騒音ならば、その帯域をカットオフ周波数としてハイパスフィルタに設定する。これにより、周期性騒音の帯域が変化しても常にそれが低減される。ランダム騒音が主体である場合は所定のカットオフ周波数に設定する。これにより、人音声帯域をカットして認識率を低下させることがない、としている。
しかし、ハイパスフィルタのカットオフ周波数が最上限の場合には、話者にもよるが音声情報の一部も影響を受ける。またある騒音下では、この一部の音声情報を欠落させても認識率が上がるとあるが、文中に「人音声の認識において重要な母音は有声音であり、その有声音は１００〜３００Ｈｚの帯域に基本周波数を有する場合、周期音である。」と記載されているのに対して、実施例説明の中ではカットオフ周波数を４００Ｈｚにまで上げていることに矛盾がある。特許文献３の技術は、音声認識に必要な周波数帯域に重畳する雑音に対しては、有効ではないと考えられる。
特開２０００−２３１３９９号公報特開２０００−３２１０８０号公報特開２００１−２９６８８７号公報 In recent vehicles, an in-vehicle voice recognition device that controls equipment in the vehicle by voice generated by an occupant may be used. However, since the sound generated by the vehicle itself or the sound generated from the audio device provided in the vehicle becomes noise and is superimposed on the sound generated by the occupant, the sound may not be recognized correctly. Therefore, a method for suppressing noise has been proposed as preprocessing for speech recognition.
Patent Document 1 discloses a music signal from a car audio device mixed in a sound input microphone from an audio signal output from the car audio device and a detection signal detected from a sound input microphone provided in the vicinity of a speaker. A technique for estimating and removing this from the detection signal is described. The technique of Patent Document 1 is a noise reduction method as preprocessing for voice recognition. However, since the driving sound of an automobile is not reduced, there is a high possibility that it can be handled only when the driving sound is low, such as when the vehicle is stopped.
Patent Document 2 describes a method of suppressing noise so as to match the state of noise generated according to the running state of the vehicle. In the technique of Patent Document 2, the subtraction control unit of the noise suppression device controls the opening and closing of the switch based on the vehicle information obtained from the vehicle information detection device, so that the voice signal data of the speaker input to the voice microphone The subtractor removes stationary noise and non-stationary noise from the subtractor according to the traveling state of the vehicle. Noise suppression is performed in accordance with four patterns of the vehicle running state: stopping, acceleration / deceleration, engine stop, and running at a constant speed. However, the traveling state includes not only these four patterns but also various states, and the traveling sound of the vehicle changes according to the traveling state. Therefore, noise suppression with a fixed pattern as shown in Patent Document 2 may not be sufficient.
Patent Document 3 describes a method of providing a cut-off frequency determination unit and a high-pass filter controlled by the cut-off frequency determination unit after the voice detection unit. The cut-off frequency determination unit analyzes the main subject of noise in each band of the audio signal. If the main noise is periodic noise, the band is set as a cut-off frequency in the high-pass filter. Thereby, even if the band of periodic noise changes, it is always reduced. When random noise is the main component, a predetermined cutoff frequency is set. As a result, the human voice band is not cut to reduce the recognition rate.
However, when the cut-off frequency of the high-pass filter is the upper limit, a part of the voice information is also affected depending on the speaker. Under certain noise, the recognition rate may be increased even if some of the voice information is lost. However, in the sentence, “Vowels important for human speech recognition are voiced sounds, and the voiced sounds are 100 to 300 Hz. It is a periodic sound when it has a fundamental frequency in the band ”, whereas in the description of the embodiment, there is a contradiction in that the cutoff frequency is raised to 400 Hz. The technique of Patent Document 3 is not effective against noise superimposed on a frequency band necessary for speech recognition.
JP 2000-231399 A JP 2000-321080 A JP 2001-296877 A

本発明は、上記事情に鑑みてなされたものであり、車両の各装置のさまざまな作動状態において、音声認識率を向上するように、効果的に雑音を抑圧することを目的とする。 The present invention has been made in view of the above circumstances, and an object thereof is to effectively suppress noise so as to improve a speech recognition rate in various operating states of each device of a vehicle.

上記目的を達成するため、本発明の第１の観点に係る雑音抑圧装置は、車両内の話者の音声を認識するために、音声認識装置に入力される音声信号の雑音を抑圧する雑音抑圧装置であって、前記車両の内部の音響を収集するために該車両に備えられた音響信号収集手段と、前記車両の各装置の作動状態である車両状態を検出する車両状態検出手段と、前記音響信号収集手段で収集された音響信号を、収集されたときの前記車両状態に対応づけて記憶する記憶部と、前記車両状態ごとに記憶された前記音響信号を所定の期間で平均化した情報を、前記車両状態に対応づけて記憶する車両音−車両状態対比記憶部と、前記音響信号収集手段で収集された前記音響信号から、該音響信号が収集されたときの前記車両状態に対応した前記平均化された情報を差し引き抑圧する抑圧手段と、を備えることを特徴とする。 In order to achieve the above object, a noise suppression apparatus according to a first aspect of the present invention is a noise suppression apparatus that suppresses noise of a voice signal input to a voice recognition apparatus in order to recognize a voice of a speaker in a vehicle. An apparatus for collecting acoustic signals in the vehicle for collecting sound inside the vehicle, vehicle state detecting means for detecting a vehicle state as an operating state of each device of the vehicle, and A storage unit that stores the acoustic signal collected by the acoustic signal collecting unit in association with the vehicle state at the time of collection, and information obtained by averaging the acoustic signal stored for each vehicle state over a predetermined period Corresponding to the vehicle state when the acoustic signal is collected from the vehicle sound-vehicle state comparison storage unit that stores the vehicle signal in association with the vehicle state and the acoustic signal collected by the acoustic signal collecting unit. The averaged Characterized in that it comprises a suppressing means for suppressing subtracted distribution, the.

さらに、複数の前記音響信号収集手段と、前記音声認識装置で認識すべき特定の話者の位置と、前記複数の音響信号収集装手段それぞれとの距離の差に等しい位相差を有する音響信号のみを、前記音響信号から抽出する特定話者位相信号抽出手段とを備え、前記特定話者位相信号抽出手段で、前記複数の音響信号収集手段で収集した前記音響信号から、前記特定の話者と前記複数の音響信号収集手段それぞれとの距離の差に等しい位相差を有する成分を抽出することにより、前記特定の話者と前記複数の音響信号収集手段それぞれとの距離の差と異なる位相を有する音響成分を抑圧する、ことを特徴とする。 Further, only acoustic signals having a phase difference equal to a difference in distance between the plurality of acoustic signal collecting means, the position of a specific speaker to be recognized by the speech recognition apparatus, and the plurality of acoustic signal collecting means. Specific speaker phase signal extraction means for extracting from the acoustic signal, and the specific speaker phase signal extraction means, from the acoustic signals collected by the plurality of acoustic signal collection means, By extracting a component having a phase difference equal to the difference in distance from each of the plurality of acoustic signal collection means, the component has a phase different from the difference in distance between the specific speaker and each of the plurality of acoustic signal collection means. The acoustic component is suppressed.

前記車両にオーディオ装置が備えられている場合、該オーディオ装置の音響信号を前記音響信号収集装置で収集した音響信号から差し引いて、前記車両外部の音響信号とするオーディオ信号抑圧手段を備えることが好ましい。 In the case where the vehicle is provided with an audio device, it is preferable to include an audio signal suppression unit that subtracts the sound signal of the audio device from the sound signal collected by the sound signal collecting device to obtain an sound signal outside the vehicle. .

好ましくは、音響信号収集手段によって収集される音響信号を、周波数情報および／または時間軸情報としてデジタル化して処理することを特徴とする。 Preferably, the acoustic signal collected by the acoustic signal collecting means is digitized and processed as frequency information and / or time axis information.

前記車両の各装置の作動状態として、車両の速度、アクセル開度、機関回転数、加減速状態、変速機状態、ステアリング操舵角、ワイパー作動状態、エアコン作動状態、電動ファン作動状態、窓開閉状態、ホーン作動状態および前記車両に装着されたタイヤの種類のうちの任意の組み合わせを含むことができる。 As the operating state of each device of the vehicle, vehicle speed, accelerator opening, engine speed, acceleration / deceleration state, transmission state, steering steering angle, wiper operating state, air conditioner operating state, electric fan operating state, window opening / closing state Any combination of the horn operating state and the type of tire mounted on the vehicle may be included.

本発明の車載用音声認識装置の雑音抑圧装置によれば、車両の各装置のさまざまな作動状態に対応した音響情報データベース（車両音−車両状態対比記憶）を構築し、音響信号収集手段で収集された音響信号から、音響信号が収集されたときの車両の状態に対応した自車両の特徴ある車両走行音の成分を抑圧するので、車内の話者の音声のみを捉えることができる。その結果、車載用音声認識装置において音声認識率の向上が期待できる。 According to the noise suppression device of the on-vehicle speech recognition device of the present invention, an acoustic information database (vehicle sound-vehicle state comparison memory) corresponding to various operating states of each device of the vehicle is constructed and collected by the acoustic signal collecting means. Since the component of the vehicle running sound that is characteristic of the host vehicle corresponding to the state of the vehicle when the acoustic signal is collected is suppressed from the generated acoustic signal, only the voice of the speaker in the vehicle can be captured. As a result, an improvement in the speech recognition rate can be expected in the in-vehicle speech recognition apparatus.

（実施の形態１）
本発明に係る雑音抑圧装置３０の一実施の形態について、図１により説明する。図１は、本発明の実施の形態１に係る雑音抑圧装置３０を示すブロック図である。図１に示す雑音抑圧装置３０の各部の構成を説明する。マイクロフォン１は、車内の音響を収集する音響信号収集手段である。マイクロフォン１で収集された音響信号はマイクアンプ４で増幅され、Ａ／Ｄ変換器５でディジタル信号に変換されて、オーディオ信号抑圧部９に入力される。オーディオ信号入力部２は、車両に備えられたオーディオ装置（図示せず）のオーディオ信号を入力する。オーディオ信号は、Ａ／Ｄ変換器６でディジタル信号に変換されて、オーディオ信号調節部８に入力される。Ａ／Ｄ変換器５、６におけるサンプリング周波数は、音声認識するのに有効な最大周波数の数倍〜１０倍とする。例えば、音響を認識するための最大周波数が１０ｋＨｚである場合、サンプリング周波数を４０ｋＨｚ〜５０ｋＨｚ程度とする。 (Embodiment 1)
An embodiment of a noise suppression device 30 according to the present invention will be described with reference to FIG. FIG. 1 is a block diagram showing a noise suppression device 30 according to Embodiment 1 of the present invention. The configuration of each part of the noise suppression device 30 shown in FIG. 1 will be described. The microphone 1 is an acoustic signal collecting unit that collects sound in the vehicle. The acoustic signal collected by the microphone 1 is amplified by the microphone amplifier 4, converted into a digital signal by the A / D converter 5, and input to the audio signal suppression unit 9. The audio signal input unit 2 inputs an audio signal of an audio device (not shown) provided in the vehicle. The audio signal is converted into a digital signal by the A / D converter 6 and input to the audio signal adjusting unit 8. The sampling frequency in the A / D converters 5 and 6 is several to 10 times the maximum frequency effective for speech recognition. For example, when the maximum frequency for recognizing sound is 10 kHz, the sampling frequency is about 40 kHz to 50 kHz.

車両状態検出部３は、車両の各装置の作動状態を検出する。車両の各装置の作動状態としては、例えば、車両の速度、アクセル開度、機関回転数、加減速状態、変速機状態、ステアリング操舵角、ワイパー作動状態、エアコン作動状態、電動ファン作動状態、窓開閉状態、ホーン作動状態および車両に装着されたタイヤの種類などがある。車両の各装置の作動状態を検出するには、車両制御装置（図示せず）から作動状態情報を入力する。または、前記の各装置に検出器を備えて、その状態を検知する。あるいは、前記の車両の各装置の制御部から作動状態情報を入力する。車両状態検出部３で検出した車両状態情報は、オーディオ信号調節部８およびアドレス生成部１０に伝達される。 The vehicle state detection unit 3 detects the operation state of each device of the vehicle. The operating state of each device of the vehicle includes, for example, vehicle speed, accelerator opening, engine speed, acceleration / deceleration state, transmission state, steering steering angle, wiper operating state, air conditioner operating state, electric fan operating state, window There are open / closed states, horn operating states, and types of tires mounted on the vehicle. In order to detect the operating state of each device of the vehicle, operating state information is input from a vehicle control device (not shown). Alternatively, each device described above is provided with a detector to detect its state. Alternatively, operating state information is input from the control unit of each device of the vehicle. The vehicle state information detected by the vehicle state detection unit 3 is transmitted to the audio signal adjustment unit 8 and the address generation unit 10.

オーディオ信号調節部８は、例えばオーディオ信号の増幅度を帯域ごとに変化することができる可変帯域フィルタから構成され、車両状態に応じて、車外の音響信号から差し引き抑圧するオーディオ信号を調節する。例えば、窓の開閉状態によって音響の反射条件が変化し、車内のオーディオ装置の音響がマイクロフォン１に回り込む大きさが変化するので、窓の開閉状態に応じて差し引き抑圧するオーディオ信号の増幅度を調節する。オーディオ信号調節部８で調節されたオーディオ信号は、オーディオ信号抑圧部９に入力される。 The audio signal adjustment unit 8 is composed of, for example, a variable band filter that can change the amplification degree of the audio signal for each band, and adjusts the audio signal to be subtracted and suppressed from the acoustic signal outside the vehicle according to the vehicle state. For example, the acoustic reflection conditions change depending on the open / close state of the window, and the magnitude of the sound of the audio device in the car that wraps around the microphone 1 changes, so the amplification level of the audio signal to be subtracted and suppressed is adjusted according to the open / close state of the window To do. The audio signal adjusted by the audio signal adjusting unit 8 is input to the audio signal suppressing unit 9.

オーディオ信号抑圧部９は、音響信号の差分を求める減算回路とフーリエ変換回路から構成され、マイクロフォン１で収集された音響信号から、オーディオ信号調節部８で調節されたオーディオ信号を差し引き抑圧する。オーディオ信号の抑圧は、時間軸情報（音波波形）のまま差分を求めることができる。時間軸情報で音響信号からオーディオ信号を抑圧したのち、周波数情報とする。先に、音響信号とオーディオ信号をフーリエ変換して周波数情報とし、周波数情報で差分を求めてもよい。周波数情報とするには、音響信号およびオーディオ信号を一定の時間蓄積し、その一定の時間間隔ごとにフーリエ変換する。その後は、一定の時間ごとに処理を行う。 The audio signal suppression unit 9 includes a subtraction circuit for obtaining a difference between acoustic signals and a Fourier transform circuit, and subtracts and suppresses the audio signal adjusted by the audio signal adjustment unit 8 from the acoustic signal collected by the microphone 1. For the suppression of the audio signal, the difference can be obtained with the time axis information (sound waveform). After suppressing the audio signal from the acoustic signal with the time axis information, the frequency information is obtained. First, the acoustic signal and the audio signal may be Fourier-transformed to obtain frequency information, and the difference may be obtained from the frequency information. In order to obtain frequency information, an acoustic signal and an audio signal are accumulated for a certain period of time, and Fourier transform is performed at each certain time interval. Thereafter, processing is performed at regular intervals.

オーディオ信号が抑圧された音響信号は、記憶部１１に送られて記憶される。記憶部１１で記憶された音響信号は、平均化処理部１２で車両状態ごとに所定の期間にわたって平均化される。音響信号の平均化は周波数ごとに行う。車両音−車両状態対比記憶部１３は、平均化処理部１２で車両状態ごとに音響信号が所定の期間にわたって平均化された車両音を記憶している。 The acoustic signal in which the audio signal is suppressed is sent to and stored in the storage unit 11. The acoustic signal stored in the storage unit 11 is averaged over a predetermined period for each vehicle state by the averaging processing unit 12. The sound signal is averaged for each frequency. The vehicle sound-vehicle state comparison storage unit 13 stores the vehicle sound obtained by averaging the acoustic signals over a predetermined period for each vehicle state by the averaging processing unit 12.

平均化処理部１２は、そのときの車両状態に対応する平均化された車両音と現在の音響信号を重み付けして平均し、その車両状態に対応する平均化された車両音として、車両音−車両状態対比記憶部１３の記憶を更新する。例えばＮを正の整数として、平均化された音をＮ−１倍し、音響信号を加算してＮで除して新たな平均化された車両音とする。例えばＮが１００の場合、音響信号は最初に１／１００になり、以後、毎回９９／１００になって順次、平均化された車両音への寄与率が小さくなっていく。 The averaging processing unit 12 weights and averages the averaged vehicle sound corresponding to the vehicle state at that time and the current acoustic signal, and the vehicle sound − as the averaged vehicle sound corresponding to the vehicle state The storage in the vehicle state comparison storage unit 13 is updated. For example, assuming that N is a positive integer, the averaged sound is multiplied by N-1 and the sound signal is added and divided by N to obtain a new averaged vehicle sound. For example, when N is 100, the acoustic signal first becomes 1/100, and thereafter becomes 99/100 each time, and the contribution rate to the averaged vehicle sound decreases gradually.

平均化する方法として、車両状態が同じである所定の個数の音響信号の移動平均をとってもよい。平均化にあたって、現在の（オーディオ信号が抑圧された）音響信号と平均化された車両音の差を、現在の音響信号から差し引いて平均化してもよいが、同じ車両状態でも長期的には車両音が変化する可能性があるので、オーディオ信号が抑圧された音響信号のまま平均化してもよい。このようにすると、例えばタイヤが交換された場合や、季節変動あるいは長期的な変化にも追随することができる。 As an averaging method, a moving average of a predetermined number of acoustic signals having the same vehicle state may be taken. For averaging, the difference between the current acoustic signal (with the audio signal suppressed) and the averaged vehicle sound may be subtracted from the current acoustic signal, but it may be averaged even in the same vehicle condition in the long term. Since the sound may change, the audio signal may be averaged with the suppressed acoustic signal. In this way, for example, when a tire is replaced, it is possible to follow seasonal variations or long-term changes.

車両音−車両状態対比記憶部１３は、フラッシュメモリ、ハードディスク、ＤＶＤ（Digital Versatile Disc）、ＤＶＤ−ＲＡＭ（Digital Versatile Disc Random-Access Memory）、ＤＶＤ−ＲＷ（Digital Versatile Disc Rewritable）等の不揮発性メモリから構成される。 The vehicle sound-vehicle state comparison storage unit 13 is a non-volatile memory such as a flash memory, a hard disk, a DVD (Digital Versatile Disc), a DVD-RAM (Digital Versatile Disc Random-Access Memory), a DVD-RW (Digital Versatile Disc Rewritable). Consists of

図２は、車両音−車両状態対比記憶部１３の構造の例を示す図である。車両状態の各項目に対してビットが割り当てられており、各ビットの０または１の状態で決まるビットパターンに対応して、平均化された車両音が格納されているアドレス（車両音格納場所）が割り当てられている。車両状態情報で各ビットが決められ、そのビットパターンに一致する行の車両音格納場所を参照して、車両状態に対応した平均化された車両音を取り出すことができる。 FIG. 2 is a diagram illustrating an example of the structure of the vehicle sound-vehicle state comparison storage unit 13. A bit is assigned to each item of the vehicle state, and an address where the averaged vehicle sound is stored corresponding to the bit pattern determined by the state of 0 or 1 of each bit (vehicle sound storage location) Is assigned. Each bit is determined by the vehicle state information, and an averaged vehicle sound corresponding to the vehicle state can be extracted with reference to the vehicle sound storage location of the row that matches the bit pattern.

車両状態としては、車両の速度（車速）、変速機の段階、アクセル開度、機関回転数、加減速状態、操舵角、ワイパー作動状態、エアコン作動状態、電動ファン作動状態、窓の開度、警笛作動状態などがある。車速としては、例えば１０ｋｍ／ｈごとに１ずつ増加する数値とする。変速機の状態は、例えば前進５段、後退１段、ニュートラルおよびパーキングを含めて８段階なので、３ビットを割り当てる。同様にして、例えばアクセル開度に３ビット、機関回転数に４ビット、加減速状態に３ビットを割り当てる。ワイパー作動は、例えば停止、間欠、連続、高速の４段階として２ビットを割り当てる。エアコン作動状態は、ヒートポンプおよび熱交換機のファンの作動と室内ファンの作動に、例えば合計３ビットを割り当てる。電動ファン作動状態に例えば２ビットを割り当てる。窓の開度としては、例えば全体として全閉を含めて８段階として、３ビットを割り当てる。窓ごとに例えば２ビットを割り当ててもよい。警笛はＯＮ／ＯＦＦしかない場合は１ビットでよい。図２の例では、車両状態は合計３１ビットである。車両状態を構成する項目とそれぞれのビット数は、車両に応じて適宜、追加または変更することができる。例えば、雨滴センサの情報を追加して、降雨の強さを車両状態の１つに加えてもよい。 The vehicle state includes vehicle speed (vehicle speed), transmission stage, accelerator opening, engine speed, acceleration / deceleration, steering angle, wiper operation, air conditioner operation, electric fan operation, window opening, There is a horn operating state. For example, the vehicle speed is a numerical value that increases by 1 every 10 km / h. Since the state of the transmission is 8 stages including, for example, 5 forward stages, 1 reverse stage, neutral and parking, 3 bits are allocated. Similarly, for example, 3 bits are assigned to the accelerator opening, 4 bits are assigned to the engine speed, and 3 bits are assigned to the acceleration / deceleration state. In the wiper operation, for example, 2 bits are assigned as 4 stages of stop, intermittent, continuous, and high speed. In the air conditioner operating state, for example, a total of 3 bits are allocated to the operation of the fan of the heat pump and the heat exchanger and the operation of the indoor fan. For example, 2 bits are assigned to the operating state of the electric fan. As the opening of the window, for example, 3 bits are assigned in 8 stages including the fully closed state as a whole. For example, 2 bits may be allocated for each window. If the horn is only ON / OFF, 1 bit is sufficient. In the example of FIG. 2, the vehicle state is 31 bits in total. The items constituting the vehicle state and the number of bits can be added or changed as appropriate according to the vehicle. For example, raindrop sensor information may be added to add the strength of rain to one of the vehicle conditions.

アドレス生成部１０は、車両状態に対応する平均化された車両音が記憶された車両音−車両状態対比記憶部１３内のアドレスを生成し、車両音−車両状態対比記憶部１３に指示する。例えば、車両状態検出部３で検出された車両状態情報から、図２の車両状態に対応する３１ビットの情報を生成し、生成された３１ビットの情報に対応する車両音格納場所を、図２の構造で表されるデータから取り出す。 The address generation unit 10 generates an address in the vehicle sound-vehicle state comparison storage unit 13 in which the averaged vehicle sound corresponding to the vehicle state is stored, and instructs the vehicle sound-vehicle state comparison storage unit 13. For example, 31-bit information corresponding to the vehicle state of FIG. 2 is generated from the vehicle state information detected by the vehicle state detection unit 3, and the vehicle sound storage location corresponding to the generated 31-bit information is shown in FIG. Extract from the data represented by the structure.

アドレス生成部１０で指示された車両音格納場所に格納されている車両音は、車両音−車両状態対比記憶部１３から車両音抑圧部１４に入力される。また、記憶部１１に記憶されている音響信号が、車両音抑圧部１４に入力される。車両音抑圧部１４は、減算回路から構成され、音響信号から平均化された車両音を周波数成分ごとに減算する。すなわち、本実施の形態１では、車両音抑圧部１４は音響信号から、そのときの車両状態に対応する平均化された車両音を抑圧する。 The vehicle sound stored in the vehicle sound storage location instructed by the address generation unit 10 is input from the vehicle sound-vehicle state comparison storage unit 13 to the vehicle sound suppression unit 14. In addition, an acoustic signal stored in the storage unit 11 is input to the vehicle sound suppression unit 14. The vehicle sound suppression unit 14 includes a subtraction circuit, and subtracts the vehicle sound averaged from the acoustic signal for each frequency component. That is, in the first embodiment, the vehicle sound suppression unit 14 suppresses the averaged vehicle sound corresponding to the vehicle state at that time from the acoustic signal.

図３は、音響信号から平均化された車両音を抑圧する例を示す模式図である。図３の（ａ）は、車両音に音声が重畳している様子を示す。便宜的に車両音を示す下側実線と、重畳された音声を示す上側実線を分けているが、観測されるデータは上側の実線である。図３の例では、音響信号をフーリエ変換して周波数情報とした場合を示す。図３の（ｂ）は（ａ）のスペクトルから車両音のスペクトルを差し引いた（抑圧した）残りで、自車両以外の音響成分を表す。このように、音響信号から車両音の成分を抑圧して、車内の音声成分を取り出すことができる。厳密には、抑圧した音響信号に車外の音響、例えば近くを走行する車両の音が含まれているが、車両の遮音特性によって軽減されている。 FIG. 3 is a schematic diagram illustrating an example of suppressing vehicle sound averaged from an acoustic signal. FIG. 3A shows a state in which sound is superimposed on the vehicle sound. For convenience, the lower solid line indicating the vehicle sound and the upper solid line indicating the superimposed sound are separated, but the observed data is the upper solid line. In the example of FIG. 3, the case where an acoustic signal is Fourier-transformed into frequency information is shown. (B) of FIG. 3 shows the acoustic component other than the own vehicle, which is the remainder obtained by subtracting (suppressing) the spectrum of the vehicle sound from the spectrum of (a). In this manner, the vehicle sound component can be suppressed from the acoustic signal, and the sound component in the vehicle can be extracted. Strictly speaking, the suppressed acoustic signal includes sound outside the vehicle, for example, the sound of a vehicle traveling nearby, which is reduced by the sound insulation characteristics of the vehicle.

音響信号から平均化された車両音を抑圧するのは、一定の時間ごとに処理を行う。一定の時間間隔は、音声認識ができる程度の短い間隔とする。すなわち、音素と音素の変化を検出できる程度の短い時間間隔で処理を行う。例えば、１０ミリ秒以下、好ましくは５ミリ秒以下の時間間隔で処理を行う。 The suppression of the vehicle sound averaged from the acoustic signal is performed at regular intervals. The fixed time interval is set to a short interval that allows voice recognition. That is, processing is performed at a short time interval that can detect a change between phonemes and phonemes. For example, processing is performed at time intervals of 10 milliseconds or less, preferably 5 milliseconds or less.

車両音抑圧部１４で平均化された車両音が抑圧された音響信号は、音声認識装置１５に入力され、音声認識される。時間軸情報が必要な場合は、車両音抑圧部１４で車両音が抑圧された音響信号の周波数情報から、その周波数成分の周波数の信号波形を合成してもよい。その場合、元の音響信号の位相情報を用いて合成波形の位相を決定する。 The acoustic signal in which the vehicle sound averaged by the vehicle sound suppression unit 14 is suppressed is input to the speech recognition device 15 and recognized. When the time axis information is required, the signal waveform of the frequency component frequency may be synthesized from the frequency information of the acoustic signal whose vehicle sound is suppressed by the vehicle sound suppression unit 14. In that case, the phase of the composite waveform is determined using the phase information of the original acoustic signal.

雑音抑圧装置３０のうち、アドレス生成部１０、オーディオ信号抑圧部９、オーディオ信号調節部８、記憶部１１、平均化処理部１２および車両音抑圧部１４は、その全部または一部をＤＳＰ（Digital Signal Processor）で構成することができる。 Of the noise suppression device 30, the address generation unit 10, the audio signal suppression unit 9, the audio signal adjustment unit 8, the storage unit 11, the averaging processing unit 12, and the vehicle sound suppression unit 14 are all or part of a DSP (Digital (Signal Processor).

次に、図１の雑音抑圧装置３０の動作を、図４を参照して説明する。図４は、実施の形態１の雑音抑圧装置３０の動作を示すフローチャートである。 Next, the operation of the noise suppression device 30 of FIG. 1 will be described with reference to FIG. FIG. 4 is a flowchart showing the operation of the noise suppression apparatus 30 according to the first embodiment.

まず、マイクロフォン１で車内の音響信号を収集し、オーディオ信号入力部２でオーディオ信号を入力する（ステップＡ１）。同時に、車両状態検出部３でそのときの車両状態を入力する（ステップＡ２）。前述のとおり、音響信号とオーディオ信号をＡ／Ｄ変換器５、６でディジタル化し、オーディオ信号調節部８でオーディオ信号を車両状態に応じて調節して、オーディオ信号抑圧部９で音響信号からオーディオ信号を抑圧する（ステップＡ３）。オーディオ信号が抑圧された音響信号は、記憶部１１に記憶される。車両状態情報から、車両状態に対応する平均化された車両音が記憶された車両音−車両状態対比記憶部１３のアドレスを生成し、車両音−車両状態対比記憶部１３から車両状態に対応する平均化された車両音を参照する（ステップＡ４）。 First, acoustic signals in the vehicle are collected by the microphone 1, and an audio signal is input by the audio signal input unit 2 (step A1). At the same time, the vehicle state at that time is input by the vehicle state detector 3 (step A2). As described above, the audio signal and the audio signal are digitized by the A / D converters 5 and 6, the audio signal adjusting unit 8 adjusts the audio signal according to the vehicle state, and the audio signal suppressing unit 9 converts the audio signal from the audio signal to the audio signal. The signal is suppressed (step A3). The acoustic signal in which the audio signal is suppressed is stored in the storage unit 11. From the vehicle state information, an address of the vehicle sound-vehicle state comparison storage unit 13 in which the averaged vehicle sound corresponding to the vehicle state is stored is generated, and the address corresponding to the vehicle state is generated from the vehicle sound-vehicle state comparison storage unit 13. Reference is made to the averaged vehicle sound (step A4).

車両音抑圧部１４で、オーディオ信号が抑圧された音響情報から、車両状態に対応する平均化された車両音を抑圧し（ステップＡ５）、音声認識装置１５に、車両音を抑圧した音響信号を送る（ステップＡ６）。 The vehicle sound suppression unit 14 suppresses the averaged vehicle sound corresponding to the vehicle state from the acoustic information in which the audio signal is suppressed (step A5), and the sound recognition device 15 receives the acoustic signal that suppresses the vehicle sound. Send (step A6).

平均化処理部１２で、そのときの車両状態に対応する平均化された車両音と現在の車両音を重み付けして平均し、その車両状態に対応する平均化された車両音として、車両音−車両状態対比記憶部１３の記憶を更新する（ステップＡ７）。ここまでの処理は所定の時間間隔（例えば５ミリ秒）ごとに行われる。そのときの車両状態以外の平均化された車両音は、その回では更新されない。 The averaging processing unit 12 weights and averages the averaged vehicle sound corresponding to the vehicle state at that time and the current vehicle sound, and the vehicle sound − as the averaged vehicle sound corresponding to the vehicle state The storage of the vehicle state comparison storage unit 13 is updated (step A7). The processing so far is performed at predetermined time intervals (for example, 5 milliseconds). The averaged vehicle sound other than the vehicle state at that time is not updated at that time.

以上の結果、車両の各装置のさまざまな作動状態に対応した音響情報データベース（車両音−車両状態対比記憶）を構築し、音響信号収集手段で収集された音響信号から、音響信号が収集されたときの車両の状態に対応した自車両の特徴ある車両走行音の成分を抑圧するので、車内の話者の音声のみを捉えることができる。その結果、車載用音声認識装置において音声認識率の向上が期待できる。 As a result, an acoustic information database (vehicle sound-vehicle state comparison memory) corresponding to various operating states of each device of the vehicle was constructed, and acoustic signals were collected from the acoustic signals collected by the acoustic signal collecting means. Since the characteristic vehicle running sound component of the host vehicle corresponding to the state of the vehicle at the time is suppressed, only the voice of the speaker in the vehicle can be captured. As a result, an improvement in the speech recognition rate can be expected in the in-vehicle speech recognition apparatus.

（実施の形態２）
本発明の異なる実施の形態２について、図５乃至図７を参照して説明する。実施の形態２では、雑音抑圧装置は複数の音響信号収集手段を備え、複数の音響信号収集手段で収集された音響信号から、音声認識装置で認識すべき特定の話者の位置と、前記複数の音響信号収集手段それぞれとの距離の差に等しい位相差を有する音響信号のみを抽出する。図５は、本発明の実施の形態２に係る音響信号収集手段の配置の例を示す車両の平面図である。図５に示すように、音響信号収集手段として例えばマイクロフォン１ａ、１ｂを、運転者の前面の２カ所に設けることができる。 (Embodiment 2)
A second embodiment of the present invention will be described with reference to FIGS. In the second embodiment, the noise suppression device includes a plurality of acoustic signal collection units, and the position of the specific speaker to be recognized by the speech recognition device from the acoustic signals collected by the plurality of acoustic signal collection units, and the plurality of the plurality of acoustic signal collection units. Only acoustic signals having a phase difference equal to the difference in distance from each of the acoustic signal collecting means are extracted. FIG. 5 is a plan view of a vehicle showing an example of the arrangement of acoustic signal collecting means according to Embodiment 2 of the present invention. As shown in FIG. 5, for example, microphones 1 a and 1 b can be provided as acoustic signal collecting means at two locations on the front of the driver.

図６は、実施の形態２に係る雑音抑圧装置３０のブロック図である。実施の形態１と比較して、マイクロフォン１ａ、１ｂが複数個（２個）になっている。音響信号は、マイクロフォン１ａ、１ｂごとにマイクアンプ４ａ、４ｂで増幅され、Ａ／Ｄ変換器５ａ、５ｂでディジタル信号に変換される。そして、特定話者位相信号抽出部７が新たに設けられている。 FIG. 6 is a block diagram of the noise suppression apparatus 30 according to the second embodiment. Compared with the first embodiment, there are a plurality (two) of microphones 1a and 1b. The acoustic signal is amplified by the microphone amplifiers 4a and 4b for each of the microphones 1a and 1b, and converted into a digital signal by the A / D converters 5a and 5b. A specific speaker phase signal extraction unit 7 is newly provided.

特定話者位相信号抽出部７は、例えばＤＳＰで構成される。特定話者位相信号抽出部７は、マイクロフォン１ａから話者までの距離と、マイクロフォン１ｂから話者までの距離の差に相当する位相差を有する信号の成分を抽出する。例えば、マイクロフォン１ｂの信号を、マイクロフォン１ａと話者との距離と、マイクロフォン１ｂと話者との距離の差の音波の到達時間に相当する時間だけずらして、両者の信号の相関をとる。そして相関の高い部分を抽出する。このとき、マイクロフォン１ａ、１ｂと話者との距離の比に応じて、振幅を補正する。マイクロフォン１ａ、１ｂと音声認識すべき話者、例えば運転者との距離を等しい距離にすると、話者からマイクロフォン１ａ、１ｂまでの音声の到達時間が等しいので、音響信号の時間をずらす必要がない。 The specific speaker phase signal extraction unit 7 is configured by a DSP, for example. The specific speaker phase signal extraction unit 7 extracts a signal component having a phase difference corresponding to the difference between the distance from the microphone 1a to the speaker and the distance from the microphone 1b to the speaker. For example, the signal of the microphone 1b is shifted by the time corresponding to the arrival time of the sound wave, which is the difference between the distance between the microphone 1a and the speaker, and the distance between the microphone 1b and the speaker, and the correlation between the two signals is obtained. Then, a highly correlated part is extracted. At this time, the amplitude is corrected according to the ratio of the distance between the microphones 1a and 1b and the speaker. When the distances between the microphones 1a and 1b and the speakers to be recognized by voice, for example, the driver, are equal, the arrival times of the voices from the speakers to the microphones 1a and 1b are equal, so there is no need to shift the time of the acoustic signal. .

特定話者位相信号抽出部７で抽出された、マイクロフォン１ａから話者までの距離と、マイクロフォン１ｂから話者までの距離の差に相当する位相差を有する信号を、改めて音響信号として、オーディオ信号抑圧部９に入力する。オーディオ信号抑圧以降は、実施の形態１と同様に、オーディオ信号抑圧部９でオーディオ信号を抑圧し、車両音抑圧部１４で平均化された車両音を抑圧して、音声認識装置１５に送る。また、平均化処理部１２でそのときの車両状態に対応する平均化された車両音と現在の音響信号を重み付けして平均し、その車両状態に対応する平均化された車両音として、車両音−車両状態対比記憶部１３の記憶を更新する。本実施の形態２の場合、平均化された車両音は、マイクロフォン１ａ、１ｂで収集された音響信号のうち、マイクロフォン１ａ、１ｂから話者までの距離の差と同じ位相差を有する成分になっている。 A signal having a phase difference corresponding to the difference between the distance from the microphone 1a to the speaker and the distance from the microphone 1b to the speaker, extracted by the specific speaker phase signal extraction unit 7, is again used as an audio signal as an audio signal. Input to the suppression unit 9. After the audio signal suppression, the audio signal is suppressed by the audio signal suppression unit 9 and the vehicle sound averaged by the vehicle sound suppression unit 14 is suppressed and sent to the voice recognition device 15 as in the first embodiment. Further, the averaging processing unit 12 weights and averages the averaged vehicle sound corresponding to the vehicle state at that time and the current acoustic signal, and the vehicle sound is obtained as the averaged vehicle sound corresponding to the vehicle state. -Update the storage of the vehicle state comparison storage unit 13. In the case of the second embodiment, the averaged vehicle sound becomes a component having the same phase difference as the difference in distance from the microphones 1a, 1b to the speaker among the acoustic signals collected by the microphones 1a, 1b. ing.

つぎに、図６の雑音抑圧装置３０の動作を図７を参照して説明する。図７は、本実施の形態２の雑音抑圧装置３０の動作を説明するフローチャートである。図７において、ステップＢ１およびステップＢ２は、図２のステップＡ１およびＡ２と同様に、マイクロフォン１ａ、１ｂで車内の音響信号を収集し、オーディオ信号入力部２でオーディオ信号を入力する（ステップＢ１）。同時に、車両状態検出部３でそのときの車両状態を入力する（ステップＢ２）。実施の形態２では、オーディオ信号抑圧の前に特定話者位相信号を抽出する（ステップＢ３）。オーディオ信号抑圧以降のステップＢ４〜ステップＢ８はそれぞれ、図４のステップＡ３〜Ａ７と同様なので、説明を省略する。 Next, the operation of the noise suppression device 30 of FIG. 6 will be described with reference to FIG. FIG. 7 is a flowchart for explaining the operation of the noise suppression apparatus 30 according to the second embodiment. 7, step B1 and step B2 are similar to steps A1 and A2 of FIG. 2, in-vehicle acoustic signals are collected by the microphones 1a and 1b, and audio signals are input by the audio signal input unit 2 (step B1). . At the same time, the vehicle state at that time is input by the vehicle state detector 3 (step B2). In the second embodiment, a specific speaker phase signal is extracted before audio signal suppression (step B3). Steps B4 to B8 after audio signal suppression are the same as steps A3 to A7 in FIG.

本実施の形態では、複数のマイクロフォン（音響信号収集手段）１ａ、１ｂを備え、マイクロフォン１ａから話者までの距離と、マイクロフォン１ｂから話者までの距離の差に相当する位相差を有する信号の成分を抽出するので、それ以外の位相差を有する信号が抑圧され、不要な話者の音声と車両音が抑圧される。また、車外の音響のうち、異なる位相差の信号が抑圧されるので、実施の形態１と比較して、さらに雑音抑圧の効果がある。 In the present embodiment, a plurality of microphones (acoustic signal collecting means) 1a and 1b are provided, and a signal having a phase difference corresponding to the difference between the distance from the microphone 1a to the speaker and the distance from the microphone 1b to the speaker. Since the components are extracted, signals having other phase differences are suppressed, and unnecessary speaker's voice and vehicle sound are suppressed. Further, since signals with different phase differences are suppressed from the sound outside the vehicle, there is a further effect of noise suppression as compared with the first embodiment.

つぎに、本実施の形態２の変形例について説明する。図８は、実施の形態２の異なる例を示すブロック図である。図８の雑音抑圧装置３０では、図６の記憶部１１、平均化処理部１２および車両音抑圧手段１４が、マイクロコンピュータ２０で構成されている。 Next, a modification of the second embodiment will be described. FIG. 8 is a block diagram illustrating a different example of the second embodiment. In the noise suppression device 30 of FIG. 8, the storage unit 11, the averaging processing unit 12, and the vehicle sound suppression unit 14 of FIG.

マイクロコンピュータ２０は、制御部２１、主記憶部２２、外部記憶入出力部２３および入出力部２４から構成される。主記憶部２２、外部記憶入出力部２３および入出力部２４はいずれも内部バス２５を介して制御部２１に接続されている。図８の構成では、車両音−車両状態対比記憶部１３がマイクロコンピュータ２０の外部記憶部になっている。 The microcomputer 20 includes a control unit 21, a main storage unit 22, an external storage input / output unit 23 and an input / output unit 24. The main storage unit 22, the external storage input / output unit 23, and the input / output unit 24 are all connected to the control unit 21 via the internal bus 25. In the configuration of FIG. 8, the vehicle sound-vehicle state comparison storage unit 13 is an external storage unit of the microcomputer 20.

マイクロコンピュータの制御部２１はＣＰＵ（Central Processing Unit）等から構成され、外部記憶部に記憶されているプログラムに従って、音響信号を入力し、車両音の抑圧および平均化処理を実行する。 The control unit 21 of the microcomputer is composed of a CPU (Central Processing Unit) or the like, and receives an acoustic signal according to a program stored in an external storage unit, and executes vehicle sound suppression and averaging processing.

主記憶部２２はＲＡＭ（Random-Access Memory）等から構成され、制御部２１の作業領域として用いられる。 The main storage unit 22 includes a RAM (Random-Access Memory) and the like, and is used as a work area for the control unit 21.

車両音−車両情報対比記憶手段１３を含む外部記憶部は、フラッシュメモリ、ハードディスク、ＤＶＤ（Digital Versatile Disc）、ＤＶＤ−ＲＡＭ（Digital Versatile Disc Random-Access Memory）、ＤＶＤ−ＲＷ（Digital Versatile Disc Rewritable）等の不揮発性メモリから構成され、前記の処理を制御部２１に行わせるためのプログラムを予め記憶し、また、制御部２１の指示に従って、このプログラムやそのほかプログラムが利用するデータを制御部２１に供給し、制御部２１から供給されたデータを記憶する。外部記憶部は、車両音−車両状態対比記憶部１３を含み、制御部２１の指示に従って、車両状態に対応する平均化された車両音を供給し、また新たに平均化された車両音を更新して記憶する。 The external storage unit including the vehicle sound-vehicle information comparison storage means 13 includes a flash memory, a hard disk, a DVD (Digital Versatile Disc), a DVD-RAM (Digital Versatile Disc Random-Access Memory), and a DVD-RW (Digital Versatile Disc Rewritable). A program for causing the control unit 21 to perform the above processing is stored in advance, and in accordance with instructions from the control unit 21, the program and other data used by the program are stored in the control unit 21. The data supplied from the control unit 21 is stored. The external storage unit includes a vehicle sound-vehicle state comparison storage unit 13, and supplies an averaged vehicle sound corresponding to the vehicle state and updates a newly averaged vehicle sound according to an instruction from the control unit 21. And remember.

入出力部２４はシリアルインタフェース、パラレルインタフェース又はＬＡＮ（Local Area Network）インターフェースから構成されている。制御部２１は、入出力部２４を介して、オーディオ信号抑圧部９から音響信号を入力し、音声認識装置１５に送信するデータを出力する。 The input / output unit 24 includes a serial interface, a parallel interface, or a LAN (Local Area Network) interface. The control unit 21 inputs an acoustic signal from the audio signal suppression unit 9 via the input / output unit 24 and outputs data to be transmitted to the voice recognition device 15.

実施の形態２において、雑音抑圧装置３０のうち、特定話者位相信号抽出部７、オーディオ信号調節部８、オーディオ信号抑圧部９、アドレス生成部１０、車両音抑圧部１４は、その全部または一部をＤＳＰ（Digital Signal Processor）で構成することができる。すなわち、特定話者位相信号抽出部７、オーディオ信号調節部８、オーディオ信号抑圧部９、アドレス生成部１０、マイクロコンピュータ２０を含んで、ＤＳＰとしてもよい。 In the second embodiment, the specific speaker phase signal extraction unit 7, the audio signal adjustment unit 8, the audio signal suppression unit 9, the address generation unit 10, and the vehicle sound suppression unit 14 of the noise suppression device 30 are all or one of them. The unit can be configured by a DSP (Digital Signal Processor). That is, the DSP may include a specific speaker phase signal extraction unit 7, an audio signal adjustment unit 8, an audio signal suppression unit 9, an address generation unit 10, and a microcomputer 20.

マイクロコンピュータまたはＤＳＰを用いることによって、各処理をプログラムで記述するので、時間間隔の調整や位相差などのパラメータを変更することが可能になり、異なる車両に対応することが容易になる。 Since each process is described by a program by using a microcomputer or a DSP, parameters such as time interval adjustment and phase difference can be changed, and it becomes easy to deal with different vehicles.

本発明の実施の形態１に係る雑音抑圧装置のブロック図である。It is a block diagram of the noise suppression apparatus which concerns on Embodiment 1 of this invention. 車両音−車両状態対比記憶部の構造の例を示す図である。It is a figure which shows the example of the structure of a vehicle sound-vehicle state comparison memory | storage part. 音響信号から車両音を抑圧する様子の例を示す模式図である。It is a schematic diagram which shows the example of a mode that a vehicle sound is suppressed from an acoustic signal. 実施の形態１の動作を示すフローチャートである。3 is a flowchart showing the operation of the first embodiment. 実施の形態２に係る音響信号収集手段の配置の例を示す平面図である。6 is a plan view showing an example of arrangement of acoustic signal collecting means according to Embodiment 2. FIG. 実施の形態２に係る雑音抑圧装置のブロック図である。5 is a block diagram of a noise suppression device according to Embodiment 2. FIG. 実施の形態２の動作を示すフローチャートである。10 is a flowchart showing the operation of the second embodiment. 実施の形態２の異なる例を示すブロック図である。FIG. 10 is a block diagram illustrating a different example of the second embodiment.

Explanation of symbols

１、１ａ、１ｂマイクロフォン（音響信号収集手段）
２オーディオ信号入力部
３車両状態検出部
５、５ａ、５ｂＡ／Ｄ変換器
６Ａ／Ｄ変換器
７特定話者位相信号抽出部
９オーディオ信号抑圧部
１１記憶部
１２平均化処理部
１３車両音−車両状態対比記憶部
１４車両音抑圧部
１５音声認識装置
３０雑音抑圧装置 1, 1a, 1b microphone (acoustic signal collecting means)
2 Audio signal input section
3 Vehicle state detector 5, 5a, 5b A / D converter
6 A / D converter
7 Specific speaker phase signal extraction unit
9 Audio signal suppressor
11 Memory unit
12 Averaging processor
13 Vehicle sound-vehicle state comparison storage unit
14 Vehicle sound suppression part
15 Voice recognition device
30 Noise suppressor

Claims

A noise suppression device that suppresses noise in a voice signal input to a voice recognition device in order to recognize a speaker's voice in a vehicle,
Acoustic signal collecting means provided in the vehicle for collecting the sound inside the vehicle;
Vehicle state detecting means for detecting a vehicle state that is an operating state of each device of the vehicle;
A storage unit that stores the acoustic signals collected by the acoustic signal collecting unit in association with the vehicle state when collected;
A vehicle sound-vehicle state comparison storage unit that stores information obtained by averaging the acoustic signals stored for each vehicle state over a predetermined period in association with the vehicle state;
Suppression means for subtracting and suppressing the averaged information corresponding to the vehicle state when the acoustic signal is collected from the acoustic signal collected by the acoustic signal collecting means;
A noise suppression device comprising:

A plurality of acoustic signal collecting means;
A specific speaker that extracts only an acoustic signal having a phase difference equal to a difference in distance between a position of a specific speaker to be recognized by the speech recognition apparatus and each of the plurality of acoustic signal collecting means from the acoustic signal. Phase signal extraction means;
With
The specific speaker phase signal extraction unit has a phase difference equal to a difference in distance between the specific speaker and each of the plurality of acoustic signal collection units from the acoustic signal collected by the plurality of acoustic signal collection units. The noise suppression according to claim 1, wherein an acoustic component having a phase different from a difference in distance between the specific speaker and each of the plurality of acoustic signal collecting units is suppressed by extracting the component. apparatus.

When the vehicle includes an audio device, the vehicle further includes an audio signal suppression unit that suppresses the sound signal of the audio device from the sound signal collected by the sound signal collection device to obtain the sound signal inside the vehicle. Item 2. The noise suppression device according to Item 1.

The noise suppression device according to claim 1, wherein the acoustic information of the noise suppression device is digitized and processed as frequency information and / or time axis information.

As the operating state of each device of the vehicle, vehicle speed, accelerator opening, engine speed, acceleration / deceleration state, transmission state, steering steering angle, wiper operating state, air conditioner operating state, electric fan operating state, window opening / closing state The noise suppression device according to claim 1, comprising any combination of a horn operating state and a type of tire mounted on the vehicle.