JPH02254499A

JPH02254499A - Voice input device

Info

Publication number: JPH02254499A
Application number: JP1077776A
Authority: JP
Inventors: Eiji Horiba; 堀場　英二; Naomasa Miwa; 三輪　尚正; Kazuyuki Umebayashi; 梅林　和幸
Original assignee: Aisin Seiki Co Ltd
Current assignee: Aisin Corp
Priority date: 1989-03-29
Filing date: 1989-03-29
Publication date: 1990-10-15
Anticipated expiration: 2015-01-24
Also published as: JP3003037B2

Abstract

PURPOSE:To reduce other noise component than an object voice and to output a clear sound signal by analyzing a frequency characteristic of a sound signal, and adjusting a frequency characteristic of a variable filter means in accordance with a result of analysis. CONSTITUTION:A signal mainly composed of a voice detected by a front microphone 2 is inputted to a noise canceller 200 through an LPF 10, an A/D converter 20, a BPF 30 and an amplifier 40. On the other hand, a signal mainly composed of a noise detected by a rear microphone 6 is inputted to the canceller 200 through an LPF 50, an A/D converter 60, a BPF 70 and an amplifier 80. A CPU 90 analyzes a voice of a drive within a prescribed time, when an ignition switch IGS is turned on, and adjusts a frequency characteristic of the BPF 30 and 70. In such a way, an unnecessary noise component is eliminated, and a clear sound signal can be outputted to a navigation unit 400, etc.

Description

【発明の詳細な説明】［発明の目的］［産業上の利用分野］本発明は、音声入力装置に関し、例えば、音声認識によ
って車上機器を制御する場合に利用できる。DETAILED DESCRIPTION OF THE INVENTION [Object of the Invention] [Industrial Application Field] The present invention relates to a voice input device, and can be used, for example, when controlling on-vehicle equipment by voice recognition.

［従来の技術］例えば車輌上でドライバが各種車載機器を操作する場合
、操作可能なスイッチの数が多いと、操作すべきスイッ
チの位置を捜し、指をそのスイッチ上に位置決めするま
での確認を視覚的に行なう必要があるので、車輌の走行
中は安全性の点で問題がある。[Prior Art] For example, when a driver operates various in-vehicle devices in a vehicle, and there are a large number of operable switches, he has to search for the position of the switch to be operated and check until his finger is positioned on the switch. Since this must be done visually, there are safety issues when the vehicle is in motion.

そこで、視覚的な操作を不要にするために、音声認識装
置を用いて、車載機器を音声入力によって制御すること
が提案されている。Therefore, in order to eliminate the need for visual operations, it has been proposed to use a voice recognition device to control in-vehicle equipment through voice input.

ところが、特に車上においては、入力音声とともに認識
装置に入力される外部雑音のレベルが大きい場合があり
、音声認識に著しく不都合をきたす場合がある。外部雑
音としては、車外からの音。However, especially in a car, the level of external noise that is input to the recognition device together with the input voice may be large, which may significantly impede voice recognition. External noise is the sound from outside the car.

自軍のエンジン音、車内の音響装置からの音等々がある
。There are the engine sounds of your own troops, the sounds from the sound equipment inside the car, etc.

そこで、特公昭６３−２９７５５号公報の技術において
は、特定のスイッチが操作されろと、音声入力に備えて
、雑音のレベルを低減するために。Therefore, the technique disclosed in Japanese Patent Publication No. 63-29755 is used to reduce the noise level in preparation for voice input when a specific switch is operated.

音響装置の出力を下げ、！！、ガラスを閉じるようにし
ている。Lower the output of the sound device! ! , trying to close the glass.

［発明が解決しようとする課題］従来例（特公昭６３−２９７５５号公報）によれば、雑
音の影響を小さくし、音声認識装置の認識率を向上させ
ることができる。しがしながら。[Problems to be Solved by the Invention] According to the conventional example (Japanese Patent Publication No. 63-29755), it is possible to reduce the influence of noise and improve the recognition rate of the speech recognition device. While doing so.

音声認識を行なう度にドライバがスイッチを操作しなけ
ればならないし、窓ガラスが全閉になるまで待たなけれ
ばならないので、音声入力を開始できるまでに時間がか
かり、ドライバは煩わしい操作を強いられる。Each time voice recognition is performed, the driver must operate a switch and wait until the window glass is fully closed, so it takes time before voice input can begin, and the driver is forced to perform cumbersome operations.

本発明は、各種の音声入力装置において、目的とする音
声以外の雑音成分を低減し明瞭な音声信号を出力するこ
とを課題とする。SUMMARY OF THE INVENTION An object of the present invention is to output clear audio signals by reducing noise components other than target audio in various audio input devices.

［発明の構成］［課題を解決するための手段］上記課題を解決するために、本発明においては、音声を
電気信号に変換する、音声入力手段；前記音声入力手段
から出力される電気信号の周波数特性を調整する可変フ
ィルタ手段；スイッチ手段；前記音声入力手段から出力
される電気信号を分析してその周波数特性を検知する１
周波数特性検知手段；及び前記スイッチ手段の動作に応
答して、前記音声入力手段から出力される電気信号を前
記周波数特性検知手段で分析し、その分析結果に応じて
前記可変フィルタ手段の周波数特性を調整する、制御手
段；を設ける。[Structure of the Invention] [Means for Solving the Problems] In order to solve the above problems, the present invention provides a voice input means for converting voice into an electric signal; Variable filter means for adjusting frequency characteristics; Switch means; 1 for analyzing the electrical signal output from the audio input means and detecting its frequency characteristics;
frequency characteristic detection means; and in response to the operation of the switch means, the electrical signal output from the audio input means is analyzed by the frequency characteristic detection means, and the frequency characteristic of the variable filter means is determined according to the analysis result. A control means for adjusting is provided.

［作用］本発明においては、音声は、例えばマイクロホンなどの
音声入力手段によって電気信号に変換された後、可変フ
ィルタ手段を通り、出力される。[Function] In the present invention, audio is converted into an electrical signal by audio input means such as a microphone, and then passed through variable filter means and output.

可変フィルタ手段は、例えばバンドパスフィルタを構成
し、目的とする音声の周波数帯域以外の雑音成分を低減
する。また、可変フィルタ手段の周波数特性は、スイッ
チ手段の動作に応答して起動する調整モードにおいて、
制御手段により自動的に設定される。The variable filter means constitutes a bandpass filter, for example, and reduces noise components outside the frequency band of the target voice. Further, the frequency characteristics of the variable filter means are adjusted in the adjustment mode activated in response to the operation of the switch means.
Automatically set by the control means.

即ち、外部雑音が実質上存在しない状態（例えば自動車
のエンジンスタート前）で、実際の話者（例えばドライ
バ）がスイッチ手段を操作して、調整モードを起動した
後、その話者が音声を発すると、雑音を含まない音声成
分だけの周波数特性が分析され、その周波数特性と対応
するように、つまりその話者の音声帯域の信号だけを通
過させ他の周波数成分を低減するように、可変フィルタ
手段の周波数特性が調整され４゜音声認識などを行なう場合、雑音成分を低減するために
フィルタを設けることは従来でも行なわれている。しか
し、人間の発する音声の周波数特性には大きな個人差が
あるので、不特定話者の音声認識を行なう場合にはフィ
ルタの通過帯域を狭くすることはできず、従来の装置で
は、実際の話者の周波数帯域を外れる雑音成分を充分に
低減することができなかった。That is, when an actual speaker (e.g., a driver) activates the adjustment mode by operating a switch means in a state where there is substantially no external noise (e.g., before starting the engine of a car), the speaker makes a sound. Then, the frequency characteristics of only the voice components that do not include noise are analyzed, and a variable filter is applied so as to correspond to the frequency characteristics, that is, to pass only the signal in the voice band of the speaker and reduce other frequency components. When the frequency characteristics of the means are adjusted to perform 4° speech recognition, it has been conventionally practiced to provide a filter to reduce noise components. However, since there are large individual differences in the frequency characteristics of human speech, it is not possible to narrow the passband of the filter when performing speech recognition for unspecified speakers. It was not possible to sufficiently reduce noise components outside the user's frequency band.

それに対して本発明を用いれば、スイッチ手段の操作に
応答して、実際の話者の音声の周波数特性が測定され、
それに適合するように可変フィルタ手段の周波数特性が
自動的に調整されるので。In contrast, if the present invention is used, the frequency characteristics of the actual speaker's voice are measured in response to the operation of the switch means,
Since the frequency characteristics of the variable filter means are automatically adjusted to suit it.

フィルタの帯域を狭くすることができ１話者の周波数帯
域を外れろ周波数の雑音成分を大幅に低減することがで
きる。The band of the filter can be narrowed, and noise components at frequencies outside the frequency band of one speaker can be significantly reduced.

なお、本発明の音声入力装置は、音声認識に限らず、雑
音状況下で特定話者の音声信号だけを抽出する必要のあ
る様々な用途（例えば電話、放送。Note that the voice input device of the present invention is applicable not only to voice recognition, but also to various applications where it is necessary to extract only the voice signal of a specific speaker under noisy conditions (for example, telephone, broadcasting, etc.).

録音等々）においても利用できるのは言うまでもない。Needless to say, it can also be used for recording, etc.).

本発明の他の目的及び特徴は、以下の、図面を参照した
実施例説明により明らかになろう。Other objects and features of the present invention will become apparent from the following description of embodiments with reference to the drawings.

［実施例］第１図に、本発明を実施する装置を搭載した自動車の車
室内の外観を示す。この実施例においては、ナビゲーシ
ョン装置の操作を音声入力によって行なうシステムが備
わっている。[Example] Fig. 1 shows the appearance of the interior of an automobile equipped with a device implementing the present invention. This embodiment is equipped with a system for operating the navigation device through voice input.

第１図を参照すると、センタコンソール部分にナビゲー
ション装置のモニタテレビ３が組込まれており、その近
傍の隠れた位置に、ナビゲーション装置の本体であるナ
ビゲーシ目ンユニット４が設けられている。また、モニ
タテレビ３の上方の、ダツシュボード１上の略中央部（
車輌の左右方向に対する中央）に、マイクロホン２が組
込まれている。このマイクロホン２は、ドライバが発声
する音声を検知するために設けられている。Referring to FIG. 1, a monitor television 3 of a navigation device is incorporated in the center console portion, and a navigation eye unit 4, which is the main body of the navigation device, is provided in a hidden position near the monitor television 3. In addition, the approximate center of the dash board 1 above the monitor television 3 (
A microphone 2 is incorporated in the center of the vehicle in the left-right direction. This microphone 2 is provided to detect the voice uttered by the driver.

また、リアトレイ５上には、左端及び右端の互いに対称
な位置に、スピーカ７及び８が組込まれている。これら
のスピーカ７．８は、車上に備わったオーディオ装置及
びテレビ受像機の音信号を音響に変換して出力するため
に利用される。更にリアトレイ５上には、その左右方向
の中央部（スピーカ７と８に対して等距離の位置）に、
マイクロホン６が組込まれている。このマイクロホン６
は、主としてスピーカ７．８から出力される音響を、音
声認識の際のノイズとして検出するために設けられてい
る。Further, speakers 7 and 8 are installed on the rear tray 5 at mutually symmetrical positions on the left end and the right end. These speakers 7.8 are used to convert sound signals from an on-board audio device and a television receiver into sound and output the sound. Furthermore, on the rear tray 5, at the center in the left and right direction (at a position equidistant from the speakers 7 and 8),
A microphone 6 is incorporated. This microphone 6
is provided mainly to detect the sound output from the speaker 7.8 as noise during speech recognition.

即ち、前方のマイクロホン２は、ドライバの音声の他に
、スピーカ７．８から出る音響や様々な空気振動を同時
に検出してしまうので、マイクロホン２で検出される電
気信号には比較的大きなノイズ成分が含まれる場合が多
い。そこで、後方のマイクロホン６でノイズ成分の信号
を検出し、２つのマイクロホン２，６で検出した電気信
号を合成することにより、ノイズ成分のレベルが小さく
比較的明瞭度の高い音声信号を抽出するようにしている
。In other words, the front microphone 2 simultaneously detects the driver's voice, the sound coming from the speaker 7.8, and various air vibrations, so the electrical signal detected by the microphone 2 contains a relatively large noise component. is often included. Therefore, by detecting the noise component signal with the rear microphone 6 and synthesizing the electrical signals detected by the two microphones 2 and 6, an audio signal with a relatively high clarity and a low noise component level is extracted. I have to.

ところで、ステレオ音源の音響を再生する場合。By the way, when playing the sound of a stereo sound source.

２つのスピーカ７．８は、互いに異なる音響を発生する
。従って、２つのスピーカ７．８から出る各々のノイズ
の成分を、マイクロホン２で検出した信号から除去する
ためには、検出される各々のノイズ成分と、それらのマ
イクロホン２で検出されるノイズ成分との時間及び振幅
を、それぞれ−致させる必要がある。The two speakers 7.8 generate mutually different sounds. Therefore, in order to remove each noise component emitted from the two speakers 7.8 from the signal detected by the microphone 2, it is necessary to remove each detected noise component and the noise component detected by the microphone 2. It is necessary to match the time and amplitude of -, respectively.

しかしこの実施例では、前方のマイクロホン２が前方中
央部に配置してあり、ノイズ源であるスピーカ７．８は
車室内の互いに左右対称な位置に配置されているので、
左側のスピーカ７から出た音響がマイクロホン２で検出
されるまでの伝播遅延時間と、右側のスピーカ８から出
た音響がマイクロホン２で検出されるまでの伝播遅延時
間とは実質上一致する。しかも、後方のマイクロホン６
が２つのスピーカ７．８の中央部に配置しであるので、
左側のスピーカ７から出た音響がマイクロホン６で検出
されるまでの伝播遅延時間と、右側のスピーカ８から出
た音響がマイクロホン６で検出されるまでの伝播遅延時
間とが一致する。However, in this embodiment, the front microphone 2 is placed at the center of the front, and the speakers 7.8, which are noise sources, are placed at symmetrical positions in the vehicle interior.
The propagation delay time until the sound emitted from the left speaker 7 is detected by the microphone 2 and the propagation delay time until the sound emitted from the right speaker 8 is detected by the microphone 2 substantially match. Moreover, the rear microphone 6
is placed in the center of the two speakers 7 and 8, so
The propagation delay time until the sound emitted from the left speaker 7 is detected by the microphone 6 matches the propagation delay time until the sound emitted from the right speaker 8 is detected by the microphone 6.

従って、左側のスピーカ７から出た同一の音響が前方の
マイクロホン２で検出される時と、後方のマイクロホン
６で検出される時との時間差ＴｄＬと右側のスピーカ８
から出た同一の音響が前方のマイクロホン２で検出され
ろ時と、後方のマイクロホン６で検出される時との時間
差Ｔｄｒとが一致する。Therefore, the time difference TdL between when the same sound emitted from the left speaker 7 is detected by the front microphone 2 and when it is detected by the rear microphone 6,
The time difference Tdr between when the same sound emitted from the microphone 2 is detected by the front microphone 2 and when the same sound is detected by the rear microphone 6 is the same.

つまり、特定の時間差Ｔｄ　（＝Ｔｄｌ＝Ｔｄｒ）によ
って、マイクロホン６で検出した信号を遅延させてやれ
ば、遅延した信号のタイミングを、マイクロホン２で検
出した信号のノイズ成分のタイミングと一致させること
ができ、それらの差分を抽出することにより、左及び右
のいずれのスピーカから出たノイズについても、それを
低減して、音声成分を明瞭にすることができる。この場
合、ノイズ成分を検出するためのマイクロホン及びそれ
が検出した信号を処理する電気回路が１ｆｆｉだけで済
み、装置の構成が簡単になる。In other words, if the signal detected by microphone 6 is delayed by a specific time difference Td (=Tdl=Tdr), the timing of the delayed signal can be made to match the timing of the noise component of the signal detected by microphone 2. By extracting the difference between them, it is possible to reduce the noise emitted from both the left and right speakers and make the audio component clear. In this case, only 1ffi is required for the microphone for detecting the noise component and the electric circuit for processing the signal detected by the microphone, which simplifies the configuration of the device.

第２図に、第１図の自動車の搭載した音声入力装置の電
装部の構成を示す、第２図を参照して説明する。前方の
マイクロホン２で検出された音声を主体とする信号は、
ローパスフィルタ１０で周波数の高い成分（Ａ／Ｄ変換
のサンプリング周波数の半分以上の周波数）が除去され
た後、Ａ／Ｄ（アナログ／デジタル）変換器２０によっ
てデジタル信号に変換され、バンドパスフィルタ３０を
通り、増幅器（乗算器）４０を通って、ノイズキャンセ
ラ２００の一方の入力端子に印加される。また、後方の
マイクロホン６で検出されたノイズを主体とする信号は
、ローパスフィルタ５０で周波数の高い成分が除去され
た後、Ａ／Ｄ変換器６０によってデジタル信号に変換さ
れ、バンドパスフィルタ７０を通り、増幅器（乗算器）
８０を通ってノイズキャンセラ２００の他方の入力端子
に印加される。Description will be made with reference to FIG. 2, which shows the configuration of an electrical component of the voice input device mounted on the vehicle of FIG. 1. The signal mainly consisting of voice detected by the front microphone 2 is
After high-frequency components (frequency equal to or higher than half of the sampling frequency of A/D conversion) are removed by a low-pass filter 10, the signals are converted into digital signals by an A/D (analog/digital) converter 20, and then passed through a band-pass filter 30. The signal passes through the amplifier (multiplier) 40 and is applied to one input terminal of the noise canceller 200. Further, a signal mainly composed of noise detected by the rear microphone 6 is converted into a digital signal by an A/D converter 60 after high frequency components are removed by a low-pass filter 50, and then passed through a band-pass filter 70. street, amplifier (multiplier)
80 and is applied to the other input terminal of the noise canceller 200.

バンドパスフィルタ３０及び７０は１話者、即ちドライ
バの音声の周波数帯域と一致する周波数の信号成分だけ
を通過させるデジタルフィルタである。勿論１人間の音
声の周波数帯域には大きな個人差があるので、バンドパ
スフィルタ３０及び７０の特性を実際の話者の特性に適
合させる調整作業が必要になる。The bandpass filters 30 and 70 are digital filters that pass only signal components of frequencies that match the frequency band of the voice of one speaker, that is, the driver. Of course, since there are large individual differences in the frequency band of human speech, adjustment work is required to adapt the characteristics of the bandpass filters 30 and 70 to the characteristics of the actual speaker.

しかしこの実施例においては、バンドパスフィルタ３０
．７０の周波数特性の調整を自動的に行なうように構成
しであるので、通常の音声入力時における調整は不要に
なっている。However, in this embodiment, the bandpass filter 30
．． Since the frequency characteristics of 70 are configured to be adjusted automatically, there is no need for adjustment during normal audio input.

即ち、この装置に備わったマイクロコンピュータ（以下
、ＣＰＵと記載する）９０は、特定の条件の時にＡ／Ｄ
変換器２０から出力される信号の周波数特性を分析し、
その結果に応じてバンドパスフィルタ３０．７０の特性
を調整するようになっている。That is, a microcomputer (hereinafter referred to as CPU) 90 provided in this device controls the A/D under certain conditions.
Analyzing the frequency characteristics of the signal output from the converter 20,
The characteristics of the bandpass filters 30.70 are adjusted according to the results.

ＣＰＵ９０には、それに指示を与えるために。To the CPU 90, to give instructions to it.

イグニッションスイッチＩＧＳとキャンセルスイッチＣ
３が接続されている。Ignition switch IGS and cancel switch C
3 is connected.

ＣＰＵ９０の動作の概略を第３図に示す、第３図を参照
してＣＰＵ９０の動作を説明する。ステップ１では、イ
グニッションスイッチＩＧＳの電気接点がアクセサリ位
置ＡＣＣにあるか否かを識別する。ＩＧＳがＡＣＣ位置
になると、次の処理に進む。An outline of the operation of the CPU 90 is shown in FIG. 3, and the operation of the CPU 90 will be described with reference to FIG. Step 1 identifies whether the electrical contacts of the ignition switch IGS are in the accessory position ACC. When the IGS reaches the ACC position, the process proceeds to the next step.

ステップ２ではタイマｌをスタートし、ステップ３では
タイマ１の計数する時間ｔ１を参照する。In step 2, timer 1 is started, and in step 3, time t1 counted by timer 1 is referred to.

ｔｌが０．５秒になると、つまりイグニッションスイッ
チがＡＣＣ位置になってから０．５秒が経過すると、ス
テップ４に進む。When tl reaches 0.5 seconds, that is, when 0.5 seconds have passed since the ignition switch was placed in the ACC position, the process proceeds to step 4.

ステップ４では、タイマ２をスタートし次のステップ５
に進む。ステップ５では、Ａ／Ｄ変換器２０から一定の
サンプリング周期で出力されるデジタル信号を順次に入
力し、内部のメモリ上にストアする。ステップ５の処理
は、ｔ２．即ちタイマ２の計数値が２秒になるまで繰り
返される。つまり、２秒間の間に検出された音響情報の
デジタルデータが、メモリ上に蓄積される。In step 4, start timer 2 and proceed to step 5.
Proceed to. In step 5, digital signals outputted from the A/D converter 20 at a constant sampling period are sequentially input and stored on the internal memory. The processing in step 5 is performed at t2. That is, the process is repeated until the count value of timer 2 reaches 2 seconds. In other words, digital data of acoustic information detected for two seconds is stored in the memory.

ＣＰＵ９０がステップ５，６を実行する際中には、話者
となるドライバは、任意の言葉を発声する必要がある。While the CPU 90 executes steps 5 and 6, the driver, who is the speaker, needs to utter arbitrary words.

ドライバが２秒間の発声を行なうと、その音声波形の情
報が測定され、その結果がメモリ上に保存される。When the driver speaks for two seconds, information on the audio waveform is measured and the results are stored in memory.

ＣＰＵ９０は、ステップ７に進むと、メモリ上に蓄積さ
れた音声波形の情報について、公知の高速フーリエ変換
（ＦＦＴ）処理を実行する。その結果、測定したドライ
バの音声エネルギーの周波数分布のデータが得られる。Proceeding to step 7, the CPU 90 executes a known fast Fourier transform (FFT) process on the audio waveform information stored in the memory. As a result, data on the frequency distribution of the measured audio energy of the driver is obtained.

このデータを更に処理し、音声周波数帯域の上限周波数
と下限周波数を求める。具体的には、この実施例では、
エネルギー分布の最大値に対して所定比率以上のエネル
ギーが検知された全周波数範囲の上限及び下限を求めろ
ようにしている。This data is further processed to determine the upper limit frequency and lower limit frequency of the audio frequency band. Specifically, in this example,
The upper and lower limits of all frequency ranges in which energy of a predetermined ratio or higher is detected relative to the maximum value of the energy distribution are determined.

次のステップ８では、ステップ７で求めた周波数の上限
値及び下限値と一致する選択信号を生成する。In the next step 8, a selection signal matching the upper and lower frequency limits determined in step 7 is generated.

第２図を参照すると、ＣＰＵ９０に接続されたＲＯＭ　
１００には、バンドパスフィルタ３０及び７０の周波数
特性を決定する係数のデータが、周波数帯域毎に区分し
て、互いに異なるアドレス領域に予め記憶させである。Referring to FIG. 2, the ROM connected to the CPU 90
In 100, data of coefficients that determine the frequency characteristics of the bandpass filters 30 and 70 are stored in advance in different address areas, divided into frequency bands.

従って、ＣＰＵ９０は、ステップ７で求めた周波数帯域
に割り当てた係数データを記憶したメモリアドレスを選
択する信号ＳＥＬを生成し、それをＲＯＭ　１００のア
ドレス端子に印加する。Therefore, the CPU 90 generates a signal SEL for selecting the memory address storing the coefficient data assigned to the frequency band determined in step 7, and applies it to the address terminal of the ROM 100.

次のステップ９では、キャンセルスイッチＣ８の状態を
チエツクする。キャンセルスイッチＳＣがオンの場合に
は、ステップ２に戻って再び分析の処理を行なう。In the next step 9, the state of the cancel switch C8 is checked. If the cancel switch SC is on, the process returns to step 2 and the analysis process is performed again.

次のステップ１０では、イグニッションスイッチＩＧＳ
が、０８位置か否かを識別する。ＯＮ位置を検知した場
合には次にステップ１１に進む。In the next step 10, the ignition switch IGS
is at the 08 position. If the ON position is detected, the process proceeds to step 11.

ステップ１１では、バンドパスフィルタ３０及び７０を
制御して、ＲＯＭ　１００が出力する係数データＰｆを
各々のフィルタに係数としてラッチさせる。In step 11, the bandpass filters 30 and 70 are controlled to cause each filter to latch the coefficient data Pf output from the ROM 100 as a coefficient.

ステップ１２では、イグニッションスイッチ■ＧＳがＯ
ＦＦ位置か否かを識別する。○ＦＦ位置を検知すると、
次にステップ１に戻る。In step 12, the ignition switch ■GS is turned to O.
Identifies whether it is in the FF position or not. ○When the FF position is detected,
Then return to step 1.

つまり、自動車のエンジンが停止している状態（ＩＧＳ
がＯＦＦ位置）でイグニッションキーをＯＦＦ位置から
ＡＣＣ位置に動かすと、ドライバの声の周波数特性を分
析するモードに入り、ノイズの少ない状態でドライバの
音声のみの周波数特性を自動的に測定し、それに適合す
るように、バンドパスフィルタ３０．７０の特性が自動
的に調整される。In other words, the car engine is stopped (IGS
When the ignition key is moved from the OFF position to the ACC position, the system enters the mode that analyzes the frequency characteristics of the driver's voice. The characteristics of the bandpass filter 30.70 are automatically adjusted to suit.

なお、ＣＰＵ９０はその内部に不揮発性メモリを備えて
おり、ＣＰｔＪ９０が出力する選択信号ＳＥＬの状態は
、装置の電源がオフした場合でも保存されろ。この実施
例では一担１話者の特性にバンドパスフィルタの特性を
適合させた後は、話者、即ちドライバが変わらない限り
、再びその調整を行なう必要はない。Note that the CPU 90 includes a nonvolatile memory therein, and the state of the selection signal SEL output by the CPtJ 90 is saved even when the power of the device is turned off. In this embodiment, once the characteristics of the bandpass filter are adapted to the characteristics of a single speaker, there is no need to make the adjustment again unless the speaker, ie, the driver, changes.

つまり、バンドパスフィルタの調整が済んでいる場合に
は、ドライバは停止法服のエンジンをスタートする時に
通常の操作を行なえばよい。ドライバが、イグニッショ
ンキーの位置をＯＦＦ位置からＡＣＣ位置に切換えた後
、時間待ちをすることなく直ちにＯＮ位置に切換えれば
、ＣＰＵ９０の処理は、第３図のステップ１−２−３−
１３を通って１１に進むので１選択信号ＳＥＬの更新は
行なわれず、電源がオフする前と同一の選択信号ＳＥＬ
によって、バンドパスフィルタ３０．７０の特性は、以
前に調整した周波数帯域特性と同一の特性に設定される
。In other words, if the bandpass filter has been adjusted, the driver only needs to perform the normal operation when starting the engine of the stopped suit. If the driver switches the ignition key from the OFF position to the ACC position and then immediately switches it to the ON position without waiting, the process of the CPU 90 will be performed in step 1-2-3- of FIG.
13 and proceeds to 11, the 1 selection signal SEL is not updated, and the selection signal SEL is the same as before the power was turned off.
Accordingly, the characteristics of the bandpass filter 30.70 are set to the same characteristics as the previously adjusted frequency band characteristics.

再び第２図を参照すると、ノイズキャンセラ２００には
、遅延回路２１０．適応フィルタ２２０及び係数制御回
路２３０が何わっている。遅延回路２１０は、所定時間
（６、４ｍ５ｅｃ）入力信号を遅延した信号ｄｋを出力
する。適応フィルタ２２０は、１２７段の遅延要素（Ｚ
″″’）、１２８段の可変増幅要素（Ａ１−Ａ１２８：
乗算器）及び１２８段の加算要素を含んでおり、可変増
幅要素の各々に設定する係数を調整することにより、こ
のフィルタの特性を様々に変化させろことができる。Referring again to FIG. 2, the noise canceller 200 includes a delay circuit 210. The adaptive filter 220 and the coefficient control circuit 230 are different. The delay circuit 210 outputs a signal dk obtained by delaying the input signal for a predetermined period (6,4 m5ec). The adaptive filter 220 has 127 stages of delay elements (Z
""), 128 stage variable amplification element (A1-A128:
The filter includes a multiplier) and 128 stages of addition elements, and the characteristics of this filter can be varied in various ways by adjusting the coefficients set for each of the variable amplification elements.

係数制御回路２３０は、遅延回路２１０が出力する（、
１号ｄｋと適応フィルタ２２０が出力する信号ｙｋとの
差分ｅｋを入力し、その二乗平均値が最小になるような
係数群を生成し、それらを可変増幅要素の各々に印加す
る。この例では、係数制御回路２３０は、公知のＬＭＳ
アルゴリズムを実行するようになっている。The coefficient control circuit 230 outputs the output from the delay circuit 210 (,
The difference ek between the No. 1 dk and the signal yk output by the adaptive filter 220 is input, a group of coefficients whose root mean square value is minimized is generated, and these coefficients are applied to each of the variable amplification elements. In this example, the coefficient control circuit 230 is a known LMS.
It is designed to run an algorithm.

つまり、ドライバの音声を目的とする信号成分＜Ｓ）と
し、それ以外の音響を全てノイズ成分（Ｎ）とみなせば
、増幅器４０からノイズキャンセラ２００に印加される
電気信号には、ＳとＮの両方の成分が含まれ、増幅器８
０からノイズキャンセラ２００に印加される電気信号は
、主としてＮの成分で構成されるので９両者を合成した
結果を最小にすることは、Ｓの成分だけを抽出すること
を意味する。In other words, if we consider the driver's audio as the target signal component <S) and all other sounds as noise components (N), the electrical signal applied from the amplifier 40 to the noise canceller 200 contains both S and N. contains the components of amplifier 8
Since the electrical signal applied to the noise canceller 200 from 9 is mainly composed of N components, minimizing the result of combining both means extracting only the S component.

ノイズキャンセラ２００に入力される２つの電気信号に
含まれるノイズ成分は互いに時間及び振幅が異なるが、
その違いは各々の信号を遅延回路２１０及び適応フィル
タ２２０を通すことによってなくすることができる。Although the noise components included in the two electrical signals input to the noise canceller 200 are different in time and amplitude,
The difference can be eliminated by passing each signal through a delay circuit 210 and an adaptive filter 220.

ノイズキャンセラ２００によって出力される抽出された
音声信号は、音声認識ユニット３００に印加されろ、こ
の音声認識ユニット３００は、公知の認識アルゴリズム
を実行して、ドライバの発した音声の認識を行ない、更
に、認識した音声情報と予め定めた指令語（拡大、縮小
、オン、オフ。The extracted audio signal output by the noise canceller 200 is applied to a voice recognition unit 300, which executes a known recognition algorithm to recognize the voice uttered by the driver; Recognized voice information and predetermined command words (enlarge, reduce, on, off).

上、下、右、左等々）の各々との適合の有無を識別し、
適合した場合には、その指令語に対応する指令信号を、
ナビゲーションユニット４００に出力する。(top, bottom, right, left, etc.).
If it matches, the command signal corresponding to the command word is
Output to navigation unit 400.

従ってこの例では、ドライバの音声入力によって、ナビ
ゲーションユニット４００に指示を与えてそれを制御す
ることができる。Therefore, in this example, the driver's voice input can provide instructions to and control the navigation unit 400.

なお、イグニッションスイッチＩＧＳがＡＣＣ位置の場
合、アクセサリであるオーディオ装置にも電源が供給さ
れるので、それの電源スィッチがオンであれば、ドライ
バの音声の周波数帯域を測定するモードでも、オーディ
オ装置から出力される音響、即ちノイズが検出され、測
定に誤りを生じる恐れがある。従ってその場合には、次
のような構成に変更するのが望ましい。即ち、オーディ
オ装置の電源をオン／オフするリレーなどのスイッチを
設けるとともに、第３図のステップ４の中で。Note that when the ignition switch IGS is in the ACC position, power is also supplied to the accessory audio device, so if the power switch for that device is on, even in the mode that measures the frequency band of the driver's audio, power is supplied from the audio device. Output sound, ie noise, may be detected and cause errors in measurements. Therefore, in that case, it is desirable to change the configuration to the following. That is, in addition to providing a switch such as a relay to turn on/off the power of the audio device, in step 4 of FIG.

該スイッチをオフしてオーディオ装置の電源を遮断し、
ステップ７の中でスイッチをオンしてオーディオ装置の
電源供給を許可するように制御を変える。Turn off the switch to cut off the power to the audio device,
In step 7, the control is changed to turn on the switch and permit power supply to the audio device.

なお、上記実施例においては、バンドパスフィルタ３０
．７０の特性調整を行なうＣＰＵ９０とノイズキャンセ
ラ２００の適応フィルタの制御を行なう係数制御回路２
３０とをそれぞれ独立に設けたが、両方の処理を１つの
ＣＰＵで実行することも可能である。Note that in the above embodiment, the bandpass filter 30
．． a CPU 90 that adjusts the characteristics of the noise canceller 200; and a coefficient control circuit 2 that controls the adaptive filter of the noise canceller 200;
Although 30 and 30 are provided independently, it is also possible to execute both processes by one CPU.

また実施例においては、デジタル信号処理によってドラ
イバの音声の周波数分析及びバンドパスフィルタの調整
を行なっているが、これらの処理はアナログ電気回路に
おき換えても同様に行なうことができる。バンドパスフ
ィルタ３０．７０をアナログ回路におき換えろ場合には
１例えば、多数のコンデンサをアナログスイッチによっ
て切換えるように構成すれば、デジタル処理の場合と同
様にフィルタの周波数特性を切換えることができる。Further, in the embodiment, the frequency analysis of the driver's voice and the adjustment of the bandpass filter are performed by digital signal processing, but these processes can be performed in the same way by replacing them with analog electric circuits. In the case of replacing the bandpass filters 30 and 70 with analog circuits, for example, if a large number of capacitors are configured to be switched by analog switches, the frequency characteristics of the filters can be switched in the same way as in the case of digital processing.

更に、実施例では音声認識の場合を示したが、本発明は
、単純にアナログ信号を処理する用途に利用する場合で
あっても、同様に様々なノイズを含む音響信号の中から
特定話者の音声成分だけを抽出することができる。Furthermore, although the embodiment shows the case of speech recognition, even when the present invention is used for simply processing analog signals, it is possible to identify a specific speaker from acoustic signals containing various noises. It is possible to extract only the audio components.

［効果］以上のとおり本発明によれば、可変フィルタ手段の周波
数特性を実際の話者の周波数帯域に正確に一致させるこ
とができるので、フィルタの通過帯域幅を充分に狭くす
ることができ、不必要な周波数成分を全てノイズとして
除去し必要な音声成分だけを抽出することができる。し
かも、可変フィルタ手段の特性の調整は自動的に行なわ
れるので操作上の煩わしさが生じない。[Effects] As described above, according to the present invention, the frequency characteristics of the variable filter means can be made to accurately match the frequency band of the actual speaker, so the passband width of the filter can be sufficiently narrowed. All unnecessary frequency components can be removed as noise and only the necessary audio components can be extracted. Moreover, since the characteristics of the variable filter means are automatically adjusted, there is no need for operational hassle.

[Brief explanation of drawings]

第１図は、本発明の装置を搭載した自動車の車室内の外
観を示す斜視図である。第２図は、実施例の音声入力装置の構成を示すブロック
図である。第３図はＣＰＵ９０の動作の概略を示すフローチャート
である。２：マイクロホン（音声入力手段）３：モニタテレビ　　　７，８：スピーカ１０．５０：
ローバスフィルタ２０．６０：Ａ／Ｄ変換器３０．７０　：バンドパスフィルタ（可変フィルタ手段
）９０：マイクロコンピュータ（周波数特性検知手段、
制御手段）　　　　　　１００　：　ＲＯＭ２００：ノ
イズキャンセラ２１０：遅延回路　　　２２０：適応フィルタ２３０：
係数制御回路　３００：音声認識ユニット４００：ナビ
ゲーションユニットＩＧＳ：イグニッションスイッチ（スイッチ手段）Ｃ８
：キャンセルスイッチFIG. 1 is a perspective view showing the exterior of the interior of an automobile equipped with the device of the present invention. FIG. 2 is a block diagram showing the configuration of the voice input device of the embodiment. FIG. 3 is a flowchart showing an outline of the operation of the CPU 90. 2: Microphone (audio input means) 3: Monitor TV 7, 8: Speaker 10.50:
Low-pass filter 20.60: A/D converter 30.70: Band-pass filter (variable filter means) 90: Microcomputer (frequency characteristic detection means,
Control means) 100: ROM 200: Noise canceller 210: Delay circuit 220: Adaptive filter 230:
Coefficient control circuit 300: Voice recognition unit 400: Navigation unit IGS: Ignition switch (switch means) C8
:Cancel switch

Claims

[Claims]

(1) Audio input means that converts audio into an electrical signal; Variable filter means that adjusts the frequency characteristics of the electrical signal output from the audio input means; Switch means; Analyzes the electrical signal output from the audio input means frequency characteristic detection means for detecting the frequency characteristics thereof; and in response to the operation of the switch means, the frequency characteristic detection means analyzes the electrical signal output from the voice input means, and according to the analysis result. A voice input device comprising: control means for adjusting the frequency characteristics of the variable filter means.

(2) The control means starts the analysis operation of the frequency characteristic detection means when a first predetermined time has elapsed since the switch means became in the first state, and when a second predetermined time has elapsed. The voice input device according to claim 1, wherein the analysis is terminated at a certain time.

(3) The switch means is an ignition switch of the vehicle, and the control means starts an analysis operation of the frequency characteristic detection means in response to switching the switch means to the accessory position, and ends the analysis. 2. The audio input device according to claim 1, wherein if the switch means is previously switched to the on position, the analysis operation is stopped and the frequency characteristic of the variable filter means maintains the previously set state.

(4) The voice input device according to claim 1, wherein a voice recognition means is connected to the output terminal of the variable filter means, and the voice recognition means controls onboard equipment according to the recognized voice.