JPH04240897A

JPH04240897A - Speech recognizer

Info

Publication number: JPH04240897A
Application number: JP3023711A
Authority: JP
Inventors: Hirofumi Yajima; 弘文矢島
Original assignee: Clarion Co Ltd
Current assignee: Faurecia Clarion Electronics Co Ltd
Priority date: 1991-01-25
Filing date: 1991-01-25
Publication date: 1992-08-28

Abstract

PURPOSE:To improve the recognition rate by compensating the defects of a directional microphone and an ultra-directional microphone. CONSTITUTION:This voice recognition device is provided with the directional microphone 1, the ultra-directional microphone 12, speech recognition parts 51 and 52, registration memories 61 and 62, a CPU 8 which controls them, etc. The speech recognition parts compare voice data obtained from the microphones with voice data previously registered in the memories and obtain the code and similarity of the voice data having the highest similarity by the comparison. The CPU 8 controls speech recognition according to the code and similarity so as to output the recognition output and then the SN ratio is improved; and the direction deviation of the microphones to a sound source is prevented and the recognition ratio becomes high.

Description

[Detailed description of the invention]

【０００１】0001

【産業上の利用分野】本発明は、車載用等の音声認識装
置に関し、特に超指向性マイクロホンを使用し認識率の
向上を図った音声認識装置に係る。BACKGROUND OF THE INVENTION 1. Field of the Invention The present invention relates to a voice recognition device for use in a vehicle, and more particularly to a voice recognition device that uses a super-directional microphone to improve the recognition rate.

【０００２】0002

【従来の技術】従来のパターンマッチング方式の車載用
音声認識装置では、運転中の発声を目的とするために、
マイクロホンをハンズフリーマイクロホンとして口元か
ら５０ｃｍ以上離して設定する必要があった。[Prior Art] Conventional pattern-matching in-vehicle voice recognition devices use
It was necessary to set the microphone as a hands-free microphone at least 50 cm away from the mouth.

【０００３】0003

【発明が解決しようとする課題】ところが、前述のよう
に単なる指向性マイクロホンを使うとＳ／Ｎがとれない
場合があるため、高騒音環境下では認識率の低下が見ら
れた。またＳ／Ｎの向上のために超指向性マイクロホン
を使うと、発声者の運転者に座る位置がずれたり、頭の
位置がずれたりすることにより、かえってＳ／Ｎがとれ
ないことがあった。それによる認識率の低下を避けるこ
とができなかった。[Problems to be Solved by the Invention] However, as mentioned above, if a simple directional microphone is used, the S/N ratio may not be maintained, so a reduction in the recognition rate was observed in a high-noise environment. Additionally, when using a super-directional microphone to improve S/N, the speaker's sitting position or head position may shift, resulting in poor S/N. . As a result, a decrease in recognition rate could not be avoided.

【０００４】発明の目的　　本発明の目的は、指向性マイクロホン、超指向性マ
イクロホンの欠点を補い、認識率の向上を図ることがで
きる音声認識装置を提供することにある。OBJECTS OF THE INVENTION An object of the present invention is to provide a speech recognition device that can compensate for the drawbacks of directional microphones and super-directional microphones and improve the recognition rate.

【０００５】[0005]

【課題を解決するための手段】このような目的を達成す
るために、本願の第１の発明の音声認識装置は、音声源
近傍の異なる位置に複数個設けられ、音声源からの音声
と雑音との音情報を取り込む、少なくとも１つが超指向
性のマイクロホンと、このマイクロホンから取り込まれ
た音情報を音声データに変換する音データ変換部と、マ
イクロホンから取り込まれた音声の登録時の音声データ
をコードとともに記憶する登録音声データ記憶部と、マ
イクロホンのそれぞれから取り込まれた音声の認識時の
音声データを記憶部に記憶された音声データと比較し、
この比較により、類似度の最も高い音声データのコード
および類似度に基づいて認識動作を行なう音声認識部と
、この音声認識部を動作制御し、上記コードおよび類似
度に基づいて認識結果を出力する認識制御部とからなる
ことを特徴とする。[Means for Solving the Problems] In order to achieve such an object, a plurality of speech recognition devices according to the first invention of the present application are provided at different positions near a speech source, and are capable of distinguishing between speech from the speech source and noise. a microphone, at least one of which is super-directional, that captures sound information from the microphone; a sound data converter that converts the sound information captured from the microphone into audio data; and a sound data converter that converts the sound information captured from the microphone into audio data during registration. Comparing the voice data at the time of recognition of the voice captured from each of the registered voice data storage unit stored together with the code and the microphone with the voice data stored in the storage unit,
Based on this comparison, a voice recognition unit that performs a recognition operation based on the code and similarity of the voice data with the highest degree of similarity, and the operation of this voice recognition unit are controlled to output a recognition result based on the code and the degree of similarity. It is characterized by comprising a recognition control section.

【０００６】更に上記装置において、本願の第２の発明
は前記複数のマイクロホンのうち、少なくとも１つが音
声源の最も近い位置に設けられ、かつ、全てが超指向性
のマイクロホンであって、該位置のマイクロホンによっ
て取り込まれた登録時の音声データを前記各登録音声デ
ータ記憶部に転送し、該音声データにより前記各音声認
識部の認識動作を行なうことを特徴とする。Furthermore, in the above device, a second invention of the present application is such that at least one of the plurality of microphones is provided at a position closest to the audio source, and all of the microphones are super-directional microphones, The voice data at the time of registration captured by the microphone is transferred to each of the registered voice data storage sections, and the recognition operation of each of the voice recognition sections is performed using the voice data.

【０００７】[0007]

【作用】本発明の装置は、音声源近傍の異なる位置に、
複数個のマイクロホンを設け、それぞれに対応した音声
認識部で認識動作を行ない、それを基に、認識結果を出
力することにより、認識率の向上を図ることができる。[Operation] The device of the present invention provides
It is possible to improve the recognition rate by providing a plurality of microphones, performing a recognition operation with a voice recognition section corresponding to each microphone, and outputting a recognition result based on the recognition operation.

【０００８】[0008]

【実施例】以下、本発明による音声認識装置の実施例を
図面により詳細に説明する。図１は本発明による音声認
識装置の一実施例のシステム構成を示すもので、１１は
指向性マイクロホン、１２は超指向性マイクロホン、２
１，２２は増幅器、３１，３２はフィルタバンク、４１
，４２はＡ／Ｄコンバータ、５１，５２は認識部、６１
，６２は登録メモリ、７はモード選択スイッチ、８はＣ
ＰＵ、９１〜９３はデータバスを示す。なお、各マイク
ロホンで高認識率をあげられるように増幅器２１，２２
のゲインを調整している。DESCRIPTION OF THE PREFERRED EMBODIMENTS Hereinafter, embodiments of the speech recognition apparatus according to the present invention will be explained in detail with reference to the drawings. FIG. 1 shows the system configuration of an embodiment of the speech recognition device according to the present invention, in which 11 is a directional microphone, 12 is a super-directional microphone, 2
1 and 22 are amplifiers, 31 and 32 are filter banks, and 41
, 42 is an A/D converter, 51, 52 is a recognition unit, 61
, 62 is a registration memory, 7 is a mode selection switch, 8 is a C
PU, 91-93 indicate data buses. In addition, amplifiers 21 and 22 are installed in order to increase the recognition rate of each microphone.
The gain is being adjusted.

【０００９】図２はＣＰＵ８での基本的動作を示すフロ
ーチャート、図３および図４はそれぞれ、ＣＰＵ８にお
ける音声データ登録時および音声認識時の動作を示すフ
ローチャートである。以下、図１〜図４を基に、音声デ
ータ登録時および音声認識時の動作を説明する。 ■音声データ登録時モード選択スイッチ７により、「登録モード」を選択し
（ステップ１０１〜１０２）、登録ルーチンに入る（ス
テップ１０３）。ＣＰＵ８はそれを判断して、音声認識
部５１，５２に登録動作のコマンドを送り（ステップ２
０１，２０２）、音声認識部５１，５２では、Ａ／Ｄコ
ンバータ４１，４２からの音声の入力を待つ。認識部５
１，５２には、音声トリガが内蔵されており、Ａ／Ｄコ
ンバータ４１，４２よりデータバスを介して入力された
、あるレベル以上のデータ間のみを、音声データとして
取り込む。FIG. 2 is a flowchart showing the basic operation of the CPU 8, and FIGS. 3 and 4 are flowcharts showing the operation of the CPU 8 at the time of voice data registration and voice recognition, respectively. Hereinafter, operations at the time of voice data registration and voice recognition will be explained based on FIGS. 1 to 4. (2) At the time of audio data registration, the "registration mode" is selected using the mode selection switch 7 (steps 101-102), and the registration routine is entered (step 103). The CPU 8 judges this and sends a registration operation command to the voice recognition units 51 and 52 (step 2).
01, 202), the voice recognition units 51, 52 wait for voice input from the A/D converters 41, 42. Recognition unit 5
1 and 52 have a built-in audio trigger, and only data between a certain level or higher input from the A/D converters 41 and 42 via the data bus is taken in as audio data.

【００１０】マイクロホン１１，１２、増幅器２１，２
２、フィルタバンク３１，３２、Ａ／Ｄコンバータ４１
，４２は常時動作しており、ユーザが発声した音声を、
マイクロホン１１，１２に入力し、増幅器２１，２２で
増幅し、フィルタバンク３１，３２にて周波数分析する
。その後、Ａ／Ｄコンバータ４１，４２で、ディジタル
データとなり、さらに、各認識部５１，５２に取り込ま
れる。Microphones 11, 12, amplifiers 21, 2
2, filter banks 31, 32, A/D converter 41
, 42 is always operating and transmits the voice uttered by the user.
The signal is input to microphones 11 and 12, amplified by amplifiers 21 and 22, and frequency analyzed by filter banks 31 and 32. Thereafter, the data is converted into digital data by A/D converters 41 and 42, and then taken into respective recognition units 51 and 52.

【００１１】認識部５１，５２に取り込まれた音声デー
タを、認識部のコントロールの下に登録メモリ６１，６
２に保存し、また、登録順に、コード番号を付加する。 ■音声認識時モード選択スイッチ７により、「認識モード」を選択し
（ステップ１０１，１０４）、認識ルーチンに入る（ス
テップ１０５）。The voice data taken into the recognition units 51 and 52 are stored in registration memories 61 and 6 under the control of the recognition units.
2 and add a code number in the order of registration. (2) At the time of voice recognition, the "recognition mode" is selected using the mode selection switch 7 (steps 101, 104), and the recognition routine is entered (step 105).

【００１２】ＣＰＵ８は、認識部５１，５２に認識動作
コマンドを送り（ステップ３０１，３０２）、認識部５
１，５２を音声入力待ちとする、ユーザが認識対象単語
を発声することにより、マイクロホン１１，１２に入力
された音声データを、■と同様にして、認識部５１，５
２に取り込み（ステップ３０３，３０４）、■で登録さ
れた音声データとのＤＰマッチングを認識部５で行ない
、登録データと一番類似度の高い登録データのコード番
号およびその類似度を（類似度が低すぎたならば、エラ
ー情報も）それぞれＣＰＵ８に返す。 ■音声認識結果出力方法２つの認識結果より、以下の方法で最終認識結果を求め
（ステップ３０５）、外部装置コントロール信号として
出力する（ステップ３０６）。（１）両方ともエラーの
とき→エラー。（２）２つが違うコード番号を返したと
き→類似度の高い方のコード番号とする。（３）両方と
も同じコード番号を返したとき→そのコード番号を認識
したとする。（４）片方がエラー、もう一方があるコー
ド番号を返したとき→そのコード番号を認識したとする
。The CPU 8 sends a recognition operation command to the recognition units 51 and 52 (steps 301 and 302), and the recognition unit 5
1 and 52 are waiting for voice input. When the user speaks a word to be recognized, the voice data input to the microphones 11 and 12 is sent to the recognition units 51 and 5 in the same manner as in ■.
2 (steps 303, 304), the recognition unit 5 performs DP matching with the voice data registered in If the value is too low, error information is also returned to the CPU 8. ■Voice recognition result output method A final recognition result is obtained from the two recognition results using the following method (step 305), and is output as an external device control signal (step 306). (1) When both are errors → error. (2) When two different code numbers are returned → the code number with higher similarity is selected. (3) When both return the same code number, it is assumed that the code number has been recognized. (4) When one side returns an error and the other side returns a certain code number, assume that the code number is recognized.

【００１３】図５は本発明による音声認識装置の他の実
施例のシステム構成を示すもので、１１〜１３は超指向
性マイクロホン、２１〜２３は増幅器、３１〜３３はバ
ンドパスフィルタ、４１〜４３はＡ／Ｄコンバータ、５
１〜５３は認識部、６１〜６３はデータメモリ、７０は
データ転送バス、７はモード選択スイッチ、８はＣＰＵ
、９１〜９３はデータバスである。FIG. 5 shows the system configuration of another embodiment of the speech recognition device according to the present invention, in which 11-13 are super-directional microphones, 21-23 are amplifiers, 31-33 are band-pass filters, and 41-13 are super-directional microphones, 21-23 are amplifiers, 31-33 are band-pass filters, 43 is an A/D converter, 5
1 to 53 are recognition units, 61 to 63 are data memories, 70 is a data transfer bus, 7 is a mode selection switch, and 8 is a CPU
, 91-93 are data buses.

【００１４】図６（ａ）および（ｂ）は、超指向性マイ
クロホン１１〜１３の配置の例を示すもので、ユーザは
、前もって、３つのマイクロホン１１〜１３の内の真中
のマイクロホン１２が、口元に向くようにしておく。ＣＰＵ８の基本的動作は図２と同じなので省略する。FIGS. 6(a) and 6(b) show an example of the arrangement of the superdirectional microphones 11 to 13, and the user has previously determined that the middle microphone 12 of the three microphones 11 to 13 is Keep it facing your mouth. The basic operation of the CPU 8 is the same as that in FIG. 2, so a description thereof will be omitted.

【００１５】図７および図８それぞれＣＰＵ８での登録
および認識動作の詳細を示すものである。以下、図５の
実施例の動作につき、図７、図８を参照しながら詳細に
説明する。 ■音声データ登録時モード選択スイッチ７により、「登録モード」を選択す
る。ＣＰＵ７は、それを判断して、音声認識部５２に登
録動作のコマンドを送り（ステップ４０１）認識部５２
では、Ａ／Ｄコンバータ４２から入力された、あるレベ
ル以上のデータ部分のみを音声データとして取り込む。FIGS. 7 and 8 show details of the registration and recognition operations in the CPU 8, respectively. The operation of the embodiment shown in FIG. 5 will be described in detail below with reference to FIGS. 7 and 8. ■When registering audio data, use the mode selection switch 7 to select "registration mode". The CPU 7 determines this and sends a registration operation command to the voice recognition unit 52 (step 401).
Then, only the data portion of a certain level or higher inputted from the A/D converter 42 is taken in as audio data.

【００１６】マイクロホン１２、増幅器２２、フィルタ
バンク３２、Ａ／Ｄコンバータ４２は常時動作しており
、ユーザが発声した音声を、マイクロホン１２に入力し
、増幅器２２で増幅し、バンドパスフィルタ３２で周波
数分析し、さらに、Ａ／Ｄコンバータ４２でディジタル
データとする。The microphone 12, the amplifier 22, the filter bank 32, and the A/D converter 42 are in constant operation, and the voice uttered by the user is input to the microphone 12, amplified by the amplifier 22, and frequency converted by the bandpass filter 32. The data is analyzed and further converted into digital data by the A/D converter 42.

【００１７】認識部５２では、データバス９２を介して
取り込まれた音声データを認識し、データバス９２を介
して登録メモリ６２に保存し、また、登録順に、コード
番号を付加することにより、データメモリ６２に登録す
る。The recognition unit 52 recognizes the audio data taken in via the data bus 92, stores it in the registration memory 62 via the data bus 92, and adds a code number to the data in the order of registration. It is registered in the memory 62.

【００１８】認識部５２から認識動作終了のステータス
情報を得た各認識部は、各データメモリ６１〜６３にコ
ントロール信号を送り、データメモリ６２のデータをそ
のまま他のデータメモリ６１，６３にデータ転送バス７
０を介して転送する。 ■音声認識時モード選択スイッチ７により、「認識モード」を選択す
る。ＣＰＵ８は、各認識部５１〜５３に認識動作コマン
ドを送り（ステップ５０１〜５０３）、各認識部は入力
待ちとなる。各マイクロホン１１〜１３に入力された音
声データを、■と同様にして、認識部５１〜５３に取り
込み、■で登録された音声データとのＤＰマッチングを
認識部５１〜５３で行ない、登録データと一番類似度の
高い登録データのコード番号およびその類似度を（類似
度が低すぎたならば、エラー情報も）それぞれＣＰＵ８
に返す。 ■音声認識結果出力方法ＣＰＵ８では、各認識部からの認識結果を基に、下記の
方法で、最終認識結果を求め（ステップ５０７）、外部
装置コントロール信号を出力する（ステップ５０８）。（１）全てがエラーのとき→エラー。（２）３つの内の
２つ以上が違うコード番号を返したとき→類似度の最も
高いコード番号とする。（３）全て同じコード番号を返
したとき→そのコード番号を認識したとする。（４）１
つまたは２つがエラー、残りが同じコード番号を返した
とき→そのコード番号を認識したとする。Each recognition unit that has obtained the status information indicating the end of the recognition operation from the recognition unit 52 sends a control signal to each data memory 61 to 63, and transfers the data in the data memory 62 as it is to the other data memories 61 and 63. bus 7
Transfer via 0. (2) Select "recognition mode" using the voice recognition mode selection switch 7. The CPU 8 sends a recognition operation command to each recognition unit 51 to 53 (steps 501 to 503), and each recognition unit waits for input. The voice data input to each of the microphones 11 to 13 is taken into the recognition units 51 to 53 in the same manner as in (■), and the recognition units 51 to 53 perform DP matching with the voice data registered in (■), and the registered data and The code number of the registered data with the highest degree of similarity and its degree of similarity (and error information if the degree of similarity is too low) are sent to the CPU 8.
Return to. ■Voice recognition result output method The CPU 8 determines the final recognition result based on the recognition results from each recognition section using the method described below (step 507), and outputs an external device control signal (step 508). (1) When everything is an error → Error. (2) When two or more of the three return different code numbers → select the code number with the highest degree of similarity. (3) When all the same code numbers are returned → That code number is recognized. (4)1
When one or two return an error and the remaining return the same code number, the code number is recognized.

【００１９】図９は本発明の音声認識装置の更に他の実
施例のシステム構成を示すもので、図５と同様にＡ／Ｄ
コンバータ４１〜４３、認識部５１〜５３、登録メモリ
６１〜６３はそれぞれデータバス９１〜９３を介して接
続されている。図５と異なる点は、登録メモリ６２から
の転送線がないことである。図９におけるＣＰＵ８の基
本動作および認識動作は図２および図３のフローチャー
トと同じである。FIG. 9 shows the system configuration of still another embodiment of the speech recognition device of the present invention.
Converters 41-43, recognition units 51-53, and registration memories 61-63 are connected via data buses 91-93, respectively. The difference from FIG. 5 is that there is no transfer line from the registration memory 62. The basic operation and recognition operation of the CPU 8 in FIG. 9 are the same as the flowcharts in FIGS. 2 and 3.

【００２０】また、図１０はＣＰＵ８の登録動作を示す
フローチャートである。以下、図９の実施例の動作につ
き、図１０を参照して説明する。 ■音声データ登録時モード選択スイッチ７により、「登録モード」を選択す
る。ＣＰＵ８はそれを判断して、音声認識部５１〜５３
に登録動作のコマンドを送り（ステップ６０１〜６０３
）、認識部５１〜５３では、Ａ／Ｄコンバータ４１〜４
３からの音声の入力を待つ。FIG. 10 is a flowchart showing the registration operation of the CPU 8. The operation of the embodiment shown in FIG. 9 will be described below with reference to FIG. 10. ■When registering audio data, use the mode selection switch 7 to select "registration mode". The CPU 8 determines this and uses the voice recognition units 51 to 53.
Send a registration operation command to (steps 601 to 603)
), in the recognition units 51 to 53, the A/D converters 41 to 4
Wait for audio input from 3.

【００２１】認識部５１〜５３には、音声トリガが内蔵
されており、Ａ／Ｄコンバータ４１〜４３よりデータバ
ス９１〜９３を介して入力された、あるレベル以上のデ
ータ部分のみを音声データとして取り込む。[0021] The recognition units 51 to 53 have built-in audio triggers, and only data portions of a certain level or higher inputted from the A/D converters 41 to 43 via the data buses 91 to 93 are used as audio data. take in.

【００２２】マイクロホン１１〜１３からＡ／Ｄコンバ
ータ４１〜４３までは常時動作しており、ユーザが発声
した音声を、マイクロホン１１〜１３に入力し、増幅器
２１〜２３で増幅し、フィルタバンク３１〜３３で周波
数分析し、その後、Ａ／Ｄコンバータ４１〜４３でディ
ジタルデータに変換する。The microphones 11 to 13 to the A/D converters 41 to 43 are always in operation, and the voice uttered by the user is inputted to the microphones 11 to 13, amplified by the amplifiers 21 to 23, and transmitted to the filter banks 31 to 43. 33 performs frequency analysis, and then A/D converters 41 to 43 convert it into digital data.

【００２３】認識部５１〜５３に取り込まれた音声デー
タを、データバス９１〜３を介して登録メモリ６１〜６
３に保存し、また、その時、登録順に、コード番号を付
加する。 ■音声認識時モード選択スイッチ７により、「認識モード」を選択す
る。ＣＰＵ８は、前述した実施例と同様に、各認識部５
１〜５３に認識動作コマンドを送り、各認識部５１〜５
３は、音声入力待ちとなる。ユーザが認識対象単語を発
声することにより、マイクロホン１１〜１３に入力され
た音声データを、■と同様にして認識部５１〜５３に取
り込み、■で登録された音声データとのＤＰマッチング
を認識部５１〜５３で行ない、登録データと一番類似度
の高い登録データのコード番号およびその類似度を（類
似度が低すぎたならば、エラー情報も）それぞれＣＰＵ
８に返す。 ■音声認識結果出力方法ＣＰＵ８では、各認識部からの認識結果を基に、下記の
方法で、最終認識結果を求め、外部装置コントロール信
号を出力する。（１）全てがエラーのとき→エラー。（２）３つの内の２つ以上が違うコード番号を返したと
き→類似度の最も高いコード番号とする。（３）全て同
じコード番号を返したとき→そのコード番号を認識した
とする。（４）１つまたは２つがエラー、残りが同じコ
ード番号を返したとき→そのコード番号を認識したとす
る。The audio data taken into the recognition units 51-53 is transferred to the registration memories 61-6 via data buses 91-3.
3, and at that time, a code number is added in the order of registration. (2) Select "recognition mode" using the voice recognition mode selection switch 7. The CPU 8 controls each recognition unit 5 as in the above-mentioned embodiment.
1 to 53, each recognition unit 51 to 5
3 waits for voice input. When the user utters the recognition target word, the voice data input to the microphones 11 to 13 is input into the recognition units 51 to 53 in the same manner as in ■, and the recognition unit performs DP matching with the voice data registered in ■. 51 to 53, the code number of the registered data with the highest degree of similarity to the registered data and its degree of similarity (and error information if the degree of similarity is too low) are sent to the CPU.
Return to 8. ■Voice recognition result output method The CPU 8 uses the following method to obtain the final recognition result based on the recognition results from each recognition section, and outputs an external device control signal. (1) When everything is an error → Error. (2) When two or more of the three return different code numbers → select the code number with the highest degree of similarity. (3) When all the same code numbers are returned → That code number is recognized. (4) When one or two return an error and the rest return the same code number, the code number is recognized.

【００２４】[0024]

【発明の効果】以上述べたように、本発明によれば、音
声認識装置において、超指向性マイクロホンの使用によ
り高Ｓ／Ｎを確保し、かつ音源との方向のずれは複数個
のマイクロホンを使うことで防止しているので、高認識
率を得ることができる。As described above, according to the present invention, in a speech recognition device, a high S/N is ensured by using a super-directional microphone, and the direction deviation with respect to the sound source can be reduced by using a plurality of microphones. Since it is prevented by using this method, a high recognition rate can be obtained.

[Brief explanation of the drawing]

【図１】本発明の音声認識装置の一実施例のシステム構
成図である。FIG. 1 is a system configuration diagram of an embodiment of a speech recognition device of the present invention.

【図２】図１のＣＰＵの基本動作のフローチャートであ
る。FIG. 2 is a flowchart of the basic operation of the CPU in FIG. 1;

【図３】図１のＣＰＵの登録および認識動作のフローチ
ャートである。FIG. 3 is a flowchart of registration and recognition operations of the CPU in FIG. 1;

【図４】図１のＣＰＵの登録および認識動作のフローチ
ャートである。FIG. 4 is a flowchart of registration and recognition operations of the CPU in FIG. 1;

【図５】本発明の音声認識装置の他の実施例のシステム
構成図である。FIG. 5 is a system configuration diagram of another embodiment of the speech recognition device of the present invention.

【図６】図５のマイクロホンの配置を示す図である。FIG. 6 is a diagram showing the arrangement of the microphones in FIG. 5;

【図７】図５のＣＰＵの登録および認識動作のフローチ
ャートである。FIG. 7 is a flowchart of registration and recognition operations of the CPU in FIG. 5;

【図８】図５のＣＰＵの登録および認識動作のフローチ
ャートである。FIG. 8 is a flowchart of registration and recognition operations of the CPU in FIG. 5;

【図９】本発明の音声認識装置の他の実施例のシステム
構成図である。FIG. 9 is a system configuration diagram of another embodiment of the speech recognition device of the present invention.

【図１０】図９のＣＰＵの登録動作のフローチャートで
ある。FIG. 10 is a flowchart of the registration operation of the CPU in FIG. 9;

[Explanation of symbols]

１１〜１３　　超指向性マイクロホン５１〜５３　　音声認識部６１〜６３　　登録メモリ７　　モード選択スイッチ８　　ＣＰＵ 11-13 Super directional microphone 51-53 Voice recognition section 61-63 Registration memory 7 Mode selection switch 8 CPU

Claims

[Claims]

1. A plurality of microphones, at least one of which is super-directional, is provided at different positions near a sound source and captures sound information of sound and noise from the sound source, and sound information captured from the microphone. a sound data converting unit that converts the sound into audio data; a registered audio data storage unit that stores the audio data at the time of registration of the audio captured from the microphones together with a code; The audio data is compared with the audio data stored in the storage unit, and by this comparison,
a voice recognition unit that performs a recognition operation based on the code and similarity of voice data having the highest degree of similarity; and a recognition control unit that controls the operation of the voice recognition unit and outputs a recognition result based on the code and the degree of similarity. A speech recognition device comprising:

2. At least one of the plurality of microphones is provided at a position closest to a sound source, and
All of the microphones are super-directional microphones, and the voice data at the time of registration captured by the microphone at the position is transferred to each of the registered voice data storage sections, and the recognition operation of each of the voice recognition sections is performed using the voice data. The speech recognition device according to claim 1, characterized in that: