JPS61130999A

JPS61130999A - Voice recognition equipment

Info

Publication number: JPS61130999A
Application number: JP59251902A
Authority: JP
Inventors: 浩榊原; 真一吉田; 井上　博富; 田所　富男; 土肥　治; 好高久間
Original assignee: Hitachi Techno Engineering Co Ltd; Hitachi Ltd; Kobe Steel Ltd
Current assignee: Hitachi Ltd; Kobe Steel Ltd; Hitachi Plant Technologies Ltd
Priority date: 1984-11-30
Filing date: 1984-11-30
Publication date: 1986-06-18
Also published as: JPH0438358B2

Abstract

(57)【要約】本公報は電子出願前の出願データであるた
め要約のデータは記録されません。(57) [Summary] This bulletin contains application data before electronic filing, so abstract data is not recorded.

Description

【発明の詳細な説明】〔発明の利用分野〕一本発明は、人間の音声を認識する装置に係シ、特に１人
力された音声情報を、予め登録された音　　　゛声パタ
ーンと照合して認識する複数の音声認識手段を備えた音
声認識装置の改良に関する。[Detailed Description of the Invention] [Field of Application of the Invention] The present invention relates to a device that recognizes human speech, and in particular, a device that compares speech information input by one person with pre-registered speech patterns. The present invention relates to an improvement of a speech recognition device equipped with a plurality of speech recognition means.

[Background of the invention]

例えば、特開昭５１−２８７０１号公報に開示されてい
るように、現在の音声認識精度は、不特定話者では著し
く低下するので、話者毎の特徴を抽出した音声パターン
を予め登録しておき、これと″音声入力を照合して、そ
れらの特徴の一致により認識する手法が主流である。For example, as disclosed in Japanese Unexamined Patent Publication No. 51-28701, the current speech recognition accuracy is significantly degraded for unspecified speakers, so it is necessary to register speech patterns in advance that have extracted features for each speaker. The mainstream method is to compare this with the voice input and recognize it based on the matching of those features.

ところで、音声認識装置を、複数の話者で使用したい要
求が強く、この場合には、１台の音声−識装置に、複数
の話者（例えば２０名など）の各単語毎の音声パターン
を予め記憶させておく。そして、使用者（話者）の発生
した単語と、記憶された音声パターンとを照合し、最も
近い、あるいは、予定の誤差の範囲にある音声パターン
に対応する１葉であるものと認識する。By the way, there is a strong demand to use a speech recognition device with multiple speakers, and in this case, one speech recognition device can record the speech patterns for each word of multiple speakers (for example, 20 people). Memorize it in advance. Then, the words generated by the user (speaker) are compared with the stored speech pattern, and the word is recognized as the one word that corresponds to the speech pattern that is closest or within the expected error range.

しかし１、多数の話者の多数の単語を記憶するためには
、大きな記憶容量を必要とし、また、その中から一致す
る音声パターンを認識するためには、認識時間が長くな
るという欠点がある。However, 1. Memorizing many words from many speakers requires a large storage capacity, and recognizing matching speech patterns from among them requires a long recognition time. .

このため、各話者の音声パターンを記憶させたカセット
・メモリを各話者が所有し、音声認識装置を使用すると
きに、上記カセットをセットする手法も提案されている
。For this reason, a method has been proposed in which each speaker owns a cassette memory storing the speech patterns of each speaker, and the cassette is set when using the speech recognition device.

この手法によれば、メモリの総容量としては同じである
が、特定話者の音声パターンのみとの照合により、認識
精度及び認識速度が向上する利点がある。According to this method, although the total memory capacity is the same, there is an advantage that recognition accuracy and recognition speed are improved by matching only the voice pattern of a specific speaker.

しかしながら、カセット・メモリなどの紛失や置場の問
題が生じ、利用者にとって不便である。However, this is inconvenient for the user, as the cassette memory or the like may be lost or stored.

[Purpose of the invention]

本発明の目的は、複数の音声認識装置を複数の話者が使
用する場合において、認識精度や認識速度を損うことが
なく、また、着脱メモリなどを必要としない音声認識装
置を提供することである。An object of the present invention is to provide a speech recognition device that does not impair recognition accuracy or recognition speed when multiple speech recognition devices are used by multiple speakers, and does not require removable memory. It is.

[Summary of the invention]

本発明の特徴とするところは、複数の音声認識手段のう
ちの任意のものを使用する話者を識別する手段と１．＊
ｉ話者毎の音声パターンを記憶する　　　　　　）共通
の補助記憶手段と、識別された話者に対応する音声パタ
ーンを上記補助記憶手段から該当する音声認識手段の持
つ音声パターン記憶手段へ読出し格納する制御手段を設
けることである。The present invention is characterized by: 1. means for identifying a speaker using any one of a plurality of speech recognition means; *
Storing the voice pattern for each i speaker) A common auxiliary storage means, and control for reading and storing the voice pattern corresponding to the identified speaker from the auxiliary storage means into the voice pattern storage means of the corresponding voice recognition means. It is necessary to provide means.

本発明の望ましい実施態様においては、共通の補助記憶
手段には、（１）、話者識別用の音声パターン（話者交
代用語）と、（２）、作業用の音声パターン（命令語や
作業用語など）とを予め記憶させておく。そして、各音
声認識手段の使用直前には、話者識別用の音声パターン
を各音声認識手段内の音声パターン記憶手段に読出し格
納しておく。In a preferred embodiment of the present invention, the common auxiliary storage means contains (1) speech patterns for speaker identification (speaker change terms), and (2) speech patterns for tasks (command words and tasks). (terms, etc.) are memorized in advance. Immediately before each voice recognition means is used, a voice pattern for speaker identification is read out and stored in the voice pattern storage means in each voice recognition means.

この状態で、任意の音声認識手段を例えば、人太部が使
用を開始するとき、その氏名などの話者識別用音声パタ
ーンと符号する定められた話者交代用語を発声する。In this state, for example, when Nitabe starts using the arbitrary voice recognition means, he utters a predetermined speaker change term that is encoded with a voice pattern for identifying the speaker, such as his name.

これによシ、話者が識別されると、制御手段は、識別さ
れた話者に対応する作業用の音声パターンを共通の補助
記憶手段から読出し、対応する音声認識手段内の音声パ
ターン記憶手段へ格納する。Accordingly, when a speaker is identified, the control means reads out a working speech pattern corresponding to the identified speaker from the common auxiliary storage means, and reads out a working speech pattern corresponding to the identified speaker from the speech pattern storage means in the corresponding speech recognition means. Store it in

以後は、話者と１対１に対応した作業用の音声パターン
のみと、その話者の音声入力情報との照合の下に音声認
識が行われ、必要な作業が遂行される。Thereafter, voice recognition is performed by comparing only the voice pattern for work in one-on-one correspondence with the speaker and the voice input information of that speaker, and the necessary work is performed.

この結果、着脱メモリなどを必要とせず、複数の音声認
識手段を夫々異る話者が同時に使用することができ、各
音声認識手段の認識精度及び認識速度を損うことも奇い
。As a result, a plurality of voice recognition means can be used simultaneously by different speakers without requiring a removable memory, which may impair the recognition accuracy and recognition speed of each voice recognition means.

[Embodiments of the invention]

以下、音声入力記憶装置に本発明を適用した一実施例に
つき詳細に説明する。この実施例においては、音声によ
るガイダンス９．アンサーバックを行いつつ、作業者の
作業とその結果を入力して記憶する装置である。また、
最初に行うべき、あるいは新たな話者に対して行うべき
音声パターンの登録をも自由に行いうるものである。Hereinafter, an embodiment in which the present invention is applied to a voice input storage device will be described in detail. In this embodiment, audio guidance 9. This is a device that inputs and stores the worker's work and its results while providing answer back. Also,
It is also possible to freely register voice patterns that should be performed for the first time or for a new speaker.

第１図は本発明に係る音声情報入力記録装置の一実施例
の構成を示す。同図において、２台の音声認識装置１人
およびＩＢは音声入力用マイクロＡおよび６Ｂの音声信
号を増幅する増幅器１１Ａおよび１１Ｂ１音声信号をデ
ィジタル信号に変換するＡ／Ｄ変換器１２Ａおよび１２
ＢＳあらかじめ音声パターンを記憶しておく音声パター
ンメモリ１４人および１４Ｂ及び入力音声と音声パター
ンとを比較して音声認識をする音声認識制御回路１３Ａ
および１３Ｂによって構成されている一方、音声出力装
置２人および２Ｂは音声出ヵをするための音声を記憶し
ておく合成音声メモリ２２Ａおよび２２Ｂ、’Ｎ声認識
結果に応じて合成音声メモＩＪ　２２　Ａおよび２２Ｂ
の記憶内容を選別して出力する音声出力制御回路２１Ａ
および２１Ｂ１音声出力制御回路２１Ａおよび２１Ｂの
出力信号をアナログ信号に変換するＤ／Ａ変換器２３Ａ
および２３Ｂ１アナログ信号を増幅してスピーカ（′！
またけイヤホン）７Ａおよび７Ｂからアンサーバックの
音声を発生させる増幅器２４Ａおよび２４Ｂによって構
成されている。FIG. 1 shows the configuration of an embodiment of a voice information input and recording device according to the present invention. In the figure, two voice recognition devices, one person and IB, are amplifiers 11A and 11B that amplify the voice signals of voice input micros A and 6B, A/D converters 12A and 12 that convert the voice signals into digital signals.
BS 14 voice pattern memories and 14B that store voice patterns in advance; and a voice recognition control circuit 13A that performs voice recognition by comparing input voice and voice patterns.
On the other hand, the voice output devices 2 and 2B are composed of synthesized voice memories 22A and 22B that store voices for outputting voices, and a synthesized voice memo IJ 22 according to the result of 'N voice recognition. A and 22B
Audio output control circuit 21A that selects and outputs the memory contents of
and a D/A converter 23A that converts the output signals of the 21B1 audio output control circuits 21A and 21B into analog signals.
And the 23B1 analog signal is amplified and the speaker ('!
It is composed of amplifiers 24A and 24B that generate answerback sounds from the earphones 7A and 7B.

補助記憶装置８は、複数の音声認識袋ＮＩＡおよびＩＢ
に共通して使用されるもので、夫々の音声パターンメモ
リ１４Ａおよび１４Ｂへ格納すべき音声パターンを記憶
するものである。The auxiliary storage device 8 stores a plurality of voice recognition bags NIA and IB.
It is commonly used for storing voice patterns to be stored in the respective voice pattern memories 14A and 14B.

また制御回路３は音声認識装置ＩＡおよびＩＢの音声認
識制御回路１３Ａおよび１３Ｂを制御して音声認識結果
を取シ込んだシ、音声出力装置２Ａおよび２Ｂの音声出
力制御回路２１Ａおよび２１Ｂの制御をしてガイダンス
やアンサーバック音をスピーカ７Ａおよび７Ｂから出力
させたシ、音声認識装置ＩＡおよびＩＢの音声パターン
メモリー４Ａおよび１４Ｂの音声パターンを補助記憶装
置８に記憶させたシ、逆に補助記憶袋Ｒ８の音声パター
ンを音声認識装置」ＡおよびＩＢあるいは１人またはＩ
Ｂの音声パターンメモリー　４　Ａおよび１４Ｂに移し
換えたシ、表示器（またはプリンタ）５に制御状態や音
声認識結果などを表示（またはプリントアウト）したり
する制御用コンピュータである。この制御回路３は音声
の他にキーボード４によっても制御される。The control circuit 3 also controls the voice recognition control circuits 13A and 13B of the voice recognition devices IA and IB to receive voice recognition results, and controls the voice output control circuits 21A and 21B of the voice output devices 2A and 2B. In this case, guidance and answerback sounds are output from speakers 7A and 7B, and voice patterns from voice pattern memories 4A and 14B of voice recognition devices IA and IB are stored in auxiliary storage device 8. Voice recognition device A and IB or one person or I
This is a control computer that displays (or prints out) the control status, voice recognition results, etc. on the display (or printer) 5. This control circuit 3 is controlled not only by voice but also by a keyboard 4.

次に本発明の一実施例に使用する音声単語の一例を第２
図に示す。Next, an example of the audio words used in one embodiment of the present invention is shown in the second example.
As shown in the figure.

音声単語は、話者交代をするための話者交代用語語（話者識別用の音声パターン）と、作業をするための
作業用語ならびに作業に使用する命令語（作業用の音声
パターン）から成る。Spoken words consist of speaker change terms for changing speakers (speech patterns for speaker identification), working words for performing tasks, and command words used for tasks (sound patterns for tasks). .

まず、音声パターンの登録は、話者がマイクロＡまたは
６Ｂを使って音声単語を順次音声で読み上げることによ
って行なわれ、その音声は増幅器１１Ａまたは１１　Ｂ
、　Ａ／Ｄ変換器１２Ａまたは１２Ｂ１音声認識制御回
路１３Ａま九は１３Ｂを介して音声パターンメモリ１４
Ａまたは１４Ｂに記憶される。この音声パターンメモリ
１４Ａｆたは１４Ｂに記憶された音声パターンは補助記
憶装置８に話者毎に番地付けされて格納される。First, the voice pattern is registered by the speaker reading out spoken words in sequence using the Micro A or 6B, and the voice is transmitted to the amplifier 11A or 11B.
, A/D converter 12A or 12B1 voice recognition control circuit 13A and voice pattern memory 14 via 13B.
A or 14B. The voice patterns stored in the voice pattern memory 14Af or 14B are stored in the auxiliary storage device 8 with addresses assigned for each speaker.

音声パターンメモリ１４Ａおよび１４Ｂへの音声単語の
記憶の番地付けは、命令語と作業用語については話者共
通の同一番地とし、話者交代用語は話者毎に相異した番
地とする。そして話者交代モート責使用開始時や交代命
令があったとき）にすいては話者全員の話者交代用語の
音声パターンのみを、音声パターンメモリ１４Ａあるい
は１４Ｂに収納しておき、話者交代完了後の作業モード
では、上記交代モードで識別された１人の話者の命令語
と作業用語の音声パターンを音声パターンメモ１７１４
　Ａまたは１４Ｂに移して音声でデータの入力を行なう
。Regarding the address allocation for storing voice words in the voice pattern memories 14A and 14B, command words and working words are stored at the same address common to all speakers, and speaker change words are stored at different addresses for each speaker. Then, when starting to use the speaker change mode or when a change command is issued, only the voice patterns of the speaker change terms of all speakers are stored in the voice pattern memory 14A or 14B, In the work mode after completion, the voice pattern of command words and work words of one speaker identified in the above alternation mode is recorded in the voice pattern memo 1714.
Move to A or 14B and enter data by voice.

次に本発明による音声情報入力の一実施例を第３図を用
いて説明する。Next, an embodiment of audio information input according to the present invention will be described using FIG. 3.

スピーカ７Ａからの音声ガイダンス「氏名は？」に対し
、Ａ太部が、マイクロＡから音声で「Ａ太部」と発声す
ると、音声認識装置ＩＡの音声認識制御回路１３Ａによ
って音声パターンメモリ１４Ａに記憶されている話者交
代用の音声単語の中から、入力音声と一致する単語「Ａ
太部」を探し出して、その記憶番地あるいは対応するコ
ードを制御回路３に出力する。In response to the voice guidance "What's your name?" from the speaker 7A, A-taabe utters "A-taabe" from the micro A, and the voice recognition control circuit 13A of the speech recognition device IA stores it in the voice pattern memory 14A. Select the word “A” that matches the input voice from among the voice words for speaker change
The CPU 11 searches for the "fat part" and outputs the memory address or the corresponding code to the control circuit 3.

制御回路３は音声単語コードの入力によシデーメとして
取シ込んだシ表示器５に表示したシする他に音声出力制
御回路２１Ａにアンサーバックさとるための指令を発す
る。音声出力制御回路２１Ａは制御回路３のアンサーバ
ック指令により合成音声メモリ２２Ａ内の音声データを
出力してＤ／Ａ変換器２３Ａ１増幅器２４Ａを介してス
ピーカ７Ａから「Ａ太部」と発声させる。ここで、Ａ太
部がマイクロＡからｒＯＫＪと発声して入力すると、音
声認識袋ｆｆ１Ａの音声認識制御回路１３Ａによって音
声パターンメモリ１４　Ａの話者交代用単語の中から、
入力音声と一致する単語ｒＯＫＪを探し出してその番地
あるいはコードを制御回路３に出力する。制御回路３は
これにより、話著がＡ太部であることを識別し、補助記
憶装置８に記憶していたＡ太部の作業用の音声パターン
を音声パターンメモリ１４Ａに読出して格納し、Ａ太部
の作業モードにするとともに、音声出力装置２Ａを制御
してスピーカ７人から「作業は？」と音声ガイダンスを
発する。The control circuit 3 issues a command to the voice output control circuit 21A to obtain an answer back, in addition to displaying the words received as a message on the display 5 by inputting the voice word code. The audio output control circuit 21A outputs the audio data in the synthesized audio memory 22A in response to the answerback command from the control circuit 3, and causes the speaker 7A to utter "A fat section" via the D/A converter 23A1 and the amplifier 24A. Here, when A-taabe utters and inputs rOKJ from micro A, the voice recognition control circuit 13A of the voice recognition bag ff1A selects from among the words for speaker change in the voice pattern memory 14A.
The word rOKJ that matches the input voice is searched and its address or code is output to the control circuit 3. As a result, the control circuit 3 identifies that the speech is A-bold, reads out and stores the working audio pattern for A-bold that was stored in the auxiliary storage device 8 in the audio pattern memory 14A, and stores A-broad. At the same time as switching to the work mode shown in the thick section, the voice output device 2A is controlled to emit voice guidance from seven speakers asking, ``What about the work?''.

Ａ太部が「入庫」と音声入力すると、音声認識の結果「
品番は？」とスピーカ７Ａからガイダンスが返ってくる
ので、例えば「ｉ、２，３ｊと音で入力する七正しく認
識されればｌ−１，２，３Ｊとアンサーバックが返って
くる。次に「置場は？」のガイダンスに対しｒＡＪと音
声入力すると音声認識の結果ｒＡＪとアンサーバックが
返ってくる。When A-tabe inputs “in stock” by voice, the voice recognition results in “
What's the product number? ” is returned from the speaker 7A, so for example, if it is recognized correctly, the answerback will be returned as l-1, 2, 3J. If you input rAJ by voice in response to the guidance "?", the voice recognition will return rAJ and an answerback.

以上により、Ａ太部は、Ａ太部の音声で自分の作業用の
音声パターンを補助記憶装置８から音声認識装置ＩＡに
移した上で、自分の作業用音声パターンのみとの照合に
よる精度の高い、かつ高速の認識を用いて、「品番１２
３と置場Ａに入庫」というデータを入力したことになる
。As described above, A-taibe transfers his working voice pattern from the auxiliary storage device 8 to the speech recognition device IA using A-taibe's voice, and then checks the accuracy by comparing it only with his own working voice pattern. Using high-speed and high-speed recognition,
This means that the data "3 and storage at storage area A" has been input.

Ａ太部が作業を終了するときは、「交代」とマイクロＡ
から入力すると作業モードから話者交代モードに切り換
る。すなわち、制御回路３は、補助記憶装置８内の話者
交代用音声パターンを読出して、音声パターンメモ１７
１４　Ａへ格納する。When A-tabe finishes his work, he says "takeover" and micro-A.
If you input from , it will switch from work mode to speaker change mode. That is, the control circuit 3 reads out the voice pattern for speaker change in the auxiliary storage device 8 and stores it in the voice pattern memo 17.
14 Store in A.

以上はＡ太部がマイクロＡから音声入力した場合につい
て説明したが、Ａ太部がマイクロＢから音声入力した場
合も全く同様である。スピーカ７Ｂからの音声ガイダン
ス「氏名は？」に対して、Ａ太部が、マイクロＢから音
声で「Ａ太部」と発声すると、音声認識装置ＩＢの音声
認識制御回路）Ｌ３Ｂによって音声パターンメモＩＪ　
１４　Ｂに記憶されている音声単語の中から入力と一致
する単語である「Ａ太部」を探し出してその記憶番地あ
る　　　　　　）いはコードを制御回路３に出力する。The above description has been made for the case where the A-thick section inputs audio from the micro A, but the case where the A-thick section inputs the audio from the micro B is exactly the same. In response to the voice guidance "What's your name?" from the speaker 7B, A-taabe utters "A-taabe" from the micro B. Then, the voice recognition control circuit (L3B) of the voice recognition device IB uses the voice pattern memo IJ.
14 Search out the word "A thick part" that matches the input from the audio words stored in B, and output its memory address ( ) or code to the control circuit 3.

制御回路３の制御によって音声出力装置２Ｂの増幅器２
４Ｂを介してスピーカ７Ｂから「Ａ太部」と発声させる
。ここで、Ａ太部がマイクロＢから「ＯＫ」と発声して
入力すると、音声認識装置ＩＢの音声認識制御回路１３
ＢＫよって登録音声メモリ１４　Ｂの単語の中から入力
音声と一致する音声単語である「ＯＫ」を探し出してそ
の番地あるいはコードを制御回路３に出力する。これに
より、制御回路３は補助記憶装置８に記憶していたＡ太
部の作業用の音声パターンを音声パターンメモリ１４Ｂ
に移し換えて、Ａ太部の作業モードにするとともに、音
声出力装置２Ｂを制御してスピーカ７Ｂから「作業は？
」と音声ガイダンスを発する。１扶下マイクロＡからの
音声入力時と全く同様に作用する。The amplifier 2 of the audio output device 2B is controlled by the control circuit 3.
``A fat part'' is uttered from the speaker 7B via the speaker 4B. Here, when A fat part speaks and inputs "OK" from micro B, the voice recognition control circuit 13 of the voice recognition device IB
According to BK, the voice word "OK" that matches the input voice is searched from among the words in the registered voice memory 14B, and its address or code is output to the control circuit 3. As a result, the control circuit 3 transfers the audio pattern for the A-thick section stored in the auxiliary storage device 8 to the audio pattern memory 14B.
At the same time, the voice output device 2B is controlled and the speaker 7B asks, ``What about your work?''
”, says the voice guidance. 1 It works in exactly the same way as when inputting voice from Micro A.

今度はＢ太部がマイクロＡ（または６Ｂ）から「Ｂ太部
」と音声入力すると音声認識の結果、今）庫は音声パタ
ーンメモリ１４　Ａ　（または１４Ｂ）には補助記憶装
置８からＢ太部の作業用音声パターンが入り、Ｂ太部が
音声データ入力をすることができるようになる。Next time, when B thick inputs "B thick" from micro A (or 6B), as a result of voice recognition, the voice pattern memory 14 A (or 14B) has B thick from auxiliary storage device 8. The work voice pattern is entered, and the B bold section can now input voice data.

以下同様にして、１組の補助記憶装置８に記憶しておい
た話者交代用並びに複数話者毎の作業用の音声パターン
を複数の音声認識装置ＩＡおよびＩＢに導き出して自由
に音声で話者交代およびデータ入力をすることができる
。音声パターンの登録は１組の音声認識装置から行ない
補助記憶装置を介して他の音声認識装置に移し換えても
良く、また各音声認識装置からそれぞれ登録しても良い
。Thereafter, in the same way, the voice patterns for changing speakers and for working with multiple speakers stored in one set of auxiliary storage device 8 are derived to the multiple voice recognition devices IA and IB, and the voice patterns are freely spoken. Ability to change personnel and enter data. The voice pattern may be registered from one voice recognition device and transferred to another voice recognition device via an auxiliary storage device, or may be registered from each voice recognition device.

ここで補助記憶装置８は集積回路のＲＡＭやＲＯＭとし
ても良く、また、バブルカセット、カセットテープ、フ
ロッピーディスクなどとしても良い。但し、新た力話者
の音声パターンを自由に登録するためには、ＲＯＭ以外
の記憶手段を用いる。Here, the auxiliary storage device 8 may be an integrated circuit RAM or ROM, or may be a bubble cassette, cassette tape, floppy disk, or the like. However, in order to freely register the voice patterns of a new strong speaker, a storage means other than the ROM is used.

補助記憶装置８と登録音声メモＩＪ　１４　Ａまたは１
４Ｂ間の音声パターンの読出し格納は、音声入力による
他にキーボード４から行なうようにしても良い。さらに
、１音声認識結果を表示器５に表示して、音声出力装置
２人および２Ｂを省略しても複数の話者が複数の音声認
識装置から交代して音声情報を入力することができる。Auxiliary storage device 8 and registered voice memo IJ 14 A or 1
Reading and storing of the voice pattern between 4B may be performed from the keyboard 4 in addition to voice input. Furthermore, even if one voice recognition result is displayed on the display 5 and the two voice output devices and 2B are omitted, a plurality of speakers can alternately input voice information from a plurality of voice recognition devices.

＠４図は本発明に係る音声情報入力装置の他の一実施例
の構成を示したもので、第１図と同一符号のものは同一
機能を有する。同図において、無線機移動局３０Ａおよ
び３０ＢはマイクロＡおよび６Ｂの入力音声をアンテナ
３３Ａおよび３３Ｂから電波を発射する送信機３１Ａお
よび３１Ｂ１アンテナ３３Ａおよび３３Ｂから電波を受
信してスピーカ７Ａおよび７Ｂから音声ガイダンスやア
ンサーバックを発生させる受信機３２Ａおよび一３２Ｂ
によって構成されている。無線機固定局２０Ａおよび２
０Ｂは無線機移動局３０Ａおよび３０Ｂの電波をアンテ
ナ２３Ａおよび２３Ｂを介して受信して音声入出力装置
１０ＡおよびＩＯＢの音声認識装置ＩＡおよびＩＢに入
力する受信機２１Ａおよび２１Ｂ１音声入出力装置１０
ＡおよびＩＯＢの音声出力装置２人および２Ｂの出力音
声をアンテナ２３Ａおよび２３Ｂを介して無線機移動局
３０Ａおよび３０Ｂの受信機３２Ａおよび３２Ｂへ電波
を発射する送信機２２Ａおよび２２Ｂから構成されてい
る。音声パターンの登録はマイクロＡまたは６Ｂから話
者が音声単語を順次音声で読み上げることによって行な
わね、る。マイクロＡまたは６Ｂから入力された音声は
無線機移動局３０Ａまたは３０Ｂの送信機３１Ａまたは
３１Ｂからアンテナ３３Ａまたは３３Ｂを介して電波が
発射される。この電波はアンテナ２３Ａまたは２．３Ｂ
を介して無線機固定局２０Ａまたは２０Ｂの受信機２１
Ａｔたは２１Ｂで受信して音声認識装置ＩＡまたはＩＢ
の登録音声メモリに登録される。この登録音声メモリに
登録された音声パターンは補助記憶装置８に話者毎に番
地付けされて格納される。また、補助記憶装置８に格納
された音声パターンはキーボード４の操作あるいは音声
認識装置ＩＡまたはＩＢへの音声入力によって音声認識
結果ＩＡまたはＩＢそれぞれの音声パターンメモリに移
される。マイクロＡまたは６Ｂから話者の音声データが
入力されると無線機移動局３０Ａまたは３０Ｂの送信機
３１Ａまたは３１Ｂから電　　　　　　１波をとおして
無線機固定局２０Ａまたは２０Ｂの受信機２１Ａまたは
２１Ｂで受信し音声認識装置１人またはＩＢに入力され
る。音声認識結果のアンサーバックは音声出力装置２人
または２Ｂから発せられ送信機２２Ａまたは２２Ｂによ
って電波となって発射される。この電波は受信機３２Ａ
または３２３によって受信されスピーカ７Ａまたは７Ｂ
から発声される。話者はマイクロＡまたは６Ｂから音声
でデータを入力するとスピーカ７Ａまたは７Ｂからアン
サーバックあるいはガイダンスが発せられるのでこれを
開きながら音声でデータを入力する。@Figure 4 shows the configuration of another embodiment of the audio information input device according to the present invention, and the same reference numerals as in Figure 1 have the same functions. In the figure, radio mobile stations 30A and 30B receive input audio from micros A and 6B from transmitters 31A and 31B1 which emit radio waves from antennas 33A and 33B, receive radio waves from antennas 33A and 33B, and transmit audio from speakers 7A and 7B. Receivers 32A and 32B that generate guidance and answerback
It is made up of. Radio fixed station 20A and 2
0B is a receiver 21A and 21B1 which receives radio waves from radio mobile stations 30A and 30B via antennas 23A and 23B and inputs them to the voice input/output device 10A and the voice recognition devices IA and IB of IOB; voice input/output device 10;
The audio output device A and IOB consists of transmitters 22A and 22B that emit radio waves from the output audio of two people and 2B to receivers 32A and 32B of radio mobile stations 30A and 30B via antennas 23A and 23B. . The voice pattern is registered by the speaker reading out voice words sequentially from Micro A or 6B. The voice input from Micro A or 6B is emitted as a radio wave from transmitter 31A or 31B of radio mobile station 30A or 30B via antenna 33A or 33B. This radio wave is from antenna 23A or 2.3B
Receiver 21 of radio fixed station 20A or 20B via
At or 21B receives the speech recognition device IA or IB.
registered in the registered voice memory. The voice patterns registered in the registered voice memory are stored in the auxiliary storage device 8 with addresses assigned for each speaker. Further, the voice pattern stored in the auxiliary storage device 8 is transferred to the voice pattern memory of the voice recognition result IA or IB by operating the keyboard 4 or inputting voice to the voice recognition device IA or IB. When the speaker's voice data is input from Micro A or 6B, it is received by the receiver 21A or 21B of the radio fixed station 20A or 20B through one wave from the transmitter 31A or 31B of the radio mobile station 30A or 30B. and input into a voice recognition device or IB. The answer back of the voice recognition result is emitted from the voice output device 2 or 2B, and is emitted as a radio wave by the transmitter 22A or 22B. This radio wave is receiver 32A
or received by 323 and speaker 7A or 7B
is uttered from. When the speaker inputs data by voice from the micro A or 6B, an answerback or guidance is emitted from the speaker 7A or 7B, and the speaker inputs data by voice while opening the speaker.

以上の実施例では、１組の音声認識装置で音声パターン
の登録をすれば他の音声認識装置への音声パターンの登
録は発声することなく補助記憶装置を利用して行うこと
ができる。In the embodiments described above, once a voice pattern is registered in one set of voice recognition devices, the voice pattern can be registered in another voice recognition device using the auxiliary storage device without uttering a voice.

以上の実施例では、話者交代用の音声パターン、をも、
共通の補助記憶装Ｒ８に登録しておき、話、者交代モー
ドでのみ、各音声認識装置ＩＡ、ＩＢ内の音声パターン
メモ１Ｊ１４Ａ、１４Ｂへ格納するようにしている。し
かし、話者交代用の音声パターンは、常時、各音声パタ
ーンメモリー４Ａ。In the above embodiment, the voice pattern for speaker change is also
It is registered in the common auxiliary storage device R8, and is stored in the voice pattern memo 1J14A, 14B in each voice recognition device IA, IB only in the talk and person change modes. However, the voice pattern for speaker change is always stored in each voice pattern memory 4A.

１４Ｂが記憶しておくようにすることができる。14B may be stored.

この場合、各音声パターンメモリの他の番地に、作業用
の音声パターンのうち、識別された話者に対応するパタ
ーンが選択的に格納されることとなる。In this case, a pattern corresponding to the identified speaker among the working voice patterns is selectively stored in another address of each voice pattern memory.

また、話者の識別にも音声認識手段を利用するものにつ
き説明したが、これは話者別のコードを、キーボードそ
の他のいかなる入力手段によって入力するようにしても
よく、この場合には、制御口゛　路が簡単に話者を識別
できる。In addition, although we have described a method that uses voice recognition means to identify the speaker, it is also possible to input a code for each speaker using a keyboard or any other input means, and in this case, the control The speaker's mouth can be easily identified.

−〔発明の効果〕本発明によれば、複数の音声認識装置を複数の話者が自
由に使用でき、話者の識別によって該当話者の音声パタ
ーンを対応する音声認識装置の音声パターン記憶手段へ
格納することによシ、認識精度に優れた音声認識装置を
提供することができ：＾・６- [Effects of the Invention] According to the present invention, a plurality of speech recognition devices can be freely used by a plurality of speakers, and the speech pattern storage means of the speech recognition device corresponds to the speech pattern of the corresponding speaker by identifying the speaker. By storing the information in the memory, it is possible to provide a speech recognition device with excellent recognition accuracy.

[Brief explanation of the drawing]

第１図は本発明の一実施例を示す音声情報入力装置の構
成を示すシステム構成図、第２図は第１図に示した音声
情報入力装置に使用する音声単語の１例と、その記憶内
容を示す図、第３図は話者交代と作業の１例を示す音声
情報入力の手順図、第４図は本発明の他の一実施例を示
す他の音声情報入力装置の構成を示すシステム構成図で
ある。ＩＡ、ＩＢ・・・音声認識手段、２Ａ、２Ｂ・・・音声
出力装置、３・・・制御手段、４・・・キーボード、５
・・・表示器またはプリンタ、６Ａ、６Ｂ・・・マイク
、７Ａ。７Ｂ・・・スピーカ、８・・・補助記憶手段、ＩＯＡ。１０Ｂ・・・音声入出力装置、２０Ａ、２０Ｂ・・・無
線機固定局、３０Ａ、３０Ｂ・・・無線機移動局、躬　
２　口第　３０FIG. 1 is a system configuration diagram showing the configuration of a voice information input device according to an embodiment of the present invention, and FIG. 2 is an example of a voice word used in the voice information input device shown in FIG. 1 and its memory. 3 is a diagram showing the contents, FIG. 3 is a procedure diagram of voice information input showing an example of speaker change and work, and FIG. 4 is a diagram showing the configuration of another voice information input device showing another embodiment of the present invention. It is a system configuration diagram. IA, IB... Voice recognition means, 2A, 2B... Voice output device, 3... Control means, 4... Keyboard, 5
... Display or printer, 6A, 6B... Microphone, 7A. 7B...Speaker, 8...Auxiliary storage means, IOA. 10B... Audio input/output device, 20A, 20B... Radio fixed station, 30A, 30B... Radio mobile station,
2nd part 30th

Claims

[Scope of Claims] 1. A speech recognition device that matches speech input information with a pre-registered speech pattern, including a plurality of microphones, each of which has a speech pattern storage means provided corresponding to each microphone, and a speech recognition device that matches speech input information with a pre-registered speech pattern. a plurality of voice recognition means that recognize the voice by comparing the voice input from the microphone with the voice pattern; a common auxiliary storage means that stores the voice pattern of each of the plurality of speakers; and any of the microphones. means for identifying a specific speaker, and a voice pattern of the specific speaker among the voice patterns for each of a plurality of speakers stored in the auxiliary storage means, to the voice pattern storage means of the corresponding voice recognition means; A speech recognition device comprising a control means for reading and storing information. 2. The speaker identification means identifies the speaker by comparing the speech pattern for speaker identification stored in the speech pattern storage means of the speech recognition means with the speech input by the speaker. The speech recognition device according to item 1, which is a means. 3. The common auxiliary storage means also stores the voice pattern for speaker identification, and the control means stores the voice pattern for the specific speaker's work according to the speaker identification result by the voice recognition means. 3. The speech recognition device according to claim 2, wherein the speech pattern is read from the auxiliary storage means and stored in the speech pattern storage means. 4. The speech recognition device according to item 1, wherein the speaker identification means includes means for inputting a code for each speaker.