JPH041915B2

JPH041915B2 -

Info

Publication number: JPH041915B2
Application number: JP58008214A
Authority: JP
Inventors: Masanori Myatake
Original assignee: Sanyo Electric Co Ltd
Current assignee: Sanyo Electric Co Ltd
Priority date: 1983-01-20
Filing date: 1983-01-20
Publication date: 1992-01-14
Also published as: JPS59133599A

Description

【発明の詳細な説明】 (イ) 産業上の利用分野本発明は音声を認識する音声認識装置に関す
る。DETAILED DESCRIPTION OF THE INVENTION (a) Field of Industrial Application The present invention relates to a speech recognition device that recognizes speech.

(ロ) 従来技術第１図に従来の音声認識装置の構成を示す。同
図に於いて、１は音声を電気的な音声信号に変換
するマイクロフオン、２は該マイクロフオン１に
て得られる音声信号から音声の特徴を示す音声パ
ターンを抽出する音声パターン抽出回路、３は該
音声パターン抽出回路２にて得られる音声を貯え
る入力音声パターンメモリ、４は認識すべき複数
音声の音声パターンを番号付けして予じめ貯えて
おく登録音声パターンメモリである。５はパター
ン認識を行なう認識処理部であり、上記入力音声
パターンメモリ３の音声パターンと上記登録音声
パターンメモリ４の各音声パターンとを夫々比較
し、最も類似の登録パターンメモリ４の音声パタ
ーンの番号が出力される。即ち、上記マイクロフ
オン１〜認識処理部５からなる基本構成に依つて
マイクロフオン１に入力された音声の認識処理が
行なわれる。さらに、６は比較回路、７はカウン
タ、８は表示器、９は修正部である。これ等比較
回路６〜修正部９に依つて上記登録音声パターン
メモリ４の各音声パターンをより正確なパターン
に修正する為の処理が行なわれる。(b) Prior Art Figure 1 shows the configuration of a conventional speech recognition device. In the figure, 1 is a microphone that converts audio into an electrical audio signal; 2 is an audio pattern extraction circuit that extracts a audio pattern representing the characteristics of audio from the audio signal obtained by the microphone 1; and 3 Reference numeral 4 designates an input speech pattern memory that stores the speech obtained by the speech pattern extraction circuit 2, and 4 indicates a registered speech pattern memory that stores in advance numbered speech patterns of a plurality of speeches to be recognized. 5 is a recognition processing unit that performs pattern recognition, which compares the voice pattern in the input voice pattern memory 3 with each voice pattern in the registered voice pattern memory 4, and determines the number of the most similar voice pattern in the registered pattern memory 4. is output. That is, the recognition processing of the voice input to the microphone 1 is performed by the basic configuration consisting of the microphone 1 to the recognition processing section 5 described above. Furthermore, 6 is a comparison circuit, 7 is a counter, 8 is a display, and 9 is a correction section. The comparison circuit 6 to the correction section 9 perform processing for correcting each voice pattern in the registered voice pattern memory 4 to a more accurate pattern.

次に上述の如き従来の音声認識装置を用いて地
名音声を認識せしめる場合を例に挙げ、その動作
を述べる。 Next, the operation will be described by taking as an example a case where the conventional speech recognition device as described above is used to recognize a place name speech.

先ず登録モードに於いて、話者は、マイクロフ
オン１に地名音声を順次発声入力せしめる。この
時、カウンタ回路７は表示器８の数字表示を
“１”から順次歩進せしめ、この数字表示が“１”
の時に話者は「トウキヨウ」なる音声をマイクロ
フオン１に入力する。この時、パターン抽出回路
２にて抽出された「トウキヨウ」なる音声パター
ンが入力パターンメモリ３に一時的に貯えられた
後、この音声パターンを登録音声パターンメモリ
４に番号“１”と対応づけして格納する。次に表
示器８の数字表示が“２”に歩進された時に話者
は「ナゴヤ」なる音声をマイクロフオン１に入力
し、上述の「トウキヨウ」の場合と同様に「ナゴ
ヤ」なる音声パターンを登録音声パターンメモリ
４に番号“２”と対応づけて格納する。以下同様
に、話者が発声する事に依り、登録音声パターン
メモリ４に「キヨウト」、「オオサカ」、「オカヤ
マ」、「ヒロシマ」、「ハカタ」なる音声パターン
を、順次番号“３”、“４”、“５”、“６”、“７”
と
対応づけて格納する。 First, in the registration mode, the speaker sequentially inputs place name sounds into the microphone 1. At this time, the counter circuit 7 sequentially increments the numerical display on the display 8 from "1" until the numerical display reaches "1".
At this time, the speaker inputs the voice "Tokyo" into the microphone 1. At this time, the voice pattern "TOKYO" extracted by the pattern extraction circuit 2 is temporarily stored in the input pattern memory 3, and then this voice pattern is associated with the number "1" in the registered voice pattern memory 4. and store it. Next, when the number display on the display 8 increments to "2", the speaker inputs the voice "Nagoya" into the microphone 1, and the voice pattern "Nagoya" is similar to the case of "Tokyo" described above. is stored in the registered voice pattern memory 4 in association with the number "2". Similarly, as the speaker utters, the voice patterns ``Kiyouto'', ``Osaka'', ``Okayama'', ``Hiroshima'', and ``Hakata'' are sequentially stored in the registered voice pattern memory 4 with numbers ``3'' and ``Hakata''. 4”, “5”, “6”, “7”
Stored in association with

しかしながら、上述の登録モードの際に、話者
が正確な発音及びアクセントで発音できるとは限
らず、誤まつた音声パターンが登録音声パターン
メモリ４に格納される惧れがあるので、この誤ま
つた音声パターンを修正する為の修正モードが設
けられている。斯る修正モードに於いて、カウン
タ回路７は上記登録モード時と同様に表示器８の
数字表示を“１”から“７”まで順次歩進せし
め、この数字表示が“１”の時に話者は「トウキ
ヨウ」なる音声をマイクロフオン１に入力する。
この時、パターン抽出回路２にて抽出された「ト
ウキヨウ」なる音声パターンが入力音声パターン
メモリ３に一時的に貯えられる。そして、認識処
理部５は入力音声パターンメモリ３の「トウキヨ
ウ」なる音声パターンを登録音声パターンメモリ
４の各７個の音声パターンに基づいてパターン認
識する。これに依つて、登録音声パターンメモリ
４の番号“１”に対応づけられた「トウキヨウ」
なる音声パターンであると正確に認識された場合
には、該認識処理部５から出力される番号“１”
と上記カウンタ回路７のカウント値“１”とが一
致する事を比較回路６が検知し一致信号Ｓを出力
する。この一致信号Ｓは上記カウンタ回路７のカ
ウント値を歩進せしめ、表示器８の数字表示を
“２”に変更する。以下同様に表示器８の数字表
示に従つて、その番号に対応した地名をマイクロ
フオン１に順次入力せしめる事となるが、例え
ば、表示器８の数字表示が“４”であり、「オオ
サカ」なる音声をマイクロフオン１に入力した時
に、認識処理部５がこの時の入力音声パターンが
登録音声パターンが登録音声パターンメモリ４の
番号“５”に対応した「オカヤマ」なる音声パタ
ーンに最も類似であると判定して番号“５”を出
力した場合、この番号“５”と上記カウンタ回路
７のカウント値“４”とを比較する比較回路６は
不一致信号を出力する。この不一致信号はカ
ウンタ回路７を歩進せしめる事なく、修正部９を
動作せしめる。この時、修正部９は入力音声パタ
ーンメモリ３に貯えられたこの時の「オオサカ」
なる音声パターンと登録音声パターンメモリ４の
番号“４”に対応づけられた「オオサカ」なる音
声パターンとを平均化せしめてなる新たな「オオ
サカ」なる平均化音声パターンを得て、登録音声
パターンメモリ４の「オオサカ」なる音声パター
ンをこの新たな平均化音声パターンに変更する。
この時、表示器８の数字表示は“４”のままとな
つており、話者は、再び「オオサカ」なる音声を
マイクロフオン１に入力し、認識処理部５が正確
に「オオサカ」なる音声と認識し、番号“４”を
出力する迄、上述の如き登録音声パターンメモリ
４の「オオサカ」なる音声パターンの修正の為の
動作がくり返し実行される。そしてこの修正が完
了した時点で次の登録音声パターンメモリ４の
「オカヤマ」なる音声についての上述の如き修正
処理に移行し、この処理が最後の音声「ハカタ」
についてまで順次行なわれる。 However, in the registration mode described above, it is not always possible for the speaker to pronounce with correct pronunciation and accent, and there is a risk that an incorrect speech pattern may be stored in the registered speech pattern memory 4. A modification mode is provided for modifying the voice pattern. In this correction mode, the counter circuit 7 sequentially increments the numerical display on the display 8 from "1" to "7" in the same manner as in the registration mode, and when the numerical display is "1", the speaker inputs the voice "Tokyo" into microphone 1.
At this time, the voice pattern "TOKYO" extracted by the pattern extraction circuit 2 is temporarily stored in the input voice pattern memory 3. Then, the recognition processing unit 5 recognizes the voice pattern "TOKYO" in the input voice pattern memory 3 based on each of the seven voice patterns in the registered voice pattern memory 4. As a result, "Tokyo" is associated with the number "1" in the registered voice pattern memory 4.
If the voice pattern is correctly recognized as a voice pattern, the number “1” is output from the recognition processing unit 5.
The comparator circuit 6 detects that the count value "1" of the counter circuit 7 and the count value "1" of the counter circuit 7 match, and outputs a match signal S. This coincidence signal S increments the count value of the counter circuit 7, and changes the number displayed on the display 8 to "2". Thereafter, in the same way, according to the numerical display on the display 8, place names corresponding to the numbers are input into the microphone 1 in sequence. For example, if the numerical display on the display 8 is "4", "Osaka" When a voice is input into the microphone 1, the recognition processing unit 5 determines that the input voice pattern at this time is the registered voice pattern that is most similar to the voice pattern ``Okayama'' corresponding to number "5" in the registered voice pattern memory 4. If it is determined that there is one and the number "5" is output, the comparison circuit 6 which compares this number "5" with the count value "4" of the counter circuit 7 outputs a mismatch signal. This mismatch signal causes the correction section 9 to operate without causing the counter circuit 7 to step forward. At this time, the correction unit 9 uses the current "Osaka" stored in the input voice pattern memory 3.
A new averaged speech pattern of "Osaka" is obtained by averaging the speech pattern of "Osaka" and the speech pattern of "Osaka" associated with the number "4" of the registered speech pattern memory 4, and the speech pattern is stored in the registered speech pattern memory. The voice pattern "Osaka" in No. 4 is changed to this new averaged voice pattern.
At this time, the number displayed on the display 8 remains "4", and the speaker again inputs the voice "Osaka" into the microphone 1, and the recognition processing unit 5 accurately records the voice "Osaka". The operation for correcting the voice pattern "Osaka" in the registered voice pattern memory 4 as described above is repeated until the number "4" is output. When this modification is completed, the process moves on to the above-mentioned modification process for the voice "Okayama" in the next registered voice pattern memory 4, and this process is performed for the last voice "Hakata".
This will be carried out in sequence.

認識モードに於いては、マイクロフオン１に入
力された音声パターンが抽出２されて入力音声パ
ターンメモリ３に貯えられ、登録音声パターンメ
モリ４の各音声パターンに基づき、認識処理部５
にてパターン認識される。即ち、認識処理部５が
番号“４”を出力したとしたら、この時、話者が
発声した音声は「オオサカ」であつた事となる。 In the recognition mode, the voice pattern input to the microphone 1 is extracted 2 and stored in the input voice pattern memory 3, and is processed by the recognition processing unit 5 based on each voice pattern in the registered voice pattern memory 4.
The pattern is recognized. That is, if the recognition processing unit 5 outputs the number "4", the voice uttered by the speaker at this time was "Osaka".

しかしながら、斯様な従来の音声認識装置に於
いては、登録モード時に例えば「オカヤマ」なる
音声の発音並びにアクセントが不正確であつた
り、この音声の発声時に雑音が混入する事に依つ
て、登録音声パターンメモリ４の番号“５”に対
応するこの「オカヤマ」なる音声パターンがその
番号“４”に対応する「オオサカ」なる音声パタ
ーンに類似したものになつてしまう場合がある。
この場合修正モード時には、対応づけられた番号
順に従がい、先に登録音声パターンメモリ４の
「オオサカ」なる音声パターンの修正が行なわれ
る事となるが、この番号“４”に対応する「オオ
サカ」なる音声パターンがかなり正確なものであ
つたとしても、話者が発声した「オオサカ」なる
音声パターンがむしろ番号“５”に対応する不正
確な「オカヤマ」なる音声パターンに類似してし
まう事がある。この時には、認識処理部５は番号
“５”を出力して、登録音声パターン４の「オオ
サカ」なる音声パターンを修正する事になるが、
「オオサカ」なる音声パターンを、これに誤まつ
て類似した「オカヤマ」なる音声パターンと充分
区別できる修正処理は行なえない。従つて、登録
音声パターンメモリ４の「オオサカ」なる音声パ
ターンの修正処理が完了できなくなるばかりか、
これ以後の音声パターンの修正処理もできなくな
る欠点があつた。 However, in such conventional speech recognition devices, when in the registration mode, the pronunciation and accent of the voice "okayama" may be inaccurate, or noise may be mixed in when the voice is uttered, resulting in errors in registration. The voice pattern "Okayama" corresponding to the number "5" in the voice pattern memory 4 may become similar to the voice pattern "Osaka" corresponding to the number "4".
In this case, in the modification mode, the voice pattern "Osaka" in the registered voice pattern memory 4 is first modified according to the order of the associated numbers, but "Osaka" corresponding to this number "4" is corrected first. Even if the sound pattern ``Osaka'' uttered by the speaker is quite accurate, the sound pattern ``Osaka'' uttered by the speaker may be rather similar to the inaccurate sound pattern ``Okayama'' that corresponds to the number "5". be. At this time, the recognition processing unit 5 outputs the number "5" and corrects the voice pattern "Osaka" of the registered voice pattern 4.
It is not possible to perform a correction process that can sufficiently distinguish the sound pattern "Osaka" from the similar sound pattern "Okayama". Therefore, not only is it impossible to complete the correction process for the voice pattern "Osaka" in the registered voice pattern memory 4, but also
There was a drawback that subsequent correction processing of the voice pattern was no longer possible.

(ハ) 発明の目的本発明は上述の欠点を解消する事を目的とし、
登録音声パターンメモリの音声パターンの修正を
確実に実行できる音声認識装置を提供するもので
ある。(c) Purpose of the invention The purpose of the present invention is to eliminate the above-mentioned drawbacks,
An object of the present invention is to provide a speech recognition device that can reliably modify speech patterns in a registered speech pattern memory.

(ニ) 発明の構成本発明の音声認識装置は登録音声パターンメモ
リの各音声パターンを修正する為の修正手段と、
この修正手段が修正を行なう登録音声の発声を話
者に指示する指示手段とを設け、修正モードに於
いて、上記指示手段に従つて話者が指示した登録
音声を入力した時に他の登録音声と誤認識した場
合には、上記修正手段は上記修正すべき登録音声
の対象を誤認識した登録音声に変更すると共に、
上記指示手段は修正手段にて変更された新たな登
録音声を話者に指示するものである。(d) Structure of the Invention The speech recognition device of the present invention includes a modification means for modifying each speech pattern in the registered speech pattern memory;
and instruction means for instructing the speaker to utter the registered voice to be corrected by the correction means, and in the correction mode, when the speaker inputs the registered voice instructed according to the instruction means, other registered voices are input. If the correcting means misrecognizes the corrected registered speech, the correcting means changes the target of the registered speech to be corrected to the misrecognized registered speech, and
The above-mentioned instruction means is for instructing the speaker of the newly registered voice changed by the modification means.

(ホ) 実施例第２図に本発明の音声認識装置の一実施例を示
す。同図に於いて、第１図の従来装置と同一機能
をなすものには第１図と同一図番を付している。
尚、６′，８′及び９′は、従来装置と同様に比較
回路、表示器、及び修正部を示しているが、後述
する如く、その比較対象、表示対象、及び修正対
象が従来装置と異なる。１０はカウンタ回路７の
カウント値と認識処理部５の出力番号とを選択出
力し、この出力値を比較回路６′、表示器８′、及
び修正部９′に伝送する選択回路であり、比較回
路６′からの一致信号Ｓを受けた時には、カウン
タ回路７のカウント値が出力される。また逆に比
較回路６′から不一致信号を受けた時には、認
識処理部５からの出力番号が出力され、この次に
比較回路６′から一致信号Ｓ又は不一致信号の
いずれかの信号を受けた時点で、この出力値がカ
ウンタ回路７のカウント値に復帰される。１１は
カウンタ回路７とこのカウンタ回路７の歩進を行
なわしめる一致信号Ｓ線との接続の開閉をする開
閉回路であり、上記選択回路１０と同期した開閉
動作を行ない、選択回路１０がカウンタ回路７の
カウント値を出力する時のみ一致信号Ｓがカウン
タ回路７に接続される。従つて、比較回路６′は
認識処理部５からの出力番号と、選択回路１０か
らの出力値とが一致した時、一致信号Ｓを出力
し、逆に不一致の時には不一致信号を出力する
事となり、この一致信号Ｓにて開閉回路１１を接
続状態とし、これに依つて、カウンタ回路７を歩
進せしめ、不一致信号にて修正部９′の修正動
作を実行せしめ、さらにはこれ等両信号Ｓ，に
て選択回路１０を選択動作せしめる。また表示部
８′は選択回路１０からのカウンタ回路７のカウ
ント値或いは認識処理部５での出力番号を表示す
る。さらに修正部９′は上記比較回路６′からの不
一致信号を受けた時、選択回路１０から得られ
る認識処理部５の出力番号に対応する登録音声パ
ターンメモリ４の音声パターンの修正を行なう事
になる。(E) Embodiment FIG. 2 shows an embodiment of the speech recognition device of the present invention. In this figure, parts having the same functions as those of the conventional device shown in FIG. 1 are given the same numbers as in FIG.
Note that 6', 8', and 9' indicate a comparison circuit, a display, and a correction unit as in the conventional device, but as will be described later, the objects for comparison, display, and correction are different from those in the conventional device. different. 10 is a selection circuit that selectively outputs the count value of the counter circuit 7 and the output number of the recognition processing section 5, and transmits this output value to the comparison circuit 6', the display 8', and the correction section 9'; When receiving the match signal S from the circuit 6', the count value of the counter circuit 7 is output. Conversely, when a mismatch signal is received from the comparison circuit 6', the output number from the recognition processing section 5 is output, and the next time either the match signal S or the mismatch signal is received from the comparison circuit 6'. This output value is then returned to the count value of the counter circuit 7. Reference numeral 11 denotes an opening/closing circuit for opening and closing the connection between the counter circuit 7 and the coincidence signal S line that causes the counter circuit 7 to increment. The coincidence signal S is connected to the counter circuit 7 only when a count value of 7 is output. Therefore, when the output number from the recognition processing section 5 and the output value from the selection circuit 10 match, the comparison circuit 6' outputs a match signal S, and conversely, when they do not match, it outputs a mismatch signal. , the opening/closing circuit 11 is brought into a connected state by this coincidence signal S, thereby causing the counter circuit 7 to step forward, and by the disagreement signal being caused to execute the correction operation of the correction section 9', furthermore, these two signals S , causes the selection circuit 10 to perform a selection operation. Further, the display section 8' displays the count value of the counter circuit 7 from the selection circuit 10 or the output number from the recognition processing section 5. Further, when the modification section 9' receives the discrepancy signal from the comparison circuit 6', it modifies the speech pattern in the registered speech pattern memory 4 corresponding to the output number of the recognition processing section 5 obtained from the selection circuit 10. Become.

斯様な構成の本発明の音声認識装置にて地名音
声を認識せしめる際の動作を述べる。尚、登録モ
ード、及び認識モードは従来装置と同様に動作す
るものであるので説明は省略する。 The operation when the speech recognition device of the present invention having such a configuration is used to recognize a place name speech will be described. Note that the registration mode and the recognition mode operate in the same manner as the conventional device, so their explanation will be omitted.

従つて、登録音声パターンメモリ４に従来例で
の説明と同様に、「トウキヨウ」、〜、「ハカタ」
なる音声が番号“１”、〜“７”に対応づけられ
て格納されているものとし、以下に修正モードに
ついて説明する。 Therefore, "Tokyo", ~, "Hakata" are stored in the registered voice pattern memory 4 as in the conventional example.
The modification mode will be described below, assuming that the voices are stored in association with the numbers "1" to "7".

先ず、初期状態に於いて、カウンタ回路７のカ
ウント値“１”は強制的に選択回路１０を介して
表示器８′に入力され、この表示器８′は番号
“１”を表示する。話者はこれに従つて、番号
“１”に対応づけられた「トウキヨウ」なる音声
をマイクロフオン１に入力すると、この音声パタ
ーンはパターン抽出２されて入力音声パターンメ
モリ３に一時的に貯えられ、認識処理部５でパタ
ーン認識される。この時、認識処理部５が登録音
声パターンメモリ４の番号“１”に対応づけられ
た「トウキヨウ」なる音声パターンと正しく判定
したなら、番号“１”を出力する事になり、比較
回路６′は一致信号Ｓを閉成状態の開閉回路１１
を介してカウンタ回路７に入力すると共に選択回
路１０に伝送する。従つて、修正部９′は修正動
作を行なわず、選択回路１０は歩進したカウンタ
回路７のカウント値“２”を選択して表示器８′
に表示せしめる。斯して比較回路６′から不一致
信号が得られる迄、対応する番号順に表示器
８′の数字表示が順次変更され、これに従つて話
者が地名音声をマイクロフオン１に順次入力す
る。 First, in the initial state, the count value "1" of the counter circuit 7 is forcibly input to the display 8' via the selection circuit 10, and the display 8' displays the number "1". In accordance with this, the speaker inputs the voice "Tokyo" associated with the number "1" into the microphone 1, and this voice pattern is extracted as a pattern 2 and temporarily stored in the input voice pattern memory 3. , the pattern is recognized by the recognition processing section 5. At this time, if the recognition processing unit 5 correctly determines that the voice pattern is "TOKYO" which is associated with the number "1" in the registered voice pattern memory 4, it will output the number "1" and the comparison circuit 6' is the opening/closing circuit 11 in which the coincidence signal S is closed.
The signal is inputted to the counter circuit 7 and transmitted to the selection circuit 10 via. Therefore, the correction section 9' does not perform any correction operation, and the selection circuit 10 selects the incremented count value "2" of the counter circuit 7 and displays it on the display 8'.
to be displayed. In this way, the numeric display on the display 8' is sequentially changed in the order of the corresponding numbers until a non-coincidence signal is obtained from the comparison circuit 6', and the speaker sequentially inputs the place name speech into the microphone 1 accordingly.

一方、例えば、表示器８′の数字表示が“４”
であり、これに対応する「オオサカ」なる音声を
マイクロフオン１に入力した時、認識処理部５が
この時の音声パターンが登録音声パターンメモリ
４の番号“５”に対応した何等かの原因で不正確
となつた「オカヤマ」なる音声パターンに最も類
似であると判定して番号“５”を出力した場合、
比較回路６′は不一致信号を出力する。この場
合、この不一致信号に依つて開閉回路１１は開
成され、カウンタ回路７はカウント値“４”を保
持した状態となる。一方、選択回路１０はこの時
の認識処理部５からの番号“５”を選択して表示
器８′に表示せしめ、さらには、修正部９′は修正
動作を行う事になる。即ち、修正部９′は選択回
路１０からの出力が番号“５”に変更される直前
の番号、即ちカウンタ回路７が保持しているカウ
ント値“４”に対応づけられた登録音声パターン
メモリの「オオサカ」なる音声パターンを、第１
図の従来装置の修正部９と同様に、平均化音声パ
ターンに変更した後、この修正部９′は、選択回
路１０から得られる番号“５”に対応づけられた
登録音声パターンメモリの「オカヤマ」なる音声
パターンが新たな修正の対象となる。 On the other hand, for example, the number displayed on the display 8' is "4".
When the corresponding voice "Osaka" is input into the microphone 1, the recognition processing unit 5 recognizes that the voice pattern at this time corresponds to number "5" in the registered voice pattern memory 4 for some reason. If it is determined that the voice pattern is most similar to the incorrect voice pattern “Okayama” and the number “5” is output,
Comparison circuit 6' outputs a mismatch signal. In this case, the switching circuit 11 is opened in response to the mismatch signal, and the counter circuit 7 maintains the count value "4". On the other hand, the selection circuit 10 selects the number "5" from the recognition processing section 5 at this time and displays it on the display 8', and furthermore, the modification section 9' performs a modification operation. That is, the modification unit 9' modifies the registered voice pattern memory corresponding to the number immediately before the output from the selection circuit 10 is changed to the number "5", that is, the count value "4" held by the counter circuit 7. The first voice pattern is “Osaka”.
Similar to the modification section 9 of the conventional device shown in the figure, after changing to the averaged speech pattern, the modification section 9' changes the "OKAYAMA" pattern in the registered speech pattern memory associated with the number "5" obtained from the selection circuit 10. ” sound pattern is subject to new modification.

斯して、話者が表示器８′の数字表示“５”に
従つて、「オカヤマ」なる音声をマイクロフオン
１に入力すると、修正部９′の動作に依つて、登
録音声パターンメモリ４の「オカヤマ」なる音声
パターンはその平均化パターンに修正されるが、
この時、認識処理部５の認識結果としての出力番
号は“５”となるとは限らない。即ち、登録音声
パターンメモリ４の「オカヤマ」なる音声パター
ンが不正確である為に出力番号は“５”とはなら
ない場合が多い。従つて、比較回路６′からは、
一致信号Ｓか又は不一致信号のいずれかが得ら
れ、この信号Ｓ＋に依つて、開閉回路１１は閉
成すると共に選択回路１０はカウンタ回路７にこ
れまで保持されていたカウント値“４”を再び出
力する。これに依つて、修正部９′の修正対象が
「オオサカ」なる音声に復帰すると共に、表示器
８′の数字表示は“５”から“４”に変更され、
話者に「オオサカ」なる音声を再度マイクロフオ
ン１に入力する事を指示する。 Thus, when the speaker inputs the voice "Okayama" into the microphone 1 according to the number "5" on the display 8', the registered voice pattern memory 4 is changed by the operation of the correction section 9'. The voice pattern "Okayama" is corrected to the averaged pattern,
At this time, the output number as the recognition result of the recognition processing unit 5 is not necessarily "5". That is, since the voice pattern "Okayama" in the registered voice pattern memory 4 is inaccurate, the output number is often not "5". Therefore, from the comparator circuit 6',
Either the match signal S or the mismatch signal is obtained, and depending on this signal S+, the switching circuit 11 is closed, and the selection circuit 10 again resets the count value "4" held in the counter circuit 7 until now. Output. As a result, the correction target of the correction unit 9' returns to the voice "Osaka", and the number displayed on the display 8' is changed from "5" to "4".
The speaker is instructed to input the voice "Osaka" into the microphone 1 again.

斯して、「オオサカ」なる音声に大して誤認識
の対象となる登録音声パターンメモリ４の不正確
な「オカヤマ」なる音声の音声パターンが先に一
担修正され、その後再び「オオサカ」なる音声パ
ターンについて修正処理が開始される。然るに、
この時の修正処理は、登録音声パターンメモリ４
の「オカヤマ」なる音声パターンが、すでに「オ
オサカ」なる音声パターンと充分に区別できる程
度に修正されている為に、さらにはこの「オオサ
カ」なる音声パターンも一度修正されているの
で、この修正処理が多くとも２回か３回くり返え
されれば、「オオサカ」なる音声パターン以外の
不正確な音声パターンの影響がなくなり、「オオ
サカ」なる音声パターンの修正を完了する事がで
きる。従つて、比較回路６′からは、一致信号Ｓ
が出力され、次の番号“５”の「オカヤマ」なる
音声に対する修正処理に速やかに移行でき、同じ
くして順次最後の番号“７”の「ハカタ」なる音
声に至るまでの修正処理が順次行なわれる。 In this way, the inaccurate voice pattern of the voice "Okayama" in the registered voice pattern memory 4, which is often misrecognized as the voice "Osaka", is first corrected, and then the voice pattern of "Osaka" is changed again. Correction processing is started for the However,
The correction process at this time is the registered voice pattern memory 4.
The sound pattern "Okayama" has already been modified to the extent that it can be distinguished from the sound pattern "Osaka", and this sound pattern "Osaka" has also been modified once, so this modification process If this is repeated at most two or three times, the influence of inaccurate speech patterns other than the "Osaka" speech pattern will disappear, and the correction of the "Osaka" speech pattern can be completed. Therefore, the comparison circuit 6' outputs the match signal S.
is output, and it is possible to quickly move on to the modification process for the next number "5", which is the sound "Okayama", and in the same way, the modification process is sequentially performed up to the last number "7", which is the sound "Hakata". It will be done.

以上の説明に於いては、登録音声パターンメモ
リ４の登録音声パターンを修正する為の修正手段
として、平均化パターンにて登録音声パターンを
修正する修正部９′を示したが、この登録音声パ
ターンを入力音声パターンに変更するだけの修正
手段としても良い。 In the above explanation, as a modification means for modifying the registered speech pattern in the registered speech pattern memory 4, the modification section 9' for modifying the registered speech pattern using an averaging pattern has been shown. It is also possible to simply change the input voice pattern to the input voice pattern.

また、話者に発声すべき音声を指示する為の指
示手段として、音声に対応づけた番号を表示する
表示器８′を用いたが、その音声をそのまま表示
するものであつてもよいし、音声合成回路を設
け、この回路にて音声出力する事も可能である。 In addition, as an instruction means for instructing the speaker which voice to utter, a display device 8' that displays a number associated with the voice is used, but the voice may be displayed as is. It is also possible to provide a voice synthesis circuit and output voice using this circuit.

(ヘ) 発明の効果本発明の音声認識装置は、以上の説明から明ら
かな如く、登録音声パターンメモリの各音声パタ
ーンを修正する為の修正手段と、この修正手段が
修正を行なう登録音声の発声を話者に指示する指
示手段とを設け、修正モードに於いて、上記指示
手段に従つて話者が指示した登録音声を入力した
時に他の登録音声と誤認識した場合には、上記修
正手段は修正すべき登録音声の対象を誤認識した
登録音声に変更すると共に、上記指示手段は修正
手段にて変更された新たな登録音声を話者に指示
するものであるので、先の修正対象である音声パ
ターンが比較的正確であり、他の音声パターンが
何らかの原因で先の音声パターンに誤まつて類似
していたとしても、この不正確な他の音声パター
ンを一担修正した後に、先の音声パターンを修正
する事ができる。従つて、先の音声パターンを他
の音声パターンと充分区別できる程度に正確に修
正する事が可能となり、斯る修正処理を確実にし
かも速やかに完了する事ができ、この種音声認識
装置の操作性の大巾な向上が望めるばかりか、認
識モード時での誤認識の低減にも寄与する。(f) Effects of the Invention As is clear from the above description, the speech recognition device of the present invention includes a modification means for modifying each speech pattern in the registered speech pattern memory, and an utterance of the registered speech to be modified by the modification means. and instructing means for instructing the speaker, and in the correction mode, when the registered voice instructed by the speaker according to the instruction means is input and it is mistakenly recognized as another registered voice, the correction means changes the target of the registered voice to be corrected to the erroneously recognized registered voice, and the above-mentioned instruction means instructs the speaker to the new registered voice changed by the correction means. Even if one voice pattern is relatively accurate and another voice pattern is mistakenly similar to the previous voice pattern for some reason, the first voice pattern can be corrected after correcting this inaccurate other voice pattern. Voice patterns can be modified. Therefore, it is possible to correct the previous speech pattern accurately to the extent that it can be sufficiently distinguished from other speech patterns, and such correction processing can be completed reliably and quickly, making it easier to operate this type of speech recognition device. Not only can this be expected to greatly improve performance, but it also contributes to reducing erroneous recognition in recognition mode.

[Brief explanation of drawings]

第１図は従来の音声認識装置の構成図、第２図
は本発明の音声認識装置の一実施例の構成図であ
り、１はマイクロフオン、２は音声パターン抽出
回路、３は入力音声パターンメモリ、４は登録音
声パターンメモリ、５は認識処理部、６，６′は
比較回路、７はカウンタ回路、８，８′は表示器、
９，９′は修正部、１０は選択回路、１１は開閉
回路を夫々示している。 FIG. 1 is a block diagram of a conventional speech recognition device, and FIG. 2 is a block diagram of an embodiment of the speech recognition device of the present invention, where 1 is a microphone, 2 is a speech pattern extraction circuit, and 3 is an input speech pattern. Memory, 4 is a registered voice pattern memory, 5 is a recognition processing unit, 6 and 6' are comparison circuits, 7 is a counter circuit, 8 and 8' are display devices,
Reference numerals 9 and 9' indicate correction sections, 10 a selection circuit, and 11 an opening/closing circuit, respectively.

Claims

[Scope of Claims] 1. A recognition processing unit converts a voice pattern obtained by a pattern extraction circuit from a voice signal obtained by inputting voice into a microphone into a pattern based on a plurality of voice patterns in a registered voice pattern memory. In a speech recognition device that performs recognition, an instruction means for sequentially instructing registered speech to be reuttered when a speaker faces a microphone in order to modify each speech pattern in a registered speech pattern memory;
correction means for correcting the voice pattern of the registered voice instructed by the instruction means based on the voice pattern extracted by the pattern extraction circuit from the voice reuttered by the speaker when facing the microphone; In the pattern correction mode, if the recognition processing results for the instruction registration voice instructed by the instruction means and the voice input by the speaker into the microphone according to the instruction are different, the input voice pattern at this time is The modification means modifies the voice pattern of the registered instruction voice based on the above, and the instruction means instructs the registered voice using the registered voice pattern indicated by the recognition processing result as a new correction target. Device.