JPH04134397A - Voice recognizing device - Google Patents

Voice recognizing device

Info

Publication number
JPH04134397A
JPH04134397A JP2258058A JP25805890A JPH04134397A JP H04134397 A JPH04134397 A JP H04134397A JP 2258058 A JP2258058 A JP 2258058A JP 25805890 A JP25805890 A JP 25805890A JP H04134397 A JPH04134397 A JP H04134397A
Authority
JP
Japan
Prior art keywords
recognition
memory
word
highest degree
result
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
JP2258058A
Other languages
Japanese (ja)
Inventor
Etsuji Shuda
周田 悦治
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Panasonic Holdings Corp
Original Assignee
Matsushita Electric Industrial Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Matsushita Electric Industrial Co Ltd filed Critical Matsushita Electric Industrial Co Ltd
Priority to JP2258058A priority Critical patent/JPH04134397A/en
Publication of JPH04134397A publication Critical patent/JPH04134397A/en
Pending legal-status Critical Current

Links

Abstract

PURPOSE:To prevent the same recognition error from being repeated many times by outputting a recognition word having the highest degree of resemblance from a successive memory at the time of first recognition and outputting a recognition word having the highest degree of resemblance other than recognition words, which are already outputted as recognition results, at the time of n-th recognition. CONSTITUTION:When a control part 8 discriminates first voice recognition by a recognition end signal from a collator 4, an output switch 12 is connected to a contact (a) to output the recognition result having the highest degree of resemblance stored in a memory 1 from a recognizing device. The recognition result of first recognition is inputted to a register 10 and is compared with the recognition result having the highest degree of resemblance of second recognition, namely, contents of the memory 1 of a successive rewrite memory 6; and if they are different, the output switch 12 is connected to the contact (a). If they are equal as the comparison result, the output switch 12 is connected to a contact (b) to output the recognition result having the highest degree of resemblance different from that of first recognition. Thus, the same recognition result error is not outputted for rephrasing.

Description

【発明の詳細な説明】 産業上の利用分野 本発明は音声を用いて各種機器を制御したり、ビデオテ
ープレコーダーなどのタイマー予約などのデータを入力
する音声認識装置に関するものである。
DETAILED DESCRIPTION OF THE INVENTION Field of the Invention The present invention relates to a voice recognition device that uses voice to control various devices and input data such as timer reservations for video tape recorders and the like.

従来の技術 近年、音声認識技術の進歩に伴い、別の作業を行ないな
がら音声で機器を制御したり、操作としての簡便さから
ビデオテープレコーダーなとのタイマー予約データの入
力等に音声認識装置が用い始められてきた。
Conventional technology In recent years, with the advancement of voice recognition technology, voice recognition devices have become popular for controlling equipment by voice while performing other tasks, and for inputting timer reservation data for video tape recorders due to the ease of operation. It has started to be used.

以下、図面を参照しながら、上述した従来の音声認識装
置の一例について説明する。
An example of the conventional speech recognition device described above will be described below with reference to the drawings.

第2図は従来の音声認識装置の構成を示すものである。FIG. 2 shows the configuration of a conventional speech recognition device.

第2図において、11は音声を電気信号に変換するマイ
クロフォンである。12は増幅器である。13はアナロ
グ−デジタル変換器(以下、A−D変換器と称す)であ
る。14は照合器である。15は認識する単語のパター
ンを登録しておく認識単語メモリーである。16は認識
結果を格納する逐次書き替えメモリーである。17は認
識のスタートを行なう認識ボタンである。18は認識を
制御する制御部である。
In FIG. 2, 11 is a microphone that converts audio into electrical signals. 12 is an amplifier. 13 is an analog-to-digital converter (hereinafter referred to as an A-D converter). 14 is a collation device. 15 is a recognition word memory in which patterns of words to be recognized are registered. Reference numeral 16 denotes a sequentially rewritten memory for storing recognition results. 17 is a recognition button for starting recognition. 18 is a control unit that controls recognition.

以上のように構成された従来の音声認識装置について、
以下その動作を説明する。
Regarding the conventional speech recognition device configured as above,
The operation will be explained below.

まず、音声認識を行なうために認識ボタン17を操作す
ると、制御部18は、照合器14に音声認識開始制御を
行なう。発音された音声はマイクロフォン11で電気信
号に変換されて、増幅器12で照合に必要な振幅まで増
幅される。増幅されたアナログ電気信号はA−D変換器
13でデジタル信号に変換される。認識単語メモリー1
5は認識させようとする単語のパターンをデジタル信号
として登録しであるもので、あらかじめ登録しである読
みだし専用メモリーであってもよいし、使うときに利用
者の登録か可能な逐次書き替えメモリーであってもよい
。例えば、数字の0から9までを認識させる場合15の
認識単語メモリーにはOから9までのそれぞれの音声の
パターンをデジタル化したものを格納しである。照合器
14はA−D変換器13でデジタル化した音声入力と認
識単語メモリー15のデータを照合し類似度の結果を認
識結果逐次書き替えメモリー16に格納する。発音され
た音声の認識結果は類似度の最も高かったものが出力さ
れる。
First, when the recognition button 17 is operated to perform voice recognition, the control section 18 controls the collation device 14 to start voice recognition. The spoken voice is converted into an electrical signal by the microphone 11, and amplified by the amplifier 12 to the amplitude required for verification. The amplified analog electrical signal is converted into a digital signal by an A-D converter 13. Recognition word memory 1
5 is a device in which the pattern of the word to be recognized is registered as a digital signal, which may be registered in advance in a read-only memory, or may be rewritten sequentially by user registration at the time of use. It may be memory. For example, in order to recognize the numbers 0 to 9, the digitized sound patterns of each of the numbers 0 to 9 are stored in the recognition word memory 15. The collation device 14 collates the voice input digitized by the A/D converter 13 with the data in the recognition word memory 15, and stores the similarity result in the recognition result sequential rewriting memory 16. The recognition result of the pronounced voice with the highest degree of similarity is output.

発明が解決しようとする課題 しかしながら上記のような構成では音声認識の誤りを補
正することができないという問題点を持っている。音声
認識は100%の認識率を持つことは極めて困難であり
、一定の誤り率を念頭においた認識装置が必要である。
Problems to be Solved by the Invention However, the above configuration has a problem in that errors in speech recognition cannot be corrected. It is extremely difficult to achieve a 100% recognition rate in speech recognition, and a recognition device is required with a certain error rate in mind.

単語の中には9(きゅう)と10(しゅう)のように発
音の極めて似通ったものが存在し、利用者の発音の癖に
よっては何度も発音した内容と異なる認識結果に終わる
可能性があるという問題点を持っている。
Some words, such as 9 (kyu) and 10 (shuu), have extremely similar pronunciations, and depending on the user's pronunciation habits, the recognition result may differ from what has been pronounced many times. It has a certain problem.

本発明は上記問題点に鑑み、複数の音声の認識がされた
場合、最初に認識したものと異なる、類似度の高い単語
を認識結果として出力する手段を持つことで、何度も同
じ認識誤りを繰り返さないという使い勝手のよい音声認
識装置を提供するものである。
In view of the above-mentioned problems, the present invention has a means for outputting a highly similar word different from the first recognized word as a recognition result when multiple voices are recognized, thereby preventing the same recognition error from occurring over and over again. To provide a voice recognition device that is easy to use and does not repeat the process.

課題を解決するための手段 上記問題点を解決するために本発明の音声認識装置は入
力された音声と複数の認識単語を登録した認識単語メモ
リーとの照合度に応じて類似度の高さを格納する逐次メ
モリー手段と、認識の1回目は最も類似度の高い認識単
語を出力する認識結果出力手段と、n回目の認識では、
すでに認識結果として出力した認識単語以外の最も類似
度の高い認識単語を出力する認識結果出力手段と、1回
目の認識に戻すリセット手段を備えた構成を持つもので
ある。
Means for Solving the Problems In order to solve the above problems, the speech recognition device of the present invention determines the degree of similarity depending on the degree of matching between input speech and a recognition word memory in which a plurality of recognition words are registered. A sequential memory means for storing, a recognition result output means for outputting a recognized word with the highest similarity in the first recognition, and a recognition result output means in the nth recognition.
The recognition result output means outputs a recognition word with the highest degree of similarity other than the recognition words already output as recognition results, and the recognition result output means includes a reset means for returning to the first recognition.

作用 本発明は上記した構成によって、入力された音声と、複
数の認識単語を登録した認識単語メモリーとの照合度に
応して類似度の高さを格納する逐次メモリー手段をもち
、認識の1回目は最も類似度の高い認識単語を逐次メモ
リーから出力し、n回目の認識では、すでに認識結果と
して出力した認識単語以外の最も類似度の高い認識単語
を出力できるようになる。また再び−から認識を開始す
る時は、1回目の認識に戻すリセット手段によって認識
のスタートに戻せるようにできることとなる。
Effects The present invention has the above-described configuration, and has a sequential memory means for storing the degree of similarity according to the degree of matching between the input speech and the recognition word memory in which a plurality of recognition words are registered, In the n-th recognition, the recognition words with the highest degree of similarity are sequentially output from the memory, and in the n-th recognition, the recognition words with the highest degree of similarity other than the recognition words that have already been output as recognition results can be output. Furthermore, when starting the recognition from - again, the reset means for returning to the first recognition can be used to return to the start of the recognition.

実施例 以下、本発明の一実施例の音声認識装置について図面を
参照しながら説明する。
Embodiment Hereinafter, a speech recognition device according to an embodiment of the present invention will be described with reference to the drawings.

第1図は本発明の一実施例の音声認識装置の構成を示す
ブロック図である。
FIG. 1 is a block diagram showing the configuration of a speech recognition device according to an embodiment of the present invention.

第1図において、1は音声を電気信号に変換するマイク
ロフォンである。2は増幅器である。3はアナログ−デ
ジタル変換器(以下、A−D変換器と称す)である。4
は照合器である。5は認識する単語のパターンを登録し
ておく認識単語メモリーである。6は認識結果を格納す
る逐次書き替えメモリーである。7は認識のスタートを
行なう認識ボタン、である。8は認識を制御する制御部
である。9は認識の類似度の最も高い結果を出力するス
イッチである。10は認識の類似度の最も高い結果を一
時的に保持するレジスタである。
In FIG. 1, 1 is a microphone that converts audio into electrical signals. 2 is an amplifier. 3 is an analog-to-digital converter (hereinafter referred to as an A-D converter). 4
is a matcher. 5 is a recognition word memory in which patterns of words to be recognized are registered. 6 is a sequentially rewritten memory that stores recognition results. 7 is a recognition button for starting recognition. 8 is a control unit that controls recognition. 9 is a switch that outputs the result with the highest recognition similarity. 10 is a register that temporarily holds the result with the highest degree of recognition similarity.

11は比較器である。12は認識結果の出力切り替えス
イッチである。
11 is a comparator. 12 is a recognition result output changeover switch.

以上のように構成された音声認識装置について以下その
動作を説明する。
The operation of the speech recognition device configured as described above will be explained below.

まず、音声認識を行なうために認識ボタン7を操作する
と制御部8は照合器4に音声認識開始制御を行なう。発
音された音声はマイクロフォン1で電気信号に変換され
て増幅器2で照合に必要な振幅まで増幅される。増幅さ
れたアナログ電気信号はA−D変換器3でデジタル信号
に変換される。認識単語メモリー5は認識させようとす
る単語のパターンをデジタル信号として登録しであるも
ので、あらかじめ登録しである読みだし専用メモリーで
あってもよいし、使うときに利用者の登録が可能な逐次
書き替えメモリーであってもよい。例えば数字の0から
9までを認識させる場合、認識単語メモリー5には0か
ら9までのそれぞれの音声のパターンをデジタル化した
ものを格納しである。照合器4はA−D変換器3でデジ
タル化した音声入力と認識単語メモリー5のデータを照
合し、類似度の結果を認識結果逐次書き替えメモリー6
に格納する。類似度の最も高かった順に、順次逐次書き
替えメモリー6のメモリー1から順に符合化されて格納
される。
First, when the recognition button 7 is operated to perform voice recognition, the control section 8 controls the collation device 4 to start voice recognition. The voice produced is converted into an electrical signal by a microphone 1 and amplified by an amplifier 2 to the amplitude necessary for verification. The amplified analog electrical signal is converted into a digital signal by an A-D converter 3. The recognition word memory 5 registers the pattern of the word to be recognized as a digital signal, and may be a read-only memory that is pre-registered, or can be registered by the user at the time of use. It may be a sequentially rewritten memory. For example, when the numbers 0 to 9 are to be recognized, the recognition word memory 5 stores digitized speech patterns for each of the numbers 0 to 9. The collation device 4 collates the voice input digitized by the A-D converter 3 with the data in the recognition word memory 5, and stores the similarity results in the recognition result memory 6.
Store in. The data are encoded and stored in order from memory 1 of the rewriting memory 6 in order of highest similarity.

制御部8は初めての音声認識であるかを照合器4から認
識終了信号により判別し、1回目の認識であれば出力ス
イッチ12をaにたおし、メモリー1に格納された最も
類似度の高い認識結果を認識装置から出力する。また、
スイッチ9を閉じ、レジスタ10にメモリー1のデータ
を格納し再びスイッチ9を開放する。
The control unit 8 determines whether this is the first speech recognition based on the recognition end signal from the collation device 4, and if it is the first recognition, sets the output switch 12 to a, and selects the recognition with the highest degree of similarity stored in the memory 1. Output the results from the recognition device. Also,
The switch 9 is closed, the data of the memory 1 is stored in the register 10, and the switch 9 is opened again.

ここで、再び音声認識がされ、上記の過程と同様に逐次
書き替えメモリー6に2回目の認識結果が登録された時
、制御部8は2回目の認識であることを検知して、比較
器11を作動させる。1回目の認識結果はレジスタ10
に入っており2回目の認識の最も、類似度の高い認識単
語、すなわち逐次書き替えメモリー6のメモリー1の内
容と比較し異なっている場合は出力スイッチ12をaに
倒す。もし、比較結果が同一の場合は出力スイッチ12
をbに倒すことで、1回目と異なる類似度の高い認識結
果を出力することができる。レジスタ10の数を増やし
複数の過去の認識結果を登録するようにすればこの動作
をn回繰り返すことも可能である。また初期段階に戻す
には、制御部8において、回数の制限を設けるか、スイ
ッチ7か押されることで可能である。
Here, when voice recognition is performed again and the second recognition result is registered in the sequential rewriting memory 6 in the same way as in the above process, the control unit 8 detects that it is the second recognition and starts the comparator. 11 is activated. The first recognition result is in register 10
If the word is different from the recognized word with the highest degree of similarity in the second recognition, that is, the content of memory 1 of the sequential rewriting memory 6, the output switch 12 is turned to a. If the comparison results are the same, the output switch 12
By changing b to b, it is possible to output a recognition result with a high degree of similarity that is different from the first recognition result. This operation can be repeated n times by increasing the number of registers 10 and registering a plurality of past recognition results. Further, in order to return to the initial stage, it is possible to set a limit on the number of times in the control section 8 or by pressing the switch 7.

発明の効果 以上のように本発明によれば、入力された音声と、複数
の認識単語を登録した認識単語メモリーとの照合度に応
じて類似度の高さを格納する逐次メモリー手段と、認識
の1回目は最も類似度の高い認識単語を出力する認識結
果出力手段と、n回目の認識では、すでに認識結果とし
て出力した認識単語以外の最も類似度の高い認識単語を
出力する認識結果出力手段と、1回目の認識に戻すリセ
ット手段を備えた構成を持つことで言い直しを行なった
時に同じ認識結果の誤りを出力することがなくなるとい
った優れた効果を得ることができる。
Effects of the Invention As described above, according to the present invention, there is provided a sequential memory means for storing the degree of similarity according to the degree of matching between input speech and a recognition word memory in which a plurality of recognition words are registered; a recognition result output means for outputting a recognized word with the highest degree of similarity for the first recognition; and a recognition result output means for outputting a recognized word of the highest degree of similarity for the n-th recognition, other than the recognition word that has already been output as a recognition result. By having a configuration including a reset means for returning to the first recognition, it is possible to obtain an excellent effect that an error in the same recognition result will not be output when rewording is performed.

【図面の簡単な説明】[Brief explanation of drawings]

第1図は本発明の一実施例のタイマー予約装置を示すブ
ロック図、第2図は従来のタイマー予約装置のブロック
図である。 1・・・・・・マイクロフォン、2・・・・・・増幅器
、3・・・・・・A−D変換器、4・・・・・・照合器
、5・・・・・・認識単語メモリー 6・・・・・・認
識結果メモリー 7・・・・・・認識スイッチ、8・・
・・・・制御部、9・・・・・・スイッチ、10・・・
・・・レジスタ、11・・・・・・比較器、12・・・
・・・出カスインチ。
FIG. 1 is a block diagram showing a timer reservation device according to an embodiment of the present invention, and FIG. 2 is a block diagram of a conventional timer reservation device. 1...Microphone, 2...Amplifier, 3...A-D converter, 4...Verifier, 5...Recognized word Memory 6... Recognition result memory 7... Recognition switch, 8...
...Control unit, 9...Switch, 10...
...Register, 11...Comparator, 12...
... Out of stock.

Claims (1)

【特許請求の範囲】[Claims] 複数の単語の音声の認識を行なう音声認識装置であって
、入力された音声と複数の認識単語を登録した認識単語
メモリーとの照合度に応じて類似度の高さを格納する逐
次メモリー手段と、認識の1回目は最も類似度の高い認
識単語を出力する認識結果出力手段と、n回目の認識で
は、すでに認識結果として出力した認識単語以外の最も
類似度の高い認識単語を出力する認識結果出力手段と、
1回目の認識に戻すリセット手段を備えたことを特徴と
する音声認識装置。
A speech recognition device for recognizing speech of a plurality of words, comprising a sequential memory means for storing a degree of similarity according to a degree of matching between an input speech and a recognition word memory in which a plurality of recognized words are registered; , a recognition result output means that outputs the recognized word with the highest degree of similarity in the first recognition, and a recognition result output means which outputs the recognized word with the highest degree of similarity in the n-th recognition, other than the recognition word that has already been output as the recognition result. output means;
A speech recognition device comprising a reset means for returning to the first recognition.
JP2258058A 1990-09-26 1990-09-26 Voice recognizing device Pending JPH04134397A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
JP2258058A JPH04134397A (en) 1990-09-26 1990-09-26 Voice recognizing device

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
JP2258058A JPH04134397A (en) 1990-09-26 1990-09-26 Voice recognizing device

Publications (1)

Publication Number Publication Date
JPH04134397A true JPH04134397A (en) 1992-05-08

Family

ID=17314952

Family Applications (1)

Application Number Title Priority Date Filing Date
JP2258058A Pending JPH04134397A (en) 1990-09-26 1990-09-26 Voice recognizing device

Country Status (1)

Country Link
JP (1) JPH04134397A (en)

Cited By (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JPH11143487A (en) * 1997-11-11 1999-05-28 Osaka Gas Co Ltd Method and device for converting voice to character
US6564185B1 (en) 1998-09-08 2003-05-13 Seiko Epson Corporation Continuous speech recognition method and program medium with alternative choice selection to confirm individual words

Cited By (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JPH11143487A (en) * 1997-11-11 1999-05-28 Osaka Gas Co Ltd Method and device for converting voice to character
US6564185B1 (en) 1998-09-08 2003-05-13 Seiko Epson Corporation Continuous speech recognition method and program medium with alternative choice selection to confirm individual words

Similar Documents

Publication Publication Date Title
JP3968133B2 (en) Speech recognition dialogue processing method and speech recognition dialogue apparatus
US6446039B1 (en) Speech recognition method, speech recognition device, and recording medium on which is recorded a speech recognition processing program
US6631348B1 (en) Dynamic speech recognition pattern switching for enhanced speech recognition accuracy
US7146317B2 (en) Speech recognition device with reference transformation means
JPH04134397A (en) Voice recognizing device
US20030040915A1 (en) Method for the voice-controlled initiation of actions by means of a limited circle of users, whereby said actions can be carried out in appliance
JP2820093B2 (en) Monosyllable recognition device
US6564185B1 (en) Continuous speech recognition method and program medium with alternative choice selection to confirm individual words
JPH04246695A (en) Voice recognition device
JP2000122678A (en) Controller for speech recogniging equipment
WO1994002936A1 (en) Voice recognition apparatus and method
JPH10510081A (en) Apparatus and voice control device for equipment
WO2000065575A1 (en) Voice recognition device for toys
JP2005148764A (en) Method and device for speech recognition interaction
JPH0488399A (en) Voice recognizer
KR20010026402A (en) Device and method for recognizing voice sound using nervous network
JPS6239899A (en) Conversation voice understanding system
KR100204240B1 (en) Voice recognition apparatus and method using memory card
JPH04301695A (en) Dictionary control system for speech recognition device
JPS58224397A (en) Large vocaburary word voice recognition system
JPS608898A (en) Voice recognition equipment
JPS59195299A (en) Sepecific speaker's voice recognition equipment
JPH07210193A (en) Voice conversation device
JPH0580794A (en) Speech recognition device
JP3695168B2 (en) Voice recognition device, voice input gain setting method, and storage medium storing input gain setting processing program