JPH01123299A

JPH01123299A - Voice dialing apparatus

Info

Publication number: JPH01123299A
Application number: JP62281829A
Authority: JP
Inventors: Shoji Kuriki; 章次栗木
Original assignee: Ricoh Co Ltd
Current assignee: Ricoh Co Ltd
Priority date: 1987-11-06
Filing date: 1987-11-06
Publication date: 1989-05-16

Abstract

PURPOSE: To make the recognition rate higher for second speaking in the case of erroneous recognition for first speaking by removing an erroneously recognized word from the object of a dictionary in next speaking when a user indicates this erroneously recognized word in the recognition result and taking all words of the dictionary as the recognition object at the time of on-hook. CONSTITUTION: A dictionary mask part 8 masks only a specific word of a dictionary 6 as the recognition object to remove it from the recognition object. A receiver is hooked off, and first speaking is inputted and is recognized, and the recognition result is sent to a result control part 7. This part 7 not only informs the user of the result by display, voice, or the like but also prepares for automatic originating by an automatic originating part 10, and originating is successful in the case of correct recognition. The user depresses an erroneous recognition switch 9 in the case of erroneous recognition. In this case, originating is stopped, and the next candidate of the recognition result is shown. Several candidates are shown in this manner, and the user speaks again if they are all erroneously recognized. Thus, the name of a destination is correctly recognized with a high probability in next speaking though being erroneously recognized once.

Description

【発明の詳細な説明】技術分野本発明は、音声ダイアリング装置に関する。[Detailed description of the invention] Technical field The present invention relates to a voice dialing device.

従来技術音声ダイアリング装置は、使用者が発声した音声から認
識した結果を基に相手先のｉｔ話番号を自動的に発信す
るものであるが、認識率が１００パーセントであること
はなく、誤認識を生じる。そのため、通常、認識結果を
使用者に表示や音声出力を用いて示し、誤認識であれば
スイッチ等を用いて知らせてもらい、自動発信を中止す
ることになる。使用者は一度誤認識になったならば、も
う−度同じ相手先の名前を発声し、電話をかけようとす
る。しかし、続けて発声すると、大体同じような発声に
なるため、また、同じ誤認識結果になる可能性が高く、
使用者にとって非常に使いに、くいものとなる欠点があ
った。Conventional voice dialing devices automatically call the other party's IT phone number based on the result of recognition from the voice uttered by the user, but the recognition rate is not 100% and there may be errors. give rise to recognition. Therefore, the recognition result is usually shown to the user using a display or audio output, and if there is a misrecognition, the user is notified using a switch or the like, and the automatic call is canceled. Once the user is misrecognized, he/she tries to make a call by saying the same name of the other party again. However, if you continue to utter the same utterances, the utterances will be more or less the same, and there is a high possibility that you will get the same erroneous recognition result.
It had the disadvantage of being extremely difficult to use for users.

１−一旗本発明は、上述のごとき実情に鑑みてなされたもので、
特に、音声ダイアリング装置において。1-1 Flag The present invention was made in view of the above-mentioned circumstances.
Especially in voice dialing equipment.

−度目の発声で誤認識をした場合、二度目にはより認識
率を高くすることを目的としてなされたものである。- This was done with the aim of increasing the recognition rate even higher the second time, if there is a misrecognition on the first utterance.

摂成本発明は、上記目的を達成するために、受話器と、特徴
抽出部と、音声区間検出部と、認識部と。SUMMARY OF THE INVENTION In order to achieve the above object, the present invention includes a telephone receiver, a feature extraction section, a voice section detection section, and a recognition section.

結果出力制御部と、誤認識スイッチと、辞書と、辞書マ
スク部と、使用者に結果を知らせる手段と、オンフック
・オフフック検出部と、自動発信部を持つ音声ダイアリ
ング装置において、認識結果が使用者により誤認識であ
ると示された場合、次の発声では前に誤認識であると示
された単語を辞書の対象から外し、オンフックになれば
辞書の全てを認識対象にすることを特徴としたものであ
る。The recognition results are used in a voice dialing device that has a result output control section, an erroneous recognition switch, a dictionary, a dictionary mask section, a means for notifying the user of the results, an on-hook/off-hook detection section, and an automatic transmission section. If the user indicates that the word has been misrecognized by the user, the word that was previously misrecognized will be removed from the dictionary in the next utterance, and once on-hook, the entire dictionary will be included in the recognition target. This is what I did.

以下、本発明の実施例に基いて説明する。Hereinafter, the present invention will be explained based on examples.

音声ダイアリング装置の使用法として、受話器を持ち上
げてから次に受話器を置くまで若しくはダイアルを発信
するまでは、同一の相手先に電話をかける為に使用者が
発声していると考えられる。When using a voice dialing device, the user is considered to be speaking to make a call to the same destination from the time he or she picks up the handset until the time he or she hangs up the handset or dials the number.

そのため、−度誤認識と知らされた単語はその後の発声
の認識の対象外にした方が認識率が良くなる。大体、続
けて発声している場合、その発声は良く似ているので、
同じ誤認識結果になる可能性が高い０．そのためにも、
−度誤認識と知らされた単語を辞書から外せば、次の認
識率が非常に上がることになる。使用者が別の相手先に
掛ける場合には、受話器を置けばよいし、もちろん−回
自動発信すれば、辞書の対象外の単語は無くなり、全て
の単語が認識対象になる。Therefore, the recognition rate will be better if the words that have been notified of -degree misrecognition are excluded from subsequent recognition of utterances. Generally speaking, when they are uttered consecutively, the utterances are very similar, so
0, which is likely to result in the same false recognition result. For that reason,
- If words that have been misrecognized are removed from the dictionary, the next recognition rate will greatly increase. When the user wants to call another party, he or she can simply hang up the receiver, and of course, if the user automatically makes the - number of calls, there will be no words that are not included in the dictionary, and all words will be recognized.

第１図は、本発明の一実施例を説明するためのブロック
線図で１図中、１は受話器、２はフック。FIG. 1 is a block diagram for explaining one embodiment of the present invention, and in the figure, 1 is a telephone receiver, and 2 is a hook.

３は特徴抽出部、４は音声区間検出部、５は認識部、６
は辞書、７は結果制御部、８は辞書マスク部、９は誤認
識スイッチ、１０は自動発信部、１１は表示音声出力部
で、受話器１のマイクからの音声信号は特徴抽出部３と
、音声区間検出部４に入力され、それぞれのデータが認
識部５に送られる。また、受話器１からは、フック２の
状態が辞書マスク部８に送られる。辞書マスク部８では
認識対象となる辞書６を特定の単語のみマスクして認識
対象外にすることができる。受話器が持ち上げられたら
つまりオフフックになったならば、辞書マスク部８では
、先ず全ての単語を認識対象とする。そこで、−回目の
発声が入力されて認識が行われ、認識結果が結果制御部
７に送られる。3 is a feature extraction unit, 4 is a speech section detection unit, 5 is a recognition unit, 6
1 is a dictionary, 7 is a result control unit, 8 is a dictionary mask unit, 9 is an erroneous recognition switch, 10 is an automatic transmission unit, 11 is a display audio output unit, and the audio signal from the microphone of the receiver 1 is sent to a feature extraction unit 3. The data is input to the voice section detection section 4 and the respective data is sent to the recognition section 5. Further, the state of the hook 2 is sent from the receiver 1 to the dictionary mask section 8. The dictionary masking unit 8 can mask only specific words in the dictionary 6 to be recognized so as not to be recognized. When the handset is picked up, that is, when the handset goes off-hook, the dictionary mask section 8 first targets all words for recognition. Then, the -th utterance is input, recognition is performed, and the recognition result is sent to the result control section 7.

結果制御部７では使用者に結果を表示や音声出力などで
知らせると共に、自動発信部１０により自動発信の準備
を行う。もし、認識が正答ならば、発信は成功する。The result control section 7 notifies the user of the results through display or audio output, and the automatic transmission section 10 prepares for automatic transmission. If the recognition is correct, the transmission is successful.

誤認識の場合は使用者が誤認識スイッチ９を押す。この
場合は中止され、認識結果の次の候補が示される。この
ようにして何個かの候補が示され、それが全て誤認識で
あった場合は、もう−度発声を行うことになる。この場
合、次の認識において、以前に誤認識とされた単語は辞
書マスク部８において認識対象外の単語にされる。それ
は、使用者がどこかに電話しようとし、それがまだなさ
れていない場合、当然、同一の相手先に掛けるものであ
ることによる。そのため、せっかく正答でないと判って
いる単語をまた認識対象にする必要はない。このように
して認識対象を減らして認識を行い、使用者に結果を示
す。ここで正答がでれば、自動発信を行い、同時に辞書
マスク部をクリアして、次には全単語の認識ができる様
にする。しかし、この回でも誤認識したならば、もう−
度認識するときには、前回同様にこの回の誤認識単語も
対象外として、次の認識を行う。もし、途中で使用者の
気が変り別のところに掛けたくなったならば一度受話器
を置き、再び持ち上がればよい。そうすれば、オンフッ
クからオフフックとなり、このフック信号により辞書マ
スク部がクリアされる。In case of erroneous recognition, the user presses the erroneous recognition switch 9. In this case, the process is canceled and the next recognition result candidate is displayed. In this way, several candidates are shown, and if all of them are misrecognized, the utterance will be performed again. In this case, in the next recognition, the word that was previously misrecognized is made into a word that is not a recognition target in the dictionary masking unit 8. This is because if the user wants to call somewhere and has not done so already, he will naturally call the same number. Therefore, there is no need to use a word that is known to be incorrect as a recognition target again. In this way, recognition is performed by reducing the number of recognition targets, and the results are presented to the user. If a correct answer is given here, an automatic call is made and the dictionary mask section is cleared at the same time, allowing all words to be recognized next. However, if I misunderstood this time too, then...
When performing recognition, the next recognition is performed, excluding the erroneously recognized word this time, as in the previous time. If the user changes his or her mind and wants to place the call somewhere else, he or she can hang up the phone and pick it up again. Then, the hook goes from on-hook to off-hook, and the dictionary mask section is cleared by this hook signal.

効　　　果以上の説明から明らかなように、本発明によると相手先
の名前を誤認識しても、次の発声ではより高い確率で正
しく認識する音声ダイアリング装置が可能になる。Effects As is clear from the above explanation, according to the present invention, even if the name of the other party is misrecognized, it is possible to provide a voice dialing device that can correctly recognize the name of the other party with a higher probability in the next utterance.

[Brief explanation of the drawing]

第１図は、本発明の一実施例を説明するためのブロック
線図である。１・・・受話器、２・・・フック、３・・・特徴抽出部
、４・・・音声区間検出部、５・・・認識部、６・・・
辞書、７・・・結果制御部、８・・・辞書マスク部、９
・・・誤認識スイッチ、１０・・・自動発信部、１１・
・・表示音声出力部。FIG. 1 is a block diagram for explaining one embodiment of the present invention. DESCRIPTION OF SYMBOLS 1... Receiver, 2... Hook, 3... Feature extraction section, 4... Voice section detection section, 5... Recognition section, 6...
Dictionary, 7... Result control section, 8... Dictionary mask section, 9
... Misrecognition switch, 10... Automatic transmitter, 11.
...Display audio output section.

Claims

[Claims]

A telephone receiver, a feature extraction section, a voice section detection section, a recognition section, a result output control section, an erroneous recognition switch, a dictionary, a dictionary mask section, means for notifying the user of the results, and on-hook/off-hook detection. In a voice dialing device that has an automatic transmission section and an automatic transmission section, if the recognition result is indicated by the user as a misrecognition, the next time the user utters the word, the word that was previously indicated as an incorrect recognition is removed from the dictionary. A voice dialing device characterized in that when disconnected and on-hook, the entire dictionary is recognized.