JPH01123299A - Voice dialing apparatus - Google Patents

Voice dialing apparatus

Info

Publication number
JPH01123299A
JPH01123299A JP62281829A JP28182987A JPH01123299A JP H01123299 A JPH01123299 A JP H01123299A JP 62281829 A JP62281829 A JP 62281829A JP 28182987 A JP28182987 A JP 28182987A JP H01123299 A JPH01123299 A JP H01123299A
Authority
JP
Japan
Prior art keywords
recognition
dictionary
section
user
result
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
JP62281829A
Other languages
Japanese (ja)
Inventor
Shoji Kuriki
章次 栗木
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Ricoh Co Ltd
Original Assignee
Ricoh Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Ricoh Co Ltd filed Critical Ricoh Co Ltd
Priority to JP62281829A priority Critical patent/JPH01123299A/en
Publication of JPH01123299A publication Critical patent/JPH01123299A/en
Pending legal-status Critical Current

Links

Abstract

PURPOSE: To make the recognition rate higher for second speaking in the case of erroneous recognition for first speaking by removing an erroneously recognized word from the object of a dictionary in next speaking when a user indicates this erroneously recognized word in the recognition result and taking all words of the dictionary as the recognition object at the time of on-hook. CONSTITUTION: A dictionary mask part 8 masks only a specific word of a dictionary 6 as the recognition object to remove it from the recognition object. A receiver is hooked off, and first speaking is inputted and is recognized, and the recognition result is sent to a result control part 7. This part 7 not only informs the user of the result by display, voice, or the like but also prepares for automatic originating by an automatic originating part 10, and originating is successful in the case of correct recognition. The user depresses an erroneous recognition switch 9 in the case of erroneous recognition. In this case, originating is stopped, and the next candidate of the recognition result is shown. Several candidates are shown in this manner, and the user speaks again if they are all erroneously recognized. Thus, the name of a destination is correctly recognized with a high probability in next speaking though being erroneously recognized once.

Description

【発明の詳細な説明】 技術分野 本発明は、音声ダイアリング装置に関する。[Detailed description of the invention] Technical field The present invention relates to a voice dialing device.

従来技術 音声ダイアリング装置は、使用者が発声した音声から認
識した結果を基に相手先のit話番号を自動的に発信す
るものであるが、認識率が100パーセントであること
はなく、誤認識を生じる。そのため、通常、認識結果を
使用者に表示や音声出力を用いて示し、誤認識であれば
スイッチ等を用いて知らせてもらい、自動発信を中止す
ることになる。使用者は一度誤認識になったならば、も
う−度同じ相手先の名前を発声し、電話をかけようとす
る。しかし、続けて発声すると、大体同じような発声に
なるため、また、同じ誤認識結果になる可能性が高く、
使用者にとって非常に使いに、くいものとなる欠点があ
った。
Conventional voice dialing devices automatically call the other party's IT phone number based on the result of recognition from the voice uttered by the user, but the recognition rate is not 100% and there may be errors. give rise to recognition. Therefore, the recognition result is usually shown to the user using a display or audio output, and if there is a misrecognition, the user is notified using a switch or the like, and the automatic call is canceled. Once the user is misrecognized, he/she tries to make a call by saying the same name of the other party again. However, if you continue to utter the same utterances, the utterances will be more or less the same, and there is a high possibility that you will get the same erroneous recognition result.
It had the disadvantage of being extremely difficult to use for users.

1−一旗 本発明は、上述のごとき実情に鑑みてなされたもので、
特に、音声ダイアリング装置において。
1-1 Flag The present invention was made in view of the above-mentioned circumstances.
Especially in voice dialing equipment.

−度目の発声で誤認識をした場合、二度目にはより認識
率を高くすることを目的としてなされたものである。
- This was done with the aim of increasing the recognition rate even higher the second time, if there is a misrecognition on the first utterance.

摂成 本発明は、上記目的を達成するために、受話器と、特徴
抽出部と、音声区間検出部と、認識部と。
SUMMARY OF THE INVENTION In order to achieve the above object, the present invention includes a telephone receiver, a feature extraction section, a voice section detection section, and a recognition section.

結果出力制御部と、誤認識スイッチと、辞書と、辞書マ
スク部と、使用者に結果を知らせる手段と、オンフック
・オフフック検出部と、自動発信部を持つ音声ダイアリ
ング装置において、認識結果が使用者により誤認識であ
ると示された場合、次の発声では前に誤認識であると示
された単語を辞書の対象から外し、オンフックになれば
辞書の全てを認識対象にすることを特徴としたものであ
る。
The recognition results are used in a voice dialing device that has a result output control section, an erroneous recognition switch, a dictionary, a dictionary mask section, a means for notifying the user of the results, an on-hook/off-hook detection section, and an automatic transmission section. If the user indicates that the word has been misrecognized by the user, the word that was previously misrecognized will be removed from the dictionary in the next utterance, and once on-hook, the entire dictionary will be included in the recognition target. This is what I did.

以下、本発明の実施例に基いて説明する。Hereinafter, the present invention will be explained based on examples.

音声ダイアリング装置の使用法として、受話器を持ち上
げてから次に受話器を置くまで若しくはダイアルを発信
するまでは、同一の相手先に電話をかける為に使用者が
発声していると考えられる。
When using a voice dialing device, the user is considered to be speaking to make a call to the same destination from the time he or she picks up the handset until the time he or she hangs up the handset or dials the number.

そのため、−度誤認識と知らされた単語はその後の発声
の認識の対象外にした方が認識率が良くなる。大体、続
けて発声している場合、その発声は良く似ているので、
同じ誤認識結果になる可能性が高い0.そのためにも、
−度誤認識と知らされた単語を辞書から外せば、次の認
識率が非常に上がることになる。使用者が別の相手先に
掛ける場合には、受話器を置けばよいし、もちろん−回
自動発信すれば、辞書の対象外の単語は無くなり、全て
の単語が認識対象になる。
Therefore, the recognition rate will be better if the words that have been notified of -degree misrecognition are excluded from subsequent recognition of utterances. Generally speaking, when they are uttered consecutively, the utterances are very similar, so
0, which is likely to result in the same false recognition result. For that reason,
- If words that have been misrecognized are removed from the dictionary, the next recognition rate will greatly increase. When the user wants to call another party, he or she can simply hang up the receiver, and of course, if the user automatically makes the - number of calls, there will be no words that are not included in the dictionary, and all words will be recognized.

第1図は、本発明の一実施例を説明するためのブロック
線図で1図中、1は受話器、2はフック。
FIG. 1 is a block diagram for explaining one embodiment of the present invention, and in the figure, 1 is a telephone receiver, and 2 is a hook.

3は特徴抽出部、4は音声区間検出部、5は認識部、6
は辞書、7は結果制御部、8は辞書マスク部、9は誤認
識スイッチ、10は自動発信部、11は表示音声出力部
で、受話器1のマイクからの音声信号は特徴抽出部3と
、音声区間検出部4に入力され、それぞれのデータが認
識部5に送られる。また、受話器1からは、フック2の
状態が辞書マスク部8に送られる。辞書マスク部8では
認識対象となる辞書6を特定の単語のみマスクして認識
対象外にすることができる。受話器が持ち上げられたら
つまりオフフックになったならば、辞書マスク部8では
、先ず全ての単語を認識対象とする。そこで、−回目の
発声が入力されて認識が行われ、認識結果が結果制御部
7に送られる。
3 is a feature extraction unit, 4 is a speech section detection unit, 5 is a recognition unit, 6
1 is a dictionary, 7 is a result control unit, 8 is a dictionary mask unit, 9 is an erroneous recognition switch, 10 is an automatic transmission unit, 11 is a display audio output unit, and the audio signal from the microphone of the receiver 1 is sent to a feature extraction unit 3. The data is input to the voice section detection section 4 and the respective data is sent to the recognition section 5. Further, the state of the hook 2 is sent from the receiver 1 to the dictionary mask section 8. The dictionary masking unit 8 can mask only specific words in the dictionary 6 to be recognized so as not to be recognized. When the handset is picked up, that is, when the handset goes off-hook, the dictionary mask section 8 first targets all words for recognition. Then, the -th utterance is input, recognition is performed, and the recognition result is sent to the result control section 7.

結果制御部7では使用者に結果を表示や音声出力などで
知らせると共に、自動発信部10により自動発信の準備
を行う。もし、認識が正答ならば、発信は成功する。
The result control section 7 notifies the user of the results through display or audio output, and the automatic transmission section 10 prepares for automatic transmission. If the recognition is correct, the transmission is successful.

誤認識の場合は使用者が誤認識スイッチ9を押す。この
場合は中止され、認識結果の次の候補が示される。この
ようにして何個かの候補が示され、それが全て誤認識で
あった場合は、もう−度発声を行うことになる。この場
合、次の認識において、以前に誤認識とされた単語は辞
書マスク部8において認識対象外の単語にされる。それ
は、使用者がどこかに電話しようとし、それがまだなさ
れていない場合、当然、同一の相手先に掛けるものであ
ることによる。そのため、せっかく正答でないと判って
いる単語をまた認識対象にする必要はない。このように
して認識対象を減らして認識を行い、使用者に結果を示
す。ここで正答がでれば、自動発信を行い、同時に辞書
マスク部をクリアして、次には全単語の認識ができる様
にする。しかし、この回でも誤認識したならば、もう−
度認識するときには、前回同様にこの回の誤認識単語も
対象外として、次の認識を行う。もし、途中で使用者の
気が変り別のところに掛けたくなったならば一度受話器
を置き、再び持ち上がればよい。そうすれば、オンフッ
クからオフフックとなり、このフック信号により辞書マ
スク部がクリアされる。
In case of erroneous recognition, the user presses the erroneous recognition switch 9. In this case, the process is canceled and the next recognition result candidate is displayed. In this way, several candidates are shown, and if all of them are misrecognized, the utterance will be performed again. In this case, in the next recognition, the word that was previously misrecognized is made into a word that is not a recognition target in the dictionary masking unit 8. This is because if the user wants to call somewhere and has not done so already, he will naturally call the same number. Therefore, there is no need to use a word that is known to be incorrect as a recognition target again. In this way, recognition is performed by reducing the number of recognition targets, and the results are presented to the user. If a correct answer is given here, an automatic call is made and the dictionary mask section is cleared at the same time, allowing all words to be recognized next. However, if I misunderstood this time too, then...
When performing recognition, the next recognition is performed, excluding the erroneously recognized word this time, as in the previous time. If the user changes his or her mind and wants to place the call somewhere else, he or she can hang up the phone and pick it up again. Then, the hook goes from on-hook to off-hook, and the dictionary mask section is cleared by this hook signal.

効   果 以上の説明から明らかなように、本発明によると相手先
の名前を誤認識しても、次の発声ではより高い確率で正
しく認識する音声ダイアリング装置が可能になる。
Effects As is clear from the above explanation, according to the present invention, even if the name of the other party is misrecognized, it is possible to provide a voice dialing device that can correctly recognize the name of the other party with a higher probability in the next utterance.

【図面の簡単な説明】[Brief explanation of the drawing]

第1図は、本発明の一実施例を説明するためのブロック
線図である。 1・・・受話器、2・・・フック、3・・・特徴抽出部
、4・・・音声区間検出部、5・・・認識部、6・・・
辞書、7・・・結果制御部、8・・・辞書マスク部、9
・・・誤認識スイッチ、10・・・自動発信部、11・
・・表示音声出力部。
FIG. 1 is a block diagram for explaining one embodiment of the present invention. DESCRIPTION OF SYMBOLS 1... Receiver, 2... Hook, 3... Feature extraction section, 4... Voice section detection section, 5... Recognition section, 6...
Dictionary, 7... Result control section, 8... Dictionary mask section, 9
... Misrecognition switch, 10... Automatic transmitter, 11.
...Display audio output section.

Claims (1)

【特許請求の範囲】[Claims] 受話器と、特徴抽出部と、音声区間検出部と、認識部と
、結果出力制御部と、誤認識スイッチと、辞書と、辞書
マスク部と、使用者に結果を知らせる手段と、オンフッ
ク・オフフック検出部と、自動発信部を持つ音声ダイア
リング装置において、認識結果が使用者により誤認識で
あると示された場合、次の発声では前に誤認識であると
示された単語を辞書の対象から外し、オンフックになれ
ば辞書の全てを認識対象にすることを特徴とする音声ダ
イアリング装置。
A telephone receiver, a feature extraction section, a voice section detection section, a recognition section, a result output control section, an erroneous recognition switch, a dictionary, a dictionary mask section, means for notifying the user of the results, and on-hook/off-hook detection. In a voice dialing device that has an automatic transmission section and an automatic transmission section, if the recognition result is indicated by the user as a misrecognition, the next time the user utters the word, the word that was previously indicated as an incorrect recognition is removed from the dictionary. A voice dialing device characterized in that when disconnected and on-hook, the entire dictionary is recognized.
JP62281829A 1987-11-06 1987-11-06 Voice dialing apparatus Pending JPH01123299A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
JP62281829A JPH01123299A (en) 1987-11-06 1987-11-06 Voice dialing apparatus

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
JP62281829A JPH01123299A (en) 1987-11-06 1987-11-06 Voice dialing apparatus

Publications (1)

Publication Number Publication Date
JPH01123299A true JPH01123299A (en) 1989-05-16

Family

ID=17644584

Family Applications (1)

Application Number Title Priority Date Filing Date
JP62281829A Pending JPH01123299A (en) 1987-11-06 1987-11-06 Voice dialing apparatus

Country Status (1)

Country Link
JP (1) JPH01123299A (en)

Cited By (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JPH0856251A (en) * 1994-08-11 1996-02-27 Nec Corp Voice dialer
JP2019086599A (en) * 2017-11-03 2019-06-06 アルパイン株式会社 Voice recognition device

Cited By (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JPH0856251A (en) * 1994-08-11 1996-02-27 Nec Corp Voice dialer
JP2019086599A (en) * 2017-11-03 2019-06-06 アルパイン株式会社 Voice recognition device

Similar Documents

Publication Publication Date Title
US6687673B2 (en) Speech recognition system
US5165095A (en) Voice telephone dialing
US6882973B1 (en) Speech recognition system with barge-in capability
TW323364B (en)
US20160006849A1 (en) Bluetooth headset and voice interaction control thereof
US8391445B2 (en) Caller identification using voice recognition
EP1170932B1 (en) Audible identification of caller and callee for mobile communication device
CA2221913A1 (en) Statistical database correction of alphanumeric account numbers for speech recognition and touch-tone recognition
US20030125947A1 (en) Network-accessible speaker-dependent voice models of multiple persons
CN110517697A (en) Prompt tone intelligence cutting-off device for interactive voice response
JPH01123299A (en) Voice dialing apparatus
JP2015023485A5 (en)
WO2001047225A3 (en) System and method for enhancing call waiting functionality via voice recognition
JP4230982B2 (en) Call assistance device, call assistance method, call assistance program, and call assistance system
HRP20000914A2 (en) Phone speaks name or number of calling party
JPS62105558A (en) Telephone set having voice dial function
JPH0228700A (en) Voice dialing device
JPH04349747A (en) Telephone system
JPH0548702A (en) Speech dialing device with voice terminating function
JP4067483B2 (en) Telephone reception translation system
JPH05300226A (en) Undesired phone call excluding device and telephone set with undesired phone call excluding device
JPH0229054A (en) Voice dialing device
AU1180199A (en) A method for recording and storing received sound signals and, possibly, picturesignals in connection with a telephone apparatus
JP2000115350A (en) Telephone set
JPH01123556A (en) Voice dialing device