JPH03159356A

JPH03159356A - Response method for audio response recognizing device

Info

Publication number: JPH03159356A
Application number: JP1297833A
Authority: JP
Inventors: Junji Kojima; 小島　順治
Original assignee: Nippon Telegraph and Telephone Corp
Current assignee: Nippon Telegraph and Telephone Corp
Priority date: 1989-11-16
Filing date: 1989-11-16
Publication date: 1991-07-09

Abstract

PURPOSE:To efficiently progress service by automatically selecting the necessity of plural response guidance or a sign tone sending function corresponding to a user. CONSTITUTION:The audio response recognizing device is comprised of a network control part 1, a speech recognition part 2, an audio response part 3, and a main control part 4, and when an outgoing talephone number is inputted from the user, the degree of operating skill for a system of the user prepared as a table in the device is discriminated based on the telephone number. Then, the response guidance is selected based on discriminated degree of operating skill, and also, it is selected whether or not a sign tone to expedite the user to be easy to perform voice input based on procedure shown by the response guidance is sent. Thereby, it is possible to efficiently progress audio response service corresponding to a wide range of users.

Description

[Detailed description of the invention]

〔産業上の利用分野］本発明は、銀行の残高照会や振込み通知サービス等に使
用されている、利用者の音声を識別して自動的に応答し
ながらサービスを進める音声応答認識装置に係り，特に
、発信電話番号通知機能を持つネットワーク、例えば、
Ｉ　Ｓ　Ｄ　Ｎ　（Ｉｎｔｅｇｒａｔｅｄ　　Ｓｅｒｖ
ｉｃｅｓ　　Ｄｉｇｉｔａｌ　　Ｎｅｔｗｏｒｋ，以下
総合ディジタル通信網と記載する）網に接続可能なイン
タフェースを有する音声応答認識装置の応答方法に関す
るものである。〔従来の技術］不特定話者の音声認識技術を用いた音声応答認識装置は
、銀行の残高照会や振込み通知、または、証券会社の株
価照会等に使われ、広く商品化されている．従来のシステムでは、音声応答認識装置とのやり取りは
、サービス毎に固定された応答ガイダンスを用いている
．また、音声入力を正確なタイミングで行なうため、音
声入力契機に、発声を促す合図音を固定的に出力してい
る。第６図は、従来システムの音声入力契機におけるサービ
ス仕様を示す説明図である。ここでは、サービスコード入力の場合を示す。従来のシステムでは、音声応答認識装置は、ガイダンス
音声／合図音６ｌとして、一律に，この例のように、「
暗証番号をどうぞ」というメッセージ６３が用いられ、
合図音６４、６６が出力される．利用者は、入力音声６２として、それぞれの合図音の後
に所定の入力ｒＯＪ　６５．ＮＪ　６７を行なう。利用
者の入力を確認するメッセージ６８が「残高照会ですね
。Ｊと出され、処理が進められる。第７図は、従来システムの合図音出力と音声認識装置の
動作の関係を示すタイミングチャートである。まず、音声応答装置からガイダンス７ｌ　（「サービス
コールをどうぞ」）が出力された後、合図音７２（「ピ
ッ」）が出力され，その直後から、音声認識装置は音声
認識動作７４を開始する。音声認識動作７４は、ある一定の時間、例えば、坩語の
場合は２秒程度継続され、その時間内で発声された音声
入力７３を認識する。このようにして、音声認識装置は、利用者の処理要求を
受け付ける。一方、ＩＳＤＮ網等においては、ネットワークの機能と
して発信者の電話番号を相手に通知出来る。この発信者
の電話番号は、発信者そのものを表すものではないが、
一般家庭の場合には、概ね一時近似的に、固定した発信
者と考えることが出来る。ＩＳＤＮは、伝送路と交換器をデイジタル回線で結び、
音声、データ、画像等、各通信サービスを統一したイン
タフェースで提供する公衆デイジタル・ネットワークで
ある。１つの加入者回線で、電話やファクシミリ、ビデ
オテックス、データ通信などを統合して一元的に使える
ほか，テレビ電話等画像通信も可能にするサービスを提
供する。このようなＩＳＤＮの応用に関しては、日系ＢＰ社発行
「日系コミュニケーション別ｆｌｌ｝　　ＴＳＤｔＪ活
用の手引き』に詳しい、特に上文は、ＰＰ．ｌ８、およ
び、ＰＰ．１８９より抜粋したものである．[Industrial Application Field] The present invention relates to a voice response recognition device used for bank balance inquiry and transfer notification services, etc., which identifies the user's voice and automatically responds while proceeding with the service. In particular, networks with calling phone number notification functionality, e.g.
ISDN (Integrated Serv
The present invention relates to a response method of a voice response recognition device having an interface connectable to the ICES Digital Network (hereinafter referred to as integrated digital communication network). [Prior Art] Voice response recognition devices that use voice recognition technology for non-specific speakers are widely commercialized and are used for bank balance inquiries and transfer notifications, stock price inquiries at securities companies, etc. In conventional systems, interaction with voice response recognition devices uses response guidance that is fixed for each service. In addition, in order to perform voice input at accurate timing, a signal sound that prompts vocalization is fixedly output when the voice input is triggered. FIG. 6 is an explanatory diagram showing service specifications in response to voice input in a conventional system. Here, the case of inputting a service code is shown. In the conventional system, the voice response recognition device uniformly outputs the guidance voice/signal sound 6l as shown in this example.
Message 63 "Please enter your PIN number" is used,
Signal sounds 64 and 66 are output. The user inputs a predetermined input rOJ 65. after each signal sound as the input voice 62. Do NJ 67. A message 68 confirming the user's input is displayed as ``Balance Inquiry. First, the voice response device outputs the guidance 7l (“Please make a service call”), then the signal sound 72 (“beep”) is output, and immediately after that, the voice recognition device starts the voice recognition operation 74. do. The speech recognition operation 74 continues for a certain period of time, for example, about two seconds in the case of mundane, and recognizes the speech input 73 uttered within that period. In this way, the speech recognition device receives a processing request from a user. On the other hand, in ISDN networks and the like, the caller's telephone number can be notified to the other party as a network function. This caller's phone number does not represent the caller itself, but
In the case of a general household, the caller can be thought of as a fixed caller, temporarily and approximately. ISDN connects transmission lines and exchanges with digital lines,
It is a public digital network that provides various communication services such as voice, data, and images through a unified interface. In addition to integrating telephone, facsimile, videotex, and data communications into a single subscriber line, the service provides services that enable image communications such as videophone calls. Regarding the application of ISDN, please refer to ``Guidelines for Utilization of TSDtJ for Nikkei Communication'' published by Nikkei BP, especially the above text is excerpted from PP.18 and PP.189.

[Problem to be solved by the invention]

従来のシステムにおける音声応答認識装置は、サービス
毎に固定された応答ガイダンスを用いたり、発声を促す
合図音を固定的に出力している。しかしながら、利用者との応答ガイダンスや合図音送出
機能については、利用者のシステム習熟度や、その利用
者に期待される音声認識性能等により、フレキシブルに
対応することが望ましい。すなわち、従来のシステムでは、認識すべき音声の入力
契機を与えるため、合図音を用いているが、合図音出力
前のメッセージを、単にｒ暗証番号をどうぞＪのような
ものにすると、慣れない人は、合図音出力前に発声を開
始したり、合図音に重ねて発声したいする誤りが生じて
いた。また、合図音に続いて発声して欲しい旨の音声ガイダン
スを出力した後、合図音を出力すると、慣れている人に
は、冗長になるとの不都合が生じていた。さらに慣れた人には、合図音の退出そのものも不要とな
り、効率的な対応が望まれている。例えば、第６図および第７図で示したように、従来シス
テムにおいては、初心者には、うまくタイミングがつか
めないことがある。このような初心者には、「暗証番号
をビッという合図音に続いておっしゃって下さい」とい
う方が間違い無く入力できる。しかしながら、このメッセージは慣れている人には冗長
であり、もっと慣れている人には、合図音の出力も省略
した方が良いと考えられる，特に、照会系のサービスの
場合、通常、通話糾合は、発信者の負担となるので、冗
長なガイダンスや合図音出力は、回線の保留時間、を不
必要に増加させ、通話料の負担を強いるので望ましくな
いといえる，しかしながら，従来のシステムでは、応答ガイダンスは
、一通りしかなく、利用者の状況に応じて変更すること
は、不可能であった。また、複数のガイダンスを用意することも可能であるが
、どの利用者に、どのガイダンスを出力するかを、どの
ように決めるかが問題となっていた。つまり、従来のアナログ網では、どこから発信されたか
、受信側では分からないので、本人の識別情報を人力さ
せる機会をわざわざ設けないと利用者の識別が困難であ
る。そのような契機を設けること自体が保留時間の延長
につながるという矛盾をはらんでいた。このように、従来の装置では、慣れない利用者にとって
は、応答手順を間違え易く，また、慣れた利用者にとっ
ては、合図音の送出やガイダンス内容の一部は冗長なも
のとなっている。また、魚会系サービスの場合、通話料
金は、通常、発信者である利用者に課金される。そのた
め、焦駄なガイダンスを省き、回線の保留特開を短くす
ることが望まれている。本発明の目的は、これら従来技術の課題を解決し、操作
に慣れていない人や認識能力の劣る人には丁寧な手順を
、また、慣れている人や認識能ノノの良い人には簡素な
手順を自動的に選択して、幅広い利用者に応じて、効率
的にサービスを進めることが可能な音声応答認識装置の
応答方法を提供することである。［課題を解決するための手段〕上記目的を達戒するため、本発明の音声応答認識装置の
応答方法は、音声応答認識装置を、発信電話番号通知機
能を持つネットワークに接続し、利用者の発信電話番号
が入力されると、発信電話番号を認識し、この発信電話
番号に付加された操作習熟度を判別し、予め用意された
複数の応答ガイダンスの中から発信電話番号に対応する
応答ガイダンスを選択し、かつ、利用者の音声入力を促
す合図音の送出・否送出を発信電話番号に対応して選択
し、この選択結果に基づき応答することを特徴とする。〔作用Ｊ本発明において、音声応答認識装置には、予め、ｆｌｌ
用者のシステム操作の習熟度に合わせた複数の応答ガイ
ダンスが用意され、そして、ＩＳＤＮ網等に接続されて
いる。このＩＳＤＮ網等においては、ネットワークの機
能として発信者の電話番号を相手に通知出来る。利用者から発信電話番号が人力されると、音声応答認識
装置は、この７Ｉｔ話番号に基づき、利用者の、システ
ムの操作習熟度を判別する。この電話番号に基づく操作
習熟度は、予め、装置内にテーブルとして用意される。次に、音声応答認識装置は、判別された操作習熟度に対
応して、応答ガイダンスを選択すると共に、１１１用者
が応答ガイダンスの示す手順に基づき音声入力を行ない
易い様に促すための合図音を送出するか、否かを選択す
る。このような応答方法により、音声応答認識装置は、複数
の応答ガイダンスや合図訝送出機能の必要・不必要を、
利用者に応じて自動的に選択するので、利用者にとって
フレキシブルなマンマシンインタフェースとなる。〔実施例］以下本発明の実施例を、図面により許細に説明する。第１図は、本発明を施した音声応答認識装置の一実施例
を示すブロック図である。網制御部ｌ、音声認識部２、音声応答部３、および、主
制御部４から構成されている。網制御部ｌは、ＩＳＤＮ網に接続されている。このＩｓＤＮ網等においては、不ツトワークの機能とし
て発信者の電話番号を相手に通知出来る。網制御部ｌでは、発信機能の制御のほか、認識入力を適
切なタイミングで行なわせるための台図ａ出力部５や、
ネットワークから通知される発信者の電話番号を受信す
る発信者番号受信部６を有し、必要な情報の授受を主制
御部４と行なっている。音声認識部２は、入力された音声を分析し，予め蓄えら
れている標烟的な音声のパターンと比較し、入力された
言葉が何であるかを認識し、主制御部４に報告する。音声応答部３は、主制御部４の支持のもとに、所定のａ
声ガイダンスを出力する。主制御部４は、各部を制御するほか、サービスの進渉管
理、蓄積データの検索等の業務処理を行なっている．網制御部ｌに接続されているＩＳＤＮ網においては、ネ
ットワークの機能として発信者の電話番号を相手に通知
出来る。この発信者の電話番号は、発信者そのものを表
すものではないが、一般家庭の場合には、概ね一次近似
的に、固定した発信者と考えることが出来る。そこで、
この発信者の電話番号を受信した場合は，その発信者の
システムに対する習熟度を蓄えている情報により、シス
テムが判断して、適切な応答を選択することが可能とな
る．このような構成により、本音声応答認識装置は、慣れて
いない人や認識性能の劣る人には丁寧に、また、慣れて
いる人や認識能力の良い人には、効率的にサービスを進
めるため、複数の応答ガイダンスや合図音送出機能の必
要不必要を、利用者に応じて自動的に選択する。第２図は，第１図における音声応答認識装置の動作を示
すフローチャートである。本実施例では、習熟度テーブルを用いた場合の処理フロ
ーの例を示している。ここでは，発信考電話番号が通知された場合は、電話番
号と、その利用者のシステムに対する習熟度の対応をと
る。すなわち、利用者からの電話が入力されると、まず、発
信者のｍ話番号が登録されているか否かを確認し（ステ
ップ２０　１）　、登録されていなければ、その発信者
を習熟度「普通ノとみなす。登録されていれば、その電
話番号をキーとして、習熟度テーブルを検索し、発信者
の習熟度を参照する（ステップ２０２），ステップ２０２で、習熟度が「未熟」または「普通」と
判別された発信者に対しては、それぞれの習熟度に対応
した合図音の送出を行なう処理を選択する（ステップ２
０３）。ステップ２０２で、習熟度が「熟練」であると
判別された発信者に対しては、ステップ２０３の合図音
の送出を行なわず、直ちに、普通のガイダンスを提供す
る（ステップ２０５）。ステップ２０３の合図音の送出を選択した後、習熟度が
「未熟』と判別された発信者に対しては、丁寧なガイダ
ンス（ステップ２０４）を提供し、また、習熟度が「普
通」と判別された発信者に対しては、普通のガイダンス
（ステップ２０５）を提供する．このようにして、発信者の習熟度を自動的に判別し、発
信者の習熟度に最適なガイダンスを、最適な方法で提供
することが出来る。第３図は、第２図における習熟度参照のステップで用い
られる習熟度テーブルの例を示す説明図である。発信者電話番号３ｌに対応して、それぞれの電話利用者
の習熟度３２が示されている。ここでは、習熟度を３段階に分けている。そして、これ
らの習熟度は、システム管理者が，発信者（該当番号の
電話の使用者）のシステムの利用回数や、現在までの認
識傾向等の履歴情報をもとに決定する。習熟度ｒＱＪは、システム利用方法に関しては未熟練の
利用者を示し、習熟度「１』は、替通の熟練度の利用者
、そして、習熟度「２」の利用者は、熟練した利用者で
あることを示す。第４図は、第３図において習熟度がｒＯＪとされた利用
者向けのサービス仕槌を示す説明図である．音声応答認識装置から送出されるガイダンス音声／合図
音４ｌと、電話を介しての利用者の入力音声４２が、決
められた順序で応答しあうことにより，利用者の希望す
る処理が可能となる。まず、音声応答認識装置から「こちらは＊＊銀行です。よろしかったら、合図音に続いてサービスコードを一桁
づつおっしゃって下さい。」とのメッセージ４３が出さ
れる。このメッセージの後には，利用者からの音声は入
力されず、無人力ｒ０」４４である．メッセージ４３の後、音声応答認識装置から「ビッ』と
いう合図音４５が出され、この合図音を聞いた利用者は
、メッセージ４３の内容に従い、サービスコードｒｌＪ
４６を入力する。音声応答認識装置は、サービスコード
ＮＪ４６が入力されると、続いて、「残高照会ですね。よろしかったら、合図音に続いて、「ハイ」とおっしゃ
ってドさい。」とのメッセージ４７を出す。利用者は、「ビツＪという合図音４８を確認した後、「
はい」と返事４９を入力する。このように、システムの操作方法に慣れていない習熟度
の低い利用者には、丁寧なガイダンスが提供される。尚、普通の熟練度の利用者に対しては、第６図で示した
、従来のシステムからのガイダンスと同じものが提供さ
れる。第５図は、第３図において習熟度が「２」とされた利用
者向けのサービス仕様を示す説明図である。音声応答認識装置から送出されるのは、ガイダンス音声
５１のみで、合図音は省略されている。このガイダンス音声５１に、電話を介しての利用者の入
力音声４２が決められた順序で応答することにより、利
用者の希望する処理が可能となる。まず、音声応答認識装置から「こちらは＊＊銀行です。サービスコードなどうぞ。」との短いメッセージ５３が
出される。利用者が、すぐにサービスコードｒｏＩＪ５
４を入力すると、音声応答認２装置は、「残高照会です
ね」とサービスコードｒｏＩＪ５４に対応した処理内容
の確認メッセージ５５を出す。利用者は、「ハイ」と返
事５６を入力する。このようにして、システムの操作方法に慣れている習熟
度の高い利用者には、合図音を使用せず、冗長な手順を
除き、簡素なガイダンスが提供されるので、利用者は素
早い応答が出来る。尚、第３図から第５図までの説明は、一例を示したもの
であり、習熟度の段階やガイダンスの数は、どのような
サービスの、どのような場面かによって自由にシステム
を設計できる。また、ここでは、習熟度により、適切なサービス仕様を
選択する例を述べたが、習熟度に関係熊く、利用者が、
自分の好みに合わせてサービス仕様を予め登録しておき
、それを発信者番号により自動的に選択することも可能
である。以上、述べたように、本実施例によれば、利用者の発信
者電話番号が通知されたとき、その利用者の経験度等に
より、複数の応答ガイダンスから適切な応答ガイダンス
を選択したり、合図音の送出を選択することが可能とな
り、利用者に合ったマンマシンインタフェースを提供す
ることが出来る。［発明の効果］本発明によれば、操作に慣れていない人や認識能力の劣
る人には丁寧な手順を、また、慣れている人や認識能力
の良い人には簡素な手順を自動的に選択して、幅広い利
用者に応じて、効率的に音声応答サービスを進めること
が可能である。The voice response recognition device in the conventional system uses fixed response guidance for each service or outputs a fixed signal sound to encourage speaking. However, it is desirable to be able to respond flexibly to the user's response guidance and signal sound transmission function, depending on the user's system proficiency level, the voice recognition performance expected of the user, and so on. In other words, in the conventional system, a signal sound is used to give an opportunity to input the voice to be recognized, but if the message before the signal sound is simply ``R PIN number please J'', it is difficult to get used to it. People often make the mistake of starting to speak before the signal sound is output, or overlapping the sound with the signal sound. Further, if the signal sound is output after the audio guidance indicating that the user wants to speak after the signal sound is output, it becomes redundant for people who are used to the signal sound, which is inconvenient. Furthermore, for those who are accustomed to the system, it is no longer necessary to use the cue sound itself, and it is hoped that they will be able to respond more efficiently. For example, as shown in FIGS. 6 and 7, in the conventional system, beginners may not be able to properly grasp the timing. For such beginners, it is easier to input the password by saying, ``Please say your password after the beep.'' However, this message may be redundant for those who are used to it, and for those who are more accustomed to it, it may be better to omit the output of the signal sound.Especially in the case of inquiry-based services, calls are usually However, in the conventional system, redundant guidance and signal sound output are undesirable because they unnecessarily increase the holding time of the line and impose a burden on call charges.However, in conventional systems, There was only one type of response guidance, and it was impossible to change it according to the user's situation. Furthermore, although it is possible to prepare a plurality of guidances, the problem is how to decide which guidance should be output to which user. In other words, in conventional analog networks, since the receiving side does not know where the call is coming from, it is difficult to identify the user unless there is an opportunity for the user to enter the user's identification information manually. Providing such an opportunity was itself fraught with the contradiction that it would lead to an extension of the hold time. As described above, with conventional devices, inexperienced users tend to make mistakes in response procedures, and for experienced users, the sending of signal sounds and some of the guidance contents are redundant. Furthermore, in the case of Uokai-based services, the call fee is usually charged to the user who is the caller. Therefore, it is desired to eliminate wasteful guidance and shorten the length of time the line is held. The purpose of the present invention is to solve the problems of these conventional techniques, and to provide a detailed procedure for those who are not accustomed to operating it or those with poor recognition ability, while providing a simple procedure for those who are accustomed to operating it or those with good recognition ability. It is an object of the present invention to provide a response method of a voice response recognition device that can automatically select a suitable procedure and efficiently proceed with a service according to a wide range of users. [Means for Solving the Problems] In order to achieve the above object, the response method of the voice response recognition device of the present invention connects the voice response recognition device to a network having a caller telephone number notification function, and When a calling phone number is input, it recognizes the calling phone number, determines the operational proficiency level attached to this calling phone number, and selects a response guidance corresponding to the calling phone number from among multiple response guidance prepared in advance. The system is characterized in that it selects whether to send or not send a signal sound to prompt the user's voice input in accordance with the calling telephone number, and responds based on the selection result. [Function J] In the present invention, the voice response recognition device is provided with
A plurality of response guidances are prepared according to the user's level of proficiency in operating the system, and the system is connected to an ISDN network or the like. In ISDN networks and the like, the caller's telephone number can be notified to the other party as a network function. When the user manually enters the calling telephone number, the voice response recognition device determines the user's proficiency in operating the system based on this 7It call number. The operating proficiency level based on this telephone number is prepared in advance as a table within the device. Next, the voice response recognition device selects response guidance in accordance with the determined operating proficiency level, and also produces a signal sound to encourage the 111 user to easily perform voice input based on the procedure indicated by the response guidance. Select whether to send or not. Using this response method, the voice response recognition device can determine whether multiple response guidance or signal/question sending functions are necessary or not.
Since it is automatically selected according to the user, it becomes a flexible man-machine interface for the user. [Examples] Examples of the present invention will be described in detail below with reference to the drawings. FIG. 1 is a block diagram showing an embodiment of a voice response recognition device according to the present invention. It is composed of a network control section 1, a voice recognition section 2, a voice response section 3, and a main control section 4. The network control unit 1 is connected to the ISDN network. In this IsDN network, etc., the caller's telephone number can be notified to the other party as a network function. In addition to controlling the transmission function, the network control unit 1 also includes a platform a output unit 5 for performing recognition input at an appropriate timing,
It has a caller number receiving section 6 that receives the caller's telephone number notified from the network, and exchanges necessary information with the main control section 4. The speech recognition section 2 analyzes the input speech, compares it with standard speech patterns stored in advance, recognizes the input word, and reports the result to the main control section 4. The voice response section 3, under the support of the main control section 4,
Output voice guidance. The main control unit 4 not only controls each unit, but also performs business processes such as managing the progress of services and searching for stored data. In the ISDN network connected to the network control unit 1, the telephone number of the caller can be notified to the other party as a function of the network. Although this caller's telephone number does not represent the caller itself, in the case of an ordinary household, it can be considered as a fixed caller in a first-order approximation. Therefore,
When the system receives the caller's phone number, the system can determine the caller's familiarity with the system and select an appropriate response. With this configuration, this voice response recognition device can provide services politely to people who are not used to it or who have poor recognition performance, and efficiently to people who are used to it or have good recognition ability. , the necessity or necessity of multiple response guidance and signal sound transmission functions is automatically selected according to the user. FIG. 2 is a flowchart showing the operation of the voice response recognition device in FIG. This embodiment shows an example of a processing flow when a proficiency level table is used. Here, when a caller's telephone number is notified, correspondence is made between the telephone number and the user's level of familiarity with the system. That is, when a call from a user is input, it is first checked whether the caller's M-phone number is registered (step 201), and if it is not registered, the caller is assigned a proficiency level. If it is registered, the proficiency level table is searched using the phone number as a key, and the caller's proficiency level is referenced (step 202). For callers judged to be "fair", a process is selected to send a signal sound corresponding to their proficiency level (step 2).
03). For callers whose proficiency level is determined to be "expert" in step 202, normal guidance is immediately provided without transmitting the signal sound in step 203 (step 205). After selecting to send a signal sound in step 203, polite guidance (step 204) is provided to the caller whose proficiency level is determined to be "unskilled", and if the proficiency level is determined to be "average". For the caller who received the request, normal guidance (step 205) is provided. In this way, it is possible to automatically determine the caller's proficiency level and provide guidance that is optimal for the caller's proficiency level in an optimal manner. FIG. 3 is an explanatory diagram showing an example of a proficiency level table used in the proficiency level reference step in FIG. 2. The proficiency level 32 of each telephone user is shown corresponding to the caller's telephone number 3l. Here, the proficiency level is divided into three levels. These proficiency levels are determined by the system administrator based on historical information such as the number of times the caller (the user of the telephone with the corresponding number) has used the system and the recognition tendency to date. The proficiency level rQJ indicates a user who is unskilled in how to use the system, a proficiency level of "1" indicates a user who is proficient in exchange, and a user with a proficiency level of "2" indicates a skilled user. . FIG. 4 is an explanatory diagram showing a service provision for users whose proficiency level is rOJ in FIG. 3. The guidance voice/signal sound 4l sent from the voice response recognition device and the user's input voice 42 via the telephone respond to each other in a predetermined order, making it possible to perform the process desired by the user. . First, the voice response recognition device issues a message 43 saying, ``This is ** bank. If you agree, please say the service code one digit at a time following the signal tone.'' After this message, no voice is input from the user, and the message is unattended. After the message 43, the voice response recognition device emits a signal sound 45 called "beep", and the user who hears this signal sound receives the service code rlJ according to the content of the message 43.
Enter 46. When the service code NJ46 is input, the voice response recognition device then says, ``Balance inquiry.If you wish, please say ``Hi'' after the signal tone. ” message 47 is displayed. After the user confirms the signal sound 48 "Bit J",
Yes” and enter reply 49. In this way, detailed guidance is provided to less proficient users who are not familiar with how to operate the system. It should be noted that for users of average skill level, the same guidance as that provided by the conventional system shown in FIG. 6 is provided. FIG. 5 is an explanatory diagram showing service specifications for users whose proficiency level is "2" in FIG. 3. Only the guidance voice 51 is sent out from the voice response recognition device, and the signal sound is omitted. By responding to the guidance voice 51 with the user's input voice 42 via the telephone in a predetermined order, the process desired by the user becomes possible. First, the voice response recognition device issues a short message 53 saying, ``This is ** bank. Please provide the service code.'' The user immediately receives the service code roIJ5
4, the voice response recognition device 2 outputs a confirmation message 55 of the processing content corresponding to the service code roIJ54, saying, ``This is a balance inquiry.'' The user inputs the reply 56 as "yes". In this way, experienced users who are familiar with how to operate the system are provided with simple guidance, without cues and without redundant steps, allowing them to respond quickly. I can do it. The explanations shown in Figures 3 to 5 are just examples, and the system can be designed freely depending on the level of proficiency and the number of guidance depending on the type of service and the type of situation. . In addition, although we have described an example of selecting an appropriate service specification based on the user's proficiency level, the user
It is also possible to register service specifications in advance according to one's preferences and automatically select them based on the caller number. As described above, according to this embodiment, when a user's caller phone number is notified, an appropriate response guidance can be selected from a plurality of response guidances depending on the user's experience level, etc. It becomes possible to select the transmission of a signal sound, and it is possible to provide a man-machine interface that suits the user. [Effects of the Invention] According to the present invention, a careful procedure can be automatically performed for a person who is not accustomed to operating the device or a person with poor cognitive ability, and a simple procedure can be automatically performed for a person who is accustomed to operating the device or a person with a good cognitive ability. It is possible to efficiently provide voice response services to a wide range of users.

[Brief explanation of the drawing]

第１図は本発明を施した音声応答認識装置の一実施例置
を示すブロック図、第２図は第１図における音声応答認
識装置の動作を示すフローチャート、第３図は利用者の
習熟度参照用の習熟度テーブルの例を示す説明図、第４
図は習熟度が低い利用者向けのサービス仕様を示す説明
図、第５図は習熟度が高い利用者向けのサービス仕様を
示す説明図、第６図は従来システムの音声入力契機にお
けるサービス仕様を示す説明図、第７図は従来システム
の合図音出力と音声認識装置の動作の関係を示すタイミ
ングチャートである。１・網制御部，２：音声認識部，３，音声応答部，４：
主制御部，５：合図音送出部，６：発信者番号受信部，
３１：発信者電話番号，３２・習熟度，４ｌ：ガイダン
ス音声／合図音，４２．入力音声，４３：メッセージ，
４４：無入力ｒＱＪ　，４５：合図音，４６：サービス
コードｒｌＪ，４７：メッセージ，４８：合図音，４９
：返１￥，５１：ガイダンス音声，５２：入力音声，５
３：メッセージ，５４　サービスコードｒｏＩＪ，５５
メッセージ，５６，返事，６１：ガイダンスａ声／合図
音，６２・入力音声，６３．メッセージ，６４　合図音
，６５：人力ｒＯＪ，６６　　合図音，６７・人力ＮＪ
，６８：メッセージ，７ｌ：ガイダンス，７２：合図音，７３：音声入力，７　４　：音声認識動作。一のニ２熔藁葺第２図舅始２０４２０５第３図第４図第６図FIG. 1 is a block diagram showing an embodiment of the voice response recognition device according to the present invention, FIG. 2 is a flowchart showing the operation of the voice response recognition device in FIG. 1, and FIG. 3 is a user's proficiency level. Explanatory diagram showing an example of a proficiency table for reference, No. 4
Figure 5 is an explanatory diagram showing the service specifications for users with a low proficiency level, Figure 5 is an explanatory diagram showing the service specifications for users with a high proficiency level, and Figure 6 is an explanatory diagram showing the service specifications for the voice input trigger of the conventional system. The explanatory diagram shown in FIG. 7 is a timing chart showing the relationship between the signal sound output of the conventional system and the operation of the voice recognition device. 1. Network control unit, 2: Voice recognition unit, 3, Voice response unit, 4:
Main control unit, 5: Signal sound sending unit, 6: Caller number receiving unit,
31: Caller phone number, 32. Proficiency level, 4l: Guidance voice/signal sound, 42. Input voice, 43: Message,
44: No input rQJ, 45: Signal sound, 46: Service code rlJ, 47: Message, 48: Signal sound, 49
: Return 1 yen, 51: Guidance voice, 52: Input voice, 5
3: Message, 54 Service code roIJ, 55
Message, 56, Reply, 61: Guidance voice/signal sound, 62/Input voice, 63. Message, 64 Signal sound, 65: Human power rOJ, 66 Signal sound, 67 Human power NJ
, 68: Message, 7l: Guidance, 72: Signal sound, 73: Voice input, 74: Voice recognition operation. 1-2 2-melted straw thatching 2nd figure 204 205 3rd figure 4th figure 6th figure

Claims

[Claims]

(1) A voice response recognition device that uses voice recognition technology that recognizes voices of unspecified speakers to identify and automatically respond to a process specified by a user by voice via a telephone, the voice response recognition device is connected to a network that has a calling phone number notification function, and when the calling phone number from the user is input, it recognizes the calling phone number and calculates the operating proficiency level that has been added to the calling phone number in advance. selects the response guidance corresponding to the calling telephone number from among a plurality of response guidances prepared in advance based on the operating proficiency level, and sends/rejects a signal sound to prompt the user to input voice. A response method for a voice response recognition device, characterized in that the voice response recognition device selects a telephone number corresponding to the calling telephone number, and responds based on the selection result.