WO2019207597A1 - System and method of operating open ended interactive voice response in any spoken languages - Google Patents
System and method of operating open ended interactive voice response in any spoken languages Download PDFInfo
- Publication number
- WO2019207597A1 WO2019207597A1 PCT/IN2019/050325 IN2019050325W WO2019207597A1 WO 2019207597 A1 WO2019207597 A1 WO 2019207597A1 IN 2019050325 W IN2019050325 W IN 2019050325W WO 2019207597 A1 WO2019207597 A1 WO 2019207597A1
- Authority
- WO
- WIPO (PCT)
- Prior art keywords
- open ended
- interactive voice
- voice response
- users
- user
- Prior art date
Links
Classifications
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F40/00—Handling natural language data
- G06F40/20—Natural language analysis
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F40/00—Handling natural language data
- G06F40/40—Processing or translation of natural language
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F40/00—Handling natural language data
- G06F40/40—Processing or translation of natural language
- G06F40/53—Processing of non-Latin text
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04M—TELEPHONIC COMMUNICATION
- H04M3/00—Automatic or semi-automatic exchanges
- H04M3/42—Systems providing special services or facilities to subscribers
- H04M3/487—Arrangements for providing information services, e.g. recorded voice services or time announcements
- H04M3/493—Interactive information services, e.g. directory enquiries ; Arrangements therefor, e.g. interactive voice response [IVR] systems or voice portals
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04M—TELEPHONIC COMMUNICATION
- H04M2201/00—Electronic components, circuits, software, systems or apparatus used in telephone systems
- H04M2201/39—Electronic components, circuits, software, systems or apparatus used in telephone systems using speech synthesis
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04M—TELEPHONIC COMMUNICATION
- H04M2203/00—Aspects of automatic or semi-automatic exchanges
- H04M2203/20—Aspects of automatic or semi-automatic exchanges related to features of supplementary services
- H04M2203/2061—Language aspects
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04M—TELEPHONIC COMMUNICATION
- H04M3/00—Automatic or semi-automatic exchanges
- H04M3/42—Systems providing special services or facilities to subscribers
- H04M3/42382—Text-based messaging services in telephone networks such as PSTN/ISDN, e.g. User-to-User Signalling or Short Message Service for fixed networks
Definitions
- the invention relates to the field of operating Interactive Voice Response (IVR). More particularly the invention relates to operating open ended IVR system.
- IVR Interactive Voice Response
- Interactive Voice Response is a system where a call, usually a GSM call or using other means of wireless communication for mobile devices, such as cellular or data networks is picked up by the system and a static menu is presented to the caller in a form of audio or text. Based on the user’s keypad input, a certain task is performed. Businesses set up Interactive Voice Response (IVR) to cut down cost and make serving customer effective before any human intervention is required.
- IVR Interactive Voice Response
- menus Normally when a user calls an entity, like a bank, the call is first received by an automated audio IVR system. The user is presented with some audio menus, the menu items are bound to the keys 0 - 9 of the keypad. Most menus provides instant service or access to an end point of a specific division in an entity. Bank instant services could be for instance checking balance, change pin of a card and so on, endpoint access of entity could get in touch with the loan department, contact the security department, etc. Menus are always specific to the entity and might contain services specific to the individual caller.
- a bank’s IVR system will only serve account menus to a caller who has previously registered an account with the phone number, whereas the IVR system of a pizza delivery service will only take in the order and place of delivery. IVR system may or may not use the caller’s information to process a request.
- the caller has to listen to a set of menus and perform certain keystrokes to reach an intended service or endpoint. These makes the process very time intensive. In addition the process is quite error prone as well, in case the caller makes a wrong input, the whole process might have to be repeated again. Moreover the menu of the IVR may be changed or updated as an impact, a regular caller will not be able to reach a desired service or endpoint just by remembering the keystrokes. Furthermore the endpoint or service may not fulfill the caller’s desired intent and may have to call back to the IVR again to perform the task, which is not at all user friendly and might frustrate the caller.
- the technique is not efficient enough, as the user can switch between one IVR to another, but the IVRs do not share data. Moreover, the users have to press menu buttons to reach the desired destination, which makes the system very cumbersome. Although that technology made multiple IVR systems accessible under one phone call, it was unable to make them interconnected.
- the visual IVR menu is specific to the destination and only the IVR of the destination dialed is displayed. These techniques therefore require each destination to set-up hardware, software and other facilities to be deployed for providing visual IVR servers. Moreover, the caller must be literate enough to be able to read and respond to the menu or else the caller has to listen through all the menu before proceeding.
- this technique requires a specifically configured device to interpret the codes sent as DTMF signals for generating the graphics. Moreover, an operator is required to present the graphics to the caller. Furthermore, specialized software and hardware are required at the operator to design and generate DTMF codes. Therefore, the technique faces various practical limitations.
- the audible menu scripts must be available in a particular format to enable the conversion.
- the audio menu scripts must be available or downloadable for the program to function.
- the audio menus scripts that are available can be converted to visual IVR menu scripts.
- the device of the caller must be designed or programmed to be able to display the visual IVR menu scripts.
- the effectiveness of providing the IVR in visual form is discussed in a technical paper titled ‘The Benefits of Augmenting Telephone Voice Menu Navigation with Visual Browsing and Search’ by Min Yin et al.
- the paper discusses a setup, where visual content of the IVR is sent from a service provider to a computer connected to a mobile phone.
- the technique discussed in the paper is limited to the visual content provided by the service provider's end, after the connection is established. Moreover, the providers are required to individually set up the hardware and services for providing visual content.
- An enhanced IVR system which lets the user perform multiple task across multiple services under one streamlined phone call with just voice input without using any menus.
- the user is authenticated via voice or passcode and the credentials are fetched.
- the user tells the system the intended tasks.
- the voice is converted to text using an automatic speech recognition (ASR) system which converts the users’ utterances to text.
- a natural language processing (NLP) engine corrects any mistakes the ASR made during the conversion in languages such as Austroasiatic, Austronesian, Dravidian, Indo-Aryan, Afroasiatic, Sino-Tibetan and Tai- Kadai.
- the logic system takes over and fetches a set of services needed by the tasks, checks if these tasks are valid, fetches required information and generates responses to be given to the user. Thereupon the response system takes charge, generates and presents responses through speech synthesis or in text form (e.g. through sms messages) to the user.
- the invention tackles a wide array from problems not dealt previously.
- the elimination of both acoustic and visual menus leads to more natural and time effective user interaction. There is not need to type in any information, since all data will be extracted from voice, thus less literate users are enabled to interact with the IVR system effortlessly.
- the interconnection across multiple services allows the user to perform similar tasks in a single phone all session.
- FIG. 1.0.0 is a diagrammatic view of the system organization for an exemplary embodiment of the presented invention.
- FIG. 2, 4, 5, 6 are segmented flow charts of the principal steps involved in using the presented invention.
- a caller (can be multiple) 101 can initiate a call 104 to the open ended IVR system 200.
- the caller 101 can make the call 104 using any cellular device 102 which could be a smartphone or feature phone and the call 104 will land on the system, which is running on cloud servers 200 using the cellular network 103 the GSM device 102 use.
- the caller 101 will be greeted with an audio message and asks to speak the desired intent.
- the system 200 receives the caller’s 101 intent, the system 200 fetches the required services 106 necessary to perform the intent via internet 105 based APIs (application programming interface).
- the system 200 checks if the caller 101 intent is executable and based on the intent the system 200 responds to the caller 101 in the following ways:
- the system 200 reports the caller 101 the informative intent requested in audio or text via SMS Gateway servers.
- the system 200 might ask the caller’s 101 consent to execute the requested task, presenting an audio format of the task to be executed.
- the system 200 might report the caller 101 error in the intent in audio form.
- the caller 101 might ask to execute another intent or end the call. If the caller 101 receives an error, the caller 101 will be asked to repeat the corrected intent. For example: A caller 101 calls the system 200 and the system 200 greets the caller 101 with an audio message and asks the caller 101 for the intent. The caller 101 speaks“Buy 2 dozen eggs from EGGY Store and pay the bill from my AB bank account and deliver the eggs to my home by EATS” the system 200 processes the intent and responds to the caller 101 in audio with‘’From EGGY Store purchase 24 eggs. An amount BDT 50 will be debated from your AB bank account number XXXXX by EGGY.
- the caller 101 responds with“YES” and the system 200 replies“Your request was successfully executed”. The caller 101 ends the call.
- the system 200 sends two receipts via SMS, one from EGGY store and another from EATS delivery service.
- the system may invoke a particular service multiple times as well based on the caller 101 needs.
- a caller 101 calls the system 200.
- the system 200 greets the caller 101 with an audio message and asks the caller 101 for the intent.
- the caller 101 speaks“Transfer BDT 5000 from my account in ABC bank to account number XXXXX in ABC Bank and pay my Electric bill ID XXXX with my ABC bank account”
- the system 200 processes the intent and responds to the caller 101 in audio with“From your account XXXX in ABC bank transfer BDT 5000 to account XXXXXX in ABC.
- An amount BDT 4200 will be deducted from your ABC bank account XXXX in refer to the Electric bill ID XXXX. Confirm task?”.
- the caller 101 responds with“YES” and the system 200 replies“Your request was successfully executed”.
- the caller 101 cuts the line.
- the system 200 sends two SMS logs, one for account transfer another for the electric bill payment.
- FIG. 2 a diagrammatic view of the open ended IVR system 200 for an exemplary embodiment of the presented invention is shown.
- the Call Landing server 201 handles the call 104 and routes the call 104 to the authentication system 300.
- the authentication system 300 identifies the caller 101 and fetches the caller 101 credentials.
- the audio speech of the caller 101 is converted to a machine readable intent by the intent handler system 400 and passed to the logic system 500 for further processing.
- the logic system 500 fetches the required services and viability of the intent and gives feedback to the response system 500.
- the response system converts the feedback into a human understandable format and passes the feedback to the caller 101 via the available channels.
- FIG. 3 a diagrammatic view of the open ended IVR system 200 for an embodiment of the present invention.
- FIG. 4 a diagrammatic view of the intent handler system 400 for an exemplary embodiment of the present invention is shown.
- the audio from the caller 202 is passed to the automatic speech recognition (ASR) system 401 which converts the audio speech to text 402.
- the text 402 is passed to the natural language processing system (NLP) 403 which corrects any error generated during audio to text conversion in languages such as Austroasiatic, Austronesian, Dravidian, Indo-Aryan, Afroasiatic, Sino-Tibetan and Tai-Kadai.
- the corrected text then is then passed to the contextual AI system, which reads the text and collects the tasks 406, the caller 101 wants to perform.
- the machine readable intent 406 is passed to the logic system 500 for further processing.
- Intent 406 can be one of the following 3 types:
- the logic system 500 retrieves the services required to perform the intent - 504. Then the logic system 500 performs operations to evaluate whether the intent isvalid and executable - 506. If the intent 507 requires only information, the data is retrieved and passed to the response system 600 or the intent waits for the caller’s 101 confirmation or the intent contains an error, which has to be corrected by the caller 101 by providing a new intent 406. Upon receiving the confirmation intent the tasks are executed and the result is passed on to the response system 600.
- FIG. 6 a diagrammatic view of the response system 600 for an exemplary embodiment of the presented invention is shown.
- the logic system generates 2 types of responses:
- the system needs a confirmation on the task list. 502
- a report is generated to be delivered to the caller 101. It can be formation or error. 503
- the response is converted to human readable text 504 and based on the logic system response 501.
- the human readable text 504 is either passed to the text to speech (TTS) system 505 or aSMS gateway server, before it is delivered to the caller 101.
- TTS text to speech
Landscapes
- Engineering & Computer Science (AREA)
- Theoretical Computer Science (AREA)
- Audiology, Speech & Language Pathology (AREA)
- Health & Medical Sciences (AREA)
- Computational Linguistics (AREA)
- General Health & Medical Sciences (AREA)
- Physics & Mathematics (AREA)
- General Engineering & Computer Science (AREA)
- General Physics & Mathematics (AREA)
- Artificial Intelligence (AREA)
- Signal Processing (AREA)
- Telephonic Communication Services (AREA)
- Nitrogen Condensed Heterocyclic Rings (AREA)
- Coloring Foods And Improving Nutritive Qualities (AREA)
- Medicines Containing Plant Substances (AREA)
Abstract
Description
Claims
Priority Applications (2)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
AU2019260038A AU2019260038A1 (en) | 2018-04-23 | 2019-04-22 | System and method of operating open ended interactive voice response in any spoken languages |
PH12020551761A PH12020551761A1 (en) | 2018-04-23 | 2020-10-22 | System and method of operating open ended interactive voice response in any spoken languages |
Applications Claiming Priority (2)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
BD1192018 | 2018-04-23 | ||
BD119/2018 | 2018-04-23 |
Publications (1)
Publication Number | Publication Date |
---|---|
WO2019207597A1 true WO2019207597A1 (en) | 2019-10-31 |
Family
ID=68295833
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
PCT/IN2019/050325 WO2019207597A1 (en) | 2018-04-23 | 2019-04-22 | System and method of operating open ended interactive voice response in any spoken languages |
Country Status (3)
Country | Link |
---|---|
AU (1) | AU2019260038A1 (en) |
PH (1) | PH12020551761A1 (en) |
WO (1) | WO2019207597A1 (en) |
Cited By (2)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
WO2021135548A1 (en) * | 2020-06-05 | 2021-07-08 | 平安科技(深圳)有限公司 | Voice intent recognition method and device, computer equipment and storage medium |
CN113392847A (en) * | 2021-06-17 | 2021-09-14 | 拉萨搻若文化艺术产业开发有限公司 | OCR (optical character recognition) handheld scanning translation device and translation method for Tibetan Chinese and English |
Families Citing this family (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN113905135B (en) * | 2021-10-14 | 2023-10-20 | 天津车之家软件有限公司 | User intention recognition method and device of intelligent outbound robot |
Citations (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US6418199B1 (en) * | 1997-12-05 | 2002-07-09 | Jeffrey Perrone | Voice control of a server |
US6944592B1 (en) * | 1999-11-05 | 2005-09-13 | International Business Machines Corporation | Interactive voice response system |
US20170228367A1 (en) * | 2012-04-20 | 2017-08-10 | Maluuba Inc. | Conversational agent |
-
2019
- 2019-04-22 WO PCT/IN2019/050325 patent/WO2019207597A1/en active Application Filing
- 2019-04-22 AU AU2019260038A patent/AU2019260038A1/en not_active Abandoned
-
2020
- 2020-10-22 PH PH12020551761A patent/PH12020551761A1/en unknown
Patent Citations (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US6418199B1 (en) * | 1997-12-05 | 2002-07-09 | Jeffrey Perrone | Voice control of a server |
US6944592B1 (en) * | 1999-11-05 | 2005-09-13 | International Business Machines Corporation | Interactive voice response system |
US20170228367A1 (en) * | 2012-04-20 | 2017-08-10 | Maluuba Inc. | Conversational agent |
Non-Patent Citations (1)
Title |
---|
AMAZON: "Voice Shopping with Alexa", YOUTUBE, 18 November 2016 (2016-11-18), XP054979967, Retrieved from the Internet <URL:https://youtu.be/mCjvV3iFsuw> * |
Cited By (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
WO2021135548A1 (en) * | 2020-06-05 | 2021-07-08 | 平安科技(深圳)有限公司 | Voice intent recognition method and device, computer equipment and storage medium |
CN113392847A (en) * | 2021-06-17 | 2021-09-14 | 拉萨搻若文化艺术产业开发有限公司 | OCR (optical character recognition) handheld scanning translation device and translation method for Tibetan Chinese and English |
CN113392847B (en) * | 2021-06-17 | 2023-12-05 | 拉萨搻若文化艺术产业开发有限公司 | Tibetan Chinese-English three-language OCR handheld scanning translation device and translation method |
Also Published As
Publication number | Publication date |
---|---|
PH12020551761A1 (en) | 2021-07-12 |
AU2019260038A1 (en) | 2020-12-24 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
AU2019236638B2 (en) | User authentication via mobile phone | |
US10009463B2 (en) | Multi-channel delivery platform | |
US8000454B1 (en) | Systems and methods for visual presentation and selection of IVR menu | |
US8223931B1 (en) | Systems and methods for visual presentation and selection of IVR menu | |
US8442563B2 (en) | Automated text-based messaging interaction using natural language understanding technologies | |
US10063701B2 (en) | Custom grammars builder platform | |
US7471786B2 (en) | Interactive voice response system with partial human monitoring | |
US20090144131A1 (en) | Advertising method and apparatus | |
WO2019207597A1 (en) | System and method of operating open ended interactive voice response in any spoken languages | |
US8625756B1 (en) | Systems and methods for visual presentation and selection of IVR menu | |
EP3138272B1 (en) | Voice call diversion to alternate communication method | |
US20160370952A1 (en) | Visual interactive voice response system | |
EP3047638B1 (en) | Multi-channel delivery platform | |
WO2010107649A1 (en) | Cross channel contact history management | |
US8731148B1 (en) | Systems and methods for visual presentation and selection of IVR menu | |
US8867708B1 (en) | Systems and methods for visual presentation and selection of IVR menu | |
US9247055B2 (en) | Interactive voice response (IVR) routing system | |
US10672093B2 (en) | Delivery order relaying system using TTS and method therefor | |
US10404849B2 (en) | Launching a designated application using a set of signals | |
WO2011130077A1 (en) | System and method for intermediating between subscriber devices and communication service providers | |
US11743386B2 (en) | System and method of controlling and implementing a communication platform as a service | |
US20150312411A1 (en) | Method for directing a phone call to a web-based menu access point via a passive telephone access point | |
US20200322293A1 (en) | Information processing system and method | |
KR20090099924A (en) | Method and unit for interactive multimedia response to multiple call distribution | |
KR20120103278A (en) | Remote banking system and method |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
121 | Ep: the epo has been informed by wipo that ep was designated in this application |
Ref document number: 19791474 Country of ref document: EP Kind code of ref document: A1 |
|
DPE1 | Request for preliminary examination filed after expiration of 19th month from priority date (pct application filed from 20040101) | ||
NENP | Non-entry into the national phase |
Ref country code: DE |
|
ENP | Entry into the national phase |
Ref document number: 2019260038 Country of ref document: AU Date of ref document: 20190422 Kind code of ref document: A |
|
122 | Ep: pct application non-entry in european phase |
Ref document number: 19791474 Country of ref document: EP Kind code of ref document: A1 |
|
122 | Ep: pct application non-entry in european phase |
Ref document number: 19791474 Country of ref document: EP Kind code of ref document: A1 |