EP1224797A1 - Procede et systeme d'interface vocale - Google Patents
Procede et systeme d'interface vocaleInfo
- Publication number
- EP1224797A1 EP1224797A1 EP00965546A EP00965546A EP1224797A1 EP 1224797 A1 EP1224797 A1 EP 1224797A1 EP 00965546 A EP00965546 A EP 00965546A EP 00965546 A EP00965546 A EP 00965546A EP 1224797 A1 EP1224797 A1 EP 1224797A1
- Authority
- EP
- European Patent Office
- Prior art keywords
- caller
- service
- information
- business
- database
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Withdrawn
Links
- 238000000034 method Methods 0.000 title claims description 61
- 230000001755 vocal effect Effects 0.000 title claims description 56
- 230000007704 transition Effects 0.000 claims description 17
- 230000004044 response Effects 0.000 claims description 16
- 230000014509 gene expression Effects 0.000 claims description 9
- 238000012545 processing Methods 0.000 claims description 4
- 230000009471 action Effects 0.000 claims description 3
- 230000003993 interaction Effects 0.000 claims 3
- 230000000977 initiatory effect Effects 0.000 claims 1
- 239000008186 active pharmaceutical agent Substances 0.000 abstract description 2
- 238000010586 diagram Methods 0.000 description 36
- 230000002123 temporal effect Effects 0.000 description 6
- 238000012790 confirmation Methods 0.000 description 5
- 239000000463 material Substances 0.000 description 5
- 238000012986 modification Methods 0.000 description 5
- 230000004048 modification Effects 0.000 description 5
- 230000008569 process Effects 0.000 description 4
- 238000011161 development Methods 0.000 description 3
- 230000000694 effects Effects 0.000 description 3
- 230000008901 benefit Effects 0.000 description 2
- 230000008859 change Effects 0.000 description 2
- 238000004891 communication Methods 0.000 description 2
- 230000000763 evoking effect Effects 0.000 description 2
- 238000011272 standard treatment Methods 0.000 description 2
- 230000001419 dependent effect Effects 0.000 description 1
- 238000005516 engineering process Methods 0.000 description 1
- 239000002360 explosive Substances 0.000 description 1
- 230000010006 flight Effects 0.000 description 1
- 229910000078 germane Inorganic materials 0.000 description 1
- 230000000717 retained effect Effects 0.000 description 1
- 238000012795 verification Methods 0.000 description 1
Classifications
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04M—TELEPHONIC COMMUNICATION
- H04M3/00—Automatic or semi-automatic exchanges
- H04M3/42—Systems providing special services or facilities to subscribers
- H04M3/487—Arrangements for providing information services, e.g. recorded voice services or time announcements
- H04M3/493—Interactive information services, e.g. directory enquiries ; Arrangements therefor, e.g. interactive voice response [IVR] systems or voice portals
- H04M3/4936—Speech interaction details
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/22—Procedures used during a speech recognition process, e.g. man-machine dialogue
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04M—TELEPHONIC COMMUNICATION
- H04M2242/00—Special services or facilities
- H04M2242/22—Automatic class or number identification arrangements
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04M—TELEPHONIC COMMUNICATION
- H04M3/00—Automatic or semi-automatic exchanges
- H04M3/38—Graded-service arrangements, i.e. some subscribers prevented from establishing certain connections
- H04M3/382—Graded-service arrangements, i.e. some subscribers prevented from establishing certain connections using authorisation codes or passwords
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04M—TELEPHONIC COMMUNICATION
- H04M3/00—Automatic or semi-automatic exchanges
- H04M3/42—Systems providing special services or facilities to subscribers
- H04M3/42025—Calling or Called party identification service
- H04M3/42034—Calling party identification service
- H04M3/42059—Making use of the calling party identifier
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04Q—SELECTING
- H04Q3/00—Selecting arrangements
- H04Q3/72—Finding out and indicating number of calling subscriber
Definitions
- Web pages has for example, experienced explosive growth. Moreover, the
- VUI vocal user interface
- VUI implementations tend to be cumbersome
- the present invention comprises a VUI Speech Object Application
- Fig. A1 Depicts a conversational state diagram of an embodiment of a
- Fig. A2 Depicts a conversational state diagram of an embodiment of a
- Fig. A3 Depicts a conversational state diagram of an embodiment of a
- Fig. A4 Depicts a conversational state diagram of an embodiment of a
- Fig. B1 Depicts a conversational state diagram of an embodiment of a
- Fig. B2 Depicts a conversational state diagram of an alternate embodiment of a Traffic Condition Speech Object.
- Fig. C1 Depicts a conversational state diagram of an embodiment of a 1/26350
- Fig. C2 Depicts a conversational state diagram of an embodiment of a extended functionality of the Business Finder Speech Object.
- Fig. D1 Depicts a conversational state diagram of an embodiment of a Stock Information Speech Object.
- Fig. D2 Depicts a conversational state diagram of an embodiment of extended functionality of a Stock Information Speech Object.
- Fig. D3 Depicts a conversational state diagram of an embodiment of extended functionality of a Stock Information Speech Object.
- Fig. E1 Depicts a conversational state diagram of an embodiment of a Weather Speech Object.
- Fig. E2 Depicts a conversational state diagram of an embodiment of a List Speech Object for conveying weather information to the caller.
- Fig. F1 Depicts a conversational state diagram of an embodiment of a Address Locating Speech Object.
- Fig. F2 Depicts a conversational state diagram of an embodiment of a Address Disambiguation Speech Object.
- Fig- G1 Depicts a conversational state diagram of an embodiment of a Flight Finder Speech Object.
- Fig- G2 Depicts a conversational state diagram of an embodiment of a Flight Information Speech Object.
- Fig- G3 Depicts a conversational state diagram of an embodiment of a Itinerary Speech Object.
- Fig. H Depicts a conversational state diagram of an embodiment of a Driving Directions Speech Object.
- scaleable system architecture that includes at least one each of the following;
- Vocal User Interface Application Server
- Telephony Server a Telephony Server
- Speech Recognition Server a Text-to-Speech Server
- Media Server a Media Server
- API Application Program Interface
- the presently preferred backbone network comprises a TCP/IP
- a caller connects to the Telephony Server by dialing a telephone
- the Telephony Server includes a Telephone Network (PSTN).
- PSTN Telephone Network
- the Telephony Server includes a Telephone Network (PSTN).
- FIG. 1 depicts the Telephone Network Interface coupled
- PSTN Public Switched Telephone Network
- Telephone Network Interface further comprises speech signal processing
- the VUI Application Server comprises hardware under control of a VUI
- the VUI Application implements a vocally navigable Speech Object interface between the caller and the API of the independent Service-
- VUI Application further comprises distinct program
- speech grammars that are particularly germane to the
- a Traffic Condition Module a Traffic Condition Module
- a Business Finder Module a Stock Finder Module
- the Media Server comprises hardware under program control to store
- the Media Server conveys speech objects to the voice
- Speech Recognition Server comprises hardware under program control to
- VUI Application translates the caller's uttered Service- content
- Speech Objects that further comprise reused
- the caller may vocalize a primary specific navigable point or a
- Service-Database e.g. "Traffic Conditions Database”, “Home Menu”, or “Stock Information Database”
- the List Speech Object comprises a preamble that will convey
- Speech Object Further, selection of an item in the list or getting more
- disambiguation in accordance with the present invention comprises a method
- the first step is to convey the ambiguous
- caller is accomplished with appropriate utterance and speech grammar (e.g.
- Disambiguating Speech Object further creates dynamic speech grammars
- the Main Menu Speech Object comprises
- Diagram A depicts several possible transitions depending upon the
- the caller may utter a grammar associated with a
- Service-Database program module e.g. "Traffic” or with one of
- caller administrative program modules e.g. "Login”, “New Account”,
- the Main Menu document confirms the caller's choice (e.g.
- Figure A2 depicts a Login Speech Object (SOLogin A4) that permits
- PIN personal identification number
- Login Speech Object associates each caller's PIN with their telephone
- Figure A3 depicts a Passcode Speech Object to
- Figure A4 depicts a New Account Speech Object to
- the caller may at any time return to the Main Menu document by
- the caller having to retrace the same navigated path.
- the caller may utter several of the same navigational choices previously
- Figure B1 depicts a Traffic Conditions Program Module conversational
- Traffic Speech Object The caller can select the Traffic
- Speech Object from the above Main Menu Speech Object module by uttering
- the Traffic Speech Object coveys prompts that direct the caller to utterances that indicate a region of interest for traffic condition information.
- the Traffic Speech Object first processes the caller's area code
- the Traffic Speech Object prompts the caller to
- Traffic Module confirms a metro area associated with the caller's selected
- the Traffic Module confirms that it will search the traffic
- Service-Database e.g SO_GetMetroTraffic B2
- the Traffic Module will prompt the caller for a new metro area if the caller at this time cancels the pending search by uttering a "cancel"
- the caller can request additional information by uttering "that one.”
- the Traffic Module prompts the caller to optionally perform
- the Traffic Speech Object engages the
- Figure C depicts a Business Finder Service Program Module
- the presumption is conveyed to the caller (e.g.
- the caller is prompted (e.g. SO_Brand/Category
- the Business Finder Speech Object automatically first filters out matches that are more than a specified distance away (e.g. more that 50
- the caller is prompted whether a new search is desired.
- the Business Finder Speech Object further includes the ability for the
- Figure C2 depicts a conversational state diagram of this
- the caller may also select a business
- the Business Finder Speech Object audibly confirms the
- the Telephony Network Server initiates a telephone call to the Telephony Network Server
- Figure D1 depicts a Stock Information Program Module conversational
- a speech object audibly alerts the caller that the Stock Information
- the caller may opt out of the assumption by uttering a speech
- a stock information indicator e.g. company name, ticker symbol,
- the Stock Information Speech Object performs a search of the stock
- the caller may wish to receive detailed information about stocks or abbreviated information.
- Object recognizes both contextually global - non temporal utterances (e.g.
- Figure D2 depicts a conversational state diagram reflecting additional
- the Stock Sub-Module performs a search of the stock information
- Figure D3 depicts an example of a conversational state diagram for
- Figure E1 depicts the Weather Conditions Speech Object ("Weather
- the Weather Speech Object infers a city for the caller based upon
- the Weather Speech Object retrieves the weather information for
- Figure E2 depicts a List Speech Object for conveying the weather
- Figure F1 depicts the conversation state diagram of the Address
- Locating Program Module ("Address Speech Object").
- the Address Speech Object is ordinarily transitioned to from another Speech Object that needs to
- Landmarks are preassigned speech grammars that can be both global
- the Airport Finder Speech Object searches the caller private profile
- the Address Module will access the address associated with
- the Address Module prompts the caller
- Speech Object engages the caller in speech objects that enable the caller to
- Speech Object or alternatively, to begin searching from scratch. For example,
- the caller may change the street name, or the cross street name.
- FIG. G1 depicts the conversation state diagram of the Flight
- the caller is greeted and prompted to
- Flight Information Program Module (“Flight Information Speech Object").
- Figure G2 depicts the conversation state diagram of the Flight
- Flight Information Program Module (“Flight Information Speech Object"). Upon a transition to the Flight Information Speech Object, the caller is prompted to
- Flight Information Speech Object transitions to speech objects that prompt the
- Flight Information Speech Object will transition alternate speech
- Flight Information Speech Object will convey the flight status information
- the caller may also pick a flight without any specific information about
- Figure G3 depicts the Itinerary Speech Object
- Object includes speech objects that allow the caller to choose a flight
- Object will engage the caller in a speech object to determine the airline if it is
- the Itinerary Speech Object engages the caller in speech objects to
- Service-Database 60 conveys it to the caller (SOReadRoutes G17).
- the Driving Directions Speech Object determines point-to-point driving
- Speech Object can be evoked both as a stand-alone program module Speech Object or from another program module, such as the Business Finder Speech
- Speech Object If the Speech Object is evoked from another program module such as
- Speech Object contains speech objects to determine either
- the Driving Directions Speech Object interfaces with the API of an
- the caller may also receive the driving directions by
- the caller's driving directions includes a particularly long stretch of road
- the Driving Directions Speech Object has the
- a caller may use the Driving Directions Speech Object to determine directions to a particular location, save the driving
Landscapes
- Engineering & Computer Science (AREA)
- Human Computer Interaction (AREA)
- Signal Processing (AREA)
- Telephonic Communication Services (AREA)
Abstract
L'invention concerne une application d'objet de langage d'interface d'utilisateur vocale (VUI), comprenant des objets de language de module de programme qui constituent une interface avec les interfaces de programme d'application (API) des bases de données-services, de façon à extraire une information désirée de l'appelant.
Applications Claiming Priority (3)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
US15696899P | 1999-10-01 | 1999-10-01 | |
US156968P | 1999-10-01 | ||
PCT/US2000/026935 WO2001026350A1 (fr) | 1999-10-01 | 2000-09-28 | Procede et systeme d'interface vocale |
Publications (1)
Publication Number | Publication Date |
---|---|
EP1224797A1 true EP1224797A1 (fr) | 2002-07-24 |
Family
ID=22561826
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
EP00965546A Withdrawn EP1224797A1 (fr) | 1999-10-01 | 2000-09-28 | Procede et systeme d'interface vocale |
Country Status (3)
Country | Link |
---|---|
EP (1) | EP1224797A1 (fr) |
AU (1) | AU7624800A (fr) |
WO (1) | WO2001026350A1 (fr) |
Families Citing this family (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
JP2002318132A (ja) | 2001-04-23 | 2002-10-31 | Hitachi Ltd | 音声対話型ナビゲーションシステムおよび移動端末装置および音声対話サーバ |
FR2827695A1 (fr) * | 2001-07-23 | 2003-01-24 | France Telecom | Portail de services de telecommunications comprenant un serveur avec reconnaissance vocale et equipement de navigation et de guidage utilisant ledit portail |
WO2017132660A1 (fr) * | 2016-01-29 | 2017-08-03 | Liquid Analytics, Inc. | Systèmes et procédés permettant un traitement dynamique de la gestion des flux |
Family Cites Families (6)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
EP0175503A1 (fr) * | 1984-09-06 | 1986-03-26 | BRITISH TELECOMMUNICATIONS public limited company | Procédé et dispositif pour dialogue interactif |
US5774860A (en) * | 1994-06-27 | 1998-06-30 | U S West Technologies, Inc. | Adaptive knowledge base of complex information through interactive voice dialogue |
IL129893A0 (en) * | 1996-11-28 | 2000-02-29 | British Telecomm | Interactive apparatus |
EP0922279A3 (fr) * | 1997-01-09 | 1999-09-01 | Koninklijke Philips Electronics N.V. | Procede et appareil pour executer un dialogue homme-machine sous la forme d'un discours bilateral sur la base d'une structure de dialogue modulaire |
EP0895396A3 (fr) * | 1997-07-03 | 2004-01-14 | Texas Instruments Incorporated | Systéme de dialogue parlé pour accés d'information |
EP1090495A1 (fr) * | 1999-04-21 | 2001-04-11 | Ranjeet Nabha | Procede et systeme permettant de fournir une information offerte sur internet sous une forme sonore |
-
2000
- 2000-09-28 AU AU76248/00A patent/AU7624800A/en not_active Abandoned
- 2000-09-28 EP EP00965546A patent/EP1224797A1/fr not_active Withdrawn
- 2000-09-28 WO PCT/US2000/026935 patent/WO2001026350A1/fr not_active Application Discontinuation
Non-Patent Citations (1)
Title |
---|
See references of WO0126350A1 * |
Also Published As
Publication number | Publication date |
---|---|
AU7624800A (en) | 2001-05-10 |
WO2001026350A1 (fr) | 2001-04-12 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
US20210201932A1 (en) | Method of and system for real time feedback in an incremental speech input interface | |
US7627096B2 (en) | System and method for independently recognizing and selecting actions and objects in a speech recognition system | |
US7450698B2 (en) | System and method of utilizing a hybrid semantic model for speech recognition | |
US8185539B1 (en) | Web site or directory search using speech recognition of letters | |
US6708150B1 (en) | Speech recognition apparatus and speech recognition navigation apparatus | |
US9202247B2 (en) | System and method utilizing voice search to locate a product in stores from a phone | |
EP2560158B1 (fr) | Système d'exploitation et procédé d'exploitation | |
US7376640B1 (en) | Method and system for searching an information retrieval system according to user-specified location information | |
KR100383352B1 (ko) | 음성작동서비스 | |
US6246986B1 (en) | User barge-in enablement in large vocabulary speech recognition systems | |
US20030191639A1 (en) | Dynamic and adaptive selection of vocabulary and acoustic models based on a call context for speech recognition | |
US20030115289A1 (en) | Navigation in a voice recognition system | |
US20030149566A1 (en) | System and method for a spoken language interface to a large database of changing records | |
US7689425B2 (en) | Quality of service call routing system using counselor and speech recognition engine and method thereof | |
US20020143548A1 (en) | Automated database assistance via telephone | |
US20020184023A1 (en) | Multi-context conversational environment system and method | |
US20040153322A1 (en) | Menu-based, speech actuated system with speak-ahead capability | |
EP2289231A1 (fr) | Système et procédé utilisant la recherche vocale pour localiser un produit dans un magasin à partir d'un téléphone | |
US8428241B2 (en) | Semi-supervised training of destination map for call handling applications | |
US20060020471A1 (en) | Method and apparatus for robustly locating user barge-ins in voice-activated command systems | |
TWI698756B (zh) | 查詢服務之系統與方法 | |
JP2004518195A (ja) | データベース言語モデルによる自動対話システム | |
US11056113B2 (en) | Conversation guidance method of speech recognition system | |
EP1224797A1 (fr) | Procede et systeme d'interface vocale | |
JP2003016087A (ja) | 自動セクタ情報システムを動作する方法及びシステム |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PUAI | Public reference made under article 153(3) epc to a published international application that has entered the european phase |
Free format text: ORIGINAL CODE: 0009012 |
|
17P | Request for examination filed |
Effective date: 20020412 |
|
AK | Designated contracting states |
Kind code of ref document: A1 Designated state(s): AT BE CH CY DE DK ES FI FR GB GR IE IT LI LU MC NL PT SE |
|
AX | Request for extension of the european patent |
Free format text: AL;LT;LV;MK;RO;SI |
|
STAA | Information on the status of an ep patent application or granted ep patent |
Free format text: STATUS: THE APPLICATION IS DEEMED TO BE WITHDRAWN |
|
18D | Application deemed to be withdrawn |
Effective date: 20040401 |