EP1224797A1 - Procede et systeme d'interface vocale - Google Patents

Procede et systeme d'interface vocale

Info

Publication number
EP1224797A1
EP1224797A1 EP00965546A EP00965546A EP1224797A1 EP 1224797 A1 EP1224797 A1 EP 1224797A1 EP 00965546 A EP00965546 A EP 00965546A EP 00965546 A EP00965546 A EP 00965546A EP 1224797 A1 EP1224797 A1 EP 1224797A1
Authority
EP
European Patent Office
Prior art keywords
caller
service
information
business
database
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Withdrawn
Application number
EP00965546A
Other languages
German (de)
English (en)
Inventor
C. Mikael Berner
Amol M. Joshi
Lisa M. Guerra
Kevin M. Stone
Steve T. Tran
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Bevocal LLC
Original Assignee
Bevocal LLC
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Bevocal LLC filed Critical Bevocal LLC
Publication of EP1224797A1 publication Critical patent/EP1224797A1/fr
Withdrawn legal-status Critical Current

Links

Classifications

    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04MTELEPHONIC COMMUNICATION
    • H04M3/00Automatic or semi-automatic exchanges
    • H04M3/42Systems providing special services or facilities to subscribers
    • H04M3/487Arrangements for providing information services, e.g. recorded voice services or time announcements
    • H04M3/493Interactive information services, e.g. directory enquiries ; Arrangements therefor, e.g. interactive voice response [IVR] systems or voice portals
    • H04M3/4936Speech interaction details
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/22Procedures used during a speech recognition process, e.g. man-machine dialogue
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04MTELEPHONIC COMMUNICATION
    • H04M2242/00Special services or facilities
    • H04M2242/22Automatic class or number identification arrangements
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04MTELEPHONIC COMMUNICATION
    • H04M3/00Automatic or semi-automatic exchanges
    • H04M3/38Graded-service arrangements, i.e. some subscribers prevented from establishing certain connections
    • H04M3/382Graded-service arrangements, i.e. some subscribers prevented from establishing certain connections using authorisation codes or passwords
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04MTELEPHONIC COMMUNICATION
    • H04M3/00Automatic or semi-automatic exchanges
    • H04M3/42Systems providing special services or facilities to subscribers
    • H04M3/42025Calling or Called party identification service
    • H04M3/42034Calling party identification service
    • H04M3/42059Making use of the calling party identifier
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04QSELECTING
    • H04Q3/00Selecting arrangements
    • H04Q3/72Finding out and indicating number of calling subscriber

Definitions

  • Web pages has for example, experienced explosive growth. Moreover, the
  • VUI vocal user interface
  • VUI implementations tend to be cumbersome
  • the present invention comprises a VUI Speech Object Application
  • Fig. A1 Depicts a conversational state diagram of an embodiment of a
  • Fig. A2 Depicts a conversational state diagram of an embodiment of a
  • Fig. A3 Depicts a conversational state diagram of an embodiment of a
  • Fig. A4 Depicts a conversational state diagram of an embodiment of a
  • Fig. B1 Depicts a conversational state diagram of an embodiment of a
  • Fig. B2 Depicts a conversational state diagram of an alternate embodiment of a Traffic Condition Speech Object.
  • Fig. C1 Depicts a conversational state diagram of an embodiment of a 1/26350
  • Fig. C2 Depicts a conversational state diagram of an embodiment of a extended functionality of the Business Finder Speech Object.
  • Fig. D1 Depicts a conversational state diagram of an embodiment of a Stock Information Speech Object.
  • Fig. D2 Depicts a conversational state diagram of an embodiment of extended functionality of a Stock Information Speech Object.
  • Fig. D3 Depicts a conversational state diagram of an embodiment of extended functionality of a Stock Information Speech Object.
  • Fig. E1 Depicts a conversational state diagram of an embodiment of a Weather Speech Object.
  • Fig. E2 Depicts a conversational state diagram of an embodiment of a List Speech Object for conveying weather information to the caller.
  • Fig. F1 Depicts a conversational state diagram of an embodiment of a Address Locating Speech Object.
  • Fig. F2 Depicts a conversational state diagram of an embodiment of a Address Disambiguation Speech Object.
  • Fig- G1 Depicts a conversational state diagram of an embodiment of a Flight Finder Speech Object.
  • Fig- G2 Depicts a conversational state diagram of an embodiment of a Flight Information Speech Object.
  • Fig- G3 Depicts a conversational state diagram of an embodiment of a Itinerary Speech Object.
  • Fig. H Depicts a conversational state diagram of an embodiment of a Driving Directions Speech Object.
  • scaleable system architecture that includes at least one each of the following;
  • Vocal User Interface Application Server
  • Telephony Server a Telephony Server
  • Speech Recognition Server a Text-to-Speech Server
  • Media Server a Media Server
  • API Application Program Interface
  • the presently preferred backbone network comprises a TCP/IP
  • a caller connects to the Telephony Server by dialing a telephone
  • the Telephony Server includes a Telephone Network (PSTN).
  • PSTN Telephone Network
  • the Telephony Server includes a Telephone Network (PSTN).
  • FIG. 1 depicts the Telephone Network Interface coupled
  • PSTN Public Switched Telephone Network
  • Telephone Network Interface further comprises speech signal processing
  • the VUI Application Server comprises hardware under control of a VUI
  • the VUI Application implements a vocally navigable Speech Object interface between the caller and the API of the independent Service-
  • VUI Application further comprises distinct program
  • speech grammars that are particularly germane to the
  • a Traffic Condition Module a Traffic Condition Module
  • a Business Finder Module a Stock Finder Module
  • the Media Server comprises hardware under program control to store
  • the Media Server conveys speech objects to the voice
  • Speech Recognition Server comprises hardware under program control to
  • VUI Application translates the caller's uttered Service- content
  • Speech Objects that further comprise reused
  • the caller may vocalize a primary specific navigable point or a
  • Service-Database e.g. "Traffic Conditions Database”, “Home Menu”, or “Stock Information Database”
  • the List Speech Object comprises a preamble that will convey
  • Speech Object Further, selection of an item in the list or getting more
  • disambiguation in accordance with the present invention comprises a method
  • the first step is to convey the ambiguous
  • caller is accomplished with appropriate utterance and speech grammar (e.g.
  • Disambiguating Speech Object further creates dynamic speech grammars
  • the Main Menu Speech Object comprises
  • Diagram A depicts several possible transitions depending upon the
  • the caller may utter a grammar associated with a
  • Service-Database program module e.g. "Traffic” or with one of
  • caller administrative program modules e.g. "Login”, “New Account”,
  • the Main Menu document confirms the caller's choice (e.g.
  • Figure A2 depicts a Login Speech Object (SOLogin A4) that permits
  • PIN personal identification number
  • Login Speech Object associates each caller's PIN with their telephone
  • Figure A3 depicts a Passcode Speech Object to
  • Figure A4 depicts a New Account Speech Object to
  • the caller may at any time return to the Main Menu document by
  • the caller having to retrace the same navigated path.
  • the caller may utter several of the same navigational choices previously
  • Figure B1 depicts a Traffic Conditions Program Module conversational
  • Traffic Speech Object The caller can select the Traffic
  • Speech Object from the above Main Menu Speech Object module by uttering
  • the Traffic Speech Object coveys prompts that direct the caller to utterances that indicate a region of interest for traffic condition information.
  • the Traffic Speech Object first processes the caller's area code
  • the Traffic Speech Object prompts the caller to
  • Traffic Module confirms a metro area associated with the caller's selected
  • the Traffic Module confirms that it will search the traffic
  • Service-Database e.g SO_GetMetroTraffic B2
  • the Traffic Module will prompt the caller for a new metro area if the caller at this time cancels the pending search by uttering a "cancel"
  • the caller can request additional information by uttering "that one.”
  • the Traffic Module prompts the caller to optionally perform
  • the Traffic Speech Object engages the
  • Figure C depicts a Business Finder Service Program Module
  • the presumption is conveyed to the caller (e.g.
  • the caller is prompted (e.g. SO_Brand/Category
  • the Business Finder Speech Object automatically first filters out matches that are more than a specified distance away (e.g. more that 50
  • the caller is prompted whether a new search is desired.
  • the Business Finder Speech Object further includes the ability for the
  • Figure C2 depicts a conversational state diagram of this
  • the caller may also select a business
  • the Business Finder Speech Object audibly confirms the
  • the Telephony Network Server initiates a telephone call to the Telephony Network Server
  • Figure D1 depicts a Stock Information Program Module conversational
  • a speech object audibly alerts the caller that the Stock Information
  • the caller may opt out of the assumption by uttering a speech
  • a stock information indicator e.g. company name, ticker symbol,
  • the Stock Information Speech Object performs a search of the stock
  • the caller may wish to receive detailed information about stocks or abbreviated information.
  • Object recognizes both contextually global - non temporal utterances (e.g.
  • Figure D2 depicts a conversational state diagram reflecting additional
  • the Stock Sub-Module performs a search of the stock information
  • Figure D3 depicts an example of a conversational state diagram for
  • Figure E1 depicts the Weather Conditions Speech Object ("Weather
  • the Weather Speech Object infers a city for the caller based upon
  • the Weather Speech Object retrieves the weather information for
  • Figure E2 depicts a List Speech Object for conveying the weather
  • Figure F1 depicts the conversation state diagram of the Address
  • Locating Program Module ("Address Speech Object").
  • the Address Speech Object is ordinarily transitioned to from another Speech Object that needs to
  • Landmarks are preassigned speech grammars that can be both global
  • the Airport Finder Speech Object searches the caller private profile
  • the Address Module will access the address associated with
  • the Address Module prompts the caller
  • Speech Object engages the caller in speech objects that enable the caller to
  • Speech Object or alternatively, to begin searching from scratch. For example,
  • the caller may change the street name, or the cross street name.
  • FIG. G1 depicts the conversation state diagram of the Flight
  • the caller is greeted and prompted to
  • Flight Information Program Module (“Flight Information Speech Object").
  • Figure G2 depicts the conversation state diagram of the Flight
  • Flight Information Program Module (“Flight Information Speech Object"). Upon a transition to the Flight Information Speech Object, the caller is prompted to
  • Flight Information Speech Object transitions to speech objects that prompt the
  • Flight Information Speech Object will transition alternate speech
  • Flight Information Speech Object will convey the flight status information
  • the caller may also pick a flight without any specific information about
  • Figure G3 depicts the Itinerary Speech Object
  • Object includes speech objects that allow the caller to choose a flight
  • Object will engage the caller in a speech object to determine the airline if it is
  • the Itinerary Speech Object engages the caller in speech objects to
  • Service-Database 60 conveys it to the caller (SOReadRoutes G17).
  • the Driving Directions Speech Object determines point-to-point driving
  • Speech Object can be evoked both as a stand-alone program module Speech Object or from another program module, such as the Business Finder Speech
  • Speech Object If the Speech Object is evoked from another program module such as
  • Speech Object contains speech objects to determine either
  • the Driving Directions Speech Object interfaces with the API of an
  • the caller may also receive the driving directions by
  • the caller's driving directions includes a particularly long stretch of road
  • the Driving Directions Speech Object has the
  • a caller may use the Driving Directions Speech Object to determine directions to a particular location, save the driving

Landscapes

  • Engineering & Computer Science (AREA)
  • Human Computer Interaction (AREA)
  • Signal Processing (AREA)
  • Telephonic Communication Services (AREA)

Abstract

L'invention concerne une application d'objet de langage d'interface d'utilisateur vocale (VUI), comprenant des objets de language de module de programme qui constituent une interface avec les interfaces de programme d'application (API) des bases de données-services, de façon à extraire une information désirée de l'appelant.
EP00965546A 1999-10-01 2000-09-28 Procede et systeme d'interface vocale Withdrawn EP1224797A1 (fr)

Applications Claiming Priority (3)

Application Number Priority Date Filing Date Title
US15696899P 1999-10-01 1999-10-01
US156968P 1999-10-01
PCT/US2000/026935 WO2001026350A1 (fr) 1999-10-01 2000-09-28 Procede et systeme d'interface vocale

Publications (1)

Publication Number Publication Date
EP1224797A1 true EP1224797A1 (fr) 2002-07-24

Family

ID=22561826

Family Applications (1)

Application Number Title Priority Date Filing Date
EP00965546A Withdrawn EP1224797A1 (fr) 1999-10-01 2000-09-28 Procede et systeme d'interface vocale

Country Status (3)

Country Link
EP (1) EP1224797A1 (fr)
AU (1) AU7624800A (fr)
WO (1) WO2001026350A1 (fr)

Families Citing this family (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP2002318132A (ja) 2001-04-23 2002-10-31 Hitachi Ltd 音声対話型ナビゲーションシステムおよび移動端末装置および音声対話サーバ
FR2827695A1 (fr) * 2001-07-23 2003-01-24 France Telecom Portail de services de telecommunications comprenant un serveur avec reconnaissance vocale et equipement de navigation et de guidage utilisant ledit portail
WO2017132660A1 (fr) * 2016-01-29 2017-08-03 Liquid Analytics, Inc. Systèmes et procédés permettant un traitement dynamique de la gestion des flux

Family Cites Families (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
EP0175503A1 (fr) * 1984-09-06 1986-03-26 BRITISH TELECOMMUNICATIONS public limited company Procédé et dispositif pour dialogue interactif
US5774860A (en) * 1994-06-27 1998-06-30 U S West Technologies, Inc. Adaptive knowledge base of complex information through interactive voice dialogue
IL129893A0 (en) * 1996-11-28 2000-02-29 British Telecomm Interactive apparatus
EP0922279A3 (fr) * 1997-01-09 1999-09-01 Koninklijke Philips Electronics N.V. Procede et appareil pour executer un dialogue homme-machine sous la forme d'un discours bilateral sur la base d'une structure de dialogue modulaire
EP0895396A3 (fr) * 1997-07-03 2004-01-14 Texas Instruments Incorporated Systéme de dialogue parlé pour accés d'information
EP1090495A1 (fr) * 1999-04-21 2001-04-11 Ranjeet Nabha Procede et systeme permettant de fournir une information offerte sur internet sous une forme sonore

Non-Patent Citations (1)

* Cited by examiner, † Cited by third party
Title
See references of WO0126350A1 *

Also Published As

Publication number Publication date
AU7624800A (en) 2001-05-10
WO2001026350A1 (fr) 2001-04-12

Similar Documents

Publication Publication Date Title
US20210201932A1 (en) Method of and system for real time feedback in an incremental speech input interface
US7627096B2 (en) System and method for independently recognizing and selecting actions and objects in a speech recognition system
US7450698B2 (en) System and method of utilizing a hybrid semantic model for speech recognition
US8185539B1 (en) Web site or directory search using speech recognition of letters
US6708150B1 (en) Speech recognition apparatus and speech recognition navigation apparatus
US9202247B2 (en) System and method utilizing voice search to locate a product in stores from a phone
EP2560158B1 (fr) Système d'exploitation et procédé d'exploitation
US7376640B1 (en) Method and system for searching an information retrieval system according to user-specified location information
KR100383352B1 (ko) 음성작동서비스
US6246986B1 (en) User barge-in enablement in large vocabulary speech recognition systems
US20030191639A1 (en) Dynamic and adaptive selection of vocabulary and acoustic models based on a call context for speech recognition
US20030115289A1 (en) Navigation in a voice recognition system
US20030149566A1 (en) System and method for a spoken language interface to a large database of changing records
US7689425B2 (en) Quality of service call routing system using counselor and speech recognition engine and method thereof
US20020143548A1 (en) Automated database assistance via telephone
US20020184023A1 (en) Multi-context conversational environment system and method
US20040153322A1 (en) Menu-based, speech actuated system with speak-ahead capability
EP2289231A1 (fr) Système et procédé utilisant la recherche vocale pour localiser un produit dans un magasin à partir d'un téléphone
US8428241B2 (en) Semi-supervised training of destination map for call handling applications
US20060020471A1 (en) Method and apparatus for robustly locating user barge-ins in voice-activated command systems
TWI698756B (zh) 查詢服務之系統與方法
JP2004518195A (ja) データベース言語モデルによる自動対話システム
US11056113B2 (en) Conversation guidance method of speech recognition system
EP1224797A1 (fr) Procede et systeme d'interface vocale
JP2003016087A (ja) 自動セクタ情報システムを動作する方法及びシステム

Legal Events

Date Code Title Description
PUAI Public reference made under article 153(3) epc to a published international application that has entered the european phase

Free format text: ORIGINAL CODE: 0009012

17P Request for examination filed

Effective date: 20020412

AK Designated contracting states

Kind code of ref document: A1

Designated state(s): AT BE CH CY DE DK ES FI FR GB GR IE IT LI LU MC NL PT SE

AX Request for extension of the european patent

Free format text: AL;LT;LV;MK;RO;SI

STAA Information on the status of an ep patent application or granted ep patent

Free format text: STATUS: THE APPLICATION IS DEEMED TO BE WITHDRAWN

18D Application deemed to be withdrawn

Effective date: 20040401