CN1894658B - 用于话音激活网页的系统和方法 - Google Patents

用于话音激活网页的系统和方法 Download PDF

Info

Publication number
CN1894658B
CN1894658B CN2004800157375A CN200480015737A CN1894658B CN 1894658 B CN1894658 B CN 1894658B CN 2004800157375 A CN2004800157375 A CN 2004800157375A CN 200480015737 A CN200480015737 A CN 200480015737A CN 1894658 B CN1894658 B CN 1894658B
Authority
CN
China
Prior art keywords
window
new
data
voice request
data structure
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Expired - Fee Related
Application number
CN2004800157375A
Other languages
English (en)
Chinese (zh)
Other versions
CN1894658A (zh
Inventor
迈克尔·L·查尼
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Columbia University in the City of New York
Original Assignee
Columbia University in the City of New York
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Columbia University in the City of New York filed Critical Columbia University in the City of New York
Publication of CN1894658A publication Critical patent/CN1894658A/zh
Application granted granted Critical
Publication of CN1894658B publication Critical patent/CN1894658B/zh
Anticipated expiration legal-status Critical
Expired - Fee Related legal-status Critical Current

Links

Images

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/26Speech to text systems
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/08Speech classification or search
    • G10L15/18Speech classification or search using natural language modelling
    • G10L15/183Speech classification or search using natural language modelling using context dependencies, e.g. language models
    • G10L15/19Grammatical context, e.g. disambiguation of the recognition hypotheses based on word sequence rules
    • G10L15/193Formal grammars, e.g. finite state automata, context free grammars or word networks

Landscapes

  • Engineering & Computer Science (AREA)
  • Computational Linguistics (AREA)
  • Health & Medical Sciences (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Human Computer Interaction (AREA)
  • Physics & Mathematics (AREA)
  • Acoustics & Sound (AREA)
  • Multimedia (AREA)
  • Information Transfer Between Computers (AREA)
  • Information Retrieval, Db Structures And Fs Structures Therefor (AREA)
CN2004800157375A 2003-06-06 2004-06-07 用于话音激活网页的系统和方法 Expired - Fee Related CN1894658B (zh)

Applications Claiming Priority (3)

Application Number Priority Date Filing Date Title
US47681503P 2003-06-06 2003-06-06
US60/476,815 2003-06-06
PCT/US2004/018109 WO2004109471A2 (en) 2003-06-06 2004-06-07 System and method for voice activating web pages

Publications (2)

Publication Number Publication Date
CN1894658A CN1894658A (zh) 2007-01-10
CN1894658B true CN1894658B (zh) 2012-02-29

Family

ID=33511821

Family Applications (1)

Application Number Title Priority Date Filing Date
CN2004800157375A Expired - Fee Related CN1894658B (zh) 2003-06-06 2004-06-07 用于话音激活网页的系统和方法

Country Status (5)

Country Link
US (1) US9202467B2 (enExample)
EP (1) EP1631899A4 (enExample)
JP (2) JP4891072B2 (enExample)
CN (1) CN1894658B (enExample)
WO (1) WO2004109471A2 (enExample)

Families Citing this family (11)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20050273487A1 (en) * 2004-06-04 2005-12-08 Comverse, Ltd. Automatic multimodal enabling of existing web content
US7827033B2 (en) * 2006-12-06 2010-11-02 Nuance Communications, Inc. Enabling grammars in web page frames
US8831950B2 (en) * 2008-04-07 2014-09-09 Nuance Communications, Inc. Automated voice enablement of a web page
US10157612B2 (en) 2012-08-02 2018-12-18 Nuance Communications, Inc. Methods and apparatus for voice-enabling a web application
US9400633B2 (en) 2012-08-02 2016-07-26 Nuance Communications, Inc. Methods and apparatus for voiced-enabling a web application
US9292253B2 (en) 2012-08-02 2016-03-22 Nuance Communications, Inc. Methods and apparatus for voiced-enabling a web application
US9292252B2 (en) 2012-08-02 2016-03-22 Nuance Communications, Inc. Methods and apparatus for voiced-enabling a web application
US9781262B2 (en) * 2012-08-02 2017-10-03 Nuance Communications, Inc. Methods and apparatus for voice-enabling a web application
US9547468B2 (en) * 2014-03-31 2017-01-17 Microsoft Technology Licensing, Llc Client-side personal voice web navigation
US10147421B2 (en) 2014-12-16 2018-12-04 Microcoft Technology Licensing, Llc Digital assistant voice input integration
CN113921008A (zh) * 2021-10-11 2022-01-11 杭州易知微科技有限公司 跨终端语音交互系统和方法

Citations (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US6313854B1 (en) * 1998-10-16 2001-11-06 International Business Machines Corporation Display mechanism for HTML frames

Family Cites Families (58)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP3286339B2 (ja) * 1992-03-25 2002-05-27 株式会社リコー ウインドウ画面制御装置
EP0607615B1 (en) * 1992-12-28 1999-09-15 Kabushiki Kaisha Toshiba Speech recognition interface system suitable for window systems and speech mail systems
JP3725566B2 (ja) * 1992-12-28 2005-12-14 株式会社東芝 音声認識インターフェース
US6965864B1 (en) * 1995-04-10 2005-11-15 Texas Instruments Incorporated Voice activated hypermedia systems using grammatical metadata
US5819220A (en) * 1996-09-30 1998-10-06 Hewlett-Packard Company Web triggered word set boosting for speech interfaces to the world wide web
US5796393A (en) * 1996-11-08 1998-08-18 Compuserve Incorporated System for intergrating an on-line service community with a foreign service
US5960399A (en) * 1996-12-24 1999-09-28 Gte Internetworking Incorporated Client/server speech processor/recognizer
US6456974B1 (en) 1997-01-06 2002-09-24 Texas Instruments Incorporated System and method for adding speech recognition capabilities to java
JPH10222337A (ja) * 1997-02-13 1998-08-21 Meidensha Corp コンピュータシステム
US6078886A (en) * 1997-04-14 2000-06-20 At&T Corporation System and method for providing remote automatic speech recognition services via a packet network
US6101472A (en) * 1997-04-16 2000-08-08 International Business Machines Corporation Data processing system and method for navigating a network using a voice command
US20020002458A1 (en) * 1997-10-22 2002-01-03 David E. Owen System and method for representing complex information auditorially
US6418439B1 (en) * 1997-11-12 2002-07-09 Ncr Corporation Computer system and computer implemented method for translation of information into multiple media variations
US6282512B1 (en) * 1998-02-05 2001-08-28 Texas Instruments Incorporated Enhancement of markup language pages to support spoken queries
AU3104599A (en) 1998-03-20 1999-10-11 Inroad, Inc. Voice controlled web browser
US6233559B1 (en) * 1998-04-01 2001-05-15 Motorola, Inc. Speech control of multiple applications using applets
US6115686A (en) * 1998-04-02 2000-09-05 Industrial Technology Research Institute Hyper text mark up language document to speech converter
US6411952B1 (en) * 1998-06-24 2002-06-25 Compaq Information Technologies Group, Lp Method for learning character patterns to interactively control the scope of a web crawler
US6269336B1 (en) 1998-07-24 2001-07-31 Motorola, Inc. Voice browser for interactive services and methods thereof
EP1099152A4 (en) 1998-07-24 2004-05-19 Motorola Inc MARKING LANGUAGE FOR INTERACTIVE SERVICES AND METHODS THEREFOR
US6177936B1 (en) * 1998-08-20 2001-01-23 International Business Machines Corporation Browser hierarchical contextual information for web pages
US6499013B1 (en) * 1998-09-09 2002-12-24 One Voice Technologies, Inc. Interactive user interface using speech recognition and natural language processing
US6324511B1 (en) * 1998-10-01 2001-11-27 Mindmaker, Inc. Method of and apparatus for multi-modal information presentation to computer users with dyslexia, reading disabilities or visual impairment
US7137126B1 (en) * 1998-10-02 2006-11-14 International Business Machines Corporation Conversational computing via conversational virtual machine
US6470317B1 (en) * 1998-10-02 2002-10-22 Motorola, Inc. Markup language to allow for billing of interactive services and methods thereof
US6587822B2 (en) * 1998-10-06 2003-07-01 Lucent Technologies Inc. Web-based platform for interactive voice response (IVR)
US6636831B1 (en) * 1999-04-09 2003-10-21 Inroad, Inc. System and process for voice-controlled information retrieval
US6604075B1 (en) * 1999-05-20 2003-08-05 Lucent Technologies Inc. Web-based voice dialog interface
US20020032564A1 (en) * 2000-04-19 2002-03-14 Farzad Ehsani Phrase-based dialogue modeling with particular application to creating a recognition grammar for a voice-controlled user interface
US6462757B1 (en) * 1999-06-30 2002-10-08 International Business Machines Corporation Method, system and computer program product for locating a window of a windows operating system in a computer system
US7590538B2 (en) * 1999-08-31 2009-09-15 Accenture Llp Voice recognition system for navigating on the internet
US6578000B1 (en) * 1999-09-03 2003-06-10 Cisco Technology, Inc. Browser-based arrangement for developing voice enabled web applications using extensible markup language documents
US6718308B1 (en) * 2000-02-22 2004-04-06 Daniel L. Nolting Media presentation system controlled by voice to text commands
US6636246B1 (en) * 2000-03-17 2003-10-21 Vizible.Com Inc. Three dimensional spatial user interface
AU2001242153A1 (en) * 2000-03-17 2001-09-24 Vizible.Com Inc. A three dimensional spatial user interface
US7366766B2 (en) * 2000-03-24 2008-04-29 Eliza Corporation Web-based speech recognition with scripting and semantic objects
US7334050B2 (en) * 2000-06-07 2008-02-19 Nvidia International, Inc. Voice applications and voice-based interface
FR2810125B1 (fr) * 2000-06-08 2004-04-30 Interactive Speech Technologie Systeme de commande vocale d'une page stockee sur un serveur et telechargeable en vue de sa visualisation sur un dispositif client
EP1172064A3 (en) * 2000-07-13 2003-01-15 Computer Motion, Inc. Method and apparatus for accessing medical data over a network
JP2002041276A (ja) * 2000-07-24 2002-02-08 Sony Corp 対話型操作支援システム及び対話型操作支援方法、並びに記憶媒体
US7315830B1 (en) * 2000-08-11 2008-01-01 Nexus Company, Ltd. Method, system and computer program product for ordering merchandise in a global computer network environment
US7065712B2 (en) * 2000-09-01 2006-06-20 Nippon Telegraph And Telephone Corp. Scheme for posting advertisements on comprehensive information viewing device
US20020077823A1 (en) * 2000-10-13 2002-06-20 Andrew Fox Software development systems and methods
US20070226640A1 (en) * 2000-11-15 2007-09-27 Holbrook David M Apparatus and methods for organizing and/or presenting data
ATE391986T1 (de) * 2000-11-23 2008-04-15 Ibm Sprachnavigation in webanwendungen
AU2002227096A1 (en) * 2000-12-01 2002-06-11 The Trustees Of Columbia University In The City Of New York A method and system for voice activating web pages
US6658414B2 (en) * 2001-03-06 2003-12-02 Topic Radio, Inc. Methods, systems, and computer program products for generating and providing access to end-user-definable voice portals
ITRM20010126A1 (it) * 2001-03-12 2002-09-12 Mediavoice S R L Metodo di abilitazione dell'interazione vocale di una pagina o di un sito web.
GB2378776A (en) * 2001-05-22 2003-02-19 Canon Kk Apparatus and method for managing a multi-modal interface in which the inputs feedback on each other
JP2003036403A (ja) * 2001-07-23 2003-02-07 Asahi Kasei Corp 契約違反者利用防止システム
US20020010715A1 (en) * 2001-07-26 2002-01-24 Garry Chinn System and method for browsing using a limited display device
US20030078973A1 (en) * 2001-09-25 2003-04-24 Przekop Michael V. Web-enabled system and method for on-demand distribution of transcript-synchronized video/audio records of legal proceedings to collaborative workgroups
US6985865B1 (en) * 2001-09-26 2006-01-10 Sprint Spectrum L.P. Method and system for enhanced response to voice commands in a voice command platform
US7036080B1 (en) * 2001-11-30 2006-04-25 Sap Labs, Inc. Method and apparatus for implementing a speech interface for a GUI
US7260530B2 (en) * 2002-02-15 2007-08-21 Bevocal, Inc. Enhanced go-back feature system and method for use in a voice portal
US20030197723A1 (en) * 2002-04-19 2003-10-23 Young Brian R. Methods, apparatus and computer program products for providing network device status information
US7032169B2 (en) * 2002-05-22 2006-04-18 International Business Machines Corporation Method and system for distributed coordination of multiple modalities of computer-user interaction
US7324633B2 (en) * 2002-05-30 2008-01-29 At&T Delaware Intellectual Property, Inc. Web integrated interactive voice response

Patent Citations (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US6313854B1 (en) * 1998-10-16 2001-11-06 International Business Machines Corporation Display mechanism for HTML frames

Also Published As

Publication number Publication date
US9202467B2 (en) 2015-12-01
JP2011227507A (ja) 2011-11-10
WO2004109471A2 (en) 2004-12-16
CN1894658A (zh) 2007-01-10
US20050143975A1 (en) 2005-06-30
JP4891072B2 (ja) 2012-03-07
WO2004109471A3 (en) 2005-06-09
EP1631899A2 (en) 2006-03-08
EP1631899A4 (en) 2007-07-18
JP2006527400A (ja) 2006-11-30

Similar Documents

Publication Publication Date Title
US7640163B2 (en) Method and system for voice activating web pages
US7146323B2 (en) Method and system for gathering information by voice input
US6029135A (en) Hypertext navigation system controlled by spoken words
US7650284B2 (en) Enabling voice click in a multimodal page
EP2494473B1 (en) Transforming components of a web page to voice prompts
US8055713B2 (en) Email application with user voice interface
US9083798B2 (en) Enabling voice selection of user preferences
CN100397340C (zh) 以对话为目的的应用抽象
US7212971B2 (en) Control apparatus for enabling a user to communicate by speech with a processor-controlled apparatus
KR101042119B1 (ko) 음성 이해 시스템, 및 컴퓨터 판독가능 기록 매체
KR101066741B1 (ko) 컴퓨터 시스템과 동적으로 상호작용하기 위한 컴퓨터 구현 방법, 시스템, 및 컴퓨터 판독가능 기록 매체
JP2011227507A (ja) ウェブ・ページを音声付勢するためのシステム及び方法
US7171361B2 (en) Idiom handling in voice service systems
US20100094635A1 (en) System for Voice-Based Interaction on Web Pages
US20050004800A1 (en) Combining use of a stepwise markup language and an object oriented development tool
JP2004310748A (ja) ユーザ入力に基づくデータの提示
US7653546B2 (en) Method and system for efficient voice-based programming
EP1209660B1 (en) Voice navigation in web applications
Griol et al. The VoiceApp system: Speech technologies to access the semantic web
EP1881685B1 (en) A method and system for voice activating web pages

Legal Events

Date Code Title Description
C06 Publication
PB01 Publication
C10 Entry into substantive examination
SE01 Entry into force of request for substantive examination
C14 Grant of patent or utility model
GR01 Patent grant
CF01 Termination of patent right due to non-payment of annual fee
CF01 Termination of patent right due to non-payment of annual fee

Granted publication date: 20120229

Termination date: 20190607