HK1010597A1 - Method and apparatus for dynamic adaptation of a large vocabulary speech recognition system and for use of constraints from a database in a large vocabulary speech recognition system - Google Patents

Method and apparatus for dynamic adaptation of a large vocabulary speech recognition system and for use of constraints from a database in a large vocabulary speech recognition system

Info

Publication number
HK1010597A1
HK1010597A1 HK98111612A HK98111612A HK1010597A1 HK 1010597 A1 HK1010597 A1 HK 1010597A1 HK 98111612 A HK98111612 A HK 98111612A HK 98111612 A HK98111612 A HK 98111612A HK 1010597 A1 HK1010597 A1 HK 1010597A1
Authority
HK
Hong Kong
Prior art keywords
speech recognition
recognition system
large vocabulary
vocabulary speech
constraints
Prior art date
Application number
HK98111612A
Other languages
English (en)
Inventor
Michael S Phillips
John N Nguyen
Original Assignee
Speechworks Int Inc
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Speechworks Int Inc filed Critical Speechworks Int Inc
Publication of HK1010597A1 publication Critical patent/HK1010597A1/xx

Links

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/08Speech classification or search
    • G10L15/18Speech classification or search using natural language modelling
    • G10L15/183Speech classification or search using natural language modelling using context dependencies, e.g. language models
    • G10L15/187Phonemic context, e.g. pronunciation rules, phonotactical constraints or phoneme n-grams
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/06Creation of reference templates; Training of speech recognition systems, e.g. adaptation to the characteristics of the speaker's voice
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/22Procedures used during a speech recognition process, e.g. man-machine dialogue
    • G10L2015/226Procedures used during a speech recognition process, e.g. man-machine dialogue using non-speech characteristics
    • G10L2015/228Procedures used during a speech recognition process, e.g. man-machine dialogue using non-speech characteristics of application context

Landscapes

  • Engineering & Computer Science (AREA)
  • Artificial Intelligence (AREA)
  • Computational Linguistics (AREA)
  • Health & Medical Sciences (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Human Computer Interaction (AREA)
  • Physics & Mathematics (AREA)
  • Acoustics & Sound (AREA)
  • Multimedia (AREA)
  • Machine Translation (AREA)
HK98111612A 1995-05-26 1998-10-29 Method and apparatus for dynamic adaptation of a large vocabulary speech recognition system and for use of constraints from a database in a large vocabulary speech recognition system HK1010597A1 (en)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
US45144895A 1995-05-26 1995-05-26
PCT/US1996/006634 WO1996037881A2 (fr) 1995-05-26 1996-05-09 Appareil et procede permettant une adaptation dynamique d'un systeme de reconnaissance vocale a vocabulaire tres etendu, tenant compte de contraintes imposees par une base de donnees de ce systeme

Publications (1)

Publication Number Publication Date
HK1010597A1 true HK1010597A1 (en) 1999-06-25

Family

ID=23792258

Family Applications (1)

Application Number Title Priority Date Filing Date
HK98111612A HK1010597A1 (en) 1995-05-26 1998-10-29 Method and apparatus for dynamic adaptation of a large vocabulary speech recognition system and for use of constraints from a database in a large vocabulary speech recognition system

Country Status (7)

Country Link
US (1) US6501833B2 (fr)
EP (2) EP0838073B1 (fr)
AU (1) AU5738296A (fr)
CA (1) CA2220004A1 (fr)
DE (1) DE69622565T2 (fr)
HK (1) HK1010597A1 (fr)
WO (1) WO1996037881A2 (fr)

Families Citing this family (89)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
DE59904741D1 (de) * 1998-05-11 2003-04-30 Siemens Ag Anordnung und verfahren zur erkennung eines vorgegebenen wortschatzes in gesprochener sprache durch einen rechner
US6519562B1 (en) * 1999-02-25 2003-02-11 Speechworks International, Inc. Dynamic semantic control of a speech recognition system
DE19942869A1 (de) * 1999-09-08 2001-03-15 Volkswagen Ag Verfahren und Einrichtung zum Betrieb einer sprachgesteuerten Einrichtung bei Kraftfahrzeugen
US6687689B1 (en) 2000-06-16 2004-02-03 Nusuara Technologies Sdn. Bhd. System and methods for document retrieval using natural language-based queries
EP1172802B1 (fr) * 2000-07-14 2007-08-08 Siemens Aktiengesellschaft Adaptation à un locuteur des transcriptions phonétiques d'un lexique de prononciation
US7275033B1 (en) 2000-09-30 2007-09-25 Intel Corporation Method and system for using rule-based knowledge to build a class-based domain specific statistical language model
US20040190688A1 (en) * 2003-03-31 2004-09-30 Timmins Timothy A. Communications methods and systems using voiceprints
US7103533B2 (en) * 2001-02-21 2006-09-05 International Business Machines Corporation Method for preserving contextual accuracy in an extendible speech recognition language model
US7698228B2 (en) 2001-04-27 2010-04-13 Accenture Llp Tracking purchases in a location-based services system
US7970648B2 (en) 2001-04-27 2011-06-28 Accenture Global Services Limited Advertising campaign and business listing management for a location-based services system
US7437295B2 (en) * 2001-04-27 2008-10-14 Accenture Llp Natural language processing for a location-based services system
US6944447B2 (en) * 2001-04-27 2005-09-13 Accenture Llp Location-based services
US6848542B2 (en) 2001-04-27 2005-02-01 Accenture Llp Method for passive mining of usage information in a location-based services system
US20030037053A1 (en) * 2001-08-09 2003-02-20 Zhong-Hua Wang Method and apparatus for automatically updating stock and mutual fund grammars in speech recognition systems
EP1306768A1 (fr) * 2001-10-26 2003-05-02 Sensoria Technology Limited Méthode et système d'apprentissage adaptatif et de reconnaissance de formes
US7398209B2 (en) * 2002-06-03 2008-07-08 Voicebox Technologies, Inc. Systems and methods for responding to natural language speech utterance
US7693720B2 (en) 2002-07-15 2010-04-06 Voicebox Technologies, Inc. Mobile systems and methods for responding to natural language speech utterance
US20040167892A1 (en) * 2003-02-25 2004-08-26 Evan Kirshenbaum Apparatus and method for translating between different role-based vocabularies for multiple users
US7146319B2 (en) * 2003-03-31 2006-12-05 Novauris Technologies Ltd. Phonetically based speech recognition system and method
JP2004303148A (ja) * 2003-04-01 2004-10-28 Canon Inc 情報処理装置
GB2404040A (en) * 2003-07-16 2005-01-19 Canon Kk Lattice matching
US20050055197A1 (en) * 2003-08-14 2005-03-10 Sviatoslav Karavansky Linguographic method of compiling word dictionaries and lexicons for the memories of electronic speech-recognition devices
DE10359624A1 (de) * 2003-12-18 2005-07-21 Daimlerchrysler Ag Spracherkennung mit sprecherunabhängiger Vokabularerweiterung
US7403941B2 (en) * 2004-04-23 2008-07-22 Novauris Technologies Ltd. System, method and technique for searching structured databases
US20060009974A1 (en) * 2004-07-09 2006-01-12 Matsushita Electric Industrial Co., Ltd. Hands-free voice dialing for portable and remote devices
US20060036438A1 (en) * 2004-07-13 2006-02-16 Microsoft Corporation Efficient multimodal method to provide input to a computing device
US7742911B2 (en) * 2004-10-12 2010-06-22 At&T Intellectual Property Ii, L.P. Apparatus and method for spoken language understanding by using semantic role labeling
EP1803116B1 (fr) * 2004-10-19 2009-01-28 France Télécom Procede de reconnaissance vocale comprenant une etape d ' insertion de marqueurs temporels et systeme correspondant
US8942985B2 (en) 2004-11-16 2015-01-27 Microsoft Corporation Centralized method and system for clarifying voice commands
US7778821B2 (en) * 2004-11-24 2010-08-17 Microsoft Corporation Controlled manipulation of characters
GB2428853A (en) * 2005-07-22 2007-02-07 Novauris Technologies Ltd Speech recognition application specific dictionary
US7640160B2 (en) 2005-08-05 2009-12-29 Voicebox Technologies, Inc. Systems and methods for responding to natural language speech utterance
US7620549B2 (en) 2005-08-10 2009-11-17 Voicebox Technologies, Inc. System and method of supporting adaptive misrecognition in conversational speech
US7949529B2 (en) 2005-08-29 2011-05-24 Voicebox Technologies, Inc. Mobile systems and methods of supporting natural language human-machine interactions
EP1934971A4 (fr) 2005-08-31 2010-10-27 Voicebox Technologies Inc Amelioration de precision de parole dynamique
US9697230B2 (en) 2005-11-09 2017-07-04 Cxense Asa Methods and apparatus for dynamic presentation of advertising, factual, and informational content using enhanced metadata in search-driven media applications
US7801910B2 (en) * 2005-11-09 2010-09-21 Ramp Holdings, Inc. Method and apparatus for timed tagging of media content
US20070106685A1 (en) * 2005-11-09 2007-05-10 Podzinger Corp. Method and apparatus for updating speech recognition databases and reindexing audio and video content using the same
US20070118873A1 (en) * 2005-11-09 2007-05-24 Bbnt Solutions Llc Methods and apparatus for merging media content
US20070106646A1 (en) * 2005-11-09 2007-05-10 Bbnt Solutions Llc User-directed navigation of multimedia search results
US9697231B2 (en) * 2005-11-09 2017-07-04 Cxense Asa Methods and apparatus for providing virtual media channels based on media search
US7734460B2 (en) * 2005-12-20 2010-06-08 Microsoft Corporation Time asynchronous decoding for long-span trajectory model
US7877256B2 (en) * 2006-02-17 2011-01-25 Microsoft Corporation Time synchronous decoding for long-span hidden trajectory model
US7925975B2 (en) 2006-03-10 2011-04-12 Microsoft Corporation Searching for commands to execute in applications
US20070239444A1 (en) * 2006-03-29 2007-10-11 Motorola, Inc. Voice signal perturbation for speech recognition
US8214213B1 (en) * 2006-04-27 2012-07-03 At&T Intellectual Property Ii, L.P. Speech recognition based on pronunciation modeling
US7890328B1 (en) * 2006-09-07 2011-02-15 At&T Intellectual Property Ii, L.P. Enhanced accuracy for speech recognition grammars
US8073681B2 (en) 2006-10-16 2011-12-06 Voicebox Technologies, Inc. System and method for a cooperative conversational voice user interface
US7831431B2 (en) * 2006-10-31 2010-11-09 Honda Motor Co., Ltd. Voice recognition updates via remote broadcast signal
US7818176B2 (en) 2007-02-06 2010-10-19 Voicebox Technologies, Inc. System and method for selecting and presenting advertisements based on natural language processing of voice-based input
US7983913B2 (en) * 2007-07-31 2011-07-19 Microsoft Corporation Understanding spoken location information based on intersections
US7788095B2 (en) * 2007-11-18 2010-08-31 Nice Systems, Ltd. Method and apparatus for fast search in call-center monitoring
US8140335B2 (en) 2007-12-11 2012-03-20 Voicebox Technologies, Inc. System and method for providing a natural language voice user interface in an integrated voice navigation services environment
KR20090065102A (ko) * 2007-12-17 2009-06-22 한국전자통신연구원 어휘 디코딩 방법 및 장치
US8312022B2 (en) 2008-03-21 2012-11-13 Ramp Holdings, Inc. Search engine optimization
US20090245646A1 (en) * 2008-03-28 2009-10-01 Microsoft Corporation Online Handwriting Expression Recognition
US8536976B2 (en) * 2008-06-11 2013-09-17 Veritrix, Inc. Single-channel multi-factor authentication
US8589161B2 (en) 2008-05-27 2013-11-19 Voicebox Technologies, Inc. System and method for an integrated, multi-modal, multi-device natural language voice services environment
US9305548B2 (en) 2008-05-27 2016-04-05 Voicebox Technologies Corporation System and method for an integrated, multi-modal, multi-device natural language voice services environment
US8166297B2 (en) * 2008-07-02 2012-04-24 Veritrix, Inc. Systems and methods for controlling access to encrypted data stored on a mobile device
WO2010051342A1 (fr) * 2008-11-03 2010-05-06 Veritrix, Inc. Authentification d'utilisateur pour des réseaux sociaux
US20100166314A1 (en) * 2008-12-30 2010-07-01 Microsoft Corporation Segment Sequence-Based Handwritten Expression Recognition
US8326637B2 (en) 2009-02-20 2012-12-04 Voicebox Technologies, Inc. System and method for processing multi-modal device interactions in a natural language voice services environment
US9502025B2 (en) 2009-11-10 2016-11-22 Voicebox Technologies Corporation System and method for providing a natural language content dedication service
US9171541B2 (en) 2009-11-10 2015-10-27 Voicebox Technologies Corporation System and method for hybrid processing in a natural language voice services environment
US9045098B2 (en) * 2009-12-01 2015-06-02 Honda Motor Co., Ltd. Vocabulary dictionary recompile for in-vehicle audio system
US20110131040A1 (en) * 2009-12-01 2011-06-02 Honda Motor Co., Ltd Multi-mode speech recognition
US9263045B2 (en) 2011-05-17 2016-02-16 Microsoft Technology Licensing, Llc Multi-mode text input
US20120330880A1 (en) * 2011-06-23 2012-12-27 Microsoft Corporation Synthetic data generation
KR20130014893A (ko) * 2011-08-01 2013-02-12 한국전자통신연구원 음성 인식 장치 및 방법
US9640175B2 (en) * 2011-10-07 2017-05-02 Microsoft Technology Licensing, Llc Pronunciation learning from user correction
US9620111B1 (en) * 2012-05-01 2017-04-11 Amazon Technologies, Inc. Generation and maintenance of language model
US10957310B1 (en) 2012-07-23 2021-03-23 Soundhound, Inc. Integrated programming framework for speech and text understanding with meaning parsing
US11295730B1 (en) 2014-02-27 2022-04-05 Soundhound, Inc. Using phonetic variants in a local context to improve natural language understanding
CN107003996A (zh) 2014-09-16 2017-08-01 声钰科技 语音商务
US9898459B2 (en) 2014-09-16 2018-02-20 Voicebox Technologies Corporation Integration of domain information into state transitions of a finite state transducer for natural language processing
WO2016061309A1 (fr) 2014-10-15 2016-04-21 Voicebox Technologies Corporation Système et procédé pour fournir des réponses de suivi à des entrées préalables en langage naturel d'un utilisateur
US10431214B2 (en) 2014-11-26 2019-10-01 Voicebox Technologies Corporation System and method of determining a domain and/or an action related to a natural language input
US10614799B2 (en) 2014-11-26 2020-04-07 Voicebox Technologies Corporation System and method of providing intent predictions for an utterance prior to a system detection of an end of the utterance
US9384188B1 (en) 2015-01-27 2016-07-05 Microsoft Technology Licensing, Llc Transcription correction using multi-token structures
US9799327B1 (en) * 2016-02-26 2017-10-24 Google Inc. Speech recognition with attention-based recurrent neural networks
JP6744025B2 (ja) * 2016-06-21 2020-08-19 日本電気株式会社 作業支援システム、管理サーバ、携帯端末、作業支援方法およびプログラム
WO2018023106A1 (fr) 2016-07-29 2018-02-01 Erik SWART Système et procédé de désambiguïsation de demandes de traitement de langage naturel
CN108346073B (zh) * 2017-01-23 2021-11-02 北京京东尚科信息技术有限公司 一种语音购物方法和装置
US11568007B2 (en) * 2018-10-03 2023-01-31 Walmart Apollo, Llc Method and apparatus for parsing and representation of digital inquiry related natural language
US11282512B2 (en) * 2018-10-27 2022-03-22 Qualcomm Incorporated Automatic grammar augmentation for robust voice command recognition
US11227065B2 (en) 2018-11-06 2022-01-18 Microsoft Technology Licensing, Llc Static data masking
US11954719B2 (en) * 2019-05-30 2024-04-09 Ncr Voyix Corporation Personalized voice-based assistance
CN115859975B (zh) * 2023-02-07 2023-05-09 支付宝(杭州)信息技术有限公司 数据处理方法、装置及设备

Family Cites Families (28)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US4156868A (en) * 1977-05-05 1979-05-29 Bell Telephone Laboratories, Incorporated Syntactic word recognizer
US4333152A (en) * 1979-02-05 1982-06-01 Best Robert M TV Movies that talk back
US4489434A (en) * 1981-10-05 1984-12-18 Exxon Corporation Speech recognition method and apparatus
US4481593A (en) * 1981-10-05 1984-11-06 Exxon Corporation Continuous speech recognition
US4956865A (en) * 1985-01-30 1990-09-11 Northern Telecom Limited Speech recognition
JPS61252596A (ja) * 1985-05-02 1986-11-10 株式会社日立製作所 文字音声通信方式及び装置
US4980918A (en) * 1985-05-09 1990-12-25 International Business Machines Corporation Speech recognition system with efficient storage and rapid assembly of phonological graphs
US4783803A (en) * 1985-11-12 1988-11-08 Dragon Systems, Inc. Speech recognition apparatus and method
DE3766124D1 (de) * 1986-02-15 1990-12-20 Smiths Industries Plc Verfahren und vorrichtung zur sprachverarbeitung.
US4837831A (en) * 1986-10-15 1989-06-06 Dragon Systems, Inc. Method for creating and using multiple-word sound models in speech recognition
US4829576A (en) * 1986-10-21 1989-05-09 Dragon Systems, Inc. Voice recognition system
JPH02195400A (ja) * 1989-01-24 1990-08-01 Canon Inc 音声認識装置
US5263117A (en) * 1989-10-26 1993-11-16 International Business Machines Corporation Method and apparatus for finding the best splits in a decision tree for a language model for a speech recognizer
US5202952A (en) * 1990-06-22 1993-04-13 Dragon Systems, Inc. Large-vocabulary continuous speech prefiltering and processing system
WO1992006436A2 (fr) * 1990-10-03 1992-04-16 Thinking Machines Corporation Systeme d'ordinateur parallele
JP2768561B2 (ja) * 1990-12-19 1998-06-25 富士通株式会社 ネットワーク変形装置および作成装置
US5268990A (en) * 1991-01-31 1993-12-07 Sri International Method for recognizing speech using linguistically-motivated hidden Markov models
US5212730A (en) * 1991-07-01 1993-05-18 Texas Instruments Incorporated Voice recognition of proper names using text-derived recognition models
US5283833A (en) * 1991-09-19 1994-02-01 At&T Bell Laboratories Method and apparatus for speech processing using morphology and rhyming
US5390278A (en) * 1991-10-08 1995-02-14 Bell Canada Phoneme based speech recognition
US5267345A (en) * 1992-02-10 1993-11-30 International Business Machines Corporation Speech recognition apparatus which predicts word classes from context and words from word classes
US5333275A (en) * 1992-06-23 1994-07-26 Wheatley Barbara J System and method for time aligning speech
US5325421A (en) * 1992-08-24 1994-06-28 At&T Bell Laboratories Voice directed communications system platform
US5428707A (en) * 1992-11-13 1995-06-27 Dragon Systems, Inc. Apparatus and methods for training speech recognition systems and their users and otherwise improving speech recognition performance
DE4397100T1 (de) * 1992-12-31 1995-11-23 Apple Computer Rekursive Grammatik mit endlicher Zustandsanzahl
US5457770A (en) * 1993-08-19 1995-10-10 Kabushiki Kaisha Meidensha Speaker independent speech recognition system and method using neural network and/or DP matching technique
US6125347A (en) * 1993-09-29 2000-09-26 L&H Applications Usa, Inc. System for controlling multiple user application programs by spoken input
US5799276A (en) * 1995-11-07 1998-08-25 Accent Incorporated Knowledge-based speech recognition system and methods having frame length computed based upon estimated pitch period of vocalic intervals

Also Published As

Publication number Publication date
US6501833B2 (en) 2002-12-31
DE69622565T2 (de) 2003-04-03
AU5738296A (en) 1996-12-11
EP0838073B1 (fr) 2002-07-24
EP1199707A2 (fr) 2002-04-24
EP0838073A2 (fr) 1998-04-29
CA2220004A1 (fr) 1996-11-28
WO1996037881A2 (fr) 1996-11-28
EP1199707A3 (fr) 2002-05-02
US20020048350A1 (en) 2002-04-25
WO1996037881A3 (fr) 1997-01-16
DE69622565D1 (de) 2002-08-29

Similar Documents

Publication Publication Date Title
HK1010597A1 (en) Method and apparatus for dynamic adaptation of a large vocabulary speech recognition system and for use of constraints from a database in a large vocabulary speech recognition system
IL120622A0 (en) System and method for multimodal interactive speech and language training
AU3274395A (en) Method and system for continuous speech recognition using voting techniques
EP0740263A3 (fr) Méthode pour entraîner un système de reconnaissance avec des modèles de caractères
AU4313897A (en) Method and apparatus for processing the output of a speech recognition engine
NZ294659A (en) Method of and apparatus for generating a vocabulary from an input speech signal
EP0702351A3 (fr) Procédé et dispositif d'analyse des événements d'entrée audio dans un système de reconnaissance de parole
AU8633191A (en) Method and apparatus for speech recognition
DE69517705D1 (de) Verfahren und vorrichtung zur anpassung der grösse eines sprachmodells in einem spracherkennungssystem
GB2311640B (en) System and method for generating and using context dependent sub-syllable models to recognize a tonal language
GB9811553D0 (en) Method and apparatus for securely handling data in a database of biometrics and associated data
DE69625950T2 (de) Verfahren und Vorrichtung zur Spracherkennung und Übersetzungssystem
GB2331392B (en) A fast vocabulary independent method and apparatus for spotting words in speech
GB9704694D0 (en) System for recognizing spoken sounds from continuous speech and method of using same
FI922606A (fi) Puheentunnistusmenetelmä ja -järjestelmä puheella ohjattavaa puhelinta varten
EP0750293A3 (fr) Méthode pour dessiner des modèles de transition et méthode de reconnaissance de voix et appareil utilisant cette méthode
AU2169700A (en) A method and apparatus for adaptive speech recognition hypothesis construction and selection in a spoken language translation system
AU3274295A (en) Method and system for identifying spoken sounds in continuous speech by comparing classifier outputs
AU2797199A (en) Apparatus and method for providing speech input to a speech recognition system
EP0779609A3 (fr) Système d'adaptation des signaux de parole et de reconnaissance de la parole
EP0485315A3 (en) Method and apparatus for speech analysis and speech recognition
DE69616724T2 (de) Verfahren und System für die Spracherkennung
DE69613644D1 (de) Verfahren zur Erzeugung eines Sprachmodels und Spracherkennungsvorrichtung
HK1013879A1 (en) Speech recognition apparatus using neural network and learning method therefor
EP0482395A3 (en) Method and apparatus for generating models of spoken words based on a small number of utterances

Legal Events

Date Code Title Description
PF Patent in force
PC Patent ceased (i.e. patent has lapsed due to the failure to pay the renewal fee)

Effective date: 20090509