TWI346322B - Method and medium for adaptive selection of vocabulary and acoustic models for speech recognition - Google Patents

Method and medium for adaptive selection of vocabulary and acoustic models for speech recognition

Info

Publication number
TWI346322B
TWI346322B TW092107596A TW92107596A TWI346322B TW I346322 B TWI346322 B TW I346322B TW 092107596 A TW092107596 A TW 092107596A TW 92107596 A TW92107596 A TW 92107596A TW I346322 B TWI346322 B TW I346322B
Authority
TW
Taiwan
Prior art keywords
vocabulary
medium
speech recognition
acoustic models
adaptive selection
Prior art date
Application number
TW092107596A
Other languages
English (en)
Other versions
TW200305140A (en
Inventor
Sam Mazza
Original Assignee
Intel Corp
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Intel Corp filed Critical Intel Corp
Publication of TW200305140A publication Critical patent/TW200305140A/zh
Application granted granted Critical
Publication of TWI346322B publication Critical patent/TWI346322B/zh

Links

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/08Speech classification or search
    • G10L15/18Speech classification or search using natural language modelling
    • G10L15/183Speech classification or search using natural language modelling using context dependencies, e.g. language models
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/22Procedures used during a speech recognition process, e.g. man-machine dialogue
    • G10L2015/226Procedures used during a speech recognition process, e.g. man-machine dialogue using non-speech characteristics
    • G10L2015/228Procedures used during a speech recognition process, e.g. man-machine dialogue using non-speech characteristics of application context

Landscapes

  • Engineering & Computer Science (AREA)
  • Artificial Intelligence (AREA)
  • Computational Linguistics (AREA)
  • Health & Medical Sciences (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Human Computer Interaction (AREA)
  • Physics & Mathematics (AREA)
  • Acoustics & Sound (AREA)
  • Multimedia (AREA)
  • Telephonic Communication Services (AREA)
TW092107596A 2002-04-05 2003-04-03 Method and medium for adaptive selection of vocabulary and acoustic models for speech recognition TWI346322B (en)

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
US10/115,936 US20030191639A1 (en) 2002-04-05 2002-04-05 Dynamic and adaptive selection of vocabulary and acoustic models based on a call context for speech recognition

Publications (2)

Publication Number Publication Date
TW200305140A TW200305140A (en) 2003-10-16
TWI346322B true TWI346322B (en) 2011-08-01

Family

ID=28673872

Family Applications (1)

Application Number Title Priority Date Filing Date
TW092107596A TWI346322B (en) 2002-04-05 2003-04-03 Method and medium for adaptive selection of vocabulary and acoustic models for speech recognition

Country Status (6)

Country Link
US (1) US20030191639A1 (zh)
EP (1) EP1497825A1 (zh)
CN (1) CN100407291C (zh)
AU (1) AU2003218398A1 (zh)
TW (1) TWI346322B (zh)
WO (1) WO2003088211A1 (zh)

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
TWI502582B (zh) * 2013-04-03 2015-10-01 Chung Han Interlingua Knowledge Co Ltd 服務點之語音客服系統

Families Citing this family (93)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20060143007A1 (en) * 2000-07-24 2006-06-29 Koh V E User interaction with voice information services
US20050197405A1 (en) * 2000-11-07 2005-09-08 Li Chiang J. Treatment of hematologic tumors and cancers with beta-lapachone, a broad spectrum anti-cancer agent
US7389228B2 (en) * 2002-12-16 2008-06-17 International Business Machines Corporation Speaker adaptation of vocabulary for speech recognition
EP1599867B1 (en) * 2003-03-01 2008-02-13 Robert E. Coifman Improving the transcription accuracy of speech recognition software
CA2486128C (en) * 2003-10-30 2011-08-23 At&T Corp. System and method for using meta-data dependent language modeling for automatic speech recognition
CA2486125C (en) * 2003-10-30 2011-02-08 At&T Corp. A system and method of using meta-data in speech-processing
EP1687961A2 (en) * 2003-11-14 2006-08-09 Voice Signal Technologies Inc. Installing language modules in a mobile communication device
US20050113021A1 (en) * 2003-11-25 2005-05-26 G Squared, Llc Wireless communication system for media transmission, production, recording, reinforcement and monitoring in real-time
GB0328035D0 (en) * 2003-12-03 2004-01-07 British Telecomm Communications method and system
US8050918B2 (en) * 2003-12-11 2011-11-01 Nuance Communications, Inc. Quality evaluation tool for dynamic voice portals
US7660715B1 (en) * 2004-01-12 2010-02-09 Avaya Inc. Transparent monitoring and intervention to improve automatic adaptation of speech models
DE102004012148A1 (de) * 2004-03-12 2005-10-06 Siemens Ag Spracherkennung unter Berücksichtigung einer geografischen Position
US7873149B2 (en) 2004-06-01 2011-01-18 Verizon Business Global Llc Systems and methods for gathering information
US8392193B2 (en) * 2004-06-01 2013-03-05 Verizon Business Global Llc Systems and methods for performing speech recognition using constraint based processing
US8036893B2 (en) * 2004-07-22 2011-10-11 Nuance Communications, Inc. Method and system for identifying and correcting accent-induced speech recognition difficulties
US7783028B2 (en) * 2004-09-30 2010-08-24 International Business Machines Corporation System and method of using speech recognition at call centers to improve their efficiency and customer satisfaction
KR101221172B1 (ko) * 2005-02-03 2013-01-11 뉘앙스 커뮤니케이션즈, 인코포레이티드 이동 통신 장치의 음성 어휘를 자동으로 확장하는 방법 및장치
US7865362B2 (en) 2005-02-04 2011-01-04 Vocollect, Inc. Method and system for considering information about an expected response when performing speech recognition
US8200495B2 (en) * 2005-02-04 2012-06-12 Vocollect, Inc. Methods and systems for considering information about an expected response when performing speech recognition
US7827032B2 (en) * 2005-02-04 2010-11-02 Vocollect, Inc. Methods and systems for adapting a model for a speech recognition system
US7949533B2 (en) * 2005-02-04 2011-05-24 Vococollect, Inc. Methods and systems for assessing and improving the performance of a speech recognition system
US7895039B2 (en) 2005-02-04 2011-02-22 Vocollect, Inc. Methods and systems for optimizing model adaptation for a speech recognition system
US20060282265A1 (en) * 2005-06-10 2006-12-14 Steve Grobman Methods and apparatus to perform enhanced speech to text processing
US8654937B2 (en) * 2005-11-30 2014-02-18 International Business Machines Corporation System and method for call center agent quality assurance using biometric detection technologies
US9165557B2 (en) * 2006-02-06 2015-10-20 Nec Corporation Voice recognizing apparatus, voice recognizing method, and program for recognizing voice
US8762148B2 (en) * 2006-02-27 2014-06-24 Nec Corporation Reference pattern adaptation apparatus, reference pattern adaptation method and reference pattern adaptation program
US7653543B1 (en) 2006-03-24 2010-01-26 Avaya Inc. Automatic signal adjustment based on intelligibility
US8510109B2 (en) 2007-08-22 2013-08-13 Canyon Ip Holdings Llc Continuous speech transcription performance indication
US8234120B2 (en) * 2006-07-26 2012-07-31 Nuance Communications, Inc. Performing a safety analysis for user-defined voice commands to ensure that the voice commands do not cause speech recognition ambiguities
CA2660960C (en) * 2006-08-15 2014-07-08 Intellisist, Inc. Managing a dynamic call flow during automated call processing
US7925508B1 (en) 2006-08-22 2011-04-12 Avaya Inc. Detection of extreme hypoglycemia or hyperglycemia based on automatic analysis of speech patterns
US7962342B1 (en) 2006-08-22 2011-06-14 Avaya Inc. Dynamic user interface for the temporarily impaired based on automatic analysis for speech patterns
US8938392B2 (en) * 2007-02-27 2015-01-20 Nuance Communications, Inc. Configuring a speech engine for a multimodal application based on location
US9208783B2 (en) 2007-02-27 2015-12-08 Nuance Communications, Inc. Altering behavior of a multimodal application based on location
US9973450B2 (en) 2007-09-17 2018-05-15 Amazon Technologies, Inc. Methods and systems for dynamically updating web service profile information by parsing transcribed message strings
TWI349266B (en) * 2007-04-13 2011-09-21 Qisda Corp Voice recognition system and method
US8041344B1 (en) 2007-06-26 2011-10-18 Avaya Inc. Cooling off period prior to sending dependent on user's state
US20130070911A1 (en) * 2007-07-22 2013-03-21 Daniel O'Sullivan Adaptive Accent Vocie Communications System (AAVCS)
US8255224B2 (en) 2008-03-07 2012-08-28 Google Inc. Voice recognition grammar selection based on context
US8571849B2 (en) * 2008-09-30 2013-10-29 At&T Intellectual Property I, L.P. System and method for enriching spoken language translation with prosodic information
JP5377430B2 (ja) * 2009-07-08 2013-12-25 本田技研工業株式会社 質問応答データベース拡張装置および質問応答データベース拡張方法
KR20110006004A (ko) * 2009-07-13 2011-01-20 삼성전자주식회사 결합인식단위 최적화 장치 및 그 방법
US8442827B2 (en) * 2010-06-18 2013-05-14 At&T Intellectual Property I, L.P. System and method for customized voice response
US8417530B1 (en) 2010-08-20 2013-04-09 Google Inc. Accent-influenced search results
US9704413B2 (en) 2011-03-25 2017-07-11 Educational Testing Service Non-scorable response filters for speech scoring systems
US9202465B2 (en) * 2011-03-25 2015-12-01 General Motors Llc Speech recognition dependent on text message content
US8990082B2 (en) * 2011-03-25 2015-03-24 Educational Testing Service Non-scorable response filters for speech scoring systems
US8914286B1 (en) * 2011-04-14 2014-12-16 Canyon IP Holdings, LLC Speech recognition with hierarchical networks
US8914290B2 (en) 2011-05-20 2014-12-16 Vocollect, Inc. Systems and methods for dynamically improving user intelligibility of synthesized speech in a work environment
US9711167B2 (en) * 2012-03-13 2017-07-18 Nice Ltd. System and method for real-time speaker segmentation of audio interactions
US20130282844A1 (en) 2012-04-23 2013-10-24 Contact Solutions LLC Apparatus and methods for multi-mode asynchronous communication
US9635067B2 (en) 2012-04-23 2017-04-25 Verint Americas Inc. Tracing and asynchronous communication network and routing method
US20130325449A1 (en) 2012-05-31 2013-12-05 Elwha Llc Speech recognition adaptation systems based on adaptation data
US10395672B2 (en) * 2012-05-31 2019-08-27 Elwha Llc Methods and systems for managing adaptation data
US10431235B2 (en) 2012-05-31 2019-10-01 Elwha Llc Methods and systems for speech adaptation data
US8843371B2 (en) * 2012-05-31 2014-09-23 Elwha Llc Speech recognition adaptation systems based on adaptation data
US9495966B2 (en) 2012-05-31 2016-11-15 Elwha Llc Speech recognition adaptation systems based on adaptation data
US9305565B2 (en) 2012-05-31 2016-04-05 Elwha Llc Methods and systems for speech adaptation data
US9966064B2 (en) * 2012-07-18 2018-05-08 International Business Machines Corporation Dialect-specific acoustic language modeling and speech recognition
US9093072B2 (en) * 2012-07-20 2015-07-28 Microsoft Technology Licensing, Llc Speech and gesture recognition enhancement
US9734819B2 (en) * 2013-02-21 2017-08-15 Google Technology Holdings LLC Recognizing accented speech
US9978395B2 (en) 2013-03-15 2018-05-22 Vocollect, Inc. Method and system for mitigating delay in receiving audio stream during production of sound from audio stream
US9530103B2 (en) * 2013-04-04 2016-12-27 Cypress Semiconductor Corporation Combining of results from multiple decoders
US20140372118A1 (en) * 2013-06-17 2014-12-18 Speech Morphing Systems, Inc. Method and apparatus for exemplary chip architecture
US9305554B2 (en) * 2013-07-17 2016-04-05 Samsung Electronics Co., Ltd. Multi-level speech recognition
US9299340B2 (en) * 2013-10-07 2016-03-29 Honeywell International Inc. System and method for correcting accent induced speech in an aircraft cockpit utilizing a dynamic speech database
US10565984B2 (en) 2013-11-15 2020-02-18 Intel Corporation System and method for maintaining speech recognition dynamic dictionary
JP6080978B2 (ja) * 2013-11-20 2017-02-15 三菱電機株式会社 音声認識装置および音声認識方法
US20150149169A1 (en) * 2013-11-27 2015-05-28 At&T Intellectual Property I, L.P. Method and apparatus for providing mobile multimodal speech hearing aid
US11386886B2 (en) * 2014-01-28 2022-07-12 Lenovo (Singapore) Pte. Ltd. Adjusting speech recognition using contextual information
WO2015120263A1 (en) 2014-02-06 2015-08-13 Contact Solutions LLC Systems, apparatuses and methods for communication flow modification
CN103956169B (zh) * 2014-04-17 2017-07-21 北京搜狗科技发展有限公司 一种语音输入方法、装置和系统
US9858920B2 (en) * 2014-06-30 2018-01-02 GM Global Technology Operations LLC Adaptation methods and systems for speech systems
KR101619262B1 (ko) * 2014-11-14 2016-05-18 현대자동차 주식회사 음성인식 장치 및 방법
US9166881B1 (en) 2014-12-31 2015-10-20 Contact Solutions LLC Methods and apparatus for adaptive bandwidth-based communication management
US10325590B2 (en) * 2015-06-26 2019-06-18 Intel Corporation Language model modification for local speech recognition systems using remote sources
WO2017024248A1 (en) 2015-08-06 2017-02-09 Contact Solutions LLC Tracing and asynchronous communication network and routing method
US10008199B2 (en) 2015-08-22 2018-06-26 Toyota Motor Engineering & Manufacturing North America, Inc. Speech recognition system with abbreviated training
US10063647B2 (en) 2015-12-31 2018-08-28 Verint Americas Inc. Systems, apparatuses, and methods for intelligent network communication and engagement
US9972313B2 (en) * 2016-03-01 2018-05-15 Intel Corporation Intermediate scoring and rejection loopback for improved key phrase detection
CN106205622A (zh) * 2016-06-29 2016-12-07 联想(北京)有限公司 信息处理方法及电子设备
US10714121B2 (en) 2016-07-27 2020-07-14 Vocollect, Inc. Distinguishing user speech from background speech in speech-dense environments
EP3622506B1 (en) * 2017-05-08 2023-09-27 Telefonaktiebolaget LM Ericsson (publ) Asr adaptation
US20190019516A1 (en) * 2017-07-14 2019-01-17 Ford Global Technologies, Llc Speech recognition user macros for improving vehicle grammars
US10468019B1 (en) * 2017-10-27 2019-11-05 Kadho, Inc. System and method for automatic speech recognition using selection of speech models based on input characteristics
CN108198552B (zh) * 2018-01-18 2021-02-02 深圳市大疆创新科技有限公司 一种语音控制方法及视频眼镜
EP3575202A1 (en) * 2018-06-01 2019-12-04 GE Aviation Systems Limited Systems and methods for secure commands in vehicles
CN108777142A (zh) * 2018-06-05 2018-11-09 上海木木机器人技术有限公司 一种基于机场环境的语音交互识别方法及语音交互机器人
US10720149B2 (en) 2018-10-23 2020-07-21 Capital One Services, Llc Dynamic vocabulary customization in automated voice systems
CN109672786B (zh) * 2019-01-31 2021-08-20 北京蓦然认知科技有限公司 一种来电接听方法及装置
US10785171B2 (en) 2019-02-07 2020-09-22 Capital One Services, Llc Chat bot utilizing metaphors to both relay and obtain information
CN112788184A (zh) * 2021-01-18 2021-05-11 商客通尚景科技(上海)股份有限公司 根据语音输入连接呼叫中心的方法
US20240203412A1 (en) * 2022-12-16 2024-06-20 Amazon Technologies, Inc. Enterprise type models for voice interfaces

Family Cites Families (11)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP2524472B2 (ja) * 1992-09-21 1996-08-14 インターナショナル・ビジネス・マシーンズ・コーポレイション 電話回線利用の音声認識システムを訓練する方法
US5666400A (en) * 1994-07-07 1997-09-09 Bell Atlantic Network Services, Inc. Intelligent recognition
JPH10513033A (ja) * 1995-11-17 1998-12-08 エイ・ティ・アンド・ティ・コーポレーション 電気通信網に基づく音声ダイヤル呼び出しのための自動語彙作成
US5897616A (en) * 1997-06-11 1999-04-27 International Business Machines Corporation Apparatus and methods for speaker verification/identification/classification employing non-acoustic and/or acoustic models and databases
US6125341A (en) * 1997-12-19 2000-09-26 Nortel Networks Corporation Speech recognition system and method
US6105063A (en) * 1998-05-05 2000-08-15 International Business Machines Corp. Client-server system for maintaining application preferences in a hierarchical data structure according to user and user group or terminal and terminal group contexts
US6614885B2 (en) * 1998-08-14 2003-09-02 Intervoice Limited Partnership System and method for operating a highly distributed interactive voice response system
US6442519B1 (en) * 1999-11-10 2002-08-27 International Business Machines Corp. Speaker model adaptation via network of similar users
GB2366033B (en) * 2000-02-29 2004-08-04 Ibm Method and apparatus for processing acquired data and contextual information and associating the same with available multimedia resources
US20020032591A1 (en) * 2000-09-08 2002-03-14 Agentai, Inc. Service request processing performed by artificial intelligence systems in conjunctiion with human intervention
US20020138274A1 (en) * 2001-03-26 2002-09-26 Sharma Sangita R. Server based adaption of acoustic models for client-based speech systems

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
TWI502582B (zh) * 2013-04-03 2015-10-01 Chung Han Interlingua Knowledge Co Ltd 服務點之語音客服系統

Also Published As

Publication number Publication date
EP1497825A1 (en) 2005-01-19
TW200305140A (en) 2003-10-16
WO2003088211A1 (en) 2003-10-23
US20030191639A1 (en) 2003-10-09
AU2003218398A1 (en) 2003-10-27
CN1659624A (zh) 2005-08-24
CN100407291C (zh) 2008-07-30

Similar Documents

Publication Publication Date Title
TWI346322B (en) Method and medium for adaptive selection of vocabulary and acoustic models for speech recognition
AU2003235782A8 (en) System and method for speech recognition by multi-pass recognition generating refined context specific grammars
AU2003295628A1 (en) Method and apparatus for selective speech recognition
AU2003271083A1 (en) Language model creation/accumulation device, speech recognition device, language model creation method, and speech recognition method
AU2003295976A1 (en) Method and apparatus for selective distributed speech recognition
AU2003293119A1 (en) Method and apparatus for selective distributed speech recognition
AU2003280474A1 (en) Multi-phoneme streamer and knowledge representation speech recognition system and method
GB2333877B (en) Method of evaluating an utterance in a speech recognition system
GB0219870D0 (en) Speech synthesis apparatus and method
GB2407681B (en) Voice recognition system and method
EP1552721A4 (en) ULTRASONIC TRANSDUCERS WITH MICRO-MACHINING AND METHOD OF MANUFACTURE
AU2002367354A1 (en) Method and apparatus for multi-level distributed speech recognition
ZA200500792B (en) Distributed speech recognition with back-end voice activity detection apparatus and method
AU2003284654A1 (en) Speech synthesis method and speech synthesis device
AU2002364174A1 (en) System and method for speech recognition and transcription
GB2391680B (en) Adaptive learning of language models for speech recognition
AU2003278431A1 (en) Speech recognition device and method
DE502004002300D1 (de) Verfahren zur sprecherabhängigen spracherkennung und spracherkennungssystem
AU2003254273A1 (en) Acoustic modeling apparatus and method
AU2003237231A1 (en) Method and apparatus for differential compression of speaker models for speaker recognition
GB2390466B (en) Method for formation of speech recognition parameters
AU2003256852A1 (en) Speech recognition faciliation method and apparatus
GB2394589B (en) Speech recognition device and method
RU2002129029A (ru) Способ дикторонезависимого распознавания звуков речи
AU2003243027A1 (en) Language modeling method of speech recognition system

Legal Events

Date Code Title Description
MM4A Annulment or lapse of patent due to non-payment of fees