DE69842190D1 - Erweiterbares Spracherkennungssystem mit einer Audio-Rückkopplung - Google Patents

Erweiterbares Spracherkennungssystem mit einer Audio-Rückkopplung

Info

Publication number
DE69842190D1
DE69842190D1 DE69842190T DE69842190T DE69842190D1 DE 69842190 D1 DE69842190 D1 DE 69842190D1 DE 69842190 T DE69842190 T DE 69842190T DE 69842190 T DE69842190 T DE 69842190T DE 69842190 D1 DE69842190 D1 DE 69842190D1
Authority
DE
Germany
Prior art keywords
expandable
speech recognition
recognition system
audio feedback
audio
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Expired - Lifetime
Application number
DE69842190T
Other languages
English (en)
Inventor
Xuedong D Huang
Michael J Rozak
Li Jiang
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Microsoft Corp
Original Assignee
Microsoft Corp
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Microsoft Corp filed Critical Microsoft Corp
Application granted granted Critical
Publication of DE69842190D1 publication Critical patent/DE69842190D1/de
Anticipated expiration legal-status Critical
Expired - Lifetime legal-status Critical Current

Links

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/06Creation of reference templates; Training of speech recognition systems, e.g. adaptation to the characteristics of the speaker's voice
    • G10L15/063Training
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/06Creation of reference templates; Training of speech recognition systems, e.g. adaptation to the characteristics of the speaker's voice
    • G10L15/063Training
    • G10L2015/0638Interactive procedures

Landscapes

  • Engineering & Computer Science (AREA)
  • Artificial Intelligence (AREA)
  • Computational Linguistics (AREA)
  • Health & Medical Sciences (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Human Computer Interaction (AREA)
  • Physics & Mathematics (AREA)
  • Acoustics & Sound (AREA)
  • Multimedia (AREA)
  • Machine Translation (AREA)
  • Electrically Operated Instructional Devices (AREA)
DE69842190T 1997-04-10 1998-04-08 Erweiterbares Spracherkennungssystem mit einer Audio-Rückkopplung Expired - Lifetime DE69842190D1 (de)

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
US08/833,916 US5933804A (en) 1997-04-10 1997-04-10 Extensible speech recognition system that provides a user with audio feedback

Publications (1)

Publication Number Publication Date
DE69842190D1 true DE69842190D1 (de) 2011-04-28

Family

ID=25265611

Family Applications (2)

Application Number Title Priority Date Filing Date
DE69834553T Expired - Lifetime DE69834553T2 (de) 1997-04-10 1998-04-08 Erweiterbares spracherkennungssystem mit einer audio-rückkopplung
DE69842190T Expired - Lifetime DE69842190D1 (de) 1997-04-10 1998-04-08 Erweiterbares Spracherkennungssystem mit einer Audio-Rückkopplung

Family Applications Before (1)

Application Number Title Priority Date Filing Date
DE69834553T Expired - Lifetime DE69834553T2 (de) 1997-04-10 1998-04-08 Erweiterbares spracherkennungssystem mit einer audio-rückkopplung

Country Status (6)

Country Link
US (1) US5933804A (de)
EP (2) EP1693827B1 (de)
JP (1) JP4570176B2 (de)
CN (2) CN1196105C (de)
DE (2) DE69834553T2 (de)
WO (1) WO1998045834A1 (de)

Families Citing this family (47)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CA2219008C (en) * 1997-10-21 2002-11-19 Bell Canada A method and apparatus for improving the utility of speech recognition
US6078885A (en) * 1998-05-08 2000-06-20 At&T Corp Verbal, fully automatic dictionary updates by end-users of speech synthesis and recognition systems
US6163768A (en) * 1998-06-15 2000-12-19 Dragon Systems, Inc. Non-interactive enrollment in speech recognition
US6462616B1 (en) 1998-09-24 2002-10-08 Ericsson Inc. Embedded phonetic support and TTS play button in a contacts database
US6363342B2 (en) * 1998-12-18 2002-03-26 Matsushita Electric Industrial Co., Ltd. System for developing word-pronunciation pairs
US6324507B1 (en) 1999-02-10 2001-11-27 International Business Machines Corp. Speech recognition enrollment for non-readers and displayless devices
US7292980B1 (en) * 1999-04-30 2007-11-06 Lucent Technologies Inc. Graphical user interface and method for modifying pronunciations in text-to-speech and speech recognition systems
US6434521B1 (en) * 1999-06-24 2002-08-13 Speechworks International, Inc. Automatically determining words for updating in a pronunciation dictionary in a speech recognition system
EP1074973B1 (de) * 1999-06-30 2006-03-15 International Business Machines Corporation Verfahren zur Erweiterung des Wortschatzes eines Spracherkennungssystems
ATE320650T1 (de) 1999-06-30 2006-04-15 Ibm Verfahren zur erweiterung des wortschatzes eines spracherkennungssystems
US7149690B2 (en) * 1999-09-09 2006-12-12 Lucent Technologies Inc. Method and apparatus for interactive language instruction
JP2002221980A (ja) * 2001-01-25 2002-08-09 Oki Electric Ind Co Ltd テキスト音声変換装置
US7107215B2 (en) * 2001-04-16 2006-09-12 Sakhr Software Company Determining a compact model to transcribe the arabic language acoustically in a well defined basic phonetic study
DE10119677A1 (de) * 2001-04-20 2002-10-24 Philips Corp Intellectual Pty Verfahren zum Ermitteln von Datenbankeinträgen
US7493559B1 (en) * 2002-01-09 2009-02-17 Ricoh Co., Ltd. System and method for direct multi-modal annotation of objects
KR100467590B1 (ko) * 2002-06-28 2005-01-24 삼성전자주식회사 발음 사전 갱신 장치 및 방법
DE10304229A1 (de) * 2003-01-28 2004-08-05 Deutsche Telekom Ag Kommunikationssystem, Kommunikationsendeinrichtung und Vorrichtung zum Erkennen fehlerbehafteter Text-Nachrichten
US8577681B2 (en) * 2003-09-11 2013-11-05 Nuance Communications, Inc. Pronunciation discovery for spoken words
US20050114131A1 (en) * 2003-11-24 2005-05-26 Kirill Stoimenov Apparatus and method for voice-tagging lexicon
US7340395B2 (en) * 2004-04-23 2008-03-04 Sap Aktiengesellschaft Multiple speech recognition engines
US20050273337A1 (en) * 2004-06-02 2005-12-08 Adoram Erell Apparatus and method for synthesized audible response to an utterance in speaker-independent voice recognition
US7697827B2 (en) 2005-10-17 2010-04-13 Konicek Jeffrey C User-friendlier interfaces for a camera
WO2007097176A1 (ja) * 2006-02-23 2007-08-30 Nec Corporation 音声認識辞書作成支援システム、音声認識辞書作成支援方法及び音声認識辞書作成支援用プログラム
US20080104537A1 (en) * 2006-10-30 2008-05-01 Sherryl Lee Lorraine Scott Method of improved viewing of visual objects on a display, and handheld electronic device
EP2126900B1 (de) 2007-02-06 2013-04-24 Nuance Communications Austria GmbH Verfahren und system zur erstellung von einträgen in einem spracherkennungs-lexikon
US8484034B2 (en) * 2008-03-31 2013-07-09 Avaya Inc. Arrangement for creating and using a phonetic-alphabet representation of a name of a party to a call
US9077933B2 (en) 2008-05-14 2015-07-07 At&T Intellectual Property I, L.P. Methods and apparatus to generate relevance rankings for use by a program selector of a media presentation system
US9202460B2 (en) * 2008-05-14 2015-12-01 At&T Intellectual Property I, Lp Methods and apparatus to generate a speech recognition library
US8160881B2 (en) * 2008-12-15 2012-04-17 Microsoft Corporation Human-assisted pronunciation generation
JP5334178B2 (ja) * 2009-01-21 2013-11-06 クラリオン株式会社 音声認識装置およびデータ更新方法
CN101739459A (zh) * 2009-12-21 2010-06-16 中兴通讯股份有限公司 一种移动终端的词库添加方法和移动终端
US9640175B2 (en) 2011-10-07 2017-05-02 Microsoft Technology Licensing, Llc Pronunciation learning from user correction
KR101179915B1 (ko) 2011-12-29 2012-09-06 주식회사 예스피치 통계적 언어 모델이 적용된 음성인식 시스템의 발화 데이터 정제 장치 및 방법
US9721587B2 (en) * 2013-01-24 2017-08-01 Microsoft Technology Licensing, Llc Visual feedback for speech recognition system
US9779722B2 (en) * 2013-11-05 2017-10-03 GM Global Technology Operations LLC System for adapting speech recognition vocabulary
GB2524222B (en) * 2013-12-18 2018-07-18 Cirrus Logic Int Semiconductor Ltd Activating speech processing
US20150310851A1 (en) * 2014-04-24 2015-10-29 Ford Global Technologies, Llc Method and Apparatus for Extra-Vehicular Voice Recognition Training Including Vehicular Updating
US9613140B2 (en) * 2014-05-16 2017-04-04 International Business Machines Corporation Real-time audio dictionary updating system
US9953646B2 (en) 2014-09-02 2018-04-24 Belleau Technologies Method and system for dynamic speech recognition and tracking of prewritten script
CN104598791A (zh) * 2014-11-29 2015-05-06 深圳市金立通信设备有限公司 一种语音解锁方法
CN104505089B (zh) * 2014-12-17 2018-05-18 福建网龙计算机网络信息技术有限公司 口语纠错方法及设备
US9787819B2 (en) * 2015-09-18 2017-10-10 Microsoft Technology Licensing, Llc Transcription of spoken communications
US10706210B2 (en) * 2016-08-31 2020-07-07 Nuance Communications, Inc. User interface for dictation application employing automatic speech recognition
US11170757B2 (en) * 2016-09-30 2021-11-09 T-Mobile Usa, Inc. Systems and methods for improved call handling
CN109635096B (zh) * 2018-12-20 2020-12-25 广东小天才科技有限公司 一种听写提示方法及电子设备
CN111081084B (zh) * 2019-07-11 2021-11-26 广东小天才科技有限公司 一种听写内容的播报方法及电子设备
US11676572B2 (en) * 2021-03-03 2023-06-13 Google Llc Instantaneous learning in text-to-speech during dialog

Family Cites Families (13)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US4516260A (en) * 1978-04-28 1985-05-07 Texas Instruments Incorporated Electronic learning aid or game having synthesized speech
CH644246B (fr) * 1981-05-15 1900-01-01 Asulab Sa Dispositif d'introduction de mots a commande par la parole.
US4749353A (en) * 1982-05-13 1988-06-07 Texas Instruments Incorporated Talking electronic learning aid for improvement of spelling with operator-controlled word list
JPS6221199A (ja) * 1985-07-22 1987-01-29 株式会社東芝 音声認識装置
JPS6287994A (ja) * 1985-10-14 1987-04-22 株式会社リコー 音声認識辞書更新方式
JPS63281196A (ja) * 1987-05-14 1988-11-17 沖電気工業株式会社 音声認識装置
GB8817705D0 (en) * 1988-07-25 1988-09-01 British Telecomm Optical communications system
JPH0778183A (ja) * 1993-06-25 1995-03-20 Ricoh Co Ltd デ−タベ−ス検索システム
US5623578A (en) * 1993-10-28 1997-04-22 Lucent Technologies Inc. Speech recognition system allows new vocabulary words to be added without requiring spoken samples of the words
JPH07306851A (ja) * 1994-05-12 1995-11-21 Ricoh Co Ltd 発音記号編集装置
US5681108A (en) * 1995-06-28 1997-10-28 Miller; Alan Golf scorekeeping system
US5737487A (en) * 1996-02-13 1998-04-07 Apple Computer, Inc. Speaker adaptation based on lateral tying for large-vocabulary continuous speech recognition
JPH09292255A (ja) * 1996-04-26 1997-11-11 Pioneer Electron Corp ナビゲーション方法及び装置

Also Published As

Publication number Publication date
CN1264468A (zh) 2000-08-23
EP1693827A2 (de) 2006-08-23
WO1998045834A1 (en) 1998-10-15
DE69834553T2 (de) 2007-04-26
EP0974141B1 (de) 2006-05-17
CN1280782C (zh) 2006-10-18
DE69834553D1 (de) 2006-06-22
US5933804A (en) 1999-08-03
CN1604187A (zh) 2005-04-06
EP1693827B1 (de) 2011-03-16
JP2002511154A (ja) 2002-04-09
CN1196105C (zh) 2005-04-06
EP0974141A1 (de) 2000-01-26
JP4570176B2 (ja) 2010-10-27
EP1693827A3 (de) 2007-05-30

Similar Documents

Publication Publication Date Title
DE69842190D1 (de) Erweiterbares Spracherkennungssystem mit einer Audio-Rückkopplung
DK0789901T3 (da) Talegenkendelse
DE69814589D1 (de) Spracherkennung unter verwendung mehrerer spracherkenner
NO974097D0 (no) Talegjenkjenning
DE69432570D1 (de) Spracherkennung
DE69727519D1 (de) Datennetzwerk mit Stimmkontrollmitteln
DE69735572D1 (de) Gleichstrom-Gleichstrom Abwärtsspannungsregler
BR9203745A (pt) Sistemas de reconhecimento de fala
DE69721349D1 (de) Sprachkodierung
DK0731995T3 (da) Konnektor til reduktion af krydstale
DE69815650D1 (de) Sprachkodierer
FI954573A (fi) Asiayhteydessä olevan puheen tunnistus
DE69718234D1 (de) Sprachkodierer
DE69328275T2 (de) Spracherkennungssystem
DE69427717D1 (de) Sprachdialogsystem
DE69524321T2 (de) Spracherkenner
DE69702261D1 (de) Sprachkodierung
DE69330361D1 (de) Spracherkennungssystem
NO985093D0 (no) Automatisk talegjenkjennelse
DE69524002D1 (de) Sprachkodierer
KR970060125U (ko) 스피커 시스템
KR940025428U (ko) 오디오의 음성인식 제어 시스템
KR970060123U (ko) 스피커 케이스구조
KR970010341U (ko) 음성출력 기능을 갖는 라이터
BR7601539U (pt) Régua-gabarito