DE10251112A1 - Verfahren und System zur Spracherkennung - Google Patents

Verfahren und System zur Spracherkennung Download PDF

Info

Publication number
DE10251112A1
DE10251112A1 DE10251112A DE10251112A DE10251112A1 DE 10251112 A1 DE10251112 A1 DE 10251112A1 DE 10251112 A DE10251112 A DE 10251112A DE 10251112 A DE10251112 A DE 10251112A DE 10251112 A1 DE10251112 A1 DE 10251112A1
Authority
DE
Germany
Prior art keywords
recognition
user
recognition result
output
speech
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Withdrawn
Application number
DE10251112A
Other languages
German (de)
English (en)
Inventor
Albert R.R. Drs. Kooiman
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Philips Intellectual Property and Standards GmbH
Original Assignee
Philips Intellectual Property and Standards GmbH
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Philips Intellectual Property and Standards GmbH filed Critical Philips Intellectual Property and Standards GmbH
Priority to DE10251112A priority Critical patent/DE10251112A1/de
Priority to AU2003274432A priority patent/AU2003274432A1/en
Priority to PCT/IB2003/004717 priority patent/WO2004042699A1/en
Priority to DE60325997T priority patent/DE60325997D1/de
Priority to US10/532,918 priority patent/US20050288922A1/en
Priority to EP03758411A priority patent/EP1561204B1/en
Priority to AT03758411T priority patent/ATE421748T1/de
Priority to JP2004549439A priority patent/JP4960596B2/ja
Priority to CNB2003801025097A priority patent/CN100524459C/zh
Publication of DE10251112A1 publication Critical patent/DE10251112A1/de
Withdrawn legal-status Critical Current

Links

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/22Procedures used during a speech recognition process, e.g. man-machine dialogue
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/22Procedures used during a speech recognition process, e.g. man-machine dialogue
    • G10L2015/225Feedback of the input speech

Landscapes

  • Engineering & Computer Science (AREA)
  • Acoustics & Sound (AREA)
  • Health & Medical Sciences (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Human Computer Interaction (AREA)
  • Physics & Mathematics (AREA)
  • Computational Linguistics (AREA)
  • Multimedia (AREA)
  • Machine Translation (AREA)
  • Telephonic Communication Services (AREA)
  • Mobile Radio Communication Systems (AREA)
  • Document Processing Apparatus (AREA)
  • Image Analysis (AREA)
DE10251112A 2002-11-02 2002-11-02 Verfahren und System zur Spracherkennung Withdrawn DE10251112A1 (de)

Priority Applications (9)

Application Number Priority Date Filing Date Title
DE10251112A DE10251112A1 (de) 2002-11-02 2002-11-02 Verfahren und System zur Spracherkennung
AU2003274432A AU2003274432A1 (en) 2002-11-02 2003-10-24 Method and system for speech recognition
PCT/IB2003/004717 WO2004042699A1 (en) 2002-11-02 2003-10-24 Method and system for speech recognition
DE60325997T DE60325997D1 (de) 2002-11-02 2003-10-24 Verfahren und anordnung zur spracherkennung
US10/532,918 US20050288922A1 (en) 2002-11-02 2003-10-24 Method and system for speech recognition
EP03758411A EP1561204B1 (en) 2002-11-02 2003-10-24 Method and system for speech recognition
AT03758411T ATE421748T1 (de) 2002-11-02 2003-10-24 Verfahren und anordnung zur spracherkennung
JP2004549439A JP4960596B2 (ja) 2002-11-02 2003-10-24 音声認識の方法およびシステム
CNB2003801025097A CN100524459C (zh) 2002-11-02 2003-10-24 用于语音识别的方法和系统

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
DE10251112A DE10251112A1 (de) 2002-11-02 2002-11-02 Verfahren und System zur Spracherkennung

Publications (1)

Publication Number Publication Date
DE10251112A1 true DE10251112A1 (de) 2004-05-19

Family

ID=32115142

Family Applications (2)

Application Number Title Priority Date Filing Date
DE10251112A Withdrawn DE10251112A1 (de) 2002-11-02 2002-11-02 Verfahren und System zur Spracherkennung
DE60325997T Expired - Lifetime DE60325997D1 (de) 2002-11-02 2003-10-24 Verfahren und anordnung zur spracherkennung

Family Applications After (1)

Application Number Title Priority Date Filing Date
DE60325997T Expired - Lifetime DE60325997D1 (de) 2002-11-02 2003-10-24 Verfahren und anordnung zur spracherkennung

Country Status (8)

Country Link
US (1) US20050288922A1 (enExample)
EP (1) EP1561204B1 (enExample)
JP (1) JP4960596B2 (enExample)
CN (1) CN100524459C (enExample)
AT (1) ATE421748T1 (enExample)
AU (1) AU2003274432A1 (enExample)
DE (2) DE10251112A1 (enExample)
WO (1) WO2004042699A1 (enExample)

Cited By (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
DE102004029873B3 (de) * 2004-06-16 2005-12-29 Deutsche Telekom Ag Verfahren und Vorrichtung zur intelligenten Eingabekorrektur für automatische Sprachdialogsysteme
DE102006058758A1 (de) * 2006-12-12 2008-06-19 Deutsche Telekom Ag Verfahren und Vorrichtung zum Steuern einer Telekommunikationsendeinrichtung

Families Citing this family (25)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US7551727B2 (en) 2004-10-20 2009-06-23 Microsoft Corporation Unified messaging architecture
US7912186B2 (en) * 2004-10-20 2011-03-22 Microsoft Corporation Selectable state machine user interface system
JP4679254B2 (ja) * 2004-10-28 2011-04-27 富士通株式会社 対話システム、対話方法、及びコンピュータプログラム
US7941316B2 (en) * 2005-10-28 2011-05-10 Microsoft Corporation Combined speech and alternate input modality to a mobile device
US9128926B2 (en) * 2006-10-26 2015-09-08 Facebook, Inc. Simultaneous translation of open domain lectures and speeches
US8972268B2 (en) 2008-04-15 2015-03-03 Facebook, Inc. Enhanced speech-to-speech translation system and methods for adding a new word
US11222185B2 (en) 2006-10-26 2022-01-11 Meta Platforms, Inc. Lexicon development via shared translation database
US7987090B2 (en) * 2007-08-09 2011-07-26 Honda Motor Co., Ltd. Sound-source separation system
JP5610197B2 (ja) * 2010-05-25 2014-10-22 ソニー株式会社 検索装置、検索方法、及び、プログラム
CN102723080B (zh) * 2012-06-25 2014-06-11 惠州市德赛西威汽车电子有限公司 一种语音识别测试系统及方法
US10229676B2 (en) * 2012-10-05 2019-03-12 Avaya Inc. Phrase spotting systems and methods
CN102945671A (zh) * 2012-10-31 2013-02-27 四川长虹电器股份有限公司 语音识别方法
KR20140065897A (ko) * 2012-11-22 2014-05-30 삼성전자주식회사 전력 부하 모니터링 장치 및 방법
US9620115B2 (en) 2013-01-03 2017-04-11 Telenav, Inc. Content delivery system with barge-in mechanism and method of operation thereof
CN104618456A (zh) * 2015-01-13 2015-05-13 小米科技有限责任公司 信息发布方法及装置
US9773483B2 (en) * 2015-01-20 2017-09-26 Harman International Industries, Incorporated Automatic transcription of musical content and real-time musical accompaniment
KR102561711B1 (ko) * 2016-02-26 2023-08-01 삼성전자주식회사 컨텐트를 인식하는 방법 및 장치
DE102016115243A1 (de) * 2016-04-28 2017-11-02 Masoud Amri Programmieren in natürlicher Sprache
US11151986B1 (en) * 2018-09-21 2021-10-19 Amazon Technologies, Inc. Learning how to rewrite user-specific input for natural language understanding
KR102368193B1 (ko) * 2018-10-29 2022-03-02 어니컴 주식회사 음성합성을 이용한 음성인식기능 검증 방법 및 장치
CN110853639B (zh) * 2019-10-23 2023-09-01 天津讯飞极智科技有限公司 语音转写方法及相关装置
WO2022003822A1 (ja) * 2020-06-30 2022-01-06 日産自動車株式会社 情報処理装置及び情報処理方法
CN114974249B (zh) * 2021-02-20 2025-09-12 上海大唐移动通信设备有限公司 一种语音识别方法、装置及存储介质
CN113611328B (zh) * 2021-06-30 2025-04-11 公安部第一研究所 一种声纹识别语音评测方法及装置
CN114240455A (zh) * 2021-12-07 2022-03-25 山东远联信息科技有限公司 一种现场智能机器人的自助流程客服方法和系统

Family Cites Families (30)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP2585547B2 (ja) * 1986-09-19 1997-02-26 株式会社日立製作所 音声入出力装置における入力音声の修正方法
JPH0351898A (ja) * 1989-07-20 1991-03-06 Sanyo Electric Co Ltd 音声認識装置
JPH0854894A (ja) * 1994-08-10 1996-02-27 Fujitsu Ten Ltd 音声処理装置
JPH09114482A (ja) * 1995-10-17 1997-05-02 Nippon Telegr & Teleph Corp <Ntt> 音声認識のための話者適応化方法
US5794189A (en) * 1995-11-13 1998-08-11 Dragon Systems, Inc. Continuous speech recognition
JPH10143503A (ja) * 1996-11-08 1998-05-29 Nec Corp 音声ワードプロセッサ
US6154526A (en) * 1996-12-04 2000-11-28 Intellivoice Communications, Inc. Data acquisition and error correcting speech recognition system
US5897616A (en) * 1997-06-11 1999-04-27 International Business Machines Corporation Apparatus and methods for speaker verification/identification/classification employing non-acoustic and/or acoustic models and databases
US6219628B1 (en) * 1997-08-18 2001-04-17 National Instruments Corporation System and method for configuring an instrument to perform measurement functions utilizing conversion of graphical programs into hardware implementations
JPH11338493A (ja) * 1998-05-26 1999-12-10 Sony Corp 情報処理装置および方法、並びに提供媒体
US6405170B1 (en) * 1998-09-22 2002-06-11 Speechworks International, Inc. Method and system of reviewing the behavior of an interactive speech recognition application
US6219638B1 (en) * 1998-11-03 2001-04-17 International Business Machines Corporation Telephone messaging and editing system
JP2000250587A (ja) * 1999-03-01 2000-09-14 Hitachi Ltd 音声認識装置及び音声認識翻訳装置
JP3980791B2 (ja) * 1999-05-03 2007-09-26 パイオニア株式会社 音声認識装置を備えたマンマシンシステム
DE50008703D1 (de) * 1999-06-10 2004-12-23 Infineon Technologies Ag Spracherkennungsverfahren und -einrichtung
JP2001005809A (ja) * 1999-06-25 2001-01-12 Toshiba Corp 文書作成装置、文書作成方法、及び文書作成プログラムが記録された記録媒体
CN1207664C (zh) * 1999-07-27 2005-06-22 国际商业机器公司 对语音识别结果中的错误进行校正的方法和语音识别系统
JP2001100786A (ja) * 1999-09-28 2001-04-13 Canon Inc 音声認識方法、装置及び記憶媒体
JP2003518266A (ja) * 1999-12-20 2003-06-03 コーニンクレッカ フィリップス エレクトロニクス エヌ ヴィ 音声認識システムのテキスト編集用音声再生
JP4465564B2 (ja) * 2000-02-28 2010-05-19 ソニー株式会社 音声認識装置および音声認識方法、並びに記録媒体
US7200555B1 (en) 2000-07-05 2007-04-03 International Business Machines Corporation Speech recognition correction for devices having limited or no display
US6856956B2 (en) * 2000-07-20 2005-02-15 Microsoft Corporation Method and apparatus for generating and displaying N-best alternatives in a speech recognition system
WO2002021510A1 (en) * 2000-09-08 2002-03-14 Koninklijke Philips Electronics N.V. Speech recognition method with a replace command
EP1189203B1 (en) * 2000-09-18 2006-05-17 L &amp; H Holdings USA, Inc. Homophone selection in speech recognition
WO2002080144A1 (en) * 2001-03-29 2002-10-10 Koninklijke Philips Electronics N.V. Text editing for recognized speech during synchronous playback
US6839667B2 (en) * 2001-05-16 2005-01-04 International Business Machines Corporation Method of speech recognition by presenting N-best word candidates
US6910012B2 (en) * 2001-05-16 2005-06-21 International Business Machines Corporation Method and system for speech recognition using phonetically similar word alternatives
US6963834B2 (en) * 2001-05-29 2005-11-08 International Business Machines Corporation Method of speech recognition using empirically determined word candidates
TW517221B (en) * 2001-08-24 2003-01-11 Ind Tech Res Inst Voice recognition system
US7260534B2 (en) * 2002-07-16 2007-08-21 International Business Machines Corporation Graphical user interface for determining speech recognition accuracy

Cited By (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
DE102004029873B3 (de) * 2004-06-16 2005-12-29 Deutsche Telekom Ag Verfahren und Vorrichtung zur intelligenten Eingabekorrektur für automatische Sprachdialogsysteme
DE102006058758A1 (de) * 2006-12-12 2008-06-19 Deutsche Telekom Ag Verfahren und Vorrichtung zum Steuern einer Telekommunikationsendeinrichtung
DE102006058758B4 (de) * 2006-12-12 2018-02-22 Deutsche Telekom Ag Verfahren und Vorrichtung zum Steuern einer Telekommunikationsendeinrichtung

Also Published As

Publication number Publication date
WO2004042699A1 (en) 2004-05-21
EP1561204B1 (en) 2009-01-21
JP4960596B2 (ja) 2012-06-27
JP2006505002A (ja) 2006-02-09
CN100524459C (zh) 2009-08-05
DE60325997D1 (de) 2009-03-12
CN1708783A (zh) 2005-12-14
AU2003274432A1 (en) 2004-06-07
EP1561204A1 (en) 2005-08-10
ATE421748T1 (de) 2009-02-15
US20050288922A1 (en) 2005-12-29

Similar Documents

Publication Publication Date Title
DE10251112A1 (de) Verfahren und System zur Spracherkennung
EP1927980B1 (de) Verfahren zur Klassifizierung der gesprochenen Sprache in Sprachdialogsystemen
DE60207742T2 (de) Korrektur eines von einer spracherkennung erkannten textes mittels vergleich der phonemfolgen des erkannten textes mit einer phonetischen transkription eines manuell eingegebenen korrekturwortes
DE60222093T2 (de) Verfahren, modul, vorrichtung und server zur spracherkennung
DE69519328T2 (de) Verfahren und Anordnung für die Umwandlung von Sprache in Text
DE60124559T2 (de) Einrichtung und verfahren zur spracherkennung
EP1466317B1 (de) Betriebsverfahren eines automatischen spracherkenners zur sprecherunabhängigen spracherkennung von worten aus verschiedenen sprachen und automatischer spracherkenner
DE19510083C2 (de) Verfahren und Anordnung zur Spracherkennung bei Wortkomposita enthaltenden Sprachen
DE19956747C1 (de) Verfahren und Vorrichtung zur Spracherkennung sowie ein Telekommunikationssystem
DE60023736T2 (de) Verfahren und vorrichtung zur spracherkennung mit verschiedenen sprachmodellen
EP1590797B1 (de) Kommunikationssystem, kommunikationsendeinrichtung und vorrichtung zum erkennen fehlerbehafteter text-nachrichten
EP1077448B1 (de) Spracherkennung unter Berücksichtigung der Lautstärkeschwankungen
DE60020504T2 (de) Anpassung eines spracherkenners an korrigierte texte
EP2047668B1 (de) Verfahren, sprachdialogsystem und telekommunikationsendgerät zur multilingualen sprachausgabe
DE102010040553A1 (de) Spracherkennungsverfahren
EP1456837B1 (de) Verfahren und vorrichtung zur spracherkennung
EP3430615A1 (de) Fortbewegungsmittel, system und verfahren zur anpassung einer länge einer erlaubten sprechpause im rahmen einer spracheingabe
DE112006000322T5 (de) Audioerkennungssystem zur Erzeugung von Antwort-Audio unter Verwendung extrahierter Audiodaten
EP2034472B1 (de) Spracherkennungsverfahren und Spracherkennungsvorrichtung
DE60029456T2 (de) Verfahren zur Online-Anpassung von Aussprachewörterbüchern
EP2907048B1 (de) Kraftwagen mit einem sprachübersetzungssystem
DE60022976T2 (de) Spracherkennungseinrichtung mit transfermitteln
DE102010033117A1 (de) Spracherkennungsverfahren
Amir et al. Unresolved anger: Prosodic analysis and classification of speech from a therapeutic setting
EP2141692A1 (de) Automatisierte Sprachgesteuerte Unterstützung eines Benutzers

Legal Events

Date Code Title Description
8139 Disposal/non-payment of the annual fee