DE60020504D1 - Anpassung eines spracherkenners an korrigierte texte - Google Patents

Anpassung eines spracherkenners an korrigierte texte

Info

Publication number
DE60020504D1
DE60020504D1 DE60020504T DE60020504T DE60020504D1 DE 60020504 D1 DE60020504 D1 DE 60020504D1 DE 60020504 T DE60020504 T DE 60020504T DE 60020504 T DE60020504 T DE 60020504T DE 60020504 D1 DE60020504 D1 DE 60020504D1
Authority
DE
Germany
Prior art keywords
text information
indicator
adjusting
information
smi
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Expired - Lifetime
Application number
DE60020504T
Other languages
English (en)
Other versions
DE60020504T2 (de
Inventor
Heinrich Bartosik
Walter Mueller
Martin Schatz
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Nuance Communications Austria GmbH
Original Assignee
Koninklijke Philips Electronics NV
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Koninklijke Philips Electronics NV filed Critical Koninklijke Philips Electronics NV
Publication of DE60020504D1 publication Critical patent/DE60020504D1/de
Application granted granted Critical
Publication of DE60020504T2 publication Critical patent/DE60020504T2/de
Anticipated expiration legal-status Critical
Expired - Lifetime legal-status Critical Current

Links

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/06Creation of reference templates; Training of speech recognition systems, e.g. adaptation to the characteristics of the speaker's voice
    • G10L15/065Adaptation
    • G10L15/07Adaptation to the speaker
    • G10L15/075Adaptation to the speaker supervised, i.e. under machine guidance
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/06Creation of reference templates; Training of speech recognition systems, e.g. adaptation to the characteristics of the speaker's voice
    • G10L15/063Training
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/08Speech classification or search
    • G10L15/18Speech classification or search using natural language modelling
    • G10L15/183Speech classification or search using natural language modelling using context dependencies, e.g. language models
    • G10L15/187Phonemic context, e.g. pronunciation rules, phonotactical constraints or phoneme n-grams
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/08Speech classification or search
    • G10L15/18Speech classification or search using natural language modelling
    • G10L15/183Speech classification or search using natural language modelling using context dependencies, e.g. language models
    • G10L15/19Grammatical context, e.g. disambiguation of the recognition hypotheses based on word sequence rules
    • G10L15/197Probabilistic grammars, e.g. word n-grams

Landscapes

  • Engineering & Computer Science (AREA)
  • Artificial Intelligence (AREA)
  • Computational Linguistics (AREA)
  • Health & Medical Sciences (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Human Computer Interaction (AREA)
  • Physics & Mathematics (AREA)
  • Acoustics & Sound (AREA)
  • Multimedia (AREA)
  • Machine Translation (AREA)
  • Document Processing Apparatus (AREA)
  • Telephonic Communication Services (AREA)
DE60020504T 1999-07-08 2000-06-30 Anpassung eines spracherkenners an korrigierte texte Expired - Lifetime DE60020504T2 (de)

Applications Claiming Priority (3)

Application Number Priority Date Filing Date Title
EP99890232 1999-07-08
EP99890232 1999-07-08
PCT/EP2000/006167 WO2001004874A1 (en) 1999-07-08 2000-06-30 Adaptation of a speech recognizer from corrected text

Publications (2)

Publication Number Publication Date
DE60020504D1 true DE60020504D1 (de) 2005-07-07
DE60020504T2 DE60020504T2 (de) 2006-05-04

Family

ID=8243996

Family Applications (1)

Application Number Title Priority Date Filing Date
DE60020504T Expired - Lifetime DE60020504T2 (de) 1999-07-08 2000-06-30 Anpassung eines spracherkenners an korrigierte texte

Country Status (6)

Country Link
US (1) US6725194B1 (de)
EP (1) EP1110204B1 (de)
JP (1) JP2003504674A (de)
AT (1) ATE297046T1 (de)
DE (1) DE60020504T2 (de)
WO (1) WO2001004874A1 (de)

Families Citing this family (15)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
ATE306116T1 (de) * 1999-07-08 2005-10-15 Koninkl Philips Electronics Nv Spracherkennungseinrichtung mit transfermitteln
US7689416B1 (en) 1999-09-29 2010-03-30 Poirier Darrell A System for transferring personalize matter from one computer to another
JP2001100781A (ja) 1999-09-30 2001-04-13 Sony Corp 音声処理装置および音声処理方法、並びに記録媒体
ATE362271T1 (de) * 2000-07-31 2007-06-15 Eliza Corp Verfahren und system zur verbesserung der genauigkeit in einem spracherkennungssystem
US7418381B2 (en) * 2001-09-07 2008-08-26 Hewlett-Packard Development Company, L.P. Device for automatically translating and presenting voice messages as text messages
US20030158735A1 (en) * 2002-02-15 2003-08-21 Canon Kabushiki Kaisha Information processing apparatus and method with speech synthesis function
US8019602B2 (en) 2004-01-20 2011-09-13 Microsoft Corporation Automatic speech recognition learning using user corrections
US7590533B2 (en) * 2004-03-10 2009-09-15 Microsoft Corporation New-word pronunciation learning using a pronunciation graph
WO2008008730A2 (en) 2006-07-08 2008-01-17 Personics Holdings Inc. Personal audio assistant device and method
CN103714048B (zh) * 2012-09-29 2017-07-21 国际商业机器公司 用于校正文本的方法和系统
KR102009423B1 (ko) * 2012-10-08 2019-08-09 삼성전자주식회사 음성 인식을 이용한 미리 설정된 동작 모드의 수행 방법 및 장치
CN107086040B (zh) * 2017-06-23 2021-03-02 歌尔股份有限公司 语音识别能力测试方法和装置
US10943583B1 (en) * 2017-07-20 2021-03-09 Amazon Technologies, Inc. Creation of language models for speech recognition
US10600408B1 (en) * 2018-03-23 2020-03-24 Amazon Technologies, Inc. Content output management based on speech quality
US11393471B1 (en) * 2020-03-30 2022-07-19 Amazon Technologies, Inc. Multi-device output management based on speech characteristics

Family Cites Families (10)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US4779209A (en) * 1982-11-03 1988-10-18 Wang Laboratories, Inc. Editing voice data
AT390685B (de) 1988-10-25 1990-06-11 Philips Nv System zur textverarbeitung
JPH08505959A (ja) * 1993-01-21 1996-06-25 アップル コンピューター インコーポレイテッド ベクトル量子化ベース音声符号化/複号化を用いたテキスト−音声合成システム
US5852801A (en) * 1995-10-04 1998-12-22 Apple Computer, Inc. Method and apparatus for automatically invoking a new word module for unrecognized user input
US5960447A (en) * 1995-11-13 1999-09-28 Holt; Douglas Word tagging and editing system for speech recognition
US5794189A (en) * 1995-11-13 1998-08-11 Dragon Systems, Inc. Continuous speech recognition
GB2302199B (en) * 1996-09-24 1997-05-14 Allvoice Computing Plc Data processing method and apparatus
US5857099A (en) * 1996-09-27 1999-01-05 Allvoice Computing Plc Speech-to-text dictation system with audio message capability
US6173259B1 (en) * 1997-03-27 2001-01-09 Speech Machines Plc Speech to text conversion
US6263308B1 (en) * 2000-03-20 2001-07-17 Microsoft Corporation Methods and apparatus for performing speech recognition using acoustic models which are improved through an interactive process

Also Published As

Publication number Publication date
ATE297046T1 (de) 2005-06-15
US6725194B1 (en) 2004-04-20
JP2003504674A (ja) 2003-02-04
DE60020504T2 (de) 2006-05-04
EP1110204A1 (de) 2001-06-27
WO2001004874A1 (en) 2001-01-18
EP1110204B1 (de) 2005-06-01

Similar Documents

Publication Publication Date Title
DE60020504D1 (de) Anpassung eines spracherkenners an korrigierte texte
Kanthak et al. Context-dependent acoustic modeling using graphemes for large vocabulary speech recognition
US7490039B1 (en) Text to speech system and method having interactive spelling capabilities
CA2181205A1 (en) Discriminative Utterance Verification for Connected Digits Recognition
JP2006058899A (ja) 発話検索のためのラティス・ベースの検索システムおよび方法
ATE395685T1 (de) Spracherkennung durch wort-in-phrase-befehl
DE60004862D1 (de) Automatische bestimmung der genauigkeit eines aussprachewörterbuchs in einem spracherkennungssystem
ATE311650T1 (de) Korrektur eines von einer spracherkennung erkannten textes mittels vergleich der phonemfolgen des erkannten textes mit einer phonetischen transkription eines manuell eingegebenen korrekturwortes
KR19990087935A (ko) 연속음성인식시에구두점들을자동으로발생시키기위한장치및방법
CA2493265A1 (en) System and method for augmenting spoken language understanding by correcting common errors in linguistic performance
DE60209103D1 (de) Texteditierung von erkannter sprache bei gleichzeitiger wiedergabe
WO2007034478A3 (en) System and method for correcting speech
ATE319161T1 (de) Korrekturvorrichtung mit markierung von teilen eines erkannten textes
Zetterholm Voice imitation: a phonetic study of perceptual illusions and acoustic success
Greenberg et al. Linguistic dissection of switchboard-corpus automatic speech recognition systems
Che et al. Speaker recognition using HMM with experiments on the YOHO database
US6377921B1 (en) Identifying mismatches between assumed and actual pronunciations of words
Newman et al. Speaker verification through large vocabulary continuous speech recognition
Amdal et al. Joint pronunciation modelling of non-native speakers using data-driven methods.
Schuller et al. Comparing one and two-stage acoustic modeling in the recognition of emotion in speech
US20050234724A1 (en) System and method for improving text-to-speech software intelligibility through the detection of uncommon words and phrases
DE60022976D1 (de) Spracherkennungseinrichtung mit transfermitteln
Gauvain et al. Experiments with speaker verification over the telephone.
Schramm et al. Filled-pause modeling for medical transcriptions
KR20090109501A (ko) 언어학습용 리듬훈련 시스템 및 방법

Legal Events

Date Code Title Description
8364 No opposition during term of opposition
8327 Change in the person/name/address of the patent owner

Owner name: NUANCE COMMUNICATIONS AUSTRIA GMBH, WIEN, AT

8328 Change in the person/name/address of the agent

Representative=s name: VOSSIUS & PARTNER, 81675 MUENCHEN