ATE239966T1 - Anwendung von referenzdaten für spracherkennung - Google Patents

Anwendung von referenzdaten für spracherkennung

Info

Publication number
ATE239966T1
ATE239966T1 AT00123488T AT00123488T ATE239966T1 AT E239966 T1 ATE239966 T1 AT E239966T1 AT 00123488 T AT00123488 T AT 00123488T AT 00123488 T AT00123488 T AT 00123488T AT E239966 T1 ATE239966 T1 AT E239966T1
Authority
AT
Austria
Prior art keywords
speech recognition
application
reference data
recognition
currently valid
Prior art date
Application number
AT00123488T
Other languages
English (en)
Inventor
Stefan Dobler
Ralph Schleifer
Andreas Kiessling
Raymond Brueckner
Original Assignee
Ericsson Telefon Ab L M
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Ericsson Telefon Ab L M filed Critical Ericsson Telefon Ab L M
Application granted granted Critical
Publication of ATE239966T1 publication Critical patent/ATE239966T1/de

Links

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/06Creation of reference templates; Training of speech recognition systems, e.g. adaptation to the characteristics of the speaker's voice
    • G10L15/065Adaptation
    • G10L15/07Adaptation to the speaker
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/06Creation of reference templates; Training of speech recognition systems, e.g. adaptation to the characteristics of the speaker's voice
    • G10L15/063Training
    • G10L2015/0631Creating reference templates; Clustering
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/06Creation of reference templates; Training of speech recognition systems, e.g. adaptation to the characteristics of the speaker's voice
    • G10L15/063Training
    • G10L2015/0635Training updating or merging of old and new templates; Mean values; Weighting
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/06Creation of reference templates; Training of speech recognition systems, e.g. adaptation to the characteristics of the speaker's voice
    • G10L15/063Training
    • G10L2015/0635Training updating or merging of old and new templates; Mean values; Weighting
    • G10L2015/0636Threshold criteria for the updating

Landscapes

  • Engineering & Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • Multimedia (AREA)
  • Health & Medical Sciences (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Human Computer Interaction (AREA)
  • Artificial Intelligence (AREA)
  • Acoustics & Sound (AREA)
  • Computational Linguistics (AREA)
  • Navigation (AREA)
  • Image Analysis (AREA)
  • Machine Translation (AREA)
  • Circuits Of Receivers In General (AREA)
  • Image Processing (AREA)
  • Electric Clocks (AREA)
AT00123488T 2000-11-07 2000-11-07 Anwendung von referenzdaten für spracherkennung ATE239966T1 (de)

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
EP00123488A EP1205906B1 (de) 2000-11-07 2000-11-07 Anwendung von Referenzdaten für Spracherkennung

Publications (1)

Publication Number Publication Date
ATE239966T1 true ATE239966T1 (de) 2003-05-15

Family

ID=8170225

Family Applications (1)

Application Number Title Priority Date Filing Date
AT00123488T ATE239966T1 (de) 2000-11-07 2000-11-07 Anwendung von referenzdaten für spracherkennung

Country Status (6)

Country Link
US (1) US6961702B2 (de)
EP (1) EP1205906B1 (de)
AT (1) ATE239966T1 (de)
AU (1) AU2002220661A1 (de)
DE (1) DE60002584D1 (de)
WO (1) WO2002039427A1 (de)

Families Citing this family (21)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US7676366B2 (en) * 2003-01-13 2010-03-09 Art Advanced Recognition Technologies Inc. Adaptation of symbols
US20060064177A1 (en) * 2004-09-17 2006-03-23 Nokia Corporation System and method for measuring confusion among words in an adaptive speech recognition system
US7831549B2 (en) * 2004-09-17 2010-11-09 Nokia Corporation Optimization of text-based training set selection for language processing modules
US20080208578A1 (en) * 2004-09-23 2008-08-28 Koninklijke Philips Electronics, N.V. Robust Speaker-Dependent Speech Recognition System
US7634406B2 (en) * 2004-12-10 2009-12-15 Microsoft Corporation System and method for identifying semantic intent from acoustic information
US7895039B2 (en) 2005-02-04 2011-02-22 Vocollect, Inc. Methods and systems for optimizing model adaptation for a speech recognition system
US7865362B2 (en) 2005-02-04 2011-01-04 Vocollect, Inc. Method and system for considering information about an expected response when performing speech recognition
US8200495B2 (en) 2005-02-04 2012-06-12 Vocollect, Inc. Methods and systems for considering information about an expected response when performing speech recognition
US7949533B2 (en) 2005-02-04 2011-05-24 Vococollect, Inc. Methods and systems for assessing and improving the performance of a speech recognition system
US7827032B2 (en) * 2005-02-04 2010-11-02 Vocollect, Inc. Methods and systems for adapting a model for a speech recognition system
US8762148B2 (en) * 2006-02-27 2014-06-24 Nec Corporation Reference pattern adaptation apparatus, reference pattern adaptation method and reference pattern adaptation program
US8914290B2 (en) 2011-05-20 2014-12-16 Vocollect, Inc. Systems and methods for dynamically improving user intelligibility of synthesized speech in a work environment
US9899026B2 (en) 2012-05-31 2018-02-20 Elwha Llc Speech recognition adaptation systems based on adaptation data
US10395672B2 (en) 2012-05-31 2019-08-27 Elwha Llc Methods and systems for managing adaptation data
US20130325453A1 (en) 2012-05-31 2013-12-05 Elwha LLC, a limited liability company of the State of Delaware Methods and systems for speech adaptation data
US9495966B2 (en) 2012-05-31 2016-11-15 Elwha Llc Speech recognition adaptation systems based on adaptation data
US10431235B2 (en) 2012-05-31 2019-10-01 Elwha Llc Methods and systems for speech adaptation data
US8843371B2 (en) 2012-05-31 2014-09-23 Elwha Llc Speech recognition adaptation systems based on adaptation data
EP2867889A4 (de) * 2012-06-29 2016-03-02 Elwha Llc Verfahren und systeme zur verwaltung von anpassungsdaten
US9978395B2 (en) 2013-03-15 2018-05-22 Vocollect, Inc. Method and system for mitigating delay in receiving audio stream during production of sound from audio stream
US10714121B2 (en) 2016-07-27 2020-07-14 Vocollect, Inc. Distinguishing user speech from background speech in speech-dense environments

Family Cites Families (14)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JPS50155105A (de) * 1974-06-04 1975-12-15
US4720863A (en) * 1982-11-03 1988-01-19 Itt Defense Communications Method and apparatus for text-independent speaker recognition
US5127055A (en) * 1988-12-30 1992-06-30 Kurzweil Applied Intelligence, Inc. Speech recognition apparatus & method having dynamic reference pattern adaptation
JP2852298B2 (ja) * 1990-07-31 1999-01-27 日本電気株式会社 標準パターン適応化方式
WO1995009416A1 (en) * 1993-09-30 1995-04-06 Apple Computer, Inc. Continuous reference adaptation in a pattern recognition system
JP3092491B2 (ja) * 1995-08-30 2000-09-25 日本電気株式会社 記述長最小基準を用いたパターン適応化方式
US5895447A (en) * 1996-02-02 1999-04-20 International Business Machines Corporation Speech recognition using thresholded speaker class model selection or model adaptation
US5842161A (en) * 1996-06-25 1998-11-24 Lucent Technologies Inc. Telecommunications instrument employing variable criteria speech recognition
US6260013B1 (en) * 1997-03-14 2001-07-10 Lernout & Hauspie Speech Products N.V. Speech recognition system employing discriminatively trained models
US6012027A (en) * 1997-05-27 2000-01-04 Ameritech Corporation Criteria for usable repetitions of an utterance during speech reference enrollment
DE69829187T2 (de) * 1998-12-17 2005-12-29 Sony International (Europe) Gmbh Halbüberwachte Sprecheradaptation
EP1022724B8 (de) * 1999-01-20 2008-10-15 Sony Deutschland GmbH Sprecheradaption für verwechselbare Wörter
US6253181B1 (en) 1999-01-22 2001-06-26 Matsushita Electric Industrial Co., Ltd. Speech recognition and teaching apparatus able to rapidly adapt to difficult speech of children and foreign speakers
JP2000221990A (ja) * 1999-01-28 2000-08-11 Ricoh Co Ltd 音声認識装置

Also Published As

Publication number Publication date
EP1205906B1 (de) 2003-05-07
EP1205906A1 (de) 2002-05-15
US20020069053A1 (en) 2002-06-06
US6961702B2 (en) 2005-11-01
DE60002584D1 (de) 2003-06-12
WO2002039427A1 (en) 2002-05-16
AU2002220661A1 (en) 2002-05-21

Similar Documents

Publication Publication Date Title
ATE239966T1 (de) Anwendung von referenzdaten für spracherkennung
JP4536323B2 (ja) 音声−音声生成システムおよび方法
US7062440B2 (en) Monitoring text to speech output to effect control of barge-in
DE60128816D1 (de) Spracherkennungsverfahren mit ersetzungsbefehl
ATE349056T1 (de) Sprachunabhängige stimmbasierte benutzeroberfläche
US7191132B2 (en) Speech synthesis apparatus and method
ATE298918T1 (de) Sprachgesteuertes tragbares endgerät
DE69822179D1 (de) Verfahren zum lernen von mustern für die sprach- oder die sprechererkennung
ATE282881T1 (de) Vokoder basierter spracherkenner
WO2002054033A3 (en) Hierarchical language models for speech recognition
WO2004090866A3 (en) Phonetically based speech recognition system and method
ATE257616T1 (de) Spracherkennungsverfahren
DE69937176D1 (de) Segmentierungsverfahren zur Erweiterung des aktiven Vokabulars von Spracherkennern
DE69623364D1 (de) Einrichtung zur Erkennung kontinuierlich gesprochener Sprache
ATE363120T1 (de) Audio-dialogsystem und sprachgesteuertes browsing-verfahren
ATE261607T1 (de) Sprachgesteuertes tragbares endgerät
WO1996000962A3 (en) Method and device for adapting a speech recognition equipment for dialectal variations in a language
DE602004024172D1 (de) Automatische Erzeugung einer Wortaussprache für die Spracherkennung
DE50004296D1 (de) Wiedergabeverfahren für sprachgesteuerte Systeme mit text-basierter Sprachsynthese
Stöber et al. Speech synthesis using multilevel selection and concatenation of units from large speech corpora
ES2169572T3 (es) Procedimiento de reconocimiento de voz empleando una gramatica.
JP3575919B2 (ja) テキスト音声変換装置
WO2000026901A3 (en) Performing spoken recorded actions
KR100281582B1 (ko) 인식기 자원을 효율적으로 사용하는 음성인식 방법
KR100989500B1 (ko) 음성인식 파라미터 공유 방법

Legal Events

Date Code Title Description
RER Ceased as to paragraph 5 lit. 3 law introducing patent treaties