DE60002584D1 - Anwendung von Referenzdaten für Spracherkennung - Google Patents

Anwendung von Referenzdaten für Spracherkennung

Info

Publication number
DE60002584D1
DE60002584D1 DE60002584T DE60002584T DE60002584D1 DE 60002584 D1 DE60002584 D1 DE 60002584D1 DE 60002584 T DE60002584 T DE 60002584T DE 60002584 T DE60002584 T DE 60002584T DE 60002584 D1 DE60002584 D1 DE 60002584D1
Authority
DE
Germany
Prior art keywords
speech recognition
reference data
recognition
currently valid
utterance
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Expired - Lifetime
Application number
DE60002584T
Other languages
English (en)
Inventor
Stefan Dobler
Ralph Schleifer
Andreas Kiessling
Raymond Brueckner
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Telefonaktiebolaget LM Ericsson AB
Original Assignee
Telefonaktiebolaget LM Ericsson AB
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Telefonaktiebolaget LM Ericsson AB filed Critical Telefonaktiebolaget LM Ericsson AB
Application granted granted Critical
Publication of DE60002584D1 publication Critical patent/DE60002584D1/de
Anticipated expiration legal-status Critical
Expired - Lifetime legal-status Critical Current

Links

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/06Creation of reference templates; Training of speech recognition systems, e.g. adaptation to the characteristics of the speaker's voice
    • G10L15/065Adaptation
    • G10L15/07Adaptation to the speaker
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/06Creation of reference templates; Training of speech recognition systems, e.g. adaptation to the characteristics of the speaker's voice
    • G10L15/063Training
    • G10L2015/0631Creating reference templates; Clustering
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/06Creation of reference templates; Training of speech recognition systems, e.g. adaptation to the characteristics of the speaker's voice
    • G10L15/063Training
    • G10L2015/0635Training updating or merging of old and new templates; Mean values; Weighting
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/06Creation of reference templates; Training of speech recognition systems, e.g. adaptation to the characteristics of the speaker's voice
    • G10L15/063Training
    • G10L2015/0635Training updating or merging of old and new templates; Mean values; Weighting
    • G10L2015/0636Threshold criteria for the updating

Landscapes

  • Engineering & Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • Multimedia (AREA)
  • Health & Medical Sciences (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Human Computer Interaction (AREA)
  • Artificial Intelligence (AREA)
  • Acoustics & Sound (AREA)
  • Computational Linguistics (AREA)
  • Navigation (AREA)
  • Image Analysis (AREA)
  • Machine Translation (AREA)
  • Electric Clocks (AREA)
  • Circuits Of Receivers In General (AREA)
  • Image Processing (AREA)
DE60002584T 2000-11-07 2000-11-07 Anwendung von Referenzdaten für Spracherkennung Expired - Lifetime DE60002584D1 (de)

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
EP00123488A EP1205906B1 (de) 2000-11-07 2000-11-07 Anwendung von Referenzdaten für Spracherkennung

Publications (1)

Publication Number Publication Date
DE60002584D1 true DE60002584D1 (de) 2003-06-12

Family

ID=8170225

Family Applications (1)

Application Number Title Priority Date Filing Date
DE60002584T Expired - Lifetime DE60002584D1 (de) 2000-11-07 2000-11-07 Anwendung von Referenzdaten für Spracherkennung

Country Status (6)

Country Link
US (1) US6961702B2 (de)
EP (1) EP1205906B1 (de)
AT (1) ATE239966T1 (de)
AU (1) AU2002220661A1 (de)
DE (1) DE60002584D1 (de)
WO (1) WO2002039427A1 (de)

Families Citing this family (21)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US7676366B2 (en) * 2003-01-13 2010-03-09 Art Advanced Recognition Technologies Inc. Adaptation of symbols
US20060064177A1 (en) * 2004-09-17 2006-03-23 Nokia Corporation System and method for measuring confusion among words in an adaptive speech recognition system
US7831549B2 (en) * 2004-09-17 2010-11-09 Nokia Corporation Optimization of text-based training set selection for language processing modules
JP4943335B2 (ja) * 2004-09-23 2012-05-30 コーニンクレッカ フィリップス エレクトロニクス エヌ ヴィ 話者に依存しない堅牢な音声認識システム
US7634406B2 (en) * 2004-12-10 2009-12-15 Microsoft Corporation System and method for identifying semantic intent from acoustic information
US7949533B2 (en) 2005-02-04 2011-05-24 Vococollect, Inc. Methods and systems for assessing and improving the performance of a speech recognition system
US7895039B2 (en) 2005-02-04 2011-02-22 Vocollect, Inc. Methods and systems for optimizing model adaptation for a speech recognition system
US8200495B2 (en) 2005-02-04 2012-06-12 Vocollect, Inc. Methods and systems for considering information about an expected response when performing speech recognition
US7827032B2 (en) * 2005-02-04 2010-11-02 Vocollect, Inc. Methods and systems for adapting a model for a speech recognition system
US7865362B2 (en) 2005-02-04 2011-01-04 Vocollect, Inc. Method and system for considering information about an expected response when performing speech recognition
CN101390156B (zh) * 2006-02-27 2011-12-07 日本电气株式会社 标准模式适应装置、标准模式适应方法
US8914290B2 (en) 2011-05-20 2014-12-16 Vocollect, Inc. Systems and methods for dynamically improving user intelligibility of synthesized speech in a work environment
US9495966B2 (en) 2012-05-31 2016-11-15 Elwha Llc Speech recognition adaptation systems based on adaptation data
US9305565B2 (en) 2012-05-31 2016-04-05 Elwha Llc Methods and systems for speech adaptation data
US9620128B2 (en) 2012-05-31 2017-04-11 Elwha Llc Speech recognition adaptation systems based on adaptation data
US10395672B2 (en) 2012-05-31 2019-08-27 Elwha Llc Methods and systems for managing adaptation data
US10431235B2 (en) 2012-05-31 2019-10-01 Elwha Llc Methods and systems for speech adaptation data
US20130325449A1 (en) 2012-05-31 2013-12-05 Elwha Llc Speech recognition adaptation systems based on adaptation data
WO2014005055A2 (en) * 2012-06-29 2014-01-03 Elwha Llc Methods and systems for managing adaptation data
US9978395B2 (en) 2013-03-15 2018-05-22 Vocollect, Inc. Method and system for mitigating delay in receiving audio stream during production of sound from audio stream
US10714121B2 (en) 2016-07-27 2020-07-14 Vocollect, Inc. Distinguishing user speech from background speech in speech-dense environments

Family Cites Families (14)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JPS50155105A (de) * 1974-06-04 1975-12-15
US4720863A (en) * 1982-11-03 1988-01-19 Itt Defense Communications Method and apparatus for text-independent speaker recognition
US5127055A (en) * 1988-12-30 1992-06-30 Kurzweil Applied Intelligence, Inc. Speech recognition apparatus & method having dynamic reference pattern adaptation
JP2852298B2 (ja) * 1990-07-31 1999-01-27 日本電気株式会社 標準パターン適応化方式
WO1995009416A1 (en) * 1993-09-30 1995-04-06 Apple Computer, Inc. Continuous reference adaptation in a pattern recognition system
JP3092491B2 (ja) * 1995-08-30 2000-09-25 日本電気株式会社 記述長最小基準を用いたパターン適応化方式
US5895447A (en) * 1996-02-02 1999-04-20 International Business Machines Corporation Speech recognition using thresholded speaker class model selection or model adaptation
US5842161A (en) * 1996-06-25 1998-11-24 Lucent Technologies Inc. Telecommunications instrument employing variable criteria speech recognition
US6260013B1 (en) * 1997-03-14 2001-07-10 Lernout & Hauspie Speech Products N.V. Speech recognition system employing discriminatively trained models
US6012027A (en) * 1997-05-27 2000-01-04 Ameritech Corporation Criteria for usable repetitions of an utterance during speech reference enrollment
DE69829187T2 (de) * 1998-12-17 2005-12-29 Sony International (Europe) Gmbh Halbüberwachte Sprecheradaptation
DE69939151D1 (de) * 1999-01-20 2008-09-04 Sony Deutschland Gmbh Sprecheradaption für verwechselbare Wörter
US6253181B1 (en) 1999-01-22 2001-06-26 Matsushita Electric Industrial Co., Ltd. Speech recognition and teaching apparatus able to rapidly adapt to difficult speech of children and foreign speakers
JP2000221990A (ja) * 1999-01-28 2000-08-11 Ricoh Co Ltd 音声認識装置

Also Published As

Publication number Publication date
AU2002220661A1 (en) 2002-05-21
EP1205906A1 (de) 2002-05-15
US6961702B2 (en) 2005-11-01
EP1205906B1 (de) 2003-05-07
ATE239966T1 (de) 2003-05-15
US20020069053A1 (en) 2002-06-06
WO2002039427A1 (en) 2002-05-16

Similar Documents

Publication Publication Date Title
DE60002584D1 (de) Anwendung von Referenzdaten für Spracherkennung
JP4536323B2 (ja) 音声−音声生成システムおよび方法
ATE349056T1 (de) Sprachunabhängige stimmbasierte benutzeroberfläche
DE60128816D1 (de) Spracherkennungsverfahren mit ersetzungsbefehl
DE69822179D1 (de) Verfahren zum lernen von mustern für die sprach- oder die sprechererkennung
WO2002054033A3 (en) Hierarchical language models for speech recognition
ATE282881T1 (de) Vokoder basierter spracherkenner
WO2004090866A3 (en) Phonetically based speech recognition system and method
ATE257616T1 (de) Spracherkennungsverfahren
DE69937176D1 (de) Segmentierungsverfahren zur Erweiterung des aktiven Vokabulars von Spracherkennern
US20020184030A1 (en) Speech synthesis apparatus and method
EP1022722A3 (de) Sprecheradaptation auf der Basis von Stimm-Eigenvektoren
ATE496363T1 (de) Spracherkennungsvorrichtung mit markierung von erkannten textteilen
DE60209103D1 (de) Texteditierung von erkannter sprache bei gleichzeitiger wiedergabe
WO2001097213A8 (en) Speech recognition using utterance-level confidence estimates
ATE253763T1 (de) Verfahren zur spracherkennung
DE69623364D1 (de) Einrichtung zur Erkennung kontinuierlich gesprochener Sprache
ATE363120T1 (de) Audio-dialogsystem und sprachgesteuertes browsing-verfahren
DE60008893D1 (de) Sprachgesteuertes tragbares Endgerät
WO1996000962A3 (en) Method and device for adapting a speech recognition equipment for dialectal variations in a language
DE602004024172D1 (de) Automatische Erzeugung einer Wortaussprache für die Spracherkennung
ATE253762T1 (de) Wiedergabeverfahren für sprachgesteuerte systeme mit text-basierter sprachsynthese
KR20010087328A (ko) 문법적 제한사항을 갖는 라벨러를 이용한 구두 발언 거절
Cerrato et al. Duration and tonal characteristics of short expressions in Italian
ES2169572T3 (es) Procedimiento de reconocimiento de voz empleando una gramatica.

Legal Events

Date Code Title Description
8332 No legal effect for de