ATE239966T1 - Anwendung von referenzdaten für spracherkennung - Google Patents

Anwendung von referenzdaten für spracherkennung

Info

Publication number
ATE239966T1
ATE239966T1 AT00123488T AT00123488T ATE239966T1 AT E239966 T1 ATE239966 T1 AT E239966T1 AT 00123488 T AT00123488 T AT 00123488T AT 00123488 T AT00123488 T AT 00123488T AT E239966 T1 ATE239966 T1 AT E239966T1
Authority
AT
Austria
Prior art keywords
speech recognition
application
reference data
recognition
currently valid
Prior art date
Application number
AT00123488T
Other languages
English (en)
Inventor
Stefan Dobler
Ralph Schleifer
Andreas Kiessling
Raymond Brueckner
Original Assignee
Ericsson Telefon Ab L M
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Ericsson Telefon Ab L M filed Critical Ericsson Telefon Ab L M
Application granted granted Critical
Publication of ATE239966T1 publication Critical patent/ATE239966T1/de

Links

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/06Creation of reference templates; Training of speech recognition systems, e.g. adaptation to the characteristics of the speaker's voice
    • G10L15/065Adaptation
    • G10L15/07Adaptation to the speaker
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/06Creation of reference templates; Training of speech recognition systems, e.g. adaptation to the characteristics of the speaker's voice
    • G10L15/063Training
    • G10L2015/0631Creating reference templates; Clustering
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/06Creation of reference templates; Training of speech recognition systems, e.g. adaptation to the characteristics of the speaker's voice
    • G10L15/063Training
    • G10L2015/0635Training updating or merging of old and new templates; Mean values; Weighting
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/06Creation of reference templates; Training of speech recognition systems, e.g. adaptation to the characteristics of the speaker's voice
    • G10L15/063Training
    • G10L2015/0635Training updating or merging of old and new templates; Mean values; Weighting
    • G10L2015/0636Threshold criteria for the updating

Landscapes

  • Engineering & Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • Multimedia (AREA)
  • Health & Medical Sciences (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Human Computer Interaction (AREA)
  • Artificial Intelligence (AREA)
  • Acoustics & Sound (AREA)
  • Computational Linguistics (AREA)
  • Image Analysis (AREA)
  • Navigation (AREA)
  • Electric Clocks (AREA)
  • Machine Translation (AREA)
  • Image Processing (AREA)
  • Circuits Of Receivers In General (AREA)
AT00123488T 2000-11-07 2000-11-07 Anwendung von referenzdaten für spracherkennung ATE239966T1 (de)

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
EP00123488A EP1205906B1 (de) 2000-11-07 2000-11-07 Anwendung von Referenzdaten für Spracherkennung

Publications (1)

Publication Number Publication Date
ATE239966T1 true ATE239966T1 (de) 2003-05-15

Family

ID=8170225

Family Applications (1)

Application Number Title Priority Date Filing Date
AT00123488T ATE239966T1 (de) 2000-11-07 2000-11-07 Anwendung von referenzdaten für spracherkennung

Country Status (6)

Country Link
US (1) US6961702B2 (de)
EP (1) EP1205906B1 (de)
AT (1) ATE239966T1 (de)
AU (1) AU2002220661A1 (de)
DE (1) DE60002584D1 (de)
WO (1) WO2002039427A1 (de)

Families Citing this family (21)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US7676366B2 (en) * 2003-01-13 2010-03-09 Art Advanced Recognition Technologies Inc. Adaptation of symbols
US7831549B2 (en) * 2004-09-17 2010-11-09 Nokia Corporation Optimization of text-based training set selection for language processing modules
US20060064177A1 (en) * 2004-09-17 2006-03-23 Nokia Corporation System and method for measuring confusion among words in an adaptive speech recognition system
EP1794746A2 (de) * 2004-09-23 2007-06-13 Koninklijke Philips Electronics N.V. Verfahren zum trainieren eines robusten sprecherunabhängigen spracherkennungssystems mit sprecherabhängigen ausdrücken und robustes sprecherabhängiges spracherkennungssystem
US7634406B2 (en) * 2004-12-10 2009-12-15 Microsoft Corporation System and method for identifying semantic intent from acoustic information
US7895039B2 (en) 2005-02-04 2011-02-22 Vocollect, Inc. Methods and systems for optimizing model adaptation for a speech recognition system
US7949533B2 (en) 2005-02-04 2011-05-24 Vococollect, Inc. Methods and systems for assessing and improving the performance of a speech recognition system
US8200495B2 (en) 2005-02-04 2012-06-12 Vocollect, Inc. Methods and systems for considering information about an expected response when performing speech recognition
US7827032B2 (en) 2005-02-04 2010-11-02 Vocollect, Inc. Methods and systems for adapting a model for a speech recognition system
US7865362B2 (en) 2005-02-04 2011-01-04 Vocollect, Inc. Method and system for considering information about an expected response when performing speech recognition
WO2007105409A1 (ja) * 2006-02-27 2007-09-20 Nec Corporation 標準パタン適応装置、標準パタン適応方法および標準パタン適応プログラム
US8914290B2 (en) 2011-05-20 2014-12-16 Vocollect, Inc. Systems and methods for dynamically improving user intelligibility of synthesized speech in a work environment
US9305565B2 (en) 2012-05-31 2016-04-05 Elwha Llc Methods and systems for speech adaptation data
US10395672B2 (en) 2012-05-31 2019-08-27 Elwha Llc Methods and systems for managing adaptation data
US9495966B2 (en) 2012-05-31 2016-11-15 Elwha Llc Speech recognition adaptation systems based on adaptation data
US9620128B2 (en) 2012-05-31 2017-04-11 Elwha Llc Speech recognition adaptation systems based on adaptation data
US20130325449A1 (en) 2012-05-31 2013-12-05 Elwha Llc Speech recognition adaptation systems based on adaptation data
US10431235B2 (en) 2012-05-31 2019-10-01 Elwha Llc Methods and systems for speech adaptation data
CN104412322B (zh) * 2012-06-29 2019-01-18 埃尔瓦有限公司 用于管理适应数据的方法和系统
US9978395B2 (en) 2013-03-15 2018-05-22 Vocollect, Inc. Method and system for mitigating delay in receiving audio stream during production of sound from audio stream
US10714121B2 (en) 2016-07-27 2020-07-14 Vocollect, Inc. Distinguishing user speech from background speech in speech-dense environments

Family Cites Families (14)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JPS50155105A (de) * 1974-06-04 1975-12-15
US4720863A (en) * 1982-11-03 1988-01-19 Itt Defense Communications Method and apparatus for text-independent speaker recognition
US5127055A (en) * 1988-12-30 1992-06-30 Kurzweil Applied Intelligence, Inc. Speech recognition apparatus & method having dynamic reference pattern adaptation
JP2852298B2 (ja) * 1990-07-31 1999-01-27 日本電気株式会社 標準パターン適応化方式
WO1995009416A1 (en) * 1993-09-30 1995-04-06 Apple Computer, Inc. Continuous reference adaptation in a pattern recognition system
JP3092491B2 (ja) * 1995-08-30 2000-09-25 日本電気株式会社 記述長最小基準を用いたパターン適応化方式
US5895447A (en) * 1996-02-02 1999-04-20 International Business Machines Corporation Speech recognition using thresholded speaker class model selection or model adaptation
US5842161A (en) * 1996-06-25 1998-11-24 Lucent Technologies Inc. Telecommunications instrument employing variable criteria speech recognition
US6260013B1 (en) * 1997-03-14 2001-07-10 Lernout & Hauspie Speech Products N.V. Speech recognition system employing discriminatively trained models
US6012027A (en) * 1997-05-27 2000-01-04 Ameritech Corporation Criteria for usable repetitions of an utterance during speech reference enrollment
EP1011094B1 (de) * 1998-12-17 2005-03-02 Sony International (Europe) GmbH Halbüberwachte Sprecheradaptation
DE69939151D1 (de) * 1999-01-20 2008-09-04 Sony Deutschland Gmbh Sprecheradaption für verwechselbare Wörter
US6253181B1 (en) 1999-01-22 2001-06-26 Matsushita Electric Industrial Co., Ltd. Speech recognition and teaching apparatus able to rapidly adapt to difficult speech of children and foreign speakers
JP2000221990A (ja) * 1999-01-28 2000-08-11 Ricoh Co Ltd 音声認識装置

Also Published As

Publication number Publication date
WO2002039427A1 (en) 2002-05-16
AU2002220661A1 (en) 2002-05-21
EP1205906B1 (de) 2003-05-07
DE60002584D1 (de) 2003-06-12
US6961702B2 (en) 2005-11-01
US20020069053A1 (en) 2002-06-06
EP1205906A1 (de) 2002-05-15

Similar Documents

Publication Publication Date Title
ATE239966T1 (de) Anwendung von referenzdaten für spracherkennung
US7461001B2 (en) Speech-to-speech generation system and method
US20240112678A1 (en) Voice dialogue system and method of understanding utterance intention
ATE349056T1 (de) Sprachunabhängige stimmbasierte benutzeroberfläche
FI20145179A7 (fi) Menetelmä ja laitteisto monitasoiseksi hajautetuksi puheentunnistukseksi
ATE364219T1 (de) Spracherkennungsverfahren mit ersetzungsbefehl
DE69822179D1 (de) Verfahren zum lernen von mustern für die sprach- oder die sprechererkennung
ATE282881T1 (de) Vokoder basierter spracherkenner
US20020184031A1 (en) Speech system barge-in control
ATE374421T1 (de) Segmentierungsverfahren zur erweiterung des aktiven vokabulars von spracherkennern
US20020184030A1 (en) Speech synthesis apparatus and method
WO2002054033A3 (en) Hierarchical language models for speech recognition
WO2004090866A3 (en) Phonetically based speech recognition system and method
WO2003019528A1 (en) Intonation generating method, speech synthesizing device by the method, and voice server
EP0847179A3 (de) System und Verfahren mit Sprachschnittstelle zu hyperlink Informationen
ATE253763T1 (de) Verfahren zur spracherkennung
DE69623364D1 (de) Einrichtung zur Erkennung kontinuierlich gesprochener Sprache
DE60008893D1 (de) Sprachgesteuertes tragbares Endgerät
DE602004024172D1 (de) Automatische Erzeugung einer Wortaussprache für die Spracherkennung
WO1996000962A3 (en) Method and device for adapting a speech recognition equipment for dialectal variations in a language
ATE253762T1 (de) Wiedergabeverfahren für sprachgesteuerte systeme mit text-basierter sprachsynthese
KR20010087328A (ko) 문법적 제한사항을 갖는 라벨러를 이용한 구두 발언 거절
WO2004008433A3 (en) System and method for mandarin chinese speech recognition using an optimized phone set
ES2169572T3 (es) Procedimiento de reconocimiento de voz empleando una gramatica.
JP3605011B2 (ja) 音声認識方法

Legal Events

Date Code Title Description
RER Ceased as to paragraph 5 lit. 3 law introducing patent treaties