ATE398323T1 - Diskriminatives trainieren von hidden markov modellen für die erkennung fliessender sprache - Google Patents

Diskriminatives trainieren von hidden markov modellen für die erkennung fliessender sprache

Info

Publication number
ATE398323T1
ATE398323T1 AT01923898T AT01923898T ATE398323T1 AT E398323 T1 ATE398323 T1 AT E398323T1 AT 01923898 T AT01923898 T AT 01923898T AT 01923898 T AT01923898 T AT 01923898T AT E398323 T1 ATE398323 T1 AT E398323T1
Authority
AT
Austria
Prior art keywords
segment
correct
incorrect
state sequence
recognition
Prior art date
Application number
AT01923898T
Other languages
English (en)
Inventor
Girija Yegnanarayanan
Vladimir Sejnoha
Ramesh Sarukkai
Original Assignee
Lernout & Hauspie Speechprod
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Lernout & Hauspie Speechprod filed Critical Lernout & Hauspie Speechprod
Application granted granted Critical
Publication of ATE398323T1 publication Critical patent/ATE398323T1/de

Links

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/08Speech classification or search
    • G10L15/14Speech classification or search using statistical models, e.g. Hidden Markov Models [HMMs]
    • G10L15/142Hidden Markov Models [HMMs]
    • G10L15/144Training of HMMs
    • G10L15/146Training of HMMs with insufficient amount of training data, e.g. state sharing, tying, deleted interpolation
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/08Speech classification or search
    • G10L15/14Speech classification or search using statistical models, e.g. Hidden Markov Models [HMMs]
    • G10L15/142Hidden Markov Models [HMMs]
    • G10L15/144Training of HMMs

Landscapes

  • Engineering & Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • Multimedia (AREA)
  • Computational Linguistics (AREA)
  • Health & Medical Sciences (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Human Computer Interaction (AREA)
  • Acoustics & Sound (AREA)
  • Probability & Statistics with Applications (AREA)
  • Machine Translation (AREA)
  • Image Analysis (AREA)
  • Character Discrimination (AREA)
  • Document Processing Apparatus (AREA)
  • Pens And Brushes (AREA)
  • Display Devices Of Pinball Game Machines (AREA)
  • Management, Administration, Business Operations System, And Electronic Commerce (AREA)
  • Measuring Pulse, Heart Rate, Blood Pressure Or Blood Flow (AREA)
  • Measuring Temperature Or Quantity Of Heat (AREA)
  • Electrophonic Musical Instruments (AREA)
AT01923898T 2000-04-05 2001-04-03 Diskriminatives trainieren von hidden markov modellen für die erkennung fliessender sprache ATE398323T1 (de)

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
US09/543,202 US6490555B1 (en) 1997-03-14 2000-04-05 Discriminatively trained mixture models in continuous speech recognition

Publications (1)

Publication Number Publication Date
ATE398323T1 true ATE398323T1 (de) 2008-07-15

Family

ID=24167006

Family Applications (1)

Application Number Title Priority Date Filing Date
AT01923898T ATE398323T1 (de) 2000-04-05 2001-04-03 Diskriminatives trainieren von hidden markov modellen für die erkennung fliessender sprache

Country Status (7)

Country Link
US (1) US6490555B1 (de)
EP (1) EP1269464B1 (de)
JP (1) JP5134751B2 (de)
AT (1) ATE398323T1 (de)
AU (1) AU2001250579A1 (de)
DE (1) DE60134395D1 (de)
WO (1) WO2001075862A2 (de)

Families Citing this family (30)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US7020845B1 (en) * 1999-11-15 2006-03-28 Gottfurcht Elliot A Navigating internet content on a television using a simplified interface and a remote control
US7003455B1 (en) * 2000-10-16 2006-02-21 Microsoft Corporation Method of noise reduction using correction and scaling vectors with partitioning of the acoustic space in the domain of noisy speech
DE10120513C1 (de) 2001-04-26 2003-01-09 Siemens Ag Verfahren zur Bestimmung einer Folge von Lautbausteinen zum Synthetisieren eines Sprachsignals einer tonalen Sprache
AUPR579601A0 (en) * 2001-06-19 2001-07-12 Syrinx Speech Systems Pty Limited On-line environmental and speaker model adaptation
US20040150676A1 (en) * 2002-03-25 2004-08-05 Gottfurcht Elliot A. Apparatus and method for simple wide-area network navigation
US7117148B2 (en) 2002-04-05 2006-10-03 Microsoft Corporation Method of noise reduction using correction vectors based on dynamic aspects of speech and noise normalization
FI121583B (fi) * 2002-07-05 2011-01-14 Syslore Oy Symbolijonon etsintä
US7752045B2 (en) * 2002-10-07 2010-07-06 Carnegie Mellon University Systems and methods for comparing speech elements
EP1450350A1 (de) * 2003-02-20 2004-08-25 Sony International (Europe) GmbH Verfahren zur Spracherkennung mittels Attributen
US20040186714A1 (en) * 2003-03-18 2004-09-23 Aurilab, Llc Speech recognition improvement through post-processsing
US20040193412A1 (en) * 2003-03-18 2004-09-30 Aurilab, Llc Non-linear score scrunching for more efficient comparison of hypotheses
US8019602B2 (en) * 2004-01-20 2011-09-13 Microsoft Corporation Automatic speech recognition learning using user corrections
GB0420464D0 (en) 2004-09-14 2004-10-20 Zentian Ltd A speech recognition circuit and method
EP1743897A1 (de) * 2005-07-15 2007-01-17 Gesellschaft für Biotechnologische Forschung mbH Aus Sorangium cellulosum erhältliche biologisch aktive Verbindungen
US20070083373A1 (en) * 2005-10-11 2007-04-12 Matsushita Electric Industrial Co., Ltd. Discriminative training of HMM models using maximum margin estimation for speech recognition
US8301449B2 (en) * 2006-10-16 2012-10-30 Microsoft Corporation Minimum classification error training with growth transformation optimization
US7885812B2 (en) * 2006-11-15 2011-02-08 Microsoft Corporation Joint training of feature extraction and acoustic model parameters for speech recognition
US20080147579A1 (en) * 2006-12-14 2008-06-19 Microsoft Corporation Discriminative training using boosted lasso
US7856351B2 (en) * 2007-01-19 2010-12-21 Microsoft Corporation Integrated speech recognition and semantic classification
US8423364B2 (en) * 2007-02-20 2013-04-16 Microsoft Corporation Generic framework for large-margin MCE training in speech recognition
JP5294086B2 (ja) * 2007-02-28 2013-09-18 日本電気株式会社 重み係数学習システム及び音声認識システム
US20080243503A1 (en) * 2007-03-30 2008-10-02 Microsoft Corporation Minimum divergence based discriminative training for pattern recognition
US8239332B2 (en) 2007-11-20 2012-08-07 Microsoft Corporation Constrained line search optimization for discriminative training of HMMS
US8843370B2 (en) * 2007-11-26 2014-09-23 Nuance Communications, Inc. Joint discriminative training of multiple speech recognizers
US8595004B2 (en) * 2007-12-18 2013-11-26 Nec Corporation Pronunciation variation rule extraction apparatus, pronunciation variation rule extraction method, and pronunciation variation rule extraction program
US9240184B1 (en) * 2012-11-15 2016-01-19 Google Inc. Frame-level combination of deep neural network and gaussian mixture models
US9817881B2 (en) * 2013-10-16 2017-11-14 Cypress Semiconductor Corporation Hidden markov model processing engine
WO2016167779A1 (en) * 2015-04-16 2016-10-20 Mitsubishi Electric Corporation Speech recognition device and rescoring device
CN111354344B (zh) * 2020-03-09 2023-08-22 第四范式(北京)技术有限公司 语音识别模型的训练方法、装置、电子设备及存储介质
CN114387959B (zh) * 2020-10-19 2024-10-11 北京爱语吧科技有限公司 一种基于语音的日语发音评测方法和系统

Family Cites Families (14)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US4741036A (en) 1985-01-31 1988-04-26 International Business Machines Corporation Determination of phone weights for markov models in a speech recognition system
US5027406A (en) * 1988-12-06 1991-06-25 Dragon Systems, Inc. Method for interactive speech recognition and training
US5388183A (en) 1991-09-30 1995-02-07 Kurzwell Applied Intelligence, Inc. Speech recognition providing multiple outputs
US5280563A (en) 1991-12-20 1994-01-18 Kurzweil Applied Intelligence, Inc. Method of optimizing a composite speech recognition expert
DE69322894T2 (de) 1992-03-02 1999-07-29 At & T Corp., New York, N.Y. Lernverfahren und Gerät zur Spracherkennung
US5832430A (en) * 1994-12-29 1998-11-03 Lucent Technologies, Inc. Devices and methods for speech recognition of vocabulary words with simultaneous detection and verification
US5675706A (en) * 1995-03-31 1997-10-07 Lucent Technologies Inc. Vocabulary independent discriminative utterance verification for non-keyword rejection in subword based speech recognition
US5737489A (en) * 1995-09-15 1998-04-07 Lucent Technologies Inc. Discriminative utterance verification for connected digits recognition
US5895447A (en) * 1996-02-02 1999-04-20 International Business Machines Corporation Speech recognition using thresholded speaker class model selection or model adaptation
US5991720A (en) * 1996-05-06 1999-11-23 Matsushita Electric Industrial Co., Ltd. Speech recognition system employing multiple grammar networks
JPH10207485A (ja) * 1997-01-22 1998-08-07 Toshiba Corp 音声認識装置及び話者適応方法
US6122613A (en) * 1997-01-30 2000-09-19 Dragon Systems, Inc. Speech recognition using multiple recognizers (selectively) applied to the same input sample
US6292778B1 (en) * 1998-10-30 2001-09-18 Lucent Technologies Inc. Task-independent utterance verification with subword-based minimum verification error training
US7216079B1 (en) 1999-11-02 2007-05-08 Speechworks International, Inc. Method and apparatus for discriminative training of acoustic models of a speech recognition system

Also Published As

Publication number Publication date
DE60134395D1 (de) 2008-07-24
JP5134751B2 (ja) 2013-01-30
WO2001075862A2 (en) 2001-10-11
WO2001075862A3 (en) 2002-01-10
EP1269464B1 (de) 2008-06-11
US6490555B1 (en) 2002-12-03
EP1269464A2 (de) 2003-01-02
AU2001250579A1 (en) 2001-10-15
JP2004512544A (ja) 2004-04-22

Similar Documents

Publication Publication Date Title
ATE398323T1 (de) Diskriminatives trainieren von hidden markov modellen für die erkennung fliessender sprache
US8498857B2 (en) System and method for rapid prototyping of existing speech recognition solutions in different languages
Wang et al. Towards automatic assessment of spontaneous spoken English
WO2009025356A1 (ja) 音声認識装置および音声認識方法
CA2177638A1 (en) Utterance verification using word based minimum verification error training for recognizing a keyword string
WO2007034478A3 (en) System and method for correcting speech
CN101650942A (zh) 基于韵律短语的韵律结构生成方法
CN105261246A (zh) 一种基于大数据挖掘技术的英语口语纠错系统
Gallwitz et al. Integrated recognition of words and prosodic phrase boundaries
EP1460615A1 (de) Sprachverarbeitungseinrichtung und -verfahren, aufzeichnungsmedium und programm
US20020087317A1 (en) Computer-implemented dynamic pronunciation method and system
DE69916297D1 (de) Zwischen-wörter verbindung phonemische modelle
CN113053414B (zh) 一种发音评测方法及装置
Anzai et al. Recognition of utterances with grammatical mistakes based on optimization of language model towards interactive CALL systems
Sawada et al. Overview of NITECH HMM-based text-to-speech system for Blizzard Challenge 2014
Rayner et al. Supervised learning of response grammars in a spoken call system.
Yamashita et al. Automatic scoring for prosodic proficiency of English sentences spoken by Japanese based on utterance comparison
KR100308274B1 (ko) 가변어휘인식시스템
Svendsen Pronunciation modeling for speech technology
Hernández-Mena et al. Creating a grammar-based speech recognition parser for Mexican Spanish using HTK, compatible with CMU Sphinx-III system
Hagen et al. Data driven subword unit modeling for speech recognition and its application to interactive reading tutors.
Vicsi Thinking about the present and future of the complex speech recognition
Deville et al. Automatic detection and correction of pronunciation errors for foreign language learners: the demosthenes application.
Nouza et al. Methods and application of phonetic label alignment in speech processing tasks
JPH08314490A (ja) ワードスポッティング型音声認識方法と装置

Legal Events

Date Code Title Description
RER Ceased as to paragraph 5 lit. 3 law introducing patent treaties