ATE398323T1 - Diskriminatives trainieren von hidden markov modellen für die erkennung fliessender sprache - Google Patents
Diskriminatives trainieren von hidden markov modellen für die erkennung fliessender spracheInfo
- Publication number
- ATE398323T1 ATE398323T1 AT01923898T AT01923898T ATE398323T1 AT E398323 T1 ATE398323 T1 AT E398323T1 AT 01923898 T AT01923898 T AT 01923898T AT 01923898 T AT01923898 T AT 01923898T AT E398323 T1 ATE398323 T1 AT E398323T1
- Authority
- AT
- Austria
- Prior art keywords
- segment
- correct
- incorrect
- state sequence
- recognition
- Prior art date
Links
- 238000002864 sequence alignment Methods 0.000 abstract 3
- 238000000034 method Methods 0.000 abstract 1
Classifications
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/08—Speech classification or search
- G10L15/14—Speech classification or search using statistical models, e.g. Hidden Markov Models [HMMs]
- G10L15/142—Hidden Markov Models [HMMs]
- G10L15/144—Training of HMMs
- G10L15/146—Training of HMMs with insufficient amount of training data, e.g. state sharing, tying, deleted interpolation
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/08—Speech classification or search
- G10L15/14—Speech classification or search using statistical models, e.g. Hidden Markov Models [HMMs]
- G10L15/142—Hidden Markov Models [HMMs]
- G10L15/144—Training of HMMs
Landscapes
- Engineering & Computer Science (AREA)
- Physics & Mathematics (AREA)
- Multimedia (AREA)
- Computational Linguistics (AREA)
- Health & Medical Sciences (AREA)
- Audiology, Speech & Language Pathology (AREA)
- Human Computer Interaction (AREA)
- Acoustics & Sound (AREA)
- Probability & Statistics with Applications (AREA)
- Machine Translation (AREA)
- Image Analysis (AREA)
- Character Discrimination (AREA)
- Document Processing Apparatus (AREA)
- Pens And Brushes (AREA)
- Display Devices Of Pinball Game Machines (AREA)
- Management, Administration, Business Operations System, And Electronic Commerce (AREA)
- Measuring Pulse, Heart Rate, Blood Pressure Or Blood Flow (AREA)
- Measuring Temperature Or Quantity Of Heat (AREA)
- Electrophonic Musical Instruments (AREA)
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
US09/543,202 US6490555B1 (en) | 1997-03-14 | 2000-04-05 | Discriminatively trained mixture models in continuous speech recognition |
Publications (1)
Publication Number | Publication Date |
---|---|
ATE398323T1 true ATE398323T1 (de) | 2008-07-15 |
Family
ID=24167006
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
AT01923898T ATE398323T1 (de) | 2000-04-05 | 2001-04-03 | Diskriminatives trainieren von hidden markov modellen für die erkennung fliessender sprache |
Country Status (7)
Country | Link |
---|---|
US (1) | US6490555B1 (de) |
EP (1) | EP1269464B1 (de) |
JP (1) | JP5134751B2 (de) |
AT (1) | ATE398323T1 (de) |
AU (1) | AU2001250579A1 (de) |
DE (1) | DE60134395D1 (de) |
WO (1) | WO2001075862A2 (de) |
Families Citing this family (30)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US7020845B1 (en) * | 1999-11-15 | 2006-03-28 | Gottfurcht Elliot A | Navigating internet content on a television using a simplified interface and a remote control |
US7003455B1 (en) * | 2000-10-16 | 2006-02-21 | Microsoft Corporation | Method of noise reduction using correction and scaling vectors with partitioning of the acoustic space in the domain of noisy speech |
DE10120513C1 (de) | 2001-04-26 | 2003-01-09 | Siemens Ag | Verfahren zur Bestimmung einer Folge von Lautbausteinen zum Synthetisieren eines Sprachsignals einer tonalen Sprache |
AUPR579601A0 (en) * | 2001-06-19 | 2001-07-12 | Syrinx Speech Systems Pty Limited | On-line environmental and speaker model adaptation |
US20040150676A1 (en) * | 2002-03-25 | 2004-08-05 | Gottfurcht Elliot A. | Apparatus and method for simple wide-area network navigation |
US7117148B2 (en) | 2002-04-05 | 2006-10-03 | Microsoft Corporation | Method of noise reduction using correction vectors based on dynamic aspects of speech and noise normalization |
FI121583B (fi) * | 2002-07-05 | 2011-01-14 | Syslore Oy | Symbolijonon etsintä |
US7752045B2 (en) * | 2002-10-07 | 2010-07-06 | Carnegie Mellon University | Systems and methods for comparing speech elements |
EP1450350A1 (de) * | 2003-02-20 | 2004-08-25 | Sony International (Europe) GmbH | Verfahren zur Spracherkennung mittels Attributen |
US20040186714A1 (en) * | 2003-03-18 | 2004-09-23 | Aurilab, Llc | Speech recognition improvement through post-processsing |
US20040193412A1 (en) * | 2003-03-18 | 2004-09-30 | Aurilab, Llc | Non-linear score scrunching for more efficient comparison of hypotheses |
US8019602B2 (en) * | 2004-01-20 | 2011-09-13 | Microsoft Corporation | Automatic speech recognition learning using user corrections |
GB0420464D0 (en) | 2004-09-14 | 2004-10-20 | Zentian Ltd | A speech recognition circuit and method |
EP1743897A1 (de) * | 2005-07-15 | 2007-01-17 | Gesellschaft für Biotechnologische Forschung mbH | Aus Sorangium cellulosum erhältliche biologisch aktive Verbindungen |
US20070083373A1 (en) * | 2005-10-11 | 2007-04-12 | Matsushita Electric Industrial Co., Ltd. | Discriminative training of HMM models using maximum margin estimation for speech recognition |
US8301449B2 (en) * | 2006-10-16 | 2012-10-30 | Microsoft Corporation | Minimum classification error training with growth transformation optimization |
US7885812B2 (en) * | 2006-11-15 | 2011-02-08 | Microsoft Corporation | Joint training of feature extraction and acoustic model parameters for speech recognition |
US20080147579A1 (en) * | 2006-12-14 | 2008-06-19 | Microsoft Corporation | Discriminative training using boosted lasso |
US7856351B2 (en) * | 2007-01-19 | 2010-12-21 | Microsoft Corporation | Integrated speech recognition and semantic classification |
US8423364B2 (en) * | 2007-02-20 | 2013-04-16 | Microsoft Corporation | Generic framework for large-margin MCE training in speech recognition |
JP5294086B2 (ja) * | 2007-02-28 | 2013-09-18 | 日本電気株式会社 | 重み係数学習システム及び音声認識システム |
US20080243503A1 (en) * | 2007-03-30 | 2008-10-02 | Microsoft Corporation | Minimum divergence based discriminative training for pattern recognition |
US8239332B2 (en) | 2007-11-20 | 2012-08-07 | Microsoft Corporation | Constrained line search optimization for discriminative training of HMMS |
US8843370B2 (en) * | 2007-11-26 | 2014-09-23 | Nuance Communications, Inc. | Joint discriminative training of multiple speech recognizers |
US8595004B2 (en) * | 2007-12-18 | 2013-11-26 | Nec Corporation | Pronunciation variation rule extraction apparatus, pronunciation variation rule extraction method, and pronunciation variation rule extraction program |
US9240184B1 (en) * | 2012-11-15 | 2016-01-19 | Google Inc. | Frame-level combination of deep neural network and gaussian mixture models |
US9817881B2 (en) * | 2013-10-16 | 2017-11-14 | Cypress Semiconductor Corporation | Hidden markov model processing engine |
WO2016167779A1 (en) * | 2015-04-16 | 2016-10-20 | Mitsubishi Electric Corporation | Speech recognition device and rescoring device |
CN111354344B (zh) * | 2020-03-09 | 2023-08-22 | 第四范式(北京)技术有限公司 | 语音识别模型的训练方法、装置、电子设备及存储介质 |
CN114387959B (zh) * | 2020-10-19 | 2024-10-11 | 北京爱语吧科技有限公司 | 一种基于语音的日语发音评测方法和系统 |
Family Cites Families (14)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US4741036A (en) | 1985-01-31 | 1988-04-26 | International Business Machines Corporation | Determination of phone weights for markov models in a speech recognition system |
US5027406A (en) * | 1988-12-06 | 1991-06-25 | Dragon Systems, Inc. | Method for interactive speech recognition and training |
US5388183A (en) | 1991-09-30 | 1995-02-07 | Kurzwell Applied Intelligence, Inc. | Speech recognition providing multiple outputs |
US5280563A (en) | 1991-12-20 | 1994-01-18 | Kurzweil Applied Intelligence, Inc. | Method of optimizing a composite speech recognition expert |
DE69322894T2 (de) | 1992-03-02 | 1999-07-29 | At & T Corp., New York, N.Y. | Lernverfahren und Gerät zur Spracherkennung |
US5832430A (en) * | 1994-12-29 | 1998-11-03 | Lucent Technologies, Inc. | Devices and methods for speech recognition of vocabulary words with simultaneous detection and verification |
US5675706A (en) * | 1995-03-31 | 1997-10-07 | Lucent Technologies Inc. | Vocabulary independent discriminative utterance verification for non-keyword rejection in subword based speech recognition |
US5737489A (en) * | 1995-09-15 | 1998-04-07 | Lucent Technologies Inc. | Discriminative utterance verification for connected digits recognition |
US5895447A (en) * | 1996-02-02 | 1999-04-20 | International Business Machines Corporation | Speech recognition using thresholded speaker class model selection or model adaptation |
US5991720A (en) * | 1996-05-06 | 1999-11-23 | Matsushita Electric Industrial Co., Ltd. | Speech recognition system employing multiple grammar networks |
JPH10207485A (ja) * | 1997-01-22 | 1998-08-07 | Toshiba Corp | 音声認識装置及び話者適応方法 |
US6122613A (en) * | 1997-01-30 | 2000-09-19 | Dragon Systems, Inc. | Speech recognition using multiple recognizers (selectively) applied to the same input sample |
US6292778B1 (en) * | 1998-10-30 | 2001-09-18 | Lucent Technologies Inc. | Task-independent utterance verification with subword-based minimum verification error training |
US7216079B1 (en) | 1999-11-02 | 2007-05-08 | Speechworks International, Inc. | Method and apparatus for discriminative training of acoustic models of a speech recognition system |
-
2000
- 2000-04-05 US US09/543,202 patent/US6490555B1/en not_active Expired - Lifetime
-
2001
- 2001-04-03 AT AT01923898T patent/ATE398323T1/de not_active IP Right Cessation
- 2001-04-03 DE DE60134395T patent/DE60134395D1/de not_active Expired - Lifetime
- 2001-04-03 AU AU2001250579A patent/AU2001250579A1/en not_active Abandoned
- 2001-04-03 JP JP2001573458A patent/JP5134751B2/ja not_active Expired - Fee Related
- 2001-04-03 EP EP01923898A patent/EP1269464B1/de not_active Expired - Lifetime
- 2001-04-03 WO PCT/IB2001/000726 patent/WO2001075862A2/en active IP Right Grant
Also Published As
Publication number | Publication date |
---|---|
DE60134395D1 (de) | 2008-07-24 |
JP5134751B2 (ja) | 2013-01-30 |
WO2001075862A2 (en) | 2001-10-11 |
WO2001075862A3 (en) | 2002-01-10 |
EP1269464B1 (de) | 2008-06-11 |
US6490555B1 (en) | 2002-12-03 |
EP1269464A2 (de) | 2003-01-02 |
AU2001250579A1 (en) | 2001-10-15 |
JP2004512544A (ja) | 2004-04-22 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
ATE398323T1 (de) | Diskriminatives trainieren von hidden markov modellen für die erkennung fliessender sprache | |
US8498857B2 (en) | System and method for rapid prototyping of existing speech recognition solutions in different languages | |
Wang et al. | Towards automatic assessment of spontaneous spoken English | |
WO2009025356A1 (ja) | 音声認識装置および音声認識方法 | |
CA2177638A1 (en) | Utterance verification using word based minimum verification error training for recognizing a keyword string | |
WO2007034478A3 (en) | System and method for correcting speech | |
CN101650942A (zh) | 基于韵律短语的韵律结构生成方法 | |
CN105261246A (zh) | 一种基于大数据挖掘技术的英语口语纠错系统 | |
Gallwitz et al. | Integrated recognition of words and prosodic phrase boundaries | |
EP1460615A1 (de) | Sprachverarbeitungseinrichtung und -verfahren, aufzeichnungsmedium und programm | |
US20020087317A1 (en) | Computer-implemented dynamic pronunciation method and system | |
DE69916297D1 (de) | Zwischen-wörter verbindung phonemische modelle | |
CN113053414B (zh) | 一种发音评测方法及装置 | |
Anzai et al. | Recognition of utterances with grammatical mistakes based on optimization of language model towards interactive CALL systems | |
Sawada et al. | Overview of NITECH HMM-based text-to-speech system for Blizzard Challenge 2014 | |
Rayner et al. | Supervised learning of response grammars in a spoken call system. | |
Yamashita et al. | Automatic scoring for prosodic proficiency of English sentences spoken by Japanese based on utterance comparison | |
KR100308274B1 (ko) | 가변어휘인식시스템 | |
Svendsen | Pronunciation modeling for speech technology | |
Hernández-Mena et al. | Creating a grammar-based speech recognition parser for Mexican Spanish using HTK, compatible with CMU Sphinx-III system | |
Hagen et al. | Data driven subword unit modeling for speech recognition and its application to interactive reading tutors. | |
Vicsi | Thinking about the present and future of the complex speech recognition | |
Deville et al. | Automatic detection and correction of pronunciation errors for foreign language learners: the demosthenes application. | |
Nouza et al. | Methods and application of phonetic label alignment in speech processing tasks | |
JPH08314490A (ja) | ワードスポッティング型音声認識方法と装置 |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
RER | Ceased as to paragraph 5 lit. 3 law introducing patent treaties |