DE60018696T2 - Robuste sprachverarbeitung von verrauschten sprachmodellen - Google Patents
Robuste sprachverarbeitung von verrauschten sprachmodellen Download PDFInfo
- Publication number
- DE60018696T2 DE60018696T2 DE60018696T DE60018696T DE60018696T2 DE 60018696 T2 DE60018696 T2 DE 60018696T2 DE 60018696 T DE60018696 T DE 60018696T DE 60018696 T DE60018696 T DE 60018696T DE 60018696 T2 DE60018696 T2 DE 60018696T2
- Authority
- DE
- Germany
- Prior art keywords
- signal
- model
- speech
- processing
- models
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Expired - Lifetime
Links
- 238000012545 processing Methods 0.000 title claims description 52
- 238000000034 method Methods 0.000 claims description 41
- 230000006870 function Effects 0.000 claims description 16
- 238000012549 training Methods 0.000 claims description 12
- 230000008569 process Effects 0.000 claims description 7
- 238000000926 separation method Methods 0.000 claims description 4
- 230000003044 adaptive effect Effects 0.000 claims 1
- 238000012360 testing method Methods 0.000 description 17
- 239000013598 vector Substances 0.000 description 15
- 238000009826 distribution Methods 0.000 description 10
- 238000002474 experimental method Methods 0.000 description 9
- 238000013459 approach Methods 0.000 description 7
- 238000010586 diagram Methods 0.000 description 6
- 238000010183 spectrum analysis Methods 0.000 description 6
- 238000007476 Maximum Likelihood Methods 0.000 description 5
- 238000001514 detection method Methods 0.000 description 5
- 238000013507 mapping Methods 0.000 description 5
- 239000000463 material Substances 0.000 description 4
- 239000000203 mixture Substances 0.000 description 4
- 230000009466 transformation Effects 0.000 description 4
- 240000003517 Elaeocarpus dentatus Species 0.000 description 3
- 230000006866 deterioration Effects 0.000 description 3
- 238000013518 transcription Methods 0.000 description 3
- 230000035897 transcription Effects 0.000 description 3
- 230000015556 catabolic process Effects 0.000 description 2
- 230000008859 change Effects 0.000 description 2
- 238000006731 degradation reaction Methods 0.000 description 2
- 238000000605 extraction Methods 0.000 description 2
- 238000005259 measurement Methods 0.000 description 2
- 239000000243 solution Substances 0.000 description 2
- 230000002411 adverse Effects 0.000 description 1
- 230000005540 biological transmission Effects 0.000 description 1
- 238000011161 development Methods 0.000 description 1
- 230000000694 effects Effects 0.000 description 1
- 230000006872 improvement Effects 0.000 description 1
- 239000011159 matrix material Substances 0.000 description 1
- 238000005457 optimization Methods 0.000 description 1
- 238000003909 pattern recognition Methods 0.000 description 1
- 238000003672 processing method Methods 0.000 description 1
- 238000010079 rubber tapping Methods 0.000 description 1
- 230000001131 transforming effect Effects 0.000 description 1
Classifications
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/20—Speech recognition techniques specially adapted for robustness in adverse environments, e.g. in noise, of stress induced speech
Landscapes
- Engineering & Computer Science (AREA)
- Computational Linguistics (AREA)
- Health & Medical Sciences (AREA)
- Audiology, Speech & Language Pathology (AREA)
- Human Computer Interaction (AREA)
- Physics & Mathematics (AREA)
- Acoustics & Sound (AREA)
- Multimedia (AREA)
- Telephonic Communication Services (AREA)
- Machine Translation (AREA)
Applications Claiming Priority (3)
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| EP99202136 | 1999-07-01 | ||
| EP99202136 | 1999-07-01 | ||
| PCT/EP2000/005963 WO2001003113A1 (en) | 1999-07-01 | 2000-06-27 | Robust speech processing from noisy speech models |
Publications (2)
| Publication Number | Publication Date |
|---|---|
| DE60018696D1 DE60018696D1 (de) | 2005-04-21 |
| DE60018696T2 true DE60018696T2 (de) | 2006-04-06 |
Family
ID=8240395
Family Applications (1)
| Application Number | Title | Priority Date | Filing Date |
|---|---|---|---|
| DE60018696T Expired - Lifetime DE60018696T2 (de) | 1999-07-01 | 2000-06-27 | Robuste sprachverarbeitung von verrauschten sprachmodellen |
Country Status (5)
| Country | Link |
|---|---|
| US (1) | US6865531B1 (enExample) |
| EP (1) | EP1116219B1 (enExample) |
| JP (1) | JP4818556B2 (enExample) |
| DE (1) | DE60018696T2 (enExample) |
| WO (1) | WO2001003113A1 (enExample) |
Families Citing this family (16)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| US7587321B2 (en) * | 2001-05-08 | 2009-09-08 | Intel Corporation | Method, apparatus, and system for building context dependent models for a large vocabulary continuous speech recognition (LVCSR) system |
| US7174292B2 (en) | 2002-05-20 | 2007-02-06 | Microsoft Corporation | Method of determining uncertainty associated with acoustic distortion-based noise reduction |
| US7103540B2 (en) | 2002-05-20 | 2006-09-05 | Microsoft Corporation | Method of pattern recognition using noise reduction uncertainty |
| US7107210B2 (en) * | 2002-05-20 | 2006-09-12 | Microsoft Corporation | Method of noise reduction based on dynamic aspects of speech |
| WO2004049305A2 (en) * | 2002-11-21 | 2004-06-10 | Scansoft, Inc. | Discriminative training of hidden markov models for continuous speech recognition |
| US20040181409A1 (en) * | 2003-03-11 | 2004-09-16 | Yifan Gong | Speech recognition using model parameters dependent on acoustic environment |
| US8150688B2 (en) * | 2006-01-11 | 2012-04-03 | Nec Corporation | Voice recognizing apparatus, voice recognizing method, voice recognizing program, interference reducing apparatus, interference reducing method, and interference reducing program |
| JP5088701B2 (ja) * | 2006-05-31 | 2012-12-05 | 日本電気株式会社 | 言語モデル学習システム、言語モデル学習方法、および言語モデル学習用プログラム |
| US7885812B2 (en) * | 2006-11-15 | 2011-02-08 | Microsoft Corporation | Joint training of feature extraction and acoustic model parameters for speech recognition |
| US20080243503A1 (en) * | 2007-03-30 | 2008-10-02 | Microsoft Corporation | Minimum divergence based discriminative training for pattern recognition |
| US8275615B2 (en) * | 2007-07-13 | 2012-09-25 | International Business Machines Corporation | Model weighting, selection and hypotheses combination for automatic speech recognition and machine translation |
| US8160878B2 (en) * | 2008-09-16 | 2012-04-17 | Microsoft Corporation | Piecewise-based variable-parameter Hidden Markov Models and the training thereof |
| GB2464093B (en) * | 2008-09-29 | 2011-03-09 | Toshiba Res Europ Ltd | A speech recognition method |
| WO2012140248A2 (en) | 2011-04-13 | 2012-10-18 | Man Oil Group Ag | Liquid products and method for emulsifying oil, and use thereof in the treatment of oil contaminations |
| TWI475557B (zh) * | 2012-10-31 | 2015-03-01 | Acer Inc | 音訊處理裝置 |
| CN109346097B (zh) * | 2018-03-30 | 2023-07-14 | 上海大学 | 一种基于Kullback-Leibler差异的语音增强方法 |
Family Cites Families (22)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| JPH04275600A (ja) * | 1991-03-01 | 1992-10-01 | Ricoh Co Ltd | 音声認識装置 |
| JPH0566790A (ja) * | 1991-09-10 | 1993-03-19 | Oki Electric Ind Co Ltd | 音声認識方法 |
| JP3098593B2 (ja) * | 1991-12-12 | 2000-10-16 | 株式会社日立製作所 | 音声認識装置 |
| JPH06236196A (ja) * | 1993-02-08 | 1994-08-23 | Nippon Telegr & Teleph Corp <Ntt> | 音声認識方法および装置 |
| JPH06282297A (ja) * | 1993-03-26 | 1994-10-07 | Idou Tsushin Syst Kaihatsu Kk | 音声符号化方式 |
| JP3102195B2 (ja) * | 1993-04-02 | 2000-10-23 | 三菱電機株式会社 | 音声認識装置 |
| DE4325404C2 (de) * | 1993-07-29 | 2002-04-11 | Tenovis Gmbh & Co Kg | Verfahren zum Ermitteln und Klassifizieren von Störgeräuschtypen |
| US5727124A (en) | 1994-06-21 | 1998-03-10 | Lucent Technologies, Inc. | Method of and apparatus for signal recognition that compensates for mismatching |
| JPH08110800A (ja) * | 1994-10-12 | 1996-04-30 | Fujitsu Ltd | A−b−S法による高能率音声符号化方式 |
| JPH08320698A (ja) * | 1995-05-23 | 1996-12-03 | Clarion Co Ltd | 音声認識装置 |
| US6067517A (en) * | 1996-02-02 | 2000-05-23 | International Business Machines Corporation | Transcription of speech data with segments from acoustically dissimilar environments |
| JP3452443B2 (ja) * | 1996-03-25 | 2003-09-29 | 三菱電機株式会社 | 騒音下音声認識装置及び騒音下音声認識方法 |
| JPH1063293A (ja) * | 1996-08-23 | 1998-03-06 | Kokusai Denshin Denwa Co Ltd <Kdd> | 電話音声認識装置 |
| JP3250604B2 (ja) * | 1996-09-20 | 2002-01-28 | 日本電信電話株式会社 | 音声認識方法および装置 |
| JP3587966B2 (ja) * | 1996-09-20 | 2004-11-10 | 日本電信電話株式会社 | 音声認識方法、装置そよびその記憶媒体 |
| US5960397A (en) * | 1997-05-27 | 1999-09-28 | At&T Corp | System and method of recognizing an acoustic environment to adapt a set of based recognition models to the current acoustic environment for subsequent speech recognition |
| US5970446A (en) * | 1997-11-25 | 1999-10-19 | At&T Corp | Selective noise/channel/coding models and recognizers for automatic speech recognition |
| CN1658282A (zh) * | 1997-12-24 | 2005-08-24 | 三菱电机株式会社 | 声音编码方法和声音译码方法以及声音编码装置和声音译码装置 |
| US6389393B1 (en) * | 1998-04-28 | 2002-05-14 | Texas Instruments Incorporated | Method of adapting speech recognition models for speaker, microphone, and noisy environment |
| US6327565B1 (en) * | 1998-04-30 | 2001-12-04 | Matsushita Electric Industrial Co., Ltd. | Speaker and environment adaptation based on eigenvoices |
| US6324510B1 (en) * | 1998-11-06 | 2001-11-27 | Lernout & Hauspie Speech Products N.V. | Method and apparatus of hierarchically organizing an acoustic model for speech recognition and adaptation of the model to unseen domains |
| US6275800B1 (en) * | 1999-02-23 | 2001-08-14 | Motorola, Inc. | Voice recognition system and method |
-
2000
- 2000-06-27 JP JP2001508432A patent/JP4818556B2/ja not_active Expired - Lifetime
- 2000-06-27 US US09/786,290 patent/US6865531B1/en not_active Expired - Lifetime
- 2000-06-27 WO PCT/EP2000/005963 patent/WO2001003113A1/en not_active Ceased
- 2000-06-27 EP EP00951309A patent/EP1116219B1/en not_active Expired - Lifetime
- 2000-06-27 DE DE60018696T patent/DE60018696T2/de not_active Expired - Lifetime
Also Published As
| Publication number | Publication date |
|---|---|
| JP4818556B2 (ja) | 2011-11-16 |
| EP1116219B1 (en) | 2005-03-16 |
| WO2001003113A1 (en) | 2001-01-11 |
| EP1116219A1 (en) | 2001-07-18 |
| JP2003504653A (ja) | 2003-02-04 |
| US6865531B1 (en) | 2005-03-08 |
| DE60018696D1 (de) | 2005-04-21 |
Similar Documents
| Publication | Publication Date | Title |
|---|---|---|
| DE69311303T2 (de) | Sprachtrainingshilfe für kinder. | |
| DE69022237T2 (de) | Sprachsyntheseeinrichtung nach dem phonetischen Hidden-Markov-Modell. | |
| DE69514382T2 (de) | Spracherkennung | |
| DE69635655T2 (de) | Sprecherangepasste Spracherkennung | |
| DE69816177T2 (de) | Sprache/Pausen-Unterscheidung mittels ungeführter Adaption von Hidden-Markov-Modellen | |
| DE69800006T2 (de) | Verfahren zur Durchführung stochastischer Mustervergleiche für die Sprecherverifizierung | |
| DE69818231T2 (de) | Verfahren zum diskriminativen training von spracherkennungsmodellen | |
| DE69616568T2 (de) | Mustererkennung | |
| DE602004012909T2 (de) | Verfahren und Vorrichtung zur Modellierung eines Spracherkennungssystems und zur Schätzung einer Wort-Fehlerrate basierend auf einem Text | |
| DE60018696T2 (de) | Robuste sprachverarbeitung von verrauschten sprachmodellen | |
| DE69519297T2 (de) | Verfahren und vorrichtung zur spracherkennung mittels optimierter partieller buendelung von wahrscheinlichkeitsmischungen | |
| DE69226796T2 (de) | Zeitliche Dekorrelationsverfahren zur störsicheren Sprechererkennung | |
| DE69713452T2 (de) | Verfahren und System zur Auswahl akustischer Elemente zur Laufzeit für die Sprachsynthese | |
| DE69829187T2 (de) | Halbüberwachte Sprecheradaptation | |
| DE69719236T2 (de) | Verfahren und System zur Spracherkennung mittels verborgener Markoff-Modelle mit kontinuierlichen Ausgangswahrscheinlichkeiten | |
| EP0925579B1 (de) | Verfahren zur anpassung eines hidden-markov-lautmodelles in einem spracherkennungssystem | |
| DE69225371T2 (de) | Schlüsselwörtererkennung in einem zusammenhängenden Text mittels zweier "Hidden Markov" Modelle | |
| DE69523219T2 (de) | Anpassungsfähiges Lernverfahren zur Mustererkennung | |
| DE69524994T2 (de) | Verfahren und Vorrichtung zur Signalerkennung unter Kompensation von Fehlzusammensetzungen | |
| DE69832393T2 (de) | Spracherkennungssystem für die erkennung von kontinuierlicher und isolierter sprache | |
| EP1084490B1 (de) | Anordnung und verfahren zur erkennung eines vorgegebenen wortschatzes in gesprochener sprache durch einen rechner | |
| WO1996029695A1 (de) | Verfahren und anordnung zur spracherkennung bei wortkomposita enthaltenden sprachen | |
| EP1273003B1 (de) | Verfahren und vorrichtung zum bestimmen prosodischer markierungen | |
| EP1264301B1 (de) | Verfahren zur erkennung von sprachäusserungen nicht-muttersprachlicher sprecher in einem sprachverarbeitungssystem | |
| DE60318385T2 (de) | Sprachverarbeitungseinrichtung und -verfahren, aufzeichnungsmedium und programm |
Legal Events
| Date | Code | Title | Description |
|---|---|---|---|
| 8364 | No opposition during term of opposition |