DE60100637D1 - Verfahren zur Rauschadaptierung mittels transformierter Matrizen in der automatischen Spracherkennung - Google Patents
Verfahren zur Rauschadaptierung mittels transformierter Matrizen in der automatischen SpracherkennungInfo
- Publication number
- DE60100637D1 DE60100637D1 DE60100637T DE60100637T DE60100637D1 DE 60100637 D1 DE60100637 D1 DE 60100637D1 DE 60100637 T DE60100637 T DE 60100637T DE 60100637 T DE60100637 T DE 60100637T DE 60100637 D1 DE60100637 D1 DE 60100637D1
- Authority
- DE
- Germany
- Prior art keywords
- speech recognition
- automatic speech
- noise adaptation
- transformed matrices
- matrices
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Expired - Fee Related
Links
- 230000006978 adaptation Effects 0.000 title 1
Classifications
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/06—Creation of reference templates; Training of speech recognition systems, e.g. adaptation to the characteristics of the speaker's voice
- G10L15/063—Training
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/20—Speech recognition techniques specially adapted for robustness in adverse environments, e.g. in noise, of stress induced speech
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L21/00—Speech or voice signal processing techniques to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
- G10L21/02—Speech enhancement, e.g. noise reduction or echo cancellation
- G10L21/0208—Noise filtering
- G10L21/0216—Noise filtering characterised by the method used for estimating noise
Landscapes
- Engineering & Computer Science (AREA)
- Computational Linguistics (AREA)
- Health & Medical Sciences (AREA)
- Audiology, Speech & Language Pathology (AREA)
- Human Computer Interaction (AREA)
- Physics & Mathematics (AREA)
- Acoustics & Sound (AREA)
- Multimedia (AREA)
- Artificial Intelligence (AREA)
- Measurement Of Mechanical Vibrations Or Ultrasonic Waves (AREA)
- Circuit For Audible Band Transducer (AREA)
Applications Claiming Priority (2)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
US09/551,001 US6529872B1 (en) | 2000-04-18 | 2000-04-18 | Method for noise adaptation in automatic speech recognition using transformed matrices |
US551001 | 2000-04-18 |
Publications (2)
Publication Number | Publication Date |
---|---|
DE60100637D1 true DE60100637D1 (de) | 2003-10-02 |
DE60100637T2 DE60100637T2 (de) | 2004-06-17 |
Family
ID=24199418
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
DE60100637T Expired - Fee Related DE60100637T2 (de) | 2000-04-18 | 2001-04-18 | Verfahren zur Rauschadaptierung mittels transformierter Matrizen in der automatischen Spracherkennung |
Country Status (4)
Country | Link |
---|---|
US (2) | US6529872B1 (de) |
EP (1) | EP1148471B1 (de) |
JP (1) | JP3848845B2 (de) |
DE (1) | DE60100637T2 (de) |
Families Citing this family (42)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US7387253B1 (en) | 1996-09-03 | 2008-06-17 | Hand Held Products, Inc. | Optical reader system comprising local host processor and optical reader |
JP5105682B2 (ja) * | 2000-02-25 | 2012-12-26 | ニュアンス コミュニケーションズ オーストリア ゲーエムベーハー | 基準変換手段を伴なう音声認識装置 |
US6631348B1 (en) * | 2000-08-08 | 2003-10-07 | Intel Corporation | Dynamic speech recognition pattern switching for enhanced speech recognition accuracy |
US7457750B2 (en) * | 2000-10-13 | 2008-11-25 | At&T Corp. | Systems and methods for dynamic re-configurable speech recognition |
US6876966B1 (en) * | 2000-10-16 | 2005-04-05 | Microsoft Corporation | Pattern recognition training method and apparatus using inserted noise followed by noise reduction |
US7003455B1 (en) | 2000-10-16 | 2006-02-21 | Microsoft Corporation | Method of noise reduction using correction and scaling vectors with partitioning of the acoustic space in the domain of noisy speech |
US20020087306A1 (en) * | 2000-12-29 | 2002-07-04 | Lee Victor Wai Leung | Computer-implemented noise normalization method and system |
EP1229516A1 (de) * | 2001-01-26 | 2002-08-07 | Telefonaktiebolaget L M Ericsson (Publ) | Verfahren, Vorrichtung, Endgerät und System zur automatischen Erkennung verzerrter Sprachdaten |
US7062433B2 (en) * | 2001-03-14 | 2006-06-13 | Texas Instruments Incorporated | Method of speech recognition with compensation for both channel distortion and background noise |
US6985858B2 (en) * | 2001-03-20 | 2006-01-10 | Microsoft Corporation | Method and apparatus for removing noise from feature vectors |
US6912497B2 (en) * | 2001-03-28 | 2005-06-28 | Texas Instruments Incorporated | Calibration of speech data acquisition path |
US7165028B2 (en) * | 2001-12-12 | 2007-01-16 | Texas Instruments Incorporated | Method of speech recognition resistant to convolutive distortion and additive distortion |
US7117148B2 (en) * | 2002-04-05 | 2006-10-03 | Microsoft Corporation | Method of noise reduction using correction vectors based on dynamic aspects of speech and noise normalization |
GB2389217A (en) * | 2002-05-27 | 2003-12-03 | Canon Kk | Speech recognition system |
US20040064314A1 (en) * | 2002-09-27 | 2004-04-01 | Aubert Nicolas De Saint | Methods and apparatus for speech end-point detection |
JP4033299B2 (ja) * | 2003-03-12 | 2008-01-16 | 株式会社エヌ・ティ・ティ・ドコモ | 音声モデルの雑音適応化システム、雑音適応化方法、及び、音声認識雑音適応化プログラム |
JP4333369B2 (ja) * | 2004-01-07 | 2009-09-16 | 株式会社デンソー | 雑音除去装置、及び音声認識装置、並びにカーナビゲーション装置 |
US7729908B2 (en) * | 2005-03-04 | 2010-06-01 | Panasonic Corporation | Joint signal and model based noise matching noise robustness method for automatic speech recognition |
US7729909B2 (en) * | 2005-03-04 | 2010-06-01 | Panasonic Corporation | Block-diagonal covariance joint subspace tying and model compensation for noise robust automatic speech recognition |
US7693713B2 (en) * | 2005-06-17 | 2010-04-06 | Microsoft Corporation | Speech models generated using competitive training, asymmetric training, and data boosting |
US20070033027A1 (en) * | 2005-08-03 | 2007-02-08 | Texas Instruments, Incorporated | Systems and methods employing stochastic bias compensation and bayesian joint additive/convolutive compensation in automatic speech recognition |
US7584097B2 (en) * | 2005-08-03 | 2009-09-01 | Texas Instruments Incorporated | System and method for noisy automatic speech recognition employing joint compensation of additive and convolutive distortions |
JP2007114413A (ja) * | 2005-10-19 | 2007-05-10 | Toshiba Corp | 音声非音声判別装置、音声区間検出装置、音声非音声判別方法、音声区間検出方法、音声非音声判別プログラムおよび音声区間検出プログラム |
US7877255B2 (en) * | 2006-03-31 | 2011-01-25 | Voice Signal Technologies, Inc. | Speech recognition using channel verification |
AU2006343470B2 (en) * | 2006-05-16 | 2012-07-19 | Loquendo S.P.A. | Intersession variability compensation for automatic extraction of information from voice |
JP4282704B2 (ja) * | 2006-09-27 | 2009-06-24 | 株式会社東芝 | 音声区間検出装置およびプログラム |
US8180637B2 (en) * | 2007-12-03 | 2012-05-15 | Microsoft Corporation | High performance HMM adaptation with joint compensation of additive and convolutive distortions |
JP4950930B2 (ja) * | 2008-04-03 | 2012-06-13 | 株式会社東芝 | 音声/非音声を判定する装置、方法およびプログラム |
US8214215B2 (en) * | 2008-09-24 | 2012-07-03 | Microsoft Corporation | Phase sensitive model adaptation for noisy speech recognition |
KR101239318B1 (ko) * | 2008-12-22 | 2013-03-05 | 한국전자통신연구원 | 음질 향상 장치와 음성 인식 시스템 및 방법 |
US8433564B2 (en) * | 2009-07-02 | 2013-04-30 | Alon Konchitsky | Method for wind noise reduction |
KR20120054845A (ko) * | 2010-11-22 | 2012-05-31 | 삼성전자주식회사 | 로봇의 음성인식방법 |
JP5966689B2 (ja) * | 2012-07-04 | 2016-08-10 | 日本電気株式会社 | 音響モデル適応装置、音響モデル適応方法および音響モデル適応プログラム |
WO2014100236A1 (en) | 2012-12-19 | 2014-06-26 | Visa International Service Association | System and method for voice authentication |
US8949224B2 (en) | 2013-01-15 | 2015-02-03 | Amazon Technologies, Inc. | Efficient query processing using histograms in a columnar database |
CN103903630A (zh) * | 2014-03-18 | 2014-07-02 | 北京捷通华声语音技术有限公司 | 一种用于消除稀疏噪声方法及装置 |
JP6464650B2 (ja) * | 2014-10-03 | 2019-02-06 | 日本電気株式会社 | 音声処理装置、音声処理方法、およびプログラム |
CN106384588B (zh) * | 2016-09-08 | 2019-09-10 | 河海大学 | 基于矢量泰勒级数的加性噪声与短时混响的联合补偿方法 |
JP6767326B2 (ja) * | 2017-09-08 | 2020-10-14 | 日本電信電話株式会社 | センサ信号処理方法、センサ信号処理装置、およびプログラム |
CN110570845B (zh) * | 2019-08-15 | 2021-10-22 | 武汉理工大学 | 一种基于域不变特征的语音识别方法 |
US11335329B2 (en) * | 2019-08-28 | 2022-05-17 | Tata Consultancy Services Limited | Method and system for generating synthetic multi-conditioned data sets for robust automatic speech recognition |
CN113223505B (zh) * | 2021-04-30 | 2023-12-08 | 珠海格力电器股份有限公司 | 模型训练、数据处理方法、装置、电子设备及存储介质 |
Family Cites Families (4)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US5226092A (en) * | 1991-06-28 | 1993-07-06 | Digital Equipment Corporation | Method and apparatus for learning in a neural network |
US6026359A (en) * | 1996-09-20 | 2000-02-15 | Nippon Telegraph And Telephone Corporation | Scheme for model adaptation in pattern recognition based on Taylor expansion |
US6182270B1 (en) * | 1996-12-04 | 2001-01-30 | Lucent Technologies Inc. | Low-displacement rank preconditioners for simplified non-linear analysis of circuits and other devices |
US6154716A (en) * | 1998-07-29 | 2000-11-28 | Lucent Technologies - Inc. | System and method for simulating electronic circuits |
-
2000
- 2000-04-18 US US09/551,001 patent/US6529872B1/en not_active Expired - Lifetime
- 2000-07-31 US US09/628,376 patent/US6691091B1/en not_active Expired - Lifetime
-
2001
- 2001-04-18 DE DE60100637T patent/DE60100637T2/de not_active Expired - Fee Related
- 2001-04-18 EP EP01303537A patent/EP1148471B1/de not_active Expired - Lifetime
- 2001-04-18 JP JP2001119722A patent/JP3848845B2/ja not_active Expired - Fee Related
Also Published As
Publication number | Publication date |
---|---|
DE60100637T2 (de) | 2004-06-17 |
US6529872B1 (en) | 2003-03-04 |
JP3848845B2 (ja) | 2006-11-22 |
US6691091B1 (en) | 2004-02-10 |
JP2001356791A (ja) | 2001-12-26 |
EP1148471B1 (de) | 2003-08-27 |
EP1148471A1 (de) | 2001-10-24 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
DE60100637D1 (de) | Verfahren zur Rauschadaptierung mittels transformierter Matrizen in der automatischen Spracherkennung | |
DE60024506D1 (de) | Verfahren zur mehrstufigen Spracherkennung mittels Zuverlässigkeitsmasses | |
DE60316912D1 (de) | Verfahren zur Spracherkennung | |
DE69806645D1 (de) | Verfahren und vorrichtung zur gleichzeitigen sprachkodierung und geräuschunterdrückung | |
DE602004020572D1 (de) | Verfahren und Vorrichtung zur Verminderung der Latenzzeit für automatische Spracherkennung mittels Mehrfach-Durchlauf-Teil-Ergebnissen | |
DE60120048D1 (de) | Verfahren zur Auswahl eines Objektes | |
DE60136901D1 (de) | Verfahren zur Herstellung eines multifunktionalen akustischen Geräts | |
DE60309822D1 (de) | Verfahren und Vorrichtung zur Spracherkennung | |
ATE339484T1 (de) | Verfahren zur herstellung von fischer-tropsch- wachsen | |
DE602004022130D1 (de) | Verfahren zur Zeichenerkennung | |
DE60108373D1 (de) | Verfahren zur Detektion von Emotionen in Sprachsignalen unter Verwendung von Sprecheridentifikation | |
DE60107308D1 (de) | Verfahren zur Erzeugung eines Wasserzeichens für Audiosignale | |
ATE300520T1 (de) | Verfahren zur herstellung amlodipinmaleat | |
DE602004023364D1 (de) | Vorrichtung und Verfahren zur Spracherkennung | |
DE60028219D1 (de) | Verfahren zur Spracherkennung | |
DE60212725D1 (de) | Verfahren zur automatischen spracherkennung | |
DE60124884D1 (de) | Verfahren zur verbesserung der fotomaskengeometrie | |
DE60032776D1 (de) | Verfahren zur Spracherkennung | |
DE60134650D1 (de) | Verfahren zur herstellung honigförmiger waben | |
DE602004028008D1 (de) | Verfahren zur statistischen sprachmodellierung bei der spracherkennung | |
DE602004014675D1 (de) | Verfahren und Vorrichtung zur Spracherkennung | |
DE60108104D1 (de) | Verfahren zur Sprecheridentifikation | |
DE50109658D1 (de) | Vorrichtung und Verfahren zur Sprachsteuerung | |
DE69808339D1 (de) | Verfahren zur sprachkodierung bei hintergrundrauschen | |
DE60229315D1 (de) | Verfahren und Vorrichtung zur Spracherkennung |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
8364 | No opposition during term of opposition | ||
8339 | Ceased/non-payment of the annual fee | ||
8370 | Indication of lapse of patent is to be deleted | ||
8339 | Ceased/non-payment of the annual fee |