ATE362165T1 - Anpassung einer umgebungsfehlanpassung für spracherkennungssysteme - Google Patents
Anpassung einer umgebungsfehlanpassung für spracherkennungssystemeInfo
- Publication number
- ATE362165T1 ATE362165T1 AT04770165T AT04770165T ATE362165T1 AT E362165 T1 ATE362165 T1 AT E362165T1 AT 04770165 T AT04770165 T AT 04770165T AT 04770165 T AT04770165 T AT 04770165T AT E362165 T1 ATE362165 T1 AT E362165T1
- Authority
- AT
- Austria
- Prior art keywords
- speech
- environmental
- misadaptation
- adjusting
- voice recognition
- Prior art date
Links
- 230000007613 environmental effect Effects 0.000 title abstract 4
- 238000000034 method Methods 0.000 abstract 2
- 239000013598 vector Substances 0.000 abstract 2
- 230000006978 adaptation Effects 0.000 abstract 1
- 238000004590 computer program Methods 0.000 abstract 1
- 238000001228 spectrum Methods 0.000 abstract 1
- 230000009466 transformation Effects 0.000 abstract 1
Classifications
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/06—Creation of reference templates; Training of speech recognition systems, e.g. adaptation to the characteristics of the speaker's voice
- G10L15/065—Adaptation
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/20—Speech recognition techniques specially adapted for robustness in adverse environments, e.g. in noise, of stress induced speech
Landscapes
- Engineering & Computer Science (AREA)
- Physics & Mathematics (AREA)
- Computational Linguistics (AREA)
- Health & Medical Sciences (AREA)
- Audiology, Speech & Language Pathology (AREA)
- Human Computer Interaction (AREA)
- Acoustics & Sound (AREA)
- Multimedia (AREA)
- Artificial Intelligence (AREA)
- Compression, Expansion, Code Conversion, And Decoders (AREA)
- Complex Calculations (AREA)
- Telephonic Communication Services (AREA)
- Two-Way Televisions, Distribution Of Moving Picture Or The Like (AREA)
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
EP03103727 | 2003-10-08 |
Publications (1)
Publication Number | Publication Date |
---|---|
ATE362165T1 true ATE362165T1 (de) | 2007-06-15 |
Family
ID=34429460
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
AT04770165T ATE362165T1 (de) | 2003-10-08 | 2004-10-05 | Anpassung einer umgebungsfehlanpassung für spracherkennungssysteme |
Country Status (7)
Country | Link |
---|---|
US (1) | US20070124143A1 (de) |
EP (1) | EP1673761B1 (de) |
JP (1) | JP2007508577A (de) |
CN (1) | CN1864202A (de) |
AT (1) | ATE362165T1 (de) |
DE (1) | DE602004006429D1 (de) |
WO (1) | WO2005036525A1 (de) |
Families Citing this family (6)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US7725316B2 (en) * | 2006-07-05 | 2010-05-25 | General Motors Llc | Applying speech recognition adaptation in an automated speech recognition system of a telematics-equipped vehicle |
EP2317730B1 (de) * | 2009-10-29 | 2015-08-12 | Unify GmbH & Co. KG | Verfahren und System zur automatischen Änderung oder Aktualisierung der Konfiguration oder Einstellung eines Kommunikationssystems |
GB2482874B (en) | 2010-08-16 | 2013-06-12 | Toshiba Res Europ Ltd | A speech processing system and method |
CN103458782B (zh) * | 2011-03-16 | 2016-02-17 | 皇家飞利浦有限公司 | 呼吸困难和水肿症状评估 |
US8972256B2 (en) * | 2011-10-17 | 2015-03-03 | Nuance Communications, Inc. | System and method for dynamic noise adaptation for robust automatic speech recognition |
US9338580B2 (en) * | 2011-10-21 | 2016-05-10 | Qualcomm Incorporated | Method and apparatus for packet loss rate-based codec adaptation |
Family Cites Families (5)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US5604839A (en) * | 1994-07-29 | 1997-02-18 | Microsoft Corporation | Method and system for improving speech recognition through front-end normalization of feature vectors |
JP2768274B2 (ja) * | 1994-09-08 | 1998-06-25 | 日本電気株式会社 | 音声認識装置 |
JPH10161692A (ja) * | 1996-12-03 | 1998-06-19 | Canon Inc | 音声認識装置及び音声認識方法 |
KR100304666B1 (ko) * | 1999-08-28 | 2001-11-01 | 윤종용 | 음성 향상 방법 |
US7072833B2 (en) * | 2000-06-02 | 2006-07-04 | Canon Kabushiki Kaisha | Speech processing system |
-
2004
- 2004-10-05 DE DE602004006429T patent/DE602004006429D1/de active Active
- 2004-10-05 CN CN200480029513.XA patent/CN1864202A/zh active Pending
- 2004-10-05 JP JP2006530972A patent/JP2007508577A/ja not_active Withdrawn
- 2004-10-05 AT AT04770165T patent/ATE362165T1/de not_active IP Right Cessation
- 2004-10-05 WO PCT/IB2004/051969 patent/WO2005036525A1/en active IP Right Grant
- 2004-10-05 US US10/574,447 patent/US20070124143A1/en not_active Abandoned
- 2004-10-05 EP EP04770165A patent/EP1673761B1/de not_active Not-in-force
Also Published As
Publication number | Publication date |
---|---|
JP2007508577A (ja) | 2007-04-05 |
US20070124143A1 (en) | 2007-05-31 |
EP1673761A1 (de) | 2006-06-28 |
EP1673761B1 (de) | 2007-05-09 |
WO2005036525A1 (en) | 2005-04-21 |
DE602004006429D1 (de) | 2007-06-21 |
CN1864202A (zh) | 2006-11-15 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
Kalinli et al. | Noise adaptive training for robust automatic speech recognition | |
US8731936B2 (en) | Energy-efficient unobtrusive identification of a speaker | |
CN103236260B (zh) | 语音识别系统 | |
US7895038B2 (en) | Signal enhancement via noise reduction for speech recognition | |
Reynolds | Channel robust speaker verification via feature mapping | |
Deng et al. | Large-vocabulary speech recognition under adverse acoustic environments. | |
US6721699B2 (en) | Method and system of Chinese speech pitch extraction | |
Cui et al. | Noise robust speech recognition using feature compensation based on polynomial regression of utterance SNR | |
US20050143997A1 (en) | Method and apparatus using spectral addition for speaker recognition | |
KR20060044629A (ko) | 신경 회로망을 이용한 음성 신호 분리 시스템 및 방법과음성 신호 강화 시스템 | |
CN1251194A (zh) | 识别系统 | |
CN107910008B (zh) | 一种用于个人设备的基于多声学模型的语音识别方法 | |
CN110827844B (zh) | 一种基于bp网络的噪声分类方法 | |
Espinoza-Cuadros et al. | Speaker de-identification system using autoencoders and adversarial training | |
CN110268471A (zh) | 具有嵌入式降噪的asr的方法和设备 | |
Lin et al. | DNN-based feature transformation for speech recognition using throat microphone | |
ATE362165T1 (de) | Anpassung einer umgebungsfehlanpassung für spracherkennungssysteme | |
JP2003532162A (ja) | 雑音に影響された音声の認識のためのロバストなパラメータ | |
US10522135B2 (en) | System and method for segmenting audio files for transcription | |
Fan et al. | Acoustic analysis for speaker identification of whispered speech | |
CN101853262A (zh) | 基于交叉熵的音频指纹快速搜索方法 | |
Rahman et al. | Continuous bangla speech segmentation, classification and feature extraction | |
CN106981287A (zh) | 一种提高声纹识别速度的方法及系统 | |
Pardede et al. | On the Effect of the Implementation of Human Auditory Systems on Q-Log-Based Features for Robustness of Speech Recognition Against Noise. | |
Gales | Acoustic modelling for speech recognition: Hidden Markov Models and beyond? |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
RER | Ceased as to paragraph 5 lit. 3 law introducing patent treaties |