ATE362165T1 - Anpassung einer umgebungsfehlanpassung für spracherkennungssysteme - Google Patents

Anpassung einer umgebungsfehlanpassung für spracherkennungssysteme

Info

Publication number
ATE362165T1
ATE362165T1 AT04770165T AT04770165T ATE362165T1 AT E362165 T1 ATE362165 T1 AT E362165T1 AT 04770165 T AT04770165 T AT 04770165T AT 04770165 T AT04770165 T AT 04770165T AT E362165 T1 ATE362165 T1 AT E362165T1
Authority
AT
Austria
Prior art keywords
speech
environmental
misadaptation
adjusting
voice recognition
Prior art date
Application number
AT04770165T
Other languages
English (en)
Inventor
Dieter Geller
Original Assignee
Koninkl Philips Electronics Nv
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Koninkl Philips Electronics Nv filed Critical Koninkl Philips Electronics Nv
Application granted granted Critical
Publication of ATE362165T1 publication Critical patent/ATE362165T1/de

Links

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/06Creation of reference templates; Training of speech recognition systems, e.g. adaptation to the characteristics of the speaker's voice
    • G10L15/065Adaptation
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/20Speech recognition techniques specially adapted for robustness in adverse environments, e.g. in noise, of stress induced speech

Landscapes

  • Engineering & Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • Computational Linguistics (AREA)
  • Health & Medical Sciences (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Human Computer Interaction (AREA)
  • Acoustics & Sound (AREA)
  • Multimedia (AREA)
  • Artificial Intelligence (AREA)
  • Compression, Expansion, Code Conversion, And Decoders (AREA)
  • Complex Calculations (AREA)
  • Telephonic Communication Services (AREA)
  • Two-Way Televisions, Distribution Of Moving Picture Or The Like (AREA)
AT04770165T 2003-10-08 2004-10-05 Anpassung einer umgebungsfehlanpassung für spracherkennungssysteme ATE362165T1 (de)

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
EP03103727 2003-10-08

Publications (1)

Publication Number Publication Date
ATE362165T1 true ATE362165T1 (de) 2007-06-15

Family

ID=34429460

Family Applications (1)

Application Number Title Priority Date Filing Date
AT04770165T ATE362165T1 (de) 2003-10-08 2004-10-05 Anpassung einer umgebungsfehlanpassung für spracherkennungssysteme

Country Status (7)

Country Link
US (1) US20070124143A1 (de)
EP (1) EP1673761B1 (de)
JP (1) JP2007508577A (de)
CN (1) CN1864202A (de)
AT (1) ATE362165T1 (de)
DE (1) DE602004006429D1 (de)
WO (1) WO2005036525A1 (de)

Families Citing this family (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US7725316B2 (en) * 2006-07-05 2010-05-25 General Motors Llc Applying speech recognition adaptation in an automated speech recognition system of a telematics-equipped vehicle
EP2317730B1 (de) * 2009-10-29 2015-08-12 Unify GmbH & Co. KG Verfahren und System zur automatischen Änderung oder Aktualisierung der Konfiguration oder Einstellung eines Kommunikationssystems
GB2482874B (en) 2010-08-16 2013-06-12 Toshiba Res Europ Ltd A speech processing system and method
CN103458782B (zh) * 2011-03-16 2016-02-17 皇家飞利浦有限公司 呼吸困难和水肿症状评估
US8972256B2 (en) * 2011-10-17 2015-03-03 Nuance Communications, Inc. System and method for dynamic noise adaptation for robust automatic speech recognition
US9338580B2 (en) * 2011-10-21 2016-05-10 Qualcomm Incorporated Method and apparatus for packet loss rate-based codec adaptation

Family Cites Families (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US5604839A (en) * 1994-07-29 1997-02-18 Microsoft Corporation Method and system for improving speech recognition through front-end normalization of feature vectors
JP2768274B2 (ja) * 1994-09-08 1998-06-25 日本電気株式会社 音声認識装置
JPH10161692A (ja) * 1996-12-03 1998-06-19 Canon Inc 音声認識装置及び音声認識方法
KR100304666B1 (ko) * 1999-08-28 2001-11-01 윤종용 음성 향상 방법
US7072833B2 (en) * 2000-06-02 2006-07-04 Canon Kabushiki Kaisha Speech processing system

Also Published As

Publication number Publication date
JP2007508577A (ja) 2007-04-05
US20070124143A1 (en) 2007-05-31
EP1673761A1 (de) 2006-06-28
EP1673761B1 (de) 2007-05-09
WO2005036525A1 (en) 2005-04-21
DE602004006429D1 (de) 2007-06-21
CN1864202A (zh) 2006-11-15

Similar Documents

Publication Publication Date Title
Kalinli et al. Noise adaptive training for robust automatic speech recognition
US8731936B2 (en) Energy-efficient unobtrusive identification of a speaker
CN103236260B (zh) 语音识别系统
US7895038B2 (en) Signal enhancement via noise reduction for speech recognition
Reynolds Channel robust speaker verification via feature mapping
Deng et al. Large-vocabulary speech recognition under adverse acoustic environments.
US6721699B2 (en) Method and system of Chinese speech pitch extraction
Cui et al. Noise robust speech recognition using feature compensation based on polynomial regression of utterance SNR
US20050143997A1 (en) Method and apparatus using spectral addition for speaker recognition
KR20060044629A (ko) 신경 회로망을 이용한 음성 신호 분리 시스템 및 방법과음성 신호 강화 시스템
CN1251194A (zh) 识别系统
CN107910008B (zh) 一种用于个人设备的基于多声学模型的语音识别方法
CN110827844B (zh) 一种基于bp网络的噪声分类方法
Espinoza-Cuadros et al. Speaker de-identification system using autoencoders and adversarial training
CN110268471A (zh) 具有嵌入式降噪的asr的方法和设备
Lin et al. DNN-based feature transformation for speech recognition using throat microphone
ATE362165T1 (de) Anpassung einer umgebungsfehlanpassung für spracherkennungssysteme
JP2003532162A (ja) 雑音に影響された音声の認識のためのロバストなパラメータ
US10522135B2 (en) System and method for segmenting audio files for transcription
Fan et al. Acoustic analysis for speaker identification of whispered speech
CN101853262A (zh) 基于交叉熵的音频指纹快速搜索方法
Rahman et al. Continuous bangla speech segmentation, classification and feature extraction
CN106981287A (zh) 一种提高声纹识别速度的方法及系统
Pardede et al. On the Effect of the Implementation of Human Auditory Systems on Q-Log-Based Features for Robustness of Speech Recognition Against Noise.
Gales Acoustic modelling for speech recognition: Hidden Markov Models and beyond?

Legal Events

Date Code Title Description
RER Ceased as to paragraph 5 lit. 3 law introducing patent treaties