KR100574594B1 - 잡음 보상되는 음성 인식 시스템 및 방법 - Google Patents

잡음 보상되는 음성 인식 시스템 및 방법 Download PDF

Info

Publication number
KR100574594B1
KR100574594B1 KR1020007008543A KR20007008543A KR100574594B1 KR 100574594 B1 KR100574594 B1 KR 100574594B1 KR 1020007008543 A KR1020007008543 A KR 1020007008543A KR 20007008543 A KR20007008543 A KR 20007008543A KR 100574594 B1 KR100574594 B1 KR 100574594B1
Authority
KR
South Korea
Prior art keywords
noise
speech
input signal
speech recognition
signal
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Expired - Lifetime
Application number
KR1020007008543A
Other languages
English (en)
Korean (ko)
Other versions
KR20010040669A (ko
Inventor
길버트 씨. 시
닝 비
Original Assignee
콸콤 인코포레이티드
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by 콸콤 인코포레이티드 filed Critical 콸콤 인코포레이티드
Publication of KR20010040669A publication Critical patent/KR20010040669A/ko
Application granted granted Critical
Publication of KR100574594B1 publication Critical patent/KR100574594B1/ko
Anticipated expiration legal-status Critical
Expired - Lifetime legal-status Critical Current

Links

Images

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L21/00Speech or voice signal processing techniques to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
    • G10L21/02Speech enhancement, e.g. noise reduction or echo cancellation
    • G10L21/0208Noise filtering
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/20Speech recognition techniques specially adapted for robustness in adverse environments, e.g. in noise, of stress induced speech
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/06Creation of reference templates; Training of speech recognition systems, e.g. adaptation to the characteristics of the speaker's voice
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L21/00Speech or voice signal processing techniques to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
    • G10L21/02Speech enhancement, e.g. noise reduction or echo cancellation
    • G10L21/0208Noise filtering
    • G10L21/0216Noise filtering characterised by the method used for estimating noise

Landscapes

  • Engineering & Computer Science (AREA)
  • Acoustics & Sound (AREA)
  • Health & Medical Sciences (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Human Computer Interaction (AREA)
  • Physics & Mathematics (AREA)
  • Computational Linguistics (AREA)
  • Multimedia (AREA)
  • Quality & Reliability (AREA)
  • Signal Processing (AREA)
  • Artificial Intelligence (AREA)
  • Telephonic Communication Services (AREA)
  • Machine Translation (AREA)
  • Noise Elimination (AREA)
KR1020007008543A 1998-02-04 1999-02-03 잡음 보상되는 음성 인식 시스템 및 방법 Expired - Lifetime KR100574594B1 (ko)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
US09/018,257 US6381569B1 (en) 1998-02-04 1998-02-04 Noise-compensated speech recognition templates
US09/018,257 1998-02-04

Publications (2)

Publication Number Publication Date
KR20010040669A KR20010040669A (ko) 2001-05-15
KR100574594B1 true KR100574594B1 (ko) 2006-04-28

Family

ID=21787025

Family Applications (1)

Application Number Title Priority Date Filing Date
KR1020007008543A Expired - Lifetime KR100574594B1 (ko) 1998-02-04 1999-02-03 잡음 보상되는 음성 인식 시스템 및 방법

Country Status (8)

Country Link
US (2) US6381569B1 (enExample)
EP (1) EP1058925B1 (enExample)
JP (1) JP4750271B2 (enExample)
KR (1) KR100574594B1 (enExample)
CN (1) CN1228761C (enExample)
AU (1) AU2577499A (enExample)
DE (1) DE69916255T2 (enExample)
WO (1) WO1999040571A1 (enExample)

Families Citing this family (50)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US6744887B1 (en) * 1999-10-05 2004-06-01 Zhone Technologies, Inc. Acoustic echo processing system
JP4590692B2 (ja) * 2000-06-28 2010-12-01 パナソニック株式会社 音響モデル作成装置及びその方法
US6631348B1 (en) * 2000-08-08 2003-10-07 Intel Corporation Dynamic speech recognition pattern switching for enhanced speech recognition accuracy
JP4244514B2 (ja) * 2000-10-23 2009-03-25 セイコーエプソン株式会社 音声認識方法および音声認識装置
US6999926B2 (en) * 2000-11-16 2006-02-14 International Business Machines Corporation Unsupervised incremental adaptation using maximum likelihood spectral transformation
US7236929B2 (en) * 2001-05-09 2007-06-26 Plantronics, Inc. Echo suppression and speech detection techniques for telephony applications
JP4240878B2 (ja) * 2001-12-13 2009-03-18 四一 安藤 音声認識方法及び音声認識装置
JP3885002B2 (ja) * 2002-06-28 2007-02-21 キヤノン株式会社 情報処理装置およびその方法
US7340397B2 (en) * 2003-03-03 2008-03-04 International Business Machines Corporation Speech recognition optimization tool
US20050228673A1 (en) * 2004-03-30 2005-10-13 Nefian Ara V Techniques for separating and evaluating audio and video source data
DE102004049347A1 (de) * 2004-10-08 2006-04-20 Micronas Gmbh Schaltungsanordnung bzw. Verfahren für Sprache enthaltende Audiosignale
EP1854095A1 (en) * 2005-02-15 2007-11-14 BBN Technologies Corp. Speech analyzing system with adaptive noise codebook
US8219391B2 (en) * 2005-02-15 2012-07-10 Raytheon Bbn Technologies Corp. Speech analyzing system with speech codebook
CN1936829B (zh) * 2005-09-23 2010-05-26 鸿富锦精密工业(深圳)有限公司 声音输出系统及方法
US7729911B2 (en) * 2005-09-27 2010-06-01 General Motors Llc Speech recognition method and system
KR100751923B1 (ko) * 2005-11-11 2007-08-24 고려대학교 산학협력단 잡음환경에 강인한 음성인식을 위한 에너지 특징 보상 방법및 장치
US20070118372A1 (en) * 2005-11-23 2007-05-24 General Electric Company System and method for generating closed captions
CN100580774C (zh) * 2005-12-30 2010-01-13 宏碁股份有限公司 消除录音突波的方法及其装置
CN100389421C (zh) * 2006-04-20 2008-05-21 北京理工大学 一种快速构造用于关键词检出任务的语音数据库的方法
JP5038403B2 (ja) * 2007-03-16 2012-10-03 パナソニック株式会社 音声分析装置、音声分析方法、音声分析プログラム、及びシステム集積回路
US8868417B2 (en) * 2007-06-15 2014-10-21 Alon Konchitsky Handset intelligibility enhancement system using adaptive filters and signal buffers
US9343079B2 (en) 2007-06-15 2016-05-17 Alon Konchitsky Receiver intelligibility enhancement system
US8190440B2 (en) * 2008-02-29 2012-05-29 Broadcom Corporation Sub-band codec with native voice activity detection
US8615397B2 (en) * 2008-04-04 2013-12-24 Intuit Inc. Identifying audio content using distorted target patterns
US8433564B2 (en) * 2009-07-02 2013-04-30 Alon Konchitsky Method for wind noise reduction
DE102009059138A1 (de) 2009-12-19 2010-07-29 Daimler Ag Verfahren und Testsystem zum Testen eines Spracherkennungssystems
US20120143604A1 (en) * 2010-12-07 2012-06-07 Rita Singh Method for Restoring Spectral Components in Denoised Speech Signals
US9143571B2 (en) * 2011-03-04 2015-09-22 Qualcomm Incorporated Method and apparatus for identifying mobile devices in similar sound environment
WO2013097239A1 (en) * 2011-12-31 2013-07-04 Thomson Licensing Method and device for presenting content
CN103514878A (zh) * 2012-06-27 2014-01-15 北京百度网讯科技有限公司 声学建模方法及装置和语音识别方法及装置
US9293148B2 (en) 2012-10-11 2016-03-22 International Business Machines Corporation Reducing noise in a shared media session
CN103903616B (zh) * 2012-12-25 2017-12-29 联想(北京)有限公司 一种信息处理的方法及电子设备
CN103544953B (zh) * 2013-10-24 2016-01-20 哈尔滨师范大学 一种基于背景噪声最小统计量特征的声音环境识别方法
US9466310B2 (en) * 2013-12-20 2016-10-11 Lenovo Enterprise Solutions (Singapore) Pte. Ltd. Compensating for identifiable background content in a speech recognition device
US9858922B2 (en) 2014-06-23 2018-01-02 Google Inc. Caching speech recognition scores
US9299347B1 (en) * 2014-10-22 2016-03-29 Google Inc. Speech recognition using associative mapping
CN108028048B (zh) 2015-06-30 2022-06-21 弗劳恩霍夫应用研究促进协会 用于关联噪声和用于分析的方法和设备
US9786270B2 (en) 2015-07-09 2017-10-10 Google Inc. Generating acoustic models
CN105405447B (zh) * 2015-10-27 2019-05-24 航宇救生装备有限公司 一种送话呼吸噪声屏蔽方法
US10229672B1 (en) 2015-12-31 2019-03-12 Google Llc Training acoustic models using connectionist temporal classification
US20180018973A1 (en) 2016-07-15 2018-01-18 Google Inc. Speaker verification
CN106816154A (zh) * 2016-12-15 2017-06-09 北京青笋科技有限公司 一种具有智能降噪功能的灯具语音识别控制方法
KR102410820B1 (ko) * 2017-08-14 2022-06-20 삼성전자주식회사 뉴럴 네트워크를 이용한 인식 방법 및 장치 및 상기 뉴럴 네트워크를 트레이닝하는 방법 및 장치
US10706840B2 (en) 2017-08-18 2020-07-07 Google Llc Encoder-decoder models for sequence to sequence mapping
US10762905B2 (en) * 2018-07-31 2020-09-01 Cirrus Logic, Inc. Speaker verification
CN109256144B (zh) * 2018-11-20 2022-09-06 中国科学技术大学 基于集成学习与噪声感知训练的语音增强方法
CN109841227B (zh) * 2019-03-11 2020-10-02 南京邮电大学 一种基于学习补偿的背景噪声去除方法
CN110808030B (zh) * 2019-11-22 2021-01-22 珠海格力电器股份有限公司 语音唤醒方法、系统、存储介质及电子设备
EP3862782A1 (en) * 2020-02-04 2021-08-11 Infineon Technologies AG Apparatus and method for correcting an input signal
CN113409770B (zh) * 2020-11-25 2025-04-15 腾讯科技(深圳)有限公司 发音特征处理方法、装置、服务器及介质

Citations (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US4933973A (en) * 1988-02-29 1990-06-12 Itt Corporation Apparatus and methods for the selective addition of noise to templates employed in automatic speech recognition systems

Family Cites Families (8)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US5095503A (en) 1989-12-20 1992-03-10 Motorola, Inc. Cellular telephone controller with synthesized voice feedback for directory number confirmation and call status
BR9206143A (pt) 1991-06-11 1995-01-03 Qualcomm Inc Processos de compressão de final vocal e para codificação de taxa variável de quadros de entrada, aparelho para comprimir im sinal acústico em dados de taxa variável, codificador de prognóstico exitado por córdigo de taxa variável (CELP) e descodificador para descodificar quadros codificados
US5307405A (en) 1992-09-25 1994-04-26 Qualcomm Incorporated Network echo canceller
DE4340679A1 (de) 1993-11-30 1995-06-01 Detecon Gmbh Sprachmodul für die akustische Wiedergabe von SAPI 3 Messages (Short Message Service) in einer Mobilstation (MS)
US5845246A (en) * 1995-02-28 1998-12-01 Voice Control Systems, Inc. Method for reducing database requirements for speech recognition systems
IL116103A0 (en) 1995-11-23 1996-01-31 Wireless Links International L Mobile data terminals with text to speech capability
US5778342A (en) * 1996-02-01 1998-07-07 Dspc Israel Ltd. Pattern recognition system and method
US5950123A (en) 1996-08-26 1999-09-07 Telefonaktiebolaget L M Cellular telephone network support of audible information delivery to visually impaired subscribers

Patent Citations (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US4933973A (en) * 1988-02-29 1990-06-12 Itt Corporation Apparatus and methods for the selective addition of noise to templates employed in automatic speech recognition systems

Non-Patent Citations (2)

* Cited by examiner, † Cited by third party
Title
Computer Speech and language, Vol.9 No.4, pp.289-307 Robust speech recognition in additive and convolutional noise using parallel model combination (1995.) *
Speech Communication, Vol.16 No.3, pp.261-291 Speech recognition in noisy environments: A survey (1995.) . *

Also Published As

Publication number Publication date
DE69916255T2 (de) 2005-04-14
WO1999040571A1 (en) 1999-08-12
KR20010040669A (ko) 2001-05-15
CN1296607A (zh) 2001-05-23
US6381569B1 (en) 2002-04-30
CN1228761C (zh) 2005-11-23
EP1058925A1 (en) 2000-12-13
DE69916255D1 (de) 2004-05-13
US20010001141A1 (en) 2001-05-10
HK1035600A1 (en) 2001-11-30
EP1058925B1 (en) 2004-04-07
AU2577499A (en) 1999-08-23
JP2002502993A (ja) 2002-01-29
JP4750271B2 (ja) 2011-08-17

Similar Documents

Publication Publication Date Title
KR100574594B1 (ko) 잡음 보상되는 음성 인식 시스템 및 방법
EP1301922B1 (en) System and method for voice recognition with a plurality of voice recognition engines
US8731921B2 (en) Frame erasure concealment technique for a bitstream-based feature extractor
US6950796B2 (en) Speech recognition by dynamical noise model adaptation
US20100036659A1 (en) Noise-Reduction Processing of Speech Signals
US6182036B1 (en) Method of extracting features in a voice recognition system
KR20030014332A (ko) 화자-독립형 보이스 인식 시스템용 보이스 템플릿을구성하는 방법 및 장치
US5579432A (en) Discriminating between stationary and non-stationary signals
KR100216018B1 (ko) 배경음을 엔코딩 및 디코딩하는 방법 및 장치
Kuo et al. Speech classification embedded in adaptive codebook search for low bit-rate CELP coding
US20030046069A1 (en) Noise reduction system and method
US20050143987A1 (en) Bitstream-based feature extraction method for a front-end speech recognizer
Sorin et al. The ETSI extended distributed speech recognition (DSR) standards: client side processing and tonal language recognition evaluation
Kim et al. Performance improvement of a bitstream-based front-end for wireless speech recognition in adverse environments
JPH11327593A (ja) 音声認識システム
Wang et al. Improved Mandarin speech recognition by lattice rescoring with enhanced tone models
EP1521243A1 (en) Speech coding method applying noise reduction by modifying the codebook gain
HK1035600B (en) System and method for noise-compensated speech recognition
Hayashi et al. A subtractive-type speech enhancement using the perceptual frequency-weighting function
JP2001343984A (ja) 有音/無音判定装置、音声復号化装置及び音声復号化方法
Hernando On the use of filter-bank energies driven from the autocorrelation sequence for noisy speech recognition.
JP2003513320A (ja) 音声信号からの雑音の消去
Hernando Pericás On the use of filter bank energies driven from the osa sequence for noisy speech recognition
JPH07239696A (ja) 音声認識装置
HK1013881B (en) Discriminating between stationary and non-stationary signals

Legal Events

Date Code Title Description
PA0105 International application

Patent event date: 20000804

Patent event code: PA01051R01D

Comment text: International Patent Application

PG1501 Laying open of application
A201 Request for examination
PA0201 Request for examination

Patent event code: PA02012R01D

Patent event date: 20040106

Comment text: Request for Examination of Application

E902 Notification of reason for refusal
PE0902 Notice of grounds for rejection

Comment text: Notification of reason for refusal

Patent event date: 20051031

Patent event code: PE09021S01D

E701 Decision to grant or registration of patent right
PE0701 Decision of registration

Patent event code: PE07011S01D

Comment text: Decision to Grant Registration

Patent event date: 20060224

GRNT Written decision to grant
PR0701 Registration of establishment

Comment text: Registration of Establishment

Patent event date: 20060421

Patent event code: PR07011E01D

PR1002 Payment of registration fee

Payment date: 20060424

End annual number: 3

Start annual number: 1

PG1601 Publication of registration
PR1001 Payment of annual fee

Payment date: 20090331

Start annual number: 4

End annual number: 4

PR1001 Payment of annual fee

Payment date: 20100331

Start annual number: 5

End annual number: 5

PR1001 Payment of annual fee

Payment date: 20110330

Start annual number: 6

End annual number: 6

FPAY Annual fee payment

Payment date: 20120329

Year of fee payment: 7

PR1001 Payment of annual fee

Payment date: 20120329

Start annual number: 7

End annual number: 7

FPAY Annual fee payment

Payment date: 20130329

Year of fee payment: 8

PR1001 Payment of annual fee

Payment date: 20130329

Start annual number: 8

End annual number: 8

FPAY Annual fee payment

Payment date: 20160330

Year of fee payment: 11

PR1001 Payment of annual fee

Payment date: 20160330

Start annual number: 11

End annual number: 11

FPAY Annual fee payment

Payment date: 20170330

Year of fee payment: 12

PR1001 Payment of annual fee

Payment date: 20170330

Start annual number: 12

End annual number: 12

FPAY Annual fee payment

Payment date: 20180329

Year of fee payment: 13

PR1001 Payment of annual fee

Payment date: 20180329

Start annual number: 13

End annual number: 13

EXPY Expiration of term
PC1801 Expiration of term

Termination date: 20190803

Termination category: Expiration of duration