ES2570961T3 - Estimación de varianza de ruido para mejorar la calidad de voz - Google Patents

Estimación de varianza de ruido para mejorar la calidad de voz

Info

Publication number
ES2570961T3
ES2570961T3 ES08726859T ES08726859T ES2570961T3 ES 2570961 T3 ES2570961 T3 ES 2570961T3 ES 08726859 T ES08726859 T ES 08726859T ES 08726859 T ES08726859 T ES 08726859T ES 2570961 T3 ES2570961 T3 ES 2570961T3
Authority
ES
Spain
Prior art keywords
audio signal
noise components
amplitude
estimate
estimation
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Active
Application number
ES08726859T
Other languages
English (en)
Inventor
Rongshan Yu
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Dolby Laboratories Licensing Corp
Original Assignee
Dolby Laboratories Licensing Corp
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Dolby Laboratories Licensing Corp filed Critical Dolby Laboratories Licensing Corp
Application granted granted Critical
Publication of ES2570961T3 publication Critical patent/ES2570961T3/es
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/20Speech recognition techniques specially adapted for robustness in adverse environments, e.g. in noise, of stress induced speech
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L21/00Speech or voice signal processing techniques to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
    • G10L21/02Speech enhancement, e.g. noise reduction or echo cancellation
    • G10L21/0208Noise filtering
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/08Speech classification or search
    • G10L15/12Speech classification or search using dynamic programming techniques, e.g. dynamic time warping [DTW]
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L25/00Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
    • G10L25/03Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 characterised by the type of extracted parameters
    • G10L25/18Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 characterised by the type of extracted parameters the extracted parameters being spectral information of each sub-band

Landscapes

  • Engineering & Computer Science (AREA)
  • Multimedia (AREA)
  • Acoustics & Sound (AREA)
  • Computational Linguistics (AREA)
  • Health & Medical Sciences (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Human Computer Interaction (AREA)
  • Physics & Mathematics (AREA)
  • Quality & Reliability (AREA)
  • Signal Processing (AREA)
  • Compression, Expansion, Code Conversion, And Decoders (AREA)
  • Circuit For Audible Band Transducer (AREA)
  • Monitoring And Testing Of Transmission In General (AREA)
  • Noise Elimination (AREA)
  • Telephone Function (AREA)

Abstract

Un procedimiento para obtener una estimación de varianza en componentes de ruido de una señal de audio formada por componentes de voz y de ruido, que comprende: obtener dicha estimación de varianza en componentes de ruido de una señal de audio a partir del promedio de estimaciones previas de la amplitud de las componentes de ruido de la señal de audio, en el que las estimaciones de la amplitud de las componentes de ruido de la señal de audio que tienen valores mayores que un umbral se excluyen de o se ponderan con un valor bajo en el promedio de las estimaciones previas de la amplitud de las componentes de ruido de la señal de audio, y en el que cada estimación de la amplitud de las componentes de ruido de la señal de audio es una función de una estimación de varianza en las componentes de ruido de la señal de audio, una estimación de varianza en las componentes de voz de la señal de audio y la amplitud de la señal de audio.
ES08726859T 2007-03-19 2008-03-14 Estimación de varianza de ruido para mejorar la calidad de voz Active ES2570961T3 (es)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
US91896407P 2007-03-19 2007-03-19
PCT/US2008/003436 WO2008115435A1 (en) 2007-03-19 2008-03-14 Noise variance estimator for speech enhancement

Publications (1)

Publication Number Publication Date
ES2570961T3 true ES2570961T3 (es) 2016-05-23

Family

ID=39468801

Family Applications (1)

Application Number Title Priority Date Filing Date
ES08726859T Active ES2570961T3 (es) 2007-03-19 2008-03-14 Estimación de varianza de ruido para mejorar la calidad de voz

Country Status (8)

Country Link
US (1) US8280731B2 (es)
EP (2) EP3070714B1 (es)
JP (1) JP5186510B2 (es)
KR (1) KR101141033B1 (es)
CN (1) CN101647061B (es)
ES (1) ES2570961T3 (es)
TW (1) TWI420509B (es)
WO (1) WO2008115435A1 (es)

Families Citing this family (30)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US8949120B1 (en) 2006-05-25 2015-02-03 Audience, Inc. Adaptive noise cancelation
US8521530B1 (en) * 2008-06-30 2013-08-27 Audience, Inc. System and method for enhancing a monaural audio signal
KR101581885B1 (ko) * 2009-08-26 2016-01-04 삼성전자주식회사 복소 스펙트럼 잡음 제거 장치 및 방법
US20110178800A1 (en) * 2010-01-19 2011-07-21 Lloyd Watts Distortion Measurement for Noise Suppression System
US8798290B1 (en) 2010-04-21 2014-08-05 Audience, Inc. Systems and methods for adaptive signal equalization
US9558755B1 (en) 2010-05-20 2017-01-31 Knowles Electronics, Llc Noise suppression assisted automatic speech recognition
BR122021003884B1 (pt) 2010-08-12 2021-11-30 Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e. V. Reamostrar sinais de saída de codecs de áudio com base em qmf
JP5643686B2 (ja) * 2011-03-11 2014-12-17 株式会社東芝 音声判別装置、音声判別方法および音声判別プログラム
US9173025B2 (en) 2012-02-08 2015-10-27 Dolby Laboratories Licensing Corporation Combined suppression of noise, echo, and out-of-location signals
US9373341B2 (en) 2012-03-23 2016-06-21 Dolby Laboratories Licensing Corporation Method and system for bias corrected speech level determination
EP2828854B1 (en) 2012-03-23 2016-03-16 Dolby Laboratories Licensing Corporation Hierarchical active voice detection
JP6182895B2 (ja) * 2012-05-01 2017-08-23 株式会社リコー 処理装置、処理方法、プログラム及び処理システム
US9640194B1 (en) 2012-10-04 2017-05-02 Knowles Electronics, Llc Noise suppression for speech processing based on machine-learning mask estimation
US10306389B2 (en) 2013-03-13 2019-05-28 Kopin Corporation Head wearable acoustic system with noise canceling microphone geometry apparatuses and methods
US9257952B2 (en) 2013-03-13 2016-02-09 Kopin Corporation Apparatuses and methods for multi-channel signal compression during desired voice activity detection
US9536540B2 (en) 2013-07-19 2017-01-03 Knowles Electronics, Llc Speech signal separation and synthesis based on auditory scene analysis and speech modeling
CN103559887B (zh) * 2013-11-04 2016-08-17 深港产学研基地 用于语音增强系统的背景噪声估计方法
JP6361156B2 (ja) * 2014-02-10 2018-07-25 沖電気工業株式会社 雑音推定装置、方法及びプログラム
CN103824563A (zh) * 2014-02-21 2014-05-28 深圳市微纳集成电路与系统应用研究院 一种基于模块复用的助听器去噪装置和方法
CN103854662B (zh) * 2014-03-04 2017-03-15 中央军委装备发展部第六十三研究所 基于多域联合估计的自适应语音检测方法
DE112015003945T5 (de) 2014-08-28 2017-05-11 Knowles Electronics, Llc Mehrquellen-Rauschunterdrückung
RU2673390C1 (ru) * 2014-12-12 2018-11-26 Хуавэй Текнолоджиз Ко., Лтд. Устройство обработки сигналов для усиления речевого компонента в многоканальном звуковом сигнале
CN105810214B (zh) * 2014-12-31 2019-11-05 展讯通信(上海)有限公司 语音激活检测方法及装置
DK3118851T3 (da) * 2015-07-01 2021-02-22 Oticon As Forbedring af støjende tale baseret på statistiske tale- og støjmodeller
US11631421B2 (en) * 2015-10-18 2023-04-18 Solos Technology Limited Apparatuses and methods for enhanced speech recognition in variable environments
US20190137549A1 (en) * 2017-11-03 2019-05-09 Velodyne Lidar, Inc. Systems and methods for multi-tier centroid calculation
EP3573058B1 (en) * 2018-05-23 2021-02-24 Harman Becker Automotive Systems GmbH Dry sound and ambient sound separation
CN110164467B (zh) * 2018-12-18 2022-11-25 腾讯科技(深圳)有限公司 语音降噪的方法和装置、计算设备和计算机可读存储介质
CN110136738A (zh) * 2019-06-13 2019-08-16 苏州思必驰信息科技有限公司 噪声估计方法及装置
CN111613239B (zh) * 2020-05-29 2023-09-05 北京达佳互联信息技术有限公司 音频去噪方法和装置、服务器、存储介质

Family Cites Families (19)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US5706395A (en) * 1995-04-19 1998-01-06 Texas Instruments Incorporated Adaptive weiner filtering using a dynamic suppression factor
SE506034C2 (sv) * 1996-02-01 1997-11-03 Ericsson Telefon Ab L M Förfarande och anordning för förbättring av parametrar representerande brusigt tal
US6415253B1 (en) * 1998-02-20 2002-07-02 Meta-C Corporation Method and apparatus for enhancing noise-corrupted speech
US6453285B1 (en) * 1998-08-21 2002-09-17 Polycom, Inc. Speech activity detector for use in noise reduction system, and methods therefor
US6289309B1 (en) 1998-12-16 2001-09-11 Sarnoff Corporation Noise spectrum tracking for speech enhancement
US6910011B1 (en) * 1999-08-16 2005-06-21 Haman Becker Automotive Systems - Wavemakers, Inc. Noisy acoustic signal enhancement
US6757395B1 (en) * 2000-01-12 2004-06-29 Sonic Innovations, Inc. Noise reduction apparatus and method
US6804640B1 (en) * 2000-02-29 2004-10-12 Nuance Communications Signal noise reduction using magnitude-domain spectral subtraction
JP3342864B2 (ja) * 2000-09-13 2002-11-11 株式会社エントロピーソフトウェア研究所 音声の類似度検出方法及びその検出値を用いた音声認識方法、並びに、振動波の類似度検出方法及びその検出値を用いた機械の異常判定方法、並びに、画像の類似度検出方法及びその検出値を用いた画像認識方法、並びに、立体の類似度検出方法及びその検出値を用いた立体認識方法、並びに、動画像の類似度検出方法及びその検出値を用いた動画像認識方法
JP4195267B2 (ja) * 2002-03-14 2008-12-10 インターナショナル・ビジネス・マシーンズ・コーポレーション 音声認識装置、その音声認識方法及びプログラム
US20030187637A1 (en) * 2002-03-29 2003-10-02 At&T Automatic feature compensation based on decomposition of speech and noise
JP4989967B2 (ja) * 2003-07-11 2012-08-01 コクレア リミテッド ノイズ低減のための方法および装置
US7133825B2 (en) * 2003-11-28 2006-11-07 Skyworks Solutions, Inc. Computationally efficient background noise suppressor for speech coding and speech recognition
CA2454296A1 (en) * 2003-12-29 2005-06-29 Nokia Corporation Method and device for speech enhancement in the presence of background noise
US7492889B2 (en) 2004-04-23 2009-02-17 Acoustic Technologies, Inc. Noise suppression based on bark band wiener filtering and modified doblinger noise estimate
US7454332B2 (en) * 2004-06-15 2008-11-18 Microsoft Corporation Gain constrained noise suppression
US7742914B2 (en) * 2005-03-07 2010-06-22 Daniel A. Kosek Audio spectral noise reduction method and apparatus
EP1760696B1 (en) * 2005-09-03 2016-02-03 GN ReSound A/S Method and apparatus for improved estimation of non-stationary noise for speech enhancement
CN101802909B (zh) * 2007-09-12 2013-07-10 杜比实验室特许公司 通过噪声水平估计调整进行的语音增强

Also Published As

Publication number Publication date
CN101647061A (zh) 2010-02-10
EP2137728A1 (en) 2009-12-30
US20100100386A1 (en) 2010-04-22
KR20090122251A (ko) 2009-11-26
JP5186510B2 (ja) 2013-04-17
EP3070714A1 (en) 2016-09-21
JP2010521704A (ja) 2010-06-24
TWI420509B (zh) 2013-12-21
WO2008115435A1 (en) 2008-09-25
CN101647061B (zh) 2012-04-11
TW200844978A (en) 2008-11-16
KR101141033B1 (ko) 2012-05-03
EP3070714B1 (en) 2018-03-14
US8280731B2 (en) 2012-10-02
EP2137728B1 (en) 2016-03-09

Similar Documents

Publication Publication Date Title
ES2570961T3 (es) Estimación de varianza de ruido para mejorar la calidad de voz
LTPA2018510I1 (lt) Dioksa-biciklo[3.2.1]oktan-2,3,4-triolio dariniai
AR094279A1 (es) Agregado de ruido de confort para modelar el ruido de fondo a bajas tasas de bits
WO2012140510A3 (en) Devices and methods for monitoring intracranial pressure and additional intracranial hemodynamic parameters
BRPI1008710A2 (pt) películas nanoporosas e método de fabricação das mesmas.
ZA201903140B (en) Estimation of background noise in audio signals
MX2018000552A (es) Velocidad controlada de ruptura de espuma en limpiadores de superficies duras.
WO2014115115A3 (en) Determining apnea-hypopnia index ahi from speech
BR112016009563A2 (pt) Extensão de largura de banda de áudio através da inserção de ruído temporal préformado no domínio de frequência
DK3719801T3 (da) Estimering af baggrundsstøj i audiosignaler
EP2738763A3 (en) Speech enhancement apparatus and speech enhancement method
EA201791911A1 (ru) Применение трихотецен-превращающей алкогольдегидрогеназы, способ превращения трихотеценов и трихотецен-превращающая добавка
WO2013132342A3 (en) Voice signal enhancement
MX2016004528A (es) Estimacion de forma de ganancia para rastreo mejorado de caracteristicas temporales de banda-alta.
AR101320A1 (es) Método para estimar ruido en una señal de audio, estimador de ruido, codificador de audio, decodificador de audio, y sistema para transmitir señales de audio
WO2014107732A3 (en) Metal hydride alloy
CL2016003121A1 (es) Método y aparato para reconstruir un componente de ruido de una señal de voz/audio
WO2013132348A3 (en) Formant based speech reconstruction from noisy signals
DE112009002571T8 (de) Variable Rauschmaskierung während Phasen wesentlicher Stille
DK3118851T3 (da) Forbedring af støjende tale baseret på statistiske tale- og støjmodeller
WO2016100747A3 (en) Method and apparatus for estimating waveform onset time
MY160723A (en) An anticoagulant
Abebe Modification of Melamine Sponge for Efficient Absorption Oil from Oily Waste Water
WO2010017478A3 (en) Pak1 agonists and methods of use
FR2926465B1 (fr) Composition anti-virale notamment le sida