ES2570961T3 - Estimación de varianza de ruido para mejorar la calidad de voz - Google Patents
Estimación de varianza de ruido para mejorar la calidad de vozInfo
- Publication number
- ES2570961T3 ES2570961T3 ES08726859T ES08726859T ES2570961T3 ES 2570961 T3 ES2570961 T3 ES 2570961T3 ES 08726859 T ES08726859 T ES 08726859T ES 08726859 T ES08726859 T ES 08726859T ES 2570961 T3 ES2570961 T3 ES 2570961T3
- Authority
- ES
- Spain
- Prior art keywords
- audio signal
- noise components
- amplitude
- estimate
- estimation
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Active
Links
- 230000005236 sound signal Effects 0.000 abstract 9
Classifications
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/20—Speech recognition techniques specially adapted for robustness in adverse environments, e.g. in noise, of stress induced speech
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L21/00—Speech or voice signal processing techniques to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
- G10L21/02—Speech enhancement, e.g. noise reduction or echo cancellation
- G10L21/0208—Noise filtering
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/08—Speech classification or search
- G10L15/12—Speech classification or search using dynamic programming techniques, e.g. dynamic time warping [DTW]
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L25/00—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
- G10L25/03—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 characterised by the type of extracted parameters
- G10L25/18—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 characterised by the type of extracted parameters the extracted parameters being spectral information of each sub-band
Landscapes
- Engineering & Computer Science (AREA)
- Multimedia (AREA)
- Acoustics & Sound (AREA)
- Computational Linguistics (AREA)
- Health & Medical Sciences (AREA)
- Audiology, Speech & Language Pathology (AREA)
- Human Computer Interaction (AREA)
- Physics & Mathematics (AREA)
- Quality & Reliability (AREA)
- Signal Processing (AREA)
- Compression, Expansion, Code Conversion, And Decoders (AREA)
- Circuit For Audible Band Transducer (AREA)
- Monitoring And Testing Of Transmission In General (AREA)
- Noise Elimination (AREA)
- Telephone Function (AREA)
Abstract
Un procedimiento para obtener una estimación de varianza en componentes de ruido de una señal de audio formada por componentes de voz y de ruido, que comprende: obtener dicha estimación de varianza en componentes de ruido de una señal de audio a partir del promedio de estimaciones previas de la amplitud de las componentes de ruido de la señal de audio, en el que las estimaciones de la amplitud de las componentes de ruido de la señal de audio que tienen valores mayores que un umbral se excluyen de o se ponderan con un valor bajo en el promedio de las estimaciones previas de la amplitud de las componentes de ruido de la señal de audio, y en el que cada estimación de la amplitud de las componentes de ruido de la señal de audio es una función de una estimación de varianza en las componentes de ruido de la señal de audio, una estimación de varianza en las componentes de voz de la señal de audio y la amplitud de la señal de audio.
Applications Claiming Priority (2)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
US91896407P | 2007-03-19 | 2007-03-19 | |
PCT/US2008/003436 WO2008115435A1 (en) | 2007-03-19 | 2008-03-14 | Noise variance estimator for speech enhancement |
Publications (1)
Publication Number | Publication Date |
---|---|
ES2570961T3 true ES2570961T3 (es) | 2016-05-23 |
Family
ID=39468801
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
ES08726859T Active ES2570961T3 (es) | 2007-03-19 | 2008-03-14 | Estimación de varianza de ruido para mejorar la calidad de voz |
Country Status (8)
Country | Link |
---|---|
US (1) | US8280731B2 (es) |
EP (2) | EP3070714B1 (es) |
JP (1) | JP5186510B2 (es) |
KR (1) | KR101141033B1 (es) |
CN (1) | CN101647061B (es) |
ES (1) | ES2570961T3 (es) |
TW (1) | TWI420509B (es) |
WO (1) | WO2008115435A1 (es) |
Families Citing this family (30)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US8949120B1 (en) | 2006-05-25 | 2015-02-03 | Audience, Inc. | Adaptive noise cancelation |
US8521530B1 (en) * | 2008-06-30 | 2013-08-27 | Audience, Inc. | System and method for enhancing a monaural audio signal |
KR101581885B1 (ko) * | 2009-08-26 | 2016-01-04 | 삼성전자주식회사 | 복소 스펙트럼 잡음 제거 장치 및 방법 |
US20110178800A1 (en) * | 2010-01-19 | 2011-07-21 | Lloyd Watts | Distortion Measurement for Noise Suppression System |
US8798290B1 (en) | 2010-04-21 | 2014-08-05 | Audience, Inc. | Systems and methods for adaptive signal equalization |
US9558755B1 (en) | 2010-05-20 | 2017-01-31 | Knowles Electronics, Llc | Noise suppression assisted automatic speech recognition |
BR122021003884B1 (pt) | 2010-08-12 | 2021-11-30 | Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e. V. | Reamostrar sinais de saída de codecs de áudio com base em qmf |
JP5643686B2 (ja) * | 2011-03-11 | 2014-12-17 | 株式会社東芝 | 音声判別装置、音声判別方法および音声判別プログラム |
US9173025B2 (en) | 2012-02-08 | 2015-10-27 | Dolby Laboratories Licensing Corporation | Combined suppression of noise, echo, and out-of-location signals |
US9373341B2 (en) | 2012-03-23 | 2016-06-21 | Dolby Laboratories Licensing Corporation | Method and system for bias corrected speech level determination |
EP2828854B1 (en) | 2012-03-23 | 2016-03-16 | Dolby Laboratories Licensing Corporation | Hierarchical active voice detection |
JP6182895B2 (ja) * | 2012-05-01 | 2017-08-23 | 株式会社リコー | 処理装置、処理方法、プログラム及び処理システム |
US9640194B1 (en) | 2012-10-04 | 2017-05-02 | Knowles Electronics, Llc | Noise suppression for speech processing based on machine-learning mask estimation |
US10306389B2 (en) | 2013-03-13 | 2019-05-28 | Kopin Corporation | Head wearable acoustic system with noise canceling microphone geometry apparatuses and methods |
US9257952B2 (en) | 2013-03-13 | 2016-02-09 | Kopin Corporation | Apparatuses and methods for multi-channel signal compression during desired voice activity detection |
US9536540B2 (en) | 2013-07-19 | 2017-01-03 | Knowles Electronics, Llc | Speech signal separation and synthesis based on auditory scene analysis and speech modeling |
CN103559887B (zh) * | 2013-11-04 | 2016-08-17 | 深港产学研基地 | 用于语音增强系统的背景噪声估计方法 |
JP6361156B2 (ja) * | 2014-02-10 | 2018-07-25 | 沖電気工業株式会社 | 雑音推定装置、方法及びプログラム |
CN103824563A (zh) * | 2014-02-21 | 2014-05-28 | 深圳市微纳集成电路与系统应用研究院 | 一种基于模块复用的助听器去噪装置和方法 |
CN103854662B (zh) * | 2014-03-04 | 2017-03-15 | 中央军委装备发展部第六十三研究所 | 基于多域联合估计的自适应语音检测方法 |
DE112015003945T5 (de) | 2014-08-28 | 2017-05-11 | Knowles Electronics, Llc | Mehrquellen-Rauschunterdrückung |
RU2673390C1 (ru) * | 2014-12-12 | 2018-11-26 | Хуавэй Текнолоджиз Ко., Лтд. | Устройство обработки сигналов для усиления речевого компонента в многоканальном звуковом сигнале |
CN105810214B (zh) * | 2014-12-31 | 2019-11-05 | 展讯通信(上海)有限公司 | 语音激活检测方法及装置 |
DK3118851T3 (da) * | 2015-07-01 | 2021-02-22 | Oticon As | Forbedring af støjende tale baseret på statistiske tale- og støjmodeller |
US11631421B2 (en) * | 2015-10-18 | 2023-04-18 | Solos Technology Limited | Apparatuses and methods for enhanced speech recognition in variable environments |
US20190137549A1 (en) * | 2017-11-03 | 2019-05-09 | Velodyne Lidar, Inc. | Systems and methods for multi-tier centroid calculation |
EP3573058B1 (en) * | 2018-05-23 | 2021-02-24 | Harman Becker Automotive Systems GmbH | Dry sound and ambient sound separation |
CN110164467B (zh) * | 2018-12-18 | 2022-11-25 | 腾讯科技(深圳)有限公司 | 语音降噪的方法和装置、计算设备和计算机可读存储介质 |
CN110136738A (zh) * | 2019-06-13 | 2019-08-16 | 苏州思必驰信息科技有限公司 | 噪声估计方法及装置 |
CN111613239B (zh) * | 2020-05-29 | 2023-09-05 | 北京达佳互联信息技术有限公司 | 音频去噪方法和装置、服务器、存储介质 |
Family Cites Families (19)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US5706395A (en) * | 1995-04-19 | 1998-01-06 | Texas Instruments Incorporated | Adaptive weiner filtering using a dynamic suppression factor |
SE506034C2 (sv) * | 1996-02-01 | 1997-11-03 | Ericsson Telefon Ab L M | Förfarande och anordning för förbättring av parametrar representerande brusigt tal |
US6415253B1 (en) * | 1998-02-20 | 2002-07-02 | Meta-C Corporation | Method and apparatus for enhancing noise-corrupted speech |
US6453285B1 (en) * | 1998-08-21 | 2002-09-17 | Polycom, Inc. | Speech activity detector for use in noise reduction system, and methods therefor |
US6289309B1 (en) | 1998-12-16 | 2001-09-11 | Sarnoff Corporation | Noise spectrum tracking for speech enhancement |
US6910011B1 (en) * | 1999-08-16 | 2005-06-21 | Haman Becker Automotive Systems - Wavemakers, Inc. | Noisy acoustic signal enhancement |
US6757395B1 (en) * | 2000-01-12 | 2004-06-29 | Sonic Innovations, Inc. | Noise reduction apparatus and method |
US6804640B1 (en) * | 2000-02-29 | 2004-10-12 | Nuance Communications | Signal noise reduction using magnitude-domain spectral subtraction |
JP3342864B2 (ja) * | 2000-09-13 | 2002-11-11 | 株式会社エントロピーソフトウェア研究所 | 音声の類似度検出方法及びその検出値を用いた音声認識方法、並びに、振動波の類似度検出方法及びその検出値を用いた機械の異常判定方法、並びに、画像の類似度検出方法及びその検出値を用いた画像認識方法、並びに、立体の類似度検出方法及びその検出値を用いた立体認識方法、並びに、動画像の類似度検出方法及びその検出値を用いた動画像認識方法 |
JP4195267B2 (ja) * | 2002-03-14 | 2008-12-10 | インターナショナル・ビジネス・マシーンズ・コーポレーション | 音声認識装置、その音声認識方法及びプログラム |
US20030187637A1 (en) * | 2002-03-29 | 2003-10-02 | At&T | Automatic feature compensation based on decomposition of speech and noise |
JP4989967B2 (ja) * | 2003-07-11 | 2012-08-01 | コクレア リミテッド | ノイズ低減のための方法および装置 |
US7133825B2 (en) * | 2003-11-28 | 2006-11-07 | Skyworks Solutions, Inc. | Computationally efficient background noise suppressor for speech coding and speech recognition |
CA2454296A1 (en) * | 2003-12-29 | 2005-06-29 | Nokia Corporation | Method and device for speech enhancement in the presence of background noise |
US7492889B2 (en) | 2004-04-23 | 2009-02-17 | Acoustic Technologies, Inc. | Noise suppression based on bark band wiener filtering and modified doblinger noise estimate |
US7454332B2 (en) * | 2004-06-15 | 2008-11-18 | Microsoft Corporation | Gain constrained noise suppression |
US7742914B2 (en) * | 2005-03-07 | 2010-06-22 | Daniel A. Kosek | Audio spectral noise reduction method and apparatus |
EP1760696B1 (en) * | 2005-09-03 | 2016-02-03 | GN ReSound A/S | Method and apparatus for improved estimation of non-stationary noise for speech enhancement |
CN101802909B (zh) * | 2007-09-12 | 2013-07-10 | 杜比实验室特许公司 | 通过噪声水平估计调整进行的语音增强 |
-
2008
- 2008-03-14 US US12/531,690 patent/US8280731B2/en active Active
- 2008-03-14 KR KR1020097019499A patent/KR101141033B1/ko active IP Right Grant
- 2008-03-14 JP JP2009553646A patent/JP5186510B2/ja active Active
- 2008-03-14 WO PCT/US2008/003436 patent/WO2008115435A1/en active Application Filing
- 2008-03-14 ES ES08726859T patent/ES2570961T3/es active Active
- 2008-03-14 EP EP16151957.4A patent/EP3070714B1/en active Active
- 2008-03-14 TW TW097109065A patent/TWI420509B/zh active
- 2008-03-14 EP EP08726859.5A patent/EP2137728B1/en active Active
- 2008-03-14 CN CN2008800088867A patent/CN101647061B/zh active Active
Also Published As
Publication number | Publication date |
---|---|
CN101647061A (zh) | 2010-02-10 |
EP2137728A1 (en) | 2009-12-30 |
US20100100386A1 (en) | 2010-04-22 |
KR20090122251A (ko) | 2009-11-26 |
JP5186510B2 (ja) | 2013-04-17 |
EP3070714A1 (en) | 2016-09-21 |
JP2010521704A (ja) | 2010-06-24 |
TWI420509B (zh) | 2013-12-21 |
WO2008115435A1 (en) | 2008-09-25 |
CN101647061B (zh) | 2012-04-11 |
TW200844978A (en) | 2008-11-16 |
KR101141033B1 (ko) | 2012-05-03 |
EP3070714B1 (en) | 2018-03-14 |
US8280731B2 (en) | 2012-10-02 |
EP2137728B1 (en) | 2016-03-09 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
ES2570961T3 (es) | Estimación de varianza de ruido para mejorar la calidad de voz | |
LTPA2018510I1 (lt) | Dioksa-biciklo[3.2.1]oktan-2,3,4-triolio dariniai | |
AR094279A1 (es) | Agregado de ruido de confort para modelar el ruido de fondo a bajas tasas de bits | |
WO2012140510A3 (en) | Devices and methods for monitoring intracranial pressure and additional intracranial hemodynamic parameters | |
BRPI1008710A2 (pt) | películas nanoporosas e método de fabricação das mesmas. | |
ZA201903140B (en) | Estimation of background noise in audio signals | |
MX2018000552A (es) | Velocidad controlada de ruptura de espuma en limpiadores de superficies duras. | |
WO2014115115A3 (en) | Determining apnea-hypopnia index ahi from speech | |
BR112016009563A2 (pt) | Extensão de largura de banda de áudio através da inserção de ruído temporal préformado no domínio de frequência | |
DK3719801T3 (da) | Estimering af baggrundsstøj i audiosignaler | |
EP2738763A3 (en) | Speech enhancement apparatus and speech enhancement method | |
EA201791911A1 (ru) | Применение трихотецен-превращающей алкогольдегидрогеназы, способ превращения трихотеценов и трихотецен-превращающая добавка | |
WO2013132342A3 (en) | Voice signal enhancement | |
MX2016004528A (es) | Estimacion de forma de ganancia para rastreo mejorado de caracteristicas temporales de banda-alta. | |
AR101320A1 (es) | Método para estimar ruido en una señal de audio, estimador de ruido, codificador de audio, decodificador de audio, y sistema para transmitir señales de audio | |
WO2014107732A3 (en) | Metal hydride alloy | |
CL2016003121A1 (es) | Método y aparato para reconstruir un componente de ruido de una señal de voz/audio | |
WO2013132348A3 (en) | Formant based speech reconstruction from noisy signals | |
DE112009002571T8 (de) | Variable Rauschmaskierung während Phasen wesentlicher Stille | |
DK3118851T3 (da) | Forbedring af støjende tale baseret på statistiske tale- og støjmodeller | |
WO2016100747A3 (en) | Method and apparatus for estimating waveform onset time | |
MY160723A (en) | An anticoagulant | |
Abebe | Modification of Melamine Sponge for Efficient Absorption Oil from Oily Waste Water | |
WO2010017478A3 (en) | Pak1 agonists and methods of use | |
FR2926465B1 (fr) | Composition anti-virale notamment le sida |