CN106796802B - 用于经由非线性衰减/增益函数来消除音乐噪声的方法和装置 - Google Patents
用于经由非线性衰减/增益函数来消除音乐噪声的方法和装置 Download PDFInfo
- Publication number
- CN106796802B CN106796802B CN201580047301.2A CN201580047301A CN106796802B CN 106796802 B CN106796802 B CN 106796802B CN 201580047301 A CN201580047301 A CN 201580047301A CN 106796802 B CN106796802 B CN 106796802B
- Authority
- CN
- China
- Prior art keywords
- noise
- speech signal
- signal
- estimated
- speech
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Expired - Fee Related
Links
- 238000000034 method Methods 0.000 title claims description 26
- 238000012886 linear function Methods 0.000 claims abstract description 12
- 230000006870 function Effects 0.000 description 29
- 230000002238 attenuated effect Effects 0.000 description 9
- 230000002829 reductive effect Effects 0.000 description 7
- 238000004590 computer program Methods 0.000 description 6
- 230000005236 sound signal Effects 0.000 description 6
- 230000003247 decreasing effect Effects 0.000 description 4
- 238000010586 diagram Methods 0.000 description 4
- 238000004891 communication Methods 0.000 description 2
- 238000007796 conventional method Methods 0.000 description 2
- 230000007423 decrease Effects 0.000 description 2
- 238000011946 reduction process Methods 0.000 description 2
- 238000013459 approach Methods 0.000 description 1
- 230000008901 benefit Effects 0.000 description 1
- 230000005540 biological transmission Effects 0.000 description 1
- 230000015572 biosynthetic process Effects 0.000 description 1
- 230000001413 cellular effect Effects 0.000 description 1
- 238000006243 chemical reaction Methods 0.000 description 1
- 230000000694 effects Effects 0.000 description 1
- 230000008030 elimination Effects 0.000 description 1
- 238000003379 elimination reaction Methods 0.000 description 1
- 238000013507 mapping Methods 0.000 description 1
- 238000012986 modification Methods 0.000 description 1
- 230000004048 modification Effects 0.000 description 1
- 230000003287 optical effect Effects 0.000 description 1
- 230000036961 partial effect Effects 0.000 description 1
- 230000002093 peripheral effect Effects 0.000 description 1
- 230000008569 process Effects 0.000 description 1
- 230000001902 propagating effect Effects 0.000 description 1
- ZLIBICFPKPWGIZ-UHFFFAOYSA-N pyrimethanil Chemical compound CC1=CC(C)=NC(NC=2C=CC=CC=2)=N1 ZLIBICFPKPWGIZ-UHFFFAOYSA-N 0.000 description 1
- 239000010979 ruby Substances 0.000 description 1
- 239000004065 semiconductor Substances 0.000 description 1
- 230000003068 static effect Effects 0.000 description 1
- 238000006467 substitution reaction Methods 0.000 description 1
- 230000001052 transient effect Effects 0.000 description 1
- 230000007704 transition Effects 0.000 description 1
Images
Classifications
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L21/00—Speech or voice signal processing techniques to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
- G10L21/02—Speech enhancement, e.g. noise reduction or echo cancellation
- G10L21/0208—Noise filtering
- G10L21/0264—Noise filtering characterised by the type of parameter measurement, e.g. correlation techniques, zero crossing techniques or predictive techniques
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L21/00—Speech or voice signal processing techniques to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
- G10L21/02—Speech enhancement, e.g. noise reduction or echo cancellation
- G10L21/0208—Noise filtering
- G10L21/0216—Noise filtering characterised by the method used for estimating noise
- G10L21/0232—Processing in the frequency domain
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L19/00—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
- G10L19/02—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using spectral analysis, e.g. transform vocoders or subband vocoders
- G10L19/0204—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using spectral analysis, e.g. transform vocoders or subband vocoders using subband decomposition
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L21/00—Speech or voice signal processing techniques to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
- G10L21/02—Speech enhancement, e.g. noise reduction or echo cancellation
- G10L21/0316—Speech enhancement, e.g. noise reduction or echo cancellation by changing the amplitude
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L25/00—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
- G10L25/03—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 characterised by the type of extracted parameters
- G10L25/21—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 characterised by the type of extracted parameters the extracted parameters being power information
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L21/00—Speech or voice signal processing techniques to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
- G10L21/02—Speech enhancement, e.g. noise reduction or echo cancellation
- G10L21/0208—Noise filtering
- G10L2021/02085—Periodic noise
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L21/00—Speech or voice signal processing techniques to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
- G10L21/02—Speech enhancement, e.g. noise reduction or echo cancellation
- G10L21/0208—Noise filtering
- G10L21/0216—Noise filtering characterised by the method used for estimating noise
- G10L2021/02161—Number of inputs available containing the signal or the noise to be suppressed
- G10L2021/02163—Only one microphone
Landscapes
- Engineering & Computer Science (AREA)
- Physics & Mathematics (AREA)
- Audiology, Speech & Language Pathology (AREA)
- Computational Linguistics (AREA)
- Signal Processing (AREA)
- Health & Medical Sciences (AREA)
- Human Computer Interaction (AREA)
- Acoustics & Sound (AREA)
- Multimedia (AREA)
- Quality & Reliability (AREA)
- Spectroscopy & Molecular Physics (AREA)
- Noise Elimination (AREA)
- Telephone Function (AREA)
- Circuit For Audible Band Transducer (AREA)
Abstract
Description
Claims (20)
Applications Claiming Priority (5)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
US201462045367P | 2014-09-03 | 2014-09-03 | |
US62/045,367 | 2014-09-03 | ||
US14/829,052 | 2015-08-18 | ||
US14/829,052 US9940945B2 (en) | 2014-09-03 | 2015-08-18 | Method and apparatus for eliminating music noise via a nonlinear attenuation/gain function |
PCT/US2015/046979 WO2016036562A1 (en) | 2014-09-03 | 2015-08-26 | Method and apparatus for eliminating music noise via a nonlinear attenuation/gain function |
Publications (2)
Publication Number | Publication Date |
---|---|
CN106796802A CN106796802A (zh) | 2017-05-31 |
CN106796802B true CN106796802B (zh) | 2021-06-18 |
Family
ID=55403207
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN201580047301.2A Expired - Fee Related CN106796802B (zh) | 2014-09-03 | 2015-08-26 | 用于经由非线性衰减/增益函数来消除音乐噪声的方法和装置 |
Country Status (4)
Country | Link |
---|---|
US (1) | US9940945B2 (zh) |
EP (1) | EP3195313A1 (zh) |
CN (1) | CN106796802B (zh) |
WO (1) | WO2016036562A1 (zh) |
Citations (4)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN101089952A (zh) * | 2006-06-15 | 2007-12-19 | 株式会社东芝 | 噪声抑制、平滑语音谱、提取语音特征、语音识别、及训练语音模型的方法和装置 |
CN101636648A (zh) * | 2007-03-19 | 2010-01-27 | 杜比实验室特许公司 | 采用感知模型的语音增强 |
CN101853665A (zh) * | 2009-06-18 | 2010-10-06 | 博石金(北京)信息技术有限公司 | 语音中噪声的消除方法 |
CN102402987A (zh) * | 2010-09-07 | 2012-04-04 | 索尼公司 | 噪声抑制装置、噪声抑制方法和程序 |
Family Cites Families (10)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US20020002455A1 (en) * | 1998-01-09 | 2002-01-03 | At&T Corporation | Core estimator and adaptive gains from signal to noise ratio in a hybrid speech enhancement system |
WO2005114656A1 (en) | 2004-05-14 | 2005-12-01 | Loquendo S.P.A. | Noise reduction for automatic speech recognition |
KR100821177B1 (ko) * | 2006-09-29 | 2008-04-14 | 한국전자통신연구원 | 통계적 모델에 기반한 선험적 음성 부재 확률 추정 방법 |
FR2908003B1 (fr) * | 2006-10-26 | 2009-04-03 | Parrot Sa | Procede de reduction de l'echo acoustique residuel apres supression d'echo dans un dispositif"mains libres" |
US8352257B2 (en) * | 2007-01-04 | 2013-01-08 | Qnx Software Systems Limited | Spectro-temporal varying approach for speech enhancement |
US8306817B2 (en) * | 2008-01-08 | 2012-11-06 | Microsoft Corporation | Speech recognition with non-linear noise reduction on Mel-frequency cepstra |
WO2010091077A1 (en) * | 2009-02-03 | 2010-08-12 | University Of Ottawa | Method and system for a multi-microphone noise reduction |
US9130643B2 (en) * | 2012-01-31 | 2015-09-08 | Broadcom Corporation | Systems and methods for enhancing audio quality of FM receivers |
JP6135106B2 (ja) * | 2012-11-29 | 2017-05-31 | 富士通株式会社 | 音声強調装置、音声強調方法及び音声強調用コンピュータプログラム |
US9437212B1 (en) * | 2013-12-16 | 2016-09-06 | Marvell International Ltd. | Systems and methods for suppressing noise in an audio signal for subbands in a frequency domain based on a closed-form solution |
-
2015
- 2015-08-18 US US14/829,052 patent/US9940945B2/en not_active Expired - Fee Related
- 2015-08-26 WO PCT/US2015/046979 patent/WO2016036562A1/en active Application Filing
- 2015-08-26 CN CN201580047301.2A patent/CN106796802B/zh not_active Expired - Fee Related
- 2015-08-26 EP EP15766266.9A patent/EP3195313A1/en not_active Withdrawn
Patent Citations (4)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN101089952A (zh) * | 2006-06-15 | 2007-12-19 | 株式会社东芝 | 噪声抑制、平滑语音谱、提取语音特征、语音识别、及训练语音模型的方法和装置 |
CN101636648A (zh) * | 2007-03-19 | 2010-01-27 | 杜比实验室特许公司 | 采用感知模型的语音增强 |
CN101853665A (zh) * | 2009-06-18 | 2010-10-06 | 博石金(北京)信息技术有限公司 | 语音中噪声的消除方法 |
CN102402987A (zh) * | 2010-09-07 | 2012-04-04 | 索尼公司 | 噪声抑制装置、噪声抑制方法和程序 |
Non-Patent Citations (4)
Title |
---|
Speech enhancement for non-stationarynoise environments;Israel Cohen等;《Signal Processing》;20011231;第2403-2418页 * |
Speech Enhancement Using a- Minimum Mean-Square Error Short-Time Spectral Amplitude Estimator;YARIV EPHRAIM等;《IEEE TRANSACTIONS ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING》;19841231;第32卷(第16期);第1109-1121页 * |
基于MMSE 先验信噪比估计的语音增强;陈俊等;《武汉大学学报(理学版)》;20051031;第51卷(第5期);第638-642页 * |
改进增益函数的MMSE语音增强算法;余建潮等;《计算机工程与设计》;20101231;第31卷(第14期);第3287-3293页 * |
Also Published As
Publication number | Publication date |
---|---|
CN106796802A (zh) | 2017-05-31 |
US9940945B2 (en) | 2018-04-10 |
EP3195313A1 (en) | 2017-07-26 |
WO2016036562A1 (en) | 2016-03-10 |
US20160064010A1 (en) | 2016-03-03 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
US9503813B2 (en) | System and method for dynamic residual noise shaping | |
KR101732208B1 (ko) | 오디오 녹음의 적응적 동적 범위 강화 | |
CN104637491A (zh) | 用于内部mmse计算的基于外部估计的snr的修改器 | |
JP2016531332A (ja) | 音声処理システム | |
US9418677B2 (en) | Noise suppressing device, noise suppressing method, and a non-transitory computer-readable recording medium storing noise suppressing program | |
CN104637493A (zh) | 改进噪声抑制性能的语音概率存在修改器 | |
RU2662693C2 (ru) | Устройство декодирования, устройство кодирования, способ декодирования и способ кодирования | |
US10096329B2 (en) | Enhancing intelligibility of speech content in an audio signal | |
JP5136378B2 (ja) | 音響処理方法 | |
EP2828853B1 (en) | Method and system for bias corrected speech level determination | |
JP2023536104A (ja) | 機械学習を用いたノイズ削減 | |
US10638227B2 (en) | Processing of an audio input signal | |
CN106796802B (zh) | 用于经由非线性衰减/增益函数来消除音乐噪声的方法和装置 | |
US20110211711A1 (en) | Factor setting device and noise suppression apparatus | |
CN112309418B (zh) | 一种抑制风噪声的方法及装置 | |
JP6282925B2 (ja) | 音声強調装置、音声強調方法及びプログラム | |
WO2006055354A2 (en) | Adaptive time-based noise suppression | |
JP6816277B2 (ja) | 信号処理装置、制御方法、プログラム及び記憶媒体 | |
KR20210086499A (ko) | 음원의 음량 표준화 방법 및 장치 | |
KR20210055630A (ko) | 청각보조기기의 청각 보상 방법 | |
CN115866486A (zh) | 音箱调音模拟方法、装置、设备及存储介质 | |
CN116057626A (zh) | 使用机器学习的降噪 | |
KR20230121316A (ko) | 음향 처리 장치 | |
CN114360572A (zh) | 语音去噪方法、装置、电子设备及存储介质 |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PB01 | Publication | ||
PB01 | Publication | ||
SE01 | Entry into force of request for substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
TA01 | Transfer of patent application right | ||
TA01 | Transfer of patent application right |
Effective date of registration: 20200428 Address after: Singapore City Applicant after: Marvell Asia Pte. Ltd. Address before: Ford street, Grand Cayman, Cayman Islands Applicant before: Kaiwei international Co. Effective date of registration: 20200428 Address after: Ford street, Grand Cayman, Cayman Islands Applicant after: Kaiwei international Co. Address before: Hamilton, Bermuda Applicant before: Marvell International Ltd. Effective date of registration: 20200428 Address after: Hamilton, Bermuda Applicant after: Marvell International Ltd. Address before: Babado J San Mega Le Applicant before: MARVELL WORLD TRADE Ltd. |
|
GR01 | Patent grant | ||
GR01 | Patent grant | ||
CF01 | Termination of patent right due to non-payment of annual fee | ||
CF01 | Termination of patent right due to non-payment of annual fee |
Granted publication date: 20210618 |