CN106796802A - 用于经由非线性衰减/增益函数来消除音乐噪声的方法和装置 - Google Patents
用于经由非线性衰减/增益函数来消除音乐噪声的方法和装置 Download PDFInfo
- Publication number
- CN106796802A CN106796802A CN201580047301.2A CN201580047301A CN106796802A CN 106796802 A CN106796802 A CN 106796802A CN 201580047301 A CN201580047301 A CN 201580047301A CN 106796802 A CN106796802 A CN 106796802A
- Authority
- CN
- China
- Prior art keywords
- noise
- voice signal
- estimation
- amplitude
- signal
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Granted
Links
- 238000000034 method Methods 0.000 title claims description 27
- 230000006870 function Effects 0.000 description 36
- 230000002238 attenuated effect Effects 0.000 description 6
- 238000004590 computer program Methods 0.000 description 6
- 230000008859 change Effects 0.000 description 5
- 238000000746 purification Methods 0.000 description 5
- 230000005236 sound signal Effects 0.000 description 5
- 238000004088 simulation Methods 0.000 description 4
- 230000001052 transient effect Effects 0.000 description 4
- 230000002829 reductive effect Effects 0.000 description 3
- 230000005540 biological transmission Effects 0.000 description 2
- 238000004891 communication Methods 0.000 description 2
- 238000007796 conventional method Methods 0.000 description 2
- 238000010586 diagram Methods 0.000 description 2
- 230000005611 electricity Effects 0.000 description 2
- 230000009467 reduction Effects 0.000 description 2
- 230000000717 retained effect Effects 0.000 description 2
- 230000009466 transformation Effects 0.000 description 2
- 230000005534 acoustic noise Effects 0.000 description 1
- 230000009286 beneficial effect Effects 0.000 description 1
- 230000008901 benefit Effects 0.000 description 1
- 230000015572 biosynthetic process Effects 0.000 description 1
- 230000008878 coupling Effects 0.000 description 1
- 238000010168 coupling process Methods 0.000 description 1
- 238000005859 coupling reaction Methods 0.000 description 1
- 238000006073 displacement reaction Methods 0.000 description 1
- 230000008030 elimination Effects 0.000 description 1
- 238000003379 elimination reaction Methods 0.000 description 1
- 238000005516 engineering process Methods 0.000 description 1
- 230000003993 interaction Effects 0.000 description 1
- 230000000670 limiting effect Effects 0.000 description 1
- 238000013507 mapping Methods 0.000 description 1
- 238000012986 modification Methods 0.000 description 1
- 230000004048 modification Effects 0.000 description 1
- 230000003287 optical effect Effects 0.000 description 1
- 230000008569 process Effects 0.000 description 1
- 238000012545 processing Methods 0.000 description 1
- 230000000644 propagated effect Effects 0.000 description 1
- ZLIBICFPKPWGIZ-UHFFFAOYSA-N pyrimethanil Chemical compound CC1=CC(C)=NC(NC=2C=CC=CC=2)=N1 ZLIBICFPKPWGIZ-UHFFFAOYSA-N 0.000 description 1
- 238000011946 reduction process Methods 0.000 description 1
- 239000010979 ruby Substances 0.000 description 1
- 229910001750 ruby Inorganic materials 0.000 description 1
- 239000004065 semiconductor Substances 0.000 description 1
- 230000003068 static effect Effects 0.000 description 1
- 238000006467 substitution reaction Methods 0.000 description 1
- 230000007704 transition Effects 0.000 description 1
Classifications
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L21/00—Speech or voice signal processing techniques to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
- G10L21/02—Speech enhancement, e.g. noise reduction or echo cancellation
- G10L21/0208—Noise filtering
- G10L21/0264—Noise filtering characterised by the type of parameter measurement, e.g. correlation techniques, zero crossing techniques or predictive techniques
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L19/00—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
- G10L19/02—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using spectral analysis, e.g. transform vocoders or subband vocoders
- G10L19/0204—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using spectral analysis, e.g. transform vocoders or subband vocoders using subband decomposition
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L21/00—Speech or voice signal processing techniques to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
- G10L21/02—Speech enhancement, e.g. noise reduction or echo cancellation
- G10L21/0208—Noise filtering
- G10L21/0216—Noise filtering characterised by the method used for estimating noise
- G10L21/0232—Processing in the frequency domain
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L21/00—Speech or voice signal processing techniques to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
- G10L21/02—Speech enhancement, e.g. noise reduction or echo cancellation
- G10L21/0316—Speech enhancement, e.g. noise reduction or echo cancellation by changing the amplitude
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L25/00—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
- G10L25/03—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 characterised by the type of extracted parameters
- G10L25/21—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 characterised by the type of extracted parameters the extracted parameters being power information
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L21/00—Speech or voice signal processing techniques to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
- G10L21/02—Speech enhancement, e.g. noise reduction or echo cancellation
- G10L21/0208—Noise filtering
- G10L2021/02085—Periodic noise
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L21/00—Speech or voice signal processing techniques to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
- G10L21/02—Speech enhancement, e.g. noise reduction or echo cancellation
- G10L21/0208—Noise filtering
- G10L21/0216—Noise filtering characterised by the method used for estimating noise
- G10L2021/02161—Number of inputs available containing the signal or the noise to be suppressed
- G10L2021/02163—Only one microphone
Landscapes
- Engineering & Computer Science (AREA)
- Physics & Mathematics (AREA)
- Human Computer Interaction (AREA)
- Signal Processing (AREA)
- Health & Medical Sciences (AREA)
- Audiology, Speech & Language Pathology (AREA)
- Computational Linguistics (AREA)
- Acoustics & Sound (AREA)
- Multimedia (AREA)
- Quality & Reliability (AREA)
- Spectroscopy & Molecular Physics (AREA)
- Noise Elimination (AREA)
- Telephone Function (AREA)
- Circuit For Audible Band Transducer (AREA)
Abstract
Description
Claims (20)
Applications Claiming Priority (5)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
US201462045367P | 2014-09-03 | 2014-09-03 | |
US62/045,367 | 2014-09-03 | ||
US14/829,052 | 2015-08-18 | ||
US14/829,052 US9940945B2 (en) | 2014-09-03 | 2015-08-18 | Method and apparatus for eliminating music noise via a nonlinear attenuation/gain function |
PCT/US2015/046979 WO2016036562A1 (en) | 2014-09-03 | 2015-08-26 | Method and apparatus for eliminating music noise via a nonlinear attenuation/gain function |
Publications (2)
Publication Number | Publication Date |
---|---|
CN106796802A true CN106796802A (zh) | 2017-05-31 |
CN106796802B CN106796802B (zh) | 2021-06-18 |
Family
ID=55403207
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN201580047301.2A Expired - Fee Related CN106796802B (zh) | 2014-09-03 | 2015-08-26 | 用于经由非线性衰减/增益函数来消除音乐噪声的方法和装置 |
Country Status (4)
Country | Link |
---|---|
US (1) | US9940945B2 (zh) |
EP (1) | EP3195313A1 (zh) |
CN (1) | CN106796802B (zh) |
WO (1) | WO2016036562A1 (zh) |
Citations (4)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN101089952A (zh) * | 2006-06-15 | 2007-12-19 | 株式会社东芝 | 噪声抑制、平滑语音谱、提取语音特征、语音识别、及训练语音模型的方法和装置 |
CN101636648A (zh) * | 2007-03-19 | 2010-01-27 | 杜比实验室特许公司 | 采用感知模型的语音增强 |
CN101853665A (zh) * | 2009-06-18 | 2010-10-06 | 博石金(北京)信息技术有限公司 | 语音中噪声的消除方法 |
CN102402987A (zh) * | 2010-09-07 | 2012-04-04 | 索尼公司 | 噪声抑制装置、噪声抑制方法和程序 |
Family Cites Families (10)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US20020002455A1 (en) * | 1998-01-09 | 2002-01-03 | At&T Corporation | Core estimator and adaptive gains from signal to noise ratio in a hybrid speech enhancement system |
WO2005114656A1 (en) | 2004-05-14 | 2005-12-01 | Loquendo S.P.A. | Noise reduction for automatic speech recognition |
KR100821177B1 (ko) * | 2006-09-29 | 2008-04-14 | 한국전자통신연구원 | 통계적 모델에 기반한 선험적 음성 부재 확률 추정 방법 |
FR2908003B1 (fr) * | 2006-10-26 | 2009-04-03 | Parrot Sa | Procede de reduction de l'echo acoustique residuel apres supression d'echo dans un dispositif"mains libres" |
US8352257B2 (en) * | 2007-01-04 | 2013-01-08 | Qnx Software Systems Limited | Spectro-temporal varying approach for speech enhancement |
US8306817B2 (en) * | 2008-01-08 | 2012-11-06 | Microsoft Corporation | Speech recognition with non-linear noise reduction on Mel-frequency cepstra |
US8660281B2 (en) * | 2009-02-03 | 2014-02-25 | University Of Ottawa | Method and system for a multi-microphone noise reduction |
US9130643B2 (en) * | 2012-01-31 | 2015-09-08 | Broadcom Corporation | Systems and methods for enhancing audio quality of FM receivers |
JP6135106B2 (ja) * | 2012-11-29 | 2017-05-31 | 富士通株式会社 | 音声強調装置、音声強調方法及び音声強調用コンピュータプログラム |
US9437212B1 (en) * | 2013-12-16 | 2016-09-06 | Marvell International Ltd. | Systems and methods for suppressing noise in an audio signal for subbands in a frequency domain based on a closed-form solution |
-
2015
- 2015-08-18 US US14/829,052 patent/US9940945B2/en not_active Expired - Fee Related
- 2015-08-26 WO PCT/US2015/046979 patent/WO2016036562A1/en active Application Filing
- 2015-08-26 CN CN201580047301.2A patent/CN106796802B/zh not_active Expired - Fee Related
- 2015-08-26 EP EP15766266.9A patent/EP3195313A1/en not_active Withdrawn
Patent Citations (4)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN101089952A (zh) * | 2006-06-15 | 2007-12-19 | 株式会社东芝 | 噪声抑制、平滑语音谱、提取语音特征、语音识别、及训练语音模型的方法和装置 |
CN101636648A (zh) * | 2007-03-19 | 2010-01-27 | 杜比实验室特许公司 | 采用感知模型的语音增强 |
CN101853665A (zh) * | 2009-06-18 | 2010-10-06 | 博石金(北京)信息技术有限公司 | 语音中噪声的消除方法 |
CN102402987A (zh) * | 2010-09-07 | 2012-04-04 | 索尼公司 | 噪声抑制装置、噪声抑制方法和程序 |
Non-Patent Citations (4)
Title |
---|
ISRAEL COHEN等: "Speech enhancement for non-stationarynoise environments", 《SIGNAL PROCESSING》 * |
YARIV EPHRAIM等: "Speech Enhancement Using a- Minimum Mean-Square Error Short-Time Spectral Amplitude Estimator", 《IEEE TRANSACTIONS ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING》 * |
余建潮等: "改进增益函数的MMSE语音增强算法", 《计算机工程与设计》 * |
陈俊等: "基于MMSE 先验信噪比估计的语音增强", 《武汉大学学报(理学版)》 * |
Also Published As
Publication number | Publication date |
---|---|
US9940945B2 (en) | 2018-04-10 |
CN106796802B (zh) | 2021-06-18 |
EP3195313A1 (en) | 2017-07-26 |
WO2016036562A1 (en) | 2016-03-10 |
US20160064010A1 (en) | 2016-03-03 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
US10891931B2 (en) | Single-channel, binaural and multi-channel dereverberation | |
US10142763B2 (en) | Audio signal processing | |
JP6436934B2 (ja) | 動的閾値を用いた周波数帯域圧縮 | |
JP5722912B2 (ja) | 音響通信方法及び音響通信方法を実行させるためのプログラムを記録した記録媒体 | |
US10419849B2 (en) | FIR filter coefficient calculation for beam-forming filters | |
EP2551850A1 (en) | Methods and apparatuses for convolutive blind source separation | |
CN103039023A (zh) | 音频重放的自适应环境噪声补偿 | |
JP2008197284A (ja) | フィルタ係数算出装置、フィルタ係数算出方法、制御プログラム、コンピュータ読み取り可能な記録媒体、および、音声信号処理装置 | |
JP2012516646A (ja) | 臨界バンドに分けられたインパルス応答データから逆フィルタを決定する方法 | |
EP2980789A1 (en) | Apparatus and method for enhancing an audio signal, sound enhancing system | |
CN112712816A (zh) | 语音处理模型的训练方法和装置以及语音处理方法和装置 | |
Lehtonen et al. | Audibility of aliasing distortion in sawtooth signals and its implications for oscillator algorithm design | |
CN106796802A (zh) | 用于经由非线性衰减/增益函数来消除音乐噪声的方法和装置 | |
Favrot et al. | Perceptually motivated gain filter smoothing for noise suppression | |
Yang et al. | Environment-Aware Reconfigurable Noise Suppression | |
US11308975B2 (en) | Mixing device, mixing method, and non-transitory computer-readable recording medium | |
Axelson-Fisk | Caring More About EQ Than IQ: Automatic Equalizing of Audio Signals | |
Lopez et al. | Low Order IIR Parametric Loudspeaker Equalization, A Psychoacoustic Approach | |
CN115691522A (zh) | 一种重低音增强方法、系统、设备及存储介质 | |
KR100717154B1 (ko) | 멀티채널 오디오 신호의 다운믹스 방법 및 장치 | |
JP2015216492A (ja) | エコー抑圧装置 | |
CN117439844A (zh) | 一种带噪声消除的信道估计方法及系统 | |
Lakhdhar et al. | Iterative equalization of room transfer function using biquadratic filters | |
JP2005079781A (ja) | ブラインド信号分離方法、ブラインド信号分離プログラム及び記録媒体 | |
Jeon et al. | Complexity Reduction of Virtual Reverberation Filtering Based on Index-Based Convolution for Resource-Constrained Devices |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PB01 | Publication | ||
PB01 | Publication | ||
SE01 | Entry into force of request for substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
TA01 | Transfer of patent application right | ||
TA01 | Transfer of patent application right |
Effective date of registration: 20200428 Address after: Singapore City Applicant after: Marvell Asia Pte. Ltd. Address before: Ford street, Grand Cayman, Cayman Islands Applicant before: Kaiwei international Co. Effective date of registration: 20200428 Address after: Ford street, Grand Cayman, Cayman Islands Applicant after: Kaiwei international Co. Address before: Hamilton, Bermuda Applicant before: Marvell International Ltd. Effective date of registration: 20200428 Address after: Hamilton, Bermuda Applicant after: Marvell International Ltd. Address before: Babado J San Mega Le Applicant before: MARVELL WORLD TRADE Ltd. |
|
GR01 | Patent grant | ||
GR01 | Patent grant | ||
CF01 | Termination of patent right due to non-payment of annual fee | ||
CF01 | Termination of patent right due to non-payment of annual fee |
Granted publication date: 20210618 |