CN107045874A - A kind of Non-linear Speech Enhancement Method based on correlation - Google Patents
A kind of Non-linear Speech Enhancement Method based on correlation Download PDFInfo
- Publication number
- CN107045874A CN107045874A CN201610079921.0A CN201610079921A CN107045874A CN 107045874 A CN107045874 A CN 107045874A CN 201610079921 A CN201610079921 A CN 201610079921A CN 107045874 A CN107045874 A CN 107045874A
- Authority
- CN
- China
- Prior art keywords
- speech
- gain
- noise
- frequency spectrum
- noisy speech
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Granted
Links
Classifications
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L21/00—Processing of the speech or voice signal to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
- G10L21/02—Speech enhancement, e.g. noise reduction or echo cancellation
- G10L21/0208—Noise filtering
Abstract
The invention discloses a kind of Non-linear Speech Enhancement Method based on correlation, including:Step a:The noisy speech data pre-processed to voiceWith estimation noise dataFast Fourier Transform (FFT) is carried out, the frequency spectrum of noisy speech frame is obtainedWith the frequency spectrum of estimation noise frame;Step b:When fading gain is calculated noise, obtains fading gain,;Step c:The correlation calculations of noisy speech and noise, calculate the frequency spectrum for obtaining the noisy speech frameWith the frequency spectrum of estimation noise frameCross-correlation function,;Step d:Nonlinear attenuation gain is calculated, and obtains the nonlinear attenuation gain;Step e:Speech enhan-cement processing, by the fading gainWith nonlinear attenuation gain described in the step dCollective effect is in the frequency spectrum of the noisy speech frameOn, to realize the processing of speech enhan-cement, obtain pure speech signal spec-trum.The technical scheme that the present invention is provided can more thoroughly remove the noise contribution in Noisy Speech Signal, and can remove noise according to different application scene and ensure that the aspect of speech quality two carries out flexibly compromise application.
Description
Technical field
The invention belongs to speech communication technical field, more particularly to speech enhancement technique.
Background technology
In voice communication course, the voice that transmitting terminal session person sends can be introduced noise by from surrounding environment where it
Interference, such as handle official business room conditioning sound, the sound that the fan such as host computer is rotated etc..The voice received in receiving terminal is
No longer it is the clean speech of transmitting terminal session person, and is the introduction of the noisy speech of all kinds of noise jammings, makes receiving terminal listener
The speech recognition reduction of uppick.But in many occasions, especially during videoconference, speech recognition and voice
Quality need to preferably be ensured, therefore it is necessary to carry out enhancing to voice, and speech enhancement technique of coming in is rapidly developed.
It is to subtract the method for thought based on spectrum to have a class in existing sound enhancement method, and such method is by noisy speech frequency spectrum
The noise spectrum obtained with estimation makees difference and obtains enhanced speech signal spec-trum, and its algorithm complex is low, and amount of calculation is smaller, but
It has the disadvantage more serious using noise in speech signal residual after spectrum-subtraction progress speech enhan-cement.Equations of The Second Kind is based on adaptive filter
The speech enhancement technique of ripple algorithm, such algorithm can not fundamentally overcome the contradiction between convergence rate and steady-state error, and
Algorithm effect in the relatively low environment of signal to noise ratio is poor.3rd class is the speech enhan-cement side based on matrix decomposition or model learning class
Method, such method is more excellent for the removal effect of non-stationary burst noise, but such method be related to matrix decomposition, model training
Learn scheduling theory implementation process complicated, amount of calculation is higher by much than preceding two classes method.It is of the invention to disclose one kind newly based on above-mentioned
The speech enhancement technique of type is with place of overcoming the deficiencies in the prior art.
The content of the invention
Present invention aims at a kind of Non-linear Speech Enhancement Method based on correlation is provided, ensureing voice quality
Under the premise of solve the problems such as noise remove is not net, and preferable speech enhan-cement effect can be obtained under the relatively low scene of signal to noise ratio.
In order to realize foregoing invention purpose, technical scheme is as follows:A kind of non-linear voice based on correlation
Enhancement Method, mainly includes:Step a:The noisy speech data pre-processed to voiceWith estimation noise dataCarry out fast
Fast Fourier transformation, obtains the frequency spectrum of noisy speech frameWith the frequency spectrum of estimation noise frame;Step b:Noise when decays
Gain is calculated, and obtains fading gain,;
Step c:The correlation calculations of noisy speech and noise, calculate the frequency spectrum for obtaining noisy speech frameWith the frequency of estimation noise frame
SpectrumCross-correlation function,;
Step d:Nonlinear attenuation gain is calculated, and obtains nonlinear attenuation gain;Step e:Speech enhan-cement processing, will decline
Subtract gainWith nonlinear attenuation gain in step dCollective effect is in the frequency spectrum of noisy speech frame
On, to realize the processing of speech enhan-cement, obtain pure speech signal spec-trum。
It is preferred that, step f is also included behind step e, for inverse fast Fourier transform, to speech signal spec-trum
Known inverse fast Fourier transform is carried out, signal is converted back into time domain from frequency domain:。
It is preferred that, step b further comprises the steps:Step b1:Calculate posteriori SNR,;Step b2:Calculate signal to noise ratio and update coefficient,, whereinFor former frame noisy speech data, parameterCan
Appropriate value is chosen according to concrete application scene;Step b3:Calculate prior weight,;Step b4:Calculate prior weight ratio,;Step b5:Calculated and most preferably declined using hypergeometric distribution correlation computations formula
Subtract gain;Step b6:Calculate fading gain lower limit;Step b7:Calculating obtains fading gain。
It is preferred that, parameter described in step b2Conventional span is [0.05,0.30].ParameterIt can take
0.25。
It is preferred that, step b5 optimized attenuation gains,
Wherein,For known gamma functions,,For with
Natural constantFor the exponential function at bottom,WithRespectively 0 rank and 1 rank Bessel functions.
It is preferred that, fading gain lower limit in step b6,
WhereinFor with natural constantFor the exponential function at bottom.
It is preferred that, fading gain,
WhereinFor weight coefficient, appropriate value can be chosen according to application scenarios, conventional span is [0.60,0.90].
It is preferred that, nonlinear attenuation gainPass throughCalculating is obtained, its
In,Smaller value is taken to operate for conventional, i.e.,
。
It is preferred that, nonlinear attenuation gainCalculating process be:。
The present invention provides a kind of Non-linear Speech Enhancement Method based on correlation, can under the premise of relatively low amount of calculation,
Overcome the shortcomings of art methods, making an uproar in Noisy Speech Signal can be more thoroughly removed using technical scheme
Sound composition, and can remove noise according to different application scene and ensure that the aspect of speech quality two is flexibly compromised, and this hair
The technical scheme of bright offer can obtain preferable speech enhan-cement effect under the relatively low scene of signal to noise ratio, and amount of calculation is compared to existing
There is algorithm to have no larger lifting, it is easy to accomplish and applied to the scene of real-time Transmission.
Brief description of the drawings
Fig. 1 is the flow chart of Non-linear Speech Enhancement Method in the specific embodiment of the invention.
Embodiment
In order to make the purpose , technical scheme and advantage of the present invention be clearer, it is right below in conjunction with drawings and Examples
The present invention is further elaborated.It should be appreciated that the specific embodiments described herein are merely illustrative of the present invention, and
It is not used in the restriction present invention.
The present invention general principle be:A kind of Non-linear Speech Enhancement Method.This method utilizes Noisy Speech Signal and ginseng
The frequency domain information for examining noise signal calculates signal to noise ratio, and the fading gain value of each frequency range is obtained using the signal-to-noise ratio computation;Then again
The correlation of Noisy Speech Signal and reference noise signal is calculated, correlation carries out nonlinear adjustment to fading gain value accordingly,
Finally the fading gain after adjustment is multiplied with noisy speech frequency spectrum and obtains the clean speech for removing noise jamming.
Fig. 1 is the flow chart of Non-linear Speech Enhancement Method in the specific embodiment of the invention.With reference to Fig. 1, to this
The specific implementation step of inventive method is further described.
The present invention focuses on known noisy speechWith known estimation noiseOn the premise of, carry out at speech enhan-cement
Reason, and for noiseEstimation procedure do not described.
Step 1 voice is pre-processed:
To noisy speechWith estimation noiseSub-frame processing is carried out, band to be reinforced is obtained by adding window sub-frame processing and made an uproar
Speech dataWith estimation noise data:
Wherein,For window function, Hamming is used in this embodiment(Hamming)Window;Adding window sub-frame processing is digital letter
Commonly used and necessary process in number processing, processing limited quantity numeral letter can be read in data signal operation processing unit every time
Number, data signal is subjected to framing by the quantity that processing can be read every time using window function.
Step 2 Fast Fourier Transform (FFT):
The noisy speech obtained to windowing processWith estimation noiseKnown Fast Fourier Transform (FFT) is carried out, is obtained
The frequency spectrum of noisy speech frameWith the frequency spectrum of estimation noise frame:
WhereinFor known Fast Fourier Transform (FFT).
When fading gain is calculated step 3 noise:
In this step to noise when fading gain estimation use for reference Y. Ephraim and D. Malah in " Y. Ephraim
and D. Malah, "Speech enhancement using minimum mean-square error short-time
spectral amplitude estimator". IEEE Transactions on Acoustics, Speech and
The classical calculation proposed in Signal Processing, vol. ASSP-32, no. 6, pp. 1109-1121,1984. "
Method, and its algorithm is improved with simplifying, the calculating process is only sketched, details refer to above-mentioned original text:
1)Posteriori SNR is calculated first:
2)Then calculate signal to noise ratio and update coefficient:
WhereinFor former frame noisy speech data, parameterAppropriate value can be chosen according to concrete application scene, commonly used
Span be [0.05,0.30], in present exampleIt is chosen for 0.25;
3)Calculate prior weight:
1 is utilized in this step)It is middle to calculate obtained posteriori SNRWith 2)It is middle to calculate obtained renewal coefficient
The prior weight that weighted sum is estimated;
4)Utilize 3)Calculate what is obtainedCalculate prior weight ratio:
5)Optimized attenuation gain is calculated using hypergeometric distribution correlation computations formula:
Wherein,For known gamma functions,,
For the exponential function using natural constant the bottom of as,WithRespectively 0 rank and 1 rank Bessel functions, relevant Bessel correlations can
With reference to William J.Lentz " Bessel functions in Mie scattering calculations using
continued fractions”;
6)Calculate fading gain lower limit:
WhereinWith 5)Described in, it is with natural constantFor the exponential function at bottom, fading gain lower limitFor one on the occasion of for the optimized attenuation gain to trying to achieveLimited, if, then optimized attenuation gain is illustratedValue is too small, so that enhanced voice
Can be containing " music noise " phenomenon that rises and falls, therefore must utilizeIt is rightValue be any limitation as, refer to 7)
Middle calculating process;
7)Calculating obtains fading gain:
WhereinLarger Value Operations are taken for conventional, i.e.,
UtilizeIt is rightIt is any limitation as, and withIt is weighted summation and asks flat again
Side obtains fading gain;WhereinFor weight coefficient, appropriate value, conventional value can be chosen according to application scenarios
Scope is selection in [0.60,0.90], present exampleFor 0.75.
The correlation calculations of step 4 noisy speech and noise
Noisy Speech Signal is first calculated in the stepPower spectrumWith estimation noisePower spectrum, it is related in the stepSubscript represents the real part of phasor,Subscript represents the imaginary part of phasor:
Then Noisy Speech Signal is calculatedWith estimation noiseCrosspower spectrum:
Then Noisy Speech Signal is calculatedWith estimation noiseCross-correlation function:
。
It is contemplated that utilizing Noisy Speech SignalWith estimation noiseCorrelation strengthen speech enhan-cement effect
Really, noisy speech power spectrum is utilized in frequency domain in this step, estimating noise power spectrumAnd the two is mutual
Power spectrumCalculating obtains Noisy Speech SignalWith estimation noiseCross-correlation function.In language
In sound processing procedure, Noisy Speech SignalAnd estimated noise signalIt is considered as the gaussian variable of statistical iteration, cross-correlation
Function can characterize the degree of correlation of noisy speech and estimation noise in different frequency range, and cross-correlation function value is larger, illustrates band
Make an uproar voice and estimation Noise Correlation it is stronger, then it represents that less without phonetic element or phonetic element in noisy speech, noise contribution
Accounting is higher;Cross-correlation function value is smaller, illustrates that noisy speech and estimation Noise Correlation are weaker, then it represents that contain in noisy speech
There is more phonetic element, it is shown weaker correlation with estimation noise.
The gain of step 5 nonlinear attenuation is calculated
CalculateAverage:
In above formulaFor less thanThe integer of higher limit, according to different application scenarios,The selection of value can be different, such as
The occasion of low frequency is concentrated in noise,Smaller value can be chosen, and in the unknown occasion of noise characteristic,Can choose withHigher limit
The value of formed objects.Such as, sample rate is that frame length elects 10ms as in 16kHz, adding window preprocessing process, then the data point in a frame
Number is 160, carries out Fast Fourier Transform (FFT) using the folded mode of frame and tries to achieve cross-correlation function, thenSpan is
0 ..., 159, can be by if known noise integrated distribution is in low-frequency range 0Hz-4kHzValue is elected 79 as and asked for。
According to the correlation average of frequency range of interestDetermine whether to apply nonlinear attenuation gain to present frame,
ContrastWith relevance thresholdIf,, illustrate in frequency-domain segment of interest, when
Preceding number of speech frames is according to estimating that noise data correlation is smaller, and voice accounts for main component, to ensure that speech quality does not suffer damage,
Nonlinear attenuation gain is not applied then, by nonlinear attenuation gainIt is set to 1.0;If, illustrate in frequency-domain segment of interest, current speech frame data with estimation noise data correlation compared with
Greatly, noise contribution is accounted for mainly, for phonetic element enhancing effect is better achieved, and need to be applied nonlinear attenuation gain and further be removed
Noise, nonlinear attenuation gainPass throughCalculating is obtained, whereinFor
Conventional takes smaller value to operate, i.e.,
UseIt is for guarantee, it is ensured that nonlinear attenuation gainFor noisy speech
Play and decay and the effect of non-amplified.
To sum up,Calculating process is:
WhereinAppropriate value can be selected according to concrete application scene, the selection of the numerical value, which also can be considered, is removing noise
Compromise is made between interference and guarantee speech quality, ifChoose higher value, then according to above formula,It is set to
1.0 probability increase, the declines of nonlinear attenuation gain, noise can be while ensureing that speech quality is injury-free
Residual;IfChoose smaller value,The probability for being set to 1.0 reduces, and the effect of nonlinear attenuation gain increases
By force, noise jamming can be preferably removed, if butThe value of selection is too small, and nonlinear attenuation gain effect is excessive, meeting
Damage is caused to speech quality.ThereforeAppropriate value, conventional span need to be chosen according to concrete application scene
For [0.70,0.80], in present exampleValue is 0.735.
The processing of step 6 speech enhan-cement
Step 3 is calculated to obtained fading gainObtained nonlinear attenuation gain is calculated with step 5
Collective effect is in noisy speech frequency spectrumOn realize speech enhan-cement handle:
Noisy Speech Signal frequency spectrumIn the fading gain obtained using signal-to-noise ratio computationOn the basis of effect, enter one
Step carries out nonlinear attenuation gain process, preferably removes noise using this nonlinear attenuation gain, obtains purer language
Sound。
Step 7 inverse fast Fourier transform
To handling obtained speech signal spec-trum by enhancingKnown inverse fast Fourier transform is carried out, by signal
Time domain is converted back from frequency domain:, obtain enhanced time domain speech signal, wherein
For known inverse fast Fourier transform.
The upper only presently preferred embodiments of the present invention, is not intended to limit the invention, all spirit in the present invention
With any modifications, equivalent substitutions and improvements made within principle etc., it should be included in the scope of the protection.
Claims (10)
1. a kind of Non-linear Speech Enhancement Method based on correlation, it is characterised in that methods described mainly includes:
Step a:The noisy speech data pre-processed to voiceWith estimation noise dataCarry out Fast Fourier Transform (FFT),
Obtain the frequency spectrum of noisy speech frameWith the frequency spectrum of estimation noise frame;
Step b:When fading gain is calculated noise, obtains fading gain,;
Step c:The correlation calculations of noisy speech and noise, calculate the frequency spectrum for obtaining the noisy speech frameWith estimation noise
The frequency spectrum of frameCross-correlation function,;
Step d:Nonlinear attenuation gain is calculated, and obtains the nonlinear attenuation gain;
Step e:Speech enhan-cement processing, by the fading gainWith nonlinear attenuation gain described in the step dCollective effect is in the frequency spectrum of the noisy speech frameOn, to realize the processing of speech enhan-cement, obtain pure
Speech signal spec-trum。
2. according to the method described in claim 1, it is characterised in that also include step f behind the step e, for quick Fu
In leaf inverse transformation, to the speech signal spec-trumKnown inverse fast Fourier transform is carried out, signal is turned from frequency domain
Gain time domain:。
3. method according to claim 2, it is characterised in that the step b further comprises the steps:Step b1:
Calculate posteriori SNR,;Step b2:Calculate signal to noise ratio and update coefficient
,, whereinFor former frame noisy speech data, parameterCan
Appropriate value is chosen according to concrete application scene;Step b3:Calculate prior weight,;Step b4:Calculate prior weight ratio,;Step b5:Calculated and most preferably declined using hypergeometric distribution correlation computations formula
Subtract gain;Step b6:Calculate fading gain lower limit;Step b7:Calculating obtains the decay and increased
Benefit。
4. method according to claim 3, it is characterised in that parameter described in the step b2Conventional span
For [0.05,0.30].
5. method according to claim 4, it is characterised in that the parameter0.25 can be taken.
6. the method according to claim 4 or 5, it is characterised in that optimized attenuation gain described in the step b5, wherein,For known gamma functions,,For with natural constantFor the exponential function at bottom,With
Respectively 0 rank and 1 rank Bessel functions.
7. method according to claim 6, it is characterised in that fading gain lower limit described in the step b6, whereinFor with natural constantFor the index letter at bottom
Number.
8. method according to claim 7, it is characterised in that the fading gain, whereinFor weighting system
Number, can choose appropriate value according to application scenarios, and conventional span is [0.60,0.90].
9. method according to claim 8, it is characterised in that the nonlinear attenuation gainPass throughCalculating is obtained, wherein,Smaller value is taken to operate for conventional, i.e.,
。
10. method according to claim 9, it is characterised in that the nonlinear attenuation gainCalculating
Cheng Wei:。
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201610079921.0A CN107045874B (en) | 2016-02-05 | 2016-02-05 | Non-linear voice enhancement method based on correlation |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201610079921.0A CN107045874B (en) | 2016-02-05 | 2016-02-05 | Non-linear voice enhancement method based on correlation |
Publications (2)
Publication Number | Publication Date |
---|---|
CN107045874A true CN107045874A (en) | 2017-08-15 |
CN107045874B CN107045874B (en) | 2021-03-02 |
Family
ID=59542672
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN201610079921.0A Active CN107045874B (en) | 2016-02-05 | 2016-02-05 | Non-linear voice enhancement method based on correlation |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN107045874B (en) |
Cited By (4)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN109410975A (en) * | 2018-10-31 | 2019-03-01 | 歌尔科技有限公司 | A kind of voice de-noising method, equipment and storage medium |
CN110335618A (en) * | 2019-06-06 | 2019-10-15 | 福建星网智慧软件有限公司 | A kind of method and computer equipment improving non-linear inhibition |
WO2019227589A1 (en) * | 2018-05-29 | 2019-12-05 | 平安科技(深圳)有限公司 | Speech enhancement method and apparatus, computer device, and storage medium |
CN114047010A (en) * | 2021-11-10 | 2022-02-15 | 宜昌达瑞机电科技有限公司 | Vibration and noise detection method and device for automobile air conditioner and computer equipment |
Citations (25)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
JP2001352594A (en) * | 2000-06-07 | 2001-12-21 | Sony Corp | Method and device for reducing wind sound |
US20040024596A1 (en) * | 2002-07-31 | 2004-02-05 | Carney Laurel H. | Noise reduction system |
CN1493073A (en) * | 2000-12-28 | 2004-04-28 | �ձ�������ʽ���� | Noise removing method and device |
US20050025263A1 (en) * | 2003-07-23 | 2005-02-03 | Gin-Der Wu | Nonlinear overlap method for time scaling |
US20080114593A1 (en) * | 2006-11-15 | 2008-05-15 | Microsoft Corporation | Noise suppressor for speech recognition |
CN101192411A (en) * | 2007-12-27 | 2008-06-04 | 北京中星微电子有限公司 | Large distance microphone array noise cancellation method and noise cancellation system |
KR20080075362A (en) * | 2007-02-12 | 2008-08-18 | 인하대학교 산학협력단 | A method for obtaining an estimated speech signal in noisy environments |
CN101430882A (en) * | 2008-12-22 | 2009-05-13 | 北京中星微电子有限公司 | Method and apparatus for restraining wind noise |
CN101510426A (en) * | 2009-03-23 | 2009-08-19 | 北京中星微电子有限公司 | Method and system for eliminating noise |
US20090254340A1 (en) * | 2008-04-07 | 2009-10-08 | Cambridge Silicon Radio Limited | Noise Reduction |
CN101582264A (en) * | 2009-06-12 | 2009-11-18 | 瑞声声学科技(深圳)有限公司 | Method and voice collecting system for speech enhancement |
CN101606191A (en) * | 2005-06-28 | 2009-12-16 | 微软公司 | Use many sensings voice of voice status model to strengthen |
CN101673550A (en) * | 2008-09-09 | 2010-03-17 | 联芯科技有限公司 | Spectral gain calculating method and device and noise suppression system |
CN101894563A (en) * | 2010-07-15 | 2010-11-24 | 瑞声声学科技(深圳)有限公司 | Voice enhancing method |
CN101976566A (en) * | 2010-07-09 | 2011-02-16 | 瑞声声学科技(深圳)有限公司 | Voice enhancement method and device using same |
CN101976565A (en) * | 2010-07-09 | 2011-02-16 | 瑞声声学科技(深圳)有限公司 | Dual-microphone-based speech enhancement device and method |
CN102065190A (en) * | 2010-12-31 | 2011-05-18 | 杭州华三通信技术有限公司 | Method and device for eliminating echo |
CN102347028A (en) * | 2011-07-14 | 2012-02-08 | 瑞声声学科技(深圳)有限公司 | Double-microphone speech enhancer and speech enhancement method thereof |
CN102947685A (en) * | 2010-06-17 | 2013-02-27 | 杜比实验室特许公司 | Method and apparatus for reducing the effect of environmental noise on listeners |
US20130343558A1 (en) * | 2012-06-26 | 2013-12-26 | Parrot | Method for denoising an acoustic signal for a multi-microphone audio device operating in a noisy environment |
CN103730126A (en) * | 2012-10-16 | 2014-04-16 | 联芯科技有限公司 | Noise suppression method and noise suppressor |
CN104637491A (en) * | 2013-11-07 | 2015-05-20 | 大陆汽车系统公司 | Externally estimated SNR based modifiers for internal MMSE calculations |
CN104685903A (en) * | 2012-10-09 | 2015-06-03 | 皇家飞利浦有限公司 | Method and apparatus for audio interference estimation |
KR20150078831A (en) * | 2013-12-31 | 2015-07-08 | 서울대학교산학협력단 | Method and system forspeech enhancement using non negative matrix factorization and basis matrix update |
US20150318902A1 (en) * | 2012-11-27 | 2015-11-05 | Nec Corporation | Signal processing apparatus, signal processing method, and signal processing program |
-
2016
- 2016-02-05 CN CN201610079921.0A patent/CN107045874B/en active Active
Patent Citations (25)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
JP2001352594A (en) * | 2000-06-07 | 2001-12-21 | Sony Corp | Method and device for reducing wind sound |
CN1493073A (en) * | 2000-12-28 | 2004-04-28 | �ձ�������ʽ���� | Noise removing method and device |
US20040024596A1 (en) * | 2002-07-31 | 2004-02-05 | Carney Laurel H. | Noise reduction system |
US20050025263A1 (en) * | 2003-07-23 | 2005-02-03 | Gin-Der Wu | Nonlinear overlap method for time scaling |
CN101606191A (en) * | 2005-06-28 | 2009-12-16 | 微软公司 | Use many sensings voice of voice status model to strengthen |
US20080114593A1 (en) * | 2006-11-15 | 2008-05-15 | Microsoft Corporation | Noise suppressor for speech recognition |
KR20080075362A (en) * | 2007-02-12 | 2008-08-18 | 인하대학교 산학협력단 | A method for obtaining an estimated speech signal in noisy environments |
CN101192411A (en) * | 2007-12-27 | 2008-06-04 | 北京中星微电子有限公司 | Large distance microphone array noise cancellation method and noise cancellation system |
US20090254340A1 (en) * | 2008-04-07 | 2009-10-08 | Cambridge Silicon Radio Limited | Noise Reduction |
CN101673550A (en) * | 2008-09-09 | 2010-03-17 | 联芯科技有限公司 | Spectral gain calculating method and device and noise suppression system |
CN101430882A (en) * | 2008-12-22 | 2009-05-13 | 北京中星微电子有限公司 | Method and apparatus for restraining wind noise |
CN101510426A (en) * | 2009-03-23 | 2009-08-19 | 北京中星微电子有限公司 | Method and system for eliminating noise |
CN101582264A (en) * | 2009-06-12 | 2009-11-18 | 瑞声声学科技(深圳)有限公司 | Method and voice collecting system for speech enhancement |
CN102947685A (en) * | 2010-06-17 | 2013-02-27 | 杜比实验室特许公司 | Method and apparatus for reducing the effect of environmental noise on listeners |
CN101976566A (en) * | 2010-07-09 | 2011-02-16 | 瑞声声学科技(深圳)有限公司 | Voice enhancement method and device using same |
CN101976565A (en) * | 2010-07-09 | 2011-02-16 | 瑞声声学科技(深圳)有限公司 | Dual-microphone-based speech enhancement device and method |
CN101894563A (en) * | 2010-07-15 | 2010-11-24 | 瑞声声学科技(深圳)有限公司 | Voice enhancing method |
CN102065190A (en) * | 2010-12-31 | 2011-05-18 | 杭州华三通信技术有限公司 | Method and device for eliminating echo |
CN102347028A (en) * | 2011-07-14 | 2012-02-08 | 瑞声声学科技(深圳)有限公司 | Double-microphone speech enhancer and speech enhancement method thereof |
US20130343558A1 (en) * | 2012-06-26 | 2013-12-26 | Parrot | Method for denoising an acoustic signal for a multi-microphone audio device operating in a noisy environment |
CN104685903A (en) * | 2012-10-09 | 2015-06-03 | 皇家飞利浦有限公司 | Method and apparatus for audio interference estimation |
CN103730126A (en) * | 2012-10-16 | 2014-04-16 | 联芯科技有限公司 | Noise suppression method and noise suppressor |
US20150318902A1 (en) * | 2012-11-27 | 2015-11-05 | Nec Corporation | Signal processing apparatus, signal processing method, and signal processing program |
CN104637491A (en) * | 2013-11-07 | 2015-05-20 | 大陆汽车系统公司 | Externally estimated SNR based modifiers for internal MMSE calculations |
KR20150078831A (en) * | 2013-12-31 | 2015-07-08 | 서울대학교산학협력단 | Method and system forspeech enhancement using non negative matrix factorization and basis matrix update |
Non-Patent Citations (5)
Title |
---|
BALAZS FODOR ET AL: "MMSE speech enhancement under speech presence uncertainty assuming (generalized) gamma speech priors throughout", 《IWAENC 2012; INTERNATIONAL WORKSHOP ON ACOUSTIC SIGNAL ENHANCEMENT》 * |
MARTIN KRAWCZYK-BECKER ET AL: "MMSE-optimal combination of wiener filtering and harmonic model based speech enhancement in a general framework", 《2015 IEEE WORKSHOP ON APPLICATIONS OF SIGNAL PROCESSING TO AUDIO AND ACOUSTICS》 * |
Y EPHRAIM ET AL: "Speech enhancement using a minimum mean-square error log-spectral amplitude estimator", 《IEEE TRANSACTIONS ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING》 * |
Y. EPHRAIM ET AL: "Speech enhancement using a minimum mean square error short-time spectral amplitude estimator", 《IEEE TRANSACTIONS ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING》 * |
张鑫琪等: "改进的最小均方误差语音增强算法的研究", 《声学技术》 * |
Cited By (6)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
WO2019227589A1 (en) * | 2018-05-29 | 2019-12-05 | 平安科技(深圳)有限公司 | Speech enhancement method and apparatus, computer device, and storage medium |
CN109410975A (en) * | 2018-10-31 | 2019-03-01 | 歌尔科技有限公司 | A kind of voice de-noising method, equipment and storage medium |
CN109410975B (en) * | 2018-10-31 | 2021-03-09 | 歌尔科技有限公司 | Voice noise reduction method, device and storage medium |
CN110335618A (en) * | 2019-06-06 | 2019-10-15 | 福建星网智慧软件有限公司 | A kind of method and computer equipment improving non-linear inhibition |
CN110335618B (en) * | 2019-06-06 | 2021-07-30 | 福建星网智慧软件有限公司 | Method for improving nonlinear echo suppression and computer equipment |
CN114047010A (en) * | 2021-11-10 | 2022-02-15 | 宜昌达瑞机电科技有限公司 | Vibration and noise detection method and device for automobile air conditioner and computer equipment |
Also Published As
Publication number | Publication date |
---|---|
CN107045874B (en) | 2021-03-02 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
Martin et al. | Speech enhancement in the DFT domain using Laplacian speech priors | |
CN111653288B (en) | Target person voice enhancement method based on conditional variation self-encoder | |
CN104067339B (en) | Noise-suppressing device | |
CN108831499A (en) | Utilize the sound enhancement method of voice existing probability | |
US9520138B2 (en) | Adaptive modulation filtering for spectral feature enhancement | |
CN103632677A (en) | Method and device for processing voice signal with noise, and server | |
CN107045874A (en) | A kind of Non-linear Speech Enhancement Method based on correlation | |
CN105489226A (en) | Wiener filtering speech enhancement method for multi-taper spectrum estimation of pickup | |
Islam et al. | Speech enhancement based on student $ t $ modeling of Teager energy operated perceptual wavelet packet coefficients and a custom thresholding function | |
Wang et al. | Joint noise and mask aware training for DNN-based speech enhancement with sub-band features | |
CN103594093A (en) | Method for enhancing voice based on signal to noise ratio soft masking | |
CN110808057A (en) | Voice enhancement method for generating confrontation network based on constraint naive | |
Yang et al. | A noise reduction method based on LMS adaptive filter of audio signals | |
Ram et al. | Performance analysis of adaptive variational mode decomposition approach for speech enhancement | |
Garg | Speech enhancement using long short term memory with trained speech features and adaptive wiener filter | |
CN108711432A (en) | A kind of sound enhancement method of the perception gain function of single microphone | |
Rao et al. | Speech enhancement using sub-band cross-correlation compensated Wiener filter combined with harmonic regeneration | |
Hong et al. | Independent component analysis based single channel speech enhancement | |
Pfeifenberger et al. | Blind source extraction based on a direction-dependent a-priori SNR. | |
Chen | Noise reduction of bird calls based on a combination of spectral subtraction, Wiener filtering, and Kalman filtering | |
Shankar et al. | Noise dependent super gaussian-coherence based dual microphone speech enhancement for hearing aid application using smartphone | |
Venkateswarlu et al. | Speech Enhancement in terms of Objective Quality Measures Based on Wavelet Hybrid Thresholding the Multitaper Spectrum | |
Saadoune et al. | MCRA noise estimation for KLT-VRE-based speech enhancement | |
Lu et al. | Temporal contrast normalization and edge-preserved smoothing of temporal modulation structures of speech for robust speech recognition | |
Rao et al. | Speech enhancement using combination of digital audio effects with Kalman filter |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PB01 | Publication | ||
PB01 | Publication | ||
SE01 | Entry into force of request for substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
GR01 | Patent grant | ||
GR01 | Patent grant |