CN101582264A - Method and voice collecting system for speech enhancement - Google Patents

Method and voice collecting system for speech enhancement Download PDF

Info

Publication number
CN101582264A
CN101582264A CNA2009101080587A CN200910108058A CN101582264A CN 101582264 A CN101582264 A CN 101582264A CN A2009101080587 A CNA2009101080587 A CN A2009101080587A CN 200910108058 A CN200910108058 A CN 200910108058A CN 101582264 A CN101582264 A CN 101582264A
Authority
CN
China
Prior art keywords
signal
voice
noise
frequency band
energy
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CNA2009101080587A
Other languages
Chinese (zh)
Inventor
叶利剑
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
AAC Technologies Holdings Shenzhen Co Ltd
AAC Technologies Holdings Changzhou Co Ltd
Original Assignee
AAC Acoustic Technologies Shenzhen Co Ltd
AAC Acoustic Technologies Changzhou Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by AAC Acoustic Technologies Shenzhen Co Ltd, AAC Acoustic Technologies Changzhou Co Ltd filed Critical AAC Acoustic Technologies Shenzhen Co Ltd
Priority to CNA2009101080587A priority Critical patent/CN101582264A/en
Publication of CN101582264A publication Critical patent/CN101582264A/en
Pending legal-status Critical Current

Links

Landscapes

  • Circuit For Audible Band Transducer (AREA)

Abstract

The invention provides a method and a voice collecting system for speech enhancement. The system comprises a microphone collecting device and a chip which integrates the method for the speech enhancement. The method for speech enhancement comprises the following steps of: carrying out frame division and pre-emphasis treatment on the noise-contained speech signal so as to be converted to frequency domains; dividing into a plurality of frequency bands; calculating the signal energies of each channel; calculating the estimation value of the posterior signal-to-noise-ratio and the prior signal-to-noise ratio of the current frame; judging whether updating the estimation value of the noise energy; calculating and processing the attenuation factor of each frequency band so as to obtain the speech signal with enhanced signal-to-noise ratio; and converting the processed speech signal into the time domain and outputting the speech signal.

Description

The voice acquisition system that method that voice strengthen and voice increase
[technical field]
The present invention relates to a kind of method of voice increase and the voice acquisition system of integrated this method.
[background technology]
Since the existence of a large amount of neighbourhood noises, voice acquisition system, and as the microphone microphone, the general signal to noise ratio (S/N ratio) of the voice signal that collects is not high enough.In order to collect the high voice signal of signal to noise ratio (S/N ratio), usually, gather voice signal in the certain limit by utilizing directional microphone, or the method for utilizing voice to strengthen promotes the signal to noise ratio (S/N ratio) of voice signal.
Calculated amount and storage space that existing relevant voice enhancement algorithm needs are all bigger than normal, and than higher, the area of the silicon that needs when making special chip is also bigger, thereby makes its cost also than higher, and noise reduction neither be very desirable to the requirement of hardware.
Therefore, be necessary to study the method that a kind of new voice strengthen, to reach good noise reduction.
[summary of the invention]
The technical matters that the present invention need solve provides a kind of method of voice increase of excellent noise reduction effect,
According to above-mentioned technical matters, designed the method that a kind of voice strengthen, it may further comprise the steps:
(1), voice collection device being collected Noisy Speech Signal carries out branch frame, pre-emphasis processing, arrives frequency domain through Short Time Fourier Transform again with chip;
(2), the Noisy Speech Signal that will transform to behind the frequency domain is divided into some frequency bands, calculate each frequency band energy again and carry out level and smooth, obtain the signal energy in each frequency band after level and smooth, described signal energy comprises speech energy and noise energy, and obtains the initial estimate of described noise energy;
(3), by the initial estimate of signal energy and noise energy, calculate the posteriority signal to noise ratio (S/N ratio) of each frequency band present frame, and obtain the priori SNR estimation value of present frame by the priori SNR estimation value of former frame;
(4), present frame is adjudicated, judge whether it is noise, otherwise execution in step (5), be then to carry out (6) by the priori SNR estimation value that obtains;
(5), the estimated value of the noise energy of each frequency band is upgraded, the estimated value of the current renewal by signal energy and noise energy again, calculate the posteriority signal to noise ratio (S/N ratio) of each frequency band present frame, and obtain the priori SNR estimation value of present frame by the priori SNR estimation value of former frame, continue execution in step (4) to adjudicate again;
(6), according to the priori SNR estimation value that obtains, calculate the decay gain factor of each frequency band;
(7), with the decay gain factor that obtains, the signal spectrum that is divided into each frequency band is handled;
(8), the frequency-region signal after will handling transforms to time domain, the processing of postemphasising becomes output signal.
More excellent is, is treated to the decay gain factor that the Noisy Speech Signal of present frame be multiply by frequency band in the described step (7).
More excellent is to operate described step (8) and include:
(81), by inverse fast fourier transform frequency-region signal is transformed to time domain, the time domain voice signal after being enhanced;
(82), increase the weight of to handle by low-pass filter.
Another technical matters solved by the invention provides the voice acquisition system that a kind of voice increase, and it comprises the chip of the method that voice collection device, integrated as above predicate sound increase.
More excellent is that described chipset is formed in the voice collection device.
More excellent is that described voice collection device is the microphone microphone.
Compare with correlation technique, the method that voice of the present invention strengthen has realized real-time speech-enhancement system, voice collection device output be voice signal behind the direct noise reduction, and improved greatly noise alleviation, guaranteed the intelligibility of voice, especially to automobile noise, a class such as the street noise attenuating of additive noise stably is particularly outstanding.
[description of drawings]
Fig. 1 is the schematic flow sheet of the method for voice enhancing of the present invention;
[embodiment]
The invention will be further described below in conjunction with drawings and embodiments.
Main thought of the present invention is in the chip that a kind of voice enhancement algorithm is integrated in special use, and by the interface data transmission of this design chips with corresponding voice collection device, to form a real-time speech-enhancement system.Voice signal is directly handled by the voice enhancement algorithm in the chip by the voice collection device collection again, obtains the signal after signal to noise ratio (S/N ratio) strengthens, and output is for secondary use.
The voice acquisition system that voice of the present invention strengthen comprises: voice collection device, voice signal process chip, chip is integrated in this voice collection device.This voice collection device is the microphone microphone in the present embodiment, and the simulating signal of microphone collection also need be converted to digital signal, handles for chip.
The present invention is integrated in the method that the voice in the chip strengthen, and it may further comprise the steps:
(1), voice collection device being collected Noisy Speech Signal (this signal is a digital signal) carries out branch frame, pre-emphasis processing, arrives frequency domain through Short Time Fourier Transform again with chip;
(2), the Noisy Speech Signal that will transform to behind the frequency domain is divided into some frequency bands, calculate each frequency band energy again and carry out level and smooth, obtain the signal energy in each frequency band after level and smooth, described signal energy comprises speech energy and noise energy, and obtains the initial estimate of described noise energy;
(3), by the initial estimate of signal energy and noise energy, calculate the posteriority signal to noise ratio (S/N ratio) of each frequency band present frame, and obtain the priori SNR estimation value of present frame by the priori SNR estimation value of former frame;
(4), present frame is adjudicated, judge whether it is noise, otherwise execution in step (5), be then to carry out (6) by the priori SNR estimation value that obtains;
Estimated value to the noise energy of each frequency band is upgraded, the estimated value of the current renewal by signal energy and noise energy again, calculate the posteriority signal to noise ratio (S/N ratio) of each frequency band present frame, and obtain the priori SNR estimation value of present frame by the priori SNR estimation value of former frame, continue execution in step (4) to adjudicate again;
(6), according to the priori SNR estimation value that obtains, calculate the decay gain factor of each frequency band;
(7), with the decay gain factor that obtains, the signal spectrum that is divided into each frequency band is handled, the Noisy Speech Signal of present frame be multiply by the decay gain factor of frequency band;
(8), the frequency-region signal after will handling transforms to time domain, the processing of postemphasising becomes output signal.Concrete steps (8) are:
(81), by inverse fast fourier transform frequency-region signal is transformed to time domain, the time domain voice signal after being enhanced;
(82), increase the weight of to handle by low-pass filter.
Be introduced below by specific embodiment, the sampling rate of the Noisy Speech Signal of the voice acquisition system input that these voice strengthen is 8kHZ again, and precision is 16.
At first, the Noisy Speech Signal in time domain being carried out the branch frame, is to be that unit is divided into some Noisy Speech Signals unit with the frame with Noisy Speech Signal.This Noisy Speech Signal unit is made up of sampled point, the present invention has chosen the sample frequency of 8KHz, needs according to the short-time spectrum analysis, frame length is generally set between 10~35ms, present embodiment divides frame with 32ms, and promptly a frame Noisy Speech Signal unit is provided with 256 sampled points, nature, any frame Noisy Speech Signal unit has certain frame length, and the frame length of arbitrary frame of the present invention is 256.
Voice signal behind the branch frame through a Hi-pass filter, is handled as pre-emphasis.Because the ground unrest in the voice signal is generally bigger at the low frequency part energy,, make noise reduction better so use can the decay deal of low frequency part of this Hi-pass filter.Its form is as follows:
H(z)=1-αz -1
The general value of α is between 0.75-0.95, and effect preferably can be obtained in α=0.9 here.
Because voice signal is stably in short-term, handle so can carry out the branch frame, but the branch frame can bring the discontinuous of frame signal boundary again and cause frequency to be revealed signal.So, to carry out Short Time Fourier Transform (STFT) for the voice signal behind minute frame.Short Time Fourier Transform can be understood as does Fourier transform again to the windowing of frame signal elder generation.The purpose of windowed function is exactly for when doing Short Time Fourier Transform, reduces the discontinuous of frame signal boundary and causes frequency to reveal, thereby reduce " blocking effect ".Here used a length to equal the Hamming window of 256 of frame lengths, it can effectively reduce the oscillation degree of Gibbs' effect.
Hamming window function is defined as follows:
win(n)={
0.54-0.46cos(2*π*n/M) 0≤n≤M-1
0 all the other n
}
Short Time Fourier Transform is as follows:
X ( m , k 1 ) = 2 M Σ n = 0 M - 1 win ( n - m ) × x ( m ) e - 2 πjk 1 n M 0≤k1≤M-1
Wherein, M=256 is the computational length of Fourier Tranform in short-term.M represents the m frame signal.
So just the Noisy Speech Signal s with present frame has transformed from the time domain to frequency domain.
The Noisy Speech Signal that transforms to behind the frequency domain comprises voice signal and noise signal, and this signal is that unit is divided into some frequency bands with the frame, and the voice signal at different frequency bands carries out different policing actions afterwards.
Below the following Noisy Speech Signal of 4kHz is carried out frequency band division, signal Processing is afterwards all carried out in each frequency band, so both can reduce computational complexity, can do different processing at different frequency bands again, obtains better voice and strengthens effect.
Signal among the present invention is divided into 23 frequency bands altogether, specifically sees Table 1.
23 frequency band division of table 1
Frequency band number Initial frequency (Hz) Cutoff frequency (Hz)
1 62.5 93.75
2 125 156.25
3 187.5 218.75
4 250 281.25
5 312.5 343.75
6 375 406.25
7 437.5 468.75
8 500 531.25
9 562.5 593.75
10 625 656.25
11 687.5 718.75
12 750 781.25
13 812.5 906.25
14 937.5 1062.5
15 1093.75 1250
16 1281.25 1468.75
17 1500 1718.75
18 1750 2000
19 2031.25 2312.5
20 2343.75 2687.5
21 2718.75 3125
22 3156.25 3687.5
23 3718.75 3968.75
The signal energy of each frequency band estimates, calculates and carries out smoothly with following formula:
E(m,k)=|X(m,k)| 2 0≤k≤N-1
Y(m,k)=αY(m-1,k)+(1-α)E(m,k) 0≤k≤N-1
Wherein, and Y (m represents the sequence number of present frame for m, the k) signal energy in each the frequency band interval of expression after level and smooth, and k represents the sequence number of current subband, and smoothing factor is represented in α=0.75, and N is the frequency band sum of choosing, promptly 23.
The signal energy in each the frequency band interval after level and smooth comprises speech energy and noise energy, here, obtain the initial estimated value of a noise energy earlier, the posteriority signal to noise ratio (S/N ratio) of removing to calculate each frequency band present frame according to the initial estimated value of signal energy and noise energy, and obtain the priori SNR estimation value of present frame by the priori snr computation of former frame.By the priori SNR estimation value that obtains present frame is adjudicated again, judges whether it is noise:
If judgement is "No", it promptly not noise, then the estimated value of the noise energy of each frequency band is upgraded, the estimated value of the current renewal by signal energy and noise energy again, calculate the posteriority signal to noise ratio (S/N ratio) of each frequency band present frame, and obtain the priori SNR estimation value of present frame by the priori snr computation of former frame, recycle is adjudicated present frame, judge whether it is noise, whether the estimated value of noise energy needs to upgrade.
If judgement is noise for "Yes", according to the priori SNR estimation value that obtains, calculate the decay gain factor of each frequency band, continue next step;
Calculate the formula of the posteriority signal to noise ratio (S/N ratio) of current frame signal, as follows:
SNR post ( m , k ) = Y ( m , k ) V ( k )
Wherein V (k) represents the energy value of the noise signal of current estimation.
Based on the priori SNR estimation formula of Ephraim and Malah, the formula of the priori SNR estimation value of calculating present frame is as follows then:
Figure A20091010805800101
Among the present invention, the judgement of the noise energy of each frequency band has adopted the voice activation based on the priori signal to noise ratio (S/N ratio) to detect (VAD) method with renewal.Judge at first whether present frame is pure noise signal.
VAD ( m ) = Σ k = 1 N [ γ ( m , k ) ζ ( m , k ) 1 + ζ ( m , k ) - lg ( 1 + ζ ( m , k ) ) ]
Wherein γ (m, k)=min[SNR Post(m, k), 40],
Figure A20091010805800103
VAD (m) is judged, and carry out noise and upgrade, as follows:
V ( m , k ) = &mu;V ( m - 1 , k ) + ( 1 - &mu; ) E ( m , k ) VAD ( m ) < &eta; V ( m - 1 , k ) VAD ( m ) &GreaterEqual; &eta;
Wherein η is that noise upgrades the judgement factor, gets η=0.01 among the present invention.
μ is a smoothing factor, gets μ=0.9 here.
Next, calculating the decay gain factor of each frequency band.Based on the priori SNR estimation value that previous calculations draws, take different strategies.For the big frequency band of signal to noise ratio (S/N ratio), can think voice signal, adopt the method for spectral substraction to obtain decay factor, for the little frequency band of signal to noise ratio (S/N ratio), think noise signal, it is carried out to a certain degree decay.Its concrete formula is as follows.
Figure A20091010805800105
Wherein, a, b, c are respectively different constants.
Consider that noise mainly concentrates on lower frequency band,, get different a, b, c therefore for medium and low frequency section and high frequency.
Among the present invention for the frequency band of k≤18, i.e. the following signal of 2kHz, a=10, b=5.5, c=8
For the frequency band of k>18, i.e. the above signal of 2kHz, a=5, b=4.8, c=5
Obtain the gain factor of decaying, (m k), multiply by it, and what obtain is exactly voice signal after this frequency band signal to noise ratio (S/N ratio) strengthens with the Noisy Speech Signal X of each frequency band of present frame again.
S ^ ( k ) = q ( k ) * X ( k ) 0≤k≤N-1
Wherein, N=23 is the frequency band sum,
Figure A20091010805800112
It is the voice signal estimated value after k frequency band strengthens.
At last, from the frequency domain transform to the time domain, the processing of postemphasising becomes output signal with the voice signal after the signal to noise ratio (S/N ratio) enhancing after handling.It is operating as:
The first step: inverse fast fourier transform (FFT) transforms to time domain to the voice signal of frequency domain, the time domain voice signal after being enhanced.
The conversion of time domain realizes with general contrary discrete Fourier transform (IDFT).
s ( m , n ) = 1 2 * &Sigma; n = 0 M - 1 S ^ ( k ) e j 2 &pi;nk / M 0≤k≤M-1
Wherein, M=256 is frame length.S is the voice signal that transforms to after full range band after the time domain strengthens.
Second step: the processing of postemphasising.
With the pre-emphasis of front handle opposite, here with signal by a low-pass filter, farthest reduce original signal.The frequency response of wave filter is as follows;
H(z)=1+αz -1
The coefficient here is corresponding with the processing of front pre-emphasis, gets α=0.9.
Compare with correlation technique, the method that voice of the present invention strengthen has realized real-time speech-enhancement system, voice collection device output be voice signal behind the direct noise reduction, saved the cost of other use respective algorithms, and improved the intelligibility that noise alleviation, signal to noise ratio (S/N ratio) has been improved, has guaranteed voice greatly, especially to automobile noise, a class such as the street noise attenuating of additive noise stably is particularly outstanding.
Above-described only is embodiments of the present invention, should be pointed out that for the person of ordinary skill of the art at this, under the prerequisite that does not break away from the invention design, can also make improvement, but these all belongs to protection scope of the present invention.

Claims (6)

1, a kind of method of voice enhancing is characterized in that, may further comprise the steps:
(1), voice collection device being collected Noisy Speech Signal carries out branch frame, pre-emphasis processing, arrives frequency domain through Short Time Fourier Transform again with chip;
(2), the Noisy Speech Signal that will transform to behind the frequency domain is divided into some frequency bands, calculate each frequency band energy again and carry out level and smooth, obtain the signal energy in each frequency band after level and smooth, described signal energy comprises speech energy and noise energy, and obtains the initial estimate of described noise energy;
(3), by the initial estimate of signal energy and noise energy, calculate the posteriority signal to noise ratio (S/N ratio) of each frequency band present frame, and obtain the priori SNR estimation value of present frame by the priori SNR estimation value of former frame;
(4), present frame is adjudicated, judge whether it is noise, otherwise execution in step (5), be then to carry out (6) by the priori SNR estimation value that obtains;
(5), the estimated value of the noise energy of each frequency band is upgraded, the estimated value of the current renewal by signal energy and noise energy again, calculate the posteriority signal to noise ratio (S/N ratio) of each frequency band present frame, and obtain the priori SNR estimation value of present frame by the priori SNR estimation value of former frame, continue execution in step (4) to adjudicate again;
(6), according to the priori SNR estimation value that obtains, calculate the decay gain factor of each frequency band;
(7), with the decay gain factor that obtains, the signal spectrum that is divided into each frequency band is handled;
(8), the frequency-region signal after will handling transforms to time domain, the processing of postemphasising becomes output signal.
2, the method that strengthens according to the described voice of claim 1 is characterized in that, is treated to the decay gain factor that the Noisy Speech Signal of present frame be multiply by frequency band in the described step (7).
3, the method that strengthens according to the described voice of claim 1 is characterized in that: operate described step (8) and include:
(81), by inverse fast fourier transform frequency-region signal is transformed to time domain, the time domain voice signal after being enhanced;
(82), increase the weight of to handle by low-pass filter.
4, a kind of voice acquisition system of voice increase is characterized in that, comprising: the chip of the method that voice collection device, integrated voice according to claim 1 increase.
5, voice acquisition system according to claim 4 is characterized in that: described chipset is formed in the voice collection device.
6, according to claim 4 or 5 described voice acquisition systems, it is characterized in that: described voice collection device is the microphone microphone.
CNA2009101080587A 2009-06-12 2009-06-12 Method and voice collecting system for speech enhancement Pending CN101582264A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CNA2009101080587A CN101582264A (en) 2009-06-12 2009-06-12 Method and voice collecting system for speech enhancement

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CNA2009101080587A CN101582264A (en) 2009-06-12 2009-06-12 Method and voice collecting system for speech enhancement

Publications (1)

Publication Number Publication Date
CN101582264A true CN101582264A (en) 2009-11-18

Family

ID=41364387

Family Applications (1)

Application Number Title Priority Date Filing Date
CNA2009101080587A Pending CN101582264A (en) 2009-06-12 2009-06-12 Method and voice collecting system for speech enhancement

Country Status (1)

Country Link
CN (1) CN101582264A (en)

Cited By (25)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101894563A (en) * 2010-07-15 2010-11-24 瑞声声学科技(深圳)有限公司 Voice enhancing method
CN101976565A (en) * 2010-07-09 2011-02-16 瑞声声学科技(深圳)有限公司 Dual-microphone-based speech enhancement device and method
CN101976566A (en) * 2010-07-09 2011-02-16 瑞声声学科技(深圳)有限公司 Voice enhancement method and device using same
CN102074241A (en) * 2011-01-07 2011-05-25 蔡镇滨 Method for realizing voice reduction through rapid voice waveform repairing
CN101916567B (en) * 2009-11-23 2012-02-01 瑞声声学科技(深圳)有限公司 Speech enhancement method applied to dual-microphone system
WO2012094952A1 (en) * 2011-01-10 2012-07-19 华为技术有限公司 Signal processing method and device
CN103280225A (en) * 2013-05-24 2013-09-04 广州海格通信集团股份有限公司 Low-complexity silence detection method
CN103426433A (en) * 2012-05-14 2013-12-04 宏达国际电子股份有限公司 Noise cancellation method
CN103871421A (en) * 2014-03-21 2014-06-18 厦门莱亚特医疗器械有限公司 Self-adaptive denoising method and system based on sub-band noise analysis
CN103915099A (en) * 2012-12-29 2014-07-09 北京百度网讯科技有限公司 Speech pitch period detection method and device
CN104867498A (en) * 2014-12-26 2015-08-26 深圳市微纳集成电路与系统应用研究院 Mobile communication terminal and voice enhancement method and module thereof
CN105280195A (en) * 2015-11-04 2016-01-27 腾讯科技(深圳)有限公司 Method and device for processing speech signal
CN105679330A (en) * 2016-03-16 2016-06-15 南京工程学院 Digital hearing aid noise reduction method based on improved sub-band signal-to-noise ratio estimation
CN106297818A (en) * 2016-09-12 2017-01-04 广州酷狗计算机科技有限公司 The method and apparatus of noisy speech signal is removed in a kind of acquisition
CN106328155A (en) * 2016-09-13 2017-01-11 广东顺德中山大学卡内基梅隆大学国际联合研究院 Speech enhancement method of correcting priori signal-to-noise ratio overestimation
CN106601267A (en) * 2016-11-30 2017-04-26 武汉船舶通信研究所 Ultra-short wave FM modulation-based speech enhancement method
CN107045874A (en) * 2016-02-05 2017-08-15 深圳市潮流网络技术有限公司 A kind of Non-linear Speech Enhancement Method based on correlation
CN107437417A (en) * 2017-08-02 2017-12-05 中国科学院自动化研究所 Based on speech data Enhancement Method and device in Recognition with Recurrent Neural Network speech recognition
CN107895582A (en) * 2017-10-16 2018-04-10 中国电子科技集团公司第二十八研究所 Towards the speaker adaptation speech-emotion recognition method in multi-source information field
WO2019024008A1 (en) * 2017-08-02 2019-02-07 中国科学院自动化研究所 Voice data enhancing method and device in voice recognition based on recurrent neural network
CN110634500A (en) * 2019-10-14 2019-12-31 达闼科技成都有限公司 Method for calculating prior signal-to-noise ratio, electronic device and storage medium
CN111128213A (en) * 2019-12-10 2020-05-08 展讯通信(上海)有限公司 Noise suppression method and system for processing in different frequency bands
CN111354365A (en) * 2020-03-10 2020-06-30 苏宁云计算有限公司 Pure voice data sampling rate identification method, device and system
CN113168843A (en) * 2018-11-21 2021-07-23 深圳市欢太科技有限公司 Audio processing method and device, storage medium and electronic equipment
CN113613112A (en) * 2021-09-23 2021-11-05 三星半导体(中国)研究开发有限公司 Method and electronic device for suppressing wind noise of microphone

Cited By (48)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101916567B (en) * 2009-11-23 2012-02-01 瑞声声学科技(深圳)有限公司 Speech enhancement method applied to dual-microphone system
CN101976565A (en) * 2010-07-09 2011-02-16 瑞声声学科技(深圳)有限公司 Dual-microphone-based speech enhancement device and method
CN101976566A (en) * 2010-07-09 2011-02-16 瑞声声学科技(深圳)有限公司 Voice enhancement method and device using same
CN101976566B (en) * 2010-07-09 2012-05-02 瑞声声学科技(深圳)有限公司 Voice enhancement method and device using same
CN101894563B (en) * 2010-07-15 2013-03-20 瑞声声学科技(深圳)有限公司 Voice enhancing method
CN101894563A (en) * 2010-07-15 2010-11-24 瑞声声学科技(深圳)有限公司 Voice enhancing method
CN102074241B (en) * 2011-01-07 2012-03-28 蔡镇滨 Method for realizing voice reduction through rapid voice waveform repairing
CN102074241A (en) * 2011-01-07 2011-05-25 蔡镇滨 Method for realizing voice reduction through rapid voice waveform repairing
WO2012094952A1 (en) * 2011-01-10 2012-07-19 华为技术有限公司 Signal processing method and device
US9996503B2 (en) 2011-01-10 2018-06-12 Huawei Technologies Co., Ltd. Signal processing method and device
US9792257B2 (en) 2011-01-10 2017-10-17 Huawei Technologies Co., Ltd. Audio signal processing method and encoder
US9519619B2 (en) 2011-01-10 2016-12-13 Huawei Technologies Co., Ltd. Data processing method and device for processing speech signal or audio signal
US9280984B2 (en) 2012-05-14 2016-03-08 Htc Corporation Noise cancellation method
CN103426433A (en) * 2012-05-14 2013-12-04 宏达国际电子股份有限公司 Noise cancellation method
US9711164B2 (en) 2012-05-14 2017-07-18 Htc Corporation Noise cancellation method
CN103426433B (en) * 2012-05-14 2016-05-04 宏达国际电子股份有限公司 Noise cancellation method
CN103915099A (en) * 2012-12-29 2014-07-09 北京百度网讯科技有限公司 Speech pitch period detection method and device
CN103915099B (en) * 2012-12-29 2016-12-28 北京百度网讯科技有限公司 Voice fundamental periodicity detection methods and device
CN103280225B (en) * 2013-05-24 2015-07-01 广州海格通信集团股份有限公司 Low-complexity silence detection method
CN103280225A (en) * 2013-05-24 2013-09-04 广州海格通信集团股份有限公司 Low-complexity silence detection method
CN103871421B (en) * 2014-03-21 2018-02-02 厦门莱亚特医疗器械有限公司 A kind of self-adaptation noise reduction method and system based on subband noise analysis
CN103871421A (en) * 2014-03-21 2014-06-18 厦门莱亚特医疗器械有限公司 Self-adaptive denoising method and system based on sub-band noise analysis
CN104867498A (en) * 2014-12-26 2015-08-26 深圳市微纳集成电路与系统应用研究院 Mobile communication terminal and voice enhancement method and module thereof
CN105280195B (en) * 2015-11-04 2018-12-28 腾讯科技(深圳)有限公司 The processing method and processing device of voice signal
US10586551B2 (en) 2015-11-04 2020-03-10 Tencent Technology (Shenzhen) Company Limited Speech signal processing method and apparatus
US10924614B2 (en) 2015-11-04 2021-02-16 Tencent Technology (Shenzhen) Company Limited Speech signal processing method and apparatus
CN105280195A (en) * 2015-11-04 2016-01-27 腾讯科技(深圳)有限公司 Method and device for processing speech signal
CN107045874A (en) * 2016-02-05 2017-08-15 深圳市潮流网络技术有限公司 A kind of Non-linear Speech Enhancement Method based on correlation
CN105679330A (en) * 2016-03-16 2016-06-15 南京工程学院 Digital hearing aid noise reduction method based on improved sub-band signal-to-noise ratio estimation
CN105679330B (en) * 2016-03-16 2019-11-29 南京工程学院 Based on the digital deaf-aid noise-reduction method for improving subband signal-to-noise ratio (SNR) estimation
CN106297818B (en) * 2016-09-12 2019-09-13 广州酷狗计算机科技有限公司 It is a kind of to obtain the method and apparatus for removing noisy speech signal
CN106297818A (en) * 2016-09-12 2017-01-04 广州酷狗计算机科技有限公司 The method and apparatus of noisy speech signal is removed in a kind of acquisition
CN106328155A (en) * 2016-09-13 2017-01-11 广东顺德中山大学卡内基梅隆大学国际联合研究院 Speech enhancement method of correcting priori signal-to-noise ratio overestimation
CN106601267A (en) * 2016-11-30 2017-04-26 武汉船舶通信研究所 Ultra-short wave FM modulation-based speech enhancement method
CN106601267B (en) * 2016-11-30 2019-12-06 武汉船舶通信研究所 Voice enhancement method based on ultrashort wave FM modulation
WO2019024008A1 (en) * 2017-08-02 2019-02-07 中国科学院自动化研究所 Voice data enhancing method and device in voice recognition based on recurrent neural network
CN107437417A (en) * 2017-08-02 2017-12-05 中国科学院自动化研究所 Based on speech data Enhancement Method and device in Recognition with Recurrent Neural Network speech recognition
CN107895582A (en) * 2017-10-16 2018-04-10 中国电子科技集团公司第二十八研究所 Towards the speaker adaptation speech-emotion recognition method in multi-source information field
CN113168843B (en) * 2018-11-21 2022-04-22 深圳市欢太科技有限公司 Audio processing method and device, storage medium and electronic equipment
CN113168843A (en) * 2018-11-21 2021-07-23 深圳市欢太科技有限公司 Audio processing method and device, storage medium and electronic equipment
CN110634500A (en) * 2019-10-14 2019-12-31 达闼科技成都有限公司 Method for calculating prior signal-to-noise ratio, electronic device and storage medium
CN110634500B (en) * 2019-10-14 2022-05-31 达闼机器人股份有限公司 Method for calculating prior signal-to-noise ratio, electronic device and storage medium
CN111128213A (en) * 2019-12-10 2020-05-08 展讯通信(上海)有限公司 Noise suppression method and system for processing in different frequency bands
CN111128213B (en) * 2019-12-10 2022-09-27 展讯通信(上海)有限公司 Noise suppression method and system for processing in different frequency bands
CN111354365A (en) * 2020-03-10 2020-06-30 苏宁云计算有限公司 Pure voice data sampling rate identification method, device and system
CN111354365B (en) * 2020-03-10 2023-10-31 苏宁云计算有限公司 Pure voice data sampling rate identification method, device and system
CN113613112A (en) * 2021-09-23 2021-11-05 三星半导体(中国)研究开发有限公司 Method and electronic device for suppressing wind noise of microphone
CN113613112B (en) * 2021-09-23 2024-03-29 三星半导体(中国)研究开发有限公司 Method for suppressing wind noise of microphone and electronic device

Similar Documents

Publication Publication Date Title
CN101582264A (en) Method and voice collecting system for speech enhancement
CN101599274B (en) Method for speech enhancement
CN101894563B (en) Voice enhancing method
CN101976566B (en) Voice enhancement method and device using same
CN102074245B (en) Dual-microphone-based speech enhancement device and speech enhancement method
CN102074246B (en) Dual-microphone based speech enhancement device and method
CN101916567B (en) Speech enhancement method applied to dual-microphone system
CN101763858A (en) Method for processing double-microphone signal
CN101477800A (en) Voice enhancing process
EP2151822B1 (en) Apparatus and method for processing and audio signal for speech enhancement using a feature extraction
CN101430882B (en) Method and apparatus for restraining wind noise
CN102792373B (en) Noise suppression device
EP2164066B1 (en) Noise spectrum tracking in noisy acoustical signals
CN100543842C (en) Realize the method that ground unrest suppresses based on multiple statistics model and least mean-square error
CN102347027A (en) Double-microphone speech enhancer and speech enhancement method thereof
WO2021114733A1 (en) Noise suppression method for processing at different frequency bands, and system thereof
CN101976565A (en) Dual-microphone-based speech enhancement device and method
US20040064307A1 (en) Noise reduction method and device
CN106340292A (en) Voice enhancement method based on continuous noise estimation
CN103440872B (en) The denoising method of transient state noise
TW201248613A (en) System and method for monaural audio processing based preserving speech information
Kesarkar et al. Feature extraction for speech recognition
CN105390142A (en) Digital hearing aid voice noise elimination method
CN103578466B (en) Based on the voice non-voice detection method of Fourier Transform of Fractional Order
CN101853665A (en) Method for eliminating noise in voice

Legal Events

Date Code Title Description
C06 Publication
PB01 Publication
C10 Entry into substantive examination
SE01 Entry into force of request for substantive examination
C02 Deemed withdrawal of patent application after publication (patent law 2001)
WD01 Invention patent application deemed withdrawn after publication

Open date: 20091118