CN101582264A - Method and voice collecting system for speech enhancement - Google Patents
Method and voice collecting system for speech enhancement Download PDFInfo
- Publication number
- CN101582264A CN101582264A CNA2009101080587A CN200910108058A CN101582264A CN 101582264 A CN101582264 A CN 101582264A CN A2009101080587 A CNA2009101080587 A CN A2009101080587A CN 200910108058 A CN200910108058 A CN 200910108058A CN 101582264 A CN101582264 A CN 101582264A
- Authority
- CN
- China
- Prior art keywords
- signal
- voice
- noise
- frequency band
- energy
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Pending
Links
Landscapes
- Circuit For Audible Band Transducer (AREA)
Abstract
The invention provides a method and a voice collecting system for speech enhancement. The system comprises a microphone collecting device and a chip which integrates the method for the speech enhancement. The method for speech enhancement comprises the following steps of: carrying out frame division and pre-emphasis treatment on the noise-contained speech signal so as to be converted to frequency domains; dividing into a plurality of frequency bands; calculating the signal energies of each channel; calculating the estimation value of the posterior signal-to-noise-ratio and the prior signal-to-noise ratio of the current frame; judging whether updating the estimation value of the noise energy; calculating and processing the attenuation factor of each frequency band so as to obtain the speech signal with enhanced signal-to-noise ratio; and converting the processed speech signal into the time domain and outputting the speech signal.
Description
[technical field]
The present invention relates to a kind of method of voice increase and the voice acquisition system of integrated this method.
[background technology]
Since the existence of a large amount of neighbourhood noises, voice acquisition system, and as the microphone microphone, the general signal to noise ratio (S/N ratio) of the voice signal that collects is not high enough.In order to collect the high voice signal of signal to noise ratio (S/N ratio), usually, gather voice signal in the certain limit by utilizing directional microphone, or the method for utilizing voice to strengthen promotes the signal to noise ratio (S/N ratio) of voice signal.
Calculated amount and storage space that existing relevant voice enhancement algorithm needs are all bigger than normal, and than higher, the area of the silicon that needs when making special chip is also bigger, thereby makes its cost also than higher, and noise reduction neither be very desirable to the requirement of hardware.
Therefore, be necessary to study the method that a kind of new voice strengthen, to reach good noise reduction.
[summary of the invention]
The technical matters that the present invention need solve provides a kind of method of voice increase of excellent noise reduction effect,
According to above-mentioned technical matters, designed the method that a kind of voice strengthen, it may further comprise the steps:
(1), voice collection device being collected Noisy Speech Signal carries out branch frame, pre-emphasis processing, arrives frequency domain through Short Time Fourier Transform again with chip;
(2), the Noisy Speech Signal that will transform to behind the frequency domain is divided into some frequency bands, calculate each frequency band energy again and carry out level and smooth, obtain the signal energy in each frequency band after level and smooth, described signal energy comprises speech energy and noise energy, and obtains the initial estimate of described noise energy;
(3), by the initial estimate of signal energy and noise energy, calculate the posteriority signal to noise ratio (S/N ratio) of each frequency band present frame, and obtain the priori SNR estimation value of present frame by the priori SNR estimation value of former frame;
(4), present frame is adjudicated, judge whether it is noise, otherwise execution in step (5), be then to carry out (6) by the priori SNR estimation value that obtains;
(5), the estimated value of the noise energy of each frequency band is upgraded, the estimated value of the current renewal by signal energy and noise energy again, calculate the posteriority signal to noise ratio (S/N ratio) of each frequency band present frame, and obtain the priori SNR estimation value of present frame by the priori SNR estimation value of former frame, continue execution in step (4) to adjudicate again;
(6), according to the priori SNR estimation value that obtains, calculate the decay gain factor of each frequency band;
(7), with the decay gain factor that obtains, the signal spectrum that is divided into each frequency band is handled;
(8), the frequency-region signal after will handling transforms to time domain, the processing of postemphasising becomes output signal.
More excellent is, is treated to the decay gain factor that the Noisy Speech Signal of present frame be multiply by frequency band in the described step (7).
More excellent is to operate described step (8) and include:
(81), by inverse fast fourier transform frequency-region signal is transformed to time domain, the time domain voice signal after being enhanced;
(82), increase the weight of to handle by low-pass filter.
Another technical matters solved by the invention provides the voice acquisition system that a kind of voice increase, and it comprises the chip of the method that voice collection device, integrated as above predicate sound increase.
More excellent is that described chipset is formed in the voice collection device.
More excellent is that described voice collection device is the microphone microphone.
Compare with correlation technique, the method that voice of the present invention strengthen has realized real-time speech-enhancement system, voice collection device output be voice signal behind the direct noise reduction, and improved greatly noise alleviation, guaranteed the intelligibility of voice, especially to automobile noise, a class such as the street noise attenuating of additive noise stably is particularly outstanding.
[description of drawings]
Fig. 1 is the schematic flow sheet of the method for voice enhancing of the present invention;
[embodiment]
The invention will be further described below in conjunction with drawings and embodiments.
Main thought of the present invention is in the chip that a kind of voice enhancement algorithm is integrated in special use, and by the interface data transmission of this design chips with corresponding voice collection device, to form a real-time speech-enhancement system.Voice signal is directly handled by the voice enhancement algorithm in the chip by the voice collection device collection again, obtains the signal after signal to noise ratio (S/N ratio) strengthens, and output is for secondary use.
The voice acquisition system that voice of the present invention strengthen comprises: voice collection device, voice signal process chip, chip is integrated in this voice collection device.This voice collection device is the microphone microphone in the present embodiment, and the simulating signal of microphone collection also need be converted to digital signal, handles for chip.
The present invention is integrated in the method that the voice in the chip strengthen, and it may further comprise the steps:
(1), voice collection device being collected Noisy Speech Signal (this signal is a digital signal) carries out branch frame, pre-emphasis processing, arrives frequency domain through Short Time Fourier Transform again with chip;
(2), the Noisy Speech Signal that will transform to behind the frequency domain is divided into some frequency bands, calculate each frequency band energy again and carry out level and smooth, obtain the signal energy in each frequency band after level and smooth, described signal energy comprises speech energy and noise energy, and obtains the initial estimate of described noise energy;
(3), by the initial estimate of signal energy and noise energy, calculate the posteriority signal to noise ratio (S/N ratio) of each frequency band present frame, and obtain the priori SNR estimation value of present frame by the priori SNR estimation value of former frame;
(4), present frame is adjudicated, judge whether it is noise, otherwise execution in step (5), be then to carry out (6) by the priori SNR estimation value that obtains;
Estimated value to the noise energy of each frequency band is upgraded, the estimated value of the current renewal by signal energy and noise energy again, calculate the posteriority signal to noise ratio (S/N ratio) of each frequency band present frame, and obtain the priori SNR estimation value of present frame by the priori SNR estimation value of former frame, continue execution in step (4) to adjudicate again;
(6), according to the priori SNR estimation value that obtains, calculate the decay gain factor of each frequency band;
(7), with the decay gain factor that obtains, the signal spectrum that is divided into each frequency band is handled, the Noisy Speech Signal of present frame be multiply by the decay gain factor of frequency band;
(8), the frequency-region signal after will handling transforms to time domain, the processing of postemphasising becomes output signal.Concrete steps (8) are:
(81), by inverse fast fourier transform frequency-region signal is transformed to time domain, the time domain voice signal after being enhanced;
(82), increase the weight of to handle by low-pass filter.
Be introduced below by specific embodiment, the sampling rate of the Noisy Speech Signal of the voice acquisition system input that these voice strengthen is 8kHZ again, and precision is 16.
At first, the Noisy Speech Signal in time domain being carried out the branch frame, is to be that unit is divided into some Noisy Speech Signals unit with the frame with Noisy Speech Signal.This Noisy Speech Signal unit is made up of sampled point, the present invention has chosen the sample frequency of 8KHz, needs according to the short-time spectrum analysis, frame length is generally set between 10~35ms, present embodiment divides frame with 32ms, and promptly a frame Noisy Speech Signal unit is provided with 256 sampled points, nature, any frame Noisy Speech Signal unit has certain frame length, and the frame length of arbitrary frame of the present invention is 256.
Voice signal behind the branch frame through a Hi-pass filter, is handled as pre-emphasis.Because the ground unrest in the voice signal is generally bigger at the low frequency part energy,, make noise reduction better so use can the decay deal of low frequency part of this Hi-pass filter.Its form is as follows:
H(z)=1-αz
-1
The general value of α is between 0.75-0.95, and effect preferably can be obtained in α=0.9 here.
Because voice signal is stably in short-term, handle so can carry out the branch frame, but the branch frame can bring the discontinuous of frame signal boundary again and cause frequency to be revealed signal.So, to carry out Short Time Fourier Transform (STFT) for the voice signal behind minute frame.Short Time Fourier Transform can be understood as does Fourier transform again to the windowing of frame signal elder generation.The purpose of windowed function is exactly for when doing Short Time Fourier Transform, reduces the discontinuous of frame signal boundary and causes frequency to reveal, thereby reduce " blocking effect ".Here used a length to equal the Hamming window of 256 of frame lengths, it can effectively reduce the oscillation degree of Gibbs' effect.
Hamming window function is defined as follows:
win(n)={
0.54-0.46cos(2*π*n/M) 0≤n≤M-1
0 all the other n
}
Short Time Fourier Transform is as follows:
Wherein, M=256 is the computational length of Fourier Tranform in short-term.M represents the m frame signal.
So just the Noisy Speech Signal s with present frame has transformed from the time domain to frequency domain.
The Noisy Speech Signal that transforms to behind the frequency domain comprises voice signal and noise signal, and this signal is that unit is divided into some frequency bands with the frame, and the voice signal at different frequency bands carries out different policing actions afterwards.
Below the following Noisy Speech Signal of 4kHz is carried out frequency band division, signal Processing is afterwards all carried out in each frequency band, so both can reduce computational complexity, can do different processing at different frequency bands again, obtains better voice and strengthens effect.
Signal among the present invention is divided into 23 frequency bands altogether, specifically sees Table 1.
23 frequency band division of table 1
Frequency band number | Initial frequency (Hz) | Cutoff frequency (Hz) |
1 | 62.5 | 93.75 |
2 | 125 | 156.25 |
3 | 187.5 | 218.75 |
4 | 250 | 281.25 |
5 | 312.5 | 343.75 |
6 | 375 | 406.25 |
7 | 437.5 | 468.75 |
8 | 500 | 531.25 |
9 | 562.5 | 593.75 |
10 | 625 | 656.25 |
11 | 687.5 | 718.75 |
12 | 750 | 781.25 |
13 | 812.5 | 906.25 |
14 | 937.5 | 1062.5 |
15 | 1093.75 | 1250 |
16 | 1281.25 | 1468.75 |
17 | 1500 | 1718.75 |
18 | 1750 | 2000 |
19 | 2031.25 | 2312.5 |
20 | 2343.75 | 2687.5 |
21 | 2718.75 | 3125 |
22 | 3156.25 | 3687.5 |
23 | 3718.75 | 3968.75 |
The signal energy of each frequency band estimates, calculates and carries out smoothly with following formula:
E(m,k)=|X(m,k)|
2 0≤k≤N-1
Y(m,k)=αY(m-1,k)+(1-α)E(m,k) 0≤k≤N-1
Wherein, and Y (m represents the sequence number of present frame for m, the k) signal energy in each the frequency band interval of expression after level and smooth, and k represents the sequence number of current subband, and smoothing factor is represented in α=0.75, and N is the frequency band sum of choosing, promptly 23.
The signal energy in each the frequency band interval after level and smooth comprises speech energy and noise energy, here, obtain the initial estimated value of a noise energy earlier, the posteriority signal to noise ratio (S/N ratio) of removing to calculate each frequency band present frame according to the initial estimated value of signal energy and noise energy, and obtain the priori SNR estimation value of present frame by the priori snr computation of former frame.By the priori SNR estimation value that obtains present frame is adjudicated again, judges whether it is noise:
If judgement is "No", it promptly not noise, then the estimated value of the noise energy of each frequency band is upgraded, the estimated value of the current renewal by signal energy and noise energy again, calculate the posteriority signal to noise ratio (S/N ratio) of each frequency band present frame, and obtain the priori SNR estimation value of present frame by the priori snr computation of former frame, recycle is adjudicated present frame, judge whether it is noise, whether the estimated value of noise energy needs to upgrade.
If judgement is noise for "Yes", according to the priori SNR estimation value that obtains, calculate the decay gain factor of each frequency band, continue next step;
Calculate the formula of the posteriority signal to noise ratio (S/N ratio) of current frame signal, as follows:
Wherein V (k) represents the energy value of the noise signal of current estimation.
Based on the priori SNR estimation formula of Ephraim and Malah, the formula of the priori SNR estimation value of calculating present frame is as follows then:
Among the present invention, the judgement of the noise energy of each frequency band has adopted the voice activation based on the priori signal to noise ratio (S/N ratio) to detect (VAD) method with renewal.Judge at first whether present frame is pure noise signal.
VAD (m) is judged, and carry out noise and upgrade, as follows:
Wherein η is that noise upgrades the judgement factor, gets η=0.01 among the present invention.
μ is a smoothing factor, gets μ=0.9 here.
Next, calculating the decay gain factor of each frequency band.Based on the priori SNR estimation value that previous calculations draws, take different strategies.For the big frequency band of signal to noise ratio (S/N ratio), can think voice signal, adopt the method for spectral substraction to obtain decay factor, for the little frequency band of signal to noise ratio (S/N ratio), think noise signal, it is carried out to a certain degree decay.Its concrete formula is as follows.
Wherein, a, b, c are respectively different constants.
Consider that noise mainly concentrates on lower frequency band,, get different a, b, c therefore for medium and low frequency section and high frequency.
Among the present invention for the frequency band of k≤18, i.e. the following signal of 2kHz, a=10, b=5.5, c=8
For the frequency band of k>18, i.e. the above signal of 2kHz, a=5, b=4.8, c=5
Obtain the gain factor of decaying, (m k), multiply by it, and what obtain is exactly voice signal after this frequency band signal to noise ratio (S/N ratio) strengthens with the Noisy Speech Signal X of each frequency band of present frame again.
Wherein, N=23 is the frequency band sum,
It is the voice signal estimated value after k frequency band strengthens.
At last, from the frequency domain transform to the time domain, the processing of postemphasising becomes output signal with the voice signal after the signal to noise ratio (S/N ratio) enhancing after handling.It is operating as:
The first step: inverse fast fourier transform (FFT) transforms to time domain to the voice signal of frequency domain, the time domain voice signal after being enhanced.
The conversion of time domain realizes with general contrary discrete Fourier transform (IDFT).
Wherein, M=256 is frame length.S is the voice signal that transforms to after full range band after the time domain strengthens.
Second step: the processing of postemphasising.
With the pre-emphasis of front handle opposite, here with signal by a low-pass filter, farthest reduce original signal.The frequency response of wave filter is as follows;
H(z)=1+αz
-1
The coefficient here is corresponding with the processing of front pre-emphasis, gets α=0.9.
Compare with correlation technique, the method that voice of the present invention strengthen has realized real-time speech-enhancement system, voice collection device output be voice signal behind the direct noise reduction, saved the cost of other use respective algorithms, and improved the intelligibility that noise alleviation, signal to noise ratio (S/N ratio) has been improved, has guaranteed voice greatly, especially to automobile noise, a class such as the street noise attenuating of additive noise stably is particularly outstanding.
Above-described only is embodiments of the present invention, should be pointed out that for the person of ordinary skill of the art at this, under the prerequisite that does not break away from the invention design, can also make improvement, but these all belongs to protection scope of the present invention.
Claims (6)
1, a kind of method of voice enhancing is characterized in that, may further comprise the steps:
(1), voice collection device being collected Noisy Speech Signal carries out branch frame, pre-emphasis processing, arrives frequency domain through Short Time Fourier Transform again with chip;
(2), the Noisy Speech Signal that will transform to behind the frequency domain is divided into some frequency bands, calculate each frequency band energy again and carry out level and smooth, obtain the signal energy in each frequency band after level and smooth, described signal energy comprises speech energy and noise energy, and obtains the initial estimate of described noise energy;
(3), by the initial estimate of signal energy and noise energy, calculate the posteriority signal to noise ratio (S/N ratio) of each frequency band present frame, and obtain the priori SNR estimation value of present frame by the priori SNR estimation value of former frame;
(4), present frame is adjudicated, judge whether it is noise, otherwise execution in step (5), be then to carry out (6) by the priori SNR estimation value that obtains;
(5), the estimated value of the noise energy of each frequency band is upgraded, the estimated value of the current renewal by signal energy and noise energy again, calculate the posteriority signal to noise ratio (S/N ratio) of each frequency band present frame, and obtain the priori SNR estimation value of present frame by the priori SNR estimation value of former frame, continue execution in step (4) to adjudicate again;
(6), according to the priori SNR estimation value that obtains, calculate the decay gain factor of each frequency band;
(7), with the decay gain factor that obtains, the signal spectrum that is divided into each frequency band is handled;
(8), the frequency-region signal after will handling transforms to time domain, the processing of postemphasising becomes output signal.
2, the method that strengthens according to the described voice of claim 1 is characterized in that, is treated to the decay gain factor that the Noisy Speech Signal of present frame be multiply by frequency band in the described step (7).
3, the method that strengthens according to the described voice of claim 1 is characterized in that: operate described step (8) and include:
(81), by inverse fast fourier transform frequency-region signal is transformed to time domain, the time domain voice signal after being enhanced;
(82), increase the weight of to handle by low-pass filter.
4, a kind of voice acquisition system of voice increase is characterized in that, comprising: the chip of the method that voice collection device, integrated voice according to claim 1 increase.
5, voice acquisition system according to claim 4 is characterized in that: described chipset is formed in the voice collection device.
6, according to claim 4 or 5 described voice acquisition systems, it is characterized in that: described voice collection device is the microphone microphone.
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CNA2009101080587A CN101582264A (en) | 2009-06-12 | 2009-06-12 | Method and voice collecting system for speech enhancement |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CNA2009101080587A CN101582264A (en) | 2009-06-12 | 2009-06-12 | Method and voice collecting system for speech enhancement |
Publications (1)
Publication Number | Publication Date |
---|---|
CN101582264A true CN101582264A (en) | 2009-11-18 |
Family
ID=41364387
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CNA2009101080587A Pending CN101582264A (en) | 2009-06-12 | 2009-06-12 | Method and voice collecting system for speech enhancement |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN101582264A (en) |
Cited By (25)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN101894563A (en) * | 2010-07-15 | 2010-11-24 | 瑞声声学科技(深圳)有限公司 | Voice enhancing method |
CN101976565A (en) * | 2010-07-09 | 2011-02-16 | 瑞声声学科技(深圳)有限公司 | Dual-microphone-based speech enhancement device and method |
CN101976566A (en) * | 2010-07-09 | 2011-02-16 | 瑞声声学科技(深圳)有限公司 | Voice enhancement method and device using same |
CN102074241A (en) * | 2011-01-07 | 2011-05-25 | 蔡镇滨 | Method for realizing voice reduction through rapid voice waveform repairing |
CN101916567B (en) * | 2009-11-23 | 2012-02-01 | 瑞声声学科技(深圳)有限公司 | Speech enhancement method applied to dual-microphone system |
WO2012094952A1 (en) * | 2011-01-10 | 2012-07-19 | 华为技术有限公司 | Signal processing method and device |
CN103280225A (en) * | 2013-05-24 | 2013-09-04 | 广州海格通信集团股份有限公司 | Low-complexity silence detection method |
CN103426433A (en) * | 2012-05-14 | 2013-12-04 | 宏达国际电子股份有限公司 | Noise cancellation method |
CN103871421A (en) * | 2014-03-21 | 2014-06-18 | 厦门莱亚特医疗器械有限公司 | Self-adaptive denoising method and system based on sub-band noise analysis |
CN103915099A (en) * | 2012-12-29 | 2014-07-09 | 北京百度网讯科技有限公司 | Speech pitch period detection method and device |
CN104867498A (en) * | 2014-12-26 | 2015-08-26 | 深圳市微纳集成电路与系统应用研究院 | Mobile communication terminal and voice enhancement method and module thereof |
CN105280195A (en) * | 2015-11-04 | 2016-01-27 | 腾讯科技(深圳)有限公司 | Method and device for processing speech signal |
CN105679330A (en) * | 2016-03-16 | 2016-06-15 | 南京工程学院 | Digital hearing aid noise reduction method based on improved sub-band signal-to-noise ratio estimation |
CN106297818A (en) * | 2016-09-12 | 2017-01-04 | 广州酷狗计算机科技有限公司 | The method and apparatus of noisy speech signal is removed in a kind of acquisition |
CN106328155A (en) * | 2016-09-13 | 2017-01-11 | 广东顺德中山大学卡内基梅隆大学国际联合研究院 | Speech enhancement method of correcting priori signal-to-noise ratio overestimation |
CN106601267A (en) * | 2016-11-30 | 2017-04-26 | 武汉船舶通信研究所 | Ultra-short wave FM modulation-based speech enhancement method |
CN107045874A (en) * | 2016-02-05 | 2017-08-15 | 深圳市潮流网络技术有限公司 | A kind of Non-linear Speech Enhancement Method based on correlation |
CN107437417A (en) * | 2017-08-02 | 2017-12-05 | 中国科学院自动化研究所 | Based on speech data Enhancement Method and device in Recognition with Recurrent Neural Network speech recognition |
CN107895582A (en) * | 2017-10-16 | 2018-04-10 | 中国电子科技集团公司第二十八研究所 | Towards the speaker adaptation speech-emotion recognition method in multi-source information field |
WO2019024008A1 (en) * | 2017-08-02 | 2019-02-07 | 中国科学院自动化研究所 | Voice data enhancing method and device in voice recognition based on recurrent neural network |
CN110634500A (en) * | 2019-10-14 | 2019-12-31 | 达闼科技成都有限公司 | Method for calculating prior signal-to-noise ratio, electronic device and storage medium |
CN111128213A (en) * | 2019-12-10 | 2020-05-08 | 展讯通信(上海)有限公司 | Noise suppression method and system for processing in different frequency bands |
CN111354365A (en) * | 2020-03-10 | 2020-06-30 | 苏宁云计算有限公司 | Pure voice data sampling rate identification method, device and system |
CN113168843A (en) * | 2018-11-21 | 2021-07-23 | 深圳市欢太科技有限公司 | Audio processing method and device, storage medium and electronic equipment |
CN113613112A (en) * | 2021-09-23 | 2021-11-05 | 三星半导体(中国)研究开发有限公司 | Method and electronic device for suppressing wind noise of microphone |
-
2009
- 2009-06-12 CN CNA2009101080587A patent/CN101582264A/en active Pending
Cited By (48)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN101916567B (en) * | 2009-11-23 | 2012-02-01 | 瑞声声学科技(深圳)有限公司 | Speech enhancement method applied to dual-microphone system |
CN101976565A (en) * | 2010-07-09 | 2011-02-16 | 瑞声声学科技(深圳)有限公司 | Dual-microphone-based speech enhancement device and method |
CN101976566A (en) * | 2010-07-09 | 2011-02-16 | 瑞声声学科技(深圳)有限公司 | Voice enhancement method and device using same |
CN101976566B (en) * | 2010-07-09 | 2012-05-02 | 瑞声声学科技(深圳)有限公司 | Voice enhancement method and device using same |
CN101894563B (en) * | 2010-07-15 | 2013-03-20 | 瑞声声学科技(深圳)有限公司 | Voice enhancing method |
CN101894563A (en) * | 2010-07-15 | 2010-11-24 | 瑞声声学科技(深圳)有限公司 | Voice enhancing method |
CN102074241B (en) * | 2011-01-07 | 2012-03-28 | 蔡镇滨 | Method for realizing voice reduction through rapid voice waveform repairing |
CN102074241A (en) * | 2011-01-07 | 2011-05-25 | 蔡镇滨 | Method for realizing voice reduction through rapid voice waveform repairing |
WO2012094952A1 (en) * | 2011-01-10 | 2012-07-19 | 华为技术有限公司 | Signal processing method and device |
US9996503B2 (en) | 2011-01-10 | 2018-06-12 | Huawei Technologies Co., Ltd. | Signal processing method and device |
US9792257B2 (en) | 2011-01-10 | 2017-10-17 | Huawei Technologies Co., Ltd. | Audio signal processing method and encoder |
US9519619B2 (en) | 2011-01-10 | 2016-12-13 | Huawei Technologies Co., Ltd. | Data processing method and device for processing speech signal or audio signal |
US9280984B2 (en) | 2012-05-14 | 2016-03-08 | Htc Corporation | Noise cancellation method |
CN103426433A (en) * | 2012-05-14 | 2013-12-04 | 宏达国际电子股份有限公司 | Noise cancellation method |
US9711164B2 (en) | 2012-05-14 | 2017-07-18 | Htc Corporation | Noise cancellation method |
CN103426433B (en) * | 2012-05-14 | 2016-05-04 | 宏达国际电子股份有限公司 | Noise cancellation method |
CN103915099A (en) * | 2012-12-29 | 2014-07-09 | 北京百度网讯科技有限公司 | Speech pitch period detection method and device |
CN103915099B (en) * | 2012-12-29 | 2016-12-28 | 北京百度网讯科技有限公司 | Voice fundamental periodicity detection methods and device |
CN103280225B (en) * | 2013-05-24 | 2015-07-01 | 广州海格通信集团股份有限公司 | Low-complexity silence detection method |
CN103280225A (en) * | 2013-05-24 | 2013-09-04 | 广州海格通信集团股份有限公司 | Low-complexity silence detection method |
CN103871421B (en) * | 2014-03-21 | 2018-02-02 | 厦门莱亚特医疗器械有限公司 | A kind of self-adaptation noise reduction method and system based on subband noise analysis |
CN103871421A (en) * | 2014-03-21 | 2014-06-18 | 厦门莱亚特医疗器械有限公司 | Self-adaptive denoising method and system based on sub-band noise analysis |
CN104867498A (en) * | 2014-12-26 | 2015-08-26 | 深圳市微纳集成电路与系统应用研究院 | Mobile communication terminal and voice enhancement method and module thereof |
CN105280195B (en) * | 2015-11-04 | 2018-12-28 | 腾讯科技(深圳)有限公司 | The processing method and processing device of voice signal |
US10586551B2 (en) | 2015-11-04 | 2020-03-10 | Tencent Technology (Shenzhen) Company Limited | Speech signal processing method and apparatus |
US10924614B2 (en) | 2015-11-04 | 2021-02-16 | Tencent Technology (Shenzhen) Company Limited | Speech signal processing method and apparatus |
CN105280195A (en) * | 2015-11-04 | 2016-01-27 | 腾讯科技(深圳)有限公司 | Method and device for processing speech signal |
CN107045874A (en) * | 2016-02-05 | 2017-08-15 | 深圳市潮流网络技术有限公司 | A kind of Non-linear Speech Enhancement Method based on correlation |
CN105679330A (en) * | 2016-03-16 | 2016-06-15 | 南京工程学院 | Digital hearing aid noise reduction method based on improved sub-band signal-to-noise ratio estimation |
CN105679330B (en) * | 2016-03-16 | 2019-11-29 | 南京工程学院 | Based on the digital deaf-aid noise-reduction method for improving subband signal-to-noise ratio (SNR) estimation |
CN106297818B (en) * | 2016-09-12 | 2019-09-13 | 广州酷狗计算机科技有限公司 | It is a kind of to obtain the method and apparatus for removing noisy speech signal |
CN106297818A (en) * | 2016-09-12 | 2017-01-04 | 广州酷狗计算机科技有限公司 | The method and apparatus of noisy speech signal is removed in a kind of acquisition |
CN106328155A (en) * | 2016-09-13 | 2017-01-11 | 广东顺德中山大学卡内基梅隆大学国际联合研究院 | Speech enhancement method of correcting priori signal-to-noise ratio overestimation |
CN106601267A (en) * | 2016-11-30 | 2017-04-26 | 武汉船舶通信研究所 | Ultra-short wave FM modulation-based speech enhancement method |
CN106601267B (en) * | 2016-11-30 | 2019-12-06 | 武汉船舶通信研究所 | Voice enhancement method based on ultrashort wave FM modulation |
WO2019024008A1 (en) * | 2017-08-02 | 2019-02-07 | 中国科学院自动化研究所 | Voice data enhancing method and device in voice recognition based on recurrent neural network |
CN107437417A (en) * | 2017-08-02 | 2017-12-05 | 中国科学院自动化研究所 | Based on speech data Enhancement Method and device in Recognition with Recurrent Neural Network speech recognition |
CN107895582A (en) * | 2017-10-16 | 2018-04-10 | 中国电子科技集团公司第二十八研究所 | Towards the speaker adaptation speech-emotion recognition method in multi-source information field |
CN113168843B (en) * | 2018-11-21 | 2022-04-22 | 深圳市欢太科技有限公司 | Audio processing method and device, storage medium and electronic equipment |
CN113168843A (en) * | 2018-11-21 | 2021-07-23 | 深圳市欢太科技有限公司 | Audio processing method and device, storage medium and electronic equipment |
CN110634500A (en) * | 2019-10-14 | 2019-12-31 | 达闼科技成都有限公司 | Method for calculating prior signal-to-noise ratio, electronic device and storage medium |
CN110634500B (en) * | 2019-10-14 | 2022-05-31 | 达闼机器人股份有限公司 | Method for calculating prior signal-to-noise ratio, electronic device and storage medium |
CN111128213A (en) * | 2019-12-10 | 2020-05-08 | 展讯通信(上海)有限公司 | Noise suppression method and system for processing in different frequency bands |
CN111128213B (en) * | 2019-12-10 | 2022-09-27 | 展讯通信(上海)有限公司 | Noise suppression method and system for processing in different frequency bands |
CN111354365A (en) * | 2020-03-10 | 2020-06-30 | 苏宁云计算有限公司 | Pure voice data sampling rate identification method, device and system |
CN111354365B (en) * | 2020-03-10 | 2023-10-31 | 苏宁云计算有限公司 | Pure voice data sampling rate identification method, device and system |
CN113613112A (en) * | 2021-09-23 | 2021-11-05 | 三星半导体(中国)研究开发有限公司 | Method and electronic device for suppressing wind noise of microphone |
CN113613112B (en) * | 2021-09-23 | 2024-03-29 | 三星半导体(中国)研究开发有限公司 | Method for suppressing wind noise of microphone and electronic device |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN101582264A (en) | Method and voice collecting system for speech enhancement | |
CN101599274B (en) | Method for speech enhancement | |
CN101894563B (en) | Voice enhancing method | |
CN101976566B (en) | Voice enhancement method and device using same | |
CN102074245B (en) | Dual-microphone-based speech enhancement device and speech enhancement method | |
CN102074246B (en) | Dual-microphone based speech enhancement device and method | |
CN101916567B (en) | Speech enhancement method applied to dual-microphone system | |
CN101763858A (en) | Method for processing double-microphone signal | |
CN101477800A (en) | Voice enhancing process | |
EP2151822B1 (en) | Apparatus and method for processing and audio signal for speech enhancement using a feature extraction | |
CN101430882B (en) | Method and apparatus for restraining wind noise | |
CN102792373B (en) | Noise suppression device | |
EP2164066B1 (en) | Noise spectrum tracking in noisy acoustical signals | |
CN100543842C (en) | Realize the method that ground unrest suppresses based on multiple statistics model and least mean-square error | |
CN102347027A (en) | Double-microphone speech enhancer and speech enhancement method thereof | |
WO2021114733A1 (en) | Noise suppression method for processing at different frequency bands, and system thereof | |
CN101976565A (en) | Dual-microphone-based speech enhancement device and method | |
US20040064307A1 (en) | Noise reduction method and device | |
CN106340292A (en) | Voice enhancement method based on continuous noise estimation | |
CN103440872B (en) | The denoising method of transient state noise | |
TW201248613A (en) | System and method for monaural audio processing based preserving speech information | |
Kesarkar et al. | Feature extraction for speech recognition | |
CN105390142A (en) | Digital hearing aid voice noise elimination method | |
CN103578466B (en) | Based on the voice non-voice detection method of Fourier Transform of Fractional Order | |
CN101853665A (en) | Method for eliminating noise in voice |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
C06 | Publication | ||
PB01 | Publication | ||
C10 | Entry into substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
C02 | Deemed withdrawal of patent application after publication (patent law 2001) | ||
WD01 | Invention patent application deemed withdrawn after publication |
Open date: 20091118 |