CN101916567A - Speech enhancement method applied to dual-microphone system - Google Patents

Speech enhancement method applied to dual-microphone system Download PDF

Info

Publication number
CN101916567A
CN101916567A CN2009101098256A CN200910109825A CN101916567A CN 101916567 A CN101916567 A CN 101916567A CN 2009101098256 A CN2009101098256 A CN 2009101098256A CN 200910109825 A CN200910109825 A CN 200910109825A CN 101916567 A CN101916567 A CN 101916567A
Authority
CN
China
Prior art keywords
signal
frame
noise
microphone
present frame
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Granted
Application number
CN2009101098256A
Other languages
Chinese (zh)
Other versions
CN101916567B (en
Inventor
叶利剑
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
AAC Technologies Holdings Nanjing Co Ltd
Original Assignee
Ruisheng Acoustic Technology Changzhou Co ltd
AAC Acoustic Technologies Shenzhen Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Ruisheng Acoustic Technology Changzhou Co ltd, AAC Acoustic Technologies Shenzhen Co Ltd filed Critical Ruisheng Acoustic Technology Changzhou Co ltd
Priority to CN2009101098256A priority Critical patent/CN101916567B/en
Publication of CN101916567A publication Critical patent/CN101916567A/en
Application granted granted Critical
Publication of CN101916567B publication Critical patent/CN101916567B/en
Expired - Fee Related legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Images

Abstract

The invention provides a speech enhancement method applied to a dual-microphone system. The dual-microphone system comprises a microphone array module, a control module, a processing module and an output module, wherein the microphone array module consists of a first microphone and a second microphone; the control module is used for receiving a signal transmitted by the microphone array module and controlling a microphone array; the processing module is used for receiving data transmitted by the control module; and the output module is used for outputting the processed data output by the processing module. The speech enhancement method comprises the following step that: 1, the first microphone and the second microphone receive time domain speech signals with noise (S1) and (S2) and transmit the signals to the control module, and the control module performs framing and pre-emphasis processing on the time domain speech signals with noise (S1) and (S2) and converts the time domain speech signals with noise (S1) and (S2) into frequency domain signals with noise (X1) and (X2) through short-time Fourier transform. Therefore, the attenuation on related noise is improved.

Description

Be applied to the sound enhancement method of dual-microphone system
[technical field]
The present invention relates to a kind of sound enhancement method, relate in particular to the method that a kind of voice that are applied to dual-microphone system strengthen.
[background technology]
Because the existence of a large amount of neighbourhood noises, the general signal to noise ratio (S/N ratio) of the voice signal that the microphone of communication apparatus such as mobile phone collects is not high enough, particularly in high-noise environments such as street automobile, needs to improve volume the other side is not heard.So need promote the signal to noise ratio (S/N ratio) of input voice by the method that voice strengthen, improve communication quality.
Yet traditional single channel sound enhancement method noise reduction is limited, and can cause big distortion to voice.Use two-microphone array, can introduce the phase information of voice and noise signal, the effect of raising squelch also reduces the distortion of voice.
[summary of the invention]
The technical problem to be solved in the present invention provides a kind of sound enhancement method.
For solving the problems of the technologies described above, technical scheme provided by the invention is:
A kind of sound enhancement method that is applied to dual-microphone system, this dual-microphone system comprises
The microphone array module that constitutes by first microphone and second microphone;
Be used to receive signal that this microphone array module sends and the control module of controlling this microphone array;
Be used to receive the processing module of the data that control module sends;
Be used for output module that the data of processing module output are exported after treatment;
This method comprises the steps:
Step 1. first microphone and second microphone receive time domain Noisy Speech Signal (S1) respectively, (S2) after, send control module to, by control module to time domain Noisy Speech Signal (S1), (S2) carry out respectively branch frame, pre-emphasis handle, through Short Time Fourier Transform with time domain Noisy Speech Signal (S1), (S2) be not transformed into frequency domain signal with noise (X1), (X2); Make when wherein dividing frame between the time domain signals with noise of adjacent two frames the aliasing part is arranged;
Step 2. receives frequency domain Noisy Speech Signal (X1) by processing module, (X2), and obtains frequency domain Noisy Speech Signal (X1) respectively, (X2) auto-power spectrum and frequency domain Noisy Speech Signal (X1), and cross-power spectrum (X2),
By the priori snr value of processing module, obtain present frame frequency domain Noisy Speech Signal (X1) or decay (X2) gain according to the present frame that obtains; Gain with the above-mentioned decay that obtains by processing module, multiply by the auto-power spectrum of first microphone or the second microphone frequency domain signal with noise, the auto-power spectrum of the clean speech estimated signal after obtaining handling;
Decay gain by former frame obtains present frame frequency domain signal with noise (X1), noise cross-power frequency spectrum (X2);
Obtain present frame frequency domain signal with noise (X1) by processing module by the noise cross-power spectrum that obtains, posteriority signal to noise ratio (S/N ratio) (X2), and obtain the priori snr value of present frame, export to output module;
Frequency-region signal after step 3. will be handled by output module transforms to time domain, and the processing of postemphasising becomes output signal.
Beneficial effect of the present invention is, use two-microphone array, calculate the cross-power spectrum that dual microphone receives signal, suppress uncorrelated noise, and by from the signal cross-power spectrum, deducting the noise cross-power spectrum, and the calculating of priori SNR estimation value, having improved decay for correlation noise, performance is better than existing single channel voice enhancement algorithm.
[description of drawings]
Fig. 1 is the structured flowchart of the system of sound enhancement method application of the present invention;
Fig. 2 is the schematic flow sheet of voice enhancement algorithm of the present invention.
[embodiment]
The present invention will be further described below in conjunction with accompanying drawing.
A kind of sound enhancement method that is applied to dual-microphone system, referring to Fig. 1, this dual-microphone system comprises,
The microphone array module that constitutes by first microphone and second microphone;
Be used to receive signal that this microphone array module sends and the control module of controlling this microphone array;
Be used to receive the processing module of the data that control module sends;
Be used for output module that the data of processing module output are exported after treatment;
Comprise the steps:
Step 1. first microphone and second microphone receive time domain Noisy Speech Signal S1 respectively, behind the S2, send control module to, by control module to time domain Noisy Speech Signal S1, S2 carry out respectively branch frame, pre-emphasis handle, through Short Time Fourier Transform with time domain Noisy Speech Signal S1, S2 is transformed into frequency domain signal with noise X1, X2 respectively; Make when wherein dividing frame between the time domain signals with noise of adjacent two frames the aliasing part is arranged;
Step 2. receives frequency domain Noisy Speech Signal X1 by processing module, X2, and obtain frequency domain Noisy Speech Signal X1 respectively, and X2 auto-power spectrum and frequency domain Noisy Speech Signal X1, the cross-power spectrum of X2,
By the priori snr value of processing module, obtain the decay gain of present frame frequency domain Noisy Speech Signal X1 or X2 according to the present frame that obtains; Gain with the above-mentioned decay that obtains by processing module, multiply by the auto-power spectrum of first microphone or the second microphone frequency domain signal with noise, the auto-power spectrum of the clean speech estimated signal after obtaining handling;
Decay gain by former frame obtains present frame frequency domain signal with noise X1, the noise cross-power frequency spectrum of X2;
Obtain present frame frequency domain signal with noise X1 by processing module by the noise cross-power spectrum that obtains, the posteriority signal to noise ratio (S/N ratio) of X2, and obtain the priori snr value of present frame, export to output module;
Frequency-region signal after step 3. will be handled by output module transforms to time domain, and the processing of postemphasising becomes output signal.
Concrete, referring to Fig. 2, the voice enhancement algorithm basic step among the present invention is as follows:
1. the Noisy Speech Signal that dual microphone is received carries out the branch frame respectively, and pre-emphasis is handled, and arrives frequency domain through Short Time Fourier Transform;
2. the respectively auto-power spectrum and the cross-power spectrum of two Noisy Speech Signals of computational transformation behind the frequency domain, and the noise cross-power spectrum that obtains estimating by the decay gain of former frame signal;
3. calculate the posteriority signal to noise ratio (S/N ratio) of current frame signal by signal and the noise cross-power spectrum that obtains previously, and obtain the priori SNR estimation value of present frame by the priori SNR estimation value of former frame;
4. according to the priori SNR estimation value that obtains, calculate the decay gain of current frame signal;
5. with the decay that obtains gain, one road microphone signal frequency spectrum is wherein handled;
6. the frequency-region signal after will handling transforms to time domain, and the processing of postemphasising becomes output signal.
In the case introduction of following mask body, the sampling rate of the noisy speech signal of speech-enhancement system input is 16kHZ, and resolution is 16.
To the time domain Noisy Speech Signal x1 that two microphones receive, x2 carries out the branch frame.Be meant with Noisy Speech Signal to be that unit is divided into some signals with noise unit with the frame.Described signals with noise unit is made up of sampled point, chosen the sample frequency of 16kHz among the present invention, needs according to the short-time spectrum analysis, frame length is generally set between 10~35ms, present embodiment is divided frame with 16ms, and promptly a frame signals with noise unit is provided with 256 sampled points, naturally, any frame signals with noise unit has certain frame length, and the frame length of arbitrary frame is 256 among the present invention.
For the blocking effect between the signals with noise unit that prevents adjacent two frames, when minute frame, to make between the signals with noise unit of adjacent two frames certain aliasing part is arranged, that is, it is former frame section data data that D data are arranged in these frame data, and wherein aliasing partly is described below:
s i(n)=d i(m,D+n) 0≤n<L,i=1,2
S wherein iExpression input tape noisy speech signal, i gets 1 and 2 and represents two paths of signals respectively
d i(m,n)=d i(m-1,L+n) 0≤n<D
Wherein, d i256 point sampling signals of expression present frame, because the length of any frame is 256, Duplication is 75%, so the sampled point number D=192 of lap.Distance L=256-192=64 that first sampled point of the signals with noise unit of consecutive frame is separated by.
Can have 50%~75% Duplication between the signals with noise unit of adjacent two frames of the present invention.Present embodiment is chosen between the signals with noise unit of adjacent two frames has 75% Duplication, promptly consistent with the Noisy Speech Signal unit of 75% (192 point) after the signals with noise unit of preceding 75% (192 point) of this frame and the former frame.
Two paths of signals behind the branch frame passes through same Hi-pass filter respectively, handles as pre-emphasis.Because ground unrest is generally bigger at the low frequency part energy in the voice signal,, make the enhancing better effects if so use can the decay component of low frequency part of described Hi-pass filter.Its form is as follows:
H(z)=1-αz -1
The general value of α is between 0.75-0.95, and effect preferably can be obtained in α=0.9 here.
Because voice signal is stably in short-term, is feasible so signal is carried out the processing of branch frame, but divides frame can bring the discontinuous frequency that causes of frame signal boundary to reveal again.So will carry out Short Time Fourier Transform (STFT) here.Short Time Fourier Transform can be understood as does Fourier transform again to the windowing of frame signal elder generation.The purpose of windowed function is exactly for when doing Short Time Fourier Transform, reduces the discontinuous frequency that causes of frame signal boundary and reveals.Here used a length to equal the Hamming window of 256 of frame lengths, it can effectively reduce the oscillation degree of Gibbs' effect.
Hamming window function is defined as follows:
win(n)={
0.54-0.46cos(2*π*n/M) 0≤n≤M-1
0 all the other n
}
Then Short Time Fourier Transform is as follows
X i ( f , m ) = 2 M Σ n = 0 M - 1 win ( n - m ) × x i ( m ) e - 2 πjf n m 0 ≤ k 1 ≤ M - 1
Wherein, M=256 is the computational length of Fourier Tranform in short-term.M represents the m frame signal.I=1,2 expression two paths of signals.
So just with the Noisy Speech Signal s of present frame iTransform from the time domain to and be frequency domain signal X i
Then use following formula to calculate the auto-power spectrum and the cross-power spectrum of two-way Noisy Speech Signal, consider the continuity between frame and the frame simultaneously, carry out level and smooth each signal energy spectrum:
P XiXj ( f , m ) = X i ( f , m ) X j * ( f , m ) m = 1 λ x P XiXj ( f , m - 1 ) + ( 1 - λ x ) X i ( f , m ) X j * ( f , m ) m > 1
Wherein, m represents the sequence number of present frame, and f represents through different Frequency point after the FFT conversion, λ x=0.6 is smoothing factor.
P XiXjThe energy spectrum of the signal of expression after smoothly.
Work as i=j=1, expression microphone 1 receives the auto-power spectrum of signal;
Work as i=j=2, expression microphone 2 receives the auto-power spectrum of signal;
As i=1, j=2, expression microphone 1 and microphone 2 receive the cross-power spectrum of Noisy Speech Signal.
Cross-power spectrum to the present frame noise signal is estimated
P N 1 N 2 ( f , m ) = λ n P N 1 N 2 ( f , m - 1 ) + ( 1 - λ n ) ( 1 - q ( f , m - 1 ) ) X 1 ( f , m ) X 2 * ( f , m )
Wherein, λ n=0.9 is smoothing factor, and (f m-1) represents the decay factor that former frame calculates to q.
Then, calculate the posteriority SNR estimation of current frame signal by the Noisy Speech Signal cross-power spectrum that obtains previously and noise signal cross-power spectrum, as follows
SNR post ( f , m ) = | X 1 ( f , m ) X 2 * ( f , m ) | | P N 1 N 2 ( f , m ) |
Based on the priori SNR estimation formula of Ephraim and Malah, calculate the priori SNR estimation value of present frame then
Figure G2009101098256D00061
Next, the decay calculating of gain factor.Here the priori SNR estimation value that draws based on previous calculations is taked different strategies.
For the big frequency of signal to noise ratio (S/N ratio), can think voice signal, adopt the method for spectral substraction to obtain decay factor, for the little frequency of signal to noise ratio (S/N ratio), think noise signal, it is carried out to a certain degree decay.Select SNR among the present invention Prior>1.5 is criterion, is higher than 1.5 the voice signal of thinking, is lower than 1.5 and can thinks noise.
Concrete gain reduction factor computing formula is as follows,
Figure G2009101098256D00062
Noisy Speech Signal X with microphone 1 present frame 1(f m), multiply by the decay gain factor of the corresponding frequencies point that obtains previously, and what obtain is exactly voice signal after the enhancing of this frequency.
S ^ ( f , m ) = q ( f , m ) × X 1 ( f , m ) , 0 ≤ k ≤ N - 1
Wherein,
Figure G2009101098256D00064
Be the voice signal estimated value after f frequency enhancing of m frame.
The operation that last spatial transform and output carry out has:
The first step: inverse fast fourier transform (FFT) transforms to time domain to the speech manual of frequency domain, the time domain voice signal after being enhanced.
The conversion of time domain realizes with general contrary discrete Fourier transform (IDFT).
s ( m , n ) = 1 2 * Σ n = 0 M - 1 S ^ ( f ) e j 2 πnf / M , 0 ≤ k ≤ M - 1
Wherein, M=256 is frame length.S is the voice signal that transforms to after full range band after the time domain strengthens.
Second step: the processing of postemphasising.
With the pre-emphasis of front handle opposite, here with signal by a low-pass filter, farthest reduce original signal.The frequency response of wave filter is as follows;
H(z)=1+αz -1
The coefficient here is corresponding with the processing of front pre-emphasis, gets α=0.9.
The 3rd step: the lap of the consecutive frame of the voice signal after will strengthening carries out the phase add operation.
Concrete lap addition can be represented with following method.
s &prime; ( n ) = s ( m , n ) + s ( m - 1 , n + L ) 0 &le; n < M - L s ( m , n ) M - L &le; n < M
L=64 is the distance that adjacent frame signal begins to locate, and M=256 is frame length.The final output signal after the phase add operation is finished in s ' representative.
Compare with correlation technique, it is the voice of being correlated with and the hypothesis of uncorrelated noise signal that dual microphone voice enhancement algorithm that the present invention proposes has effectively utilized that dual microphone receives, suppressed the interference of uncorrelated noise in the voice signal, simultaneously by from the signal cross-power spectrum, deducting the noise cross-power spectrum, and the calculating of priori SNR estimation value, improved decay for correlation noise.
The above only is a better embodiment of the present invention; protection scope of the present invention is not exceeded with above-mentioned embodiment; as long as the equivalence that those of ordinary skills do according to disclosed content is modified or changed, all should include in the protection domain of putting down in writing in claims.

Claims (6)

1. sound enhancement method that is applied to dual-microphone system, this dual-microphone system comprises
The microphone array module that constitutes by first microphone and second microphone;
Be used to receive signal that this microphone array module sends and the control module of controlling this microphone array;
Be used to receive the processing module of the data that control module sends;
Be used for output module that the data of processing module output are exported after treatment;
It is characterized in that comprising the steps:
Step 1. first microphone and second microphone receive time domain Noisy Speech Signal (S1) respectively, (S2) after, send control module to, by control module to time domain Noisy Speech Signal (S1), (S2) carry out respectively branch frame, pre-emphasis handle, through Short Time Fourier Transform with time domain Noisy Speech Signal (S1), (S2) be not transformed into frequency domain signal with noise (X1), (X2); Make when wherein dividing frame between the time domain signals with noise of adjacent two frames the aliasing part is arranged;
Step 2. receives frequency domain Noisy Speech Signal (X1) by processing module, (X2), and obtains frequency domain Noisy Speech Signal (X1) respectively, (X2) auto-power spectrum and frequency domain Noisy Speech Signal (X1), and cross-power spectrum (X2),
By the priori snr value of processing module, obtain present frame frequency domain Noisy Speech Signal (X1) or decay (X2) gain according to the present frame that obtains; Gain with the above-mentioned decay that obtains by processing module, multiply by the auto-power spectrum of first microphone or the second microphone frequency domain signal with noise, the auto-power spectrum of the clean speech estimated signal after obtaining handling;
Decay gain by former frame obtains present frame frequency domain signal with noise (X1), noise cross-power frequency spectrum (X2);
Obtain present frame frequency domain signal with noise (X1) by processing module by the noise cross-power spectrum that obtains, posteriority signal to noise ratio (S/N ratio) (X2), and obtain the priori snr value of present frame, export to output module;
Frequency-region signal after step 3. will be handled by output module transforms to time domain, and the processing of postemphasising becomes output signal.
2. the sound enhancement method that is applied to dual-microphone system according to claim 1 is characterized in that: in the step 1, the frame length when dividing frame is set between 10~35ms; Described pre-emphasis is treated to the time domain Noisy Speech Signal (S1) through undue frame, the component of the low frequency part that (S2) decays respectively; Between the time domain signals with noise of adjacent two frames lap is arranged, its Duplication is 50%~75%.
3. the sound enhancement method that is applied to dual-microphone system according to claim 2, it is characterized in that: aliasing partly is described below:
s i(n)=d i(m,D+n) 0≤n<L,i=1,2
S wherein iExpression time domain Noisy Speech Signal,
d i(m,n)=d i(m-1,L+n) 0≤n<D
Wherein, d iThe sampled signal of expression present frame, D represents the sampled point number of lap; L represents the distance that first sampled point of the time domain Noisy Speech Signal of consecutive frame is separated by, and m represents current frame number, and n represents a certain point data in the present frame.
4. the sound enhancement method that is applied to dual-microphone system according to claim 2 is characterized in that: time domain Noisy Speech Signal (S1), and (S2) decay the respectively branch metering method of low frequency part is:
H(z)=1-αz -1
Wherein, the general value of α is between 0.75-0.95, and H (z) represents transport function,
5. the sound enhancement method that is applied to dual-microphone system according to claim 1 is characterized in that:
In the step 1, the method for revealing for the discontinuous frequency that causes that reduces the frame signal boundary in the Short Time Fourier Transform process is: to the windowing of frame signal elder generation,
win(n)={
0.54-0.46cos(2*π*n/M) 0≤n≤M-1
0 all the other n
}
Win (n) represents employed Hamming window, and M represents in short-term that the computational length n of Fourier Tranform represents wherein a certain data point:
The method of Short Time Fourier Transform is:
X i ( f , m ) = 2 M &Sigma; n = 0 M - 1 win ( n - m ) &times; x i ( m ) e - 2 &pi;jf n M 0≤k1≤M-1
Wherein, X i(f, m) representation transformation is to the two paths of signals of frequency domain; M represents the m frame signal; The wherein a certain time domain data point of frame length n representative of Fourier transform is done in the M representative, and f represents a certain Frequency point of frequency domain; I gets 1 or 2.
6. the sound enhancement method that is applied to dual-microphone system according to claim 1 is characterized in that:
In the step 2, obtain frequency domain signal with noise X1, X2 auto-power spectrum and frequency domain signal with noise X1, the method for the cross-power spectrum of X2 is:
P XiXj ( f , m ) = X i ( f , m ) X j * ( f , m ) m = 1 &lambda; x P XiXj ( f , m - 1 ) + ( 1 - &lambda; x ) X i ( f , m ) X j * ( f , m ) m > 1
Wherein, m represents the sequence number of present frame, and f represents through different Frequency point behind the Fourier Tranform in short-term, λ x=0.6 is smoothing factor;
P XiXjThe energy spectrum of the signal of expression after smoothly; X i(f, m) representation transformation is to the two paths of signals of frequency domain; M represents the m frame signal; X j *(f, m) representation signal is got conjugation;
Work as i=j=1, represent the auto-power spectrum of the frequency domain signal with noise X1 that first microphone receives;
Work as i=j=2, represent the auto-power spectrum of the frequency domain signal with noise X2 signal that second microphone receives;
As i=1, j=2, represent that first microphone and second microphone receive frequency domain signal with noise X1, the cross-power spectrum of X2;
The method that obtains present frame noise cross-power frequency spectrum is
Wherein, λ n=0.9 is smoothing factor, and (f m-1) represents the decay gain factor that former frame calculates and stores, P to q N1N2(m represents the sequence number of present frame for f, the m) cross-power spectrum of expression present frame, and f represents different Frequency points;
The method that obtains the posteriority signal to noise ratio (S/N ratio) of current frame signal is:
SNR post ( f , m ) = | X 1 ( f , m ) X 2 * ( f , m ) | | P N 1 N 2 ( f , m ) |
SNR wherein Post(m represents the sequence number of present frame for f, m) the posteriority signal to noise ratio (S/N ratio) of expression present frame, and f represents different Frequency points.
The method that obtains the priori signal to noise ratio (S/N ratio) of present frame is:
Wherein
Figure F2009101098256C00035
The priori signal to noise ratio (S/N ratio) of expression present frame, m represents the sequence number of present frame, and f represents different Frequency points, and (f m-1) represents the decay gain factor that former frame calculates and stores, P to q N1N2(f, the m) cross-power spectrum of expression present frame, SNR Post(α is a smoothing factor for f, m) the posteriority signal to noise ratio (S/N ratio) of expression present frame, and general value is 0.7~0.9.
The decay gain obtains method and is:
Figure F2009101098256C00041
Wherein q (f, m) expression decay gain factor,
Figure F2009101098256C00042
The priori signal to noise ratio (S/N ratio) of expression present frame, m represents the sequence number of present frame, f represents different Frequency points, P NiNj(f, m) (mutually) power spectrum certainly of expression present frame, SNR Post(f, m) the posteriority signal to noise ratio (S/N ratio) of expression present frame.
In the step 2, (f m), multiply by the decay gain factor of the corresponding frequencies point that former frame obtains, and what obtain is exactly voice signal after the enhancing of this frequency with the Noisy Speech Signal X1 of the first microphone present frame
S ^ ( f , m ) = q ( f , m ) &times; X 1 ( f , m ) 0≤k≤N-1
Wherein,
Figure F2009101098256C00044
Be the voice signal estimated value after f frequency enhancing of m frame, q (f, m) expression decay gain factor.
CN2009101098256A 2009-11-23 2009-11-23 Speech enhancement method applied to dual-microphone system Expired - Fee Related CN101916567B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN2009101098256A CN101916567B (en) 2009-11-23 2009-11-23 Speech enhancement method applied to dual-microphone system

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN2009101098256A CN101916567B (en) 2009-11-23 2009-11-23 Speech enhancement method applied to dual-microphone system

Publications (2)

Publication Number Publication Date
CN101916567A true CN101916567A (en) 2010-12-15
CN101916567B CN101916567B (en) 2012-02-01

Family

ID=43324060

Family Applications (1)

Application Number Title Priority Date Filing Date
CN2009101098256A Expired - Fee Related CN101916567B (en) 2009-11-23 2009-11-23 Speech enhancement method applied to dual-microphone system

Country Status (1)

Country Link
CN (1) CN101916567B (en)

Cited By (19)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN102855880A (en) * 2011-06-20 2013-01-02 鹦鹉股份有限公司 De-noising method for multi-microphone audio equipment
CN103219012A (en) * 2013-04-23 2013-07-24 中国人民解放军总后勤部军需装备研究所 Double-microphone noise elimination method and device based on sound source distance
CN103854657A (en) * 2012-12-05 2014-06-11 华为技术有限公司 Interference signal elimination processing method and device
WO2015085693A1 (en) * 2013-12-13 2015-06-18 惠州Tcl移动通信有限公司 Method and system for automatically switching contextual mode based on ambient noise magnitude
CN103854657B (en) * 2012-12-05 2016-11-30 华为技术有限公司 Eliminate the processing method and processing device of interference signal
CN106297817A (en) * 2015-06-09 2017-01-04 中国科学院声学研究所 A kind of sound enhancement method based on binaural information
CN106340305A (en) * 2016-11-04 2017-01-18 北京声智科技有限公司 Self-calibration far-field voice interaction device and far-field voice self-calibration method
CN107484080A (en) * 2016-05-30 2017-12-15 奥迪康有限公司 The method of apparatus for processing audio and signal to noise ratio for estimation voice signal
CN107680609A (en) * 2017-09-12 2018-02-09 桂林电子科技大学 A kind of double-channel pronunciation Enhancement Method based on noise power spectral density
CN107969164A (en) * 2014-11-12 2018-04-27 美国思睿逻辑有限公司 Differentiate between adaptability channel and scale wave filter again
CN108074580A (en) * 2016-11-17 2018-05-25 杭州海康威视数字技术股份有限公司 A kind of noise cancellation method and device
CN108269585A (en) * 2013-04-05 2018-07-10 杜比实验室特许公司 The companding device and method of quantizing noise are reduced using advanced spectrum continuation
CN108447500A (en) * 2018-04-27 2018-08-24 深圳市沃特沃德股份有限公司 The method and apparatus of speech enhan-cement
CN110556125A (en) * 2019-10-15 2019-12-10 出门问问信息科技有限公司 Feature extraction method and device based on voice signal and computer storage medium
CN110634500A (en) * 2019-10-14 2019-12-31 达闼科技成都有限公司 Method for calculating prior signal-to-noise ratio, electronic device and storage medium
CN112185407A (en) * 2020-09-24 2021-01-05 广州朗国电子科技有限公司 Dual-MIC input environmental sound suppression method and device, storage medium and equipment
CN113380266A (en) * 2021-05-28 2021-09-10 中国电子科技集团公司第三研究所 Miniature double-microphone voice enhancement method and miniature double-microphone
CN113660578A (en) * 2021-08-16 2021-11-16 世邦通信股份有限公司 Double-microphone directional pickup method and device with adjustable pickup angle range
CN115831145A (en) * 2023-02-16 2023-03-21 之江实验室 Double-microphone speech enhancement method and system

Family Cites Families (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US7447630B2 (en) * 2003-11-26 2008-11-04 Microsoft Corporation Method and apparatus for multi-sensory speech enhancement
CN1809105B (en) * 2006-01-13 2010-05-12 北京中星微电子有限公司 Dual-microphone speech enhancement method and system applicable to mini-type mobile communication devices
CN101488981A (en) * 2008-01-18 2009-07-22 上海晨兴电子科技有限公司 Mobile phone construction and method for enhancing voice call quality of mobile phone
CN101477800A (en) * 2008-12-31 2009-07-08 瑞声声学科技(深圳)有限公司 Voice enhancing process
CN101582264A (en) * 2009-06-12 2009-11-18 瑞声声学科技(深圳)有限公司 Method and voice collecting system for speech enhancement

Cited By (30)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN102855880B (en) * 2011-06-20 2016-09-28 鹦鹉汽车股份有限公司 Denoising method for multi-microphone audio equipment
CN102855880A (en) * 2011-06-20 2013-01-02 鹦鹉股份有限公司 De-noising method for multi-microphone audio equipment
CN103854657A (en) * 2012-12-05 2014-06-11 华为技术有限公司 Interference signal elimination processing method and device
CN103854657B (en) * 2012-12-05 2016-11-30 华为技术有限公司 Eliminate the processing method and processing device of interference signal
CN108269585A (en) * 2013-04-05 2018-07-10 杜比实验室特许公司 The companding device and method of quantizing noise are reduced using advanced spectrum continuation
US11423923B2 (en) 2013-04-05 2022-08-23 Dolby Laboratories Licensing Corporation Companding system and method to reduce quantization noise using advanced spectral extension
CN103219012A (en) * 2013-04-23 2013-07-24 中国人民解放军总后勤部军需装备研究所 Double-microphone noise elimination method and device based on sound source distance
WO2015085693A1 (en) * 2013-12-13 2015-06-18 惠州Tcl移动通信有限公司 Method and system for automatically switching contextual mode based on ambient noise magnitude
CN107969164B (en) * 2014-11-12 2020-07-17 美国思睿逻辑有限公司 Adaptive inter-channel discrimination rescaling filter
CN107969164A (en) * 2014-11-12 2018-04-27 美国思睿逻辑有限公司 Differentiate between adaptability channel and scale wave filter again
CN106297817A (en) * 2015-06-09 2017-01-04 中国科学院声学研究所 A kind of sound enhancement method based on binaural information
CN106297817B (en) * 2015-06-09 2019-07-09 中国科学院声学研究所 A kind of sound enhancement method based on binaural information
CN107484080B (en) * 2016-05-30 2021-07-16 奥迪康有限公司 Audio processing apparatus and method for estimating signal-to-noise ratio of sound signal
CN107484080A (en) * 2016-05-30 2017-12-15 奥迪康有限公司 The method of apparatus for processing audio and signal to noise ratio for estimation voice signal
CN106340305B (en) * 2016-11-04 2024-03-19 北京声智科技有限公司 Self-calibration far-field voice interaction device and far-field voice self-calibration method
CN106340305A (en) * 2016-11-04 2017-01-18 北京声智科技有限公司 Self-calibration far-field voice interaction device and far-field voice self-calibration method
CN108074580B (en) * 2016-11-17 2021-04-02 杭州海康威视数字技术股份有限公司 Noise elimination method and device
CN108074580A (en) * 2016-11-17 2018-05-25 杭州海康威视数字技术股份有限公司 A kind of noise cancellation method and device
CN107680609A (en) * 2017-09-12 2018-02-09 桂林电子科技大学 A kind of double-channel pronunciation Enhancement Method based on noise power spectral density
CN108447500B (en) * 2018-04-27 2020-08-18 深圳市沃特沃德股份有限公司 Method and device for speech enhancement
CN108447500A (en) * 2018-04-27 2018-08-24 深圳市沃特沃德股份有限公司 The method and apparatus of speech enhan-cement
CN110634500B (en) * 2019-10-14 2022-05-31 达闼机器人股份有限公司 Method for calculating prior signal-to-noise ratio, electronic device and storage medium
CN110634500A (en) * 2019-10-14 2019-12-31 达闼科技成都有限公司 Method for calculating prior signal-to-noise ratio, electronic device and storage medium
CN110556125A (en) * 2019-10-15 2019-12-10 出门问问信息科技有限公司 Feature extraction method and device based on voice signal and computer storage medium
CN112185407A (en) * 2020-09-24 2021-01-05 广州朗国电子科技有限公司 Dual-MIC input environmental sound suppression method and device, storage medium and equipment
CN113380266A (en) * 2021-05-28 2021-09-10 中国电子科技集团公司第三研究所 Miniature double-microphone voice enhancement method and miniature double-microphone
CN113380266B (en) * 2021-05-28 2022-06-28 中国电子科技集团公司第三研究所 Miniature dual-microphone speech enhancement method and miniature dual-microphone
CN113660578A (en) * 2021-08-16 2021-11-16 世邦通信股份有限公司 Double-microphone directional pickup method and device with adjustable pickup angle range
CN113660578B (en) * 2021-08-16 2023-11-28 世邦通信股份有限公司 Directional pickup method and device with adjustable pickup angle range for double microphones
CN115831145A (en) * 2023-02-16 2023-03-21 之江实验室 Double-microphone speech enhancement method and system

Also Published As

Publication number Publication date
CN101916567B (en) 2012-02-01

Similar Documents

Publication Publication Date Title
CN101916567B (en) Speech enhancement method applied to dual-microphone system
CN101976565A (en) Dual-microphone-based speech enhancement device and method
CN101894563B (en) Voice enhancing method
US8010355B2 (en) Low complexity noise reduction method
CN102074246B (en) Dual-microphone based speech enhancement device and method
US8712074B2 (en) Noise spectrum tracking in noisy acoustical signals
US7162420B2 (en) System and method for noise reduction having first and second adaptive filters
JP4210521B2 (en) Noise reduction method and apparatus
CN102347028A (en) Double-microphone speech enhancer and speech enhancement method thereof
CN102347027A (en) Double-microphone speech enhancer and speech enhancement method thereof
CN101763858A (en) Method for processing double-microphone signal
CN102074245A (en) Dual-microphone-based speech enhancement device and speech enhancement method
US9818424B2 (en) Method and apparatus for suppression of unwanted audio signals
CN101027719B (en) Noise suppressor
CN111554315B (en) Single-channel voice enhancement method and device, storage medium and terminal
CN101582264A (en) Method and voice collecting system for speech enhancement
CN104103277A (en) Time frequency mask-based single acoustic vector sensor (AVS) target voice enhancement method
US20050251389A1 (en) Method and apparatus for noise reduction
CN103632675A (en) Noise estimation for use with noise reduction and echo cancellation in personal communication
CN102576538A (en) A method and an apparatus for processing an audio signal
CN101904182A (en) The enhanced blind source separation algorithm that is used for the mixture of height correlation
CN105788607A (en) Speech enhancement method applied to dual-microphone array
TW201248613A (en) System and method for monaural audio processing based preserving speech information
CN101894561A (en) Wavelet transform and variable-step least mean square algorithm-based voice denoising method
CN105390142A (en) Digital hearing aid voice noise elimination method

Legal Events

Date Code Title Description
C06 Publication
PB01 Publication
C10 Entry into substantive examination
SE01 Entry into force of request for substantive examination
C14 Grant of patent or utility model
GR01 Patent grant
ASS Succession or assignment of patent right

Owner name: AAC TECHNOLOGIES (NANJING) CO., LTD.

Free format text: FORMER OWNER: AAC ACOUSTIC TECHNOLOGIES (SHENZHEN) CO., LTD.

Effective date: 20121130

Free format text: FORMER OWNER: AAC ACOUSTIC TECHNOLOGIES (CHANGZHOU) CO., LTD.

Effective date: 20121130

C41 Transfer of patent application or patent right or utility model
COR Change of bibliographic data

Free format text: CORRECT: ADDRESS; FROM: 518057 SHENZHEN, GUANGDONG PROVINCE TO: 210093 NANJING, JIANGSU PROVINCE

TR01 Transfer of patent right

Effective date of registration: 20121130

Address after: 210093 Gulou District, Qingdao, Jiangsu,, Nanjing University, Drum Tower, University of science and Technology Park, No. 32, entrepreneurship center, No. 401

Patentee after: AAC TECHNOLOGIES (NANJING) Co.,Ltd.

Address before: 518057 Nanshan District province high tech Industrial Park, Shenzhen, North West New Road, No. 18

Patentee before: AAC ACOUSTIC TECHNOLOGIES (SHENZHEN) Co.,Ltd.

Patentee before: AAC Acoustic Technologies (Changzhou) Co.,Ltd.

DD01 Delivery of document by public notice

Addressee: Zhang Yong

Document name: payment instructions

DD01 Delivery of document by public notice
CF01 Termination of patent right due to non-payment of annual fee

Granted publication date: 20120201

CF01 Termination of patent right due to non-payment of annual fee