CN103295582A - Noise suppression method and system - Google Patents

Noise suppression method and system Download PDF

Info

Publication number
CN103295582A
CN103295582A CN2012100538941A CN201210053894A CN103295582A CN 103295582 A CN103295582 A CN 103295582A CN 2012100538941 A CN2012100538941 A CN 2012100538941A CN 201210053894 A CN201210053894 A CN 201210053894A CN 103295582 A CN103295582 A CN 103295582A
Authority
CN
China
Prior art keywords
ratio
noise
signal
module
speech signal
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Granted
Application number
CN2012100538941A
Other languages
Chinese (zh)
Other versions
CN103295582B (en
Inventor
谢单辉
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Shanghai Li Ke Semiconductor Technology Co., Ltd.
Original Assignee
Leadcore Technology Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Leadcore Technology Co Ltd filed Critical Leadcore Technology Co Ltd
Priority to CN201210053894.1A priority Critical patent/CN103295582B/en
Publication of CN103295582A publication Critical patent/CN103295582A/en
Application granted granted Critical
Publication of CN103295582B publication Critical patent/CN103295582B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Images

Landscapes

  • Noise Elimination (AREA)
  • Circuit For Audible Band Transducer (AREA)

Abstract

The invention relates to the field of voice signal processing, and discloses a noise suppression method and system. According to the noise suppression method, the prior signal to noise ratio and the posterior signal to noise ratio of a noisy voice signal are calculated, the estimated value of a clean voice signal in the short-term spectrum amplitude is calculated according to a maximum posterior MAP standard, the ratio of the estimated value and the noisy voice signal is calculated, an attenuation factor is obtained, and then inverse fast Fourier transform IFFT treatment is conducted on the noisy voice signal according to the attenuation factor obtained through calculation, so that calculated amount is greatly reduced during noise suppression. The noise suppression method and system is more suitable for practical application, and performance is also improved. Furthermore, when the attenuation factor is calculated, the calculation of the ratio of the first order and zero order of a first kind of the modified Bessel function is involved, and at the moment, the ratio is obtained through the table look-up method so that storage space can be further reduced; or the approximate calculation method is adopted, no storage table is required, and calculation is more simplified.

Description

Noise suppressing method and system thereof
Technical field
The present invention relates to field of voice signal, particularly noise suppressing method and system thereof.
Background technology
Single Mike's voice strengthen the disposal route that (being squelch) adopts the frequency domain decay usually, comprise that Fourier transform (FFT) analysis, noise estimate, calculate modules such as elder generation/posteriority signal to noise ratio (S/N ratio), calculating decay factor, IFFT be comprehensive, and roughly flow process as shown in Figure 1.
The Noisy Speech Signal Y (w) that Mike collects can be expressed as Y (w)=X (w)+N (w), and wherein X (w) is the clean speech signal, and N (w) is acoustic noise signal; Use X, N, Y represent the short-term spectrum amplitude (Short-Time Spectral Amplitude is called for short " STSA ") of corresponding signal, i.e. X=|X (w) respectively |, N=|N (w) |, Y=|Y (w) |.Noisy Speech Signal is transformed into frequency domain through fft analysis from time domain earlier; The noise estimation module then is to estimate the variance λ of current noise as far as possible exactly N=E{|N (w) | 2; Then calculate priori signal to noise ratio (S/N ratio) ξ and posteriority signal to noise ratio (S/N ratio) γ; Calculate decay factor then, carry out IFFT at last and comprehensively obtain voice signal through squelch.Wherein, priori signal to noise ratio (S/N ratio) ξ is the variance ratio of voice X (w) and noise N (w)
Figure BDA0000140307810000011
Posteriority signal to noise ratio (S/N ratio) γ is the ratio of the current signals with noise spectrum energy Y (w) that receives and noise N (w) variance γ = | Y ( w ) | 2 E { | N ( w ) | 2 } = Y 2 λ N .
Decay factor G is the clean speech signal estimated shared ratio in Noisy Speech Signal, like this and the Noisy Speech Signal of importing multiply each other and namely obtain result, decay factor is determining the performance of algorithm.Its computing method are core places of whole noise suppression algorithm, the final effect of algorithm will directly be influenced, least mean-square error commonly used (Minimum Mean Square Error is called for short " MMSE ") and least mean-square error (log-MMSE) the inhibiting factor algorithm of taking the logarithm calculate.
Adopting MMSE to calculate decay factor can be represented by the formula:
G MMSE = π 2 υ γ exp ( - υ 2 ) [ ( 1 + υ ) I 1 ( υ 2 ) + υI 0 ( υ 2 ) ] - - - ( 1 )
Adopting Log-MMSE to calculate decay factor can be represented by the formula:
G Log _ MMSE = ξ 1 + ξ exp { 1 2 ∫ υ ∞ e - t t dt } - - - ( 2 )
In formula 1 and the formula 2, exp{} represents exponential function, I vExpression v rank first kind modified Bessel function,
Figure BDA0000140307810000023
By as can be seen top, the calculated amount of MMSE and Log-MMSE all is very big.The concrete analysis of its calculated amount is as follows:
1. for adopting MMSE to calculate decay factor, need at least 1 evolution, 2 divisions, 1 exp exponent arithmetic, several times multiplication and additive operation, and calculate I 1And I 0Wherein calculate I 1And I 0Usually adopt the method for tabling look-up, need be respectively them storage space is provided.In addition, because the domain of walker of Bessel's function is very big, cause in fixed-point algorithm, realizing very difficult.Be illustrated in figure 2 as first-order bessel function I 1(x) and zero Bessel function I 0(x) the numerical simulation figure in [20,20] dB scope, its numerical range is [10 -3, 10 44].
2. for adopting Log-MMSE to calculate decay factor, direct integral can't calculate, in the mathematical software matrix experiment chamber (Matrix Laboratory, be called for short " Matlab ") a Function e xpint is arranged, can be used for the integral part of exp index inside in the calculating formula 2:
expint ( υ ) = ∫ υ ∞ e - t t dt - - - ( 3 )
Approximate formula commonly used in terminal is:
expint ( &upsi; ) = - 2.31 log 10 ( &upsi; ) - 0.6 &upsi; < 0.1 - 1.544 log 10 ( &upsi; ) + 0.166 0.1 &le; &upsi; &le; 1 10 - ( 0.52 &upsi; + 0.26 ) &upsi; > 1 - - - ( 4 )
Approximation method in calculating in the general employing formula 4, but total calculated amount is also very big, comprises exponent arithmetic, the log logarithm operation of exp and 10.
Summary of the invention
The object of the present invention is to provide a kind of noise suppressing method and system thereof, make in squelch, to have significantly reduced calculated amount, be more suitable for practical application, and on performance, also promote to some extent.
For solving the problems of the technologies described above, embodiments of the present invention provide a kind of noise suppressing method, comprise following steps:
After the priori signal to noise ratio (S/N ratio) that calculates Noisy Speech Signal and posteriority signal to noise ratio (S/N ratio), ask for the estimated value of clean speech signal and the ratio of Noisy Speech Signal, obtain decay factor G; Wherein, described estimated value is the estimated value according to the clean speech signal short-term spectrum amplitude of maximum a posteriori MAP criterion calculating;
According to described G described Noisy Speech Signal being carried out contrary fast fourier transform IFFT handles.
Embodiments of the present invention also provide a kind of noise suppressing system, comprise: snr computation module, decay factor are asked for module, IFFT processing module;
Described snr computation module is used for calculating priori signal to noise ratio (S/N ratio) and the posteriority signal to noise ratio (S/N ratio) of Noisy Speech Signal, and the described priori signal to noise ratio (S/N ratio) that will calculate and posteriority signal to noise ratio (S/N ratio) are exported to described decay factor and asked for module;
Described decay factor is asked for module and is used for when receiving described priori signal to noise ratio (S/N ratio) and posteriority signal to noise ratio (S/N ratio),, ask for the estimated value of clean speech signal and the ratio of Noisy Speech Signal, obtain decay factor G; Wherein, described estimated value is the estimated value according to the clean speech signal short-term spectrum amplitude of maximum a posteriori MAP criterion calculating;
Described IFFT processing module is used for asking for the G that module obtains according to described decay factor, described Noisy Speech Signal is carried out contrary fast fourier transform IFFT handle.
Embodiment of the present invention in terms of existing technologies, when calculating decay factor, the estimated value of the clean speech signal short-term spectrum amplitude of calculating with maximum a posteriori MAP criterion, the estimated value of required clean speech signal short-term spectrum amplitude when calculating decay factor.Make in squelch, to have significantly reduced calculated amount, be more suitable for practical application, and on performance, also promote to some extent.
Preferably, ask for described ratio according to following formula, obtain decay factor G:
G = &xi; 2 ( 1 + &xi; ) r I ( &zeta; ) + { &xi; 2 ( 1 + &xi; ) r I ( &zeta; ) } 2 + &xi; 2 ( 1 + &xi; ) &gamma;
Wherein, ξ represents described priori signal to noise ratio (S/N ratio); γ represents described posteriority signal to noise ratio (S/N ratio); ζ represents the variable of Bessel's function; r I(ζ) ratio of expression first kind modified Bessel function single order and zeroth order.As can be seen, calculate major part that described G adopts be operating as evolution, division, square, simple operation such as addition, do not need exponent arithmetic and power operation, and ratio r INumerical range (ζ) is [0,1], is convenient to very much the fixed point computing.
Preferably, store all possible ζ and corresponding r in advance IMapping table (ζ); Ask for described ratio, when obtaining decay factor G, calculating described ζ earlier, in described mapping table, searching the r corresponding with this ζ according to described ζ then I(ζ); At last with described ξ, described γ and the described r that finds I(ζ), the described formula of substitution obtains described G.Because that storage in advance is all possible ζ and corresponding r IMapping table (ζ) needs to store all values of first kind modified Bessel function single order and zeroth order in the prior art, and the size of storage list can be reduced half, i.e. space storage data reduce half.
Preferably, asking for described ratio, when obtaining decay factor G, calculating described ζ earlier, judging that then whether described ζ is greater than 1; If described ζ is greater than 1, then according to formula
Figure BDA0000140307810000042
Calculate described r I(ζ); If described ζ is less than or equal to 1, then according to formula r I(ζ)=0.5 ζ calculates described r I(ζ); At last with the described r of described ξ, described γ and calculating I(ζ) the described formula of substitution G = &xi; 2 ( 1 + &xi; ) r I ( &zeta; ) + { &xi; 2 ( 1 + &xi; ) r I ( &zeta; ) } 2 + &xi; 2 ( 1 + &xi; ) &gamma; In, obtain described G.As seen r ICalculating (ζ) is very simple, and does not need storage list, further saves storage space.
Preferably, when calculating described ζ, with the product of described ξ and described γ, open radical sign and multiply by the numerical value that obtains after 2 as described ζ.Simple evolution, multiplication can calculate, and realizes simple.
Description of drawings
Fig. 1 is the process flow diagram of noise suppressing method in the prior art;
Fig. 2 is first-order bessel function I 1(x) and zero Bessel function I 0(x) the numerical simulation figure in [20,20] dB scope;
Fig. 3 is the process flow diagram of first embodiment of the invention noise suppressing method;
Fig. 4 is the corresponding relation figure of the ratio of the independent variable of Bessel's function and first kind modified Bessel function single order and zeroth order;
Fig. 5 is the process flow diagram of noise suppressing method second embodiment of the invention;
Fig. 6 is the process flow diagram according to the 3rd embodiment noise suppressing method of the present invention;
Fig. 7 is the approximate treatment value of the ratio of first kind modified Bessel function single order and zeroth order in the 3rd embodiment of the present invention and the comparison diagram of actual value;
Fig. 8 adopts MAP, MMSE and log-MMSE criterion to calculate the objective performance comparison diagram that decay factor is carried out the voice signal of squelch;
Fig. 9 does not carry out the voice signal of squelch and adopts MAP, MMSE and the log-MMSE criterion is calculated the subjective performance comparison diagram that decay factor is carried out the voice signal of squelch;
Figure 10 is the structural drawing according to the 5th embodiment noise suppressing system of the present invention;
Figure 11 is the structural drawing according to the 6th embodiment noise suppressing system of the present invention.
Embodiment
For making the purpose, technical solutions and advantages of the present invention clearer, below in conjunction with accompanying drawing the embodiments of the present invention are explained in detail.Yet, persons of ordinary skill in the art may appreciate that in each embodiment of the present invention, in order to make the reader understand the application better many ins and outs have been proposed.But, even without these ins and outs with based on many variations and the modification of following each embodiment, also can realize each claim of the application technical scheme required for protection.
First embodiment of the present invention relates to a kind of noise suppressing method.Idiographic flow comprises following steps as shown in Figure 3:
Step 301 is carried out fast fourier transform FFT to Noisy Speech Signal.
Step 302 is carried out noise to the Noisy Speech Signal after the FFT conversion and is estimated.
Step 303, priori signal to noise ratio (S/N ratio) and the posteriority signal to noise ratio (S/N ratio) of calculating Noisy Speech Signal.
Step 301 is same as the prior art to step 303, does not repeat them here.
In step 304, ask for the estimated value of clean speech signal and the ratio of Noisy Speech Signal, obtain decay factor G; Wherein, the estimated value of clean speech signal is the estimated value according to the clean speech signal short-term spectrum amplitude of maximum a posteriori MAP criterion calculating.That is to say, with the estimated value of the clean speech signal short-term spectrum amplitude that calculates according to maximum a posteriori MAP criterion, required clean speech signal estimated value when asking for decay factor G.
Specifically, in the present embodiment, obtain decay factor G according to following formula:
G = &xi; 2 ( 1 + &xi; ) r I ( &zeta; ) + { &xi; 2 ( 1 + &xi; ) r I ( &zeta; ) } 2 + &xi; 2 ( 1 + &xi; ) &gamma; - - - ( 5 )
Wherein, ξ represents the priori signal to noise ratio (S/N ratio); γ represents the posteriority signal to noise ratio (S/N ratio); ζ represents the variable of Bessel's function; r I(ζ) ratio of expression first kind modified Bessel function single order and zeroth order.Derivation to above-mentioned formula (5) describes below:
According to multiple Gauss model, the distribution function that can obtain the short-term spectrum amplitude X of clean speech signal is
p ( X ) = 2 X &lambda; X exp ( - X 2 &lambda; X ) X &GreaterEqual; 0 0 , otherwise - - - ( 6 )
In the formula, λ XVariance for X.
And the short-term spectrum amplitude of clean speech signal is under the condition of X, and Noisy Speech Signal is that the distribution function of Y (W) is
p ( Y ( w ) | X ) = 1 &pi;&lambda; N exp ( - Y 2 + X 2 &lambda; N ) I 0 2 XY &lambda; N - - - ( 7 )
In the formula, Y is the short-term spectrum amplitude of Noisy Speech Signal, λ NBe the variance of noise, I 0It is zeroth order first kind modified Bessel function.
According to maximum a posteriori MAP criterion, the estimated value of calculating the short-term spectrum amplitude X of clean speech signal is
X ^ = arg max X p ( X | Y ( w ) ) = arg max X p ( Y ( w ) | X ) p ( X ) p ( Y ( w ) ) - - - ( 8 )
Because log is monotonic quantity, therefore, the maximal value of calculating posterior probability p (X|Y (w)) can be by calculating its logarithm log{p (X|Y (w)) } maximal value simplify calculating.
Definition:
J=log{p(X|Y(w))}=log{p(Y(w)|X)}+log{p(X)}-log{p(Y(w))} (9)
Distribution function 6 and the above-mentioned formula 9 of 7 substitutions can be got:
J = - Y 2 + X 2 &lambda; N - X 2 &lambda; N + log X + log [ I 0 ( 2 YX &lambda; N ) ] + cons tan t - - - ( 10 )
In the formula, constant is constant.
Make the independent variable of Bessel's function be
Figure BDA0000140307810000074
Ask the local derviation of the X of J, and utilize the characteristic of first kind correction Bezier (Bessel) function:
&PartialD; &PartialD; &zeta; I n ( &zeta; ) = n &zeta; I n ( &zeta; ) + I n + 1 ( &zeta; ) n=0,1,2.... (12)
Obtain at last:
&PartialD; J &PartialD; X = - 2 X &lambda; X - 2 X &lambda; N + 1 X + 2 Y &lambda; N I 1 ( &zeta; ) I 0 ( &zeta; ) - - - ( 13 )
Make formula 13 equal zero, obtain about the super quadratic equation of X as follows through distortion:
0 = X 2 - Y&lambda; X ( &lambda; X + &lambda; N ) I 1 ( &zeta; ) I 0 ( &zeta; ) X - 1 2 &lambda; X &lambda; N &lambda; X + &lambda; N - - - ( 14 )
Because the variable of Bessel's function
Figure BDA0000140307810000082
Comprise unknown variable X, therefore above-mentioned super quadratic equation does not have exact solution.In the present embodiment, the variable in the hypothesis Bessel's function is known earlier, and on this basis, according to the root formula of quadratic equation, two solutions that obtain formula 14 are
X 1,2 = Y&lambda; X 2 ( &lambda; X + &lambda; N ) I 1 ( &zeta; ) I 0 ( &zeta; ) &PlusMinus; ( Y&lambda; X 2 ( &lambda; X + &lambda; N ) I 1 ( &zeta; ) I 0 ( &zeta; ) ) 2 + &lambda; X &lambda; N 2 ( &lambda; X + &lambda; N ) - - - ( 15 )
With the solution that obtains, divided by the amplitude Y of signals with noise:
G = X 1,2 Y - - - ( 16 )
Remove impossible negative value solution, the expression that finally obtains G is:
G = &xi; 2 ( 1 + &xi; ) r I ( &zeta; ) + { &xi; 2 ( 1 + &xi; ) r I ( &zeta; ) } 2 + &xi; 2 ( 1 + &xi; ) &gamma;
In the formula
Figure BDA0000140307810000086
Ratio for first kind modified Bessel function single order and zeroth order.
After asking for decay factor G, enter step 105, carry out contrary fast fourier transform IFFT according to the Noisy Speech Signal of the decay factor G that calculates and handle.This step is same as the prior art, does not repeat them here.
In terms of existing technologies, first embodiment of the present invention calculates the estimated value of clean speech signal short-term spectrum amplitude according to maximum a posteriori MAP criterion in squelch, ask for the ratio of this estimated value and Noisy Speech Signal, obtain the computing formula of decay factor, by this computing formula as can be seen, calculate major part that decay factor G adopts be operating as evolution, division, square, simple operation such as addition, do not need exponent arithmetic and power operation, and as shown in Figure 4, ratio r INumerical range (ζ) is [0,1], is convenient to very much the fixed point computing.Therefore in squelch, significantly reduce calculated amount, be more suitable for practical application.
In addition, it will be understood by those skilled in the art that the concrete numerical value of wishing to get decay factor G, need to calculate earlier the ratio r of priori signal to noise ratio (S/N ratio) ξ, posteriority signal to noise ratio (S/N ratio) γ and first kind modified Bessel function single order and zeroth order I(ζ); And to calculate r I(ζ), then need to calculate earlier the independent variable of Bessel's function
Figure BDA0000140307810000091
Calculate for further simplifying, can obtain the estimated value of ζ according to priori signal to noise ratio (S/N ratio) ξ and posteriority signal to noise ratio (S/N ratio) γ:
&zeta; ^ &ap; 2 E [ X 2 ] &lambda; N Y 2 &lambda; N = 2 &xi;Y - - - ( 17 )
Because priori signal to noise ratio (S/N ratio) ξ and posteriority signal to noise ratio (S/N ratio) γ calculate in step before, therefore simple evolution, multiplication can calculate the independent variable ζ of Bessel's function, realizes very simple.
Second embodiment of the present invention relates to a kind of noise suppressing method.Second embodiment has been done further improvement on the basis of first embodiment, and its improvements mainly are: in second embodiment of the invention, store all possible ζ and corresponding r in advance IMapping table (ζ); Asking for ratio r IIn the time of (ζ), calculate the independent variable ζ of Bessel's function earlier, in mapping table, search the r corresponding with this ζ according to ζ then I(ζ); At last with ξ, γ and r I(ζ), substitution formula (5) calculates G, and idiographic flow as shown in Figure 5.Because that storage in advance is all possible ζ and corresponding r IMapping table (ζ) therefore after calculating ζ, can directly find corresponding r rapidly by this mapping table I(ζ), and need to store all values of first kind modified Bessel function single order and zeroth order in the prior art, the size of the storage list of second embodiment of the present invention can be reduced half, i.e. space storage data reduce half.
The 3rd embodiment of the present invention relates to a kind of noise suppressing method.The 3rd embodiment has been done further improvement on the basis of first embodiment, its improvements mainly are: in third embodiment of the invention, after calculating the independent variable ζ of Bessel's function, judge that whether ζ is greater than 1, if ζ is greater than 1, then according to formula
Figure BDA0000140307810000093
Calculate r I(ζ); If ζ is less than or equal to 1, then according to formula r I(ζ)=0.5 ζ calculates r I(ζ); At last with ξ, γ and r IIn (ζ) the substitution formula (5), calculate G, idiographic flow as shown in Figure 6.In the present embodiment, ratio r ICalculating (ζ) is very simple, and does not need storage list, can further save storage space.And, use formula
Figure BDA0000140307810000101
With formula r I(ζ)=0.5 r that calculates of ζ I(ζ), with the r of reality I(ζ) compare, almost can ignore in the error that performance exists.As shown in Figure 7, solid line is represented r among the figure IActual value (ζ), long dot-and-dash line represents to adopt formula
Figure BDA0000140307810000102
The r that calculates I(ζ) approximate value, short dot-and-dash line represents to adopt formula r I(ζ)=0.5 ζThe r that calculates I(ζ) approximate value, (1,0.5) is long dot-and-dash line and short dashdotted joint.As can be seen from the figure, the r that in most of zone, calculates according to formula I(ζ) approximate value curve and true ratio curve fit tightly, and have some little errors near 0dB, its maximum error<0.07, and exist the scope of error very narrow, can accept in the scope.
In terms of existing technologies, the 3rd embodiment of the present invention is when reducing calculated amount, and it is main, objective performance also promotes to some extent.Objective performance refers to adopt certain objective algorithm to come assess performance, adopt segmental signal-to-noise ratio to promote (Segmental SNR Improvement) as evaluation criterion in the present embodiment, as shown in Figure 8, horizontal ordinate is represented input signal-to-noise ratio among the figure, ordinate is represented the amplitude that segmental signal-to-noise ratio promotes, as seen from the figure, adopt the MAP criterion than adopting MMSE and log-MMSE criterion to calculate all liftings to some extent of objective performance that decay factor is carried out the voice signal of squelch.Subjective performance refers to use subjective people to go to listen and handles back sound, adopt International Telecommunications Union's telecommunication standards group (International Telecommunication Union-Telecommunication Standardization Sector in the present embodiment, abbreviation " ITU-T ") P.862 Perceptual Evaluation of Speech Quality standard (Perceptual Evaluation of Speech Quality, be called for short " PESQ ") as evaluation criterion, as shown in Figure 9, horizontal ordinate is represented input signal-to-noise ratio among the figure, ordinate is represented the voice signal after squelch is carried out the P.862PESQ result of standard evaluation, as seen from the figure, the voice signal that adopts MAP criterion calculating decay factor to carry out squelch carries out squelch than employing MMSE and log-MMSE criterion, and all liftings to some extent of objective performance of not carrying out the voice signal of squelch.When carrying out performance simulation in the present embodiment, adopt 30 sentence (15 male voices among the received pronunciation storehouse NOIZEUS, 15 female voices), sampling rate is 8KHz, 16bit quantizes, with white noise on their aliasings, and be divided into 0dB, 5dB, 10dB and four grades of 15dB, the processing frame length is 20ms, Duplication is 50%, and Hamming window (Hamming) adopts directly decision-making (Decision-Direct) algorithm estimation priori signal to noise ratio (S/N ratio), noise variance adopts the simplest hard decision tracking mode to ask for, and the judgement mode adopts log-likelihood ratio.
Need to prove, above the step of the whole bag of tricks divide, just clear in order to describe, can merge into a step during realization or some step is split, be decomposed into a plurality of steps, as long as comprise identical logical relation, all in the protection domain of this patent; To adding inessential modification in the algorithm or in the flow process or introduce inessential design, but the core design that does not change its algorithm and flow process is all in the protection domain of this patent.
Four embodiment of the invention relates to a kind of noise suppressing system, comprises: the fast fourier transform module, noise estimation module, snr computation module, decay factor are asked for module, IFFT processing module.
Wherein the fast fourier transform module is used for Noisy Speech Signal is carried out fast fourier transform FFT, and the Noisy Speech Signal after the FFT conversion is exported to the noise estimation module.
The noise estimation module is used for that the Noisy Speech Signal after the FFT conversion is carried out noise and estimates.
The snr computation module is used for calculating priori signal to noise ratio (S/N ratio) and the posteriority signal to noise ratio (S/N ratio) of Noisy Speech Signal, and the priori signal to noise ratio (S/N ratio) calculated and posteriority signal to noise ratio (S/N ratio) is exported to decay factor ask for module.
Decay factor is asked for module and is used for asking for the estimated value of clean speech signal and the ratio of Noisy Speech Signal when receiving priori signal to noise ratio (S/N ratio) and posteriority signal to noise ratio (S/N ratio), obtains decay factor G; Wherein, the estimated value of clean speech signal is the estimated value according to the clean speech signal short-term spectrum amplitude of maximum a posteriori MAP criterion calculating.In the present embodiment, the computing formula of decay factor G is:
G = &xi; 2 ( 1 + &xi; ) r I ( &zeta; ) + { &xi; 2 ( 1 + &xi; ) r I ( &zeta; ) } 2 + &xi; 2 ( 1 + &xi; ) &gamma;
Wherein, ξ represents the priori signal to noise ratio (S/N ratio); γ represents the posteriority signal to noise ratio (S/N ratio); ζ represents the variable of Bessel's function; r I(ζ) ratio of expression first kind modified Bessel function single order and zeroth order.
The IFFT processing module is used for asking for the G that module obtains according to decay factor, Noisy Speech Signal is carried out contrary fast fourier transform IFFT handle.
Be not difficult to find that present embodiment is the system embodiment corresponding with first embodiment, present embodiment can with the enforcement of working in coordination of first embodiment.The correlation technique details of mentioning in first embodiment is still effective in the present embodiment, in order to reduce repetition, repeats no more here.Correspondingly, the correlation technique details of mentioning in the present embodiment also can be applicable in first embodiment.
What deserves to be mentioned is that each involved in present embodiment module is logic module, in actual applications, a logical block can be a physical location, also can be the part of a physical location, can also realize with the combination of a plurality of physical locations.In addition, for outstanding innovation part of the present invention, will not introduce not too close unit with solving technical matters relation proposed by the invention in the present embodiment, but this does not show the unit that does not have other in the present embodiment.
Fifth embodiment of the invention relates to a kind of noise suppressing system, as shown in figure 10.The 5th embodiment has been done further improvement on the basis of the 4th embodiment, and its improvements mainly are: in fifth embodiment of the invention, decay factor is asked for module and is comprised following submodule:
Sub module stored is used for storing in advance all possible ζ and corresponding r IMapping table (ζ);
The independent variable calculating sub module is used for calculating ζ;
Search submodule, be used at sub module stored stored relation table, search the r corresponding with this ζ according to ζ I(ζ);
The G calculating sub module is used for ξ, γ and r I(ζ), substitution formula 5 calculates G.
In addition, it will be understood by those skilled in the art that the independent variable calculating sub module when calculating ζ, can obtain ζ after opening radical sign and multiply by 2 with the product of ξ and γ.
Because second embodiment is corresponding mutually with present embodiment, thus present embodiment can with the enforcement of working in coordination of second embodiment.The correlation technique details of mentioning in second embodiment is still effective in the present embodiment, and the technique effect that can reach in second embodiment can be realized in the present embodiment too, in order to reduce repetition, repeats no more here.Correspondingly, the correlation technique details of mentioning in the present embodiment also can be applicable in second embodiment.
Sixth embodiment of the invention relates to a kind of noise suppressing system, as shown in figure 11.The 6th embodiment has been done further improvement on the basis of four embodiments, and its improvements mainly are: in sixth embodiment of the invention, decay factor is asked for module and is comprised following submodule:
The independent variable calculating sub module is used for calculating ζ;
Judge submodule, be used for judging that whether ζ is greater than 1;
r I(ζ) calculating sub module is used for judging that submodule judgement ζ is greater than 1 o'clock, according to formula
Figure BDA0000140307810000131
Calculate r I(ζ); Judging that submodule judgement ζ is less than or equal at 1 o'clock, according to formula r I(ζ)=0.5 ζ calculates r I(ζ);
The G calculating sub module is used for ξ, γ and r I(ζ), in the substitution formula 5, calculate G.
In addition, it will be understood by those skilled in the art that the independent variable calculating sub module when calculating ζ, can obtain ζ after opening radical sign and multiply by 2 with the product of ξ and γ.
Because the 3rd embodiment is corresponding mutually with present embodiment, thus present embodiment can with the enforcement of working in coordination of the 3rd embodiment.The correlation technique details of mentioning in the 3rd embodiment is still effective in the present embodiment, and the technique effect that can reach in the 3rd embodiment can be realized in the present embodiment too, in order to reduce repetition, repeats no more here.Correspondingly, the correlation technique details of mentioning in the present embodiment also can be applicable in the 3rd embodiment.
Persons of ordinary skill in the art may appreciate that the respective embodiments described above are to realize specific embodiments of the invention, and in actual applications, can do various changes to it in the form and details, and without departing from the spirit and scope of the present invention.

Claims (12)

1. a noise suppressing method is characterized in that, comprises following steps:
After the priori signal to noise ratio (S/N ratio) that calculates Noisy Speech Signal and posteriority signal to noise ratio (S/N ratio), ask for the estimated value of clean speech signal and the ratio of Noisy Speech Signal, obtain decay factor G; Wherein, described estimated value is the estimated value according to the clean speech signal short-term spectrum amplitude of maximum a posteriori MAP criterion calculating;
According to described G described Noisy Speech Signal being carried out contrary fast fourier transform IFFT handles.
2. noise suppressing method according to claim 1 is characterized in that, asks for described ratio according to following formula, obtains decay factor G:
G = &xi; 2 ( 1 + &xi; ) r I ( &zeta; ) + { &xi; 2 ( 1 + &xi; ) r I ( &zeta; ) } 2 + &xi; 2 ( 1 + &xi; ) &gamma;
Wherein, ξ represents described priori signal to noise ratio (S/N ratio); γ represents described posteriority signal to noise ratio (S/N ratio); ζ represents the variable of Bessel's function; r I(ζ) ratio of expression first kind modified Bessel function single order and zeroth order.
3. noise suppressing method according to claim 2 is characterized in that, also comprises following steps:
Store all possible ζ and corresponding r in advance IMapping table (ζ);
Ask for described ratio, obtain comprising following substep in the step of decay factor G:
Calculate described ζ;
In described mapping table, search the r corresponding with this ζ according to described ζ I(ζ);
With described ξ, described γ and the described r that finds I(ζ), the described formula of substitution obtains described G.
4. noise suppressing method according to claim 2 is characterized in that, asks for described ratio, obtains comprising following substep in the step of decay factor G:
Calculate described ζ;
If described ζ is greater than 1, then according to formula
Figure FDA0000140307800000021
Calculate described r I(ζ); If described ζ is less than or equal to 1, then according to formula r I(ζ)=0.5 ζ calculates described r I(ζ);
Described r with described ξ, described γ and calculating I(ζ) the described formula of substitution G = &xi; 2 ( 1 + &xi; ) r I ( &zeta; ) + { &xi; 2 ( 1 + &xi; ) r I ( &zeta; ) } 2 + &xi; 2 ( 1 + &xi; ) &gamma; In, obtain described G.
5. according to claim 3 or 4 described noise suppressing methods, it is characterized in that, when calculating described ζ, with the product of described ξ and described γ, open radical sign and multiply by the numerical value that obtains after 2 as described ζ.
6. noise suppressing method according to claim 1 is characterized in that, also comprises following steps:
Before the priori signal to noise ratio (S/N ratio) and posteriority signal to noise ratio (S/N ratio) of calculating Noisy Speech Signal, Noisy Speech Signal is carried out fast fourier transform FFT;
Noisy Speech Signal after described FFT conversion is carried out noise to be estimated.
7. a noise suppressing system is characterized in that, comprises: snr computation module, decay factor are asked for module, IFFT processing module;
Described snr computation module is used for calculating priori signal to noise ratio (S/N ratio) and the posteriority signal to noise ratio (S/N ratio) of Noisy Speech Signal, and the described priori signal to noise ratio (S/N ratio) that will calculate and posteriority signal to noise ratio (S/N ratio) are exported to described decay factor and asked for module;
Described decay factor is asked for module and is used for asking for the estimated value of clean speech signal and the ratio of Noisy Speech Signal when receiving described priori signal to noise ratio (S/N ratio) and posteriority signal to noise ratio (S/N ratio), obtains decay factor G; Wherein, described estimated value is the estimated value according to the clean speech signal short-term spectrum amplitude of maximum a posteriori MAP criterion calculating;
Described IFFT processing module is used for asking for the G that module obtains according to described decay factor, described Noisy Speech Signal is carried out contrary fast fourier transform IFFT handle.
8. noise suppressing system according to claim 7 is characterized in that, described decay factor is asked for module and asked for described ratio according to following formula, obtains decay factor G:
G = &xi; 2 ( 1 + &xi; ) r I ( &zeta; ) + { &xi; 2 ( 1 + &xi; ) r I ( &zeta; ) } 2 + &xi; 2 ( 1 + &xi; ) &gamma;
Wherein, ξ represents described priori signal to noise ratio (S/N ratio); γ represents described posteriority signal to noise ratio (S/N ratio); ζ represents the variable of Bessel's function; r I(ζ) ratio of expression first kind modified Bessel function single order and zeroth order.
9. noise suppressing system according to claim 8 is characterized in that, described decay factor is asked for module and comprised following submodule:
Sub module stored is used for storing in advance all possible ζ and corresponding r IMapping table (ζ);
The independent variable calculating sub module is used for calculating described ζ;
Search submodule, be used for the described mapping table in described sub module stored storage, search the r corresponding with this ζ according to the described ζ that calculates I(ζ);
The G calculating sub module is used for described ξ, described γ and described r I(ζ), the described formula of substitution obtains described G.
10. noise suppressing system according to claim 8 is characterized in that, described decay factor is asked for module and comprised following submodule:
The independent variable calculating sub module is used for calculating described ζ;
Judge submodule, be used for judging that whether the described ζ that calculates is greater than 1;
r I(ζ) calculating sub module is used for judging that at described judgement submodule described ζ is greater than 1 o'clock, according to formula Calculate described r I(ζ); Judge that at described judgement submodule described ζ is less than or equal at 1 o'clock, according to formula r I(ζ)=0.5 ζ calculates described r I(ζ);
The G calculating sub module is used for described ξ, described γ and described r I(ζ), substitution formula G = &xi; 2 ( 1 + &xi; ) r I ( &zeta; ) + { &xi; 2 ( 1 + &xi; ) r I ( &zeta; ) } 2 + &xi; 2 ( 1 + &xi; ) &gamma; In, obtain described G.
11., it is characterized in that described independent variable calculating sub module with the product of described ξ and described γ, obtains described ζ after opening radical sign and multiply by 2 according to claim 9 or 10 described noise suppressing systems when calculating described ζ.
12. noise suppressing system according to claim 7 is characterized in that, this system also comprises: fast fourier transform module and noise estimation module;
Described fast fourier transform module is used for Noisy Speech Signal is carried out fast fourier transform FFT, and the Noisy Speech Signal after described FFT conversion is exported to the noise estimation module;
Described noise estimation module is used for that the Noisy Speech Signal after described FFT conversion is carried out noise and estimates.
CN201210053894.1A 2012-03-02 2012-03-02 Noise suppressing method and system thereof Active CN103295582B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201210053894.1A CN103295582B (en) 2012-03-02 2012-03-02 Noise suppressing method and system thereof

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201210053894.1A CN103295582B (en) 2012-03-02 2012-03-02 Noise suppressing method and system thereof

Publications (2)

Publication Number Publication Date
CN103295582A true CN103295582A (en) 2013-09-11
CN103295582B CN103295582B (en) 2016-04-20

Family

ID=49096337

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201210053894.1A Active CN103295582B (en) 2012-03-02 2012-03-02 Noise suppressing method and system thereof

Country Status (1)

Country Link
CN (1) CN103295582B (en)

Cited By (8)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN103646648A (en) * 2013-11-19 2014-03-19 清华大学 Noise power estimation method
CN105280193A (en) * 2015-07-20 2016-01-27 广东顺德中山大学卡内基梅隆大学国际联合研究院 Prior signal-to-noise ratio estimating method based on MMSE error criterion
CN105448304A (en) * 2015-12-01 2016-03-30 珠海市杰理科技有限公司 Voice signal noise spectrum estimation method and apparatus, and noise reduction processing method
CN107833579A (en) * 2017-10-30 2018-03-23 广州酷狗计算机科技有限公司 Noise cancellation method, device and computer-readable recording medium
CN108428456A (en) * 2018-03-29 2018-08-21 浙江凯池电子科技有限公司 Voice de-noising algorithm
CN109087657A (en) * 2018-10-17 2018-12-25 成都天奥信息科技有限公司 A kind of sound enhancement method applied to ultrashort wave radio set
CN112908344A (en) * 2021-01-22 2021-06-04 广州大学 Intelligent recognition method, device, equipment and medium for bird song
CN114176623A (en) * 2021-12-21 2022-03-15 深圳大学 Sound noise reduction method, system, noise reduction device and computer readable storage medium

Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
EP1465160A2 (en) * 2003-03-31 2004-10-06 Microsoft Corporation Method of noise estimation using incremental bayesian learning
US20090067642A1 (en) * 2007-08-13 2009-03-12 Markus Buck Noise reduction through spatial selectivity and filtering
CN101627428A (en) * 2007-03-06 2010-01-13 日本电气株式会社 Noise suppression method, device, and program

Patent Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
EP1465160A2 (en) * 2003-03-31 2004-10-06 Microsoft Corporation Method of noise estimation using incremental bayesian learning
CN101627428A (en) * 2007-03-06 2010-01-13 日本电气株式会社 Noise suppression method, device, and program
US20090067642A1 (en) * 2007-08-13 2009-03-12 Markus Buck Noise reduction through spatial selectivity and filtering

Cited By (13)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN103646648B (en) * 2013-11-19 2016-03-23 清华大学 A kind of noise power estimation method
CN103646648A (en) * 2013-11-19 2014-03-19 清华大学 Noise power estimation method
CN105280193B (en) * 2015-07-20 2022-11-08 广东顺德中山大学卡内基梅隆大学国际联合研究院 Priori signal-to-noise ratio estimation method based on MMSE error criterion
CN105280193A (en) * 2015-07-20 2016-01-27 广东顺德中山大学卡内基梅隆大学国际联合研究院 Prior signal-to-noise ratio estimating method based on MMSE error criterion
CN105448304A (en) * 2015-12-01 2016-03-30 珠海市杰理科技有限公司 Voice signal noise spectrum estimation method and apparatus, and noise reduction processing method
CN105448304B (en) * 2015-12-01 2019-01-15 珠海市杰理科技股份有限公司 Pronunciation signal noise spectrum estimating method, device and noise reduction process method
CN107833579A (en) * 2017-10-30 2018-03-23 广州酷狗计算机科技有限公司 Noise cancellation method, device and computer-readable recording medium
CN108428456A (en) * 2018-03-29 2018-08-21 浙江凯池电子科技有限公司 Voice de-noising algorithm
CN109087657A (en) * 2018-10-17 2018-12-25 成都天奥信息科技有限公司 A kind of sound enhancement method applied to ultrashort wave radio set
CN112908344A (en) * 2021-01-22 2021-06-04 广州大学 Intelligent recognition method, device, equipment and medium for bird song
CN112908344B (en) * 2021-01-22 2023-08-08 广州大学 Intelligent bird song recognition method, device, equipment and medium
CN114176623A (en) * 2021-12-21 2022-03-15 深圳大学 Sound noise reduction method, system, noise reduction device and computer readable storage medium
CN114176623B (en) * 2021-12-21 2023-09-12 深圳大学 Sound noise reduction method, system, noise reduction device and computer readable storage medium

Also Published As

Publication number Publication date
CN103295582B (en) 2016-04-20

Similar Documents

Publication Publication Date Title
CN103295582B (en) Noise suppressing method and system thereof
Davis et al. Statistical voice activity detection using low-variance spectrum estimation and an adaptive threshold
CN102750956B (en) Method and device for removing reverberation of single channel voice
CN100543842C (en) Realize the method that ground unrest suppresses based on multiple statistics model and least mean-square error
KR100310030B1 (en) A noisy speech parameter enhancement method and apparatus
EP2788980B1 (en) Harmonicity-based single-channel speech quality estimation
CN101976566B (en) Voice enhancement method and device using same
EP3040991B1 (en) Voice activation detection method and device
US7783481B2 (en) Noise reduction apparatus and noise reducing method
CN113724725B (en) Bluetooth audio squeal detection suppression method, device, medium and Bluetooth device
US9548064B2 (en) Noise estimation apparatus of obtaining suitable estimated value about sub-band noise power and noise estimating method
KR20130117750A (en) Monaural noise suppression based on computational auditory scene analysis
CN101083640A (en) Low complexity noise reduction method
Borowicz et al. Signal subspace approach for psychoacoustically motivated speech enhancement
US9414157B2 (en) Method and device for reducing voice reverberation based on double microphones
EP3118852B1 (en) Method and device for detecting audio signal
CN103632677A (en) Method and device for processing voice signal with noise, and server
CN108074582A (en) A kind of noise suppressed signal-noise ratio estimation method and user terminal
CN103440872A (en) Transient state noise removing method
Vu et al. Implementation of the MFCC front-end for low-cost speech recognition systems
US20230267947A1 (en) Noise reduction using machine learning
Jin et al. Speech enhancement using harmonic emphasis and adaptive comb filtering
JP4551817B2 (en) Noise level estimation method and apparatus
CN114530160A (en) Model training method, echo cancellation method, system, device and storage medium
KR101971268B1 (en) Audio coding method and related apparatus

Legal Events

Date Code Title Description
C06 Publication
PB01 Publication
C10 Entry into substantive examination
SE01 Entry into force of request for substantive examination
C14 Grant of patent or utility model
GR01 Patent grant
TR01 Transfer of patent right

Effective date of registration: 20170421

Address after: 201206 China (Shanghai) free trade zone, the moon Road, No. 3, building fourth, room B412, level 1258

Patentee after: Shanghai Li Ke Semiconductor Technology Co., Ltd.

Address before: 201206 Pudong New Area Mingyue Road, Shanghai, No. 1258

Patentee before: Leadcore Technology Co., Ltd.

TR01 Transfer of patent right