CN102740214A - Howling suppression method based on feedback signal spectrum estimation - Google Patents

Howling suppression method based on feedback signal spectrum estimation Download PDF

Info

Publication number
CN102740214A
CN102740214A CN2011100821974A CN201110082197A CN102740214A CN 102740214 A CN102740214 A CN 102740214A CN 2011100821974 A CN2011100821974 A CN 2011100821974A CN 201110082197 A CN201110082197 A CN 201110082197A CN 102740214 A CN102740214 A CN 102740214A
Authority
CN
China
Prior art keywords
signal
spectrum
feedback
microphone
pitched sounds
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Granted
Application number
CN2011100821974A
Other languages
Chinese (zh)
Other versions
CN102740214B (en
Inventor
杨飞然
吴鸣
杨军
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
In New Science And Technology Co Ltd (suzhou)
Original Assignee
Institute of Acoustics CAS
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Institute of Acoustics CAS filed Critical Institute of Acoustics CAS
Priority to CN201110082197.4A priority Critical patent/CN102740214B/en
Publication of CN102740214A publication Critical patent/CN102740214A/en
Application granted granted Critical
Publication of CN102740214B publication Critical patent/CN102740214B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Images

Landscapes

  • Circuit For Audible Band Transducer (AREA)

Abstract

The invention relates to a howling suppression method based on feedback signal spectrum estimation. The method comprises the following steps: firstly, carrying out framing, windowing and Fourier transform on a signal collected by a microphone and a loudspeaker output signal respectively so as to obtain a corresponding frequency domain signal; then, according to the microphone frequency domain signal and the loudspeaker output frequency domain signal, calculating a cross-power spectrum between the microphone and the loudspeaker output signal and a power spectrum of the loudspeaker output signal; dividing the cross-power spectrum by the power spectrum so as to obtain estimation of a feedback path transfer function amplitude spectrum; using the obtained estimation of the feedback path transfer function spectrum amplitude to calculate a current frame feedback signal amplitude spectrum; and then using the microphone signal amplitude spectrum and the obtained feedback signal amplitude spectrum to calculate a gain function; finally, according to the obtained gain function, carrying out correction on the microphone frequency domain signal so as to realize howling suppression. By using the method of the invention, convergence and tracking speeds are fast, robustness is good and algorithm complexity is low, which is good for real-time realization.

Description

A kind of inhibition method of uttering long and high-pitched sounds based on the feedback signal spectrum estimation
Technical field
The inhibition field that the present invention relates to utter long and high-pitched sounds, particularly a kind of inhibition method of uttering long and high-pitched sounds based on the feedback signal spectrum estimation.
Background technology
The document that is suppressed at of uttering long and high-pitched sounds also is called as " acoustic feedback inhibition ", " acoustic feedback control ", " feedback is eliminated " etc., and what they were expressed is with a kind of meaning.As shown in Figure 1, Fig. 1 is the generation schematic diagram of uttering long and high-pitched sounds.S (n) representes near-end speech, and this is to need amplifying signal; D (n) expression feeds back to microphone from the sound that loud speaker sends through feedback path 102 once more, the feedback signal of being picked up by microphone.D (n) is not the signal that expectation receives, and need curb it.Forward direction processing module 104 is used for accomplishing the functions such as inhibition and automatic gain control of uttering long and high-pitched sounds.Input signal s (n) is G (z)/(1-G (z) F (z)) to the transfer function between the output signal y (n), if satisfy (a) simultaneously | G (z) F (z) | >=1; (b) ∠ G (z) F (z)=2 π n, n ∈ N; Then this system will become unstable, on the Frequency point that satisfies condition, will utter long and high-pitched sounds.Using one of public address system purpose is exactly in order to improve system gain, and the wearer of hearing aids also hopes and can be amplified to desired level to sound, the amount of gain of sound reinforcement system of having uttered long and high-pitched sounds down phenomenon limits.Therefore, must take certain measure elimination feedback to avoid the generation of uttering long and high-pitched sounds.In the public sound reinforcement system because teller's position may change frequently.For example: teacher gives lessons in the public address system of usefulness in the classroom; Microphone is placed on teacher's the clothes and the position of teacher's health when giving lessons is not fixed; This just causes the continuous variation of feedback path; Corresponding solution must be considered this point and can follow the tracks of this variation fast, otherwise will occur uttering long and high-pitched sounds.
The inhibition of uttering long and high-pitched sounds is the research topic of a hot topic.Present solution mainly contains:
(1) shift frequency method.The shift frequency method reaches the purpose of uttering long and high-pitched sounds and suppressing through the destruction phase condition that produces of uttering long and high-pitched sounds.Yet uttering long and high-pitched sounds of this method suppresses effect and bad, usually controlled uttering long and high-pitched sounds of a Frequency point, and system can produce at other Frequency point again and utter long and high-pitched sounds.And the shift frequency method has noticeable influence to voice quality, and research and real system test shows use shift frequency method can obtain the gain of 6dB at most.
(2) trapper method.The trapper method is at first sought the spectrum position that feedback possibly take place or take place, and these strong excessively frequency contents in the signal is attenuated to reach the feedback inhibition effect then.But in real system, might there be a plurality of feedback frequency points.Therefore, trapper method effect is limited.The use trapper method of having reported can provide the gain of 3dB-8dB.
(3) sef-adapting filter method.The operation principle of sef-adapting filter method is again from the come in transmission characteristic of a such propagation ducts of microphone after utilizing filter to come simulated sound to send through loud speaker; Make that the signal that comes out from filter is consistent with the feedback signal that microphone collects, from the input signal of microphone, cut this part signal then and reach the purpose that elimination is fed back.As shown in Figure 2, Fig. 2 is the inhibition system block diagram of uttering long and high-pitched sounds based on adaptive filter algorithm.The parameter of adaptive-filtering unit 206 must be adjusted to the actual transmission paths characteristic approaching, and the just more approaching actual feedback signal of the signal that its simulation obtains just can obtain the inhibition effect of better uttering long and high-pitched sounds.This method is the same with the method principle that echo cancelltion is adopted; But different with echo cancelltion is under the feedback occasion; System always is in the state that is called as " both-end intercommunication "; Thereby influenced the convergence rate of sef-adapting filter, and the filter coefficient that estimates has partially, this method is difficult to be applicable to that feedback path is always in situation about constantly changing.
In addition, the sef-adapting filter method is divided into discontinuous the utter long and high-pitched sounds inhibition and the inhibition of uttering long and high-pitched sounds continuously.The discontinuous major defect that suppresses of uttering long and high-pitched sounds is that this method need be interrupted normal input speech signal, thereby the intelligibility of voice is affected, thereby is can not be received in a lot of occasions.Suppress to use continuous input signal to come estimated feedback signal and utter long and high-pitched sounds continuously, but convergence rate is very slow.
In sum, the present inhibition method of uttering long and high-pitched sounds robustness is bad, when feedback path changes, all occurs of short duration uttering long and high-pitched sounds probably.
Summary of the invention
The objective of the invention is to,, propose a kind of inhibition method of uttering long and high-pitched sounds, thereby it is good to reach robustness, convergence and the fast purpose of tracking velocity based on the feedback signal spectrum estimation in order to address the above problem.
For realizing the foregoing invention purpose, the present invention proposes a kind of inhibition method of uttering long and high-pitched sounds based on the feedback signal spectrum estimation, and these method concrete steps comprise:
Step 1): the signal x (n) that respectively microphone is gathered and speaker output signal y (n) carry out branch frame, windowing and Fourier transform obtain the microphone frequency domain signal X (i, k) with loud speaker export frequency-region signal Y (i, k);
Step 2): (i, k) (i k) calculates crosspower spectrum φ between microphone and the speaker output signal to the microphone frequency domain signal X that obtains according to described step 1) with loud speaker output frequency-region signal Y Yx(i is k) with the power spectrum φ of speaker output signal Yy(i, k); And with crosspower spectrum φ Yx(i is k) divided by power spectrum φ Yy(i k) obtains the estimation of feedback path transfer function amplitude spectrum
Figure BDA0000053433440000021
Step 3): exploiting the said step 2) obtained in feedback path of transfer function of amplitude spectrum from the estimated
Figure BDA0000053433440000022
calculate get the current frame feedback signal amplitude spectrum of
Step 4): utilize the microphone signal amplitude spectrum | X (i; K) | and feedback signal amplitude spectrum
Figure BDA0000053433440000024
the calculating acquisition gain function G of described step 3) acquisition (i, k);
Step 5): (i k) comes that (i k) revises and realizes uttering long and high-pitched sounds inhibition to the microphone frequency domain signal X to the gain function G that obtains according to described step 4).
This method also further comprises:
Step 6): the said step 5) to obtain a corrected frequency domain signal after processing the microphone to obtain a final output signal
Figure BDA0000053433440000031
and the output signal
Figure BDA0000053433440000032
de-correlation processing; and
Step 7): described step 6) process result is carried out gain-adjusted.
Described step 2) concrete steps of acquisition feedback path transfer function amplitude spectrum comprise:
21): adopt the level and smooth method of single order autoregression, obtain crosspower spectrum φ according to formula (1) Yx(i, k);
φ yx(i,k)=α Cφ yx(i-1,k)+(1-α C)|Y *(i,k)X(i,k)| (1)
Wherein, α CBe smoothing factor, 0≤α C<1;
22): adopt the level and smooth method of single order autoregression, obtain power spectrum φ according to formula (2) Yy(i, k);
φ yy(i,k)=α Rφ yy(i-1,k)+(1-α R)Y *(i,k)Y(i,k) (2)
Wherein, α RBe smoothing factor, 0≤α R<1;
23): utilize crosspower spectrum φ Yx(i is k) with power spectrum φ Yy(i k) obtains feedback path transfer function amplitude spectrum according to formula (3)
Figure BDA0000053433440000033
| F ^ ′ ( i , k ) | = φ yx ( i , k ) φ yy ( i , k ) + δ - - - ( 3 )
Wherein, δ is that to avoid the denominator of formula (3) be 0 to a very little positive number.
Described step 3) is calculated according to formula (4) and is obtained present frame feedback signal amplitude spectrum
| X ^ ( i , k ) | = β | F ^ ′ ( i , k ) | | X ( i , k ) | - - - ( 4 )
Wherein, β is an adjustable parameter, carries out this parameter of choose reasonable according to actual conditions.
The method of described step 4) calculated gains function comprises: based on priori signal feedback ratio method with based on check back signal feedback ratio method.
Described step 4) based on check back signal feedback ratio method according to formula (5) calculate to obtain gain function G (i, k);
G ( i , k ) = [ max ( | X ( i , k ) | α - | X ^ ( i , k ) | α , 0 ) | X ( i , k ) | α ] 1 α - - - ( 5 )
Wherein, α>0.For reducing computation complexity, α gets 1 or 2.
(i k) revises and realizes uttering long and high-pitched sounds inhibition described step 5) to the microphone frequency domain signal X according to formula (6);
S ^ ( i , k ) = X ( i , k ) G ( i , k ) - - - ( 6 ) .
Described step 7) adopts the gain of regulating digital signal or the gain of regulating the analogue system power amplifier.The invention has the advantages that compared with prior art, the present invention has good robustness; Initial convergence speed is fast and when feedback path changes, can follow the tracks of this variation rapidly and can not utter long and high-pitched sounds; Because all operations of the present invention all is that frequency domain carries out, therefore has very low computation complexity.
Description of drawings
Fig. 1 is the generation schematic diagram of uttering long and high-pitched sounds;
Fig. 2 is traditional inhibition system block diagram of uttering long and high-pitched sounds based on adaptive filter algorithm;
Fig. 3 is a feedback path estimating system block diagram of the present invention;
Fig. 4 is the inhibition method of the uttering long and high-pitched sounds structured flowchart based on the feedback signal spectrum estimation of the present invention;
Fig. 5 is the block diagram of feedback spectrum estimation module 410 of the present invention;
Fig. 6 is a feedback path impulse response figure of actual measurement;
Fig. 7 is the amplitude spectrum of actual feedback path transfer function and the amplitude spectrum comparison diagram of the feedback path transfer function that the present invention estimates out.
Embodiment
Below in conjunction with accompanying drawing and specific embodiment the present invention is carried out detailed explanation.
It is good to the objective of the invention is to have proposed a kind of robustness, and convergence and tracking velocity are fast, when effectively inhibition is uttered long and high-pitched sounds, guarantees the inhibition method of uttering long and high-pitched sounds that voice quality is not suffered a loss.
In order to achieve the above object, the technical scheme taked of the present invention is following:
A kind of inhibition method of uttering long and high-pitched sounds based on the feedback signal spectrum estimation, concrete steps comprise:
Step 1): respectively microphone signal and the speaker output signal that collects carried out windowing process and Fourier transform, time-domain signal is transformed to frequency domain;
Step 2): utilize the crosspower spectrum of level and smooth microphone signal and speaker output signal to obtain the amplitude spectrum of feedback path, and utilize the amplitude spectrum of feedback path to calculate the amplitude spectrum of present frame feedback signal divided by the power spectrum of level and smooth speaker output signal;
Step 3): utilize microphone signal amplitude spectrum and feedback signal amplitude spectrum calculated gains function;
Step 4): the microphone signal frequency spectrum is revised;
Step 5): revised microphone signal frequency spectrum is carried out inverse Fourier transform obtain time domain output;
Step 6): optional) output to step 5) applies a de-correlation block;
Step 7): optional) output to step 6) applies a gain.
In technique scheme, further, the pattern that adopts frame to handle described in the step 1) is carried out signal processing, all handle and program all to be based on frame be unit.Need carry out windowing process to the frame microphone signal that receives and the output signal of loud speaker, use FFT to transform to frequency domain then.
In technique scheme, further, step 2) at first needs the amplitude spectrum of estimation feedback path transfer function described in, and utilize its to come further to calculate amplitude spectrum of present frame feedback signal.The crosspower spectrum absolute value that the amplitude spectrum of feedback path transfer function utilizes level and smooth microphone signal and speaker output signal obtains divided by the power spectrum of level and smooth speaker output signal.
In above-mentioned technical scheme, further, the said gain function of step 3) is based on that the check back signal feedback ratio calculates.
In above-mentioned technical scheme, further, step 4) only is meant the frequency spectrum correction of microphone its amplitude spectrum is revised, and keeps its phase spectrum constant.
In above-mentioned technical scheme, further, step 5) is revised microphone frequency spectrum to be carried out inverse Fourier transform obtain time-domain signal, and the output signal with former frame carries out overlapping stack as the output signal of uttering long and high-pitched sounds and suppressing then.
In above-mentioned technical scheme, further, the described de-correlation block of step 6) is with the correlation that removes between the near end signal that speaker output signal and microphone receive.
In above-mentioned technical scheme, further, the described gain of step 7) both can be added in the power amplifier end that also can be added in analogue system on the digital signal of digital system.
As shown in Figure 4, Fig. 4 is the inhibition method of the uttering long and high-pitched sounds structured flowchart based on the feedback signal spectrum estimation of the present invention.Wherein, s (n) representes near end signal, and d (n) representes feedback signal, the signal that x (n) expression microphone collects, the signal of y (n) expression loud speaker output.
Step 1): respectively microphone signal and the speaker output signal that collects carried out windowing process and Fourier transform, time-domain signal is transformed to frequency domain.
The microphone signal x (n) that collects is carried out behind the serial to parallel conversion through after windowing module 402 and the Fourier transform module 404 time-domain signal being transformed to frequency domain; Speaker output signal y (n) is carried out behind the serial to parallel conversion through after windowing module 402 and the Fourier transform module 404 time-domain signal being transformed to frequency domain.The i frame discrete Fourier transform of y (n) be designated as Y (i, k), the i frame discrete Fourier transform of x (n) be designated as X (i, k).
Step 2): the crosspower spectrum that utilizes level and smooth microphone signal x (n) and speaker output signal y (n) obtains the amplitude spectrum of feedback path divided by the power spectrum of level and smooth speaker output signal y (n), and utilizes the amplitude spectrum of feedback path to calculate the amplitude spectrum of present frame feedback signal.
As shown in Figure 3, Fig. 3 is a feedback path estimating system block diagram of the present invention.The signal that microphone picks up is x (n)=d (n)+s (n), and d (n) is that speaker output signal y (n) obtains through feedback path transfer function F (z) filtering.We are objective definition function E{e 2(n) }=E{|x (n)-y (n) * f (n) | 2, * representes convolution here, the time-domain pulse response of f (n) expression F (z).Minimizing Wei Na that this target function can obtain time domain, to separate form following:
f ^ ( n ) = R yy - 1 r yx - - - ( 1 )
Wherein, R YyThe autocorrelation matrix of expression y (n), r YxThe cross-correlation matrix of expression y (n) and x (n).(1) is transformed to frequency domain to be obtained
F ′ ( i , k ) = P yx ( i , k ) P yy ( i , k ) - - - ( 2 )
Here P Yx(i, the k) crosspower spectrum of expression x (n) and y (n), P Yy(i, k) power spectrum of expression y (n).If the delay long enough between input signal s (n) and output signal y (n) just can well carry out decorrelation to the such stationary signal in short-term of voice, can think that promptly correlation between s (n) and the y (n) has become very a little less than, P so Ys(i, k) ≈ 0, so P Yx(i, k)=P Ys(i, k)+P Yd(i, k) ≈ P Yd(i, k), thereby (2) formula of utilization can effectivelyly estimate the frequency spectrum of feedback path transfer function.The present invention is actual utilize be F ' (i, amplitude spectrum k) | F ' (i, k) |.As shown in Figure 5, Fig. 5 is the block diagram of feedback spectrum estimation module 410 of the present invention.
Crosspower spectrum smooth unit 502 is calculated the crosspower spectrum absolute value of x (n) and y (n), and carries out smoothly,
φ yx(i,k)=α Cφ yx(i-1,k)+(1-α C)|Y *(i,k)X(i,k)| (3)
Here use * to represent complex conjugate.We have adopted signal period figure have been carried out autoregression smoothly as the estimation of power spectrum or crosspower spectrum, and this mainly is because this method is simple and easy the realization, yet this patent is not limited to this method, and the method for other any estimated power spectrums can be used.
Power spectrum smooth unit 504 is calculated the power spectrum of y (n) and is carried out smoothly,
φ yy(i,k)=α Rφ yy(i-1,k)+(1-α R)Y *(i,k)Y(i,k) (4)
Here α CAnd α RBe smoothing factor, its value satisfies 0≤α C<1 and 0≤α R<1.
Feedback transfer function estimation unit 506 calculates feedback path transfer function amplitude spectrum, is expressed as:
| F ^ ′ ( i , k ) | = φ yx ( i , k ) φ yy ( i , k ) + δ - - - ( 5 )
δ is a positive number that absolute value is very little in the formula (5), and being used for the divisor of the formula that prevents (5) is 0.
As shown in Figure 6, Fig. 6 is the feedback path impulse response figure of an actual measurement.As shown in Figure 7, Fig. 7 is the amplitude spectrum of actual feedback path transfer function and the amplitude spectrum comparison diagram of the feedback path transfer function that the present invention estimates out.The amplitude spectrum that can find out the feedback path transfer function that the present invention estimates can well approach the amplitude spectrum of real feedback path transfer function.
Present frame feedback signal amplitude spectrum computing unit 508 is used for estimating the spectrum component of present frame feedback signal, is expressed as
| X ^ ( i , k ) | = β | F ^ ′ ( i , k ) | | Y ( i , k ) | - - - ( 6 )
In the formula (6), β is a parameter that the user is adjustable, if the amplitude spectrum of feedback signal is crossed and to have been estimated then need make β<1, otherwise need be made β>1 if the amplitude spectrum of feedback signal owes to have estimated then.Select big compensating factor just can obtain bigger amount of gain but relative meeting the near-end speech quality is had certain influence; The voice quality of selecting little compensating factor to obtain still may reduce the amount of gain of sound reinforcement system, should reasonably accept or reject according to actual conditions.
In sum, we have just estimated the amplitude spectrum of feedback signal through type (6), have accomplished the function of feedback signal spectrum estimation module 410.
Step 3): utilize microphone signal amplitude spectrum and feedback signal amplitude spectrum calculated gains function;
The gain function of gain calculation module 412 can utilize spectrum-subtraction to obtain, and its expression formula is:
G ( i , k ) = [ max ( | X ( i , k ) | α - | X ^ ( i , k ) | α , 0 ) | X ( i , k ) | α ] 1 α - - - ( 7 )
α representes a factor of spectrum-subtraction in the following formula, and α=1 is based on the amplitude spectrum subtraction, α=2nd, and the power spectrum subtraction is the Wiener filtering method, α also can select other numerical value, α>0.Generally select α=1 or α=2 in order to reduce computation complexity in the reality.
Though the selection of gain function also has other form, as adopted in the document [1] gain function based on the priori signal to noise ratio ([1] Y.Ephraim and D.Malah, " Speech enhancement using a minimum mean squareerror short-time spectral amplitude estimator; " IEEE Trans.on Acoust.; Speech, SignalProcessing, vol.ASSP-32; Pp.1109-1121; Dec.1984), find to use posterior signal feedback ratio can obtain better result through testing us, thereby the present invention recommends use formula (7) to come the calculated gains function for the inhibition system that utters long and high-pitched sounds.
Step 4) is revised the microphone signal frequency spectrum.
The input/output relation of frequency spectrum correcting module 414 is:
S ^ ( i , k ) = X ( i , k ) G ( i , k ) - - - ( 8 )
The microphone signal frequency spectrum after the feedback signal has been eliminated in
Figure BDA0000053433440000074
expression.
Step 5) is carried out inverse Fourier transform to the microphone signal frequency spectrum and is obtained output.
416 pairs of present frames of inverse Fourier transform module
Figure BDA0000053433440000081
carry out inverse Fourier transform and obtain present frame output, carry out overlapping stack with former frame output then and obtain final output signal
Figure BDA0000053433440000082
Step 6) is optional) output of step 5) is applied a de-correlation block 418.
Like step 2) delay between said input signal s (n) and output signal y (n) is 3 times frame length, promptly corresponding 50% overlapping.This delay generally can be carried out decorrelation to the such stationary signal in short-term of voice.Because voice signal also has correlation when long, i.e. the time-delay of 3 times of frame lengths sometimes still is not enough to remove fully its correlation, and this may cause the distortion of signal.Remove the correlation between s (n) and the y (n) for further raising voice quality need increase a de-correlation block, a lot of classical decorrelation methods are suggested, can be referring to document [2] (M.Ali; " Stereophonic acoustic echo cancellation system using time-varying all-pass filteringfor signal decorrelation, " in Proceedings of the ICASSP ' 98, vol.6; Pp.3689-3692, Seattle, USA; May 1998) and document [3] ([3] J.Benesty; D.R.Morgan, and M.M.Sondhi, " A betterunderstanding and an improved solution to the specific problems of stereophonic acoustic echo cancellation; " IEEE Trans.Speech Audio Processing; Vol.6, pp.156-165, Mar.1998).Need to prove that choosing of de-correlation must be prerequisite not damage voice quality.
Step 7) is optional) output of step 6) is increased a gain module 420.
System gain can be regulated through dual mode; The one, regulate the gain of digital signal; Promptly in gain module 420,
Figure BDA0000053433440000083
applied gain g, also can realize through the gain of regulating analogue system such as power amplifier.
Should be noted that the inhibition method of uttering long and high-pitched sounds described in the invention can use multiple mode to realize, the for example combination of hardware, software or hardware and software.Hardware platform can be FPGA, PLD or other application-specific integrated circuit ASICs.Software platform comprises DSP, ARM or other microprocessors.The combination of software and hardware for example part of module realizes that with dsp software part of module such as FFT realize with hardware accelerator.
It should be noted last that above embodiment is only unrestricted in order to technical scheme of the present invention to be described.Although the present invention is specified with reference to embodiment; Those of ordinary skill in the art is to be understood that; Technical scheme of the present invention is made amendment or is equal to replacement, do not break away from the spirit and the scope of technical scheme of the present invention, it all should be encompassed in the middle of the claim scope of the present invention.

Claims (8)

1. inhibition method of uttering long and high-pitched sounds based on the feedback signal spectrum estimation, these method concrete steps comprise:
Step 1): the signal x (n) that respectively microphone is gathered with raise produce device output signal y (n) carry out branch frame, windowing and Fourier transform obtain the microphone frequency domain signal X (i, k) with raise produce device export frequency-region signal Y (i, k);
Step 2): (i, k) (i k) calculates crosspower spectrum φ between microphone and the speaker output signal to the microphone frequency domain signal X that obtains according to described step 1) with loud speaker output frequency-region signal Y Yx(i is k) with the power spectrum φ of speaker output signal Yy(i, k); And with crosspower spectrum φ Yx(i is k) divided by power spectrum φ Yy(i k) obtains the estimation of feedback path transfer function amplitude spectrum
Step 3): using said step 2) to obtain the transfer function of the feedback path estimate of the amplitude spectrum
Figure FDA0000053433430000012
calculate the feedback signal amplitude spectrum of the current frame
Figure FDA0000053433430000013
Step 4): utilize the microphone signal amplitude spectrum | X (i; K) | and feedback signal amplitude spectrum
Figure FDA0000053433430000014
the calculating acquisition gain function G of described step 3) acquisition (i, k);
Step 5): (i k) comes that (i k) revises and realizes uttering long and high-pitched sounds inhibition to the microphone frequency domain signal X to the gain function G that obtains according to described step 4).
2. the inhibition method of uttering long and high-pitched sounds based on the feedback signal spectrum estimation according to claim 1 is characterized in that this method also further comprises:
Step 6): the revised microphone frequency-region signal to described step 5) obtains is handled the final output signal of acquisition, and output signal
Figure FDA0000053433430000016
is carried out decorrelation handle; With
Step 7): described step 6) process result is carried out gain-adjusted.
3. the inhibition method of uttering long and high-pitched sounds based on the feedback signal spectrum estimation according to claim 1 is characterized in that described step 2) concrete steps that obtain feedback path transfer function amplitude spectrum comprise:
21): adopt the level and smooth method of single order autoregression, obtain crosspower spectrum φ according to formula (1) Yx(i, k);
φ yx(i,k)=α Cφ yx(i-1,k)+(1-α C)|Y *(i,k)X(i,k)| (1)
Wherein, α CBe smoothing factor, 0≤α C<1;
22): adopt the level and smooth method of single order autoregression, obtain power spectrum φ according to formula (2) Yy(i, k);
φ yy(i,k)=α Rφ yy(i-1,k)+(1-α R)Y *(i,k)Y(i,k) (2)
Wherein, α RBe smoothing factor, 0≤α R<1;
23): utilize crosspower spectrum φ Yx(i is k) with power spectrum φ Yy(i, k) root pick formula (3) obtains feedback path transfer function amplitude spectrum
| F ^ ′ ( i , k ) | = φ yx ( i , k ) φ yy ( i , k ) + δ - - - ( 3 ) .
4. the inhibition method of uttering long and high-pitched sounds based on the feedback signal spectrum estimation according to claim 1; It is characterized in that described step 3) is calculated according to formula (4) and obtained present frame feedback signal amplitude spectrum
Figure FDA0000053433430000021
| X ^ ( i , k ) | = β | F ^ ′ ( i , k ) | | X ( i , k ) | - - - ( 4 )
Wherein, β is an adjustable parameter, carries out this parameter of choose reasonable according to actual conditions.
5. the inhibition method of uttering long and high-pitched sounds based on the feedback signal spectrum estimation according to claim 1 is characterized in that the method for described step 4) calculated gains function comprises: based on priori signal feedback ratio method with based on check back signal feedback ratio method.
6. the inhibition method of uttering long and high-pitched sounds based on the feedback signal spectrum estimation according to claim 5 is characterized in that, described step 4) based on check back signal feedback ratio method according to formula (5) calculate to obtain gain function G (i, k);
G ( i , k ) = [ max ( | X ( i , k ) | α - | X ^ ( i , k ) | α , 0 ) | X ( i , k ) | α ] 1 α - - - ( 5 )
Wherein, α>0.
7. the inhibition method of uttering long and high-pitched sounds based on the feedback signal spectrum estimation according to claim 1 is characterized in that, described step 5) according to formula (6) to the microphone frequency domain signal X (i, k) revise and realize uttering long and high-pitched sounds inhibition:
S ^ ( i , k ) = X ( i , k ) G ( i , k ) - - - ( 6 ) .
8. the inhibition method of uttering long and high-pitched sounds based on the feedback signal spectrum estimation according to claim 2 is characterized in that, described step 7) adopts the gain of regulating digital signal or the gain of regulating the analogue system power amplifier.
CN201110082197.4A 2011-04-01 2011-04-01 Howling suppression method based on feedback signal spectrum estimation Active CN102740214B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201110082197.4A CN102740214B (en) 2011-04-01 2011-04-01 Howling suppression method based on feedback signal spectrum estimation

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201110082197.4A CN102740214B (en) 2011-04-01 2011-04-01 Howling suppression method based on feedback signal spectrum estimation

Publications (2)

Publication Number Publication Date
CN102740214A true CN102740214A (en) 2012-10-17
CN102740214B CN102740214B (en) 2014-09-10

Family

ID=46994837

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201110082197.4A Active CN102740214B (en) 2011-04-01 2011-04-01 Howling suppression method based on feedback signal spectrum estimation

Country Status (1)

Country Link
CN (1) CN102740214B (en)

Cited By (15)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN103338419A (en) * 2013-06-29 2013-10-02 青岛歌尔声学科技有限公司 Method and device for removing headset scream
CN103905958A (en) * 2014-04-21 2014-07-02 杭州百控科技有限公司 Audio processing device and method
CN106303118A (en) * 2015-06-05 2017-01-04 福建凯米网络科技有限公司 Intelligent terminal realizes the method for microphone function, audio frequency playing method, equipment and system
CN106303878A (en) * 2015-05-22 2017-01-04 成都鼎桥通信技术有限公司 One is uttered long and high-pitched sounds and is detected and suppressing method
CN106303827A (en) * 2016-08-19 2017-01-04 宁波中荣声学科技有限公司 The anti-circuit of uttering long and high-pitched sounds of mike
CN106558316A (en) * 2016-11-09 2017-04-05 天津大学 It is a kind of based on it is long when signal special frequency band rate of change detection method of uttering long and high-pitched sounds
CN107257528A (en) * 2017-06-14 2017-10-17 山东浪潮云服务信息科技有限公司 A kind of detection method of uttering long and high-pitched sounds based on weighted spectral entropy
CN110265042A (en) * 2019-05-31 2019-09-20 歌尔科技有限公司 Audio signal processing method, device and equipment
CN111149370A (en) * 2017-09-29 2020-05-12 杜比实验室特许公司 Howling detection in a conferencing system
CN112309364A (en) * 2020-11-04 2021-02-02 广州市立锐升电子有限公司 Method, system and chip for realizing DSP multichannel squeal reduction processing
CN112349295A (en) * 2020-10-20 2021-02-09 浙江大华技术股份有限公司 Howling detection method and device
CN113611276A (en) * 2021-07-08 2021-11-05 北京小唱科技有限公司 Acoustic feedback suppression method, apparatus and storage medium
CN113838474A (en) * 2021-11-25 2021-12-24 全时云商务服务股份有限公司 Communication system howling suppression method and device
CN114401399A (en) * 2022-03-28 2022-04-26 广州迈聆信息科技有限公司 Audio bidirectional delay estimation method and device, conference terminal and storage medium
CN118016042A (en) * 2024-04-09 2024-05-10 成都启英泰伦科技有限公司 Howling suppression method and device

Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN1533665A (en) * 2001-07-20 2004-09-29 皇家菲利浦电子有限公司 Echo canceller having nonlinear echo suppressor for harmonics calculations
CN101339769A (en) * 2007-07-03 2009-01-07 富士通株式会社 Echo suppressor and echo suppressing method
EP2086249A2 (en) * 2008-01-31 2009-08-05 Yamaha Corporation Howling suppression apparatus and computer readable recording medium
US20090232297A1 (en) * 2008-03-12 2009-09-17 Oki Electric Industry Co., Ltd. Echo canceler and echo canceling method
CN101778183A (en) * 2009-01-13 2010-07-14 华为终端有限公司 Method and device for suppressing residual echo

Patent Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN1533665A (en) * 2001-07-20 2004-09-29 皇家菲利浦电子有限公司 Echo canceller having nonlinear echo suppressor for harmonics calculations
CN101339769A (en) * 2007-07-03 2009-01-07 富士通株式会社 Echo suppressor and echo suppressing method
EP2086249A2 (en) * 2008-01-31 2009-08-05 Yamaha Corporation Howling suppression apparatus and computer readable recording medium
US20090232297A1 (en) * 2008-03-12 2009-09-17 Oki Electric Industry Co., Ltd. Echo canceler and echo canceling method
CN101778183A (en) * 2009-01-13 2010-07-14 华为终端有限公司 Method and device for suppressing residual echo

Non-Patent Citations (1)

* Cited by examiner, † Cited by third party
Title
吴鸣: ""采用随机相位技术的啸叫抑制方法"", 《电声技术》 *

Cited By (27)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN103338419B (en) * 2013-06-29 2016-07-06 青岛歌尔声学科技有限公司 A kind of eliminate method and the device that earphone is uttered long and high-pitched sounds
CN103338419A (en) * 2013-06-29 2013-10-02 青岛歌尔声学科技有限公司 Method and device for removing headset scream
CN103905958A (en) * 2014-04-21 2014-07-02 杭州百控科技有限公司 Audio processing device and method
CN106303878A (en) * 2015-05-22 2017-01-04 成都鼎桥通信技术有限公司 One is uttered long and high-pitched sounds and is detected and suppressing method
CN106303118A (en) * 2015-06-05 2017-01-04 福建凯米网络科技有限公司 Intelligent terminal realizes the method for microphone function, audio frequency playing method, equipment and system
CN106303827B (en) * 2016-08-19 2019-10-25 宁波中荣声学科技有限公司 The anti-circuit of uttering long and high-pitched sounds of microphone
CN106303827A (en) * 2016-08-19 2017-01-04 宁波中荣声学科技有限公司 The anti-circuit of uttering long and high-pitched sounds of mike
CN106558316A (en) * 2016-11-09 2017-04-05 天津大学 It is a kind of based on it is long when signal special frequency band rate of change detection method of uttering long and high-pitched sounds
CN107257528A (en) * 2017-06-14 2017-10-17 山东浪潮云服务信息科技有限公司 A kind of detection method of uttering long and high-pitched sounds based on weighted spectral entropy
US11277518B2 (en) 2017-09-29 2022-03-15 Dolby Laboratories Licensing Corporation Howl detection in conference systems
US11677879B2 (en) 2017-09-29 2023-06-13 Dolby Laboratories Licensing Corporation Howl detection in conference systems
CN111149370A (en) * 2017-09-29 2020-05-12 杜比实验室特许公司 Howling detection in a conferencing system
CN111149370B (en) * 2017-09-29 2021-10-01 杜比实验室特许公司 Howling detection in a conferencing system
CN110265042A (en) * 2019-05-31 2019-09-20 歌尔科技有限公司 Audio signal processing method, device and equipment
US11694700B2 (en) 2019-05-31 2023-07-04 Goertek Inc. Method, apparatus and device for processing sound signal
WO2020237954A1 (en) * 2019-05-31 2020-12-03 歌尔股份有限公司 Sound signal processing method, apparatus and device
CN112349295B (en) * 2020-10-20 2023-03-31 浙江大华技术股份有限公司 Howling detection method and device
CN112349295A (en) * 2020-10-20 2021-02-09 浙江大华技术股份有限公司 Howling detection method and device
CN112309364A (en) * 2020-11-04 2021-02-02 广州市立锐升电子有限公司 Method, system and chip for realizing DSP multichannel squeal reduction processing
CN113611276A (en) * 2021-07-08 2021-11-05 北京小唱科技有限公司 Acoustic feedback suppression method, apparatus and storage medium
CN113611276B (en) * 2021-07-08 2024-06-11 北京小唱科技有限公司 Acoustic feedback suppression method, apparatus and storage medium
CN113838474A (en) * 2021-11-25 2021-12-24 全时云商务服务股份有限公司 Communication system howling suppression method and device
CN113838474B (en) * 2021-11-25 2022-02-18 全时云商务服务股份有限公司 Communication system howling suppression method and device
CN114401399B (en) * 2022-03-28 2022-08-09 广州迈聆信息科技有限公司 Audio bidirectional delay estimation method and device, conference terminal and storage medium
CN114401399A (en) * 2022-03-28 2022-04-26 广州迈聆信息科技有限公司 Audio bidirectional delay estimation method and device, conference terminal and storage medium
CN118016042A (en) * 2024-04-09 2024-05-10 成都启英泰伦科技有限公司 Howling suppression method and device
CN118016042B (en) * 2024-04-09 2024-05-31 成都启英泰伦科技有限公司 Howling suppression method and device

Also Published As

Publication number Publication date
CN102740214B (en) 2014-09-10

Similar Documents

Publication Publication Date Title
CN102740214B (en) Howling suppression method based on feedback signal spectrum estimation
CN109727604B (en) Frequency domain echo cancellation method for speech recognition front end and computer storage medium
CN102739886B (en) Stereo echo offset method based on echo spectrum estimation and speech existence probability
CN108600894B (en) Earphone self-adaptive active noise control system and method
JP5675848B2 (en) Adaptive noise suppression by level cue
US8355511B2 (en) System and method for envelope-based acoustic echo cancellation
WO2015196729A1 (en) Microphone array speech enhancement method and device
JP4161628B2 (en) Echo suppression method and apparatus
US8010355B2 (en) Low complexity noise reduction method
CN102347028A (en) Double-microphone speech enhancer and speech enhancement method thereof
US20120197638A1 (en) Method and Device for Noise Reduction Control Using Microphone Array
CN101763858A (en) Method for processing double-microphone signal
CN101901602B (en) Method for reducing noise by using hearing threshold of impaired hearing
JP4957810B2 (en) Sound processing apparatus, sound processing method, and sound processing program
KR20030040399A (en) Voice enhancement system
JP6250147B2 (en) Hearing aid system signal processing method and hearing aid system
CN102740189B (en) Acoustic feedback inhibition method based on time reversal
EP3662650A1 (en) Multi-channel residual echo suppression
CN109727605A (en) Handle the method and system of voice signal
CN103929704B (en) The method and system that a kind of adaptive acoustic feedback based on transform domain is eliminated
EP2490218B1 (en) Method for interference suppression
JP3673727B2 (en) Reverberation elimination method, apparatus thereof, program thereof, and recording medium thereof
CN111785289B (en) Residual echo cancellation method and device
Seidel et al. Efficient deep acoustic echo suppression with condition-aware training
EP3531719B1 (en) Dereverberation device and hearing aid

Legal Events

Date Code Title Description
C06 Publication
PB01 Publication
C10 Entry into substantive examination
SE01 Entry into force of request for substantive examination
C14 Grant of patent or utility model
GR01 Patent grant
TR01 Transfer of patent right
TR01 Transfer of patent right

Effective date of registration: 20180313

Address after: 215000 401 Room 401, No. 158, Jinfeng Road, science and Technology City, high tech Zone, Jiangsu Province

Patentee after: In the new science and Technology Co. Ltd. (Suzhou)

Address before: 100190 Beijing City, North Fourth Ring Road West, No. 21, No.

Patentee before: Institute of acoustics, Chinese Academy of Sciences