CN106161820A - A kind of interchannel decorrelation method for stereo acoustic echo canceler - Google Patents

A kind of interchannel decorrelation method for stereo acoustic echo canceler Download PDF

Info

Publication number
CN106161820A
CN106161820A CN201510180915.XA CN201510180915A CN106161820A CN 106161820 A CN106161820 A CN 106161820A CN 201510180915 A CN201510180915 A CN 201510180915A CN 106161820 A CN106161820 A CN 106161820A
Authority
CN
China
Prior art keywords
frequency
value
carries out
signal
amplitude
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Granted
Application number
CN201510180915.XA
Other languages
Chinese (zh)
Other versions
CN106161820B (en
Inventor
杨鹤飞
郑成诗
李晓东
彭任华
厉剑
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Institute of Acoustics CAS
Original Assignee
Institute of Acoustics CAS
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Institute of Acoustics CAS filed Critical Institute of Acoustics CAS
Priority to CN201510180915.XA priority Critical patent/CN106161820B/en
Publication of CN106161820A publication Critical patent/CN106161820A/en
Application granted granted Critical
Publication of CN106161820B publication Critical patent/CN106161820B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Landscapes

  • Cable Transmission Systems, Equalization Of Radio And Reduction Of Echo (AREA)

Abstract

The invention provides a kind of interchannel decorrelation method for stereo acoustic echo canceler, including: step 1) far-end microphone pickup acoustic reference signal x1(n) and x2(n);Frequency spectrum X is obtained by Fourier transformation1(k) and X2(k);Step 2) to frequency spectrum X1(k) and X2K () carries out high frequency decorrelative transformation;Time domain reference signal x after being processed1d(n) and x2d(n);Step 3) to time domain reference signal x1N () carries out low frequency decorrelative transformation, construct three sinusoidal signals;Step 4) by time domain addition, above three sinusoidal signal is joined x1dIn (n);Obtain x1′(n);x2' (n) is x2d(n);Step 5) to step 4) the signal x that obtains1' (n) and x2' (n) carries out adaptive channel identification by adaptive algorithm, it is achieved echo cancellor.The method of the present invention effectively reduces the Inter-channel Correlation of boombox signal, improves the imbalance performance of stereo echo canceller system.

Description

A kind of interchannel decorrelation method for stereo acoustic echo canceler
Technical field
The present invention relates to acoustic echo technology for eliminating field, particularly to a kind of for stereo acoustic echo canceler Interchannel decorrelation method.
Background technology
In hands-free communication system, there is acoustic echo due to the closed loop transmission of acoustical signal.Such issues that at hands The scenes such as machine communication, TeleConference Bridge, video conferencing system generally exist.Acoustic echo can reduce system Communication quality, it is therefore desirable to eliminated, general employing adaptive channel identification algorithm (list of references [1]: Benesty J,Kellermann K et al.A perspective on stereophonic echo cancellation.Berlin,Heidelberg: Springer-Verlag Berlin Heidelberg, 2011:10.) realize the elimination of echo.In stereo communication system, There is strong Inter-channel Correlation in the boombox signal in near-end room, this causes adaptive channel identification to be calculated The convergency value of method is not unique, so that stereo acoustic echo eliminates the relatively big (list of references of wave filter imbalance of system [2]: Benesty J, Morgan D R et al.A better understanding and an improved solution to the specific problems of stereophonic acoustic echo cancellation.IEEE Trans.Speech and Audio Processing,1998,6(2):156-165.)。
In order to improve imbalance performance, need in advance two passage Near end speaker signal to be carried out decorrelation pretreatment;With Time, in order to ensure communication quality, respective handling should not introduce obvious distortion.Have some the most both at home and abroad can use In the interchannel decorrelation method that stereo acoustic echo eliminates;Principle be in primary signal add a part other Signal or directly primary signal is converted;The most representational is classical non-linear transform algorithm (ginseng Examine document [2]), the method adds the nonlinear transformation composition of a part of primary signal in primary signal.These tradition The defect of method is that quality distortion is big, and decorrelation effect is the strongest, causes quality distortion the biggest.In order to go phase Closing and obtain more preferable compromise between effect and quality distortion, in recent years, domestic and international researcher starts with psychoacoustics Effect carries out interchannel decorrelative transformation, to obtain less quality distortion;Auditory masking effect on psychoacoustics Should (list of references [3]: Valin J M.Perceptually-motivated nonlinear channel decorrelation for stereo acoustic echo cancellation,In Proc.Joint Workshop Hands-Free Speech Communication and Microphone Arrays, Trento, May 2008:188-191.) and fundamental frequency loss effect (list of references [4]: Cecchi S, Romoli L et al.A combined psychoacoustic approach for stereo acoustic echo cancellation.IEEE Trans.Audio,Speech and Language Processing,2011, 19 (6): 1530-1539.) it is used for carrying out low frequency decorrelative transformation.
Compared with adaptive channel discrimination method in early days, method based on psychoacoustics effect can be less at quality distortion Obtain more preferable decorrelation effect simultaneously.But require higher occasion for eliminating echo, current based on psychology sound The method learning effect can not meet higher quality distortion requirement, and the calculating process of the method is complex.
Summary of the invention
It is an object of the invention to overcome the method being currently based on psychoacoustics effect above-mentioned present on elimination echo Defect, it is proposed that a kind of interchannel decorrelation method for stereo acoustic echo canceler, the method can be effective Reduce the Inter-channel Correlation of boombox signal, improve the imbalance performance of stereo echo canceller system, it Ensure that less quality distortion, and realize simple, computation complexity is low.
In order to realize foregoing invention purpose, the present invention proposes a kind of interchannel for stereo acoustic echo canceler and goes Correlation technique, described method includes:
Step 1) far-end microphone pick-up of acoustic signals, obtain stereo reference signal x1(n) and x2(n);Simultaneously to x1(n) And x2N () carries out Fourier transformation, obtain frequency spectrum X1(k) and X2(k);
Step 2) to frequency spectrum X1(k) and X2K () carries out high frequency decorrelative transformation;Time domain reference signal after being processed x1d(n) and x2d(n);
Step 3) to three-dimensional acoustic reference signal x1N () carries out low frequency decorrelative transformation, construct three sinusoidal signals: s1(n)、s2(n) and s3(n);
Step 4) pass through time domain addition by described three sinusoidal signals s1(n)、s2(n) and s3N () joins x1dIn (n); Obtain x '1(n);x′2N () is x2d(n);Described x '1(n) and x '2N () is stereo after Whole frequency band decorrelative transformation Signal;
Step 5) to step 4) the signal x ' that obtains1(n) and x '2N () carries out adaptive channel by adaptive algorithm and distinguishes Know, it is achieved echo cancellor.
In technique scheme, described step 2) particularly as follows:
Simultaneously to frequency spectrum X1(k) and X2K () carries out high frequency decorrelative transformation:
X 1 d ( k ) = X 1 ( k ) · [ ( 1 - β ( k ) ) + β ( k ) · e iΘ ] X 2 d ( k ) = X 2 ( k ) · [ ( 1 - β ( k ) ) + β ( k ) · e - iΘ ] - - - ( 5 )
Wherein, β (k) is nonlinear weight coefficient, it is contemplated that the pass of the phase sensitive following frequency of human auditory system System, β (k) value is:
&beta; ( k ) = 0.5 , f ( k ) &le; 2 kHz 0.5 + 0.4 &CenterDot; &CenterDot; f ( k ) - 2000 f s / 2 - 2000 , 2 kHz < f ( k ) < f s / 2 - - - ( 6 )
Wherein, f (k) is the linear frequency value corresponding to discrete frequency indices k,NFFT is The length of Fourier transformation;fsSample frequency for signal;β (k) is at fsMaximum is taken at/2;
Judge that analog frequency f (k) corresponding to k is more than f0Whether set up, described f0It is high frequency or low for division signals The marginal value of frequency;If a determination be made that negative, Θ value is set to 0, otherwise, Θ value is set to sinusoidal change The value changed;
To frequency spectrum X1d(k) and X2dK () carries out inverse Fourier transform, obtain the time domain reference after high frequency decorrelative transformation Signal x1d(n) and x2d(n)。
In technique scheme, described f0Value be: f0=1kHz.
In technique scheme, described step 3) specifically include:
Step 3-1) calculate reference signal x1The weighted auto-correlation function ψ of (n)w(τ);
x1N the average magnitude difference function φ (τ) of () is defined as:
φ (τ)=E{ | x1(n)-x1(n+τ)|} (7)
Wherein, τ is time delay, and E{} represents and seeks desired operation;
x1N the auto-correlation function of () is calculated as follows:
ψ (τ)=E{x1(n)x1(n+τ)}. (8)
Weighted auto-correlation function is:
ψw(τ)=ψ (τ)/[φ (τ)+ε], (9)
Wherein ε is to prevent quotient from dissipating the fixed value introduced, ε=1;
Step 3-2) calculate fundamental frequency cycles estimated value τ0
τ0={ τ: ψw(τ)=max (ψw(τ));fs/350≤τ≤fs/60} (10)
Wherein, max () expression takes maximum;
Step 3-3) recalculate fundamental frequency estimated value f0
To with maximum ψw0Adjacent 3 ψ centered by)w(τ) value carries out interpolation, then searches again for maximum and obtains Precision higher fundamental frequency cycles estimated value T0, then fundamental frequency value is:
f0=1/T0 (11)
Step 3-4) retrieval reference signal frequency spectrum X1K (), carries out amplitude weighting and obtains the amplitude of three low order harmonics compositions AiAnd first phase
k0Represent the frequency indices corresponding to fundamental frequency, then two, three harmonics correspond respectively to 2k0And 3k0;Utilize three Individual frequency indices value retrieval X1K (), respectively obtains fundamental frequency and the amplitude of two, three harmonics and first phase, carries out amplitude and add Power obtains:
Wherein, A1And σ1For fundamental frequency amplitude, first phase and the amplitude weighting factor, A2And σ2Humorous for secondary Amplitude, first phase and the amplitude weighting factor, the A of frequency3And σ3For adding for the amplitude of three harmonics, first phase and amplitude Weight factor;Angle{ } represent plural number taken phase function, σi(i=1,2,3) is in [0,1] interior value, it is preferred that σ1=0.1, σ2=0.2, σ3=0.3;
Step 3-5) construct three sinusoidal signals s1(n)、s2(n) and s3(n):
Wherein, fi=i f0, i=1,2,3.
In technique scheme, described adaptive algorithm is: least fibre method, normalized least fibre method or frequency Territory least fibre method.
It is an advantage of the current invention that:
1, the method for the present invention utilizes the nonlinear Phase Modulation of time-varying to tradition nonlinear transformation decorrelation at high band Method improves, thus obtains more preferable high frequency decorrelation effect;
2, the method for the present invention composes advantageous effect based on psychoacoustics, Implantation Energy near three low order harmonics compositions More weak sinusoidal signal, to carry out low frequency decorrelative transformation, which ensure that less quality distortion;
3, the method for the present invention utilizes time domain addition to inject in primary signal by sinusoidal signal, makes algorithm have relatively low Operand;
4, the energy size of the sinusoidal signal that the method for the present invention is injected by regulation is lost with tonequality in decorrelation effect Compromise in various degree is carried out between Zhen, therefore can be according to the requirement of concrete application scenarios by sine when reality is applied The energy of signal is arranged to different value, such as, for those occasions less demanding to tonequality, can increase and be injected The energy of sinusoidal signal to obtain more preferable decorrelation effect.
Accompanying drawing explanation
Fig. 1 is that stereo acoustic echo eliminates model general illustration;
Fig. 2 is the present invention schematic flow sheet for the interchannel decorrelation method of stereo acoustic echo canceler;
Fig. 3 is the interchannel phase obtained of the method using the present invention in example 1 and existing non-linear transform algorithm Responsibility number comparison diagram;
Fig. 4 is the imbalance performance obtained of method and the existing non-linear transform algorithm using the present invention in example 1 Comparison diagram;
Fig. 5 is the voice quality obtained of method and the existing non-linear transform algorithm using the present invention in example 1 The comparison diagram of perception assessment.
Detailed description of the invention
As it is shown in figure 1, in hands-free communication system, the acoustical signal of far-end speaker is by the stereomicrophone of far-end Device receives, and obtains stereo reference signal x of two passages1(n) and x2(n).Stereo reference signal x1(n) and x2(n) Through decorrelation pretreatment, the signal x after being processed1' (n) and x2'(n).Then by line transmission to near-end, x1' (n) and x2' (n) again by near-end stereophonic loud-speaker system play.Two microphone channel signals of near-end are respectively d1(n) and d2(n), the signal that i.e. near-end microphone receives, it is divided into two parts: a part is x1' (n) and x2' (n) by After Near end speaker plays back, its acoustical signal is propagated in proximal void, reflection in near-end room and dissipating After penetrating, arrive the signal of near-end microphone;Another part is the background noise in near-end scene and voice, i.e. Near end signal.Near-end microphone signal d1(n) and d2N the Part I signal in () can be returned to again far by line transmission End, and reset by far-end loudspeaker in far-end scene.So, the speaker in far-end scene just hears The sound of a part oneself, here it is need the acoustic echo eliminated.
If the signal vector corresponding to two channel speakers signals is respectively as follows:
x1' (n)=[x1′(n-L+1) x1′(n-L+2) ... x1′(n)]
x2' (n)=[x2′(n-L+1) x2′(n-L+2) ... x2′(n)]
The acoustic echo presented in far-end scene mainly due to caused by the acoustic propagation in near-end scene, because of This utilizes adaptive channel identification to carry out acoustic echo elimination in near-end scene.h12(n) and h22N () represents near respectively In end scene, two loudspeaker channel are to the room impulse response of left microphone,WithIt is adaptive the most respectively Answer passage identification algorithm to h12(n) and h22The estimation of (n);h21(n) and h11N () represents respectively in near-end scene and raises for two Sound device passage to the room impulse response of right microphone,WithIt is adaptive channel identification algorithm the most respectively To h21(n) and h11The estimation of (n).
Adaptive algorithm is passed through
e 1 ( n ) = d 1 ( n ) - [ h ^ 21 ( n ) * x 2 &prime; ( n ) + h ^ 11 ( n ) * x 1 &prime; ( n ) ] - - - ( 1 )
e 2 ( n ) = d 2 ( n ) - [ h ^ 22 ( n ) * x 2 &prime; ( n ) + h ^ 12 ( n ) * x 1 &prime; ( n ) ] - - - ( 2 )
Obtain the residue signal e after echo cancellor1(n) and e2(n);e1(n) and e2(n) stereo by far-end scene Speaker is play, and so can eliminate the acoustic echo occurred in far-end scene.
Wave filter imbalance is defined as:
&eta; ( n ) = 20 &CenterDot; lg &Sigma; i = 1 2 &Sigma; j = 1 2 | | h ij ( n ) - h ^ ij ( n ) | | 2 &Sigma; i = 1 2 &Sigma; j = 1 2 | | h ij ( n ) | | 2 - - - ( 3 )
Wherein, | | | |2Represent 2 norms.
If there is no decorrelation pretreatment, bigger wave filter imbalance value η (n) will be there is.It is to say, self adaptation is led to There is bigger wave filter estimation difference in road identification algorithm.Especially physical pulse response changes when, greatly Wave filter imbalance will cause adaptive tracing to slow even cannot accurately following the tracks of it to change, so that echo disappears Except effect severe exacerbation.Its reason is, if not having decorrelation pretreatment, then
x 1 &prime; ( n ) = x 1 ( n ) x 2 &prime; ( n ) = x 2 ( n ) - - - ( 4 )
Due to stereo reference signal x1(n) and x2N () comes from the same speaker in far-end scene, between them There is the strongest Inter-channel Correlation.This strong correlation will make the convergency value of adaptive channel identification algorithm the most only One, the sef-adapting filter converged to during actual applicationMay and be not equal to desired by us The physics transmission function h obtainedij(n), i=1,2;J=1,2, thus cause big wave filter to be lacked of proper care.
In order to solve this problem, improve imbalance performance, generally in reference signal x1(n) and x2N () is delivered to near-end field Before scape, it is carried out decorrelation pretreatment, thus effectively reduce wave filter imbalance, improve stereo acoustic echo and disappear Except performance.
The present invention will be further described in detail with specific embodiment below in conjunction with the accompanying drawings.
As in figure 2 it is shown, a kind of interchannel decorrelation method in stereo acoustic echo eliminates, concrete steps As follows:
Step 1) far-end microphone pick-up of acoustic signals, obtain stereo reference signal x1(n) and x2(n);Simultaneously to x1(n) And x2N () carries out Fourier transformation (FFT), obtain frequency spectrum X1(k) and X2(k);
Step 2) to frequency spectrum X1(k) and X2K () carries out high frequency decorrelative transformation;Time domain reference signal after being processed x1d(n) and x2d(n);Specifically include:
Simultaneously to frequency spectrum X1(k) and X2K () carries out high frequency decorrelative transformation:
X 1 d ( k ) = X 1 ( k ) &CenterDot; [ ( 1 - &beta; ( k ) ) + &beta; ( k ) &CenterDot; e i&Theta; ] X 2 d ( k ) = X 2 ( k ) &CenterDot; [ ( 1 - &beta; ( k ) ) + &beta; ( k ) &CenterDot; e - i&Theta; ] - - - ( 5 )
Wherein, β (k) is nonlinear weight coefficient, it is contemplated that the pass of the phase sensitive following frequency of human auditory system System, β (k) value is:
&beta; ( k ) = 0.5 , f ( k ) &le; 2 kHz 0.5 + 0.4 &CenterDot; f ( k ) - 2000 f s / 2 - 2000 , 2 kHz < f ( k ) < f s / 2 - - - ( 6 )
Wherein, f (k) is the linear frequency value corresponding to discrete frequency indices k,NFFT is The length of Fourier transformation;fsSample frequency for signal;β (k) is at fsMaximum is taken at/2;
Judge that analog frequency f (k) corresponding to k is more than f0Whether set up, described f0For division signals be high frequency still The marginal value of low frequency;Preferably, f0=1kHz;If a determination be made that negative, Θ value is set to 0, otherwise, Θ value is set to the value of sinusoidal variations;
To frequency spectrum X1d(k) and X2dK () carries out inverse Fourier transform (IFFT), obtain after high frequency decorrelative transformation time Territory reference signal x1d(n) and x2d(n)。
Step 3) to three-dimensional acoustic reference signal x1N () carries out low frequency decorrelative transformation, construct three sinusoidal signals; Specifically include:
Step 3-1) calculate stereo reference signal x1The weighted auto-correlation function ψ of (n)w(τ);
x1N average magnitude difference function (AMDF) φ (τ) of () is defined as:
φ (τ)=E{ | x1(n)-x1(n+τ)|} (7)
Wherein, τ is time delay, and E{} represents and seeks desired operation;
x1N the auto-correlation function of () is calculated as follows:
ψ (τ)=E{x1(n)x1(n+τ)}. (8)
Weighted auto-correlation function is:
ψw(τ)=ψ (τ)/[φ (τ)+ε], (9)
Wherein ε is to prevent quotient from dissipating the fixed value introduced, preferred value ε=1.
Step 3-2) calculate fundamental frequency cycles estimated value τ0
Weighted auto-correlation function ψw(τ) maximum ψw,maxCorresponding time delay number τ0It is the estimation of fundamental frequency cycles Value;The span of fundamental frequency is defined to [60,350] Hz, and this is equivalent to limit τ0Hunting zone, it may be assumed that
τ0={ τ: ψw(τ)=max (ψw(τ));fs/350≤τ≤fs/60} (10)
Wherein, max () expression takes maximum.
Step 3-3) recalculate fundamental frequency estimated value f0
In order to improve the accuracy of detection of fundamental frequency, further to maximum ψw0Adjacent 3 ψ centered by)w(τ) value Carry out interpolation, then search again for maximum and obtain precision higher fundamental frequency cycles estimated value T0, then fundamental frequency value is:
f0=1/T0 (11)
Step 3-4) retrieval reference signal frequency spectrum X1K (), carries out amplitude weighting and obtains the amplitude of three low order harmonics compositions AiAnd first phase
k0Represent the frequency indices corresponding to fundamental frequency, then two, three harmonics correspond respectively to 2k0And 3k0;Utilize three Individual frequency indices value retrieval X1K (), respectively obtains fundamental frequency and the amplitude of two, three harmonics and first phase, carries out amplitude and add Power obtains:
Wherein, A1And σ1For fundamental frequency amplitude, first phase and the amplitude weighting factor, A2And σ2Humorous for secondary Amplitude, first phase and the amplitude weighting factor, the A of frequency3And σ3For adding for the amplitude of three harmonics, first phase and amplitude Weight factor;Angle{ } represent plural number taken phase function, σi(i=1,2,3) is in [0,1] interior value, it is preferred that σ1=0.1, σ2=0.2, σ3=0.3.
Step 3-5) construct three sinusoidal signals s1(n)、s2(n) and s3(n):
Wherein, fi=i f0, i=1,2,3.
Step 4) pass through time domain addition by described three sinusoidal signals s1(n)、s2(n) and s3N () joins x1dIn (n); Obtain x '1(n);x′2N () is x2d(n);x′1(n) and x '2N () is the stereophonic signal after Whole frequency band decorrelative transformation:
x 1 &prime; ( n ) = x 1 d ( n ) + s 1 ( n ) + s 2 ( n ) + s 3 ( n ) x 2 &prime; ( n ) = x 2 d ( n ) - - - ( 14 )
Step 5) to step 4) the signal x ' that obtains1(n) and x '2N () carries out adaptive by normalized least fibre method Answer passage identification, it is achieved echo cancellor;
Within the n-th sampling period, utilize the sef-adapting filter in the n-th sampling periodTo x '1(n) and x′2N () filters, obtain the echo signal y at microphonej(n), it may be assumed that
y j ( n ) = &Sigma; i = 1 2 h ^ ij ( n ) * x i &prime; ( n ) - - - ( 15 )
Again at microphone signal djN echo signal is eliminated by (), obtain residue signal ej(n):
ej(n)=dj(n)-yj(n) (16)
Then sef-adapting filter is updated, obtains the sef-adapting filter in (n+1)th sampling period h ^ ij ( n + 1 ) :
h ^ ij ( n + 1 ) = h ^ ij ( n ) + &mu; &Sigma; i = 1 2 &Sigma; m = n - L + 1 n | x i &prime; ( m ) | 2 e ( n ) x i &prime; ( n ) - - - ( 17 )
Wherein, xi' (n)=[xi′(n-L+1) xi′(n-L+2) ... xi′(n)]。
For the next sampling period, then continue to carry out adaptive tracing from formula (17) to formula (19), until signal ended.
Described normalized least fibre method (NLMS) can be other adaptive algorithm: least fibre method (LMS) Or frequency domain least fibre method (FBLMS).
Below in conjunction with example, the present invention will be further described.
Example 1: near-end room a size of 4 × 3 × 3m3, two Near end speaker lay respectively at 1,2,1.2}m and {3,2,1.2}m;Investigation is positioned at that { the near-end microphone at 1.8,1,1.2}m, the acoustical signal it picked up is regarded as closely End microphone signal d (n).The white Gaussian noise simulation near end signal using signal to noise ratio to be 30dB.Sample rate takes 16 KHz, utilizes NLMS algorithm to carry out adaptive channel identification.By the method for the present invention and traditional nonlinear transformation Decorrelation method (NLT) carries out performance comparison.
The interchannel coherence factor of two channel signals x1 (n) and x2 (n) (Inter-Channel Coherence Coefficient, ICCC) it is defined as:
ICCC ( k ) = | E { X 1 * ( k ) &CenterDot; X 2 ( k ) } | E { | X 1 ( k ) | 2 } &CenterDot; E { | X 2 ( k ) | 2 } - - - ( 18 )
ICCC is in [0,1] interior value, and its value is the least, and inter-channel coherence is the most weak, and decorrelation effect is the best.Such as Fig. 4 Shown in, in the result of three methods, the ICCC value of two signals after being processed by the method for the present invention is Little.
In imbalance learning curve, reach stable state the shortest, represent that convergence rate is the fastest;Its value is the least, identification essence Spend the highest.As Fig. 4 shows, in the result of three methods, the signal after being processed by the method for the present invention is lost Adjust best performance.
Perceptual evaluation of speech quality (Perceptual Evaluation of Speech Quality, PESQ) is to multiple human ear Auditory properties models, by subjectivity tonequality objective quantification.PESQ is value between-0.5 to 4.5, and its value is the biggest, table Show that distortion is the least.As it is shown in figure 5, in the result of three methods, after being processed by the method for the present invention The PESQ value of signal is bigger than the PESQ value of signal after NLT process.

Claims (5)

1., for an interchannel decorrelation method for stereo acoustic echo canceler, described method includes:
Step 1) far-end microphone pick-up of acoustic signals, obtain stereo reference signal x1(n) and x2(n);Simultaneously to x1(n) And x2N () carries out Fourier transformation, obtain frequency spectrum X1(k) and X2(k);
Step 2) to frequency spectrum X1(k) and X2K () carries out high frequency decorrelative transformation;Time domain reference signal after being processed x1d(n) and x2d(n);
Step 3) to three-dimensional acoustic reference signal x1N () carries out low frequency decorrelative transformation, construct three sinusoidal signals: s1(n)、s2(n) and s3(n);
Step 4) pass through time domain addition by described three sinusoidal signals s1(n)、s2(n) and s3N () joins x1dIn (n); Obtain x1′(n);x2' (n) is x2d(n);x1' (n) and x2' (n) is the stereophonic signal after Whole frequency band decorrelative transformation;
Step 5) to step 4) the signal x that obtains1' (n) and x2' (n) carries out adaptive channel by adaptive algorithm and distinguishes Know, it is achieved echo cancellor.
Interchannel decorrelation method for stereo acoustic echo canceler the most according to claim 1, it is special Levy and be, described step 2) particularly as follows:
Simultaneously to frequency spectrum X1(k) and X2K () carries out high frequency decorrelative transformation:
X 1 d ( k ) = X 1 ( k ) &CenterDot; [ ( 1 - &beta; ( k ) ) + &beta; ( k ) &CenterDot; e i&Theta; ] X 2 d ( k ) = X 2 ( k ) &CenterDot; [ ( 1 - &beta; ( k ) ) + &beta; ( k ) &CenterDot; e i&Theta; ] - - - ( 5 )
Wherein, β (k) is nonlinear weight coefficient, it is contemplated that the pass of the phase sensitive following frequency of human auditory system System, β (k) value is:
&beta; ( k ) = 0.5 , f ( k ) &le; 2 kHz 0.5 + 0.4 &CenterDot; f ( k ) - 2000 f s / 2 - 2000 , 2 kHz < f ( k ) < f s / 2 - - - ( 6 )
Wherein, f (k) is the linear frequency value corresponding to discrete frequency indices k,NFFT is The length of Fourier transformation;fsSample frequency for signal;β (k) is at fsMaximum is taken at/2;
Judge that analog frequency f (k) corresponding to k is more than f0Whether set up, described f0It is high frequency or low for division signals The marginal value of frequency;If a determination be made that negative, Θ value is set to 0, otherwise, Θ value is set to sinusoidal change The value changed;
To frequency spectrum X1d(k) and X2dK () carries out inverse Fourier transform, obtain the time domain reference after high frequency decorrelative transformation Signal x1d(n) and x2d(n)。
Interchannel decorrelation method for stereo acoustic echo canceler the most according to claim 2, it is special Levy and be, described f0Value be: f0=1kHz.
Interchannel decorrelation method for stereo acoustic echo canceler the most according to claim 2, it is special Levy and be, described step 3) specifically include:
Step 3-1) calculate reference signal x1The weighted auto-correlation function ψ of (n)w(τ);
x1N the average magnitude difference function φ (τ) of () is defined as:
φ (τ)=E{ | x1(n)-x1(n+τ)|} (7)
Wherein, τ is time delay, and E{} represents and seeks desired operation;
x1N the auto-correlation function of () is calculated as follows:
ψ (τ)=E{x1(n)x1(n+τ)}. (8)
Weighted auto-correlation function is:
ψw(τ)=ψ (τ)/[φ (τ)+ε], (9)
Wherein ε is to prevent quotient from dissipating the fixed value introduced, ε=1;
Step 3-2) calculate fundamental frequency cycles estimated value τ0
τ0={ τ: ψw(τ)=max (ψw(τ));fs/350≤τ≤fs/60} (10)
Wherein, max () expression takes maximum;
Step 3-3) recalculate fundamental frequency estimated value f0
To with maximum ψw0Adjacent 3 ψ centered by)w(τ) value carries out interpolation, then searches again for maximum and obtains Precision higher fundamental frequency cycles estimated value T0, then fundamental frequency value is:
f0=1/T0 (11)
Step 3-4) retrieval reference signal frequency spectrum X1K (), carries out amplitude weighting and obtains the amplitude of three low order harmonics compositions AiAnd first phase
k0Represent the frequency indices corresponding to fundamental frequency, then two, three harmonics correspond respectively to 2k0And 3k0;Utilize three Individual frequency indices value retrieval X1K (), respectively obtains fundamental frequency and the amplitude of two, three harmonics and first phase, carries out amplitude and add Power obtains:
Wherein, A1And σ1For fundamental frequency amplitude, first phase and the amplitude weighting factor, A2And σ2Humorous for secondary Amplitude, first phase and the amplitude weighting factor, the A of frequency3And σ3For adding for the amplitude of three harmonics, first phase and amplitude Weight factor;Angle{ } represent plural number taken phase function, σi(i=1,2,3) is in [0,1] interior value, it is preferred that σ1=0.1, σ2=0.2, σ3=0.3;
Step 3-5) construct three sinusoidal signals s1(n)、s2(n) and s3(n):
Wherein, fi=i f0, i=1,2,3.
Interchannel decorrelation method for stereo acoustic echo canceler the most according to claim 4, it is special Levying and be, described adaptive algorithm is: least fibre method, normalized least fibre method or frequency domain least fibre method.
CN201510180915.XA 2015-04-16 2015-04-16 A kind of interchannel decorrelation method for stereo acoustic echo canceler Active CN106161820B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201510180915.XA CN106161820B (en) 2015-04-16 2015-04-16 A kind of interchannel decorrelation method for stereo acoustic echo canceler

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201510180915.XA CN106161820B (en) 2015-04-16 2015-04-16 A kind of interchannel decorrelation method for stereo acoustic echo canceler

Publications (2)

Publication Number Publication Date
CN106161820A true CN106161820A (en) 2016-11-23
CN106161820B CN106161820B (en) 2019-04-23

Family

ID=58058019

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201510180915.XA Active CN106161820B (en) 2015-04-16 2015-04-16 A kind of interchannel decorrelation method for stereo acoustic echo canceler

Country Status (1)

Country Link
CN (1) CN106161820B (en)

Cited By (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN107452398A (en) * 2017-08-09 2017-12-08 深圳创维数字技术有限公司 Echo acquisition methods, electronic equipment and computer-readable recording medium
CN109509482A (en) * 2018-12-12 2019-03-22 北京达佳互联信息技术有限公司 Echo cancel method, echo cancelling device, electronic equipment and readable medium
CN111726464A (en) * 2020-06-29 2020-09-29 珠海全志科技股份有限公司 Multichannel echo filtering method, filtering device and readable storage medium
CN111755020A (en) * 2020-08-07 2020-10-09 南京时保联信息科技有限公司 Stereo echo cancellation method
CN111756943A (en) * 2020-06-05 2020-10-09 中原工学院 Self-adaptive stereo acoustic echo cancellation method for sub-band with increased complex value and proportional value

Citations (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101010723A (en) * 2004-08-25 2007-08-01 杜比实验室特许公司 Multichannel decorrelation in spatial audio coding
US20120207325A1 (en) * 2011-02-10 2012-08-16 Dolby Laboratories Licensing Corporation Multi-Channel Wind Noise Suppression System and Method

Patent Citations (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101010723A (en) * 2004-08-25 2007-08-01 杜比实验室特许公司 Multichannel decorrelation in spatial audio coding
US20120207325A1 (en) * 2011-02-10 2012-08-16 Dolby Laboratories Licensing Corporation Multi-Channel Wind Noise Suppression System and Method

Non-Patent Citations (1)

* Cited by examiner, † Cited by third party
Title
杨鹤飞等:: ""一种基于谱优势的混合立体声声学回声抵消方法"", 《声学技术》 *

Cited By (9)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN107452398A (en) * 2017-08-09 2017-12-08 深圳创维数字技术有限公司 Echo acquisition methods, electronic equipment and computer-readable recording medium
CN107452398B (en) * 2017-08-09 2021-03-16 深圳创维数字技术有限公司 Echo acquisition method, electronic device and computer readable storage medium
CN109509482A (en) * 2018-12-12 2019-03-22 北京达佳互联信息技术有限公司 Echo cancel method, echo cancelling device, electronic equipment and readable medium
CN109509482B (en) * 2018-12-12 2022-03-25 北京达佳互联信息技术有限公司 Echo cancellation method, echo cancellation device, electronic apparatus, and readable medium
CN111756943A (en) * 2020-06-05 2020-10-09 中原工学院 Self-adaptive stereo acoustic echo cancellation method for sub-band with increased complex value and proportional value
CN111726464A (en) * 2020-06-29 2020-09-29 珠海全志科技股份有限公司 Multichannel echo filtering method, filtering device and readable storage medium
CN111726464B (en) * 2020-06-29 2021-04-20 珠海全志科技股份有限公司 Multichannel echo filtering method, filtering device and readable storage medium
CN111755020A (en) * 2020-08-07 2020-10-09 南京时保联信息科技有限公司 Stereo echo cancellation method
CN111755020B (en) * 2020-08-07 2023-02-28 南京时保联信息科技有限公司 Stereo echo cancellation method

Also Published As

Publication number Publication date
CN106161820B (en) 2019-04-23

Similar Documents

Publication Publication Date Title
Sridhar et al. ICASSP 2021 acoustic echo cancellation challenge: Datasets, testing framework, and results
CN107123430B (en) Echo cancel method, device, meeting plate and computer storage medium
CN111292759B (en) Stereo echo cancellation method and system based on neural network
CN105957520B (en) A kind of voice status detection method suitable for echo cancelling system
CN102804747B (en) Multichannel echo canceller
CN106161820A (en) A kind of interchannel decorrelation method for stereo acoustic echo canceler
US6249581B1 (en) Spectrum-based adaptive canceller of acoustic echoes arising in hands-free audio
Cutler et al. INTERSPEECH 2021 Acoustic Echo Cancellation Challenge.
CN101888455B (en) Self-adaptive echo counteracting method for frequency domain
CN106251877A (en) Voice Sounnd source direction method of estimation and device
CN103369162B (en) A kind of listener&#39;s echo self adaptive elimination method of low complex degree
Zhang et al. FT-LSTM based complex network for joint acoustic echo cancellation and speech enhancement
CN104883462B (en) A kind of sef-adapting filter and filtering method for eliminating acoustic echo
CN105869651A (en) Two-channel beam forming speech enhancement method based on noise mixed coherence
Schwartz et al. Joint estimation of late reverberant and speech power spectral densities in noisy environments using Frobenius norm
US10937418B1 (en) Echo cancellation by acoustic playback estimation
Albu et al. Proportionate algorithms for two-microphone active feedback cancellation
Ivry et al. Nonlinear acoustic echo cancellation with deep learning
CN111312269A (en) Rapid echo cancellation method in intelligent loudspeaker box
US11380312B1 (en) Residual echo suppression for keyword detection
CN107071196B (en) A kind of adaptive echo cancellation method
Lee et al. Stereophonic acoustic echo suppression incorporating spectro-temporal correlations
CN103370741B (en) Process audio signal
US11386911B1 (en) Dereverberation and noise reduction
JP2003309493A (en) Method, device and program for reducing echo

Legal Events

Date Code Title Description
C06 Publication
PB01 Publication
C10 Entry into substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant