CN102147458B - Method and device for estimating direction of arrival (DOA) of broadband sound source - Google Patents

Method and device for estimating direction of arrival (DOA) of broadband sound source Download PDF

Info

Publication number
CN102147458B
CN102147458B CN 201010608798 CN201010608798A CN102147458B CN 102147458 B CN102147458 B CN 102147458B CN 201010608798 CN201010608798 CN 201010608798 CN 201010608798 A CN201010608798 A CN 201010608798A CN 102147458 B CN102147458 B CN 102147458B
Authority
CN
China
Prior art keywords
voice data
data
mould
coherence function
microphone
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Expired - Fee Related
Application number
CN 201010608798
Other languages
Chinese (zh)
Other versions
CN102147458A (en
Inventor
冯大航
鲍明
管鲁阳
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Institute of Acoustics CAS
Original Assignee
Institute of Acoustics CAS
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Institute of Acoustics CAS filed Critical Institute of Acoustics CAS
Priority to CN 201010608798 priority Critical patent/CN102147458B/en
Publication of CN102147458A publication Critical patent/CN102147458A/en
Application granted granted Critical
Publication of CN102147458B publication Critical patent/CN102147458B/en
Expired - Fee Related legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Images

Landscapes

  • Circuit For Audible Band Transducer (AREA)

Abstract

The invention relates to a method and device for estimating the direction of arrival (DOA) of a broadband sound source. The method comprises the following steps: selecting one section of sound data X(n)=[x1(n),...,xP(n)]T from sound data collected from a microphone array, wherein n is a subset of Z*; evenly dividing the selected sound data X(n) into L frames; carrying out N-point quick Fourier transform on each frame of sound data Xl (n); presenting the frequency domain of the obtained sound data as follows: Xl(k)=[xl1(k),...,xlP(k)]T, wherein k=1,..., N/2; then, calculating the modulus square value eta (k) of a coherence function between two microphones in the microphone array; randomly selecting the modulus square values of q coherence functions from the eta (k); and estimating the DOA only by using a frequency band corresponding to the modulus squares of the q coherence functions. According to the method and device, the computation amount of an algorithm is greatly lowered on the premise of ensuring that the precision is hardly lowered.

Description

A kind of Wave arrival direction estimating method and device thereof for broadband sound source
Technical field
The present invention relates to the Bearing technical field, particularly a kind of Wave arrival direction estimating method and device thereof for broadband sound source.
Background technology
Bearing has important application in fields such as sonar, radar, navigation and wireless sensor networks.And sensor array is the major way of Bearing.
In the Bearing technology, an important technological problems is how broadband sound source to be positioned.For this respect a lot of algorithms has been arranged.Broadband MUSIC algorithm be first with broadband signal by being fourier transformed on the frequency domain, then each frequency band is processed with the MUSIC method of arrowband, the direction that at last result of each frequency band is comprehensively obtained sound source is estimated.Be incoherent subspace method (ISM, Incoherent Sub-space Method).This class methods operand is little, but low precision, and can only process noncoherent signal; Maximum likelihood (ML, Maximum Likelihood) algorithm for estimating is the match class algorithm of representative, and is good gradual with it, applicability flexibly, and computation complexity that can be low has obtained extensive concern.For broadband signal, the people such as Yao professor Kung have proposed AML (the Approximated Maximum Likelihood) DOA estimation algorithm for broadband sound source.This class arithmetic accuracy is high, can process coherent signal, but operand is very large.Simultaneously, can also process with the method for focussing matrix for the coherent signal source.At first needing to utilize low complexity algorithm that Sounnd source direction is carried out one estimates, by this angle configuration focussing matrix of estimating, by focussing matrix on the signal focus to of the different frequency bands centre frequency, then it is processed as a narrow band signal, the shortcoming of this method is to know the angle of estimating of sound source, and the selection of estimating angle is very large for the Algorithm Performance impact.
Summary of the invention
The object of the invention is to, at the operand that guarantees greatly to reduce in the situation that precision does not reduce substantially algorithm.
For achieving the above object, a kind of Wave arrival direction estimating method for broadband sound source is proposed, the method concrete steps comprise:
Step 1): from the voice data that microphone array gathers, choose one section voice data X (n)=[x 1(n) ..., x P(n)] T, wherein, P represents the number of microphone in the microphone array, P 〉=2; This section voice data X (n) evenly is divided into the L frame, the data X of every frame l(n) length is N, and the data length of X (n) is N * L, l=1 ..., L; N ∈ Z *
To each frame voice data X l(n) do the conversion of N point quick Fourier, obtain the frequency domain representation X of voice data l(k)=[x L1(k) ..., x LP(k)] T, k=1 wherein, 2 ..., N/2;
Step 2): according to described step 1 X that) obtains l(k) calculate the mould square η (k) of coherence function between two microphones in the microphone array according to following formula (1);
η ( k ) = | γ i , j ( k ) | 2 = | Φ i , j ( k ) | 2 Φ i , i ( k ) Φ j , j ( k ) - - - ( 1 )
Wherein, k=1,2 ..., N/2; I, j ∈ [1,2 ..., P]; Φ I, j(k) be the cross-power spectrum of i microphone and j microphone, Φ I, i(k) be the auto-power spectrum of i microphone,
Figure BSA00000401124000023
Φ J, j(k) be the auto-power spectrum of j microphone,
Figure BSA00000401124000024
Step 3): according to described step 2 η (k) that) obtains select at random q coherence function mould square;
The voice data X of square correspondence of q the coherence function mould of step 4): according to described step 3) selecting l(k) carry out direction of arrival and estimate, wherein, k ∈ [k 1..., k q], k qExpression band number, and [k 1..., k q] ∈ [1 ..., N/2].
Described step 1) also comprises: preserve the continuous L frame voice data that described microphone array gathers till current; And preserve from behind the up-to-date 1 frame voice data of described microphone array, with the earliest 1 frame voice data deletion in the continuous L frame voice data of preserving.
Described step 2) in, two microphones are two microphones of space length maximum in the microphone array.
Described step 2) in, between two microphones continuously the data of (L-1) frame ask for the mould square of coherence function as data buffer storage according to N data of the current acquisition L frame individual data of the N that deposits * (L-1) of easing up.
Described step 4) in, according to described step 3) the L frame voice data X of square correspondence of q coherence function mould selecting L(k) carry out direction of arrival and estimate, wherein, k ∈ [k 1..., k q], k qExpression band number, and [k 1..., k q] ∈ [1 ..., N/2].
Described step 3) η (k) size that obtains in, more described step 2) is selected a front q η (k) value, i.e. η (k successively from the beginning of maximum 1) 〉=η (k 2) 〉=... 〉=η (k q) 〉=η (k m), wherein, k qThe expression band number, [k 1..., k q] ∈ [1,2 ...., N/2],
Figure BSA00000401124000031
For achieving the above object, propose a kind of direction of arrival estimation unit for broadband sound source, it is characterized in that this device comprises: the mould of Fourier transform module, coherence function square module and direction of arrival estimation module;
Described Fourier transform module comprises: data selection unit, data framing unit and Fourier transform unit; Be used for described data selection unit from the voice data that microphone array gathers, choose one section voice data X (n)=[x 1(n) ..., x P(n)] T, the voice data X (n) that described data selection unit is selected in described data framing unit evenly is divided into the L frame, and described Fourier transform unit is to each frame voice data X l(n) do the conversion of N point quick Fourier, obtain the frequency domain representation X of voice data l(k)=[x L1(k) ..., x LP(k)] TAnd the voice data of described frequency domain is exported to the mould square module of described coherence function; Wherein, P represents the number of microphone in the microphone array, P 〉=2; The data X of every frame l(n) length is N, l=1 ..., L, the data length of X (n) is N * L; N ∈ Z *K=1,2 ..., N/2;
The mould of described coherence function square module is used for the X that obtains according to described Fourier transform module l(k) calculate the mould square η (k) of coherence function between two microphones in the microphone array according to formula (2);
η ( k ) = | γ i , j ( k ) | 2 = | Φ i , j ( k ) | 2 Φ i , i ( k ) Φ j , j ( k ) - - - ( 2 )
Wherein, k=1,2 ..., N/2; I, j ∈ [1,2 ..., P]; Φ I, j(k) be the cross-power spectrum of i microphone and j microphone,
Figure BSA00000401124000033
Φ I, i(k) be the auto-power spectrum of i microphone,
Figure BSA00000401124000034
Φ J, j(k) be the auto-power spectrum of j microphone,
Figure BSA00000401124000035
Described direction of arrival estimation module, the mould that is used for the coherence function that obtains according to the mould square module from described coherence function square is selected the voice data X of square correspondence of q coherence function mould at random l(k) carry out direction of arrival and estimate, wherein, k ∈ [k 1..., k q], k qExpression band number, and [k 1..., k q] ∈ [1 ..., N/2].
In the mould of the described coherence function square module, two microphones are two microphones of space length maximum in the microphone array.
Described device also comprises the voice data cache module; Described voice data cache module be used for to be preserved the continuous L frame voice data that microphone array gathers till current; And preserve and to receive from behind the up-to-date 1 frame voice data of microphone array, with the earliest 1 frame voice data deletion in the continuous L frame voice data of preserving; And export the voice data of buffer memory to described Fourier transform module.
Between two microphones that the mould of described coherence function square module will be chosen continuously the data of (L-1) frame ask for the mould square of coherence function as data buffer storage according to N data of the current acquisition L frame individual data of the N that deposits * (L-1) of easing up.
The L frame voice data X of square correspondence of q the coherence function mould that described direction of arrival estimation module obtains according to described comparison module L(k) carry out direction of arrival and estimate, wherein, k ∈ [k 1..., k q], k qExpression band number, and [k 1..., k q] ∈ [1 ..., N/2].
This device also comprises a comparison module, for η (k) size that the mould square module according to more described coherence function obtains, selects successively a front q η (k) value, i.e. η (k from the beginning of maximum 1) 〉=η (k 2) 〉=... 〉=η (k q) 〉=η (k m), wherein, k qThe expression band number, [k 1..., k q] ∈ [1,2 ...., N/2],
Figure BSA00000401124000041
Square export the mould of the coherence function picked out to described direction of arrival estimation module.
The invention has the advantages that the present invention is computation reduction greatly.Any algorithm that this shortcut calculation of while is discussed applicable to the background technology part.Take the AML algorithm as example, because the operand of AML algorithm is maximum, can draw in the suitable situation of directional effect by checking, greatly reduce operand.
Description of drawings
Fig. 1 is the space structure synoptic diagram of microphone array;
Fig. 2 is a kind of Wave arrival direction estimating method process flow diagram for broadband sound source that the present invention proposes;
Fig. 3 be in the l-G simulation test in the white noise situation embodiment of the invention scheme and Whole frequency band scheme carry out the as a result comparison diagram that direction of arrival is estimated;
Fig. 4 be in the l-G simulation test in the coloured noise situation embodiment of the invention scheme and Whole frequency band scheme carry out the as a result comparison diagram that direction of arrival is estimated;
Fig. 5 is that embodiment of the invention scheme and Whole frequency band scheme are carried out the as a result comparison diagram that direction of arrival is estimated in the field test.
Embodiment
The present invention is further described below in conjunction with drawings and Examples.
Be provided with R omnidirectional (omni-directional) microphone, p microphone receives signal and is:
x p ( n ) = Σ m = 1 M s 0 ( m ) ( n - t p ( m ) ) + w p ( n ) - - - ( 1 )
Wherein: n=0 ..., L-1, L are a frame signal sampled point number; P=1 ..., R, R are the number of microphone in the receiving array; M=1 ..., M, M are sound source quantity (M<R). Be source signal, Be the time delay that represents with sampling number.w p(n) be noise component in the array received signal.Wherein, time delay
Figure BSA00000401124000051
Geometric relationship according to formation and target sound source position obtains.For the unit of the R under far field condition circle battle array, namely R microphone equidistantly is arranged on the circumference, and the time delay that represents with sampling number of p microphone is:
t p ( m ) = f s r c cos ( 2 π ( p - 1 ) R - θ m ) - - - ( 2 )
Wherein, f sBe the signal sampling frequency, r is circle battle array radius, and c is the velocity of wave in the isotropic medium, θ mIt is the position angle, far field in m source.
Do to received signal the FFT conversion that N is ordered, get the frequency domain signal model:
X(k)=D(Θ,k)S 0(k)+η(k) (3)
In the formula, k=0 ..., N/2; Frequency domain array output signal: X (k)=[X 1(k) ..., X R(k)] TThe frequency spectrum of M sound source:
Figure BSA00000401124000053
Direction matrix (Steering Matrix) is: D (Θ, k)=[d (1)(k) ..., d (M)(k)]; Direction vector (Steering Vector) is:
Figure BSA00000401124000055
Figure BSA00000401124000056
η (k) is noise spectrum.
Noise spectrum vector η (k) is that variance is L σ 2Zero-mean plural number white Gaussian noise.Be any independent identically distributed stochastic variable even notice w (n), according to central limit theorem, all can know η (k) Gaussian distributed by inference.
Definition S (k)=D (k) S 0(k) (4)
Obtain NR/2 dimension space-time-frequency vector:
X=G(θ)+ξ 1 (5)
Wherein, G (θ)=[S (0) T..., S (N/2) T] T, and have:
R ξ = E [ ξξ H ] = L σ 2 I R 0 σ 2 I R L 0 σ 2 I R - - - ( 6 )
Wherein, superscript HRepresent the conjugate transpose of complex matrix.
Suppose that initial unknown parameter space is:
Θ = [ θ s T , S 0 ( 1 ) T , . . . , S 0 ( M ) T ] T - - - ( 7 )
Wherein, source side parallactic angle
Figure BSA00000401124000059
The source frequency spectrum
Figure BSA000004011240000510
So maximum likelihood function is arranged:
f ( Θ ) = 1 ( 2 π ) NR / 2 / 2 | R ξ | 1 / 2 exp { - 1 2 [ X - G ( Θ ) ] H R ξ - 1 [ X - G ( Θ ) ] } - - - ( 8 )
This likelihood function of logarithmetics also omits constant term and gets:
L ( Θ ) = - | | X - G ( Θ ) | | 2 R ξ - - - ( 9 )
Can obtain optimization criterion:
max Θ L ( Θ ) = min Θ Σ k = 0 N / 2 | | X ( k ) - D ( k ) S 0 ( k ) | | 2 R ξ ( k )
= min Θ Σ k = 0 N / 2 | | X ( k ) - D ( k ) S 0 ( k ) | | 2 σ 2 - - - ( 10 )
This is equivalent to all k function, finds Again because σ 2Be constant, f (k) can be abbreviated as:
f(k)=||X(k)-D(k)S 0(k)|| 2 (11)
The minimizing source signal vector S that f (k) is got 0(k) certain satisfied Therefore for any source being positioned with minimum residual error, the estimation of source vector is given:
Figure BSA00000401124000066
Wherein,
Figure BSA00000401124000067
It is the pseudo inverse matrix (pseudo-inverse) of direction matrix D (k).The definition vertical plan The direction of arrival of AML algorithm is estimated and can be obtained by following formula so.
θ s = max θ s Σ k = 0 N / 2 | | P ( k , θ s ) X ( k ) | | 2 - - - ( 13 )
Although voice signal is the broadband, signal generally all can not be white, and generally all there are some harmonics structures in it, and namely signal mainly concentrates on some frequency range; Or because noise is coloured, and the signal of some frequency band has been fallen into oblivion by noise fully, therefore when calculating, the information on these frequency bands is redundant, even can affect last result.So it is necessary adopting certain method to choose some frequency band, among the present invention program, with the standard of the coherence function on the frequency band as the selection frequency band.
Coherence function (coherence) is used for describing the correlativity between two groups of data.Among the present invention program, coherence function is defined as:
γ xy = G xy ( f ) G xx ( f ) G yy ( f ) - - - ( 14 )
Wherein, x represents two groups of different data, G with y Xy(f) be the cross-spectrum of x and y, G Xx(f) and G Yy(f) be respectively the certainly spectrum of x and y.G xy(f)=E{X(f)Y *(f)},G xx(f)=E{X(f)X *(f)},G yy(f)=E{Y(f)Y *(f)}。
Amplitude square coherence function (MSC) is coherence function γ XyMould square:
| γ xy ( f ) | 2 = | G xy ( f ) | 2 G xx ( f ) G yy ( f ) - - - ( 15 )
Thus, visible coherence function refers to all is computing on the frequency domain.
The value of coherence function is between 0 to 1, that is:
0≤|γ xy| 2≤1 (16)
Represent that when value is 0 the frequency spectrum on frequency range x and the y is fully uncorrelated, value is the frequency spectrum complete dependence on 1 expression frequency range x and the y.
In specific situation, have: signal to noise ratio (S/N ratio)
Figure BSA00000401124000072
This shows that MSC can be used for the size of the signal to noise ratio (S/N ratio) on certain frequency band of gauge signal.When | γ Xy(k) | 2When larger, show that signal useful on k the frequency band is more; Otherwise, illustrate that then noise accounts for major part on the k frequency band.
The space structure synoptic diagram of microphone array, as shown in Figure 1.Use the microphone array of cross formation shown in Figure 1 to carry out the collection of wind noise cancellation signal, 12 pieces of B﹠amp of circle 101 expressions; K 4189 type standard microphones, and be furnished with the spherical sponges fan housing that diameter is 10cm.102 expressions are used for fixing the support of these microphones.The aperture of upper three 4 element array of cross battle array is respectively 0.2 meter, 0.6 meter, and 1.2 meters.Adopt SONY SIR-1000 type 16 Channel Synchronous digital audio tapes, sample frequency is 48kHz.During data analysis, by low-pass filtering, normalization, down-sampledly analyze to 1kHz.
Use measured data, the spatial coherence of statistic array wind noise.Adopt following formula to calculate the interchannel average correlation coefficient:
γ ‾ ( k ) = 2 R ( R - 1 ) Σ i = 1 R Σ j = i + 1 R | γ ij ( k ) | - - - ( 17 )
Wherein, k=0 ..., N/2; R is port number, γ Ij(k) be the cross-correlation coefficient of i passage and j passage,
Figure BSA00000401124000074
Be average correlation coefficient.
Table 1 wind noise interchannel average correlation coefficient
Figure BSA00000401124000075
Table 1 explanation, different from the spatial coherence of low frequency wind noise, the wind noise interchannel related coefficient at medium-high frequency place is little, and spatial coherence is faint, and the array wind noise can think that substantially interchannel is separate.
The present invention program is after having gathered voice data, according to coefficient of coherence voice data is screened, filter out the voice data that carries the more frequency range of useful signal, and abandon the voice data of other frequency ranges, so, just reduce direction of arrival and estimated required data volume, thereby reduced the operand that follow-up direction of arrival is estimated.
A kind of Wave arrival direction estimating method process flow diagram for broadband sound source that the present invention proposes; As shown in Figure 2, comprise the steps:
Step 201: from the voice data that the microphone array that is comprised of P microphone gathers, select one section voice data x (n)=[x 1(n) ..., x P(n)] T, wherein, n ∈ Z *
Step 202: selected voice data x (n) evenly is divided into L section x l(n), l=1 ..., L, every section data x l(n) length is N, and the data data length of x (n) is N * L.
Step 203: to each section voice data x l(n) do N point quick Fourier conversion (FFT), the frequency domain representation that obtains voice data is X l(k)=[X L1(k) ..., X LP(k)] T, wherein, k=1,2 ..., N/2.
Step 204: square (MSC) that calculates the coherence function mould between two microphones in the microphone array
Figure BSA00000401124000081
Wherein, i, j ∈ [1,2 ..., P], Φ I, j(k) be the cross-power spectrum of i microphone and j microphone, Φ I, i(k) and Φ J, j(k) be respectively the auto-power spectrum of i microphone and j microphone:
Φ i , i ( k ) = Σ l = 1 L | X li ( k ) | 2 ; Φ j , j ( k ) = Σ l = 1 L | X lj ( k ) | 2 ; Φ i , j ( k ) = Σ l = 1 L X li ( k ) X lj * ( k ) .
Preferably, described two microphones can be two microphones of space length maximum in the microphone array.
Step 205: from square η (k) of coherence function mould, pick out the square value of larger q coherence function mould, i.e. η (k 1) 〉=η (k 2) 〉=... 〉=η (k q) 〉=η (k m), wherein, k qBe band number, [k 1..., k q] ∈ [1,2 ...., N/2],
Figure BSA00000401124000085
Step 206: the voice data X that uses square correspondence of q the coherence function mould of picking out l(k) carrying out direction of arrival estimates.Wherein, k ∈ [k 1..., k q]; Take the AML algorithm as example, then the computing formula of angle by (18) as can be known,
θ s = max θ s Σ k = k 1 k q | | P ( k , θ s ) X l ( k ) | | 2 - - - ( 18 )
Because the calculating of coherence function needs more data, namely N * L is larger, therefore may affect the real-time of method.In the calculating of coherence function, before having preserved continuously the data of (L-1) frame as data buffer storage, according to N data of current acquisition ease up deposit N * (L-1) individual data are asked for MSC.And obtain after the new frame data at every turn, all abandon the data of former frame, upgrade whole data buffer storage.Be x such as the total data in the system l(n), l=1 ..., L, wherein x L(n) be frame data of current acquisition, and x l(n), l=1 ..., L-1 is former data.Use x l(n), l=1 ..., L calculates MSC, and only uses x L(n) carry out angle estimation.
The present invention also proposes a kind of device of estimating for the direction of arrival of broadband sound source, comprising:
Fourier transform module is used for the voice data of the microphone array collection of P microphone composition is carried out fast fourier transform, obtains the frequency domain representation of voice data, and described frequency domain voice data is exported to the mould square module of coherence function;
The mould of coherence function square module be used for to be calculated the mould square of the coherence function of the voice data between two microphones of microphone array, and square exports the mould of the coherence function that calculates to comparison module;
Comparison module is used for square picking out a larger q value from the mould of coherence function;
The direction of arrival estimation module, the frequency band of voice data that is used for the mould square correspondence of the coherence function picked out according to comparison module carries out direction of arrival and estimates.
Preferably, the mould of described coherence function square module be used for to be calculated the mould square of the coherence function of voice data between microphone array middle distance two microphones farthest, and square exports the mould of the coherence function that calculates to comparison module.
Preferably, described Fourier transform module comprises:
Data selection unit, being used for from the voice data selection length of the microphone array collection that is comprised of P microphone is that one section voice data of N * L is as pending voice data x (n)=[x 1(n) ..., x P(n)] T, wherein, n ∈ Z *
The data framing unit is used for the selected pending voice data x of described data selection unit (n) is divided into the L section, and wherein arbitrary section voice data is x l(n), l=1 ..., L, and the voice data after the segmentation exported; And
Fourier transformation unit is used for each section voice data x to the output of described data framing unit l(n) do the conversion of N point quick Fourier, the frequency domain representation that obtains voice data is X l(k)=[X L1(k) ..., X LP(k)] T, wherein, k=1,2 ..., N/2.
Preferably, the mould of coherence function square module is calculated the mould of the coherence function between i microphone of microphone array and j the microphone and square is:
Figure BSA00000401124000091
Wherein, i, j ∈ [1,2 ..., P], Φ I, j(k) be the cross-power spectrum of i microphone and j microphone, Φ I, i(k) and Φ J, j(k) be respectively the auto-power spectrum of i microphone and j microphone.
Preferably, described comparison module is picked out the mould square value of a larger q coherence function, i.e. η (k from the mould square η (k) of N/2 coherence function 1) 〉=η (k 2) 〉=... 〉=η (k q), k qThe expression band number, k=1,2 ..., N/2, [k 1..., k q] ∈ [1,2 ...., N/2].
Preferably, the frequency band of the voice data of the mould square correspondence of the coherence function picked out according to comparison module of described DOA estimation module carries out direction of arrival and estimates.
Preferably, this device further comprises:
The voice data cache module is used for preserving the continuous L frame voice data that described microphone array gathers till current; And after receiving the 1 frame voice data up-to-date from described microphone array, with the earliest 1 frame voice data deletion in the continuous L frame voice data of preserving.
The capable fast fourier transform of voice data that described Fourier transform module is used for the voice data cache module is stored obtains the frequency domain representation of voice data, and described frequency domain voice data is exported to the mould square module of coherence function.
Below result by emulation experiment and experiment on the spot the present invention program's technique effect is described.In l-G simulation test, the microphone group pattern is the circle battle array, and array number is 4, and the radius of circle battle array is 0.2m, and sound source is white noise, and number is 1, and incident angle is 70 °.The AML algorithm of use Whole frequency band and the AML arithmetic result that frequency band is chosen contrast respectively as shown in Figure 3 and Figure 4, and wherein, Fig. 3 is the situation of white noise, and Fig. 4 is the situation of coloured noise.Wherein coloured noise is to be produced by the FIR wave filter of white noise by 50 rank.
As can be seen from Figure 3, when background was the condition of white noise, the AML algorithm of Whole frequency band was better than the AML algorithm that frequency band is chosen under the low signal-to-noise ratio; The deviation of these two kinds of algorithms is close under the high s/n ratio all trends towards zero.As can be seen from Figure 4, when background is coloured noise, it has not been optimum that low signal-to-noise ratio uses the AML algorithm of Whole frequency band, as can be seen from the figure under different signal to noise ratio (S/N ratio)s, should choose different frequency band number, this is that signal to noise ratio (S/N ratio) can be very large on some frequency band because background is coloured noise, and signal to noise ratio (S/N ratio) can be very little on some frequency band, therefore choose too much frequency band and can make the low frequency band of these signal to noise ratio (S/N ratio)s also participate in computing, thereby the deviation of algorithm is increased; Under the high s/n ratio, the deviation of two kinds of algorithms also all levels off to 0.
In the actual experiment, array is the circle battle array in the open air, and array number is 4, and circle battle array radius is 0.2m, and sound source is that tank moves in a circle around array.Experimental result as shown in Figure 5, icon " * " is the result of Whole frequency band search, icon " o " is the result that 20 frequency bands only getting the coefficient of coherence maximum calculate gained, as can be seen from the figure, both bearing accuracies are substantially the same, but after using coefficient of coherence to select frequency band, and operand is original 1/15.
Through the above description of the embodiments, those skilled in the art can be well understood to the present invention and can realize by the mode that software adds essential hardware platform, can certainly all implement by hardware, but the former is better embodiment in a lot of situation.Based on such understanding, technical scheme of the present invention is to can embodying with the form of software product in whole or in part that background technology contributes, this computer software product can be stored in the storage medium, such as ROM/RAM, magnetic disc, CD etc., comprise that some instructions are with so that computer equipment is carried out the described method of some part of each embodiment of the present invention or embodiment.Computer equipment is such as personal computer, server, perhaps network equipment etc.
It should be noted last that above embodiment is only unrestricted in order to technical scheme of the present invention to be described.Although with reference to embodiment the present invention is had been described in detail, those of ordinary skill in the art is to be understood that, technical scheme of the present invention is made amendment or is equal to replacement, do not break away from the spirit and scope of technical solution of the present invention, it all should be encompassed in the middle of the claim scope of the present invention.

Claims (12)

1. Wave arrival direction estimating method for broadband sound source, the method concrete steps comprise:
Step 1): from the voice data that microphone array gathers, choose one section voice data X (n)=[x 1(n) ..., x P(n)] T, wherein, P represents the number of microphone in the microphone array, P 〉=2; This section voice data X (n) evenly is divided into the L frame, the data X of every frame l(n) length is N, and the data length of X (n) is N * L, l=1 ..., L; N ∈ Z *
To each frame voice data X l(n) do the conversion of N point quick Fourier, obtain the frequency domain representation X of voice data l(k)=[x L1(k) ..., x LP(k)] T, k=1 wherein, 2 ..., N/2;
Step 2): the X that obtains according to described step 1) l(k) calculate the mould square η (k) of coherence function between two microphones in the microphone array according to following formula (1);
η ( k ) = | γ i , j ( k ) | 2 = | Φ i , j ( k ) | 2 Φ i , i ( k ) Φ j , j ( k ) - - - ( 1 )
Wherein, k=1,2 ..., N/2; I, j ∈ [1,2 ..., P]; Φ I, j(k) be the cross-power spectrum of i microphone and j microphone,
Figure FDA00002275865200012
Φ I, i(k) be the auto-power spectrum of i microphone, Φ i , i ( k ) = Σ l = 1 L | x li ( k ) | 2 ; Φ J, j(k) be the auto-power spectrum of j microphone, Φ j , j ( k ) = Σ l = 1 L | x lj | 2 ;
Step 3): according to described step 2) η (k) that obtains from the beginning of maximum select successively q coherence function mould square;
Step 4): the voice data X of square correspondence of q the coherence function mould of selecting according to described step 3) l(k) carry out direction of arrival and estimate, wherein, k ∈ [k 1..., k q], k qExpression band number, and [k 1..., k q] ∈ [1 ..., N/2].
2. the Wave arrival direction estimating method for broadband sound source according to claim 1 is characterized in that, described step 1) also comprises: preserve the continuous L frame voice data that described microphone array gathers till current; And preserve from behind the up-to-date 1 frame voice data of described microphone array, with the earliest 1 frame voice data deletion in the continuous L frame voice data of preserving.
3. the Wave arrival direction estimating method for broadband sound source according to claim 1 is characterized in that, described step 2) in, two microphones are two microphones of space length maximum in the microphone array.
4. the Wave arrival direction estimating method for broadband sound source according to claim 2, it is characterized in that, described step 2) in, between two microphones continuously the data of (L-1) frame ask for the mould square of coherence function as data buffer storage according to N data of the current acquisition L frame individual data of the N that deposits * (L-1) of easing up.
5. the Wave arrival direction estimating method for broadband sound source according to claim 4 is characterized in that, in the described step 4), and the L frame voice data X of square correspondence of q the coherence function mould of selecting according to described step 3) L(k) carry out direction of arrival and estimate, wherein, k ∈ [k 1..., k q], k qExpression band number, and [k 1..., k q] ∈ [1 ..., N/2].
6. the Wave arrival direction estimating method for broadband sound source according to claim 1, it is characterized in that in the described step 3), more described step 2) in η (k) size that obtains, select successively a front q η (k) value, i.e. η (k from the beginning of maximum 1) 〉=η (k 2) 〉=... 〉=η (k q) 〉=η (k m), wherein, k qThe expression band number, [k 1..., k q∈ [1,2 ...., N/2],
Figure FDA00002275865200021
7. the direction of arrival estimation unit for broadband sound source is characterized in that this device comprises: the mould of Fourier transform module, coherence function square module and direction of arrival estimation module;
Described Fourier transform module comprises: data selection unit, data framing unit and Fourier transform unit; Described data selection unit is chosen one section voice data X (n)=[x from the voice data that microphone array gathers 1(n) ..., x P(n)] T, the voice data X (n) that described data selection unit is selected in described data framing unit evenly is divided into the L frame, and described Fourier transform unit is to each frame voice data X l(n) do the conversion of N point quick Fourier, obtain the frequency domain representation X of voice data l(k)=[x L1(k) ..., x LP(k)] TAnd described frequency domain voice data is exported to the mould square module of described coherence function; Wherein, P represents the number of microphone in the microphone array, P 〉=2; The data X of every frame l(n) length is N, l=1 ..., L, the data length of X (n) is N * L; N ∈ Z *K=1,2 ..., N/2;
The mould of described coherence function square module is used for the X that obtains according to described Fourier transform module l(k) calculate the mould square η (k) of coherence function between two microphones in the microphone array according to formula (2);
η ( k ) = | γ i , j ( k ) | 2 = | Φ i , j ( k ) | 2 Φ i , i ( k ) Φ j , j ( k ) - - - ( 2 )
Wherein, k=1,2 ..., N/2; I, j ∈ [1,2 ..., P]; Φ I, j(k) be the cross-power spectrum of i microphone and j microphone,
Figure FDA00002275865200023
Φ I, i(k) be the auto-power spectrum of i microphone,
Φ i , i ( k ) = Σ l = 1 L | X lj ( k ) | ; 2 Φ J, j(k) be the auto-power spectrum of j microphone, Φ j , j ( k ) = Σ l = 1 L | X lj ( k ) | 2 ;
Described direction of arrival estimation module, the mould square that is used for the coherence function that obtains according to the mould square module from described coherence function is selected the voice data X of square correspondence of q coherence function mould successively from the beginning of maximum l(k) carry out direction of arrival and estimate, wherein, k ∈ [k 1..., k q], k qExpression band number, and [k 1..., k q] ∈ [1 ..., N/2].
8. the direction of arrival estimation unit for broadband sound source according to claim 7 is characterized in that, in the mould of the described coherence function square module, two microphones are two microphones of space length maximum in the microphone array.
9. the direction of arrival estimation unit for broadband sound source according to claim 7 is characterized in that described device also comprises the voice data cache module;
Described voice data cache module be used for to be preserved the continuous L frame voice data that microphone array gathers till current; And preserve and to receive from behind the up-to-date 1 frame voice data of microphone array, with the earliest 1 frame voice data deletion in the continuous L frame voice data of preserving; And export the voice data of buffer memory to described Fourier transform module.
10. the direction of arrival estimation unit for broadband sound source according to claim 9, it is characterized in that, between two microphones that the mould of described coherence function square module will be chosen continuously the data of (L-1) frame ask for the mould square of coherence function as data buffer storage according to N data of the current acquisition L frame individual data of the N that deposits * (L-1) of easing up.
11. the direction of arrival estimation unit for broadband sound source according to claim 10 is characterized in that, the L frame voice data X of square correspondence of q the coherence function mould that described direction of arrival estimation module obtains according to comparison module L(k) carry out direction of arrival and estimate, wherein, k ∈ [k 1..., k q].
12. the direction of arrival estimation unit for broadband sound source according to claim 7, it is characterized in that, this device also comprises a comparison module, be used for η (k) size that the mould square module according to more described coherence function obtains, select successively a front q η (k) value, i.e. η (k from the beginning of maximum 1) 〉=η (k 2) 〉=... 〉=η (k q) 〉=η (k m), wherein, k qThe expression band number, [k 1..., k q] ∈ [1,2 ...., N/2],
Figure FDA00002275865200033
Square export the mould of the coherence function picked out to described direction of arrival estimation module.
CN 201010608798 2010-12-17 2010-12-17 Method and device for estimating direction of arrival (DOA) of broadband sound source Expired - Fee Related CN102147458B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN 201010608798 CN102147458B (en) 2010-12-17 2010-12-17 Method and device for estimating direction of arrival (DOA) of broadband sound source

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN 201010608798 CN102147458B (en) 2010-12-17 2010-12-17 Method and device for estimating direction of arrival (DOA) of broadband sound source

Publications (2)

Publication Number Publication Date
CN102147458A CN102147458A (en) 2011-08-10
CN102147458B true CN102147458B (en) 2013-03-13

Family

ID=44421826

Family Applications (1)

Application Number Title Priority Date Filing Date
CN 201010608798 Expired - Fee Related CN102147458B (en) 2010-12-17 2010-12-17 Method and device for estimating direction of arrival (DOA) of broadband sound source

Country Status (1)

Country Link
CN (1) CN102147458B (en)

Families Citing this family (11)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN103217671B (en) * 2013-04-10 2014-09-17 哈尔滨工程大学 Multi-input and multi-output fast estimation method for radar receiving and transmitting angles under color-noise environment
CN104811886B (en) * 2015-04-10 2018-04-17 西安电子科技大学 Microphone array direction-finding method based on phase difference measurement
CN104811867B (en) * 2015-04-29 2017-11-21 西安电子科技大学 Microphone array airspace filter method based on array virtual extended
CN106054132B (en) * 2016-06-06 2018-06-08 西北工业大学 A kind of ISM methods based on the selection of effective subband and detection statistic weighting
CN106019214B (en) * 2016-07-08 2018-11-27 天津大学 Wide-band coherent signal source DOA estimation method
CN106500820B (en) * 2016-10-13 2019-08-20 华南理工大学 A kind of sonic velocity measurement method and device for estimating two-dimensional direction-of-arrival
CN106526533B (en) * 2016-11-14 2019-03-05 中国科学院上海微系统与信息技术研究所 A kind of micropore diameter MEMS acoustic matrix sensor and its application method
CN108710101A (en) * 2018-04-10 2018-10-26 贵州理工学院 A kind of DOA algorithm for estimating obtaining coherent signal subspace using orthogonality
CN113219399B (en) * 2020-08-05 2023-03-10 哈尔滨工业大学(威海) Far-field narrow-band radio signal direction-of-arrival estimation method based on full real value calculation
CN112485761B (en) * 2021-02-03 2021-04-09 成都启英泰伦科技有限公司 Sound source positioning method based on double microphones
CN115497501B (en) * 2022-11-18 2023-04-07 国网山东省电力公司济南供电公司 SW-MUSIC based transformer fault voiceprint positioning method and system

Family Cites Families (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
DE3932620A1 (en) * 1989-09-29 1991-04-11 Mantel Juval LOCATION SYSTEM FOR SOUND IMPULSES
US6198693B1 (en) * 1998-04-13 2001-03-06 Andrea Electronics Corporation System and method for finding the direction of a wave source using an array of sensors
JP2001166025A (en) * 1999-12-14 2001-06-22 Matsushita Electric Ind Co Ltd Sound source direction estimating method, sound collection method and device
CN1875292A (en) * 2003-09-04 2006-12-06 声纳技术地图有限公司 Resolving directional information in sonar arrays
US8233353B2 (en) * 2007-01-26 2012-07-31 Microsoft Corporation Multi-sensor sound source localization
CN101470187B (en) * 2007-12-26 2011-12-28 中国科学院声学研究所 High-precision direction finding method used for linear array

Also Published As

Publication number Publication date
CN102147458A (en) 2011-08-10

Similar Documents

Publication Publication Date Title
CN102147458B (en) Method and device for estimating direction of arrival (DOA) of broadband sound source
CN104076331B (en) A kind of sound localization method of seven yuan of microphone arrays
CN103308889B (en) Passive sound source two-dimensional DOA (direction of arrival) estimation method under complex environment
Blandin et al. Multi-source TDOA estimation in reverberant audio using angular spectra and clustering
CN110491403A (en) Processing method, device, medium and the speech enabled equipment of audio signal
Pavlidi et al. 3D localization of multiple sound sources with intensity vector estimates in single source zones
CN111123192B (en) Two-dimensional DOA positioning method based on circular array and virtual extension
CN102411138A (en) Method for positioning sound source by robot
CN105301563B (en) A kind of double sound source localization method that least square method is converted based on consistent focusing
Salvati et al. Acoustic source localization from multirotor UAVs
WO2020024816A1 (en) Audio signal processing method and apparatus, device, and storage medium
US10670694B1 (en) System and method for hybrid-weighted, multi-gridded acoustic source location
Grobler et al. Sound based localization and identification in industrial environments
CN111798869B (en) Sound source positioning method based on double microphone arrays
CN108549052A (en) A kind of humorous domain puppet sound intensity sound localization method of circle of time-frequency-spatial domain joint weighting
CN106526563B (en) A kind of penton product battle array multi-target DOA estimation method based on cross-correlation virtual array
Hosseini et al. Time difference of arrival estimation of sound source using cross correlation and modified maximum likelihood weighting function
Li et al. Sound source localization for mobile robot based on time difference feature and space grid matching
Hu et al. Decoupled direction-of-arrival estimations using relative harmonic coefficients
Suo et al. Single acoustic vector sensor DOA enhanced by unsaturated bistable stochastic resonance with linear amplitude response constrained
Tourbabin et al. Speaker localization by humanoid robots in reverberant environments
Astapov et al. Simplified acoustic localization by linear arrays for wireless sensor networks
CN110838303B (en) Voice sound source positioning method using microphone array
Georgiou et al. Robust maximum likelihood source localization: The case for sub-Gaussian versus Gaussian
Nikunen et al. Time-difference of arrival model for spherical microphone arrays and application to direction of arrival estimation

Legal Events

Date Code Title Description
C06 Publication
PB01 Publication
C10 Entry into substantive examination
SE01 Entry into force of request for substantive examination
C14 Grant of patent or utility model
GR01 Patent grant
CF01 Termination of patent right due to non-payment of annual fee

Granted publication date: 20130313

CF01 Termination of patent right due to non-payment of annual fee