CN102147458B - Method and device for estimating direction of arrival (DOA) of broadband sound source - Google Patents
Method and device for estimating direction of arrival (DOA) of broadband sound source Download PDFInfo
- Publication number
- CN102147458B CN102147458B CN 201010608798 CN201010608798A CN102147458B CN 102147458 B CN102147458 B CN 102147458B CN 201010608798 CN201010608798 CN 201010608798 CN 201010608798 A CN201010608798 A CN 201010608798A CN 102147458 B CN102147458 B CN 102147458B
- Authority
- CN
- China
- Prior art keywords
- voice data
- data
- mould
- coherence function
- microphone
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Expired - Fee Related
Links
- 238000000034 method Methods 0.000 title claims abstract description 32
- 230000006870 function Effects 0.000 claims abstract description 70
- 238000001228 spectrum Methods 0.000 claims description 24
- 230000014509 gene expression Effects 0.000 claims description 8
- 238000006243 chemical reaction Methods 0.000 claims description 7
- 238000009432 framing Methods 0.000 claims description 6
- 238000012217 deletion Methods 0.000 claims description 5
- 230000037430 deletion Effects 0.000 claims description 5
- 238000004422 calculation algorithm Methods 0.000 abstract description 21
- 239000011159 matrix material Substances 0.000 description 8
- 238000010586 diagram Methods 0.000 description 7
- 230000008569 process Effects 0.000 description 5
- 238000007476 Maximum Likelihood Methods 0.000 description 4
- 238000005516 engineering process Methods 0.000 description 4
- 238000012360 testing method Methods 0.000 description 4
- 238000002474 experimental method Methods 0.000 description 3
- 238000005070 sampling Methods 0.000 description 3
- 238000004088 simulation Methods 0.000 description 3
- 230000015572 biosynthetic process Effects 0.000 description 2
- 230000001427 coherent effect Effects 0.000 description 2
- 230000000694 effects Effects 0.000 description 2
- 238000004364 calculation method Methods 0.000 description 1
- 239000012141 concentrate Substances 0.000 description 1
- 238000007405 data analysis Methods 0.000 description 1
- 238000001914 filtration Methods 0.000 description 1
- 239000000203 mixture Substances 0.000 description 1
- 238000010606 normalization Methods 0.000 description 1
- 238000005457 optimization Methods 0.000 description 1
- 230000009467 reduction Effects 0.000 description 1
- 230000011218 segmentation Effects 0.000 description 1
- 230000001360 synchronised effect Effects 0.000 description 1
- 230000009466 transformation Effects 0.000 description 1
Images
Landscapes
- Circuit For Audible Band Transducer (AREA)
Abstract
The invention relates to a method and device for estimating the direction of arrival (DOA) of a broadband sound source. The method comprises the following steps: selecting one section of sound data X(n)=[x1(n),...,xP(n)]T from sound data collected from a microphone array, wherein n is a subset of Z*; evenly dividing the selected sound data X(n) into L frames; carrying out N-point quick Fourier transform on each frame of sound data Xl (n); presenting the frequency domain of the obtained sound data as follows: Xl(k)=[xl1(k),...,xlP(k)]T, wherein k=1,..., N/2; then, calculating the modulus square value eta (k) of a coherence function between two microphones in the microphone array; randomly selecting the modulus square values of q coherence functions from the eta (k); and estimating the DOA only by using a frequency band corresponding to the modulus squares of the q coherence functions. According to the method and device, the computation amount of an algorithm is greatly lowered on the premise of ensuring that the precision is hardly lowered.
Description
Technical field
The present invention relates to the Bearing technical field, particularly a kind of Wave arrival direction estimating method and device thereof for broadband sound source.
Background technology
Bearing has important application in fields such as sonar, radar, navigation and wireless sensor networks.And sensor array is the major way of Bearing.
In the Bearing technology, an important technological problems is how broadband sound source to be positioned.For this respect a lot of algorithms has been arranged.Broadband MUSIC algorithm be first with broadband signal by being fourier transformed on the frequency domain, then each frequency band is processed with the MUSIC method of arrowband, the direction that at last result of each frequency band is comprehensively obtained sound source is estimated.Be incoherent subspace method (ISM, Incoherent Sub-space Method).This class methods operand is little, but low precision, and can only process noncoherent signal; Maximum likelihood (ML, Maximum Likelihood) algorithm for estimating is the match class algorithm of representative, and is good gradual with it, applicability flexibly, and computation complexity that can be low has obtained extensive concern.For broadband signal, the people such as Yao professor Kung have proposed AML (the Approximated Maximum Likelihood) DOA estimation algorithm for broadband sound source.This class arithmetic accuracy is high, can process coherent signal, but operand is very large.Simultaneously, can also process with the method for focussing matrix for the coherent signal source.At first needing to utilize low complexity algorithm that Sounnd source direction is carried out one estimates, by this angle configuration focussing matrix of estimating, by focussing matrix on the signal focus to of the different frequency bands centre frequency, then it is processed as a narrow band signal, the shortcoming of this method is to know the angle of estimating of sound source, and the selection of estimating angle is very large for the Algorithm Performance impact.
Summary of the invention
The object of the invention is to, at the operand that guarantees greatly to reduce in the situation that precision does not reduce substantially algorithm.
For achieving the above object, a kind of Wave arrival direction estimating method for broadband sound source is proposed, the method concrete steps comprise:
Step 1): from the voice data that microphone array gathers, choose one section voice data X (n)=[x
1(n) ..., x
P(n)]
T, wherein, P represents the number of microphone in the microphone array, P 〉=2; This section voice data X (n) evenly is divided into the L frame, the data X of every frame
l(n) length is N, and the data length of X (n) is N * L, l=1 ..., L; N ∈ Z
*
To each frame voice data X
l(n) do the conversion of N point quick Fourier, obtain the frequency domain representation X of voice data
l(k)=[x
L1(k) ..., x
LP(k)]
T, k=1 wherein, 2 ..., N/2;
Step 2): according to described step 1 X that) obtains
l(k) calculate the mould square η (k) of coherence function between two microphones in the microphone array according to following formula (1);
Wherein, k=1,2 ..., N/2; I, j ∈ [1,2 ..., P]; Φ
I, j(k) be the cross-power spectrum of i microphone and j microphone,
Φ
I, i(k) be the auto-power spectrum of i microphone,
Φ
J, j(k) be the auto-power spectrum of j microphone,
Step 3): according to described step 2 η (k) that) obtains select at random q coherence function mould square;
The voice data X of square correspondence of q the coherence function mould of step 4): according to described step 3) selecting
l(k) carry out direction of arrival and estimate, wherein, k ∈ [k
1..., k
q], k
qExpression band number, and [k
1..., k
q] ∈ [1 ..., N/2].
Described step 1) also comprises: preserve the continuous L frame voice data that described microphone array gathers till current; And preserve from behind the up-to-date 1 frame voice data of described microphone array, with the earliest 1 frame voice data deletion in the continuous L frame voice data of preserving.
Described step 2) in, two microphones are two microphones of space length maximum in the microphone array.
Described step 2) in, between two microphones continuously the data of (L-1) frame ask for the mould square of coherence function as data buffer storage according to N data of the current acquisition L frame individual data of the N that deposits * (L-1) of easing up.
Described step 4) in, according to described step 3) the L frame voice data X of square correspondence of q coherence function mould selecting
L(k) carry out direction of arrival and estimate, wherein, k ∈ [k
1..., k
q], k
qExpression band number, and [k
1..., k
q] ∈ [1 ..., N/2].
Described step 3) η (k) size that obtains in, more described step 2) is selected a front q η (k) value, i.e. η (k successively from the beginning of maximum
1) 〉=η (k
2) 〉=... 〉=η (k
q) 〉=η (k
m), wherein, k
qThe expression band number, [k
1..., k
q] ∈ [1,2 ...., N/2],
For achieving the above object, propose a kind of direction of arrival estimation unit for broadband sound source, it is characterized in that this device comprises: the mould of Fourier transform module, coherence function square module and direction of arrival estimation module;
Described Fourier transform module comprises: data selection unit, data framing unit and Fourier transform unit; Be used for described data selection unit from the voice data that microphone array gathers, choose one section voice data X (n)=[x
1(n) ..., x
P(n)]
T, the voice data X (n) that described data selection unit is selected in described data framing unit evenly is divided into the L frame, and described Fourier transform unit is to each frame voice data X
l(n) do the conversion of N point quick Fourier, obtain the frequency domain representation X of voice data
l(k)=[x
L1(k) ..., x
LP(k)]
TAnd the voice data of described frequency domain is exported to the mould square module of described coherence function; Wherein, P represents the number of microphone in the microphone array, P 〉=2; The data X of every frame
l(n) length is N, l=1 ..., L, the data length of X (n) is N * L; N ∈ Z
*K=1,2 ..., N/2;
The mould of described coherence function square module is used for the X that obtains according to described Fourier transform module
l(k) calculate the mould square η (k) of coherence function between two microphones in the microphone array according to formula (2);
Wherein, k=1,2 ..., N/2; I, j ∈ [1,2 ..., P]; Φ
I, j(k) be the cross-power spectrum of i microphone and j microphone,
Φ
I, i(k) be the auto-power spectrum of i microphone,
Φ
J, j(k) be the auto-power spectrum of j microphone,
Described direction of arrival estimation module, the mould that is used for the coherence function that obtains according to the mould square module from described coherence function square is selected the voice data X of square correspondence of q coherence function mould at random
l(k) carry out direction of arrival and estimate, wherein, k ∈ [k
1..., k
q], k
qExpression band number, and [k
1..., k
q] ∈ [1 ..., N/2].
In the mould of the described coherence function square module, two microphones are two microphones of space length maximum in the microphone array.
Described device also comprises the voice data cache module; Described voice data cache module be used for to be preserved the continuous L frame voice data that microphone array gathers till current; And preserve and to receive from behind the up-to-date 1 frame voice data of microphone array, with the earliest 1 frame voice data deletion in the continuous L frame voice data of preserving; And export the voice data of buffer memory to described Fourier transform module.
Between two microphones that the mould of described coherence function square module will be chosen continuously the data of (L-1) frame ask for the mould square of coherence function as data buffer storage according to N data of the current acquisition L frame individual data of the N that deposits * (L-1) of easing up.
The L frame voice data X of square correspondence of q the coherence function mould that described direction of arrival estimation module obtains according to described comparison module
L(k) carry out direction of arrival and estimate, wherein, k ∈ [k
1..., k
q], k
qExpression band number, and [k
1..., k
q] ∈ [1 ..., N/2].
This device also comprises a comparison module, for η (k) size that the mould square module according to more described coherence function obtains, selects successively a front q η (k) value, i.e. η (k from the beginning of maximum
1) 〉=η (k
2) 〉=... 〉=η (k
q) 〉=η (k
m), wherein, k
qThe expression band number, [k
1..., k
q] ∈ [1,2 ...., N/2],
Square export the mould of the coherence function picked out to described direction of arrival estimation module.
The invention has the advantages that the present invention is computation reduction greatly.Any algorithm that this shortcut calculation of while is discussed applicable to the background technology part.Take the AML algorithm as example, because the operand of AML algorithm is maximum, can draw in the suitable situation of directional effect by checking, greatly reduce operand.
Description of drawings
Fig. 1 is the space structure synoptic diagram of microphone array;
Fig. 2 is a kind of Wave arrival direction estimating method process flow diagram for broadband sound source that the present invention proposes;
Fig. 3 be in the l-G simulation test in the white noise situation embodiment of the invention scheme and Whole frequency band scheme carry out the as a result comparison diagram that direction of arrival is estimated;
Fig. 4 be in the l-G simulation test in the coloured noise situation embodiment of the invention scheme and Whole frequency band scheme carry out the as a result comparison diagram that direction of arrival is estimated;
Fig. 5 is that embodiment of the invention scheme and Whole frequency band scheme are carried out the as a result comparison diagram that direction of arrival is estimated in the field test.
Embodiment
The present invention is further described below in conjunction with drawings and Examples.
Be provided with R omnidirectional (omni-directional) microphone, p microphone receives signal and is:
Wherein: n=0 ..., L-1, L are a frame signal sampled point number; P=1 ..., R, R are the number of microphone in the receiving array; M=1 ..., M, M are sound source quantity (M<R).
Be source signal,
Be the time delay that represents with sampling number.w
p(n) be noise component in the array received signal.Wherein, time delay
Geometric relationship according to formation and target sound source position obtains.For the unit of the R under far field condition circle battle array, namely R microphone equidistantly is arranged on the circumference, and the time delay that represents with sampling number of p microphone is:
Wherein, f
sBe the signal sampling frequency, r is circle battle array radius, and c is the velocity of wave in the isotropic medium, θ
mIt is the position angle, far field in m source.
Do to received signal the FFT conversion that N is ordered, get the frequency domain signal model:
X(k)=D(Θ,k)S
0(k)+η(k) (3)
In the formula, k=0 ..., N/2; Frequency domain array output signal: X (k)=[X
1(k) ..., X
R(k)]
TThe frequency spectrum of M sound source:
Direction matrix (Steering Matrix) is: D (Θ, k)=[d
(1)(k) ..., d
(M)(k)]; Direction vector (Steering Vector) is:
η (k) is noise spectrum.
Noise spectrum vector η (k) is that variance is L σ
2Zero-mean plural number white Gaussian noise.Be any independent identically distributed stochastic variable even notice w (n), according to central limit theorem, all can know η (k) Gaussian distributed by inference.
Definition S (k)=D (k) S
0(k) (4)
Obtain NR/2 dimension space-time-frequency vector:
X=G(θ)+ξ
1 (5)
Wherein, G (θ)=[S (0)
T..., S (N/2)
T]
T, and have:
Wherein, superscript
HRepresent the conjugate transpose of complex matrix.
Suppose that initial unknown parameter space is:
So maximum likelihood function is arranged:
This likelihood function of logarithmetics also omits constant term and gets:
Can obtain optimization criterion:
This is equivalent to all k function, finds
Again because σ
2Be constant, f (k) can be abbreviated as:
f(k)=||X(k)-D(k)S
0(k)||
2 (11)
The minimizing source signal vector S that f (k) is got
0(k) certain satisfied
Therefore for any source being positioned with minimum residual error, the estimation of source vector is given:
Wherein,
It is the pseudo inverse matrix (pseudo-inverse) of direction matrix D (k).The definition vertical plan
The direction of arrival of AML algorithm is estimated and can be obtained by following formula so.
Although voice signal is the broadband, signal generally all can not be white, and generally all there are some harmonics structures in it, and namely signal mainly concentrates on some frequency range; Or because noise is coloured, and the signal of some frequency band has been fallen into oblivion by noise fully, therefore when calculating, the information on these frequency bands is redundant, even can affect last result.So it is necessary adopting certain method to choose some frequency band, among the present invention program, with the standard of the coherence function on the frequency band as the selection frequency band.
Coherence function (coherence) is used for describing the correlativity between two groups of data.Among the present invention program, coherence function is defined as:
Wherein, x represents two groups of different data, G with y
Xy(f) be the cross-spectrum of x and y, G
Xx(f) and G
Yy(f) be respectively the certainly spectrum of x and y.G
xy(f)=E{X(f)Y
*(f)},G
xx(f)=E{X(f)X
*(f)},G
yy(f)=E{Y(f)Y
*(f)}。
Amplitude square coherence function (MSC) is coherence function γ
XyMould square:
Thus, visible coherence function refers to all is computing on the frequency domain.
The value of coherence function is between 0 to 1, that is:
0≤|γ
xy|
2≤1 (16)
Represent that when value is 0 the frequency spectrum on frequency range x and the y is fully uncorrelated, value is the frequency spectrum complete dependence on 1 expression frequency range x and the y.
In specific situation, have: signal to noise ratio (S/N ratio)
This shows that MSC can be used for the size of the signal to noise ratio (S/N ratio) on certain frequency band of gauge signal.When | γ
Xy(k) |
2When larger, show that signal useful on k the frequency band is more; Otherwise, illustrate that then noise accounts for major part on the k frequency band.
The space structure synoptic diagram of microphone array, as shown in Figure 1.Use the microphone array of cross formation shown in Figure 1 to carry out the collection of wind noise cancellation signal, 12 pieces of B﹠amp of circle 101 expressions; K 4189 type standard microphones, and be furnished with the spherical sponges fan housing that diameter is 10cm.102 expressions are used for fixing the support of these microphones.The aperture of upper three 4 element array of cross battle array is respectively 0.2 meter, 0.6 meter, and 1.2 meters.Adopt SONY SIR-1000 type 16 Channel Synchronous digital audio tapes, sample frequency is 48kHz.During data analysis, by low-pass filtering, normalization, down-sampledly analyze to 1kHz.
Use measured data, the spatial coherence of statistic array wind noise.Adopt following formula to calculate the interchannel average correlation coefficient:
Wherein, k=0 ..., N/2; R is port number, γ
Ij(k) be the cross-correlation coefficient of i passage and j passage,
Be average correlation coefficient.
Table 1 wind noise interchannel average correlation coefficient
Table 1 explanation, different from the spatial coherence of low frequency wind noise, the wind noise interchannel related coefficient at medium-high frequency place is little, and spatial coherence is faint, and the array wind noise can think that substantially interchannel is separate.
The present invention program is after having gathered voice data, according to coefficient of coherence voice data is screened, filter out the voice data that carries the more frequency range of useful signal, and abandon the voice data of other frequency ranges, so, just reduce direction of arrival and estimated required data volume, thereby reduced the operand that follow-up direction of arrival is estimated.
A kind of Wave arrival direction estimating method process flow diagram for broadband sound source that the present invention proposes; As shown in Figure 2, comprise the steps:
Step 201: from the voice data that the microphone array that is comprised of P microphone gathers, select one section voice data x (n)=[x
1(n) ..., x
P(n)]
T, wherein, n ∈ Z
*
Step 202: selected voice data x (n) evenly is divided into L section x
l(n), l=1 ..., L, every section data x
l(n) length is N, and the data data length of x (n) is N * L.
Step 203: to each section voice data x
l(n) do N point quick Fourier conversion (FFT), the frequency domain representation that obtains voice data is X
l(k)=[X
L1(k) ..., X
LP(k)]
T, wherein, k=1,2 ..., N/2.
Step 204: square (MSC) that calculates the coherence function mould between two microphones in the microphone array
Wherein, i, j ∈ [1,2 ..., P], Φ
I, j(k) be the cross-power spectrum of i microphone and j microphone, Φ
I, i(k) and Φ
J, j(k) be respectively the auto-power spectrum of i microphone and j microphone:
Preferably, described two microphones can be two microphones of space length maximum in the microphone array.
Step 205: from square η (k) of coherence function mould, pick out the square value of larger q coherence function mould, i.e. η (k
1) 〉=η (k
2) 〉=... 〉=η (k
q) 〉=η (k
m), wherein, k
qBe band number, [k
1..., k
q] ∈ [1,2 ...., N/2],
Step 206: the voice data X that uses square correspondence of q the coherence function mould of picking out
l(k) carrying out direction of arrival estimates.Wherein, k ∈ [k
1..., k
q]; Take the AML algorithm as example, then the computing formula of angle by (18) as can be known,
Because the calculating of coherence function needs more data, namely N * L is larger, therefore may affect the real-time of method.In the calculating of coherence function, before having preserved continuously the data of (L-1) frame as data buffer storage, according to N data of current acquisition ease up deposit N * (L-1) individual data are asked for MSC.And obtain after the new frame data at every turn, all abandon the data of former frame, upgrade whole data buffer storage.Be x such as the total data in the system
l(n), l=1 ..., L, wherein x
L(n) be frame data of current acquisition, and x
l(n), l=1 ..., L-1 is former data.Use x
l(n), l=1 ..., L calculates MSC, and only uses x
L(n) carry out angle estimation.
The present invention also proposes a kind of device of estimating for the direction of arrival of broadband sound source, comprising:
Fourier transform module is used for the voice data of the microphone array collection of P microphone composition is carried out fast fourier transform, obtains the frequency domain representation of voice data, and described frequency domain voice data is exported to the mould square module of coherence function;
The mould of coherence function square module be used for to be calculated the mould square of the coherence function of the voice data between two microphones of microphone array, and square exports the mould of the coherence function that calculates to comparison module;
Comparison module is used for square picking out a larger q value from the mould of coherence function;
The direction of arrival estimation module, the frequency band of voice data that is used for the mould square correspondence of the coherence function picked out according to comparison module carries out direction of arrival and estimates.
Preferably, the mould of described coherence function square module be used for to be calculated the mould square of the coherence function of voice data between microphone array middle distance two microphones farthest, and square exports the mould of the coherence function that calculates to comparison module.
Preferably, described Fourier transform module comprises:
Data selection unit, being used for from the voice data selection length of the microphone array collection that is comprised of P microphone is that one section voice data of N * L is as pending voice data x (n)=[x
1(n) ..., x
P(n)]
T, wherein, n ∈ Z
*
The data framing unit is used for the selected pending voice data x of described data selection unit (n) is divided into the L section, and wherein arbitrary section voice data is x
l(n), l=1 ..., L, and the voice data after the segmentation exported; And
Fourier transformation unit is used for each section voice data x to the output of described data framing unit
l(n) do the conversion of N point quick Fourier, the frequency domain representation that obtains voice data is X
l(k)=[X
L1(k) ..., X
LP(k)]
T, wherein, k=1,2 ..., N/2.
Preferably, the mould of coherence function square module is calculated the mould of the coherence function between i microphone of microphone array and j the microphone and square is:
Wherein, i, j ∈ [1,2 ..., P], Φ
I, j(k) be the cross-power spectrum of i microphone and j microphone, Φ
I, i(k) and Φ
J, j(k) be respectively the auto-power spectrum of i microphone and j microphone.
Preferably, described comparison module is picked out the mould square value of a larger q coherence function, i.e. η (k from the mould square η (k) of N/2 coherence function
1) 〉=η (k
2) 〉=... 〉=η (k
q), k
qThe expression band number, k=1,2 ..., N/2, [k
1..., k
q] ∈ [1,2 ...., N/2].
Preferably, the frequency band of the voice data of the mould square correspondence of the coherence function picked out according to comparison module of described DOA estimation module carries out direction of arrival and estimates.
Preferably, this device further comprises:
The voice data cache module is used for preserving the continuous L frame voice data that described microphone array gathers till current; And after receiving the 1 frame voice data up-to-date from described microphone array, with the earliest 1 frame voice data deletion in the continuous L frame voice data of preserving.
The capable fast fourier transform of voice data that described Fourier transform module is used for the voice data cache module is stored obtains the frequency domain representation of voice data, and described frequency domain voice data is exported to the mould square module of coherence function.
Below result by emulation experiment and experiment on the spot the present invention program's technique effect is described.In l-G simulation test, the microphone group pattern is the circle battle array, and array number is 4, and the radius of circle battle array is 0.2m, and sound source is white noise, and number is 1, and incident angle is 70 °.The AML algorithm of use Whole frequency band and the AML arithmetic result that frequency band is chosen contrast respectively as shown in Figure 3 and Figure 4, and wherein, Fig. 3 is the situation of white noise, and Fig. 4 is the situation of coloured noise.Wherein coloured noise is to be produced by the FIR wave filter of white noise by 50 rank.
As can be seen from Figure 3, when background was the condition of white noise, the AML algorithm of Whole frequency band was better than the AML algorithm that frequency band is chosen under the low signal-to-noise ratio; The deviation of these two kinds of algorithms is close under the high s/n ratio all trends towards zero.As can be seen from Figure 4, when background is coloured noise, it has not been optimum that low signal-to-noise ratio uses the AML algorithm of Whole frequency band, as can be seen from the figure under different signal to noise ratio (S/N ratio)s, should choose different frequency band number, this is that signal to noise ratio (S/N ratio) can be very large on some frequency band because background is coloured noise, and signal to noise ratio (S/N ratio) can be very little on some frequency band, therefore choose too much frequency band and can make the low frequency band of these signal to noise ratio (S/N ratio)s also participate in computing, thereby the deviation of algorithm is increased; Under the high s/n ratio, the deviation of two kinds of algorithms also all levels off to 0.
In the actual experiment, array is the circle battle array in the open air, and array number is 4, and circle battle array radius is 0.2m, and sound source is that tank moves in a circle around array.Experimental result as shown in Figure 5, icon " * " is the result of Whole frequency band search, icon " o " is the result that 20 frequency bands only getting the coefficient of coherence maximum calculate gained, as can be seen from the figure, both bearing accuracies are substantially the same, but after using coefficient of coherence to select frequency band, and operand is original 1/15.
Through the above description of the embodiments, those skilled in the art can be well understood to the present invention and can realize by the mode that software adds essential hardware platform, can certainly all implement by hardware, but the former is better embodiment in a lot of situation.Based on such understanding, technical scheme of the present invention is to can embodying with the form of software product in whole or in part that background technology contributes, this computer software product can be stored in the storage medium, such as ROM/RAM, magnetic disc, CD etc., comprise that some instructions are with so that computer equipment is carried out the described method of some part of each embodiment of the present invention or embodiment.Computer equipment is such as personal computer, server, perhaps network equipment etc.
It should be noted last that above embodiment is only unrestricted in order to technical scheme of the present invention to be described.Although with reference to embodiment the present invention is had been described in detail, those of ordinary skill in the art is to be understood that, technical scheme of the present invention is made amendment or is equal to replacement, do not break away from the spirit and scope of technical solution of the present invention, it all should be encompassed in the middle of the claim scope of the present invention.
Claims (12)
1. Wave arrival direction estimating method for broadband sound source, the method concrete steps comprise:
Step 1): from the voice data that microphone array gathers, choose one section voice data X (n)=[x
1(n) ..., x
P(n)]
T, wherein, P represents the number of microphone in the microphone array, P 〉=2; This section voice data X (n) evenly is divided into the L frame, the data X of every frame
l(n) length is N, and the data length of X (n) is N * L, l=1 ..., L; N ∈ Z
*
To each frame voice data X
l(n) do the conversion of N point quick Fourier, obtain the frequency domain representation X of voice data
l(k)=[x
L1(k) ..., x
LP(k)]
T, k=1 wherein, 2 ..., N/2;
Step 2): the X that obtains according to described step 1)
l(k) calculate the mould square η (k) of coherence function between two microphones in the microphone array according to following formula (1);
Wherein, k=1,2 ..., N/2; I, j ∈ [1,2 ..., P]; Φ
I, j(k) be the cross-power spectrum of i microphone and j microphone,
Φ
I, i(k) be the auto-power spectrum of i microphone,
Φ
J, j(k) be the auto-power spectrum of j microphone,
Step 3): according to described step 2) η (k) that obtains from the beginning of maximum select successively q coherence function mould square;
Step 4): the voice data X of square correspondence of q the coherence function mould of selecting according to described step 3)
l(k) carry out direction of arrival and estimate, wherein, k ∈ [k
1..., k
q], k
qExpression band number, and [k
1..., k
q] ∈ [1 ..., N/2].
2. the Wave arrival direction estimating method for broadband sound source according to claim 1 is characterized in that, described step 1) also comprises: preserve the continuous L frame voice data that described microphone array gathers till current; And preserve from behind the up-to-date 1 frame voice data of described microphone array, with the earliest 1 frame voice data deletion in the continuous L frame voice data of preserving.
3. the Wave arrival direction estimating method for broadband sound source according to claim 1 is characterized in that, described step 2) in, two microphones are two microphones of space length maximum in the microphone array.
4. the Wave arrival direction estimating method for broadband sound source according to claim 2, it is characterized in that, described step 2) in, between two microphones continuously the data of (L-1) frame ask for the mould square of coherence function as data buffer storage according to N data of the current acquisition L frame individual data of the N that deposits * (L-1) of easing up.
5. the Wave arrival direction estimating method for broadband sound source according to claim 4 is characterized in that, in the described step 4), and the L frame voice data X of square correspondence of q the coherence function mould of selecting according to described step 3)
L(k) carry out direction of arrival and estimate, wherein, k ∈ [k
1..., k
q], k
qExpression band number, and [k
1..., k
q] ∈ [1 ..., N/2].
6. the Wave arrival direction estimating method for broadband sound source according to claim 1, it is characterized in that in the described step 3), more described step 2) in η (k) size that obtains, select successively a front q η (k) value, i.e. η (k from the beginning of maximum
1) 〉=η (k
2) 〉=... 〉=η (k
q) 〉=η (k
m), wherein, k
qThe expression band number, [k
1..., k
q∈ [1,2 ...., N/2],
7. the direction of arrival estimation unit for broadband sound source is characterized in that this device comprises: the mould of Fourier transform module, coherence function square module and direction of arrival estimation module;
Described Fourier transform module comprises: data selection unit, data framing unit and Fourier transform unit; Described data selection unit is chosen one section voice data X (n)=[x from the voice data that microphone array gathers
1(n) ..., x
P(n)]
T, the voice data X (n) that described data selection unit is selected in described data framing unit evenly is divided into the L frame, and described Fourier transform unit is to each frame voice data X
l(n) do the conversion of N point quick Fourier, obtain the frequency domain representation X of voice data
l(k)=[x
L1(k) ..., x
LP(k)]
TAnd described frequency domain voice data is exported to the mould square module of described coherence function; Wherein, P represents the number of microphone in the microphone array, P 〉=2; The data X of every frame
l(n) length is N, l=1 ..., L, the data length of X (n) is N * L; N ∈ Z
*K=1,2 ..., N/2;
The mould of described coherence function square module is used for the X that obtains according to described Fourier transform module
l(k) calculate the mould square η (k) of coherence function between two microphones in the microphone array according to formula (2);
Wherein, k=1,2 ..., N/2; I, j ∈ [1,2 ..., P]; Φ
I, j(k) be the cross-power spectrum of i microphone and j microphone,
Φ
I, i(k) be the auto-power spectrum of i microphone,
Described direction of arrival estimation module, the mould square that is used for the coherence function that obtains according to the mould square module from described coherence function is selected the voice data X of square correspondence of q coherence function mould successively from the beginning of maximum
l(k) carry out direction of arrival and estimate, wherein, k ∈ [k
1..., k
q], k
qExpression band number, and [k
1..., k
q] ∈ [1 ..., N/2].
8. the direction of arrival estimation unit for broadband sound source according to claim 7 is characterized in that, in the mould of the described coherence function square module, two microphones are two microphones of space length maximum in the microphone array.
9. the direction of arrival estimation unit for broadband sound source according to claim 7 is characterized in that described device also comprises the voice data cache module;
Described voice data cache module be used for to be preserved the continuous L frame voice data that microphone array gathers till current; And preserve and to receive from behind the up-to-date 1 frame voice data of microphone array, with the earliest 1 frame voice data deletion in the continuous L frame voice data of preserving; And export the voice data of buffer memory to described Fourier transform module.
10. the direction of arrival estimation unit for broadband sound source according to claim 9, it is characterized in that, between two microphones that the mould of described coherence function square module will be chosen continuously the data of (L-1) frame ask for the mould square of coherence function as data buffer storage according to N data of the current acquisition L frame individual data of the N that deposits * (L-1) of easing up.
11. the direction of arrival estimation unit for broadband sound source according to claim 10 is characterized in that, the L frame voice data X of square correspondence of q the coherence function mould that described direction of arrival estimation module obtains according to comparison module
L(k) carry out direction of arrival and estimate, wherein, k ∈ [k
1..., k
q].
12. the direction of arrival estimation unit for broadband sound source according to claim 7, it is characterized in that, this device also comprises a comparison module, be used for η (k) size that the mould square module according to more described coherence function obtains, select successively a front q η (k) value, i.e. η (k from the beginning of maximum
1) 〉=η (k
2) 〉=... 〉=η (k
q) 〉=η (k
m), wherein, k
qThe expression band number, [k
1..., k
q] ∈ [1,2 ...., N/2],
Square export the mould of the coherence function picked out to described direction of arrival estimation module.
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN 201010608798 CN102147458B (en) | 2010-12-17 | 2010-12-17 | Method and device for estimating direction of arrival (DOA) of broadband sound source |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN 201010608798 CN102147458B (en) | 2010-12-17 | 2010-12-17 | Method and device for estimating direction of arrival (DOA) of broadband sound source |
Publications (2)
Publication Number | Publication Date |
---|---|
CN102147458A CN102147458A (en) | 2011-08-10 |
CN102147458B true CN102147458B (en) | 2013-03-13 |
Family
ID=44421826
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN 201010608798 Expired - Fee Related CN102147458B (en) | 2010-12-17 | 2010-12-17 | Method and device for estimating direction of arrival (DOA) of broadband sound source |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN102147458B (en) |
Families Citing this family (11)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN103217671B (en) * | 2013-04-10 | 2014-09-17 | 哈尔滨工程大学 | Multi-input and multi-output fast estimation method for radar receiving and transmitting angles under color-noise environment |
CN104811886B (en) * | 2015-04-10 | 2018-04-17 | 西安电子科技大学 | Microphone array direction-finding method based on phase difference measurement |
CN104811867B (en) * | 2015-04-29 | 2017-11-21 | 西安电子科技大学 | Microphone array airspace filter method based on array virtual extended |
CN106054132B (en) * | 2016-06-06 | 2018-06-08 | 西北工业大学 | A kind of ISM methods based on the selection of effective subband and detection statistic weighting |
CN106019214B (en) * | 2016-07-08 | 2018-11-27 | 天津大学 | Wide-band coherent signal source DOA estimation method |
CN106500820B (en) * | 2016-10-13 | 2019-08-20 | 华南理工大学 | A kind of sonic velocity measurement method and device for estimating two-dimensional direction-of-arrival |
CN106526533B (en) * | 2016-11-14 | 2019-03-05 | 中国科学院上海微系统与信息技术研究所 | A kind of micropore diameter MEMS acoustic matrix sensor and its application method |
CN108710101A (en) * | 2018-04-10 | 2018-10-26 | 贵州理工学院 | A kind of DOA algorithm for estimating obtaining coherent signal subspace using orthogonality |
CN113219399B (en) * | 2020-08-05 | 2023-03-10 | 哈尔滨工业大学(威海) | Far-field narrow-band radio signal direction-of-arrival estimation method based on full real value calculation |
CN112485761B (en) * | 2021-02-03 | 2021-04-09 | 成都启英泰伦科技有限公司 | Sound source positioning method based on double microphones |
CN115497501B (en) * | 2022-11-18 | 2023-04-07 | 国网山东省电力公司济南供电公司 | SW-MUSIC based transformer fault voiceprint positioning method and system |
Family Cites Families (6)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
DE3932620A1 (en) * | 1989-09-29 | 1991-04-11 | Mantel Juval | LOCATION SYSTEM FOR SOUND IMPULSES |
US6198693B1 (en) * | 1998-04-13 | 2001-03-06 | Andrea Electronics Corporation | System and method for finding the direction of a wave source using an array of sensors |
JP2001166025A (en) * | 1999-12-14 | 2001-06-22 | Matsushita Electric Ind Co Ltd | Sound source direction estimating method, sound collection method and device |
CN1875292A (en) * | 2003-09-04 | 2006-12-06 | 声纳技术地图有限公司 | Resolving directional information in sonar arrays |
US8233353B2 (en) * | 2007-01-26 | 2012-07-31 | Microsoft Corporation | Multi-sensor sound source localization |
CN101470187B (en) * | 2007-12-26 | 2011-12-28 | 中国科学院声学研究所 | High-precision direction finding method used for linear array |
-
2010
- 2010-12-17 CN CN 201010608798 patent/CN102147458B/en not_active Expired - Fee Related
Also Published As
Publication number | Publication date |
---|---|
CN102147458A (en) | 2011-08-10 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN102147458B (en) | Method and device for estimating direction of arrival (DOA) of broadband sound source | |
CN104076331B (en) | A kind of sound localization method of seven yuan of microphone arrays | |
CN103308889B (en) | Passive sound source two-dimensional DOA (direction of arrival) estimation method under complex environment | |
Blandin et al. | Multi-source TDOA estimation in reverberant audio using angular spectra and clustering | |
CN110491403A (en) | Processing method, device, medium and the speech enabled equipment of audio signal | |
Pavlidi et al. | 3D localization of multiple sound sources with intensity vector estimates in single source zones | |
CN111123192B (en) | Two-dimensional DOA positioning method based on circular array and virtual extension | |
CN102411138A (en) | Method for positioning sound source by robot | |
CN105301563B (en) | A kind of double sound source localization method that least square method is converted based on consistent focusing | |
Salvati et al. | Acoustic source localization from multirotor UAVs | |
WO2020024816A1 (en) | Audio signal processing method and apparatus, device, and storage medium | |
US10670694B1 (en) | System and method for hybrid-weighted, multi-gridded acoustic source location | |
Grobler et al. | Sound based localization and identification in industrial environments | |
CN111798869B (en) | Sound source positioning method based on double microphone arrays | |
CN108549052A (en) | A kind of humorous domain puppet sound intensity sound localization method of circle of time-frequency-spatial domain joint weighting | |
CN106526563B (en) | A kind of penton product battle array multi-target DOA estimation method based on cross-correlation virtual array | |
Hosseini et al. | Time difference of arrival estimation of sound source using cross correlation and modified maximum likelihood weighting function | |
Li et al. | Sound source localization for mobile robot based on time difference feature and space grid matching | |
Hu et al. | Decoupled direction-of-arrival estimations using relative harmonic coefficients | |
Suo et al. | Single acoustic vector sensor DOA enhanced by unsaturated bistable stochastic resonance with linear amplitude response constrained | |
Tourbabin et al. | Speaker localization by humanoid robots in reverberant environments | |
Astapov et al. | Simplified acoustic localization by linear arrays for wireless sensor networks | |
CN110838303B (en) | Voice sound source positioning method using microphone array | |
Georgiou et al. | Robust maximum likelihood source localization: The case for sub-Gaussian versus Gaussian | |
Nikunen et al. | Time-difference of arrival model for spherical microphone arrays and application to direction of arrival estimation |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
C06 | Publication | ||
PB01 | Publication | ||
C10 | Entry into substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
C14 | Grant of patent or utility model | ||
GR01 | Patent grant | ||
CF01 | Termination of patent right due to non-payment of annual fee |
Granted publication date: 20130313 |
|
CF01 | Termination of patent right due to non-payment of annual fee |