CN113687306A

CN113687306A - Multi-frequency synchronous two-dimensional off-network compressed beam forming sound source identification method

Info

Publication number: CN113687306A
Application number: CN202110580351.4A
Authority: CN
Inventors: 褚志刚; 杨咏馨; 杨洋
Original assignee: Chongqing University
Current assignee: Chongqing University
Priority date: 2021-05-26
Filing date: 2021-05-26
Publication date: 2021-11-23
Anticipated expiration: 2041-05-26
Also published as: CN113687306B

Abstract

The invention discloses a multi-frequency synchronization two-dimensional off-network compressed beam forming sound source identification method, which comprises the following steps: 1) establishing a multi-frequency sound pressure signal measurement model; 2) establishing a maximum likelihood estimation model with the sound source position and the source intensity as parameters; 3) performing preliminary estimation on the sound source position to obtain a preliminary estimation coordinate of the sound source position; 4) carrying out local optimization on the preliminary estimation coordinates of the sound source position by utilizing a Newton method to obtain optimized estimation coordinates of the sound source position; 5) carrying out global cyclic optimization on all the recognized sound source position coordinates by utilizing a Newton method; 6) performing orthogonal solution on all the recognized sound source intensities, and updating residual errors; the method provided by the invention can effectively overcome the problem of base mismatch, is compatible with a planar array with randomly arranged microphones, is suitable for both steady-state and unsteady-state sound sources, and has high spatial resolution and strong anti-noise interference capability.

Description

Multi-frequency synchronous two-dimensional off-network compressed beam forming sound source identification method

Technical Field

The invention relates to the field of sound source identification, in particular to a multi-frequency synchronous two-dimensional off-network compressed beam forming sound source identification method.

Background

The beam forming sound source identification technology based on the remote measurement in the microphone array is widely applied to the fields of automobiles, airplanes, high-speed trains, wind power generation and the like. The two-dimensional compressed beam forming method using the planar microphone array and the compressed sensing theory is concerned with due to the advantages of wide sound source identification space range, applicability to coherent and incoherent sound sources, clear imaging and the like, and is a current research hotspot. The traditional compressed beam forming method discretizes a sound source imaging area, establishes an underdetermined equation set between source intensity distribution and microphone measurement sound pressure, and solves to obtain position and intensity estimation of a sound source. However, this method has a fundamental mismatch problem and a problem of poor sound source identification performance at small separations or low signal-to-noise ratios. The base mismatch problem refers to: the sound source recognition performance is degraded when the sound source is deviated from the grid points. Although the encryption grid can alleviate the problem to some extent, the cost is increased, and the excessive encryption grid causes excessive coherence of the sensing matrix column (steering vector), which causes algebraic aliasing and degrades the sound source identification performance. Poor sound source recognition performance at small separations or low signal-to-noise ratios is also associated with the perceptual matrix array.

In order to solve the problem of base mismatch, many scholars make efforts, and three methods, namely a non-grid method, a dynamic grid method and an off-grid method are proposed to effectively solve the problem of base mismatch. To improve the sound source identification performance under small separation or low signal-to-noise ratio conditions, many scholars have attempted to use a multi-snapshot approach. The multi-snapshot method is based on time invariant hypothesis sampling of sound source signals to obtain a plurality of array signal data blocks (snapshots), all the snapshots share the same sound source sparsity, and source intensity distribution is obtained through joint sparse recovery processing. The joint processing effectively promotes the common sparsity in space and time, improves the anti-noise interference capability, and increases the spatial resolution, the sound source position estimation and the source intensity quantization precision due to the increase of the microphone sound pressure matrix rank. Whether conventional fixed-grid, non-grid, or off-grid compressed beamforming, corresponding multi-snapshot versions have been developed. However, the multi-shot method is limited by the time-invariant assumption, and is only applicable to a stationary sound source, and is not applicable to an unsteady sound source.

In summary, there has not been a method that can simultaneously overcome the above two problems, and can be applied to a planar array with arbitrarily arranged microphones and can also be applied to various sound sources (steady state, unsteady state).

Disclosure of Invention

The invention aims to provide a multi-frequency synchronization two-dimensional off-network compressed beam forming sound source identification method, which comprises the following steps:

1) and establishing a multi-frequency sound pressure signal measurement model.

The m-th microphone has the coordinate of

The coordinates of the s-th sound source are

Wherein the content of the first and second substances,

the x-axis coordinate and the y-axis coordinate of the mth microphone.

The x-axis coordinate and the y-axis coordinate of the s-th sound source are shown. h is the distance between the sound source plane and the array plane. M is 1,2, 3 …, M. S is 1,2, 3 …, S. Theoretical sound pressure at the location of the mth microphone

As follows:

in the form of matrix

Matrix array

The superscript T denotes transpose. f. of_lThe frequency corresponding to the ith frequency point. q. q.s_s,lIndicating the intensity of the s-th sound source at the l-th frequency point.

Is an imaginary unit. And c represents the speed of sound. d_m,sIndicating the distance of the mth microphone from the mth sound source. I | · | purple wind₂Representing the two-norm of the vector.

The multi-frequency sound pressure signal measurement model is as follows:

p＝A(Ω_S)q_S+n (2)

in the formula (I), the compound is shown in the specification,

and a source intensity vector formed by the source intensities of S sound sources at each frequency.

The source intensity vector for all frequencies of the s-th source.

Representing the microphone measurement noise vector. n is_·,l＝[n_1,l,n_2,l,...,n_M,l]Representing the measured noise vector of the microphone at the point corresponding to the l-th frequency point.

Representing the transfer matrix between S sound sources to all microphones at all L frequencies.

A coordinate matrix composed of coordinates of all sound source positions,

a complex set is represented.

Representing a set of real numbers. Wherein the transfer matrix between the s-th sound source to all microphones at all L frequencies

As follows:

in the formula, the frequency f_lIs first ofTransfer vector between s sound sources to all microphones

2) And establishing a maximum likelihood estimation model with the sound source position and the source intensity as parameters.

The step of establishing a maximum likelihood estimation model with the sound source position and the source intensity as parameters comprises the following steps:

2.1) minimizing the residual error of the multi-frequency Acoustic pressure Signal measurement model

Obtaining maximum likelihood estimates of sound source position and source intensity

Namely:

in the formula (I), the compound is shown in the specification,

the representation takes the real part of the complex number. The superscript H denotes the hermitian conjugate transpose operator.

2.2) establishing a likelihood function

Namely:

2.3) Source intensity vector q formed by the Source intensities of S Sound sources at respective frequencies_SFor variables, a least squares solution is established, i.e.:

q_S＝(A(Ω_S)^HA(Ω_S))^-1A(Ω_S)^Hp (6)

2.4) establishing a maximum likelihood ratio test cost function

Namely:

2.5) establishing an estimation equation of the sound source position coordinates, namely:

in the formula (I), the compound is shown in the specification,

coordinates are estimated for the sound source position.

3) And performing preliminary estimation on the sound source position to obtain a preliminary estimation coordinate of the sound source position.

The step of preliminary estimating the sound source position includes:

3.1) spatially discretizing the sound source detection into N grid points. N > M. Wherein the coordinates of the nth grid point are

Set of grid point coordinates

Wherein, the matrix

3.2) establishing a perceptual matrix of all frequency correspondences between the N grid points to the M microphones

Wherein, the sensing matrix corresponding to all frequencies from the nth grid point to the M microphones

3.3) the actual measured microphone sound pressure vector p can be expressed as:

p＝A(Ω_G)q_G+n (9)

in the formula (I), the compound is shown in the specification,

and a source intensity vector formed by source intensities of all frequencies of the N grid point positions.

Source strength vector for all frequencies of nth grid point

Representing the frequency f at the nth grid point_lThe source is strong.

3.4) updating the likelihood function, the maximum likelihood ratio test cost function and the least square function into the identification formula of the single sound source to obtain the function

Sum function

Namely:

3.5) from the set Ω_GIn selecting a maximum likelihood ratio test cost function

Maximum mesh node as sound source positionPreliminary estimated coordinates, denoted

Preliminary estimation coordinates of sound source position

As follows:

in the formula (I), the compound is shown in the specification,

representing the residual vector of the microphone sound pressure signal obtained after the first s-1 sound source estimates.

And

respectively the estimated coordinates and source strength of the jth sound source.

3.6) preliminary estimation of coordinates of the sound source position

Substituting the following formula to obtain the source intensity estimation vector of all frequencies of the s sound source, namely:

in the formula (I), the compound is shown in the specification,

vectors are estimated for the source strengths of all frequencies of the s-th source.

4) And carrying out local optimization on the preliminary estimation coordinates of the sound source position by utilizing a Newton method to obtain the optimized estimation coordinates of the sound source position.

The method for carrying out local optimization on the initial estimation coordinates of the sound source position by utilizing the Newton method comprises the following steps:

4.1) establishing a two-dimensional Newton optimization function, namely:

in the formula (I), the compound is shown in the specification,

and

respectively representing a jacobian matrix and a hessian matrix.

The estimated coordinates are optimized for the sound source position.

Jacobian matrix

Heisen matrix

Respectively as follows:

wherein the parameter item

Parameter item

Parameter item

Parameter item

Parameter item

Parameter item

Respectively as follows:

in the formula, A_fRepresenting a transfer matrix at a frequency f;

4.2) optimizing and estimating coordinates of sound source position

Substituting the obtained value into the following formula to obtain the sound source intensity optimized estimation

Namely:

4.3) updating the microphone pressure residual vector p_rNamely:

in the formula (I), the compound is shown in the specification,

the transfer matrix at all frequencies between the jth source to all M microphones.

5) And carrying out global cyclic optimization on the coordinates of all the identified sound source positions by utilizing a Newton method to obtain global optimization estimated coordinates of all the identified sound source positions.

The method for carrying out global cyclic optimization on all the recognized sound source position coordinates by utilizing the Newton method comprises the following steps:

5.1) establishing a source intensity estimation vector

Transmission matrix

And harmonic source location estimation matrices

5.2) carrying out the t-th iterative optimization on the kth sound source, wherein the steps are as follows:

5.2.1) computing the transition residual vector that preserves the kth source component at the t-1 th iteration

Namely:

in the formula, t is initially 1.

5.2.2) residual transition

Substituting into the following formula to obtain the transition source intensity

Namely:

5.2.3) calculating the global optimization estimation coordinates of the sound source position

Namely:

updating microphone sound pressure residual vector

Namely:

5.3) determining the residual energy change of two continuous cycles

If yes, entering step 6), otherwise, making t equal to t +1, and returning to step 5.2);

6) the steps of solving all the identified sound source intensities orthogonally and updating the residual errors include:

6.1) optimizing the estimation coordinate set of the sound source position

Substituting into the following formula to obtain the sound source intensity estimation

Namely:

6.2) updating the microphone pressure residual vector p_rNamely:

6.3) judging whether k is more than or equal to S, if so, ending the circulation and outputting a set

And

otherwise, let k be k +1 and return to step 3.5).

The technical effect of the invention is undoubted, the method firstly establishes a maximum likelihood estimation model taking the sound source position and the intensity of each frequency source as parameters, and then utilizes the NOMP algorithm and the block sparse thought to form a BNOMP algorithm for solving to obtain the estimation of the sound source position and the intensity of each frequency source. The method provided by the invention can effectively overcome the problem of base mismatching, is compatible with a planar array with randomly arranged microphones, and is suitable for both steady-state and unsteady-state sound sources. In addition, the performance of the test piece is analyzed based on simulation and verification tests, and the result shows that: the method has high spatial resolution and strong anti-noise interference capability due to suppression of mutual interference of sound sources by global loop feedback optimization, promotion of multi-frequency joint sparse constraint on sparsity and suppression of noise interference.

Drawings

FIG. 1 is a planar microphone array measurement model;

fig. 2 shows the result of gaussian pulse signal identification (SNR 20 dB);

FIG. 2(a) is a sound source localization cloud diagram obtained by superimposing the energy of the estimated positions of the 1000-and 5000-Hz sound source according to the conventional orthogonal matching pursuit compressed beam forming method with single frequency processing;

FIG. 2(b) is a sound source localization cloud diagram obtained by superimposing the energy of the estimated positions of the 1000-5000Hz sound source according to the conventional single-frequency-processing Newton orthogonal matching tracking compressed beam forming method;

FIG. 2(c) is a sound source localization cloud diagram of an orthogonal matching pursuit compression beamforming method of multi-frequency synchronization processing;

FIG. 2(d) is a sound source localization cloud of the present invention;

fig. 2(e) shows the source strength quantization result of the conventional single-frequency orthogonal matching pursuit compressed beam forming method at each frequency;

FIG. 2(f) is a diagram illustrating the source strength quantization results of the conventional single-frequency Newton orthogonal matching pursuit compression beamforming method at each frequency;

FIG. 2(g) is a diagram illustrating the source-intensive quantization result of the conventional orthogonal matching pursuit compressed beamforming method with multi-frequency synchronization processing;

FIG. 2(h) is the source intensity quantization result at each frequency according to the present invention;

fig. 3 shows the result of gaussian pulse signal identification (SNR 5 dB);

FIG. 3(a) is a sound source localization cloud diagram obtained by superimposing the energy of the estimated positions of the 1000-and 5000-Hz sound source according to the conventional orthogonal matching pursuit compressed beam forming method with single frequency processing;

FIG. 3(b) is a sound source localization cloud chart obtained by superimposing the energy of the estimated positions of the 1000-5000Hz sound source according to the conventional single-frequency-processing Newton orthogonal matching tracking compressed beam forming method;

FIG. 3(c) is a sound source localization cloud diagram of the orthogonal matching pursuit compression beam forming method of multi-frequency synchronization processing;

FIG. 3(d) is a sound source localization cloud of the present invention;

fig. 3(e) shows the source strength quantization result of the conventional single-frequency orthogonal matching pursuit compressed beamforming method at each frequency;

FIG. 3(f) is a diagram illustrating the source strength quantization results of the conventional single-frequency Newton orthogonal matching pursuit compressed beamforming method at each frequency;

FIG. 3(g) is a diagram illustrating the source-intensive quantization result of the conventional orthogonal matching pursuit compressed beamforming method with multi-frequency synchronization processing;

FIG. 3(h) is the source intensity quantization result at each frequency according to the present invention;

fig. 4 shows the test arrangement and the amplitude spectra of the measured signals of the microphones (a) the test arrangement and (b) the amplitude spectra of the measured signals of the microphones.

Fig. 5 shows the test results.

FIG. 5(a) is a sound source localization cloud diagram obtained by superimposing the energy of the estimated positions of the 1000-and 5000-Hz sound source according to the conventional orthogonal matching pursuit compressed beam forming method with single frequency processing;

FIG. 5(b) is a sound source localization cloud chart obtained by superimposing the energy of the estimated positions of the 1000-5000Hz sound source according to the conventional single-frequency-processing Newton orthogonal matching tracking compressed beam forming method;

FIG. 5(c) is a sound source localization cloud diagram of the orthogonal matching pursuit compression beamforming method of multi-frequency synchronization processing;

FIG. 5(d) is a sound source localization cloud of the present invention;

Detailed Description

The present invention is further illustrated by the following examples, but it should not be construed that the scope of the above-described subject matter is limited to the following examples. Various substitutions and alterations can be made without departing from the technical idea of the invention and the scope of the invention is covered by the present invention according to the common technical knowledge and the conventional means in the field.

Example 1:

referring to fig. 1 to 5, the multi-frequency synchronous two-dimensional off-network compressed beam forming sound source identification method includes the following steps:

1) and establishing a multi-frequency sound pressure signal measurement model.

The m-th microphone has the coordinate of

The coordinates of the s-th sound source are

Wherein the content of the first and second substances,

the x-axis coordinate and the y-axis coordinate of the mth microphone.

As follows:

in the form of matrix

Matrix array

The multi-frequency sound pressure signal measurement model is as follows:

p＝A(Ω_S)q_S+n (2)

in the formula (I), the compound is shown in the specification,

The source intensity vector for all frequencies of the s-th source.

A coordinate matrix composed of coordinates of all sound source positions,

a complex set is represented.

As follows:

in the formula, the frequency f_lTransfer vector between the s-th sound source to all microphones

Namely:

in the formula (I), the compound is shown in the specification,

2.2) establishing a likelihood function

Namely:

q_S＝(A(Ω_S)^HA(Ω_S))-¹A(Ω_S)^Hp (6)

2.4) establishing a maximum likelihood ratio test cost function

Namely:

in the formula (I), the compound is shown in the specification,

coordinates are estimated for the sound source position.

The step of preliminary estimating the sound source position includes:

Set of grid point coordinates

Wherein, the matrix

p＝A(Ω_G)q_G+n (9)

in the formula (I), the compound is shown in the specification,

Source strength vector for all frequencies of nth grid point

Representing the frequency f at the nth grid point_lThe source is strong.

Sum function

Namely:

The largest grid node is taken as the initial estimation coordinate of the sound source position and is recorded as

Preliminary estimation coordinates of sound source position

As follows:

in the formula (I), the compound is shown in the specification,

And

3.6) preliminary estimation of coordinates of the sound source position

in the formula (I), the compound is shown in the specification,

4.1) establishing a two-dimensional Newton optimization function, namely:

in the formula (I), the compound is shown in the specification,

and

respectively representing a jacobian matrix and a hessian matrix.

The estimated coordinates are optimized for the sound source position.

Jacobian matrix

Heisen matrix

Respectively as follows:

wherein the parameter item

Parameter item

Parameter item

Parameter item

Parameter item

Parameter item

Respectively as follows:

in the formula, A_fRepresenting a transfer matrix at a frequency f;

4.2) optimizing and estimating coordinates of sound source position

Namely:

4.3) updating the microphone pressure residual vector p_rNamely:

in the formula (I), the compound is shown in the specification,

5.1) establishing a source intensity estimation vector

Transmission matrix

And harmonic source location estimation matrices

Namely:

in the formula, t is initially 1.

5.2.2) residual transition

Namely:

Namely:

the arrow in the formula indicates that the value of the left symbol is updated to the right expression value.

5.2.4) updating microphone pressure residual vector

Namely:

3) determining the residual energy change of two consecutive cycles

6. the steps of solving all the identified sound source intensities orthogonally and updating the residual errors include:

6.1) optimizing the estimation coordinate set of the sound source position

Substituting into the following formula to obtain sound source intensityDegree estimation

Namely:

6.2) updating the microphone pressure residual vector p_rNamely:

And

otherwise, let k be k +1 and return to step 3.5).

Example 2:

a multi-frequency synchronization two-dimensional off-network compressed beam forming sound source identification method comprises the following steps:

1) near-field measurements were performed on S broadband sound sources using a random microphone array with M microphones, the layout being shown in fig. 1, where "·" represents a microphone and "·" represents a point sound source. Let the cartesian coordinate system origin be at the array center and the XOY plane coincide with the array plane, assuming that the sound source plane is parallel to the array plane and at a distance h. The m-th microphone has the coordinate of

And order

The coordinates of the s-th sound source are

And order

The superscript "T" denotes the transpose operator. Considering a plurality of frequencies, the ith frequency point is denoted as f_lThe corresponding frequency f at the mth microphone position_lTheoretical sound pressure p of_m,lCan be expressed as:

in the formula, q_s,lThe intensity of the ith frequency of the s-th sound source (expressed by the sound pressure generated by the sound source at 1m from it),

is an imaginary unit, c represents the speed of sound, d_m,sRepresents the distance between the mth microphone and the mth sound source, | · | | survival₂Representing the two-norm of the vector. From equation (1), the frequency f can be constructed_lTransfer vector between the s-th sound source to all microphones

(ii) a Further, the transfer matrix between the s-th sound source to all microphones at all L frequencies may be constructed

The transfer matrix between all S sound sources to all M microphones at all L frequencies can be expressed as:

wherein the content of the first and second substances,

a coordinate matrix composed of all S sound source position coordinates,

the complex set is represented as a complex set,

representing a set of real numbers, superscript "^ML×1、^ML×SL"etc. represent matrix dimensions.

In consideration of noise interference, sound pressure components of sound pressure signals received by all M microphones at L frequencies form microphone sound pressure quantities in actual measurement

(wherein,

indicating that all microphones are at frequency f_lMeasured sound pressure, p_m,lIndicating the m-th microphone at frequency f_lThe actual measured sound pressure signal) can be expressed as:

p＝A(Ω_S)q_S+n (2)

wherein the content of the first and second substances,

a source intensity vector representing the source intensities of all S sound sources at the respective frequencies,

the source intensity vectors for all frequencies of the s-th source,

representing the microphone measurement noise vector. Defining a frequency of f_lSignal to noise ratio of time is SNR_l＝20log₁₀(||p_·,l-n_·,l||₂/||n_·,l||₂)。

2) Mathematical model building

By minimizing the residual error of the multi-frequency measurement model (equation (2)))

Maximum likelihood estimates of sound source position and source intensity can be obtained:

in the formula (I), the compound is shown in the specification,

representing the real part of the complex number, superscript "^H"denotes the Hermite conjugate transpose operator. Defining the likelihood function as:

synchronous solving is carried out so that

Maximum omega_SAnd q is_SDifficult to realize, so only q_SAs variables, their least squares solution can be obtained:

q_S＝(A(Ω_S)^HA(Ω_S))^-1A(Ω_S)^Hp (5)

substituting equation (6) to obtain the maximum likelihood ratio test cost function

Estimation of the coordinates of the sound source position

Comprises the following steps:

estimating coordinates

Substituting equation (7) can obtain the corresponding source strength estimation

3) Preliminary estimation of sound source position coordinates

Direct simultaneous estimation in the entire two-dimensional continuous plane makes all sound sources with the largest cost function difficult to implement. Therefore, the solution process is simplified in two ways: (1) discretizing the continuous plane to obtain a rough sound source position through preliminary estimation; (2) all sound sources are solved one by one (steps 2, 3, 4, 5 below all describe the identification and updating for the s-th sound source).

For this purpose, the target sound source area is discretized into N (N > M) grid points, and the coordinates of the nth grid point are

Order to

Constructing a set of grid point coordinates

Combining the sensing matrices of all L frequencies from all N grid points to all M microphones

(wherein,

then, the actual measured microphone sound pressure vector p can be expressed as:

p＝A(Ω_G)q_G+n (7)

wherein the content of the first and second substances,

a source intensity vector formed by the source intensities of all L frequencies representing all N grid point positions (where,

the source intensity vectors for all L frequencies at the nth grid point,

representing the frequency f at the nth grid point_lStrong source of time).

In addition, to solve all sound sources one by one, the likelihood function, the cost function, and the least square function for source intensity estimation are respectively degenerated into identification formulas for a single sound source (s-th sound source as an example):

through two simplifications, the problem then turns into: from set omega based on multi-frequency synchronous networked compressed beamforming model (equation (4))_GIn selecting a maximum likelihood ratio test cost function

Preliminary estimation of maximum mesh node as sound source position coordinate

I.e. from the block containing N a priori blocks A (r)_Gn) N1.., N, a perceptual matrix a (Ω)_G) In search of cost function

Maximum one prior block:

estimating the position coordinates of the sound source:

wherein the content of the first and second substances,

And

respectively the estimated coordinates and source strength of the jth sound source. Preliminarily estimating the coordinates of the sound source position

Substituting into formula (10) can obtain the source intensity estimation vector corresponding to all frequencies of the sound source

4) The Newton method carries out local optimization on the position coordinate of the s-th sound source which is roughly estimated, and carries out Newton optimization on the initial estimation in a local continuous two-dimensional plane, so that the likelihood function can be further converged to the current local maximum value, and more accurate sound source position and intensity estimation can be further obtained. Specifically, two-dimensional newton optimization is expressed as follows:

wherein the content of the first and second substances,

and

the jacobian matrix and the hessian matrix respectively have the following specific expressions:

wherein the content of the first and second substances,

obtaining an optimized estimate of the location coordinates of a sound source

The sound source intensity estimates may then be updated in sequence according to equation (10)

By using

Updating microphone sound pressure residual vector p_r。

5) Global circulation Newton optimization for improving positioning precision

Global loop optimization provides feedback on all identified sound sources according to the current residuals, giving them the opportunity to re-optimize.Initialization based on all identified sound sources

And

(where

k

1, 2.. and s are the identified sound source indices), a sound source position estimation matrix is constructed

Source strength estimation vector

And a transfer matrix

Cyclically subjecting it to Newton optimization in turn and updating the set

Where t represents the number of cycles. The process of optimizing the kth sound source in the t-th cycle is described as follows:

to optimize the kth source, a transition residual vector is first defined that retains the last kth source component by subtracting the removed component

The projection on the corresponding basis of the kth sound source is obtained back by adding, i.e.

Residual of transition

Substituting equation (10) can obtain the transition source intensity

At this time, the optimized sound source position can be obtained by further using the Newton's optimization formula (14)

Source strengths can be updated using equation (10)

And updating the residual

And the above-mentioned set

Circularly and sequentially optimizing each identified sound source until residual energy changes of two continuous cycles

6) Source-strong orthogonal solution

Because the group formed by the atoms corresponding to each sound source is not a group of orthogonal groups, the obtained residual error always has a component which is not orthogonal to the atoms corresponding to the optimized identified sound source. To remove these components, an orthogonal solution update of all frequency source intensities of the identified sound source is required. Based on the set formed by terminating after undergoing T times of global loop optimization in the step (4)

The source intensity of all frequency components of all identified sound sources can be updated using equation (5) and using equation

Updating residual p_r。

All S sound source position estimates can be obtained through S times of four-step circulation

And all frequency source strength estimation thereof

Example 3:

simulation and experimental verification of the multi-frequency synchronous two-dimensional off-network compressed beam forming sound source identification method comprise the following steps:

1) in order to analyze and verify the sound source identification performance of the method (marked as BNOMP-CB), simulation tests are carried out on the method, and the method is compared with the existing orthogonal matching pursuit compressed beam forming method (marked as OMP-CB) of single-frequency processing, the Newton orthogonal matching pursuit compressed beam forming method (marked as NOMP-CB) and the orthogonal matching pursuit compressed beam forming method (marked as BOMP-CB) of multi-frequency synchronous processing.

The measurement array is the Bruel&

(

Denmark) 36-channel sector wheel array with a diameter of 0.65m and an average microphone spacing of 0.1m, geometrically arranged as in fig. 1. The array is parallel to the sound source plane and at a distance of 1m, the sound source plane is divided into grids with a pitch of 0.05m, and the number of grid points is 21 × 21 ═ 441. Gaussian pulse sound sources are used as sound sources to be identified, and the sound source identification performance of the four methods is contrastively analyzed. Suppose that there are three Gaussian pulse sound sources S1, S2 and S3 at (-0.04, -0.2) m, (0.08, -0.2) m and (-0.2,0.32) m, respectively, with half-peak bandwidths of 0.0001S and a peak time interval of 0.001S between the sources. The sound pressure signals at each microphone of the array are subjected to Fast Fourier Transform (FFT) to obtain Fourier spectra of the signals, wherein the sampling frequency is 102400Hz and the frequency resolution is 50 Hz. FIG. 2 shows the sound source identification results obtained by processing 81 frequencies from 1000Hz to 5000Hz with the above method when the SNR is 20 dB. Fig. 3 shows the recognition result at a corresponding signal-to-noise ratio of 5 dB. In the two groups of results, subgraphs (a) - (d) are cloud images for each method, and subgraphs (e) - (h) are results of each method for the respective frequency source of all sound sources.

For sound source S3, fig. 2(a) -2(d) show: the OMP-CB method has relatively dispersed positioning at each frequency and the worst positioning precision; the NOMP-CB method has relatively centralized frequency positioning and higher positioning precision; the BOMP-CB method sound source is positioned to nearby grid points, namely the positioning precision of the grid (base) is limited due to the existence of mismatch of the constraint bases; and the BNOMP-CB obtains accurate positioning. The source intensity curves in fig. 2(e) -2(h) show: the 4 methods can accurately quantize the intensity of each frequency source of the sound source S3. For the two sound sources S1 and S2 that are closer together, the OMP-CB method only achieves correct quantization at frequencies above 4500Hz, which is related to the fact that the OMP-CB method correctly separates the sound sources S1 and S2 at frequencies above 4500 Hz; the NOMP-CB method can accurately separate and correctly quantize the two sound sources when the frequency is higher than 2500 Hz; the BOMP-CB method does not obtain accurate quantification of the source intensities of the two sound sources within the frequency band of interest of 1000-; the BNOMP-CB method realizes the accurate separation and positioning of the two sound sources and obtains the accurate source strength estimation of the two sound sources in the whole frequency band of the concerned frequency band of 1000-plus 5000 Hz. In summary, the BNOMP-CB method has a spatial resolution far superior to that of the other 3 methods, has good applicability to transient sound source identification, and can accurately locate a sound source and obtain accurate quantification of the intensity of each frequency source of the sound source.

Comparing the recognition results of the methods in the case of the signal-to-noise ratio of 20dB in FIG. 2 and the recognition results of the methods in the case of the signal-to-noise ratio of 5dB in FIG. 3, it can be seen that the positioning results of the sound source at each frequency by the OMP-CB and NOMP-CB methods become more dispersed as the signal-to-noise ratio decreases; the positioning results obtained by the BOMP-CB and BNOMP-CB methods are unchanged and slightly changed. In addition, due to the introduction of strong noise, the source intensity quantization accuracy of all sound sources is reduced by the 4 methods. As can be seen from the positioning and quantization results of the sound sources S1 and S2, the NOMP-CB method still fails to accurately separate the two sound sources when the frequency is higher than 2500Hz when the signal-to-noise ratio is 5dB, but obtains more accurate strong source quantization when the frequency is close to 4500Hz, and the quantization curve presents a rule similar to that of the OMP-CB method. This phenomenon indicates that strong noise interference reduces the spatial resolution of the NOMP-CB method. While the quantization cases of the BOMP-CB and BNOMP-CB methods are similar to each other at a signal-to-noise ratio of 20dB except for the increase of fluctuation, the introduction of the multi-frequency synchronization processing makes the sound source identification performance of the two methods relatively robust to noise interference, and therefore, the proposed BNOMP-CB method also enjoys good sound source identification performance at a low signal-to-noise ratio.

2) Verification of comparative experiment

Three small speaker sources emitting random noise (Abramtek Type M5, Shenzhen, China) were tested for identification outdoors. The array and sound source arrangement is shown in FIG. 4(a), where the array is a Bruel with a 0.65m diameter microphone average pitch of 0.1m&

(Type 8608,

Denmark)36 channel planar microphone array with three loudspeakers arranged in a plane parallel to and 1m from the array plane. During the test, a PULSE data acquisition and analysis system (Bruel) is adopted&

Type 3660C,

Denmark) synchronously acquires sound pressure signals at 36 microphones, and the sampling frequency is 16384 Hz. And performing Fast Fourier Transform (FFT) on the sound pressure signal acquired by each microphone by using BKConnect to obtain a Fourier spectrum of the signal, wherein the frequency resolution is 50 Hz. Fig. 4(b) shows the amplitude spectrum of each microphone measurement signal.

The method comprises the following steps of adopting an orthogonal matching pursuit compression beam forming method (marked as OMP-CB) of single frequency processing, a Newton orthogonal matching pursuit compression beam forming method (marked as NOMP-CB) of single frequency processing, an orthogonal matching pursuit compression beam forming method (marked as BOMP-CB) of multi-frequency synchronous processing and the method invention (marked as BNOMP-CB) to carry out positioning identification on a small loudspeaker sound source, and comparing identification results:

fig. 5 is a sound source localization cloud image at [1000,1050., 5000] Hz (L ═ 81) broadband. As can be seen from fig. 5(a) (b), the respective frequency localization results of the OMP-CB method are scattered, while the localization results obtained by the NOMP-CB method are slightly more concentrated than those of the OMP-CB method, but both methods have low localization accuracy at many frequencies and estimate more false sound sources far from the sound source position. The BOMP-CB and BNOMP-CB methods are viewed reversely, and both methods can correctly position each sound source. In particular, for the left two sources, which are relatively close together, the OMP-CB and NOMP-CB methods fail to separate them correctly at many frequencies, whereas both the BOMP-CB and BNOMP-CB methods succeed in separating the two sources and obtaining accurate position estimates. In addition, the positioning results of the BOMP-CB and the BNOMP-CB on the left lower sound source and the right upper sound source are compared, the BNOMP-CB effectively overcomes the problem of base mismatching due to the introduction of Newton optimization, and positioning superior to that of the BOMP-CB is obtained. In conclusion, test cases show that the proposed BNOMP method has good sound source identification performance due to the overcoming of Newton optimization to the base mismatching problem and the promotion of multi-frequency joint sparse constraint to sparsity.

Claims

1. The method for identifying the sound source formed by the multi-frequency synchronous two-dimensional off-network compressed wave beams is characterized by comprising the following steps of:

1) and establishing the multi-frequency sound pressure signal measurement model.

2) Establishing a maximum likelihood estimation model with the sound source position and the source intensity as parameters;

3) performing preliminary estimation on the sound source position to obtain a preliminary estimation coordinate of the sound source position;

4) carrying out local optimization on the preliminary estimation coordinates of the sound source position by utilizing a Newton method to obtain optimized estimation coordinates of the sound source position;

5) carrying out global cyclic optimization on all the recognized sound source position coordinates by utilizing a Newton method to obtain global optimization estimation coordinates of all the recognized sound source positions;

6) and carrying out orthogonal solving on all the recognized sound source intensities, and updating the residual error according to the obtained sound source intensities.

2. The method of claim 1, wherein the multi-frequency synchronous two-dimensional off-network compressed beam forming sound source identification model is as follows:

p＝A(Ω_S)q_S+n (1)

in the formula (I), the compound is shown in the specification,

source intensity vectors representing source intensities of the S sound sources at each frequency;

source intensity vectors of all frequencies of the s-th sound source;

representing a microphone measurement noise vector; n is_·,l＝[n_1,l,n_2,l,...,n_M,l]Representing a measured noise vector corresponding to the microphone at the l-th frequency point;

a transfer matrix representing all L frequencies between S sound sources to all microphones;

a coordinate matrix composed of coordinates of all sound source positions,

representing a complex set;

representing a set of real numbers; p is the actually measured microphone sound pressure vector;

wherein the mth microphone has the coordinate of

The coordinates of the s-th sound source are

Wherein the content of the first and second substances,

the x-axis coordinate and the y-axis coordinate of the mth microphone;

the x-axis coordinate and the y-axis coordinate of the s-th sound source are obtained; h is the distance between the sound source plane and the array plane; m is 1,2, 3 …, M; s is 1,2, 3 …, S; m, S are the number of microphones and sound sources, respectively;

distance d between mth microphone and s sound source_m,sAs follows:

in the formula, vector

Vector

Superscript T denotes transpose; i | · | purple wind₂A two-norm representation of a vector;

a transfer matrix representing all L frequencies between the s-th source to all microphones is shown below:

in the formula (I), the compound is shown in the specification,

is a frequency f_lTransmission of the s-th sound source to all microphonesVector increment quantity

3. The method of claim 2, wherein the step of establishing a maximum likelihood estimation model with the sound source position and the source intensity as parameters comprises:

1) minimizing residual errors of a multi-frequency acoustic pressure signal measurement model

Namely:

in the formula (I), the compound is shown in the specification,

representing the real part of the complex number; superscript H represents the hermitian conjugate transpose operator;

2) establishing a likelihood function

Namely:

3) source intensity vector q formed by source intensities of S sound sources at various frequencies_SFor variables, a least squares solution is established, i.e.:

q_S＝(A(Ω_S)^HA(Ω_S))^-1A(Ω_S)^Hp (6)

4) establishing a maximum likelihood ratio test cost function

Namely:

5) establishing a sound source position coordinate estimation equation, namely:

in the formula (I), the compound is shown in the specification,

coordinates are estimated for the sound source position.

4. The method of claim 3, wherein the step of performing the preliminary estimation of the sound source location comprises:

1) dispersing a sound source detection space into N grid points; n > M; wherein the coordinates of the nth grid point are

Set of grid point coordinates

Wherein, the vector

2) Establishing a sensing matrix of all frequency correspondences between N grid points and M microphones

3) The actual measured microphone sound pressure vector p is represented as follows:

p＝A(Ω_G)q_G+n (9)

in the formula (I), the compound is shown in the specification,

a source intensity vector formed by source intensities of all frequencies of N grid point positions is represented;

source strength vector for all frequencies of nth grid point

Representing the frequency f at the nth grid point_lThe source of time is strong;

4) updating likelihood function, maximum likelihood ratio test cost function and least square function as identification formula for single sound source to obtain function

Sum function

Namely:

5) from the set omega_GIn selecting a maximum likelihood ratio test cost function

Preliminary estimation coordinates of sound source position

As follows:

in the formula (I), the compound is shown in the specification,

a residual vector representing the microphone sound pressure signal obtained after the first s-1 sound source estimation;

and

6) Preliminarily estimating coordinates of sound source position

in the formula (I), the compound is shown in the specification,

5. The method of claim 1, wherein the step of locally optimizing the preliminary estimated coordinates of the sound source location by newton's method comprises:

1) establishing a two-dimensional Newton optimization function, namely:

in the formula (I), the compound is shown in the specification,

and

respectively representing a Jacobian matrix and a Hessian matrix;

optimizing and estimating coordinates for the sound source position;

jacobian matrix

Heisen matrix

Respectively as follows:

wherein the parameter item

Parameter item

Parameter item

Parameter item

Parameter item

Parameter item

Respectively as follows:

in the formula, A_fRepresenting a transfer matrix at a frequency f;

2) optimizing and estimating coordinates of sound source position

Namely:

3) updating microphone sound pressure residual vector p_rNamely:

in the formula (I), the compound is shown in the specification,

6. The method of claim 1, wherein the step of performing a global loop optimization on all the identified sound source position coordinates by using newton's method comprises:

1) establishing a source strength estimation vector

Transmission matrix

And harmonic source location estimation matrices

2) And (3) carrying out the t iterative optimization on the kth sound source, wherein the steps are as follows:

2.1) computing the transition residual vector of the kth sound source component at the time of retaining the t-1 th iteration

Namely:

in the formula, the initial value of t is 1;

2.2) residual transition

Namely:

2.3) calculating the global optimization estimated coordinates of the sound source position

Namely:

2.4) updating microphone pressure residual vector

Namely:

3) determining the residual energy change of two consecutive cycles

And if so, performing orthogonal solution on all the recognized sound source intensities and updating the residual error, otherwise, making t equal to t +1, and returning to the step 2).

7. The method of claim 1, wherein the method for recognizing a sound source by multi-frequency synchronous two-dimensional off-network compressed beam forming comprises: the steps of solving all the identified sound source intensities orthogonally and updating the residual errors include:

1) optimizing and estimating coordinate set of sound source position

Namely:

2) updating microphone sound pressure residual vector p_rNamely:

3) judgmentIf the K is greater than or equal to S, if so, ending the circulation and outputting the set

And

otherwise, let k be k +1 and from the set Ω_GIn selecting a maximum likelihood ratio test cost function

And the largest grid node is used as a preliminary estimation coordinate of the sound source position.