CN112666520A - Method and system for positioning time-frequency spectrum sound source with adjustable response - Google Patents

Method and system for positioning time-frequency spectrum sound source with adjustable response Download PDF

Info

Publication number
CN112666520A
CN112666520A CN202011492942.8A CN202011492942A CN112666520A CN 112666520 A CN112666520 A CN 112666520A CN 202011492942 A CN202011492942 A CN 202011492942A CN 112666520 A CN112666520 A CN 112666520A
Authority
CN
China
Prior art keywords
frequency
positioning
representing
time
energy distribution
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Granted
Application number
CN202011492942.8A
Other languages
Chinese (zh)
Other versions
CN112666520B (en
Inventor
聂鹏飞
贾彩琴
刘宾
王黎明
韩焱
余尚江
陈晋央
周会娟
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
North University of China
Original Assignee
North University of China
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by North University of China filed Critical North University of China
Priority to CN202011492942.8A priority Critical patent/CN112666520B/en
Publication of CN112666520A publication Critical patent/CN112666520A/en
Application granted granted Critical
Publication of CN112666520B publication Critical patent/CN112666520B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Images

Landscapes

  • Measurement Of Velocity Or Position Using Acoustic Or Ultrasonic Waves (AREA)

Abstract

The invention provides a method and a system for positioning a time-frequency spectrum sound source with adjustable response.

Description

Method and system for positioning time-frequency spectrum sound source with adjustable response
Technical Field
The invention relates to the technical field of sound source positioning, in particular to a method and a system for positioning a sound source with a response-adjustable time-frequency spectrum.
Background
The sound source positioning is widely applied to many fields, and the current sound source positioning methods can be divided into two types in terms of positioning principle, wherein one type is a two-step positioning method based on arrival time/arrival time difference estimation, and the other type is a one-step positioning method based on beam forming. The positioning accuracy of the two-step positioning method is limited by the time difference estimation accuracy and the positioning algorithm performance. High accuracy moveout/moveout estimation requires high quality data and therefore the two step positioning method is poor in interference rejection. The one-step positioning method can make full use of time, energy and frequency information of signals, and is strong in anti-interference energy, high in positioning accuracy and wide in application range. However, the above methods only consider the time information of the array signal, and do not consider the constraint of the frequency information on the positioning, which results in inaccurate positioning. Although the existing one-step positioning method, such as SRP-PHAT, has a certain anti-noise performance, the method utilizes the full-band information of the signal, and due to noise interference, a large number of local extrema exist in the energy distribution of the positioning space, which results in the problems of reduced sound source positioning accuracy, low convergence speed in the optimization solution process, and the like.
Disclosure of Invention
The invention aims to provide a method and a system for positioning a time-frequency spectrum sound source with adjustable response, which are used for overcoming the technical defects of low positioning precision and low positioning speed in the conventional positioning method and improving the positioning precision and the positioning speed.
In order to achieve the purpose, the invention provides the following scheme:
a method for tunable response time-frequency spectrum sound source localization, the localization method comprising the steps of:
calculating the cross-correlation coefficient of the vibration signals acquired by every two sensors in the sensor array;
performing time-frequency transformation on each cross-correlation coefficient to obtain a time-frequency spectrum of the cross-correlation coefficient;
obtaining positioning space energy distribution containing signal frequency information by utilizing a delay and sum beam forming principle according to the time-frequency spectrum of each cross-correlation coefficient;
determining a positioning characteristic vector corresponding to the maximum value of the energy distribution of the positioning space of the vibration signals with different frequencies in a frequency constraint range by adopting a constraint maximum likelihood estimation mode, and constructing a positioning characteristic matrix;
and clustering and weighting fusion are carried out on the positioning feature vectors in the positioning feature matrix to obtain a sound source positioning result.
Optionally, the obtaining, according to the time-frequency spectrum of each cross-correlation coefficient, a positioning space energy distribution including signal frequency information by using a principle of delay-sum beam forming specifically includes:
according to the time-frequency spectrum of each cross-correlation coefficient, the positioning space energy distribution containing signal frequency information is obtained by utilizing the principle of delay summation beam forming, and the positioning space energy distribution is as follows:
Figure BDA0002841259090000021
where P (x, f) denotes the localized spatial energy distribution, τ (x) denotes the time difference between the spatial position x to the ith and jth sensors, f denotes the vibration signal frequency,
Figure BDA0002841259090000028
represents the cross-correlation coefficient R after the (i, j) th time-frequency transformationi,jThe time-frequency transform function value of (1); si(t) represents the vibration signal collected by the ith sensor,
Figure BDA0002841259090000029
representing the vibration signal s collected by the jth sensorj(t + τ (x)) with t representing the time at which the ith sensor acquired the vibration signal and N representing the number of sensors in the sensor array.
Optionally, the determining, in a constrained maximum likelihood estimation manner, a positioning feature vector corresponding to a maximum value of energy distribution in a positioning space of vibration signals with different frequencies within a frequency constraint range, and constructing a positioning feature matrix specifically includes:
solving formula by using constrained maximum likelihood estimation mode
Figure BDA0002841259090000022
Obtaining a positioning feature matrix as A ═ a1,a2,...,am,...aM];
Wherein the content of the first and second substances,
Figure BDA0002841259090000023
representing the frequency of the vibration signal as fmPosition vector, P (x, f), of the maximum of the temporal localization spatial energy distributionm) Representing the frequency of the vibration signal as fmTemporal localization spatial energy distribution, amRepresenting the frequency f of the vibration signalmThe corresponding location feature vector is then used to locate,
Figure BDA0002841259090000024
Emrepresenting the frequency f of the vibration signalmThe corresponding spatially tunable response spectral energy,
Figure BDA0002841259090000025
Figure BDA0002841259090000026
representing the frequency of the vibration signal as fmLocation vector in temporal localization spatial energy distribution
Figure BDA0002841259090000027
Energy of (f) of (d)min,fmax]Indicating the frequency band range, fminRepresenting the minimum effective frequency, fmaxRepresenting the maximum effective frequency.
Optionally, the clustering and weighting fusion of the localization feature vectors in the localization feature matrix is performed to obtain a sound source localization result, and the method specifically includes:
taking the positioning feature vector with the vibration signal frequency closest to the main frequency in the positioning feature matrix as a main clustering center;
performing two-classification dynamic clustering on the positioning feature vector in the positioning feature matrix by using the main clustering center to obtain a class in which the main clustering center is located as a data fusion sample set;
using formulas
Figure BDA0002841259090000031
Performing weighted fusion on each positioning feature vector in the data fusion sample set to obtain a source positioning result;
where Φ represents the source localization result, wnRepresenting the nth location feature vector a in the data fusion sample setnThe weight of (a) is determined,
Figure BDA0002841259090000032
βnrepresenting the inverse of the Euclidean distance, β, of the nth location feature vector in the data fusion sample set from the primary cluster centerlAnd L represents the quantity of the positioning feature vectors in the data fusion sample set.
A tunable response time-frequency spectral sound source localization system, the localization system comprising:
the cross-correlation coefficient calculation module is used for calculating the cross-correlation coefficient of the vibration signals acquired by every two sensors in the sensor array;
the time-frequency transformation module is used for respectively carrying out time-frequency transformation on each cross-correlation coefficient to obtain a time-frequency spectrum of the cross-correlation coefficient;
the positioning space energy distribution determining module is used for obtaining positioning space energy distribution containing signal frequency information by utilizing the principle of delay summation beam forming according to the cross-correlation coefficient after each time frequency transformation;
the positioning feature matrix construction module is used for determining a positioning feature vector corresponding to the maximum value of the energy distribution of the positioning space of the vibration signals with different frequencies in the frequency constraint range by adopting a constraint maximum likelihood estimation mode, and constructing a positioning feature matrix;
and the sound source positioning module is used for clustering and weighting the positioning characteristic vectors in the positioning characteristic matrix to obtain a sound source positioning result.
Optionally, the positioning spatial energy distribution determining module specifically includes:
and the positioning space energy distribution determining submodule is used for obtaining positioning space energy distribution containing signal frequency information according to the time-frequency spectrum of each cross-correlation coefficient by utilizing the principle of delay summation beam forming, and comprises the following steps:
Figure BDA0002841259090000033
where P (x, f) denotes the localized spatial energy distribution, τ (x) denotes the time difference between the spatial position x to the ith and jth sensors, f denotes the vibration signal frequency,
Figure BDA0002841259090000034
represents the cross-correlation coefficient R after the (i, j) th time-frequency transformationi,jThe time-frequency transform function value of (1); si(t) represents the vibration signal collected by the ith sensor,
Figure BDA0002841259090000049
representing the vibration signal s collected by the jth sensorj(t + τ (x)) with t representing the time at which the ith sensor acquired the vibration signal and N representing the number of sensors in the sensor array.
Optionally, the module for constructing the location feature matrix specifically includes:
a positioning feature matrix construction submodule for solving the formula by adopting a constraint maximum likelihood estimation mode
Figure BDA0002841259090000041
Obtaining a positioning feature matrix as A ═ a1,a2,...,am,...aM];
Wherein the content of the first and second substances,
Figure BDA0002841259090000042
representing the frequency of the vibration signal as fmPosition vector, P (x, f), of the maximum of the temporal localization spatial energy distributionm) Representing the frequency of the vibration signal as fmTemporal localization spatial energy distribution, amIndicating vibrationFrequency f of the signalmThe corresponding location feature vector is then used to locate,
Figure BDA0002841259090000043
Emrepresenting the frequency f of the vibration signalmThe corresponding spatially tunable response spectral energy,
Figure BDA0002841259090000044
Figure BDA0002841259090000045
representing the frequency of the vibration signal as fmLocation vector in temporal localization spatial energy distribution
Figure BDA0002841259090000046
Energy of (f) of (d)min,fmax]Indicating the frequency band range, fminRepresenting the minimum effective frequency, fmaxRepresenting the maximum effective frequency.
Optionally, the sound source positioning module specifically includes:
the main clustering center determining submodule is used for taking the positioning feature vector with the vibration signal frequency closest to the main frequency in the positioning feature matrix as a main clustering center;
the data fusion sample set acquisition submodule is used for carrying out two-classification dynamic clustering on the positioning feature vector in the positioning feature matrix by using the main clustering center to obtain a class where the main clustering center is located as a data fusion sample set;
a weighted fusion submodule for utilizing the formula
Figure BDA0002841259090000047
Performing weighted fusion on each positioning feature vector in the data fusion sample set to obtain a source positioning result;
where Φ represents the source localization result, wnRepresenting the nth location feature vector a in the data fusion sample setnThe weight of (a) is determined,
Figure BDA0002841259090000048
βnrepresenting the inverse of the Euclidean distance, β, of the nth location feature vector in the data fusion sample set from the primary cluster centerlAnd L represents the quantity of the positioning feature vectors in the data fusion sample set.
Compared with the prior art, the invention has the beneficial effects that:
the invention provides a method and a system for positioning a time-frequency spectrum sound source with adjustable response, wherein the positioning method comprises the following steps: calculating the cross-correlation coefficient of the vibration signals acquired by every two sensors in the sensor array; performing time-frequency transformation on each cross-correlation coefficient to obtain a time-frequency spectrum of the cross-correlation coefficient; obtaining positioning space energy distribution containing signal frequency information by utilizing a delay and sum beam forming principle according to the time-frequency spectrum of each cross-correlation coefficient; determining a positioning characteristic vector corresponding to the maximum value of the energy distribution of the positioning space of the vibration signals with different frequencies in a frequency constraint range by adopting a constraint maximum likelihood estimation mode, and constructing a positioning characteristic matrix; and clustering and weighting fusion are carried out on the positioning feature vectors in the positioning feature matrix to obtain a sound source positioning result. According to the invention, the positioning space energy distribution containing signal frequency information is obtained by performing time-frequency transformation on the cross-correlation coefficient and utilizing the principle of delay summation beam forming, so that the obtained positioning space energy distribution has frequency information, then the maximum likelihood estimation is restricted, a positioning characteristic matrix is constructed, the technical defect of inaccurate positioning caused by the fact that the frequency information restriction is not considered is overcome, and the positioning precision and speed are further improved by adopting a clustering and weighting fusion mode.
Drawings
In order to more clearly illustrate the embodiments of the present invention or the technical solutions in the prior art, the drawings needed to be used in the embodiments will be briefly described below, and it is obvious that the drawings in the following description are only some embodiments of the present invention, and it is obvious for those skilled in the art to obtain other drawings without inventive exercise.
FIG. 1 is a flow chart of a method for time-frequency-spectrum sound source localization with adjustable response according to the present invention;
fig. 2 is a schematic diagram of a method for locating a tunable response time-frequency spectrum sound source according to the present invention.
Detailed Description
The technical solutions in the embodiments of the present invention will be clearly and completely described below with reference to the drawings in the embodiments of the present invention, and it is obvious that the described embodiments are only a part of the embodiments of the present invention, and not all of the embodiments. All other embodiments, which can be derived by a person skilled in the art from the embodiments given herein without making any creative effort, shall fall within the protection scope of the present invention.
The invention aims to provide a method and a system for positioning a time-frequency spectrum sound source with adjustable response, which are used for overcoming the technical defects of low positioning precision and low positioning speed in the conventional positioning method and improving the positioning precision and the positioning speed.
In order to make the aforementioned objects, features and advantages of the present invention comprehensible, embodiments accompanied with figures are described in further detail below.
As shown in fig. 1 and 2, the present invention provides a method for localization of a time-frequency-spectrum sound source with adjustable response, the localization method comprising the steps of:
step 101, calculating the cross-correlation coefficient of the vibration signals collected by every two sensors in the sensor array.
Collecting vibration data generated by sound source by array sensor, calculating cross correlation R of signals of sensor pair (i, j)i,j(τ),
Figure BDA0002841259090000061
Where j denotes the jth sensor, t denotes the time of acquisition of the signal, τ denotes the time delay, si(t) represents the signal collected by the ith sensor, sj(t + τ) represents the acquired signal delay signal of the jth sensor.
And 102, respectively carrying out time-frequency transformation on each cross correlation coefficient to obtain a time-frequency spectrum of the cross correlation coefficient.
The SRP-PHAT (weighted controlled response power sound source positioning of phase transformation) method which can be obtained by the formula (1) is the positioning method with the best anti-noise and anti-multipath interference performance at present, and the positioning principle is as follows:
Figure BDA0002841259090000062
where P (x) represents the localization spatial energy distribution,
Figure BDA0002841259090000063
is the time difference between the calculated spatial position x to the ith and jth sensors, wherein x represents the spatial position of the sound source, | · | | survival2Is the 2 norm (distance) of the vector and v is the wave propagation velocity. Maximizing the spatial position corresponding to P (x)
Figure BDA0002841259090000064
Is the location of the sound source. Although the SRP-PHAT method has certain anti-noise performance, the method utilizes the full-frequency-band information of signals, and a large number of local extrema exist in P (x) due to noise interference, so that the positioning accuracy of a sound source is reduced, the convergence speed in the optimization solving process is low, and the like.
In order to overcome the defects of the SRP-PHAT method, the invention firstly carries out the treatment on Ri,j(tau) performing time-frequency transformation, and then obtaining an adjustable spatial response spectrum by utilizing delay-sum beam forming in a time-frequency domain. Ri,j(τ) time-frequency transform with S-transform:
Figure BDA0002841259090000065
wherein the content of the first and second substances,
Figure BDA0002841259090000066
representing a signal sj(t) synchronized pressureThe S-scaling transformation is carried out,
Figure BDA0002841259090000067
representing the cross-correlation Ri,j(τ) time-frequency transformation.
And 103, obtaining positioning space energy distribution containing signal frequency information by utilizing a delay and sum beam forming principle according to the time-frequency spectrum of each cross-correlation coefficient.
Step 103, obtaining the positioning space energy distribution containing the signal frequency information by using the principle of delay and sum beam forming according to the cross-correlation coefficient after each time-frequency transformation, specifically including: according to the cross-correlation coefficient after each time frequency transformation, the positioning space energy distribution containing signal frequency information is obtained by utilizing the principle of delay summation beam forming, and the positioning space energy distribution is as follows:
Figure BDA0002841259090000071
where P (x, f) denotes the localized spatial energy distribution, τ (x) denotes the time difference between the spatial position x to the ith and jth sensors, f denotes the vibration signal frequency,
Figure BDA0002841259090000072
represents the cross-correlation coefficient R after the (i, j) th time-frequency transformationi,jThe time-frequency transform function value of (1); si(t) represents the vibration signal collected by the ith sensor,
Figure BDA0002841259090000073
representing the vibration signal s collected by the jth sensorj(t + τ (x)) with t representing the time at which the ith sensor acquired the vibration signal and N representing the number of sensors in the sensor array.
Specifically, let the position of the ith sensor be xi=[xi,yi,zi]TFrom the principle of delay-sum beamforming, one can obtain:
Figure BDA0002841259090000074
the tunable spatial response spectrum represented by equation (3) contains not only the spatial position variable x but also a function of the signal frequency f. Thus, better positioning performance can be obtained by restricting the frequency f of the signal.
And 104, determining a positioning feature vector corresponding to the maximum value of the energy distribution of the positioning space of the vibration signals with different frequencies in the frequency constraint range by adopting a constraint maximum likelihood estimation mode, and constructing a positioning feature matrix.
Step 104, determining a positioning feature vector corresponding to the maximum value of the energy distribution of the positioning space of the vibration signals with different frequencies in the frequency constraint range by adopting a constraint maximum likelihood estimation mode, and constructing a positioning feature matrix, which specifically comprises the following steps: solving formula by using constrained maximum likelihood estimation mode
Figure BDA0002841259090000075
Obtaining a positioning feature matrix as A ═ a1,a2,...,am,...aM](ii) a Wherein the content of the first and second substances,
Figure BDA0002841259090000076
representing the frequency of the vibration signal as fmPosition vector, P (x, f), of the maximum of the temporal localization spatial energy distributionm) Representing the frequency of the vibration signal as fmTemporal localization spatial energy distribution, amRepresenting the frequency f of the vibration signalmThe corresponding location feature vector is then used to locate,
Figure BDA0002841259090000077
Emrepresenting the frequency f of the vibration signalmThe corresponding spatially tunable response spectral energy,
Figure BDA0002841259090000081
Figure BDA0002841259090000082
representing the frequency of the vibration signal as fmLocation vector in temporal localization spatial energy distribution
Figure BDA0002841259090000083
Energy of (f) of (d)min,fmax]Indicating the frequency band range, fminRepresenting the minimum effective frequency, fmaxRepresenting the maximum effective frequency.
Specifically, when the sensor array collects signals, the sensor array is often influenced by the surrounding environment, and particularly for complex environments, the signal-to-noise ratio of the collected signals is low. Meanwhile, the acquired array signals are often in a main frequency band, and not all frequency bands are useful information. Therefore, better positioning results can be obtained by constraining the signal frequency f so that only the adjustable spatial response spectral energy of the signal main frequency band is considered in the formula (3) maximization process. Carrying out spectrum analysis on the array signal to obtain the frequency band range [ f ] of the array signalmin,fmax]And main frequency f0It is used as a constraint to solve the maximum likelihood estimate of P (x, f). The maximization problem can be converted into a constraint optimization problem
Figure BDA0002841259090000084
fiRepresents that is ofmin,fmax]At a certain frequency of the range, the above-mentioned constrained optimization problem can be solved by group intelligent optimization methods such as genetic algorithm, etc., so as to obtain a series of positioning results corresponding to the frequency band range and their corresponding characteristics, which are expressed as
A=[a1,a2,...,ai,...aM] (5)
Wherein a isiIs derived from the frequency f within the frequency bandiIts corresponding positioning space position
Figure BDA0002841259090000085
And its corresponding spatially tunable response spectral energy EiConstructed vectors, i.e.
Figure BDA0002841259090000086
Wherein the content of the first and second substances,
Figure BDA0002841259090000087
the above method obtains a series of samples, possibly of the sound source position, which contain both spatial position information and frequency and energy information. This information is in fact the key information available for positioning. The acquisition of the information can provide a basis for subsequent high-precision fusion positioning.
And 105, clustering and weighting the positioning feature vectors in the positioning feature matrix to obtain a sound source positioning result.
Step 105, performing clustering and weighted fusion on the positioning feature vectors in the positioning feature matrix to obtain a sound source positioning result, which specifically includes: taking the positioning feature vector with the vibration signal frequency closest to the main frequency in the positioning feature matrix as a main clustering center; performing two-classification dynamic clustering on the positioning feature vector in the positioning feature matrix by using the main clustering center to obtain a class in which the main clustering center is located as a data fusion sample set; using formulas
Figure BDA0002841259090000091
Performing weighted fusion on each positioning feature vector in the data fusion sample set to obtain a source positioning result; where Φ represents the sound source localization result, wnRepresenting the nth location feature vector a in the data fusion sample setnThe weight of (a) is determined,
Figure BDA0002841259090000092
βnrepresenting the inverse of the Euclidean distance, β, of the nth location feature vector in the data fusion sample set from the primary cluster centerlAnd L represents the quantity of the positioning feature vectors in the data fusion sample set.
Specifically, step 104 has already obtained the key information for determining the positioning performance, but still cannot simply obtain high precision therefromAnd as a result of the accurate positioning, data mining is required to realize high-precision positioning. Although the positioning result within the signal frequency band is obtained in step 104, the frequencies of the signals collected by different sensors, which can represent the positioning performance, are not completely consistent due to the difference of the propagation paths of the waves to the different sensors. Meanwhile, the spatial position corresponding to the maximum spatial response spectral energy due to noise interference may not be the best positioning result. Based on the above two points, the invention firstly searches the A middle and dominant frequency f0Closest sample
Figure BDA0002841259090000093
(fmClosest to f0) As a clustering center of the dynamic clustering, the obtained sample set A is subjected to two-classification dynamic clustering, and the removed clustering center is not amKeep the cluster center as amThe class (denoted as B) is used as a sample for data fusion. Calculate each sample a in BnAnd amInverse beta of the Euclidean distance of (1)n
Figure BDA0002841259090000094
Wherein d isnm=||an-am||2,anE.g. B. From betanConstructing a weighting factor w for a weighted fusionn
Figure BDA0002841259090000095
Using the constructed weighting coefficients, for sample a in BnAnd performing weighted fusion to obtain a final positioning vector phi.
Figure BDA0002841259090000096
The spatial position corresponding to phi is the final positioning result.
The invention also provides a system for positioning a spectral sound source in adjustable response, which comprises:
the cross-correlation coefficient calculation module is used for calculating the cross-correlation coefficient of the vibration signals acquired by every two sensors in the sensor array;
the time-frequency transformation module is used for respectively carrying out time-frequency transformation on each cross-correlation coefficient to obtain a time-frequency spectrum of the cross-correlation coefficient;
the positioning space energy distribution determining module is used for obtaining positioning space energy distribution containing signal frequency information by utilizing the principle of delay summation beam forming according to the time-frequency spectrum of each cross-correlation coefficient;
the positioning spatial energy distribution determining module specifically includes: and the positioning space energy distribution determining submodule is used for obtaining positioning space energy distribution containing signal frequency information according to the cross-correlation coefficient after each time-frequency transformation by utilizing the principle of delay summation beam forming:
Figure BDA0002841259090000101
where P (x, f) denotes the localized spatial energy distribution, τ (x) denotes the time difference between the spatial position x to the ith and jth sensors, f denotes the vibration signal frequency,
Figure BDA0002841259090000102
represents the cross-correlation coefficient R after the (i, j) th time-frequency transformationi,jThe time-frequency transform function value of (1); si(t) represents the vibration signal collected by the ith sensor,
Figure BDA0002841259090000103
representing the vibration signal s collected by the jth sensorj(t + τ (x)) with t representing the time at which the ith sensor acquired the vibration signal and N representing the number of sensors in the sensor array.
And the positioning feature matrix construction module is used for determining a positioning feature vector corresponding to the maximum value of the energy distribution of the positioning space of the vibration signals with different frequencies in the frequency constraint range by adopting a constraint maximum likelihood estimation mode, and constructing a positioning feature matrix.
The positioning feature matrix building module specifically comprises: a positioning feature matrix construction submodule for solving the formula by adopting a constraint maximum likelihood estimation mode
Figure BDA0002841259090000104
Obtaining a positioning feature matrix as A ═ a1,a2,...,am,...aM];
Wherein the content of the first and second substances,
Figure BDA0002841259090000105
representing the frequency of the vibration signal as fmPosition vector, P (x, f), of the maximum of the temporal localization spatial energy distributionm) Representing the frequency of the vibration signal as fmTemporal localization spatial energy distribution, amRepresenting the frequency f of the vibration signalmThe corresponding location feature vector is then used to locate,
Figure BDA0002841259090000106
Emrepresenting the frequency f of the vibration signalmThe corresponding spatially tunable response spectral energy,
Figure BDA0002841259090000107
Figure BDA0002841259090000108
representing the frequency of the vibration signal as fmLocation vector in temporal localization spatial energy distribution
Figure BDA0002841259090000109
Energy of (f) of (d)min,fmax]Indicating the frequency band range, fminRepresenting the minimum effective frequency, fmaxRepresenting the maximum effective frequency.
And the sound source positioning module is used for clustering and weighting the positioning characteristic vectors in the positioning characteristic matrix to obtain a sound source positioning result.
The sound source positioning module specifically comprises: a main clustering center determining submodule for determining the location feature matrixThe positioning feature vector with the vibration signal frequency closest to the main frequency is used as a main clustering center; the data fusion sample set acquisition submodule is used for carrying out two-classification dynamic clustering on the positioning feature vector in the positioning feature matrix by using the main clustering center to obtain a class where the main clustering center is located as a data fusion sample set; a weighted fusion submodule for utilizing the formula
Figure BDA0002841259090000111
Performing weighted fusion on each positioning feature vector in the data fusion sample set to obtain a source positioning result; where Φ represents the source localization result, wnRepresenting the nth location feature vector a in the data fusion sample setnThe weight of (a) is determined,
Figure BDA0002841259090000112
βnrepresenting the inverse of the Euclidean distance, β, of the nth location feature vector in the data fusion sample set from the primary cluster centerlAnd L represents the quantity of the positioning feature vectors in the data fusion sample set.
Compared with the prior art, the invention has the beneficial effects that:
the invention provides a method and a system for positioning a time-frequency spectrum sound source with adjustable response, wherein the positioning method comprises the following steps: calculating the cross-correlation coefficient of the vibration signals acquired by every two sensors in the sensor array; performing time-frequency transformation on each cross-correlation coefficient to obtain a time-frequency spectrum of the cross-correlation coefficient; obtaining positioning space energy distribution containing signal frequency information by utilizing a delay and sum beam forming principle according to the time-frequency spectrum of each cross-correlation coefficient; determining a positioning characteristic vector corresponding to the maximum value of the energy distribution of the positioning space of the vibration signals with different frequencies in a frequency constraint range by adopting a constraint maximum likelihood estimation mode, and constructing a positioning characteristic matrix; and clustering and weighting fusion are carried out on the positioning feature vectors in the positioning feature matrix to obtain a sound source positioning result. According to the invention, the positioning space energy distribution containing signal frequency information is obtained by performing time-frequency transformation on the cross-correlation coefficient and utilizing the principle of delay summation beam forming, so that the obtained positioning space energy distribution has frequency information, then the maximum likelihood estimation is restricted, a positioning characteristic matrix is constructed, the technical defect of inaccurate positioning caused by the fact that the frequency information restriction is not considered is overcome, and the positioning precision and speed are further improved by adopting a clustering and weighting fusion mode.
The equivalent embodiments in the present specification are described in a progressive manner, each embodiment focuses on differences from other embodiments, and the same and similar parts between the equivalent embodiments can be referred to each other.
The principles and embodiments of the present invention have been described herein using specific examples, which are provided only to help understand the method and the core concept of the present invention; meanwhile, for a person skilled in the art, according to the idea of the present invention, the specific embodiments and the application range may be changed. In summary, this summary should not be construed to limit the present invention.

Claims (8)

1. A method for locating a time-frequency-spectrum sound source with adjustable response is characterized by comprising the following steps:
calculating the cross-correlation coefficient of the vibration signals acquired by every two sensors in the sensor array;
performing time-frequency transformation on each cross-correlation coefficient to obtain a time-frequency spectrum of the cross-correlation coefficient;
obtaining positioning space energy distribution containing signal frequency information by utilizing a delay and sum beam forming principle according to the time-frequency spectrum of each cross-correlation coefficient;
determining a positioning characteristic vector corresponding to the maximum value of the energy distribution of the positioning space of the vibration signals with different frequencies in a frequency constraint range by adopting a constraint maximum likelihood estimation mode, and constructing a positioning characteristic matrix;
and clustering and weighting fusion are carried out on the positioning feature vectors in the positioning feature matrix to obtain a sound source positioning result.
2. The method according to claim 1, wherein the obtaining a localization spatial energy distribution containing signal frequency information according to the time-frequency spectrum of each cross-correlation coefficient by using the principle of delay-sum beam forming specifically comprises:
according to the time-frequency spectrum of each cross-correlation coefficient, the positioning space energy distribution containing signal frequency information is obtained by utilizing the principle of delay summation beam forming, and the positioning space energy distribution is as follows:
Figure FDA0002841259080000011
where P (x, f) denotes the localized spatial energy distribution, τ (x) denotes the time difference between the spatial position x to the ith and jth sensors, f denotes the vibration signal frequency,
Figure FDA0002841259080000012
represents the cross-correlation coefficient R after the (i, j) th time-frequency transformationi,jThe time-frequency transform function value of (1); si(t) represents the vibration signal collected by the ith sensor,
Figure FDA0002841259080000013
representing the vibration signal s collected by the jth sensorj(t + τ (x)) with t representing the time at which the ith sensor acquired the vibration signal and N representing the number of sensors in the sensor array.
3. The method for positioning an adjustable-response time-frequency spectrum sound source according to claim 2, wherein the determining a positioning feature vector corresponding to a maximum value of energy distribution in a positioning space of vibration signals with different frequencies within a frequency constraint range by using a constraint maximum likelihood estimation method to construct a positioning feature matrix specifically comprises:
solving formula by using constrained maximum likelihood estimation mode
Figure FDA0002841259080000014
Obtaining a positioning feature matrix as A ═ a1,a2,...,am,...aM];
Wherein the content of the first and second substances,
Figure FDA0002841259080000021
representing the frequency of the vibration signal as fmPosition vector, P (x, f), of the maximum of the temporal localization spatial energy distributionm) Representing the frequency of the vibration signal as fmTemporal localization spatial energy distribution, amRepresenting the frequency f of the vibration signalmThe corresponding location feature vector is then used to locate,
Figure FDA0002841259080000022
Emrepresenting the frequency f of the vibration signalmThe corresponding spatially tunable response spectral energy,
Figure FDA0002841259080000023
Figure FDA0002841259080000024
representing the frequency of the vibration signal as fmLocation vector in temporal localization spatial energy distribution
Figure FDA0002841259080000025
Energy of (f) of (d)min,fmax]Indicating the frequency band range, fminRepresenting the minimum effective frequency, fmaxRepresenting the maximum effective frequency.
4. The method for positioning a sound source with an adjustable response time-frequency spectrum according to claim 1, wherein the clustering and weighted fusion of the positioning feature vectors in the positioning feature matrix is performed to obtain a sound source positioning result, and specifically comprises:
taking the positioning feature vector with the vibration signal frequency closest to the main frequency in the positioning feature matrix as a main clustering center;
performing two-classification dynamic clustering on the positioning feature vector in the positioning feature matrix by using the main clustering center to obtain a class in which the main clustering center is located as a data fusion sample set;
using formulas
Figure FDA0002841259080000026
Carrying out weighted fusion on each positioning feature vector in the data fusion sample set to obtain a sound source positioning result;
where Φ represents the sound source localization result, wnRepresenting the nth location feature vector a in the data fusion sample setnThe weight of (a) is determined,
Figure FDA0002841259080000027
βnrepresenting the inverse of the Euclidean distance, β, of the nth location feature vector in the data fusion sample set from the primary cluster centerlAnd L represents the quantity of the positioning feature vectors in the data fusion sample set.
5. A system for tunable response-time spectral sound source localization, the localization system comprising:
the cross-correlation coefficient calculation module is used for calculating the cross-correlation coefficient of the vibration signals acquired by every two sensors in the sensor array;
the time-frequency transformation module is used for respectively carrying out time-frequency transformation on each cross-correlation coefficient to obtain a time-frequency spectrum of the cross-correlation coefficient;
the positioning space energy distribution determining module is used for obtaining positioning space energy distribution containing signal frequency information by utilizing the principle of delay summation beam forming according to the time-frequency spectrum of each cross-correlation coefficient;
the positioning feature matrix construction module is used for determining a positioning feature vector corresponding to the maximum value of the energy distribution of the positioning space of the vibration signals with different frequencies in the frequency constraint range by adopting a constraint maximum likelihood estimation mode, and constructing a positioning feature matrix;
and the sound source positioning module is used for clustering and weighting the positioning characteristic vectors in the positioning characteristic matrix to obtain a sound source positioning result.
6. The system for tunable response-time spectral sound source localization according to claim 5, wherein the localization spatial energy distribution determining module specifically comprises:
and the positioning space energy distribution determining submodule is used for obtaining positioning space energy distribution containing signal frequency information according to the cross-correlation coefficient after each time-frequency transformation by utilizing the principle of delay summation beam forming:
Figure FDA0002841259080000031
where P (x, f) denotes the localized spatial energy distribution, τ (x) denotes the time difference between the spatial position x to the ith and jth sensors, f denotes the vibration signal frequency,
Figure FDA0002841259080000032
represents the cross-correlation coefficient R after the (i, j) th time-frequency transformationi,jThe time-frequency transform function value of (1); si(t) represents the vibration signal collected by the ith sensor,
Figure FDA0002841259080000033
representing the vibration signal s collected by the jth sensorj(t + τ (x)) with t representing the time at which the ith sensor acquired the vibration signal and N representing the number of sensors in the sensor array.
7. The system for spectral sound source localization according to claim 5, wherein the localization feature matrix constructing module specifically comprises:
a positioning feature matrix construction submodule for solving the formula by adopting a constraint maximum likelihood estimation mode
Figure FDA0002841259080000034
Obtaining a positioning feature matrix as A ═ a1,a2,...,am,...aM];
Wherein the content of the first and second substances,
Figure FDA0002841259080000035
representing the frequency of the vibration signal as fmPosition vector, P (x, f), of the maximum of the temporal localization spatial energy distributionm) Representing the frequency of the vibration signal as fmTemporal localization spatial energy distribution, amRepresenting the frequency f of the vibration signalmThe corresponding location feature vector is then used to locate,
Figure FDA0002841259080000036
Emrepresenting the frequency f of the vibration signalmThe corresponding spatially tunable response spectral energy,
Figure FDA0002841259080000037
Figure FDA0002841259080000038
representing the frequency of the vibration signal as fmLocation vector in temporal localization spatial energy distribution
Figure FDA0002841259080000039
Energy of (f) of (d)min,fmax]Indicating the frequency band range, fminRepresenting the minimum effective frequency, fmaxRepresenting the maximum effective frequency.
8. The system for tunable response-time spectral sound source localization according to claim 5, wherein the sound source localization module specifically comprises:
the main clustering center determining submodule is used for taking the positioning feature vector with the vibration signal frequency closest to the main frequency in the positioning feature matrix as a main clustering center;
the data fusion sample set acquisition submodule is used for carrying out two-classification dynamic clustering on the positioning feature vector in the positioning feature matrix by using the main clustering center to obtain a class where the main clustering center is located as a data fusion sample set;
a weighted fusion submodule for utilizing the formula
Figure FDA0002841259080000041
Performing weighted fusion on each positioning feature vector in the data fusion sample set to obtain a source positioning result;
where Φ represents the source localization result, wnRepresenting the nth location feature vector a in the data fusion sample setnThe weight of (a) is determined,
Figure FDA0002841259080000042
βnrepresenting the inverse of the Euclidean distance, β, of the nth location feature vector in the data fusion sample set from the primary cluster centerlAnd L represents the quantity of the positioning feature vectors in the data fusion sample set.
CN202011492942.8A 2020-12-17 2020-12-17 Method and system for positioning time-frequency spectrum sound source with adjustable response Active CN112666520B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN202011492942.8A CN112666520B (en) 2020-12-17 2020-12-17 Method and system for positioning time-frequency spectrum sound source with adjustable response

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN202011492942.8A CN112666520B (en) 2020-12-17 2020-12-17 Method and system for positioning time-frequency spectrum sound source with adjustable response

Publications (2)

Publication Number Publication Date
CN112666520A true CN112666520A (en) 2021-04-16
CN112666520B CN112666520B (en) 2022-09-13

Family

ID=75404485

Family Applications (1)

Application Number Title Priority Date Filing Date
CN202011492942.8A Active CN112666520B (en) 2020-12-17 2020-12-17 Method and system for positioning time-frequency spectrum sound source with adjustable response

Country Status (1)

Country Link
CN (1) CN112666520B (en)

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN113884986A (en) * 2021-12-03 2022-01-04 杭州兆华电子有限公司 Beam focusing enhanced strong impact signal space-time domain joint detection method and system

Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20090149132A1 (en) * 2007-12-10 2009-06-11 Trueposition, Inc. Detection of Time of Arrival of CDMA Signals in a Wireless Location System
CN104747912A (en) * 2015-04-23 2015-07-01 重庆邮电大学 Fluid conveying pipe leakage acoustic emission time-frequency positioning method
CN106959433A (en) * 2017-05-09 2017-07-18 上海微小卫星工程中心 STFT IRT method for parameter estimation based on RLBI
US20200029157A1 (en) * 2017-12-26 2020-01-23 Aselsan Elektronik Sanayi Ve Ticaret Anonim Sirketi Method for acoustic detection of shooter location
CN111175698A (en) * 2020-01-18 2020-05-19 国网山东省电力公司菏泽供电公司 Transformer noise source positioning method, system and device based on sound and vibration combination

Patent Citations (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20090149132A1 (en) * 2007-12-10 2009-06-11 Trueposition, Inc. Detection of Time of Arrival of CDMA Signals in a Wireless Location System
CN101821645A (en) * 2007-12-10 2010-09-01 真实定位公司 Detect the time of arrival of the CDMA signal in the wireless location system
CN104747912A (en) * 2015-04-23 2015-07-01 重庆邮电大学 Fluid conveying pipe leakage acoustic emission time-frequency positioning method
CN106959433A (en) * 2017-05-09 2017-07-18 上海微小卫星工程中心 STFT IRT method for parameter estimation based on RLBI
US20200029157A1 (en) * 2017-12-26 2020-01-23 Aselsan Elektronik Sanayi Ve Ticaret Anonim Sirketi Method for acoustic detection of shooter location
CN111175698A (en) * 2020-01-18 2020-05-19 国网山东省电力公司菏泽供电公司 Transformer noise source positioning method, system and device based on sound and vibration combination

Non-Patent Citations (5)

* Cited by examiner, † Cited by third party
Title
BIN LIU,ET AL: "Steered sample algorithm for acoustic source localization", 《PLOS ONE》 *
PENGFEI NIE,ET AL: "SRP-PHAR Combined Velocity Scanning for Locating the Shallow Underground Acoustic Source", 《IEEE ACESS》 *
THOMAS PADOIS: "Acoustic source localization based on the generalized cross-correlation and the generalized mean with few microphones", 《ACOUSTICAL SOCIETY OF AMERICA》 *
张帅等: "S变换时频域滤波方法在主动源资料处理中的应用研究", 《地震研究》 *
郑晓亮等: "基于延迟求和的输气管道泄漏声波定位方法", 《仪器仪表学报》 *

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN113884986A (en) * 2021-12-03 2022-01-04 杭州兆华电子有限公司 Beam focusing enhanced strong impact signal space-time domain joint detection method and system

Also Published As

Publication number Publication date
CN112666520B (en) 2022-09-13

Similar Documents

Publication Publication Date Title
Dvorkind et al. Time difference of arrival estimation of speech source in a noisy and reverberant environment
CN107644650B (en) Improved sound source positioning method based on progressive serial orthogonalization blind source separation algorithm and implementation system thereof
CN109839612A (en) Sounnd source direction estimation method based on time-frequency masking and deep neural network
CN112565119B (en) Broadband DOA estimation method based on time-varying mixed signal blind separation
CN109557504B (en) Method for positioning near-field narrow-band signal source
CN111798869A (en) Sound source positioning method based on double microphone arrays
Ahmad et al. Wideband DOA estimation based on incoherent signal subspace method
CN112666520B (en) Method and system for positioning time-frequency spectrum sound source with adjustable response
Durrani et al. Eigenfilter approaches to adaptive array processing
Stoica et al. Direction-of-arrival estimation of an amplitude-distorted wavefront
CN103837858A (en) Far field direction of arrival estimation method applied to plane array and system thereof
Hu et al. Decoupled direction-of-arrival estimations using relative harmonic coefficients
Weiss et al. Direction-of-arrival estimation using MODE with interpolated arrays
CN110212966A (en) Mutual coupling of antenna bearing calibration based on importance resampling under the conditions of a kind of coherent source
Niu et al. Mode separation with one hydrophone in shallow water: A sparse Bayesian learning approach based on phase speed
Nesta et al. Enhanced multidimensional spatial functions for unambiguous localization of multiple sparse acoustic sources
Jo et al. Robust localization of early reflections in a room using semi real-valued EB-ESPRIT with three recurrence relations and laplacian constraint
Wang et al. Off-grid doa estimation based on alternating iterative weighted least squares for acoustic vector hydrophone array
CN110940999A (en) Self-adaptive unscented Kalman filtering method based on error model
CN113640891B (en) Singular spectrum analysis-based transient electromagnetic detection data noise filtering method
CN101645701B (en) Time delay estimation method based on filter bank and system thereof
Hu et al. Evaluation and comparison of three source direction-of-arrival estimators using relative harmonic coefficients
CN110361696B (en) Closed space sound source positioning method based on time reversal technology
Wang et al. Sparse spectrum fitting algorithm using signal covariance matrix reconstruction and weighted sparse constraint
Bella et al. A new sparse blind source separation method for determined linear convolutive mixtures in time-frequency domain

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
CB03 Change of inventor or designer information
CB03 Change of inventor or designer information

Inventor after: Nie Pengfei

Inventor after: Jia Caiqin

Inventor after: Liu Bin

Inventor after: Wang Liming

Inventor after: Han Yan

Inventor before: Nie Pengfei

Inventor before: Jia Caiqin

Inventor before: Liu Bin

Inventor before: Wang Liming

Inventor before: Han Yan

Inventor before: Yu Shangjiang

Inventor before: Chen Jinyang

Inventor before: Zhou Huijuan

GR01 Patent grant
GR01 Patent grant