CN109188362B - Microphone array sound source positioning signal processing method - Google Patents
Microphone array sound source positioning signal processing method Download PDFInfo
- Publication number
- CN109188362B CN109188362B CN201811019390.1A CN201811019390A CN109188362B CN 109188362 B CN109188362 B CN 109188362B CN 201811019390 A CN201811019390 A CN 201811019390A CN 109188362 B CN109188362 B CN 109188362B
- Authority
- CN
- China
- Prior art keywords
- grid point
- microphone
- sound source
- value
- calculating
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Active
Links
Images
Classifications
-
- G—PHYSICS
- G01—MEASURING; TESTING
- G01S—RADIO DIRECTION-FINDING; RADIO NAVIGATION; DETERMINING DISTANCE OR VELOCITY BY USE OF RADIO WAVES; LOCATING OR PRESENCE-DETECTING BY USE OF THE REFLECTION OR RERADIATION OF RADIO WAVES; ANALOGOUS ARRANGEMENTS USING OTHER WAVES
- G01S5/00—Position-fixing by co-ordinating two or more direction or position line determinations; Position-fixing by co-ordinating two or more distance determinations
- G01S5/18—Position-fixing by co-ordinating two or more direction or position line determinations; Position-fixing by co-ordinating two or more distance determinations using ultrasonic, sonic, or infrasonic waves
- G01S5/22—Position of source determined by co-ordinating a plurality of position lines defined by path-difference measurements
Landscapes
- Physics & Mathematics (AREA)
- Engineering & Computer Science (AREA)
- General Physics & Mathematics (AREA)
- Radar, Positioning & Navigation (AREA)
- Remote Sensing (AREA)
- Circuit For Audible Band Transducer (AREA)
- Measurement Of Velocity Or Position Using Acoustic Or Ultrasonic Waves (AREA)
Abstract
The invention provides a microphone array sound source positioning method, which comprises the following steps: step 1) dividing the estimated sound source position into Q grid points in the measurement space, wherein the three-dimensional coordinate of each grid point isSampling the M microphone signals to calculate a grid pointTime delay differences to two different microphone signals; step 2) collecting current frame data of M microphone channels, and calculating time delay values of microphone pairs; calculating a weighted value w of the q-th grid point based on the delay value and the delay difference of the step 1)q(ii) a Then, the SRP-PHAT value p of the q grid point is calculatedqFinding w in Q grid pointsqpqGrid point corresponding to the maximum value ofThereby obtaining the grid point coordinates of the frame data corresponding to the estimated sound source positionThe invention can solve the problem that the positioning accuracy of the SRP-PHAT method in the prior art is seriously and rapidly reduced under the influence of environmental noise and reverberation conditions.
Description
Technical Field
The invention belongs to the technical field of audio signal processing and array signal processing, and particularly relates to a microphone array sound source positioning signal processing method.
Background
Current microphone array localization algorithms fall broadly into three broad categories, namely time difference of arrival (TDOA) -based localization, controlled response power (SRP), and high resolution spectral estimation-based algorithms. Algorithms based on high-resolution spectral estimation were initially applied to the localization of narrowband sources and were later increasingly referenced by numerous scholars' transformations to the broadband source localization problem. When the method is expanded to broadband signal estimation, the signal frequency needs to be divided into a plurality of sub-bands in the frequency domain, or frequency focusing is carried out to convert the frequency into a narrow-band signal processing mode. The algorithm has high positioning resolution, but the algorithm operation amount is greatly increased due to the conversion from a broadband to a narrow band, and the performance is sharply reduced in practice because the number of sound sources is unknown and the noise environment does not meet the ideal Gaussian white noise condition.
The core of a time difference of arrival (TDOA) -based localization algorithm is the accurate estimation of acoustic propagation delay, which is generally obtained by performing cross-correlation or generalized cross-correlation on signals between microphones. Finally, the position of the sound source is determined by applying a geometric algorithm. The directional algorithm based on the arrival time difference has relatively small computation amount, good real-time performance and low hardware cost, so the method attracts attention and becomes a method widely adopted in sound source orientation. In the method, whether the time delay estimation value is accurate or not determines whether the sound source positioning is accurate or not, and the environmental noise and the indoor reverberation have certain influence on the accuracy.
The SRP method divides the space into a grid, each grid has a hypothetical sound source, the time delay difference from each hypothetical sound source to a pair of microphones at a designated position can be calculated, the cross-correlation values corresponding to the time delay differences of all the microphones are summed to obtain the response power, and the hypothetical sound source position corresponding to the maximum value of the response power is the estimated value of the real sound source position. The sound source positioning method (SRP-PHAT) combining controllable response power and phase transformation combines the inherent robustness and short-time analysis characteristics of the controllable response power method with the insensitivity of the phase transformation method to the signal surrounding environment in time delay estimation, so that a sound source positioning system has certain noise resistance and reverberation resistance. However, the SRP-PHAT method has a sharp performance degradation in a severe environment (large noise interference and serious reverberation effect).
Disclosure of Invention
The invention aims to solve the problem that the positioning accuracy of the SRP-PHAT method in the prior art is seriously and rapidly reduced under the influence of environmental noise and reverberation conditions.
In order to achieve the above object, the present invention discloses a microphone array sound source localization signal processing method, which includes:
step 1) dividing the estimated sound source position into Q grid points in the measurement space, wherein the three-dimensional coordinate of each grid point isSampling the M microphone signals to calculate a grid pointTime delay differences to two different microphone signals;
step 2) collecting current frame data of M microphone channels, and calculating time delay values of microphone pairs; calculating a weighted value w of the q-th grid point based on the delay value and the delay difference of the step 1)q(ii) a Then, the SRP-PHAT value p of the q grid point is calculatedqFinding w in Q grid pointsqpqGrid point corresponding to the maximum value ofThereby obtaining the grid point coordinates of the frame data corresponding to the estimated sound source position
As a modification of the above method, the step 1) includes:
step 1-1) setting a microphone array consisting of M microphones to be distributed in a three-dimensional space, wherein the coordinates of each microphone are
Step 1-2) dividing all possible positions of a sound source into Q grid points in a measurement space, wherein the three-dimensional coordinates of the grid points are
Step 1-3) each microphone corresponds to a channel, and the sampling frequency of the signal is set as fsEach frame has a sampling length of L per channel and a sampling signal of x per channeli1(n), i1 ═ 1, …, M, n ═ 1, …, L; the number of Fourier transform points is equal to 2L-1;
Where i2 is 1, …, M, i2 ≠ i1, and c is the speed of sound.
As a modification of the above method, the step 2) includes:
step 2-1) calculating each microphone channel signal x respectivelyi1(n), i1 ═ 1, …, M, n ═ 1, …, 2L-1 point fast fourier transform of L to obtain Xi1(k),i1=1,…,M,k=1,…,2L-1;
Step 2-2) calculating the phase transformation PHAT cross-correlation value R of the i1 th and i2 th microphone channelsi1i2(l):
Wherein, Xi1(k) Is the i1 th channel receiving signal xi1(n), i1 ═ 1, …, M, n ═ 1, …, frequency domain representation of L, the number of points calculated by the fast fourier transform FFT is 2L-1; xi2(k) Is the i2 th channel receiving signal xi2(n), i2 ═ 1, …, M, n ═ 1, …, the frequency domain representation of L,is Xi2(k) Conjugation of (1); i Xi1(k) I is Xi1(k) The magnitude of (d); 1, …, L;
step 2-3) according to Ri1i2(l) Calculate the firstTime delay value between i1 and i2 microphone channels
Step 2-4) calculating Delta taui1i2(q) andthe standard deviation between them obtains the weighted value w of each grid pointq:
Step 2-5) calculating a controllable response power-phase transformation SRP-PHAT value p of each grid pointq;
Step 2-6) calculating the weighted controllable response power-phase transformation SRP-PHAT value w of the qth grid pointqpqAt Q number of wqpqFind the maximum value among them, according to wqpqGet the corresponding grid point
Step 2-7) according to wqpqGrid point corresponding to maximum valueObtaining the sound source position corresponding to the frame data
The invention has the advantages that:
1. the invention discloses a microphone array sound source positioning signal processing method, which adopts the technical scheme of weighted SRP-PHAT sound source positioning signal processing, and uses the reciprocal of the standard difference between the time delay estimated by a PHAT cross-correlation value and the correct time delay value corresponding to a search point as the weighted value of the SRP-PHAT value to calculate the response power of a space grid point;
2, the relative time delay value of the sound source position and the microphone is more similar to the time delay value obtained by calculation of the PHAT cross-correlation method, and the response power value is larger;
3. the invention can solve the problem that the positioning accuracy of the SRP-PHAT method in the prior art is seriously and rapidly reduced under the influence of environmental noise and reverberation conditions.
Drawings
FIG. 1 is a flow chart of a signal processing method according to the present invention.
Detailed Description
The invention is described in detail below with reference to the figures and specific embodiments.
Setting a microphone array composed of M microphones distributed in a three-dimensional space, wherein the coordinates of each microphone areAccording to the requirement of the system on estimation precision, all possible positions of the sound source can be simplified into grid points of a three-dimensional grid in a measurement space. Suppose a total division into Q grid points with coordinates ofLet the sampling rate of the signal be fsThe length of each frame per channel sample is L.
The weighted SRP-PHAT sound source positioning method determines the estimation value of the sound source position by searching the position with the maximum weighted SRP-PHAT value in the grid
wherein the PHAT cross-correlation value Ri1i2(Δτi1i2(q)) the calculation formula is as follows:
wherein, Xi1(k) Is the i1 th channel receiving signal xi1(n), i1 ═ 1, …, M, n ═ 1, …, frequency domain representation of L, the number of points FFT calculated is 2L-1; xi2(k) Is the i2 th channel receiving signal xi2(n), i2 ═ 1, …, M, n ═ 1, …, the frequency domain representation of L,is Xi2(k) Conjugation of (1); i Xi1(k) I is Xi1(k) The magnitude of (d); 1, …, L;
wherein, Δ τi1i2(q) is the grid pointThe delay difference to the i1 th and i2 th channels is calculated as:
where i2 is 1, …, M, i2 ≠ i1, and c is the speed of sound.
Weighted value wqThe calculation formula of (a) is as follows:
examples
Setting a microphone array composed of M microphones distributed in a three-dimensional space, wherein the coordinates of each microphone areAccording to the requirement of the system on estimation precision, all possible positions of the sound source can be simplified into grid points of a three-dimensional grid in a measurement space. Suppose a total division into Q grid points with coordinates ofEach microphone corresponds to a channel, and the sampling frequency of the signal is set as fsAnd the sampling length per channel of each frame is L and is marked as xi1(n), i1 ═ 1, …, M, n ═ 1, …, L. The number of Fourier transform points is equal to 2L-1.
As shown in fig. 1, the signal processing method disclosed by the present invention specifically comprises the following steps:
step 1) calculating the time delay difference from each grid point to the position of the microphone by using a formula (4) according to the position coordinates of the microphone and the coordinates of the searched grid points, and storing for later use. This step is performed only once;
and 2) processing each frame of data to obtain the estimation of the frame of data on the position of the sound source.
The specific steps of each frame of data processing are as follows:
step 2-1) calculating each channel signal x respectivelyi1(n), 2L-1 point Fast Fourier Transform (FFT) of L, i1 ═ 1, …, M, n ═ 1, …, and X is obtainedi1(k),i1=1,…,M,k=1,…,2L-1;
Step 2-2) calculating the PHAT cross-correlation value R of the signals of all channel microphones according to the formula (3)i1i2(l);
Step 2-3) the PHAT cross-correlation value R is used according to the formula (6)i1i2(τ) calculating delay estimates between all channel pairs
Step 2-4) calculating Delta tau according to formula (5)i1i2(q) andthe standard deviation between them obtains the weighted value w of each grid pointq;
Step 2-5) calculating the SRP-PHAT value p of each grid point according to the formula (2)q;
Step 2-6) calculating weighted SRP-PHAT values p of all grid points according to formula (1)qFinding out the grid point corresponding to the maximum value
Step 2-7) according to wqpqGrid point corresponding to maximum valueObtaining the sound source position corresponding to the frame data
The invention discloses a weighted SRP-PHAT microphone array sound source positioning signal processing method, which uses the reciprocal of the standard difference between the time delay estimated by a PHAT cross-correlation value and the correct time delay value corresponding to a search point as the weighted value of an SRP-PHAT value to calculate the response power of a space grid point. The guiding idea is that if the grid point is the correct sound source position, the relative time delay value of the grid point and the microphone pair is closer to the time delay value calculated by the PHAT cross-correlation method, and the response power value of the point is larger. By adopting the method, the accuracy of sound source positioning can be further improved.
Finally, it should be noted that the above embodiments are only used for illustrating the technical solutions of the present invention and are not limited. Although the present invention has been described in detail with reference to the embodiments, it will be understood by those skilled in the art that various changes may be made and equivalents may be substituted without departing from the spirit and scope of the invention as defined in the appended claims.
Claims (3)
1. A microphone array sound source localization method, comprising:
step 1) dividing the estimated sound source position into Q grid points in the measurement space, wherein the three-dimensional coordinate of each grid point isSampling the M microphone signals to calculate a grid pointTime delay differences to two different microphone signals;
step 2) collecting current frame data of M microphone channels, and calculating time delay values of microphone pairs; calculating a weighted value w of the q-th grid point based on the delay value and the delay difference of the step 1)q(ii) a Then, the SRP-PHAT value p of the q grid point is calculatedqFinding w in Q grid pointsqpqGrid point corresponding to the maximum value ofThereby obtaining the grid point coordinates of the frame data corresponding to the estimated sound source position
2. The microphone array sound source localization method according to claim 1, wherein the step 1) includes:
step 1-1) setting a microphone array consisting of M microphones to be distributed in a three-dimensional space, wherein the coordinates of each microphone are
Step 1-2) dividing all possible positions of a sound source into Q grid points in a measurement space, wherein the three-dimensional coordinates of the grid points are
Step 1-3) each microphone corresponds to a channel, and the sampling frequency of the signal is set as fsEach frame has a sampling length of L per channel and a sampling signal of x per channeli1(n), i1 ═ 1, …, M, n ═ 1, …, L; the number of Fourier transform points is equal to 2L-1;
Where i2 is 1, …, M, i2 ≠ i1, and c is the speed of sound.
3. The microphone array sound source localization method according to claim 2, wherein the step 2) includes:
step 2-1) calculating each microphone channel signal x respectivelyi1(n), i1 ═ 1, …, M, n ═ 1, …, 2L-1 point fast fourier transform of L to obtain Xi1(k),i1=1,…,M,k=1,…,2L-1;
Step 2-2) calculating the phase transformation PHAT cross-correlation value R of the i1 th and i2 th microphone channelsi1i2(l):
Wherein, Xi1(k) Is the i1 th channel receiving signal xi1(n), i1 ═ 1, …, M, n ═ 1, …, frequency domain representation of L, the number of points calculated by the fast fourier transform FFT is 2L-1; xi2(k) Is the i2 th channel receiving signal xi2(n), i2 ═ 1, …, M, n ═ 1, …, the frequency domain representation of L,is Xi2(k) Conjugation of (1); i Xi1(k) I is Xi1(k) The magnitude of (d); 1, …, L;
step 2-3) according to Ri1i2(l) Calculating a time delay value between the i1 th and i2 th microphone channels
Step 2-4) calculating Delta taui1i2(q) andthe standard deviation between them obtains the weighted value w of each grid pointq:
Step 2-5) calculating a controllable response power-phase transformation SRP-PHAT value p of each grid pointq;
Step 2-6) calculating the weighted controllable response power-phase transformation SRP-PHAT value w of the qth grid pointqpqAt Q number of wqpqFind the maximum value among them, according to wqpqGet the corresponding grid point
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201811019390.1A CN109188362B (en) | 2018-09-03 | 2018-09-03 | Microphone array sound source positioning signal processing method |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201811019390.1A CN109188362B (en) | 2018-09-03 | 2018-09-03 | Microphone array sound source positioning signal processing method |
Publications (2)
Publication Number | Publication Date |
---|---|
CN109188362A CN109188362A (en) | 2019-01-11 |
CN109188362B true CN109188362B (en) | 2020-09-08 |
Family
ID=64917807
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN201811019390.1A Active CN109188362B (en) | 2018-09-03 | 2018-09-03 | Microphone array sound source positioning signal processing method |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN109188362B (en) |
Families Citing this family (6)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN110082725B (en) * | 2019-03-12 | 2023-02-28 | 西安电子科技大学 | Microphone array-based sound source positioning time delay estimation method and sound source positioning system |
CN111445920B (en) * | 2020-03-19 | 2023-05-16 | 西安声联科技有限公司 | Multi-sound source voice signal real-time separation method, device and pickup |
CN111650559B (en) * | 2020-06-12 | 2022-11-01 | 深圳市裂石影音科技有限公司 | Real-time processing two-dimensional sound source positioning method |
CN112379330B (en) * | 2020-11-27 | 2023-03-10 | 浙江同善人工智能技术有限公司 | Multi-robot cooperative 3D sound source identification and positioning method |
CN113470682B (en) * | 2021-06-16 | 2023-11-24 | 中科上声(苏州)电子有限公司 | Method, device and storage medium for estimating speaker azimuth by microphone array |
CN116047413B (en) * | 2023-03-31 | 2023-06-23 | 长沙东玛克信息科技有限公司 | Audio accurate positioning method under closed reverberation environment |
Family Cites Families (5)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN101762806B (en) * | 2010-01-27 | 2013-03-13 | 华为终端有限公司 | Sound source locating method and apparatus thereof |
KR101767925B1 (en) * | 2012-07-26 | 2017-08-18 | 한화테크윈 주식회사 | Apparatus and method for estimating location of sound source |
CN105044675B (en) * | 2015-07-16 | 2017-09-08 | 南京航空航天大学 | A kind of Fast implementation of SRP auditory localizations |
WO2017129239A1 (en) * | 2016-01-27 | 2017-08-03 | Nokia Technologies Oy | System and apparatus for tracking moving audio sources |
CN107102296B (en) * | 2017-04-27 | 2020-04-14 | 大连理工大学 | Sound source positioning system based on distributed microphone array |
-
2018
- 2018-09-03 CN CN201811019390.1A patent/CN109188362B/en active Active
Non-Patent Citations (4)
Title |
---|
Multichannel Audio Processing for Speaker Localization, Separation and Enhancement;Amparo Martí Guerola;《Universitat Politècnica de València doctoral thesis》;20131231;全文 * |
Speaker Localization and Detection in Videoconferencing Environments Using a Modified SRP-PHAT Algorithm;A. Marti,et al;《Waves》;20111231;全文 * |
SRP-PHAT的改进算法综述;袁晓坤,等;《电声技术》;20121031;第36卷(第10期);全文 * |
基于分布式麦克风阵列的声源定位算法;蔡卫平,等;《计算机应用与软件》;20140531;第31卷(第5期);全文 * |
Also Published As
Publication number | Publication date |
---|---|
CN109188362A (en) | 2019-01-11 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN109188362B (en) | Microphone array sound source positioning signal processing method | |
CN104076331B (en) | A kind of sound localization method of seven yuan of microphone arrays | |
CN110082725B (en) | Microphone array-based sound source positioning time delay estimation method and sound source positioning system | |
CN107102296B (en) | Sound source positioning system based on distributed microphone array | |
WO2020042708A1 (en) | Time-frequency masking and deep neural network-based sound source direction estimation method | |
CN104898091B (en) | Microphone array self calibration sonic location system based on iteration optimization algorithms | |
CN108375763B (en) | Frequency division positioning method applied to multi-sound-source environment | |
CN107644650B (en) | Improved sound source positioning method based on progressive serial orthogonalization blind source separation algorithm and implementation system thereof | |
CN103308889A (en) | Passive sound source two-dimensional DOA (direction of arrival) estimation method under complex environment | |
CN111123192A (en) | Two-dimensional DOA positioning method based on circular array and virtual extension | |
CN110534126B (en) | Sound source positioning and voice enhancement method and system based on fixed beam forming | |
CN108957403B (en) | Gaussian fitting envelope time delay estimation method and system based on generalized cross correlation | |
CN111798869B (en) | Sound source positioning method based on double microphone arrays | |
CN107167770A (en) | A kind of microphone array sound source locating device under the conditions of reverberation | |
CN105607042A (en) | Method for locating sound source through microphone array time delay estimation | |
CN109031261B (en) | Time difference estimation method and device | |
CN109212481A (en) | A method of auditory localization is carried out using microphone array | |
CN104811886A (en) | Phase difference measurement-based microphone array direction finding method | |
Dang et al. | A feature-based data association method for multiple acoustic source localization in a distributed microphone array | |
CN206114888U (en) | Pronunciation sound source goniometer system | |
CN103837858A (en) | Far field direction of arrival estimation method applied to plane array and system thereof | |
KR20090128221A (en) | Method for sound source localization and system thereof | |
CN110007276B (en) | Sound source positioning method and system | |
Liu et al. | Research on acoustic source localization using time difference of arrival measurements | |
CN103778288A (en) | Ant colony optimization-based near field sound source localization method under non-uniform array noise condition |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PB01 | Publication | ||
PB01 | Publication | ||
SE01 | Entry into force of request for substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
GR01 | Patent grant | ||
GR01 | Patent grant |