CN113419216B - Multi-sound source positioning method suitable for reverberant environment - Google Patents

Multi-sound source positioning method suitable for reverberant environment Download PDF

Info

Publication number
CN113419216B
CN113419216B CN202110684270.9A CN202110684270A CN113419216B CN 113419216 B CN113419216 B CN 113419216B CN 202110684270 A CN202110684270 A CN 202110684270A CN 113419216 B CN113419216 B CN 113419216B
Authority
CN
China
Prior art keywords
sound source
coordinates
group
microphone
coordinate
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Active
Application number
CN202110684270.9A
Other languages
Chinese (zh)
Other versions
CN113419216A (en
Inventor
胡秋岑
吴礼福
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Nanjing University of Information Science and Technology
Original Assignee
Nanjing University of Information Science and Technology
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Nanjing University of Information Science and Technology filed Critical Nanjing University of Information Science and Technology
Priority to CN202110684270.9A priority Critical patent/CN113419216B/en
Publication of CN113419216A publication Critical patent/CN113419216A/en
Application granted granted Critical
Publication of CN113419216B publication Critical patent/CN113419216B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Classifications

    • GPHYSICS
    • G01MEASURING; TESTING
    • G01SRADIO DIRECTION-FINDING; RADIO NAVIGATION; DETERMINING DISTANCE OR VELOCITY BY USE OF RADIO WAVES; LOCATING OR PRESENCE-DETECTING BY USE OF THE REFLECTION OR RERADIATION OF RADIO WAVES; ANALOGOUS ARRANGEMENTS USING OTHER WAVES
    • G01S5/00Position-fixing by co-ordinating two or more direction or position line determinations; Position-fixing by co-ordinating two or more distance determinations
    • G01S5/18Position-fixing by co-ordinating two or more direction or position line determinations; Position-fixing by co-ordinating two or more distance determinations using ultrasonic, sonic, or infrasonic waves
    • G01S5/20Position of source determined by a plurality of spaced direction-finders
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F18/00Pattern recognition
    • G06F18/20Analysing
    • G06F18/23Clustering techniques
    • G06F18/232Non-hierarchical techniques
    • G06F18/2321Non-hierarchical techniques using statistics or function optimisation, e.g. modelling of probability density functions
    • G06F18/23213Non-hierarchical techniques using statistics or function optimisation, e.g. modelling of probability density functions with fixed number of clusters, e.g. K-means clustering
    • YGENERAL TAGGING OF NEW TECHNOLOGICAL DEVELOPMENTS; GENERAL TAGGING OF CROSS-SECTIONAL TECHNOLOGIES SPANNING OVER SEVERAL SECTIONS OF THE IPC; TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
    • Y02TECHNOLOGIES OR APPLICATIONS FOR MITIGATION OR ADAPTATION AGAINST CLIMATE CHANGE
    • Y02DCLIMATE CHANGE MITIGATION TECHNOLOGIES IN INFORMATION AND COMMUNICATION TECHNOLOGIES [ICT], I.E. INFORMATION AND COMMUNICATION TECHNOLOGIES AIMING AT THE REDUCTION OF THEIR OWN ENERGY USE
    • Y02D30/00Reducing energy consumption in communication networks
    • Y02D30/70Reducing energy consumption in communication networks in wireless communication networks

Abstract

The invention provides a multi-sound source localization method suitable for a reverberation environment, which comprises the steps of grouping all coordinates of a whole search area and calculating the center coordinates of each group; collecting a voice signal using a 16 microphone array; positioning a certain sound source by using a multi-sound source positioning algorithm of double-layer search space clustering (TL-SSC), and removing coordinates near the sound source in a search area; this operation is repeated until all sound sources are located. The method solves the problem of real-time localization of multiple sound sources under the reverberation condition, and compared with other multi-sound source localization methods, the method has the advantages that the number of the required microphones is small, the calculation efficiency can be improved under the condition of keeping high localization precision, and the real-time requirement of mobile robot application is met.

Description

Multi-sound source positioning method suitable for reverberant environment
Technical Field
The invention relates to the technical field of sound source localization, in particular to a multi-sound source localization method suitable for a reverberation environment.
Background
The multi-sound source localization has a wide demand in real-time systems such as video conferences, voice recognition, mobile service robots and the like, and is one of research hotspots in the field of acoustic signal processing. For example, when the mobile robot performs real-time intelligent service, the voice position is determined by a multi-sound source positioning method, and the robot is guided to complete the service. The existing sound source localization methods mainly comprise three types: positioning method based on subspace, positioning method based on controllable beam forming and positioning method based on arrival time delay. The subspace-based positioning method receives signals through each microphone, utilizes orthogonality of the signal subspace and the noise subspace, constructs a spatial spectrum function and searches a spectrum peak to obtain a sound source direction, has high positioning accuracy, has higher requirements on stability of sound source signals, and has poor positioning effect in a small space. The positioning method based on controllable beam forming selects the direction corresponding to the maximum receiving power value as the sound source direction by changing the direction of the receiving signals of the microphone array, and has simple principle and small calculated amount, but has poor noise resistance, environmental noise information needs to be acquired in advance, and positioning instantaneity is difficult to ensure. The positioning method based on the arrival time delay utilizes the sound path difference from the sound source to each microphone to determine the sound source position, the calculation complexity is generally smaller than that of the two methods, the positioning precision is higher, and the positioning instantaneity is easy to meet.
Disclosure of Invention
Aiming at the application occasions with higher requirements on algorithm real-time performance, such as an indoor mobile robot, in the prior art, namely the problem that the calculation efficiency is improved and the space is improved while the accuracy is maintained as much as possible in a small space, the invention provides a multi-sound-source positioning method suitable for a reverberation environment, wherein the number of microphones adopted is 16 by the multi-sound-source positioning method based on double-layer search space clustering (Two-Levels Search Space Clustering, TL-SSC), the calculation efficiency of a system is improved by grouping coordinates, carrying out real-time double-layer search, clustering screening, threshold judgment and the like, and the real-time positioning of the multi-sound-source is realized by utilizing the estimation of arrival time delay (Time Difference of Arrival, TDOA).
In order to achieve the above purpose, the present invention adopts the following technical scheme:
a multi-sound source localization method suitable for reverberant environments comprises the following steps:
s1, collecting coordinates in the whole search area, grouping, and calculating the center coordinates of each group;
s2, collecting voice signals by using a microphone array;
s3, determining a candidate subgroup of a certain sound source position by adopting a multi-sound source positioning algorithm of double-layer search space clustering in a mode of calculating the central coordinate power of each subgroup, positioning the sound source position in all coordinates contained in the candidate subgroup, and removing coordinates near the sound source in a search area;
repeating the step S3 until all the sound source positions are positioned.
In order to optimize the technical scheme, the specific measures adopted further comprise:
further, the grouping in step S1 is based on:
if the ith coordinate q i Belonging to the j-th group z j Then p (q i ∈z j ) Has a value of 1; if the ith coordinate q i Not belonging to the j-th group z j ,p(q i ∈z j ) The value of (2) is 0;
wherein I represents the total number of coordinates in the whole search area, J represents the current group number, z j Represented as all coordinate sets in the j-th group; wherein the initial value of J is 1, and sequentially adding 1 until the formulaIt holds that with the change of i and j, p (q i ∈z j )、e(q i ,z j ) And z j The center coordinates of the (B) are calculated through a K-means algorithm;
wherein e (q) i ,z j ) Representation groupError convergence, defined as the difference in sound path between all microphone pairsAnd (3) summing; />Representing the distance position q between microphone k and microphone l i TDOA value of>Representing the distance set z between microphone k and microphone l j TDOA value of center coordinates, M represents the number of microphones, θ t Represented as a threshold.
Further, the threshold value theta t The definition is as follows:
where λ represents the wavelength, c is the speed of sound, and f is the sampling rate; in sound source localization, θ t The value is determined by the maximum frequency of the speech signal.
Further, the microphone array collects voice signals by adopting 16 microphones, the whole microphone array is a cylinder, and 8 microphones are uniformly distributed on the upper and lower outlines respectively.
Further, in the time domain, the mode of calculating the coordinate power is specifically:
wherein y (t, q) represents the output value of the coordinate position q at time t, g m (t) represents the impulse response of the filter at the mth microphone, x m (t+τ m,q ) Representing the mth microphone at time t+τ m,q Received signal τ m,q Representing signal propagation times for coordinate position q through the mth microphone; in the frequency domain, the formula for calculating the coordinate power is expressed as:
wherein Y (ω, q) is expressed as an output value of the coordinate position q at the frequency ω, X m (omega) Fourier transform denoted as mth microphone signal, G m (ω) represents the frequency domain system function of the filter at the mth microphone;
based on a formula for calculating coordinate power in a frequency domain, a power output value P (q) of the coordinate position q is obtained as follows:
wherein G is l (omega) is expressed as a frequency domain system function of the filter at the first microphone, X l (ω) is denoted as the fourier transform of the first microphone signal,expressed as the conjugate of the frequency domain system function of the filter at the kth microphone,/i>Represented as the conjugate of the fourier transform of the kth microphone signal, τ k,q Representing signal propagation times for coordinate position q through the kth microphone;
in the method, in the process of the invention, a PHAT weighting coefficient between the ith microphone signal and the kth microphone signal;
after calculating the central coordinate power of each group, determining a candidate group, locating a sound source position in all coordinates contained in the candidate group, and determining the sound source position as the maximum power valueCorresponding coordinates, namely:
further, the specific way to determine a certain sound source candidate group is: searching according to the result of calculating the central coordinate power value of each subgroup, selecting the subgroup corresponding to the maximum power value as a first candidate subgroup, and when judging the v-th subgroup in the rest subgroups, selecting the subgroup as the candidate subgroup under the following conditions:
vc |≤θ 1
terminating the judgment of the candidate subgroup after the number u of the candidate subgroups reaches a certain number or all subgroups are judged;
wherein (X) b ,Y b ,Z b ) Expressed as the center coordinates of the b-th subgroup of the existing candidate subgroups in the Cartesian coordinate system, (X) cc ,Y cc ,Z cc ) Expressed as average coordinates, theta, of all the current candidate group center coordinates averaged in a Cartesian coordinate system c For the direction of the average coordinates after averaging the central coordinates of all the current candidate subgroupsThe angle of the corner of the plate,for the elevation angle of the average coordinate averaged over the center coordinates of all the candidate subgroups at present, θ v Azimuth angle expressed as the central coordinate position of the current group to be discriminated, +.>Elevation angle, θ, expressed as center coordinates of the current group to be discriminated 1 Expressed as azimuth threshold, ++>Represented as elevation threshold.
Further, the specific content of removing the coordinates near the sound source in the search area is:
providing a region omega, uniformly reducing the power of the coordinate positions in the group and giving a power value E to the group contained in the region omega l Meanwhile, the small groups contained in the region omega are not considered in the subsequent step of positioning other sound source positions;
coordinates within a region Ω in a spherical coordinate systemThe requirements are as follows:
|θ-θ s |≤θ 2
where, θ is expressed as azimuth of the coordinates,expressed as elevation angle of the coordinates, r represents distance of the coordinates from origin of the coordinate system, θ s Azimuth angle, expressed as the last sound source coordinate position of the currently located sound source, +>Elevation angle, θ, expressed as the last sound source coordinate position of the currently located sound source 2 Expressed as azimuth threshold, ++>Represented as elevation threshold.
The beneficial effects of the invention are as follows:
1. according to the multi-sound source localization method suitable for the reverberation environment, through the steps of clustering screening, threshold judgment and the like, a proper removal area and candidate group screening conditions are selected, so that the TL-SSC algorithm can be applied to a multi-sound source system.
2. Compared with other multi-sound source localization methods, the multi-sound source localization method suitable for the reverberation environment has the advantages that the number of required microphones is small, the calculation efficiency can be improved under the condition that high localization accuracy is kept, and the real-time requirement of mobile robot application is met.
Drawings
FIG. 1 is a schematic diagram of a microphone and sound source distribution of the present invention; in the figure: pentagram represents a microphone and dot represents a sound source.
Detailed Description
The invention will now be described in further detail with reference to the accompanying drawings.
The localization of two sound sources among the plurality of sound sources is described as an example.
Grouping the primary coordinates in the whole search area, and calculating the center coordinates of each group, wherein the grouping basis is as follows:
wherein I represents the total number of primary coordinates in the whole search area, J represents the current group number, M represents the number of microphones, and θ t Represented as a threshold. z j Expressed as all coordinate sets in the j-th group, if the i-th coordinate q i Belonging to the j-th group z j Then p (q i ∈z j ) 1 is shown in the specification; otherwise, 0.e (q) i ,z j ) Representing a group error, defined as the difference in sound path between all microphone pairsAnd (3) summing; />Representing the distance position q between microphone k and microphone l i Is to determine the TDOA value of->Values. The initial value of J is 1, and the increment is 1 each time until the formula
This is true. As J increases, p (q i ∈z j ),e(q i ,z j ) And z j Will be calculated by the K-means algorithm. The threshold θ is defined as:
where λ represents the wavelength, c is the speed of sound, and f is the sampling rate; in sound source localization, θ t The value is determined by the maximum frequency of the speech signal.
Candidate subgroups of one of the sound sources (the sound source that is not the first to be localized) are screened out and localized. Calculating the power value corresponding to each subgroup according to the obtained grouping result, carrying out first layer search, and selecting the subgroup corresponding to the maximum power value of the central coordinate asA first candidate group. Suppose that u is already in the first tier of searches 1 The subgroup is selected as candidate subgroup, and the v-th subgroup is left to select the subgroup belonging to the same sound source as the existing candidate subgroup 1 When the group is judged, the conditions for selecting the group as a candidate group are as follows:
up to u 1 Up to a certain number n or all subgroups are discriminated. Wherein the method comprises the steps ofExpressed as b in the existing candidate group in Cartesian coordinate system 1 Center coordinates of the groups, (X c1 ,Y c1 ,Z c1 ) Expressed as average coordinates, theta, of all the current candidate group center coordinates averaged in a Cartesian coordinate system c1 For the azimuth angle of the average coordinates averaged over all the current candidate group center coordinates +.>To average sitting after averaging the central coordinates of all the current candidate groupsElevation angle of target->Azimuth angle expressed as the central coordinate position of the current group to be discriminated, +.>Elevation angle, θ, expressed as center coordinates of the current group to be discriminated 1 Expressed as azimuth threshold, ++>Represented as elevation threshold. Using the formula
And calculating the power P (q) of each primary coordinate in all candidate groups, performing second-layer search on the first sound source, searching the position with the maximum output power in all coordinates, and determining the position as the position of the first sound source.
The influence of coordinates around the sound source on the positioning of a second sound source at a later stage is reduced by removing the group near the sound source by giving the group a lower power value. Is provided with a region omega and omega internal coordinates in a spherical coordinate systemThe method meets the following conditions:
|θ-θ s |≤θ 2
wherein θ is 2 And (3) withThreshold values respectively set in azimuth and elevation angle, θ s And->The azimuth and elevation of the first sound source coordinate which is already positioned are respectively, and the P (q) value of the small group contained in the region omega is uniformly given with a low power value E l The subgroup it contains is not considered in the next step. If the currently located sound source is the first located sound source, the coordinate power of the adjacent group is not reduced and removed for the first sound source, but the coordinate power of the adjacent group is reduced and removed from the second located sound source.
The second sound source is positioned according to the modified power distribution. Since it cannot be guaranteed that the region Ω contains all the subgroups near the first sound source that may affect the positioning of the second sound source, the screening method used when positioning the first sound source is still adopted, that is, in the first layer search, the subgroup with the largest power value is selected from the subgroups not contained in the region Ω as the first candidate subgroup of the second sound source, and then the remaining subgroups in the lookup table are screened, so as to reduce the possibility of mixing in the subgroups with the first sound source as the main power contributing sound source. Suppose that u is already present 2 The subgroup is selected as candidate subgroup in the second sound source, and then v is determined in the rest subgroups 2 The conditions for whether the individual subgroups are candidate subgroups are:
wherein, the liquid crystal display device comprises a liquid crystal display device,represents the v th 2 Center coordinates of the groups, wherein>Expressed as b in the existing candidate group in Cartesian coordinate system 2 Center coordinates of the groups, (X c2 ,Y c2 ,Z c2 ) Expressed as average coordinates, theta, of all the current candidate group center coordinates averaged in a Cartesian coordinate system c2 For the azimuth angle of the average coordinates averaged over all the current candidate group center coordinates +.>For the elevation angle of the average coordinates averaged over the center coordinates of all candidate subgroups at present +.>Azimuth angle expressed as the central coordinate position of the current group to be discriminated, +.>Elevation angle, θ, expressed as center coordinates of the current group to be discriminated 1 Expressed as azimuth threshold, ++>Represented as elevation threshold.
When candidate group u 2 After the number of the (B) reaches a certain value n or all subgroups are judged, screening is stopped. Calculating the corresponding power P (q) values of all primary coordinates in the candidate group, sorting, and selecting one corresponding to the maximum power valueThe stage coordinates are the coordinates of the second sound source.
It should be noted that the terms like "upper", "lower", "left", "right", "front", "rear", and the like are also used for descriptive purposes only and are not intended to limit the scope of the invention in which the invention may be practiced, but rather the relative relationship of the terms may be altered or modified without materially altering the teachings of the invention.
The above is only a preferred embodiment of the present invention, and the protection scope of the present invention is not limited to the above examples, and all technical solutions belonging to the concept of the present invention belong to the protection scope of the present invention. It should be noted that modifications and adaptations to the invention without departing from the principles thereof are intended to be within the scope of the invention as set forth in the following claims.

Claims (5)

1. A multi-sound source localization method suitable for a reverberant environment, comprising the steps of:
s1, collecting coordinates in the whole search area, grouping, and calculating the center coordinates of each group;
s2, collecting voice signals by using a microphone array;
s3, determining a candidate subgroup of a certain sound source position by adopting a multi-sound source positioning algorithm of double-layer search space clustering in a mode of calculating the central coordinate power of each subgroup, positioning the sound source position in all coordinates contained in the candidate subgroup, and removing coordinates near the sound source in a search area;
repeating the above operation until all sound source positions are positioned;
the specific method for determining a certain sound source candidate group is as follows: searching according to the result of calculating the central coordinate power value of each subgroup, selecting the subgroup corresponding to the maximum power value as a first candidate subgroup, and when judging the v-th subgroup in the rest subgroups, selecting the subgroup as the candidate subgroup under the following conditions:
vc |≤θ 1
terminating the judgment of the candidate subgroup after the number u of the candidate subgroups reaches a certain number or all subgroups are judged;
wherein (X) b ,Y b ,Z b ) Expressed as the center coordinates of the b-th subgroup of the existing candidate subgroups in the Cartesian coordinate system, (X) cc ,Y cc ,Z cc ) Expressed as average coordinates, theta, of all the current candidate group center coordinates averaged in a Cartesian coordinate system c To average the azimuth of the average coordinates of the current candidate group center coordinates,for the elevation angle of the average coordinate averaged over the center coordinates of all the candidate subgroups at present, θ v Azimuth angle expressed as the central coordinate position of the current group to be discriminated, +.>Elevation angle, θ, expressed as center coordinates of the current group to be discriminated 1 Represented as an azimuth threshold value,represented as elevation threshold;
the specific contents of removing the coordinates near the sound source in the search area are as follows:
providing a region omega, uniformly reducing the power of the coordinate positions in the group and giving a power value E to the group contained in the region omega l Meanwhile, the small groups contained in the region omega are not considered in the subsequent step of positioning other sound source positions;
coordinates within a region Ω in a spherical coordinate systemThe requirements are as follows:
|θ-θ s |≤θ 2
where, θ is expressed as azimuth of the coordinates,expressed as elevation angle of the coordinates, r represents distance of the coordinates from origin of the coordinate system, θ s Azimuth angle, expressed as the last sound source coordinate position of the currently located sound source, +>Elevation angle, θ, expressed as the last sound source coordinate position of the currently located sound source 2 Expressed as azimuth threshold, ++>Represented as elevation threshold.
2. A multi-sound source localization method for reverberant environments according to claim 1, wherein the grouping in step S1 is based on:
if the ith coordinate q i Belonging to the j-th group z j Then p (q i ∈z j ) Has a value of 1; if the ith coordinate q i Not belonging to the j-th group z j ,p(q i ∈z j ) The value of (2) is 0;
wherein I represents the total number of coordinates in the whole search area, J represents the current group number, z j Represented as all coordinate sets in the j-th group; wherein the initial value of J is 1, and sequentially adding 1 until the formulaIt holds that with the change of i and j, p (q i ∈z j )、e(q i ,z j ) And z j The center coordinates of the (B) are calculated through a K-means algorithm;
wherein e (q) i ,z j ) Representing a group error, defined as the difference in sound path between all microphone pairsAnd (3) summing;representing the distance position q between microphone k and microphone l i TDOA value of>Representing the distance set z between microphone k and microphone l j TDOA value of center coordinates, M represents the number of microphones, θ t Represented as a threshold.
3. A multi-sound source localization method for reverberant environments according to claim 2, wherein the threshold θ t The definition is as follows:
where λ represents the wavelength, c is the speed of sound, and f is the sampling rate; in sound source localization, θ t The value is determined by the maximum frequency of the speech signal.
4. A multi-sound source localization method for reverberant environments according to claim 3, wherein the microphone array uses 16 microphones to collect the voice signals, the microphone array is cylindrical as a whole, and 8 microphones are uniformly distributed on the upper and lower contours.
5. A multi-sound source localization method for reverberant environments according to claim 2, wherein,
in the frequency domain, the formula for calculating the coordinate power is expressed as:
wherein Y (ω, q) is expressed as an output value of the coordinate position q at the frequency ω, X m (omega) Fourier transform denoted as mth microphone signal, G m (ω) represents the frequency domain system function of the filter at the mth microphone;
based on a formula for calculating coordinate power in a frequency domain, a power output value P (q) of the coordinate position q is obtained as follows:
wherein G is l (omega) is expressed as a frequency domain system function of the filter at the first microphone, X l (ω) is denoted as the fourier transform of the first microphone signal,denoted as kthConjugation of the frequency domain system function of the filter at the microphone,/->Represented as the conjugate of the fourier transform of the kth microphone signal, τ k,q Representing signal propagation times for coordinate position q through the kth microphone;
in the method, in the process of the invention, a PHAT weighting coefficient between the ith microphone signal and the kth microphone signal;
after calculating the central coordinate power of each group, determining a candidate group, locating a sound source position in all coordinates contained in the candidate group, and determining the sound source position as the maximum power valueCorresponding coordinates, namely:
CN202110684270.9A 2021-06-21 2021-06-21 Multi-sound source positioning method suitable for reverberant environment Active CN113419216B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN202110684270.9A CN113419216B (en) 2021-06-21 2021-06-21 Multi-sound source positioning method suitable for reverberant environment

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN202110684270.9A CN113419216B (en) 2021-06-21 2021-06-21 Multi-sound source positioning method suitable for reverberant environment

Publications (2)

Publication Number Publication Date
CN113419216A CN113419216A (en) 2021-09-21
CN113419216B true CN113419216B (en) 2023-10-31

Family

ID=77789393

Family Applications (1)

Application Number Title Priority Date Filing Date
CN202110684270.9A Active CN113419216B (en) 2021-06-21 2021-06-21 Multi-sound source positioning method suitable for reverberant environment

Country Status (1)

Country Link
CN (1) CN113419216B (en)

Families Citing this family (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN115662383B (en) * 2022-12-22 2023-04-14 杭州爱华智能科技有限公司 Method and system for deleting main sound source, method, system and device for identifying multiple sound sources
CN117828405B (en) * 2024-02-23 2024-05-07 兰州交通大学 Signal positioning method based on intelligent frequency spectrum sensing

Citations (8)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN106093864A (en) * 2016-06-03 2016-11-09 清华大学 A kind of microphone array sound source space real-time location method
US9554208B1 (en) * 2014-03-28 2017-01-24 Marvell International Ltd. Concurrent sound source localization of multiple speakers
CN106940439A (en) * 2017-03-01 2017-07-11 西安电子科技大学 K mean cluster weighting sound localization method based on wireless acoustic sensor network
CN108198568A (en) * 2017-12-26 2018-06-22 太原理工大学 A kind of method and system of more auditory localizations
CN110443371A (en) * 2019-06-25 2019-11-12 深圳欧克曼技术有限公司 A kind of artificial intelligence device and method
CN111352075A (en) * 2018-12-20 2020-06-30 中国科学院声学研究所 Underwater multi-sound-source positioning method and system based on deep learning
CN111474521A (en) * 2020-04-09 2020-07-31 南京理工大学 Sound source positioning method based on microphone array in multipath environment
CN111489753A (en) * 2020-06-24 2020-08-04 深圳市友杰智新科技有限公司 Anti-noise sound source positioning method and device and computer equipment

Patent Citations (8)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US9554208B1 (en) * 2014-03-28 2017-01-24 Marvell International Ltd. Concurrent sound source localization of multiple speakers
CN106093864A (en) * 2016-06-03 2016-11-09 清华大学 A kind of microphone array sound source space real-time location method
CN106940439A (en) * 2017-03-01 2017-07-11 西安电子科技大学 K mean cluster weighting sound localization method based on wireless acoustic sensor network
CN108198568A (en) * 2017-12-26 2018-06-22 太原理工大学 A kind of method and system of more auditory localizations
CN111352075A (en) * 2018-12-20 2020-06-30 中国科学院声学研究所 Underwater multi-sound-source positioning method and system based on deep learning
CN110443371A (en) * 2019-06-25 2019-11-12 深圳欧克曼技术有限公司 A kind of artificial intelligence device and method
CN111474521A (en) * 2020-04-09 2020-07-31 南京理工大学 Sound source positioning method based on microphone array in multipath environment
CN111489753A (en) * 2020-06-24 2020-08-04 深圳市友杰智新科技有限公司 Anti-noise sound source positioning method and device and computer equipment

Non-Patent Citations (7)

* Cited by examiner, † Cited by third party
Title
Fast sound source localization using two-level search space clustering;YOOK D;《IEEE Transactions on Cybernetics》;第第46卷卷(第第1期期);摘要、第1-5章,图1-5 *
MULTIPLE SOUND SOURCE LOCALIZATION BASED ON TDOA CLUSTERING AND MULTI-PATH MATCHING PURSUIT.《IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP)》.2018,全文. *
Multi-source sound localization using the competitive k-means clustering.《2010 IEEE 15th Conference on Emerging Technologies & Factory Automation (ETFA 2010)》.2010,全文. *
基于三线交点球麦克风阵列的远场多声源定位;庄启雷;黄青华;;上海大学学报(自然科学版)(第02期);全文 *
基于多阵列数据融合的宽带多声源定位研究;滕鹏晓;杨亦春;李晓东;田静;;应用声学(第03期);全文 *
基于子带可控响应功率的多声源定位方法;倪志莲;蔡卫平;张怡典;;计算机工程与应用(第24期);全文 *
基于相位差复指数变换的传声器多声源定位;赵小燕;汤捷;周琳;吴镇扬;;东南大学学报(自然科学版)(第02期);全文 *

Also Published As

Publication number Publication date
CN113419216A (en) 2021-09-21

Similar Documents

Publication Publication Date Title
CN107102296B (en) Sound source positioning system based on distributed microphone array
CN113419216B (en) Multi-sound source positioning method suitable for reverberant environment
CN108375763B (en) Frequency division positioning method applied to multi-sound-source environment
CN110488223A (en) A kind of sound localization method
CN108696932A (en) It is a kind of using CSI multipaths and the outdoor fingerprint positioning method of machine learning
CN104142492A (en) SRP-PHAT multi-source spatial positioning method
CN112904279B (en) Sound source positioning method based on convolutional neural network and subband SRP-PHAT spatial spectrum
CN108731886A (en) A kind of more leakage point acoustic fix ranging methods of water supply line based on iteration recursion
CN101620270A (en) Wireless location method based on cluster-fusion
CN110534126A (en) A kind of auditory localization and sound enhancement method and system based on fixed beam formation
CN107167770A (en) A kind of microphone array sound source locating device under the conditions of reverberation
CN111856402B (en) Signal processing method and device, storage medium and electronic device
CN109884591A (en) A kind of multi-rotor unmanned aerial vehicle acoustical signal Enhancement Method based on microphone array
CN109212481A (en) A method of auditory localization is carried out using microphone array
Badawy et al. Decision tree approach to estimate user location in WLAN based on location fingerprinting
Luo et al. A new passive source localization method using AOA-GROA-TDOA in wireless sensor array networks and its Cramér-Rao bound analysis
CN109633554A (en) Moving sound based on probabilistic data association reaches delay time estimation method
CN116559778B (en) Vehicle whistle positioning method and system based on deep learning
CN110208736B (en) Non-circular signal uniform array direction-of-arrival angle estimation method based on fourth-order cumulant
CN1402586A (en) Method and device for estimating position by signal arrival time
Himawan et al. Clustering of ad-hoc microphone arrays for robust blind beamforming
CN114269014B (en) Large-scale MIMO dynamic environment fingerprint positioning method based on domain adaptive network
Jia et al. Acoustic passive localization algorithm based on wireless sensor networks
Shi et al. CSI-based fingerprinting for indoor localization with multi-scale convolutional neural network
CN113189544A (en) Multi-sound-source positioning method for removing outliers by weighting of activity intensity vectors

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant