CN113419216B - Multi-sound source positioning method suitable for reverberant environment - Google Patents
Multi-sound source positioning method suitable for reverberant environment Download PDFInfo
- Publication number
- CN113419216B CN113419216B CN202110684270.9A CN202110684270A CN113419216B CN 113419216 B CN113419216 B CN 113419216B CN 202110684270 A CN202110684270 A CN 202110684270A CN 113419216 B CN113419216 B CN 113419216B
- Authority
- CN
- China
- Prior art keywords
- sound source
- coordinates
- group
- microphone
- coordinate
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Active
Links
- 238000000034 method Methods 0.000 title claims abstract description 35
- 230000004807 localization Effects 0.000 claims abstract description 22
- 238000005070 sampling Methods 0.000 claims description 3
- 230000008859 change Effects 0.000 claims description 2
- 230000008569 process Effects 0.000 claims description 2
- 238000004364 calculation method Methods 0.000 abstract description 5
- 238000012216 screening Methods 0.000 description 5
- 238000012935 Averaging Methods 0.000 description 2
- 239000004973 liquid crystal related substance Substances 0.000 description 2
- 238000001228 spectrum Methods 0.000 description 2
- 230000006978 adaptation Effects 0.000 description 1
- 230000009286 beneficial effect Effects 0.000 description 1
- 238000010586 diagram Methods 0.000 description 1
- 230000000694 effects Effects 0.000 description 1
- 230000007613 environmental effect Effects 0.000 description 1
- 238000012986 modification Methods 0.000 description 1
- 230000004048 modification Effects 0.000 description 1
- 238000012545 processing Methods 0.000 description 1
- 238000011160 research Methods 0.000 description 1
- 230000004044 response Effects 0.000 description 1
Classifications
-
- G—PHYSICS
- G01—MEASURING; TESTING
- G01S—RADIO DIRECTION-FINDING; RADIO NAVIGATION; DETERMINING DISTANCE OR VELOCITY BY USE OF RADIO WAVES; LOCATING OR PRESENCE-DETECTING BY USE OF THE REFLECTION OR RERADIATION OF RADIO WAVES; ANALOGOUS ARRANGEMENTS USING OTHER WAVES
- G01S5/00—Position-fixing by co-ordinating two or more direction or position line determinations; Position-fixing by co-ordinating two or more distance determinations
- G01S5/18—Position-fixing by co-ordinating two or more direction or position line determinations; Position-fixing by co-ordinating two or more distance determinations using ultrasonic, sonic, or infrasonic waves
- G01S5/20—Position of source determined by a plurality of spaced direction-finders
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F18/00—Pattern recognition
- G06F18/20—Analysing
- G06F18/23—Clustering techniques
- G06F18/232—Non-hierarchical techniques
- G06F18/2321—Non-hierarchical techniques using statistics or function optimisation, e.g. modelling of probability density functions
- G06F18/23213—Non-hierarchical techniques using statistics or function optimisation, e.g. modelling of probability density functions with fixed number of clusters, e.g. K-means clustering
-
- Y—GENERAL TAGGING OF NEW TECHNOLOGICAL DEVELOPMENTS; GENERAL TAGGING OF CROSS-SECTIONAL TECHNOLOGIES SPANNING OVER SEVERAL SECTIONS OF THE IPC; TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
- Y02—TECHNOLOGIES OR APPLICATIONS FOR MITIGATION OR ADAPTATION AGAINST CLIMATE CHANGE
- Y02D—CLIMATE CHANGE MITIGATION TECHNOLOGIES IN INFORMATION AND COMMUNICATION TECHNOLOGIES [ICT], I.E. INFORMATION AND COMMUNICATION TECHNOLOGIES AIMING AT THE REDUCTION OF THEIR OWN ENERGY USE
- Y02D30/00—Reducing energy consumption in communication networks
- Y02D30/70—Reducing energy consumption in communication networks in wireless communication networks
Abstract
The invention provides a multi-sound source localization method suitable for a reverberation environment, which comprises the steps of grouping all coordinates of a whole search area and calculating the center coordinates of each group; collecting a voice signal using a 16 microphone array; positioning a certain sound source by using a multi-sound source positioning algorithm of double-layer search space clustering (TL-SSC), and removing coordinates near the sound source in a search area; this operation is repeated until all sound sources are located. The method solves the problem of real-time localization of multiple sound sources under the reverberation condition, and compared with other multi-sound source localization methods, the method has the advantages that the number of the required microphones is small, the calculation efficiency can be improved under the condition of keeping high localization precision, and the real-time requirement of mobile robot application is met.
Description
Technical Field
The invention relates to the technical field of sound source localization, in particular to a multi-sound source localization method suitable for a reverberation environment.
Background
The multi-sound source localization has a wide demand in real-time systems such as video conferences, voice recognition, mobile service robots and the like, and is one of research hotspots in the field of acoustic signal processing. For example, when the mobile robot performs real-time intelligent service, the voice position is determined by a multi-sound source positioning method, and the robot is guided to complete the service. The existing sound source localization methods mainly comprise three types: positioning method based on subspace, positioning method based on controllable beam forming and positioning method based on arrival time delay. The subspace-based positioning method receives signals through each microphone, utilizes orthogonality of the signal subspace and the noise subspace, constructs a spatial spectrum function and searches a spectrum peak to obtain a sound source direction, has high positioning accuracy, has higher requirements on stability of sound source signals, and has poor positioning effect in a small space. The positioning method based on controllable beam forming selects the direction corresponding to the maximum receiving power value as the sound source direction by changing the direction of the receiving signals of the microphone array, and has simple principle and small calculated amount, but has poor noise resistance, environmental noise information needs to be acquired in advance, and positioning instantaneity is difficult to ensure. The positioning method based on the arrival time delay utilizes the sound path difference from the sound source to each microphone to determine the sound source position, the calculation complexity is generally smaller than that of the two methods, the positioning precision is higher, and the positioning instantaneity is easy to meet.
Disclosure of Invention
Aiming at the application occasions with higher requirements on algorithm real-time performance, such as an indoor mobile robot, in the prior art, namely the problem that the calculation efficiency is improved and the space is improved while the accuracy is maintained as much as possible in a small space, the invention provides a multi-sound-source positioning method suitable for a reverberation environment, wherein the number of microphones adopted is 16 by the multi-sound-source positioning method based on double-layer search space clustering (Two-Levels Search Space Clustering, TL-SSC), the calculation efficiency of a system is improved by grouping coordinates, carrying out real-time double-layer search, clustering screening, threshold judgment and the like, and the real-time positioning of the multi-sound-source is realized by utilizing the estimation of arrival time delay (Time Difference of Arrival, TDOA).
In order to achieve the above purpose, the present invention adopts the following technical scheme:
a multi-sound source localization method suitable for reverberant environments comprises the following steps:
s1, collecting coordinates in the whole search area, grouping, and calculating the center coordinates of each group;
s2, collecting voice signals by using a microphone array;
s3, determining a candidate subgroup of a certain sound source position by adopting a multi-sound source positioning algorithm of double-layer search space clustering in a mode of calculating the central coordinate power of each subgroup, positioning the sound source position in all coordinates contained in the candidate subgroup, and removing coordinates near the sound source in a search area;
repeating the step S3 until all the sound source positions are positioned.
In order to optimize the technical scheme, the specific measures adopted further comprise:
further, the grouping in step S1 is based on:
if the ith coordinate q i Belonging to the j-th group z j Then p (q i ∈z j ) Has a value of 1; if the ith coordinate q i Not belonging to the j-th group z j ,p(q i ∈z j ) The value of (2) is 0;
wherein I represents the total number of coordinates in the whole search area, J represents the current group number, z j Represented as all coordinate sets in the j-th group; wherein the initial value of J is 1, and sequentially adding 1 until the formulaIt holds that with the change of i and j, p (q i ∈z j )、e(q i ,z j ) And z j The center coordinates of the (B) are calculated through a K-means algorithm;
wherein e (q) i ,z j ) Representation groupError convergence, defined as the difference in sound path between all microphone pairsAnd (3) summing; />Representing the distance position q between microphone k and microphone l i TDOA value of>Representing the distance set z between microphone k and microphone l j TDOA value of center coordinates, M represents the number of microphones, θ t Represented as a threshold.
Further, the threshold value theta t The definition is as follows:
where λ represents the wavelength, c is the speed of sound, and f is the sampling rate; in sound source localization, θ t The value is determined by the maximum frequency of the speech signal.
Further, the microphone array collects voice signals by adopting 16 microphones, the whole microphone array is a cylinder, and 8 microphones are uniformly distributed on the upper and lower outlines respectively.
Further, in the time domain, the mode of calculating the coordinate power is specifically:
wherein y (t, q) represents the output value of the coordinate position q at time t, g m (t) represents the impulse response of the filter at the mth microphone, x m (t+τ m,q ) Representing the mth microphone at time t+τ m,q Received signal τ m,q Representing signal propagation times for coordinate position q through the mth microphone; in the frequency domain, the formula for calculating the coordinate power is expressed as:
wherein Y (ω, q) is expressed as an output value of the coordinate position q at the frequency ω, X m (omega) Fourier transform denoted as mth microphone signal, G m (ω) represents the frequency domain system function of the filter at the mth microphone;
based on a formula for calculating coordinate power in a frequency domain, a power output value P (q) of the coordinate position q is obtained as follows:
wherein G is l (omega) is expressed as a frequency domain system function of the filter at the first microphone, X l (ω) is denoted as the fourier transform of the first microphone signal,expressed as the conjugate of the frequency domain system function of the filter at the kth microphone,/i>Represented as the conjugate of the fourier transform of the kth microphone signal, τ k,q Representing signal propagation times for coordinate position q through the kth microphone;
in the method, in the process of the invention, a PHAT weighting coefficient between the ith microphone signal and the kth microphone signal;
after calculating the central coordinate power of each group, determining a candidate group, locating a sound source position in all coordinates contained in the candidate group, and determining the sound source position as the maximum power valueCorresponding coordinates, namely:
further, the specific way to determine a certain sound source candidate group is: searching according to the result of calculating the central coordinate power value of each subgroup, selecting the subgroup corresponding to the maximum power value as a first candidate subgroup, and when judging the v-th subgroup in the rest subgroups, selecting the subgroup as the candidate subgroup under the following conditions:
|θ v -θ c |≤θ 1 ,
terminating the judgment of the candidate subgroup after the number u of the candidate subgroups reaches a certain number or all subgroups are judged;
wherein (X) b ,Y b ,Z b ) Expressed as the center coordinates of the b-th subgroup of the existing candidate subgroups in the Cartesian coordinate system, (X) cc ,Y cc ,Z cc ) Expressed as average coordinates, theta, of all the current candidate group center coordinates averaged in a Cartesian coordinate system c For the direction of the average coordinates after averaging the central coordinates of all the current candidate subgroupsThe angle of the corner of the plate,for the elevation angle of the average coordinate averaged over the center coordinates of all the candidate subgroups at present, θ v Azimuth angle expressed as the central coordinate position of the current group to be discriminated, +.>Elevation angle, θ, expressed as center coordinates of the current group to be discriminated 1 Expressed as azimuth threshold, ++>Represented as elevation threshold.
Further, the specific content of removing the coordinates near the sound source in the search area is:
providing a region omega, uniformly reducing the power of the coordinate positions in the group and giving a power value E to the group contained in the region omega l Meanwhile, the small groups contained in the region omega are not considered in the subsequent step of positioning other sound source positions;
coordinates within a region Ω in a spherical coordinate systemThe requirements are as follows:
|θ-θ s |≤θ 2 ,
where, θ is expressed as azimuth of the coordinates,expressed as elevation angle of the coordinates, r represents distance of the coordinates from origin of the coordinate system, θ s Azimuth angle, expressed as the last sound source coordinate position of the currently located sound source, +>Elevation angle, θ, expressed as the last sound source coordinate position of the currently located sound source 2 Expressed as azimuth threshold, ++>Represented as elevation threshold.
The beneficial effects of the invention are as follows:
1. according to the multi-sound source localization method suitable for the reverberation environment, through the steps of clustering screening, threshold judgment and the like, a proper removal area and candidate group screening conditions are selected, so that the TL-SSC algorithm can be applied to a multi-sound source system.
2. Compared with other multi-sound source localization methods, the multi-sound source localization method suitable for the reverberation environment has the advantages that the number of required microphones is small, the calculation efficiency can be improved under the condition that high localization accuracy is kept, and the real-time requirement of mobile robot application is met.
Drawings
FIG. 1 is a schematic diagram of a microphone and sound source distribution of the present invention; in the figure: pentagram represents a microphone and dot represents a sound source.
Detailed Description
The invention will now be described in further detail with reference to the accompanying drawings.
The localization of two sound sources among the plurality of sound sources is described as an example.
Grouping the primary coordinates in the whole search area, and calculating the center coordinates of each group, wherein the grouping basis is as follows:
wherein I represents the total number of primary coordinates in the whole search area, J represents the current group number, M represents the number of microphones, and θ t Represented as a threshold. z j Expressed as all coordinate sets in the j-th group, if the i-th coordinate q i Belonging to the j-th group z j Then p (q i ∈z j ) 1 is shown in the specification; otherwise, 0.e (q) i ,z j ) Representing a group error, defined as the difference in sound path between all microphone pairsAnd (3) summing; />Representing the distance position q between microphone k and microphone l i Is to determine the TDOA value of->Values. The initial value of J is 1, and the increment is 1 each time until the formula
This is true. As J increases, p (q i ∈z j ),e(q i ,z j ) And z j Will be calculated by the K-means algorithm. The threshold θ is defined as:
where λ represents the wavelength, c is the speed of sound, and f is the sampling rate; in sound source localization, θ t The value is determined by the maximum frequency of the speech signal.
Candidate subgroups of one of the sound sources (the sound source that is not the first to be localized) are screened out and localized. Calculating the power value corresponding to each subgroup according to the obtained grouping result, carrying out first layer search, and selecting the subgroup corresponding to the maximum power value of the central coordinate asA first candidate group. Suppose that u is already in the first tier of searches 1 The subgroup is selected as candidate subgroup, and the v-th subgroup is left to select the subgroup belonging to the same sound source as the existing candidate subgroup 1 When the group is judged, the conditions for selecting the group as a candidate group are as follows:
up to u 1 Up to a certain number n or all subgroups are discriminated. Wherein the method comprises the steps ofExpressed as b in the existing candidate group in Cartesian coordinate system 1 Center coordinates of the groups, (X c1 ,Y c1 ,Z c1 ) Expressed as average coordinates, theta, of all the current candidate group center coordinates averaged in a Cartesian coordinate system c1 For the azimuth angle of the average coordinates averaged over all the current candidate group center coordinates +.>To average sitting after averaging the central coordinates of all the current candidate groupsElevation angle of target->Azimuth angle expressed as the central coordinate position of the current group to be discriminated, +.>Elevation angle, θ, expressed as center coordinates of the current group to be discriminated 1 Expressed as azimuth threshold, ++>Represented as elevation threshold. Using the formula
And calculating the power P (q) of each primary coordinate in all candidate groups, performing second-layer search on the first sound source, searching the position with the maximum output power in all coordinates, and determining the position as the position of the first sound source.
The influence of coordinates around the sound source on the positioning of a second sound source at a later stage is reduced by removing the group near the sound source by giving the group a lower power value. Is provided with a region omega and omega internal coordinates in a spherical coordinate systemThe method meets the following conditions:
|θ-θ s |≤θ 2 ,
wherein θ is 2 And (3) withThreshold values respectively set in azimuth and elevation angle, θ s And->The azimuth and elevation of the first sound source coordinate which is already positioned are respectively, and the P (q) value of the small group contained in the region omega is uniformly given with a low power value E l The subgroup it contains is not considered in the next step. If the currently located sound source is the first located sound source, the coordinate power of the adjacent group is not reduced and removed for the first sound source, but the coordinate power of the adjacent group is reduced and removed from the second located sound source.
The second sound source is positioned according to the modified power distribution. Since it cannot be guaranteed that the region Ω contains all the subgroups near the first sound source that may affect the positioning of the second sound source, the screening method used when positioning the first sound source is still adopted, that is, in the first layer search, the subgroup with the largest power value is selected from the subgroups not contained in the region Ω as the first candidate subgroup of the second sound source, and then the remaining subgroups in the lookup table are screened, so as to reduce the possibility of mixing in the subgroups with the first sound source as the main power contributing sound source. Suppose that u is already present 2 The subgroup is selected as candidate subgroup in the second sound source, and then v is determined in the rest subgroups 2 The conditions for whether the individual subgroups are candidate subgroups are:
wherein, the liquid crystal display device comprises a liquid crystal display device,represents the v th 2 Center coordinates of the groups, wherein>Expressed as b in the existing candidate group in Cartesian coordinate system 2 Center coordinates of the groups, (X c2 ,Y c2 ,Z c2 ) Expressed as average coordinates, theta, of all the current candidate group center coordinates averaged in a Cartesian coordinate system c2 For the azimuth angle of the average coordinates averaged over all the current candidate group center coordinates +.>For the elevation angle of the average coordinates averaged over the center coordinates of all candidate subgroups at present +.>Azimuth angle expressed as the central coordinate position of the current group to be discriminated, +.>Elevation angle, θ, expressed as center coordinates of the current group to be discriminated 1 Expressed as azimuth threshold, ++>Represented as elevation threshold.
When candidate group u 2 After the number of the (B) reaches a certain value n or all subgroups are judged, screening is stopped. Calculating the corresponding power P (q) values of all primary coordinates in the candidate group, sorting, and selecting one corresponding to the maximum power valueThe stage coordinates are the coordinates of the second sound source.
It should be noted that the terms like "upper", "lower", "left", "right", "front", "rear", and the like are also used for descriptive purposes only and are not intended to limit the scope of the invention in which the invention may be practiced, but rather the relative relationship of the terms may be altered or modified without materially altering the teachings of the invention.
The above is only a preferred embodiment of the present invention, and the protection scope of the present invention is not limited to the above examples, and all technical solutions belonging to the concept of the present invention belong to the protection scope of the present invention. It should be noted that modifications and adaptations to the invention without departing from the principles thereof are intended to be within the scope of the invention as set forth in the following claims.
Claims (5)
1. A multi-sound source localization method suitable for a reverberant environment, comprising the steps of:
s1, collecting coordinates in the whole search area, grouping, and calculating the center coordinates of each group;
s2, collecting voice signals by using a microphone array;
s3, determining a candidate subgroup of a certain sound source position by adopting a multi-sound source positioning algorithm of double-layer search space clustering in a mode of calculating the central coordinate power of each subgroup, positioning the sound source position in all coordinates contained in the candidate subgroup, and removing coordinates near the sound source in a search area;
repeating the above operation until all sound source positions are positioned;
the specific method for determining a certain sound source candidate group is as follows: searching according to the result of calculating the central coordinate power value of each subgroup, selecting the subgroup corresponding to the maximum power value as a first candidate subgroup, and when judging the v-th subgroup in the rest subgroups, selecting the subgroup as the candidate subgroup under the following conditions:
|θ v -θ c |≤θ 1 ,
terminating the judgment of the candidate subgroup after the number u of the candidate subgroups reaches a certain number or all subgroups are judged;
wherein (X) b ,Y b ,Z b ) Expressed as the center coordinates of the b-th subgroup of the existing candidate subgroups in the Cartesian coordinate system, (X) cc ,Y cc ,Z cc ) Expressed as average coordinates, theta, of all the current candidate group center coordinates averaged in a Cartesian coordinate system c To average the azimuth of the average coordinates of the current candidate group center coordinates,for the elevation angle of the average coordinate averaged over the center coordinates of all the candidate subgroups at present, θ v Azimuth angle expressed as the central coordinate position of the current group to be discriminated, +.>Elevation angle, θ, expressed as center coordinates of the current group to be discriminated 1 Represented as an azimuth threshold value,represented as elevation threshold;
the specific contents of removing the coordinates near the sound source in the search area are as follows:
providing a region omega, uniformly reducing the power of the coordinate positions in the group and giving a power value E to the group contained in the region omega l Meanwhile, the small groups contained in the region omega are not considered in the subsequent step of positioning other sound source positions;
coordinates within a region Ω in a spherical coordinate systemThe requirements are as follows:
|θ-θ s |≤θ 2 ,
where, θ is expressed as azimuth of the coordinates,expressed as elevation angle of the coordinates, r represents distance of the coordinates from origin of the coordinate system, θ s Azimuth angle, expressed as the last sound source coordinate position of the currently located sound source, +>Elevation angle, θ, expressed as the last sound source coordinate position of the currently located sound source 2 Expressed as azimuth threshold, ++>Represented as elevation threshold.
2. A multi-sound source localization method for reverberant environments according to claim 1, wherein the grouping in step S1 is based on:
if the ith coordinate q i Belonging to the j-th group z j Then p (q i ∈z j ) Has a value of 1; if the ith coordinate q i Not belonging to the j-th group z j ,p(q i ∈z j ) The value of (2) is 0;
wherein I represents the total number of coordinates in the whole search area, J represents the current group number, z j Represented as all coordinate sets in the j-th group; wherein the initial value of J is 1, and sequentially adding 1 until the formulaIt holds that with the change of i and j, p (q i ∈z j )、e(q i ,z j ) And z j The center coordinates of the (B) are calculated through a K-means algorithm;
wherein e (q) i ,z j ) Representing a group error, defined as the difference in sound path between all microphone pairsAnd (3) summing;representing the distance position q between microphone k and microphone l i TDOA value of>Representing the distance set z between microphone k and microphone l j TDOA value of center coordinates, M represents the number of microphones, θ t Represented as a threshold.
3. A multi-sound source localization method for reverberant environments according to claim 2, wherein the threshold θ t The definition is as follows:
where λ represents the wavelength, c is the speed of sound, and f is the sampling rate; in sound source localization, θ t The value is determined by the maximum frequency of the speech signal.
4. A multi-sound source localization method for reverberant environments according to claim 3, wherein the microphone array uses 16 microphones to collect the voice signals, the microphone array is cylindrical as a whole, and 8 microphones are uniformly distributed on the upper and lower contours.
5. A multi-sound source localization method for reverberant environments according to claim 2, wherein,
in the frequency domain, the formula for calculating the coordinate power is expressed as:
wherein Y (ω, q) is expressed as an output value of the coordinate position q at the frequency ω, X m (omega) Fourier transform denoted as mth microphone signal, G m (ω) represents the frequency domain system function of the filter at the mth microphone;
based on a formula for calculating coordinate power in a frequency domain, a power output value P (q) of the coordinate position q is obtained as follows:
wherein G is l (omega) is expressed as a frequency domain system function of the filter at the first microphone, X l (ω) is denoted as the fourier transform of the first microphone signal,denoted as kthConjugation of the frequency domain system function of the filter at the microphone,/->Represented as the conjugate of the fourier transform of the kth microphone signal, τ k,q Representing signal propagation times for coordinate position q through the kth microphone;
in the method, in the process of the invention, a PHAT weighting coefficient between the ith microphone signal and the kth microphone signal;
after calculating the central coordinate power of each group, determining a candidate group, locating a sound source position in all coordinates contained in the candidate group, and determining the sound source position as the maximum power valueCorresponding coordinates, namely:
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN202110684270.9A CN113419216B (en) | 2021-06-21 | 2021-06-21 | Multi-sound source positioning method suitable for reverberant environment |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN202110684270.9A CN113419216B (en) | 2021-06-21 | 2021-06-21 | Multi-sound source positioning method suitable for reverberant environment |
Publications (2)
Publication Number | Publication Date |
---|---|
CN113419216A CN113419216A (en) | 2021-09-21 |
CN113419216B true CN113419216B (en) | 2023-10-31 |
Family
ID=77789393
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN202110684270.9A Active CN113419216B (en) | 2021-06-21 | 2021-06-21 | Multi-sound source positioning method suitable for reverberant environment |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN113419216B (en) |
Families Citing this family (2)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN115662383B (en) * | 2022-12-22 | 2023-04-14 | 杭州爱华智能科技有限公司 | Method and system for deleting main sound source, method, system and device for identifying multiple sound sources |
CN117828405B (en) * | 2024-02-23 | 2024-05-07 | 兰州交通大学 | Signal positioning method based on intelligent frequency spectrum sensing |
Citations (8)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN106093864A (en) * | 2016-06-03 | 2016-11-09 | 清华大学 | A kind of microphone array sound source space real-time location method |
US9554208B1 (en) * | 2014-03-28 | 2017-01-24 | Marvell International Ltd. | Concurrent sound source localization of multiple speakers |
CN106940439A (en) * | 2017-03-01 | 2017-07-11 | 西安电子科技大学 | K mean cluster weighting sound localization method based on wireless acoustic sensor network |
CN108198568A (en) * | 2017-12-26 | 2018-06-22 | 太原理工大学 | A kind of method and system of more auditory localizations |
CN110443371A (en) * | 2019-06-25 | 2019-11-12 | 深圳欧克曼技术有限公司 | A kind of artificial intelligence device and method |
CN111352075A (en) * | 2018-12-20 | 2020-06-30 | 中国科学院声学研究所 | Underwater multi-sound-source positioning method and system based on deep learning |
CN111474521A (en) * | 2020-04-09 | 2020-07-31 | 南京理工大学 | Sound source positioning method based on microphone array in multipath environment |
CN111489753A (en) * | 2020-06-24 | 2020-08-04 | 深圳市友杰智新科技有限公司 | Anti-noise sound source positioning method and device and computer equipment |
-
2021
- 2021-06-21 CN CN202110684270.9A patent/CN113419216B/en active Active
Patent Citations (8)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US9554208B1 (en) * | 2014-03-28 | 2017-01-24 | Marvell International Ltd. | Concurrent sound source localization of multiple speakers |
CN106093864A (en) * | 2016-06-03 | 2016-11-09 | 清华大学 | A kind of microphone array sound source space real-time location method |
CN106940439A (en) * | 2017-03-01 | 2017-07-11 | 西安电子科技大学 | K mean cluster weighting sound localization method based on wireless acoustic sensor network |
CN108198568A (en) * | 2017-12-26 | 2018-06-22 | 太原理工大学 | A kind of method and system of more auditory localizations |
CN111352075A (en) * | 2018-12-20 | 2020-06-30 | 中国科学院声学研究所 | Underwater multi-sound-source positioning method and system based on deep learning |
CN110443371A (en) * | 2019-06-25 | 2019-11-12 | 深圳欧克曼技术有限公司 | A kind of artificial intelligence device and method |
CN111474521A (en) * | 2020-04-09 | 2020-07-31 | 南京理工大学 | Sound source positioning method based on microphone array in multipath environment |
CN111489753A (en) * | 2020-06-24 | 2020-08-04 | 深圳市友杰智新科技有限公司 | Anti-noise sound source positioning method and device and computer equipment |
Non-Patent Citations (7)
Title |
---|
Fast sound source localization using two-level search space clustering;YOOK D;《IEEE Transactions on Cybernetics》;第第46卷卷(第第1期期);摘要、第1-5章,图1-5 * |
MULTIPLE SOUND SOURCE LOCALIZATION BASED ON TDOA CLUSTERING AND MULTI-PATH MATCHING PURSUIT.《IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP)》.2018,全文. * |
Multi-source sound localization using the competitive k-means clustering.《2010 IEEE 15th Conference on Emerging Technologies & Factory Automation (ETFA 2010)》.2010,全文. * |
基于三线交点球麦克风阵列的远场多声源定位;庄启雷;黄青华;;上海大学学报(自然科学版)(第02期);全文 * |
基于多阵列数据融合的宽带多声源定位研究;滕鹏晓;杨亦春;李晓东;田静;;应用声学(第03期);全文 * |
基于子带可控响应功率的多声源定位方法;倪志莲;蔡卫平;张怡典;;计算机工程与应用(第24期);全文 * |
基于相位差复指数变换的传声器多声源定位;赵小燕;汤捷;周琳;吴镇扬;;东南大学学报(自然科学版)(第02期);全文 * |
Also Published As
Publication number | Publication date |
---|---|
CN113419216A (en) | 2021-09-21 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN107102296B (en) | Sound source positioning system based on distributed microphone array | |
CN113419216B (en) | Multi-sound source positioning method suitable for reverberant environment | |
CN108375763B (en) | Frequency division positioning method applied to multi-sound-source environment | |
CN110488223A (en) | A kind of sound localization method | |
CN108696932A (en) | It is a kind of using CSI multipaths and the outdoor fingerprint positioning method of machine learning | |
CN104142492A (en) | SRP-PHAT multi-source spatial positioning method | |
CN112904279B (en) | Sound source positioning method based on convolutional neural network and subband SRP-PHAT spatial spectrum | |
CN108731886A (en) | A kind of more leakage point acoustic fix ranging methods of water supply line based on iteration recursion | |
CN101620270A (en) | Wireless location method based on cluster-fusion | |
CN110534126A (en) | A kind of auditory localization and sound enhancement method and system based on fixed beam formation | |
CN107167770A (en) | A kind of microphone array sound source locating device under the conditions of reverberation | |
CN111856402B (en) | Signal processing method and device, storage medium and electronic device | |
CN109884591A (en) | A kind of multi-rotor unmanned aerial vehicle acoustical signal Enhancement Method based on microphone array | |
CN109212481A (en) | A method of auditory localization is carried out using microphone array | |
Badawy et al. | Decision tree approach to estimate user location in WLAN based on location fingerprinting | |
Luo et al. | A new passive source localization method using AOA-GROA-TDOA in wireless sensor array networks and its Cramér-Rao bound analysis | |
CN109633554A (en) | Moving sound based on probabilistic data association reaches delay time estimation method | |
CN116559778B (en) | Vehicle whistle positioning method and system based on deep learning | |
CN110208736B (en) | Non-circular signal uniform array direction-of-arrival angle estimation method based on fourth-order cumulant | |
CN1402586A (en) | Method and device for estimating position by signal arrival time | |
Himawan et al. | Clustering of ad-hoc microphone arrays for robust blind beamforming | |
CN114269014B (en) | Large-scale MIMO dynamic environment fingerprint positioning method based on domain adaptive network | |
Jia et al. | Acoustic passive localization algorithm based on wireless sensor networks | |
Shi et al. | CSI-based fingerprinting for indoor localization with multi-scale convolutional neural network | |
CN113189544A (en) | Multi-sound-source positioning method for removing outliers by weighting of activity intensity vectors |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PB01 | Publication | ||
PB01 | Publication | ||
SE01 | Entry into force of request for substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
GR01 | Patent grant | ||
GR01 | Patent grant |