WO2023173514A1 - Sound source positioning method under strong multipath interference condition - Google Patents

Sound source positioning method under strong multipath interference condition Download PDF

Info

Publication number
WO2023173514A1
WO2023173514A1 PCT/CN2022/085493 CN2022085493W WO2023173514A1 WO 2023173514 A1 WO2023173514 A1 WO 2023173514A1 CN 2022085493 W CN2022085493 W CN 2022085493W WO 2023173514 A1 WO2023173514 A1 WO 2023173514A1
Authority
WO
WIPO (PCT)
Prior art keywords
sound source
microphone array
microphone
sound
method under
Prior art date
Application number
PCT/CN2022/085493
Other languages
French (fr)
Chinese (zh)
Inventor
俞李斌
王智
林俊光
席玉章
孙士恩
董益华
马聪
Original Assignee
浙江浙能技术研究院有限公司
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by 浙江浙能技术研究院有限公司 filed Critical 浙江浙能技术研究院有限公司
Publication of WO2023173514A1 publication Critical patent/WO2023173514A1/en

Links

Images

Classifications

    • GPHYSICS
    • G01MEASURING; TESTING
    • G01SRADIO DIRECTION-FINDING; RADIO NAVIGATION; DETERMINING DISTANCE OR VELOCITY BY USE OF RADIO WAVES; LOCATING OR PRESENCE-DETECTING BY USE OF THE REFLECTION OR RERADIATION OF RADIO WAVES; ANALOGOUS ARRANGEMENTS USING OTHER WAVES
    • G01S5/00Position-fixing by co-ordinating two or more direction or position line determinations; Position-fixing by co-ordinating two or more distance determinations
    • G01S5/18Position-fixing by co-ordinating two or more direction or position line determinations; Position-fixing by co-ordinating two or more distance determinations using ultrasonic, sonic, or infrasonic waves
    • G01S5/20Position of source determined by a plurality of spaced direction-finders
    • GPHYSICS
    • G01MEASURING; TESTING
    • G01MTESTING STATIC OR DYNAMIC BALANCE OF MACHINES OR STRUCTURES; TESTING OF STRUCTURES OR APPARATUS, NOT OTHERWISE PROVIDED FOR
    • G01M3/00Investigating fluid-tightness of structures
    • G01M3/02Investigating fluid-tightness of structures by using fluid or vacuum
    • G01M3/04Investigating fluid-tightness of structures by using fluid or vacuum by detecting the presence of fluid at the leakage point
    • G01M3/24Investigating fluid-tightness of structures by using fluid or vacuum by detecting the presence of fluid at the leakage point using infrasonic, sonic, or ultrasonic vibrations
    • G01M3/243Investigating fluid-tightness of structures by using fluid or vacuum by detecting the presence of fluid at the leakage point using infrasonic, sonic, or ultrasonic vibrations for pipes
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F17/00Digital computing or data processing equipment or methods, specially adapted for specific functions
    • G06F17/10Complex mathematical operations
    • G06F17/15Correlation function computation including computation of convolution operations

Definitions

  • the invention relates to a sound source positioning method under strong multipath interference conditions, and belongs to the field of sound source positioning.
  • Sound source localization has very important application value.
  • the leak point can be located based on the noise emitted by the leak point, so that the leak point can be discovered early and repaired.
  • the sound propagation environment is complex, so the microphone array will be affected by multipath phenomena in the process of receiving sound signals, resulting in errors in determining the direction of the sound source that are difficult to eliminate.
  • the technical problem to be solved by the present invention is to overcome the shortcomings of the existing technology and provide a sound source positioning method under strong multipath interference conditions.
  • the present invention adopts the following technical solutions:
  • a sound source localization method under strong multipath interference conditions including the following steps:
  • Step S1 Build a microphone array
  • Step S3 Calculate the signal gain ratio of the microphone array in the sound propagation direction corresponding to ⁇ p ;
  • Step S4 Select the reconstruction scheme of the microphone array through the signal gain ratio, repeat steps S1 and S2 according to the reconstruction scheme to change ⁇ p , and set the threshold ⁇ . If the change of ⁇ p is less than ⁇ , then according to the changed The sound propagation direction corresponding to ⁇ p is used to determine the sound source position. If the change in ⁇ p is not less than ⁇ , continue to repeat steps S3, S1 and S2 in sequence until the change in ⁇ p is less than ⁇ .
  • the characteristic frequency detected by the microphone array is then used to predict the direction of the sound source.
  • a reconstruction plan is generated based on the signal gain ratio in the predicted direction, and the microphone array is reconstructed based on the reconstruction plan, and the above process is repeated. , gradually reducing the impact of the multipath effect on the numerical value of the characteristic frequency detected by the microphone array, and at the same time reducing the impact on the judgment of the direction of the sound source. Taking advantage of the positive correlation between the above two effects, when the change in the characteristic frequency before and after reconstruction is less than the threshold, It can be considered that the multipath effect can be ignored, so the predicted direction obtained based on the characteristic frequency at this time can be used as the sound source direction.
  • q is 2 or 3 or 4 or 5, and the value of q remains unchanged before and after steps S1 and S2 are repeated.
  • the microphone array of the present invention includes N microphones, one of which is a central microphone, and the other microphones are peripheral microphones and are arranged sequentially along the circumferential direction of the central microphone.
  • the spherical coordinates are established with the central microphone as the origin, and any point in the spherical coordinates is connected to the central microphone.
  • the elevation angle of the line in spherical coordinates is And the horizontal angle is ⁇
  • the distance between this point and the j-th peripheral microphone is r j
  • the elevation angle of the line connecting this point and the j-th peripheral microphone in spherical coordinates is And the horizontal angle is ⁇ j
  • set the function in v is the speed of sound
  • i is the imaginary unit
  • the elevation angle of the sound propagation direction corresponding to ⁇ p in spherical coordinates is And the horizontal angle is ⁇ p
  • the peripheral microphones are evenly distributed along the circumferential direction of the central microphone. At this time, the distance between two adjacent peripheral microphones is d min .
  • the position movement distance of the jth peripheral microphone before and after the microphone array is reconstructed is ⁇ d j . ⁇ d j ⁇ d min .
  • the present invention uses a multi-objective optimization algorithm of quantum particle swarm to generate a reconstruction plan for the microphone array to ensure that the change in ⁇ p is reduced after each reconstruction of the microphone array.
  • step S3 of the present invention the signal gain ratio is calculated for the sounds of multiple characteristic frequencies in their respective propagation directions and the weighted average is calculated.
  • step S4 the reconstruction scheme is selected based on the weighted average of the signal gain ratio.
  • Figure 1 is a flow chart of a sound source localization method under strong multipath interference conditions according to Embodiment 1 of the present invention.
  • the sound source emits sound with a certain characteristic frequency. Due to the influence of the multipath effect, in addition to receiving the sound directly emitted by the sound source, the microphone array will also receive a certain degree of noise. This part of the noise will interact with the sound directly emitted by the sound source. Destructive or constructive, therefore, when the microphone array performs data processing on the received sound signal, it will not only misestimate the value of the characteristic frequency at the sound source, but also misjudge the direction of the sound source. This is the main reason why multipath phenomenon interferes with sound source localization.
  • the sound source localization method under interference conditions includes the following steps:
  • Step S1 Construct a microphone array, in which there are a total of N microphones in the microphone array.
  • One microphone is the central microphone, and the other microphones are peripheral microphones and are arranged sequentially along the circumferential direction of the central microphone. That is, the total number of peripheral microphones is N-1. All microphones are always in the same plane;
  • the peripheral microphones are evenly distributed along the circumferential direction of the central microphone. At this time, the distance between two adjacent peripheral microphones is d min ;
  • Step S2 Set a reference direction and use the current microphone array to receive the sound signal from an unknown sound source.
  • the microphone array processes ⁇ p through the MUSIC algorithm, and the corresponding characteristic frequency can be obtained.
  • the angle D p between the sound propagation direction and the reference direction.
  • the characteristic frequency obtained by the first constructed microphone array is The characteristic frequency is The sound signal is filtered and retained Sound signals within the neighborhood can be calculated
  • the corresponding sound propagation direction, the angle between the sound propagation direction and the reference direction is can represent Corresponding sound propagation direction, perform the above steps multiple times on the sound signal received by the microphone array to obtain respectively
  • the angle between the corresponding sound propagation direction and the reference direction Due to the multipath effect, There is a certain difference between them. Since the sound source position is unique, therefore The difference between them won’t be too big;
  • Step S3 For one of the ⁇ p s , the signal gain ratio T p in the corresponding sound propagation direction (referred to as the D p direction) can be solved.
  • the signal gain ratio T p represents the sound intensity gradient in the D p direction.
  • the signal gain ratio The larger the value, the more concentrated the signal intensity is in the D p direction.
  • the less obvious the influence of the multipath effect is in the D p direction.
  • the smaller the signal gain ratio is, the more dispersed the signal intensity is in the D p direction.
  • the influence of multipath effect is more obvious in the D p direction;
  • Step S4 Generate a reconstruction plan for the microphone array with the signal gain ratio as the optimization target, and repeat steps S1 and S2 according to the reconstruction plan to change ⁇ p ;
  • the positions of the central microphone and the reference direction will not change, and the number of microphones will not change. Only the positions of the peripheral microphones will change;
  • the microphone array constructed for the second time As an example, the sound signal is re-collected at this time, and the current characteristic frequency is obtained: Then perform filtering separately to obtain the angle between the corresponding sound propagation direction and the reference direction. and One-to-one correspondence between them, and There is a one-to-one correspondence between them. Since the arrangement of the microphone array changes between the first build and the second build, the corresponding and not equal, and Also not the same;
  • the characteristic frequency of the sound signal collected by the microphone array constructed for the gth time is Then perform filtering separately to obtain the angle between the corresponding sound propagation direction and the reference direction.
  • the signal gain ratio in the direction is In the second constructed microphone array, exist The signal gain ratio in the direction is By analogy, in the microphone array constructed for the gth time, exist The signal gain ratio in the direction is
  • the meaning of generating the reconstruction scheme of the microphone array with the signal gain ratio as the optimization goal is to ensure that Its meaning indicates that the multipath effect is The influence in the direction is compared with the is smaller in the direction, the sound signal is in More concentrated in direction, The direction is closer to the direction of the actual sound source.
  • the specific generation method of the reconstruction plan will be explained later in this embodiment.
  • the direction can be used as the actual sound propagation direction
  • the numerical value can be used as the angle between the connection line between the sound source and the center microphone of the microphone array constructed at the g+1th time and the reference direction, thereby obtaining the sound source direction position information (ie, the sound source positioning in this embodiment).
  • the spherical coordinates are established with the center microphone as the origin.
  • the elevation angle of any point in the spherical coordinates connected to the center microphone in the spherical coordinates is And the horizontal angle is ⁇ , and the distance between this point and the j-th peripheral microphone is The elevation angle of the line connecting this point and the jth peripheral microphone in spherical coordinates is And the horizontal angle is set function in v is the speed of sound, i is the imaginary unit, Represents the sum of the normalized signal strengths sent to the center microphone and each peripheral microphone at any point in the space.
  • Corresponding sound propagation direction (abbreviation direction) in spherical coordinates the elevation angle is And the horizontal angle is but signal gain ratio in direction Similarly, in the microphone array constructed at the g+1th time, it can be solved
  • the multi-objective optimization algorithm of quantum particle swarm is used to generate the reconstruction plan of the microphone array.
  • the particles corresponding to the microphone array are P 1, 1.
  • M-1 particles are randomly generated, which can be recorded as P 2, 1 , P 3, 1, .... .., P M,1 , which can be known through the detection of particle P 1,1 and
  • P 1,1 , P 2,1 ,..., P M,1 can be obtained respectively through calculation.
  • the signal gain ratio in the direction is T 1,1 , T 2,1 ,..., TM,1 . (at this time That is T 1,1 ).
  • P 1,1 , P 2,1 ,..., PM ,1 to generate P 1,2 , P 2,2 ,..., PM ,2 .
  • Select a particle from P 1,2 , P 2,2 ,..., PM,2 as the reconstruction plan, so that the form of the microphone array meets the reconstruction plan when the microphone array is constructed for the second time Then the microphone array constructed for the second time can be learned through actual inspection. and
  • P 1,2 , P 2,2 ,..., P M,2 can be obtained respectively through calculation.
  • the signal gain ratio in the direction is T 1, 2 , T 2, 2 ,..., T M,2 , and so on.
  • P 1, g , P has been obtained 2, g ,..., P M,g , at this time it can be known through detection and Correspondingly, through calculation we can obtain P 1,g ,P 2,g ,...,P M,g in
  • the signal gain ratio in the direction is T 1,g , T 2,g ,..., T M,g , and then calculated to obtain P 1,g+1 , P 2,g+1 ,... ., P M,g+1 and select a particle from it as the reconstruction plan to construct the g+1th microphone array.
  • the particle P m,1 (1 ⁇ m ⁇ M) continuously evolves and iterates through calculation, and generates particles P m,2 , P m,3 ,..., then in the gth construction
  • the particle P corresponding to max(T m, 1 , T m, 2 ,..., T m, g ) is the historical maximum.
  • Optimal coordinate P m, best , at this time max (T 1, 1 , T 1, 2 ,..., T 1,g , T 2, 1, T 2 , 2 ,...,
  • the particle P corresponding to T 2,g ,...,TM ,1 , TM,2 ,..., TM,g ) is the global historical optimal coordinate G best . It can be found that as the value of g increases, the value of P m, best may change or remain unchanged, but the total number of P m, best is always M.
  • ⁇ 1 and ⁇ 2 are random numbers greater than 0 and less than 1, and ⁇ is the expansion coefficient.
  • the historical optimal coordinates P m, best and the global historical optimal coordinates G best are used to iteratively update the particles to ensure that as the microphone array As the number of reconstructions increases, the entire particle evolves in the direction of gradually increasing signal gain ratio.
  • its corresponding particle is P f, g , using P f, g+1 as the construction plan for the g+1th microphone array can minimize the impact of the multipath effect on the reconstructed microphone array.
  • By controlling the value of ⁇ , we avoid generating P 1,g +1,P 2,g +1,... through P 1,g,P 2,g , ...,PM,g .
  • the microphone array can obtain greater The signal gain ratio also avoids ⁇ being too small, which results in too many microphone array reconstructions, so that the sound source direction can be obtained as quickly as possible.
  • adjusting ⁇ essentially the iteration step size
  • the particle distance ⁇ to obtain the reconstruction plan can maximize the increase in the signal gain ratio before and after the microphone array is reconstructed, and ensure After the microphone array is reconstructed, the signal gain ratio will not decrease, that is, the particles will diverge.
  • the position movement distance of the jth peripheral microphone before and after the microphone array reconstruction is ⁇ d j , ⁇ d j ⁇ d min .
  • P 1,g ,P 2,g ,...,P M,g are called particles of the gth generation.
  • Particles with the same g are called particles of the same generation, and the characteristic frequencies of particles of the same generation are the same.
  • Which specific particle to select as the reconstruction solution for the microphone array from particles of the same generation is judged by the signal gain ratio.
  • the particle with the largest signal gain ratio in the propagation direction corresponding to a single characteristic frequency sound is used as the reconstruction plan.
  • the signal gain ratio of a single particle in the corresponding propagation direction of each characteristic frequency sound is solved and then the average value is obtained, and then the average signal gain ratio of a single particle is obtained, and then The particle with the largest signal gain ratio to the average value is used as the reconstruction plan.
  • the average value of the signal gain ratio in this embodiment is a weighted average.
  • the smaller the value of the characteristic frequency the smaller the weight of the signal gain ratio of the sound signal in the corresponding propagation direction when calculating the weighted average.
  • the reason is that the multipath effect has a relatively small impact on the low-frequency part of the sound signal, while the high-frequency part has a significant impact. Therefore, adding weight to the signal gain ratio corresponding to the higher characteristic frequency, and then realizing particle iteration, can make the average signal gain ratio Value increases more quickly.

Landscapes

  • Engineering & Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • General Physics & Mathematics (AREA)
  • Mathematical Physics (AREA)
  • Computational Mathematics (AREA)
  • Theoretical Computer Science (AREA)
  • Mathematical Analysis (AREA)
  • Mathematical Optimization (AREA)
  • Pure & Applied Mathematics (AREA)
  • Data Mining & Analysis (AREA)
  • Algebra (AREA)
  • Databases & Information Systems (AREA)
  • Software Systems (AREA)
  • General Engineering & Computer Science (AREA)
  • Computing Systems (AREA)
  • Radar, Positioning & Navigation (AREA)
  • Remote Sensing (AREA)
  • Circuit For Audible Band Transducer (AREA)

Abstract

A sound source positioning method under a strong multipath interference condition, comprising the following steps: constructing a microphone array (S1); collecting a sound source signal by means of the microphone array, thereby determining a characteristic frequency (ω p) of the sound source signal and the sound propagation direction; calculating a signal gain ratio; selecting a reconstruction scheme of the microphone array according to the signal gain ratio, and changing the characteristic frequency (ω p) according to the reconstruction scheme; using the characteristic frequency (ω p) detected by the microphone array to predict the sound source direction, generating a reconstruction scheme according to the signal gain ratio in the predicted direction, reconstructing the microphone array according to the reconstruction scheme, repeating the aforementioned process to gradually reduce the influence of multipath effects on the characteristic frequency (ω p) value detected by the microphone array while mitigating the influence on the judgment of the sound source direction, and when a change of the characteristic frequency (ω p) before and after reconstruction is smaller than a threshold value, using a predicted direction obtained according to the characteristic frequency (ω p) as a sound source direction.

Description

一种强多径干扰条件下的声源定位方法A sound source localization method under strong multipath interference conditions 技术领域Technical field
本发明涉及一种强多径干扰条件下的声源定位方法,属于声源定位领域。The invention relates to a sound source positioning method under strong multipath interference conditions, and belongs to the field of sound source positioning.
背景技术Background technique
声源定位具有十分重要的应用价值,例如在管道泄漏场景下,根据泄漏点发出的噪声能够对泄漏点进行定位,从而及早发现泄漏点并进行维修。但是声音传播环境是复杂的,因此麦克风阵列在接收声音信号的过程中会受到多径现象的影响,导致对声源的方向判断产生难以消除的误差。Sound source localization has very important application value. For example, in a pipeline leak scenario, the leak point can be located based on the noise emitted by the leak point, so that the leak point can be discovered early and repaired. However, the sound propagation environment is complex, so the microphone array will be affected by multipath phenomena in the process of receiving sound signals, resulting in errors in determining the direction of the sound source that are difficult to eliminate.
发明内容Contents of the invention
本发明所要解决的技术问题在于克服现有技术的不足而提供一种强多径干扰条件下的声源定位方法。The technical problem to be solved by the present invention is to overcome the shortcomings of the existing technology and provide a sound source positioning method under strong multipath interference conditions.
解决上述技术问题,本发明采用如下技术方案:To solve the above technical problems, the present invention adopts the following technical solutions:
一种强多径干扰条件下的声源定位方法,包括如下步骤:A sound source localization method under strong multipath interference conditions, including the following steps:
步骤S1:构建麦克风阵列;Step S1: Build a microphone array;
步骤S2:麦克风阵列采集声源信号,从而确定声源信号的特征频率ω p,p=1,2,......,q,对声源信号进行滤波,以保留ω p邻域范围内的声音信号,继而得到ω p对应的声音传播方向; Step S2: The microphone array collects the sound source signal to determine the characteristic frequency ω p of the sound source signal, p = 1, 2,..., q, and filter the sound source signal to retain the ω p neighborhood range. The sound signal within ω p is then obtained, and the sound propagation direction corresponding to ω p is obtained;
步骤S3:计算麦克风阵列在ω p对应的声音传播方向上的信号增益比; Step S3: Calculate the signal gain ratio of the microphone array in the sound propagation direction corresponding to ω p ;
步骤S4:通过信号增益比选择麦克风阵列的重构方案,依据重构方案重复步骤S1和S2,以使ω p发生改变,设定阈值δ,若ω p的变化量小于δ,则依据变化后的ω p对应的声音传播方向对声源位置进行判断,若ω p的变化量不小于δ,则继续依次重复步骤S3、S1和S2,直至ω p的变化量小于δ。 Step S4: Select the reconstruction scheme of the microphone array through the signal gain ratio, repeat steps S1 and S2 according to the reconstruction scheme to change ω p , and set the threshold δ. If the change of ω p is less than δ, then according to the changed The sound propagation direction corresponding to ω p is used to determine the sound source position. If the change in ω p is not less than δ, continue to repeat steps S3, S1 and S2 in sequence until the change in ω p is less than δ.
本发明的有益效果为:The beneficial effects of the present invention are:
麦克风阵列检测得到的特征频率,进而对声源方向获得了一个预判,通过该预判方向上的信号增益比生成重构方案,并依据该重构方案对麦克风阵列进行重构,重复上述过程,逐渐降低多径效应对麦克风阵列检测到的特征频率数值影响,同时还降低了对声源方向判断的影响,利用上述两种影响呈正相关的特性,待重构前后特征频率变化小于阈值时,可以认为多径影响已经可以忽略,故而可以将此时根据特征频率获得的预判方向作为声源方向。The characteristic frequency detected by the microphone array is then used to predict the direction of the sound source. A reconstruction plan is generated based on the signal gain ratio in the predicted direction, and the microphone array is reconstructed based on the reconstruction plan, and the above process is repeated. , gradually reducing the impact of the multipath effect on the numerical value of the characteristic frequency detected by the microphone array, and at the same time reducing the impact on the judgment of the direction of the sound source. Taking advantage of the positive correlation between the above two effects, when the change in the characteristic frequency before and after reconstruction is less than the threshold, It can be considered that the multipath effect can be ignored, so the predicted direction obtained based on the characteristic frequency at this time can be used as the sound source direction.
本发明q为2或3或4或5,步骤S1和S2重复前后q的取值不变。In the present invention, q is 2 or 3 or 4 or 5, and the value of q remains unchanged before and after steps S1 and S2 are repeated.
本发明麦克风阵列包括N个麦克风,其中一个麦克风为中心麦克风,其他麦克风为外周麦克风并沿中心麦克风的周向依次排布,以中心麦克风为原点建立球坐标,球坐标中任意一点与中心麦克风连线在球坐标中仰角为
Figure PCTCN2022085493-appb-000001
且水平角为θ,该点与第j个外周麦克风之间间距为r j,该点与第j个外周麦克风的连线在球坐标中仰角为
Figure PCTCN2022085493-appb-000002
且水平角为θ j,设定函数
Figure PCTCN2022085493-appb-000003
其中
Figure PCTCN2022085493-appb-000004
v为声速,i为虚数单位,ω p对应的声音传播方向在球坐标中仰角为
Figure PCTCN2022085493-appb-000005
且水平角为θ p,则在ω p对应的声音传播方向上的信号增益比
Figure PCTCN2022085493-appb-000006
The microphone array of the present invention includes N microphones, one of which is a central microphone, and the other microphones are peripheral microphones and are arranged sequentially along the circumferential direction of the central microphone. The spherical coordinates are established with the central microphone as the origin, and any point in the spherical coordinates is connected to the central microphone. The elevation angle of the line in spherical coordinates is
Figure PCTCN2022085493-appb-000001
And the horizontal angle is θ, the distance between this point and the j-th peripheral microphone is r j , and the elevation angle of the line connecting this point and the j-th peripheral microphone in spherical coordinates is
Figure PCTCN2022085493-appb-000002
And the horizontal angle is θ j , set the function
Figure PCTCN2022085493-appb-000003
in
Figure PCTCN2022085493-appb-000004
v is the speed of sound, i is the imaginary unit, and the elevation angle of the sound propagation direction corresponding to ω p in spherical coordinates is
Figure PCTCN2022085493-appb-000005
And the horizontal angle is θ p , then the signal gain ratio in the sound propagation direction corresponding to ω p
Figure PCTCN2022085493-appb-000006
本发明首次构建麦克风阵列时外周麦克风沿中心麦克风周向均匀分布,此时相邻两个外周麦克风的间距为d min,第j个外周麦克风在麦克风阵列重构前后的位置移动距离为Δd j,Δd j<d minWhen the microphone array is constructed for the first time in the present invention, the peripheral microphones are evenly distributed along the circumferential direction of the central microphone. At this time, the distance between two adjacent peripheral microphones is d min . The position movement distance of the jth peripheral microphone before and after the microphone array is reconstructed is Δd j . Δd j <d min .
本发明采用量子粒子群的多目标优化算法生成麦克风阵列的重构方案,以确保麦克风阵列每次重构后ω p的变化量减少。 The present invention uses a multi-objective optimization algorithm of quantum particle swarm to generate a reconstruction plan for the microphone array to ensure that the change in ω p is reduced after each reconstruction of the microphone array.
本发明步骤S3中对多个特征频率的声音在各自传播方向上计算信号增益比 并计算加权平均值,步骤S4中通过信号增益比的加权平均值选择重构方案。In step S3 of the present invention, the signal gain ratio is calculated for the sounds of multiple characteristic frequencies in their respective propagation directions and the weighted average is calculated. In step S4, the reconstruction scheme is selected based on the weighted average of the signal gain ratio.
本发明将每一种麦克风阵列的排列方式作为一种粒子,当前粒子和最优粒子间距为γ,若γ<1,则膨胀系数α=0.5+γ,若γ=1,则α=1.8。The present invention regards each microphone array arrangement as a kind of particle, and the distance between the current particle and the optimal particle is γ. If γ<1, then the expansion coefficient α=0.5+γ, and if γ=1, then α=1.8.
本发明的其他特点和优点将会在下面的具体实施方式、附图中详细的揭露。Other features and advantages of the present invention will be disclosed in detail in the following detailed description and drawings.
【附图说明】[Picture description]
下面结合附图对本发明做进一步的说明:The present invention will be further described below in conjunction with the accompanying drawings:
图1为本发明实施例1的强多径干扰条件下的声源定位方法的流程图。Figure 1 is a flow chart of a sound source localization method under strong multipath interference conditions according to Embodiment 1 of the present invention.
具体实施方式Detailed ways
下面结合本发明实施例的附图对本发明实施例的技术方案进行解释和说明,但下述实施例仅为本发明的优选实施例,并非全部。基于实施方式中的实施例,本领域技术人员在没有做出创造性劳动的前提下所获得其他实施例,都属于本发明的保护范围。The technical solutions of the embodiments of the present invention will be explained and described below with reference to the accompanying drawings of the embodiments of the present invention. However, the following embodiments are only preferred embodiments of the present invention and are not exhaustive. Based on the examples in the implementation mode, other embodiments obtained by those skilled in the art without any creative work shall fall within the protection scope of the present invention.
在下文描述中,出现诸如术语“内”、“外”、“上”、“下”、“左”、“右”等指示方位或者位置关系仅是为了方便描述实施例和简化描述,而不是指示或暗示所指的装置或者元件必须具有特定的方位、以特定的方位构造和操作,因此不能理解为对本发明的限制。In the following description, the occurrence of terms such as "inside", "outside", "upper", "lower", "left", "right", etc. to indicate the orientation or positional relationship are only for the convenience of describing the embodiments and simplifying the description, rather than Any indication or implication that the referred device or element must have a specific orientation, be constructed and operate in a specific orientation should not be construed as a limitation on the invention.
实施例1:Example 1:
声源发出具有一定特征频率的声音,由于多径效应的影响,麦克风阵列除了接收声源直接发出的声音以外,还会接收到一定程度的噪音,这部分噪音会与声源直接发出的声音发生相消或者相长,由此,麦克风阵列在对接收到的声音信号进行数据处理过程中,不仅会对声源处特征频率的数值发生错估,而且还会对声源方向发生误判。这就是多径现象对声源定位干扰的主要原因。The sound source emits sound with a certain characteristic frequency. Due to the influence of the multipath effect, in addition to receiving the sound directly emitted by the sound source, the microphone array will also receive a certain degree of noise. This part of the noise will interact with the sound directly emitted by the sound source. Destructive or constructive, therefore, when the microphone array performs data processing on the received sound signal, it will not only misestimate the value of the characteristic frequency at the sound source, but also misjudge the direction of the sound source. This is the main reason why multipath phenomenon interferes with sound source localization.
由此可以发现一个特殊的现象,即,特征频率错估和声源方向误判二者是同时发生的,且二者程度呈现正相关特性,基于此,本实施例提供了一种强多径干扰条件下的声源定位方法,包括如下步骤:From this, a special phenomenon can be found, that is, the misestimation of the characteristic frequency and the misjudgment of the sound source direction occur at the same time, and the degree of the two shows a positive correlation characteristic. Based on this, this embodiment provides a strong multipath The sound source localization method under interference conditions includes the following steps:
步骤S1:构建麦克风阵列,其中麦克风阵列中总计共有N个麦克风,其中一个麦克风为中心麦克风,其他麦克风为外周麦克风并沿中心麦克风的周向依 次排布,即外周麦克风的总数为N-1,所有的麦克风始终处于同一个平面中;Step S1: Construct a microphone array, in which there are a total of N microphones in the microphone array. One microphone is the central microphone, and the other microphones are peripheral microphones and are arranged sequentially along the circumferential direction of the central microphone. That is, the total number of peripheral microphones is N-1. All microphones are always in the same plane;
第一次构建的麦克风阵列中,外周麦克风沿中心麦克风周向均匀分布,此时相邻两个外周麦克风的间距为d minIn the microphone array constructed for the first time, the peripheral microphones are evenly distributed along the circumferential direction of the central microphone. At this time, the distance between two adjacent peripheral microphones is d min ;
步骤S2:设定一个基准方向,利用当前的麦克风阵列接收一位置未知的声源发出的声音信号,麦克风阵列可以以此确定声源信号的特征频率ω p,p=1,2,......,q,特征频率为声音强度较大的频率,因此常规性的,在对不同的特征频率进行排序的时候,保证ω 1、ω 2......ω q对应的声音强度逐渐降低,在多径效应的影响下,声源处声音信号的特征频率数值和麦克风阵列接收到的声音信号特征频率数值会因此频率偏移而有所区别,但是两个声音信号中特征频率数量依然是相同的,即,麦克风阵列接收到的声音信号中每个特征频率都能在声源处的声音信号中找到对应的特征频率,麦克风阵列通过MUSIC算法对ω p进行处理,可以得到对应的声音传播方向和基准方向之间的夹角D pStep S2: Set a reference direction and use the current microphone array to receive the sound signal from an unknown sound source. The microphone array can use this to determine the characteristic frequency ω p of the sound source signal, p=1,2,... ..., q, the characteristic frequency is the frequency with greater sound intensity, so conventionally, when sorting different characteristic frequencies, the sound intensity corresponding to ω 1 , ω 2 ...ω q is guaranteed Gradually decrease, under the influence of the multipath effect, the characteristic frequency value of the sound signal at the sound source and the characteristic frequency value of the sound signal received by the microphone array will be different due to the frequency offset, but the number of characteristic frequencies in the two sound signals It is still the same, that is, each characteristic frequency in the sound signal received by the microphone array can find the corresponding characteristic frequency in the sound signal at the sound source. The microphone array processes ω p through the MUSIC algorithm, and the corresponding characteristic frequency can be obtained. The angle D p between the sound propagation direction and the reference direction.
为了便于后续进行描述,首次构建的麦克风阵列获取的特征频率为
Figure PCTCN2022085493-appb-000007
对特征频率为
Figure PCTCN2022085493-appb-000008
的声音信号进行滤波,保留
Figure PCTCN2022085493-appb-000009
邻域范围内的声音信号,可以计算得到
Figure PCTCN2022085493-appb-000010
对应的声音传播方向,该声音传播方向与基准方向之间的夹角为
Figure PCTCN2022085493-appb-000011
即可代表
Figure PCTCN2022085493-appb-000012
对应的声音传播方向,对麦克风阵列接收到的声音信号多次进行上述步骤,可以分别获得
Figure PCTCN2022085493-appb-000013
对应的声音传播方向和基准方向之间的夹角
Figure PCTCN2022085493-appb-000014
由于多径效应的影响,
Figure PCTCN2022085493-appb-000015
相互之间存在一定差值,由于声源位置唯一,因此
Figure PCTCN2022085493-appb-000016
之间的差值不会太大;
In order to facilitate subsequent description, the characteristic frequency obtained by the first constructed microphone array is
Figure PCTCN2022085493-appb-000007
The characteristic frequency is
Figure PCTCN2022085493-appb-000008
The sound signal is filtered and retained
Figure PCTCN2022085493-appb-000009
Sound signals within the neighborhood can be calculated
Figure PCTCN2022085493-appb-000010
The corresponding sound propagation direction, the angle between the sound propagation direction and the reference direction is
Figure PCTCN2022085493-appb-000011
can represent
Figure PCTCN2022085493-appb-000012
Corresponding sound propagation direction, perform the above steps multiple times on the sound signal received by the microphone array to obtain respectively
Figure PCTCN2022085493-appb-000013
The angle between the corresponding sound propagation direction and the reference direction
Figure PCTCN2022085493-appb-000014
Due to the multipath effect,
Figure PCTCN2022085493-appb-000015
There is a certain difference between them. Since the sound source position is unique, therefore
Figure PCTCN2022085493-appb-000016
The difference between them won’t be too big;
在一些特殊环境下,由于声音传播至麦克风阵列时经过不同次数的反射,因此针对
Figure PCTCN2022085493-appb-000017
可能会获得多个
Figure PCTCN2022085493-appb-000018
且不同
Figure PCTCN2022085493-appb-000019
之间差值较大,例如若
Figure PCTCN2022085493-appb-000020
获得了三个相互之间数值差距较大的
Figure PCTCN2022085493-appb-000021
同时
Figure PCTCN2022085493-appb-000022
对应的
Figure PCTCN2022085493-appb-000023
仅有一个,那么通过比较
Figure PCTCN2022085493-appb-000024
和不 同
Figure PCTCN2022085493-appb-000025
之间的差值,可以剔除明显错误的
Figure PCTCN2022085493-appb-000026
以使得
Figure PCTCN2022085493-appb-000027
Figure PCTCN2022085493-appb-000028
之间形成一一对应关系;
In some special environments, since sound undergoes different times of reflection when propagating to the microphone array, the
Figure PCTCN2022085493-appb-000017
May get multiple
Figure PCTCN2022085493-appb-000018
and different
Figure PCTCN2022085493-appb-000019
The difference between them is large, for example if
Figure PCTCN2022085493-appb-000020
Three values with large numerical differences between each other were obtained.
Figure PCTCN2022085493-appb-000021
at the same time
Figure PCTCN2022085493-appb-000022
corresponding
Figure PCTCN2022085493-appb-000023
There is only one, then by comparison
Figure PCTCN2022085493-appb-000024
different from
Figure PCTCN2022085493-appb-000025
The difference between them can eliminate obvious errors
Figure PCTCN2022085493-appb-000026
so that
Figure PCTCN2022085493-appb-000027
and
Figure PCTCN2022085493-appb-000028
There is a one-to-one correspondence between them;
步骤S3:针对其中一个ω p,可以求解其对应声音传播方向(简称D p方向)上的信号增益比T p,信号增益比T p表示的是在D p方向上声音强度梯度,信号增益比越大,表明信号强度在D p方向上越集中,相应的,在D p方向上多径效应的影响越不明显,与之相对,信号增益比越小,表明信号强度在D p方向上越分散,在D p方向上多径效应的影响越明显; Step S3: For one of the ω p s , the signal gain ratio T p in the corresponding sound propagation direction (referred to as the D p direction) can be solved. The signal gain ratio T p represents the sound intensity gradient in the D p direction. The signal gain ratio The larger the value, the more concentrated the signal intensity is in the D p direction. Correspondingly, the less obvious the influence of the multipath effect is in the D p direction. In contrast, the smaller the signal gain ratio is, the more dispersed the signal intensity is in the D p direction. The influence of multipath effect is more obvious in the D p direction;
步骤S4:以信号增益比为优化目标生成麦克风阵列的重构方案,依据重构方案重复步骤S1和S2,以使ω p发生改变; Step S4: Generate a reconstruction plan for the microphone array with the signal gain ratio as the optimization target, and repeat steps S1 and S2 according to the reconstruction plan to change ω p ;
麦克风阵列重构前后,中心麦克风以及基准方向的位置不会发生变化,麦克风数量也不会发生变化,仅对外周麦克风的位置进行改变;Before and after the microphone array is reconstructed, the positions of the central microphone and the reference direction will not change, and the number of microphones will not change. Only the positions of the peripheral microphones will change;
以第二次构建的麦克风阵列为例,此时重新采集声音信号,获得当前特征频率为
Figure PCTCN2022085493-appb-000029
而后分别进行滤波,可以分别得到对应的声音传播方向和基准方向之间的夹角
Figure PCTCN2022085493-appb-000030
Figure PCTCN2022085493-appb-000031
之间一一对应,
Figure PCTCN2022085493-appb-000032
Figure PCTCN2022085493-appb-000033
之间一一对应,由于麦克风阵列的排布方式在第一次构建和第二次构建时发生变化,相应的
Figure PCTCN2022085493-appb-000034
Figure PCTCN2022085493-appb-000035
不相等,
Figure PCTCN2022085493-appb-000036
Figure PCTCN2022085493-appb-000037
也不相同;
Taking the microphone array constructed for the second time as an example, the sound signal is re-collected at this time, and the current characteristic frequency is obtained:
Figure PCTCN2022085493-appb-000029
Then perform filtering separately to obtain the angle between the corresponding sound propagation direction and the reference direction.
Figure PCTCN2022085493-appb-000030
and
Figure PCTCN2022085493-appb-000031
One-to-one correspondence between them,
Figure PCTCN2022085493-appb-000032
and
Figure PCTCN2022085493-appb-000033
There is a one-to-one correspondence between them. Since the arrangement of the microphone array changes between the first build and the second build, the corresponding
Figure PCTCN2022085493-appb-000034
and
Figure PCTCN2022085493-appb-000035
not equal,
Figure PCTCN2022085493-appb-000036
and
Figure PCTCN2022085493-appb-000037
Also not the same;
以此类推,若后续麦克风阵列继续发生重构,则第g次构建的麦克风阵列采集到的声音信号中特征频率为
Figure PCTCN2022085493-appb-000038
而后分别进行滤波,可以分别得到对应的声音传播方向和基准方向之间的夹角
Figure PCTCN2022085493-appb-000039
By analogy, if subsequent microphone arrays continue to be reconstructed, the characteristic frequency of the sound signal collected by the microphone array constructed for the gth time is
Figure PCTCN2022085493-appb-000038
Then perform filtering separately to obtain the angle between the corresponding sound propagation direction and the reference direction.
Figure PCTCN2022085493-appb-000039
在第一次构建的麦克风阵列中,
Figure PCTCN2022085493-appb-000040
Figure PCTCN2022085493-appb-000041
方向上的信号增益比为
Figure PCTCN2022085493-appb-000042
在第二次构建的麦克风阵列中,
Figure PCTCN2022085493-appb-000043
Figure PCTCN2022085493-appb-000044
方向上的信号增益比为
Figure PCTCN2022085493-appb-000045
以此类推,在第g次 构建的麦克风阵列中,
Figure PCTCN2022085493-appb-000046
Figure PCTCN2022085493-appb-000047
方向上的信号增益比为
Figure PCTCN2022085493-appb-000048
In the first microphone array built,
Figure PCTCN2022085493-appb-000040
exist
Figure PCTCN2022085493-appb-000041
The signal gain ratio in the direction is
Figure PCTCN2022085493-appb-000042
In the second constructed microphone array,
Figure PCTCN2022085493-appb-000043
exist
Figure PCTCN2022085493-appb-000044
The signal gain ratio in the direction is
Figure PCTCN2022085493-appb-000045
By analogy, in the microphone array constructed for the gth time,
Figure PCTCN2022085493-appb-000046
exist
Figure PCTCN2022085493-appb-000047
The signal gain ratio in the direction is
Figure PCTCN2022085493-appb-000048
以信号增益比为优化目标生成麦克风阵列的重构方案的含义即为,需要保证
Figure PCTCN2022085493-appb-000049
其含义表示多径效应在
Figure PCTCN2022085493-appb-000050
方向上的影响相较在
Figure PCTCN2022085493-appb-000051
方向上更小,声音信号在
Figure PCTCN2022085493-appb-000052
方向上更为集中,
Figure PCTCN2022085493-appb-000053
方向也就更贴近实际声源的方向。关于重构方案的具体生成方式会在本实施例后续进行展开说明。
The meaning of generating the reconstruction scheme of the microphone array with the signal gain ratio as the optimization goal is to ensure that
Figure PCTCN2022085493-appb-000049
Its meaning indicates that the multipath effect is
Figure PCTCN2022085493-appb-000050
The influence in the direction is compared with the
Figure PCTCN2022085493-appb-000051
is smaller in the direction, the sound signal is in
Figure PCTCN2022085493-appb-000052
More concentrated in direction,
Figure PCTCN2022085493-appb-000053
The direction is closer to the direction of the actual sound source. The specific generation method of the reconstruction plan will be explained later in this embodiment.
选定序列数p,并且设定阈值δ,针对第g+1次构建的麦克风阵列,如果
Figure PCTCN2022085493-appb-000054
则表明因多径效应产生的特征频率测量误差已经足够小,依据本实施例第二段描述的特征频率错估和声源方向误判程度正相关这一结论,
Figure PCTCN2022085493-appb-000055
方向可以作为实际声音传播方向,
Figure PCTCN2022085493-appb-000056
数值可以作为声源和第g+1次构建的麦克风阵列的中心麦克风连线与基准方向之间夹角,由此获得声源方向位置信息(即本实施例中的声源定位)。
Select the sequence number p and set the threshold δ. For the microphone array constructed for the g+1th time, if
Figure PCTCN2022085493-appb-000054
It shows that the characteristic frequency measurement error caused by the multipath effect is small enough. According to the conclusion that the characteristic frequency misestimation described in the second paragraph of this embodiment is positively correlated with the degree of misjudgment of the sound source direction,
Figure PCTCN2022085493-appb-000055
The direction can be used as the actual sound propagation direction,
Figure PCTCN2022085493-appb-000056
The numerical value can be used as the angle between the connection line between the sound source and the center microphone of the microphone array constructed at the g+1th time and the reference direction, thereby obtaining the sound source direction position information (ie, the sound source positioning in this embodiment).
若ω p的变化量不小于δ,即
Figure PCTCN2022085493-appb-000057
则继续依次重复步骤S3、S1和S2,继续进行麦克风阵列的重构,直至ω p的变化量小于δ,即
Figure PCTCN2022085493-appb-000058
If the change of ω p is not less than δ, that is
Figure PCTCN2022085493-appb-000057
Then continue to repeat steps S3, S1 and S2 in sequence, and continue to reconstruct the microphone array until the change in ω p is less than δ, that is,
Figure PCTCN2022085493-appb-000058
针对第g次构建的麦克风阵列中,以中心麦克风为原点建立球坐标,球坐标中任意一点与中心麦克风连线在球坐标中仰角为
Figure PCTCN2022085493-appb-000059
且水平角为θ,该点与第j个外周麦克风之间间距为
Figure PCTCN2022085493-appb-000060
该点与第j个外周麦克风的连线在球坐标中仰角为
Figure PCTCN2022085493-appb-000061
且水平角为
Figure PCTCN2022085493-appb-000062
设定函数
Figure PCTCN2022085493-appb-000063
其中
Figure PCTCN2022085493-appb-000064
v为声速,i为虚数单位,
Figure PCTCN2022085493-appb-000065
表示空间中任意一点发送至中心麦克风以及各个外周麦克风的归一化信号强度之和,在此基础上,
Figure PCTCN2022085493-appb-000066
对应的声音传播方向(简称
Figure PCTCN2022085493-appb-000067
方向)在球坐标中仰角为
Figure PCTCN2022085493-appb-000068
且水平角为
Figure PCTCN2022085493-appb-000069
Figure PCTCN2022085493-appb-000070
方向上的信 号增益比
Figure PCTCN2022085493-appb-000071
类似的,在第g+1次构建的麦克风阵列中可以求解得到
Figure PCTCN2022085493-appb-000072
For the microphone array constructed for the gth time, the spherical coordinates are established with the center microphone as the origin. The elevation angle of any point in the spherical coordinates connected to the center microphone in the spherical coordinates is
Figure PCTCN2022085493-appb-000059
And the horizontal angle is θ, and the distance between this point and the j-th peripheral microphone is
Figure PCTCN2022085493-appb-000060
The elevation angle of the line connecting this point and the jth peripheral microphone in spherical coordinates is
Figure PCTCN2022085493-appb-000061
And the horizontal angle is
Figure PCTCN2022085493-appb-000062
set function
Figure PCTCN2022085493-appb-000063
in
Figure PCTCN2022085493-appb-000064
v is the speed of sound, i is the imaginary unit,
Figure PCTCN2022085493-appb-000065
Represents the sum of the normalized signal strengths sent to the center microphone and each peripheral microphone at any point in the space. On this basis,
Figure PCTCN2022085493-appb-000066
Corresponding sound propagation direction (abbreviation
Figure PCTCN2022085493-appb-000067
direction) in spherical coordinates the elevation angle is
Figure PCTCN2022085493-appb-000068
And the horizontal angle is
Figure PCTCN2022085493-appb-000069
but
Figure PCTCN2022085493-appb-000070
signal gain ratio in direction
Figure PCTCN2022085493-appb-000071
Similarly, in the microphone array constructed at the g+1th time, it can be solved
Figure PCTCN2022085493-appb-000072
由于信号增益比采用的是归一化后的
Figure PCTCN2022085493-appb-000073
进行计算,因此即使声源处声音强度大小在实时变化,
Figure PCTCN2022085493-appb-000074
Figure PCTCN2022085493-appb-000075
也具有可比较性。
Since the signal gain ratio is normalized
Figure PCTCN2022085493-appb-000073
Calculation is performed, so even if the sound intensity at the sound source changes in real time,
Figure PCTCN2022085493-appb-000074
and
Figure PCTCN2022085493-appb-000075
Also comparable.
为了保证
Figure PCTCN2022085493-appb-000076
麦克风阵列在第g次构建完成后,采用量子粒子群的多目标优化算法生成麦克风阵列的重构方案。
to ensure that
Figure PCTCN2022085493-appb-000076
After the microphone array is constructed for the gth time, the multi-objective optimization algorithm of quantum particle swarm is used to generate the reconstruction plan of the microphone array.
量子粒子群的多目标优化算法核心思想是将每一种麦克风阵列的排布方式记为一个粒子P,该粒子中第j个外周麦克风的坐标为(x j,y j),则P=P(x 1,y 1,x 2,y 2,......,x N-1,y N-1)。第一次麦克风阵列构建完成后,此时麦克风阵列对应的粒子为P 1,1,此时随机生成M-1个粒子,可以分别记为P 2,1,P 3,1,......,P M,1,通过粒子P 1,1检测可以获知
Figure PCTCN2022085493-appb-000077
Figure PCTCN2022085493-appb-000078
此时通过计算可以分别获得P 1,1,P 2,1,......,P M,1
Figure PCTCN2022085493-appb-000079
方向上的信号增益比T 1,1,T 2,1,......,T M,1。(此时
Figure PCTCN2022085493-appb-000080
即为T 1,1)。通过对P 1,1,P 2,1,......,P M,1进行计算,以生成P 1,2,P 2,2,......,P M,2,从P 1,2,P 2,2,......,P M,2中挑选一个粒子作为重构方案,从而在第二次构建麦克风阵列时使麦克风阵列的形式满足重构方案,而后第二次构建的麦克风阵列通过实际检测,可以获知
Figure PCTCN2022085493-appb-000081
Figure PCTCN2022085493-appb-000082
此时通过计算可以分别获得P 1,2,P 2,2,......,P M,2
Figure PCTCN2022085493-appb-000083
方向上的信号增益比T 1,2,T 2,2,......,T M,2,以此类推,第g次构建麦克风阵列结束后,已经获得了P 1,g,P 2,g,......,P M,g,此时通过检测可以获知
Figure PCTCN2022085493-appb-000084
Figure PCTCN2022085493-appb-000085
相应的,通过计算可以获得P 1,g,P 2,g,......,P M,g
Figure PCTCN2022085493-appb-000086
方向上的信号增益比T 1,g,T 2,g,......,T M,g,而后计算得到P 1,g+1,P 2,g+1,......,P M,g+1并从中挑选一个粒子作为重构方案,从而进行第g+1次麦克风阵列构建。
The core idea of the multi-objective optimization algorithm of quantum particle swarm is to record each microphone array arrangement as a particle P, and the coordinates of the j-th peripheral microphone in the particle are (x j , y j ), then P=P (x 1 , y 1 , x 2 , y 2 ,..., x N-1 , y N-1 ). After the first microphone array is constructed, the particles corresponding to the microphone array are P 1, 1. At this time, M-1 particles are randomly generated, which can be recorded as P 2, 1 , P 3, 1, .... .., P M,1 , which can be known through the detection of particle P 1,1
Figure PCTCN2022085493-appb-000077
and
Figure PCTCN2022085493-appb-000078
At this time, P 1,1 , P 2,1 ,..., P M,1 can be obtained respectively through calculation.
Figure PCTCN2022085493-appb-000079
The signal gain ratio in the direction is T 1,1 , T 2,1 ,..., TM,1 . (at this time
Figure PCTCN2022085493-appb-000080
That is T 1,1 ). By calculating P 1,1 , P 2,1 ,..., PM ,1 to generate P 1,2 , P 2,2 ,..., PM ,2 , Select a particle from P 1,2 , P 2,2 ,..., PM,2 as the reconstruction plan, so that the form of the microphone array meets the reconstruction plan when the microphone array is constructed for the second time, Then the microphone array constructed for the second time can be learned through actual inspection.
Figure PCTCN2022085493-appb-000081
and
Figure PCTCN2022085493-appb-000082
At this time, P 1,2 , P 2,2 ,..., P M,2 can be obtained respectively through calculation.
Figure PCTCN2022085493-appb-000083
The signal gain ratio in the direction is T 1, 2 , T 2, 2 ,..., T M,2 , and so on. After the g-th construction of the microphone array is completed, P 1, g , P has been obtained 2, g ,..., P M,g , at this time it can be known through detection
Figure PCTCN2022085493-appb-000084
and
Figure PCTCN2022085493-appb-000085
Correspondingly, through calculation we can obtain P 1,g ,P 2,g ,...,P M,g in
Figure PCTCN2022085493-appb-000086
The signal gain ratio in the direction is T 1,g , T 2,g ,..., T M,g , and then calculated to obtain P 1,g+1 , P 2,g+1 ,... ., P M,g+1 and select a particle from it as the reconstruction plan to construct the g+1th microphone array.
通过上述描述可以发现,粒子P m,1(1≤m≤M)经过计算不断演化迭代,依次生成粒子P m,2、P m,3、......,那么在第g次构建麦克风阵列结束后,且在第g+1次构建麦克风阵列开始前,max(T m,1,T m,2,......,T m,g)对应的粒子P即为历史最优坐标P m,best,此时max(T 1,1,T 1,2,......,T 1,g,T 2,1,T 2,2,......,T 2,g,......,T M,1,T M,2,......,T M,g)对应的粒子P为全局历史最优坐标G best。可以发现,随着g数值的增加,P m,best的数值可能变化,也可能不变,但是P m,best的总数始终为M。 From the above description, it can be found that the particle P m,1 (1≤m≤M) continuously evolves and iterates through calculation, and generates particles P m,2 , P m,3 ,..., then in the gth construction After the microphone array ends and before the g+1th microphone array construction begins, the particle P corresponding to max(T m, 1 , T m, 2 ,..., T m, g ) is the historical maximum. Optimal coordinate P m, best , at this time max (T 1, 1 , T 1, 2 ,..., T 1,g , T 2, 1, T 2 , 2 ,..., The particle P corresponding to T 2,g ,...,TM ,1 , TM,2 ,..., TM,g ) is the global historical optimal coordinate G best . It can be found that as the value of g increases, the value of P m, best may change or remain unchanged, but the total number of P m, best is always M.
其中,
Figure PCTCN2022085493-appb-000087
其中κ 1和κ 2为大于0且小于1的随机数,α为膨胀系数,利用历史最优坐标P m,best和全局历史最优坐标G best进行粒子的迭代更新,切实保证随着麦克风阵列重构次数的增加,整个粒子朝向信号增益比逐渐增大的方向进化。在此基础上,找到T 1,g,T 2,g,......,T M,g中的最大值T f,g,M≥f≥1,其对应的粒子为P f,g,将P f,g+1作为第g+1次麦克风阵列的构建方案,就能最大限度降低多径效应对重建后麦克风阵列的影响。
in,
Figure PCTCN2022085493-appb-000087
Among them, κ 1 and κ 2 are random numbers greater than 0 and less than 1, and α is the expansion coefficient. The historical optimal coordinates P m, best and the global historical optimal coordinates G best are used to iteratively update the particles to ensure that as the microphone array As the number of reconstructions increases, the entire particle evolves in the direction of gradually increasing signal gain ratio. On this basis, find the maximum value T f, g in T 1, g , T 2, g ,..., T M , g , M≥f≥1, and its corresponding particle is P f, g , using P f, g+1 as the construction plan for the g+1th microphone array can minimize the impact of the multipath effect on the reconstructed microphone array.
通常而言,α为一个随机数,但是在本实施例中α为一个函数。具体的,T 1,g,T 2,g,......,T M,g中的最大值T f,g,则P f,g为最优粒子,P m,g为当前粒子,由前述可知,粒子可以用其内部麦克风的坐标进行表示,故而P f,g和P m,g的间距
Figure PCTCN2022085493-appb-000088
若γ<1,则膨胀系数α=0.5+γ,若γ=1,则α=1.8。通过控制α的数值,避免通过P 1,g,P 2,g,......,P M,g生成P 1,g+1,P 2,g+1,......,P M,g+1时因α过大导致随机性过大,继而生成第g+1次麦克风阵列构建方案并对麦克风阵列重构后,麦克风阵列相较第g次构建能够获得更大的信号增益比,同时也避免α过小,导致麦克风阵列重构次数过多,从而能 够尽快获得声源方向。在γ<1的情况下,依据粒子间距γ对α(本质上为迭代步长)进行调节从而获取重构方案,能够使麦克风阵列重构前后信号增益比的增幅获得最大程度的提升,并且保证麦克风阵列重构后信号增益比不会产生下降,即粒子发散。
Generally speaking, α is a random number, but in this embodiment α is a function. Specifically, the maximum value T f,g in T 1,g ,T 2,g ,...,T M, g , then P f,g is the optimal particle, and P m,g is the current particle. , it can be seen from the above that the particle can be represented by the coordinates of its internal microphone, so the distance between P f, g and P m, g
Figure PCTCN2022085493-appb-000088
If γ<1, then the expansion coefficient α=0.5+γ, if γ=1, then α=1.8. By controlling the value of α, we avoid generating P 1,g +1,P 2,g +1,... through P 1,g,P 2,g , ...,PM,g . , when P M,g+1 is too large, the randomness is too large, and then the g+1th microphone array construction plan is generated and the microphone array is reconstructed. Compared with the gth construction, the microphone array can obtain greater The signal gain ratio also avoids α being too small, which results in too many microphone array reconstructions, so that the sound source direction can be obtained as quickly as possible. In the case of γ < 1, adjusting α (essentially the iteration step size) according to the particle distance γ to obtain the reconstruction plan can maximize the increase in the signal gain ratio before and after the microphone array is reconstructed, and ensure After the microphone array is reconstructed, the signal gain ratio will not decrease, that is, the particles will diverge.
实施例2:Example 2:
为了保证P 1,1,P 2,1,......,P M,1迭代更新时相互之间不会产生过多影响,需要对粒子中外周麦克风的位置变化量进行限制。第j个外周麦克风在麦克风阵列重构前后的位置移动距离为Δd j,Δd j<d minIn order to ensure that P 1,1 , P 2,1 ,..., P M,1 do not have too much influence on each other when iteratively updating, it is necessary to limit the position change of the peripheral microphone in the particle. The position movement distance of the jth peripheral microphone before and after the microphone array reconstruction is Δd j , Δd j <d min .
实施例3:Example 3:
P 1,g,P 2,g,......,P M,g称为第g代粒子,g相同的粒子称为同代粒子,同代粒子的特征频率相同。从同代粒子中选取具体哪一个粒子作为麦克风阵列的重构方案是通过信号增益比进行判断的。在实施例1中,同一代粒子中,将在单一特征频率声音对应的传播方向上信号增益比最大的粒子作为重构方案。与之相对,本实施例中针对同一代粒子,会对单个粒子在每个特征频率声音对应传播方向上的信号增益比进行求解然后求解平均值,进而获得单个粒子的信号增益比平均值,而后将信号增益比平均值最大的粒子作为重构方案。 P 1,g ,P 2,g ,...,P M,g are called particles of the gth generation. Particles with the same g are called particles of the same generation, and the characteristic frequencies of particles of the same generation are the same. Which specific particle to select as the reconstruction solution for the microphone array from particles of the same generation is judged by the signal gain ratio. In Embodiment 1, among particles of the same generation, the particle with the largest signal gain ratio in the propagation direction corresponding to a single characteristic frequency sound is used as the reconstruction plan. In contrast, in this embodiment, for particles of the same generation, the signal gain ratio of a single particle in the corresponding propagation direction of each characteristic frequency sound is solved and then the average value is obtained, and then the average signal gain ratio of a single particle is obtained, and then The particle with the largest signal gain ratio to the average value is used as the reconstruction plan.
值得注意的是,若采用信号增益比平均值作为重构方案的选择标准,则q的数值不宜过大,否则会导致粒子发散,即无法保证粒子迭代更新后信号增益比能够增加,因此q一般而言为2或3或4或5。It is worth noting that if the average signal gain ratio is used as the selection criterion for the reconstruction scheme, the value of q should not be too large, otherwise it will cause the particles to diverge, that is, there is no guarantee that the signal gain ratio can increase after the particles are iteratively updated, so q is generally In terms of 2 or 3 or 4 or 5.
特别的,本实施例信号增益比的平均值为加权平均值,特征频率的数值越小,则其声音信号在对应的传播方向上信号增益比在计算加权平均值时获得的权重越小,其原因在于多径影响对声音信号中的低频部分影响相对较小,而高 频部分影响明显,因此对较高特征频率对应的信号增益比增加权重,继而实现粒子迭代,能够使信号增益比的平均值获得更为快速的提升。In particular, the average value of the signal gain ratio in this embodiment is a weighted average. The smaller the value of the characteristic frequency, the smaller the weight of the signal gain ratio of the sound signal in the corresponding propagation direction when calculating the weighted average. The reason is that the multipath effect has a relatively small impact on the low-frequency part of the sound signal, while the high-frequency part has a significant impact. Therefore, adding weight to the signal gain ratio corresponding to the higher characteristic frequency, and then realizing particle iteration, can make the average signal gain ratio Value increases more quickly.
以上所述,仅为本发明的具体实施方式,但本发明的保护范围并不局限于此,熟悉该本领域的技术人员应该明白本发明包括但不限于附图和上面具体实施方式中描述的内容。任何不偏离本发明的功能和结构原理的修改都将包括在权利要求书的范围中。The above are only specific embodiments of the present invention, but the protection scope of the present invention is not limited thereto. Those skilled in the art should understand that the present invention includes, but is not limited to, the drawings and those described in the above specific embodiments. content. Any modifications that do not depart from the functional and structural principles of the invention are intended to be included in the scope of the claims.

Claims (7)

  1. 一种强多径干扰条件下的声源定位方法,其特征在于,包括如下步骤:A sound source localization method under strong multipath interference conditions, which is characterized by including the following steps:
    步骤S1:构建麦克风阵列;Step S1: Build a microphone array;
    步骤S2:麦克风阵列采集声源信号,从而确定声源信号的特征频率ω p,p=1,2,......,q,对声源信号进行滤波,以保留ω p邻域范围内的声音信号,继而得到ω p对应的声音传播方向; Step S2: The microphone array collects the sound source signal to determine the characteristic frequency ω p of the sound source signal, p = 1, 2,..., q, and filter the sound source signal to retain the ω p neighborhood range. The sound signal within ω p is then obtained, and the sound propagation direction corresponding to ω p is obtained;
    步骤S3:计算麦克风阵列在ω p对应的声音传播方向上的信号增益比; Step S3: Calculate the signal gain ratio of the microphone array in the sound propagation direction corresponding to ω p ;
    步骤S4:通过信号增益比选择麦克风阵列的重构方案,依据重构方案重复步骤S1和S2,以使ω p发生改变,设定阈值δ,若ω p的变化量小于δ,则依据变化后的ω p对应的声音传播方向对声源位置进行判断,若ω p的变化量不小于δ,则继续依次重复步骤S3、S1和S2,直至ω p的变化量小于δ。 Step S4: Select the reconstruction scheme of the microphone array through the signal gain ratio, repeat steps S1 and S2 according to the reconstruction scheme to change ω p , and set the threshold δ. If the change of ω p is less than δ, then according to the changed The sound propagation direction corresponding to ω p is used to determine the sound source position. If the change in ω p is not less than δ, continue to repeat steps S3, S1 and S2 in sequence until the change in ω p is less than δ.
  2. 根据权利要求1所述的强多径干扰条件下的声源定位方法,其特征在于,q为2或3或4或5,步骤S1和S2重复前后q的取值不变。The sound source localization method under strong multipath interference conditions according to claim 1, characterized in that q is 2 or 3 or 4 or 5, and the value of q remains unchanged before and after steps S1 and S2 are repeated.
  3. 根据权利要求1所述的强多径干扰条件下的声源定位方法,其特征在于,麦克风阵列包括N个麦克风,其中一个麦克风为中心麦克风,其他麦克风为外周麦克风并沿中心麦克风的周向依次排布,以中心麦克风为原点建立球坐标,球坐标中任意一点与中心麦克风连线在球坐标中仰角为
    Figure PCTCN2022085493-appb-100001
    且水平角为θ,该点与第j个外周麦克风之间间距为r j,该点与第j个外周麦克风的连线在球坐标中仰角为
    Figure PCTCN2022085493-appb-100002
    且水平角为θ j,设定函数
    Figure PCTCN2022085493-appb-100003
    其中
    Figure PCTCN2022085493-appb-100004
    v为声速,i为虚数单位,ω p对应的声音传播方向在球坐标中仰角为
    Figure PCTCN2022085493-appb-100005
    且水平角为θ p,则在ω p对应的声音传播方向上的信号增益比
    Figure PCTCN2022085493-appb-100006
    The sound source localization method under strong multipath interference conditions according to claim 1, characterized in that the microphone array includes N microphones, one of which is a central microphone, and the other microphones are peripheral microphones in sequence along the circumferential direction of the central microphone. Arrange, use the center microphone as the origin to establish spherical coordinates. The elevation angle of any point in the spherical coordinates connected to the center microphone in the spherical coordinates is
    Figure PCTCN2022085493-appb-100001
    And the horizontal angle is θ, the distance between this point and the j-th peripheral microphone is r j , and the elevation angle of the line connecting this point and the j-th peripheral microphone in spherical coordinates is
    Figure PCTCN2022085493-appb-100002
    And the horizontal angle is θ j , set the function
    Figure PCTCN2022085493-appb-100003
    in
    Figure PCTCN2022085493-appb-100004
    v is the speed of sound, i is the imaginary unit, and the elevation angle of the sound propagation direction corresponding to ω p in spherical coordinates is
    Figure PCTCN2022085493-appb-100005
    And the horizontal angle is θ p , then the signal gain ratio in the sound propagation direction corresponding to ω p
    Figure PCTCN2022085493-appb-100006
  4. 根据权利要求3所述的强多径干扰条件下的声源定位方法,其特征在于,首次构建麦克风阵列时外周麦克风沿中心麦克风周向均匀分布,此时相邻两个外 周麦克风的间距为d min,第j个外周麦克风在麦克风阵列重构前后的位置移动距离为Δd j,Δd j<d minThe sound source localization method under strong multipath interference conditions according to claim 3, characterized in that when the microphone array is first constructed, the peripheral microphones are evenly distributed along the circumferential direction of the central microphone. At this time, the distance between two adjacent peripheral microphones is d. min , the position movement distance of the j-th peripheral microphone before and after the microphone array reconstruction is Δd j , Δd j <d min .
  5. 根据权利要求3所述的强多径干扰条件下的声源定位方法,其特征在于,采用量子粒子群的多目标优化算法生成麦克风阵列的重构方案,以确保麦克风阵列每次重构后ω p的变化量减少。 The sound source localization method under strong multipath interference conditions according to claim 3, characterized in that the multi-objective optimization algorithm of quantum particle swarm is used to generate the reconstruction plan of the microphone array to ensure that the microphone array is reconstructed after each reconstruction. The change in p decreases.
  6. 根据权利要求5所述的强多径干扰条件下的声源定位方法,其特征在于,步骤S3中对多个特征频率的声音在各自传播方向上计算信号增益比并计算加权平均值,步骤S4中通过信号增益比的加权平均值选择重构方案。The sound source localization method under strong multipath interference conditions according to claim 5, characterized in that in step S3, the signal gain ratio is calculated for the sounds of multiple characteristic frequencies in their respective propagation directions and the weighted average is calculated, and the step S4 The reconstruction scheme is selected by a weighted average of signal gain ratios.
  7. 根据权利要求6所述的强多径干扰条件下的声源定位方法,其特征在于,将每一种麦克风阵列的排列方式作为一种粒子,当前粒子和最优粒子间距为γ,若γ<1,则膨胀系数α=0.5+γ,若γ=1,则α=1.8。The sound source localization method under strong multipath interference conditions according to claim 6, characterized in that each microphone array arrangement is regarded as a particle, and the distance between the current particle and the optimal particle is γ, if γ< 1, then the expansion coefficient α=0.5+γ, if γ=1, then α=1.8.
PCT/CN2022/085493 2022-03-16 2022-04-07 Sound source positioning method under strong multipath interference condition WO2023173514A1 (en)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
CN202210266205.9 2022-03-16
CN202210266205.9A CN114624652B (en) 2022-03-16 2022-03-16 Sound source positioning method under strong multipath interference condition

Publications (1)

Publication Number Publication Date
WO2023173514A1 true WO2023173514A1 (en) 2023-09-21

Family

ID=81901477

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/CN2022/085493 WO2023173514A1 (en) 2022-03-16 2022-04-07 Sound source positioning method under strong multipath interference condition

Country Status (2)

Country Link
CN (1) CN114624652B (en)
WO (1) WO2023173514A1 (en)

Citations (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20140192999A1 (en) * 2013-01-08 2014-07-10 Stmicroelectronics S.R.L. Method and apparatus for localization of an acoustic source and acoustic beamforming
CN105068048A (en) * 2015-08-14 2015-11-18 南京信息工程大学 Distributed microphone array sound source positioning method based on space sparsity
US20190028803A1 (en) * 2014-12-05 2019-01-24 Stages Llc Active noise control and customized audio system
CN109709520A (en) * 2018-12-29 2019-05-03 桂林电子科技大学 Indoor positioning based on microphone array and build drawing method
CN110111807A (en) * 2019-04-27 2019-08-09 南京理工大学 A kind of indoor sound source based on microphone array follows and Enhancement Method
CN113238189A (en) * 2021-05-24 2021-08-10 清华大学 Sound source identification method and system based on array measurement and sparse prior information
CN113721194A (en) * 2021-07-30 2021-11-30 南京师范大学 MWCS-based near-field speech signal three-dimensional positioning algorithm

Family Cites Families (14)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
DE69939272D1 (en) * 1998-11-16 2008-09-18 Univ Illinois BINAURAL SIGNAL PROCESSING TECHNIQUES
US8767975B2 (en) * 2007-06-21 2014-07-01 Bose Corporation Sound discrimination method and apparatus
CN104036787B (en) * 2014-03-05 2017-12-01 佛山科学技术学院 A kind of speech monitoring method and equipment, and laser reflection sound localization method
CN105137394B (en) * 2015-08-24 2017-09-12 江苏中海达海洋信息技术有限公司 Ultra-short baseline acoustic positioning system and its location algorithm based on Maximum-likelihood estimation
JP6982966B2 (en) * 2017-03-14 2021-12-17 大成建設株式会社 Sound source exploration device
EP3682651B1 (en) * 2017-09-12 2023-11-08 Whisper.ai, LLC Low latency audio enhancement
CN108983149B (en) * 2018-07-30 2022-02-22 中国空气动力研究与发展中心低速空气动力研究所 Rotary microphone sound source positioning method
CN110196438A (en) * 2019-06-17 2019-09-03 广东顺德雷舜信息科技有限公司 Localization method, positioning device and terminal device
US11237241B2 (en) * 2019-10-10 2022-02-01 Uatc, Llc Microphone array for sound source detection and location
CN110907893B (en) * 2019-12-05 2023-06-02 扬州大学 Super-resolution sound source positioning method suitable for ball microphone array
CN111429939B (en) * 2020-02-20 2023-06-09 西安声联科技有限公司 Sound signal separation method of double sound sources and pickup
CN112782685B (en) * 2020-05-22 2022-10-18 上海交通大学 Multi-sound-source positioning and sound reconstruction method and system based on MIMO radar
CN112179482B (en) * 2020-09-27 2022-08-16 中车长春轨道客车股份有限公司 Method and system for identifying noise source in railway vehicle
CN112702669B (en) * 2020-12-21 2023-04-07 西安讯飞超脑信息科技有限公司 Sound pickup apparatus, method, apparatus, system, and storage medium

Patent Citations (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20140192999A1 (en) * 2013-01-08 2014-07-10 Stmicroelectronics S.R.L. Method and apparatus for localization of an acoustic source and acoustic beamforming
US20190028803A1 (en) * 2014-12-05 2019-01-24 Stages Llc Active noise control and customized audio system
CN105068048A (en) * 2015-08-14 2015-11-18 南京信息工程大学 Distributed microphone array sound source positioning method based on space sparsity
CN109709520A (en) * 2018-12-29 2019-05-03 桂林电子科技大学 Indoor positioning based on microphone array and build drawing method
CN110111807A (en) * 2019-04-27 2019-08-09 南京理工大学 A kind of indoor sound source based on microphone array follows and Enhancement Method
CN113238189A (en) * 2021-05-24 2021-08-10 清华大学 Sound source identification method and system based on array measurement and sparse prior information
CN113721194A (en) * 2021-07-30 2021-11-30 南京师范大学 MWCS-based near-field speech signal three-dimensional positioning algorithm

Also Published As

Publication number Publication date
CN114624652A (en) 2022-06-14
CN114624652B (en) 2022-09-30

Similar Documents

Publication Publication Date Title
WO2020192721A1 (en) Voice awakening method and apparatus, and device and medium
CN106537501B (en) Reverberation estimator
US9584939B2 (en) Arrangement and method for measuring the direct sound radiated by acoustical sources
CN113884986B (en) Beam focusing enhanced strong impact signal space-time domain joint detection method and system
CN107202976B (en) Low-complexity distributed microphone array sound source positioning system
WO2022247202A1 (en) Sound source identification method and system based on array measurement and sparse prior information
CN113573322B (en) Multi-target area sensor network coverage optimization method based on improved genetic algorithm
WO2023173514A1 (en) Sound source positioning method under strong multipath interference condition
WO2023165631A1 (en) Channel parameter estimation method for intelligent-surface-assisted wireless communication
CN109146922B (en) Forward-looking sonar underwater target tracking method based on adaptive particle swarm optimization
CN110441319A (en) A kind of detection method and device of open defect
Noel et al. Improved maximum likelihood estimation of target position in wireless sensor networks using particle swarm optimization
TWI429885B (en) Method for visualizing sound source energy distribution in reverberant environment
CN114048681A (en) DOA estimation method, system, storage medium and device based on self-selection neural network
WO2023217082A1 (en) Sound source positioning method and system, medium, device, and apparatus
CN111624251B (en) Intelligent lamb wave defect positioning method applied to large plate-shell structure
SongGong et al. Indoor multi-speaker localization based on Bayesian nonparametrics in the circular harmonic domain
CN116309921A (en) Delay summation acoustic imaging parallel acceleration method based on CUDA technology
CN115201821B (en) Small target detection method based on strong target imaging cancellation
Ribeiro et al. Region-to-region kernel interpolation of acoustic transfer function with directional weighting
Laoudias et al. ftTRACK: fault-tolerant target tracking in binary sensor networks
Felsheim et al. Head orientation estimation from multiple microphone arrays
WO2023279249A1 (en) Non-line-of-sight imaging via neural transient field
Lin et al. Anti spatial aliasing HOA encoding method based on aliasing projection matrix
Gao et al. Multizone sound reproduction with adaptive control of scattering effects

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 22931542

Country of ref document: EP

Kind code of ref document: A1