WO2020199351A1 - Sound source locating method, device and storage medium - Google Patents

Sound source locating method, device and storage medium Download PDF

Info

Publication number
WO2020199351A1
WO2020199351A1 PCT/CN2019/091280 CN2019091280W WO2020199351A1 WO 2020199351 A1 WO2020199351 A1 WO 2020199351A1 CN 2019091280 W CN2019091280 W CN 2019091280W WO 2020199351 A1 WO2020199351 A1 WO 2020199351A1
Authority
WO
WIPO (PCT)
Prior art keywords
position information
microphone
sound source
array
distributed microphone
Prior art date
Application number
PCT/CN2019/091280
Other languages
French (fr)
Chinese (zh)
Inventor
关海欣
丁少为
Original Assignee
北京云知声信息技术有限公司
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by 北京云知声信息技术有限公司 filed Critical 北京云知声信息技术有限公司
Publication of WO2020199351A1 publication Critical patent/WO2020199351A1/en

Links

Images

Classifications

    • GPHYSICS
    • G01MEASURING; TESTING
    • G01SRADIO DIRECTION-FINDING; RADIO NAVIGATION; DETERMINING DISTANCE OR VELOCITY BY USE OF RADIO WAVES; LOCATING OR PRESENCE-DETECTING BY USE OF THE REFLECTION OR RERADIATION OF RADIO WAVES; ANALOGOUS ARRANGEMENTS USING OTHER WAVES
    • G01S5/00Position-fixing by co-ordinating two or more direction or position line determinations; Position-fixing by co-ordinating two or more distance determinations
    • G01S5/18Position-fixing by co-ordinating two or more direction or position line determinations; Position-fixing by co-ordinating two or more distance determinations using ultrasonic, sonic, or infrasonic waves
    • G01S5/22Position of source determined by co-ordinating a plurality of position lines defined by path-difference measurements

Definitions

  • the present invention relates to the technical field of target positioning, in particular to a sound source positioning method, device and storage medium.
  • Sound source localization refers to collecting the sound signal emitted by a target object and performing a specific algorithm operation on the sound signal to determine the position of the target object relative to the sound collection device.
  • the sound collection device usually collects the sound signal of the target through a microphone array, and the microphone array includes an array composed of a number of single microphones distributed in a specific arrangement.
  • the microphone array can also be used to further obtain the sound signal.
  • the signal difference between different microphones in the microphone array, and the signal difference can further improve the accuracy of the position calculation of the target object, which is an effect that cannot be achieved by using a single microphone to collect sound signals.
  • current sound source localization algorithms based on sound signals collected by microphone arrays mainly include localization algorithms based on time delay estimation, namely TDE, localization algorithms based on high-resolution spectrum estimation, and localization algorithms based on sparse representation; among them, based on
  • the calculation core of the positioning algorithm of time delay estimation lies in the accurate estimation of the sound wave propagation time delay, which is generally obtained by performing cross-correlation processing on the sound signals collected between different microphones in the microphone array; in order to further obtain the sound source
  • the position information can also be calculated by simple delay summation, geometric calculation, or controllable power response search for the results obtained by the cross-correlation processing.
  • the above-mentioned algorithm is easier to implement, and its computational complexity is small and it is convenient for real-time processing. Therefore, it is widely used in actual calculations.
  • the prior art already has a 360-degree omnidirectional positioning method based on the combination of a microphone array and a corresponding algorithm, when the target has a flat-frame shape with a narrow frame, the existing sound source positioning method is difficult to obtain relatively Accurate positioning results, because in this case, it is difficult for the microphone array to distinguish the front and rear directions of the target, which brings great difficulty to the accurate calculation of the target with a narrow frame plate shape through the method of sound source positioning.
  • the existing microphone arrays In the sound source localization method based on the microphone array, the existing microphone arrays, such as linear microphone arrays or differential microphone arrays, cannot accurately distinguish the front and back of the target when the target has a narrow frame flat shape. Most of the existing smart voice interaction devices, such as mobile phones, tablets, or flat-panel TVs, have this narrow-border tablet shape, which will severely limit the application of sound source localization methods to this type of smart voice interaction devices.
  • the traditional sound source localization method is used in the similar intelligent voice interactive equipment to locate, and the positioning results obtained will not meet the basic accuracy requirements.
  • the existing arrangement of linear microphone arrays or differential microphone arrays cannot efficiently obtain the sound-related signals of a target with a narrow-frame flat plate shape. Such sound-related signals, such as differences in sound intensity or sound along different azimuths The changing trend, etc., have important computational significance for the subsequent positioning algorithm processing. It can be seen that the existing microphone array arrangement cannot effectively improve the positioning accuracy of the sound source positioning method.
  • the present invention provides a sound source localization method, device and storage medium.
  • the method is to arrange a microphone array with a specific shape distribution, wherein the microphone array includes first linear distributions arranged perpendicular to each other.
  • the microphone set and the second linear distributed microphone set and then based on the first linear distributed microphone set and the second linear distributed microphone set, the first position information and the second position information of the target to be located are obtained respectively, and finally according to the The first position information and the second position information calculate 360° full-plane positioning information about the target to be positioned.
  • the first set of linearly distributed microphones and the second set of linearly distributed microphones divide the full-planar space in which the target to be positioned is located into two half-planar spaces, and then obtain the two half-planar spaces in sequence.
  • the spatial position information of the target to be located, and the full-planar position information of the target to be located is comprehensively calculated according to the corresponding spatial position information in the two half-planar spaces, which can overcome the inability of traditional sound source localization methods to distinguish narrow borders
  • the defects in the front and rear directions of the flat-shaped object thereby effectively improving the sound source localization accuracy of the target with a narrow-frame flat-shaped object.
  • a sound source localization method includes the following steps:
  • Step (1) arranging a T-shaped distributed microphone array composed of several microphones, the T-shaped distributed microphone array including a first linear distributed microphone set and a second linear distributed microphone set arranged perpendicular to each other;
  • Step (2) Calculate first position information about the target based on the voice signal collected by the first linear distributed microphone set;
  • Step (3) based on the voice signal collected by the second linear distributed microphone set, calculate the second position information about the target, and determine the second position information about the target object according to the first position information and the second position information. State the 360° full plane positioning information of the target;
  • step (1) arranging the microphone array in a T-shaped distribution specifically includes: arranging a number of microphones at a predetermined interval along a first direction to form the first line-shaped distributed microphone set, and At least one microphone is arranged in a second direction perpendicular to the first direction to form the second linear distributed microphone set.
  • step (1) all microphones in the second linear distributed microphone set are located on the same side of the first linear distributed microphone set, and the second direction passes through the first linear distributed microphone set.
  • One of the microphones in a line-type distributed microphone set is located on the same side of the first linear distributed microphone set, and the second direction passes through the first linear distributed microphone set.
  • step (2) calculating the first position information specifically includes: first obtaining the voice signal of each microphone in the first linearly distributed microphone set, and then calculating it based on a time delay estimation algorithm The first position information, wherein the first position information is 0°-180° half-plane position information of the target based on a corresponding setting direction of the second linear distributed microphone set.
  • step (2) calculating the first position information based on the time delay estimation algorithm specifically includes: performing voice signals corresponding to each microphone in the first linearly distributed microphone set. Cross-correlation processing, and performing controllable power response search processing on the mutual results obtained by the cross-correlation processing, so as to obtain the first position information by calculation.
  • step (2) calculating the first position information based on the time delay estimation algorithm specifically includes: performing voice signals corresponding to each microphone in the first linearly distributed microphone set.
  • Generalized cross-correlation function processing wherein the generalized cross-correlation function also introduces a weighting function on the cross-power spectral density between different microphones, and then calculates the first step according to the generalized cross-correlation phase transformation algorithm on the generalized cross-correlation function One location information.
  • step (3) calculating the second position information specifically includes selecting one of the microphones in the first linear distributed microphone set and the one in the second linear distributed microphone set A microphone forms a small-pitch microphone differential array, and then the second position information is calculated based on the small-pitch microphone differential array in combination with a corresponding differential array algorithm.
  • step (3) calculating the second position information based on the small-pitch microphone differential array in combination with a corresponding differential array algorithm specifically includes: using the small-pitch microphone differential array as a fixed The beamformer simultaneously obtains the first-order differential beam pattern corresponding to the fixed beamformer, and calculates the second position information based on the first-order differential beam pattern.
  • step (3) calculating the second position information based on the small-pitch microphone differential array in combination with a corresponding differential array algorithm specifically includes: taking the small-pitch microphone differential array as a fixed beam forming And design the first beam weight and the second beam weight that are different for the fixed beamformer at the same time, and calculate the first output signal energy sum corresponding to the first beam weight and the second beam weight The second output signal energy is calculated according to the larger one of the first output signal energy and the second output signal energy.
  • calculating the energy of the first output signal specifically includes, for the first beam weight, taking the expected direction of the first beam weight as the front and taking the zero of the first beam weight.
  • the trap direction is backward, and the input signals of the small-pitch microphone differential array are weighted and summed to obtain the first output signal energy; or, calculating the second output signal energy specifically includes: Beam weights, taking the expected direction of the second beam weights as the rear and the null direction of the second beam weights as the front, performing weighted summation processing on the input signals of the small-pitch microphone differential array, Thus, the energy of the second output signal is obtained.
  • a sound source localization device including:
  • An array arrangement module for arranging a T-shaped distributed microphone array composed of several microphones, the T-shaped distributed microphone array including a first linear distributed microphone set and a second linear distributed microphone set arranged perpendicular to each other;
  • the first collection module is configured to calculate the first position information about the target based on the voice signal collected by the first linear distributed microphone set;
  • the second collection module is configured to calculate second position information about the target based on the voice signal collected by the second linear distributed microphone set, and determine according to the first position information and the second position information 360° full plane positioning information about the target.
  • a sound source localization device including:
  • a memory for storing processor executable instructions
  • the processor is configured to execute the steps of the method described in any embodiment of the first aspect.
  • a computer-readable storage medium having computer instructions stored thereon, characterized in that, when the instructions are executed by a processor, the steps of the method described in any of the embodiments of the first aspect are implemented .
  • the sound source localization method of the present invention divides the full-planar space where the target object to be located is located into two halves through the first linear distributed microphone set and the second linear distributed microphone set arranged perpendicular to each other. Plane space, and then sequentially obtain the spatial position information of the target to be positioned in the two half-plane spaces, and comprehensively calculate the full-planar position of the target to be positioned according to the corresponding spatial position information in the two half-plane spaces
  • the traditional sound source localization method can overcome the defect that the traditional sound source localization method cannot distinguish the front and rear directions of the flat panel shape with a narrow frame, thereby effectively improving the sound source localization accuracy of the target object with the flat frame shape of the narrow frame.
  • Fig. 1 is a schematic flowchart of a sound source localization method provided by the present invention
  • FIG. 2 is a schematic diagram of the distribution of microphone arrays in a sound source localization method provided by the present invention
  • Figure 3 is a block diagram of a sound source localization device in an embodiment of the present invention.
  • Fig. 4 is a block diagram of a sound source localization device in an embodiment of the present invention.
  • the sound source localization method is to arrange a microphone array with a specific shape distribution near the target object to be located.
  • the microphone array can divide the accessory space area of the target object to be located into a front area, a rear area, and a left area.
  • the side area and the right area are four different space area parts.
  • the division of the above four different space area parts can be sequentially in different half-plane spaces. This can effectively overcome the inability to distinguish the front-to-rear direction of the target object with a narrow frame and flat plate shape in the existing sound source localization method.
  • the basic idea of the location detection of the sound source location method of the present invention is to collect different voice signals through two different linearly distributed microphone sets in the microphone array, and calculate the corresponding different voice signals based on the different voice signals.
  • Position information, and the different position information is different position data about the target, so that the full-plane positioning information about the target can be determined according to the different position information.
  • several microphones in the microphone array can mutually form a T-shaped array distribution form.
  • the sound source localization method may include the following steps:
  • Step (1) arranging a T-shaped distributed microphone array composed of several microphones.
  • the T-shaped distributed microphone array includes a first linear distributed microphone set and a second linear distributed microphone set arranged perpendicular to each other.
  • arranging the microphone array in a T-shaped distribution specifically includes: arranging a plurality of microphones at a predetermined interval along a first direction to form the first linearly distributed microphone set, which is aligned with the first direction. At least one microphone is arranged in the second vertical direction to form the second linear distributed microphone set.
  • the first linear distributed microphone set may include at least three microphones
  • the second linear distributed microphone set may include at least two microphones; further, the first linear distributed microphone set and the second linear distributed microphone set are mutually There may be one or several microphones in common.
  • all the microphones in the second linear distributed microphone set may be located on the same side of the first linear distributed microphone set, and the second direction passes through the first linear distributed microphone One of the microphones in the collection.
  • all the microphones in the first linear distributed microphone set can be evenly distributed on both sides of the second linear distributed microphone set with the axis of the second linear distributed microphone set as the axis of symmetry.
  • the distance between all adjacent two microphones in the first linear distributed microphone set or the second linear distributed microphone set may be equal, so as to ensure the first linear distributed microphone diversity set or the second linear distributed microphone set Different microphones in the microphone set have the same receiving performance in the sound field distribution.
  • Step (2) based on the voice signal collected by the first linear distributed microphone set, calculate the first position information about the target.
  • calculating the first position information may specifically include: first obtaining the voice signal of each microphone in the first linearly distributed microphone set, and then calculating the first position information based on a time delay estimation algorithm. Position information; where the first position information is based on the 0°-180° half-plane position information of the target based on the corresponding setting direction of the second linear distributed microphone set. Further, in the process of calculating the first position, each microphone in the first linearly distributed microphone set is controlled to receive the sound signal generated by the target in real time, and then the sound signal received by each microphone is estimated to pass the time delay. The algorithm calculates the first position information.
  • calculating the first position information based on the time delay estimation algorithm may specifically include performing cross-correlation processing on the voice signal corresponding to each microphone in the first linearly distributed microphone set, and Performing controllable power response search processing on the mutual results obtained by the cross-correlation processing, thereby calculating the first position information.
  • calculating the first position information based on the time delay estimation algorithm may specifically include: performing a generalized cross-correlation function on the voice signal corresponding to each microphone in the first linearly distributed microphone set Processing, wherein the generalized cross-correlation function also introduces a weighting function on the cross power spectral density between different microphones, and then calculates the first position information according to the generalized cross-correlation phase transformation algorithm on the generalized cross-correlation function.
  • the time delay estimation algorithm can be a time delay estimation algorithm based on a generalized cross-correlation function, and the time delay estimation algorithm also introduces a weighting function during the implementation process, which adjusts the cross power spectral density of the sound signal , So as to optimize the calculation accuracy of the time delay estimation algorithm.
  • the generalized cross-correlation function corresponding to the time delay estimation algorithm can have a variety of different variants, and the time delay estimation algorithm based on the generalized cross-correlation function can be specifically generalized cross-correlation- Phase transformation method (GCC-PHAT), the generalized cross-correlation-phase transformation method itself has certain anti-noise and anti-reverberation capabilities, so the use of this algorithm can reduce to a certain extent each of the first linear distribution microphone sets The noise and/or reverberation existing in the voice signal of the microphone causes interference to the first position information.
  • GCC-PHAT generalized cross-correlation- Phase transformation method
  • the time delay estimation algorithm based on the generalized cross-correlation function estimates the time delay value based on the peak value of the cross-correlation function between the sound signals collected by two microphones. This is because in the same sound source positioning system, The sound signal received by each microphone in the microphone array set comes from the same sound source, which makes the channel signals corresponding to different microphones have a strong correlation, so that by calculating the difference between the channel signals corresponding to each two microphones The correlation function can determine the time delay value between the corresponding channel signals of the two corresponding microphones, and finally the position information corresponding to the sound source can be calculated with the time delay value.
  • Step (3) based on the voice signal collected by the second linear distributed microphone set, calculate the second position information about the target object, and determine the 360° of the target object according to the first position information and the second position information Full plane positioning information.
  • calculating the second position information may specifically include selecting one of the microphones in the first linear distributed microphone set and one of the microphones in the second linear distributed microphone set to form a The small-pitch microphone differential array is then based on the small-pitch microphone differential array and combined with the corresponding differential array algorithm to calculate the second position information.
  • calculating the second position information based on the small-pitch microphone differential array in combination with the corresponding differential array algorithm may specifically include: using the small-pitch microphone differential array as a fixed beamformer, At the same time, the first-order differential beam pattern corresponding to the fixed beamformer is acquired, and the second position information is calculated based on the first-order differential beam pattern.
  • calculating the second position information based on the small-pitch microphone differential array in combination with the corresponding differential array algorithm may specifically include using the small-pitch microphone differential array as a fixed beamformer, and designing To obtain different first beam weights and second beam weights for the fixed beamformer, and calculate the first output signal energy and the second output signal energy corresponding to the first beam weight and the second beam weight, The second position information is calculated according to the larger one of the first output signal energy and the second output signal energy.
  • calculating the energy of the first output signal may specifically include, for the first beam weight, taking the expected direction of the first beam weight as the front and taking the first beam weight as the forward direction.
  • FIG. 2 is a schematic diagram of the distribution of the microphone array in a sound source localization method according to an embodiment of the present invention.
  • Figure 2 only schematically shows the distribution of several microphones in the microphone array.
  • the distribution form of the microphone array of the present invention is not limited to the situation shown in Figure 2, the distribution of the microphone array of the present invention The form can have other different distribution forms, which will not be listed here.
  • the microphone array includes a first linear microphone set composed of three microphones M1, M2, M3 arranged in a horizontal direction, and two microphones M2, M4 arranged in a vertical direction.
  • the second line microphone collection is a first linear microphone set composed of three microphones M1, M2, M3 arranged in a horizontal direction, and two microphones M2, M4 arranged in a vertical direction.
  • the distances between two adjacent microphones in the microphones M1, M2, and M3 are equal; preferably, the microphones M1 and M3 are symmetrically distributed on the microphone with the line on which the microphones M2 and M4 are located as the axis. Both sides of M2 and M4.
  • the sound source localization process of the microphone array shown in FIG. 2 is the same as the real-time process of the sound source localization method introduced above, and will not be further described here.
  • the sound source localization method is to arrange a microphone array with a specific shape distribution, wherein the microphone array includes a first linear distribution microphone set and a second linear distribution microphone set arranged perpendicular to each other, and then based on The first line-shaped distributed microphone set and the second line-shaped distributed microphone set respectively obtain the first position information and the second position information about the target to be located, and finally calculate the information about the target object according to the first position information and the second position information.
  • the 360° full-plane positioning information of the target to be located can overcome the defect that the traditional sound source localization method cannot distinguish the front and back directions of the flat panel shape with a narrow frame, thereby effectively improving the sound source localization of the target with a flat frame shape with a narrow frame Accuracy.
  • Fig. 3 is a schematic structural diagram of a sound source localization device provided by an embodiment of the present invention.
  • the device can be implemented as part or all of an electronic device through software, hardware, or a combination of both.
  • the sound source localization device includes an array arrangement module 301, a first collection module 302 and a second collection module 303.
  • the array arrangement module 301 is used to arrange a T-shaped distributed microphone array composed of several microphones.
  • the T-shaped distributed microphone array includes a first linear distributed microphone set and a second linear distributed microphone set arranged perpendicular to each other;
  • the acquisition module 302 is used to calculate the first position information about the target based on the voice signal collected by the first linear distributed microphone set;
  • the second acquisition module 303 is used to calculate the voice signal based on the voice signal collected by the second linear distributed microphone set
  • the second position information about the target, and the 360° full-plane positioning information about the target is determined according to the first position information and the second position information.
  • the embodiment of the present invention also provides a sound source localization device, which includes:
  • a memory for storing processor executable instructions
  • the processor is configured to execute:
  • Step (1) arranging a T-shaped distributed microphone array composed of several microphones, the T-shaped distributed microphone array including a first linear distributed microphone set and a second linear distributed microphone set arranged perpendicular to each other;
  • Step (2) based on the voice signal collected by the first linear distributed microphone set, calculate the first position information about the target;
  • Step (3) based on the voice signal collected by the second linear distributed microphone set, calculate the second position information about the target, and determine the 360° full-plane positioning information about the target according to the first position information and the second position information ;
  • the above-mentioned processor may also be configured to:
  • arranging the microphone array in a T-shaped distribution specifically includes: arranging a plurality of microphones at a predetermined interval along a first direction to form a first linear distributed microphone set, and arranging at least one microphone array in a second direction perpendicular to the first direction.
  • One microphone to form a second linear distribution microphone set One microphone to form a second linear distribution microphone set.
  • the above-mentioned processor may also be configured to:
  • step (1) the microphones in the second linear distributed microphone set are all located on the same side of the first linear distributed microphone set, and the second direction passes through one of the microphones in the first linear distributed microphone set.
  • the above-mentioned processor may also be configured to:
  • step (2) calculating the first position information specifically includes: first obtaining the voice signal of each microphone in the first linearly distributed microphone set, and then calculating the first position information based on the time delay estimation algorithm, where the first position The information is the 0°-180° half-plane position information of the target based on the corresponding setting direction of the second linear distributed microphone set.
  • the above-mentioned processor may also be configured to:
  • step (2) calculating the first position information based on the time delay estimation algorithm specifically includes performing cross-correlation processing on the voice signal corresponding to each microphone in the first linearly distributed microphone set, and performing cross-correlation processing on the cross-correlation processing. As a result, a controllable power response search process is performed, and the first position information is calculated.
  • the above-mentioned processor may also be configured to:
  • step (2) calculating the first position information based on the time delay estimation algorithm specifically includes performing generalized cross-correlation function processing on the voice signal corresponding to each microphone in the first linearly distributed microphone set, where the generalized cross-correlation function is also The weighting function of the cross power spectral density between different microphones is introduced, and the first position information is calculated according to the generalized cross-correlation phase transformation algorithm of the generalized cross-correlation function.
  • the above-mentioned processor may also be configured to:
  • step (3) calculating the second position information specifically includes selecting one of the microphones in the first linear distributed microphone set and one of the microphones in the second linear distributed microphone set to form a small-pitch microphone differential array, and then based on The micro-pitch microphone differential array is combined with the corresponding differential array algorithm to calculate the second position information.
  • the above-mentioned processor may also be configured to:
  • step (3) calculating the second position information based on the small-pitch microphone differential array combined with the corresponding differential array algorithm specifically includes: using the small-pitch microphone differential array as a fixed beamformer, and at the same time obtaining the corresponding fixed beamformer The first-order differential beam pattern, and the second position information is calculated based on the first-order differential beam pattern.
  • the above-mentioned processor may also be configured to:
  • step (3) calculating the second position information based on the small-pitch microphone differential array combined with the corresponding differential array algorithm specifically includes taking the small-pitch microphone differential array as the fixed beamformer, and designing different information about the fixed beamformer.
  • the first beam weight and the second beam weight are calculated, and the first output signal energy and the second output signal energy corresponding to the first beam weight and the second beam weight are calculated, and then according to the first output signal energy and the second output signal energy The larger one of the output signal energy calculates the second position information.
  • the above-mentioned processor may also be configured to:
  • Calculating the energy of the first output signal specifically includes, for the first beam weight, taking the desired direction of the first beam weight as the front and the null direction of the first beam weight as the back, for the input signal of the differential microphone array with fine pitch Perform weighted summation processing to obtain the energy of the first output signal; or, calculating the energy of the second output signal specifically includes, for the second beam weight, taking the expected direction of the second beam weight as the rear and taking the second beam weight
  • the null direction of is forward, and the input signal of the differential array of small-pitch microphones is weighted and summed to obtain the second output signal energy.
  • Fig. 4 is a structural block diagram of a sound source localization device provided by an embodiment of the present invention.
  • the device 40 may be provided as a server.
  • the device 40 includes a processing component 402, which further includes one or more processors, and a memory resource represented by the memory 404, for storing instructions executable by the processing component 402, such as application programs.
  • the application program stored in the memory 404 may include one or more modules each corresponding to a set of instructions.
  • the processing component 402 is configured to execute instructions to perform the above-mentioned method.
  • the device 40 may also include a power component 406 configured to perform power management of the device 40, a wired or wireless network interface 408 configured to connect the device 40 to a network, and an input/output (I/O) interface 410.
  • the device 40 can operate based on an operating system stored in the memory 404, such as Windows ServerTM, Mac OS XTM, UnixTM, LinuxTM, FreeBSDTM or the like.
  • the embodiments of the present disclosure also provide a non-transitory computer-readable storage medium.
  • the instructions in the storage medium are executed by the processor of the device 40, the device 40 can execute the above sound source localization method.
  • the method includes:
  • Step (1) arranging a T-shaped distributed microphone array composed of several microphones, the T-shaped distributed microphone array including a first linear distributed microphone set and a second linear distributed microphone set arranged perpendicular to each other;
  • Step (2) based on the voice signal collected by the first linear distributed microphone set, calculate the first position information about the target;
  • Step (3) based on the voice signal collected by the second linear distributed microphone set, calculate the second position information about the target, and determine the 360° full-plane positioning information about the target according to the first position information and the second position information ;
  • step (1) arranging the microphone array in a T-shaped distribution specifically includes: arranging a plurality of microphones at a predetermined interval along the first direction to form a first linearly distributed microphone set, which is perpendicular to the first direction. At least one microphone is arranged in the second direction to form a second linear distributed microphone set.
  • step (1) the microphones in the second linearly distributed microphone set are all located on the same side of the first linearly distributed microphone set, and the second direction passes through one of the first linearly distributed microphone sets. A microphone.
  • step (2) calculating the first position information specifically includes: first obtaining the voice signal of each microphone in the first linearly distributed microphone set, and then calculating the first position information based on the delay estimation algorithm , Wherein the first position information is 0°-180° half-plane position information of the target based on the corresponding setting direction of the second linear distributed microphone set.
  • step (2) calculating the first position information based on the time delay estimation algorithm specifically includes performing cross-correlation processing on the voice signal corresponding to each microphone in the first linear distributed microphone set, and The mutual results obtained by the cross-correlation processing are subjected to controllable power response search processing, thereby calculating the first position information.
  • step (2) calculating the first position information based on the time delay estimation algorithm specifically includes performing generalized cross-correlation function processing on the voice signal corresponding to each microphone in the first linearly distributed microphone set, Among them, the generalized cross-correlation function also introduces a weighting function on the cross-power spectral density between different microphones, and then calculates the first position information according to the generalized cross-correlation phase transformation algorithm on the generalized cross-correlation function.
  • step (3) calculating the second position information specifically includes selecting one of the microphones in the first linear distributed microphone set and one of the microphones in the second linear distributed microphone set to form a small distance
  • the microphone differential array is based on the small-pitch microphone differential array and combined with the corresponding differential array algorithm to calculate the second position information.
  • step (3) calculating the second position information based on the small-pitch microphone differential array in combination with the corresponding differential array algorithm specifically includes: using the small-pitch microphone differential array as a fixed beamformer and simultaneously acquiring The first-order differential beam pattern corresponding to the fixed beamformer, and the second position information is calculated based on the first-order differential beam pattern.
  • step (3) calculating the second position information based on the small-pitch microphone differential array combined with the corresponding differential array algorithm specifically includes using the small-pitch microphone differential array as a fixed beamformer, and at the same time designing
  • the fixed beamformer has different first beam weights and second beam weights, and calculates the first output signal energy and the second output signal energy corresponding to the first beam weight and the second beam weight, and then according to the first The larger one of the energy of the output signal and the energy of the second output signal calculates the second position information.
  • calculating the energy of the first output signal specifically includes, for the first beam weight, taking the desired direction of the first beam weight as the front and taking the null direction of the first beam weight as the back, and for the small spacing
  • the input signal of the microphone differential array is weighted and summed to obtain the first output signal energy
  • calculating the second output signal energy specifically includes, for the second beam weight, taking the expected direction of the second beam weight as the rear and Taking the null direction of the second beam weight as the front, weighting and summation processing is performed on the input signal of the small-pitch microphone differential array to obtain the second output signal energy.

Landscapes

  • Physics & Mathematics (AREA)
  • Engineering & Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • Radar, Positioning & Navigation (AREA)
  • Remote Sensing (AREA)
  • Circuit For Audible Band Transducer (AREA)
  • Measurement Of Velocity Or Position Using Acoustic Or Ultrasonic Waves (AREA)

Abstract

A sound source locating method, a device and a storage medium. The sound source locating method comprises: arranging a microphone array which has a specific shape distribution, the microphone array including a microphone set in a first linear distribution and a microphone set in a second linear distribution, which are arranged perpendicular to each other; then, on the basis of the microphone set in a first linear distribution and the microphone set in a second linear distribution, respectively acquiring first position information and second position information of a target to be located; and finally, calculating, according to the first position information and the second position information, 360-degree full-plane locating information of the target to be located.

Description

一种声源定位方法、装置及存储介质Sound source localization method, device and storage medium 技术领域Technical field
本发明涉及目标物定位的技术领域,特别涉及一种声源定位方法、装置及存储介质。The present invention relates to the technical field of target positioning, in particular to a sound source positioning method, device and storage medium.
背景技术Background technique
声源定位是指通过采集目标物发出的声音信号,并对该声音信号进行特定的算法运算,从而确定该目标物相对于声音采集装置的位置。为了提高声音采集装置采集声音信号的准确性,该声音采集装置通常是通过一麦克风阵列来收集该目标物的声音信号,而该麦克风阵列包括若干个单麦克风按照特定的排列分布而组成的阵列,由于目标物发出的声音信号是以该目标物为球心并向外发散传播的球面声波,若声音采集装置只通过单个麦克风来采集声音信号,这会导致采集到的声音信号发生缺失和不完整的情况,而通过麦克风阵列来采集声音信号,则能够最大限度地获得该声音信号的完整信息,并且由于目标物与声音采集装置之间存在相对间距,采用麦克风阵列还能够进一步获得该声音信号关于麦克风阵列中不同麦克风之间的信号差异,而该信号差异能够进一步提高该目标物的位置计算准确性,这是采用单个麦克风采集声音信号所不能达到的效果。Sound source localization refers to collecting the sound signal emitted by a target object and performing a specific algorithm operation on the sound signal to determine the position of the target object relative to the sound collection device. In order to improve the accuracy of the sound signal collected by the sound collection device, the sound collection device usually collects the sound signal of the target through a microphone array, and the microphone array includes an array composed of a number of single microphones distributed in a specific arrangement. Since the sound signal emitted by the target is a spherical sound wave with the target as the center of the sphere and diverging and spreading outward, if the sound collection device only collects the sound signal through a single microphone, this will cause the collected sound signal to be missing and incomplete If the sound signal is collected by the microphone array, the complete information of the sound signal can be obtained to the greatest extent, and due to the relative distance between the target and the sound collection device, the microphone array can also be used to further obtain the sound signal. The signal difference between different microphones in the microphone array, and the signal difference can further improve the accuracy of the position calculation of the target object, which is an effect that cannot be achieved by using a single microphone to collect sound signals.
此外,目前基于麦克风阵列采集的声音信号而进行的声源定位算法主要包括基于时延估计的定位算法,即TDE,基于高分辨率谱估计的定位算法,基于稀疏表示的定位算法;其中,基于时延估计的定位算法的计算核心在于对声波传播时延的准确估计,其一般是通过对麦克风阵列中不同麦克风之间采集到的声音信号进行互相关处理后而得到;为了进一步获得声源的位置信息,还可对该互相关处理得到的结果进行简单的延时求和、几何计算或者可控功率响应搜 索等算法计算,而上述的算法较容易实现,其运算量小以及便于实时处理,因此在实际计算中被广泛应用。In addition, current sound source localization algorithms based on sound signals collected by microphone arrays mainly include localization algorithms based on time delay estimation, namely TDE, localization algorithms based on high-resolution spectrum estimation, and localization algorithms based on sparse representation; among them, based on The calculation core of the positioning algorithm of time delay estimation lies in the accurate estimation of the sound wave propagation time delay, which is generally obtained by performing cross-correlation processing on the sound signals collected between different microphones in the microphone array; in order to further obtain the sound source The position information can also be calculated by simple delay summation, geometric calculation, or controllable power response search for the results obtained by the cross-correlation processing. The above-mentioned algorithm is easier to implement, and its computational complexity is small and it is convenient for real-time processing. Therefore, it is widely used in actual calculations.
虽然,现有技术已经存在基于麦克风阵列和相应算法相结合以对目标物进行360度的全方位定位手段,但是当目标物具有窄边框平板形状时,现有的声源定位手段很难获得较为准确的定位结果,这是因为在此情况下麦克风阵列很难区分目标物的前后方向,这对于需要通过声源定位的方法来准确计算具有窄边框平板形状的目标物带来极大的难度。Although the prior art already has a 360-degree omnidirectional positioning method based on the combination of a microphone array and a corresponding algorithm, when the target has a flat-frame shape with a narrow frame, the existing sound source positioning method is difficult to obtain relatively Accurate positioning results, because in this case, it is difficult for the microphone array to distinguish the front and rear directions of the target, which brings great difficulty to the accurate calculation of the target with a narrow frame plate shape through the method of sound source positioning.
发明内容Summary of the invention
在基于麦克风阵列的声源定位方法中,现有的麦克风阵列,比如线性麦克风阵列或者差分麦克风阵列,都无法在目标物具有窄边框平板形状的情况下准确地区分目标物的前方和后方,由于现有的大多数智能语音交互设备,例如手机、平板电脑或者平板电视等都具有该窄边框平板形状,这将严重地限制声源定位方法应用于这类型的智能语音交互设备中,若在此类智能语音交互设备中使用传统的声源定位方法进行定位,其得到的定位结果都会达不到基本的精度要求。另外,现有的线性麦克风阵列或者差分麦克风阵列的布置方式都不能高效地获取具有窄边框平板形状的目标物的声音相关信号,这类声音相关信号,比如声音强度差异或者声音沿不同方位角的变化趋势等,对于后续的定位算法处理具有重要的计算意义,可见现有的麦克风阵列布置方式并不能有效地提高声源定位方法的定位精度。In the sound source localization method based on the microphone array, the existing microphone arrays, such as linear microphone arrays or differential microphone arrays, cannot accurately distinguish the front and back of the target when the target has a narrow frame flat shape. Most of the existing smart voice interaction devices, such as mobile phones, tablets, or flat-panel TVs, have this narrow-border tablet shape, which will severely limit the application of sound source localization methods to this type of smart voice interaction devices. The traditional sound source localization method is used in the similar intelligent voice interactive equipment to locate, and the positioning results obtained will not meet the basic accuracy requirements. In addition, the existing arrangement of linear microphone arrays or differential microphone arrays cannot efficiently obtain the sound-related signals of a target with a narrow-frame flat plate shape. Such sound-related signals, such as differences in sound intensity or sound along different azimuths The changing trend, etc., have important computational significance for the subsequent positioning algorithm processing. It can be seen that the existing microphone array arrangement cannot effectively improve the positioning accuracy of the sound source positioning method.
针对现有技术存在的缺陷,本发明提供一种声源定位方法、装置及存储介质,该方法是通过布置具有特定形状分布的麦克风阵列,其中该麦克风阵列包括相互垂直设置的第一线型分布麦克风集合和第二线型分布麦克风集合,接着基于该第一线型分布麦克风集合和第二线型分布麦克风集合,分别获取关于待定位目标物的第一位置信息和第二位置信息,最后再根据该第一位置信息和第二位置信息计算出关于该待定位目标物的360°全平面定位信息。实际上,该第 一线型分布麦克风集合和第二线型分布麦克风集合将该待定位目标物所处的全平面空间划分为两个半平面空间,然后依次在该两个半平面空间内获取该待定位目标物的空间位置信息,并根据这两个半平面空间内对应的空间位置信息综合计算出该待定位目标物的全平面位置信息,这样能够克服传统声源定位方法无法区分具有窄边框平板形状物体前后方向的缺陷,从而有效地提高对具有窄边框平板形状的目标物的声源定位精度。In view of the defects in the prior art, the present invention provides a sound source localization method, device and storage medium. The method is to arrange a microphone array with a specific shape distribution, wherein the microphone array includes first linear distributions arranged perpendicular to each other. The microphone set and the second linear distributed microphone set, and then based on the first linear distributed microphone set and the second linear distributed microphone set, the first position information and the second position information of the target to be located are obtained respectively, and finally according to the The first position information and the second position information calculate 360° full-plane positioning information about the target to be positioned. In fact, the first set of linearly distributed microphones and the second set of linearly distributed microphones divide the full-planar space in which the target to be positioned is located into two half-planar spaces, and then obtain the two half-planar spaces in sequence. The spatial position information of the target to be located, and the full-planar position information of the target to be located is comprehensively calculated according to the corresponding spatial position information in the two half-planar spaces, which can overcome the inability of traditional sound source localization methods to distinguish narrow borders The defects in the front and rear directions of the flat-shaped object, thereby effectively improving the sound source localization accuracy of the target with a narrow-frame flat-shaped object.
根据本发明实施例的第一方面,提供一种声源定位方法,所述声源定位方法包括如下步骤:According to a first aspect of the embodiments of the present invention, a sound source localization method is provided. The sound source localization method includes the following steps:
步骤(1),布置由若干麦克风组成的呈T型分布的麦克风阵列,所述T型分布的麦克风阵列包括相互垂直设置的第一线型分布麦克风集合和第二线型分布麦克风集合;Step (1), arranging a T-shaped distributed microphone array composed of several microphones, the T-shaped distributed microphone array including a first linear distributed microphone set and a second linear distributed microphone set arranged perpendicular to each other;
步骤(2),基于所述第一线型分布麦克风集合采集的语音信号,计算出关于目标物的第一位置信息;Step (2): Calculate first position information about the target based on the voice signal collected by the first linear distributed microphone set;
步骤(3),基于所述第二线型分布麦克风集合采集的语音信号,计算出关于所述目标物的第二位置信息,并根据所述第一位置信息和所述第二位置信息确定关于所述目标物的360°全平面定位信息;Step (3), based on the voice signal collected by the second linear distributed microphone set, calculate the second position information about the target, and determine the second position information about the target object according to the first position information and the second position information. State the 360° full plane positioning information of the target;
在一个实施例中,在步骤(1)中,布置所述呈T型分布的麦克风阵列具体包括,沿第一方向相隔预定间距设置若干麦克风以形成所述第一线型分布麦克风集合,沿与所述第一方向垂直的第二方向设置至少一个麦克风以形成所述第二线型分布麦克风集合。In one embodiment, in step (1), arranging the microphone array in a T-shaped distribution specifically includes: arranging a number of microphones at a predetermined interval along a first direction to form the first line-shaped distributed microphone set, and At least one microphone is arranged in a second direction perpendicular to the first direction to form the second linear distributed microphone set.
在一个实施例中,在步骤(1)中,所述第二线型分布麦克风集合中的所有麦克风均位于所述第一线型分布麦克风集合的同一侧,所述第二方向穿过所述第一线型分布麦克风集合中的其中一个麦克风。In one embodiment, in step (1), all microphones in the second linear distributed microphone set are located on the same side of the first linear distributed microphone set, and the second direction passes through the first linear distributed microphone set. One of the microphones in a line-type distributed microphone set.
在一个实施例中,在步骤(2)中,计算所述第一位置信息具体包括,先获取所述第一线型分布麦克风集合中每一麦克风的语音信号,再基于时延估计算法计算得到所述第一位置信息,其中,所述第一位置信息是以所述第二线型分 布麦克风集合对应设置方向为基准的、关于所述目标物的0°-180°的半平面位置信息。In one embodiment, in step (2), calculating the first position information specifically includes: first obtaining the voice signal of each microphone in the first linearly distributed microphone set, and then calculating it based on a time delay estimation algorithm The first position information, wherein the first position information is 0°-180° half-plane position information of the target based on a corresponding setting direction of the second linear distributed microphone set.
在一个实施例中,在步骤(2)中,基于所述时延估计算法计算得到所述第一位置信息具体包括,对所述第一线型分布麦克风集合中每一个麦克风对应的语音信号进行互相关处理,并对所述互相关处理得到的互相结果进行可控功率响应搜索处理,从而计算得到所述第一位置信息。In one embodiment, in step (2), calculating the first position information based on the time delay estimation algorithm specifically includes: performing voice signals corresponding to each microphone in the first linearly distributed microphone set. Cross-correlation processing, and performing controllable power response search processing on the mutual results obtained by the cross-correlation processing, so as to obtain the first position information by calculation.
在一个实施例中,在步骤(2)中,基于所述时延估计算法计算得到所述第一位置信息具体包括,对所述第一线型分布麦克风集合中每一个麦克风对应的语音信号进行广义互相关函数处理,其中所述广义互相关函数还引入了关于不同麦克风之间互功率谱密度的加权函数,再根据关于所述广义互相关函数的广义互相关相位变换算法计算得到所述第一位置信息。In one embodiment, in step (2), calculating the first position information based on the time delay estimation algorithm specifically includes: performing voice signals corresponding to each microphone in the first linearly distributed microphone set. Generalized cross-correlation function processing, wherein the generalized cross-correlation function also introduces a weighting function on the cross-power spectral density between different microphones, and then calculates the first step according to the generalized cross-correlation phase transformation algorithm on the generalized cross-correlation function One location information.
在一个实施例中,在步骤(3)中,计算所述第二位置信息具体包括,选取所述第一线型分布麦克风集合中的其中一个麦克风与所述第二线型分布麦克风集合中的其中一个麦克风组成一小间距麦克风差分阵列,再基于所述小间距麦克风差分阵列并结合相应的差分阵列算法计算出所述第二位置信息。In one embodiment, in step (3), calculating the second position information specifically includes selecting one of the microphones in the first linear distributed microphone set and the one in the second linear distributed microphone set A microphone forms a small-pitch microphone differential array, and then the second position information is calculated based on the small-pitch microphone differential array in combination with a corresponding differential array algorithm.
在一个实施例中,在步骤(3)中,基于所述小间距麦克风差分阵列并结合相应的差分阵列算法计算所述第二位置信息具体包括,将所述小间距麦克分差分阵列作为一固定波束形成器,同时获取所述固定波束形成器对应的一阶差分波束模式,并基于所述一阶差分波束模式计算出所述第二位置信息。In one embodiment, in step (3), calculating the second position information based on the small-pitch microphone differential array in combination with a corresponding differential array algorithm specifically includes: using the small-pitch microphone differential array as a fixed The beamformer simultaneously obtains the first-order differential beam pattern corresponding to the fixed beamformer, and calculates the second position information based on the first-order differential beam pattern.
在一个实施例中,在步骤(3)中,基于所述小间距麦克风差分阵列并结合相应的差分阵列算法计算所述第二位置信息具体包括,以所述小间距麦克风差分阵列为固定波束形成器,同时设计出关于该固定波束形成器不同的第一波束权值和第二波束权值,并计算所述第一波束权值和所述第二波束权值对应的第一输出信号能量和第二输出信号能量,再根据所述第一输出信号能量和所述第二输出信号能量中较大的一者计算出所述第二位置信息。In one embodiment, in step (3), calculating the second position information based on the small-pitch microphone differential array in combination with a corresponding differential array algorithm specifically includes: taking the small-pitch microphone differential array as a fixed beam forming And design the first beam weight and the second beam weight that are different for the fixed beamformer at the same time, and calculate the first output signal energy sum corresponding to the first beam weight and the second beam weight The second output signal energy is calculated according to the larger one of the first output signal energy and the second output signal energy.
在一个实施例中,计算所述第一输出信号能量具体包括,针对所述第一波 束权值,以所述第一波束权值的期望方向为前方和以所述第一波束权值的零陷方向为后方,对所述小间距麦克风差分阵列的输入信号进行加权求和处理,从而得到所述第一输出信号能量;或者,计算所述第二输出信号能量具体包括,针对所述第二波束权值,以所述第二波束权值的期望方向为后方和以所述第二波束权值的零陷方向为前方,对所述小间距麦克风差分阵列的输入信号进行加权求和处理,从而得到所述第二输出信号能量。In an embodiment, calculating the energy of the first output signal specifically includes, for the first beam weight, taking the expected direction of the first beam weight as the front and taking the zero of the first beam weight. The trap direction is backward, and the input signals of the small-pitch microphone differential array are weighted and summed to obtain the first output signal energy; or, calculating the second output signal energy specifically includes: Beam weights, taking the expected direction of the second beam weights as the rear and the null direction of the second beam weights as the front, performing weighted summation processing on the input signals of the small-pitch microphone differential array, Thus, the energy of the second output signal is obtained.
根据本发明实施例的第二方面,提供一种声源定位装置,包括:According to a second aspect of the embodiments of the present invention, there is provided a sound source localization device, including:
阵列布置模块,用于布置由若干麦克风组成的呈T型分布的麦克风阵列,所述T型分布的麦克风阵列包括相互垂直设置的第一线型分布麦克风集合和第二线型分布麦克风集合;An array arrangement module for arranging a T-shaped distributed microphone array composed of several microphones, the T-shaped distributed microphone array including a first linear distributed microphone set and a second linear distributed microphone set arranged perpendicular to each other;
第一采集模块,用于基于所述第一线型分布麦克风集合采集的语音信号,计算出关于目标物的第一位置信息;The first collection module is configured to calculate the first position information about the target based on the voice signal collected by the first linear distributed microphone set;
第二采集模块,用于基于所述第二线型分布麦克风集合采集的语音信号,计算出关于所述目标物的第二位置信息,并根据所述第一位置信息和所述第二位置信息确定关于所述目标物的360°全平面定位信息。The second collection module is configured to calculate second position information about the target based on the voice signal collected by the second linear distributed microphone set, and determine according to the first position information and the second position information 360° full plane positioning information about the target.
根据本发明实施例的第三方面,提供一种声源定位装置,包括:According to a third aspect of the embodiments of the present invention, there is provided a sound source localization device, including:
处理器;processor;
用于存储处理器可执行指令的存储器;A memory for storing processor executable instructions;
其中,所述处理器被配置为执行第一方面任一实施例所述方法的步骤。Wherein, the processor is configured to execute the steps of the method described in any embodiment of the first aspect.
根据本发明实施例的第四方面,提供一种计算机可读存储介质,其上存储有计算机指令,其特征在于,该指令被处理器执行时实现第一方面任一实施例所述方法的步骤。According to a fourth aspect of the embodiments of the present invention, there is provided a computer-readable storage medium having computer instructions stored thereon, characterized in that, when the instructions are executed by a processor, the steps of the method described in any of the embodiments of the first aspect are implemented .
相比于现有技术,本发明的声源定位方法通过相互垂直设置的第一线型分布麦克风集合和第二线型分布麦克风集合将该待定位目标物所处的全平面空间划分为两个半平面空间,然后依次在该两个半平面空间内获取该待定位目标物的空间位置信息,并根据这两个半平面空间内对应的空间位置信息综合计算 出该待定位目标物的全平面位置信息,这样能够克服传统声源定位方法无法区分具有窄边框平板形状前后方向的缺陷,从而有效地提高对具有窄边框平板形状的目标物的声源定位精度。Compared with the prior art, the sound source localization method of the present invention divides the full-planar space where the target object to be located is located into two halves through the first linear distributed microphone set and the second linear distributed microphone set arranged perpendicular to each other. Plane space, and then sequentially obtain the spatial position information of the target to be positioned in the two half-plane spaces, and comprehensively calculate the full-planar position of the target to be positioned according to the corresponding spatial position information in the two half-plane spaces In this way, the traditional sound source localization method can overcome the defect that the traditional sound source localization method cannot distinguish the front and rear directions of the flat panel shape with a narrow frame, thereby effectively improving the sound source localization accuracy of the target object with the flat frame shape of the narrow frame.
本发明的其它特征和优点将在随后的说明书中阐述,并且,部分地从说明书中变得显而易见,或者通过实施本发明而了解。本发明的目的和其他优点可通过在所写的说明书、权利要求书、以及附图中所特别指出的结构来实现和获得。Other features and advantages of the present invention will be described in the following description, and partly become obvious from the description, or understood by implementing the present invention. The purpose and other advantages of the present invention can be realized and obtained by the structures specifically pointed out in the written description, claims, and drawings.
下面通过附图和实施例,对本发明的技术方案做进一步的详细描述。The technical solutions of the present invention will be further described in detail below through the accompanying drawings and embodiments.
附图说明Description of the drawings
为了更清楚地说明本发明实施例或现有技术中的技术方案,下面将对实施例或现有技术描述中所需要使用的附图作简单地介绍,显而易见地,下面描述中的附图仅仅是本发明的一些实施例,对于本领域普通技术人员来讲,在不付出创造性劳动的前提下,还可以根据这些附图获得其他的附图。In order to explain the embodiments of the present invention or the technical solutions in the prior art more clearly, the following will briefly introduce the drawings that need to be used in the description of the embodiments or the prior art. Obviously, the drawings in the following description are only These are some embodiments of the present invention. For those of ordinary skill in the art, other drawings can be obtained based on these drawings without creative work.
图1为本发明提供的一种声源定位方法的流程示意图;Fig. 1 is a schematic flowchart of a sound source localization method provided by the present invention;
图2为本发明提供的一种声源定位方法中麦克风阵列的分布示意图;2 is a schematic diagram of the distribution of microphone arrays in a sound source localization method provided by the present invention;
图3为本发明实施例中一种声源定位装置的模块图;Figure 3 is a block diagram of a sound source localization device in an embodiment of the present invention;
图4为本发明实施例中一种声源定位装置的框图。Fig. 4 is a block diagram of a sound source localization device in an embodiment of the present invention.
具体实施方式detailed description
下面将结合本发明实施例中的附图,对本发明实施例中的技术方案进行清楚、完整地描述,显然,所描述的实施例仅仅是本发明一部分实施例,而不是全部的实施例。基于本发明中的实施例,本领域普通技术人员在没有作出创造性劳动前提下所获得的所有其他实施例,都属于本发明保护的范围。The technical solutions in the embodiments of the present invention will be clearly and completely described below in conjunction with the accompanying drawings in the embodiments of the present invention. Obviously, the described embodiments are only a part of the embodiments of the present invention, rather than all the embodiments. Based on the embodiments of the present invention, all other embodiments obtained by those of ordinary skill in the art without creative work shall fall within the protection scope of the present invention.
参阅图1,为本发明实施例提供的一种声源定位方法的流程示意图。该声源定位方法是通过在待定位的目标物附近设置具有特定形状分布的麦克风阵 列,该麦克风阵列能够将该待定位的目标物所处位置的附件空间区域划分为前方区域、后方区域、左侧区域和右侧区域这四个不同的空间区域部分,当该待定位的目标物具有窄边框平板形状时,上述四个不同空间区域部分的划分能够在不同半平面空间内依次对该待定位的目标物进行定位检测,这能够有效地克服现有的声源定位方法中无法区分辨别具有窄边框平板形状的目标物前后方向的不足。实际上,本发明的声源定位方法的定位检测基本构思为通过该麦克风阵列中的两个不同线型分布麦克风集合分别采集不同的语音信号,并基于该不同的语音信号分别计算出对应的不同位置信息,而该不同位置信息是关于该目标物的不同位置数据,这样根据该不同位置信息就能够确定关于该目标物的全平面定位信息。优选地,该麦克风阵列中的若干麦克风可相互形成T型的阵列分布形式。Refer to FIG. 1, which is a schematic flowchart of a sound source localization method provided by an embodiment of the present invention. The sound source localization method is to arrange a microphone array with a specific shape distribution near the target object to be located. The microphone array can divide the accessory space area of the target object to be located into a front area, a rear area, and a left area. The side area and the right area are four different space area parts. When the target to be positioned has a narrow frame flat plate shape, the division of the above four different space area parts can be sequentially in different half-plane spaces. This can effectively overcome the inability to distinguish the front-to-rear direction of the target object with a narrow frame and flat plate shape in the existing sound source localization method. In fact, the basic idea of the location detection of the sound source location method of the present invention is to collect different voice signals through two different linearly distributed microphone sets in the microphone array, and calculate the corresponding different voice signals based on the different voice signals. Position information, and the different position information is different position data about the target, so that the full-plane positioning information about the target can be determined according to the different position information. Preferably, several microphones in the microphone array can mutually form a T-shaped array distribution form.
具体而言,该声源定位方法可包括如下步骤:Specifically, the sound source localization method may include the following steps:
步骤(1),布置由若干麦克风组成的呈T型分布的麦克风阵列,该T型分布的麦克风阵列包括相互垂直设置的第一线型分布麦克风集合和第二线型分布麦克风集合。Step (1): arranging a T-shaped distributed microphone array composed of several microphones. The T-shaped distributed microphone array includes a first linear distributed microphone set and a second linear distributed microphone set arranged perpendicular to each other.
优选地,在该步骤(1)中,布置该呈T型分布的麦克风阵列具体包括,沿第一方向相隔预定间距设置若干麦克风以形成该第一线型分布麦克风集合,沿与该第一方向垂直的第二方向设置至少一个麦克风以形成该第二线型分布麦克风集合。其中,该第一线型分布麦克风集合可包括至少三个麦克风,该第二线型分布麦克风集合可包括至少两个麦克风;进一步,该第一线型分布麦克风集合和该第二线型分布麦克风集合相互之间可存在公用的一个或者若干个麦克风。Preferably, in the step (1), arranging the microphone array in a T-shaped distribution specifically includes: arranging a plurality of microphones at a predetermined interval along a first direction to form the first linearly distributed microphone set, which is aligned with the first direction. At least one microphone is arranged in the second vertical direction to form the second linear distributed microphone set. Wherein, the first linear distributed microphone set may include at least three microphones, and the second linear distributed microphone set may include at least two microphones; further, the first linear distributed microphone set and the second linear distributed microphone set are mutually There may be one or several microphones in common.
优选地,在该步骤(1)中,该第二线型分布麦克风集合中的所有麦克风可均位于该第一线型分布麦克风集合的同一侧,该第二方向穿过该第一线型分布麦克风集合中的其中一个麦克风。其中,该第一线型分布麦克风集合中的所有麦克风可以该第二线型分布麦克风集合所在的轴线为对称轴均匀地地分布在 该第二线型分布麦克风集合的两侧。进一步,该第一线型分布麦克风集合或者第二线型分布麦克风集合中的所有相邻的两个麦克风之间的间距可相等,这样能够保证该第一线型分布麦克分集合或者第二线型分布麦克风集合中不同麦克风在声场分布中的接收性能是一致的。Preferably, in the step (1), all the microphones in the second linear distributed microphone set may be located on the same side of the first linear distributed microphone set, and the second direction passes through the first linear distributed microphone One of the microphones in the collection. Wherein, all the microphones in the first linear distributed microphone set can be evenly distributed on both sides of the second linear distributed microphone set with the axis of the second linear distributed microphone set as the axis of symmetry. Further, the distance between all adjacent two microphones in the first linear distributed microphone set or the second linear distributed microphone set may be equal, so as to ensure the first linear distributed microphone diversity set or the second linear distributed microphone set Different microphones in the microphone set have the same receiving performance in the sound field distribution.
步骤(2),基于该第一线型分布麦克风集合采集的语音信号,计算出关于目标物的第一位置信息。Step (2), based on the voice signal collected by the first linear distributed microphone set, calculate the first position information about the target.
优选地,在该步骤(2)中,计算该第一位置信息可具体包括,先获取该第一线型分布麦克风集合中每一麦克风的语音信号,再基于时延估计算法计算得到该第一位置信息;其中,该第一位置信息是以该第二线型分布麦克风集合对应设置方向为基准的、关于该目标物的0°-180°的半平面位置信息。进一步,在计算该第一位置的过程中,控制该第一线型分布麦克风集合中的每一个麦克风对目标物实时产生的声音信号进行接收,再根据每一个麦克风接收的声音信号通过时延估计算法计算出该第一位置信息。Preferably, in this step (2), calculating the first position information may specifically include: first obtaining the voice signal of each microphone in the first linearly distributed microphone set, and then calculating the first position information based on a time delay estimation algorithm. Position information; where the first position information is based on the 0°-180° half-plane position information of the target based on the corresponding setting direction of the second linear distributed microphone set. Further, in the process of calculating the first position, each microphone in the first linearly distributed microphone set is controlled to receive the sound signal generated by the target in real time, and then the sound signal received by each microphone is estimated to pass the time delay. The algorithm calculates the first position information.
优选地,在该步骤(2)中,基于该时延估计算法计算得到第一位置信息可具体包括,对该第一线型分布麦克风集合中每一个麦克风对应的语音信号进行互相关处理,并对该互相关处理得到的互相结果进行可控功率响应搜索处理,从而计算得到该第一位置信息。Preferably, in this step (2), calculating the first position information based on the time delay estimation algorithm may specifically include performing cross-correlation processing on the voice signal corresponding to each microphone in the first linearly distributed microphone set, and Performing controllable power response search processing on the mutual results obtained by the cross-correlation processing, thereby calculating the first position information.
优选地,在该步骤(2)中,基于该时延估计算法计算得到该第一位置信息可具体包括,对该第一线型分布麦克风集合中每一个麦克风对应的语音信号进行广义互相关函数处理,其中该广义互相关函数还引入了关于不同麦克风之间互功率谱密度的加权函数,再根据关于该广义互相关函数的广义互相关相位变换算法计算得到该第一位置信息。Preferably, in this step (2), calculating the first position information based on the time delay estimation algorithm may specifically include: performing a generalized cross-correlation function on the voice signal corresponding to each microphone in the first linearly distributed microphone set Processing, wherein the generalized cross-correlation function also introduces a weighting function on the cross power spectral density between different microphones, and then calculates the first position information according to the generalized cross-correlation phase transformation algorithm on the generalized cross-correlation function.
其中,该时延估计算法可为基于广义互相关函数的时延估计算法,并且该时延估计算法在实施过程中还引入了一个加权函数,该加权函数对声音信号的互功率谱密度进行调整,从而优选该时延估计算法的计算精度。优选地,根据该加权函数的不同类型,该时延估计算法对应的广义互相关函数可相应地具有 多种不同的变形,该基于广义互相关函数的时延估计算法可具体为广义互相关-相位变换法(GCC-PHAT),该广义互相关-相位变换法本身具有一定的抗噪声和抗混响能力,故采用该算法能够在一定程度上降低该第一线型分布麦克风集合中每一麦克风的语音信号存在的噪声和/或混响对该第一位置信息引起的干扰。简单来说,该基于广义互相关函数的时延估计算法是根据两个麦克风采集的声音信号之间的互相关函数峰值在进行时延值的估计,这是由于在同一声源定位系统中,麦克风阵列集合中每一个麦克风接收到的声音信号都是来自同一声源,这使得不同麦克风对应的通道信号之间存在较强的相关性,这样通过计算每两个麦克风对应的通道信号之间的相关函数,就能够确定对应的两个麦克风对应通道信号之间的时延值,最后再跟该时延值就能够计算出声源对应的位置信息。Among them, the time delay estimation algorithm can be a time delay estimation algorithm based on a generalized cross-correlation function, and the time delay estimation algorithm also introduces a weighting function during the implementation process, which adjusts the cross power spectral density of the sound signal , So as to optimize the calculation accuracy of the time delay estimation algorithm. Preferably, according to the different types of the weighting function, the generalized cross-correlation function corresponding to the time delay estimation algorithm can have a variety of different variants, and the time delay estimation algorithm based on the generalized cross-correlation function can be specifically generalized cross-correlation- Phase transformation method (GCC-PHAT), the generalized cross-correlation-phase transformation method itself has certain anti-noise and anti-reverberation capabilities, so the use of this algorithm can reduce to a certain extent each of the first linear distribution microphone sets The noise and/or reverberation existing in the voice signal of the microphone causes interference to the first position information. Simply put, the time delay estimation algorithm based on the generalized cross-correlation function estimates the time delay value based on the peak value of the cross-correlation function between the sound signals collected by two microphones. This is because in the same sound source positioning system, The sound signal received by each microphone in the microphone array set comes from the same sound source, which makes the channel signals corresponding to different microphones have a strong correlation, so that by calculating the difference between the channel signals corresponding to each two microphones The correlation function can determine the time delay value between the corresponding channel signals of the two corresponding microphones, and finally the position information corresponding to the sound source can be calculated with the time delay value.
步骤(3),基于该第二线型分布麦克风集合采集的语音信号,计算出关于该目标物的第二位置信息,并根据该第一位置信息和第二位置信息确定关于该目标物的360°全平面定位信息。Step (3), based on the voice signal collected by the second linear distributed microphone set, calculate the second position information about the target object, and determine the 360° of the target object according to the first position information and the second position information Full plane positioning information.
优选地,在该步骤(3)中,计算该第二位置信息可具体包括,选取该第一线型分布麦克风集合中的其中一个麦克风与该第二线型分布麦克风集合中的其中一个麦克风组成一小间距麦克风差分阵列,再基于该小间距麦克风差分阵列并结合相应的差分阵列算法计算出该第二位置信息。Preferably, in the step (3), calculating the second position information may specifically include selecting one of the microphones in the first linear distributed microphone set and one of the microphones in the second linear distributed microphone set to form a The small-pitch microphone differential array is then based on the small-pitch microphone differential array and combined with the corresponding differential array algorithm to calculate the second position information.
优选地,在该步骤(3)中,基于该小间距麦克风差分阵列并结合相应的差分阵列算法计算该第二位置信息可具体包括,将该小间距麦克分差分阵列作为一固定波束形成器,同时获取该固定波束形成器对应的一阶差分波束模式,并基于该一阶差分波束模式计算出该第二位置信息。Preferably, in this step (3), calculating the second position information based on the small-pitch microphone differential array in combination with the corresponding differential array algorithm may specifically include: using the small-pitch microphone differential array as a fixed beamformer, At the same time, the first-order differential beam pattern corresponding to the fixed beamformer is acquired, and the second position information is calculated based on the first-order differential beam pattern.
优选地,在该步骤(3)中,基于该小间距麦克风差分阵列并结合相应的差分阵列算法计算该第二位置信息可具体包括,以该小间距麦克风差分阵列为固定波束形成器,同时设计出关于该固定波束形成器不同的第一波束权值和第二波束权值,并计算该第一波束权值和该第二波束权值对应的第一输出信号能量 和第二输出信号能量,再根据该第一输出信号能量和第二输出信号能量中较大的一者计算出该第二位置信息。Preferably, in this step (3), calculating the second position information based on the small-pitch microphone differential array in combination with the corresponding differential array algorithm may specifically include using the small-pitch microphone differential array as a fixed beamformer, and designing To obtain different first beam weights and second beam weights for the fixed beamformer, and calculate the first output signal energy and the second output signal energy corresponding to the first beam weight and the second beam weight, The second position information is calculated according to the larger one of the first output signal energy and the second output signal energy.
优选地,在该步骤(3)中,计算该第一输出信号能量可具体包括,针对该第一波束权值,以该第一波束权值的期望方向为前方和以第一波束权值的零陷方向为后方,对该小间距麦克风差分阵列的输入信号进行加权求和处理,从而得到该第一输出信号能量;或者,计算该第二输出信号能量可具体包括,针对该第二波束权值,以该第二波束权值的期望方向为后方和以该第二波束权值的零陷方向为前方,对小间距麦克风差分阵列的输入信号进行加权求和处理,从而得到该第二输出信号能量。Preferably, in this step (3), calculating the energy of the first output signal may specifically include, for the first beam weight, taking the expected direction of the first beam weight as the front and taking the first beam weight as the forward direction. The nulling direction is the rear, and the input signal of the small-pitch microphone differential array is weighted and summed to obtain the first output signal energy; or, calculating the second output signal energy may specifically include, for the second beam weight Value, taking the expected direction of the second beam weight as the back and the null direction of the second beam weight as the front, and performing weighted summation processing on the input signal of the differential microphone array with small spacing to obtain the second output Signal energy.
参阅图2,为本发明实施例提供的一种声源定位方法中麦克风阵列的分布示意图。其中,该图2只是示意性地展示出该麦克风阵列中若干麦克风的分布情况,当本发明的麦克风阵列的分布形式并不是只局限于该图2所示的情况,本发明的麦克风阵列的分布形式可具有其他不同的分布形式,这里并不再一一地进行列举。从该图2可见,该麦克风阵列包括由沿水平方向布置的三个麦克风M1、M2、M3共同组成的第一线型麦克风集合,以及由沿竖直方向布置的两个麦克风M2、M4共同组成的第二线型麦克风集合。优选地,在该麦克风M1、M2、M3中相邻的两个麦克风之间的间距均相等;优选地,该麦克风M1和M3以该麦克风M2和M4所在的直线为轴线对称地分布在该麦克风M2和M4的两侧。相应地,该图2所示的麦克风阵列的声源定位过程与前述介绍的声源定位方法的实时过程是相同的,这里就不再做进一步的累述。Refer to FIG. 2, which is a schematic diagram of the distribution of the microphone array in a sound source localization method according to an embodiment of the present invention. Wherein, Figure 2 only schematically shows the distribution of several microphones in the microphone array. When the distribution form of the microphone array of the present invention is not limited to the situation shown in Figure 2, the distribution of the microphone array of the present invention The form can have other different distribution forms, which will not be listed here. As can be seen from Figure 2, the microphone array includes a first linear microphone set composed of three microphones M1, M2, M3 arranged in a horizontal direction, and two microphones M2, M4 arranged in a vertical direction. The second line microphone collection. Preferably, the distances between two adjacent microphones in the microphones M1, M2, and M3 are equal; preferably, the microphones M1 and M3 are symmetrically distributed on the microphone with the line on which the microphones M2 and M4 are located as the axis. Both sides of M2 and M4. Correspondingly, the sound source localization process of the microphone array shown in FIG. 2 is the same as the real-time process of the sound source localization method introduced above, and will not be further described here.
从上述实施例可以看出,该声源定位方法是通过布置具有特定形状分布的麦克风阵列,其中该麦克风阵列包括相互垂直设置的第一线型分布麦克风集合和第二线型分布麦克风集合,接着基于该第一线型分布麦克风集合和第二线型分布麦克风集合,分别获取关于待定位目标物的第一位置信息和第二位置信息,最后再根据该第一位置信息和第二位置信息计算出关于该待定位目标物的360°全平面定位信息,这样能够克服传统声源定位方法无法区分具有窄边框平板形 状前后方向的缺陷,从而有效地提高对具有窄边框平板形状的目标物的声源定位精度。It can be seen from the above embodiments that the sound source localization method is to arrange a microphone array with a specific shape distribution, wherein the microphone array includes a first linear distribution microphone set and a second linear distribution microphone set arranged perpendicular to each other, and then based on The first line-shaped distributed microphone set and the second line-shaped distributed microphone set respectively obtain the first position information and the second position information about the target to be located, and finally calculate the information about the target object according to the first position information and the second position information. The 360° full-plane positioning information of the target to be located can overcome the defect that the traditional sound source localization method cannot distinguish the front and back directions of the flat panel shape with a narrow frame, thereby effectively improving the sound source localization of the target with a flat frame shape with a narrow frame Accuracy.
下述为本发明装置实施例,可以用于执行本发明方法实施例。The following are device embodiments of the present invention, which can be used to implement the method embodiments of the present invention.
图3为本发明实施例提供的一种声源定位装置的结构示意图,该装置可通过软件、硬件或者两者的结合实现成为电子设备的部分或者全部。根据图3所示,声源定位装置包括阵列布置模块301、第一采集模块302和第二采集模块303。其中,阵列布置模块301用于布置由若干麦克风组成的呈T型分布的麦克风阵列,T型分布的麦克风阵列包括相互垂直设置的第一线型分布麦克风集合和第二线型分布麦克风集合;第一采集模块302用于基于第一线型分布麦克风集合采集的语音信号,计算出关于目标物的第一位置信息;第二采集模块303用于基于第二线型分布麦克风集合采集的语音信号,计算出关于目标物的第二位置信息,并根据第一位置信息和第二位置信息确定关于目标物的360°全平面定位信息。Fig. 3 is a schematic structural diagram of a sound source localization device provided by an embodiment of the present invention. The device can be implemented as part or all of an electronic device through software, hardware, or a combination of both. According to FIG. 3, the sound source localization device includes an array arrangement module 301, a first collection module 302 and a second collection module 303. Wherein, the array arrangement module 301 is used to arrange a T-shaped distributed microphone array composed of several microphones. The T-shaped distributed microphone array includes a first linear distributed microphone set and a second linear distributed microphone set arranged perpendicular to each other; The acquisition module 302 is used to calculate the first position information about the target based on the voice signal collected by the first linear distributed microphone set; the second acquisition module 303 is used to calculate the voice signal based on the voice signal collected by the second linear distributed microphone set The second position information about the target, and the 360° full-plane positioning information about the target is determined according to the first position information and the second position information.
关于上述实施例中的装置,其各个模块执行操作的具体方式已经在有关该方法的实施例中进行了详细描述,此处将不做详细阐述说明。Regarding the device in the above-mentioned embodiment, the specific operation mode of each module thereof has been described in detail in the embodiment of the method, and detailed description will not be given here.
本发明实施例还提供一种声源定位的装置,该装置包括:The embodiment of the present invention also provides a sound source localization device, which includes:
处理器;processor;
用于存储处理器可执行指令的存储器;A memory for storing processor executable instructions;
其中,处理器被配置为执行:Among them, the processor is configured to execute:
步骤(1),布置由若干麦克风组成的呈T型分布的麦克风阵列,T型分布的麦克风阵列包括相互垂直设置的第一线型分布麦克风集合和第二线型分布麦克风集合;Step (1), arranging a T-shaped distributed microphone array composed of several microphones, the T-shaped distributed microphone array including a first linear distributed microphone set and a second linear distributed microphone set arranged perpendicular to each other;
步骤(2),基于第一线型分布麦克风集合采集的语音信号,计算出关于目标物的第一位置信息;Step (2), based on the voice signal collected by the first linear distributed microphone set, calculate the first position information about the target;
步骤(3),基于第二线型分布麦克风集合采集的语音信号,计算出关于目标物的第二位置信息,并根据第一位置信息和第二位置信息确定关于目标物的 360°全平面定位信息;Step (3), based on the voice signal collected by the second linear distributed microphone set, calculate the second position information about the target, and determine the 360° full-plane positioning information about the target according to the first position information and the second position information ;
在一个实施例中,上述处理器还可被配置为:In an embodiment, the above-mentioned processor may also be configured to:
在步骤(1)中,布置呈T型分布的麦克风阵列具体包括,沿第一方向相隔预定间距设置若干麦克风以形成第一线型分布麦克风集合,沿与第一方向垂直的第二方向设置至少一个麦克风以形成第二线型分布麦克风集合。In step (1), arranging the microphone array in a T-shaped distribution specifically includes: arranging a plurality of microphones at a predetermined interval along a first direction to form a first linear distributed microphone set, and arranging at least one microphone array in a second direction perpendicular to the first direction. One microphone to form a second linear distribution microphone set.
在一个实施例中,上述处理器还可被配置为:In an embodiment, the above-mentioned processor may also be configured to:
在步骤(1)中,第二线型分布麦克风集合中的麦克风均位于第一线型分布麦克风集合的同一侧,第二方向穿过第一线型分布麦克风集合中的其中一个麦克风。In step (1), the microphones in the second linear distributed microphone set are all located on the same side of the first linear distributed microphone set, and the second direction passes through one of the microphones in the first linear distributed microphone set.
在一个实施例中,上述处理器还可被配置为:In an embodiment, the above-mentioned processor may also be configured to:
在步骤(2)中,计算第一位置信息具体包括,先获取第一线型分布麦克风集合中每一麦克风的语音信号,再基于时延估计算法计算得到第一位置信息,其中,第一位置信息是以第二线型分布麦克风集合对应设置方向为基准的、关于目标物的0°-180°的半平面位置信息。In step (2), calculating the first position information specifically includes: first obtaining the voice signal of each microphone in the first linearly distributed microphone set, and then calculating the first position information based on the time delay estimation algorithm, where the first position The information is the 0°-180° half-plane position information of the target based on the corresponding setting direction of the second linear distributed microphone set.
在一个实施例中,上述处理器还可被配置为:In an embodiment, the above-mentioned processor may also be configured to:
在步骤(2)中,基于时延估计算法计算得到第一位置信息具体包括,对第一线型分布麦克风集合中每一个麦克风对应的语音信号进行互相关处理,并对互相关处理得到的互相结果进行可控功率响应搜索处理,从而计算得到第一位置信息。In step (2), calculating the first position information based on the time delay estimation algorithm specifically includes performing cross-correlation processing on the voice signal corresponding to each microphone in the first linearly distributed microphone set, and performing cross-correlation processing on the cross-correlation processing. As a result, a controllable power response search process is performed, and the first position information is calculated.
在一个实施例中,上述处理器还可被配置为:In an embodiment, the above-mentioned processor may also be configured to:
在步骤(2)中,基于时延估计算法计算得到第一位置信息具体包括,对第一线型分布麦克风集合中每一个麦克风对应的语音信号进行广义互相关函数处理,其中广义互相关函数还引入了关于不同麦克风之间互功率谱密度的加权函数,再根据关于广义互相关函数的广义互相关相位变换算法计算得到第一位置信息。In step (2), calculating the first position information based on the time delay estimation algorithm specifically includes performing generalized cross-correlation function processing on the voice signal corresponding to each microphone in the first linearly distributed microphone set, where the generalized cross-correlation function is also The weighting function of the cross power spectral density between different microphones is introduced, and the first position information is calculated according to the generalized cross-correlation phase transformation algorithm of the generalized cross-correlation function.
在一个实施例中,上述处理器还可被配置为:In an embodiment, the above-mentioned processor may also be configured to:
在步骤(3)中,计算第二位置信息具体包括,选取第一线型分布麦克风集合中的其中一个麦克风与第二线型分布麦克风集合中的其中一个麦克风组成一小间距麦克风差分阵列,再基于小间距麦克风差分阵列并结合相应的差分阵列算法计算出第二位置信息。In step (3), calculating the second position information specifically includes selecting one of the microphones in the first linear distributed microphone set and one of the microphones in the second linear distributed microphone set to form a small-pitch microphone differential array, and then based on The micro-pitch microphone differential array is combined with the corresponding differential array algorithm to calculate the second position information.
在一个实施例中,上述处理器还可被配置为:In an embodiment, the above-mentioned processor may also be configured to:
在步骤(3)中,基于小间距麦克风差分阵列并结合相应的差分阵列算法计算第二位置信息具体包括,将小间距麦克分差分阵列作为一固定波束形成器,同时获取固定波束形成器对应的一阶差分波束模式,并基于一阶差分波束模式计算出第二位置信息。In step (3), calculating the second position information based on the small-pitch microphone differential array combined with the corresponding differential array algorithm specifically includes: using the small-pitch microphone differential array as a fixed beamformer, and at the same time obtaining the corresponding fixed beamformer The first-order differential beam pattern, and the second position information is calculated based on the first-order differential beam pattern.
在一个实施例中,上述处理器还可被配置为:In an embodiment, the above-mentioned processor may also be configured to:
在步骤(3)中,基于小间距麦克风差分阵列并结合相应的差分阵列算法计算第二位置信息具体包括,以小间距麦克风差分阵列为固定波束形成器,同时设计出关于该固定波束形成器不同的第一波束权值和第二波束权值,并计算第一波束权值和第二波束权值对应的第一输出信号能量和第二输出信号能量,再根据第一输出信号能量和第二输出信号能量中较大的一者计算出第二位置信息。In step (3), calculating the second position information based on the small-pitch microphone differential array combined with the corresponding differential array algorithm specifically includes taking the small-pitch microphone differential array as the fixed beamformer, and designing different information about the fixed beamformer. The first beam weight and the second beam weight are calculated, and the first output signal energy and the second output signal energy corresponding to the first beam weight and the second beam weight are calculated, and then according to the first output signal energy and the second output signal energy The larger one of the output signal energy calculates the second position information.
在一个实施例中,上述处理器还可被配置为:In an embodiment, the above-mentioned processor may also be configured to:
计算第一输出信号能量具体包括,针对第一波束权值,以第一波束权值的期望方向为前方和以第一波束权值的零陷方向为后方,对小间距麦克风差分阵列的输入信号进行加权求和处理,从而得到第一输出信号能量;或者,计算第二输出信号能量具体包括,针对第二波束权值,以第二波束权值的期望方向为后方和以第二波束权值的零陷方向为前方,对小间距麦克风差分阵列的输入信号进行加权求和处理,从而得到第二输出信号能量。Calculating the energy of the first output signal specifically includes, for the first beam weight, taking the desired direction of the first beam weight as the front and the null direction of the first beam weight as the back, for the input signal of the differential microphone array with fine pitch Perform weighted summation processing to obtain the energy of the first output signal; or, calculating the energy of the second output signal specifically includes, for the second beam weight, taking the expected direction of the second beam weight as the rear and taking the second beam weight The null direction of is forward, and the input signal of the differential array of small-pitch microphones is weighted and summed to obtain the second output signal energy.
图4为本发明实施例提供的一种声源定位装置的结构框图。例如,装置40可被提供为一服务器。装置40包括处理组件402,其进一步包括一个或多个处理器,以及由存储器404所代表的存储器资源,用于存储可由处理组件402的 执行的指令,例如应用程序。存储器404中存储的应用程序可以包括一个或一个以上的每一个对应于一组指令的模块。此外,处理组件402被配置为执行指令,以执行上述方法。Fig. 4 is a structural block diagram of a sound source localization device provided by an embodiment of the present invention. For example, the device 40 may be provided as a server. The device 40 includes a processing component 402, which further includes one or more processors, and a memory resource represented by the memory 404, for storing instructions executable by the processing component 402, such as application programs. The application program stored in the memory 404 may include one or more modules each corresponding to a set of instructions. In addition, the processing component 402 is configured to execute instructions to perform the above-mentioned method.
装置40还可以包括一个电源组件406被配置为执行装置40的电源管理,一个有线或无线网络接口408被配置为将装置40连接到网络,和一个输入/输出(I/O)接口410。装置40可以操作基于存储在存储器404的操作系统,例如Windows ServerTM,Mac OS XTM,UnixTM,LinuxTM,FreeBSDTM或类似。The device 40 may also include a power component 406 configured to perform power management of the device 40, a wired or wireless network interface 408 configured to connect the device 40 to a network, and an input/output (I/O) interface 410. The device 40 can operate based on an operating system stored in the memory 404, such as Windows ServerTM, Mac OS XTM, UnixTM, LinuxTM, FreeBSDTM or the like.
本公开实施例还提供一种非临时性计算机可读存储介质,当存储介质中的指令由装置40的处理器执行时,使得装置40能够执行上述声源定位方法,该方法包括:The embodiments of the present disclosure also provide a non-transitory computer-readable storage medium. When the instructions in the storage medium are executed by the processor of the device 40, the device 40 can execute the above sound source localization method. The method includes:
步骤(1),布置由若干麦克风组成的呈T型分布的麦克风阵列,T型分布的麦克风阵列包括相互垂直设置的第一线型分布麦克风集合和第二线型分布麦克风集合;Step (1), arranging a T-shaped distributed microphone array composed of several microphones, the T-shaped distributed microphone array including a first linear distributed microphone set and a second linear distributed microphone set arranged perpendicular to each other;
步骤(2),基于第一线型分布麦克风集合采集的语音信号,计算出关于目标物的第一位置信息;Step (2), based on the voice signal collected by the first linear distributed microphone set, calculate the first position information about the target;
步骤(3),基于第二线型分布麦克风集合采集的语音信号,计算出关于目标物的第二位置信息,并根据第一位置信息和第二位置信息确定关于目标物的360°全平面定位信息;Step (3), based on the voice signal collected by the second linear distributed microphone set, calculate the second position information about the target, and determine the 360° full-plane positioning information about the target according to the first position information and the second position information ;
在一个实施例中,在步骤(1)中,布置呈T型分布的麦克风阵列具体包括,沿第一方向相隔预定间距设置若干麦克风以形成第一线型分布麦克风集合,沿与第一方向垂直的第二方向设置至少一个麦克风以形成第二线型分布麦克风集合。In one embodiment, in step (1), arranging the microphone array in a T-shaped distribution specifically includes: arranging a plurality of microphones at a predetermined interval along the first direction to form a first linearly distributed microphone set, which is perpendicular to the first direction. At least one microphone is arranged in the second direction to form a second linear distributed microphone set.
在一个实施例中,在步骤(1)中,第二线型分布麦克风集合中的麦克风均位于第一线型分布麦克风集合的同一侧,第二方向穿过第一线型分布麦克风集合中的其中一个麦克风。In one embodiment, in step (1), the microphones in the second linearly distributed microphone set are all located on the same side of the first linearly distributed microphone set, and the second direction passes through one of the first linearly distributed microphone sets. A microphone.
在一个实施例中,在步骤(2)中,计算第一位置信息具体包括,先获取第一线型分布麦克风集合中每一麦克风的语音信号,再基于时延估计算法计算得到第一位置信息,其中,第一位置信息是以第二线型分布麦克风集合对应设置方向为基准的、关于目标物的0°-180°的半平面位置信息。In one embodiment, in step (2), calculating the first position information specifically includes: first obtaining the voice signal of each microphone in the first linearly distributed microphone set, and then calculating the first position information based on the delay estimation algorithm , Wherein the first position information is 0°-180° half-plane position information of the target based on the corresponding setting direction of the second linear distributed microphone set.
在一个实施例中,在步骤(2)中,基于时延估计算法计算得到第一位置信息具体包括,对第一线型分布麦克风集合中每一个麦克风对应的语音信号进行互相关处理,并对互相关处理得到的互相结果进行可控功率响应搜索处理,从而计算得到第一位置信息。In one embodiment, in step (2), calculating the first position information based on the time delay estimation algorithm specifically includes performing cross-correlation processing on the voice signal corresponding to each microphone in the first linear distributed microphone set, and The mutual results obtained by the cross-correlation processing are subjected to controllable power response search processing, thereby calculating the first position information.
在一个实施例中,在步骤(2)中,基于时延估计算法计算得到第一位置信息具体包括,对第一线型分布麦克风集合中每一个麦克风对应的语音信号进行广义互相关函数处理,其中广义互相关函数还引入了关于不同麦克风之间互功率谱密度的加权函数,再根据关于广义互相关函数的广义互相关相位变换算法计算得到第一位置信息。In one embodiment, in step (2), calculating the first position information based on the time delay estimation algorithm specifically includes performing generalized cross-correlation function processing on the voice signal corresponding to each microphone in the first linearly distributed microphone set, Among them, the generalized cross-correlation function also introduces a weighting function on the cross-power spectral density between different microphones, and then calculates the first position information according to the generalized cross-correlation phase transformation algorithm on the generalized cross-correlation function.
在一个实施例中,在步骤(3)中,计算第二位置信息具体包括,选取第一线型分布麦克风集合中的其中一个麦克风与第二线型分布麦克风集合中的其中一个麦克风组成一小间距麦克风差分阵列,再基于小间距麦克风差分阵列并结合相应的差分阵列算法计算出第二位置信息。In one embodiment, in step (3), calculating the second position information specifically includes selecting one of the microphones in the first linear distributed microphone set and one of the microphones in the second linear distributed microphone set to form a small distance The microphone differential array is based on the small-pitch microphone differential array and combined with the corresponding differential array algorithm to calculate the second position information.
在一个实施例中,在步骤(3)中,基于小间距麦克风差分阵列并结合相应的差分阵列算法计算第二位置信息具体包括,将小间距麦克分差分阵列作为一固定波束形成器,同时获取固定波束形成器对应的一阶差分波束模式,并基于一阶差分波束模式计算出第二位置信息。In one embodiment, in step (3), calculating the second position information based on the small-pitch microphone differential array in combination with the corresponding differential array algorithm specifically includes: using the small-pitch microphone differential array as a fixed beamformer and simultaneously acquiring The first-order differential beam pattern corresponding to the fixed beamformer, and the second position information is calculated based on the first-order differential beam pattern.
在一个实施例中,在步骤(3)中,基于小间距麦克风差分阵列并结合相应的差分阵列算法计算第二位置信息具体包括,以小间距麦克风差分阵列为固定波束形成器,同时设计出关于该固定波束形成器不同的第一波束权值和第二波束权值,并计算第一波束权值和第二波束权值对应的第一输出信号能量和第二输出信号能量,再根据第一输出信号能量和第二输出信号能量中较大的一者计 算出第二位置信息。In one embodiment, in step (3), calculating the second position information based on the small-pitch microphone differential array combined with the corresponding differential array algorithm specifically includes using the small-pitch microphone differential array as a fixed beamformer, and at the same time designing The fixed beamformer has different first beam weights and second beam weights, and calculates the first output signal energy and the second output signal energy corresponding to the first beam weight and the second beam weight, and then according to the first The larger one of the energy of the output signal and the energy of the second output signal calculates the second position information.
在一个实施例中,计算第一输出信号能量具体包括,针对第一波束权值,以第一波束权值的期望方向为前方和以第一波束权值的零陷方向为后方,对小间距麦克风差分阵列的输入信号进行加权求和处理,从而得到第一输出信号能量;或者,计算第二输出信号能量具体包括,针对第二波束权值,以第二波束权值的期望方向为后方和以第二波束权值的零陷方向为前方,对小间距麦克风差分阵列的输入信号进行加权求和处理,从而得到第二输出信号能量。In one embodiment, calculating the energy of the first output signal specifically includes, for the first beam weight, taking the desired direction of the first beam weight as the front and taking the null direction of the first beam weight as the back, and for the small spacing The input signal of the microphone differential array is weighted and summed to obtain the first output signal energy; or, calculating the second output signal energy specifically includes, for the second beam weight, taking the expected direction of the second beam weight as the rear and Taking the null direction of the second beam weight as the front, weighting and summation processing is performed on the input signal of the small-pitch microphone differential array to obtain the second output signal energy.
显然,本领域的技术人员可以对本发明进行各种改动和变型而不脱离本发明的精神和范围。这样,倘若本发明的这些修改和变型属于本发明权利要求及其等同技术的范围之内,则本发明也意图包含这些改动和变型在内。Obviously, those skilled in the art can make various changes and modifications to the present invention without departing from the spirit and scope of the present invention. In this way, if these modifications and variations of the present invention fall within the scope of the claims of the present invention and their equivalent technologies, the present invention is also intended to include these modifications and variations.

Claims (13)

  1. 一种声源定位方法,其特征在于,所述声源定位方法包括如下步骤:A sound source localization method, characterized in that the sound source localization method includes the following steps:
    步骤(1),布置由若干麦克风组成的呈T型分布的麦克风阵列,所述T型分布的麦克风阵列包括相互垂直设置的第一线型分布麦克风集合和第二线型分布麦克风集合;Step (1), arranging a T-shaped distributed microphone array composed of several microphones, the T-shaped distributed microphone array including a first linear distributed microphone set and a second linear distributed microphone set arranged perpendicular to each other;
    步骤(2),基于所述第一线型分布麦克风集合采集的语音信号,计算出关于目标物的第一位置信息;Step (2): Calculate first position information about the target based on the voice signal collected by the first linear distributed microphone set;
    步骤(3),基于所述第二线型分布麦克风集合采集的语音信号,计算出关于所述目标物的第二位置信息,并根据所述第一位置信息和所述第二位置信息确定关于所述目标物的360°全平面定位信息。Step (3), based on the voice signal collected by the second linear distributed microphone set, calculate the second position information about the target, and determine the second position information about the target object according to the first position information and the second position information. The 360° full plane positioning information of the target.
  2. 如权利要求1所述的声源定位方法,其特征在于:在步骤(1)中,布置所述呈T型分布的麦克风阵列具体包括,沿第一方向相隔预定间距设置若干麦克风以形成所述第一线型分布麦克风集合,沿与所述第一方向垂直的第二方向设置至少一个麦克风以形成所述第二线型分布麦克风集合。The sound source localization method according to claim 1, characterized in that: in step (1), arranging the microphone array in a T-shaped distribution specifically includes arranging a plurality of microphones at a predetermined interval along a first direction to form the A first linear distributed microphone set, and at least one microphone is arranged along a second direction perpendicular to the first direction to form the second linear distributed microphone set.
  3. 如权利要求2所述的声源定位方法,其特征在于:在步骤(1)中,所述第二线型分布麦克风集合中的所有麦克风均位于所述第一线型分布麦克风集合的同一侧,所述第二方向穿过所述第一线型分布麦克风集合中的其中一个麦克风。The sound source localization method according to claim 2, wherein in step (1), all microphones in the second linear distributed microphone set are located on the same side of the first linear distributed microphone set, The second direction passes through one of the microphones in the first linear distributed microphone set.
  4. 如权利要求1所述的声源定位方法,其特征在于:在步骤(2)中,计算所述第一位置信息具体包括,先获取所述第一线型分布麦克风集合中每一麦克风的语音信号,再基于时延估计算法计算得到所述第一位置信息,其中,所述第一位置信息是以所述第二线型分布麦克风集合对应设置方向为基准的、关于所述目标物的0°-180°的半平面位置信息。The sound source localization method according to claim 1, wherein in step (2), calculating the first position information specifically includes first obtaining the voice of each microphone in the first linearly distributed microphone set Signal, the first position information is calculated based on the time delay estimation algorithm, wherein the first position information is based on the corresponding setting direction of the second linearly distributed microphone set and the 0° of the target object -180° half-plane position information.
  5. 如权利要求4所述的声源定位方法,其特征在于:在步骤(2)中,基 于所述时延估计算法计算得到所述第一位置信息具体包括,对所述第一线型分布麦克风集合中每一个麦克风对应的语音信号进行互相关处理,并对所述互相关处理得到的互相结果进行可控功率响应搜索处理,从而计算得到所述第一位置信息。The sound source localization method according to claim 4, wherein in step (2), calculating the first position information based on the time delay estimation algorithm specifically includes: The voice signal corresponding to each microphone in the set is subjected to cross-correlation processing, and a controllable power response search processing is performed on the mutual result obtained by the cross-correlation processing, so as to calculate the first position information.
  6. 如权利要求4所述的声源定位方法,其特征在于:在步骤(2)中,基于所述时延估计算法计算得到所述第一位置信息具体包括,对所述第一线型分布麦克风集合中每一个麦克风对应的语音信号进行广义互相关函数处理,其中所述广义互相关函数还引入了关于不同麦克风之间互功率谱密度的加权函数,再根据关于所述广义互相关函数的广义互相关相位变换算法计算得到所述第一位置信息。The sound source localization method according to claim 4, wherein in step (2), calculating the first position information based on the time delay estimation algorithm specifically includes: The speech signal corresponding to each microphone in the set is processed by the generalized cross-correlation function, wherein the generalized cross-correlation function also introduces a weighting function on the cross power spectral density between different microphones, and then according to the generalized cross-correlation function The first position information is calculated by a cross-correlation phase transformation algorithm.
  7. 如权利要求1所述的声源定位方法,其特征在于:在步骤(3)中,计算所述第二位置信息具体包括,选取所述第一线型分布麦克风集合中的其中一个麦克风与所述第二线型分布麦克风集合中的其中一个麦克风组成一小间距麦克风差分阵列,再基于所述小间距麦克风差分阵列并结合相应的差分阵列算法计算出所述第二位置信息。The sound source localization method according to claim 1, wherein in step (3), calculating the second position information specifically includes selecting one of the microphones in the first linear distributed microphone set and all One of the microphones in the second linear distributed microphone set forms a small-pitch microphone differential array, and then the second position information is calculated based on the small-pitch microphone differential array in combination with a corresponding differential array algorithm.
  8. 如权利要求7所述的声源定位方法,其特征在于:在步骤(3)中,基于所述小间距麦克风差分阵列并结合相应的差分阵列算法计算所述第二位置信息具体包括,将所述小间距麦克分差分阵列作为一固定波束形成器,同时获取所述固定波束形成器对应的一阶差分波束模式,并基于所述一阶差分波束模式计算出所述第二位置信息。7. The sound source localization method according to claim 7, characterized in that: in step (3), calculating the second position information based on the small-pitch microphone differential array in combination with the corresponding differential array algorithm specifically includes: The fine-pitch mic sub-differential array is used as a fixed beamformer, and the first-order differential beam pattern corresponding to the fixed beamformer is simultaneously acquired, and the second position information is calculated based on the first-order differential beam pattern.
  9. 如权利要求7所述的声源定位方法,其特征在于:在步骤(3)中,基于所述小间距麦克风差分阵列并结合相应的差分阵列算法计算所述第二位置信息具体包括,以所述小间距麦克风差分阵列为固定波束形成器,同时设计出关于该固定波束形成器不同的第一波束权值和第二波束权值,并计算所述第一波束权值和所述第二波束权值对应的第一输出信号能量和第二输出信号能量,再根据所述第一输出信号能量和所述第二输 出信号能量中较大的一者计算出所述第二位置信息。The sound source localization method according to claim 7, characterized in that: in step (3), calculating the second position information based on the small-pitch microphone differential array in combination with the corresponding differential array algorithm specifically includes: The small-pitch microphone differential array is a fixed beamformer. At the same time, different first beam weights and second beam weights for the fixed beamformer are designed, and the first beam weights and the second beams are calculated The energy of the first output signal and the energy of the second output signal corresponding to the weight is calculated according to the larger one of the energy of the first output signal and the energy of the second output signal.
  10. 如权利要求9所述的声源定位方法,其特征在于:计算所述第一输出信号能量具体包括,针对所述第一波束权值,以所述第一波束权值的期望方向为前方和以所述第一波束权值的零陷方向为后方,对所述小间距麦克风差分阵列的输入信号进行加权求和处理,从而得到所述第一输出信号能量;或者,计算所述第二输出信号能量具体包括,针对所述第二波束权值,以所述第二波束权值的期望方向为后方和以所述第二波束权值的零陷方向为前方,对所述小间距麦克风差分阵列的输入信号进行加权求和处理,从而得到所述第二输出信号能量。The sound source localization method according to claim 9, characterized in that: calculating the energy of the first output signal specifically includes, for the first beam weight, taking the expected direction of the first beam weight as forward and Taking the null direction of the first beam weight as the back, perform weighted summation processing on the input signals of the small-pitch microphone differential array to obtain the first output signal energy; or, calculate the second output The signal energy specifically includes, for the second beam weight, taking the expected direction of the second beam weight as the back and the null direction of the second beam weight as the front, and the difference to the small-pitch microphone The input signal of the array is subjected to weighted summation processing to obtain the energy of the second output signal.
  11. 一种声源定位装置,其特征在于,包括:A sound source localization device, characterized in that it comprises:
    阵列布置模块,用于布置由若干麦克风组成的呈T型分布的麦克风阵列,所述T型分布的麦克风阵列包括相互垂直设置的第一线型分布麦克风集合和第二线型分布麦克风集合;An array arrangement module for arranging a T-shaped distributed microphone array composed of several microphones, the T-shaped distributed microphone array including a first linear distributed microphone set and a second linear distributed microphone set arranged perpendicular to each other;
    第一采集模块,用于基于所述第一线型分布麦克风集合采集的语音信号,计算出关于目标物的第一位置信息;The first collection module is configured to calculate the first position information about the target based on the voice signal collected by the first linear distributed microphone set;
    第二采集模块,用于基于所述第二线型分布麦克风集合采集的语音信号,计算出关于所述目标物的第二位置信息,并根据所述第一位置信息和所述第二位置信息确定关于所述目标物的360°全平面定位信息。The second collection module is configured to calculate second position information about the target based on the voice signal collected by the second linear distributed microphone set, and determine according to the first position information and the second position information 360° full plane positioning information about the target.
  12. 一种声源定位装置,其特征在于,包括:A sound source localization device, characterized in that it comprises:
    处理器;processor;
    用于存储处理器可执行指令的存储器;A memory for storing processor executable instructions;
    其中,所述处理器被配置为执行权利要求1-10任一项所述方法的步骤。Wherein, the processor is configured to perform the steps of the method according to any one of claims 1-10.
  13. 一种计算机可读存储介质,其上存储有计算机指令,其特征在于,该指令被处理器执行时实现权利要求1-10任一项所述方法的步骤。A computer-readable storage medium having computer instructions stored thereon, wherein the instructions implement the steps of the method according to any one of claims 1-10 when the instructions are executed by a processor.
PCT/CN2019/091280 2019-04-01 2019-06-14 Sound source locating method, device and storage medium WO2020199351A1 (en)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
CN201910258214.1A CN110095755B (en) 2019-04-01 2019-04-01 Sound source positioning method
CN201910258214.1 2019-04-01

Publications (1)

Publication Number Publication Date
WO2020199351A1 true WO2020199351A1 (en) 2020-10-08

Family

ID=67444122

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/CN2019/091280 WO2020199351A1 (en) 2019-04-01 2019-06-14 Sound source locating method, device and storage medium

Country Status (2)

Country Link
CN (1) CN110095755B (en)
WO (1) WO2020199351A1 (en)

Families Citing this family (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN110554357B (en) * 2019-09-12 2022-01-18 思必驰科技股份有限公司 Sound source positioning method and device
CN113949967A (en) * 2020-07-16 2022-01-18 华为技术有限公司 Conference voice enhancement method, device and system
CN113470683A (en) * 2021-06-25 2021-10-01 深圳市联洲国际技术有限公司 Signal output method, device, equipment and storage medium of microphone array

Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20100220552A1 (en) * 2009-02-27 2010-09-02 Honda Motors Method and apparatus for estimating sound source
CN102411138A (en) * 2011-07-13 2012-04-11 北京大学 Method for positioning sound source by robot
CN103832905A (en) * 2012-11-20 2014-06-04 日立电梯(中国)有限公司 Position detection device for elevator cab
CN106501773A (en) * 2016-12-23 2017-03-15 上海语知义信息技术有限公司 Sounnd source direction localization method based on difference array

Family Cites Families (26)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US5857026A (en) * 1996-03-26 1999-01-05 Scheiber; Peter Space-mapping sound system
JP2006162461A (en) * 2004-12-08 2006-06-22 Institute Of Physical & Chemical Research Apparatus and method for detecting location of sound source
JP2007088807A (en) * 2005-09-22 2007-04-05 Yokogawa Electric Corp Method and device for presenting sound image
CN106125048B (en) * 2016-07-11 2019-05-24 浙江大华技术股份有限公司 A kind of sound localization method and device
JP2009002929A (en) * 2007-05-18 2009-01-08 Ono Sokki Co Ltd Sound source searching method, device and program
CN101782644A (en) * 2010-02-12 2010-07-21 黑龙江科技学院 Method for measuring plane two-dimensional vector sound intensity and measuring probe with adjustable measurement frequency band
CN102164328B (en) * 2010-12-29 2013-12-11 中国科学院声学研究所 Audio input system used in home environment based on microphone array
CN102305925A (en) * 2011-07-22 2012-01-04 北京大学 Robot continuous sound source positioning method
CN103235287B (en) * 2013-04-17 2015-05-20 华北电力大学(保定) Sound source localization camera shooting tracking device
CN104464739B (en) * 2013-09-18 2017-08-11 华为技术有限公司 Acoustic signal processing method and device, Difference Beam forming method and device
CN104076331B (en) * 2014-06-18 2016-04-13 南京信息工程大学 A kind of sound localization method of seven yuan of microphone arrays
CN104237849B (en) * 2014-09-26 2017-04-12 哈尔滨工业大学 Bi-pentabasic cross-array passive acoustic location integrating method
CN106019232B (en) * 2016-05-11 2018-07-10 北京地平线信息技术有限公司 Sonic location system and method
CN106028227B (en) * 2016-07-08 2019-05-24 乐鑫信息科技(上海)股份有限公司 Distributed microphone array and its applicable sonic location system
CN106098075B (en) * 2016-08-08 2018-02-02 腾讯科技(深圳)有限公司 Audio collection method and apparatus based on microphone array
CN106448722B (en) * 2016-09-14 2019-01-18 讯飞智元信息科技有限公司 The way of recording, device and system
CN107976651B (en) * 2016-10-21 2020-12-25 杭州海康威视数字技术股份有限公司 Sound source positioning method and device based on microphone array
CN106723736B (en) * 2016-12-23 2018-07-24 南京医科大学 The autonomous missing intelligent crutch of broad sense cross correlation algorithm is weighted based on phse conversion
WO2018119910A1 (en) * 2016-12-29 2018-07-05 清华大学 Method and apparatus for locating moving sound source
CN106872944B (en) * 2017-02-27 2020-05-05 海尔优家智能科技(北京)有限公司 Sound source positioning method and device based on microphone array
CN106842131B (en) * 2017-03-17 2019-10-18 浙江宇视科技有限公司 Microphone array sound localization method and device
CN107202976B (en) * 2017-05-15 2020-08-14 大连理工大学 Low-complexity distributed microphone array sound source positioning system
CN107422305B (en) * 2017-06-06 2020-03-13 歌尔股份有限公司 Microphone array sound source positioning method and device
CN108254721A (en) * 2018-04-13 2018-07-06 歌尔科技有限公司 A kind of positioning sound source by robot and robot
CN108828525A (en) * 2018-06-25 2018-11-16 大连大学 The more sonic location systems of distributed microphone array in reverberation room
CN108986832B (en) * 2018-07-12 2020-12-15 北京大学深圳研究生院 Binaural voice dereverberation method and device based on voice occurrence probability and consistency

Patent Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20100220552A1 (en) * 2009-02-27 2010-09-02 Honda Motors Method and apparatus for estimating sound source
CN102411138A (en) * 2011-07-13 2012-04-11 北京大学 Method for positioning sound source by robot
CN103832905A (en) * 2012-11-20 2014-06-04 日立电梯(中国)有限公司 Position detection device for elevator cab
CN106501773A (en) * 2016-12-23 2017-03-15 上海语知义信息技术有限公司 Sounnd source direction localization method based on difference array

Also Published As

Publication number Publication date
CN110095755A (en) 2019-08-06
CN110095755B (en) 2021-03-12

Similar Documents

Publication Publication Date Title
WO2020199351A1 (en) Sound source locating method, device and storage medium
Svaizer et al. Acoustic source location in a three-dimensional space using crosspower spectrum phase
US20210314701A1 (en) Multiple-source tracking and voice activity detections for planar microphone arrays
CN103308889B (en) Passive sound source two-dimensional DOA (direction of arrival) estimation method under complex environment
Gazzah et al. Optimum ambiguity-free directional and omnidirectional planar antenna arrays for DOA estimation
CN111123192B (en) Two-dimensional DOA positioning method based on circular array and virtual extension
CN103620438A (en) Systems and methods for blind localization of correlated sources
CN108020812B (en) Two-dimensional DOA estimation method based on special three-parallel line array structure
CN109343003B (en) Method for identifying sound source formed by fast iterative shrinking wave beams
Padois et al. Time domain localization technique with sparsity constraint for imaging acoustic sources
CN110827846A (en) Speech noise reduction method and device adopting weighted superposition synthesis beam
CN110736976B (en) Method for estimating performance of sonar beam former of any array
CN111812581A (en) Spherical array sound source direction of arrival estimation method based on atomic norm
Huang et al. One-dimensional MUSIC-type algorithm for spherical microphone arrays
US9578433B2 (en) Method for self-calibration of a set of sensors, in particular microphones, and corresponding system
Niwa et al. Diffused sensing for sharp directive beamforming
CN111830465B (en) Two-dimensional Newton orthogonal matching pursuit compressed beam forming method
Niwa et al. Optimal microphone array observation for clear recording of distant sound sources
KR20090128221A (en) Method for sound source localization and system thereof
Morgenstern et al. Joint spherical beam forming for directional analysis of reflections in rooms
CN113593596A (en) Robust self-adaptive beam forming directional pickup method based on subarray division
Astapov et al. Simplified acoustic localization by linear arrays for wireless sensor networks
CN116301195A (en) Function beam optimization method and device
Yang et al. An improved functional beamforming algorithm for far-field multi-sound source localization based on Hilbert curve
Bianchi et al. A plenacoustic approach to acoustic signal extraction

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 19923345

Country of ref document: EP

Kind code of ref document: A1

NENP Non-entry into the national phase

Ref country code: DE

122 Ep: pct application non-entry in european phase

Ref document number: 19923345

Country of ref document: EP

Kind code of ref document: A1