CN112540346A

CN112540346A - Sound source positioning method based on signal-to-noise ratio weight optimization updating

Info

Publication number: CN112540346A
Application number: CN202011438854.XA
Authority: CN
Inventors: 赵国伟; 曹冰; 张政; 赵锐; 赵磊; 郝璐璐
Original assignee: Datong Power Supply Co of State Grid Shanxi Electric Power Co Ltd
Current assignee: Datong Power Supply Co of State Grid Shanxi Electric Power Co Ltd
Priority date: 2020-12-07
Filing date: 2020-12-07
Publication date: 2021-03-23

Abstract

The application discloses sound source positioning method based on signal-to-noise ratio weight optimization updating, which is used for carrying out sound source positioning on an element which makes sound in equipment to be detected, and comprises the following steps: step 1, amplifying a first sound signal which is collected and emitted by a sound source according to a preset proportion, and performing gain adjustment to generate a second sound signal; step 2, when the second sound signal is judged not to be an abnormal noise signal, sequentially calculating the current environment signal-to-noise ratio corresponding to the second sound signal and the signal-to-noise ratio related weight mean value of the second sound signal, and calculating the transmission delay of the second sound signal corresponding to the reference sound sensor according to the signal-to-noise ratio related weight mean value; and 3, calculating the positioning information of the sound source according to the transmission time delay and the position coordinates of the sound sensor, wherein the positioning information is used for positioning the sound source. Through the technical scheme in this application, can accurately capture the specific positional information at device sound production position fast, realize the intelligent real-time location of sound source.

Description

Sound source positioning method based on signal-to-noise ratio weight optimization updating

Technical Field

The application relates to the technical field of sound source positioning, in particular to a sound source positioning method based on signal-to-noise ratio weight optimization updating.

Background

Various sounds such as noise, abnormal sound and the like are common in daily life and industrial production, the original sound positioning function of human ears is only used for solving the basic problems of life and survival, and the positioning precision is very limited. Therefore, the application of the sound source positioning technology is very important, and the product can quickly position a noise source or an abnormal sound position, so that an engineer is helped to quickly solve the problem, and the working efficiency is greatly improved.

Nowadays, microphone arrays are widely applied to the field of sound signal processing such as speech enhancement and speech recognition, sound source localization is a technology for determining a spatial source position of sound, and is one of key technologies of array signal processing based on microphone arrays, and is a hot research topic of society.

In the prior art, a traditional time delay estimation algorithm is usually adopted for sound source positioning, since reverberation is severe in an indoor environment and noise is strong in an outdoor environment, positioning performance is greatly affected, and a traditional sound source positioning method is poor in positioning performance in an environment with high reverberation and strong noise and low signal-to-noise ratio (SNR), and cannot meet requirements of various actual scenes.

Disclosure of Invention

The purpose of this application lies in: the specific position information of the sounding part of the device is captured quickly and accurately, and the intelligent real-time positioning of the sound source is realized.

The technical scheme of the application is as follows: the utility model provides a sound source localization method based on that SNR weight optimizes the renewal, this method is used for treating the sound source localization of the component that makes sound in the equipment that will detect, and the method includes: step 1, amplifying a first sound signal which is collected and emitted by a sound source according to a preset proportion, and performing gain adjustment to generate a second sound signal, wherein the first sound signal is collected by a plurality of sound sensors; step 2, when the second sound signal is judged not to be an abnormal noise signal, sequentially calculating a current environment signal-to-noise ratio corresponding to the second sound signal and a signal-to-noise ratio related weight mean value of the second sound signal, and calculating transmission delay of the second sound signal corresponding to a reference sound sensor according to the signal-to-noise ratio related weight mean value, wherein the reference sound sensor comprises a first sound sensor and a second sound sensor; and 3, calculating the positioning information of the sound source according to the transmission time delay and the position coordinates of the sound sensor, wherein the positioning information is used for positioning the sound source.

In any one of the above technical solutions, further, in step 2, the method for calculating the current environment signal-to-noise ratio corresponding to the second sound signal specifically includes:

step 201, selecting any two sound sensors as reference sound sensors, and calculating two corresponding short-time stationary signals through filtering and windowing and framing processing according to a second sound signal corresponding to the reference sound sensors;

step 202, when the short-time energy and the short-time zero crossing rate of the two short-time stationary signals are judged to be larger than the preset threshold value, a second sound signal corresponding to the two short-time stationary signals is judged to be an abnormal noise signal, the abnormal noise signal is discarded, the step 201 is executed again, and otherwise, the step 203 is executed;

step 203, when the second sound signal corresponding to the two short-time stationary signals is judged not to be the abnormal noise signal, calculating the current environment signal-to-noise ratio, wherein the calculation formula of the current environment signal-to-noise ratio is as follows:

SNR(λ)＝aSNR(λ-1)+(1-a)SNR_0

wherein λ is the frame number of the second sound signal after framing, a is a smoothing factor, SNR (λ -1) is the current environment SNR of the second sound signal of the previous frame, and SNR _0 is the energy ratio of the second sound signal of the λ -th frame to the second sound signal of the λ -1-th frame.

In any one of the above technical solutions, further, in step 2, the snr dependent weight at least includes: the method for calculating the SNR (signal-to-noise ratio) of the current environment, the cross-power spectral density function of a stationary signal, the weighting function of the SNR, the cross-correlation function of a sound signal and the related weight of the SNR specifically comprises the following steps: step 211, performing fast fourier transform on two short-time stationary signals of the second sound signal corresponding to any two sound sensorsTransforming and calculating a cross-power spectral density function R₁₂(λ, w); step 212, determining an adjustment factor according to the signal-to-noise ratio of the current environment, and according to a cross-power spectral density function R₁₂(lambda, w), calculating a signal-to-noise ratio weighting function

Step 213, calculating a cross-power spectral density function R₁₂(lambda, w) and signal-to-noise ratio weighting function

And inverse fourier transforming the product to generate the acoustic signal cross-correlation function r₁₂(λ,w)。

In any one of the above technical solutions, further, calculating a transmission delay of the second sound signal corresponding to the reference sound sensor specifically includes: step 221, orderly grouping the sound sensors in pairs, and recording the sound sensors as reference sound sensors; step 222, for each group of reference sound sensors, the sound signal cross-correlation function r₁₂Carrying out peak value detection on the value sequence of (lambda, w) to obtain a sampling point corresponding to the maximum discrete value point; step 223, multiplying the sampling point by the sampling point interval time to generate the predicted time delay of the second sound signal corresponding to the reference sound sensor; and 224, repeatedly executing the steps 221 to 224 for each group of reference sound sensors by adopting a loop traversal algorithm, calculating the average value of the predicted time delay, and recording the average value as the transmission time delay.

In any of the above technical solutions, further, the positioning information of the sound source at least includes: the azimuth angle of the sound source is theta, the pitch angle of the sound source is mu, and the distance from the sound source to each sound sensor is.

The beneficial effect of this application is:

according to the technical scheme, the sound source positioning method and device are combined with corresponding hardware equipment and deployed in an actual application scene, specific position information of a sound production part of the device can be captured quickly and accurately, intelligent real-time positioning of the sound source is achieved, and the sound source positioning method and device have good performance and popularization and application prospects.

The five-element microphone array is arranged to collect sound, the sound source signal microprocessor is used for carrying out secondary amplification operation on sound signals, the signal-to-noise ratio weight function combining the sound signal power value provided by the application is reused, then reverberation and noise of the current environment can be self-adapted, and finally the time delay and position characteristics are combined to carry out specific calculation of the azimuth coordinate on a target sound source, so that sound source positioning is realized. The disclosed method can meet the practical requirements on positioning precision and reasoning speed, and realizes the application on the front-end artificial intelligent embedded equipment.

Compared with the traditional sound source positioning algorithm at present, the method combines hardware modularization equipment to process signals, combines machine learning, mode recognition and other related algorithms to perform related analysis and processing on collected sound signal streams, innovatively provides a signal-to-noise ratio weight function combined with a sound signal power value and a sound source positioning method combining the signal-to-noise ratio weight and a time delay characteristic, breaks through a conventional thought, has obvious performance characteristics of noise resistance and reverberation resistance, obviously improves experimental performance, and mainly embodies the following two aspects:

(1) the microphone has strong sound pickup capability. According to the method, the sound source signal microprocessor is used for amplifying the sound signal for multiple times, so that the output voltage of the microphone is multiplied, and the input sound signal stream can meet the requirement of post signal processing.

(2) The anti-noise and anti-reverberation effect is obvious. According to the method, the traditional time delay estimation algorithm is not used for sound source positioning, the method is correspondingly adjusted through the change of the current signal to noise ratio, the signal to noise ratio weight which is continuously optimized and updated at different moments is obtained, functions such as signal to noise ratio weighting are related to the power value of the sound signal, and the accuracy and stability of the functions and the weight value are improved.

Drawings

The advantages of the above and/or additional aspects of the present application will become apparent and readily appreciated from the following description of the embodiments, taken in conjunction with the accompanying drawings of which:

FIG. 1 is a schematic flow diagram of a sound source localization method based on SNR weight optimization update according to an embodiment of the present application;

FIG. 2 is a schematic diagram of a five-element microphone array structure according to one embodiment of the present application;

FIG. 3 is a schematic diagram of a weight function calculation process according to one embodiment of the present application.

Detailed Description

In order that the above objects, features and advantages of the present application can be more clearly understood, the present application will be described in further detail with reference to the accompanying drawings and detailed description. It should be noted that the embodiments and features of the embodiments of the present application may be combined with each other without conflict.

In the following description, numerous specific details are set forth in order to provide a thorough understanding of the present application, however, the present application may be practiced in other ways than those described herein, and therefore the scope of the present application is not limited by the specific embodiments disclosed below.

In the embodiment, a five-element microphone array is used for sound collection, and is firstly transmitted into a sound source signal microprocessor during collection, wherein the sound source signal microprocessor comprises an integrated operational amplifier, a gain adjustment potentiometer and other modules, the sound signals are subjected to secondary amplification through microprocessing, and then the five-channel sound signals are subjected to digital conversion through an AD collection module, so that subsequent input sound signals are obtained. Then, the embodiment provides a signal-to-noise ratio weighting function combined with a sound signal power value, which can be adjusted correspondingly with the change of the current signal-to-noise ratio, and performs final calculation of a sound source position coordinate by combining a signal-to-noise ratio weighting average value, a microphone array time delay and basic position information.

The experimental results show that: the method in the embodiment is combined with corresponding hardware equipment, is deployed in an actual application scene, can quickly and accurately capture specific position information of a sounding part of the device, realizes intelligent real-time positioning of a sound source, and has good performance and popularization and application prospects for positioning of the sound source in indoor and outdoor environments.

As shown in fig. 1, the present embodiment provides a sound source localization method based on snr weight optimization update, which is used for performing sound source localization on an acoustic element in a device to be detected, and includes:

step 1, amplifying a first sound signal which is collected and emitted by a sound source according to a preset proportion, performing gain adjustment, and generating a second sound signal, wherein the first sound signal is collected by a plurality of sound sensors, and an element of the sound is recorded as the sound source.

It should be noted that the output voltage range of the conventional omnidirectional type condenser microphone is about 0mV-60mV, and thus weak electrical signals cannot meet the work of signal processing in the later period at all, so that the experimental effect is not obvious, and the positioning accuracy is low. Therefore, it is necessary to perform a process of scaling up the signal collected by the microphone.

In this embodiment, microphones are used as sound sensors and form a five-element microphone array, as shown in fig. 2, a three-dimensional rectangular coordinate system is established with the geometric center of the device to be detected as an origin, the microphones in the five-element microphone array are respectively marked as numbers 1, 2,3,4, and 5, and the coordinates of the five microphones 1, 2,3,4, and 5 are S in sequence₁(0,0,l)、S₂(l,0,0)、S₃(0,l,0)、S₄(-l,0,0) and S₅(0, -l,0), wherein the value of the length l can be obtained by artificial measurement as the distance from the microphone to the geometric center of the device to be detected.

The equipment to be detected can make a normal sound in the operation process, but when some element or some elements in the equipment to be detected are abnormal, the equipment to be detected can make an abnormal sound. At this time, the element can be used as a sound source, and the anomaly detection of the equipment to be detected is realized by positioning the sound source.

Therefore, the five-element microphone array is used for collecting the sound (including normal sound and abnormal sound) of the equipment to be detected and then entering a sound source signal microprocessor, wherein the sound source signal microprocessor comprises modules such as an integrated operational signal amplifier, a gain adjustment potentiometer and the like, the omnidirectional type capacitive microphone and all signal processing components are known commercial devices, and the connection method between all the modules and the components can be mastered by those skilled in the art.

This embodiment adopts the comparatively stable integrated operational signal amplifier of performance, carries out the secondary amplification according to predetermineeing the proportion to the first sound signal of gathering, introduces gain adjustment potentiometre module simultaneously for when debugging in the later stage, reduce the precision error that produces because the hardware characteristic is different. The processor may scale up the sound signal and use it as the second sound signal for subsequent operations.

And 2, when the second sound signal is judged not to be an abnormal noise signal (namely the current second sound signal is a normal sound signal), calculating a current environment signal-to-noise ratio corresponding to the second sound signal, calculating a signal-to-noise ratio related weight mean value of the second sound signal according to the current environment signal-to-noise ratio, and calculating transmission delay of the second sound signal corresponding to a reference sound sensor according to the signal-to-noise ratio related weight mean value, wherein the reference sound sensor comprises a first sound sensor and a second sound sensor.

In this embodiment, the microphone 1 is set as the first sound sensor, and the microphone 5 is set as the second sound sensor.

Specifically, as shown in fig. 3, in this embodiment, on the basis of the conventional adaptive snr weight optimization method, the power characteristics of the sound signal are blended, so that the weight function deeply extracts the characteristic information of the sound signal. Obtaining an enhanced five-channel second sound signal through a five-element microphone array and a sound source signal microprocessor, preprocessing the sound signal and calculating a series of related formulas to determine the signal-to-noise ratio related weights of the current environment signal-to-noise ratio, the cross power spectral density function of steady signals, the signal-to-noise ratio weighting function, the sound signal cross-correlation function and the like. And finally calculating the time delay characteristics of the five microphones according to the obtained signal-to-noise ratio related weight, thereby calculating the subsequent sound source azimuth coordinate.

The embodiment shows a method for calculating a current environmental signal-to-noise ratio corresponding to a second sound signal when it is determined that the second sound signal is not an abnormal noise signal, specifically including:

step 201, selecting any two sound sensors as the reference sound sensors, and calculating two corresponding short-time stationary signals through filtering and windowing and framing processing according to a second sound signal corresponding to the reference sound sensors.

Any two microphone channels in the five-element microphone array are used as reference sound sensors, and band-pass filtering processing is carried out on second sound signals which are collected by the reference sound sensors and subjected to amplification and gain adjustment, so that two paths of sound signals subjected to band-pass filtering are obtained. And then performing windowing and framing processing on the two paths of band-pass filtered sound signals to obtain two paths of short-time stable signals.

The sound signals collected by the two microphones are assumed to be:

x₁(t)＝a₁s₁(t)+n₁(t)

x₂(t)＝a₂s₁(t+D)+n₂(t)

wherein a is₁、a₂Is a sound attenuation factor, since the sound source is a near-field signal, it can be considered that a₁、a₂The value 1, D is the time delay of the sound signal to the selected two microphones, n₁(t)、n₂(t) is the noise signal received by the selected two microphones at time t.

And then, the selected second sound signal is subjected to band-pass filtering processing, so that the noises of a low frequency band and a high frequency band are eliminated, and two paths of sound signals subjected to band-pass filtering are provided for subsequent processing.

After the two paths of band-pass filtered sound signals are obtained, a Hamming window function is used for framing the two paths of band-pass filtered sound signals to obtain two paths of short-time stable signals, and a frame-to-frame overlapping method is generally adopted for windowing and framing. The two short-time steady signals obtained are:

s₁(λ,n)＝x₁×(n+d×(λ-1)×N)×o(n)

s₂(λ,n)＝x₂×(n+d×(λ-1)×N)×o(n)

where o (N) is a Hamming window function, N is the length of the window function o (N), d is a shift parameter between adjacent frames, λ is the number of frames, and N is the sound sequence length.

Step 202, when the short-time energy and the short-time zero crossing rate of the two short-time stationary signals are judged to be larger than the preset threshold, the second sound signal corresponding to the two short-time stationary signals is judged to be an abnormal noise signal, the abnormal noise signal is discarded, the step 201 is executed again, and otherwise, the step 203 is executed.

Specifically, the first sound signal collected by the five-element microphone array includes a normal sound signal and a background noise signal, and if the sound source does not produce sound, the collected sound signal is only the background noise signal (i.e., the normal noise signal). Specifically, when both the short-time energy (energy of the sound signal for a short period of time) and the short-time zero-crossing rate (the number of times the signal waveform crosses the horizontal axis (zero level) per unit time) of the detected two-way short-time stationary signal are larger than a preset threshold, it is determined that the current sound signal is an abnormal noise signal.

Step 203, when it is determined that the second sound signal corresponding to the two short-time stationary signals is not an abnormal noise signal, calculating the current environment signal-to-noise ratio, where a calculation formula of the current environment signal-to-noise ratio is:

SNR(λ)＝aSNR(λ-1)+(1-a)SNR_0

Further, when the second sound signal corresponding to the two-path short-time stationary signal is judged to be an abnormal noise signal, the current environment signal-to-noise ratio of the current frame second sound signal is updated according to the current environment signal-to-noise ratio of the previous frame second sound signal.

In this embodiment, the snr dependent weights at least include: SNR (lambda) of current environment signal to noise ratio and cross-power spectral density function R of stationary signal₁₂(lambda, w), signal-to-noise ratio weighting function

Sound signal cross correlation function r₁₂(λ,w)。

The embodiment also shows a method for calculating the signal-to-noise ratio related weight, which specifically includes:

step 211, performing fast fourier transform on two paths of short-time stationary signals of the second sound signals corresponding to any two sound sensors, and calculating the cross-power spectral density function R₁₂(λ, w), the corresponding calculation formula is:

in the formula, S₁(lambda, w) is a finite-length sequence with the time window function length of N of the second sound signal of the lambda frame corresponding to the first path of sound sensor, and S₁(lambda, w) is a finite long sequence with the time window function length N of the second sound signal of the lambda frame corresponding to the second path of sound sensor,

is S₂And (lambda, w), wherein the first and second sound sensors are any two sound sensors in the five-element microphone array.

Step 212, determining an adjustment factor according to the signal-to-noise ratio of the current environment, and determining a cross-power spectral density function R according to the cross-power spectral density function₁₂(λ, w) calculating the signal-to-noise ratio weighting function

Specifically, in order to resist larger noise and reverberation, and in consideration of the fact that the energy of the second sound signal after being amplified in time is still small, in this embodiment, an adjustment factor ρ in a direct relationship with the current environmental signal-to-noise ratio SNR (λ) is introduced, a value of the adjustment factor ρ is obtained through multiple experimental tests in the sound source environment, the value depends on the current signal-to-noise ratio SNR (λ), different signal-to-noise ratios SNR (λ), the adjustment factor ρ takes different values, and the higher the SNR (λ), the larger the value of the adjustment factor ρ is, which is a specific value taking manner:

when SNR (lambda) is less than or equal to 10dB, the value range of the adjusting factor rho is more than or equal to 0.25 and less than or equal to 0.55;

when SNR (lambda) is less than 10dB and less than or equal to 30dB, the value range of the adjusting factor rho is more than 0.55 and less than or equal to 0.85;

when 30dB is less than SNR (lambda), the value range of the adjusting factor rho is 0.85< rho is less than or equal to 0.95.

The additive noise is an independent noise that interferes with a desired signal such as thermal noise, and the additive noise is added to the signal, and exists regardless of the presence or absence of the signal. Therefore, the SNR weighting function takes into account additive noise

The calculation formula of (2) is as follows:

wherein rho is an adjusting factor in direct proportion to the SNR (lambda) of the current environment signal-to-noise ratio₁(w) and phi₂And (w) is the autocorrelation function of the two paths of second sound signals.

SNR weighting function without considering additive noise

The calculation formula of (2) is as follows:

In this embodiment, in order to ensure the accuracy of sound source localization, additive noise in the actual environment is considered, and therefore, the snr weighting function

The calculation formula of (2) is as follows:

step 213, calculating the cross-power spectral density function R₁₂(λ, w) and said signal-to-noise ratio weighting function

And inverse fourier transforming said product to generate said acoustic signal cross-correlation function r₁₂(λ, w), the corresponding calculation formula is:

in the formula, α means the number of times of accumulation calculation for each frame length in the time domain of the selected window function.

In this embodiment, calculating the transmission delay of the second sound signal corresponding to the reference sound sensor specifically includes:

step 221, orderly grouping the sound sensors in pairs, and recording the sound sensors as the reference sound sensors;

step 222, for each group of reference sound sensors, the sound signal cross-correlation function r₁₂Carrying out peak value detection on the value sequence of (lambda, w) to obtain a sampling point corresponding to the maximum discrete value point;

step 223, multiplying the sampling point by the sampling point interval time to generate the predicted time delay of the second sound signal corresponding to the reference sound sensor;

and 224, repeatedly executing the steps 221 to 224 for each group of reference sound sensors by adopting a loop traversal algorithm, calculating the average value of the predicted time delay, and recording the average value as the transmission time delay.

Specifically, the microphones of the five-element microphone array are divided into numbers 1, 2,3,4 and 5, and the coordinates of the five microphones 1, 2,3,4 and 5 are taken as S₁(0,0,l)、S₂(l,0,0)、S₃(0,l,0)、S₄(-l,0,0) and S₅(0,-l,0)。

And (3) grouping each 2 of the five microphones into ten groups, and circularly traversing each group and finally obtaining the transmission delay through the steps 221 to 224.

Setting the microphone 1 corresponding to the first sound sensor and the microphone 5 corresponding to the second sound sensor, and calculating the transmission time delay tau of the sound source reaching the microphones 2,3,4 and 5 relative to the microphone 1 by adopting a traversing mode through the process_δ1(δ 2,3,4,5), corresponding to a difference in acoustic path length Δ r_δ1And satisfies the following conditions: Δ r_δ1＝c·τ_δ1(δ — 2,3,4,5), where c is the speed of sound, fixed constant, and c 340. Correspondingly, the time of sound source arriving at the microphones 1, 2,3,4 is relative to the transmission time delay tau of No. 5_δ′(δ′＝1,2,3,4)。

And 3, positioning the sound source according to the transmission time delay and the position coordinates of the sound sensor.

The azimuth angle of the sound source is set to theta, the pitch angle of the sound source is set to mu, and the distances from the sound source to each microphone (sound sensor) are respectively set to z₁、z₂、z₃、z₄And z₅. The process of locating the sound source is the above-mentioned location information (azimuth angle theta, pitch angle mu, and distance z)₁、z₂、z₃、z₄And z₅) And solving the calculation process, wherein the corresponding calculation formula is as follows:

in the formula, z_iIs representative of the distance a sound source (where the sound emitting element is located) reaches each microphone (sound sensor).

The technical solution of the present application is described in detail above with reference to the accompanying drawings, and the present application provides a sound source localization method based on snr weight optimization updating, which is used for performing sound source localization on an element emitting sound in a device to be detected, and includes: step 1, amplifying a first sound signal which is collected and emitted by a sound source according to a preset proportion, and performing gain adjustment to generate a second sound signal; step 2, when the second sound signal is judged not to be an abnormal noise signal, sequentially calculating the current environment signal-to-noise ratio corresponding to the second sound signal and the signal-to-noise ratio related weight mean value of the second sound signal, and calculating the transmission delay of the second sound signal corresponding to the reference sound sensor according to the signal-to-noise ratio related weight mean value; and 3, calculating the positioning information of the sound source according to the transmission time delay and the position coordinates of the sound sensor, wherein the positioning information is used for positioning the sound source. Through the technical scheme in this application, can accurately capture the specific positional information at device sound production position fast, realize the intelligent real-time location of sound source.

The steps in the present application may be sequentially adjusted, combined, and subtracted according to actual requirements.

The units in the device can be merged, divided and deleted according to actual requirements.

Although the present application has been disclosed in detail with reference to the accompanying drawings, it is to be understood that such description is merely illustrative and not restrictive of the application of the present application. The scope of the present application is defined by the appended claims and may include various modifications, adaptations, and equivalents of the invention without departing from the scope and spirit of the application.

Claims

1. A sound source localization method based on snr weight optimization update, wherein the method is used for sound source localization of an acoustic element in a device to be detected, the method comprises:

step 1, amplifying a first sound signal which is collected and emitted by a sound source according to a preset proportion, and performing gain adjustment to generate a second sound signal, wherein the first sound signal is collected by a plurality of sound sensors;

step 2, when the second sound signal is judged not to be an abnormal noise signal, sequentially calculating a current environment signal-to-noise ratio corresponding to the second sound signal and a signal-to-noise ratio related weight mean value of the second sound signal, and calculating transmission delay of the second sound signal corresponding to a reference sound sensor according to the signal-to-noise ratio related weight mean value, wherein the reference sound sensor comprises a first sound sensor and a second sound sensor;

and 3, calculating the positioning information of the sound source according to the transmission time delay and the position coordinates of the sound sensor, wherein the positioning information is used for positioning the sound source.

2. The sound source localization method based on snr weight optimization updating of claim 1, wherein in the step 2, the method for calculating the current environmental snr corresponding to the second sound signal specifically includes:

step 201, selecting any two sound sensors as the reference sound sensors, and calculating two corresponding short-time stationary signals through filtering and windowing and framing processing according to a second sound signal corresponding to the reference sound sensors;

step 202, when the short-time energy and the short-time zero crossing rate of the two short-time stationary signals are judged to be larger than a preset threshold value, a second sound signal corresponding to the two short-time stationary signals is judged to be an abnormal noise signal, the abnormal noise signal is discarded, the step 201 is executed again, and otherwise, the step 203 is executed;

SNR(λ)＝aSNR(λ-1)+(1-a)SNR_0

3. The sound source localization method based on snr weight optimization updating of claim 1, wherein in the step 2, the snr related weight at least comprises: the method for calculating the signal-to-noise ratio related weight comprises the following steps of SNR (signal-to-noise ratio) of the current environment, a cross-power spectral density function of a stationary signal, a signal-to-noise ratio weighting function and a sound signal cross-correlation function, and specifically comprises the following steps:

step 211, two short circuits of the second sound signals corresponding to any two sound sensorsPerforming fast Fourier transform on the time stationary signal, and calculating the cross-power spectral density function R₁₂(λ,w)；

And inverse fourier transforming said product to generate said acoustic signal cross-correlation function r₁₂(λ,w)。

4. The sound source localization method according to any one of claims 1 to 3, wherein the calculating of the transmission delay of the second sound signal corresponding to the reference sound sensor specifically includes:

5. The sound source localization method based on snr weight optimization updating of any of claims 1 to 3, wherein the localization information of the sound source at least comprises: the azimuth angle of the sound source is theta, the pitch angle of the sound source is mu, and the distance from the sound source to each sound sensor is obtained.