CN102854494A - Sound source locating method and device - Google Patents
Sound source locating method and device Download PDFInfo
- Publication number
- CN102854494A CN102854494A CN2012102810199A CN201210281019A CN102854494A CN 102854494 A CN102854494 A CN 102854494A CN 2012102810199 A CN2012102810199 A CN 2012102810199A CN 201210281019 A CN201210281019 A CN 201210281019A CN 102854494 A CN102854494 A CN 102854494A
- Authority
- CN
- China
- Prior art keywords
- sound
- signal
- source signal
- function
- ratio
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Granted
Links
Images
Landscapes
- Circuit For Audible Band Transducer (AREA)
Abstract
The invention is suitable for the technical field of sound processing and provides a sound source locating method and device. The method comprises the steps of: collecting sound source signals by utilizing a microphone array and preprocessing the sound source signals collected by any two microphones; confirming a cross-power spectral density function of the two sound source signals; confirming a weighting function adjusted along with the variation of the present signal to noise ratio; confirming a sequence of values of the cross-correlation function of the two sound source signals according to the cross-power spectral density function and the weighting function; confirming the time delay of the sound source singles to two microphones according to the maximum value of the cross-correlation function; and locating the sound source positions according to the permutation distribution of the microphone array and the time delay of the sound source signals to the any two microphones. According to the method and the device, the adopted weighting function can be correspondingly adjusted along with the variation of the present signal to noise ratio to ensure that under the environment that the signal to noise ratio of a sound source is changed, the time delay of the sound source can be accurately obtained through correspondingly adjusting the weighting function, and therefore, the sound source locating accuracy is improved.
Description
Technical field
The invention belongs to the acoustic processing technical field, relate in particular to a kind of sound localization method and device.
Background technology
In video conference, security protection or in some industrial application, usually need sound source is positioned, but in some scenarios, because the uncertainty of outside sound source environment, voice signal is subject to outside noise and disturbs, so that signal to noise ratio (S/N ratio) changes, in the existing auditory localization technology, obtain one group of voice data by microphone array, estimate through carrying out time delay with phase tranformation broad sense intercorrelation method (PHAT-GCC) after the pre-service again, according to the arranged distribution of microphone in time delay result and the microphone array, can determine the position of sound source by geometric model.Because in the existing PHAT-GCC method, because the signal to noise ratio (S/N ratio) of sound-source signal may change with environment, when signal energy is less, the denominator that carries out the weighting function of frequency domain weighting can go to zero, so that the value of weighting function becomes very large, the time delay resultant error of obtaining like this is also larger, and also can there be very large error in the sound source position of orienting at last.
Summary of the invention
In view of the above problems, the object of the present invention is to provide a kind of sound localization method, be intended to solve in the existing auditory localization technology because the signal to noise ratio (S/N ratio) of sound-source signal when changing, it is very large that the value of weighting function may become, so that the very large technical matters of auditory localization resultant error.
The present invention is achieved in that a kind of sound localization method, comprises the steps:
Microphone array gathers sound-source signal, and the sound-source signal of wherein any two microphone collections is carried out pre-service;
Determine the cross-spectral density function through described pretreated two-way sound-source signal;
Determine to change the weighting function of adjusting with current signal to noise ratio (S/N ratio);
Determine the value sequence of the cross correlation function of described two-way sound-source signal according to described cross-spectral density function and weighting function, and determine that according to the maximal value of described cross correlation function sound-source signal arrives the time delay of described two microphones;
According to the time delay that arranged distribution and the described sound-source signal of microphone array arrives wherein said two microphones, localization of sound source position.
A further object of the present invention is to provide a kind of sound source locating device, comprising:
Microphone array gathers pretreatment unit, is used for microphone array and gathers sound-source signal, and the sound-source signal of wherein any two microphone collections is carried out pre-service;
The cross-spectral density determining unit is used for definite cross-spectral density function through described pretreated two-way sound-source signal;
The weighting function determining unit is used for determining to change the weighting function of adjusting with current signal to noise ratio (S/N ratio);
The time delay determining unit is used for determining according to described cross-spectral density function and weighting function the value sequence of the cross correlation function of described two-way sound-source signal, and determines that according to the maximal value of described cross correlation function sound-source signal arrives the time delay of described two microphones;
The auditory localization unit is for the time delay that arranged distribution and described sound-source signal according to microphone array arrive wherein said two microphones, localization of sound source position.
The invention has the beneficial effects as follows: because sound localization method provided by the invention and the device the weighting function that adopts can make corresponding adjustment with the variation of current signal to noise ratio (S/N ratio), so that because the impact of the factors such as ground unrest, reverberation, under the environment that the sound source signal to noise ratio (S/N ratio) changes, by corresponding adjustment weighting function, also but the time delay of Obtaining Accurate voice signal has improved the auditory localization precision.
Description of drawings
Fig. 1 is the process flow diagram of the sound localization method that provides of first embodiment of the invention;
Fig. 2 is the process flow diagram of the sound localization method that provides of second embodiment of the invention;
Fig. 3 is the block diagram of the sound source locating device that provides of third embodiment of the invention;
Fig. 4 is the block diagram of the sound source locating device that provides of fourth embodiment of the invention.
Embodiment
In order to make purpose of the present invention, technical scheme and advantage clearer, below in conjunction with drawings and Examples, the present invention is further elaborated.Should be appreciated that specific embodiment described herein only in order to explain the present invention, is not intended to limit the present invention.
For technical solutions according to the invention are described, describe below by specific embodiment.
Embodiment one:
Fig. 1 shows the flow process of the sound localization method that first embodiment of the invention provides, and only shows for convenience of explanation the part relevant with the embodiment of the invention.
The sound localization method that the embodiment of the invention provides comprises:
Step S101, microphone array gather sound-source signal, and the sound-source signal of wherein any two microphone collections is carried out pre-service.
Microphone array is the microphone set that a plurality of microphones are arranged according to certain way, in the auditory localization technology, be usually used in the sound-source signal collection, can obtain one group of sound-source signal, in this step, appoint and to get the sound-source signal that two microphones wherein collect and carry out pre-service, comprise filtering and minute frame etc.
Step S102, definite cross-spectral density function through described pretreated two-way sound-source signal;
Step S103, definite weighting function of adjusting that changes with current signal to noise ratio (S/N ratio);
Step S104, determine the value sequence of the cross correlation function of described two-way sound-source signal according to described cross-spectral density function and weighting function, and determine that according to the maximal value of described cross correlation function sound-source signal arrives the time delay of described two microphones (mistiming).
Step S102-S104 provides definite sound-source signal to arrive the process of the time delay of two microphones, the degree of accuracy of determining time delay has determined the degree of accuracy of auditory localization, general time delay determines that method is: cross-spectral density function and the weighting function of at first determining the two-way sound-source signal, product according to described cross-spectral density function and weighting function carries out the value sequence that inverse fourier transform obtains the cross correlation function of two paths of signals again, determines described time delay according to the maximal value of described cross correlation function.But existing weighting function can't be followed the variation of current signal to noise ratio (S/N ratio) and be changed, this weighting function can't be resisted larger ground unrest and reverberation, and when the voice signal ability hour, the value of described weighting function is very large, follow-up postponing a meeting or conference when definite produces very large error.And in embodiments of the present invention, the determined weighting function of step S103 changes with current signal to noise ratio (S/N ratio) makes corresponding adjustment, so that the functional value of weighting function can hour not become very large because of the speech signal energy, and then guarantee the really degree of accuracy of fixed response time.
Step S105, the time delay that arrives wherein said two microphones according to arranged distribution and the described sound-source signal of microphone array, the localization of sound source position.
The principle of auditory localization technology is by determining that sound-source signal arrives the time delay of two microphones, and according to the particular location of described microphone, determine the sound source particular location by geometric model, present embodiment determined higher accuracy the time delay, can pass through the accurate localization of sound source of geometric analysis method position, concrete localization method is identical with existing auditory localization technology, repeats no more herein.
The key distinction of the embodiment of the invention and existing auditory localization technology is, the weighting function that present embodiment provides changes and corresponding adjustment with current signal to noise ratio (S/N ratio), so that violent the change can not occur because current signal to noise ratio (S/N ratio) changes in the functional value of weighting function, the degree of accuracy of last so definite time delay value is guaranteed, and then has improved the auditory localization degree of accuracy.
Embodiment two:
Fig. 2 shows the flow process of the sound localization method that second embodiment of the invention provides, and only shows for convenience of explanation the part relevant with the embodiment of the invention.
The sound localization method that the embodiment of the invention provides comprises:
Step S201, microphone array gather sound-source signal;
Step S202, the sound-source signal of any two microphone collections in the described microphone array is carried out bandpass filtering, obtain the sound-source signal behind the two-way bandpass filtering;
Step S203, the sound-source signal behind the described two-way bandpass filtering is carried out windowing divide frame to process, obtain in short-term stationary signal of two-way.
Above-mentioned steps S201-S03 as step S101 among the embodiment one a kind of specifically preferred embodiment.
In step S201, suppose that the sound-source signal that described two microphones collect is respectively:
x
1(t)=a
1s
1(t)+n
1(t) (1)
x
2(t)=a
2s
1(t+D)+n
2(t) (2)
Wherein, a
1, a
2Be the sound attenuating factor, owing to be that sound source is near-field signals, can think a
1, a
2Be that 1, D is the time delay that sound-source signal arrives described two microphones, n
1(t), n
2(t) be described two noise signals that microphone receives.
In step S202, the sound-source signal that microphone is collected carries out bandpass filtering, with the noise filtering of low-frequency range and high band, for subsequent treatment provides sound-source signal behind the two-way bandpass filtering.
In step S203, as a kind of implementation, the sound-source signal after using Hamming window function to described two-way bandpass filtering divides frame, obtains in short-term stationary signal of two-way, and windowing divides frame generally to adopt the overlapping method of frame and frame.Two-way in short-term stationary signal is:
s
1(λ,n)=x
1(n+d(λ-1)N)w(n) (3)
s
2(λ,n)=x
2(n+d(λ-1)N)w(n) (4)
Wherein w (n) is Hamming window function, and N is the length of window function w (n), and d is the shift parameters between the consecutive frame, and λ is frame number.
Step S204, by end-point detection judge described two-way in short-term stationary signal whether be voice signal; It is execution in step 205; No execution in step 207.
Step S205, determine current signal to noise ratio (S/N ratio), current signal to noise ratio (S/N ratio) is: SNR (λ)=aSNR (λ-1)+(1-a) SNR_0, wherein SNR (λ-1) is previous frame sound-source signal signal to noise ratio (S/N ratio), the priori signal to noise ratio (S/N ratio) that SNR_0 tries to achieve for the energy ratio of using current speech signal frame and last non-speech audio frame, a is smoothing factor;
Step S206, to described two-way in short-term stationary signal carry out Fast Fourier Transform (FFT), determine again the in short-term cross-spectral density function of stationary signal of described two-way;
Step S207, give up in short-term stationary signal of described two-way, upgrade signal to noise ratio snr (λ)=SNR (λ-1), wherein SNR (λ-1) be previous frame sound-source signal signal to noise ratio (S/N ratio), and this flow process finishes, and enters the next frame processing;
Above-mentioned steps S204-S207 is that one kind of step S102 is specifically preferred embodiment among the embodiment one.Among the step S204 by end-point detection judge two-way in short-term stationary signal whether be voice signal, in the present embodiment, the sound-source signal that microphone collects comprises voice signal and the ambient noise signal of sound source, if described sound source is not during sounding, the sound-source signal that described microphone collects only is ambient noise signal, concrete, when detecting the in short-term short-time energy of stationary signal of described two-way (energy of a short time period of sound signal) and short-time zero-crossing rate (signal waveform is passed the number of times of transverse axis (zero level) in the unit interval) all greater than corresponding threshold value, can judge that current sound-source signal is voice signal.
When the voice signal λ frame after determining minute frame is non-speech audio, current signal to noise ratio (S/N ratio) then
SNR(λ)=SNR(λ-1) (8)
When the voice signal λ frame after determining minute frame is voice signal, current signal to noise ratio (S/N ratio) then
SNR(λ)=aSNR(λ-1)+(1-a)SNR_0 (9)
Wherein, SNR (λ-1) is the signal to noise ratio (S/N ratio) of previous frame, and SNR_0 is the energy ratio of current speech signal frame and last non-speech audio frame, and a is smoothing factor.
When definite current sound-source signal is voice signal, to described two-way in short-term stationary signal carry out Fast Fourier Transform (FFT), determine again the in short-term cross-spectral density function of stationary signal of described two-way.Concrete, the two-way voice signal in formula (3) and the formula (4) is carried out Fast Fourier Transform (FFT), have
Therefore, can be in the hope of the cross-spectral density function of described two-way voice signal:
Wherein, s
1(λ, n) and s
2(λ, n) is the finite length sequence of N for length, obtains S through after the Fourier transform
1(λ, k) and S
2(λ, k),
Be S
2The conjugate function of (λ, k).
When definite current two-way when stationary signal is non-speech audio in short-term, give up in short-term stationary signal of described two-way.When detecting described two-way in short-term steadily for non-speech audio, there is no need to carry out follow-up computing this moment again, so it is steady in short-term to give up described two-way among the step S207, has just reduced so to a certain extent calculated amount.
Step S208, determine weighting function according to described current signal to noise ratio (S/N ratio)
Perhaps
φ wherein
12(w) be the cross-spectral density function of sound-source signal, ρ is the regulatory factor proportional with current signal to noise ratio snr (λ),
Be coherence function, wherein φ
1(w) and φ
2(w) be the autocorrelation function of described two-way voice signal.
Above-mentioned steps S208 is that one kind of step S103 specifically preferred embodiment at first needs to determine signal to noise ratio (S/N ratio) among the embodiment one, determines weighting function according to described signal to noise ratio (S/N ratio) again.
After determining current signal to noise ratio (S/N ratio), determine again corresponding with it weighting function.In step S208, if do not consider additive noise in the actual environment, weighting function described in the present embodiment is:
If the consideration additive noise, weighting function described in the present embodiment is:
Wherein, φ
12(w) be the cross-spectral density function of voice signal, ρ is the regulatory factor proportional with current signal to noise ratio snr (k),
Be coherence function, wherein φ
1(w) and φ
2(w) be the autocorrelation function of described two-way voice signal.
In the prior art, traditional weighting function of frequency domain is
This weighting function can't be resisted larger noise and reverberation impact in the practical application, and when speech signal energy hour, the weighting function denominator approaches zero, thereby produces larger error.And in embodiments of the present invention, will be suc as formula the weighting function shown in (10) or the formula (11), be associated with current signal to noise ratio (S/N ratio), wherein ρ is the regulatory factor proportional with current signal to noise ratio snr (λ), the value of ρ is to draw by the many experiments test at the sound source environment, this value relies on current signal to noise ratio snr (λ), different SNR (λ), ρ gets different values, SNR (λ) is higher, and the value of ρ is just larger, as a kind of concrete value mode, when SNR (λ)≤10dB, the span of ρ is 0.3≤ρ≤0.55; When 10dB<SNR (λ)≤25dB, the span of ρ is 0.55<ρ≤0.75; When 25dB<SNR (λ), the span of ρ is 0.75<ρ≤0.85.
For formula (10), if current signal to noise ratio (S/N ratio) is smaller, namely the energy comparison of voice signal is little, at this moment φ
12(w) smaller, if ρ gets 0.5, so weighting function
Functional value compare with existing weighting function, weighted value is much smaller, can reduce to a certain extent error; For formula (11), further contemplate additive noise, also comprise coherence function in the denominator term of weighting function
Shown in the signal value size of the size of related function and voice signal irrelevant, the functional value that has further guaranteed weighting function can big ups and downs, have reduced error.
Step S209, the product of described cross-spectral density function and weighting function obtained the value sequence of the cross correlation function of described two-way sound-source signal through inverse Fourier transform;
Step S210, the value sequence of described cross correlation function is carried out peak value detect, obtain sample point corresponding to maximum of points, and determine that described sound-source signal arrives the time delay of described two microphones interval time according to sample point.
Above-mentioned steps S209-S210 is that one kind of step S104 is specifically preferred embodiment among the embodiment one.
In step S209, to the cross-spectral density function R in the formula (7)
12Weighting function in (λ, k) and formula (10) or the formula (11)
Product carry out inverse Fourier transform, obtain the cross correlation function of described two-way voice signal:
In step S210, to described cross correlation function r
12(λ, n) carries out peak value and detects, and gets the wherein corresponding sample point of maximum discrete value, and the described sample point that obtains and sample point are multiplied each other interval time, can obtain the time delay of described two-way sound-source signal.
Step S211, the time delay that arrives wherein said two microphones according to arranged distribution and the described sound-source signal of microphone array, the localization of sound source position, this flow process finishes, and enters next frame and processes.
After obtaining time delay value, can determine the sound source particular location according to the aggregation model of microphone position in the microphone array again.
One kind of step S105 specifically preferred embodiment among the step S211 embodiment one.
The embodiment of the invention has been listed concrete preferred implementation step to step wherein on the basis of embodiment one, can realize that sound source accurately locates.
Embodiment three:
Fig. 3 shows the structure of the sound source locating device that third embodiment of the invention provides, and only shows for convenience of explanation the part relevant with the embodiment of the invention.
The sound source locating device that the embodiment of the invention provides comprises:
Microphone array gathers pretreatment unit 301, is used for microphone array and gathers sound-source signal, and the sound-source signal of wherein any two microphone collections is carried out pre-service;
Cross-spectral density determining unit 302 is used for definite cross-spectral density function through described pretreated two-way sound-source signal;
Weighting function determining unit 303 is used for determining to change the weighting function of adjusting with current signal to noise ratio (S/N ratio);
Time delay determining unit 304, be used for determining according to described cross-spectral density function and weighting function the value sequence of the cross correlation function of described two-way sound-source signal, and determine that according to the maximal value of described cross correlation function sound-source signal arrives the time delay of described two microphones;
The functional unit 301-305 that present embodiment provides respectively correspondence has realized step S101-S105 among the embodiment one, wherein, microphone array gathers that pretreatment unit 301 gathers sound-source signals and to after the two-way sound-source signal pre-service wherein, cross-spectral density determining unit 302 and weighting function determining unit 303 are determined respectively cross-spectral density function and weighting function, described weighting function can change with current signal to noise ratio (S/N ratio) makes corresponding adjustment, so that the value of weighting function can acute variation, time delay determining unit 304 according to described cross-spectral density function and weighting function determine sound-source signal arrive described two microphones the time delay, auditory localization unit 305 again can the localization of sound source position according to arranged distribution and the described time delay of microphone array.Weighting function determining unit 303 determined weighting functions are followed the variation of current signal to noise ratio (S/N ratio) and are changed in the sound source locating device that example of the present invention provides, and this is so that the time delay result's who obtains degree of accuracy is higher, thereby can improve the auditory localization degree of accuracy.
Embodiment four:
Fig. 4 shows the structure of the sound source locating device that fourth embodiment of the invention provides, and only shows for convenience of explanation the part relevant with the embodiment of the invention.
The sound source locating device that the embodiment of the invention provides comprises:
Microphone array gathers pretreatment unit 401, is used for microphone array and gathers sound-source signal, and the sound-source signal of wherein any two microphone collections is carried out pre-service;
Cross-spectral density determining unit 402 is used for definite cross-spectral density function through described pretreated two-way sound-source signal;
Weighting function determining unit 403 is used for determining to change the weighting function of adjusting with current signal to noise ratio (S/N ratio);
Time delay determining unit 404, be used for determining according to described cross-spectral density function and weighting function the value sequence of the cross correlation function of described two-way sound-source signal, and determine that according to the maximal value of described cross correlation function source sound arrives the time delay of described two microphones;
Wherein, described microphone array collection pretreatment unit 401 comprises:
Microphone array acquisition module 4011 is used for microphone array and gathers sound-source signal;
Bandpass filtering modules block 4012 is used for the sound-source signal of any two the microphone collections of described microphone array is carried out bandpass filtering, the sound-source signal behind the two-way bandpass filtering;
Divide frame processing module 4013, carry out windowing for the sound-source signal to described two-way process bandpass filtering and divide frame to process, obtain in short-term stationary signal of two-way.
Wherein, described cross-spectral density determining unit 402 comprises:
Current signal to noise ratio (S/N ratio) determination module 4022, be used for when judgement is, determine current signal to noise ratio (S/N ratio), described current signal to noise ratio (S/N ratio) is: SNR (λ)=aSNR (λ-1)+(1-a) SNR_0, wherein SNR (λ-1) is previous frame sound-source signal signal to noise ratio (S/N ratio), the priori signal to noise ratio (S/N ratio) that SNR_0 tries to achieve for the energy ratio of using current speech signal frame and last non-speech audio frame, a is smoothing factor;
Cross-spectral density determination module 4023, to described two-way in short-term stationary signal carry out Fast Fourier Transform (FFT), determine again the in short-term cross-spectral density function of stationary signal of described two-way;
Signal is given up module 4024, is used for giving up in short-term stationary signal of described two-way, and upgrading signal to noise ratio snr (λ)=SNR (λ-1) when judgement is no, and wherein SNR (λ-1) is previous frame sound-source signal signal to noise ratio (S/N ratio).
Wherein, weighting function determining unit 403 comprises:
Weighting function determination module 4031 is used for determining that according to described current signal to noise ratio (S/N ratio) weighting function is
Perhaps
φ wherein
12(w) be the cross-spectral density function of sound-source signal, ρ is the regulatory factor proportional with current signal to noise ratio snr (λ-1),
Be coherence function, wherein φ
1(w) and φ
2(w) be the autocorrelation function of described two-way sound-source signal.
Wherein, described time delay determining unit 404 comprises:
Cross correlation function acquisition module 4041 is for the value sequence that the product of described cross-spectral density function and weighting function is obtained the cross correlation function of described two-way sound-source signal through inverse Fourier transform;
Time delay determination module 4042 is used for the value sequence of described cross correlation function is carried out the peak value detection, obtains sample point corresponding to maximum of points, and determines that described sound-source signal arrives the time delay of described two microphones interval time according to sample point.
The embodiment of the invention is on the basis of example three, provided the wherein concrete preferred structure of functional unit, corresponding each step that realizes among the embodiment two, concrete, after microphone array acquisition module 4011 collects sound-source signal, again by 4013 pairs of bandpass filtering modules block 4012 and minute frame processing modules wherein arbitrarily the two-way sound-source signal carry out pre-service, when phonetic decision module 4021 detects current sound-source signal and is voice signal, current signal to noise ratio (S/N ratio) determination module 4022 is determined current signal to noise ratio (S/N ratio), and by 4023 pairs of described two-way of cross-spectral density determination module in short-term stationary signal carry out Fourier transform, determine again the in short-term cross-spectral density function of stationary signal of described two-way, otherwise give up module 4024 by signal and give up in short-term stationary signal of described two-way, and the renewal signal to noise ratio (S/N ratio), can save unnecessary calculation procedure like this.Current signal to noise ratio (S/N ratio) is determined weighting function by weighting function determination module 4031 according to described current signal to noise ratio (S/N ratio) after determining again, as a kind of implementation, and described weighting function
Perhaps
φ wherein
12(w) be the cross-spectral density function of sound-source signal, ρ is the regulatory factor proportional with current signal to noise ratio snr (λ-1),
Be coherence function, wherein φ
1(w) and φ
2(w) be the autocorrelation function of described two-way sound-source signal, wherein the value of ρ is to draw by the many experiments test at the sound source environment, value can be with reference to embodiment two, and this value relies on current signal to noise ratio snr (λ), different SNR (λ), ρ gets different values, SNR (λ) is higher, and the value of ρ is just larger, as SNR (λ) when diminishing, the ρ value is followed and is diminished, so the functional value of the weighting function in the embodiment of the invention acute variation can not occur.Cross correlation function acquisition module 4041 obtains the product of described cross-spectral density function and weighting function the cross correlation function of described two-way sound-source signal through inverse Fourier transform, the value sequence of 4042 pairs of described cross correlation functions of time delay determination module carries out peak value and detects, obtain sample point corresponding to maximum of points, and described sample point be multiply by the time in sampling interval, can obtain the time delay that described sound-source signal arrives described two microphones, after time delay was determined, auditory localization unit 405 can accurately navigate to the position of sound source accordingly with the arranged distribution of microphone array.
The present embodiment correspondence has realized each step among the embodiment two, provides concrete cross-spectral density function and weighting function to determine mode, can realize that sound source accurately locates.
To sum up, the sound localization method that the embodiment of the invention provides and device are compared with existing auditory localization technology, can improve the auditory localization precision.
One of ordinary skill in the art will appreciate that, realize that all or part of step in above-described embodiment method is to come the relevant hardware of instruction to finish by program, described program can be in being stored in a computer read/write memory medium, described storage medium is such as ROM/RAM, disk, CD etc.
The above only is preferred embodiment of the present invention, not in order to limiting the present invention, all any modifications of doing within the spirit and principles in the present invention, is equal to and replaces and improvement etc., all should be included within protection scope of the present invention.
Claims (11)
1. a sound localization method is characterized in that, described method comprises:
Microphone array gathers sound-source signal, and the sound-source signal of wherein any two microphone collections is carried out pre-service;
Determine the cross-spectral density function through described pretreated two-way sound-source signal;
Determine to change the weighting function of adjusting with current signal to noise ratio (S/N ratio);
Determine the value sequence of the cross correlation function of described two-way sound-source signal according to described cross-spectral density function and weighting function, and determine that according to the maximal value of described cross correlation function sound-source signal arrives the time delay of described two microphones;
According to the time delay that arranged distribution and the described sound-source signal of microphone array arrives wherein said two microphones, localization of sound source position.
2. method as claimed in claim 1 is characterized in that, described microphone array gathers sound-source signal, and the sound-source signal of wherein any two microphone collections is carried out pre-treatment step, specifically comprises:
Microphone array gathers sound-source signal;
Sound-source signal to any two microphone collections in the described microphone array carries out bandpass filtering, obtains the sound-source signal behind the two-way bandpass filtering;
Described two-way is carried out windowing through the sound-source signal of bandpass filtering divide frame to process, obtain in short-term stationary signal of two-way.
3. method as claimed in claim 2 is characterized in that the cross-spectral density function step of the described pretreated two-way sound-source signal of described definite process specifically comprises:
By end-point detection judge described two-way in short-term stationary signal whether be voice signal;
When judgement is, determine current signal to noise ratio (S/N ratio), described current signal to noise ratio (S/N ratio) is: SNR (λ)=aSNR (λ-1)+(1-a) SNR_0, wherein SNR (λ-1) is previous frame sound-source signal signal to noise ratio (S/N ratio), the priori signal to noise ratio (S/N ratio) that SNR_0 tries to achieve for the energy ratio of using current speech signal frame and last non-speech audio frame, a is smoothing factor;
To described two-way in short-term stationary signal carry out Fast Fourier Transform (FFT), determine again the in short-term cross-spectral density function of stationary signal of described two-way;
When judgement is no, give up in short-term stationary signal of described two-way, and upgrade signal to noise ratio snr (λ)=SNR (λ-1), wherein SNR (λ-1) is previous frame sound-source signal signal to noise ratio (S/N ratio).
4. method as claimed in claim 3 is characterized in that, when the short-time energy of described sound-source signal and short-time zero-crossing rate during all greater than corresponding threshold value, can judge that current sound-source signal is voice signal.
5. method as claimed in claim 4 is characterized in that, describedly determines to change the weighting function step of adjusting with current signal to noise ratio (S/N ratio), specifically comprises:
Determine weighting function according to described current signal to noise ratio (S/N ratio)
Perhaps
φ wherein
12(w) be the cross-spectral density function of sound-source signal, ρ is the regulatory factor proportional with current signal to noise ratio snr (λ),
Be coherence function, wherein φ
1(w) and φ
2(w) be the autocorrelation function of described two-way sound-source signal.
6. method as claimed in claim 5, it is characterized in that, the described value sequence of determining the cross correlation function of described two-way sound-source signal according to described cross-spectral density function and weighting function, and determine that according to the maximal value of described cross correlation function sound-source signal arrives the time delay step of described two microphones, specifically comprises:
The product of described cross-spectral density function and weighting function is obtained the value sequence of the cross correlation function of described two-way sound-source signal through inverse Fourier transform;
Value sequence to described cross correlation function carries out the peak value detection, obtains sample point corresponding to maximum of points, and determines that described sound-source signal arrives the time delay of described two microphones interval time according to sample point.
7. a sound source locating device is characterized in that, described device comprises:
Microphone array gathers pretreatment unit, is used for microphone array and gathers sound-source signal, and the sound-source signal of wherein any two microphone collections is carried out pre-service;
The cross-spectral density determining unit is used for definite cross-spectral density function through described pretreated two-way sound-source signal;
The weighting function determining unit is used for determining to change the weighting function of adjusting with current signal to noise ratio (S/N ratio);
The time delay determining unit is used for determining according to described cross-spectral density function and weighting function the value sequence of the cross correlation function of described two-way sound-source signal, and determines that according to the maximal value of described cross correlation function sound-source signal arrives the time delay of described two microphones;
The auditory localization unit is for the time delay that arranged distribution and described sound-source signal according to microphone array arrive wherein said two microphones, localization of sound source position.
8. install as claimed in claim 7, it is characterized in that, described microphone array gathers pretreatment unit and comprises:
The microphone array acquisition module is used for microphone array and gathers sound-source signal;
Bandpass filtering modules block is used for the sound-source signal of any two the microphone collections of described microphone array is carried out bandpass filtering, the sound-source signal behind the two-way bandpass filtering;
Divide the frame processing module, carry out windowing for the sound-source signal to described two-way process bandpass filtering and divide frame to process, obtain in short-term stationary signal of two-way.
9. install as claimed in claim 8, it is characterized in that described cross-spectral density determining unit comprises:
The phonetic decision module is used for judging by end-point detection whether described pretreated present frame sound-source signal is voice signal;
Current signal to noise ratio (S/N ratio) determination module, be used for when judgement is, determine current signal to noise ratio (S/N ratio), described current signal to noise ratio (S/N ratio) is: SNR (λ)=aSNR (λ-1)+(1-a) SNR_0, wherein SNR (λ-1) is previous frame sound-source signal signal to noise ratio (S/N ratio), the priori signal to noise ratio (S/N ratio) that SNR_0 tries to achieve for the energy ratio of using current speech signal frame and last non-speech audio frame, a is smoothing factor;
The cross-spectral density determination module, to described two-way in short-term stationary signal carry out Fast Fourier Transform (FFT), determine again the in short-term cross-spectral density function of stationary signal of described two-way;
Signal is given up module, is used for giving up in short-term stationary signal of described two-way, and upgrading signal to noise ratio snr (λ)=SNR (λ-1) when judgement is no, and wherein SNR (λ-1) is previous frame sound-source signal signal to noise ratio (S/N ratio).
10. install as claimed in claim 9, it is characterized in that described weighting function determining unit comprises:
The weighting function determination module is used for determining that according to described current signal to noise ratio (S/N ratio) weighting function is
Perhaps
φ wherein
12(w) be the cross-spectral density function of sound-source signal, ρ is the regulatory factor proportional with current signal to noise ratio snr (λ-1),
Be coherence function, wherein φ
1(w) and φ
2(w) be the autocorrelation function of described two-way voice signal.
11. install as claimed in claim 10, it is characterized in that described time delay determining unit comprises:
The cross correlation function acquisition module is for the value sequence that the product of described cross-spectral density function and weighting function is obtained the cross correlation function of described two-way sound-source signal through inverse Fourier transform;
The time delay determination module is used for that described cross correlation function is carried out peak value and detects, and obtains sample point corresponding to maximum of points, and determines that described sound-source signal arrives the time delay of described two microphones interval time according to sample point.
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201210281019.9A CN102854494B (en) | 2012-08-08 | 2012-08-08 | A kind of sound localization method and device |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201210281019.9A CN102854494B (en) | 2012-08-08 | 2012-08-08 | A kind of sound localization method and device |
Publications (2)
Publication Number | Publication Date |
---|---|
CN102854494A true CN102854494A (en) | 2013-01-02 |
CN102854494B CN102854494B (en) | 2015-09-09 |
Family
ID=47401242
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN201210281019.9A Expired - Fee Related CN102854494B (en) | 2012-08-08 | 2012-08-08 | A kind of sound localization method and device |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN102854494B (en) |
Cited By (50)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN103364761A (en) * | 2013-07-12 | 2013-10-23 | 哈尔滨工业大学 | Positioning system of indoor sound source and method using positioning system to position indoor sound source |
CN103630148A (en) * | 2013-11-01 | 2014-03-12 | 中国科学院物理研究所 | Signal sampling averaging device and signal sampling averaging method |
CN104422922A (en) * | 2013-08-19 | 2015-03-18 | 中兴通讯股份有限公司 | Method and device for realizing sound source localization by utilizing mobile terminal |
CN104535965A (en) * | 2014-12-29 | 2015-04-22 | 江苏科技大学 | Parallelized sound source positioning system based on embedded GPU system and method |
CN104700842A (en) * | 2015-02-13 | 2015-06-10 | 广州市百果园网络科技有限公司 | Sound signal time delay estimation method and device |
CN105467364A (en) * | 2015-11-20 | 2016-04-06 | 百度在线网络技术(北京)有限公司 | Method and apparatus for localizing target sound source |
CN105575387A (en) * | 2015-12-25 | 2016-05-11 | 重庆邮电大学 | Sound source localization method based on acoustic bionic cochlea basal membrane |
CN106162431A (en) * | 2015-04-02 | 2016-11-23 | 钰太芯微电子科技(上海)有限公司 | The beam positioning system of giant-screen mobile terminal |
CN106296854A (en) * | 2016-08-12 | 2017-01-04 | 上海电机学院 | A kind of classroom based on microphone array roll calling system |
CN106375902A (en) * | 2015-07-22 | 2017-02-01 | 哈曼国际工业有限公司 | Audio enhancement via opportunistic use of microphones |
CN106488358A (en) * | 2015-09-09 | 2017-03-08 | 上海其高电子科技有限公司 | Optimize sound field imaging localization method and system |
CN106970356A (en) * | 2016-01-14 | 2017-07-21 | 芋头科技(杭州)有限公司 | Auditory localization tracking under a kind of complex environment |
CN107144820A (en) * | 2017-06-21 | 2017-09-08 | 歌尔股份有限公司 | Sound localization method and device |
CN107159435A (en) * | 2017-05-25 | 2017-09-15 | 洛阳语音云创新研究院 | Method and device for adjusting working state of mill |
CN107199572A (en) * | 2017-06-16 | 2017-09-26 | 山东大学 | A kind of robot system and method based on intelligent auditory localization and Voice command |
CN107202976A (en) * | 2017-05-15 | 2017-09-26 | 大连理工大学 | The distributed microphone array sound source localization system of low complex degree |
CN107202385A (en) * | 2017-06-22 | 2017-09-26 | 广东美的制冷设备有限公司 | Sound wave mosquito repelling function method, device and air conditioner |
CN107271963A (en) * | 2017-06-22 | 2017-10-20 | 广东美的制冷设备有限公司 | The method and apparatus and air conditioner of auditory localization |
CN107329114A (en) * | 2017-06-21 | 2017-11-07 | 歌尔股份有限公司 | Sound localization method and device |
CN107894595A (en) * | 2017-11-06 | 2018-04-10 | 上海航天测控通信研究所 | A kind of delay time estimation method under non-gaussian SaS impulsive noise environments |
CN108152788A (en) * | 2017-12-22 | 2018-06-12 | 西安Tcl软件开发有限公司 | Sound-source follow-up method, sound-source follow-up equipment and computer readable storage medium |
CN108198568A (en) * | 2017-12-26 | 2018-06-22 | 太原理工大学 | A kind of method and system of more auditory localizations |
CN108269581A (en) * | 2017-01-04 | 2018-07-10 | 中国科学院声学研究所 | A kind of dual microphone time delay estimation method based on coherence in frequency domain function |
CN108332063A (en) * | 2018-01-29 | 2018-07-27 | 中国科学院声学研究所 | A kind of pipeline leakage positioning method based on cross-correlation |
CN108549052A (en) * | 2018-03-20 | 2018-09-18 | 南京航空航天大学 | A kind of humorous domain puppet sound intensity sound localization method of circle of time-frequency-spatial domain joint weighting |
CN108549113A (en) * | 2018-04-12 | 2018-09-18 | 俞度立 | A kind of method for testing performance and device of wave detector |
WO2018218747A1 (en) * | 2017-06-01 | 2018-12-06 | 深圳大学 | Indoor positioning method and system |
CN108957392A (en) * | 2018-04-16 | 2018-12-07 | 深圳市沃特沃德股份有限公司 | Sounnd source direction estimation method and device |
CN109490833A (en) * | 2018-10-30 | 2019-03-19 | 重庆大学 | A kind of quick identification of sound source method of the GCC inversion model of modified propogator matrix |
CN109611703A (en) * | 2018-10-19 | 2019-04-12 | 宁波市鄞州利帆灯饰有限公司 | A kind of LED light being easily installed |
CN109618273A (en) * | 2018-12-29 | 2019-04-12 | 北京声智科技有限公司 | The device and method of microphone quality inspection |
CN109778485A (en) * | 2017-11-10 | 2019-05-21 | 青岛海尔滚筒洗衣机有限公司 | A kind of device for clothing processing control method and system |
CN110133596A (en) * | 2019-05-13 | 2019-08-16 | 南京林业大学 | A kind of array sound source localization method based on frequency point signal-to-noise ratio and biasing soft-decision |
CN110136732A (en) * | 2019-05-17 | 2019-08-16 | 湖南琅音信息科技有限公司 | Two-channel intelligent acoustic signal processing method, system and audio frequency apparatus |
CN110221250A (en) * | 2019-06-27 | 2019-09-10 | 中国科学院西安光学精密机械研究所 | A kind of abnormal sound localization method and positioning device |
CN110221246A (en) * | 2019-05-20 | 2019-09-10 | 北京航空航天大学 | A kind of unmanned plane localization method based on the fusion of multi-source direction finding message |
CN110310651A (en) * | 2018-03-25 | 2019-10-08 | 深圳市麦吉通科技有限公司 | Adaptive voice processing method, mobile terminal and the storage medium of Wave beam forming |
CN110488223A (en) * | 2019-07-05 | 2019-11-22 | 东北电力大学 | A kind of sound localization method |
CN110600039A (en) * | 2019-09-27 | 2019-12-20 | 百度在线网络技术(北京)有限公司 | Speaker attribute determination method and device, electronic equipment and readable storage medium |
CN110726972A (en) * | 2019-10-21 | 2020-01-24 | 南京南大电子智慧型服务机器人研究院有限公司 | Voice sound source positioning method using microphone array under interference and high reverberation environment |
CN111120223A (en) * | 2019-12-16 | 2020-05-08 | 大连赛听科技有限公司 | Blade fault monitoring method and device based on double arrays |
CN112394324A (en) * | 2020-10-21 | 2021-02-23 | 西安合谱声学科技有限公司 | Microphone array-based remote sound source positioning method and system |
CN112540346A (en) * | 2020-12-07 | 2021-03-23 | 国网山西省电力公司大同供电公司 | Sound source positioning method based on signal-to-noise ratio weight optimization updating |
CN113466793A (en) * | 2021-06-11 | 2021-10-01 | 五邑大学 | Sound source positioning method and device based on microphone array and storage medium |
CN113820662A (en) * | 2021-08-02 | 2021-12-21 | 华南师范大学 | Sound source direction positioning detection method |
CN114325214A (en) * | 2021-11-18 | 2022-04-12 | 国网辽宁省电力有限公司电力科学研究院 | Electric power online monitoring method based on microphone array sound source positioning technology |
WO2022135131A1 (en) * | 2020-12-23 | 2022-06-30 | 北京有竹居网络技术有限公司 | Sound source positioning method and apparatus, and electronic device |
CN114720942A (en) * | 2021-01-06 | 2022-07-08 | 漳州立达信光电子科技有限公司 | Sound source positioning method, device and equipment based on microphone array |
CN116312447A (en) * | 2023-02-09 | 2023-06-23 | 杭州兆华电子股份有限公司 | Directional noise elimination method and system |
CN116609726A (en) * | 2023-05-11 | 2023-08-18 | 钉钉(中国)信息技术有限公司 | Sound source positioning method and device |
Citations (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN101762806A (en) * | 2010-01-27 | 2010-06-30 | 华为终端有限公司 | Sound source locating method and apparatus thereof |
CN102411138A (en) * | 2011-07-13 | 2012-04-11 | 北京大学 | Method for positioning sound source by robot |
CN102438189A (en) * | 2011-08-30 | 2012-05-02 | 东南大学 | Dual-channel acoustic signal-based sound source localization method |
-
2012
- 2012-08-08 CN CN201210281019.9A patent/CN102854494B/en not_active Expired - Fee Related
Patent Citations (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN101762806A (en) * | 2010-01-27 | 2010-06-30 | 华为终端有限公司 | Sound source locating method and apparatus thereof |
CN102411138A (en) * | 2011-07-13 | 2012-04-11 | 北京大学 | Method for positioning sound source by robot |
CN102438189A (en) * | 2011-08-30 | 2012-05-02 | 东南大学 | Dual-channel acoustic signal-based sound source localization method |
Non-Patent Citations (2)
Title |
---|
杜要峰 等: "一种修正的近场声源定位时延估计方法", 《电声基础》 * |
杨祥清 等: "基于麦克风阵列的三维声源定位算法及其实现", 《声学技术》 * |
Cited By (65)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN103364761A (en) * | 2013-07-12 | 2013-10-23 | 哈尔滨工业大学 | Positioning system of indoor sound source and method using positioning system to position indoor sound source |
CN104422922A (en) * | 2013-08-19 | 2015-03-18 | 中兴通讯股份有限公司 | Method and device for realizing sound source localization by utilizing mobile terminal |
CN103630148A (en) * | 2013-11-01 | 2014-03-12 | 中国科学院物理研究所 | Signal sampling averaging device and signal sampling averaging method |
CN103630148B (en) * | 2013-11-01 | 2016-03-02 | 中国科学院物理研究所 | Sample of signal averaging device and sample of signal averaging method |
CN104535965A (en) * | 2014-12-29 | 2015-04-22 | 江苏科技大学 | Parallelized sound source positioning system based on embedded GPU system and method |
CN104700842B (en) * | 2015-02-13 | 2018-05-08 | 广州市百果园信息技术有限公司 | The delay time estimation method and device of voice signal |
CN104700842A (en) * | 2015-02-13 | 2015-06-10 | 广州市百果园网络科技有限公司 | Sound signal time delay estimation method and device |
CN106162431A (en) * | 2015-04-02 | 2016-11-23 | 钰太芯微电子科技(上海)有限公司 | The beam positioning system of giant-screen mobile terminal |
CN106375902A (en) * | 2015-07-22 | 2017-02-01 | 哈曼国际工业有限公司 | Audio enhancement via opportunistic use of microphones |
CN106375902B (en) * | 2015-07-22 | 2020-07-21 | 哈曼国际工业有限公司 | Audio enhancement through opportunistic use of microphones |
CN106488358A (en) * | 2015-09-09 | 2017-03-08 | 上海其高电子科技有限公司 | Optimize sound field imaging localization method and system |
CN106488358B (en) * | 2015-09-09 | 2019-07-19 | 上海其高电子科技有限公司 | Optimize sound field imaging localization method and system |
CN105467364B (en) * | 2015-11-20 | 2019-03-29 | 百度在线网络技术(北京)有限公司 | A kind of method and apparatus positioning target sound source |
CN105467364A (en) * | 2015-11-20 | 2016-04-06 | 百度在线网络技术(北京)有限公司 | Method and apparatus for localizing target sound source |
CN105575387A (en) * | 2015-12-25 | 2016-05-11 | 重庆邮电大学 | Sound source localization method based on acoustic bionic cochlea basal membrane |
CN106970356A (en) * | 2016-01-14 | 2017-07-21 | 芋头科技(杭州)有限公司 | Auditory localization tracking under a kind of complex environment |
CN106296854A (en) * | 2016-08-12 | 2017-01-04 | 上海电机学院 | A kind of classroom based on microphone array roll calling system |
CN108269581A (en) * | 2017-01-04 | 2018-07-10 | 中国科学院声学研究所 | A kind of dual microphone time delay estimation method based on coherence in frequency domain function |
CN108269581B (en) * | 2017-01-04 | 2021-06-08 | 中国科学院声学研究所 | Double-microphone time delay difference estimation method based on frequency domain coherent function |
CN107202976A (en) * | 2017-05-15 | 2017-09-26 | 大连理工大学 | The distributed microphone array sound source localization system of low complex degree |
CN107159435A (en) * | 2017-05-25 | 2017-09-15 | 洛阳语音云创新研究院 | Method and device for adjusting working state of mill |
WO2018218747A1 (en) * | 2017-06-01 | 2018-12-06 | 深圳大学 | Indoor positioning method and system |
CN107199572B (en) * | 2017-06-16 | 2020-02-14 | 山东大学 | Robot system and method based on intelligent sound source positioning and voice control |
CN107199572A (en) * | 2017-06-16 | 2017-09-26 | 山东大学 | A kind of robot system and method based on intelligent auditory localization and Voice command |
CN107329114A (en) * | 2017-06-21 | 2017-11-07 | 歌尔股份有限公司 | Sound localization method and device |
CN107144820A (en) * | 2017-06-21 | 2017-09-08 | 歌尔股份有限公司 | Sound localization method and device |
CN107202385B (en) * | 2017-06-22 | 2020-08-25 | 广东美的制冷设备有限公司 | Sound wave mosquito repelling method and device and air conditioner |
CN107271963A (en) * | 2017-06-22 | 2017-10-20 | 广东美的制冷设备有限公司 | The method and apparatus and air conditioner of auditory localization |
CN107202385A (en) * | 2017-06-22 | 2017-09-26 | 广东美的制冷设备有限公司 | Sound wave mosquito repelling function method, device and air conditioner |
CN107894595A (en) * | 2017-11-06 | 2018-04-10 | 上海航天测控通信研究所 | A kind of delay time estimation method under non-gaussian SaS impulsive noise environments |
CN109778485A (en) * | 2017-11-10 | 2019-05-21 | 青岛海尔滚筒洗衣机有限公司 | A kind of device for clothing processing control method and system |
CN109778485B (en) * | 2017-11-10 | 2022-08-05 | 青岛海尔洗涤电器有限公司 | Control method and system for clothes processing device |
CN108152788A (en) * | 2017-12-22 | 2018-06-12 | 西安Tcl软件开发有限公司 | Sound-source follow-up method, sound-source follow-up equipment and computer readable storage medium |
CN108198568B (en) * | 2017-12-26 | 2020-10-16 | 太原理工大学 | Method and system for positioning multiple sound sources |
CN108198568A (en) * | 2017-12-26 | 2018-06-22 | 太原理工大学 | A kind of method and system of more auditory localizations |
CN108332063A (en) * | 2018-01-29 | 2018-07-27 | 中国科学院声学研究所 | A kind of pipeline leakage positioning method based on cross-correlation |
CN108549052A (en) * | 2018-03-20 | 2018-09-18 | 南京航空航天大学 | A kind of humorous domain puppet sound intensity sound localization method of circle of time-frequency-spatial domain joint weighting |
CN108549052B (en) * | 2018-03-20 | 2021-04-13 | 南京航空航天大学 | Time-frequency-space domain combined weighted circular harmonic domain pseudo-sound strong sound source positioning method |
CN110310651B (en) * | 2018-03-25 | 2021-11-19 | 深圳市麦吉通科技有限公司 | Adaptive voice processing method for beam forming, mobile terminal and storage medium |
CN110310651A (en) * | 2018-03-25 | 2019-10-08 | 深圳市麦吉通科技有限公司 | Adaptive voice processing method, mobile terminal and the storage medium of Wave beam forming |
CN108549113A (en) * | 2018-04-12 | 2018-09-18 | 俞度立 | A kind of method for testing performance and device of wave detector |
CN108957392A (en) * | 2018-04-16 | 2018-12-07 | 深圳市沃特沃德股份有限公司 | Sounnd source direction estimation method and device |
CN109611703A (en) * | 2018-10-19 | 2019-04-12 | 宁波市鄞州利帆灯饰有限公司 | A kind of LED light being easily installed |
CN109490833A (en) * | 2018-10-30 | 2019-03-19 | 重庆大学 | A kind of quick identification of sound source method of the GCC inversion model of modified propogator matrix |
CN109618273A (en) * | 2018-12-29 | 2019-04-12 | 北京声智科技有限公司 | The device and method of microphone quality inspection |
CN110133596A (en) * | 2019-05-13 | 2019-08-16 | 南京林业大学 | A kind of array sound source localization method based on frequency point signal-to-noise ratio and biasing soft-decision |
CN110136732A (en) * | 2019-05-17 | 2019-08-16 | 湖南琅音信息科技有限公司 | Two-channel intelligent acoustic signal processing method, system and audio frequency apparatus |
CN110221246A (en) * | 2019-05-20 | 2019-09-10 | 北京航空航天大学 | A kind of unmanned plane localization method based on the fusion of multi-source direction finding message |
CN110221250A (en) * | 2019-06-27 | 2019-09-10 | 中国科学院西安光学精密机械研究所 | A kind of abnormal sound localization method and positioning device |
CN110488223A (en) * | 2019-07-05 | 2019-11-22 | 东北电力大学 | A kind of sound localization method |
CN110600039A (en) * | 2019-09-27 | 2019-12-20 | 百度在线网络技术(北京)有限公司 | Speaker attribute determination method and device, electronic equipment and readable storage medium |
CN110600039B (en) * | 2019-09-27 | 2022-05-20 | 百度在线网络技术(北京)有限公司 | Method and device for determining speaker attribute, electronic equipment and readable storage medium |
CN110726972A (en) * | 2019-10-21 | 2020-01-24 | 南京南大电子智慧型服务机器人研究院有限公司 | Voice sound source positioning method using microphone array under interference and high reverberation environment |
CN111120223A (en) * | 2019-12-16 | 2020-05-08 | 大连赛听科技有限公司 | Blade fault monitoring method and device based on double arrays |
CN112394324A (en) * | 2020-10-21 | 2021-02-23 | 西安合谱声学科技有限公司 | Microphone array-based remote sound source positioning method and system |
CN112540346A (en) * | 2020-12-07 | 2021-03-23 | 国网山西省电力公司大同供电公司 | Sound source positioning method based on signal-to-noise ratio weight optimization updating |
WO2022135131A1 (en) * | 2020-12-23 | 2022-06-30 | 北京有竹居网络技术有限公司 | Sound source positioning method and apparatus, and electronic device |
CN114720942A (en) * | 2021-01-06 | 2022-07-08 | 漳州立达信光电子科技有限公司 | Sound source positioning method, device and equipment based on microphone array |
CN113466793A (en) * | 2021-06-11 | 2021-10-01 | 五邑大学 | Sound source positioning method and device based on microphone array and storage medium |
CN113466793B (en) * | 2021-06-11 | 2023-10-17 | 五邑大学 | Sound source positioning method and device based on microphone array and storage medium |
CN113820662A (en) * | 2021-08-02 | 2021-12-21 | 华南师范大学 | Sound source direction positioning detection method |
CN114325214A (en) * | 2021-11-18 | 2022-04-12 | 国网辽宁省电力有限公司电力科学研究院 | Electric power online monitoring method based on microphone array sound source positioning technology |
CN116312447A (en) * | 2023-02-09 | 2023-06-23 | 杭州兆华电子股份有限公司 | Directional noise elimination method and system |
CN116312447B (en) * | 2023-02-09 | 2023-11-10 | 杭州兆华电子股份有限公司 | Directional noise elimination method and system |
CN116609726A (en) * | 2023-05-11 | 2023-08-18 | 钉钉(中国)信息技术有限公司 | Sound source positioning method and device |
Also Published As
Publication number | Publication date |
---|---|
CN102854494B (en) | 2015-09-09 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN102854494B (en) | A kind of sound localization method and device | |
CN108731886B (en) | A kind of more leakage point acoustic fix ranging methods of water supply line based on iteration recursion | |
TW201448616A (en) | Method and apparatus for determining directions of uncorrelated sound sources in a Higher Order Ambisonics representation of a sound field | |
CN102411138A (en) | Method for positioning sound source by robot | |
CN109509465A (en) | Processing method, component, equipment and the medium of voice signal | |
US20220051685A1 (en) | Method for transforming audio signal, device, and storage medium | |
CN103440872A (en) | Transient state noise removing method | |
CN103730110A (en) | Method and device for detecting voice endpoint | |
CN107527624B (en) | Voiceprint recognition method and device | |
Zhang et al. | Speech endpoint detection algorithm with low signal-to-noise based on improved conventional spectral entropy | |
CN117746905A (en) | Human activity influence assessment method and system based on time-frequency persistence analysis | |
Lathoud et al. | A sector-based, frequency-domain approach to detection and localization of multiple speakers | |
Mitre et al. | Accurate and efficient fundamental frequency determination from precise partial estimates | |
CN113270118B (en) | Voice activity detection method and device, storage medium and electronic equipment | |
CN108074588B (en) | Pitch calculation method and pitch calculation device | |
CN111192569B (en) | Double-microphone voice feature extraction method and device, computer equipment and storage medium | |
Unoki et al. | MTF-based power envelope restoration in noisy reverberant environments | |
CN113156373B (en) | Sound source positioning method, digital signal processing device and audio system | |
Li et al. | Binaural sound localization based on detection of multi-band zero-crossing points | |
Miao et al. | A Novel Tracker of Adaptive Directional Ridge Separation and Prediction for Detecting Whistles | |
Biswas et al. | FPGA-Based Novel Speech Enhancement System Using Microphone Activity Detector | |
Mao et al. | An improved accumulated cross-power spectrum phase method for time delay estimation | |
Bai et al. | A Joint Line Spectrum Detection Scheme with Stochastic Resonance Theory | |
Sun et al. | Secure Voice Processing Systems for Driverless Vehicles | |
KAZAZIS | TEMPO ESTIMATION ON MULTIPLE METRICAL LEVELS WITH SEQUENCY FLUX AND MULTIRESOLUTION ANALYSIS |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
C06 | Publication | ||
PB01 | Publication | ||
C10 | Entry into substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
C14 | Grant of patent or utility model | ||
GR01 | Patent grant | ||
CF01 | Termination of patent right due to non-payment of annual fee | ||
CF01 | Termination of patent right due to non-payment of annual fee |
Granted publication date: 20150909 |