CN114639398B - Broadband DOA estimation method based on microphone array - Google Patents

Broadband DOA estimation method based on microphone array Download PDF

Info

Publication number
CN114639398B
CN114639398B CN202210240262.XA CN202210240262A CN114639398B CN 114639398 B CN114639398 B CN 114639398B CN 202210240262 A CN202210240262 A CN 202210240262A CN 114639398 B CN114639398 B CN 114639398B
Authority
CN
China
Prior art keywords
frequency
doa
signal
microphone array
voice
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Active
Application number
CN202210240262.XA
Other languages
Chinese (zh)
Other versions
CN114639398A (en
Inventor
黄际彦
慕方方
周杨
李汉君
王珍
马敏
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
University of Electronic Science and Technology of China
Original Assignee
University of Electronic Science and Technology of China
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by University of Electronic Science and Technology of China filed Critical University of Electronic Science and Technology of China
Priority to CN202210240262.XA priority Critical patent/CN114639398B/en
Publication of CN114639398A publication Critical patent/CN114639398A/en
Application granted granted Critical
Publication of CN114639398B publication Critical patent/CN114639398B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Images

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
    • G10L25/00Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
    • G10L25/78Detection of presence or absence of voice signals
    • G10L25/87Detection of discrete points within a voice signal
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F18/00Pattern recognition
    • G06F18/20Analysing
    • G06F18/23Clustering techniques
    • G06F18/232Non-hierarchical techniques
    • G06F18/2321Non-hierarchical techniques using statistics or function optimisation, e.g. modelling of probability density functions
    • G06F18/23213Non-hierarchical techniques using statistics or function optimisation, e.g. modelling of probability density functions with fixed number of clusters, e.g. K-means clustering
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
    • G10L25/00Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
    • G10L25/03Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 characterised by the type of extracted parameters
    • G10L25/18Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 characterised by the type of extracted parameters the extracted parameters being spectral information of each sub-band
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
    • G10L25/00Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
    • G10L25/27Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 characterised by the analysis technique
    • YGENERAL TAGGING OF NEW TECHNOLOGICAL DEVELOPMENTS; GENERAL TAGGING OF CROSS-SECTIONAL TECHNOLOGIES SPANNING OVER SEVERAL SECTIONS OF THE IPC; TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
    • Y02TECHNOLOGIES OR APPLICATIONS FOR MITIGATION OR ADAPTATION AGAINST CLIMATE CHANGE
    • Y02DCLIMATE CHANGE MITIGATION TECHNOLOGIES IN INFORMATION AND COMMUNICATION TECHNOLOGIES [ICT], I.E. INFORMATION AND COMMUNICATION TECHNOLOGIES AIMING AT THE REDUCTION OF THEIR OWN ENERGY USE
    • Y02D30/00Reducing energy consumption in communication networks
    • Y02D30/70Reducing energy consumption in communication networks in wireless communication networks

Abstract

The invention belongs to the field of array signal processing, and particularly relates to a broadband DOA estimation method based on a microphone array. The invention firstly utilizes a voice endpoint detection algorithm to preprocess voice signals received by a microphone array, extracts a voice section, then only processes the voice section, can save processing time, uses a correlation method to detect voice endpoints, then converts a preprocessed time domain signal into a frequency domain, selects corresponding frequencies according to a set threshold, then utilizes a narrow-band DOA estimation technology to calculate output signals corresponding to the microphone array under the current frequency in the selected frequencies, searches DOA corresponding to the peak value of the output signals, counts and clusters DOA under each frequency, and finally obtains a clustering result, namely a DOA estimated result.

Description

Broadband DOA estimation method based on microphone array
Technical Field
The invention belongs to the field of array signal processing, and particularly relates to a broadband DOA estimation method based on a microphone array.
Background
DOA estimation is an important basis in the field of microphone array signal processing, and the technology is intensively studied at home and abroad, and excellent technical results are achieved. At present, the narrowband technology is mature, but as the environment is more and more complex, the frequency range of the signal is wider and wider, and the broadband signal has more complex information compared with the narrowband signal, so that the research on DOA estimation of the broadband signal is necessary. Considering a complex sound source environment, information on a plurality of sound sources in a short time may sometimes occur. When the DOA estimation is performed using the delay-and-sum algorithm (Bartlett) and the classical MUSIC algorithm in a signal source that is very short and possibly has not all speech signals, it is found that not all signal sources can be estimated.
Disclosure of Invention
The invention aims at providing a broadband DOA estimation method based on a microphone array aiming at two different problems of short-time double-sound-source broadband signals and clustering results caused by human interference.
The invention mainly comprises the following steps: first, a voice signal received by a microphone array is preprocessed by a voice endpoint detection algorithm. The voice endpoint detection is to detect the presence segment, extract the presence segment, and then only process the presence segment, so that the processing time can be saved. The invention uses correlation to detect the voice end point. Then, the preprocessed time domain signal is converted to a frequency domain, and a corresponding frequency is selected according to a set threshold. And then, obtaining output signals corresponding to the microphone array at the current frequency in the selected frequency by utilizing a narrow-band DOA estimation technology. And searching an angle corresponding to the peak value of the output signal, counting and clustering DOA under each frequency, and obtaining a final clustering result, namely a DOA estimated result.
The specific technical scheme of the invention is as follows:
a wideband DOA estimation method based on a microphone array is characterized in that frequency information in wideband signals is utilized to count estimated angles in frequency, and the resolution of high-frequency DOA is higher. The method comprises the following steps:
s1, voice signal preprocessing
For a piece of audio data it may not contain speech components at all times, where only noise signals may be present at a certain time. Voice endpoint detection techniques are used to improve this problem. The voice endpoint detection is to detect the presence segment, extract the presence segment, and then only process the presence segment, so that the processing time can be saved. Common endpoint detection methods are: a double threshold method, a correlation method and a spectral entropy method. The present invention uses correlation methods for endpoint detection.
Voice signal x n Short-time autocorrelation function R of (m) n (k) Is calculated as
Figure BDA0003541208580000021
Where K represents the maximum delay point number.
In order to avoid influence caused by absolute energy in the voice endpoint detection process, normalization processing is performed on the autocorrelation function as follows:
R n (k)=R n (k)/R n (0)(0≤k≤K)
when the voice endpoint detection is carried out, firstly, the autocorrelation is carried out according to the noise section of the current voice data, and two critical values T are determined according to the autocorrelation value 1 、T 2 . The audio data is subjected to framing processing, and when the maximum value of the autocorrelation function of the initial frame data is larger than the maximum value of the autocorrelation function of the end frame data is smaller than the maximum value of the autocorrelation function of the end frame data, the section has a speaking section. The result of voice endpoint detection for a segment of audio data using the autocorrelation function maximum is shown in fig. 4.
S2, frequency selection
The frequency range of the voice signal is f min =300 Hz to f max =3400hz, and this band is divided into N bands. The preprocessed voice signal in the time domain is converted into the frequency domain. The frequency with higher energy is selected. Setting a threshold value by using the data received by the array element 1, wherein the set threshold value T is as follows:
Figure BDA0003541208580000022
wherein Y is i Representing the corresponding amplitude value at the ith frequency band received by the array element 1;
Figure BDA0003541208580000023
represents the average value of Y. And selecting frequencies which are larger than the frequency corresponding to the T according to the threshold value T on the corresponding frequency band, namely selecting frequencies with higher energy.
S3, DOA estimation
An angular scan is performed at each frequency obtained in the second step, with the scan angle ranging from 0 degrees to 180 degrees, once every 0.5 degrees. The method comprises the following steps:
w=e -2πfτ
wherein w represents a weight vector; f represents the frequency point selected in the step 2; τ represents the delay of the microphone element relative to the reference element.
The output of the microphone array at each selected frequency bin is obtained as:
Y out =|w′Y max |
Y max representing the values of all array elements on the corresponding frequency domains under the selected frequency points; w' represents the transpose of the weighting vector w. Then searching the peak value of the output signal, wherein the corresponding angle at the peak value of the signal is the DOA angle under each frequency point, and counting the DOA corresponding to each frequency point to obtain a data set D θ
S4, clustering
In the clustering algorithm, DOA estimated in the third step is taken as a data set D θ . Firstly, the number of clusters in the data set is found, and the method is that the number range of clusters is set to be 1, min (100, length (D) θ ))](length(D θ ) For dataset D θ The data size of the cluster), and then the fastest cluster number of which the sum of the distances between all points and the centroid between the classes is reduced is obtained in the range, namely the cluster number L to be found. Then, with the angular accuracy, a threshold value is set. Removal of D θ The angle exceeding the threshold value is obtained to obtain the remaining angle D θ ' as a new dataset, D will finally θ ' Kmeans clustering was performed.
The angle measurement precision is affected by noise, and is:
Figure BDA0003541208580000031
in θ B Representing the beam width; SNR is expressed as signal-to-noise ratio. Consider the input signal as:
X(t)=X s (t)+X n (t)
wherein X is s (t) represents a target signal; x is X n (t) the noise signal is
Figure BDA0003541208580000032
/>
Figure BDA0003541208580000033
The output of the beamforming is
y(t)=W H X(t)=W H X s (t)+W H X n (t)
Wherein W represents a weight vector; the superscript "H" denotes the conjugate transpose. The output power is
E[|y(t)| 2 ]=W H R s W+W H R n W
In which W is H R s W is the signal power; w (W) H R n W is the noise power, then the signal to noise ratio is
Figure BDA0003541208580000041
Utilizing the obtained clustering number L to D θ ' Kmeans clustering is carried out to obtain a new clustering result.
Compared with the traditional DOA estimation method, the method utilizes the information of all the frequency points and counts the DOA estimated by each frequency point. As can be seen from fig. 2,3 and 5, the present invention improves the performance of DOA estimation, improves the DOA angular resolution, and can correctly obtain the incoming wave direction of the double sound sources. The algorithm not only correctly judges the number of clusters, but also combines the angle measurement precision to improve the precision of DOA estimation results.
Drawings
FIG. 1 is a schematic illustration of sound source propagation;
FIG. 2 is a graph of DOA estimation results of a delay-add algorithm;
FIG. 3 is a graph of the DOA estimation results of the MUSIC algorithm;
FIG. 4 is a schematic diagram of voice endpoint detection; a step of
FIG. 5 is a graph of the results after 700HZ-710HZ DOA estimation
FIG. 6 is a graph of DOA estimation results for wideband signals based on a microphone array
FIG. 7 is an initial clustering result graph
FIG. 8 is a graph showing the relationship between the number of clusters and the distance between classes
FIG. 9 is a graph showing DOA estimation results after angle screening
Detailed Description
The technical scheme of the invention is further described in detail:
establishing a microphone array signal model
A microphone array signal refers to a signal received by a microphone array that may obtain more information than a single microphone array element. For far-field sources, the sound-to-microphone elements are transmitted in the form of plane waves. The sound source propagation schematic diagram is shown in fig. 1, taking an array element 1 as a reference array element, and coordinates of an array element 2 relative to the array element 1 are as follows:
r=[x,y,z] T
assuming that the included angle between the sound source and the axis is theta, the propagation speed of the sound source in the air is c, and the propagation direction of the signal
Figure BDA0003541208580000051
The method comprises the following steps:
Figure BDA0003541208580000052
if the signal received by the array element 1 is s (t), the signal received by the array element 2 is relative to the signal received by the array element 1
Figure BDA0003541208580000053
By analogy, let the number of microphone array elements be M, and the M (m=1, 2,3 …, M) th array element position be r m . When the microphone array receives K signals in total, the signals received by the m-th array element are: />
Figure BDA0003541208580000054
Wherein n is m (t) Representing the interference signal received by the mth array element,
Figure BDA0003541208580000055
the method comprises the following steps:
Figure BDA0003541208580000056
if the time when the kth signal reaches the array element 1 is 0, taking the array element 1 as a reference, the time delay tau of the kth signal received by the kth array element mk
Figure BDA0003541208580000057
The matrix form of the signals received by the microphone array is:
Figure BDA0003541208580000058
the conventional array signal processing technology basically assumes that M array elements exist in a marc array under the condition of narrowband signals, and the incident signal s k (t) can be expressed in complex envelope form as:
Figure BDA0003541208580000061
wherein ω represents a frequency; u (u) k (t) represents amplitude;
Figure BDA0003541208580000062
representing the phase. Simultaneously satisfies the following formula:
Figure BDA0003541208580000063
substituting the above into the incident signal s k (t) the form of the available complex envelope can be deduced:
Figure BDA0003541208580000064
the signals received by the m-th array element are as follows:
Figure BDA0003541208580000065
wherein alpha is mk Representing the attenuation of sound source k to the mth array element; n is n m (t) represents noise of the mth array element at the moment; τ mk Representing the time delay for the kth sound source to reach the mth microphone. Based on attenuation alpha in far field conditions mk For 1, the matrix expression for obtaining the receiving signals of the array elements at the moment t is as follows:
Figure BDA0003541208580000066
writing the above as a vector form
X=AS+N
Wherein X represents signals received by M array elements; a is a dimensional array direction matrix; s represents a source, and the size of the source is K1 dimension matrix; n represents an interference signal, and the size is m×1 dimensional matrix.
The frequency domain model of the broadband signal can effectively solve the problem that the array flow pattern in the broadband signal model changes because of the non-single frequency, thereby causing the change of the subspace of the signal. For a certain array, let it now be assumed that the incident signal is a wideband signal, and according to equation (2-7), the signal received by the mth array element is:
Figure BDA0003541208580000071
fourier transforming the above can result in:
Figure BDA0003541208580000072
from the above equation, fourier transform converts the time delay in the time domain to a phase shift in the frequency domain, which can be converted into:
Figure BDA0003541208580000073
from the narrowband signal model and the wideband signal model, it is known that the narrowband model is a time-domain based expression, while the wideband signal model is a frequency-domain based expression.
The peak value of the autocorrelation function of the speech segment is much larger than the peak value of the autocorrelation function of the speech segment, so that the endpoint detection of the speech signal can be done based on the maximum value of the autocorrelation function. When the voice endpoint detection is carried out, firstly, the autocorrelation is carried out according to the noise section of the current voice data, and two critical values T are determined according to the autocorrelation value 1 、T 2 . And carrying out framing processing on the audio data, and when the maximum value of the autocorrelation function of the initial frame data is larger than that of the final frame data and the maximum value of the autocorrelation function of the final frame data is smaller than that of the final frame data, the section belongs to the talking section.
Examples
Two sections of audios are selected, two-second signals are intercepted, then the two sections of audios form signals actually received by each array element, the actual incidence angles of the two sections of audios are 70 degrees and 110 degrees, and the distances from the origin of the array elements are 8 meters and 5 meters respectively. After which a gaussian white noise signal is added. In estimating multiple targets, a delay-and-add algorithm is used to estimate DOA. Wherein the scanning angle is set to 0 ° to 180 °, and the calculation is performed every 0.5 °. Firstly, calculating the delay of each array element relative to a reference array element, then carrying out corresponding delay on signals received by each array element, adding the signals after the delay, and then solving the power corresponding to the signals. The power at each angle of the division is then normalized and logarithmically processed. The DOA estimation results using delay-and-add are shown in FIG. 2. The DOA estimation is carried out by using a broadband MUSIC, firstly, signals received by a microphone array are transferred from a time domain to a frequency domain, the frequency bands are divided, then, the frequency bands with higher energy are selected from the divided frequency bands, a covariance matrix is calculated on the selected frequency bands, then, eigenvalue decomposition is carried out on the covariance matrix to obtain a signal subspace and a noise subspace, a steering vector of the microphone array is constructed, a space spectrum is obtained by using a construction function of the broadband MUSIC algorithm, and in the power spectrum, the angle corresponding to a peak value is the incoming wave direction of a target sound source. The DOA estimation result of the wideband MUSIC algorithm is shown in figure 3. The invention selects the frequency range 1025HZ-1030HZ to draw every 0.5 degrees, and the result is shown in figure 5. After preliminary clustering is carried out on the multiple results, the results are shown in fig. 7, and the final clustering effect is shown in fig. 9.
The invention can utilize the audio information in a short time to carry out correct DOA estimation, and can be seen from the result graph that the invention improves the DOA estimation performance and can correctly obtain the incoming wave direction of the double sound sources. After DOA estimation, the obtained angles are clustered by using a Kmeans algorithm, the clustering result is the incoming wave direction of the target signal, and the Kmeans clustered result is 108.6032 degrees and 71.4290 degrees. The number of Kmeans clusters is considered to be artificially decided. This interference will result in different clustering results, and in some cases, the user cannot determine the number of clusters, the present invention improves the clustering for this problem, and as can be seen from the result fig. 8, the present invention correctly determines the number of clusters, and the final clustering results are 109.8939 ° and 70.0956 ° as shown in fig. 9. It can be seen that combining angular accuracy improves the accuracy of the DOA estimation.

Claims (1)

1. A wideband DOA estimation method based on a microphone array, comprising the steps of:
s1, preprocessing a voice signal received by a microphone array, specifically extracting a voice section in the voice signal;
s2, dividing the frequency range of the voice signal into N frequency bands, converting the extracted voice signal from a time domain to a frequency domain, and selecting the frequency domain signal according to energy, wherein the method comprises the following steps: the threshold value is set according to the data received by the array element 1:
Figure QLYQS_1
wherein Y is i Representing the corresponding amplitude at the i-th frequency band received by element 1,
Figure QLYQS_2
representing the average value of the whole array element receiving signals Y, and selecting frequencies which are larger than T and correspond to the T according to a threshold T on the corresponding frequency band;
s3, carrying out angle scanning on the frequency selected in the step S2 from 0 degree to 180 degrees, and scanning once every 0.5 degrees to obtain:
w=e -2πfτ
wherein w represents a weighting vector, f represents the frequency point selected in S2, and τ represents the time delay of the microphone array element relative to the reference array element;
obtaining output signals Y of the microphone array at each selected frequency point out The method comprises the following steps:
Y out =|w′Y max |
wherein Y is max Representing the values of all array elements in the corresponding frequency domain at the selected frequency point, w' representing the transpose of the weighting vector w, and then finding the output signal Y out The corresponding angle at the peak of the signal is DOA angle under each frequency point, and the DOA corresponding to each frequency point is counted to obtain a data set D θ
S4, adopting the obtained data set D θ Clustering is carried out to obtain DOA estimation results, specifically:
s41, obtaining the number of clusters: the number of clusters is set to be 1, min (100, length (D) θ ))],length(D θ ) For dataset D θ The data size of the cluster is calculated under the range of the cluster number, namely the cluster number L to be found is obtained, the threshold value is set by using the angle measurement precision, and D is removed θ The angle exceeding the threshold value, the remaining angle is obtained as a new data set D' θ The method comprises the steps of carrying out a first treatment on the surface of the Wherein the angle measurement precision is:
Figure QLYQS_3
wherein θ B Representing beam width, SNR is expressed as signal to noise ratio;
s42, for new data set D' θ And carrying out Kmeans clustering to obtain a new clustering result, namely a DOA estimation result.
CN202210240262.XA 2022-03-10 2022-03-10 Broadband DOA estimation method based on microphone array Active CN114639398B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN202210240262.XA CN114639398B (en) 2022-03-10 2022-03-10 Broadband DOA estimation method based on microphone array

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN202210240262.XA CN114639398B (en) 2022-03-10 2022-03-10 Broadband DOA estimation method based on microphone array

Publications (2)

Publication Number Publication Date
CN114639398A CN114639398A (en) 2022-06-17
CN114639398B true CN114639398B (en) 2023-05-26

Family

ID=81947613

Family Applications (1)

Application Number Title Priority Date Filing Date
CN202210240262.XA Active CN114639398B (en) 2022-03-10 2022-03-10 Broadband DOA estimation method based on microphone array

Country Status (1)

Country Link
CN (1) CN114639398B (en)

Citations (15)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101447190A (en) * 2008-06-25 2009-06-03 北京大学深圳研究生院 Voice enhancement method employing combination of nesting-subarray-based post filtering and spectrum-subtraction
CN102918588A (en) * 2010-03-29 2013-02-06 弗兰霍菲尔运输应用研究公司 A spatial audio processor and a method for providing spatial parameters based on an acoustic input signal
JP2014137226A (en) * 2013-01-15 2014-07-28 Advanced Telecommunication Research Institute International Mobile object, and system and method for creating acoustic source map
WO2014147442A1 (en) * 2013-03-20 2014-09-25 Nokia Corporation Spatial audio apparatus
CN106251877A (en) * 2016-08-11 2016-12-21 珠海全志科技股份有限公司 Voice Sounnd source direction method of estimation and device
CN106504763A (en) * 2015-12-22 2017-03-15 电子科技大学 Based on blind source separating and the microphone array multiple target sound enhancement method of spectrum-subtraction
CN107797096A (en) * 2017-10-20 2018-03-13 电子科技大学 A kind of detection localization method of blowing a whistle based on microphone face battle array
CN108447493A (en) * 2018-04-03 2018-08-24 西安交通大学 Frequency domain convolution blind source separating frequency-division section multiple centroid clustering order method
CN108519593A (en) * 2018-03-22 2018-09-11 电子科技大学 A kind of asynchronous tracking method based on single station two-frequency CW radar
CN108538306A (en) * 2017-12-29 2018-09-14 北京声智科技有限公司 Improve the method and device of speech ciphering equipment DOA estimations
CN108899044A (en) * 2018-07-27 2018-11-27 苏州思必驰信息科技有限公司 Audio signal processing method and device
CN109283487A (en) * 2018-05-09 2019-01-29 南京信息工程大学 MUSIC-DOA method based on the response of support vector machines controlled power
CN109669159A (en) * 2019-02-21 2019-04-23 深圳市友杰智新科技有限公司 Auditory localization tracking device and method based on microphone partition ring array
CN113096684A (en) * 2021-06-07 2021-07-09 成都启英泰伦科技有限公司 Target voice extraction method based on double-microphone array
WO2022042864A1 (en) * 2020-08-31 2022-03-03 Proactivaudio Gmbh Method and apparatus for measuring directions of arrival of multiple sound sources

Family Cites Families (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US9357293B2 (en) * 2012-05-16 2016-05-31 Siemens Aktiengesellschaft Methods and systems for Doppler recognition aided method (DREAM) for source localization and separation

Patent Citations (16)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101447190A (en) * 2008-06-25 2009-06-03 北京大学深圳研究生院 Voice enhancement method employing combination of nesting-subarray-based post filtering and spectrum-subtraction
CN102918588A (en) * 2010-03-29 2013-02-06 弗兰霍菲尔运输应用研究公司 A spatial audio processor and a method for providing spatial parameters based on an acoustic input signal
JP2014137226A (en) * 2013-01-15 2014-07-28 Advanced Telecommunication Research Institute International Mobile object, and system and method for creating acoustic source map
WO2014147442A1 (en) * 2013-03-20 2014-09-25 Nokia Corporation Spatial audio apparatus
CN106504763A (en) * 2015-12-22 2017-03-15 电子科技大学 Based on blind source separating and the microphone array multiple target sound enhancement method of spectrum-subtraction
CN106251877A (en) * 2016-08-11 2016-12-21 珠海全志科技股份有限公司 Voice Sounnd source direction method of estimation and device
CN107797096A (en) * 2017-10-20 2018-03-13 电子科技大学 A kind of detection localization method of blowing a whistle based on microphone face battle array
CN108538306A (en) * 2017-12-29 2018-09-14 北京声智科技有限公司 Improve the method and device of speech ciphering equipment DOA estimations
CN108519593A (en) * 2018-03-22 2018-09-11 电子科技大学 A kind of asynchronous tracking method based on single station two-frequency CW radar
CN108447493A (en) * 2018-04-03 2018-08-24 西安交通大学 Frequency domain convolution blind source separating frequency-division section multiple centroid clustering order method
CN109283487A (en) * 2018-05-09 2019-01-29 南京信息工程大学 MUSIC-DOA method based on the response of support vector machines controlled power
CN108899044A (en) * 2018-07-27 2018-11-27 苏州思必驰信息科技有限公司 Audio signal processing method and device
CN109669159A (en) * 2019-02-21 2019-04-23 深圳市友杰智新科技有限公司 Auditory localization tracking device and method based on microphone partition ring array
WO2022042864A1 (en) * 2020-08-31 2022-03-03 Proactivaudio Gmbh Method and apparatus for measuring directions of arrival of multiple sound sources
CN113096684A (en) * 2021-06-07 2021-07-09 成都启英泰伦科技有限公司 Target voice extraction method based on double-microphone array
CN113889138A (en) * 2021-06-07 2022-01-04 成都启英泰伦科技有限公司 Target voice extraction method based on double-microphone array

Non-Patent Citations (8)

* Cited by examiner, † Cited by third party
Title
Adaptive state updating particle filter tracking algorithm based on cubic spline interpolation;Zhou Y, Mu F F, Hu J F;《2021 International Conference on Electronic Information Engineering and Computer Science》;484-488 *
Coherent signals direction-of-arrival estimation using a spherical microphone array: Frequency smoothing approach;Khaykin, D., & Rafaely, B;《2009 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics》;221-224 *
Direction of arrival estimation of reflections from room impulse responses using a spherical microphone array;Tervo, S., & Politis, A;《IEEE/ACM Transactions on Audio, Speech, and Language Processing》;第23卷(第10期);1539-1551 *
Effectiveness of the Wideband Minimum Variance Direction Finding Method in Nonuniform Noise;Wen, F., Wan, Q., & Huang, J;《In 2012 IEEE 12th International Conference on Computer and Information Technology》;277-280 *
Generalized analyticaltechnique for the synthesisof unequally spaced arrays with linear, planar, cylindrical or spherical geometry;KumarB P,BrannerGR;《IEEE Transactions onAntennas & Propagation》;第53卷(第2期);621-634 *
基于 Root-ISM 算法的宽带非相干信号 DOA 估计;郭亚萍等;《电子科技》;第28卷(第6期);38-44 *
基于麦克风阵列的语音增强与干扰抑制算法;王义圆;张曦文;周贻能;黄际彦;;《电声技术》;第42卷(第02期);1-5 *
相干语音信号的两阵元DOA估计新算法;殷晴青;何培宇;;《信号处理》;第29卷(第09期);1170-1174 *

Also Published As

Publication number Publication date
CN114639398A (en) 2022-06-17

Similar Documents

Publication Publication Date Title
CN108375763B (en) Frequency division positioning method applied to multi-sound-source environment
CN107644650B (en) Improved sound source positioning method based on progressive serial orthogonalization blind source separation algorithm and implementation system thereof
CN111044973B (en) MVDR target sound source directional pickup method for microphone matrix
Moore et al. Direction of arrival estimation using pseudo-intensity vectors with direct-path dominance test
CN111239680B (en) Direction-of-arrival estimation method based on differential array
CN103426440A (en) Voice endpoint detection device and voice endpoint detection method utilizing energy spectrum entropy spatial information
JP2019503107A (en) Acoustic signal processing apparatus and method for improving acoustic signals
CN106782590A (en) Based on microphone array Beamforming Method under reverberant ambiance
CN102204281A (en) A system and method for producing a directional output signal
CN103907152A (en) A method and a system for noise suppressing an audio signal
CN112485761B (en) Sound source positioning method based on double microphones
CN108198568A (en) A kind of method and system of more auditory localizations
JP2002530966A (en) Binaural signal processing technology
CN113096684A (en) Target voice extraction method based on double-microphone array
CN110830870A (en) Earphone wearer voice activity detection system based on microphone technology
CN114639398B (en) Broadband DOA estimation method based on microphone array
CN109901114B (en) Time delay estimation method suitable for sound source positioning
Gao et al. A modified frequency weighted MUSIC algorithm for multiple sound sources localization
Guo et al. A Robust Dual-Microphone Speech Source Localization Algorithm for Reverberant Environments.
WO2023108864A1 (en) Regional pickup method and system for miniature microphone array device
Hu et al. Robust binaural sound localisation with temporal attention
Xu et al. Adaptive speech enhancement algorithm based on first-order differential microphone array
CN113189544A (en) Multi-sound-source positioning method for removing outliers by weighting of activity intensity vectors
CN113160843A (en) Particle vibration velocity sensor microarray-based interference voice suppression method and device
Shen et al. A modified cross power-spectrum phase method based on microphone array for acoustic source localization

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant