CN113933779A - Unknown sound source number DOA estimation method based on S transformation - Google Patents

Unknown sound source number DOA estimation method based on S transformation Download PDF

Info

Publication number
CN113933779A
CN113933779A CN202111206753.4A CN202111206753A CN113933779A CN 113933779 A CN113933779 A CN 113933779A CN 202111206753 A CN202111206753 A CN 202111206753A CN 113933779 A CN113933779 A CN 113933779A
Authority
CN
China
Prior art keywords
transformation
signal
array
frequency
time
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN202111206753.4A
Other languages
Chinese (zh)
Inventor
钟舜聪
黎昕婷
钟剑锋
许令鸿
吴生源
谢宇昆
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Fuzhou University
Original Assignee
Fuzhou University
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Fuzhou University filed Critical Fuzhou University
Priority to CN202111206753.4A priority Critical patent/CN113933779A/en
Publication of CN113933779A publication Critical patent/CN113933779A/en
Pending legal-status Critical Current

Links

Images

Classifications

    • GPHYSICS
    • G01MEASURING; TESTING
    • G01SRADIO DIRECTION-FINDING; RADIO NAVIGATION; DETERMINING DISTANCE OR VELOCITY BY USE OF RADIO WAVES; LOCATING OR PRESENCE-DETECTING BY USE OF THE REFLECTION OR RERADIATION OF RADIO WAVES; ANALOGOUS ARRANGEMENTS USING OTHER WAVES
    • G01S3/00Direction-finders for determining the direction from which infrasonic, sonic, ultrasonic, or electromagnetic waves, or particle emission, not having a directional significance, are being received
    • G01S3/80Direction-finders for determining the direction from which infrasonic, sonic, ultrasonic, or electromagnetic waves, or particle emission, not having a directional significance, are being received using ultrasonic, sonic or infrasonic waves
    • G01S3/802Systems for determining direction or deviation from predetermined direction
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F17/00Digital computing or data processing equipment or methods, specially adapted for specific functions
    • G06F17/10Complex mathematical operations
    • G06F17/14Fourier, Walsh or analogous domain transformations, e.g. Laplace, Hilbert, Karhunen-Loeve, transforms
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F17/00Digital computing or data processing equipment or methods, specially adapted for specific functions
    • G06F17/10Complex mathematical operations
    • G06F17/16Matrix or vector computation, e.g. matrix-matrix or matrix-vector multiplication, matrix factorization

Landscapes

  • Engineering & Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • General Physics & Mathematics (AREA)
  • Mathematical Physics (AREA)
  • Theoretical Computer Science (AREA)
  • Computational Mathematics (AREA)
  • Mathematical Analysis (AREA)
  • Mathematical Optimization (AREA)
  • Pure & Applied Mathematics (AREA)
  • Data Mining & Analysis (AREA)
  • Algebra (AREA)
  • Software Systems (AREA)
  • General Engineering & Computer Science (AREA)
  • Databases & Information Systems (AREA)
  • Computing Systems (AREA)
  • Radar, Positioning & Navigation (AREA)
  • Remote Sensing (AREA)
  • Measurement Of Velocity Or Position Using Acoustic Or Ultrasonic Waves (AREA)

Abstract

The invention relates to a DOA estimation method of the number of unknown sound sources based on S transformation. The method establishes a time-frequency domain array signal model which has consistency with the traditional array model form by performing S transformation on voice signals, introduces multi-resolution analysis, and has the advantages that the self-adaptive time-frequency window of wavelet transformation breaks away from the limitation of wavelet transformation allowable conditions and the like; and then, according to the characteristic that the power spectrum matrix of the frequency band at different moments is in a joint diagonalization structure, the model is applied to the traditional method to obtain a new space spectrum, the space spectrum is based on an S transformation result, the problem that the number of signal sources needs to be preset in the traditional MUSIC is solved, and the DOA estimation precision is better.

Description

Unknown sound source number DOA estimation method based on S transformation
Technical Field
The invention belongs to the field of voice signal positioning, and particularly relates to a DOA (direction of arrival) estimation method for the number of unknown sound sources based on S (direction of arrival) transformation.
Background
Direction of arrival (DOA) estimation based on sensor arrays has been a research hotspot in the fields of sonar, radar, communication, speech processing and the like, and the requirement of modern technology development makes it no longer limited to traditional narrow-band signal processing. The speech signal is used as a broadband non-stationary signal, and the traditional narrowband signal processing cannot realize high-precision positioning of the sound source position.
The broadband signal DOA estimation algorithm is mainly divided into two types in principle: non-coherent signal subspace approach and coherent signal subspace approach. The former uses the idea of frequency decomposition to decompose a broadband source signal into a series of narrowband signals and then perform DOA estimation. The latter is based on the principle of frequency focusing, wherein the most typical method is a bilateral correlation transformation method, signals obtained by decomposition are transformed to a uniform reference frequency point through a focusing matrix, then DOA estimation is carried out in a narrow-band mode, the coherent signal subspace method also needs to estimate the angle of a signal source, and the dependence of the focusing matrix on the estimated angle causes deviation of a final DOA estimation result.
For broadband non-stationary signals, researchers also try to research from the time-frequency domain, and the time-frequency information of the broadband signals is fully utilized to obtain more accurate DOA estimation results. Common time-frequency analysis methods include short-time fourier transform, S-transform, and wavelet transform, which are capable of multi-resolution analysis due to variable window functions. The S transformation is a time-frequency analysis method between short-time Fourier transformation and wavelet transformation, overcomes the defect that the short-time Fourier transformation cannot adjust the frequency of an analysis window, introduces multi-resolution analysis, and has the advantages that the self-adaptive time-frequency window of the wavelet transformation and basic wavelets are out of the limit of allowable conditions, and the like.
Disclosure of Invention
The present invention is to solve the above problems in the prior art, and provide a method for estimating the number of unknown sound sources DOA based on S transform.
In order to achieve the purpose, the technical scheme of the invention is as follows: a DOA (direction of arrival) estimation method for the number of unknown sound sources based on S transformation comprises the following steps:
1) in a sound field, arranging a microphone array which is uniformly and linearly arranged, and synchronously acquiring a section of voice signal by using a data acquisition device to obtain an array receiving signal;
2) s transformation is carried out on the signals after S transformation parameters are set according to the effective frequency band of the array received signals, and S transformation results are obtained;
3) constructing a time-frequency array data model under S transformation according to the S transformation result obtained in the step 2) to obtain a time-frequency domain guide vector;
4) performing correlation operation on the S transformation result according to the time-frequency array data model obtained in the step 3) to obtain an autocorrelation spectrum covariance matrix of the array signal;
5) constructing target direction-of-arrival estimation under a noise condition according to the time-frequency domain steering vector in the step 3) and the autocorrelation spectrum covariance matrix of the array signal obtained in the step 4);
6) solving the target direction of arrival estimation minimization function obtained in the step 5) to obtain a space spectrum estimation result;
7) and setting a search range and search stepping, and obtaining the positions of all spectrum peaks through spectrum search to obtain the maximum estimation value of the DOA.
In an embodiment of the present invention, in step 2), S-transform is performed on the array received signal to obtain an S-transform result, where the S-transform step is:
firstly, defining the received signal of the m-th array element in the array as:
Figure BDA0003305694700000021
p is the number of signal sources, sp(t) is the incident signal of P broadband signal sources, taumpIs the time delay, n, caused by the arrival of the p-th signal at the m-th array elementm(t) isAdditive white Gaussian noise of m array elements;
for signal xm(t) S-transform, the result ST of the transformmCan be expressed as:
Figure BDA0003305694700000022
τ is a time factor, fi(I ═ 1,2, …, I) is the frequency component, and the above equation is written as a fourier spectrum:
Figure BDA0003305694700000023
Figure BDA0003305694700000024
in order to be a component of the signal,
Figure BDA0003305694700000025
is a noise component.
In an embodiment of the present invention, the step 3) of constructing a time-frequency array data model under S transform to obtain a time-frequency domain steering vector includes:
assuming that the center frequency and the bandwidth of the S transform frequency window satisfy the narrow-band signal condition, the time-frequency array signal model of the M array element receiving signals is expressed as follows:
Figure BDA0003305694700000026
Figure BDA0003305694700000027
is a signal sp(t), P is 1,2, …, S transform result vector of P;
Figure BDA0003305694700000028
is a noise vector;
H(θ,fi)=[h(θ1,fi),h(θ2,fi),…,h(θP,fi)]an array direction matrix with dimension of M multiplied by P;
the steering vector of the array model data is represented as:
h(θp,fi)=[1,exp(-j2πfiτ1p),…,exp(-j2πfiτ(M-1)p)]T
in an embodiment of the present invention, the step 4) of performing correlation operation on the S transformation result to obtain an autocorrelation spectrum covariance matrix of the array signal includes:
ST (tau, f) is calculated according to the principle of classical multiple signal classification algorithmi) The covariance matrix of (a) is:
RY=E{ST(τ,fi)·STH(τ,fi)}=H(θ,fi)RX(τ,fi)HH(θ,fi)+σi 2I
RX(τ,fi) Is composed of
Figure BDA0003305694700000031
The covariance matrix of (a) is determined,
Figure BDA0003305694700000032
for transformed noise variance, σiIs a matrix singular value;
since the received data is finite in length in practice, the maximum likelihood estimate of the data covariance matrix is expressed as:
Figure BDA0003305694700000033
in an embodiment of the present invention, in step 5), a target direction of arrival estimation under a noise condition is constructed according to a time-frequency domain steering vector and an autocorrelation spectrum covariance matrix of an array signal, and the steps are as follows:
for the nth (1, 2, P) signal source, a vector b is definedn(f) Satisfy the requirement of
Figure BDA0003305694700000034
Regardless of the noise term, there is:
Figure BDA0003305694700000035
wherein
Figure BDA0003305694700000036
Under noisy conditions, the target direction-of-arrival estimate, i.e., the minimization function, is expressed as
Figure BDA0003305694700000037
Wherein d (f) is satisfied
Figure BDA0003305694700000038
In an embodiment of the present invention, in step 6), a minimization function of the estimation of the target direction of arrival is solved, and the obtained spatial spectrum estimation result is:
s transform to get
Figure BDA0003305694700000039
Substituting the data covariance result based on the S transformation for the time continuum processing result, the spectral estimation formula is:
Figure BDA0003305694700000041
compared with the prior art, the invention has the following beneficial effects:
the invention firstly carries out S transformation on voice signals to obtain a multi-resolution time-frequency spectrum matrix, thereby establishing a time-frequency domain array signal model which has consistency with the traditional array model form, overcoming the defect that the frequency of an analysis window cannot be adjusted by short-time Fourier transformation, introducing multi-resolution analysis, and having the advantages that the self-adaptive time-frequency window of wavelet transformation breaks away from the limitation of wavelet transformation tolerance conditions and the like; and then, according to the characteristic that the power spectrum matrix of the frequency band at different moments is in a joint diagonalization structure, applying the model to a traditional method to obtain a new space spectrum, wherein the space spectrum is based on an S transformation result, and the problem that the number of signal sources needs to be preset in the traditional MUSIC is solved.
The invention provides a DOA estimation method of the number of unknown sound sources based on S transformation, aiming at the problems of high calculation complexity, large calculation amount, the need of pre-estimating the number of signal sources and the like in the DOA estimation method. Firstly, S transformation is carried out on voice signals, a time-frequency domain array signal model which is consistent with the traditional array model form is established, multi-resolution analysis is introduced, and the method has the advantages that the self-adaptive time-frequency window of wavelet transformation breaks away from the limitation of wavelet transformation tolerance conditions and the like; and then, according to the characteristic that the power spectrum matrix of the frequency band at different moments is in a joint diagonalization structure, applying the model to a traditional method to obtain a new space spectrum, wherein the space spectrum is based on an S transformation result, and the problem that the number of signal sources needs to be preset in the traditional MUSIC is solved. The method can improve the DOA estimation resolution, has better estimation precision and has stronger practicability in the actual processing of voice signal sound source positioning.
Drawings
Fig. 1 is a flow chart of the present invention and fig. 1 is a programming flow chart of the positioning method of the present invention.
Fig. 2 shows the positioning result of one-dimensional wideband signal DOA estimation in the embodiment of the present invention.
Fig. 3 shows the positioning result of the DOA estimation of the two-dimensional speech signal in the example of the present invention.
Detailed Description
The technical solutions in the embodiments of the present invention will be described in detail and clearly with reference to the accompanying drawings. As shown in fig. 1, the present invention provides a method for estimating the number DOA of unknown sound sources based on S transform, comprising the following steps:
1) in a sound field, arranging a microphone array which is uniformly and linearly arranged, and synchronously acquiring a section of voice signal by using a data acquisition device to obtain an array receiving signal;
2) s transformation is carried out on the signals after S transformation parameters are set according to the effective frequency band of the received signals, and S transformation results are obtained; the method comprises the following specific steps:
firstly, defining the received signal of the m-th array element in the array as:
Figure BDA0003305694700000051
p is the number of signal sources, sp(t) is the incident signal of P broadband signal sources, taumpIs the time delay, n, caused by the arrival of the p-th signal at the m-th array elementm(t) is additive white gaussian noise for the mth array element;
for signal xm(t) S-transform, the result ST of the transformmCan be expressed as:
Figure BDA0003305694700000052
τ is a time factor, fi(I ═ 1,2, …, I) is the frequency component, and the above equation is written as a fourier spectrum:
Figure BDA0003305694700000053
Figure BDA0003305694700000054
in order to be a component of the signal,
Figure BDA0003305694700000055
is a noise component;
3) constructing a time-frequency array data model under S transformation according to the S transformation result obtained in the step 2) to obtain a time-frequency domain guide vector;
assuming that the center frequency and the bandwidth of the S transform frequency window satisfy the narrow-band signal condition, the time-frequency array signal model of the M array element receiving signals is expressed as follows:
Figure BDA0003305694700000056
Figure BDA0003305694700000057
is a signal sp(t), P is 1,2, …, S transform result vector of P;
Figure BDA0003305694700000058
is a noise vector;
H(θ,fi)=[h(θ1,fi),h(θ2,fi),…,h(θP,fi)]an array direction matrix with dimension of M multiplied by P;
the steering vector of the array model data is represented as:
h(θp,fi)=[1,exp(-j2πfiτ1p),…,exp(-j2πfiτ(M-1)p)]T
4) calculating the S transformation result to obtain an autocorrelation spectrum covariance matrix of the array signal;
ST (tau, f) is calculated according to the principle of classical multiple signal classification algorithmi) The covariance matrix of (a) is:
RY=E{ST(τ,fi)·STH(τ,fi)}=H(θ,fi)RX(τ,fi)HH(θ,fi)+σi 2I
RX(τ,fi) Is composed of
Figure BDA0003305694700000061
The covariance matrix of (a) is determined,
Figure BDA0003305694700000062
for transformed noise variance, σiIs a matrix singular value;
since the received data is finite in length in practice, the maximum likelihood estimate of the data covariance matrix is expressed as:
Figure BDA0003305694700000063
5) constructing target direction-of-arrival estimation under a noise condition;
for the nth (1, 2, P) signal source, a vector b is definedn(f) Satisfy the requirement of
Figure BDA0003305694700000064
Regardless of the noise term, there is:
Figure BDA0003305694700000065
wherein
Figure BDA0003305694700000066
Under noisy conditions, the target direction-of-arrival estimate, i.e., the minimization function, is expressed as
Figure BDA0003305694700000067
Wherein d (f) is satisfied
Figure BDA0003305694700000068
6) Solving a target direction of arrival estimation minimization function to obtain a space spectrum estimation result;
s transform to get
Figure BDA0003305694700000069
Substituting the data covariance result based on the S transformation for the time continuum processing result, the spectral estimation formula is:
Figure BDA00033056947000000610
7) and setting a search range and search stepping, and obtaining the positions of all spectrum peaks through spectrum search to obtain the maximum estimation value of the DOA.
In this example, the following experimental conditions were constructed: a16-element uniform linear array is adopted to construct two incoherent linear frequency modulation signals with the signal-to-noise ratio of 0dB and the frequency range of 165Hz to 300Hz, the incidence angles of the signals are set to be-20 degrees and 20 degrees respectively, the sampling frequency is 4kHz, and the total length of the signals is 1024 data points.
As shown in fig. 2, accurate DOA estimation results can be obtained at the double peak.
Under the same condition, the real voice signals are adopted to carry out two-dimensional sound source positioning verification, the reverberation condition is not considered, and the azimuth angles and the pitch angles of the two voice sound sources are respectively set to be 40 degrees, 40 degrees and-20 degrees.
As can be seen from fig. 3, this method can also obtain accurate positioning results in two-dimensional positioning.
The above examples are to be construed as merely illustrative and not limitative of the remainder of the disclosure. After reading the description of the invention, the skilled person can make various changes or modifications to the invention, and these equivalent changes and modifications also fall into the scope of the invention defined by the claims.

Claims (6)

1. A DOA (direction of arrival) estimation method for the number of unknown sound sources based on S transformation is characterized by comprising the following steps of:
1) in a sound field, arranging a microphone array which is uniformly and linearly arranged, and synchronously acquiring a section of voice signal by using a data acquisition device to obtain an array receiving signal;
2) s transformation is carried out on the signals after S transformation parameters are set according to the effective frequency band of the array received signals, and S transformation results are obtained;
3) constructing a time-frequency array data model under S transformation according to the S transformation result obtained in the step 2) to obtain a time-frequency domain guide vector;
4) performing correlation operation on the S transformation result according to the time-frequency array data model obtained in the step 3) to obtain an autocorrelation spectrum covariance matrix of the array signal;
5) constructing target direction-of-arrival estimation under a noise condition according to the time-frequency domain steering vector in the step 3) and the autocorrelation spectrum covariance matrix of the array signal obtained in the step 4);
6) solving the target direction of arrival estimation minimization function obtained in the step 5) to obtain a space spectrum estimation result;
7) and setting a search range and search stepping, and obtaining the positions of all spectrum peaks through spectrum search to obtain the maximum estimation value of the DOA.
2. The method for estimating the number of unknown acoustic sources DOA based on S transform as claimed in claim 1, wherein the S transform is performed on the array received signals in step 2) to obtain S transform results, and the S transform step is:
firstly, defining the received signal of the m-th array element in the array as:
Figure FDA0003305694690000011
p is the number of signal sources, sp(t) is the incident signal of P broadband signal sources, taumpIs the time delay, n, caused by the arrival of the p-th signal at the m-th array elementm(t) is additive white gaussian noise for the mth array element;
for signal xm(t) S-transform, the result ST of the transformmCan be expressed as:
Figure FDA0003305694690000012
τ is a time factor, fi(I ═ 1,2, …, I) is the frequency component, and the above equation is written as a fourier spectrum:
Figure FDA0003305694690000013
Figure FDA0003305694690000014
in order to be a component of the signal,
Figure FDA0003305694690000015
is a noise component.
3. The method for estimating the number of unknown acoustic sources DOA based on S transformation as claimed in claim 2, wherein the step 3) of constructing a time-frequency array data model under S transformation, and the step of obtaining the time-frequency domain steering vector comprises:
assuming that the center frequency and the bandwidth of the S transform frequency window satisfy the narrow-band signal condition, the time-frequency array signal model of the M array element receiving signals is expressed as follows:
Figure FDA0003305694690000021
Figure FDA0003305694690000022
is a signal sp(t), P is 1,2, …, S transform result vector of P;
Figure FDA0003305694690000023
is a noise vector;
H(θ,fi)=[h(θ1,fi),h(θ2,fi),…,h(θP,fi)]an array direction matrix with dimension of M multiplied by P;
the steering vector of the array model data is represented as:
h(θp,fi)=[1,exp(-j2πfiτ1p),…,exp(-j2πfiτ(M-1)p)]T
4. the method for estimating the number of unknown acoustic sources DOA based on S transformation as claimed in claim 3, wherein the step 4) of performing correlation operation on the S transformation result to obtain the autocorrelation spectrum covariance matrix of the array signal comprises:
ST (tau, f) is calculated according to the principle of classical multiple signal classification algorithmi) The covariance matrix of (a) is:
Figure FDA0003305694690000024
RX(τ,fi) Is Xfi(τ) a covariance matrix of (τ),
Figure FDA0003305694690000025
for transformed noise variance, σiIs a matrix singular value;
since the received data is finite in length in practice, the maximum likelihood estimate of the data covariance matrix is expressed as:
Figure FDA0003305694690000026
5. the method for estimating the number of unknown acoustic sources DOA based on S transformation as claimed in claim 4, wherein in step 5), the target direction of arrival estimation under the noise condition is constructed according to the time-frequency domain steering vector and the autocorrelation spectrum covariance matrix of the array signal, and the steps are as follows:
for the nth (1, 2, P) signal source, a vector b is definedn(f) Satisfy the requirement of
Figure FDA0003305694690000027
Regardless of the noise term, there is:
Figure FDA0003305694690000031
wherein d isn(f)=RXn(f)hHn,f)bn(f);
Under noisy conditions, the target direction-of-arrival estimate, i.e., the minimization function, is expressed as
Figure FDA0003305694690000032
Wherein d (f) is satisfied
Figure FDA0003305694690000033
6. The method for estimating the number of unknown acoustic sources DOA based on S transformation as recited in claim 5, wherein the objective direction of arrival estimation minimization function is solved in step 6), and the obtained spatial spectrum estimation result is:
s transform to get
Figure FDA0003305694690000034
Substituting the data covariance result based on the S transformation for the time continuum processing result, the spectral estimation formula is:
Figure FDA0003305694690000035
CN202111206753.4A 2021-10-15 2021-10-15 Unknown sound source number DOA estimation method based on S transformation Pending CN113933779A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN202111206753.4A CN113933779A (en) 2021-10-15 2021-10-15 Unknown sound source number DOA estimation method based on S transformation

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN202111206753.4A CN113933779A (en) 2021-10-15 2021-10-15 Unknown sound source number DOA estimation method based on S transformation

Publications (1)

Publication Number Publication Date
CN113933779A true CN113933779A (en) 2022-01-14

Family

ID=79279841

Family Applications (1)

Application Number Title Priority Date Filing Date
CN202111206753.4A Pending CN113933779A (en) 2021-10-15 2021-10-15 Unknown sound source number DOA estimation method based on S transformation

Country Status (1)

Country Link
CN (1) CN113933779A (en)

Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN103091661A (en) * 2013-02-01 2013-05-08 西安科技大学 Broadband signal arriving direction estimation method based on iteration spectral reconfiguration
CN109633558A (en) * 2018-10-25 2019-04-16 上海无线电设备研究所 A kind of DOA estimation algorithm based on polarization time-frequency distributions
WO2021139208A1 (en) * 2020-01-08 2021-07-15 华南理工大学 One-dimensional doa estimation method based on combined signals at specific frequencies

Patent Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN103091661A (en) * 2013-02-01 2013-05-08 西安科技大学 Broadband signal arriving direction estimation method based on iteration spectral reconfiguration
CN109633558A (en) * 2018-10-25 2019-04-16 上海无线电设备研究所 A kind of DOA estimation algorithm based on polarization time-frequency distributions
WO2021139208A1 (en) * 2020-01-08 2021-07-15 华南理工大学 One-dimensional doa estimation method based on combined signals at specific frequencies

Non-Patent Citations (1)

* Cited by examiner, † Cited by third party
Title
田潇潇;刘爱军;李春鹏;: "基于改进S变换的高频地波雷达阵列信号DOA估计", 中国海洋大学学报(自然科学版), no. 02, 15 February 2017 (2017-02-15), pages 55 - 60 *

Similar Documents

Publication Publication Date Title
Diaz-Guerra et al. Robust sound source tracking using SRP-PHAT and 3D convolutional neural networks
Erdogan et al. Improved mvdr beamforming using single-channel mask prediction networks.
CN108375763B (en) Frequency division positioning method applied to multi-sound-source environment
CN109490822B (en) Voice DOA estimation method based on ResNet
CN111239680B (en) Direction-of-arrival estimation method based on differential array
Aroudi et al. Dbnet: Doa-driven beamforming network for end-to-end reverberant sound source separation
Nikunen et al. Multichannel audio separation by direction of arrival based spatial covariance model and non-negative matrix factorization
Ahmad et al. Wideband DOA estimation based on incoherent signal subspace method
Bai et al. Time difference of arrival (TDOA)-based acoustic source localization and signal extraction for intelligent audio classification
Chakrabarty et al. Multi-scale aggregation of phase information for complexity reduction of CNN based DOA estimation
CN110111802A (en) Adaptive dereverberation method based on Kalman filtering
Salvati et al. Two-microphone end-to-end speaker joint identification and localization via convolutional neural networks
Yamaoka et al. CNN-based virtual microphone signal estimation for MPDR beamforming in underdetermined situations
Fu et al. Blind DOA estimation in a reverberant environment based on hybrid initialized multichannel deep 2-D convolutional NMF with feedback mechanism
CN110838303B (en) Voice sound source positioning method using microphone array
Huang et al. A regression approach to speech source localization exploiting deep neural network
Zhao et al. Investigating co-prime microphone arrays for speech direction of arrival estimation
Maazaoui et al. Adaptive blind source separation with HRTFs beamforming preprocessing
CN113933779A (en) Unknown sound source number DOA estimation method based on S transformation
Karimian-Azari et al. Fast joint DOA and pitch estimation using a broadband MVDR beamformer
CN109541572B (en) Subspace orientation estimation method based on linear environment noise model
CN109658944B (en) Helicopter acoustic signal enhancement method and device
Grondin et al. A study of the complexity and accuracy of direction of arrival estimation methods based on GCC-PHAT for a pair of close microphones
Aroudi et al. DBNET: DOA-driven beamforming network for end-to-end farfield sound source separation
Firoozabadi et al. Combination of nested microphone array and subband processing for multiple simultaneous speaker localization

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination