CN108364659B - Frequency domain convolution blind signal separation method based on multi-objective optimization - Google Patents
Frequency domain convolution blind signal separation method based on multi-objective optimization Download PDFInfo
- Publication number
- CN108364659B CN108364659B CN201810112970.9A CN201810112970A CN108364659B CN 108364659 B CN108364659 B CN 108364659B CN 201810112970 A CN201810112970 A CN 201810112970A CN 108364659 B CN108364659 B CN 108364659B
- Authority
- CN
- China
- Prior art keywords
- matrix
- target
- vector
- separation
- signal
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Active
Links
- 238000000926 separation method Methods 0.000 title claims abstract description 54
- 238000005457 optimization Methods 0.000 title claims abstract description 29
- 239000011159 matrix material Substances 0.000 claims abstract description 156
- 239000013598 vector Substances 0.000 claims description 48
- 238000000034 method Methods 0.000 claims description 36
- 238000000354 decomposition reaction Methods 0.000 claims description 6
- 230000021615 conjugation Effects 0.000 claims description 3
- 230000003595 spectral effect Effects 0.000 claims description 3
- 238000000513 principal component analysis Methods 0.000 claims description 2
- 230000015556 catabolic process Effects 0.000 abstract description 5
- 238000006731 degradation reaction Methods 0.000 abstract description 5
- 238000004891 communication Methods 0.000 abstract description 3
- 238000004422 calculation algorithm Methods 0.000 description 8
- 238000004088 simulation Methods 0.000 description 7
- 102100031786 Adiponectin Human genes 0.000 description 6
- 101000775469 Homo sapiens Adiponectin Proteins 0.000 description 6
- 238000010586 diagram Methods 0.000 description 6
- 235000012149 noodles Nutrition 0.000 description 6
- 238000012545 processing Methods 0.000 description 4
- 238000004364 calculation method Methods 0.000 description 3
- 230000000694 effects Effects 0.000 description 3
- 230000002087 whitening effect Effects 0.000 description 3
- 230000007547 defect Effects 0.000 description 2
- 230000008569 process Effects 0.000 description 2
- 238000012163 sequencing technique Methods 0.000 description 2
- 230000001052 transient effect Effects 0.000 description 2
- 230000006866 deterioration Effects 0.000 description 1
- 238000005516 engineering process Methods 0.000 description 1
- 238000002474 experimental method Methods 0.000 description 1
- 238000011160 research Methods 0.000 description 1
- 230000004044 response Effects 0.000 description 1
- 238000001228 spectrum Methods 0.000 description 1
- 230000009466 transformation Effects 0.000 description 1
- 230000001131 transforming effect Effects 0.000 description 1
Images
Classifications
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L21/00—Speech or voice signal processing techniques to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
- G10L21/02—Speech enhancement, e.g. noise reduction or echo cancellation
- G10L21/0272—Voice signal separating
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F17/00—Digital computing or data processing equipment or methods, specially adapted for specific functions
- G06F17/10—Complex mathematical operations
- G06F17/16—Matrix or vector computation, e.g. matrix-matrix or matrix-vector multiplication, matrix factorization
Landscapes
- Engineering & Computer Science (AREA)
- Physics & Mathematics (AREA)
- Mathematical Physics (AREA)
- General Physics & Mathematics (AREA)
- Data Mining & Analysis (AREA)
- Computational Mathematics (AREA)
- Theoretical Computer Science (AREA)
- Pure & Applied Mathematics (AREA)
- Mathematical Optimization (AREA)
- Mathematical Analysis (AREA)
- Multimedia (AREA)
- Algebra (AREA)
- Signal Processing (AREA)
- Audiology, Speech & Language Pathology (AREA)
- Acoustics & Sound (AREA)
- Quality & Reliability (AREA)
- Health & Medical Sciences (AREA)
- Computing Systems (AREA)
- Computational Linguistics (AREA)
- Databases & Information Systems (AREA)
- Software Systems (AREA)
- General Engineering & Computer Science (AREA)
- Human Computer Interaction (AREA)
- Complex Calculations (AREA)
Abstract
The invention provides a frequency domain convolution blind signal separation method based on multi-objective optimization, which is used for solving the problem that the convergence to a degradation solution is easy in the prior art, and can realize the frequency domain convolution blind signal separation with source signals smaller than the number of observation signals, and the realization steps are as follows: obtaining a set of target matrices(ii) a Constructing a diagonalized matrix B (ω)k) (ii) a Constructing a non-orthogonal joint diagonalization multi-objective optimization model; using a non-orthogonal joint diagonalization multi-target optimization model to set a target matrixSeparation matrix W (omega) on each frequency pointk) Carrying out estimation; and acquiring an estimated value of the time domain source signal. The invention has high reliability and wide application range, and can be applied to blind separation of convolution mixed signals such as voice signals, communication signals and the like under an overdetermined condition.
Description
Technical Field
The invention belongs to the technical field of blind signal processing, relates to a frequency domain convolution blind signal separation method, in particular to a frequency domain convolution blind signal separation method based on multi-objective optimization joint diagonalization, and can be applied to blind separation of convolution mixed signals such as voice signals, communication signals and the like under an overdetermined condition.
Background
The objective optimization problem generally refers to obtaining an optimal solution of an objective function through a certain optimization algorithm. When the optimized objective function is one, it is called Single-object Optimization (SOP). When there are two or more optimized objective functions, it is called Multi-objective Optimization (MOP). Unlike the solution of single-objective optimization which is a finite solution, the solution of multi-objective optimization is usually a set of equilibrium solutions.
In signal processing problems such as wireless communication, radar, sonar and the like, a problem of recovering a source signal from a plurality of observation signals often exists, and a blind signal separation technology provides a potential solution for the problems. Early studies of the blind signal separation problem focused on relatively simple transient mixing situations, but in practical applications, such as the "cocktail party" problem, the observed mixed speech signal was actually a convolutional mixed speech signal, taking into account the multipath effects of sound propagation.
The existing blind separation method of the convolution mixed speech signal is mainly divided into a frequency domain method and a time domain method, the time domain method generally adopts a method of performing joint block diagonalization on a correlation matrix to estimate a separation matrix, and the method has the defects of large calculation amount and often causing the problem of high-dimensional joint block diagonalization, for example, the calculation becomes difficult under the high-order convolution mixing (severe reverberation environment).
In the frequency domain, a method for performing joint diagonalization estimation on a power spectral density matrix to obtain a separation matrix is generally adopted, and the method has the problems of easiness in convergence to a degenerate solution, requirement on a mixed matrix to be a square matrix, uncertain sequencing and the like. This greatly limits the application of this method to the separation of convolved blind signals. Joint diagonalization algorithms are further classified into orthogonal joint diagonalization algorithms and non-orthogonal joint diagonalization algorithms, the orthogonal joint diagonalization algorithms require that a separation matrix must be an orthogonal matrix, and although the separation matrix can meet an orthogonality condition through whitening processing in many cases, the whitening processing introduces additional errors, which results in poor separation performance. In order to avoid the separation performance deterioration caused by the error introduced by the whitening process, a non-orthogonal joint diagonalization algorithm which does not require the separation matrix to be an orthogonal matrix is frequently used at present.
The application research of the non-orthogonal joint diagonalization algorithm is still in a primary stage, the existing NOODLES method, QDIAG method and ACDC method have the problems that convergence to a degenerate solution is easy to happen, and the reliability of separation is poor. Although the J-Di method, the FFDIAG method and the Jacobilike method avoid the problem of easy convergence to a degradation solution, the frequency domain convolution blind signal separation with the same number of source signals and observation signals can be realized only because the separation matrix is limited to a square matrix, and the application range is limited.
Disclosure of Invention
The invention aims to overcome the defects in the prior art, provides a frequency domain convolution blind signal separation method based on multi-objective optimization, is used for solving the problem that convergence to a degradation solution is easy in the prior art, and can realize frequency domain convolution blind signal separation with source signals smaller than the number of observation signals.
The technical idea of the invention is as follows: transforming the observed time domain convolution mixed signal into a transient mixed model of a frequency domain, estimating a separation matrix of each frequency point by using a multi-objective optimization non-orthogonal joint diagonalization algorithm, recovering a source signal in the frequency domain by using the separation matrix, and obtaining a source signal time domain waveform after Fourier inversion, wherein the specific implementation steps are as follows:
(1a) M sensors receive observed signals x from N source signal sensorsm(t) forming an observed signal vector x (t), x (t) x [ x ]1(t),...,xM(t)]TWherein N is more than or equal to 1, M is more than or equal to N, M represents a sensor serial number, and M is 1.
(1b) Dividing x (t) to obtain Q observation signal sub-vectors, and calculating a target matrix through each observation signal sub-vector to obtain a target matrix set consisting of Q multiplied by K target matrices Wherein, R (k, q) represents a target matrix on the kth frequency point of the qth section of the observed signal sub vectorK represents the sequence number of the target matrix calculated by each observation signal sub-vector, q represents the sequence number of the observation signal sub-vector, and K represents the number of the target matrix calculated by each observation signal sub-vector;
(2) constructing a diagonalized matrix B (ω)k):
Constructing a diagonalized matrix B (ω) with dimension M Nk) Wherein, ω iskRepresenting a set of target matricesThe kth frequency point;
(3) constructing a non-orthogonal joint diagonalization multi-objective optimization model:
using R (k, q) and B (ω)k) Constructing a non-orthogonal joint diagonalization multi-objective optimization model:
wherein, bnRepresents the diagonalized matrix B (ω)k) Min represents the minimize operation, max represents the maximize operation, Off (-) represents the diagonal operation of the nulling matrix, (·)HRepresenting the complex conjugate operation of the matrix, det (-) represents the determinant operation of the matrix;
(4) using a non-orthogonal joint diagonalization multi-target optimization model to set a target matrixSeparation matrix W (omega) on each frequency pointk) And (3) estimating:
(4a) setting a set of target matricesThe diagonalized matrix for the first bin has an initial value of B (ω)1)=[I,0]TSetting a condition number threshold to be psi and an iteration stop condition threshold to be lambda, and making k equal to 1, where I represents an N × N dimensional identity matrix, [ ·]TA transpose operation representing a matrix;
(4b)for the target matrix setSeparation matrix W (omega) on the k-th frequency pointk) And (3) estimating:
(4b.1) making n 1;
Wherein, BnRepresents the diagonalized matrix B (ω)k) Deleting the matrix formed by the residual column vectors after the nth column, wherein I represents a unit matrix [ ·]-1An inversion operation of the representation matrix;
(4b.3) calculating the Hessian matrix QnCondition number of (K) (Q)n) And determining kappa (Q)n) If yes, executing step (4b.5), otherwise executing step (4 b.4);
(4b.4) computing the matrix pair (Qn) And the eigenvector corresponding to the largest generalized eigenvalue is taken as the diagonalized matrix B (omega)k) And performing step (4 b.7);
(4b.5) calculating the intermediate matrix C:
wherein, U0Representation matrix QnThe eigenvector matrix corresponding to the M-N +1 minimum eigenvalues;
(4b.6) computing the diagonalized matrix B (ω)k) N column vector bnVector value of (d):
bn=U0w
wherein w represents the eigenvector corresponding to the maximum eigenvalue of the intermediate matrix C;
(4b.7) making N equal to N +1, and judging whether N is less than or equal to N, if so, executing the step (4b.2), otherwise, executing the step (4 b.8);
(4b.8) calculating a cost function J (B (ω)k) And | J (B (ω)k))-J(B(ωk-1) If yes, executing step (4b.1), otherwise executing step (4 b.9);
(4b.9) diagonal matrix B (ω)k) Taking complex conjugation to obtain a separation matrix W (omega)k);
(4c) Let K be K +1, and judge whether K ≦ K holds, if yes, let B (ω ≦ K, let B (ω) bek)=WH(ωk-1) And executing the step (4b), otherwise executing the step (5);
(5) obtaining an estimated value of a time domain source signal:
(5a) calculating the estimated value of the source signal vector on the kth frequency point of the qth section
Wherein x (k, q) represents an observed signal vector on the kth frequency point of the qth section;
(5b) to pairAnd performing inverse Fourier transform to obtain a time domain source signal estimation value, and realizing the separation of the frequency domain convolution blind signals.
Compared with the prior art, the invention has the following advantages:
(1) when the separation matrix is estimated, the multi-objective optimization non-orthogonal joint diagonalization model is adopted, the condition number of the diagonalization matrix is considered, the problem that convergence to a degradation solution is easy to occur is avoided, and compared with the prior art, the reliability of the separation of the convolution blind signals is improved.
(2) The method changes the constraint of the diagonalization matrix into the constraint of the product of the conjugate transpose of the diagonalization matrix and the diagonalization matrix, eliminates the limitation that the separation matrix is a square matrix, can realize the frequency domain convolution blind signal separation of the source signals which are less than or equal to the number of the observation signals, and has wider application range while avoiding easy convergence to a degradation solution compared with the prior art.
Drawings
FIG. 1 is a flow chart of an implementation of the present invention;
FIG. 2 is a pair of sheets of paper of the present inventionSeparation matrix W (omega) on each frequency pointk) A flow chart for performing the estimation;
FIG. 3(a) is a waveform diagram of 3 source signals used in the simulation of the present invention;
FIG. 3(b) is a diagram of a signal waveform recovered by the NOODLES method;
FIG. 3(c) is a diagram of a signal waveform recovered by the ACDC method;
fig. 3(d) is a diagram of a signal waveform recovered by the method of the present invention.
Detailed Description
The invention is described in further detail below with reference to the following figures and specific examples:
the embodiment is based on a cocktail party scene, and the speech of the conversation of three persons is separated by using the invention according to the conversation contents of the three persons received from the 4 microphone sensors. In this example, the sensor is a microphone and the received convolved aliased signal is a speech signal.
Referring to fig. 1, a frequency domain convolution blind signal separation method based on multi-objective optimization includes the following steps:
(1a) M electrical signal sensors receive observation signals x from N source signal sensorsm(t) forming an observed signal vector x (t), x (t) x [ x ]1(t),...,xM(t)]TWhere N ≧ 1, and M ≧ N, M denotes a sensor number, M ═ 1.., M, in the present embodiment, M ═ 4, N ═ 3;
(1b) dividing x (t) to obtain Q observation signal sub-vectors, and calculating a target matrix through each observation signal sub-vector to obtain a target matrix set consisting of Q multiplied by K target matrices Wherein, R (K, Q) represents a target matrix at the kth frequency point in the qth segment of the observed signal sub-vector, K represents a target matrix number calculated by each observed signal sub-vector, Q represents an observed signal sub-vector number, and K represents the number of target matrices calculated by each observed signal sub-vector, in this embodiment, x (t) is divided into 20 observed signal sub-vectors:
(1b.1) to the observed Signal xm(t) performing short-time fourier transform to obtain a frequency-domain mixed signal vector x (K, Q), where K denotes a target matrix number calculated for each observation signal sub-vector, Q denotes an observation signal sub-vector number, and K is 1,..,. K, Q is 1,..,. Q, and K denotes a number of fourier transform frequency points, which is equal to the number of target matrices calculated for each observation signal sub-vector, and in this embodiment, K is 256:
(1b.11) calculating a K-point discrete windowed Fourier transform:
wherein, the subscript i represents the frame number, the superscript m represents the Fourier transform of the mth path of observation signal,a q-th section representing the m-th observation signal, the window function length is K, the window function slides along the forward direction, the sliding distance of adjacent frames is d ═ 1-mu) K, mu is an overlapping factor of two adjacent frames, and mu is generally 50%;
(1b.12) combining the frame spectra of the m-th mixed signal into a vectorWherein I represents the frame number of the q-th section of discrete windowed Fourier transform;
(1b.13) Using vector xm(k, q) constructing a frequency domain mixed signal vector x (k, q) ═ x1(k,q),...,xM(k,q)]T;
(1b.2) calculating the Power spectral Density matrix Rx(k,q):
Rx(k,q)=E[x(k,q)xH(k,q)];
(1b.3) estimation of noise variance σ by principal component analysis2;
(1b.4) calculating a target matrix R (k, q) on the kth frequency point of the qth section of the observation signal sub vector, wherein R (k, q) is Rx(k,q)-σ2I, wherein I represents an identity matrix.
Step 2) constructing a diagonalized matrix B (omega)k):
Constructing a diagonalized matrix B (ω) with dimension M Nk) Wherein, ω iskRepresenting a set of target matricesThe kth frequency point;
step 3), constructing a non-orthogonal joint diagonalization multi-objective optimization model:
using R (k, q) and B (ω)k) Constructing a non-orthogonal joint diagonalization multi-objective optimization model:
wherein, bnPresentation pairThe matrix B (ω) is formed by anglek) Min represents the minimize operation, max represents the maximize operation, Off (-) represents the diagonal operation of the nulling matrix, (·)HRepresenting the complex conjugate operation of the matrix, det (-) represents the determinant operation of the matrix;
step 4) utilizing a non-orthogonal joint diagonalization multi-target optimization model to perform target matrix setSeparation matrix W (omega) on each frequency pointk) The estimation is carried out, and the implementation process of the estimation is shown in fig. 2:
(4a) setting a set of target matricesThe diagonalized matrix for the first bin has an initial value of B (ω)1)=[I,0]TSetting the condition number threshold to be psi, the order of magnitude of psi typically being 103The iteration stop condition threshold is set to λ, which is typically of the order of 10-2Let k be 1, where I represents an N × N-dimensional identity matrix [ ·]TA transpose operation representing a matrix;
(4b) for the target matrix setSeparation matrix W (omega) on the k-th frequency pointk) And (3) estimating:
(4b.1) making n 1;
Wherein, BnRepresents the diagonalized matrix B (ω)k) Deleting the matrix formed by the residual column vectors after the nth column, wherein I represents a unit matrix [ ·]-1An inversion operation of the representation matrix;
(4b.3) calculating the Hessian matrix QnCondition number of (K) (Q)n) And determining kappa (Q)n) If psi is true, the Hessian matrix Q is determinednIs ill-conditioned, step 4b.5) is performed, otherwise step 4b.4) is performed), hessian matrix QnCondition number of (K) (Q)n) The calculation is performed as follows:
wherein [ ·]-1Expressing the inversion operation of the matrix, and expressing the norm operation by | DEG |;
(4b.4) computing the matrix pair (Qn) And the eigenvector corresponding to the largest generalized eigenvalue is taken as the diagonalized matrix B (omega)k) And step 4b.7) is executed, wherein, the feature vector corresponding to the maximum generalized eigenvalue is obtained by the following steps:
for matrix pair (Qn) Performing generalized eigenvalue decomposition to obtain a matrix pair (Qn) And a matrix V consisting of a diagonal matrix D of generalized eigenvalues and eigenvectors corresponding to the generalized eigenvalues, and the first column of V being a matrix pair: (Qn) The maximum generalized eigenvalue corresponding eigenvector, the generalized eigenvalue decomposition formula is:
wherein, eig (·) represents generalized eigenvalue decomposition operation;
(4b.5) calculating the intermediate matrix C:
wherein, U0Representation matrix QnThe eigenvector matrix corresponding to the M-N +1 minimum eigenvalues;
(4b.6) computing the diagonalized matrix B (ω)k) N column vector bnVector value of (d):
bn=U0w
wherein w represents the eigenvector corresponding to the maximum eigenvalue of the intermediate matrix C;
(4b.7) making N equal to N +1, and judging whether N is less than or equal to N, if so, executing the step (4b.2), otherwise, executing the step (4 b.8);
(4b.8) calculating a cost function J (B (ω)k) And | J (B (ω)k))-J(B(ωk-1) If λ is true, if yes, step (4b.1) is performed, otherwise step (4b.9) is performed, where B (ω) is considered at the first iteration0) Is a zero matrix;
(4b.9) diagonal matrix B (ω)k) Taking complex conjugation to obtain a separation matrix W (omega)k);
(4c) Let K be K +1, and judge whether K ≦ K holds, if yes, let B (ω ≦ K, let B (ω) bek)=WH(ωk-1) And step (4B) is performed, otherwise step (5), B (ω), is performedk)=WH(ωk-1)=B(ωk-1) Taking the iteration result of the diagonalized matrix B as the initial value of the next iteration, wherein the step aims to solve the sequencing problem which can occur in frequency domain separation;
step 5) obtaining an estimated value of the time domain source signal:
(5a) calculating the source signal direction of the kth frequency point of the qth sectionQuantity estimation
Wherein x (k, q) represents an observed signal vector at the kth frequency point of the qth segment, and W (omega)k) Representing a separation matrix;
(5b) to pairAnd performing inverse Fourier transform to obtain a time domain source signal estimation value, and realizing the separation of the frequency domain convolution blind signals.
The technical effects of the present invention will be further explained by simulation experiments.
1. Simulation conditions and contents:
simulation conditions are as follows: MATLAB (R2013a), Intel (R) core (TM) i7-2600CPU 6503.40 GHz, Window 7 Professional.
Simulation content: the source signals are N-3 sinusoidal signals with different frequencies, Gaussian white noise is superposed on the sinusoidal signals, the SNR (signal to noise ratio) is set to be 10dB, and an 8-tap FIR (finite impulse response) filter is used for establishing a convolution mixed model. 20000 sample points are acquired from the three source signals, and mixed signals are acquired by using M ═ 4 receiving sensors, wherein elements of the mixed matrix A are randomly generated and obey standard normal distribution. The performance of the blind signal separation method is measured by a signal-to-interference ratio (SIR), the larger the SIR, the better the blind separation performance, and the SIR is defined as:
wherein, G (ω)k)=W(ωk)A(ωk) In order to be a frequency domain global transformation matrix,gnj(ωk) Is a matrix G (omega)k) Row n and column j.
2. And (3) simulation results:
the waveform diagrams of the 3 source signals used in the simulation of the present invention are shown in fig. 3 (a). The method of the present invention (JD-NS) is now compared to two other methods, one of which is an alternating column update diagonalization (ACDC) based method and the other of which is a non-orthonormal Jacobian approximation joint diagonalization (NOODLES) method. The three recovered source signals separated by the NOODLES method are shown in FIG. 3(b), and the three recovered source signals separated by the ACDC method are shown in FIG. 3 (c).
It can be seen that the three signals recovered using the ACDC method are all similar to the second source signal in the source signal waveform diagram, indicating that the ACDC method has in fact converged to a degenerate solution. The method of the invention can effectively recover all source signals, and the recovered source signals do not contain components of other source signals, which shows better separation effect.
Table 1 summarizes the SIR performance of the source signal recovered by the method of the present invention and the nodles method, when SNR is 10dB, 100 independent experiments are performed. It can be seen that the method of the present invention is superior to the NOODLES method in terms of SIR performance of three recovered source signals, and has higher reliability of convolution blind signal separation than the NOODLES method.
TABLE 1
Claims (4)
1. A frequency domain convolution blind signal separation method based on multi-objective optimization is characterized by comprising the following steps:
(1a) M sensors receive observed signals x from N source signal sensorsm(t) forming an observed signal vector x (t), x (t) x [ x ]1(t),...,xM(t)]TWherein N is more than or equal to 1, M is more than or equal to N, and M represents transmissionA sensor number, M ═ 1.., M;
(1b) dividing x (t) to obtain Q observation signal sub-vectors, and calculating a target matrix through each observation signal sub-vector to obtain a target matrix set consisting of Q multiplied by K target matrices Wherein R (K, q) represents a target matrix on a kth frequency point of a qth section of an observation signal sub-vector, K represents a target matrix serial number calculated by each observation signal sub-vector, q represents an observation signal sub-vector serial number, and K represents the number of the target matrices calculated by each observation signal sub-vector;
(2) constructing a diagonalized matrix B (ω)k):
Constructing a diagonalized matrix B (ω) with dimension M Nk) Wherein, ω iskRepresenting a set of target matricesThe kth frequency point;
(3) constructing a non-orthogonal joint diagonalization multi-objective optimization model:
using R (k, q) and B (ω)k) Constructing a non-orthogonal joint diagonalization multi-objective optimization model:
wherein, bnRepresents the diagonalized matrix B (ω)k) Min represents the minimize operation, max represents the maximize operation, Off (-) represents the diagonal operation of the nulling matrix, (·)HRepresenting the complex conjugate operation of the matrix, det (-) represents the determinant operation of the matrix;
(4) using a non-orthogonal joint diagonalization multi-target optimization model to set a target matrixSeparation matrix W (omega) on each frequency pointk) And (3) estimating:
(4a) setting a set of target matricesThe diagonalized matrix for the first bin has an initial value of B (ω)1)=[I,0]TSetting a condition number threshold to be psi and an iteration stop condition threshold to be lambda, and making k equal to 1, where I represents an N × N dimensional identity matrix, [ ·]TA transpose operation representing a matrix;
(4b) for the target matrix setSeparation matrix W (omega) on the k-th frequency pointk) And (3) estimating:
(4b.1) making n 1;
Wherein, BnRepresents the diagonalized matrix B (ω)k) Deleting the matrix formed by the residual column vectors after the nth column, wherein I represents a unit matrix [ ·]-1An inversion operation of the representation matrix;
(4b.3) calculating the Hessian matrix QnCondition number of (K) (Q)n) And determining kappa (Q)n) If yes, executing step 4b.5), otherwise executing step 4 b.4);
(4b.4) computing the matrix pairsAnd the eigenvector corresponding to the largest generalized eigenvalue is taken as the diagonalized matrix B (omega)k) And step 4b.7 is performed);
(4b.5) calculating the intermediate matrix C:
wherein, U0Representation matrix QnThe eigenvector matrix corresponding to the M-N +1 minimum eigenvalues;
(4b.6) computing the diagonalized matrix B (ω)k) N column vector bnVector value of (d):
bn=U0w
wherein w represents the eigenvector corresponding to the maximum eigenvalue of the intermediate matrix C;
(4b.7) making N equal to N +1, and judging whether N is less than or equal to N, if so, executing the step (4b.2), otherwise, executing the step (4 b.8);
(4b.8) calculating a cost function J (B (ω)k) And | J (B (ω)k))-J(B(ωk-1) If yes, executing step (4b.1), otherwise executing step (4 b.9);
(4b.9) diagonal matrix B (ω)k) Taking complex conjugation to obtain a separation matrix W (omega)k);
(4c) Let K be K +1, and judge whether K ≦ K holds, if yes, let B (ω ≦ K, let B (ω) bek)=WH(ωk-1) And executing the step (4b), otherwise executing the step (5);
(5) obtaining an estimated value of a time domain source signal:
(5a) calculating the estimated value of the source signal vector on the kth frequency point of the qth section
Wherein x (k, q) represents an observed signal vector on the kth frequency point of the qth section;
2. The frequency-domain convolution blind signal separation method based on multi-objective optimization according to claim 1, wherein R (k, q) in step (1b) is calculated by:
(1b.1) to the observed Signal xm(t) performing short-time Fourier transform to obtain a frequency domain mixed signal vector x (K, Q), wherein K represents a target matrix serial number calculated by each observation signal sub-vector, Q represents an observation signal sub-vector serial number, and K is 1, a.
(1b.2) calculating the Power spectral Density matrix Rx(k,q):
Rx(k,q)=E[x(k,q)xH(k,q)];
(1b.3) estimation of noise variance σ by principal component analysis2;
(1b.4) calculating a target matrix R (k, q) on the kth frequency point of the qth section of the observation signal sub vector, wherein R (k, q) is Rx(k,q)-σ2I, wherein I represents an identity matrix.
3. The method for frequency-domain blind convolutional signal separation based on multi-objective optimization of claim 1, wherein said step (4b.3) of computing hessian matrix QnCondition number of (K) (Q)n) The following is calculated:
wherein [ ·]-1The inverse operation of the matrix is represented, and | | · | | represents the norm operation.
4. The method for separating frequency-domain convolution blind signals based on multi-objective optimization according to claim 1, wherein the obtaining step of the eigenvector corresponding to the maximum generalized eigenvalue in step (4b.4) is:
to matrix pairCarrying out generalized eigenvalue decomposition to obtain matrix pairsA diagonal matrix D formed by generalized eigenvalues and a matrix V formed by eigenvectors corresponding to the generalized eigenvalues, and the first column of V is taken as a matrix pairThe maximum generalized eigenvalue corresponding eigenvector, the generalized eigenvalue decomposition formula is:
wherein, eig (·) represents generalized eigenvalue decomposition operation.
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201810112970.9A CN108364659B (en) | 2018-02-05 | 2018-02-05 | Frequency domain convolution blind signal separation method based on multi-objective optimization |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201810112970.9A CN108364659B (en) | 2018-02-05 | 2018-02-05 | Frequency domain convolution blind signal separation method based on multi-objective optimization |
Publications (2)
Publication Number | Publication Date |
---|---|
CN108364659A CN108364659A (en) | 2018-08-03 |
CN108364659B true CN108364659B (en) | 2021-06-01 |
Family
ID=63004763
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN201810112970.9A Active CN108364659B (en) | 2018-02-05 | 2018-02-05 | Frequency domain convolution blind signal separation method based on multi-objective optimization |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN108364659B (en) |
Families Citing this family (10)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN110890098B (en) * | 2018-09-07 | 2022-05-10 | 南京地平线机器人技术有限公司 | Blind signal separation method and device and electronic equipment |
CN109616138B (en) * | 2018-12-27 | 2020-05-19 | 山东大学 | Voice signal blind separation method based on segmented frequency point selection and binaural hearing aid system |
CN110236589B (en) * | 2019-06-03 | 2022-04-29 | 苏州美糯爱医疗科技有限公司 | Real-time heart-lung sound automatic separation method of electronic stethoscope |
CN111106866B (en) * | 2019-12-13 | 2021-09-21 | 南京理工大学 | Satellite-borne AIS/ADS-B collision signal separation method based on hessian matrix pre-estimation |
CN111179960B (en) * | 2020-03-06 | 2022-10-18 | 北京小米松果电子有限公司 | Audio signal processing method and device and storage medium |
CN111415676B (en) * | 2020-03-10 | 2022-10-18 | 山东大学 | Blind source separation method and system based on separation matrix initialization frequency point selection |
TWI809390B (en) * | 2021-03-01 | 2023-07-21 | 新加坡商台達電子國際(新加坡)私人有限公司 | Method and audio processing system for blind source separation without sampling rate mismatch estimation |
CN115372902B (en) * | 2022-08-05 | 2023-12-01 | 中国人民解放军战略支援部队信息工程大学 | TDOA bias reduction positioning method based on underwater multi-base sonar |
CN116866116B (en) * | 2023-07-13 | 2024-02-27 | 中国人民解放军战略支援部队航天工程大学 | Time-delay mixed linear blind separation method |
CN116866123B (en) * | 2023-07-13 | 2024-04-30 | 中国人民解放军战略支援部队航天工程大学 | Convolution blind separation method without orthogonal limitation |
Citations (2)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN104934041A (en) * | 2015-05-07 | 2015-09-23 | 西安电子科技大学 | Convolutive blind signal separation method based on multi-target optimization joint block diagonalization |
CN105446941A (en) * | 2015-11-11 | 2016-03-30 | 西安电子科技大学 | Joint zero diagonalization based time-frequency domain blind signal separation method |
-
2018
- 2018-02-05 CN CN201810112970.9A patent/CN108364659B/en active Active
Patent Citations (2)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN104934041A (en) * | 2015-05-07 | 2015-09-23 | 西安电子科技大学 | Convolutive blind signal separation method based on multi-target optimization joint block diagonalization |
CN105446941A (en) * | 2015-11-11 | 2016-03-30 | 西安电子科技大学 | Joint zero diagonalization based time-frequency domain blind signal separation method |
Non-Patent Citations (3)
Title |
---|
A Novel Joint Block Diagonalization Algorithm for Convolutive BSS with Limited Constraint;ZHANG Wei Tao;《中国科技论文在线》;20151231;全文 * |
Nonorthogonal Approximate Joint Diagonalization;Guoxu Zhou,Shengli Xie,Zuyuan Yang,Jun Zhang;《IEEE Trans. Neural Networks》;20091130;第20卷(第11期);全文 * |
非对称非正交快速联合对角化算法;张伟涛,楼顺天,张延良;《自动化学报》;20100630;第36卷(第6期);全文 * |
Also Published As
Publication number | Publication date |
---|---|
CN108364659A (en) | 2018-08-03 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN108364659B (en) | Frequency domain convolution blind signal separation method based on multi-objective optimization | |
CN108986838B (en) | Self-adaptive voice separation method based on sound source positioning | |
JP4268524B2 (en) | Blind signal separation | |
CN109243483B (en) | Method for separating convolution blind source of noisy frequency domain | |
US7603401B2 (en) | Method and system for on-line blind source separation | |
CN110010148B (en) | Low-complexity frequency domain blind separation method and system | |
CN109616138B (en) | Voice signal blind separation method based on segmented frequency point selection and binaural hearing aid system | |
Wang et al. | A region-growing permutation alignment approach in frequency-domain blind source separation of speech mixtures | |
Aichner et al. | Time domain blind source separation of non-stationary convolved signals by utilizing geometric beamforming | |
CN111986695A (en) | Non-overlapping sub-band division fast independent vector analysis voice blind separation method and system | |
CN111816200B (en) | Multi-channel speech enhancement method based on time-frequency domain binary mask | |
Aichner et al. | On-line time-domain blind source separation of nonstationary convolved signals | |
Douglas | Blind separation of acoustic signals | |
WO2001017109A1 (en) | Method and system for on-line blind source separation | |
CN112201276B (en) | TC-ResNet network-based microphone array voice separation method | |
Jo et al. | Robust Blind Multichannel Identification based on a Phase Constraint and Different ℓ p-norm Constraints | |
Houda et al. | Blind audio source separation: state-of-art | |
Mallis et al. | Convolutive audio source separation using robust ICA and an intelligent evolving permutation ambiguity solution | |
CN106249204B (en) | Multichannel delay time estimation method based on robust adaptive blind identification | |
Mazur et al. | Using the scaling ambiguity for filter shortening in convolutive blind source separation | |
Mazur et al. | A new clustering approach for solving the permutation problem in convolutive blind source separation | |
Markhi et al. | An improved cyclic beamforming method for signal DOA estimation | |
Girona | Underdetermined Blind Separation of Delayed Sound Sources in the Frequency Domain | |
Lin et al. | A survey of semi-blind ICA for speech separation in frequency domain | |
Pukenas | Three-mode biomedical signal denoising in the local phase space based on a tensor approach |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PB01 | Publication | ||
SE01 | Entry into force of request for substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
GR01 | Patent grant | ||
GR01 | Patent grant |