CN108364659B - Frequency domain convolution blind signal separation method based on multi-objective optimization - Google Patents

Frequency domain convolution blind signal separation method based on multi-objective optimization Download PDF

Info

Publication number
CN108364659B
CN108364659B CN201810112970.9A CN201810112970A CN108364659B CN 108364659 B CN108364659 B CN 108364659B CN 201810112970 A CN201810112970 A CN 201810112970A CN 108364659 B CN108364659 B CN 108364659B
Authority
CN
China
Prior art keywords
matrix
target
vector
separation
signal
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Active
Application number
CN201810112970.9A
Other languages
Chinese (zh)
Other versions
CN108364659A (en
Inventor
张伟涛
孙瑾铃
李扬
楼顺天
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Xidian University
Original Assignee
Xidian University
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Xidian University filed Critical Xidian University
Priority to CN201810112970.9A priority Critical patent/CN108364659B/en
Publication of CN108364659A publication Critical patent/CN108364659A/en
Application granted granted Critical
Publication of CN108364659B publication Critical patent/CN108364659B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Images

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L21/00Speech or voice signal processing techniques to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
    • G10L21/02Speech enhancement, e.g. noise reduction or echo cancellation
    • G10L21/0272Voice signal separating
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F17/00Digital computing or data processing equipment or methods, specially adapted for specific functions
    • G06F17/10Complex mathematical operations
    • G06F17/16Matrix or vector computation, e.g. matrix-matrix or matrix-vector multiplication, matrix factorization

Landscapes

  • Engineering & Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • Mathematical Physics (AREA)
  • General Physics & Mathematics (AREA)
  • Data Mining & Analysis (AREA)
  • Computational Mathematics (AREA)
  • Theoretical Computer Science (AREA)
  • Pure & Applied Mathematics (AREA)
  • Mathematical Optimization (AREA)
  • Mathematical Analysis (AREA)
  • Multimedia (AREA)
  • Algebra (AREA)
  • Signal Processing (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Acoustics & Sound (AREA)
  • Quality & Reliability (AREA)
  • Health & Medical Sciences (AREA)
  • Computing Systems (AREA)
  • Computational Linguistics (AREA)
  • Databases & Information Systems (AREA)
  • Software Systems (AREA)
  • General Engineering & Computer Science (AREA)
  • Human Computer Interaction (AREA)
  • Complex Calculations (AREA)

Abstract

The invention provides a frequency domain convolution blind signal separation method based on multi-objective optimization, which is used for solving the problem that the convergence to a degradation solution is easy in the prior art, and can realize the frequency domain convolution blind signal separation with source signals smaller than the number of observation signals, and the realization steps are as follows: obtaining a set of target matrices
Figure DDA0001569809470000011
(ii) a Constructing a diagonalized matrix B (ω)k) (ii) a Constructing a non-orthogonal joint diagonalization multi-objective optimization model; using a non-orthogonal joint diagonalization multi-target optimization model to set a target matrix
Figure DDA0001569809470000012
Separation matrix W (omega) on each frequency pointk) Carrying out estimation; and acquiring an estimated value of the time domain source signal. The invention has high reliability and wide application range, and can be applied to blind separation of convolution mixed signals such as voice signals, communication signals and the like under an overdetermined condition.

Description

Frequency domain convolution blind signal separation method based on multi-objective optimization
Technical Field
The invention belongs to the technical field of blind signal processing, relates to a frequency domain convolution blind signal separation method, in particular to a frequency domain convolution blind signal separation method based on multi-objective optimization joint diagonalization, and can be applied to blind separation of convolution mixed signals such as voice signals, communication signals and the like under an overdetermined condition.
Background
The objective optimization problem generally refers to obtaining an optimal solution of an objective function through a certain optimization algorithm. When the optimized objective function is one, it is called Single-object Optimization (SOP). When there are two or more optimized objective functions, it is called Multi-objective Optimization (MOP). Unlike the solution of single-objective optimization which is a finite solution, the solution of multi-objective optimization is usually a set of equilibrium solutions.
In signal processing problems such as wireless communication, radar, sonar and the like, a problem of recovering a source signal from a plurality of observation signals often exists, and a blind signal separation technology provides a potential solution for the problems. Early studies of the blind signal separation problem focused on relatively simple transient mixing situations, but in practical applications, such as the "cocktail party" problem, the observed mixed speech signal was actually a convolutional mixed speech signal, taking into account the multipath effects of sound propagation.
The existing blind separation method of the convolution mixed speech signal is mainly divided into a frequency domain method and a time domain method, the time domain method generally adopts a method of performing joint block diagonalization on a correlation matrix to estimate a separation matrix, and the method has the defects of large calculation amount and often causing the problem of high-dimensional joint block diagonalization, for example, the calculation becomes difficult under the high-order convolution mixing (severe reverberation environment).
In the frequency domain, a method for performing joint diagonalization estimation on a power spectral density matrix to obtain a separation matrix is generally adopted, and the method has the problems of easiness in convergence to a degenerate solution, requirement on a mixed matrix to be a square matrix, uncertain sequencing and the like. This greatly limits the application of this method to the separation of convolved blind signals. Joint diagonalization algorithms are further classified into orthogonal joint diagonalization algorithms and non-orthogonal joint diagonalization algorithms, the orthogonal joint diagonalization algorithms require that a separation matrix must be an orthogonal matrix, and although the separation matrix can meet an orthogonality condition through whitening processing in many cases, the whitening processing introduces additional errors, which results in poor separation performance. In order to avoid the separation performance deterioration caused by the error introduced by the whitening process, a non-orthogonal joint diagonalization algorithm which does not require the separation matrix to be an orthogonal matrix is frequently used at present.
The application research of the non-orthogonal joint diagonalization algorithm is still in a primary stage, the existing NOODLES method, QDIAG method and ACDC method have the problems that convergence to a degenerate solution is easy to happen, and the reliability of separation is poor. Although the J-Di method, the FFDIAG method and the Jacobilike method avoid the problem of easy convergence to a degradation solution, the frequency domain convolution blind signal separation with the same number of source signals and observation signals can be realized only because the separation matrix is limited to a square matrix, and the application range is limited.
Disclosure of Invention
The invention aims to overcome the defects in the prior art, provides a frequency domain convolution blind signal separation method based on multi-objective optimization, is used for solving the problem that convergence to a degradation solution is easy in the prior art, and can realize frequency domain convolution blind signal separation with source signals smaller than the number of observation signals.
The technical idea of the invention is as follows: transforming the observed time domain convolution mixed signal into a transient mixed model of a frequency domain, estimating a separation matrix of each frequency point by using a multi-objective optimization non-orthogonal joint diagonalization algorithm, recovering a source signal in the frequency domain by using the separation matrix, and obtaining a source signal time domain waveform after Fourier inversion, wherein the specific implementation steps are as follows:
(1) obtaining a set of target matrices
Figure GDA0002984466060000021
(1a) M sensors receive observed signals x from N source signal sensorsm(t) forming an observed signal vector x (t), x (t) x [ x ]1(t),...,xM(t)]TWherein N is more than or equal to 1, M is more than or equal to N, M represents a sensor serial number, and M is 1.
(1b) Dividing x (t) to obtain Q observation signal sub-vectors, and calculating a target matrix through each observation signal sub-vector to obtain a target matrix set consisting of Q multiplied by K target matrices
Figure GDA0002984466060000022
Figure GDA0002984466060000023
Wherein, R (k, q) represents a target matrix on the kth frequency point of the qth section of the observed signal sub vectorK represents the sequence number of the target matrix calculated by each observation signal sub-vector, q represents the sequence number of the observation signal sub-vector, and K represents the number of the target matrix calculated by each observation signal sub-vector;
(2) constructing a diagonalized matrix B (ω)k):
Constructing a diagonalized matrix B (ω) with dimension M Nk) Wherein, ω iskRepresenting a set of target matrices
Figure GDA0002984466060000024
The kth frequency point;
(3) constructing a non-orthogonal joint diagonalization multi-objective optimization model:
using R (k, q) and B (ω)k) Constructing a non-orthogonal joint diagonalization multi-objective optimization model:
Figure GDA0002984466060000031
wherein, bnRepresents the diagonalized matrix B (ω)k) Min represents the minimize operation, max represents the maximize operation, Off (-) represents the diagonal operation of the nulling matrix, (·)HRepresenting the complex conjugate operation of the matrix, det (-) represents the determinant operation of the matrix;
(4) using a non-orthogonal joint diagonalization multi-target optimization model to set a target matrix
Figure GDA0002984466060000036
Separation matrix W (omega) on each frequency pointk) And (3) estimating:
(4a) setting a set of target matrices
Figure GDA0002984466060000037
The diagonalized matrix for the first bin has an initial value of B (ω)1)=[I,0]TSetting a condition number threshold to be psi and an iteration stop condition threshold to be lambda, and making k equal to 1, where I represents an N × N dimensional identity matrix, [ ·]TA transpose operation representing a matrix;
(4b)for the target matrix set
Figure GDA0002984466060000038
Separation matrix W (omega) on the k-th frequency pointk) And (3) estimating:
(4b.1) making n 1;
(4b.2) calculating the Hessian matrix QnAnd orthogonal projection matrix
Figure GDA0002984466060000032
Figure GDA0002984466060000033
Figure GDA0002984466060000034
Wherein, BnRepresents the diagonalized matrix B (ω)k) Deleting the matrix formed by the residual column vectors after the nth column, wherein I represents a unit matrix [ ·]-1An inversion operation of the representation matrix;
(4b.3) calculating the Hessian matrix QnCondition number of (K) (Q)n) And determining kappa (Q)n) If yes, executing step (4b.5), otherwise executing step (4 b.4);
(4b.4) computing the matrix pair (
Figure GDA0002984466060000035
Qn) And the eigenvector corresponding to the largest generalized eigenvalue is taken as the diagonalized matrix B (omega)k) And performing step (4 b.7);
(4b.5) calculating the intermediate matrix C:
Figure GDA0002984466060000041
wherein, U0Representation matrix QnThe eigenvector matrix corresponding to the M-N +1 minimum eigenvalues;
(4b.6) computing the diagonalized matrix B (ω)k) N column vector bnVector value of (d):
bn=U0w
wherein w represents the eigenvector corresponding to the maximum eigenvalue of the intermediate matrix C;
(4b.7) making N equal to N +1, and judging whether N is less than or equal to N, if so, executing the step (4b.2), otherwise, executing the step (4 b.8);
(4b.8) calculating a cost function J (B (ω)k) And | J (B (ω)k))-J(B(ωk-1) If yes, executing step (4b.1), otherwise executing step (4 b.9);
(4b.9) diagonal matrix B (ω)k) Taking complex conjugation to obtain a separation matrix W (omega)k);
(4c) Let K be K +1, and judge whether K ≦ K holds, if yes, let B (ω ≦ K, let B (ω) bek)=WHk-1) And executing the step (4b), otherwise executing the step (5);
(5) obtaining an estimated value of a time domain source signal:
(5a) calculating the estimated value of the source signal vector on the kth frequency point of the qth section
Figure GDA0002984466060000044
Figure GDA0002984466060000042
Wherein x (k, q) represents an observed signal vector on the kth frequency point of the qth section;
(5b) to pair
Figure GDA0002984466060000043
And performing inverse Fourier transform to obtain a time domain source signal estimation value, and realizing the separation of the frequency domain convolution blind signals.
Compared with the prior art, the invention has the following advantages:
(1) when the separation matrix is estimated, the multi-objective optimization non-orthogonal joint diagonalization model is adopted, the condition number of the diagonalization matrix is considered, the problem that convergence to a degradation solution is easy to occur is avoided, and compared with the prior art, the reliability of the separation of the convolution blind signals is improved.
(2) The method changes the constraint of the diagonalization matrix into the constraint of the product of the conjugate transpose of the diagonalization matrix and the diagonalization matrix, eliminates the limitation that the separation matrix is a square matrix, can realize the frequency domain convolution blind signal separation of the source signals which are less than or equal to the number of the observation signals, and has wider application range while avoiding easy convergence to a degradation solution compared with the prior art.
Drawings
FIG. 1 is a flow chart of an implementation of the present invention;
FIG. 2 is a pair of sheets of paper of the present invention
Figure GDA0002984466060000051
Separation matrix W (omega) on each frequency pointk) A flow chart for performing the estimation;
FIG. 3(a) is a waveform diagram of 3 source signals used in the simulation of the present invention;
FIG. 3(b) is a diagram of a signal waveform recovered by the NOODLES method;
FIG. 3(c) is a diagram of a signal waveform recovered by the ACDC method;
fig. 3(d) is a diagram of a signal waveform recovered by the method of the present invention.
Detailed Description
The invention is described in further detail below with reference to the following figures and specific examples:
the embodiment is based on a cocktail party scene, and the speech of the conversation of three persons is separated by using the invention according to the conversation contents of the three persons received from the 4 microphone sensors. In this example, the sensor is a microphone and the received convolved aliased signal is a speech signal.
Referring to fig. 1, a frequency domain convolution blind signal separation method based on multi-objective optimization includes the following steps:
step 1) obtaining a target matrix set
Figure GDA0002984466060000054
(1a) M electrical signal sensors receive observation signals x from N source signal sensorsm(t) forming an observed signal vector x (t), x (t) x [ x ]1(t),...,xM(t)]TWhere N ≧ 1, and M ≧ N, M denotes a sensor number, M ═ 1.., M, in the present embodiment, M ═ 4, N ═ 3;
(1b) dividing x (t) to obtain Q observation signal sub-vectors, and calculating a target matrix through each observation signal sub-vector to obtain a target matrix set consisting of Q multiplied by K target matrices
Figure GDA0002984466060000052
Figure GDA0002984466060000053
Wherein, R (K, Q) represents a target matrix at the kth frequency point in the qth segment of the observed signal sub-vector, K represents a target matrix number calculated by each observed signal sub-vector, Q represents an observed signal sub-vector number, and K represents the number of target matrices calculated by each observed signal sub-vector, in this embodiment, x (t) is divided into 20 observed signal sub-vectors:
(1b.1) to the observed Signal xm(t) performing short-time fourier transform to obtain a frequency-domain mixed signal vector x (K, Q), where K denotes a target matrix number calculated for each observation signal sub-vector, Q denotes an observation signal sub-vector number, and K is 1,..,. K, Q is 1,..,. Q, and K denotes a number of fourier transform frequency points, which is equal to the number of target matrices calculated for each observation signal sub-vector, and in this embodiment, K is 256:
(1b.11) calculating a K-point discrete windowed Fourier transform:
Figure GDA0002984466060000061
wherein, the subscript i represents the frame number, the superscript m represents the Fourier transform of the mth path of observation signal,
Figure GDA0002984466060000062
a q-th section representing the m-th observation signal, the window function length is K, the window function slides along the forward direction, the sliding distance of adjacent frames is d ═ 1-mu) K, mu is an overlapping factor of two adjacent frames, and mu is generally 50%;
(1b.12) combining the frame spectra of the m-th mixed signal into a vector
Figure GDA0002984466060000063
Wherein I represents the frame number of the q-th section of discrete windowed Fourier transform;
(1b.13) Using vector xm(k, q) constructing a frequency domain mixed signal vector x (k, q) ═ x1(k,q),...,xM(k,q)]T
(1b.2) calculating the Power spectral Density matrix Rx(k,q):
Rx(k,q)=E[x(k,q)xH(k,q)];
(1b.3) estimation of noise variance σ by principal component analysis2
(1b.4) calculating a target matrix R (k, q) on the kth frequency point of the qth section of the observation signal sub vector, wherein R (k, q) is Rx(k,q)-σ2I, wherein I represents an identity matrix.
Step 2) constructing a diagonalized matrix B (omega)k):
Constructing a diagonalized matrix B (ω) with dimension M Nk) Wherein, ω iskRepresenting a set of target matrices
Figure GDA0002984466060000065
The kth frequency point;
step 3), constructing a non-orthogonal joint diagonalization multi-objective optimization model:
using R (k, q) and B (ω)k) Constructing a non-orthogonal joint diagonalization multi-objective optimization model:
Figure GDA0002984466060000064
wherein, bnPresentation pairThe matrix B (ω) is formed by anglek) Min represents the minimize operation, max represents the maximize operation, Off (-) represents the diagonal operation of the nulling matrix, (·)HRepresenting the complex conjugate operation of the matrix, det (-) represents the determinant operation of the matrix;
step 4) utilizing a non-orthogonal joint diagonalization multi-target optimization model to perform target matrix set
Figure GDA0002984466060000078
Separation matrix W (omega) on each frequency pointk) The estimation is carried out, and the implementation process of the estimation is shown in fig. 2:
(4a) setting a set of target matrices
Figure GDA0002984466060000079
The diagonalized matrix for the first bin has an initial value of B (ω)1)=[I,0]TSetting the condition number threshold to be psi, the order of magnitude of psi typically being 103The iteration stop condition threshold is set to λ, which is typically of the order of 10-2Let k be 1, where I represents an N × N-dimensional identity matrix [ ·]TA transpose operation representing a matrix;
(4b) for the target matrix set
Figure GDA00029844660600000710
Separation matrix W (omega) on the k-th frequency pointk) And (3) estimating:
(4b.1) making n 1;
(4b.2) calculating the Hessian matrix QnAnd orthogonal projection matrix
Figure GDA0002984466060000071
Figure GDA0002984466060000072
Figure GDA0002984466060000073
Wherein, BnRepresents the diagonalized matrix B (ω)k) Deleting the matrix formed by the residual column vectors after the nth column, wherein I represents a unit matrix [ ·]-1An inversion operation of the representation matrix;
(4b.3) calculating the Hessian matrix QnCondition number of (K) (Q)n) And determining kappa (Q)n) If psi is true, the Hessian matrix Q is determinednIs ill-conditioned, step 4b.5) is performed, otherwise step 4b.4) is performed), hessian matrix QnCondition number of (K) (Q)n) The calculation is performed as follows:
Figure GDA0002984466060000074
wherein [ ·]-1Expressing the inversion operation of the matrix, and expressing the norm operation by | DEG |;
(4b.4) computing the matrix pair (
Figure GDA0002984466060000075
Qn) And the eigenvector corresponding to the largest generalized eigenvalue is taken as the diagonalized matrix B (omega)k) And step 4b.7) is executed, wherein, the feature vector corresponding to the maximum generalized eigenvalue is obtained by the following steps:
for matrix pair (
Figure GDA0002984466060000076
Qn) Performing generalized eigenvalue decomposition to obtain a matrix pair (
Figure GDA0002984466060000077
Qn) And a matrix V consisting of a diagonal matrix D of generalized eigenvalues and eigenvectors corresponding to the generalized eigenvalues, and the first column of V being a matrix pair: (
Figure GDA0002984466060000081
Qn) The maximum generalized eigenvalue corresponding eigenvector, the generalized eigenvalue decomposition formula is:
Figure GDA0002984466060000082
wherein, eig (·) represents generalized eigenvalue decomposition operation;
(4b.5) calculating the intermediate matrix C:
Figure GDA0002984466060000083
wherein, U0Representation matrix QnThe eigenvector matrix corresponding to the M-N +1 minimum eigenvalues;
(4b.6) computing the diagonalized matrix B (ω)k) N column vector bnVector value of (d):
bn=U0w
wherein w represents the eigenvector corresponding to the maximum eigenvalue of the intermediate matrix C;
(4b.7) making N equal to N +1, and judging whether N is less than or equal to N, if so, executing the step (4b.2), otherwise, executing the step (4 b.8);
(4b.8) calculating a cost function J (B (ω)k) And | J (B (ω)k))-J(B(ωk-1) If λ is true, if yes, step (4b.1) is performed, otherwise step (4b.9) is performed, where B (ω) is considered at the first iteration0) Is a zero matrix;
(4b.9) diagonal matrix B (ω)k) Taking complex conjugation to obtain a separation matrix W (omega)k);
(4c) Let K be K +1, and judge whether K ≦ K holds, if yes, let B (ω ≦ K, let B (ω) bek)=WHk-1) And step (4B) is performed, otherwise step (5), B (ω), is performedk)=WHk-1)=B(ωk-1) Taking the iteration result of the diagonalized matrix B as the initial value of the next iteration, wherein the step aims to solve the sequencing problem which can occur in frequency domain separation;
step 5) obtaining an estimated value of the time domain source signal:
(5a) calculating the source signal direction of the kth frequency point of the qth sectionQuantity estimation
Figure GDA0002984466060000086
Figure GDA0002984466060000084
Wherein x (k, q) represents an observed signal vector at the kth frequency point of the qth segment, and W (omega)k) Representing a separation matrix;
(5b) to pair
Figure GDA0002984466060000085
And performing inverse Fourier transform to obtain a time domain source signal estimation value, and realizing the separation of the frequency domain convolution blind signals.
The technical effects of the present invention will be further explained by simulation experiments.
1. Simulation conditions and contents:
simulation conditions are as follows: MATLAB (R2013a), Intel (R) core (TM) i7-2600CPU 6503.40 GHz, Window 7 Professional.
Simulation content: the source signals are N-3 sinusoidal signals with different frequencies, Gaussian white noise is superposed on the sinusoidal signals, the SNR (signal to noise ratio) is set to be 10dB, and an 8-tap FIR (finite impulse response) filter is used for establishing a convolution mixed model. 20000 sample points are acquired from the three source signals, and mixed signals are acquired by using M ═ 4 receiving sensors, wherein elements of the mixed matrix A are randomly generated and obey standard normal distribution. The performance of the blind signal separation method is measured by a signal-to-interference ratio (SIR), the larger the SIR, the better the blind separation performance, and the SIR is defined as:
Figure GDA0002984466060000091
wherein, G (ω)k)=W(ωk)A(ωk) In order to be a frequency domain global transformation matrix,
Figure GDA0002984466060000092
gnjk) Is a matrix G (omega)k) Row n and column j.
2. And (3) simulation results:
the waveform diagrams of the 3 source signals used in the simulation of the present invention are shown in fig. 3 (a). The method of the present invention (JD-NS) is now compared to two other methods, one of which is an alternating column update diagonalization (ACDC) based method and the other of which is a non-orthonormal Jacobian approximation joint diagonalization (NOODLES) method. The three recovered source signals separated by the NOODLES method are shown in FIG. 3(b), and the three recovered source signals separated by the ACDC method are shown in FIG. 3 (c).
It can be seen that the three signals recovered using the ACDC method are all similar to the second source signal in the source signal waveform diagram, indicating that the ACDC method has in fact converged to a degenerate solution. The method of the invention can effectively recover all source signals, and the recovered source signals do not contain components of other source signals, which shows better separation effect.
Table 1 summarizes the SIR performance of the source signal recovered by the method of the present invention and the nodles method, when SNR is 10dB, 100 independent experiments are performed. It can be seen that the method of the present invention is superior to the NOODLES method in terms of SIR performance of three recovered source signals, and has higher reliability of convolution blind signal separation than the NOODLES method.
TABLE 1
Figure GDA0002984466060000093

Claims (4)

1. A frequency domain convolution blind signal separation method based on multi-objective optimization is characterized by comprising the following steps:
(1) obtaining a set of target matrices
Figure FDA0002984466050000012
(1a) M sensors receive observed signals x from N source signal sensorsm(t) forming an observed signal vector x (t), x (t) x [ x ]1(t),...,xM(t)]TWherein N is more than or equal to 1, M is more than or equal to N, and M represents transmissionA sensor number, M ═ 1.., M;
(1b) dividing x (t) to obtain Q observation signal sub-vectors, and calculating a target matrix through each observation signal sub-vector to obtain a target matrix set consisting of Q multiplied by K target matrices
Figure FDA0002984466050000013
Figure FDA0002984466050000015
Wherein R (K, q) represents a target matrix on a kth frequency point of a qth section of an observation signal sub-vector, K represents a target matrix serial number calculated by each observation signal sub-vector, q represents an observation signal sub-vector serial number, and K represents the number of the target matrices calculated by each observation signal sub-vector;
(2) constructing a diagonalized matrix B (ω)k):
Constructing a diagonalized matrix B (ω) with dimension M Nk) Wherein, ω iskRepresenting a set of target matrices
Figure FDA0002984466050000016
The kth frequency point;
(3) constructing a non-orthogonal joint diagonalization multi-objective optimization model:
using R (k, q) and B (ω)k) Constructing a non-orthogonal joint diagonalization multi-objective optimization model:
Figure FDA0002984466050000011
wherein, bnRepresents the diagonalized matrix B (ω)k) Min represents the minimize operation, max represents the maximize operation, Off (-) represents the diagonal operation of the nulling matrix, (·)HRepresenting the complex conjugate operation of the matrix, det (-) represents the determinant operation of the matrix;
(4) using a non-orthogonal joint diagonalization multi-target optimization model to set a target matrix
Figure FDA0002984466050000014
Separation matrix W (omega) on each frequency pointk) And (3) estimating:
(4a) setting a set of target matrices
Figure FDA0002984466050000024
The diagonalized matrix for the first bin has an initial value of B (ω)1)=[I,0]TSetting a condition number threshold to be psi and an iteration stop condition threshold to be lambda, and making k equal to 1, where I represents an N × N dimensional identity matrix, [ ·]TA transpose operation representing a matrix;
(4b) for the target matrix set
Figure FDA0002984466050000025
Separation matrix W (omega) on the k-th frequency pointk) And (3) estimating:
(4b.1) making n 1;
(4b.2) calculating the Hessian matrix QnAnd orthogonal projection matrix
Figure FDA0002984466050000026
Figure FDA0002984466050000021
Figure FDA0002984466050000022
Wherein, BnRepresents the diagonalized matrix B (ω)k) Deleting the matrix formed by the residual column vectors after the nth column, wherein I represents a unit matrix [ ·]-1An inversion operation of the representation matrix;
(4b.3) calculating the Hessian matrix QnCondition number of (K) (Q)n) And determining kappa (Q)n) If yes, executing step 4b.5), otherwise executing step 4 b.4);
(4b.4) computing the matrix pairs
Figure FDA0002984466050000027
And the eigenvector corresponding to the largest generalized eigenvalue is taken as the diagonalized matrix B (omega)k) And step 4b.7 is performed);
(4b.5) calculating the intermediate matrix C:
Figure FDA0002984466050000023
wherein, U0Representation matrix QnThe eigenvector matrix corresponding to the M-N +1 minimum eigenvalues;
(4b.6) computing the diagonalized matrix B (ω)k) N column vector bnVector value of (d):
bn=U0w
wherein w represents the eigenvector corresponding to the maximum eigenvalue of the intermediate matrix C;
(4b.7) making N equal to N +1, and judging whether N is less than or equal to N, if so, executing the step (4b.2), otherwise, executing the step (4 b.8);
(4b.8) calculating a cost function J (B (ω)k) And | J (B (ω)k))-J(B(ωk-1) If yes, executing step (4b.1), otherwise executing step (4 b.9);
(4b.9) diagonal matrix B (ω)k) Taking complex conjugation to obtain a separation matrix W (omega)k);
(4c) Let K be K +1, and judge whether K ≦ K holds, if yes, let B (ω ≦ K, let B (ω) bek)=WHk-1) And executing the step (4b), otherwise executing the step (5);
(5) obtaining an estimated value of a time domain source signal:
(5a) calculating the estimated value of the source signal vector on the kth frequency point of the qth section
Figure FDA0002984466050000031
Figure FDA0002984466050000032
Wherein x (k, q) represents an observed signal vector on the kth frequency point of the qth section;
(5b) to pair
Figure FDA0002984466050000033
And performing inverse Fourier transform to obtain a time domain source signal estimation value, and realizing the separation of the frequency domain convolution blind signals.
2. The frequency-domain convolution blind signal separation method based on multi-objective optimization according to claim 1, wherein R (k, q) in step (1b) is calculated by:
(1b.1) to the observed Signal xm(t) performing short-time Fourier transform to obtain a frequency domain mixed signal vector x (K, Q), wherein K represents a target matrix serial number calculated by each observation signal sub-vector, Q represents an observation signal sub-vector serial number, and K is 1, a.
(1b.2) calculating the Power spectral Density matrix Rx(k,q):
Rx(k,q)=E[x(k,q)xH(k,q)];
(1b.3) estimation of noise variance σ by principal component analysis2
(1b.4) calculating a target matrix R (k, q) on the kth frequency point of the qth section of the observation signal sub vector, wherein R (k, q) is Rx(k,q)-σ2I, wherein I represents an identity matrix.
3. The method for frequency-domain blind convolutional signal separation based on multi-objective optimization of claim 1, wherein said step (4b.3) of computing hessian matrix QnCondition number of (K) (Q)n) The following is calculated:
Figure FDA0002984466050000041
wherein [ ·]-1The inverse operation of the matrix is represented, and | | · | | represents the norm operation.
4. The method for separating frequency-domain convolution blind signals based on multi-objective optimization according to claim 1, wherein the obtaining step of the eigenvector corresponding to the maximum generalized eigenvalue in step (4b.4) is:
to matrix pair
Figure FDA0002984466050000042
Carrying out generalized eigenvalue decomposition to obtain matrix pairs
Figure FDA0002984466050000043
A diagonal matrix D formed by generalized eigenvalues and a matrix V formed by eigenvectors corresponding to the generalized eigenvalues, and the first column of V is taken as a matrix pair
Figure FDA0002984466050000044
The maximum generalized eigenvalue corresponding eigenvector, the generalized eigenvalue decomposition formula is:
Figure FDA0002984466050000045
wherein, eig (·) represents generalized eigenvalue decomposition operation.
CN201810112970.9A 2018-02-05 2018-02-05 Frequency domain convolution blind signal separation method based on multi-objective optimization Active CN108364659B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201810112970.9A CN108364659B (en) 2018-02-05 2018-02-05 Frequency domain convolution blind signal separation method based on multi-objective optimization

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201810112970.9A CN108364659B (en) 2018-02-05 2018-02-05 Frequency domain convolution blind signal separation method based on multi-objective optimization

Publications (2)

Publication Number Publication Date
CN108364659A CN108364659A (en) 2018-08-03
CN108364659B true CN108364659B (en) 2021-06-01

Family

ID=63004763

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201810112970.9A Active CN108364659B (en) 2018-02-05 2018-02-05 Frequency domain convolution blind signal separation method based on multi-objective optimization

Country Status (1)

Country Link
CN (1) CN108364659B (en)

Families Citing this family (10)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN110890098B (en) * 2018-09-07 2022-05-10 南京地平线机器人技术有限公司 Blind signal separation method and device and electronic equipment
CN109616138B (en) * 2018-12-27 2020-05-19 山东大学 Voice signal blind separation method based on segmented frequency point selection and binaural hearing aid system
CN110236589B (en) * 2019-06-03 2022-04-29 苏州美糯爱医疗科技有限公司 Real-time heart-lung sound automatic separation method of electronic stethoscope
CN111106866B (en) * 2019-12-13 2021-09-21 南京理工大学 Satellite-borne AIS/ADS-B collision signal separation method based on hessian matrix pre-estimation
CN111179960B (en) * 2020-03-06 2022-10-18 北京小米松果电子有限公司 Audio signal processing method and device and storage medium
CN111415676B (en) * 2020-03-10 2022-10-18 山东大学 Blind source separation method and system based on separation matrix initialization frequency point selection
TWI809390B (en) * 2021-03-01 2023-07-21 新加坡商台達電子國際(新加坡)私人有限公司 Method and audio processing system for blind source separation without sampling rate mismatch estimation
CN115372902B (en) * 2022-08-05 2023-12-01 中国人民解放军战略支援部队信息工程大学 TDOA bias reduction positioning method based on underwater multi-base sonar
CN116866116B (en) * 2023-07-13 2024-02-27 中国人民解放军战略支援部队航天工程大学 Time-delay mixed linear blind separation method
CN116866123B (en) * 2023-07-13 2024-04-30 中国人民解放军战略支援部队航天工程大学 Convolution blind separation method without orthogonal limitation

Citations (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN104934041A (en) * 2015-05-07 2015-09-23 西安电子科技大学 Convolutive blind signal separation method based on multi-target optimization joint block diagonalization
CN105446941A (en) * 2015-11-11 2016-03-30 西安电子科技大学 Joint zero diagonalization based time-frequency domain blind signal separation method

Patent Citations (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN104934041A (en) * 2015-05-07 2015-09-23 西安电子科技大学 Convolutive blind signal separation method based on multi-target optimization joint block diagonalization
CN105446941A (en) * 2015-11-11 2016-03-30 西安电子科技大学 Joint zero diagonalization based time-frequency domain blind signal separation method

Non-Patent Citations (3)

* Cited by examiner, † Cited by third party
Title
A Novel Joint Block Diagonalization Algorithm for Convolutive BSS with Limited Constraint;ZHANG Wei Tao;《中国科技论文在线》;20151231;全文 *
Nonorthogonal Approximate Joint Diagonalization;Guoxu Zhou,Shengli Xie,Zuyuan Yang,Jun Zhang;《IEEE Trans. Neural Networks》;20091130;第20卷(第11期);全文 *
非对称非正交快速联合对角化算法;张伟涛,楼顺天,张延良;《自动化学报》;20100630;第36卷(第6期);全文 *

Also Published As

Publication number Publication date
CN108364659A (en) 2018-08-03

Similar Documents

Publication Publication Date Title
CN108364659B (en) Frequency domain convolution blind signal separation method based on multi-objective optimization
CN108986838B (en) Self-adaptive voice separation method based on sound source positioning
JP4268524B2 (en) Blind signal separation
CN109243483B (en) Method for separating convolution blind source of noisy frequency domain
US7603401B2 (en) Method and system for on-line blind source separation
CN110010148B (en) Low-complexity frequency domain blind separation method and system
CN109616138B (en) Voice signal blind separation method based on segmented frequency point selection and binaural hearing aid system
Wang et al. A region-growing permutation alignment approach in frequency-domain blind source separation of speech mixtures
Aichner et al. Time domain blind source separation of non-stationary convolved signals by utilizing geometric beamforming
CN111986695A (en) Non-overlapping sub-band division fast independent vector analysis voice blind separation method and system
CN111816200B (en) Multi-channel speech enhancement method based on time-frequency domain binary mask
Aichner et al. On-line time-domain blind source separation of nonstationary convolved signals
Douglas Blind separation of acoustic signals
WO2001017109A1 (en) Method and system for on-line blind source separation
CN112201276B (en) TC-ResNet network-based microphone array voice separation method
Jo et al. Robust Blind Multichannel Identification based on a Phase Constraint and Different ℓ p-norm Constraints
Houda et al. Blind audio source separation: state-of-art
Mallis et al. Convolutive audio source separation using robust ICA and an intelligent evolving permutation ambiguity solution
CN106249204B (en) Multichannel delay time estimation method based on robust adaptive blind identification
Mazur et al. Using the scaling ambiguity for filter shortening in convolutive blind source separation
Mazur et al. A new clustering approach for solving the permutation problem in convolutive blind source separation
Markhi et al. An improved cyclic beamforming method for signal DOA estimation
Girona Underdetermined Blind Separation of Delayed Sound Sources in the Frequency Domain
Lin et al. A survey of semi-blind ICA for speech separation in frequency domain
Pukenas Three-mode biomedical signal denoising in the local phase space based on a tensor approach

Legal Events

Date Code Title Description
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant