CN105550716A - Underdetermined blind source separation method applying multiple constraints - Google Patents

Underdetermined blind source separation method applying multiple constraints Download PDF

Info

Publication number
CN105550716A
CN105550716A CN201610046103.0A CN201610046103A CN105550716A CN 105550716 A CN105550716 A CN 105550716A CN 201610046103 A CN201610046103 A CN 201610046103A CN 105550716 A CN105550716 A CN 105550716A
Authority
CN
China
Prior art keywords
signal
matrix
constraint
separation
source
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN201610046103.0A
Other languages
Chinese (zh)
Inventor
王敏
王艳芳
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Jiangsu University of Science and Technology
Original Assignee
Jiangsu University of Science and Technology
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Jiangsu University of Science and Technology filed Critical Jiangsu University of Science and Technology
Priority to CN201610046103.0A priority Critical patent/CN105550716A/en
Publication of CN105550716A publication Critical patent/CN105550716A/en
Pending legal-status Critical Current

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F18/00Pattern recognition
    • G06F18/20Analysing
    • G06F18/21Design or setup of recognition systems or techniques; Extraction of features in feature space; Blind source separation
    • G06F18/213Feature extraction, e.g. by transforming the feature space; Summarisation; Mappings, e.g. subspace methods
    • G06F18/2134Feature extraction, e.g. by transforming the feature space; Summarisation; Mappings, e.g. subspace methods based on separation criteria, e.g. independent component analysis
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F18/00Pattern recognition
    • G06F18/20Analysing
    • G06F18/21Design or setup of recognition systems or techniques; Extraction of features in feature space; Blind source separation
    • G06F18/213Feature extraction, e.g. by transforming the feature space; Summarisation; Mappings, e.g. subspace methods
    • G06F18/2134Feature extraction, e.g. by transforming the feature space; Summarisation; Mappings, e.g. subspace methods based on separation criteria, e.g. independent component analysis
    • G06F18/21343Feature extraction, e.g. by transforming the feature space; Summarisation; Mappings, e.g. subspace methods based on separation criteria, e.g. independent component analysis using decorrelation or non-stationarity, e.g. minimising lagged cross-correlations
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L21/00Speech or voice signal processing techniques to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
    • G10L21/02Speech enhancement, e.g. noise reduction or echo cancellation
    • G10L21/0272Voice signal separating
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F2218/00Aspects of pattern recognition specially adapted for signal processing
    • G06F2218/02Preprocessing

Landscapes

  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • Data Mining & Analysis (AREA)
  • Computer Vision & Pattern Recognition (AREA)
  • Physics & Mathematics (AREA)
  • Bioinformatics & Computational Biology (AREA)
  • General Engineering & Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • Evolutionary Computation (AREA)
  • Evolutionary Biology (AREA)
  • Bioinformatics & Cheminformatics (AREA)
  • Artificial Intelligence (AREA)
  • Life Sciences & Earth Sciences (AREA)
  • Signal Processing (AREA)
  • Health & Medical Sciences (AREA)
  • Quality & Reliability (AREA)
  • Multimedia (AREA)
  • Acoustics & Sound (AREA)
  • Human Computer Interaction (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Computational Linguistics (AREA)
  • Radio Transmission System (AREA)

Abstract

The invention discloses an underdetermined blind source separation method applying multiple constraints, which is improved on the basis of a traditional blind source separation method to realize a better signal separating effect. The method disclosed by the invention aims to separate mixed signals and comprises the steps of firstly carrying out equalization and whitening to improve the robustness of initial conditions, carrying out multi-constraint limit on a non-negative matrix factorization algorithm and carrying out optimization on an objective function and further improving the signal separating property through a feedback mechanism finally. Therefore, the method disclosed by the invention has the advantages of good factor interpretability and high separated signal purity.

Description

underdetermined blind source separation method applying multiple constraints
Technical Field
The invention relates to an underdetermined blind source separation method applying multiple constraints, and belongs to the technical field of signal processing.
Background
BSS (blind source separation) is a technique for recovering a source signal only from a signal received by a sensor when the source signal and a mixing method are unknown. The blind source separation when the number of the sensors is less than that of the source signals is called underdetermined blind source separation, is a hot research problem in the field of blind signal processing in recent years, and has wide application prospects in the aspects of voice signal processing, wireless communication, digital image processing and the like.
In the conventional blind signal separation method, the ICA algorithm is developed based on a Principal Component Analysis (PCA) method, and is only applicable to an overdetermined or positive definite hybrid model, and certain assumptions and constraints are necessary. The first non-Negative Matrix Factorization (NMF) proposed by Lee and Seung is an effective method to solve the underdetermined hybrid model. One of the most useful properties of NMF is that the decomposition yields a low rank matrix that expresses significant data with a small number of elements, facilitates physical interpretation, and does not require statistical independence between signals. The blind source separation method is a combination of an objective function and an optimization algorithm, i.e. a determination of the source signal obtained by optimizing the objective function. If constraints are added to the NMF and the objective function is optimized, the decomposition quality is better than the original NMF.
At present, most researches solve the underdetermined blind separation problem by using the sparsity of signals and based on a sparse component analysis algorithm, M.Zibulivsky et al propose a two-step method to estimate a mixing matrix and a source signal by using sparse component analysis, and the quality of the mixed matrix estimation directly influences the subsequent signal separation effect; based on sparse representation, Bofill estimates a mixing matrix and source signals respectively by using a clustering method and a shortest path method, successfully separates six source signals from two observed mixed signals, but has strong requirements on the sparsity of the signals; aiming at the problem that the basic NMF algorithm cannot solve blind source separation under an underdetermined condition, Cichocki et al propose a multilayer NMF algorithm to realize extremely sparse blind separation of each layer of decomposed signals, but the algorithm is relatively complex to realize; the yellow break utilizes a non-negative matrix factorization algorithm (IS-NMF) based on board bin-vegetarian vine divergence to separate and study single-channel music signals, the sparsity of the signals after time-frequency domain transformation IS still not ideal, the estimation error of a mixed matrix IS large, the performance of the whole algorithm IS affected, the sparsity requirement and the algorithm complexity of source signals are reduced, and the separation precision IS improved, so that the yellow break IS a research direction with theoretical significance and economic value.
Disclosure of Invention
The purpose of the invention is as follows: aiming at the problems and the defects in the prior art, the invention provides an underdetermined blind source separation method applying multiple constraints.
The technical scheme is as follows: an underdetermined blind source separation method applying multiple constraints comprises the following steps:
step 1, sampling and quantizing a source signal to be separated to obtain an initial observation signal, and preprocessing the initial observation signal, wherein the method comprises the following steps: carrying out averaging and whitening treatment to obtain an initial mixing matrix;
step 2, firstly carrying out sparse constraint, minimum correlation constraint and determinant constraint processing on the initial mixed matrix obtained in the step 1, and then carrying out optimization processing on a target function based on reconstruction errors to obtain an estimation and initial separation signal of the mixed matrix;
and 3, performing feedback processing on the initial separation signal to obtain a final separation signal.
The step 1 specifically comprises:
step 1-1, sampling and quantizing an original observation signal to obtain a discrete signal s (k), and a source signal s (k) ═ s1(k),s2(k),…,sM(k)]TObtaining an initial observation signal X (k) ═ x through a nonsingular mixing matrix A1(k),x2(k),…,xM(k)]T: x (k) ═ as (k); wherein s isM(k) Is the Mth component, x, of the source signal S (k)M(k) Is the Mth component of the observed signal X (K), K is a time sequence, the superscript T represents the conjugate transpose, M is a positive integer, A is an M × M-dimensional matrix;
step 1-2, sending the observation signal X (k) obtained in the step 1-1 into a preprocessing filter for STFT processing to obtain a frequency spectrum matrix, then solving a power spectrum for the frequency spectrum matrix, and outputting a power spectrum signal matrix Z (k). The method specifically comprises the following steps: the observation signals X (k) are first subjected to a centering pretreatment, i.e.Where E represents the mathematical expectation, followed by a centering pre-processing resultWhitening treatment is carried out to obtain an output power spectrum signal matrix Z (k) of the preprocessing filter: z (k) vx (k), where V is the whitening matrix, making the obtained initialization conditions more robust.
The step 2 specifically comprises the following steps:
the basic idea of NMF is to put the matrix V ═ V under non-negative constraints1,v2,…,vN]∈Rm×NDecomposed into two matrices W ═ 2w1,w2,…,wN]∈Rm×nAnd H ═ H1,h2,…,hN]∈Rn×NThe decomposition result satisfies V ≈ WH as much as possible, wherein elements in W, H are not negative. The mathematical model is represented as:
V=WH
in general, the matrix dimensions described above need to satisfy the relation (m + N) N < mN.
Step 2, firstly, performing sparse constraint, minimum correlation constraint and determinant constraint processing on the initial mixing matrix, specifically as follows:
and 2-1, when the non-negative matrix decomposition is directly used for solving the underdetermined blind signal separation, the decomposition result is not unique, and the source signal cannot be separated correctly. In order to achieve the purposes of good separation performance and unique result under an underdetermined condition, triple constraints are required to be adopted in the non-negative matrix decomposition process, and meanwhile, an optimization function F (V | | WH) is selected as follows by combining a target function with the minimum reconstruction error:
F ( V | | W H ) = &part; D D ( V | | W H ) + &part; &phi; v o l ( &phi; ( W ) ) + &part; J J ( H ) + &part; R C ( H )
wherein, W i k , H k j > 0 , &Sigma; i W i k = 1 , 1 &le; k &le; n , 1 &le; i &le; m , 1 &le; j &le; N
D ( V | | W H ) = &Sigma; i j ( V i j l o g V i j ( W H ) i j - V i j + ( W H ) i j )
in the formula
D (V | | WH) is a distance function of V and WH, namely a reconstruction error;
vol (Φ (W)) is a determinant criteria constraint of the mixing matrix W, and represents the volume of Φ (W), where W is the square matrix, vol (Φ (W)) ═ det (W)) |; when W is not a square matrix, vol (phi (W)) - | det (WW)T)|;
J (H) is a sparsity constraint separating signal H, and
c (H) is the minimum correlation constraint for the split signal H;
are constraint parameters in order to guarantee algorithm convergence.
The NMF problem can then be converted into an optimization problem, namely
minD (V | | WH), for any W, H,
s.tW,H≥0
the dispersion D (V | | WH) is non-increasing under the following iterations:
H a &mu; &LeftArrow; H a &mu; &Sigma; i W i a V i &mu; / ( W H ) i &mu; &Sigma; k W k a W i a &LeftArrow; W i a &Sigma; i H a &mu; V i &mu; / ( W H ) i &mu; &Sigma; k H a &mu;
the dispersion no longer changes if and only if W, H are stable points.
We first derive HThe iterative formula of (2) is that for the target function D (V | | WH), the negative gradient direction is used as descending analysis, and the undetermined learning rate is set to be ηBy selecting a suitable algorithmThen, an iterative formula for H can be obtained, and since W and H have symmetry, an overlap with respect to W can be similarly foundA generation formula.
Step 2-2 and step 2, the objective function based on the reconstruction error is optimized, specifically as follows:
the non-negative matrix factorization with sparse constraint is to add constraint conditions on the original objective function to acquire the decomposition information which is as sparse as possible. On the basis of the step 2-1, W and H both refer to algebraic expressions corresponding to the iterative formulas of W and H obtained in the step 2-1, and the constraint condition of adding 1-norm to the non-negative vector H is considered, namely minimizing H (| H |)1The conversion into matrix form is to minimize the sufficient sparseness of the matrix HAnd obtaining a new iteration rule.
The iterative rule is proved to be convergent by mathematical correlation definition and lemma, when the iterative rule is used for solving W and H, the objective function G (V | | | WH) is non-increasing, and the condition that the objective function is not changed is that if and only if W and H are stable points of the objective function.
When the iteration is carried out according to the new iteration rules of W and H until F (V | | | WH) is not larger than a certain small threshold value, the algorithm convergence can be judged, and the correct W and H are obtained.
Under the condition that the signal and mixing matrix are not negative, decomposing an observation signal matrix X according to the formula X-WH, wherein W is the estimation of the mixing matrix A, and H is the estimation of a source signal matrix S, thereby realizing the blind source separation of the mixed signals.
The steps of the NMF underdetermined blind source separation algorithm proposed above are as follows:
a. randomly selecting a group of W and H obtained in the step 1 for initialization;
b. optimizing an objective function based on the reconstruction error;
c. and updating the learning process, namely setting a small threshold, iterating according to an updating rule, and stopping updating when the optimization function F (V | | WH) is smaller than the threshold. And performing inverse STFT processing on W and H to obtain a unique group of mixing matrix W and an estimation matrix H of the source signal.
The step 3 specifically comprises:
in the signal separation process, the idea of a feedback mechanism is to eliminate the most thoroughly separated signals on the basis of the estimated source signals, so as to form a new mixed signal for signal separation, and the separation process is repeated to realize the separation of all signals. Then, in case that the source signal is unknown, in order to determine the purity of the estimated signal, the correlation coefficients between each source signal component estimated by the feedback system and all the mixed signals are summed, and the signal component corresponding to the minimum value is determined as the purest signal, i.e. the most thoroughly separated signal. Two signals h1And h2The formula for calculating the correlation coefficient therebetween is defined as follows:
r h 1 h 2 = &Sigma; i = 1 N ( h 1 i - h &OverBar; 1 ) ( h 2 i - h &OverBar; 2 ) &Sigma; i = 1 N ( h 1 i - h &OverBar; 1 ) 2 &Sigma; i = 1 N ( h 2 i - h &OverBar; 2 ) 2
wherein h is1iAnd h2iAre respectively a signal h1And h2And (ii) a value ofAndrespectively represent signals h1And h2Corresponding average value. If it isIt means that they are not relevant. The judgment basis of the feedback mechanism is the comparison of the absolute values of the correlation coefficients of the signals.
As described above, the signal mixing matrix is subjected to multiple constraint constraints to optimize the reconstruction error objective function, and the mixing matrix is estimated, thereby enhancing the interpretability and physical significance of the decomposition factor.
The method comprises the following specific steps:
and 3-1, updating to obtain W ═ W according to the new iteration rule of W and H obtained in the step 21,w2,…,wn]And H ═ H1,h2,…,hn]TLet p be n;
step 3-2, judging the number of source signals: if the number p of the source signals is less than 2, obtaining a final mixing matrix W and a final source signal H, and finishing the algorithm; otherwise, turning to the next step, and continuing to separate signals;
step 3-3, selecting the source signal with the best separation effect: comparing each separated source signal hi(i ═ 1,2, …, p) and the sum of the absolute values of the correlation coefficients of the original observed signal V, i.e. the sumWherein v isjFor the jth component of the observed signal V, the minimum value, i.e. the valueCorresponding to htI.e. the signal with the best separation effect;
and 3-4, forming a new mixed signal: removing source signal h from mixed signal used in this separationtTo form a new mixed signal V', i.e. V ═ V-wthtAnd let p be p-1;
step 3-5, according to the new mixed signal V', sequentially updating according to the iteration rule formula in the step 3-1 to obtain W ═ W1,w2,…,wn]And H ═ H1,h2,…,hn]T(ii) a And returning to the step 3-2.
As described above, the separation accuracy of the blind source signals is improved by introducing a feedback mechanism to the initial separation result to separate the separation signals one by one.
Has the advantages that: compared with the prior art, the invention has the following advantages:
1. the observation signals are subjected to averaging and whitening pretreatment, the robustness of initial conditions is enhanced, and a foundation is laid for improving the separation precision in subsequent steps.
2. The minimum reconstruction error objective function based on Kullback-Leibler is optimized by carrying out multiple constraint limitation on the signal mixing matrix, and the estimation of the mixing matrix is carried out, so that the uniqueness of a blind source signal separation result is ensured, and the sparsity, interpretability and physical significance of a decomposition factor are enhanced.
3. And a feedback mechanism is introduced into the initial separation result to separate the separation signals one by one, so that the separation precision of the blind source signals is improved.
Drawings
FIG. 1 is a flow chart of blind source signal separation;
fig. 2 is a feedback mechanism flow diagram.
Detailed Description
The present invention is further illustrated by the following examples, which are intended to be purely exemplary and are not intended to limit the scope of the invention, as various equivalent modifications of the invention will occur to those skilled in the art upon reading the present disclosure and fall within the scope of the appended claims.
As shown in fig. 1, the underdetermined blind source separation method applying multiple constraints includes the following steps:
example (b): the method is suitable for the voice signal and the model, but not limited to the voice signal and the model.
The linear instantaneous mixture model of the underdetermined blind speech signal separation problem can be expressed as:
x(t)=As(t)+n(t)(1)
wherein x (t) ═ x1(t),x2(t),…,xM(t)]TDenotes an observed signal vector of size M × 1 at time t, a ═ a1,a2,…,aN]∈RM×N(M < N) is a mixing matrix, aiI-th column vector representing a hybrid matrix, s (t) ═ s1(t),s2(t),…,sN(t)]TThe source signal vector with the size of N × 1 at time t is shown, and N (t) represents additive gaussian noise, therefore, the formula (1) can also be expressed as:
x ( t ) = A s ( t ) = &Sigma; i = 1 N a i s i + n ( t ) - - - ( 2 )
for the convenience of analysis, the influence of noise is not considered temporarily, so that the linear instantaneous mixed model of noise-free under-determined blind speech signal separation can be expressed as:
x ( t ) = A s ( t ) = &Sigma; i = 1 N a i s i - - - ( 3 )
in sparse component analysis, the sparsity of the signal is mainly utilized for the hybrid matrix estimation. Taking time domain signal analysis as an example, assuming that only the ith source signal exists at time t, equation (3) is degenerated as:
x(t)=aisi(t)(4)
step 1-1, sampling and quantizing an original observation time domain signal to obtain a discrete signal s (k), and a source signal s (k) ═ s1(k),s2(k),…,sM(k)]TObtaining an observation signal X (k) ═ x through a nonsingular mixing matrix A1(k),x2(k),…,xM(k)]T: x (k) ═ as (k); wherein s isM(k) Is the Mth component, x, of the source signal S (k)M(k) Is the Mth component of the observed signal X (K), K is a time sequence, the superscript T represents the conjugate transpose, M is a positive integer, A is an M × M-dimensional matrix;
step 1-2, sending the observation voice signal X (k) obtained in the step 1-1 into a preprocessing filter for STFT processing to obtain a frequency spectrum matrix, then solving a power spectrum of the frequency spectrum matrix, and outputting a power spectrum signal matrix Z (k). The method specifically comprises the following steps: the observation signals X (k) are first subjected to a centering pretreatment, i.e.Where E represents the mathematical expectation, followed by a centering pre-processing resultWhitening treatment is carried out to obtain an output power spectrum signal matrix Z (k) of the preprocessing filter: z (k) vx (k), where V is the whitening matrix, making the obtained initialization conditions more robust.
The step 2 specifically comprises the following steps:
the basic idea of NMF is to put the matrix V ═ V under non-negative constraints1,v2,…,vN]∈Rm×NDecomposed into two matrices W ═ W1,w2,…,wN]∈Rm×nAnd H ═ H1,h2,…,hN]∈Rn×NBy multiplying the decomposition results as much as possible
V=WH(5)
In general, the matrix dimensions described above need to satisfy the relation (m + N) N < mN. Specifically, in the present invention, W denotes a speech signal mixing matrix, and H denotes a speech source signal matrix.
Step 2-1, because W and H have non-negative limits, generally the decomposition for V cannot be strictly satisfied, but only the decomposition for V has errors. Since the decomposition does not eliminate the error, we can only make the decomposition error as small as possible, define an appropriate objective function, and find the decomposition with V as accurate as possible by minimizing the objective function. Two objective functions were given by d.d.lee, h.s.seung et al, one of which is based on Kullback-Leibler divergence, defined as follows:
D ( V | | W H ) = &Sigma; i j ( V i j l o g V i j ( W H ) i j - V i j + ( W H ) i j ) - - - ( 6 )
step 2, firstly, performing sparse constraint, minimum correlation constraint and determinant constraint processing on the initial mixing matrix, specifically as follows:
when the non-negative matrix decomposition is directly used for solving the underdetermined blind signal separation, the decomposition result is not unique, and the source signals cannot be separated correctly. In order to achieve the purposes of good separation performance and unique result under underdetermined conditions, the invention adopts determinant criterion as the unique constraint condition of W and adopts l1The norm is used as sparsity constraint of H, triple constraint of minimum correlation constraint of H is adopted, and an optimization function F (V | | WH) is selected as follows by combining an objective function of minimum reconstruction error based on Kullback-Leibler divergence:
F ( V | | W H ) = &part; D D ( V | | W H ) + &part; &phi; v o l ( &phi; ( W ) ) + &part; J J ( H ) + &part; R C ( H ) - - - ( 7 )
wherein, W i k , H k j > 0 , &Sigma; i W i k = 1 , 1 &le; k &le; n , 1 &le; i &le; m , 1 &le; j &le; N
D ( V | | W H ) = &Sigma; i j ( V i j l o g V i j ( W H ) i j - V i j + ( W H ) i j ) - - - ( 8 )
in the formula
D (V | | WH) is a distance function of V and WH, namely a reconstruction error;
vol (Φ (W)) is a determinant criteria constraint of the mixing matrix W, and represents the volume of Φ (W), where W is the square matrix, vol (Φ (W)) ═ det (W)) |; when W is not a square matrix, vol (phi (W)) - | det (WW)T) And the latter corresponds to the case of underdetermination (i.e. the number of speech signals is greater than the number of sensors).
J (H) is a sparsity constraint separating signal H, and
c (H) is the minimum correlation constraint of the separated signal H, so that each component of the separated voice signal does not contain the information of other signal components as much as possible, C ( H ) = 1 2 &lsqb; &Sigma; i = 1 n l o g ( ( HH T ) ) i i - l o g | HH T | &rsqb; ;
are constraint parameters in order to guarantee the NMF algorithm convergence.
The NMF problem can then be converted into an optimization problem, namely
minD (V | | WH), for any W, H,
s.tW,H≥0
the dispersion D (V | | WH) is non-increasing under alternating iterations of the multiplicative update rule:
H a &mu; &LeftArrow; H a &mu; &Sigma; i W i a V i &mu; / ( W H ) i &mu; &Sigma; k W k a W i a &LeftArrow; W i a &Sigma; i H a &mu; V i &mu; / ( W H ) i &mu; &Sigma; k H a &mu; - - - ( 9 )
the dispersion no longer changes if and only if W, H are stable points.
We first derive HThe iterative formula of (2) is that for the target function D (V | | WH), the negative gradient direction is used as descending analysis, and the undetermined learning rate is set to be ηUsing the steepest descent method to calculate
&part; D ( V | | W H ) &part; H a &mu; = &part; &Sigma; i j ( V i j l o g V i j ( W H ) i j - V i j + ( W H ) i j ) &part; H a &mu; = &Sigma; i W i a - &Sigma; i V i &mu; ( W H ) i &mu; W i a - - - ( 10 )
Thus, the
H a &mu; &LeftArrow; H a &mu; + &eta; a &mu; ( &Sigma; i V i &mu; ( W H ) i &mu; W i a - &Sigma; i W i a ) - - - ( 11 )
Selecting a learning rate ofWhere the equilibrium parameters are expressed with very small values.
Then, iterative formula (11) with respect to H can be obtained, and since W and H have symmetry, iterative formula with respect to W can be similarly found.
Step 2-2 and step 2, the objective function based on the reconstruction error is optimized, specifically as follows:
the non-negative matrix factorization with sparse constraint is to add constraint conditions on the original objective function to acquire the decomposition information which is as sparse as possible. On the basis of the step 2-1, W and H both refer to algebraic expressions corresponding to the iterative formulas of W and H obtained in the step 2-1, and the constraint condition of adding 1-norm to the non-negative vector H is considered, namely minimizing H (| H |)1The conversion into matrix form is such thatThe sufficient sparseness of the matrix H should be minimizedAdding this constraint, the objective function becomes:
D ( V | | W H ) = 1 2 &Sigma; i , j ( V i j - ( W H ) i j ) 2 - 1 2 &beta; &Sigma; i , j W i j 2 + &lambda; &Sigma; i , j H i j - - - ( 12 )
the above formula λ is not less than 0, which is transformed into the optimization problem as follows:
min G ( V | | W H ) s . t . W , H &GreaterEqual; 0 , &Sigma; i W i j = 1 , 1 &le; j &le; r - - - ( 13 )
obviously, the iteration rule of W is not changed, but passes through the formula
H a j t &LeftArrow; H a j t - 1 - &part; a j &part; G ( V | | W H ) &part; H a j - - - ( 14 )
The iteration rule of G (V | | WH) with respect to H can be solved.
&part; G ( V | | W H ) &part; H a j = &part; ( 1 2 &Sigma; i , j ( V i j - ( W H ) i j ) 2 - 1 2 &beta; &Sigma; i , j W i j 2 + &lambda; &Sigma; i , j H i j ) &part; H a j = ( W T W H ) a j - ( W T V ) a j + &lambda; - - - ( 15 )
Is the step size in the direction of the negative gradient, taken
&part; a j = H a j ( W T W H ) a j + &lambda; - - - ( 16 )
Substituting equations (15) and (16) into (3.14) yields the iterative equation for H:
H a j &LeftArrow; H a j ( W T V ) a j ( W T W H ) a j + &lambda; - - - ( 17 )
finally, a new iteration rule is obtained as follows:
W i a &LeftArrow; W i a ( VH T ) i a ( WHH T ) i a - &beta;W i a - - - ( 18 )
H a j &LeftArrow; H a j ( W T V ) a j ( W T W H ) a j + &lambda; - - - ( 19 )
the iterative rules (18), (19) can be proven to be convergent by mathematical correlation definitions and lemmas[1]When solving for W and H using the iteration rules (18), (19), the objective function G (V | | | WH) is non-increasing, and the condition that the objective function no longer changes is if and only if W and H are their stable points.
However, because the iterative formula contains a minus sign, the new iterative rule cannot guarantee that the result is non-negative, and when the denominator is zero, the new iterative rule loses meaning, and a small adjusting parameter which is 1.0 × 10 is introduced for the reason-9And normalizing each column of the base matrix W to obtain
W i a &LeftArrow; W i a m a x { ( VH T ) i a , &epsiv; } ( WHH T ) i a - &beta;W i a + &epsiv; - - - ( 20 )
H a j &LeftArrow; H a j m a x { ( W T V ) a j , &epsiv; } ( W T W H ) a j + &lambda; + &epsiv; - - - ( 21 )
A base matrix WiaNormalization is as follows:
W i a &LeftArrow; W i a &Sigma; i W i a - - - ( 22 )
when the formulas (20) and (21) are iterated until F (V | | | WH) is not larger than a certain small threshold, the convergence of the algorithm can be judged, and the correct W and H are obtained.
Under the condition that the signal and mixing matrix is not negative, the formula (3) X is AS observation signal matrix X is decomposed according to the formula (4), namely X is WH, so that W is the estimation of the mixing matrix A, and H is the estimation of the source signal matrix S, thereby realizing the blind source separation of the mixed signals.
The steps of the NMF underdetermined blind speech signal separation algorithm based on K-L divergence are as follows:
a. randomly selecting a group of W and H obtained in the step 1 for initialization, and performing sparse constraint, minimum correlation constraint and determinant constraint processing;
b. further optimizing the target function based on the reconstruction error to realize the reconstruction of the source speech signal frequency spectrum matrix and obtain the frequency spectrum matrixes W and H of the reconstructed source speech signal (wherein W and H both refer to algebraic expressions corresponding to the iterative formulas of W and H obtained in the step 2-1);
c. and updating the learning process, namely setting a small threshold, iterating according to an updating rule as shown in formulas (20) and (21), after each iteration, setting the negative elements of W and H to zero, and normalizing the column vector of W as shown in a formula (22). When the optimization function F (V | | | WH) is less than the threshold, the algorithm terminates, stopping updating. And performing inverse STFT processing on W and H to obtain a unique group of voice signal mixing matrix W and a voice source signal estimation matrix H.
The step 3 specifically comprises:
in the signal separation process, the idea of a feedback mechanism is to eliminate the most thoroughly separated signals on the basis of the estimated source signals, so as to form a new mixed signal for signal separation, and the separation process is repeated to realize the separation of all signals. A flow chart of the feedback mechanism concept, as shown in fig. 2. Then, in case that the source signal is unknown, in order to determine the purity of the estimated signal, the correlation coefficients between each source signal component estimated by the feedback system and all the mixed signals are summed, and the signal component corresponding to the minimum value is determined as the purest signal, i.e. the most thoroughly separated signal. Two signals h1And h2The formula for calculating the correlation coefficient therebetween is defined as follows:
r h 1 h 2 = &Sigma; i = 1 N ( h 1 i - h &OverBar; 1 ) ( h 2 i - h &OverBar; 2 ) &Sigma; i = 1 N ( h 1 i - h &OverBar; 1 ) 2 &Sigma; i = 1 N ( h 2 i - h &OverBar; 2 ) 2 - - - ( 23 )
wherein h is1iAnd h2iAre respectively a signal h1And h2And (ii) a value ofAndrespectively represent signals h1And h2Corresponding average value. If it isIt means that they are not relevant. The judgment basis of the feedback mechanism is the comparison of the absolute values of the correlation coefficients of the signals.
As described above, the signal mixing matrix is subjected to multiple constraint constraints to optimize the reconstruction error objective function, and the mixing matrix is estimated, thereby enhancing the interpretability and physical significance of the decomposition factor.
The method comprises the following specific steps:
and 3-1, updating to obtain W ═ W according to the iteration rule formulas (20) and (21) of W and H1,w2,…,wn]And H ═ H1,h2,…,hn]TLet p be n;
step 3-2, judging the number of source audio signals: if the number p of the source speech signals is less than 2, obtaining a final speech signal mixing matrix W and an estimation matrix H of the speech source signals, and finishing the algorithm; otherwise, turning to the next step, and continuing to separate the voice signals;
step 3-3, selecting the source signal with the best separation effect: comparing each separated source signal hi(i-1, 2, …, p) and the absolute value of the correlation coefficient of the original observed speech signal V, i.e. the sumWherein v isjFor observing the jth component of the speech signal V, the minimum value, i.e. the valueCorresponding to htNamely the voice signal with the best separation effect;
and 3-4, forming a new mixed voice signal: removing source signal h from mixed speech signal used in this separationtTo form a new mixed speech signal V', i.e. V ═ V-wthtAnd let p be p-1;
and 3-5, updating the W-W according to the new mixed voice signal V' and the iteration rule formulas (20) and (21) in sequence to obtain W-W1,w2,…,wn]And H ═ H1,h2,…,hn]T(ii) a And returning to the step 3-2.
As described above, the separated voice signals are separated one by introducing a feedback mechanism to the initial separation result, so that the separation accuracy of the blind voice signals is improved.
Reference documents:
[1]LeeDD,SeungHS.Algorithmsfornon-negativematrixfactorization[J].AdvancesinNeuralInformationProcessingSystems,2001,13:556-562.

Claims (5)

1. An underdetermined blind source separation method applying multiple constraints, characterized by comprising the following steps:
step 1, sampling and quantizing a source signal to be separated to obtain an initial observation signal, and preprocessing the initial observation signal, wherein the method comprises the following steps: carrying out averaging and whitening treatment to obtain an initial mixing matrix;
step 2, firstly carrying out sparse constraint, minimum correlation constraint and determinant constraint processing on the initial mixed matrix obtained in the step 1, and then carrying out optimization processing on a target function based on reconstruction errors to obtain an estimation and initial separation signal of the mixed matrix;
and 3, performing feedback processing on the initial separation signal to obtain a final separation signal.
2. The method for multiple-constraint underdetermined blind source separation as defined in claim 1, wherein the step 1 specifically comprises:
step 1-1, sampling and quantizing an original observation signal to obtain a discrete signal s (k), and a source signal s (k) ═ s1(k),s2(k),…,sM(k)]TObtaining an initial observation signal X (k) ═ x through a nonsingular mixing matrix A1(k),x2(k),…,xM(k)]T: x (k) ═ as (k); wherein s isM(k) Is the Mth component, x, of the source signal S (k)M(k) Is the Mth component of the observed signal X (K), K is a time sequence, the superscript T represents the conjugate transpose, M is a positive integer, A is an M × M-dimensional matrix;
step 1-2, sending the observation signal X (k) obtained in the step 1-1 into a preprocessing filter for STFT processing to obtain a frequency spectrum matrix, then solving a power spectrum for the frequency spectrum matrix, and outputting a power spectrum signal matrix Z (k). The method specifically comprises the following steps: the observation signals X (k) are first subjected to a centering pretreatment, i.e.Where E represents the mathematical expectation, followed by a centering pre-processing resultWhitening treatment is carried out to obtain an output power spectrum signal matrix Z (k) of the preprocessing filter: z (k) vx (k), where V is the whitening matrix, making the obtained initialization conditions more robust.
3. The method for multiple-constraint underdetermined blind source separation according to claim 1, wherein the initial mixture matrix in step 2 is first subjected to sparsity constraint, minimum correlation constraint and determinant constraint, specifically as follows:
and 2-1, when the non-negative matrix decomposition is directly used for solving the underdetermined blind signal separation, the decomposition result is not unique, and the source signal cannot be separated correctly. In order to achieve the purposes of good separation performance and unique result under an underdetermined condition, triple constraints are required to be adopted in the non-negative matrix decomposition process, and meanwhile, an optimization function F (V | | WH) is selected as follows by combining a target function with the minimum reconstruction error:
F ( V | | W H ) = &part; D D ( V | | W H ) + &part; &phi; v o l ( &phi; ( W ) ) + &part; J J ( H ) + &part; R C ( H )
wherein, W i k , H k j > 0 , &Sigma; i W i k = 1 , 1 &le; k &le; n , 1 &le; i &le; m , 1 &le; j &le; N
D ( V | | W H ) = &Sigma; i j ( V i j l o g V i j ( W H ) i j - V i j + ( W H ) i j )
in the formula
D (V | | WH) is a distance function of V and WH, namely a reconstruction error;
vol (Φ (W)) is a determinant criteria constraint of the mixing matrix W, and represents the volume of Φ (W), where W is the square matrix, vol (Φ (W)) ═ det (W)) |;when W is not a square matrix, vol (phi (W)) - | det (WW)T)|;
J (H) is a sparsity constraint separating signal H, and
c (H) is the minimum correlation constraint for the split signal H;
is a constraint parameter for the purpose of ensuring algorithm convergence;
the NMF problem can then be converted into an optimization problem, namely
minD (V | | WH), for any W, H,
s.tW,H≥0
the dispersion D (V | | WH) is non-increasing under the following iterations:
H a &mu; &LeftArrow; H a &mu; &Sigma; i W i a V i &mu; / ( W H ) i &mu; &Sigma; k W k a W i a &LeftArrow; W i a &Sigma; i H a &mu; V i &mu; / ( W H ) i &mu; &Sigma; k H a &mu;
if and only if W, H are stable points, the dispersion no longer changes;
we first derive HThe iterative formula of (2) is that for the target function D (V | | WH), the negative gradient direction is used as descending analysis, and the undetermined learning rate is set to be ηTo obtainThen, an iterative formula about H can be obtained, and since W and H have symmetry, an iterative formula about W can be similarly solved;
step 2-2 and step 2, the objective function based on the reconstruction error is optimized, specifically as follows:
on the basis of the step 2-1, the constraint condition that the non-negative vector h is added with 1-norm is considered, namely the calculation of the minimum | | h | | survival rate1The conversion into matrix form is to minimize the sufficient sparseness of the matrix HObtaining a new iteration rule; when the iteration rule is used for solving W and H, the objective function G (V | | | WH) is non-increasing, and the condition that the objective function is not changed is that if and only if W and H are stable points of the objective function;
when the iteration is carried out according to the new iteration rules of W and H until F (V | | | WH) is not larger than a certain very small threshold value, the algorithm convergence can be judged, and correct W and H are obtained;
under the condition that the signal and mixing matrix are not negative, decomposing an observation signal matrix X according to the formula X-WH, wherein W is the estimation of the mixing matrix A, and H is the estimation of a source signal matrix S, thereby realizing the blind source separation of the mixed signals.
4. The method for multiple-constraint underdetermined blind source separation as defined in claim 1, wherein the step 3 specifically comprises the steps of:
and 3-1, updating to obtain W ═ W according to the new iteration rule of W and H obtained in the step 21,w2,…,wn]And H ═ H1,h2,…,hn]TLet p be n;
step 3-2, judging the number of source signals: if the number p of the source signals is less than 2, obtaining a final mixing matrix W and a final source signal H, and finishing the algorithm; otherwise, turning to the next step, and continuing to separate signals;
step 3-3, selecting the source signal with the best separation effect: comparing each separated source signal hi(i ═ 1,2, …, p) and the sum of the absolute values of the correlation coefficients of the original observed signal V, i.e. the sumWherein v isjFor the jth component of the observed signal V, the minimum value, i.e. the valueCorresponding to htI.e. the signal with the best separation effect;
and 3-4, forming a new mixed signal: removing source signal h from mixed signal used in this separationtTo form a new mixed signal V', i.e. V ═ V-wthtAnd let p be p-1;
step 3-5, according to the new mixed signal V', sequentially updating according to the iteration rule formula in the step 3-1 to obtain W ═ W1,w2,…,wn]And H ═ H1,h2,…,hn]T(ii) a And returning to the step 3-2.
5. A multi-constrained underdetermined blind source separation method as defined in claim 3,
the new iteration rule is as follows:
W i a &LeftArrow; W i a ( VH T ) i a ( WHH T ) i a - &beta;W i a
H a j &LeftArrow; H a j ( W T V ) a j ( W T W H ) a j + &lambda;
when W and H are solved using the new iteration rule, the objective function G (V | | | WH) is non-increasing, and the condition that the objective function no longer changes is if and only if W and H are their stable points.
CN201610046103.0A 2016-01-22 2016-01-22 Underdetermined blind source separation method applying multiple constraints Pending CN105550716A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201610046103.0A CN105550716A (en) 2016-01-22 2016-01-22 Underdetermined blind source separation method applying multiple constraints

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201610046103.0A CN105550716A (en) 2016-01-22 2016-01-22 Underdetermined blind source separation method applying multiple constraints

Publications (1)

Publication Number Publication Date
CN105550716A true CN105550716A (en) 2016-05-04

Family

ID=55829899

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201610046103.0A Pending CN105550716A (en) 2016-01-22 2016-01-22 Underdetermined blind source separation method applying multiple constraints

Country Status (1)

Country Link
CN (1) CN105550716A (en)

Cited By (9)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN106448694A (en) * 2016-09-08 2017-02-22 哈尔滨工程大学 Time-frequency single source point extraction method in underdetermined blind source separation based on compound angle detection
CN106898361A (en) * 2017-03-16 2017-06-27 杭州电子科技大学 Single channel blind source separation method based on feedback variation Mode Decomposition
CN107403046A (en) * 2017-07-27 2017-11-28 长安大学 A kind of linear restriction method of estimation decomposed based on state space
CN107610710A (en) * 2017-09-29 2018-01-19 武汉大学 A kind of audio coding and coding/decoding method towards Multi-audio-frequency object
CN108922557A (en) * 2018-06-14 2018-11-30 北京联合大学 A kind of the multi-person speech separation method and system of chat robots
CN109084743A (en) * 2018-08-24 2018-12-25 中国科学院光电技术研究所 Method for separating target information output by fiber-optic gyroscope and disturbance signal of photoelectric tracking system
CN111652328A (en) * 2020-08-04 2020-09-11 北京英迈琪科技有限公司 Non-negative blind source separation method and system
CN112131899A (en) * 2020-09-28 2020-12-25 四川轻化工大学 Anti-collision method of RFID system in underdetermined state
US11862141B2 (en) 2019-03-27 2024-01-02 Sony Group Corporation Signal processing device and signal processing method

Citations (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN104573593A (en) * 2015-01-20 2015-04-29 江西理工大学 Underdetermined blind source separation RFID anti-collision method based on frame slots
CN104866790A (en) * 2015-06-02 2015-08-26 江西理工大学 Underdetermined blind separation self-adaption tree form grouping anti-collision method for RFID system

Patent Citations (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN104573593A (en) * 2015-01-20 2015-04-29 江西理工大学 Underdetermined blind source separation RFID anti-collision method based on frame slots
CN104866790A (en) * 2015-06-02 2015-08-26 江西理工大学 Underdetermined blind separation self-adaption tree form grouping anti-collision method for RFID system

Non-Patent Citations (2)

* Cited by examiner, † Cited by third party
Title
史习智: "《盲信号处理:理论与实践》", 31 March 2008 *
黄雯雯: "基于非负矩阵分解的盲信号分离方法研究", 《中国优秀硕士学位论文全文数据库信息科技辑》 *

Cited By (12)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN106448694A (en) * 2016-09-08 2017-02-22 哈尔滨工程大学 Time-frequency single source point extraction method in underdetermined blind source separation based on compound angle detection
CN106448694B (en) * 2016-09-08 2019-09-27 哈尔滨工程大学 A kind of deficient time-frequency list source point extracting method determined in blind source separating based on the detection of multiple angle
CN106898361A (en) * 2017-03-16 2017-06-27 杭州电子科技大学 Single channel blind source separation method based on feedback variation Mode Decomposition
CN106898361B (en) * 2017-03-16 2020-05-26 杭州电子科技大学 Single-channel blind source separation method based on feedback variational mode decomposition
CN107403046A (en) * 2017-07-27 2017-11-28 长安大学 A kind of linear restriction method of estimation decomposed based on state space
CN107610710A (en) * 2017-09-29 2018-01-19 武汉大学 A kind of audio coding and coding/decoding method towards Multi-audio-frequency object
CN108922557A (en) * 2018-06-14 2018-11-30 北京联合大学 A kind of the multi-person speech separation method and system of chat robots
CN109084743A (en) * 2018-08-24 2018-12-25 中国科学院光电技术研究所 Method for separating target information output by fiber-optic gyroscope and disturbance signal of photoelectric tracking system
CN109084743B (en) * 2018-08-24 2022-04-19 中国科学院光电技术研究所 Method for separating target information output by fiber-optic gyroscope and disturbance signal of photoelectric tracking system
US11862141B2 (en) 2019-03-27 2024-01-02 Sony Group Corporation Signal processing device and signal processing method
CN111652328A (en) * 2020-08-04 2020-09-11 北京英迈琪科技有限公司 Non-negative blind source separation method and system
CN112131899A (en) * 2020-09-28 2020-12-25 四川轻化工大学 Anti-collision method of RFID system in underdetermined state

Similar Documents

Publication Publication Date Title
CN105550716A (en) Underdetermined blind source separation method applying multiple constraints
CN110634502B (en) Single-channel voice separation algorithm based on deep neural network
Sun et al. Alternating direction method of multipliers for non-negative matrix factorization with the beta-divergence
Tan et al. Nonlinear blind source separation using higher order statistics and a genetic algorithm
CN102945670B (en) Multi-environment characteristic compensation method for voice recognition system
CN107562784A (en) Short text classification method based on ResLCNN models
CN105206270A (en) Isolated digit speech recognition classification system and method combining principal component analysis (PCA) with restricted Boltzmann machine (RBM)
CN111653288A (en) Target person voice enhancement method based on conditional variation self-encoder
CN104598972A (en) Quick training method of large-scale data recurrent neutral network (RNN)
CN105355198A (en) Multiple self-adaption based model compensation type speech recognition method
Shi et al. Independent component analysis
CN111816200B (en) Multi-channel speech enhancement method based on time-frequency domain binary mask
CN107248414A (en) A kind of sound enhancement method and device based on multiframe frequency spectrum and Non-negative Matrix Factorization
Févotte et al. Sparse linear regression in unions of bases via Bayesian variable selection
Ji et al. Recurrent neural network-based dictionary learning for compressive speech sensing
CN113674756B (en) Frequency domain blind source separation method based on short-time Fourier transform and BP neural network
CN116310125A (en) Three-dimensional space mineral reconstruction method based on two-dimensional energy spectrum information
Zhang et al. A new two-stage approach to underdetermined blind source separation using sparse representation
CN109490840A (en) Based on the noise reduction and reconstructing method for improving the sparse radar target HRRP from encoding model
CN107221321A (en) A kind of phonetics transfer method being used between any source and target voice
Gabrielson et al. ICA with orthogonality constraint: identifiability and a new efficient algorithm
CN104064195A (en) Multidimensional blind separation method in noise environment
Yang et al. Natural gradient descent for training multi-layer perceptrons
Aihara et al. Semi-non-negative matrix factorization using alternating direction method of multipliers for voice conversion
Ji et al. Structured dictionary learning for compressive speech sensing

Legal Events

Date Code Title Description
C06 Publication
PB01 Publication
C10 Entry into substantive examination
SE01 Entry into force of request for substantive examination
RJ01 Rejection of invention patent application after publication

Application publication date: 20160504

RJ01 Rejection of invention patent application after publication