CN114822584A

CN114822584A - Transmission device signal separation method based on integral improved generalized cross-correlation

Info

Publication number: CN114822584A
Application number: CN202210439737.8A
Authority: CN
Inventors: 李旭; 栾峰; 王涛; 吴艳; 韩月娇
Original assignee: Northeastern University China
Current assignee: Northeastern University China
Priority date: 2022-04-25
Filing date: 2022-04-25
Publication date: 2022-07-29

Abstract

The invention discloses a transmission device signal separation method based on integral improved generalized cross-correlation, which is a new blind source separation method combining a generalized cross-correlation algorithm and a non-negative matrix decomposition algorithm to separate sound signals of different transmission devices. Combining a generalized cross-correlation algorithm with a nonnegative matrix decomposition algorithm, obtaining arrival time difference by using the generalized cross-correlation algorithm, and judging the number of sources; combining non-negative matrix decomposition to obtain information of which source the specific dictionary atom comes from, thereby providing factual basis for generating mask matrixes of different sources; the generalized cross-correlation is improved by using an integral method, and the accuracy of the estimation of the arrival time difference is improved; a new non-negative matrix factorization initialization method is designed, and the time for calculating the non-negative matrix factorization is reduced. The method solves the problem that other blind source separation methods depend on an ideal mathematical model or on a training neural network.

Description

Transmission device signal separation method based on integral improved generalized cross-correlation

Technical Field

The invention belongs to the technical field of transmission device signal separation methods, and relates to a transmission device signal separation method based on integral improvement generalized cross-correlation.

Background

The transmission device transmits the power of the power device to equipment such as a working mechanism and the like, and the figure of the transmission device can be seen on various machines or vehicles, wherein mechanisms such as a bearing and the like which mainly do circular motion are more. Because the working environment of the equipment is possibly very harsh, the failure rate of the equipment is improved, the normal operation of the operation is influenced, and in order to ensure that the machine operates normally, the performance of the machine needs to be monitored so as to find problems in time and maintain in time to recover the work. Specific sensors can be used to acquire the sound signal or vibration curve of the transmission device, but in some special scenes, the sensors cannot be added, and a non-contact mode is needed for monitoring. In a scenario with multiple actuators, the acoustic signals can be separated using blind source separation techniques, and the acoustic signals from multiple actuators can be separated for condition monitoring. Blind source separation enables the waveform of a source signal to be recovered from an observed mixed signal without determining the mixed signal mixing process and the source signal. The "blind" of blind source separation has two main points: the source signal is unknown and the transmission channel parameters of the signal are also unknown.

The origin of the blind source separation technology can trace back to 80 years in the 20 th century, pioneering work is mainly completed by Jutten and Herault, in a conference about a neural network held in the United states, an H-J learning algorithm in the blind source separation is proposed based on a feedback neural network model, the separation of two aliasing source signals is completed, the uncertainty of the quantity and the channel of the source signals can be solved, and a special CMOS chip is designed to realize the algorithm. At present, numerous scholars at home and abroad have a very deep research on the blind source separation technology. For example, Independent Component Analysis (ICA) method, the separation of the source signals can be accomplished as long as the mutual independence between the individual signals of the mixed signal is restored. On the basis of ICA theory, a large number of excellent blind source separation algorithms emerge: the deconvolution problem can be converted into the instantaneous problem by oversampling, and then the separation is carried out by the traditional ICA method; the wavelet decomposition of the signal can be regularized by adopting ICA to search for independent characteristics to carry out wavelet ICA; the constrained optimization problem can be solved using a constrained ICA algorithm with an adaptive solution like newton's learning. In addition, FastICA is provided based on the ICA principle and the non-Gaussian maximum fixed point algorithm, and the blind source separation problem can be converted into the estimation of a density function and then researched. However, the above algorithms all assume that sound follows a certain distribution and conforms to a mathematical model under an ideal situation, but it is difficult to satisfy the assumption in an actual scene, and thus the robustness exhibited by the above methods is not strong enough. In addition, the blind source separation method based on deep learning depends on pre-training, cannot be applied immediately, and has the advantages that the separation effect tests the generalization performance of the network and the stability is not good.

Disclosure of Invention

To solve the above technical problem, the present invention provides a method for separating transmission signals based on integral-improved generalized cross-correlation.

The invention provides a transmission device signal separation method based on integral improved generalized cross-correlation, which comprises the following steps:

step 1: collecting original dual-channel audio mixed signals and preprocessing the mixed signals;

step 2: performing time-frequency analysis on the dual-channel audio mixed signal to obtain time-frequency information of the mixed signal, wherein the time-frequency information comprises an amplitude spectrum and an angle spectrum;

and step 3: estimating time delay by using an integral improved generalized cross-correlation algorithm to obtain time delay of different sound sources;

and 4, step 4: carrying out non-negative matrix decomposition on the magnitude spectrum to obtain a dictionary matrix and a coefficient matrix;

and 5: combining an integral improvement generalized cross-correlation algorithm with a non-negative matrix decomposition algorithm to generate a mask matrix;

step 6: multiplying the mask matrix and the coefficient matrix element by element to obtain a separated coefficient matrix;

and 7: carrying out inverse nonnegative matrix decomposition, and multiplying the dictionary matrix and the separated coefficient matrix to obtain the amplitude spectrums of the different sound sources after separation;

and 8: and combining the amplitude spectrum and the angle spectrum of the different separated sound sources, and performing inverse short-time Fourier transform to obtain time domain information of the different separated sound sources, so as to complete separation.

In the transmission device signal separation method based on integral improved generalized cross-correlation of the present invention, the step 1 specifically is:

step 1.1: the method comprises the following steps of collecting original audio mixed signals by using a dual-channel microphone array, placing different sound sources in different directions during collection, and enabling the different sound sources to emit sound at the same time to simulate the sound emitted by different transmission devices;

step 1.2: in order to improve the signal-to-noise ratio of the mixed signal and further improve the quality of separation, the original audio mixed signal is preprocessed, and a polynomial least square method is adopted to eliminate a trend term error.

In the transmission device signal separation method based on integral improved generalized cross-correlation of the present invention, the step 2 specifically is:

step 2.1: performing discrete short-time Fourier transform on the audio mixed signal:

wherein f is frequency, t is time, STFT (f, t) is time frequency information, k is temporary variable required by integral operation, x () is input signal, g () is window function, and Hamming window is specifically adopted;

step 2.2: decomposing the time-frequency information by using the following formula:

wherein, V _ft Is an amplitude spectrum divided into an amplitude spectrum V of a left channel _lft Amplitude spectrum V of the right channel _rft ；φ _ft Is an angular spectrum.

In the transmission device signal separation method based on integral improved generalized cross-correlation of the present invention, the step 3 specifically is:

step 3.1: the basic generalized cross-correlation algorithm is defined as follows:

wherein tau is time delay,

For cross power spectrum, psi _ft As a frequency weighting function, G _τt Is a cross-correlation function;

step 3.2: the integration method is used to improve the basic generalized cross-correlation algorithm, and the cross-correlation function G is obtained by the formula (3) _τt Front, cross power spectrum

Integration along the time t axis: in particular, a specified window length is selected, in the cross-power spectrum

The sliding window algorithm is carried out on each line of the window, the mean value in the window is calculated, the value is assigned to the element at the center of the window, and then the cross-correlation function G is carried out _τt Calculating (1);

step 3.3: the obtained cross-correlation function G _τt Summing along the t-axis, the cross-correlation function becomes a one-dimensional delay profile

Finding out using peak detection algorithm

The time delay corresponding to the abscissa of the peak value is the calculated time delay tau _S (

s

1,2.. n), s represents different sound sources, and n represents the number of sound sources.

In the transmission device signal separation method based on integral improved generalized cross-correlation of the present invention, the step 4 specifically is:

step 4.1: initialization of the dictionary matrix: from the magnitude spectrum V _ft The method comprises the following steps of selecting a plurality of column vectors with the maximum infinite norm, averaging the column vectors to be used as columns of a dictionary matrix, wherein the infinite norm of the columns is defined as follows:

wherein | vec | purple light _∞ I.e. infinite norm, v, of vector vec _i Is an element of the vector, len is the length of the vector;

step 4.2: initializing the coefficient matrix by a random initialization mode;

step 4.3: dictionary matrix W _fd Sum coefficient matrix H _dt After initialization, iteration is carried out for a plurality of times by adopting the following iteration formula, and a decomposed dictionary matrix and a decomposed coefficient matrix are solved:

wherein the content of the first and second substances,

the dictionary matrix obtained for the mth iteration is calculated,

the coefficient matrix obtained for the mth iteration is calculated.

In the transmission device signal separation method based on integral improved generalized cross-correlation of the present invention, the step 5 specifically is:

step 5.1: the dictionary matrix generated using the non-negative matrix factorization defines a new frequency weighting function as follows:

wherein the content of the first and second substances,

is a new frequency weighting function;

step 5.2: weighting the new frequency function

Substituting into the integral improvement generalized cross-correlation algorithm definition formula in the step 3, the objective of combining the integral improvement generalized cross-correlation algorithm with the non-negative matrix decomposition is realized, and the following formula is obtained:

wherein the content of the first and second substances,

is a new cross-correlation function;

step 5.3: the meaning of the mask matrix is that different sound sources correspond to different time delays τ _S Respectively substituting the time delays of different sound sources into

Attributing the dictionary atom to

The source with the largest value sets the element of the specified position of the mask matrix of the sound source to 1, otherwise to 0, and the definition of the mask matrix is shown as follows:

wherein M is _dt For the mask matrix, s denotes different sound sources.

In the transmission signal separation method based on integral improved generalized cross-correlation of the present invention, the step 8 specifically is:

combining the amplitude spectrums of the different sources obtained in the step 7 with the angle spectrums obtained in the step 2, and performing short-time Fourier inverse transformation to obtain time domain information of the different separated sources, so that the information of different sound sources is finally separated, sound signals of different transmission devices are separated, and the following formula is an inverse short-time Fourier transformation formula:

wherein the content of the first and second substances,

i.e. sound signals representing separate different actuators,

for amplitude spectra of different sound sources, phi _ft Is an angular spectrum.

The invention discloses a transmission device signal separation method based on integral improved generalized cross-correlation, which at least has the following beneficial effects:

(1) the integration improvement generalized cross-correlation algorithm is combined with a non-negative matrix decomposition algorithm to carry out the sound signal separation of the multiple transmission devices by a blind source separation method, so that the precision is high, the calculation speed is high, and the robustness is strong;

(2) an integration method is used for enhancing a generalized cross-correlation algorithm, so that the result of the generalized cross-correlation is more accurate, and the error of time delay estimation is smaller;

(3) when the dictionary matrix and the coefficient matrix are solved, a new initialization method of a non-negative matrix decomposition algorithm is adopted, the decomposition speed is increased, and the separation effect is improved;

(4) the sound generated by different transmission devices can be separated without additionally arranging a sensor on the transmission device, so that the sound-separating device is suitable for being applied in a scene that the sensor cannot be additionally arranged, and the sound of the transmission device can be acquired in a non-contact manner.

Drawings

FIG. 1 is a flow chart of a transmission signal separation method of the present invention based on integral-improved generalized cross-correlation;

FIG. 2 is a flow chart of the integral-improved generalized cross-correlation algorithm of the present invention;

FIG. 3a is a graph comparing the time domain waveforms of the source signal 1 and the source signal 1 separated by the method of the present invention in example 1;

FIG. 3b is a graph comparing the time-frequency spectrum of the source signal 1 in example 1 and the source signal 1 separated by the method of the present invention;

FIG. 3c is a comparison of the time domain waveforms of the source signal 2 and the source signal 2 isolated by the method of the present invention in example 1;

FIG. 3d is a comparison graph of the time-frequency spectrum of the source signal 2 in example 1 and the source signal 2 separated by the method of the present invention;

FIG. 4a is a comparison graph of the time domain waveforms of the source signal 1 separated by the ICA method and the source signal 1 in example 1;

FIG. 4b is a graph comparing the time-frequency spectrum of the source signal 1 separated by the ICA method and the source signal 1 in example 1;

FIG. 4c is a graph comparing time domain waveforms of the source signal 2 and the source signal 2 separated by the ICA method in example 1;

FIG. 4d is a comparison graph of the time-frequency spectrum of the source signal 2 separated by the ICA method and the source signal 2 in example 1;

FIG. 5 is a graph comparing experimental results on two sets of sound source data;

fig. 6 is a graph comparing experimental effects on a three-sound source data set.

Detailed Description

As shown in fig. 1, the invention relates to a method for separating transmission signals based on integral improved generalized cross-correlation, which comprises the following steps:

step 1: the method comprises the following steps of collecting original dual-channel audio mixed signals and preprocessing the mixed signals, and specifically comprises the following steps:

Step 2: performing time-frequency analysis on the dual-channel audio mixed signal to obtain time-frequency information of the mixed signal, wherein the time-frequency information comprises an amplitude spectrum and an angle spectrum, and the step 2 specifically comprises the following steps:

And step 3: the time delay is estimated by using an integral improved generalized cross-correlation algorithm to obtain the time delays of different sound sources, and the flow of the integral improved generalized cross-correlation algorithm is shown in fig. 2. The method specifically comprises the following steps:

wherein tau is time delay,

The sliding window algorithm is carried out on each line of the image data, the mean value in the window is calculated, the value is assigned to the element at the center of the window, and then the cross-correlation function G is carried out _τt Calculating (1);

Finding out using peak detection algorithm

s

And 4, step 4: carrying out nonnegative matrix decomposition on the magnitude spectrum to obtain a dictionary matrix and a coefficient matrix, and specifically comprising the following steps:

step 4.1: initialization of the dictionary matrix: from the magnitude spectrum V _ft Selecting a plurality of column vectors with the maximum infinite norm, averaging the column vectors to be used as columns of a dictionary matrixThe infinite norm of a column is defined as follows:

step 4.2: initializing the coefficient matrix by a random initialization mode;

wherein the content of the first and second substances,

the dictionary matrix obtained for the mth iteration is calculated,

the coefficient matrix obtained for the mth iteration is calculated.

And 5: combining an integral improvement generalized cross-correlation algorithm with a non-negative matrix decomposition algorithm to generate a mask matrix, which specifically comprises the following steps:

step 5.1: the dictionary matrix generated using the non-negative matrix decomposition defines a new frequency weighting function as follows:

wherein the content of the first and second substances,

is a new frequency weighting function;

step 5.2: weighting the new frequency function

wherein the content of the first and second substances,

is a new cross-correlation function;

Attributing the dictionary atom to

wherein M is _dt For the mask matrix, s denotes different sound sources.

and 7: carrying out inverse nonnegative matrix decomposition, and multiplying the dictionary matrix and the separated coefficient matrix to obtain the amplitude spectrums of the different separated sound sources;

and 8: combining the amplitude spectrum and the angle spectrum of the different separated sound sources, and performing inverse short-time Fourier transform to obtain time domain information of the different separated sound sources, so as to complete separation; the method specifically comprises the following steps:

wherein the content of the first and second substances,

i.e. sound signals representing separate different actuators,

The present invention is further illustrated by the following examples.

Example 1:

the method selects the double-sound-source audio mixed signal acquired on site in a laboratory, and the output data is a single-channel audio signal after separation.

Firstly, the audio mixed signal is collected, a microphone array and two loudspeakers are used, and the loudspeakers simulate the sound emitted by the transmission device, namely simulate the scene that the two transmission devices work simultaneously. And enabling the loudspeaker to play audio simultaneously, and acquiring by the microphone array. And then, setting parameters of the program, wherein the method adopts short-time Fourier transform to perform time-frequency analysis, the width of a window function is set to be 1024, and the overlapping proportion of windows is 87.5%. The method adopts a generalized cross-correlation algorithm to estimate the arrival time difference, and divides the arrival time difference into 128 intervals; setting the size of the dictionary atoms in the nonnegative matrix decomposition, namely the length of the dictionary matrix column to be 128, and solving the upper limit of the iteration times of the nonnegative matrix decomposition by an iterative method to be 100 times; the distance between the microphones is set to 8cm as a real example.

Fig. 3a is a time domain waveform comparison graph of the source signal 1 in example 1 and the source signal 1 separated by the method of the present invention: FIG. 3b is a graph comparing the time-frequency spectrum of the source signal 1 in example 1 and the source signal 1 separated by the method of the present invention; fig. 3c is a time domain waveform comparison of the source signal 2 of example 1 and the source signal 2 isolated by the method of the present invention: FIG. 3d is a comparison graph of the time-frequency spectrum of the source signal 2 in example 1 and the source signal 2 separated by the method of the present invention;

3a-3d, it can be seen that the transmission signal separation method based on the improved generalized cross-correlation provided by the present invention can effectively separate a mixed signal into several clean source signals, and has excellent separation capability in both time domain and frequency domain, and better robustness.

Further, the audio data set used is kept unchanged, and an ICA method is adopted for comparison experiments, and fig. 4a is a time domain waveform comparison graph of the source signal 1 separated by the source signal 1 and the ICA method in example 1; FIG. 4b is a graph comparing the time-frequency spectrum of the source signal 1 separated by the ICA method and the source signal 1 in example 1; FIG. 4c is a comparison graph of the time domain waveforms of the source signal 2 and the separated source signal 2 by the ICA method in example 1; fig. 4d is a time-frequency map comparing the source signal 2 separated by the ICA method and the source signal 2 in example 1.

The experimental results of fig. 4a-4d show that the result of separating the mixed signals by the ICA method is poor, the signal-to-noise ratio is low, and the effect is inferior to that of the transmission signal separation method based on the improved generalized cross-correlation provided by the invention. Compared with an ICA method, the blind source separation method formed by combining the improved generalized cross-correlation with the non-negative matrix decomposition method can effectively solve the signal separation problem of the transmission device, and has great practical significance and application value.

Example 2:

in order to further prove the advancement of the transmission device signal separation method based on the improved generalized cross-correlation, the multi-sound-source mixed signal is collected for 1.5 hours and is divided into two audio data sets according to different sources (two sound sources or three sound sources), and each audio data set comprises 270 segments of two-channel audio signals of 10 seconds. The method, the ICA method and the PCA principal component analysis method are respectively tested on the two audio data, and the test results are compared. The BSS-EVAL evaluation tool kit widely applied in the field of blind Source separation is used for quantitative evaluation of the method, and three indexes are used, namely an artifact rate (SAR), a Distortion rate (SDR) and an interference rate (SIR).

According to fig. 5 and 6, it can be seen that the method provided by the present invention has excellent performance on various evaluation indexes on two audio data sets, and on different audio data sets, each index SAR, SDR, SIR is superior to the other two methods. By combining the performances of two data sets, the multi-transmission device signal separation method based on the improved generalized cross-correlation has the best performance and the best robustness in three methods.

The above description is only for the purpose of illustrating the preferred embodiments of the present invention and is not to be construed as limiting the scope of the present invention, which is defined by the appended claims.

Claims

1. A method for separating transmission device signals based on integral improved generalized cross-correlation is characterized by comprising the following steps:

2. The method for separating the signals of the transmission device based on the integral improved generalized cross-correlation as claimed in claim 1, wherein the step 1 is specifically as follows:

3. The method for separating the signals of the transmission device based on the integral improved generalized cross-correlation as claimed in claim 1, wherein the step 2 is specifically as follows:

4. The method for separating the signals of the transmission device based on the integral improved generalized cross-correlation as claimed in claim 3, wherein the step 3 is specifically as follows:

wherein tau is time delay,

Finding out using peak detection algorithm

The time delay corresponding to the abscissa of the peak value is the calculated time delay tau _S (s 1,2.. n), s represents different sound sources, and n represents the number of sound sources.

5. The method for separating the signals of the transmission device based on the integral improved generalized cross-correlation as claimed in claim 4, wherein the step 4 is specifically as follows:

step 4.2: initializing the coefficient matrix by a random initialization mode;

wherein the content of the first and second substances,

the dictionary matrix obtained for the mth iteration is calculated,

the coefficient matrix obtained for the mth iteration is calculated.

6. The method for separating the signals of the transmission device based on the integral improved generalized cross-correlation as claimed in claim 5, wherein the step 5 is specifically as follows:

wherein the content of the first and second substances,

is a new frequency weighting function;

step 5.2: weighting the new frequency function

Integral modified generalized cross-correlation algorithm definition substituted into step 3The formula realizes the aim of combining the integral improvement generalized cross-correlation algorithm with the non-negative matrix decomposition, and obtains the following formula:

wherein the content of the first and second substances,

is a new cross-correlation function;

Attributing the dictionary atom to

wherein M is _dt For the mask matrix, s denotes different sound sources.

7. The method for separating the signals of the transmission device based on the integral improved generalized cross-correlation as claimed in claim 5, wherein the step 8 is specifically as follows:

wherein the content of the first and second substances,

i.e. sound signals representing separate different actuators,