WO2015060375A1

WO2015060375A1 - Biological sound signal processing device, biological sound signal processing method, and biological sound signal processing program

Info

Publication number: WO2015060375A1
Application number: PCT/JP2014/078182
Authority: WO
Inventors: 智弥酒井; 千弥善安; 末治宮原
Original assignee: 国立大学法人長崎大学
Priority date: 2013-10-23
Filing date: 2014-10-23
Publication date: 2015-04-30
Also published as: JPWO2015060375A1

Abstract

[Problem] To make it possible to more accurately distinguish continuous rhonchi from discontinuous rales in lung sounds of humans, etc. [Solution] The biological sound detection signal processing device (90) comprises a robust principal component analysis unit (40), a continuous sound processing unit (20), and a discontinuous sound processing unit (30). The robust principal component analysis unit (40) receives original biological sounds with an input section (12), performs Fourier transforms in a Fourier transform section (14), and performs robust principal component analysis of original sound matrices generated by a matrix-generating section (16). When a sparse matrix obtained from robust principal component analysis is processed with the continuous sound processing unit (20), continuous biological sounds are obtained from the original sounds. When a low rank matrix obtained from robust principal component analysis is processed with the discontinuous sound processing unit (30), biological sounds in which continuous biological sounds have been excluded from the original sounds are obtained.

Description

Biological sound signal processing apparatus, biological sound signal processing method, and biological sound signal processing program

The present invention relates to a biological sound signal processing device, a biological sound signal processing method, and a biological sound signal processing program for processing biological sounds such as lung sounds.

In recent years, a diagnosis support device has been developed to convert lung sound signals (pulmonary sound signals) obtained by electronic stethoscopes into digital data, analyze the data, and use the analysis results for diagnosis (sound diagnosis). Is underway.

Lung sounds are roughly divided into respiratory sounds and auxiliary noises as abnormal sounds. The sub-noise is further divided into a ra sound and others, and the ra sound is further divided into an intermittent ra sound and a continuous ra sound. Intermittent rales include water bubbles and haircut sounds, and continuous rales include whistle sounds and snoring sounds.

A method is known in which lung sounds are classified into normal breath sounds and continuous rales using fast Fourier transform and its inverse transform (see, for example, Patent Document 1). In this method, first, an amplitude spectrum and a power spectrum are calculated by performing a fast Fourier transform (FFT) on a time waveform of a lung sound. Next, an inverse FFT process is performed on the amplitude spectrum at a point where the local dispersion value of the power spectrum exceeds the threshold value. In this way, normal breath sounds and continuous rales can be distinguished.

Also, a technique for separating respiratory sounds and intermittent rales from lung sounds is known (for example, see Non-Patent Document 1). In this technique, lung sounds are separated based on a sparse expression that constitutes the lung sounds most simply by the sum of a Fourier transform signal and a wavelet signal. This sparse representation f (t) is classified as a breathing sound, and w (t) is classified as an intermittent rale.

JP 2004-357758 A

The method of separating lung sounds into normal breath sounds and continuous rales using fast Fourier transform and its inverse transform is a kind of filtering in which frequency bands are adaptively selected. For this reason, when both share the same frequency component, it cannot be distinguished. In addition, the signal that is used as the normal breathing sound here may include an intermittent sound that is an abnormal sound.

Also, in the method of separating lung sounds based on the sparse representation that composes the lung sounds most simply by the sum of the Fourier transform signal and the wavelet signal, the original lung sound signal contains continuous rales. If there is, the continuous rarity is not always classified into either f (t) or w (t). For this reason, it cannot respond to the analysis process of continuous rales.

Thus, in the conventional technology, lung sound signals including various abnormal sounds cannot be accurately classified.

Therefore, an object of the present invention is to enable more accurate separation of continuous and intermittent rales from lung sounds of humans and the like.

To achieve the above object, the present invention provides a robust principal component analysis unit that decomposes an original sound matrix representing an original sound spectrogram of a biological sound signal into a sparse matrix and a low rank matrix by robust principal component analysis in the biological sound signal processing apparatus. A continuous sound processing unit that converts the sparse matrix to obtain continuous biological sound from the biological sound signal, and a biological body that converts the low rank matrix and excludes continuous biological sound from the biological sound signal And a discontinuous sound processing unit that obtains sound.

The present invention also relates to a biological sound signal processing method, a robust principal component analysis step of decomposing an original sound matrix representing an original sound spectrogram of a biological sound into a sparse matrix and a low rank matrix by robust principal component analysis, and converting the sparse matrix A second step of obtaining a continuous sound processing unit that obtains a continuous body sound from the body sound signal, and a body sound obtained by converting the low rank matrix and excluding the continuous body sound from the body sound signal. And obtaining a third step.

In the biological sound signal processing program, the present invention provides a robust principal component analysis means for decomposing an original sound matrix representing an original sound spectrogram of biological sound into a sparse matrix and a low rank matrix by robust principal component analysis, and the sparse A continuous sound processing means for converting a matrix to obtain continuous biological sounds from the biological sound signal; and a non-continuous sound processing means for obtaining biological sounds excluding continuous biological sounds from the biological sound signals by converting the low rank matrix It is characterized by functioning as a continuous sound processing means.

According to the present invention, it is possible to more accurately separate continuous and intermittent rales such as humans.

1 is a block diagram of a first embodiment of a biological sound signal processing device according to the present invention. It is a flowchart of the biological sound signal processing method in 1st Embodiment of the biological sound signal processing apparatus which concerns on this invention. It is a graph of the original sound which processes in the 1st Embodiment of the biological sound signal processing apparatus which concerns on this invention. It is the figure which represented the value of the element of the matrix which has the value of the original sound spectrogram obtained by carrying out the short-time Fourier transform of the original sound signal in the 1st Embodiment of the biological sound signal processing apparatus which concerns on this invention by the light and shade. . It is the figure which represented the value of the element of the low rank matrix obtained by the robust principal component analysis in 1st Embodiment of the biological sound signal processing apparatus which concerns on this invention with the shading. It is the figure which represented the value of the element of the sparse matrix obtained by the robust principal component analysis in the 1st Embodiment of the biological sound signal processing apparatus which concerns on this invention with the shading. It is a graph of the lung sound signal obtained by carrying out the inverse Fourier transform of the sparse matrix in 1st Embodiment of the biological sound signal processing apparatus which concerns on this invention. It is a graph of the lung sound signal obtained by carrying out the inverse Fourier transform of the low rank matrix in 1st Embodiment of the biological sound signal processing apparatus which concerns on this invention. It is a graph of the Fourier component extracted from the lung sound signal obtained by carrying out the inverse Fourier transform of the low rank matrix in 1st Embodiment of the biological sound signal processing apparatus which concerns on this invention. It is a graph of the wavelet component extracted from the lung sound signal obtained by carrying out the inverse Fourier transform of the low rank matrix in 1st Embodiment of the biological sound signal processing apparatus which concerns on this invention. It is a block diagram of 2nd Embodiment of the biological sound signal processing apparatus which concerns on this invention. It is a flowchart of 2nd Embodiment biological sound signal processing method of the biological sound signal processing apparatus which concerns on this invention. It is a block diagram of 3rd Embodiment of the biological sound signal processing apparatus which concerns on this invention. It is a flowchart of the biological sound signal processing method of 3rd Embodiment of the biological sound signal processing apparatus which concerns on this invention.

Several embodiments of the biological sound signal processing apparatus according to the present invention will be described with reference to the drawings. This embodiment is merely an example, and the present invention is not limited to this. The same or similar components are denoted by the same reference numerals, and redundant description is omitted.

[First Embodiment]
FIG. 1 is a block diagram of a first embodiment of a biological sound signal processing apparatus according to the present invention.

The biological sound signal processing device 90 includes a preliminary processing unit 10, a robust principal component analysis unit 40, a sparse matrix storage unit 22, a low rank matrix storage unit 32, a continuous sound processing unit 20, and a discontinuous sound processing unit 30. is doing. The biological sound signal processing device 90 is constructed on, for example, one computer. The biological sound signal processing device 90 may be constructed on a plurality of computers connected by a network.

The preliminary processing unit 10 includes an input unit 12, a Fourier transform unit 14, and a matrix generation unit 16. The continuous sound processing unit 20 includes a continuous sound spectrogram generation unit 24 and a first inverse Fourier transform unit 26. The discontinuous sound processing unit 30 includes a discontinuous sound spectrogram generation unit 34, a second inverse Fourier transform unit 36, and a signal extraction unit 38.

The preliminary processing unit 10 generates an original sound matrix that represents an original sound spectrogram of a biological sound. The biological sound is detected by a biological sound detection device such as an electronic stethoscope (not shown), and is supplied to the biological sound signal processing device 90 as an electrical signal.

The robust principal component analysis unit 40 performs a robust principal component analysis on the original sound matrix generated by the preliminary processing unit 10 to obtain a sparse matrix and a low rank matrix. The sparse matrix is stored in the sparse matrix storage unit 22. The low rank matrix is stored in the low rank matrix storage unit 32.

The continuity sound processing unit 20 processes the sparse matrix stored in the sparse matrix storage unit 22 to generate a continuity sound in the original sound. The discontinuous sound processing unit 30 processes the low rank matrix stored in the low rank matrix storage unit 32 to generate a discontinuous sound in the original sound.

The continuous sound and the discontinuous sound generated by the continuous sound processing unit 20 and the discontinuous sound processing unit 30 are, for example, subjected to D / A conversion and output by a speaker (not shown). The waveform of continuous sound and discontinuous sound may be displayed on the display. Alternatively, a continuity sound signal and a discontinuity sound signal may be transmitted to an external device, and abnormality detection or the like may be performed by the external device.

Next, a biological sound signal processing method using this biological sound signal processing apparatus 90 will be described.

FIG. 2 is a flowchart of the biological sound signal processing method in the present embodiment.

First, the input unit 12 captures a signal that detects a body sound from a body sound detection device such as an electronic stethoscope (not shown). The detected biological sound is called an original sound. For example, a signal that electrically represents the original sound is referred to as an original sound signal s (t). The biological sound is, for example, a human lung sound. When the original sound signal is an analog signal, the input unit 12 performs A / D conversion to convert the original sound signal into digital data.

FIG. 3 is a graph of lung sound signals to be processed in the present embodiment. In FIG. 3, the horizontal axis represents elapsed time (seconds), and the vertical axis represents signal intensity (amplitude).

In this embodiment, the lung sound recorded on the 60th track of the CD in the appendix of Non-Patent Document 2 is used as the original sound.

Next, the Fourier transform unit 14 performs a short-time Fourier transform on the original sound signal s (t) to obtain a complex sound spectrogram (hereinafter referred to as a spectrogram) represented by a complex quantity in the time-frequency domain (step 1).

More specifically, the original sound spectrogram S (ω, t) is obtained by subjecting the original sound signal s (t) multiplied by the time window function to a discrete Fourier transform. The original sound spectrogram S (ω, t) has a complex value and represents the amplitude and phase of the component of the angular frequency ω constituting the signal at time t representing the position of the time window function.

Time t takes a discrete value with a time width Δt2 for shifting the time window function. It is assumed that the time width Δt2 for shifting the time window function does not exceed the time window width Δt1. That is, Δt2 <Δt1. Further, the angular frequency ω is discretized at an interval proportional to the reciprocal of the time window width Δt1.

Thereafter, the matrix generation unit 16 creates an original sound matrix D having the amplitude | S (ω, t) | of the original sound spectrogram S (ω, t) as an element (step 2). The row number i and the column number j of the original sound matrix D correspond to the i-th angular frequency ω _i and the j-th time t _j . An element Dij in the i-th row and j-th column of the matrix D is an absolute value of a complex number S (ω _i , t _j ) constituting the original sound spectrogram S (ω, t).

FIG. 4 is a diagram in which the values of the elements of the original sound matrix having the values of the original sound spectrogram obtained by performing a short-time Fourier transform on the original sound signal in this embodiment are represented by shading.

In step 3, the robust principal component analysis unit 40 decomposes the original sound matrix D so as to be the sum of the low rank matrix A and the sparse matrix E. Here, in accordance with Non-Patent Document 3, such matrix decomposition is called robust principal component analysis. As an algorithm for decomposing a matrix so as to be in the form of the sum of a low rank matrix and a sparse matrix, for example, Non-Patent Document 4 proposes a fast convergence algorithm improved from the extended Lagrangian method.

In normal principal component analysis, a given matrix is approximated by a low rank matrix. The low rank matrix is constructed by the product of eigenvectors associated with only the main eigenvalues (or singular values) of a given matrix.

On the other hand, in the robust principal component analysis used in the present embodiment, a given original sound matrix D is approximated by a low rank matrix A while allowing only a part of its elements to be modified. The original sound matrix D is decomposed into a sum of a low rank matrix A and a sparse matrix E representing a correction amount. At that time, the rank (rank) of the matrix A and the number of elements to be corrected (number of non-zero elements of the matrix E) are both as small as possible. The uniqueness of the solution in this case is disclosed in Non-Patent Document 3.

FIG. 5 is a diagram in which the values of the elements of the low rank matrix obtained by the robust principal component analysis in the present embodiment are represented by shading. FIG. 6 is a diagram in which the values of the elements of the sparse matrix obtained by the robust principal component analysis in the present embodiment are represented by shading.

The low rank matrix A has elements of a matrix D that can be easily constructed by the product of main eigenvectors. Therefore, the pattern in FIG. 5 exhibited by the rows or columns of the low rank matrix A tends to appear in a plurality of similar patterns, such as the vertical stripes and horizontal stripe patterns shown in FIG. 4 in which the values of the elements of the original sound matrix are represented by shading. is there.

On the other hand, the sparse matrix E has the components excluded to approximate the original sound matrix D by such a low rank matrix A as elements. Therefore, as shown in FIG. 6, the sparse matrix E exhibits an arbitrary curved or spotted pattern having no regularity such as vertical stripes and horizontal stripes.

In step 4, a continuous sound spectrogram E (ω, t) corresponding to the sparse matrix E obtained in step 3 is generated. The continuity sound spectrogram E (ω, t) is generated by the continuity sound spectrogram generation unit 24 reading the sparse matrix E from the sparse matrix storage unit 22. The complex number E (ω _i , t _j ) constituting the continuity sound spectrogram E (ω, t) is a complex number having an element of the sparse matrix as an amplitude and an argument of the original sound spectrogram as an argument. That is, the complex number E (ω _i , t _j ) is the argument θ _{ij of} the complex number S (ω _i , t _j ) that constitutes the element E _{ij in} the i-th row and j-th column of the matrix E and the original sound spectrogram S (ω, t). Is obtained by the following equation.

E (ω _i , t _j ) = E _ij (cos (θ _ij ) + isin (θ _ij ))

In step 5, the first inverse Fourier transform unit 26 performs a short-time inverse Fourier transform on the continuous sound spectrogram E (ω, t) to obtain a lung sound signal e (t). More specifically, continuity sound spectrogram E (omega, t) complex spectrum E (omega, t _j) for each time t _j of the inverse Fourier transform short time, in the time window function at each time t _j A lung sound signal is obtained. The lung sound signal e (t) is obtained by averaging the lung sound signals according to the overlap of the time window functions.

FIG. 7 is an e (t) graph of a lung sound signal obtained by performing an inverse Fourier transform on a sparse matrix in the present embodiment.

FIG. 7 shows the lung sound signal e (t) output from the first inverse Fourier transform unit 26 as digital data, which is simulated and displayed like an analog signal. When the lung sound signal e (t) is D / A converted by a biological sound output unit (not shown) and reproduced as a lung sound by a speaker (not shown) or the like, it becomes a continuous sound.

In the present embodiment, the original sound signal s (t) used as an input of processing is a lung sound including a continuous ra sound (high-pitched whistle sound) adopted from Non-Patent Document 2 from the 60th track of the CD of the appendix. It is. The continuous ra sound has a curved pattern in FIG. 4 representing the spectrogram of the original sound signal s (t), and is clearly separated into the spectrogram represented by the sparse matrix E shown in FIG. From this, it can be seen that the continuous sound is well separated as the lung sound signal e (t) in step 5 by the processing from step 1 to step 5 described above.

The continuous rales used in this example are high-pitched whistle sounds, but low-pitched snoring sounds can also be separated. Similar to the present embodiment, if the spectrogram of sounds other than continuous rales (breathing sounds and intermittent rachunes) is easily represented by the low rank matrix A, the low rank matrix A in the robust principal component analysis of step 3 is performed. The spectrogram of the continuous ra-tone is separated as a component (sparse matrix E) excluded to approximate the matrix D.

In step 6, the discontinuous sound spectrogram generation unit 34 generates the discontinuous sound spectrogram A (ω, t) corresponding to the low rank matrix A separated in step 3. The generation method is the same as in Step 4. That is, the complex number A (ω _i , t _j ) constituting the discontinuous sound spectrogram A (ω, t) is a complex number having an element of the low rank matrix as an amplitude and an argument of the original sound spectrogram as an argument. And the element A _{ij in} the i-th row and j-th column of the matrix A and the argument θ _{ij of} the complex number S (ω _i , t _j ) constituting the original sound spectrogram S (ω, t), is obtained by the following equation.

A (ω _i , t _j ) = A _ij (cos (θ _ij ) + isin (θ _ij ))

In step 7, the second inverse Fourier transform unit 36 performs a short-time inverse Fourier transform on the discontinuous sound spectrogram A (ω, t) to obtain a lung sound signal a (t). This method is the same as step 5. That is, discontinuous sound spectrogram A (omega, t) complex spectrum A (omega, t _j) for each time t _j of the inverse Fourier transform short time, lung sounds signals within a time window function at each time t _j Get. A lung sound signal a (t) is obtained by averaging the lung sound signals according to the overlap of the window functions.

FIG. 8 is a graph of the lung sound signal a (t) obtained by performing inverse Fourier transform on the low rank matrix A in the present embodiment.

FIG. 8 shows the lung sound signal output from the second inverse Fourier transform unit 36 as digital data by simulating and displaying it as an analog signal. When the lung sound signal a (t) is D / A converted by a biological sound output unit (not shown) and reproduced as a lung sound by a speaker or the like (not shown), a sound obtained by removing the continuous sound from the original sound signal s (t) is obtained.

In step 8, the signal extraction unit 38 further extracts the lung sound signal f (t) and the lung sound signal w (t) from the lung sound signal e (t) by the method of Non-Patent Document 1. The method of Non-Patent Document 1 extracts lung sounds based on sparse representation. Here, the lung sound signal a (t) is expressed by the sum of the Fourier synthesized signal (f (t)) and the wavelet synthesized signal (w (t)). At this time, the number of non-zero Fourier components and the number of non-zero wavelet components are both as small as possible. For example, Non-Patent Document 1 discloses an algorithm for this purpose.

FIG. 9 is a graph of a lung sound signal f (t) that is a Fourier component extracted from the lung sound signal e (t) in the present embodiment.

FIG. 9 shows the lung sound signal f (t) output from the signal extraction unit 38 as digital data by simulating it as an analog signal. When the lung sound signal f (t) is D / A converted by a biological sound output unit (not shown) and reproduced as a lung sound by a speaker (not shown), a respiratory sound in the original sound signal s (t) is output.

FIG. 10 is a graph of the lung sound signal w (t), which is a wavelet component extracted from the lung sound signal e (t) in the present embodiment.

FIG. 10 shows the lung sound signal w (t) output from the signal extraction unit 38 as digital data by simulating it as an analog signal. When the lung sound signal w (t) is D / A converted by a biological sound output unit (not shown) and reproduced as a lung sound by a speaker (not shown) or the like, an intermittent ra sound in the original sound signal s (t) is output.

In the present embodiment, the original sound signal s (t) is continuous sound data (high-pitched whistle sound) adopted from the 60th track in the CD of Appendix of Non-Patent Document 2. However, it can be confirmed from the waveform of the original sound signal s (t) shown in FIG. 3 that intermittent sound is mixed in addition to continuous rarity. Further, from the spectrogram representing the original sound signal s (t) shown in FIG. 4, in addition to the continuous ra sound that exhibits a curvilinear pattern, an intermittent sound that exhibits a vertical stripe pattern (intermittent ra sound), a low frequency The presence of sound (breathing sound) that continues in the belt can be confirmed. These sounds other than the continuous rales are clearly separated into spectrograms represented by the low rank matrix A shown in FIG. From this, it can be seen that the breathing sound and the intermittent rar sound are extracted satisfactorily by the processing from Step 1 to Step 3 and Step 6 to Step 8.

Thus, according to this embodiment, it is possible to more accurately classify respiratory sounds, continuous rales, and intermittent rales from lung sounds of humans and the like.

In the present embodiment, respiratory sounds, continuous rales, and intermittent rales are separated and extracted from human lung sounds. In the case where sounds similar to a natural ra sound and an intermittent ra sound are included, those sounds can be separated and extracted as in the present embodiment.

[Second Embodiment]
FIG. 11 is a block diagram of a second embodiment of the biological sound signal processing device according to the present invention. FIG. 12 is a flowchart of the biological sound signal processing method in the present embodiment.

This embodiment is different from the first embodiment in that a complex matrix is used as a whole process. For this reason, the matrix generation part 16 in 1st Embodiment does not exist in the biological sound signal processing apparatus of this Embodiment.

In the present embodiment, step 2 in the first embodiment is omitted, and in step 3, the robust principal component analysis unit 40 directly performs a robust principal component analysis on the original sound spectrogram S (ω, t) to obtain a low rank matrix A and Get the sparse matrix E. Steps 5 and 7 for obtaining a separated lung sound signal, and step 8 and subsequent steps are the same as those in the first embodiment.

In the present embodiment, the step of converting the complex matrix in the first embodiment into a real matrix and converting it again into a complex matrix after robust principal component analysis is omitted in this embodiment. Since the matrix of complex numbers representing the amplitude and the phase is separated, the separation performance can be improved as compared with the first embodiment in which only the amplitude is separated. Note that, since the matrix to be subjected to the robust principal component analysis is a complex matrix, the calculation time is slightly longer than that of the first embodiment. However, if an appropriate solution is used, it is sufficiently practical.

[Third Embodiment]
FIG. 13 is a block diagram of a third embodiment of the biological sound signal processing device according to the present invention. FIG. 14 is a flowchart of the biological sound signal processing method in the present embodiment.

This embodiment is different from the first embodiment in that short-time cosine transform and short-time inverse cosine transform are used instead of short-time Fourier transform and short-time inverse Fourier transform. In the biological sound signal processing device of the present embodiment, a short-time cosine transform unit 41 is provided instead of the short-time Fourier transform unit 14 in the first embodiment.

This embodiment is different from the first embodiment in that a real number matrix is used as a whole process. For this reason, the matrix generation part 16 in 1st Embodiment does not exist in the biological sound signal processing apparatus of this Embodiment.

In the present embodiment, step 2 in the first embodiment is omitted, and in step 3, the robust principal component analysis unit 50 directly robusts the real original spectrogram S (ω, t) obtained by the short-time cosine transform unit. Principal component analysis is performed to obtain a low rank matrix A and a sparse matrix E. Further, in the biological sound signal processing device of the present embodiment, a short-time inverse cosine transform unit 52 is provided instead of the first short-time inverse Fourier transform unit 26 in the first embodiment. In the biological sound signal processing apparatus according to the present embodiment, a short-time inverse cosine transform unit 53 is provided instead of the second short-time inverse Fourier transform unit 36 in the first embodiment.

The cosine transform has the advantage that it can be processed in half the storage area compared to the Fourier transform of the real signal. However, since the process assumes that the signal is an even function, the separation performance may be slightly inferior to that of the other embodiments.

In any of the first, second, and third embodiments, A = UKV ^T is obtained from the low rank matrix A by singular value decomposition.
A matrix U, a matrix K, and a matrix V are obtained. If the size of the low-rank matrix A is m × n and the rank (rank) is r, the matrix K is a diagonal r-order square matrix with singular values on the diagonal, and the matrices U and V are r An m × r matrix and an n × r matrix composed of a left singular vector and a right singular vector Note that V ^T is a transposed matrix of V when the matrix V is a real matrix, and a conjugate transposed matrix of V when the matrix V is a complex matrix.

The left singular vector is a basis on which the column vector of the matrix A can be synthesized. The column vector of the low rank matrix A represents the instantaneous frequency spectrum of sounds other than continuous biological sounds, particularly breathing sounds and intermittent sounds. Thus, the left singular vector is the basis for constructing the instantaneous frequency spectrum of these sounds. The right singular vector indicates the breakdown of the instantaneous frequency spectrum at an arbitrary time. That is, the component of the j-th right singular vector represents how much the base of the j-th left singular vector appears at each time.

Therefore, by the singular value decomposition of the low rank matrix A, a matrix U composed of the basis of the instantaneous frequency spectrum and V representing the breakdown are obtained. Since the same type of sound has a similar breakdown of the instantaneous frequency spectrum, it can be applied to classification for discriminating the same type of breathing sound and intermittent sound using the left and right singular vectors.

Note that singular value decomposition is a method of matrix decomposition for obtaining principal components for a set of row vectors and a set of column vectors constituting a matrix. By singular value decomposition, a singular value representing the size of the principal component and an orthonormal basis representing the direction of the principal component are obtained. However, the singular vector associated with a singular value of zero is not uniquely determined. Corresponding singular vectors on the left and right are arbitrary in terms of a code or a complex multiple of size 1. However, the low rank matrix A obtained in the present embodiment is uniquely determined because it is composed of non-zero singular values and the left and right singular vectors associated therewith.

DESCRIPTION OF SYMBOLS 10 ... Preliminary processing part, 12 ... Input part, 14 ... Fourier transform part, 16 ... Matrix generation part, 20 ... Continuous sound processing part, 22 ... Sparse matrix storage part, 24 ... Continuous sound spectrogram generation part, 26 ... 1st DESCRIPTION OF SYMBOLS 1 Inverse Fourier-transform part, 30 ... Discontinuous sound processing part, 32 ... Low rank matrix storage part, 34 ... Discontinuous sound spectrogram generation part, 36 ... 2nd inverse Fourier transform part, 38 ... Signal extraction part, 40 ... Robust principal component analysis unit, 41 ... cosine transform unit, 50 ... robust principal component analysis unit, 52 ... inverse cosine transform unit, 53 ... inverse cosine transform unit, 90 ... biological sound signal processing device

Claims

A robust principal component analysis unit that decomposes an original sound matrix representing an original sound spectrogram of a biological sound signal into a sparse matrix and a low rank matrix by robust principal component analysis;
A continuous sound processing unit that converts the sparse matrix to obtain continuous biological sound from the biological sound signal;
A discontinuous sound processing unit that transforms the low rank matrix to obtain a body sound obtained by removing continuous body sounds from the body sound signal;
A biological sound signal processing apparatus comprising:
A Fourier transform unit that obtains the original sound matrix by performing a Fourier transform on the biological sound signal for a short time;
The discontinuous sound processing unit generates a discontinuous sound spectrogram from the low rank matrix, a means for generating a discontinuous sound signal by performing a short-time inverse Fourier transform on the discontinuous sound spectrogram, The biological sound signal processing apparatus according to claim 1, further comprising: a signal extraction unit that extracts a Fourier transform signal and a wavelet signal from the discontinuous sound signal.
A Fourier transform unit that obtains the original sound matrix by performing a Fourier transform on the biological sound signal for a short time;
The continuous sound processing unit includes means for generating a continuous sound spectrogram from the sparse matrix, and means for generating a continuous sound signal by performing inverse Fourier transform on the continuous sound spectrogram for a short time. The biological sound processing apparatus according to claim 1 or 2.
A Fourier transform unit that obtains the original sound spectrogram by performing a Fourier transform on the biological sound signal for a short time;
A matrix generation unit that generates an original sound matrix including elements whose values are absolute values of elements of the original sound spectrogram, with the discretized angular frequency as a row number and the discretized time as a column number;
The biological sound signal processing apparatus according to claim 1, further comprising:
The discontinuous sound processing unit generates a discontinuous sound spectrogram composed of complex numbers having the low rank matrix element as an amplitude and a declination of the original sound spectrogram as an argument, and the discontinuous sound spectrogram 5. A means for generating a discontinuous sound signal by performing inverse Fourier transform on a short time, and a signal extracting means for extracting a Fourier transform signal and a wavelet signal from the discontinuous sound signal. 2. The biological sound signal processing device according to 1.
The continuity sound processing unit generates a continuity sound spectrogram composed of complex numbers having the elements of the sparse matrix as amplitude and the declination of the original sound spectrogram as declination, and the continuity sound spectrogram is inverted for a short time. 6. The biological sound processing apparatus according to claim 4, further comprising means for generating a continuous sound signal by performing a Fourier transform.
A cosine transform unit that obtains the original sound matrix by performing cosine transform on the biological sound signal for a short time;
The discontinuous sound processing unit includes means for generating a discontinuous sound spectrogram from the low rank matrix, means for generating a discontinuous sound signal by performing a short time inverse cosine transform on the discontinuous sound spectrogram, and The biological sound signal processing apparatus according to claim 1, further comprising: a signal extraction unit that extracts a Fourier transform signal and a wavelet signal from the discontinuous sound signal.
A cosine transform unit that obtains the original sound matrix by performing cosine transform on the biological sound signal for a short time;
The continuous sound processing unit includes means for generating a continuous sound spectrogram from the sparse matrix, and means for generating a continuous sound signal by performing inverse cosine transform on the continuous sound spectrogram for a short time. The biological sound processing apparatus according to claim 1 or 7.
The discontinuous sound processing unit decomposes the low rank matrix into a product of a singular value matrix and two orthogonal matrices sandwiching the low rank matrix, and generates a specific discontinuous sound from the two orthogonal matrices. The biological sound signal processing apparatus according to claim 1, wherein the specific discontinuous sound is extracted by extracting a portion that matches the characteristic.
A robust principal component analysis process for decomposing an original sound matrix representing an original sound spectrogram of a biological sound into a sparse matrix and a low rank matrix by robust principal component analysis;
A second step of obtaining a continuous sound processing unit for converting the sparse matrix to obtain continuous biological sound from the biological sound signal;
A third step of converting the low rank matrix to obtain a biological sound excluding continuous biological sounds from the biological sound signal;
A biological sound signal processing method characterized by comprising:
Computer
A robust principal component analysis means for decomposing an original sound matrix representing an original sound spectrogram of a biological sound into a sparse matrix and a low rank matrix by robust principal component analysis;
Continuous sound processing means for converting the sparse matrix to obtain continuous biological sound from the biological sound signal;
Discontinuous sound processing means for converting the low rank matrix to obtain a body sound obtained by excluding continuous body sounds from the body sound signal;
A biological sound signal processing program for functioning as