WO2022177045A1

WO2022177045A1 - Device and method for classifying presence of heart disease

Info

Publication number: WO2022177045A1
Application number: PCT/KR2021/002630
Authority: WO
Inventors: 장혁재; 홍영택; 장영걸; 이지나; 맹신희
Original assignee: 연세대학교 산학협력단; 주식회사 온택트헬스
Priority date: 2021-02-18
Filing date: 2021-03-03
Publication date: 2022-08-25
Also published as: KR102352859B1; KR20220118302A

Abstract

Provided is a heart disease classification device, comprising: a model generator for generating a CNN-based heart disease classification model comprising a convolution block layer, a one-dimensional pooling layer, a global pooling layer, and a fully connected layer, receives input data based on a phonocardiogram of a preset length and classifies a heart disease; an input data generator for generating the input data from the phonocardiogram through a Mel spectrum-based feature extraction technique; and a heart disease classifier for inputting the input data to the heart disease classification model and classifying the heart disease.

Description

Apparatus and method for classifying the presence or absence of heart disease

The present invention relates to an apparatus and method for classifying the presence or absence of a heart disease.

Heart sound, which can be easily heard through a stethoscope, is the first basic test method performed in diagnosing heart disease. A phonocardiogram (PCG) is a recording of heart sounds during several cardiac cycles through an electronic stethoscope, and is used for visualization of auscultation and for diagnosing heart disease.

Referring to FIG. 6 , the heart sound diagram is divided into four states: S1, Systole, S2, and Diastole, and the four states are sequentially connected to form one cycle. As shown in (a) of FIG. 6 , there is little noise in the signal in a normal heart tone, but in the heart tone of a patient with heart disease as in FIG. 6 (b), a lot of noise is mixed throughout the signal.

In this way, if the heart sound chart is used, it is possible to analyze the abnormality of the sound of the valve opening and closing by the heartbeat, thereby enabling early diagnosis of valve-related diseases. Although it has the advantage of being able to easily analyze heartbeat in a non-invasive method, it is a formal procedure because it requires specialized training to perform accurate analysis and the opinions of different specialists may vary.

Recently, as precision inspection methods such as computed tomography, magnetic resonance imaging, and ultrasound imaging have become the standard, the utilization of PCG is gradually decreasing.

With the recent development of artificial intelligence technology, various techniques for PCG analysis have been proposed. Representatively, there are a method using a one-dimensional convolutional neural network (CNN) and a method using a hidden markov model (HMM) for a heart sound signal. Other methods include analysis using an artificial neural network based on long short-term memory (LSTM), and a 2D CNN method that analyzes the spectrum of a signal and analyzes it in the form of an image.

In this regard, Korean Patent Registration No. 10- 1524226 discloses a method and apparatus for determining heart disease using a neural network.

An object of the present invention is to provide a heart disease classification apparatus and method for classifying the presence or absence of heart disease through a one-dimensional convolutional neural network (CNN)-based heart disease classification model suitable for PCG feature analysis.

Another object of the present invention is to provide a heart disease classification apparatus and method for generating input data by converting a PCG signal into a meaningful phonetic feature based on a Mel-Frequency Cepstral Coefficient (MFCC).

However, the technical problems to be achieved by the present embodiment are not limited to the technical problems described above, and other technical problems may exist.

As a technical means for achieving the above-described technical problem, an embodiment of the present invention includes a convolutional block layer, a one-dimensional pooling layer, a global pooling layer, and a fully connected layer, and a phonocardiogram (PCG) of a preset length. )-based input data and a model generator that generates a convolutional neural network (CNN)-based heart disease classification model that classifies heart disease, and the input from the heart sound diagram through a Mel spectrum-based feature extraction technique It is possible to provide a heart disease classification apparatus including an input data generating unit for generating data and a heart disease classifying unit for classifying the heart disease by inputting the input data into the heart disease classification model.

In addition, another embodiment of the present invention includes a convolutional block layer, a one-dimensional pooling layer, a global pooling layer, and a fully connected layer, and a CNN-based method that receives heart sound-based input data of a preset length and classifies heart diseases. creating a heart disease classification model of A method for classifying heart disease may be provided.

The above-described problem solving means are merely exemplary, and should not be construed as limiting the present invention. In addition to the exemplary embodiments described above, there may be additional embodiments described in the drawings and detailed description.

According to any one of the above-described problem solving means of the present invention, it is possible to significantly reduce computation time and network complexity by modeling signal changes over time using only a one-dimensional CNN without using a Recurrent Neural Network (RNN)-based model. have.

1 is a block diagram of a heart disease classification apparatus according to an embodiment of the present invention.

2 is a block diagram of a heart disease classification apparatus according to an embodiment of the present invention.

3 is a diagram illustrating a network structure of a CNN-based heart disease classification model according to an embodiment of the present invention.

4 is a block diagram of a convolutional block layer according to an embodiment of the present invention.

5 is a flowchart of a heart disease classification method according to an embodiment of the present invention.

6 is a diagram illustrating a normal heart tone and an abnormal heart tone.

DETAILED DESCRIPTION OF THE PREFERRED EMBODIMENTS Hereinafter, embodiments of the present invention will be described in detail with reference to the accompanying drawings so that those skilled in the art can easily carry out the present invention. However, the present invention may be embodied in several different forms and is not limited to the embodiments described herein. And in order to clearly explain the present invention in the drawings, parts irrelevant to the description are omitted, and similar reference numerals are attached to similar parts throughout the specification.

Throughout the specification, when a part is "connected" with another part, this includes not only the case of being "directly connected" but also the case of being "electrically connected" with another element interposed therebetween. . Also, when a part "includes" a certain component, it means that other components may be further included, rather than excluding other components, unless otherwise stated, and one or more other features However, it is to be understood that the existence or addition of numbers, steps, operations, components, parts, or combinations thereof is not precluded in advance.

In this specification, a "part" includes a unit realized by hardware, a unit realized by software, and a unit realized using both. In addition, one unit may be implemented using two or more hardware, and two or more units may be implemented by one hardware.

Some of the operations or functions described as being performed by the terminal or device in this specification may be instead performed by a server connected to the terminal or device. Similarly, some of the operations or functions described as being performed by the server may also be performed in a terminal or device connected to the corresponding server.

Hereinafter, an embodiment of the present invention will be described in detail with reference to the accompanying drawings.

1 is a block diagram of a heart disease classification apparatus according to an embodiment of the present invention, FIG. 2 is a block diagram of a heart disease classification apparatus according to an embodiment of the present invention, and FIG. 3 is an embodiment of the present invention It is a diagram showing a network structure of a CNN-based heart disease classification model according to the present invention, and FIG. 4 is a configuration diagram of a convolutional block layer according to an embodiment of the present invention.

1 and 2 , the heart disease classification apparatus 1 may include a model generation unit 100 , an input data generation unit 110 , and a heart disease classification unit 120 . The input data generator 110 may include a preprocessor 112 and a feature extractor 114 .

The heart disease classification apparatus 1 may perform the classification 240 after segmenting 220 on the received heart sound signal 210 .

For example, the heart disease classification apparatus 1 may generate a label of the heart sound signal 210 through the segmentation 220 .

Specifically, the heart disease classification apparatus 1 may perform label analysis after downsampling to 1 Khz to generate a label, and then upsampling again to perform learning.

According to the present disclosure, it is possible to improve the performance of the network by encapsulating data based on the label for the segmentation 220 of the heart sound signal. Through this, it is assumed that a higher level of learning is possible because it is possible to check which part of the data was viewed and diagnosed a disease by learning the disease diagnosis network.

Also, the heart disease classification apparatus 1 may remove noise and outliers in the signal through the data preprocessing 230 .

In addition, the heart disease classification apparatus 1 may separate the preprocessed data in units of 3 cycles, and generate input data of the network through MFCC analysis.

An example of the heart disease classification apparatus 1 may include not only a personal computer such as a desktop or a notebook computer, but also a mobile terminal capable of wired/wireless communication. A mobile terminal is a wireless communication device that guarantees portability and mobility, and includes not only smartphones, tablet PCs, and wearable devices, but also Bluetooth (BLE, Bluetooth Low Energy), NFC, RFID, Ultrasonic, infrared, and Wi-Fi ( WiFi) and Li-Fi (LiFi) may include various devices equipped with a communication module. However, the heart disease classification device 1 is not limited to the form shown in FIG. 1 or those exemplified above.

The model generator 100 may receive input data based on heart sound and generate a CNN-based heart disease classification model for classifying heart diseases.

Referring to FIG. 3 , the heart disease classification model 320 may include a convolutional block layer 332 , a one-dimensional pooling layer 334 , a global pooling layer 342 , and a fully connected layer 350 .

For example, the heart disease classification model 320 includes four first sub-layers 330 including a convolutional block layer 332 and a one-dimensional pooling layer 334 , and a convolutional block layer 332 and global pooling. It may include one second sub-layer 340 including a layer 342 . Here, the heart disease classification model 320 may be configured to be connected in the order of the first sub-layer 330 , the second sub-layer 340 , and the fully connected layer 350 .

That is, the heart disease classification model 320 includes a global pooling layer 342 between the last convolutional block layer 332 and the fully connected layer 350 . In this case, the global pooling layer 342 may improve the performance of the network by calculating it according to the actual effective length rather than calculating the average for the entire size.

Referring to FIG. 4 , the convolution block layer 332 may include two sets of one-dimensional convolution operations and activation functions. In this case, the activation function may be a Rectified Linear Unit (ReLU).

Here, the number of filters in the convolutional block layer 332 is doubled starting from 32, so that 512 can be maximally.

The hyperparameters of the heart disease classification model 320 may be depth (the number of layers) 5, the number of filters 32, and the batch size 64.

The model generator 100 may train the heart disease classification model to classify heart diseases by inputting heart sound-based learning data to the heart disease classification model.

The input data generator 110 may generate the input data from the heart sound diagram through a Mel spectrum-based feature extraction technique.

Also, the input data generator 110 may generate input data of a preset length by applying zero padding to the effective length of the input data.

For example, since the heartbeat cycle may be different for each sample, the input data generator 110 applies zero padding according to the maximum size of the cycle of the heart sound according to the reversible heartbeat range of a human so that the length is 720 in all. can do. In this case, the processed input data may be in the form of final 13×720 data.

The global pooling layer of the heart disease classification model may extract an effective feature of the input data by calculating an average of the valid lengths of the input data.

The preprocessor 112 may determine a range of a signal of a heart tone of a preset number of cycles based on a Butterworth filter.

Most of the heart sound information is included in the low frequency band, and the noise increases as it goes up the high frequency band. Accordingly, the preprocessor 112 may determine the signal range by using the Butterworth filter to effectively extract the heart sound.

In addition, the preprocessor 112 may correct noise (bouncing value) in the signal of the heart tone through the spike removal method.

Here, the pre-processor 112 may divide the heart sound chart into three periods of heart tone data that may include sufficient information by dividing the heart sound chart in order to improve accuracy through the data ensemble. Here, for the division of the heart tone, a division technique based on a Hidden Markov Model (HMM) may be used.

The feature extractor 114 may generate input data by extracting a preset number of feature components from the heart sound diagram based on MFCC as a Mel spectrum-based feature extraction technique.

MFCC is a feature extraction technique that reflects the way humans hear sounds, and the feature extraction unit 114 may use 13 Mel coefficient values for heart tone analysis in a frequency band of less than 700 Hz.

That is, in the present application, the signal of the heart sound diagram is converted into meaningful phonetic features by the MFCC method and used as input data.

The heart disease classification unit 120 may classify the heart disease by inputting the input data into the heart disease classification model.

For example, the heart disease classification unit 120 converts the input data to normal, aortic stenosis, mitral regurgitation, aortic regurgitation, and mitral valve stenosis ( Mitral stenosis) and patency of the ductus arteriosus (Patent ductus arteriosus) can be classified as either.

The present applicant conducted an experiment comparing the performance of the Example according to the present application and the comparative example using 3240 heart sound data publicly provided in PhysioNet Challenge 2016.

To measure the performance of the network, the depth of the layer, the number of filters, and the batch size were adjusted and the results were compared. Binary cross entropy was used as a loss function for error update, and Acc (Accuracy), Ppv (Positive Predictive Value), Se (Sensitivity), Sp (Specificity), and MAcc (Modified Accuracy) were used as indicators to evaluate the performance of the model. ) was used. For the same comparison with PhysioNet Challenge 2016, Sensitivity, Specificity, and MAcc were defined as follows.

As a result of the performance comparison, as shown in Table 1 below, it was confirmed that the hyperparameter (4/32/64) in the present application had the highest value in a number of indicators such as accuracy, sensitivity, and specificity. In Table 1, N is the number of layers, F is the number of filters, and B is the batch size.

N/F/BN/F/B	ACCACC	SeSe	Sp Sp	PPVPPV		MAccMAcc
4/16/644/16/64	0.940.94	0.720.72	0.980.98	0.830.83	0.850.85
4/32/644/32/64	0.940.94	0.870.87	0.950.95	0.730.73	0.910.91
5/32/645/32/64	0.950.95	0.870.87	0.970.97	0.820.82	0.920.92

In addition, as shown in Table 2 below, the network of the heart disease classification model of the present application (the present example) showed excellent heart disease diagnosis performance compared to other techniques proposed in the PhysioNet Challenge 2016. Here, Potes' technique was a technique using CNN and Adaboost, one of the ensemble techniques, Zabihi's technique is an ensemble form of several support vector machines, and Kay & agarwal's technique is a technique learned using a regularized neural network. In addition, the regularized neural network is a technique to prevent overfitting by regulating weights and to have characteristics suitable for generalization.

	SeSe	SpSp	MAccMAcc
Potes et al.Potes et al.	0.940.94	0.780.78	0.860.86
Zabihi et al.Zabihi et al.	0.870.87	0.850.85	0.860.86
Kay & AgarwalKay & Agarwal	0.870.87	0.830.83	0.850.85
본원의 실시예Examples of the present application	0.870.87	0.970.97	0.920.92

5 is a flowchart of a heart disease classification method according to an embodiment of the present invention. The heart disease classification method according to the embodiment shown in FIG. 5 includes the steps of time-series processing in the heart disease classification apparatus shown in FIG. 1 . Therefore, even if omitted below, it is also applied to the heart disease classification method performed according to the embodiment shown in FIG. 5 .

In step S500 , the heart disease classification apparatus may receive heart sound-based input data and generate a CNN-based heart disease classification model that classifies heart diseases. Here, the heart disease classification model may include a convolutional block layer, a one-dimensional pooling layer, a global pooling layer, and a fully connected layer.

In step S510, the heart disease classification apparatus may generate input data from the heart sound diagram through a Mel spectrum-based feature extraction technique.

In step S520 , the heart disease classification apparatus may classify the heart disease by inputting the input data into the heart disease classification model.

The heart disease classification method described with reference to FIG. 5 may be implemented in the form of a computer program stored in the medium, or may be implemented in the form of a recording medium including instructions executable by a computer, such as a program module executed by a computer. . Computer-readable media can be any available media that can be accessed by a computer and includes both volatile and nonvolatile media, removable and non-removable media. Also, computer-readable media may include computer storage media. Computer storage media includes both volatile and nonvolatile, removable and non-removable media implemented in any method or technology for storage of information such as computer readable instructions, data structures, program modules or other data.

The foregoing description of the present invention is for illustration, and those of ordinary skill in the art to which the present invention pertains can understand that it can be easily modified into other specific forms without changing the technical spirit or essential features of the present invention. will be. Therefore, it should be understood that the embodiments described above are illustrative in all respects and not restrictive. For example, each component described as a single type may be implemented in a distributed manner, and likewise components described as distributed may also be implemented in a combined form.

The scope of the present invention is indicated by the following claims rather than the above detailed description, and all changes or modifications derived from the meaning and scope of the claims and their equivalents should be interpreted as being included in the scope of the present invention. do.

Claims

In the device for classifying the presence or absence of heart disease,

Convolutional Neural Network (CNN) that includes a convolutional block layer, a one-dimensional pooling layer, a global pooling layer, and a fully connected layer, receives input data based on a phonocardiogram (PCG) of a preset length and classifies heart diseases a model generating unit that generates a heart disease classification model based on;

an input data generator generating the input data from the heart sound diagram through a Mel spectrum-based feature extraction technique; and

A heart disease classification unit for classifying the heart disease by inputting the input data into the heart disease classification model

That comprising a, heart disease classification device.
The method of claim 1,

The heart disease classification model includes four first sub-layers including the convolutional block layer and the one-dimensional pooling layer, and one second sub-layer including the convolutional block layer and the global pooling layer. Phosphorus, heart disease classification device.
3. The method of claim 2,

The heart disease classification model is configured to be connected in order of the first sub-layer, the second sub-layer, and the fully connected layer.
The method of claim 1,

The convolution block layer includes two sets of one-dimensional convolution operations and activation functions.
5. The method of claim 4,

The activation function is relu (Rectified Linear Unit, ReLU) will, heart disease classification device.
3. The method of claim 2,

The input data generation unit,

a pre-processing unit for determining a range of a signal of the heart tone with a predetermined number of cycles based on a Butterworth filter; and

A feature extraction unit that generates the input data by extracting a preset number of feature components from the heart sound diagram based on a Mel-Frequency Cepstral Coefficient (MFCC) as the Mel spectrum-based feature extraction technique

That comprising a, heart disease classification device.
7. The method of claim 6,

The input data generating unit applies zero padding to an effective length of the input data to generate the input data of a preset length.
8. The method of claim 7,

The global pooling layer calculates an average of the effective lengths of the input data to extract effective features of the input data.
In the method of classifying the presence or absence of heart disease,

Generating a CNN-based heart disease classification model that includes a convolutional block layer, a one-dimensional pooling layer, a global pooling layer, and a fully connected layer, and receives input data based on a heart sound of a preset length and classifies heart diseases ;

generating the input data from the heart sound diagram through a Mel spectrum-based feature extraction technique; and

classifying the heart disease by inputting the input data into the heart disease classification model

That comprising a, heart disease classification method.
10. The method of claim 9,

The heart disease classification model includes four first sub-layers including the convolutional block layer and the one-dimensional pooling layer, and one second sub-layer including the convolutional block layer and the global pooling layer. Phosphorus and heart disease classification method.
11. The method of claim 10,

Wherein the heart disease classification model is configured to be connected in the order of the first sub-layer, the second sub-layer and the fully connected layer.
10. The method of claim 9,

Wherein the convolution block layer includes two sets of one-dimensional convolution operation and an activation function.
13. The method of claim 12,

Wherein the activity function is relu, heart disease classification method.
11. The method of claim 10,

The step of generating the input data includes:

determining a range of the signal of the heart tone of a preset number of cycles based on a Butterworth filter; and

As the Mel spectrum-based feature extraction technique, extracting a preset number of feature components from the heart sound diagram based on MFCC to generate the input data.
15. The method of claim 14,

The step of generating the input data includes:

The method further comprising generating the input data of a preset length by applying zero padding to the effective length of the input data.
16. The method of claim 15,

The global pooling layer is to calculate an average of the effective length of the input data, heart disease classification method.