CN110689089A

CN110689089A - Active incremental training method for deep learning of multi-class medical image classification

Info

Publication number: CN110689089A
Application number: CN201910968185.8A
Authority: CN
Inventors: 段贵多; 黄添喜; 刘江明
Original assignee: Guangdong Institute Of Electronic And Information Engineering University Of Electronic Science And Technology Of China; University of Electronic Science and Technology of China
Current assignee: Guangdong Institute Of Electronic And Information Engineering University Of Electronic Science And Technology Of China; University of Electronic Science and Technology of China
Priority date: 2019-10-12
Filing date: 2019-10-12
Publication date: 2020-01-14

Abstract

The invention discloses an active increment training method for deep learning of multi-class medical image classification, which comprises the following steps: 1. performing primary data cleaning and preprocessing on the medical image data set; 2. randomly selecting initial data, and performing initial training on the network model; 3. testing the rest samples in the data set to obtain the correspondence between the predicted score and the lesion category; 4. performing cross expansion on the residual samples in the data set, and actively screening candidate samples; 5. performing further data set cleaning; 6. performing incremental training on the model; 7. and (5) testing the model after the incremental training, finishing the training if the accuracy rate is stable, and otherwise, repeating the steps from 4 to 7. The invention improves the AIFT (active enhancement learning method), and solves the problems of difficult medical image classification, low training efficiency and the like caused by data imbalance; the problem of poor application effect of deep learning in the field of pathological change classification is solved, and the auxiliary effect of diagnosing the state of an illness of a doctor is improved.

Description

Active incremental training method for deep learning of multi-class medical image classification

Technical Field

The invention relates to a medical image classification training method, in particular to an active increment training method for deep learning of multi-class medical image classification.

Background

With the emergence of new imaging technology and equipment in medicine and the development of computer technology, the medical image processing technology has increasingly influenced medical research and clinical practice, and has received high attention from scholars at home and abroad. In recent years, with the rise of Deep learning (also called Feature learning) methods, Feature learning has received a great deal of attention in the field of machine learning research. Deep learning utilizes a deep neural network to automatically learn effective feature representation in a data-driven mode, rapidly subverts a research framework based on artificial features in a plurality of machine learning related application fields, and becomes a new research paradigm. High-tech companies such as google, microsoft, IBM, hundredth and the like, which have big data, successively invest a large amount of resources to carry out the research and development of deep learning technology, and remarkable progress is made in the fields of voice, image, natural language, online advertisement and the like.

In recent years, an increasing patient population has led researchers to realize an efficient, high-quality and certain interpretable system for automatically classifying disease patterns, and rapid innovation and development of computer performance and deep learning techniques have provided the possibility for researchers to realize the system. However, in the medical field, the images themselves have strong specialization and specificity, and the task of classifying medical images through a computer also becomes a challenge.

Compared with other natural image data sets, the medical image data set has small labeled data amount and extremely unbalanced data among categories, the training of the model by using the data set is time-consuming and labor-consuming work, the result of the model has serious tendency, and serious misdiagnosis results can be caused in the classification task of the medical image. The active incremental learning method can effectively relieve the adverse effect of low data set quality on model performance, so that the active incremental learning method is more and more widely applied to the field of medical image classification.

Currently, active incremental learning and deep learning are combined as an emerging research direction to study a few. Wang and Shang et al, who are perhaps the earliest researchers combining active learning with deep learning, are based on a stack limit line Boltzmann machine and a stack auto-encoder. Stack et al apply active learning to improve the performance of convolutional neural networks for CAPTCHA recognition. Yang et al propose an active learning framework that reduces the annotation effort by judiciously suggesting the most efficient segmentation of the annotation region based on uncertainty and similarity information provided by a fully convolutional network. Their computational methods are very expensive because they require a set of models to be trained from scratch by the bootstrapping algorithm to compute their uncertainty measures from the divergence of these models, and the maximum set coverage problem must be solved to determine the most representative regions.

The difference between the AIFT algorithm (active-innovative fine-tuning) proposed by Zhou et al is that the above method is repeatedly retrained from scratch in each step, and the AIFT algorithm continuously fine-tunes (fine-tune) the convolutional neural network model in an incremental manner, thereby significantly improving the model training efficiency and reducing the labeling cost. And has high requirements on equipment, but some incremental training methods cannot achieve good effect in a glycogenopathy classification model due to the particularity of medical images such as fundus image data sets.

Disclosure of Invention

In order to solve the above problems, an object of the present invention is to provide an active incremental training method for deep learning multi-class medical image classification, which is easy to classify medical images and has high training efficiency.

The invention realizes the purpose through the following technical scheme:

an active incremental training method for deep learning multi-category medical image classification, comprising the following steps:

step 1, carrying out primary data cleaning and preprocessing on a medical image data set;

step 2, randomly selecting initial data, and performing initial training on the network model;

step 3, testing the rest samples in the data set on the model trained in the step 2 to obtain a prediction probability, and obtaining a prediction score of the sample by using a ranking decoding method on the output probability to realize the correspondence between the prediction score and the lesion category;

step 4, performing cross expansion on the residual samples in the data set, and actively screening candidate samples according to the test result in the step 3;

step 5, setting K values according to data proportions among different categories and carrying out further data set cleaning on the candidate samples obtained by screening in the step 4;

step 6, performing incremental training on the model, and adding a distillation item in a loss function of the incremental training;

and 7, testing the model after the incremental training, wherein the testing method is the same as that in the step 3, testing the accuracy of the model, finishing the training if the accuracy is stable, and otherwise, repeating the steps 4 to 7.

Preferably, the step 1 comprises the following steps:

step 1.1, data cleaning: in the medical image data set, low-quality samples which influence model training due to illumination, misoperation and the like in the data set are removed, and the number of samples in the fundus image data set before and after cleaning is as follows:

	number of original samples	Number of samples after washing	Post-cleaning data ratio
				Level 0	65741	63367	72.61％
Level 1	6618	6206	7.11％
				Stage 2	14846	13155	15.07％
Grade 3	2301	2087	2.39％
				4 stage	2177	1914	2.21％
Total of	91653	87269	100％

；

Step 1.2, image preprocessing: in medical image data set, in order to reduce the influence of different factors such as illumination, a bottom camera and the like on the image during data acquisition, green channel extraction and contrast enhancement operations are sequentially carried out on the image.

Preferably, the step 1.2 comprises the following steps:

step 1.2.1, green channel extraction: extracting a green channel from the color medical image, wherein the medical image is composed of red, green and blue channels, and the green channel contains more information;

step 1.2.2, contrast enhancement is carried out, average brightness is kept, and an image with better quality can be generated;

step 1.2.3, central region extraction: the black background of the image is removed, and the central part of the image is extracted.

Preferably, the step 2 comprises the following steps:

2.1, randomly selecting initial data, selecting a certain number of samples as an initial training set of the model according to the original proportion of samples among the categories in the data set, randomly selecting 5000 samples in the medical image data set, and respectively selecting 3600, 350, 750, 150 and 150 samples in 5 categories;

step 2.2, performing initial training on the model: and taking 5000 samples as a training set, and carrying out initial training on the network model.

Preferably, the step 3 comprises the following steps:

step 3.1, testing the model, testing samples which do not participate in training, wherein each sample can obtain a 5-dimensional probability vector P:

P＝[P₀，P₁，P₂，P₃，P₄]；

and 3.2, decoding the probability vector to obtain a prediction score S, and setting a ranking decoding method in lesion classification as follows:

S＝0.1*P₀+0.15*P₁+0.2*P₂+0.25*P₃+0.3*P₄；

step 3.3, obtaining a prediction score: based on the data ratio of the data set, S is converted into a continuous lesion level prediction result with a decimal between 0 and 4 in the lesion classification prediction, the smallest 72% of samples in S are predicted to be 0 to 0.4, the next 7% are 0.5 to 1.4, 15% are 1.5 to 2.4, 3% are 2.5 to 3.4, and the remaining 3% are 3.5 to 4.

Preferably, the step 4 comprises the following steps:

step 4.1, cross expansion of data: performing cross expansion on data in the data set, obtaining 5 amplification samples at five positions, namely the upper position, the lower position, the left position, the right position and the middle position of the medical image, turning the image, and performing the same operation to obtain 10 samples in total;

step 4.2, calculating the error between the true label of the candidate sample and the prediction result: obtaining an error between a prediction result and a real label according to the real label of the medical image and a sample prediction score S obtained by previous calculation:

wherein y is a real label of the sample, i is a sample serial number, and j is an amplified sample serial number of the same sample;

4.3, calculating the diversity of a group of expansion samples obtained after the cross expansion, and calculating the prediction probability P obtained through a classification network to obtain:

wherein k is the class in which the sample is predicted;

and 4.4, calculating the value of the sample, and calculating the value p of the sample through the error and diversity of a group of expansion samples obtained before:

and 4.5, screening a certain number of candidate samples as a training set of incremental training, and selecting n expansion sample groups with the highest value as the training set according to p obtained by calculating each group of expansion samples.

Preferably, the step 5 comprises the following steps:

step 5.1, further cleaning noise data of the selected training set, calculating the mean value and the variance of the prediction score S of each group of samples, and rejecting samples which are more than or less than 1 standard deviation of the mean value as noise samples;

step 5.2, selecting the number of the expansion samples according to the K value, setting the K value according to the number of each type of samples, wherein the K value is the maximum number selected in each group of expansion samples, if the number of the remaining samples after cleaning in the step 5.1 is less than the K value, all the samples are selected, otherwise, the K samples are randomly selected, and the K value in the medical image data set for lesion classification is set as shown in the following table:

categories	Total number of data	Data ratio	Value of K
				Grade 0 (No DR)	63367	72.61％	3
Grade 1 (mild NPDR)	6206	7.11％	7
				Grade 2 (moderate NPDR)	13155	15.07％	4
Grade 3 (Severe NPDR)	2087	2.39％	10
				Grade 4 (PDR)	1914	2.21％	10
Total of	87269	100％	-

。

Preferably, the step 6 comprises the following steps:

6.1, during incremental training, adding distillation terms to the model loss function, wherein the distillation terms are the loss functions of the models trained for the last two times, so as to reduce possible catastrophic forgetting of the models after multiple incremental training, wherein lambda is₁Is set to 0.4, lambda₂Set to 0.1:

loss_fine-tune＝loss_new+λ₁loss₁+λ₂loss₂

therein, loss_fine-tuneIs a loss function of incremental training, loss_newIs the loss function of origin, loss₁Is the loss function in the last training, loss₂Is the loss function during the last training;

and 6.2, performing incremental training on the model by taking the data finally obtained in the step 5.2 as a training set of incremental training and taking the loss function obtained in the step 6.1 as a loss function of model training.

Preferably, the step 7 comprises the following steps:

7.1, testing the accuracy of the model result after the incremental training;

7.2, if the difference of the prediction accuracy of the model in the last 3 times is less than 0.5%, determining that the model is stable, and ending the model training;

and 7.3, if the accuracy of the model is not stable, repeating the steps 4 to 7 until the model is stable.

The invention has the beneficial effects that:

aiming at various characteristics of medical images, the invention improves the AIFT (active learning enhancement method), uses a multi-task model to replace the traditional classification network model, and aims to solve the problems of difficult classification of the medical images, low training efficiency and the like caused by unbalanced data; by extracting the global features and the local focus information of the medical image respectively, the information in the image with serious lesion degree can be fully utilized, and the problem of poor application effect of deep learning in the field of lesion classification caused by insufficient and unbalanced data volume is solved; in the process of extracting the local focus information, the invention outputs the labeling result of the local focus information in the medical image, displays 4 key focuses influencing the lesion classification result for a doctor, solves the problem of low interpretable degree of the deep learning model result, and improves the auxiliary effect on the doctor to diagnose the disease condition.

Drawings

FIG. 1 is a general flow diagram of the active incremental training method for deep learning multi-class medical image classification according to the present invention;

FIG. 2 is a flowchart of step 1 of the active incremental training method for deep learning multi-class medical image classification according to the present invention;

fig. 3 is an expanded fundus image set obtained by cross expansion in step 4.1 of the active incremental training method for deep learning multi-class medical image classification according to the embodiment of the present invention.

Detailed Description

The invention is further illustrated by the following examples and figures:

example (b):

the present invention will be described specifically below, taking an active increment training method for classification of a diabetic fundus image as an example; diabetes mellitus is short for diabetic retinopathy, is an eye complication caused by diabetes mellitus, and is also an eye disease with high blindness rate.

As shown in fig. 1, the active incremental training method for deep learning multi-class medical image classification according to the present invention includes the following seven steps:

step 1, carrying out primary data cleaning and pretreatment on an eye fundus image data set; as shown in fig. 2, this step includes the following steps:

step 1.1, data cleaning: in the fundus image dataset, removing low-quality samples which can influence model training in the dataset;

step 1.2, image preprocessing: in the fundus image data set, sequentially carrying out green channel extraction and contrast enhancement on the image; the method comprises the following steps:

step 1.2.1, green channel extraction: extracting a green channel from the color fundus image;

step 1.2.2, contrast enhancement is carried out, and average brightness is kept;

Step 2, randomly selecting initial data, and performing initial training on the network model; the method comprises the following steps:

2.1, randomly selecting initial data, selecting a certain number of samples as an initial training set of the model according to the original proportion of the samples among the categories in the data set, randomly selecting 5000 samples in the fundus image data set, and respectively selecting 3600, 350, 750, 150 and 150 samples in 5 categories;

Step 3, testing other samples in the data set on the model trained in the step 2 to obtain a prediction probability, and obtaining a prediction score of the sample by using a ranking decoding method on the output probability to realize the correspondence between the prediction score and the diabetes mellitus lesion category; the method comprises the following steps:

P＝[P₀，P₁，P₂，P₃，P₄]；

and 3.2, decoding the probability vector to obtain a prediction score S, wherein in the classification of the diabetes mellitus lesion, the set ranking decoding method comprises the following steps:

S＝0.1*P₀+0.15*P₁+0.2*P₂+0.25*P₃+0.3*P₄；

step 3.3, obtaining a prediction score: based on the data ratio of the data set, S is converted into a continuous lesion level prediction result, which is a decimal between 0 and 4 in the classification prediction of diabetes mellitus lesions, the smallest 72% of samples in S are predicted to be 0 to 0.4, the next 7% is 0.5 to 1.4, 15% is 1.5 to 2.4, 3% is 2.5 to 3.4, and the remaining 3% is 3.5 to 4.

Step 4, performing cross expansion on the residual samples in the data set, and actively screening candidate samples according to the test result in the step 3; the method comprises the following steps:

step 4.1, cross expansion of data: performing cross expansion on data in the data set, obtaining 5 amplified samples at five positions, namely the upper position, the lower position, the left position, the right position and the middle position of the fundus image, turning the image, and performing the same operation to obtain 10 samples in total;

step 4.2, calculating the error between the true label of the candidate sample and the prediction result: obtaining an error between a prediction result and a real label according to the real label of the fundus image and a sample prediction score S obtained by calculation:

wherein k is the class in which the sample is predicted;

Step 5, setting K values according to data proportions among different categories and carrying out further data set cleaning on the candidate samples obtained by screening in the step 4; the method comprises the following steps:

step 5.2, selecting the number of the expansion samples according to the K value, setting the K value according to the number of each type of samples, wherein the K value is the maximum number selected in each group of expansion samples, if the number of the residual samples after cleaning in the step 5.1 is less than the K value, all the expansion samples are selected, otherwise, the K samples are randomly selected, and the K value in the fundus image data set classified by the glycogenopathy lesion is set as shown in the following table:

。

Step 6, performing incremental training on the model, and adding a distillation item in a loss function of the incremental training; the method comprises the following steps:

loss_fine-tune＝loss_new+λ₁loss₁+λ₂loss₂

Step 7, testing the model after the incremental training, wherein the testing method is the same as that in the step 3, testing the accuracy of the model, ending the training if the accuracy is stable, and otherwise, repeating the steps 4 to 7; the method comprises the following steps:

7.1, testing the accuracy of the model result after the incremental training;

The embodiment provides an improved AIFT algorithm based on a multi-task network aiming at the characteristics of the classification task of the diabetes mellitus lesion and the data balance problem of a data set, and according to experimental detection, the improved AIFT method can effectively reduce the time required by training, improves the training efficiency, and is more suitable for the classification task of the diabetes mellitus lesion than the AIFT method; and the multi-task network slightly improves the performance of the diabetes mellitus lesion classification task due to the addition of the regression task in the classification network, so that the network model designed in this chapter can obtain a better effect in a shorter training time. However, for the classification task of diabetic retinopathy with extremely high requirements on accuracy and sensitivity, the effect of the multitask model trained and completed by using the improved AIFT algorithm still cannot completely meet the requirement.

The above embodiments are only preferred embodiments of the present invention, and are not intended to limit the technical solutions of the present invention, so long as the technical solutions can be realized on the basis of the above embodiments without creative efforts, which should be considered to fall within the protection scope of the patent of the present invention.

Claims

1. An active incremental training method for deep learning multi-category medical image classification is characterized in that: the method comprises the following steps:

2. The active incremental training method for deep learning multi-class medical image classification as claimed in claim 1, wherein: the step 1 comprises the following steps:

step 1.1, data cleaning: in the medical image data set, low-quality samples which can influence model training in the data set are removed;

step 1.2, image preprocessing: in the medical image data set, the green channel extraction and contrast enhancement operations are sequentially performed on the image.

3. The active incremental training method for deep learning multi-class medical image classification as claimed in claim 2, wherein: the step 1.2 comprises the following steps:

step 1.2.1, green channel extraction: extracting a green channel from the color medical image;

4. The active incremental training method for deep learning multi-class medical image classification as claimed in claim 1, wherein: the step 2 comprises the following steps:

5. The active incremental training method for deep learning multi-class medical image classification as claimed in claim 1, wherein: the step 3 comprises the following steps:

P＝[P₀，P₁，P₂，P₃，P₄]；

S＝0.1*P₀+0.15*P₁+0.2*P₂+0.25*P₃+0.3*P₄；

6. The active incremental training method for deep learning multi-class medical image classification as claimed in claim 1, wherein: the step 4 comprises the following steps:

wherein k is the class in which the sample is predicted;

7. The active incremental training method for deep learning multi-class medical image classification as claimed in claim 1, wherein: the step 5 comprises the following steps:

。

8. The active incremental training method for deep learning multi-class medical image classification as claimed in claim 7, wherein: the step 6 comprises the following steps:

loss_fine-tune＝loss_new+λ₁loss₁+λ₂loss₂

9. The active incremental training method for deep learning multi-class medical image classification as claimed in claim 1, wherein: the step 7 comprises the following steps:

7.1, testing the accuracy of the model result after the incremental training;