CN113160119A

CN113160119A - Diabetic retinopathy image classification method based on deep learning

Info

Publication number: CN113160119A
Application number: CN202110159376.7A
Authority: CN
Inventors: 邱士军; 谭欣; 梁艺; 覃春红; 李伊凡
Original assignee: First Affiliated Hospital of Guangzhou University of Chinese Medicine
Current assignee: First Affiliated Hospital of Guangzhou University of Chinese Medicine
Priority date: 2021-02-04
Filing date: 2021-02-04
Publication date: 2021-07-23

Abstract

The invention relates to a diabetic retinopathy image classification method based on deep learning, which improves the quality of an input network image after carrying out data set preprocessing operations such as normalization, image preprocessing, data set expansion and the like on a data set, and trains the processed data set by utilizing a deep learning neural network model to generate a diabetic retinopathy diagnosis model. Compared with other methods, the network model has higher recognition rate and proper algorithm complexity, can reduce the phenomenon of misdiagnosis caused by human factors, can greatly shorten the time for diagnosing the diabetic retinopathy, and has important significance for early prevention and treatment of patients with the diabetes mellitus.

Description

Diabetic retinopathy image classification method based on deep learning

Technical Field

The invention relates to the technical field of deep learning, in particular to a diabetic retinopathy image classification method based on deep learning.

Background

Diabetic Retinopathy (DR), also known as diabetes mellitus, is an ocular manifestation of diabetes mellitus and is a leading cause of visual impairment and blindness in the world today. Researches prove that hyperglycemia, hypertension, hyperlipidemia and diabetes are important risk factors for diabetic retinopathy. According to WHO data statistics, the number of Chinese diabetics reaches 1.1 hundred million, and the Chinese diabetics are the countries with the largest number of diabetics. The prevalence of diabetic retinopathy in the diabetic population is 65%, which means that 6700 million people may suffer from diabetes mellitus in China. If the diabetes mellitus is not treated in time, the vision of the patient is sharply reduced after the diabetes mellitus develops to the later stage, and finally the patient is blinded, so that the normal life of the patient is seriously influenced. According to studies, the longer the period of diabetes, the higher the incidence of diabetic retinopathy, but the 90% lower the incidence of vision loss or blindness in patients with diabetes mellitus if effective early diagnosis and treatment is performed.

The traditional method for clinically diagnosing diabetic retinopathy is to perform comprehensive ophthalmologic detection including visual sensitivity, slit lamp, mydriasis and the like, and has a complex detection process and needs to consume certain manpower and material resources. Since the diabetic retinopathy has various focus types, if the fundus images can be automatically classified in the early screening process, a large amount of diagnosis time can be saved for clinicians, so that the research of realizing the diagnosis of the diabetes retinopathy by means of a machine learning and deep learning method is paid attention to by people.

At present, the precision of classifying the fundus images of patients by using a machine learning method is high, but most of machine learning algorithms at present need high-quality images as data sets, and classification and labeling of the fundus images by experienced ophthalmologists are needed when image data are acquired, so that a large amount of preparation work is still needed in the early stage of model building. Meanwhile, the method for diagnosing diabetic retinopathy by using deep learning is also rapidly developed, and most of the existing work is to indirectly detect the pathological changes by segmenting blood vessels and optic discs in fundus pictures. With the continuous increase of the scale and the depth of the deep learning network model, the accuracy of the deep learning network model in an image recognition task is rapidly improved, but the small medical image data set is the main reason for poor recognition effect of the deep learning method in the field.

At present, the detection of diabetic retinopathy mainly depends on the manual judgment of doctors, the work is time-consuming and labor-consuming, and the requirements on the related experience of the doctors are high. In the face of a large number of diabetics, the detection efficiency is urgently needed to be greatly improved, and the detection time is shortened.

Disclosure of Invention

The technical problem to be solved by the present invention is to provide a diabetic retinopathy image classification method based on deep learning, aiming at the defects of the prior art.

The technical scheme adopted by the invention for solving the technical problems is as follows: a diabetic retinopathy image classification method based on deep learning is constructed, and comprises the following steps:

acquiring diabetic retinopathy image data, preprocessing the image, and dividing a training set and a test set;

combining a ResNet neural network model and a DenseNet neural network model, constructing a deep neural network model, inputting diabetic retinopathy image data serving as a training set into the deep neural network model for training, adjusting network parameters and functions until an output result is an accurate classification result, inputting the diabetic retinopathy image data serving as a test set into the trained deep neural network model after training is finished, and verifying the accuracy of the deep neural network model;

and inputting the data of the diabetic retinopathy image shot in real time into the trained deep neural network model, and classifying the input diabetic retinopathy image by using the output result through calculation of the network model.

In the step of collecting the diabetic retinopathy image data, extracting high-resolution color images of two eyes of a plurality of diabetic patients, dividing the images into five grades of normal, mild, moderate, severe and proliferative diabetic retinopathy according to the degree of the retinopathy, and marking the grades by 0-4, wherein 0 corresponds to a normal health state, and 4 is the most serious state.

The image preprocessing mode at least comprises the operations of image denoising, image filtering, feature enhancement, normalization and data amplification on the marked high-resolution color image data of the two eyes of the diabetic patient.

In the step of carrying out image filtering on the high-resolution color image data of both eyes of the diabetic patient, the characteristics of the eyeground blood spots in the diabetic retinopathy image are enhanced in an edge detection algorithm mode, so that the accuracy of model identification is improved.

Wherein, in the step of performing feature enhancement on the high-resolution color image data of both eyes of the diabetic patient, the method comprises the following steps:

subtracting a Gaussian blur (GaussianBlur) image from an original image of a high-resolution color image of both eyes of a diabetic patient to obtain the difference between the two images;

and obtaining the final characteristic-enhanced diabetic patient binocular high-resolution color image by using GaussianBlur algorithm.

Wherein in the step of normalizing the diabetic patient binocular high resolution color image data, the data enhanced diabetic patient binocular high resolution color image is fixed to a resolution of 512x 512.

In the step of data amplification of the high-resolution color image data of both eyes of the diabetic patient, the images are zoomed, rotated, overturned and changed in brightness on the basis of not changing the disease grade of the original data set image, so that the quantity of the images in the diabetic retinopathy image data sets of all the disease grades is equal.

The deep neural network model for image classification is combined with the characteristics of a ResNet neural network model and a DenseNet neural network model, and comprises 5 blocks including a Dense Block representing a Dense Block and a Resnet Block representing a residual Block, and a transition layer is arranged between each Block for convolution nuclear pooling; the model input is pictures with the resolution of 512x512, and the pictures are classified through a Softmax classifier;

inputting a training set into a model, performing convolution operation on a picture, wherein the size of a convolution kernel is 7x7, the step length is 2, the length and the width of an output after the convolution operation are both 256, and the length and the width of an output feature map after a pooling layer are 128x128 after the maximum pooling operation; after convolutional encoding, the sample image is input into a dense block followed by a conversion layer for performing convolution kernel average pooling, and after passing through the dense block and the conversion layer, the length and width of the model output picture are both 16x 16; and further adopting a method of adding a residual block, wherein after the residual block, the output dimensionality of the model is 8x8x1024, performing average pooling on the output, then connecting the output with a layer of 1024 neurons, and finally classifying through a sofmax classifier to output a lesion classification result of the diabetic retinopathy image.

The classification effect of various identification networks is objectively evaluated by adopting two indexes of model accuracy and space complexity:

the indexes for evaluating the classification effect of the model have accuracy and average accuracy, and the calculation formulas are respectively as follows:

wherein m is_correctRepresents the number of correctly classified samples, m_totalRepresenting the number of samples tested in total; n is the number of model classification categories, Accuracy_iRepresenting the recognition accuracy of the ith category;

the quality of the model is evaluated by the spatial complexity of the model, the spatial complexity of the model determines the number of parameters of the model, the more the model parameters are, the larger the data size required by the training model is, and when the training data size is not enough to match the model parameters, the overfitting problem of the training model can be caused; the model space complexity is composed of two parts, namely the total parameter number of the model and the space occupation of each layer of output characteristic diagram, and the calculation formula is as follows:

d, K, M respectively represents the convolution layer number, the characteristic diagram size and the convolution kernel size of the convolution neural network, n represents the convolution layer label of the convolution neural network, C represents the output channel number of the convolution layer and is equal to the output channel number of the last convolution layer, the first term in the formula represents the total parameter number of the neural network model, and the second term represents the space occupation of the output characteristic diagram.

The invention provides a diabetic retinopathy image classification method based on deep learning, which improves the quality of an input network image after carrying out data set preprocessing operations such as normalization, image preprocessing, data set expansion and the like on a data set, and trains the processed data set by using a deep learning neural network model to generate a diabetic retinopathy diagnosis model. Compared with other methods, the network model has higher recognition rate and proper algorithm complexity, can reduce the phenomenon of misdiagnosis caused by human factors, can greatly shorten the time for diagnosing the diabetic retinopathy, and has important significance for early prevention and treatment of patients with the diabetes mellitus.

Drawings

The invention will be further described with reference to the accompanying drawings and examples, in which:

FIG. 1 is a schematic flow chart of a method for classifying images of diabetic retinopathy based on deep learning according to the present invention.

FIG. 2 is a schematic diagram of an architecture of a deep neural network model in the method for classifying diabetic retinopathy image based on deep learning according to the present invention.

Detailed Description

For a more clear understanding of the technical features, objects and effects of the present invention, embodiments of the present invention will now be described in detail with reference to the accompanying drawings.

As shown in fig. 1, the present invention provides a method for classifying diabetic retinopathy image based on deep learning, comprising:

In the step of collecting the diabetic retinopathy image data, the high-resolution color images of the two eyes of a plurality of diabetic patients are extracted, and the fundus image resolutions of the left eye and the fundus image resolution of the right eye are the same. The retinopathy is divided into five grades of normal, mild, moderate, severe and proliferative type diabetic retinopathy according to the degree of the retinopathy, and the pathological changes are mainly characterized by microaneurysms, hard exudation, cotton wool spots, bleeding spots and the like. And is labeled with a rating of 0-4, where 0 corresponds to a normal health state and 4 is the most severe state.

However, training tests show that the edge detection algorithm recognizes dust or other optical problems on the lens as blood spots, and the healthy original image is processed to recognize the proliferative diabetic retinopathy. Therefore, in order to improve the accuracy of model identification, it is necessary to avoid using the image subjected to the filter processing.

Bleeding points and hard exudate characteristics in the image become clear and visible, microaneurysm characteristics are amplified, and useful characteristics can be maximally learned by the classification model so as to help the model obtain a better training result.

The data set is analyzed, and images from the training and testing data sets are found to have different resolutions, aspect ratios, colors, and various cropping modes, and some data images have very poor quality and are out of focus. Training the data set image by using the deep convolutional neural network requires a fixed input dimension, so that the size of the fundus image needs to be adjusted to enable all data sets to have a fixed resolution. If the resolution of the image is too low, the detailed features of the fundus image are lost, and if the resolution is too high, the training cost of the model is increased, and the image is fixed to the resolution of 512x512 through experiments, so that the detailed features of the fundus image can be ensured on one hand, and the training speed of the model can be increased on the other hand.

In the step of data amplification of the high-resolution color image data of both eyes of the diabetic patient, the images are zoomed, rotated, overturned and changed in brightness on the basis of not changing the disease grade of the original data set image, so that the quantity of the images in the diabetic retinopathy image data sets of all the disease grades is equal. After data amplification, the influence on the categories with a large number of samples is small, and the categories with a small number are expanded and filled up to the categories with a large number.

As shown in fig. 2, the deep neural network model for image classification combines the characteristics of the ResNet neural network model and the DenseNet neural network model, and includes 5 blocks including a density Block representing a Dense Block and a ResNet Block representing a residual Block, and a transition layer is arranged between each Block for convolution kernel pooling; the model input is pictures with the resolution of 512x512, and the pictures are classified through a Softmax classifier;

The network model structure is shown in table 1:

TABLE 1 network model architecture

While the present invention has been described with reference to the embodiments shown in the drawings, the present invention is not limited to the embodiments, which are illustrative and not restrictive, and it will be apparent to those skilled in the art that various changes and modifications can be made therein without departing from the spirit and scope of the invention as defined in the appended claims.

Claims

1. A diabetic retinopathy image classification method based on deep learning is characterized by comprising the following steps:

2. The deep learning-based diabetic retinopathy image classification method according to claim 1, wherein in the step of collecting diabetic retinopathy image data, binocular high resolution color images of a plurality of diabetic patients are extracted, classified into five grades of normal, mild, moderate, severe and proliferative diabetic retinopathy according to the degree of retinopathy, and the grades are identified by 0 to 4, wherein 0 corresponds to a normal healthy state and 4 is the most severe state.

3. The deep learning-based diabetic retinopathy image classification method according to claim 2, wherein the image preprocessing at least comprises the operations of image denoising, image filtering, feature enhancement, normalization and data amplification of the labeled diabetic patient binocular high-resolution color image data.

4. The diabetic retinopathy image classification method based on deep learning of claim 3, wherein in the step of image filtering the high-resolution color image data of both eyes of the diabetic patient, the characteristics of the fundus blood spots in the diabetic retinopathy image are enhanced by adopting an edge detection algorithm so as to improve the accuracy of model identification.

5. The diabetic retinopathy image classification method based on deep learning of claim 3, wherein the step of performing feature enhancement on the high-resolution color image data of both eyes of the diabetic patient comprises the steps of:

6. The deep learning-based diabetic retinopathy image classification method according to claim 5, wherein in the step of normalizing the diabetic patient's binocular high-resolution color image data, the data-enhanced diabetic patient's binocular high-resolution color image is fixed to a resolution of 512x 512.

7. The method for classifying diabetic retinopathy based on deep learning of claim 3, wherein in the step of data expansion of the diabetic patient's binocular high resolution color image data, the operations of scaling, rotating, flipping and changing brightness are performed on the images without changing the disease level of the original data set image, so that the number of images in the diabetic retinopathy image data sets of each disease level is equal.

8. The deep learning-based diabetic retinopathy image classification method according to claim 1, characterized in that the deep neural network model for image classification combines the characteristics of the ResNet neural network model and the DenseNet neural network model, and comprises 5 blocks including density Block representing Dense blocks and ResNet Block representing residual blocks, and a transition layer is arranged between each Block for convolution nuclear pooling; the model input is pictures with the resolution of 512x512, and the pictures are classified through a Softmax classifier;

inputting a training set into a model, performing convolution operation on a picture, wherein the size of a convolution kernel is 7x7, the step length is 2, the length and the width of an output after the convolution operation are both 256, and the length and the width of an output feature map after a pooling layer are 128x128 after the maximum pooling operation; after convolutional encoding, the sample image is input into a dense block followed by a conversion layer for performing convolution kernel average pooling, and after passing through the dense block and the conversion layer, the length and width of the model output picture are both 16x 16; further adopting a method of adding a residual block, after the residual block, the output dimension of the model is 8x8x1024, performing average pooling on the output, then connecting the output with a layer of 1024 neurons, and finally passing through sofmax

And classifying by the classifier, and outputting a lesion classification result of the diabetic retinopathy image.

9. The diabetic retinopathy image classification method based on deep learning of claim 1 is characterized in that two indexes of model accuracy and spatial complexity are adopted to objectively evaluate the classification effect of various identification networks: