CN109726743A - A kind of retina OCT image classification method based on Three dimensional convolution neural network - Google Patents

A kind of retina OCT image classification method based on Three dimensional convolution neural network Download PDF

Info

Publication number
CN109726743A
CN109726743A CN201811516615.4A CN201811516615A CN109726743A CN 109726743 A CN109726743 A CN 109726743A CN 201811516615 A CN201811516615 A CN 201811516615A CN 109726743 A CN109726743 A CN 109726743A
Authority
CN
China
Prior art keywords
training
layer
convolutional layer
model
network
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Granted
Application number
CN201811516615.4A
Other languages
Chinese (zh)
Other versions
CN109726743B (en
Inventor
陈新建
冯爽朗
朱伟芳
赵鹤鸣
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Suzhou University
Original Assignee
Suzhou University
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Suzhou University filed Critical Suzhou University
Priority to CN201811516615.4A priority Critical patent/CN109726743B/en
Publication of CN109726743A publication Critical patent/CN109726743A/en
Application granted granted Critical
Publication of CN109726743B publication Critical patent/CN109726743B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Abstract

The retina OCT image classification method based on Three dimensional convolution neural network that the invention discloses a kind of, comprising the following steps: acquired image is simultaneously divided into training set and test set by (a) acquisition image, is pre-processed to image;(b) VinceptionC3D network structure is built, VinceptionC3D network is the improvement based on C3D convolutional neural networks, the Vinception module of fusion multi-channel feature is added on the basis of C3D network, and batch standardized method is applied in original C3D network, (c) training and test of model: using the pre-training model of C3D as the pre-training model of VinceptionC3D, trained VinceptionC3D model is obtained using the network after the data training load pre-training model in training set, after model training, with test set test model.The present invention can carry out whole classification to three-dimensional retina OCT image, improve efficiency and lay the foundation with analysis for the segmentation of subsequent retina OCT image.

Description

A kind of retina OCT image classification method based on Three dimensional convolution neural network
Technical field
The invention belongs to retinal images classification methods, and in particular to a kind of retina based on Three dimensional convolution neural network OCT image classification method.
Background technique
Retina is by the outwardly extending optic nerve distal tissues of brain, and its structure is complicated, fine, fragile and be metabolized prosperous It contains.Its blood vessel belongs to terminal vascular system, and histanoxia caused by any pathologic destruction and angiemphraxis etc. can cause Tissue necrosis loses the function of its impression and conduction light stimulus.
Spectral domain optical coherence tomography (OCT) is to detect light through eye subject refractive media up to retina, passes through acquisition Thickness and range information that intraocular different tissues interface reflection provides simultaneously are reduced into image and data to work, it has also become The indispensable clinical examination technology of ophthalmology, can provide three-dimensional retinal images.There are three types of adopt common retina OCT image Integrated mode: macula lutea center, disk center, and the big visual field including the first two region.In order to realize it is high performance from Whether normally dynamicization computer-aided diagnosis system is acquired region and identification to retina OCT image first and divides Class be it is very necessary, the segmentation of subsequent retina OCT image and classification can be made more efficient.
Currently, existing retina OCT image sorting algorithm is classified based on two dimensional image, such as The OCT image classification method based on HOG operator and support vector machines that Srinivasan et al. is proposed, can be used for age related The classification of macular degeneration and diabetic macular edema lesion retina;Venhuizen et al. propose based on machine learning Age-related macular degeneration lesion automatic grading system;The diabetic keratopathy view based on transfer learning that Chakraborty et al. is proposed The swollen automatic classification with Local Electroretinogram of film water, but these algorithms cannot make full use of three-dimensional space feature to believe Breath, single image are easily affected by noise, and lead to classification error.The existing Three dimensional convolution image applied to medical image Sorter network has parallel supplement to be hoisted, between shortage characteristic pattern in classification accuracy, and class categories are relatively single, special It is not normal/improper for macula lutea center of classifying simultaneously, the three kinds of scan patterns progress of disk center and the big visual field Classification work have not been reported.
Convolutional neural networks (Convolutional Neural Networks, CNN) are a kind of comprising convolution or correlation The feedforward neural network for calculating and having depth structure, is one of representative algorithm of deep learning.Three dimensional convolution neural network is For carrying out video Activity recognition, medical domain often uses Three dimensional convolution neural network, existing Three dimensional convolution network knot Structure as shown in figure 4, have 8 3D convolutional layers, 5 pond 3D layers, 2 full articulamentums, behind connect a softmax output layer, The step-length of all 3D convolution kernels is all 1, and the size of convolution kernel is 3 × 3 × 3.All maximum pond layers are in addition to first pond The core for changing layer is except 1 × 2 × 2, and the core of other pond layers is 2 × 2 × 2.
Summary of the invention
A kind of retina based on Three dimensional convolution neural network in order to overcome the deficiencies in the prior art, the present invention provides OCT image classification method.
In order to solve the above-mentioned technical problem, the present invention adopts the following technical scheme:
A kind of retina OCT image classification method based on Three dimensional convolution neural network, comprising the following steps: (a) acquires image And acquired image is divided into training set, verifying collection and test set, image is pre-processed;(b) it builds VinceptionC3D network structure, VinceptionC3D network is the improvement based on C3D convolutional neural networks, in C3D network On the basis of be added to the Vinception module of fusion multi-channel feature, and a batch standardized method has been applied to original In C3D network, (c) training and test of model: using the pre-training model of C3D as the pre-training mould of VinceptionC3D Type obtains trained VinceptionC3D model using the network after the data training load pre-training model in training set, After model training, with test set test model.
Preferably, including to macula lutea center, disk center, the big visual field to the pretreatment of image in step (a) Retina three-dimensional OCT image carries out down-sampling and subtracts average value processing, carries out Random Level overturning to data and carrys out EDS extended data set.
Preferably, in step (b) in VinceptionC3D network structure successively include 3D convolutional layer 1a, pond layer 1, 3D convolutional layer 2a, pond layer 2,3D convolutional layer 3a, 3D convolutional layer 3b, pond layer 3, Vinception module 1, pond layer 4, Vinception module 2, pond layer 5, full articulamentum 6, full articulamentum 7 and softmax output layer.
Preferably, being activated behind each convolutional layer with Relu function, Relu activation front is also all added to Normalization layer is criticized, for fixing the mean value and variance of every layer of input signal.
Preferably, Vinception module includes the convolutional layer c1, a convolutional layer c2 of two superpositions, a convolutional layer C3 and concat layers, convolutional layer c1, convolutional layer c2, convolutional layer c3 are set side by side, convolutional layer c1, convolutional layer c2, convolutional layer c3 It is connect afterwards with concat layers, convolutional layer c4 is connected with after concat layers.
Preferably, two Vinception modules receive one layer of characteristic pattern transmitted, by Vinception module Multiscale Fusion after, export the characteristic pattern of semantic relative abundance, then pond is followed by next layer.
Preferably, the convolution kernel of convolutional layer c1 is 3 × 3 × 3, the convolution kernel of convolutional layer c2 is 3 × 3 × 3, convolutional layer c3 Convolution kernel be 1 × 1 × 1, the convolution kernel of convolutional layer c4 is 1 × 1 × 1.
Preferably, the training and test of step (c) model method particularly includes: load is in video image first The pre-training model trained on Sports-1M data set uses Adam optimization algorithm, setting on the basis of pre-training model The initial learning rate of characteristic extraction part, the initial learning rate of output layer, minimum batch, highest the number of iterations, by the figure of training set As being sequentially inputted to be trained in model, after the every iteration n times of training set, is verified with verifying collection, obtained after the completion of training Final training pattern is simultaneously tested with test set;Cross validation is rolled over to assess training pattern using K on entire data set Performance.
Preferably, the initial learning rate of characteristic extraction part is 1e-4 in step (c), the initial learning rate of output layer is 1e-3, minimum batch are set as 30, and the number of iterations is 6000 times, every iteration 10 times, primary with verifying collection verifying, when test The performance of training pattern is assessed on entire data set using 3 folding cross validations.
The beneficial effects of the present invention are:
1, the big visual field, disk center, three kinds of macula lutea center retina OCT scan are directed to present invention firstly provides a kind of Normal/improper three-dimensional automatic classification method --- the VinceptionC3D neural network of image, can be to three-dimensional retina OCT Image carries out whole classification, improves efficiency and lays the foundation with analysis for the segmentation of subsequent retina OCT image.
2, the network architecture of a multiple dimensioned three-dimensional feature figure of fusion is proposed in the present invention, and it is named as On the one hand the local sparsity structure of optimization can be carried out approximate, another aspect with approximate intensive ingredient by Vinception module Each layer of convolution nucleus number can be increased, this makes the width for increasing network, but does not consume computing resource excessively again, simultaneously The multiple dimensioned feature of this network integration, allows next layer network to possess the abstract characteristics of different scale.
3,1 × 1 × 1 Three dimensional convolution layer with across channel interaction and information integration is placed on by Vinception module Concat layers below, not only can use the pre-training model Sports-1M trained on the video images, but also can merge more The feature in channel reduces number of parameters, prevents over-fitting occur during training and test.
4, there are 4 3D convolutional layers in overall model structure of the invention, 5 pond 3D layers connect below in 4 3D convolution It is 2 Vinception modules, the two Vinception modules receive one layer of characteristic pattern transmitted, by Vinception After the Multiscale Fusion of layer, the characteristic pattern of semantic relative abundance is exported, then pond is followed by next layer, and pond is in order to obtain more Wide receptive field, Vinception module are put behind, and can be merged multiple dimensioned three-dimensional feature figure high-rise in this way, be avoided More image detail features are lost in high level.Vinception module is only applied in high level, and parameter can be prevented excessive.
5, in the present invention, activation primitive uses Relu function, behind each convolutional layer, before Relu active coating all It is added to batch normalization layer, to fix the mean value and variance of every layer of input signal, so that training data and test data meet together Distribution, overcomes that network convergence rate is slow and gradient disperse problem.
Detailed description of the invention
Fig. 1 is the schematic network structure of VinceptionC3D in the present invention;
Fig. 2 is the schematic network structure of Vinception module in the present invention;
Fig. 3 is the position view for the three-dimensional retinal images classified in the present invention;
Fig. 4 is the schematic network structure of Three dimensional convolution neural network in the prior art.
Specific embodiment
Present invention will be further described below with reference to the accompanying drawings and specific embodiments:
A kind of retina OCT image classification method based on Three dimensional convolution neural network, comprising the following steps: (a) acquires image And acquired image is divided into training set, verifying collection and test set, image is pre-processed;(b) it builds VinceptionC3D network structure, VinceptionC3D network is the improvement based on C3D convolutional neural networks, in C3D network On the basis of be added to the Vinception module of fusion multi-channel feature, and a batch standardized method has been applied to original In C3D network, (c) training and test of model: using the pre-training model of C3D as the pre-training mould of VinceptionC3D Type obtains trained VinceptionC3D model using the network after the data training load pre-training model in training set, After model training, with the excellent performance to evaluate the invention of test set test model.
It include to macula lutea center, disk center, big visual field retina three-dimensional to the pretreatment of image in step (a) OCT image carries out down-sampling and subtracts average value processing, carries out Random Level overturning to data and carrys out EDS extended data set.
It successively include 3D convolutional layer 1a, pond layer 1,3D convolutional layer in VinceptionC3D network structure in step (b) 2a, pond layer 2,3D convolutional layer 3a, 3D convolutional layer 3b, pond layer 3, Vinception module 1, pond layer 4, Vinception mould Block 2, pond layer 5, full articulamentum 6, full articulamentum 7 and softmax output layer.
It is activated behind each convolutional layer with Relu function, Relu activation front is also all added to batch standardization Layer, for fixing the mean value and variance of every layer of input signal.
Vinception module include two superposition convolutional layer c1, a convolutional layer c2, a convolutional layer c3 and Concat layers, convolutional layer c1, convolutional layer c2, convolutional layer c3 are set side by side, after convolutional layer c1, convolutional layer c2, convolutional layer c3 with Concat layers connect, and are connected with convolutional layer c4 after concat layers.
Two Vinception modules receive one layer of characteristic pattern transmitted, and multiple dimensioned by Vinception module is melted After conjunction, the characteristic pattern of semantic relative abundance is exported, then pond is followed by next layer.
The convolution kernel of convolutional layer c1 is 3 × 3 × 3, and the convolution kernel of convolutional layer c2 is 3 × 3 × 3, the convolution kernel of convolutional layer c3 It is 1 × 1 × 1, the convolution kernel of convolutional layer c4 is 1 × 1 × 1.
The training of step (c) model and test method particularly includes: load is in video image Sports-1M data set first On the pre-training model trained, Adam optimization algorithm is used on the basis of pre-training model, at the beginning of characteristic extraction part is set The image of training set is sequentially inputted to mould by beginning learning rate, the initial learning rate of output layer, minimum batch, highest the number of iterations Be trained in type, after every iteration n times, with verifying collection verified, training after the completion of obtain final training pattern and with survey Examination collection is tested;Cross validation is rolled over to assess the performance of training pattern using K on entire data set.
In step (c), the initial learning rate of characteristic extraction part is 1e-4, and the initial learning rate of output layer is 1e-3, minimum Batch is set as 30, and the number of iterations is 6000 times, every iteration 10 times, primary with verifying collection verifying, in entire data set when test The upper performance that training pattern is assessed using 3 folding cross validations.
Embodiment 1
(1) acquisition and pretreatment of image:
According to the needs of clinical ophthalmology, retina OCT image often has 3 kinds of scan patterns: macula lutea center, disk center with And the big visual field, as shown in figure 3, (a) is retina entirety OCT image, the big view of visual representation on the color picture image of eyeground in figure Wild region, disk area, the position of macula lutea central area correspond respectively to (b) in figure, (c), (d).It can will adopt The three-dimensional retina OCT image collected is divided into 6 classes: improper big field-of-view image (ANW), normal big field-of-view image (NW), just Normal macula lutea center image (ANM), normal macula image (NM), improper disk images (ANO), normal optic nerve cream Head image (NO).
Used in this example from 873 of 671 subjects three-dimensional retina OCT images as data set, for training with Assessment, wherein improper big field-of-view image (ANW) 24, normal big field-of-view image (NW) 37, improper macula lutea center image (ANM) 427, normal macula center image (NM) 131, improper disk center image (ANO) 121, normally Disk center image (NO) 133.Since the three-dimensional retina OCT image of acquisition has different resolution ratio, have Different scan settings and resolution ratio.For uniform size size, and the calculating cost of learning tasks is reduced, all three-dimensional OCT Image is all down sampled to 96 × 96 × 16 using the method for bilinear interpolation, then carries out subtracting average value processing to improve difference degree.
In order to equilibrium data and prevent over-fitting, pair unless 5 class little data of normal macula center image are carried out with 30% The mode of probability level overturning realize data extending.
(2) VinceptionC3D network structure is built:
The model of the present embodiment is to carry out on Three dimensional convolution neural network C3D model improved, and Fig. 4 illustrates original C3D volumes Product neural network structure.The network is the Three dimensional convolution neural network of a progress video Activity recognition, has 8 3D convolution Layer, 5 pond 3D layers, 2 full articulamentums, behind connect a softmax output layer, the step-length of all 3D convolution kernels is all 1, The size of convolution kernel is 3 × 3 × 3.All maximum pond layers in addition to the core of first pond layer be 1 × 2 × 2 other than, other The core of pond layer is 2 × 2 × 2, in order to prematurely merge depth information, but also the length of available 16 frame so as to It prepares for subsequent down-sampling.
The network architecture of a multiple dimensioned three-dimensional feature figure of fusion is proposed in the present embodiment, as shown in Figure 1, and ordering it Entitled Vinception module, this module have used for reference the wind of inception module in two dimensional image sorter network GoogLeNet Lattice.Inception module can the local sparsity structure of optimization with approximate intensive ingredient come approximate, on the other hand can increase Add each layer of convolution nucleus number, this makes the width for increasing network, but does not consume computing resource excessively again, while this net Network has merged multiple dimensioned feature, and next layer network is allowed to possess the abstract characteristics of different scale.The present embodiment is proposed Vinception module effect it is as follows:
(1) Vinception module uses Three dimensional convolution, as shown in Fig. 2, its interlaminar action is respectively: two 3 × 3 × 3 volumes Lamination superposition, is equivalent to one 5 × 5 × 5 convolutional layer, available wider array of receptive field, independent one 3 × 3 × 3 convolution The available relatively small receptive field of layer, similarly 1 × 1 × 1 convolutional layer obtains smaller receptive field, different semantic features Parallel Fusion can increase the understanding to Feature Semantics, so that the Feature Semantics finally exported are more abundant, add again after concat One 1 × 1 × 1 convolutional layer is added, such benefit is can to increase non-linear layer, so that network expresses more samples, together When reduce number of parameters.In this example, the step-length of above-mentioned each convolutional layer is 1 × 1 × 1.
It (2) is that Vinception module will have and interact across channel and information is whole with original inception module difference 1 × 1 × 1 Three dimensional convolution layer of cooperation be placed on concat layers below, both can use trained on the video images it is pre- Training pattern Sports-1M, and the feature of multichannel can be merged, number of parameters is reduced, is prevented in training and the process tested In there is over-fitting.
Fig. 1 illustrates the overall model structure of the present embodiment, which has 4 3D convolutional layers, the volume of this 4 3D convolutional layers Product core is 3 × 3 × 3,5 pond 3D layers, and the core of first pond layer is 1 × 2 × 2, and the core of remaining pond layer is all 2 × 2 × 2, the step-length of first pond layer is 1 × 2 × 2, and the step-length of remaining pond layer is all 2 × 2 × 2;It is connect behind 4 3D convolution Be 2 described Vinception modules above, the two Vinception modules receive one layer of characteristic pattern transmitted, pass through After crossing Vinception layers of Multiscale Fusion, the characteristic pattern of semantic relative abundance is exported, then pond is followed by next layer, Chi Hua It is that wider array of receptive field, Vinception module put behind and be in order to obtain, multiple dimensioned three can be merged high-rise in this way Dimensional feature figure avoids and loses more image detail features in high level.Parameter is excessive in order to prevent simultaneously, only in high level Using.
In the present embodiment, activation primitive uses Relu function, behind each convolutional layer, before Relu active coating all It is added to batch normalization layer, to fix the mean value and variance of every layer of input signal, so that training data and test data meet together Distribution, overcomes that network convergence rate is slow and gradient disperse problem.The class categories number that full articulamentum exports below is 6, net The three-dimensional OCT retinal images size of network input is 96 × 96 × 16, and the content of arrow meaning is to pass through each Three dimensional convolution layer Later and the 3-D image size of Chi Huahou.
(3) training and test of model:
The performance of the present embodiment is assessed using 3 folding cross validations on entire data set.Load is in video image first The pre-training model trained on Sports-1M data set uses Adam optimization algorithm, feature on the basis of pre-training model Extracting the initial learning rate in part is 1e-4, and the initial learning rate of output layer is 1e-3, and minimum batch is set as 30, at 3 Iteration 6000 times on Tesla K40m GPU, every iteration 10 times verifyings are primary, obtain final training pattern and carry out test set Test.
(4) experimental result:
Initial three-dimensional convolutional neural networks are assessed respectively and compared to the present embodiment using 3 folding cross validations in entire data set VinceptionC3D network proposed in C3D and the present embodiment.The corresponding confusion matrix of experimental result as shown in table 1, table 2, TransferC3D indicates that initial three-dimensional convolutional neural networks C3D, VinceptionC3D indicate the present embodiment.It can be seen that original Three dimensional convolution neural network C3D classification accuracy is 92.09%, and after improvement namely the classification accuracy of the present embodiment is reachable 94.04%。
Table 1, table 2 illustrate the confusion matrix of 6 classification (ANW, NW, ANM, NM, ANO, NO) of both models. Classification accuracy of all categories in the present embodiment in addition to a kind of classification be equal to improve before accuracy rate, other classifications are better than Classification accuracy before improvement.In our data set, ANO OCT image is mainly from glaucoma patient.In being with ONH The retinopathy image and normal picture of the heart are much like, this can lead to wrong classification rate relatively high between ANO and NO, phase Instead, other retinas are easier to be distinguished from each other.Obscuring for ANO and NO contributes to a biggish share to whole error rate.It is comprehensive From the point of view of closing, the present embodiment can carry out the classification of three-dimensional retina OCT image in this task, can regard to individual three-dimensional in test Nethike embrane OCT image makes good judgement, and obtains one better than the good performance before improving, and is more conducive to three-dimensional view The classification of film OCT image.
The confusion matrix of 1 initial three-dimensional convolutional neural networks C3D of table
The confusion matrix of table 2VinceptionC3D
So far, a kind of to be directed to the big visual field, disk center, three kinds of macula lutea center retina OCT scan image is normal/non-just Normal three-dimensional automatic classification method --- VinceptionC3D neural network has been carried out and is verified.It is in an experiment Performance is better than original C3D Three dimensional convolution neural network, and the invention, which can make three-dimensional retina OCT image, more preferably to be judged, from For another aspect, the network structure of the present embodiment is simultaneously uncomplicated, and number of parameters is few, and Vinception module is utilized and makes Network structure more efficiently and robust, facilitates the classification and detection of three-dimensional retina OCT image, substantially increases three-dimensional view The screening efficiency of film OCT image.The present embodiment combines the acquisition and pretreatment, VinceptionC3D network model of image Build with training and test, make the research of subsequent retinal disease, such as layering segmentation, lesion segmentation, registration obtain very greatly It helps.
The technical means disclosed in the embodiments of the present invention is not limited only to technological means disclosed in above embodiment, further includes Technical solution consisting of any combination of the above technical features.It should be pointed out that for those skilled in the art For, various improvements and modifications may be made without departing from the principle of the present invention, these improvements and modifications are also considered as Protection scope of the present invention.

Claims (9)

1. a kind of retina OCT image classification method based on Three dimensional convolution neural network, which is characterized in that including following step Suddenly :(a) acquired image is simultaneously divided into training set, verifying collection and test set by acquisition image, is pre-processed to image;(b) VinceptionC3D network structure is built, VinceptionC3D network is the improvement based on C3D convolutional neural networks, in C3D It is added to the Vinception module of fusion multi-channel feature on the basis of network, and batch standardized method has been applied to original In beginning C3D network, (c) training and test of model: using the pre-training model of C3D as the pre-training of VinceptionC3D Model obtains trained VinceptionC3D mould using the network after the data training load pre-training model in training set Type, after model training, with test set test model.
2. the retina OCT image classification method based on Three dimensional convolution neural network, feature exist as described in claim 1 In, in step (a) to the pretreatment of image include to macula lutea center, disk center, the big visual field retina three-dimensional OCT figure As carrying out down-sampling and subtracting average value processing, Random Level overturning is carried out to data and carrys out EDS extended data set.
3. the retina OCT image classification method based on Three dimensional convolution neural network, feature exist as described in claim 1 In successively including 3D convolutional layer 1a, pond layer 1,3D convolutional layer 2a, Chi Hua in VinceptionC3D network structure in step (b) Layer 2,3D convolutional layer 3a, 3D convolutional layer 3b, pond layer 3, Vinception module 1, pond layer 4, Vinception module 2, pond Change layer 5, full articulamentum 6, full articulamentum 7 and softmax output layer.
4. the retina OCT image classification method based on Three dimensional convolution neural network, feature exist as claimed in claim 3 In being activated behind each convolutional layer with Relu function, Relu activation front is also all added to batch normalization layer, is used to The mean value and variance of fixed every layer of input signal.
5. the retina OCT image classification side based on Three dimensional convolution neural network as described in claims 1 to 4 any one Method, which is characterized in that Vinception module includes the convolutional layer c1, a convolutional layer c2 of two superpositions, a convolutional layer c3 And concat layers, convolutional layer c1, convolutional layer c2, convolutional layer c3 are set side by side, after convolutional layer c1, convolutional layer c2, convolutional layer c3 It is connect with concat layers, convolutional layer c4 is connected with after concat layers.
6. the retina OCT image classification method based on Three dimensional convolution neural network, feature exist as claimed in claim 5 In, two Vinception modules receive one layer of characteristic pattern transmitted, after the Multiscale Fusion of Vinception module, The characteristic pattern of semantic relative abundance is exported, then pond is followed by next layer.
7. the retina OCT image classification method based on Three dimensional convolution neural network, feature exist as claimed in claim 5 In the convolution kernel of convolutional layer c1 is 3 × 3 × 3, and the convolution kernel of convolutional layer c2 is 3 × 3 × 3, and the convolution kernel of convolutional layer c3 is 1 × 1 The convolution kernel of × 1, convolutional layer c4 are 1 × 1 × 1.
8. the retina OCT image classification method based on Three dimensional convolution neural network, feature exist as described in claim 1 In the training and test of, step (c) model method particularly includes: first load is instructed on video image Sports-1M data set The pre-training model practiced, uses Adam optimization algorithm on the basis of pre-training model, and setting characteristic extraction part is initially learned The image of training set is sequentially inputted in model by habit rate, the initial learning rate of output layer, minimum batch, highest the number of iterations It is trained, after every iteration n times, is verified with verifying collection, obtain final training pattern after the completion of training and use test set It is tested;Cross validation is rolled over to assess the performance of training pattern using K on entire data set.
9. the retina OCT image classification method based on Three dimensional convolution neural network, feature exist as claimed in claim 8 In in step (c), the initial learning rate of characteristic extraction part is 1e-4, and the initial learning rate of output layer is 1e-3, minimum batch It is set as 30, the number of iterations is 6000 times, every iteration 10 times, primary with verifying collection verifying, and when test adopts on entire data set The performance of training pattern is assessed with 3 folding cross validations.
CN201811516615.4A 2018-12-12 2018-12-12 Retina OCT image classification method based on three-dimensional convolutional neural network Active CN109726743B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201811516615.4A CN109726743B (en) 2018-12-12 2018-12-12 Retina OCT image classification method based on three-dimensional convolutional neural network

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201811516615.4A CN109726743B (en) 2018-12-12 2018-12-12 Retina OCT image classification method based on three-dimensional convolutional neural network

Publications (2)

Publication Number Publication Date
CN109726743A true CN109726743A (en) 2019-05-07
CN109726743B CN109726743B (en) 2023-03-24

Family

ID=66295279

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201811516615.4A Active CN109726743B (en) 2018-12-12 2018-12-12 Retina OCT image classification method based on three-dimensional convolutional neural network

Country Status (1)

Country Link
CN (1) CN109726743B (en)

Cited By (10)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN111583291A (en) * 2020-04-20 2020-08-25 中山大学 Layer segmentation method and system for retina layer and effusion region based on deep learning
CN111598894A (en) * 2020-04-17 2020-08-28 哈尔滨工业大学 Retina blood vessel image segmentation system based on global information convolution neural network
CN111680741A (en) * 2020-06-04 2020-09-18 南京理工大学 Automatic debugging method of computer-aided interferometer based on deep learning
CN111696109A (en) * 2020-05-25 2020-09-22 深圳大学 High-precision layer segmentation method for retina OCT three-dimensional image
CN111796180A (en) * 2020-06-23 2020-10-20 广西电网有限责任公司电力科学研究院 Automatic identification method and device for mechanical fault of high-voltage switch
CN111862009A (en) * 2020-07-02 2020-10-30 清华大学深圳国际研究生院 Classification method of fundus OCT images and computer-readable storage medium
CN112132782A (en) * 2020-08-21 2020-12-25 广东省人民医院 Method and terminal for processing DME typing based on deep neural network
CN112257709A (en) * 2020-10-23 2021-01-22 北京云杉世界信息技术有限公司 Signboard photo auditing method and device, electronic equipment and readable storage medium
CN112990315A (en) * 2021-03-17 2021-06-18 北京大学 3D shape image classification method of equal-variation 3D convolution network based on partial differential operator
CN117710373A (en) * 2024-02-05 2024-03-15 中国科学院宁波材料技术与工程研究所 Anti-interference OCTA retinal vessel extraction method and device

Citations (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN107437092A (en) * 2017-06-28 2017-12-05 苏州比格威医疗科技有限公司 The sorting algorithm of retina OCT image based on Three dimensional convolution neutral net

Patent Citations (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN107437092A (en) * 2017-06-28 2017-12-05 苏州比格威医疗科技有限公司 The sorting algorithm of retina OCT image based on Three dimensional convolution neutral net

Non-Patent Citations (1)

* Cited by examiner, † Cited by third party
Title
JO˜AO CARREIRA等: "Quo Vadis, Action Recognition? A New Model and the Kinetics Dataset", 《ARXIV》 *

Cited By (17)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN111598894B (en) * 2020-04-17 2021-02-09 哈尔滨工业大学 Retina blood vessel image segmentation system based on global information convolution neural network
CN111598894A (en) * 2020-04-17 2020-08-28 哈尔滨工业大学 Retina blood vessel image segmentation system based on global information convolution neural network
CN111583291A (en) * 2020-04-20 2020-08-25 中山大学 Layer segmentation method and system for retina layer and effusion region based on deep learning
CN111583291B (en) * 2020-04-20 2023-04-18 中山大学 Layer segmentation method and system for retina layer and effusion region based on deep learning
CN111696109A (en) * 2020-05-25 2020-09-22 深圳大学 High-precision layer segmentation method for retina OCT three-dimensional image
CN111680741A (en) * 2020-06-04 2020-09-18 南京理工大学 Automatic debugging method of computer-aided interferometer based on deep learning
CN111680741B (en) * 2020-06-04 2022-09-20 南京理工大学 Automatic debugging method of computer-aided interferometer based on deep learning
CN111796180A (en) * 2020-06-23 2020-10-20 广西电网有限责任公司电力科学研究院 Automatic identification method and device for mechanical fault of high-voltage switch
CN111862009A (en) * 2020-07-02 2020-10-30 清华大学深圳国际研究生院 Classification method of fundus OCT images and computer-readable storage medium
CN111862009B (en) * 2020-07-02 2024-01-16 清华大学深圳国际研究生院 Classifying method of fundus OCT (optical coherence tomography) images and computer readable storage medium
CN112132782A (en) * 2020-08-21 2020-12-25 广东省人民医院 Method and terminal for processing DME typing based on deep neural network
CN112132782B (en) * 2020-08-21 2023-09-05 广东省人民医院 Method and terminal for processing DME (DME) typing based on deep neural network
CN112257709A (en) * 2020-10-23 2021-01-22 北京云杉世界信息技术有限公司 Signboard photo auditing method and device, electronic equipment and readable storage medium
CN112257709B (en) * 2020-10-23 2024-05-07 北京云杉世界信息技术有限公司 Signboard photo auditing method and device, electronic equipment and readable storage medium
CN112990315A (en) * 2021-03-17 2021-06-18 北京大学 3D shape image classification method of equal-variation 3D convolution network based on partial differential operator
CN112990315B (en) * 2021-03-17 2023-10-20 北京大学 3D shape image classification method of constant-variation 3D convolution network based on partial differential operator
CN117710373A (en) * 2024-02-05 2024-03-15 中国科学院宁波材料技术与工程研究所 Anti-interference OCTA retinal vessel extraction method and device

Also Published As

Publication number Publication date
CN109726743B (en) 2023-03-24

Similar Documents

Publication Publication Date Title
CN109726743A (en) A kind of retina OCT image classification method based on Three dimensional convolution neural network
Rasti et al. Macular OCT classification using a multi-scale convolutional neural network ensemble
CN107423571B (en) Diabetic retinopathy identifying system based on eye fundus image
CN107358605B (en) The deep neural network apparatus and system of diabetic retinopathy for identification
CN107437092B (en) The classification method of retina OCT image based on Three dimensional convolution neural network
CN106682389B (en) A kind of Eye disease for monitoring hypertension initiation is health management system arranged
CN110197493A (en) Eye fundus image blood vessel segmentation method
CN107680683A (en) A kind of AI eye healths appraisal procedure
CN108537801A (en) Based on the retinal angiomatous image partition method for generating confrontation network
CN108765422A (en) A kind of retinal images blood vessel automatic division method
CN109376767A (en) Retina OCT image classification method based on deep learning
Sertkaya et al. Diagnosis of eye retinal diseases based on convolutional neural networks using optical coherence images
CN109509552A (en) A kind of mental disease automatic distinguishing method of the multi-level features fusion based on function connects network
CN106780439B (en) A method of screening eye fundus image
CN114548158B (en) Data processing method for blood sugar prediction
Paul et al. Octx: Ensembled deep learning model to detect retinal disorders
CN110013216A (en) A kind of artificial intelligence cataract analysis system
Jain et al. Rider manta ray foraging optimization-based generative adversarial network and CNN feature for detecting glaucoma
CN115409764A (en) Multi-mode fundus blood vessel segmentation method and device based on domain self-adaptation
CN110503636A (en) Parameter regulation means, lesion prediction technique, parameter adjustment controls and electronic equipment
CN111951950A (en) Three-dimensional data medical classification system, method and device based on deep learning
CN109009074A (en) A kind of sudden cardiac death auxiliary prior-warning device based on deep learning
CN115036019A (en) Physical and mental health state evaluation method based on CNN-XGboost model and EEGECG
Kazi et al. Processing retinal images to discover diseases
Taş et al. Detection of retinal diseases from ophthalmological images based on convolutional neural network architecture.

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant