CN110084813B

CN110084813B - Lung nodule benign and malignant prediction method based on three-dimensional deep learning network

Info

Publication number: CN110084813B
Application number: CN201910392053.5A
Authority: CN
Inventors: 董恩清; 熊文硕; 纪惠中; 金叶; 倪天骄; 薛鹏; 韩贺
Original assignee: Shandong Zhiwei Zhicheng Electronic Technology Co ltd
Current assignee: Shandong Zhiwei Zhicheng Electronic Technology Co ltd
Priority date: 2019-05-13
Filing date: 2019-05-13
Publication date: 2022-05-13
Anticipated expiration: 2039-05-13
Also published as: CN110084813A

Abstract

A lung nodule benign and malignant prediction method based on a three-dimensional deep learning network belongs to the field of image processing. The method comprises the following steps: lung CT image preprocessing and generation of multi-resolution input data; constructing a multi-resolution 3D dual-channel compression excitation deep learning network model; and (5) iteratively solving the model parameters by using a gradient descent method. The invention adopts a deep learning network model combining the ideas of a dual-channel network and a compression excitation network, can not only repeatedly utilize the low-order characteristics of the lung nodule image and continuously generate new high-order combined characteristics, but also can recalibrate the weight of the characteristic channel and effectively describe the importance degree of different characteristic channels to network output. The 3D multi-resolution data processing method adopted by the invention can effectively solve the problems of boundary loss, noise and the like caused by the inconsistent diameters of lung nodules.

Description

Lung nodule benign and malignant prediction method based on three-dimensional deep learning network

Technical Field

The invention belongs to the field of medical image processing, and particularly relates to lung nodule benign and malignant prediction based on a multi-resolution 3D dual-channel compression excitation network.

Background

The accurate diagnosis and prediction of benign and malignant lung nodules is an important research direction for accurate medical treatment, and has very important significance in lung CT image analysis and practical clinical application. The traditional method for diagnosing the benign and malignant pulmonary nodules through manual interpretation not only excessively depends on the diagnosis level of doctors and has strong subjectivity, but also brings huge workload. The deep learning method can deeply mine complex implicit representation of image data, and can further analyze and process deeper features, so that classification or prediction is more intelligent. Deep learning has many representative algorithms: the Dual Path Network (DPN) is proposed by the pigment group, which merges core ideas of Deep Residual Networks (reset) and dense Connected Networks (densneet) of two symbolic Deep learning Networks, and the densneet can extract new features from previous levels, while the rent is essentially a reuse of extracted features in previous levels. The compression-and-Excitation Networks (features) is a new network structure proposed by the kingdom team of Momenta corporation, which starts from another point of view, considers explicit modeling of the relationship between feature channels in the network, analyzes the dependency relationship and interaction between feature channels emphatically, adopts a "feature recalibration" mode to automatically learn the importance degree of each feature channel in the current task, and appropriately strengthens or inhibits the influence of the current channel on the final result according to the degree value. However, these methods are not fully applied to the field of 3D medical image processing, which is limited to the problems of data set and complexity, and these problems are also the key problems to be faced and urgently solved in the current research.

Disclosure of Invention

The method aims at the problems that the traditional manual method for diagnosing benign and malignant pulmonary nodules excessively depends on the diagnosis level of doctors, has strong subjectivity, brings huge workload and the like, and the traditional algorithm cannot effectively process the pulmonary nodules with different morphological changes and the like.

The invention provides a lung nodule benign and malignant prediction method based on a three-dimensional deep learning network, which comprises the following steps of:

s1, carrying out gray level reconstruction and layer thickness reconstruction on the lung CT image;

s2, extracting and storing the 3D lung nodule data according to the size of the center O and the diameter D of the lung nodule area, and screening out images with boundaries exceeding the image range or marked with non-standard images;

s3, generating data of three corresponding resolutions through a B spline interpolation algorithm;

s4, standardizing and sample expanding the data in the step S3 to be used as training network input;

s5, constructing a multi-resolution 3D dual-channel compression excitation network;

and S6, training and iteratively solving the data in the step S4 corresponding to the input network by adopting a random gradient descent method.

And S7, performing classification accuracy verification.

The invention has the beneficial effects that: a doctor for diagnosing the benign and malignant lung nodules only needs to confirm the center and the size position of the lung nodules in the lung CT image and can obtain a prediction result of the network model after inputting the lung nodules into the network. Compared with the manual diagnosis, the method has higher real-time performance, accuracy and robustness, can be well suitable for CT images with different specifications and standards, and has good expandability.

Drawings

Fig. 1 is a schematic diagram of a multi-resolution 3D dual-channel compressed excitation network.

Fig. 2 is a schematic flow chart of lung CT image preprocessing.

Fig. 3 is a schematic diagram of multi-resolution 3D data processing.

Fig. 4 is a schematic structural diagram of a two-path compression excitation unit (DPSE unit).

Detailed Description

The present invention will be described in further detail with reference to the accompanying drawings and specific embodiments. The invention discloses a lung nodule benign and malignant prediction method based on a three-dimensional deep learning network, which mainly comprises the following steps:

read-in DICOM format lung CT image training sample T₁And test specimen T₂And converted into a uint16 type, and its gray scale is reconstructed to [ -1000,400 ]]HU (Hounsfield Unit), the layer thickness is reconstructed to 1mm using nearest neighbor interpolation, as shown in FIG. 2.

S2, extracting and storing the 3D lung nodule data according to the size of the center O and the diameter D of the lung nodule area, and screening out images with boundaries exceeding the image range or labeling irregular images.

Artificially determining training sample T₁And test specimen T₂And D is equal to or larger than 3mm and equal to or smaller than 14mm, and the corresponding pixel extraction window is 20 x 20. 14mm<D is less than or equal to 21mm, the resolution is medium, and the corresponding pixel extraction window is 30 × 30. 21mm<D is less than or equal to 30mm, the resolution is high, and the corresponding pixel extraction window is 45 x 45. Lung nodules with diameters outside the three grades were all screened out.

centering on O, training sample T₁And test specimen T₂And 3D image extraction is carried out on the center of the lung nodule according to the pixel extraction window, and 3D images with the other two resolutions are generated through B-spline interpolation, as shown in figure 3, and are correspondingly stored.

the stored data were normalized to 0-1. Will T₁The data in the method adopts X-axis, Y-axis and Z-axis rotation and 2-pixel random translation to perform data expansion so as to solve the problem of sample imbalance, and the rotation angle is [90 degrees, 180 degrees and 270 degrees °]Three special angles. The augmented 3D data is retained as training network input.

and constructing a multi-resolution 3D dual-channel compression excitation network according to the architecture of the whole network in FIG. 4 and the dual-channel compression excitation (DPSE) unit in FIG. 1. In fig. 1, GN is group normalization, leakyreu is leakrelu activation function, Conv denotes convolution operation, ReLU is ReLU activation function, and Sigmoid is Sigmoid activation function. X1_lAnd X2_lNetwork inputs of a residual connecting path and a dense connecting path of the l layer are respectively shown, wherein the basic channel of the dense connecting path is 32, the increment is 16, and the dimensionality reduction ratio in the compression excitation module is set to be 4. In fig. 4, 64-3 × 3 above the arrow indicates that the current layer is convolved with 64 convolution kernels having dimensions of 3 × 3, 10 × 10@ (256+64) below the feature cube indicates that the current layer output is 10 × 10, the number of feature channels of the residual connecting channels is 256, the number of channels of the dense connecting channels is 64, and the rest is similar. Maxpooling3D indicates maximum pooling of 3D data, and Softmax is the Softmax classifier. The whole network adopts three resolution networks, the network input pixel dimensions are respectively a low resolution network 20 × 20 × 20, a middle resolution network 30 × 30 × 30 and a high resolution network 45 × 45 × 45, the main body frame of each resolution network is formed by stacking dual-channel compression excitation (DPSE) units in the graph 1, and the whole network construction parameters are given in a table 1.

In FIG. 4, the tail-end classification scaling factor is λ₁＝0.5,λ₂＝0.3,λ₃0.2. Probability P of each tail end class₁、P₂、P₃Given by:

wherein N is given a correspondence of input sample and true tag value { (D)₁,y₁),K,(D_N,y_N)}，D_jFor the jth input sample, y_jIs D_jThe corresponding real label is marked with a real label,

then the result of the network prediction, k is the number of classes, η_iIs a parameter of the i-th resolution network. Has calculated P_iThen, at the tail end, by a scaling factor lambda₁、λ₂、λ₃Performing probability fusion to obtain fusion probability P_fAs follows:

TABLE 1 detailed structure of multi-resolution 3D dual-channel compression excitation network

In table 1, C represents a convolutional layer, DPSE-C represents a convolutional layer in a dual-channel compression excitation module, FC represents a full link layer, the former of the summation terms of the channel numbers is the channel number of residual link channels, and the latter is the channel number of dense link channels.

Step 3, using a Stochastic Gradient Descent (SGD) method to reduce T₁Is trained in response to the input networkAnd iteratively solving. The network parameter initialization is performed by using He, the learning rate is set to be 0.005, the learning rate is reduced by 30% after each 300 iteration, the size of the network batch (batch size) is 8, and the loss function is selected from cross entropy loss functions. And judging whether the algorithm is converged according to the accuracy rate convergence trend and the number of iteration rounds, and if not, storing the network parameters until the algorithm is converged.

And S7, performing classification accuracy verification.

After the network is converged, the stored parameters are loaded into the network and T is input₂And (5) carrying out classification accuracy verification on the data.

Finally, it should be noted that the above embodiments are only used for illustrating the technical solutions of the present invention and not for limiting, and although the present invention is described in detail with reference to the preferred embodiments, the protection scope of the present invention is not limited thereto, and any modifications or equivalent substitutions that can be easily conceived by those skilled in the art within the technical scope of the present invention disclosed herein are all covered within the protection scope of the present invention without departing from the spirit and scope of the technical solutions of the present invention.

Claims

1. A lung nodule benign and malignant prediction method based on a three-dimensional deep learning network is characterized by comprising the following steps:

s6, training and iteratively solving the data in the step S4 corresponding to the input network by adopting a random gradient descent method;

s7, verifying the classification accuracy;

multi-resolution 3D dual channel pressureThe reduced excitation network is constructed by stacking two-channel compressed excitation units, the structure and parameters of the reduced excitation network are shown in table 1, wherein the basic channels of the dense connecting channels are 32, the basic channels are increased by 16, the dimensionality reduction ratio in the compressed excitation module is set to be 4, and the tail end classification proportionality coefficient is lambda₁＝0.5,λ₂＝0.3,λ₃0.2, probability P of each tail end class₁、P₂、P₃Given by:

wherein N is given a correspondence of input sample and true tag value { (D)₁,y₁),...,(D_N,y_N)}，D_jFor the jth input sample, y_jIs D_jThe corresponding real label is marked with a real label,

then the result of the network prediction, k is the number of classes, η_iFor the parameter of the ith resolution network, the respective prediction probabilities P of the three resolution networks are calculated_iThen, at the tail end, by a scaling factor lambda₁、λ₂、λ₃Performing probability fusion to obtain fusion probability P_fAs follows:

2. The method for predicting benign and malignant lung nodules according to claim 1, wherein said step S1 comprises using a read-in DICOM-format lung CT image training sample T₁And test specimen T₂And converted into a uint16 type, and its gray scale is reconstructed to [ -1000,400 ]]HU, layer thickness was reconstructed to 1mm using nearest neighbor interpolation.

3. The method for predicting benign and malignant pulmonary nodules according to claim 1, wherein said step S2 comprises manually determining a training sample T₁And test specimen T₂The center O and the diameter D of the middle lung nodule are large, the lung nodule is divided into three resolutions according to the diameter D, if the D is more than or equal to 3mm and less than or equal to 14mm, the resolution is low, and the corresponding pixel extraction window is 20 × 20; d is more than 14mm and less than or equal to 21mm, the resolution is medium, and the corresponding pixel extraction window is 30 × 30; d is more than 21mm and less than or equal to 30mm, the high resolution is obtained, and the corresponding pixel extraction window is 45 × 45.

4. The method for predicting lung nodule malignancy and well, according to claim 1, wherein the step S3 is to train a sample T around O₁And test specimen T₂And 3D image extraction is carried out on the center of the lung nodule according to the pixel extraction window, and 3D images with the other two resolutions are generated through B spline interpolation and are correspondingly stored to form input data.

5. The method for predicting lung nodule malignancy and well, according to claim 1, wherein in step S4, the input data is normalized by 0-1 to obtain a training sample T₁The data in the system are subjected to data expansion by adopting X-axis, Y-axis and Z-axis rotation and 2-pixel random translation, and the rotation angle is [90 degrees, 180 degrees and 270 degrees °]Three special angles.

6. The method for predicting the benign and malignant pulmonary nodule of claim 1, wherein in S6, He is used for initializing network parameters, the learning rate is set to 0.005, 30% reduction is performed every 300 iterations, the size of the network batch batchsize is 8, the loss function is a cross entropy loss function, whether the algorithm converges or not is judged according to the convergence trend of the accuracy rate and the number of iterations, and if not, the network parameters are stored until the convergence.