CN115131280A - Improved YOLO v4 lung nodule detection method fused with attention mechanism - Google Patents
Improved YOLO v4 lung nodule detection method fused with attention mechanism Download PDFInfo
- Publication number
- CN115131280A CN115131280A CN202210321194.XA CN202210321194A CN115131280A CN 115131280 A CN115131280 A CN 115131280A CN 202210321194 A CN202210321194 A CN 202210321194A CN 115131280 A CN115131280 A CN 115131280A
- Authority
- CN
- China
- Prior art keywords
- improved
- attention mechanism
- lung
- lung nodule
- yolo
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Pending
Links
Images
Classifications
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06T—IMAGE DATA PROCESSING OR GENERATION, IN GENERAL
- G06T7/00—Image analysis
- G06T7/0002—Inspection of images, e.g. flaw detection
- G06T7/0012—Biomedical image inspection
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06N—COMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N3/00—Computing arrangements based on biological models
- G06N3/02—Neural networks
- G06N3/08—Learning methods
- G06N3/084—Backpropagation, e.g. using gradient descent
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06T—IMAGE DATA PROCESSING OR GENERATION, IN GENERAL
- G06T2207/00—Indexing scheme for image analysis or image enhancement
- G06T2207/10—Image acquisition modality
- G06T2207/10072—Tomographic images
- G06T2207/10081—Computed x-ray tomography [CT]
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06T—IMAGE DATA PROCESSING OR GENERATION, IN GENERAL
- G06T2207/00—Indexing scheme for image analysis or image enhancement
- G06T2207/20—Special algorithmic details
- G06T2207/20081—Training; Learning
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06T—IMAGE DATA PROCESSING OR GENERATION, IN GENERAL
- G06T2207/00—Indexing scheme for image analysis or image enhancement
- G06T2207/20—Special algorithmic details
- G06T2207/20084—Artificial neural networks [ANN]
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06T—IMAGE DATA PROCESSING OR GENERATION, IN GENERAL
- G06T2207/00—Indexing scheme for image analysis or image enhancement
- G06T2207/30—Subject of image; Context of image processing
- G06T2207/30004—Biomedical image processing
- G06T2207/30061—Lung
- G06T2207/30064—Lung nodule
Landscapes
- Engineering & Computer Science (AREA)
- Physics & Mathematics (AREA)
- Theoretical Computer Science (AREA)
- General Health & Medical Sciences (AREA)
- Health & Medical Sciences (AREA)
- General Physics & Mathematics (AREA)
- Artificial Intelligence (AREA)
- Computational Linguistics (AREA)
- Quality & Reliability (AREA)
- Radiology & Medical Imaging (AREA)
- Nuclear Medicine, Radiotherapy & Molecular Imaging (AREA)
- Life Sciences & Earth Sciences (AREA)
- Medical Informatics (AREA)
- Biomedical Technology (AREA)
- Biophysics (AREA)
- Computer Vision & Pattern Recognition (AREA)
- Data Mining & Analysis (AREA)
- Evolutionary Computation (AREA)
- Molecular Biology (AREA)
- Computing Systems (AREA)
- General Engineering & Computer Science (AREA)
- Mathematical Physics (AREA)
- Software Systems (AREA)
- Apparatus For Radiation Diagnosis (AREA)
- Image Analysis (AREA)
Abstract
The invention relates to an improved YOLO v4 lung nodule detection method fusing attention, which further improves the speed and the precision of lung nodule detection. The invention adopts YOLO v4 as a basic pulmonary nodule detection network model, and adds a mixed hole convolution module in a backbone network, so as to enlarge the receptive field without losing resolution and improve the positioning accuracy. The attention mechanism module is added at the tail end of the neck network and can be used for increasing the weight of useful features, paying more attention to a target area containing important information, simultaneously restraining the weight of invalid features and restraining irrelevant information, and therefore the overall accuracy of target detection is improved. And finally, training the data set, loading the trained optimal weight file into the model of the invention, and performing feature extraction to generate a lung nodule prediction frame and a lung nodule prediction confidence coefficient.
Description
Technical Field
The invention relates to the technical field of medical image processing, in particular to an improved YOLO v4 lung nodule detection method fusing attention.
Background
Physical examination has become an effective means for screening various diseases. The screening of lung cancer is realized by low-dose CT (computed Tomography) of lung. Lung cancer is one of the diseases with high mortality in the world, is not easy to detect in early stage, and is often in middle and late stage if the patient goes to hospital for diagnosis due to the occurrence of relevant symptoms. The current proposals for major hospitals or medical centers are that effective screening and prevention can be achieved through examination of low-dose pulmonary CT. Generally, one low-dose lung CT scan generally contains a two-dimensional transverse position lung window image, a soft tissue window image, and a coronal or sagittal position image reconstructed by later three-dimensional reconstruction, and the total image is added up to one or two hundred. Therefore, in such multiple images, the rate of missed diagnosis is high if the micro-nodules are simply searched by the naked eyes of the radiologist. With the rapid development of artificial intelligence in recent years, convolutional neural networks have been applied to a plurality of fields such as speech recognition, natural language processing, and object detection, and particularly, remarkable results have been achieved in image processing.
In clinical medicine, early lung cancer exists in the form of lung nodules, and a CAD (Computer-Aided Diagnosis) system is commonly used for detecting lung nodules as a common method for screening medical images of the lung. When a CAD system is used for lung nodule detection, firstly, data needs to be segmented and preprocessed, and the region of suspected malignant lung nodules is extracted; then classifying, and removing false positive nodules; and finally, the real positive nodules are reserved, and the influence of over-high false positive proportion on the detection result is prevented. The research on the lung nodules is mainly difficult to realize, the detection accuracy is low, the false positive is high, the traditional detection method cannot meet the requirement of accurate detection of the lung cancer in China with continuous development of scientific technology, and if the detection accuracy can be effectively improved and the false positive nodules are eliminated, the method is a great welfare for human beings.
Disclosure of Invention
1. An improved YOLO v4 lung nodule detection method fused with an attention mechanism is characterized by comprising the following specific steps of:
step 1: collecting images to establish a data set;
step 2: preprocessing a lung CT image;
step 3: a network structure for target detection is selected and improved, a mixed cavity convolution is added on the basis of a YOLO v4 network, and an attention mechanism is fused, so that the resolution ratio is not lost while the receptive field is expanded, a target area containing important information is focused more, irrelevant information is inhibited, and the positioning accuracy can be improved better;
step 4: and training the improved pulmonary nodule detection model by using the selected data set to detect pulmonary nodules.
2. The improved YOLO v4 lung nodule detection method based on attention mechanism as claimed in claim 1, wherein the specific steps of Step1 are as follows:
using the LUNA16 dataset, the LUNA16 dataset comprised 888 low dose pulmonary CT images in mhd format, each image containing a series of axial slices of the thorax.
3. The improved YOLO v4 pulmonary nodule detection method based on attention mechanism as claimed in claim 1, wherein the specific steps of Step2 are as follows:
the values obtained after CT acquisition are X-ray attenuation values in Hounsfield (HU). In pulmonary CT images, the lung HU values are typically around-500, and regions with HU values within [ -1000, +400] will remain, others are considered irrelevant for pulmonary disease detection and are discarded.
4. The improved YOLO v4 pulmonary nodule detection method based on attention mechanism as claimed in claim 1, wherein the specific steps of Step3 are as follows:
step3.1: the backbone network structure adopts CSP (chip scale package) linking, and a mixed cavity convolution module is added in the backbone network, so that the sensitivity field is enlarged, the resolution is not lost, the positioning accuracy is improved, the backbone characteristic receiving range is enlarged, and the extraction of context information is facilitated;
step3.2: because the lung nodule target is small, the invention integrates a attention mechanism module (CBAM) for increasing the weight of useful characteristics and paying more attention to a target region containing important information; meanwhile, invalid characteristic weight is restrained, and irrelevant information is restrained, so that the overall accuracy of target detection is improved.
5. The improved YOLO v4 pulmonary nodule detection method based on attention mechanism as claimed in claim 1, wherein the specific steps of Step4 are as follows:
the improved lung nodule detection model is trained by using the selected data set, the trained optimal weight file is loaded into the model of the invention for feature extraction, a series of candidate regions are generated, then the candidate frames are labeled according to the position relation between the candidate regions and the object real frames on the picture, and a lung nodule prediction frame and a lung nodule prediction confidence coefficient are generated.
The invention has the beneficial effects that: the invention can effectively improve the accuracy and speed of pulmonary nodule detection, provide a convenient and fast reading mode for clinic and reduce misdiagnosis caused by human factors in daily work of a radiologist.
Drawings
Fig. 1 is a flow chart of an improved YOLO v4 lung nodule detection based on an attention mechanism.
Fig. 2 is a schematic diagram of an improved YOLO v4 lung nodule detection network structure based on an attention mechanism.
FIG. 3 is a schematic diagram of a hybrid hole convolution module.
FIG. 4 is a schematic illustration of the attention mechanism (CBAM).
Fig. 5 is an exemplary illustration of pulmonary CT.
Fig. 6 is a diagram of lung nodule detection results in an exemplary lung CT diagram.
Detailed description of the preferred embodiments
The present invention will be described in further detail with reference to the accompanying drawings.
The invention provides an improved YOLO v4 lung nodule detection method fused with an attention mechanism, and a specific detection flow is shown in figure 1.
Step 1: raw 3D CT images are collected to build a data set.
The present invention uses the LUNA16 dataset, and the LUNA16 dataset includes 888 low-dose pulmonary CT images in mhd format, each image containing a series of axial slices of the thorax.
Step 2: and processing the lung CT image.
The data set was classified as 2D detection data in the form of VOCs, and the values obtained after CT acquisition were X-ray attenuation values in units of Hounsfield (HU). In pulmonary CT images, the lung HU values are typically around-500, and regions with HU values within [ -1000, +400] will remain, others are considered irrelevant for pulmonary disease detection and are discarded.
The criterion for the determination of nodules in the LUNA16 dataset was that at least three of the four radiologists identified the nodule as having a radius greater than 3 mm. Thus in the annotation of the data set, non-nodules, nodules with a radius of less than 3mm, and nodules deemed by 1 or two radiologists to be greater than 3mm are considered irrelevant and are discarded.
And step 3: and selecting a network structure for target detection.
According to the invention, YOLO v4 is selected as a basic target detection model, mixed cavity convolution is added, and an attention mechanism is fused, so that an improved YOLO v4 network structure based on the attention mechanism is formed, as shown in FIG. 2.
The YOLO v4 network architecture includes a backbone (capdarknet53), a neck (neck), and a head (head).
The specific improvement steps are as follows:
3.1 the backbone network structure is linked by CSP, the invention adds a mixed cavity convolution module in the backbone network, as shown in FIG. 3, the difference between the cavity convolution and the ordinary convolution lies in that the convolution kernel is filled with 0, the number of the filled 0 is different according to the difference of the cavity rate, and the receptive field is different, thereby obtaining more characteristic information with different scales.
The calculation method of the convolution kernel of the cavity convolution comprises the following steps:
x n =x k +(x k -1)×(D r -1)
wherein x is n Is the convolution kernel size, x, of the hole convolution k Is the original convolution kernel size, D r Is the void fraction.
The calculation method of the receptive field of the cavity convolution comprises the following steps:
wherein, y m The receptive field, y, of each point of the mth layer m-1 For each of the m-1 th layerReceptive field of points, x m Is the convolution kernel size, s, of the mth layer of convolution i Is the step size of the i-th layer convolution.
The mixed hole convolution module is added to enlarge the receptive field without losing the resolution and improve the positioning accuracy. The neck network adopts SSP module to increase the receiving range of main character, to extract context information.
3.2 because the lung nodule target is small, the present invention incorporates a attention-machine (CBAM) module, as shown in FIG. 4.
The attention mechanism module comprises two aspects of a Channel Attention Mechanism (CAM) and a Space Attention Mechanism (SAM).
The Channel Attention Mechanism (CAM) adopts numerical information of an adaptive average pooling and an adaptive maximum pooling compression characteristic diagram on a space dimension (channel), the adaptive average pooling has feedback on each pixel point on the characteristic diagram, and when the adaptive maximum pooling is used for gradient back propagation calculation, gradient feedback exists only at the position with the maximum response in the characteristic diagram. Two successive fully connected layers are followed by a Sigmoid activation function.
Weight of channel attention mechanism:
wherein the content of the first and second substances,andfor global average pooling and maximum pooling of features on a channel, respectively, W 1 And W 0 Sigma is a sigmoid activation function for two layers of weight parameters on the neural network.
The Space Attention Mechanism (SAM) firstly performs maximum pooling and average pooling on input features in space dimension, connects two pooled results, then performs convolution on the spliced result through a convolution check, ensures that the finally obtained features are consistent with an input feature diagram in space dimension, and then performs normalization processing by adopting a Sigmoid activation function.
Weight of spatial attention mechanism:
wherein the content of the first and second substances,andglobal average pooling and maximum pooling are respectively carried out on the spatial features, f is convolution kernel learning of 1 x 1 is used for two spliced and fused features, and sigma is a sigmoid activation function.
The attention mechanism module can be used for increasing the weight of useful features and paying more attention to a target area containing important information; meanwhile, the invalid characteristic weight is restrained, and irrelevant information is restrained, so that the overall precision of target detection is improved.
And 4, step 4: and (5) training and storing a weight file by data.
Training the lung nodule detection model of the improved YOLO v4 based on the fusion attention mechanism by using the data set, and selecting DIOU as a loss function in order to obtain ideal weight parameters
Wherein A is a prediction box, B is a label box, B gt Each represents anThe center points of the color box and the target box, p represents the Euclidean distance between the two center points, and c represents the diagonal distance of the smallest rectangle which can cover the color box and the target box simultaneously.
And (4) initializing the network to be 0.001 according to the total loss function set in the step (4), setting the batch size to be 8, and training the network on a Pytroch 1.4.0 platform by using an ADAM optimizer until the value of the total loss function is converged within a set error range, and then stopping training. And when the convergence of the loss function is close to 0, loading the trained optimal weight file into the model of the invention, and performing feature extraction to generate a series of candidate regions.
And 5: and outputting the result.
And loading the weight for prediction, and labeling the candidate frame according to the position relation between the candidate region and the real frame of the object on the picture to generate a lung nodule prediction frame and a lung nodule prediction confidence coefficient.
Fig. 5 is input as an example lung CT diagram, fig. 6 shows a lung nodule detection result in the example lung CT diagram in the test set, and it can be seen from the result diagram that the improved YOLO v4 lung nodule detection method based on the attention mechanism provided by the invention can accurately and quickly detect lung nodules from clinical real CT cases, save a large amount of radiograph reading time of radiologists, and improve the work efficiency.
Claims (5)
1. An improved YOLO v4 lung nodule detection method fused with an attention mechanism is characterized by comprising the following specific steps of:
step 1: collecting images to establish a data set;
step 2: preprocessing a lung CT image;
step 3: the method comprises the steps of selecting a network structure for target detection to be improved, adding mixed hole convolution on the basis of a YOLO v4 network and fusing an attention mechanism, so that the resolution ratio is not lost while the receptive field is expanded, a target area containing important information is focused more, irrelevant information is restrained, and the positioning accuracy can be improved better;
step 4: and training the improved pulmonary nodule detection model by using the selected data set to detect pulmonary nodules.
2. The improved YOLO v4 pulmonary nodule detection method based on attention mechanism as claimed in claim 1, wherein the specific steps of Step1 are as follows:
using the LUNA16 dataset, the LUNA16 dataset comprised 888 low dose pulmonary CT images in mhd format, each image containing a series of axial slices of the thorax.
3. The improved YOLO v4 pulmonary nodule detection method based on attention mechanism as claimed in claim 1, wherein the specific steps of Step2 are as follows:
the values obtained after CT acquisition are X-ray attenuation values in Hounsfield (HU). In pulmonary CT images, the lung HU values are typically around-500, and regions with HU values within [ -1000, +400] will remain, others are considered irrelevant for pulmonary disease detection and are discarded.
4. The improved YOLO v4 lung nodule detection method based on attention mechanism as claimed in claim 1, wherein the specific steps of Step3 are as follows:
step3.1: the backbone network structure adopts CSP (chip scale package) linking, and a mixed cavity convolution module is added in the backbone network, so that the sensitivity field is enlarged, the resolution is not lost, the positioning accuracy is improved, the backbone characteristic receiving range is enlarged, and the extraction of context information is facilitated;
step3.2: because the lung nodule target is small, the invention integrates a attention mechanism module (CBAM) for increasing the weight of useful characteristics and paying more attention to a target region containing important information; meanwhile, the invalid characteristic weight is restrained, and irrelevant information is restrained, so that the overall precision of target detection is improved.
5. The improved YOLO v4 pulmonary nodule detection method based on attention mechanism as claimed in claim 1, wherein the specific steps of Step4 are as follows:
the improved lung nodule detection model is trained by using the selected data set, the trained optimal weight file is loaded into the model of the invention for feature extraction, a series of candidate regions are generated, then the candidate frames are labeled according to the position relation between the candidate regions and the object real frames on the picture, and a lung nodule prediction frame and a lung nodule prediction confidence coefficient are generated.
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN202210321194.XA CN115131280A (en) | 2022-03-29 | 2022-03-29 | Improved YOLO v4 lung nodule detection method fused with attention mechanism |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN202210321194.XA CN115131280A (en) | 2022-03-29 | 2022-03-29 | Improved YOLO v4 lung nodule detection method fused with attention mechanism |
Publications (1)
Publication Number | Publication Date |
---|---|
CN115131280A true CN115131280A (en) | 2022-09-30 |
Family
ID=83376673
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN202210321194.XA Pending CN115131280A (en) | 2022-03-29 | 2022-03-29 | Improved YOLO v4 lung nodule detection method fused with attention mechanism |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN115131280A (en) |
Cited By (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN116228685A (en) * | 2023-02-07 | 2023-06-06 | 重庆大学 | Deep learning-based lung nodule detection and rejection method |
CN116402747A (en) * | 2023-02-24 | 2023-07-07 | 上海白春学人工智能科技工作室 | Multi-receptive-field attention lung nodule benign and malignant classification and identification system and method |
WO2024097623A1 (en) * | 2022-10-31 | 2024-05-10 | Drexel University | Sparse coding and extraction of ultrasound knowledge for explainable point-of-care ultrasound artificial intelligence |
-
2022
- 2022-03-29 CN CN202210321194.XA patent/CN115131280A/en active Pending
Cited By (4)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
WO2024097623A1 (en) * | 2022-10-31 | 2024-05-10 | Drexel University | Sparse coding and extraction of ultrasound knowledge for explainable point-of-care ultrasound artificial intelligence |
CN116228685A (en) * | 2023-02-07 | 2023-06-06 | 重庆大学 | Deep learning-based lung nodule detection and rejection method |
CN116228685B (en) * | 2023-02-07 | 2023-08-22 | 重庆大学 | Deep learning-based lung nodule detection and rejection method |
CN116402747A (en) * | 2023-02-24 | 2023-07-07 | 上海白春学人工智能科技工作室 | Multi-receptive-field attention lung nodule benign and malignant classification and identification system and method |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN109584252B (en) | Lung lobe segment segmentation method and device of CT image based on deep learning | |
CN115131280A (en) | Improved YOLO v4 lung nodule detection method fused with attention mechanism | |
US20220301714A1 (en) | Method for predicting lung cancer development based on artificial intelligence model, and analysis device therefor | |
US20230038364A1 (en) | Method and system for automatically detecting anatomical structures in a medical image | |
Oghli et al. | Automatic fetal biometry prediction using a novel deep convolutional network architecture | |
CN112258530A (en) | Neural network-based computer-aided lung nodule automatic segmentation method | |
CN113947609B (en) | Deep learning network structure and multi-label aortic dissection CT image segmentation method | |
Xie et al. | Computer‐Aided System for the Detection of Multicategory Pulmonary Tuberculosis in Radiographs | |
CN111968120A (en) | Tooth CT image segmentation method for 3D multi-feature fusion | |
CN113344951A (en) | Liver segment segmentation method based on boundary perception and dual attention guidance | |
CN111553892A (en) | Lung nodule segmentation calculation method, device and system based on deep learning | |
CN113947681A (en) | Method, apparatus and medium for segmenting medical images | |
CN114693933A (en) | Medical image segmentation device based on generation of confrontation network and multi-scale feature fusion | |
WO2021209887A1 (en) | Rapid, accurate and machine-agnostic segmentation and quantification method and device for coronavirus ct-based diagnosis | |
CN116206164B (en) | Multi-phase CT classification system based on semi-supervised contrast learning and construction method | |
CN114119516A (en) | Virus focus segmentation method based on transfer learning and cascade adaptive hole convolution | |
CN115908449A (en) | 2.5D medical CT image segmentation method and device based on improved UNet model | |
Arega et al. | Leveraging uncertainty estimates to improve segmentation performance in cardiac MR | |
CN114445429A (en) | Whole-heart ct segmentation method and device based on multiple labels and multiple decoders | |
Pal et al. | A fully connected reproducible SE-UResNet for multiorgan chest radiographs segmentation | |
Zhang et al. | Femoral image segmentation based on two-stage convolutional network using 3D-DMFNet and 3D-ResUnet | |
CN116883341A (en) | Liver tumor CT image automatic segmentation method based on deep learning | |
CN116433654A (en) | Improved U-Net network spine integral segmentation method | |
CN115294023A (en) | Liver tumor automatic segmentation method and device | |
CN113450306B (en) | Method for providing fracture detection tool |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PB01 | Publication | ||
PB01 | Publication | ||
SE01 | Entry into force of request for substantive examination | ||
SE01 | Entry into force of request for substantive examination |