CN113570573A

CN113570573A - Pulmonary nodule false positive eliminating method, system and equipment based on mixed attention mechanism

Info

Publication number: CN113570573A
Application number: CN202110855736.7A
Authority: CN
Inventors: 李良才; 王骏; 王艳芳; 马力; 陈庆武
Original assignee: Zhongshan Yangshi Technology Co ltd
Current assignee: Zhongshan Yangshi Technology Co ltd
Priority date: 2021-07-28
Filing date: 2021-07-28
Publication date: 2021-10-29

Abstract

The invention discloses a pulmonary nodule false positive elimination method, a system, equipment and a medium based on a mixed attention mechanism. The invention relates to a lung nodule false positive eliminating method based on a mixed attention mechanism, which comprises the following steps: obtaining positive pulmonary nodule three-dimensional training data and false positive pulmonary nodule three-dimensional training data; building a TSN candidate pulmonary nodule classification network based on a mixed attention mechanism; inputting training data into a TSN candidate pulmonary nodule classification network based on a mixed attention mechanism for training to obtain a pulmonary nodule false positive exclusion model; obtaining a three-dimensional lung CT image to be excluded, and cutting the three-dimensional lung CT image to obtain candidate lung nodule slice data; and inputting the candidate lung nodule slice data into a lung nodule false positive elimination model, and obtaining a lung nodule false positive elimination result. The lung nodule false positive elimination method based on the mixed attention mechanism has the advantages of appropriate parameter quantity, light and simple calculation amount, high model convergence speed and high lung nodule false positive elimination result precision.

Description

Pulmonary nodule false positive eliminating method, system and equipment based on mixed attention mechanism

Technical Field

The invention relates to the technical field of image processing, in particular to a pulmonary nodule false positive elimination method, a system, equipment and a computer readable storage medium based on a mixed attention mechanism.

Background

Lung cancer is the cancer with the highest morbidity and mortality in the world. Early lung cancer exists in the form of lung nodules, and early diagnosis and treatment of lung nodules can improve patient survival. Computed Tomography (CT) is currently the most commonly used image for lung nodule detection. The existing pulmonary nodule computer-aided detection system mainly comprises two stages of nodule candidate detection and false positive screening. The target of the candidate nodule detection stage is to detect all suspected nodules in the CT image as much as possible so as to improve the lung nodule detection sensitivity; the false positive screening stage aims to classify the detected candidate nodules into true positive nodules and false positive nodules to eliminate the false positive nodules in the candidate nodules, so that the accuracy of lung nodule detection is improved.

In the conventional pulmonary nodule computer-aided detection system, in order to obtain high pulmonary nodule detection sensitivity in a nodule candidate detection stage, a large number of false positive pulmonary nodules exist in detected nodule candidates, so that the false positive pulmonary nodules need to be screened. In conventional false positive lung nodule screening, a 3D CNN model is generally used to perform feature extraction on candidate lung nodules, and then two classifications are used to classify the features.

However, the existing false positive pulmonary nodule screening technology has the following defects: the 3D CNN model has the defects of extremely large parameter quantity, huge and complicated calculated quantity, extremely low model convergence speed and easiness in overfitting.

Disclosure of Invention

Accordingly, an object of the present invention is to provide a method, a system, a device and a computer readable storage medium for eliminating false positive of pulmonary nodule based on a hybrid attention mechanism, which have suitable parameter amount, light and simple calculation amount, fast model convergence rate and high accuracy of the result of eliminating false positive of pulmonary nodule.

The purpose of the invention is realized by the following technical scheme:

in a first aspect, a method for eliminating false positive of pulmonary nodule based on mixed attention mechanism includes the following steps:

obtaining positive pulmonary nodule three-dimensional training data and false positive pulmonary nodule three-dimensional training data;

building a TSN candidate pulmonary nodule classification network based on a mixed attention mechanism;

inputting the three-dimensional training data of the positive pulmonary nodules and the three-dimensional training data of the false positive pulmonary nodules into a TSN candidate pulmonary nodule classification network based on a mixed attention mechanism for training to obtain a pulmonary nodule false positive exclusion model;

obtaining a three-dimensional lung CT image to be excluded, and cutting the three-dimensional lung nodule three-dimensional training data and the three-dimensional lung nodule three-dimensional training data to obtain candidate lung nodule slice data by taking the coordinate positions of the positive lung nodule three-dimensional training data and the false positive lung nodule three-dimensional training data as centers;

and inputting the candidate lung nodule slice data into a lung nodule false positive elimination model, and obtaining a lung nodule false positive elimination result.

The invention discloses a lung nodule false positive elimination method based on a mixed attention mechanism, which comprises the following steps:

(1) regarding the three-dimensional slice data as a two-dimensional slice sequence form in the slice dimension, modeling the sequence by using a TSN network model, and performing overall time sequence learning on the slice sequence.

(2) And improving a Residual block (Residual block) of a backbone network 2D Resnet-18 network used for feature extraction: the hybrid Attention Module (Attention Module) combining Motion Attention (ME), Coordinate Attention (CA), compression-and-Excitation (SE) learns the temporal variation characteristics, spatial position characteristics, and channel importance of the candidate nodules.

(3) The lung nodule false positive elimination method based on the mixed attention mechanism is light-weight in network and convenient to deploy. Compared with a 3D CNN structure model, the method can effectively reduce the parameter quantity and the calculated quantity, still improve the overall accuracy while saving the operation resources, and effectively solve the problems of extremely large parameter quantity, huge and complicated calculated quantity, extremely low model convergence speed and easiness in overfitting in the 3D CNN model.

Further preferably, the constructing of the TSN candidate pulmonary nodule classification network based on the hybrid attention mechanism specifically includes: and a 2D Resnet 18 backbone network is adopted as a TSN feature extraction network, a residual module in the TSN feature extraction network is improved, a mixed attention module combining motion excitation, coordinate attention, compression and excitation is added, and the TSN candidate pulmonary nodule classification network based on the mixed attention mechanism is obtained.

Further preferably, the inputting of the three-dimensional training data of the positive pulmonary nodules and the three-dimensional training data of the false positive pulmonary nodules into a TSN candidate pulmonary nodule classification network based on a mixed attention mechanism for training specifically includes: inputting the three-dimensional training data of the positive pulmonary nodules and the three-dimensional training data of the false positive pulmonary nodules into a TSN candidate pulmonary nodule classification network based on a mixed attention mechanism, and training the TSN candidate pulmonary nodules by using a binary cross entropy loss function to obtain a pulmonary nodule false positive exclusion model; wherein the formula of the binary cross entropy loss function is as follows:

wherein y is_iAs a true tag, p_iFalse positive lung nodule prediction.

Further preferably, the method for eliminating false positive of pulmonary nodule based on mixed attention mechanism further comprises the following steps: and preprocessing the three-dimensional lung CT image to be excluded.

In a second aspect, a lung nodule false positive exclusion system based on a mixed attention mechanism includes:

the training data acquisition module is used for acquiring positive pulmonary nodule three-dimensional training data and false positive pulmonary nodule three-dimensional training data;

the classification network building module is used for building a TSN candidate pulmonary nodule classification network based on a mixed attention mechanism;

the model training module is used for inputting the three-dimensional training data of the positive pulmonary nodules and the three-dimensional training data of the false positive pulmonary nodules into a TSN candidate pulmonary nodule classification network based on a mixed attention mechanism for training to obtain a pulmonary nodule false positive exclusion model;

the data to be excluded acquisition module is used for acquiring a three-dimensional lung CT image to be excluded and cutting the three-dimensional lung CT image to obtain candidate lung nodule slice data by taking the coordinate positions of the positive lung nodule three-dimensional training data and the false positive lung nodule three-dimensional training data as centers;

and the result output module is used for inputting the lung nodule candidate slice data into the lung nodule false positive elimination model and obtaining a lung nodule false positive elimination result.

Further preferably, the building of the classification network building module based on the TSN candidate pulmonary nodule classification network of the hybrid attention mechanism specifically includes: and a 2D Resnet 18 backbone network is adopted as a TSN feature extraction network, a residual module in the TSN feature extraction network is improved, a mixed attention module combining motion excitation, coordinate attention, compression and excitation is added, and the TSN candidate pulmonary nodule classification network based on the mixed attention mechanism is obtained.

Further preferably, the model training module inputs the three-dimensional training data of the positive pulmonary nodules and the three-dimensional training data of the false positive pulmonary nodules into a TSN candidate pulmonary nodule classification network based on a mixed attention mechanism for training, and specifically includes: inputting the three-dimensional training data of the positive pulmonary nodules and the three-dimensional training data of the false positive pulmonary nodules into a TSN candidate pulmonary nodule classification network based on a mixed attention mechanism, and training the TSN candidate pulmonary nodules by using a binary cross entropy loss function to obtain a pulmonary nodule false positive exclusion model; wherein the formula of the binary cross entropy loss function is as follows:

wherein y is_iAs a true tag, p_iFalse positive lung nodule prediction.

Further preferably, the system for eliminating false positive of pulmonary nodule based on mixed attention mechanism further comprises a preprocessing module for preprocessing the three-dimensional pulmonary CT image to be eliminated.

In a third aspect, a lung nodule false positive exclusion apparatus based on a mixed attention mechanism includes a storage device for storing one or more programs and a processor;

when the one or more programs are executed by the processor, the processor implements a mixed attention mechanism-based lung nodule false positive exclusion method as any of the above.

In a fourth aspect, a computer readable storage medium stores at least one program which, when executed by a processor, implements a mixed attention mechanism-based lung nodule false positive exclusion method as in any one of the above.

Compared with the prior art, the lung nodule false positive elimination method, the system, the equipment and the computer readable storage medium based on the mixed attention mechanism have the advantages of appropriate parameter quantity, light and simple calculation amount, high model convergence speed and high lung nodule false positive elimination result precision.

For a better understanding and practice, the invention is described in detail below with reference to the accompanying drawings.

Drawings

FIG. 1 is a flow chart of a method of lung nodule false positive exclusion based on a mixed attention mechanism of the present invention.

FIG. 2 is a modeling schematic of a compression and excitation module in a hybrid attention machine.

FIG. 3 is a schematic diagram of modeling a motion excitation module in a hybrid attention mechanism.

FIG. 4 is a modeling diagram of a coordinate attention module in a hybrid attention mechanism.

Fig. 5 shows the overall structure of the network of the lung nodule false positive exclusion model in the present invention.

Figure 6 shows the sequence of true positive lung nodule slices.

Fig. 7 shows a sequence of false positive lung nodule slices.

FIG. 8 is a block diagram of a lung nodule false positive exclusion system based on a mixed attention mechanism of the present invention.

Detailed Description

The terms of orientation of up, down, left, right, front, back, top, bottom, and the like, referred to or may be referred to in this specification, are defined relative to their configuration, and are relative concepts. Therefore, it may be changed according to different positions and different use states. Therefore, these and other directional terms should not be construed as limiting terms.

The implementations described in the exemplary embodiments below are not intended to represent all implementations consistent with the present disclosure. Rather, they are merely examples of methods consistent with certain aspects of the present disclosure, as detailed in the appended claims.

The terminology used in the present disclosure is for the purpose of describing particular embodiments only and is not intended to be limiting of the disclosure. As used in this disclosure and the appended claims, the singular forms "a," "an," and "the" are intended to include the plural forms as well, unless the context clearly indicates otherwise. It should also be understood that the term "and/or" as used herein refers to and encompasses any and all possible combinations of one or more of the associated listed items.

A lung nodule false positive elimination method based on a mixed attention mechanism is shown in figure 1 and comprises the following steps:

and S1, obtaining positive lung nodule three-dimensional training data and false positive lung nodule three-dimensional training data.

Specifically, the three-dimensional training data of the positive pulmonary nodule and the three-dimensional training data of the false positive pulmonary nodule may be obtained from a Lung CT public data set LUNA2016(Lung nodeleanalysis 16), which is obtained by combining the positive pulmonary nodule and the false positive pulmonary nodule detected by a plurality of pulmonary nodule detection systems, respectively, and corresponding labeling information is stored in a candidates _ v2.csv file, where the labeling information includes seriesuid of a CT to which the pulmonary nodule belongs, world coordinates (x, y, z) of the pulmonary nodule, and a category of the pulmonary nodule (the true positive pulmonary nodule is 1, and the false positive pulmonary nodule is 0), the world coordinates of the pulmonary nodule are converted into corresponding preprocessed pixel coordinates, and a positive sample (the true positive pulmonary nodule) and a negative sample (the false positive pulmonary nodule) with a size of 42 × 42 pixels are cut out by using the pixel coordinates as a center, so as to construct a training set, a verification set, and a test set.

In this embodiment, the three-dimensional training data of the positive lung nodule and the three-dimensional training data of the false positive lung nodule may be training data including several pieces of slice data obtained from a three-dimensional CT image of the lung nodule.

And S2, constructing a TSN candidate lung nodule classification network based on the mixed attention mechanism.

Specifically, a 2D Resnet 18 backbone network is adopted as a TSN feature extraction network, a residual module in the TSN feature extraction network is improved, a mixed Attention module combining Motion Excitation (ME), Coordinate Attention (CA), compression and Excitation (SE) is added, and the TSN candidate pulmonary nodule classification network based on the mixed Attention mechanism is obtained.

Time series segmentation network (TSN), the first framework to use 2D CNN for video behavior recognition. The TSN extracts short-term motion characteristics of images at different time sequence positions, and finally, a plurality of extracted characteristics are fused to realize long-term motion characteristic learning. In particular, given a video V, it is divided into K parts of the same duration S₁,S₂,…,S_KFrom the corresponding fragment S_kIn randomly extracting a frame T_kTSN frame sequence (T)₁,T₂,…,T_K) Modeling is performed as follows:

TSN(T₁,T₂,…,T_K)＝H(G(F(T₁,W),F(T₂,W),…,F(T_K,W))) (1)

wherein F (T)_KW) is a function representing a convolutional network with parameter W, video frame T_KAnd obtaining a C-dimensional vector after the convolution network, wherein C represents the number of the categories. The function G (-) is a segment consensus function, which can beAnd fusing the prediction vectors of the K segments, wherein the function H (-) is a Softmax function and is used for predicting the category of the video.

The model algorithm is based on a TSN model thought, a lung nodule three-dimensional CT image is regarded as a combination in the slicing direction (the cross section and the Z-axis direction), TSN is used for segmenting the slices, feature vectors of all segments are respectively provided through a convolution network, finally the feature vectors of all segments are fused, and Softmax is used for outputting class probability (true positive lung nodule probability or false positive lung nodule probability) to the fused feature vectors.

The feature extraction network uses a Deep Residual network (Deep Residual Networks), and a 2D ResNet-18 backbone network is selected as a TSN feature extraction network G (-) in consideration of the problems of large volume and overfitting of pulmonary nodule training data.

The 2D ResNet-18 backbone network is used as a TSN feature extraction network G (-) and the specific feature extraction is described as follows:

converting the shapes [ L, H and W ] of the lung nodules in the positive lung nodule three-dimensional training data and the false positive lung nodule three-dimensional training data into the forms of [ T, C, H and W ], wherein L is the lung nodule slice number of the positive lung nodule three-dimensional training data and the false positive lung nodule three-dimensional training data, H, W corresponds to the width and the height of a section, T represents the number of segments, and C represents the number of channels. Considering that the number L of pulmonary nodule slices is small, after a nodule is divided into T segments, each segment is not randomly selected, but the slice allocated to each segment is treated as a channel, that is, L is T × C. The T segment data are each input into 2D ResNet-18 to extract segmented features F ═ F1, F2, …, FT ], where F is the representation of the features in the hidden layer with dimensions [ T, C ', H', W '], considering the case of Batch Size, F has dimensions [ N, T, C', H ', W' ], where N is the Size of Batch Size.

In other embodiments, the feature extraction network may also employ other backbone networks, including but not limited to ResNet, ResNext, densnet, MobileNet, inclepionnet, and other feature extraction networks.

The mixed Attention module comprises three parts, namely a compression-and-Excitation (SE) module, a Motion Excitation (ME) module and a Coordinate Attention (CA) module, and is used for respectively learning the channel importance degree, the adjacent segment Motion information and the spatial position information of the lung nodule characteristics.

It should be noted that, the tensor outside the hybrid attention module is 4D, i.e., [ N × T, C, H, W ], where N represents the back Size, T represents the number of segments, C represents the number of color channels, H represents the height, and W represents the width, when the hybrid attention module is input in the TSN model, the input 4D tensor should be firstly reshaped into 5D tensor [ N, T, C, H, W ], so that the operation can be performed on a specific dimension inside the hybrid attention module. The 5D tensor output by the hybrid attention module is then reshaped into a 4D tensor, which is provided for subsequent use by 2D convolution.

The compression and excitation module is a channel attention mechanism, mainly from the aspect of characteristic channel correlation. The directivity of the convolutional layer extracted features is enhanced by adaptively recalibrating the feature response of the channels by explicitly modeling the interdependencies between the channels. The core of the compression and excitation module used in the present invention is divided into two parts, compression and excitation, as shown in fig. 2.

Compression given input characteristics

X represents the characteristics of the three-dimensional training data of the positive pulmonary nodules or the characteristics of the three-dimensional training data of the false positive pulmonary nodules, and global average pooling is performed and can be represented as:

wherein

Excitation of feature F after extrusion_sBy giving a scaling r, the features are obtained by 1 x 1 two-dimensional convolution

To F_rAfter ReLU nonlinear transformation, the features are restored by 1 × 1 two-dimensional convolution

Will be characterized by F_tempInputting the signal into a Sigmoid activation function to obtain an inverted channel mask,

the output characteristics of the final compression and excitation module can be expressed as:

F_o＝X+X⊙F_m(3) wherein, in the step (A),

the purpose of the motion excitation module is to model the motion information of neighboring segments, as shown in figure 3,

to reduce the amount of computation, the input feature X is first squeezed using a 1 × 1 two-dimensional convolution, the feature after squeezing being

The modeling operation on the motion characteristics can then be expressed as:

F_me＝K*F_r[:,t+1,:,:,:]－F_r[:,t,:,:,:] (4)

wherein K is a 3X 3 2D convolutional layer,

the motion features are then concatenated according to the time dimension, with 0 filling the last element, i.e., F_ME＝[F_me(1),...,F_me(t-1),0]Wherein

Then F is paired with the above formula (2)_MEPerforming spatial average pooling, and restoring the processed characteristic channel by 1 × 1 two-dimensional convolution to obtain characteristics

Will be characterized by F_tempInputting the information into a Sigmoid activation function to obtain a corresponding mask,

finally, the output of the ME module is obtained according to the formula (3)

The compression and excitation modules (channel attention) usually ignore the position information, which is very important for generating a spatially selective attention map. The coordinate attention module embeds position information into channel attention, unlike channel attention which converts feature tensors into a single feature vector through two-dimensional global pooling, the coordinate attention module is an encoding process that decomposes channel attention into two one-dimensional features, aggregating the features along two spatial directions, respectively. Therefore, the remote dependency relationship in one spatial direction can be obtained, and the accurate position information in the other spatial direction can be reserved. The generated feature maps are then encoded into a pair of direction-aware and position-sensitive attention maps, respectively, which can be applied complementarily to the input feature maps to increase the representation of the object of interest. The details of the coordinate attention module used in the present invention are shown in fig. 4, and are specifically divided into two parts of coordinate information embedding and coordinate attention generation.

Coordinate information embedding: the input feature X is decomposed into a pair of one-dimensional feature encoding operations using global pooling equation (2). Specifically, for input feature X, each channel is first encoded along the horizontal and vertical coordinate directions using pooling kernels of sizes (H, 1) and (1, W). For the C channel, the output in the H direction can be expressed by equation (5):

i represents a value between 0 (inclusive of 0) and W (exclusive of W).

Similarly, for the C-th channel, the output in the W direction can be expressed by equation (6):

j represents a value between 0 (inclusive of 0) and H (exclusive of H).

The above two transformations perform feature aggregation along two spatial directions, returning a pair of direction-aware attention maps.

Coordinate attention generation: for the aggregate feature maps generated by equations (5) and (6), they are first stitched together and then used with a shared 1 × 1 convolution transform function F₁To obtain

f＝S(F₁([z^h,z^w])) (7)

Wherein [ Z ] is^h，Z^W]Representing the stitching operation along the spatial dimension, delta is a non-linear activation function,

intermediate feature maps obtained by encoding spatial information from the horizontal direction and the vertical direction. F is then decomposed into two separate tensors along the spatial dimension

And

reusing two 1 x 1 convolution transform functions F_hAnd F_wRespectively converting the tension back to the same number of channels as the input characteristic X to obtain g^hAnd g^wRespectively expressed as:

g^h＝δ(F_h(f^h)) (8)

g^w＝S(F_w(f^w)) (9)

g is prepared from^h、g^wAs attention weights, the final attention module output is:

F_o＝X+X×g^h×g^w(10) wherein

The feature fusion strategy in the invention is as follows: and directly fusing the extracted T feature vectors by using an average pooling (avgPooling) mode. And outputting the classification probability (the probability of the true positive pulmonary nodule or the probability of the false positive pulmonary nodule) by using Softmax for the fused feature vector.

S3, inputting the three-dimensional training data of the positive pulmonary nodules and the three-dimensional training data of the false positive pulmonary nodules into a TSN candidate pulmonary nodule classification network based on a mixed attention mechanism, and training the TSN candidate pulmonary nodule classification network by utilizing a binary cross entropy loss function to obtain a pulmonary nodule false positive exclusion model; wherein the formula of the binary cross entropy loss function is as follows:

wherein y is_iAs a true tag, p_iFalse positive lung nodule prediction.

Specifically, 42 × 42 pixels of positive lung nodule three-dimensional training data and false positive lung nodule three-dimensional training data are subjected to data enhancement (x-axis, y-axis and z-axis are randomly inverted, 36 × 36 size regions are randomly cut, and x-axis and y-axis are randomly rotated by 90, 180 and 270 degrees). The enhanced data reshape is input into a TSN classification model in a form of 12 × 3 × 36, 12 time sequence feature vectors are extracted through a backbone network, the 12 time sequence feature vectors are subjected to average pooling to obtain final fusion features, and the final fusion features are subjected to binary classification. The overall structure of the network of the lung nodule false positive exclusion model is shown in fig. 5.

And S4, obtaining a three-dimensional lung CT image to be excluded, and cutting the three-dimensional lung nodule three-dimensional training data and the three-dimensional training data of the false positive lung nodule by taking the coordinate position of the three-dimensional training data of the positive lung nodule and the three-dimensional training data of the false positive lung nodule as a center to obtain candidate lung nodule slice data. Referring to fig. 6-7, fig. 6 is a sequence of slices of a true positive lung nodule, fig. 7 is a sequence of slices of a false positive lung nodule, and one row lists the partial slice sequences of one lung nodule.

This step further includes step S41: and preprocessing the three-dimensional lung CT image to be excluded. Specifically, the spacing uniform sampling of the three-dimensional lung CT image data is adjusted to be 1mm × 1mm × 1mm pixel interval, and the HU value of the CT image is intercepted to the interval [ -1200,600], and is converted into the gray value between [0, 255 ].

And S5, inputting the lung nodule candidate slice data into a lung nodule false positive elimination model, and obtaining a lung nodule false positive elimination result.

When the lung nodule false positive exclusion model is used for obtaining the false positive lung nodule predicted value of the lung nodule candidate slice data, the probability that the lung nodule candidate slice data is a false positive lung nodule can be obtained, so that whether the lung nodule candidate represented by the lung nodule candidate slice data is excluded or not is judged, and when the predicted value is larger than a preset threshold value, the lung nodule candidate represented by the lung nodule candidate slice data is considered as a false positive lung nodule, namely, the lung nodule candidate represented by the lung nodule candidate slice data is excluded.

(3) The lung nodule false positive elimination method based on the mixed attention mechanism is light-weight in network and convenient to deploy. Compared with a 3D CNN structure model, the method can effectively reduce the parameter quantity and the calculated quantity, and the overall accuracy is still improved while the operation resources are saved. See table below:

compared with a model without a mixed attention module, the TSN candidate lung nodule classification network model based on the mixed attention mechanism and constructed by the lung nodule false positive elimination method based on the mixed attention mechanism increases about 22 thousands of parameters under the condition of using the mixed attention module, and the accuracy of predicting true positive lung nodules and false positive lung nodules is improved by 0.64%.

Compared with the 3D Resnet-18, the TSN candidate pulmonary nodule classification network model based on the mixed attention mechanism and built by the pulmonary nodule false positive elimination method based on the mixed attention mechanism has the advantages that the parameter quantity is about one third, and meanwhile, the accuracy rate of predicting the true positive pulmonary nodules and the false positive pulmonary nodules is improved by 0.28%.

Therefore, the lung nodule false positive elimination method based on the mixed attention mechanism can effectively solve the problems of extremely large parameter quantity, huge and complicated calculated quantity, extremely low model convergence speed and easiness in overfitting in a 3D CNN model.

In addition, it should be noted that the invention is applicable to the second-stage false positive lung nodule screening task of the lung nodule computer-aided detection rule system, and does not relate to the design of the detection network of the first-stage candidate lung nodule. Any nodule candidate detection network capable of predicting the coordinate position of a nodule candidate can be applied to the method provided by the invention to carry out false positive screening processing.

Furthermore, the classification algorithm used by the invention can classify three-dimensional targets of other CT classes, and is also applicable.

The invention also discloses a lung nodule false positive elimination system based on a mixed attention mechanism, as shown in fig. 8, comprising:

wherein y is_iAs a true tag, p_iFalse positive lung nodule prediction.

The invention also discloses a pulmonary nodule false positive elimination device based on the mixed attention mechanism, which comprises a storage device and a processor, wherein the storage device is used for storing one or more programs;

when the one or more programs are executed by the processor, the processor implements a mixed attention mechanism-based lung nodule false positive exclusion method as described above.

The device may also preferably include a communication interface for communicating with external devices and for interactive transmission of data.

It should be noted that the memory may include a high-speed RAM memory, and may also include a nonvolatile memory (nonvolatile memory), such as at least one disk memory.

In a specific implementation, if the memory, the processor and the communication interface are integrated on a chip, the memory, the processor and the communication interface can complete mutual communication through the internal interface. If the memory, the processor and the communication interface are implemented independently, the memory, the processor and the communication interface may be connected to each other through a bus and perform communication with each other.

The present invention also discloses a computer-readable storage medium storing at least one program which, when executed by a processor, implements a method for mixed attention mechanism-based lung nodule false positive exclusion as described above.

It should be appreciated that the computer-readable storage medium is any data storage device that can store data or programs which can thereafter be read by a computer system. Examples of computer-readable storage media include: read-only memory, random access memory, CD-ROM, HDD, DVD, magnetic tape, optical data storage devices, and the like.

The computer readable storage medium can also be distributed over network coupled computer systems so that the computer readable code is stored and executed in a distributed fashion.

Program code embodied on a computer readable storage medium may be transmitted using any appropriate medium, including but not limited to: wireless, wire, fiber optic cable, Radio Frequency (RF), etc., or any suitable combination of the foregoing.

In some embodiments, the computer-readable storage medium may also be non-transitory.

Compared with the prior art, the lung nodule false positive elimination method, the system, the equipment and the computer readable storage medium based on the mixed attention mechanism have the advantages of appropriate parameter quantity, light and simple calculation amount, high model convergence speed and high lung nodule false positive elimination structure precision.

The above-mentioned embodiments only express several embodiments of the present invention, and the description thereof is more specific and detailed, but not construed as limiting the scope of the invention. It should be noted that, for a person skilled in the art, several variations and modifications can be made without departing from the inventive concept, which falls within the scope of the present invention.

Claims

1. A pulmonary nodule false positive elimination method based on a mixed attention mechanism is characterized by comprising the following steps:

2. The method for eliminating the false positive of the pulmonary nodule based on the hybrid attention mechanism according to claim 1, wherein the constructing of the TSN candidate pulmonary nodule classification network based on the hybrid attention mechanism is specifically as follows: and a 2D Resnet 18 backbone network is adopted as a TSN feature extraction network, a residual module in the TSN feature extraction network is improved, a mixed attention module combining motion excitation, coordinate attention, compression and excitation is added, and the TSN candidate pulmonary nodule classification network based on the mixed attention mechanism is obtained.

3. The method for eliminating the false positive of the lung nodule based on the hybrid attention mechanism according to claim 1, wherein the three-dimensional training data of the positive lung nodule and the three-dimensional training data of the false positive lung nodule are input into a TSN candidate lung nodule classification network based on the hybrid attention mechanism for training, specifically: inputting the three-dimensional training data of the positive pulmonary nodules and the three-dimensional training data of the false positive pulmonary nodules into a TSN candidate pulmonary nodule classification network based on a mixed attention mechanism, and training the TSN candidate pulmonary nodules by using a binary cross entropy loss function to obtain a pulmonary nodule false positive exclusion model; wherein the formula of the binary cross entropy loss function is as follows:

wherein y is_iAs a true tag, p_iFalse positive lung nodule prediction.

4. The method for mixed attention mechanism-based lung nodule false positive exclusion as claimed in claim 1, wherein the method for mixed attention mechanism-based lung nodule false positive exclusion further comprises the steps of: and preprocessing the three-dimensional lung CT image to be excluded.

5. A pulmonary nodule false positive exclusion system based on a mixed attention mechanism, comprising:

6. The system for eliminating the false positive of the pulmonary nodule based on the hybrid attention mechanism according to claim 5, wherein the building of the classification network building module is specifically as follows: and a 2D Resnet 18 backbone network is adopted as a TSN feature extraction network, a residual module in the TSN feature extraction network is improved, a mixed attention module combining motion excitation, coordinate attention, compression and excitation is added, and the TSN candidate pulmonary nodule classification network based on the mixed attention mechanism is obtained.

7. The system according to claim 5, wherein the model training module inputs the three-dimensional training data of positive lung nodules and the three-dimensional training data of false positive lung nodules into a TSN candidate lung nodule classification network based on the mixed attention mechanism for training, specifically: inputting the three-dimensional training data of the positive pulmonary nodules and the three-dimensional training data of the false positive pulmonary nodules into a TSN candidate pulmonary nodule classification network based on a mixed attention mechanism, and training the TSN candidate pulmonary nodules by using a binary cross entropy loss function to obtain a pulmonary nodule false positive exclusion model; wherein the formula of the binary cross entropy loss function is as follows:

wherein y is_iAs a true tag, p_iFalse positive lung nodule prediction.

8. The mixed attention mechanism-based lung nodule false positive exclusion system of claim 5, further comprising a pre-processing module for pre-processing the three-dimensional lung CT image to be excluded.

9. A mixed attention mechanism-based lung nodule false positive exclusion apparatus, comprising a storage device for storing one or more programs and a processor;

the one or more programs, when executed by the processor, implement the mixed attention mechanism-based lung nodule false positive exclusion method of any of claims 1-4.

10. A computer readable storage medium storing at least one program which, when executed by a processor, implements a method for lung nodule false positive exclusion based on a mixed attention mechanism according to any one of claims 1-4.