CN113469954B - Method and device for evaluating benign and malignant lung nodule - Google Patents
Method and device for evaluating benign and malignant lung nodule Download PDFInfo
- Publication number
- CN113469954B CN113469954B CN202110660081.8A CN202110660081A CN113469954B CN 113469954 B CN113469954 B CN 113469954B CN 202110660081 A CN202110660081 A CN 202110660081A CN 113469954 B CN113469954 B CN 113469954B
- Authority
- CN
- China
- Prior art keywords
- lung
- estimator
- benign
- malignant
- nodule
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Active
Links
- 206010056342 Pulmonary mass Diseases 0.000 title claims abstract description 177
- 230000003211 malignant effect Effects 0.000 title claims abstract description 81
- 238000000034 method Methods 0.000 title claims abstract description 21
- 238000012549 training Methods 0.000 claims abstract description 35
- 238000013210 evaluation model Methods 0.000 claims abstract description 33
- 238000000605 extraction Methods 0.000 claims abstract description 32
- 230000004927 fusion Effects 0.000 claims abstract description 18
- 206010028980 Neoplasm Diseases 0.000 claims description 24
- 201000011510 cancer Diseases 0.000 claims description 24
- 230000036210 malignancy Effects 0.000 claims description 24
- 238000011156 evaluation Methods 0.000 claims description 10
- 238000002372 labelling Methods 0.000 claims description 6
- 230000002685 pulmonary effect Effects 0.000 claims description 6
- 238000003384 imaging method Methods 0.000 description 8
- 238000013528 artificial neural network Methods 0.000 description 6
- 102100029469 WD repeat and HMG-box DNA-binding protein 1 Human genes 0.000 description 4
- 101710097421 WD repeat and HMG-box DNA-binding protein 1 Proteins 0.000 description 4
- 210000004072 lung Anatomy 0.000 description 4
- 238000010586 diagram Methods 0.000 description 3
- 238000010827 pathological analysis Methods 0.000 description 3
- 238000012935 Averaging Methods 0.000 description 2
- 238000004364 calculation method Methods 0.000 description 2
- 238000013145 classification model Methods 0.000 description 2
- 238000013527 convolutional neural network Methods 0.000 description 2
- 230000000694 effects Effects 0.000 description 2
- 238000011158 quantitative evaluation Methods 0.000 description 2
- 230000009286 beneficial effect Effects 0.000 description 1
- 230000001419 dependent effect Effects 0.000 description 1
- 238000001514 detection method Methods 0.000 description 1
- 239000011159 matrix material Substances 0.000 description 1
- 238000005457 optimization Methods 0.000 description 1
- 238000007500 overflow downdraw method Methods 0.000 description 1
- 238000012545 processing Methods 0.000 description 1
- 238000003672 processing method Methods 0.000 description 1
- 238000006467 substitution reaction Methods 0.000 description 1
Classifications
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06T—IMAGE DATA PROCESSING OR GENERATION, IN GENERAL
- G06T7/00—Image analysis
- G06T7/0002—Inspection of images, e.g. flaw detection
- G06T7/0012—Biomedical image inspection
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F18/00—Pattern recognition
- G06F18/20—Analysing
- G06F18/21—Design or setup of recognition systems or techniques; Extraction of features in feature space; Blind source separation
- G06F18/214—Generating training patterns; Bootstrap methods, e.g. bagging or boosting
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F18/00—Pattern recognition
- G06F18/20—Analysing
- G06F18/25—Fusion techniques
-
- G—PHYSICS
- G16—INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR SPECIFIC APPLICATION FIELDS
- G16H—HEALTHCARE INFORMATICS, i.e. INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR THE HANDLING OR PROCESSING OF MEDICAL OR HEALTHCARE DATA
- G16H30/00—ICT specially adapted for the handling or processing of medical images
- G16H30/20—ICT specially adapted for the handling or processing of medical images for handling medical images, e.g. DICOM, HL7 or PACS
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06T—IMAGE DATA PROCESSING OR GENERATION, IN GENERAL
- G06T2207/00—Indexing scheme for image analysis or image enhancement
- G06T2207/10—Image acquisition modality
- G06T2207/10072—Tomographic images
- G06T2207/10081—Computed x-ray tomography [CT]
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06T—IMAGE DATA PROCESSING OR GENERATION, IN GENERAL
- G06T2207/00—Indexing scheme for image analysis or image enhancement
- G06T2207/20—Special algorithmic details
- G06T2207/20081—Training; Learning
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06T—IMAGE DATA PROCESSING OR GENERATION, IN GENERAL
- G06T2207/00—Indexing scheme for image analysis or image enhancement
- G06T2207/20—Special algorithmic details
- G06T2207/20084—Artificial neural networks [ANN]
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06T—IMAGE DATA PROCESSING OR GENERATION, IN GENERAL
- G06T2207/00—Indexing scheme for image analysis or image enhancement
- G06T2207/30—Subject of image; Context of image processing
- G06T2207/30004—Biomedical image processing
- G06T2207/30061—Lung
- G06T2207/30064—Lung nodule
Landscapes
- Engineering & Computer Science (AREA)
- Theoretical Computer Science (AREA)
- Data Mining & Analysis (AREA)
- Computer Vision & Pattern Recognition (AREA)
- General Physics & Mathematics (AREA)
- Health & Medical Sciences (AREA)
- Physics & Mathematics (AREA)
- Radiology & Medical Imaging (AREA)
- Bioinformatics & Cheminformatics (AREA)
- Nuclear Medicine, Radiotherapy & Molecular Imaging (AREA)
- Medical Informatics (AREA)
- General Engineering & Computer Science (AREA)
- Evolutionary Computation (AREA)
- Evolutionary Biology (AREA)
- General Health & Medical Sciences (AREA)
- Life Sciences & Earth Sciences (AREA)
- Artificial Intelligence (AREA)
- Bioinformatics & Computational Biology (AREA)
- Quality & Reliability (AREA)
- Public Health (AREA)
- Primary Health Care (AREA)
- Epidemiology (AREA)
- Apparatus For Radiation Diagnosis (AREA)
Abstract
The invention provides a method and a device for evaluating benign and malignant lung nodule. The method comprises the following steps: constructing an evaluation model comprising a feature extraction module, an estimation module and a data fusion module; training parameters of the feature extraction module and each estimator by using a training data set consisting of lung nodule pairs and labels of different doctors, wherein the labels are marked by the lung nodule pairs and relate to the benign and malignant degrees of the two lung nodules, and the parameters of the feature extraction module and each estimator are trained by using a loss function obtained based on the output of each estimator and the labels of each estimator; and inputting the CT image of the lung nodule to be evaluated into a trained evaluation model to obtain the benign and malignant scoring of the lung nodule. Because the data set for training the evaluation model adopts the labels with relatively high and low benign and malignant degrees of the two lung nodules, and the consistency of the labels marked by different doctors is good, the accuracy of evaluating the benign and malignant degrees of the lung nodules is improved; the training method of the invention also reduces the time complexity and improves the speed of model training.
Description
Technical Field
The invention relates to the technical field of image processing by utilizing a neural network, in particular to a method and a device for evaluating benign and malignant lung nodule.
Background
In the prior art, the identification of benign and malignant lung nodules refers to the automatic identification of benign and malignant lung nodules in a given lung CT image through a deep neural network. The accuracy of neural networks in identifying benign and malignant lung nodules is very dependent on the accuracy of benign and malignant labels of lung CT images in training data. Labels for CT images of lung nodules have two sources: the first is that the imaging doctor judges the benign and malignant lung nodules; the second is to judge the benign and malignant lung nodules by microscopic pathological analysis. The cost of marking by the imaging doctor is low, and the marking accuracy is about 75%. The marking cost of pathological analysis is high, the accuracy is close to 100%, and the pathological analysis marking is generally regarded as a true benign and malignant marking of lung nodules. A specific method for imaging physician marking uses several imaging physicians to score lung CT images from 1 to 5, 1 representing the lowest malignancy and 5 representing the highest malignancy. If the average score of the lung CT image is higher than 3.5 points, the label of the image is malignant, if the average score is lower than 2.5 points, the label of the image is benign, and the image with the average score of 2.5 to 3.5 points is discarded. Most of the basic lung nodule CT image datasets currently used are marked by imaging doctors. However, the accuracy of the imaging physician marking is greatly affected by human factors, the standards of the lung nodule benign and malignant scoring by different imaging physicians are inconsistent, some diagnoses are more conservative, and some diagnoses are more aggressive. There is a great difference between the scores given by different imaging physicians to the same CT image of a lung nodule. Thereby resulting in a lower quality neural network data set that affects neural network training.
Disclosure of Invention
In order to solve the problems in the prior art, the invention provides a method and a device for evaluating benign and malignant lung nodule.
In order to achieve the above object, the present invention adopts the following technical scheme.
In a first aspect, the present invention provides a method for evaluating benign and malignant lung nodule, comprising the steps of:
constructing an evaluation model comprising a feature extraction module, an estimation module and a data fusion module, wherein each estimator of the estimation module is used for scoring according to the lung nodule features output by the feature extraction module, and the data fusion module is used for calculating the weighted average of the scoring;
training parameters of the feature extraction module and each estimator by using a training data set consisting of lung nodule pairs and labels of different doctors, wherein the labels are marked by the lung nodule pairs and relate to the benign and malignant degrees of the two lung nodules, and the parameters of the feature extraction module and each estimator are trained by using a loss function obtained based on the output of each estimator and the labels of each estimator;
and inputting the CT image of the lung nodule to be evaluated into a trained evaluation model to obtain the benign and malignant scoring of the lung nodule.
Further, the method of determining the output scoring weight of each estimator comprises the following steps for each estimator:
selecting N lung nodule CT images as input of an evaluation model, wherein the number of malignant lung nodules is A, the number of benign lung nodules is B, and N=A+B;
when the statistical threshold T is changed from the minimum value of scoring to the maximum value according to the step length, calculating Y=TP/A and X=FP/B by calculating the number TP of malignant lung nodules with the scoring value exceeding T and the number FP of benign lung nodules without exceeding T output by an estimator;
and drawing a curve by taking X as an abscissa and Y as an ordinate, and calculating the area under the curve to obtain the weight of the output scoring of the estimator.
Further, the estimator output score has a minimum value of 1 and a maximum value of 5.
Further, the tag is: if the malignancy of the first lung nodule is more severe, the label is 1; if the malignancy of the second lung nodule is more severe, the label is-1; if the malignancy of the two lung nodules is comparable, the label is 0.
Further, the loss function is:
wherein is L r A loss function of the r-th estimator;labeling the ith lung nodule pair for the nth doctor with a relatively high or low label on the benign or malignant extent of both lung nodules, c= { -1,0,1}, i=1, 2, …, n, n being the number of lung nodule pairs, r=1, 2, …, R being the number of doctors or estimators; epsilon is a random error subject to sigmoid distribution; />Scoring the first lung nodule in the pair according to lung nodule +.>And a second lung nodule score +.>The obtained quantized value of the benign and malignant degree is relatively high and low; lambda is a set boundary threshold; />To indicate the function, when->The value is 1 when +.>When the value is 0;is->The probability values at the time are respectively:
wherein,
in a second aspect, the present invention provides a pulmonary nodule benign and malignant evaluation apparatus comprising:
the modeling module is used for constructing an evaluation model comprising a feature extraction module, an estimation module and a data fusion module, wherein each estimator of the estimation module is used for scoring according to the lung nodule features output by the feature extraction module, and the data fusion module is used for calculating the weighted mean value of the scoring;
the training module is used for training parameters of the feature extraction module and each estimator by using a training data set consisting of lung nodule pairs and labels which are marked by different doctors and are relatively high and low in benign and malignant degrees of the two lung nodules and adopting a loss function obtained based on the output of each estimator and the labels of the estimators;
and the evaluation module is used for inputting the CT image of the lung nodule to be evaluated into the trained evaluation model to obtain the benign and malignant scoring of the lung nodule.
Further, the method of determining the output scoring weight of each estimator comprises the following steps for each estimator:
selecting CT images of N lung nodules as input of an evaluation model, wherein the number of malignant lung nodules is A, the number of benign lung nodules is B, and N=A+B;
when the statistical threshold T is changed from the minimum value of scoring to the maximum value according to the step length, calculating Y=TP/A and X=FP/B by calculating the number TP of malignant lung nodules with the scoring value exceeding T and the number FP of benign lung nodules without exceeding T output by an estimator;
and drawing a curve by taking X as an abscissa and Y as an ordinate, and calculating the area under the curve to obtain the weight of the output scoring of the estimator.
Further, the estimator output score has a minimum value of 1 and a maximum value of 5.
Further, the tag is: if the malignancy of the first lung nodule is more severe, the label is 1; if the malignancy of the second lung nodule is more severe, the label is-1; if the malignancy of the two lung nodules is comparable, the label is 0.
Further, the loss function is:
wherein is L r A loss function of the r-th estimator;labeling the ith lung nodule pair for the nth doctor with a relatively high or low label on the benign or malignant extent of both lung nodules, c= { -1,0,1}, i=1, 2, …, n, n being the number of lung nodule pairs, r=1, 2, …, R being the number of doctors or estimators; epsilon is a random error subject to sigmoid distribution; />Scoring the first lung nodule in the pair according to lung nodule +.>And a second lung nodule score +.>The obtained benign and malignant phasesA quantized value of the height; lambda is a set boundary threshold; />To indicate the function, when->The value is 1 when +.>When the value is 0;is->The probability values at the time are respectively:
wherein,
compared with the prior art, the invention has the following beneficial effects.
According to the invention, an evaluation model comprising a feature extraction module, an estimation module and a data fusion module is constructed, a training data set consisting of a lung nodule pair and labels which are marked by different doctors and are relatively high and low in benign and malignant degrees of two lung nodules in the lung nodule pair is utilized to train the evaluation model, parameters of the feature extraction module and each estimator are trained based on the output of each estimator and a loss function obtained by the labels of the estimator, the lung nodule to be evaluated is input into the trained evaluation model, the benign and malignant scoring of the lung nodule is obtained, and quantitative evaluation of the benign and malignant of the lung nodule is realized. Because the data set for training the evaluation model adopts the labels with relatively high and low benign and malignant degrees of the two lung nodules, and the consistency of the labels marked by different doctors is good, namely the label precision is high, the accuracy of the benign and malignant evaluation of the lung nodules is improved; meanwhile, the parameters of the feature extraction module and each estimator are trained by the loss function obtained based on the output of each estimator and the label thereof, so that the time complexity is reduced, and the model training speed is improved.
Drawings
Fig. 1 is a flowchart of a method for evaluating benign and malignant lung nodules according to an embodiment of the present invention.
Fig. 2 is a schematic structural diagram of the evaluation model.
Fig. 3 is a block diagram of a pulmonary nodule benign and malignant evaluation apparatus according to an embodiment of the present invention.
Detailed Description
The present invention will be further described with reference to the drawings and the detailed description below, in order to make the objects, technical solutions and advantages of the present invention more apparent. It will be apparent that the described embodiments are only some, but not all, embodiments of the invention. All other embodiments, which can be made by those skilled in the art based on the embodiments of the invention without making any inventive effort, are intended to be within the scope of the invention.
Fig. 1 is a flowchart of a method for evaluating benign and malignant lung nodule according to an embodiment of the present invention, including the steps of:
step 101, constructing an evaluation model comprising a feature extraction module, an estimation module and a data fusion module, wherein each estimator of the estimation module is used for scoring according to the lung nodule features output by the feature extraction module, and the data fusion module is used for calculating the weighted mean value of the scoring;
step 102, training parameters of a feature extraction module and each estimator by using a training data set consisting of lung nodule pairs and labels, which are marked by different doctors, about the relative high and low degrees of benign and malignant of the two lung nodules and adopting a loss function obtained based on the output of each estimator and the labels thereof;
and step 103, inputting the CT image of the lung nodule to be evaluated into a trained evaluation model to obtain a benign and malignant score of the lung nodule.
In this embodiment, an artificial neural network (such as convolutional neural network CNN) is used to construct an evaluation model of benign and malignant lung nodules. The input of the evaluation model is a lung nodule CT image, and the output is a quantitative evaluation of the benign and malignant properties of the input lung nodule, namely a comprehensive score. The benign and malignant scoring can be automatically output only by inputting CT images of lung nodules to be evaluated into a trained evaluation model.
In this embodiment, step 101 is mainly used for constructing a benign and malignant evaluation model of lung nodules. The evaluation model mainly comprises a feature extraction module, an estimation module and a data fusion module, as shown in fig. 2. The feature extraction module is used for extracting features of the input lung nodule CT image. The estimation module is composed of a plurality of estimators, all estimators share the output of the feature extraction module, and scoring is performed according to the lung nodule feature output by the feature extraction module, for example, the larger the lung nodule is, the more branches are, the higher the possibility of malignancy is, and the scoring value is higher. The data fusion module is used for fusing the scores output by all the estimators to obtain a comprehensive score. The fusion method adopted by the embodiment is to calculate the average value of the scoring weights of the estimators, and the calculation formula is as follows:
wherein S is the final integrated score, S r Scoring the r-th estimator, k r Weights scoring the R-th estimator, R being the number of estimators. The simplest weighted value strategies are to take 1, and the data fusion is equivalent to averaging. However, in order to improve the effect of data fusion, different weights are generally set according to different reliability of the estimator, and the better the reliability, the larger the estimator weight. Will be laterA specific embodiment for determining the estimator weights is presented.
In this embodiment, step 102 is mainly used for training the evaluation model. The model training of this embodiment is mainly to optimize the parameters of each estimator and the parameters of the feature extraction module. The training data set in the prior art generally consists of a single lung nodule CT image sample and benign and malignant labels marked by doctors, and the label accuracy is not high due to inconsistent scoring standards of different doctors, so that the prediction accuracy of a trained model is not high. But the consistency of the evaluation made by different doctors of the relative high and low (i.e. which is more serious) of the benign and malignant degrees of two different lung nodules is quite good, as compared with the difficulty in accurately speaking the specific height of a person, but the accurate judgment of who is higher than the two persons is easy to make. For this reason, in this embodiment, the lung nodule CT image samples are "paired two by two", that is, two are paired, and then different doctors evaluate the relative height of the same lung nodule on the benign and malignant degree, and use this as the label of the lung nodule pair, so as to obtain the training data set composed of the lung nodule pair and its label. The loss function adopted in the training in this embodiment is also different from the prior art, in which the model is generally trained by using a total loss function, and in this embodiment, a loss function determined by the output and the label difference is provided for each estimator, so that the parameters of each estimator and the feature extraction module are optimized and trained. The processing method of the embodiment not only can improve the prediction precision of the model, but also can obviously reduce the time complexity of training, reduce the training time and improve the training speed.
In this embodiment, step 103 is mainly used to obtain a score of the benign or malignant lung nodule to be evaluated. And inputting the CT image of the lung nodule to be evaluated into a trained evaluation model, so that the benign and malignant scoring of the lung nodule can be conveniently obtained.
As an alternative embodiment, the method of determining the output scoring weight of each estimator comprises the following steps for each estimator:
selecting N lung nodule CT images as input of an evaluation model, wherein the number of malignant lung nodules is A, the number of benign lung nodules is B, and N=A+B;
when the statistical threshold T is changed from the minimum value of scoring to the maximum value according to the step length, calculating Y=TP/A and X=FP/B by calculating the number TP of malignant lung nodules with the scoring value exceeding T and the number FP of benign lung nodules without exceeding T output by an estimator;
and drawing a curve by taking X as an abscissa and Y as an ordinate, and calculating the area under the curve to obtain the weight of the output scoring of the estimator.
The present embodiment provides a technical solution for determining the output scoring weight of the estimator. The present embodiment uses the ROC (Receiver Operating Characteristic ) curve to determine the estimator output scoring weights. ROC curves were first applied in the field of radar signal detection to distinguish between signals and noise. Later on, one uses it to evaluate the predictive ability of the model, and the ROC curve is derived based on the confusion matrix. The thresholds of a classification model may be set to be high or low, each threshold setting may result in different false positive rate FPR and true positive rate TPR, and the (FPR, TPR) coordinates of each threshold of the same model are plotted in ROC space to form the ROC curve of the specific model. ROC curve is FPR on the abscissa and TPR on the ordinate. AUC (Area under the Curve) is the area under the ROC curve, and when different classification models are compared, the ROC curve of each model can be drawn, and the area under the curve is used as an index of the model quality. AUC has the following characteristics: the value range is [0,1]; assuming positive above the threshold and negative below the threshold, the AUC is equal to the probability that the classifier correctly judges that the positive sample has a higher value than the negative sample. Thus, the higher the AUC value of the classifier, the higher the accuracy. The larger the AUC value, the higher the estimator feasibility, and the larger the scoring weight. Therefore, only the corresponding ROC curve is drawn for each estimator, and the area under the curve is calculated to obtain the scoring weight.
As an alternative embodiment, the minimum value of the estimator output score is 1 and the maximum value is 5.
The present embodiment gives a range of values for the output score of the estimator, with a minimum value of 1 and a maximum value of 5. The greater the scoring score, the more severe the malignancy of the lung nodules. It should be noted that this example is given as a preferred embodiment, and is not intended to negate or exclude other embodiments that are possible, for example, whether ten or a percentage may be used.
As an alternative embodiment, the tag is: if the malignancy of the first lung nodule is more severe, the label is 1; if the malignancy of the second lung nodule is more severe, the label is-1; if the malignancy of the two lung nodules is comparable, the label is 0.
The specific values of the labels representing the relative high and low degrees of benign and malignant of two lung nodules in the lung nodule pair are respectively 1,0 and-1. Labels 1 and-1 indicate that one lung nodule is more severe than the other, respectively, and 0 indicates that the benign and malignant extent of both lung nodules is the same or similar. Likewise, the embodiment only provides a preferred embodiment, and does not negate or exclude other possible embodiments, and any three different integers may be used as the label, so that the solution of the embodiment is more intuitive and concise.
As an alternative embodiment, the loss function is:
wherein is L r A loss function of the r-th estimator;labeling the ith lung nodule pair for the nth doctor with a relatively high or low label on the benign or malignant extent of both lung nodules, c= { -1,0,1}, i=1, 2, …, n, n being the number of lung nodule pairs, r=1, 2, …, R being the number of doctors or estimators; epsilon is a random error subject to sigmoid distribution; />Scoring the first lung nodule in the pair according to lung nodule +.>And a second lung nodule score +.>The obtained quantized value of the benign and malignant degree is relatively high and low; lambda is a set boundary threshold; />To indicate the function, when->The value is 1 when +.>When the value is 0;is->The probability values at the time are respectively:
wherein,
this embodimentA technical solution for the loss function is given. As described above, this embodiment sets a loss function for each estimator for training optimization of the corresponding estimator parameters, and therefore uses the serial number r of the estimator as the index of the loss function. Since the tag has three values of 1,0 and-1, the probability equal to the tag is equal to the sum of the probabilities when the tag is 1,0 and-1 respectively, and the loss function comprises the variable of the tagAnd (5) summing operation. Further, since the average value is calculated for all lung nodules in the training data set, the loss function also includes calculation of summing up (averaging) the lung nodules with the variable i. Quantized score of relatively high or low degree of benign malignancy of two lung nodules estimated based on score actually output by estimator>Is a piecewise function, takes three different values 1,0, -1, -lambda, lambda are the demarcation points of the three intervals, also called boundary threshold, see in particular +.>Is an expression of (2). Labels are also provided aboveWhen 1,0, -1, respectively, +.>Probability equal to tag->The probability obeys a sigmoid distribution.
Fig. 3 is a schematic diagram of the composition of a device for evaluating benign and malignant lung nodules according to an embodiment of the present invention, the device comprising:
the modeling module 11 is configured to construct an evaluation model including a feature extraction module, an estimation module, and a data fusion module, where each estimator of the estimation module is configured to score according to the lung nodule feature output by the feature extraction module, and the data fusion module is configured to calculate a weighted average of the scores;
a training module 12 for training parameters of the feature extraction module and each estimator by using a training data set composed of the lung nodule pairs and labels of different doctors, which are relatively high and low with respect to benign and malignant degrees of the two lung nodules, and using a loss function obtained based on the output of each estimator and the labels thereof;
and the evaluation module 13 is used for inputting the CT image of the lung nodule to be evaluated into the trained evaluation model to obtain the benign and malignant score of the lung nodule.
The device of this embodiment may be used to implement the technical solution of the method embodiment shown in fig. 1, and its implementation principle and technical effects are similar, and are not described here again. As well as the latter embodiments, will not be explained again.
As an alternative embodiment, the method of determining the output scoring weight of each estimator comprises the following steps for each estimator:
selecting N lung nodule CT images as input of an evaluation model, wherein the number of malignant lung nodules is A, the number of benign lung nodules is B, and N=A+B;
when the statistical threshold T is changed from the minimum value of scoring to the maximum value according to the step length, calculating Y=TP/A and X=FP/B by calculating the number TP of malignant lung nodules with the scoring value exceeding T and the number FP of benign lung nodules without exceeding T output by an estimator;
and drawing a curve by taking X as an abscissa and Y as an ordinate, and calculating the area under the curve to obtain the weight of the output scoring of the estimator.
As an alternative embodiment, the minimum value of the estimator output score is 1 and the maximum value is 5.
As an alternative embodiment, the tag is: if the malignancy of the first lung nodule is more severe, the label is 1; if the malignancy of the second lung nodule is more severe, the label is-1; if the malignancy of the two lung nodules is comparable, the label is 0.
As an alternative embodiment, the loss function is:
wherein is L r A loss function of the r-th estimator;labeling the ith lung nodule pair for the nth doctor with a relatively high or low label on the benign or malignant extent of both lung nodules, c= { -1,0,1}, i=1, 2, …, n, n being the number of lung nodule pairs, r=1, 2, …, R being the number of doctors or estimators; epsilon is a random error subject to sigmoid distribution; />Scoring the first lung nodule in the pair according to lung nodule +.>And a second lung nodule score +.>The obtained quantized value of the benign and malignant degree is relatively high and low; lambda is a set boundary threshold; />To indicate the function, when->The value is 1 when +.>When the value is 0;is->The probability values at the time are respectively:
wherein,
the foregoing is merely illustrative of the present invention, and the present invention is not limited thereto, and any changes or substitutions easily contemplated by those skilled in the art within the scope of the present invention should be included in the present invention. Therefore, the protection scope of the invention is subject to the protection scope of the claims.
Claims (6)
1. A method for evaluating benign and malignant lung nodules, comprising the steps of:
constructing an evaluation model comprising a feature extraction module, an estimation module and a data fusion module, wherein each estimator of the estimation module is used for scoring according to the lung nodule features output by the feature extraction module, and the data fusion module is used for calculating the weighted average of the scoring;
training parameters of the feature extraction module and each estimator by using a training data set consisting of lung nodule pairs and labels of different doctors, wherein the labels are marked by the lung nodule pairs and relate to the benign and malignant degrees of the two lung nodules, and the parameters of the feature extraction module and each estimator are trained by using a loss function obtained based on the output of each estimator and the labels of each estimator;
inputting CT images of the lung nodules to be evaluated into a trained evaluation model to obtain benign and malignant scores of the lung nodules;
the label is as follows: if the malignancy of the first lung nodule is more severe, the label is 1; if the malignancy of the second lung nodule is more severe, the label is-1; if the malignancy of the two lung nodules is comparable, the label is 0;
the loss function is:
wherein is L r A loss function of the r-th estimator;labeling the ith lung nodule pair for the nth doctor with a relatively high or low label on the benign or malignant extent of both lung nodules, c= { -1,0,1}, i=1, 2, …, n, n being the number of lung nodule pairs, r=1, 2, …, R being the number of doctors or estimators; epsilon is a random error subject to sigmoid distribution; />Scoring the first lung nodule in the pair according to lung nodule +.>And a second lung nodule score +.>The obtained quantized value of the benign and malignant degree is relatively high and low; lambda is a set boundary threshold; />To indicate the function, when->The value is 1 when +.>When the value is 0;is->The probability values at the time are respectively:
wherein,
2. the method of assessing the malignancy of pulmonary nodules of claim 1 wherein the method of determining the output scoring weight for each estimator comprises the steps of, for each estimator:
selecting N lung nodule CT images as input of an evaluation model, wherein the number of malignant lung nodules is A, the number of benign lung nodules is B, and N=A+B;
when the statistical threshold T is changed from the minimum value of scoring to the maximum value according to the step length, calculating Y=TP/A and X=FP/B by calculating the number TP of malignant lung nodules with the scoring value exceeding T and the number FP of benign lung nodules without exceeding T output by an estimator;
and drawing a curve by taking X as an abscissa and Y as an ordinate, and calculating the area under the curve to obtain the weight of the output scoring of the estimator.
3. The method of evaluating benign and malignant lung nodule according to claim 2, wherein the minimum value of the estimator output score is 1 and the maximum value is 5.
4. A pulmonary nodule benign and malignant evaluation apparatus, comprising:
the modeling module is used for constructing an evaluation model comprising a feature extraction module, an estimation module and a data fusion module, wherein each estimator of the estimation module is used for scoring according to the lung nodule features output by the feature extraction module, and the data fusion module is used for calculating the weighted mean value of the scoring;
the training module is used for training parameters of the feature extraction module and each estimator by using a training data set consisting of lung nodule pairs and labels which are marked by different doctors and are relatively high and low in benign and malignant degrees of the two lung nodules and adopting a loss function obtained based on the output of each estimator and the labels of the estimators;
the evaluation module is used for inputting CT images of the lung nodules to be evaluated into the trained evaluation model to obtain benign and malignant scoring of the lung nodules;
the label is as follows: if the malignancy of the first lung nodule is more severe, the label is 1; if the malignancy of the second lung nodule is more severe, the label is-1; if the malignancy of the two lung nodules is comparable, the label is 0;
the loss function is:
wherein is L r A loss function of the r-th estimator;labeling the ith lung nodule pair for the nth doctor with a relatively high or low label on the benign or malignant extent of both lung nodules, c= { -1,0,1}, i=1, 2, …, n, n being the number of lung nodule pairs, r=1, 2, …, R being the number of doctors or estimators; epsilon is a random error subject to sigmoid distribution; />Scoring the first lung nodule in the pair according to lung nodule +.>And a second lung nodule score +.>The obtained quantized value of the benign and malignant degree is relatively high and low; lambda is a set boundary threshold; />To indicate the function, when->The value is 1 when +.>When the value is 0;is->The probability values at the time are respectively:
wherein,
5. the pulmonary nodule benign and malignant evaluation apparatus of claim 4, wherein the method of determining the output scoring weight for each estimator comprises the steps of, for each estimator:
selecting N lung nodule CT images as input of an evaluation model, wherein the number of malignant lung nodules is A, the number of benign lung nodules is B, and N=A+B;
when the statistical threshold T is changed from the minimum value of scoring to the maximum value according to the step length, calculating Y=TP/A and X=FP/B by calculating the number TP of malignant lung nodules with the scoring value exceeding T and the number FP of benign lung nodules without exceeding T output by an estimator;
and drawing a curve by taking X as an abscissa and Y as an ordinate, and calculating the area under the curve to obtain the weight of the output scoring of the estimator.
6. The pulmonary nodule benign and malignant evaluation apparatus of claim 5, wherein the minimum value of the estimator output score is 1 and the maximum value is 5.
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN202110660081.8A CN113469954B (en) | 2021-06-15 | 2021-06-15 | Method and device for evaluating benign and malignant lung nodule |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN202110660081.8A CN113469954B (en) | 2021-06-15 | 2021-06-15 | Method and device for evaluating benign and malignant lung nodule |
Publications (2)
Publication Number | Publication Date |
---|---|
CN113469954A CN113469954A (en) | 2021-10-01 |
CN113469954B true CN113469954B (en) | 2024-04-09 |
Family
ID=77869923
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN202110660081.8A Active CN113469954B (en) | 2021-06-15 | 2021-06-15 | Method and device for evaluating benign and malignant lung nodule |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN113469954B (en) |
Families Citing this family (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN115578307B (en) * | 2022-05-25 | 2023-09-15 | 广州市基准医疗有限责任公司 | Lung nodule benign and malignant classification method and related products |
Citations (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN110223281A (en) * | 2019-06-06 | 2019-09-10 | 东北大学 | A kind of Lung neoplasm image classification method when in data set containing uncertain data |
CN111861989A (en) * | 2020-06-10 | 2020-10-30 | 杭州深睿博联科技有限公司 | Method, system, terminal and storage medium for detecting midline of brain |
CN111915596A (en) * | 2020-08-07 | 2020-11-10 | 杭州深睿博联科技有限公司 | Method and device for predicting benign and malignant pulmonary nodules |
Family Cites Families (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US11049223B2 (en) * | 2019-06-19 | 2021-06-29 | Siemens Healthcare Gmbh | Class-aware adversarial pulmonary nodule synthesis |
-
2021
- 2021-06-15 CN CN202110660081.8A patent/CN113469954B/en active Active
Patent Citations (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN110223281A (en) * | 2019-06-06 | 2019-09-10 | 东北大学 | A kind of Lung neoplasm image classification method when in data set containing uncertain data |
CN111861989A (en) * | 2020-06-10 | 2020-10-30 | 杭州深睿博联科技有限公司 | Method, system, terminal and storage medium for detecting midline of brain |
CN111915596A (en) * | 2020-08-07 | 2020-11-10 | 杭州深睿博联科技有限公司 | Method and device for predicting benign and malignant pulmonary nodules |
Non-Patent Citations (3)
Title |
---|
Botong Wu et al..joint learning for pulmonary nodule segmentation,attributes and malignancy prediction.《arXiv.org》.2018,第1-5页. * |
Long-term follow-up of persistent pulmonary pure ground-glass nodules with deep learning–assisted nodule segmentation;Lin-Lin Qi et al.;《European Radiology》;第744-755页 * |
基于深度学习的人工智能胸部CT肺结节检测效能评估;李欣菱等;《中国癌症杂志》;第第22卷卷(第第6期期);第336-340页 * |
Also Published As
Publication number | Publication date |
---|---|
CN113469954A (en) | 2021-10-01 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN108447080B (en) | Target tracking method, system and storage medium based on hierarchical data association and convolutional neural network | |
CN110969626B (en) | Method for extracting hippocampus of human brain nuclear magnetic resonance image based on 3D neural network | |
CN110766051A (en) | Lung nodule morphological classification method based on neural network | |
CN111582111B (en) | Cell component segmentation method based on semantic segmentation | |
CN116933046B (en) | Deep learning-based multi-mode health management scheme generation method and system | |
CN112991295B (en) | Lymph node metastasis image analysis system, method and equipment based on deep learning | |
CN113095382B (en) | Interpretable tuberculosis classification network identification method based on CT image | |
CN111597991A (en) | Rehabilitation detection method based on channel state information and BilSTM-Attention | |
CN108960142B (en) | Pedestrian re-identification method based on global feature loss function | |
CN110781953B (en) | Lung cancer pathological section classification method based on multi-scale pyramid convolution neural network | |
CN115994907B (en) | Intelligent processing system and method for comprehensive information of food detection mechanism | |
CN114037011B (en) | Automatic identification and cleaning method for tongue color noise labeling sample of traditional Chinese medicine | |
CN113469954B (en) | Method and device for evaluating benign and malignant lung nodule | |
CN113657449A (en) | Traditional Chinese medicine tongue picture greasy classification method containing noise labeling data | |
CN111833321A (en) | Window-adjusting optimization-enhanced intracranial hemorrhage detection model and construction method thereof | |
CN111144462A (en) | Unknown individual identification method and device for radar signals | |
CN117315379A (en) | Deep learning-oriented medical image classification model fairness evaluation method and device | |
CN116934754B (en) | Liver image identification method and device based on graph neural network | |
CN113035348A (en) | Diabetes diagnosis method based on GRU feature fusion | |
CN113052227A (en) | Pulmonary tuberculosis identification method based on SE-ResNet | |
CN111325282A (en) | Mammary gland X-ray image identification method and device suitable for multiple models | |
CN115881304A (en) | Risk assessment method, device, equipment and medium based on intelligent detection | |
CN113868597B (en) | Regression fairness measurement method for age estimation | |
CN114913585A (en) | Household old man falling detection method integrating facial expressions | |
CN112597842B (en) | Motion detection facial paralysis degree evaluation system based on artificial intelligence |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PB01 | Publication | ||
PB01 | Publication | ||
SE01 | Entry into force of request for substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
GR01 | Patent grant | ||
GR01 | Patent grant |