CN113469954B - Method and device for evaluating benign and malignant lung nodule - Google Patents

Method and device for evaluating benign and malignant lung nodule Download PDF

Info

Publication number
CN113469954B
CN113469954B CN202110660081.8A CN202110660081A CN113469954B CN 113469954 B CN113469954 B CN 113469954B CN 202110660081 A CN202110660081 A CN 202110660081A CN 113469954 B CN113469954 B CN 113469954B
Authority
CN
China
Prior art keywords
lung
estimator
benign
malignant
nodule
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Active
Application number
CN202110660081.8A
Other languages
Chinese (zh)
Other versions
CN113469954A (en
Inventor
周振
李一鸣
俞益洲
乔昕
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Beijing Shenrui Bolian Technology Co Ltd
Shenzhen Deepwise Bolian Technology Co Ltd
Original Assignee
Beijing Shenrui Bolian Technology Co Ltd
Shenzhen Deepwise Bolian Technology Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Beijing Shenrui Bolian Technology Co Ltd, Shenzhen Deepwise Bolian Technology Co Ltd filed Critical Beijing Shenrui Bolian Technology Co Ltd
Priority to CN202110660081.8A priority Critical patent/CN113469954B/en
Publication of CN113469954A publication Critical patent/CN113469954A/en
Application granted granted Critical
Publication of CN113469954B publication Critical patent/CN113469954B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06TIMAGE DATA PROCESSING OR GENERATION, IN GENERAL
    • G06T7/00Image analysis
    • G06T7/0002Inspection of images, e.g. flaw detection
    • G06T7/0012Biomedical image inspection
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F18/00Pattern recognition
    • G06F18/20Analysing
    • G06F18/21Design or setup of recognition systems or techniques; Extraction of features in feature space; Blind source separation
    • G06F18/214Generating training patterns; Bootstrap methods, e.g. bagging or boosting
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F18/00Pattern recognition
    • G06F18/20Analysing
    • G06F18/25Fusion techniques
    • GPHYSICS
    • G16INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR SPECIFIC APPLICATION FIELDS
    • G16HHEALTHCARE INFORMATICS, i.e. INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR THE HANDLING OR PROCESSING OF MEDICAL OR HEALTHCARE DATA
    • G16H30/00ICT specially adapted for the handling or processing of medical images
    • G16H30/20ICT specially adapted for the handling or processing of medical images for handling medical images, e.g. DICOM, HL7 or PACS
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06TIMAGE DATA PROCESSING OR GENERATION, IN GENERAL
    • G06T2207/00Indexing scheme for image analysis or image enhancement
    • G06T2207/10Image acquisition modality
    • G06T2207/10072Tomographic images
    • G06T2207/10081Computed x-ray tomography [CT]
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06TIMAGE DATA PROCESSING OR GENERATION, IN GENERAL
    • G06T2207/00Indexing scheme for image analysis or image enhancement
    • G06T2207/20Special algorithmic details
    • G06T2207/20081Training; Learning
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06TIMAGE DATA PROCESSING OR GENERATION, IN GENERAL
    • G06T2207/00Indexing scheme for image analysis or image enhancement
    • G06T2207/20Special algorithmic details
    • G06T2207/20084Artificial neural networks [ANN]
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06TIMAGE DATA PROCESSING OR GENERATION, IN GENERAL
    • G06T2207/00Indexing scheme for image analysis or image enhancement
    • G06T2207/30Subject of image; Context of image processing
    • G06T2207/30004Biomedical image processing
    • G06T2207/30061Lung
    • G06T2207/30064Lung nodule

Landscapes

  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • Data Mining & Analysis (AREA)
  • Computer Vision & Pattern Recognition (AREA)
  • General Physics & Mathematics (AREA)
  • Health & Medical Sciences (AREA)
  • Physics & Mathematics (AREA)
  • Radiology & Medical Imaging (AREA)
  • Bioinformatics & Cheminformatics (AREA)
  • Nuclear Medicine, Radiotherapy & Molecular Imaging (AREA)
  • Medical Informatics (AREA)
  • General Engineering & Computer Science (AREA)
  • Evolutionary Computation (AREA)
  • Evolutionary Biology (AREA)
  • General Health & Medical Sciences (AREA)
  • Life Sciences & Earth Sciences (AREA)
  • Artificial Intelligence (AREA)
  • Bioinformatics & Computational Biology (AREA)
  • Quality & Reliability (AREA)
  • Public Health (AREA)
  • Primary Health Care (AREA)
  • Epidemiology (AREA)
  • Apparatus For Radiation Diagnosis (AREA)

Abstract

The invention provides a method and a device for evaluating benign and malignant lung nodule. The method comprises the following steps: constructing an evaluation model comprising a feature extraction module, an estimation module and a data fusion module; training parameters of the feature extraction module and each estimator by using a training data set consisting of lung nodule pairs and labels of different doctors, wherein the labels are marked by the lung nodule pairs and relate to the benign and malignant degrees of the two lung nodules, and the parameters of the feature extraction module and each estimator are trained by using a loss function obtained based on the output of each estimator and the labels of each estimator; and inputting the CT image of the lung nodule to be evaluated into a trained evaluation model to obtain the benign and malignant scoring of the lung nodule. Because the data set for training the evaluation model adopts the labels with relatively high and low benign and malignant degrees of the two lung nodules, and the consistency of the labels marked by different doctors is good, the accuracy of evaluating the benign and malignant degrees of the lung nodules is improved; the training method of the invention also reduces the time complexity and improves the speed of model training.

Description

Method and device for evaluating benign and malignant lung nodule
Technical Field
The invention relates to the technical field of image processing by utilizing a neural network, in particular to a method and a device for evaluating benign and malignant lung nodule.
Background
In the prior art, the identification of benign and malignant lung nodules refers to the automatic identification of benign and malignant lung nodules in a given lung CT image through a deep neural network. The accuracy of neural networks in identifying benign and malignant lung nodules is very dependent on the accuracy of benign and malignant labels of lung CT images in training data. Labels for CT images of lung nodules have two sources: the first is that the imaging doctor judges the benign and malignant lung nodules; the second is to judge the benign and malignant lung nodules by microscopic pathological analysis. The cost of marking by the imaging doctor is low, and the marking accuracy is about 75%. The marking cost of pathological analysis is high, the accuracy is close to 100%, and the pathological analysis marking is generally regarded as a true benign and malignant marking of lung nodules. A specific method for imaging physician marking uses several imaging physicians to score lung CT images from 1 to 5, 1 representing the lowest malignancy and 5 representing the highest malignancy. If the average score of the lung CT image is higher than 3.5 points, the label of the image is malignant, if the average score is lower than 2.5 points, the label of the image is benign, and the image with the average score of 2.5 to 3.5 points is discarded. Most of the basic lung nodule CT image datasets currently used are marked by imaging doctors. However, the accuracy of the imaging physician marking is greatly affected by human factors, the standards of the lung nodule benign and malignant scoring by different imaging physicians are inconsistent, some diagnoses are more conservative, and some diagnoses are more aggressive. There is a great difference between the scores given by different imaging physicians to the same CT image of a lung nodule. Thereby resulting in a lower quality neural network data set that affects neural network training.
Disclosure of Invention
In order to solve the problems in the prior art, the invention provides a method and a device for evaluating benign and malignant lung nodule.
In order to achieve the above object, the present invention adopts the following technical scheme.
In a first aspect, the present invention provides a method for evaluating benign and malignant lung nodule, comprising the steps of:
constructing an evaluation model comprising a feature extraction module, an estimation module and a data fusion module, wherein each estimator of the estimation module is used for scoring according to the lung nodule features output by the feature extraction module, and the data fusion module is used for calculating the weighted average of the scoring;
training parameters of the feature extraction module and each estimator by using a training data set consisting of lung nodule pairs and labels of different doctors, wherein the labels are marked by the lung nodule pairs and relate to the benign and malignant degrees of the two lung nodules, and the parameters of the feature extraction module and each estimator are trained by using a loss function obtained based on the output of each estimator and the labels of each estimator;
and inputting the CT image of the lung nodule to be evaluated into a trained evaluation model to obtain the benign and malignant scoring of the lung nodule.
Further, the method of determining the output scoring weight of each estimator comprises the following steps for each estimator:
selecting N lung nodule CT images as input of an evaluation model, wherein the number of malignant lung nodules is A, the number of benign lung nodules is B, and N=A+B;
when the statistical threshold T is changed from the minimum value of scoring to the maximum value according to the step length, calculating Y=TP/A and X=FP/B by calculating the number TP of malignant lung nodules with the scoring value exceeding T and the number FP of benign lung nodules without exceeding T output by an estimator;
and drawing a curve by taking X as an abscissa and Y as an ordinate, and calculating the area under the curve to obtain the weight of the output scoring of the estimator.
Further, the estimator output score has a minimum value of 1 and a maximum value of 5.
Further, the tag is: if the malignancy of the first lung nodule is more severe, the label is 1; if the malignancy of the second lung nodule is more severe, the label is-1; if the malignancy of the two lung nodules is comparable, the label is 0.
Further, the loss function is:
wherein is L r A loss function of the r-th estimator;labeling the ith lung nodule pair for the nth doctor with a relatively high or low label on the benign or malignant extent of both lung nodules, c= { -1,0,1}, i=1, 2, …, n, n being the number of lung nodule pairs, r=1, 2, …, R being the number of doctors or estimators; epsilon is a random error subject to sigmoid distribution; />Scoring the first lung nodule in the pair according to lung nodule +.>And a second lung nodule score +.>The obtained quantized value of the benign and malignant degree is relatively high and low; lambda is a set boundary threshold; />To indicate the function, when->The value is 1 when +.>When the value is 0;is->The probability values at the time are respectively:
wherein,
in a second aspect, the present invention provides a pulmonary nodule benign and malignant evaluation apparatus comprising:
the modeling module is used for constructing an evaluation model comprising a feature extraction module, an estimation module and a data fusion module, wherein each estimator of the estimation module is used for scoring according to the lung nodule features output by the feature extraction module, and the data fusion module is used for calculating the weighted mean value of the scoring;
the training module is used for training parameters of the feature extraction module and each estimator by using a training data set consisting of lung nodule pairs and labels which are marked by different doctors and are relatively high and low in benign and malignant degrees of the two lung nodules and adopting a loss function obtained based on the output of each estimator and the labels of the estimators;
and the evaluation module is used for inputting the CT image of the lung nodule to be evaluated into the trained evaluation model to obtain the benign and malignant scoring of the lung nodule.
Further, the method of determining the output scoring weight of each estimator comprises the following steps for each estimator:
selecting CT images of N lung nodules as input of an evaluation model, wherein the number of malignant lung nodules is A, the number of benign lung nodules is B, and N=A+B;
when the statistical threshold T is changed from the minimum value of scoring to the maximum value according to the step length, calculating Y=TP/A and X=FP/B by calculating the number TP of malignant lung nodules with the scoring value exceeding T and the number FP of benign lung nodules without exceeding T output by an estimator;
and drawing a curve by taking X as an abscissa and Y as an ordinate, and calculating the area under the curve to obtain the weight of the output scoring of the estimator.
Further, the estimator output score has a minimum value of 1 and a maximum value of 5.
Further, the tag is: if the malignancy of the first lung nodule is more severe, the label is 1; if the malignancy of the second lung nodule is more severe, the label is-1; if the malignancy of the two lung nodules is comparable, the label is 0.
Further, the loss function is:
wherein is L r A loss function of the r-th estimator;labeling the ith lung nodule pair for the nth doctor with a relatively high or low label on the benign or malignant extent of both lung nodules, c= { -1,0,1}, i=1, 2, …, n, n being the number of lung nodule pairs, r=1, 2, …, R being the number of doctors or estimators; epsilon is a random error subject to sigmoid distribution; />Scoring the first lung nodule in the pair according to lung nodule +.>And a second lung nodule score +.>The obtained benign and malignant phasesA quantized value of the height; lambda is a set boundary threshold; />To indicate the function, when->The value is 1 when +.>When the value is 0;is->The probability values at the time are respectively:
wherein,
compared with the prior art, the invention has the following beneficial effects.
According to the invention, an evaluation model comprising a feature extraction module, an estimation module and a data fusion module is constructed, a training data set consisting of a lung nodule pair and labels which are marked by different doctors and are relatively high and low in benign and malignant degrees of two lung nodules in the lung nodule pair is utilized to train the evaluation model, parameters of the feature extraction module and each estimator are trained based on the output of each estimator and a loss function obtained by the labels of the estimator, the lung nodule to be evaluated is input into the trained evaluation model, the benign and malignant scoring of the lung nodule is obtained, and quantitative evaluation of the benign and malignant of the lung nodule is realized. Because the data set for training the evaluation model adopts the labels with relatively high and low benign and malignant degrees of the two lung nodules, and the consistency of the labels marked by different doctors is good, namely the label precision is high, the accuracy of the benign and malignant evaluation of the lung nodules is improved; meanwhile, the parameters of the feature extraction module and each estimator are trained by the loss function obtained based on the output of each estimator and the label thereof, so that the time complexity is reduced, and the model training speed is improved.
Drawings
Fig. 1 is a flowchart of a method for evaluating benign and malignant lung nodules according to an embodiment of the present invention.
Fig. 2 is a schematic structural diagram of the evaluation model.
Fig. 3 is a block diagram of a pulmonary nodule benign and malignant evaluation apparatus according to an embodiment of the present invention.
Detailed Description
The present invention will be further described with reference to the drawings and the detailed description below, in order to make the objects, technical solutions and advantages of the present invention more apparent. It will be apparent that the described embodiments are only some, but not all, embodiments of the invention. All other embodiments, which can be made by those skilled in the art based on the embodiments of the invention without making any inventive effort, are intended to be within the scope of the invention.
Fig. 1 is a flowchart of a method for evaluating benign and malignant lung nodule according to an embodiment of the present invention, including the steps of:
step 101, constructing an evaluation model comprising a feature extraction module, an estimation module and a data fusion module, wherein each estimator of the estimation module is used for scoring according to the lung nodule features output by the feature extraction module, and the data fusion module is used for calculating the weighted mean value of the scoring;
step 102, training parameters of a feature extraction module and each estimator by using a training data set consisting of lung nodule pairs and labels, which are marked by different doctors, about the relative high and low degrees of benign and malignant of the two lung nodules and adopting a loss function obtained based on the output of each estimator and the labels thereof;
and step 103, inputting the CT image of the lung nodule to be evaluated into a trained evaluation model to obtain a benign and malignant score of the lung nodule.
In this embodiment, an artificial neural network (such as convolutional neural network CNN) is used to construct an evaluation model of benign and malignant lung nodules. The input of the evaluation model is a lung nodule CT image, and the output is a quantitative evaluation of the benign and malignant properties of the input lung nodule, namely a comprehensive score. The benign and malignant scoring can be automatically output only by inputting CT images of lung nodules to be evaluated into a trained evaluation model.
In this embodiment, step 101 is mainly used for constructing a benign and malignant evaluation model of lung nodules. The evaluation model mainly comprises a feature extraction module, an estimation module and a data fusion module, as shown in fig. 2. The feature extraction module is used for extracting features of the input lung nodule CT image. The estimation module is composed of a plurality of estimators, all estimators share the output of the feature extraction module, and scoring is performed according to the lung nodule feature output by the feature extraction module, for example, the larger the lung nodule is, the more branches are, the higher the possibility of malignancy is, and the scoring value is higher. The data fusion module is used for fusing the scores output by all the estimators to obtain a comprehensive score. The fusion method adopted by the embodiment is to calculate the average value of the scoring weights of the estimators, and the calculation formula is as follows:
wherein S is the final integrated score, S r Scoring the r-th estimator, k r Weights scoring the R-th estimator, R being the number of estimators. The simplest weighted value strategies are to take 1, and the data fusion is equivalent to averaging. However, in order to improve the effect of data fusion, different weights are generally set according to different reliability of the estimator, and the better the reliability, the larger the estimator weight. Will be laterA specific embodiment for determining the estimator weights is presented.
In this embodiment, step 102 is mainly used for training the evaluation model. The model training of this embodiment is mainly to optimize the parameters of each estimator and the parameters of the feature extraction module. The training data set in the prior art generally consists of a single lung nodule CT image sample and benign and malignant labels marked by doctors, and the label accuracy is not high due to inconsistent scoring standards of different doctors, so that the prediction accuracy of a trained model is not high. But the consistency of the evaluation made by different doctors of the relative high and low (i.e. which is more serious) of the benign and malignant degrees of two different lung nodules is quite good, as compared with the difficulty in accurately speaking the specific height of a person, but the accurate judgment of who is higher than the two persons is easy to make. For this reason, in this embodiment, the lung nodule CT image samples are "paired two by two", that is, two are paired, and then different doctors evaluate the relative height of the same lung nodule on the benign and malignant degree, and use this as the label of the lung nodule pair, so as to obtain the training data set composed of the lung nodule pair and its label. The loss function adopted in the training in this embodiment is also different from the prior art, in which the model is generally trained by using a total loss function, and in this embodiment, a loss function determined by the output and the label difference is provided for each estimator, so that the parameters of each estimator and the feature extraction module are optimized and trained. The processing method of the embodiment not only can improve the prediction precision of the model, but also can obviously reduce the time complexity of training, reduce the training time and improve the training speed.
In this embodiment, step 103 is mainly used to obtain a score of the benign or malignant lung nodule to be evaluated. And inputting the CT image of the lung nodule to be evaluated into a trained evaluation model, so that the benign and malignant scoring of the lung nodule can be conveniently obtained.
As an alternative embodiment, the method of determining the output scoring weight of each estimator comprises the following steps for each estimator:
selecting N lung nodule CT images as input of an evaluation model, wherein the number of malignant lung nodules is A, the number of benign lung nodules is B, and N=A+B;
when the statistical threshold T is changed from the minimum value of scoring to the maximum value according to the step length, calculating Y=TP/A and X=FP/B by calculating the number TP of malignant lung nodules with the scoring value exceeding T and the number FP of benign lung nodules without exceeding T output by an estimator;
and drawing a curve by taking X as an abscissa and Y as an ordinate, and calculating the area under the curve to obtain the weight of the output scoring of the estimator.
The present embodiment provides a technical solution for determining the output scoring weight of the estimator. The present embodiment uses the ROC (Receiver Operating Characteristic ) curve to determine the estimator output scoring weights. ROC curves were first applied in the field of radar signal detection to distinguish between signals and noise. Later on, one uses it to evaluate the predictive ability of the model, and the ROC curve is derived based on the confusion matrix. The thresholds of a classification model may be set to be high or low, each threshold setting may result in different false positive rate FPR and true positive rate TPR, and the (FPR, TPR) coordinates of each threshold of the same model are plotted in ROC space to form the ROC curve of the specific model. ROC curve is FPR on the abscissa and TPR on the ordinate. AUC (Area under the Curve) is the area under the ROC curve, and when different classification models are compared, the ROC curve of each model can be drawn, and the area under the curve is used as an index of the model quality. AUC has the following characteristics: the value range is [0,1]; assuming positive above the threshold and negative below the threshold, the AUC is equal to the probability that the classifier correctly judges that the positive sample has a higher value than the negative sample. Thus, the higher the AUC value of the classifier, the higher the accuracy. The larger the AUC value, the higher the estimator feasibility, and the larger the scoring weight. Therefore, only the corresponding ROC curve is drawn for each estimator, and the area under the curve is calculated to obtain the scoring weight.
As an alternative embodiment, the minimum value of the estimator output score is 1 and the maximum value is 5.
The present embodiment gives a range of values for the output score of the estimator, with a minimum value of 1 and a maximum value of 5. The greater the scoring score, the more severe the malignancy of the lung nodules. It should be noted that this example is given as a preferred embodiment, and is not intended to negate or exclude other embodiments that are possible, for example, whether ten or a percentage may be used.
As an alternative embodiment, the tag is: if the malignancy of the first lung nodule is more severe, the label is 1; if the malignancy of the second lung nodule is more severe, the label is-1; if the malignancy of the two lung nodules is comparable, the label is 0.
The specific values of the labels representing the relative high and low degrees of benign and malignant of two lung nodules in the lung nodule pair are respectively 1,0 and-1. Labels 1 and-1 indicate that one lung nodule is more severe than the other, respectively, and 0 indicates that the benign and malignant extent of both lung nodules is the same or similar. Likewise, the embodiment only provides a preferred embodiment, and does not negate or exclude other possible embodiments, and any three different integers may be used as the label, so that the solution of the embodiment is more intuitive and concise.
As an alternative embodiment, the loss function is:
wherein is L r A loss function of the r-th estimator;labeling the ith lung nodule pair for the nth doctor with a relatively high or low label on the benign or malignant extent of both lung nodules, c= { -1,0,1}, i=1, 2, …, n, n being the number of lung nodule pairs, r=1, 2, …, R being the number of doctors or estimators; epsilon is a random error subject to sigmoid distribution; />Scoring the first lung nodule in the pair according to lung nodule +.>And a second lung nodule score +.>The obtained quantized value of the benign and malignant degree is relatively high and low; lambda is a set boundary threshold; />To indicate the function, when->The value is 1 when +.>When the value is 0;is->The probability values at the time are respectively:
wherein,
this embodimentA technical solution for the loss function is given. As described above, this embodiment sets a loss function for each estimator for training optimization of the corresponding estimator parameters, and therefore uses the serial number r of the estimator as the index of the loss function. Since the tag has three values of 1,0 and-1, the probability equal to the tag is equal to the sum of the probabilities when the tag is 1,0 and-1 respectively, and the loss function comprises the variable of the tagAnd (5) summing operation. Further, since the average value is calculated for all lung nodules in the training data set, the loss function also includes calculation of summing up (averaging) the lung nodules with the variable i. Quantized score of relatively high or low degree of benign malignancy of two lung nodules estimated based on score actually output by estimator>Is a piecewise function, takes three different values 1,0, -1, -lambda, lambda are the demarcation points of the three intervals, also called boundary threshold, see in particular +.>Is an expression of (2). Labels are also provided aboveWhen 1,0, -1, respectively, +.>Probability equal to tag->The probability obeys a sigmoid distribution.
Fig. 3 is a schematic diagram of the composition of a device for evaluating benign and malignant lung nodules according to an embodiment of the present invention, the device comprising:
the modeling module 11 is configured to construct an evaluation model including a feature extraction module, an estimation module, and a data fusion module, where each estimator of the estimation module is configured to score according to the lung nodule feature output by the feature extraction module, and the data fusion module is configured to calculate a weighted average of the scores;
a training module 12 for training parameters of the feature extraction module and each estimator by using a training data set composed of the lung nodule pairs and labels of different doctors, which are relatively high and low with respect to benign and malignant degrees of the two lung nodules, and using a loss function obtained based on the output of each estimator and the labels thereof;
and the evaluation module 13 is used for inputting the CT image of the lung nodule to be evaluated into the trained evaluation model to obtain the benign and malignant score of the lung nodule.
The device of this embodiment may be used to implement the technical solution of the method embodiment shown in fig. 1, and its implementation principle and technical effects are similar, and are not described here again. As well as the latter embodiments, will not be explained again.
As an alternative embodiment, the method of determining the output scoring weight of each estimator comprises the following steps for each estimator:
selecting N lung nodule CT images as input of an evaluation model, wherein the number of malignant lung nodules is A, the number of benign lung nodules is B, and N=A+B;
when the statistical threshold T is changed from the minimum value of scoring to the maximum value according to the step length, calculating Y=TP/A and X=FP/B by calculating the number TP of malignant lung nodules with the scoring value exceeding T and the number FP of benign lung nodules without exceeding T output by an estimator;
and drawing a curve by taking X as an abscissa and Y as an ordinate, and calculating the area under the curve to obtain the weight of the output scoring of the estimator.
As an alternative embodiment, the minimum value of the estimator output score is 1 and the maximum value is 5.
As an alternative embodiment, the tag is: if the malignancy of the first lung nodule is more severe, the label is 1; if the malignancy of the second lung nodule is more severe, the label is-1; if the malignancy of the two lung nodules is comparable, the label is 0.
As an alternative embodiment, the loss function is:
wherein is L r A loss function of the r-th estimator;labeling the ith lung nodule pair for the nth doctor with a relatively high or low label on the benign or malignant extent of both lung nodules, c= { -1,0,1}, i=1, 2, …, n, n being the number of lung nodule pairs, r=1, 2, …, R being the number of doctors or estimators; epsilon is a random error subject to sigmoid distribution; />Scoring the first lung nodule in the pair according to lung nodule +.>And a second lung nodule score +.>The obtained quantized value of the benign and malignant degree is relatively high and low; lambda is a set boundary threshold; />To indicate the function, when->The value is 1 when +.>When the value is 0;is->The probability values at the time are respectively:
wherein,
the foregoing is merely illustrative of the present invention, and the present invention is not limited thereto, and any changes or substitutions easily contemplated by those skilled in the art within the scope of the present invention should be included in the present invention. Therefore, the protection scope of the invention is subject to the protection scope of the claims.

Claims (6)

1. A method for evaluating benign and malignant lung nodules, comprising the steps of:
constructing an evaluation model comprising a feature extraction module, an estimation module and a data fusion module, wherein each estimator of the estimation module is used for scoring according to the lung nodule features output by the feature extraction module, and the data fusion module is used for calculating the weighted average of the scoring;
training parameters of the feature extraction module and each estimator by using a training data set consisting of lung nodule pairs and labels of different doctors, wherein the labels are marked by the lung nodule pairs and relate to the benign and malignant degrees of the two lung nodules, and the parameters of the feature extraction module and each estimator are trained by using a loss function obtained based on the output of each estimator and the labels of each estimator;
inputting CT images of the lung nodules to be evaluated into a trained evaluation model to obtain benign and malignant scores of the lung nodules;
the label is as follows: if the malignancy of the first lung nodule is more severe, the label is 1; if the malignancy of the second lung nodule is more severe, the label is-1; if the malignancy of the two lung nodules is comparable, the label is 0;
the loss function is:
wherein is L r A loss function of the r-th estimator;labeling the ith lung nodule pair for the nth doctor with a relatively high or low label on the benign or malignant extent of both lung nodules, c= { -1,0,1}, i=1, 2, …, n, n being the number of lung nodule pairs, r=1, 2, …, R being the number of doctors or estimators; epsilon is a random error subject to sigmoid distribution; />Scoring the first lung nodule in the pair according to lung nodule +.>And a second lung nodule score +.>The obtained quantized value of the benign and malignant degree is relatively high and low; lambda is a set boundary threshold; />To indicate the function, when->The value is 1 when +.>When the value is 0;is->The probability values at the time are respectively:
wherein,
2. the method of assessing the malignancy of pulmonary nodules of claim 1 wherein the method of determining the output scoring weight for each estimator comprises the steps of, for each estimator:
selecting N lung nodule CT images as input of an evaluation model, wherein the number of malignant lung nodules is A, the number of benign lung nodules is B, and N=A+B;
when the statistical threshold T is changed from the minimum value of scoring to the maximum value according to the step length, calculating Y=TP/A and X=FP/B by calculating the number TP of malignant lung nodules with the scoring value exceeding T and the number FP of benign lung nodules without exceeding T output by an estimator;
and drawing a curve by taking X as an abscissa and Y as an ordinate, and calculating the area under the curve to obtain the weight of the output scoring of the estimator.
3. The method of evaluating benign and malignant lung nodule according to claim 2, wherein the minimum value of the estimator output score is 1 and the maximum value is 5.
4. A pulmonary nodule benign and malignant evaluation apparatus, comprising:
the modeling module is used for constructing an evaluation model comprising a feature extraction module, an estimation module and a data fusion module, wherein each estimator of the estimation module is used for scoring according to the lung nodule features output by the feature extraction module, and the data fusion module is used for calculating the weighted mean value of the scoring;
the training module is used for training parameters of the feature extraction module and each estimator by using a training data set consisting of lung nodule pairs and labels which are marked by different doctors and are relatively high and low in benign and malignant degrees of the two lung nodules and adopting a loss function obtained based on the output of each estimator and the labels of the estimators;
the evaluation module is used for inputting CT images of the lung nodules to be evaluated into the trained evaluation model to obtain benign and malignant scoring of the lung nodules;
the label is as follows: if the malignancy of the first lung nodule is more severe, the label is 1; if the malignancy of the second lung nodule is more severe, the label is-1; if the malignancy of the two lung nodules is comparable, the label is 0;
the loss function is:
wherein is L r A loss function of the r-th estimator;labeling the ith lung nodule pair for the nth doctor with a relatively high or low label on the benign or malignant extent of both lung nodules, c= { -1,0,1}, i=1, 2, …, n, n being the number of lung nodule pairs, r=1, 2, …, R being the number of doctors or estimators; epsilon is a random error subject to sigmoid distribution; />Scoring the first lung nodule in the pair according to lung nodule +.>And a second lung nodule score +.>The obtained quantized value of the benign and malignant degree is relatively high and low; lambda is a set boundary threshold; />To indicate the function, when->The value is 1 when +.>When the value is 0;is->The probability values at the time are respectively:
wherein,
5. the pulmonary nodule benign and malignant evaluation apparatus of claim 4, wherein the method of determining the output scoring weight for each estimator comprises the steps of, for each estimator:
selecting N lung nodule CT images as input of an evaluation model, wherein the number of malignant lung nodules is A, the number of benign lung nodules is B, and N=A+B;
when the statistical threshold T is changed from the minimum value of scoring to the maximum value according to the step length, calculating Y=TP/A and X=FP/B by calculating the number TP of malignant lung nodules with the scoring value exceeding T and the number FP of benign lung nodules without exceeding T output by an estimator;
and drawing a curve by taking X as an abscissa and Y as an ordinate, and calculating the area under the curve to obtain the weight of the output scoring of the estimator.
6. The pulmonary nodule benign and malignant evaluation apparatus of claim 5, wherein the minimum value of the estimator output score is 1 and the maximum value is 5.
CN202110660081.8A 2021-06-15 2021-06-15 Method and device for evaluating benign and malignant lung nodule Active CN113469954B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN202110660081.8A CN113469954B (en) 2021-06-15 2021-06-15 Method and device for evaluating benign and malignant lung nodule

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN202110660081.8A CN113469954B (en) 2021-06-15 2021-06-15 Method and device for evaluating benign and malignant lung nodule

Publications (2)

Publication Number Publication Date
CN113469954A CN113469954A (en) 2021-10-01
CN113469954B true CN113469954B (en) 2024-04-09

Family

ID=77869923

Family Applications (1)

Application Number Title Priority Date Filing Date
CN202110660081.8A Active CN113469954B (en) 2021-06-15 2021-06-15 Method and device for evaluating benign and malignant lung nodule

Country Status (1)

Country Link
CN (1) CN113469954B (en)

Families Citing this family (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN115578307B (en) * 2022-05-25 2023-09-15 广州市基准医疗有限责任公司 Lung nodule benign and malignant classification method and related products

Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN110223281A (en) * 2019-06-06 2019-09-10 东北大学 A kind of Lung neoplasm image classification method when in data set containing uncertain data
CN111861989A (en) * 2020-06-10 2020-10-30 杭州深睿博联科技有限公司 Method, system, terminal and storage medium for detecting midline of brain
CN111915596A (en) * 2020-08-07 2020-11-10 杭州深睿博联科技有限公司 Method and device for predicting benign and malignant pulmonary nodules

Family Cites Families (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US11049223B2 (en) * 2019-06-19 2021-06-29 Siemens Healthcare Gmbh Class-aware adversarial pulmonary nodule synthesis

Patent Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN110223281A (en) * 2019-06-06 2019-09-10 东北大学 A kind of Lung neoplasm image classification method when in data set containing uncertain data
CN111861989A (en) * 2020-06-10 2020-10-30 杭州深睿博联科技有限公司 Method, system, terminal and storage medium for detecting midline of brain
CN111915596A (en) * 2020-08-07 2020-11-10 杭州深睿博联科技有限公司 Method and device for predicting benign and malignant pulmonary nodules

Non-Patent Citations (3)

* Cited by examiner, † Cited by third party
Title
Botong Wu et al..joint learning for pulmonary nodule segmentation,attributes and malignancy prediction.《arXiv.org》.2018,第1-5页. *
Long-term follow-up of persistent pulmonary pure ground-glass nodules with deep learning–assisted nodule segmentation;Lin-Lin Qi et al.;《European Radiology》;第744-755页 *
基于深度学习的人工智能胸部CT肺结节检测效能评估;李欣菱等;《中国癌症杂志》;第第22卷卷(第第6期期);第336-340页 *

Also Published As

Publication number Publication date
CN113469954A (en) 2021-10-01

Similar Documents

Publication Publication Date Title
CN108447080B (en) Target tracking method, system and storage medium based on hierarchical data association and convolutional neural network
CN110969626B (en) Method for extracting hippocampus of human brain nuclear magnetic resonance image based on 3D neural network
CN110766051A (en) Lung nodule morphological classification method based on neural network
CN111582111B (en) Cell component segmentation method based on semantic segmentation
CN116933046B (en) Deep learning-based multi-mode health management scheme generation method and system
CN112991295B (en) Lymph node metastasis image analysis system, method and equipment based on deep learning
CN111597991A (en) Rehabilitation detection method based on channel state information and BilSTM-Attention
CN108960142B (en) Pedestrian re-identification method based on global feature loss function
CN110781953B (en) Lung cancer pathological section classification method based on multi-scale pyramid convolution neural network
CN114037011B (en) Automatic identification and cleaning method for tongue color noise labeling sample of traditional Chinese medicine
CN113095382B (en) Interpretable tuberculosis classification network identification method based on CT image
CN115994907B (en) Intelligent processing system and method for comprehensive information of food detection mechanism
CN111738997A (en) Method for calculating new coronary pneumonia lesion area ratio based on deep learning
CN113469954B (en) Method and device for evaluating benign and malignant lung nodule
CN111524093A (en) Intelligent screening method and system for abnormal tongue picture
CN113657449A (en) Traditional Chinese medicine tongue picture greasy classification method containing noise labeling data
CN111833321A (en) Window-adjusting optimization-enhanced intracranial hemorrhage detection model and construction method thereof
CN111144462A (en) Unknown individual identification method and device for radar signals
CN117315379A (en) Deep learning-oriented medical image classification model fairness evaluation method and device
CN117392484A (en) Model training method, device, equipment and storage medium
CN110992309B (en) Fundus image segmentation method based on deep information transfer network
CN111325282A (en) Mammary gland X-ray image identification method and device suitable for multiple models
CN113868597B (en) Regression fairness measurement method for age estimation
CN112597842B (en) Motion detection facial paralysis degree evaluation system based on artificial intelligence
CN111144488A (en) Pathological section visual field classification improving method based on adjacent joint prediction

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant