CN116092680A - Abdominal aortic aneurysm early prediction method and system based on random forest algorithm - Google Patents

Abdominal aortic aneurysm early prediction method and system based on random forest algorithm Download PDF

Info

Publication number
CN116092680A
CN116092680A CN202310212701.0A CN202310212701A CN116092680A CN 116092680 A CN116092680 A CN 116092680A CN 202310212701 A CN202310212701 A CN 202310212701A CN 116092680 A CN116092680 A CN 116092680A
Authority
CN
China
Prior art keywords
abdominal aortic
aortic aneurysm
prediction
coefficient
random
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Granted
Application number
CN202310212701.0A
Other languages
Chinese (zh)
Other versions
CN116092680B (en
Inventor
陈松
廖海
梁艳
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Chengdu Technological University CDTU
Original Assignee
Chengdu Technological University CDTU
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Chengdu Technological University CDTU filed Critical Chengdu Technological University CDTU
Priority to CN202310212701.0A priority Critical patent/CN116092680B/en
Publication of CN116092680A publication Critical patent/CN116092680A/en
Application granted granted Critical
Publication of CN116092680B publication Critical patent/CN116092680B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Images

Classifications

    • GPHYSICS
    • G16INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR SPECIFIC APPLICATION FIELDS
    • G16HHEALTHCARE INFORMATICS, i.e. INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR THE HANDLING OR PROCESSING OF MEDICAL OR HEALTHCARE DATA
    • G16H50/00ICT specially adapted for medical diagnosis, medical simulation or medical data mining; ICT specially adapted for detecting, monitoring or modelling epidemics or pandemics
    • G16H50/30ICT specially adapted for medical diagnosis, medical simulation or medical data mining; ICT specially adapted for detecting, monitoring or modelling epidemics or pandemics for calculating health indices; for individual health risk assessment
    • GPHYSICS
    • G16INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR SPECIFIC APPLICATION FIELDS
    • G16HHEALTHCARE INFORMATICS, i.e. INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR THE HANDLING OR PROCESSING OF MEDICAL OR HEALTHCARE DATA
    • G16H10/00ICT specially adapted for the handling or processing of patient-related medical or healthcare data
    • G16H10/60ICT specially adapted for the handling or processing of patient-related medical or healthcare data for patient-specific data, e.g. for electronic patient records
    • GPHYSICS
    • G16INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR SPECIFIC APPLICATION FIELDS
    • G16HHEALTHCARE INFORMATICS, i.e. INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR THE HANDLING OR PROCESSING OF MEDICAL OR HEALTHCARE DATA
    • G16H50/00ICT specially adapted for medical diagnosis, medical simulation or medical data mining; ICT specially adapted for detecting, monitoring or modelling epidemics or pandemics
    • G16H50/70ICT specially adapted for medical diagnosis, medical simulation or medical data mining; ICT specially adapted for detecting, monitoring or modelling epidemics or pandemics for mining of medical data, e.g. analysing previous cases of other patients
    • YGENERAL TAGGING OF NEW TECHNOLOGICAL DEVELOPMENTS; GENERAL TAGGING OF CROSS-SECTIONAL TECHNOLOGIES SPANNING OVER SEVERAL SECTIONS OF THE IPC; TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
    • Y02TECHNOLOGIES OR APPLICATIONS FOR MITIGATION OR ADAPTATION AGAINST CLIMATE CHANGE
    • Y02PCLIMATE CHANGE MITIGATION TECHNOLOGIES IN THE PRODUCTION OR PROCESSING OF GOODS
    • Y02P90/00Enabling technologies with a potential contribution to greenhouse gas [GHG] emissions mitigation
    • Y02P90/30Computing systems specially adapted for manufacturing

Abstract

The invention discloses an early abdominal aortic aneurysm prediction method and system based on a random forest algorithm, which relate to the field of data processing and have the technical scheme that: obtaining a first dataset of characteristic information of an abdominal aortic aneurysm patient, wherein the characteristic information comprises age, gender, whether smoke is drawn, whether obesity, whether hypertension is present, and whether there is a family history of abdominal aortic aneurysm; training the first data set based on a random forest algorithm to obtain an abdominal aortic aneurysm prediction model; and acquiring a second data set of the patient to be predicted, which corresponds to the characteristic information of the patient with the abdominal aortic aneurysm, inputting the second data set into the abdominal aortic aneurysm prediction model for prediction, and outputting a prediction result of the patient to be predicted. The abdominal aortic aneurysm prediction model based on random forest algorithm training has good generalization performance, and a doctor can know the whole process of abdominal aortic aneurysm prediction, so that a trusted relationship is established between the doctor and an artificial intelligence system.

Description

Abdominal aortic aneurysm early prediction method and system based on random forest algorithm
Technical Field
The invention relates to the field of data processing, in particular to an early abdominal aortic aneurysm prediction method and system based on a random forest algorithm.
Background
The rupture of an Abdominal Aortic Aneurysm (AAA) can lead to life threatening events, and it is important to predict the likelihood that an abdominal aortic aneurysm will take early intervention and control. Ultrasound screening and Computed Tomography (CT) images have been used for analysis of abdominal aortic aneurysms. Currently, many studies have applied artificial intelligence techniques to aid diagnosis and treatment of vascular surgery, such as prediction of risk of rupture of abdominal aortic aneurysms. These studies have focused on analysis after abdominal aortic aneurysm formation only. Prior art solutions can generally be divided into three categories: predicting risk and outcome of rupture after intra-luminal repair of an abdominal aortic aneurysm. The evolution of the abdominal aortic aneurysm is predicted based on a deep learning algorithm to obtain a risk of rupture. The causative factors of the abdominal aortic aneurysm are analyzed for intervention and control.
In the prior art, the prediction method mainly has the following defects: 1. lacking in interpretability, current research is focused mainly on post-aneurysmal analysis, but it is clear that it has caused tremendous harm to the patient's body. Secondly, the "black box" nature of artificial intelligence prevents its further application in the medical field, and it is difficult for a doctor to understand the whole process of predicting an abdominal aortic aneurysm by a prediction algorithm, and a mutual trust relationship cannot be established between the doctor and the artificial intelligence system. 2. Lacking effective prediction before an aneurysm formation, many studies have now applied artificial intelligence techniques to analysis and prediction of vascular surgery, such as ultrasound screening and Computed Tomography (CT) images, to analysis of abdominal aortic aneurysms, but these methods focus only on analysis after an abdominal aortic aneurysm formation, ignoring how to effectively predict an abdominal aortic aneurysm before an abdominal aortic aneurysm formation.
Disclosure of Invention
The invention aims at solving the defect of early prediction of abdominal aortic aneurysm in the prior art and provides an early prediction method and system of abdominal aortic aneurysm based on a random forest algorithm. When a new sample set is predicted, which classification the sample set belongs to depends on the judgment of each decision tree, the prediction model itself consists of a plurality of classifiers, and then a final prediction result is obtained through voting and averaging.
The technical aim of the invention is realized by the following technical scheme:
in a first aspect of the present application, there is provided an early prediction method for abdominal aortic aneurysm based on random forest algorithm, the method comprising:
obtaining a first dataset of characteristic information of an abdominal aortic aneurysm patient, wherein the characteristic information comprises age, gender, whether smoke is drawn, whether obesity, whether hypertension is present, and whether there is a family history of abdominal aortic aneurysm;
training the first data set based on a random forest algorithm to obtain an abdominal aortic aneurysm prediction model;
and acquiring a second data set of the patient to be predicted, which corresponds to the characteristic information of the patient with the abdominal aortic aneurysm, inputting the second data set into the abdominal aortic aneurysm prediction model for prediction, and outputting a prediction result of the patient to be predicted.
In one embodiment, the first dataset is trained based on a random forest algorithm to obtain an abdominal aortic aneurysm prediction model, in particular:
randomly extracting characteristic information of the first data set to generate a plurality of random sample sets;
generating a classification regression tree of a plurality of random sample sets based on a classification regression algorithm, and calculating the coefficient of the basis of the random sample sets;
for a plurality of random sample sets where the current node of the classification regression tree is located, returning the classification regression algorithm to the sub-decision tree and stopping recursion when the coefficient of the foundation of the plurality of random sample sets is smaller than the threshold value of the coefficient of the foundation; or when the coefficient of the basis of the random sample sets is not smaller than the threshold value of the basis of the coefficient, calculating the coefficient of the basis of each piece of characteristic information in the random sample set where the current node is located;
taking the feature information corresponding to the minimum coefficient of the coefficient of each feature information as optimal feature information, dividing a corresponding random sample set into a first information set and a second information set based on the optimal feature information, and taking the first information set and the second information set as a left child node and a right child node of the current node respectively;
calculating the coefficient of the first information set and the second information set, returning the sub decision trees by the classification regression algorithm and stopping recursion when the coefficient of the first information set and the second information set is smaller than the threshold value of the coefficient of the first information set and the second information set, and generating a plurality of decision trees;
and learning the decision trees by a plurality of classifiers to obtain an abdominal aortic aneurysm prediction model.
In one embodiment, the second data set is input into the abdominal aortic aneurysm prediction model for prediction, and a prediction result of a patient to be predicted is output, specifically:
dividing the second data set into a training set and a testing set according to a preset proportion;
randomly sampling the characteristic information of the training set for a plurality of times to obtain a sampling sample set, wherein the sampled characteristic information is put back into a second data set for next random sampling after each random sampling;
generating a plurality of weight values for the sample set based on the plurality of decision trees;
and calculating the sum value of the plurality of weight values, and dividing the sum value of the plurality of weight values by the number of random sampling to obtain a prediction result of the patient to be predicted.
In one embodiment, the method further comprises:
presetting a matching rule of characteristic information of a first data set, correcting a weight value output by a decision tree based on the matching rule, and summing the corrected weight value to obtain a weight correction value;
predicting the test set based on the abdominal aortic aneurysm prediction model to obtain a weight average value of the test set;
and when the weight correction value is equal to the weight average value, outputting a prediction result of the patient to be predicted.
In one embodiment, a plurality of characteristic information is randomly extracted from the first data set to obtain the random sample set, wherein the data format of the first data set is a multi-row multi-column data matrix.
In one embodiment, the plurality of output results of the abdominal aortic aneurysm prediction model are selected using a receiving operator characteristic curve, such that the abdominal aortic aneurysm outputs a predicted result of a patient to be predicted.
In a second aspect of the present application, there is provided an early prediction system for abdominal aortic aneurysm based on random forest algorithm, comprising:
an acquisition module for acquiring a first data set of characteristic information of an abdominal aortic aneurysm patient, wherein the characteristic information comprises age, gender, whether smoking, whether obesity, whether hypertension and whether there is a family medical history of abdominal aortic aneurysm;
the training module is used for training the first data set based on a random forest algorithm to obtain an abdominal aortic aneurysm prediction model;
the prediction module is used for acquiring a second data set of the patient to be predicted, which corresponds to the characteristic information of the patient with the abdominal aortic aneurysm, inputting the second data set into the abdominal aortic aneurysm prediction model for prediction, and outputting a prediction result of the patient to be predicted.
In one embodiment, the training module comprises:
the random extraction module is used for randomly extracting the characteristic information of the first data set to generate a plurality of random sample sets;
the classification tree generation module is used for generating classification regression trees of a plurality of random sample sets based on a classification regression algorithm and calculating the coefficient of the basis of the random sample sets;
the judging module is used for returning the classification regression algorithm to the sub-decision tree and stopping recursion when the coefficient of the foundation of the plurality of random sample sets is smaller than the threshold value of the coefficient of the foundation for the plurality of random sample sets where the current node of the classification regression tree is located; or when the coefficient of the basis of the random sample sets is not smaller than the threshold value of the basis of the coefficient, calculating the coefficient of the basis of each piece of characteristic information in the random sample set where the current node is located;
the sample dividing module is used for taking the characteristic information corresponding to the minimum coefficient of the coefficient of each characteristic information as optimal characteristic information, dividing a corresponding random sample set into a first information set and a second information set based on the optimal characteristic information, and taking the first information set and the second information set as a left child node and a right child node of the current node respectively;
the decision tree generation module is used for calculating the coefficient of the first information set and the second information set, and when the coefficient of the first information set and the second information set is smaller than the threshold value of the coefficient of the first information set and the second information set, the classification regression algorithm returns to the sub-decision tree and stops recursion to generate a plurality of decision trees;
and the learning module is used for learning the plurality of decision trees by the plurality of classifiers to obtain an abdominal aortic aneurysm prediction model.
In one embodiment, the prediction module is further configured to: dividing the second data set into a training set and a testing set according to a preset proportion; randomly sampling the characteristic information of the training set for a plurality of times to obtain a sampling sample set, wherein the sampled characteristic information is put back into a second data set for next random sampling after each random sampling; generating a plurality of weight values for the sample set based on the plurality of decision trees; and calculating the sum value of the plurality of weight values, and dividing the sum value of the plurality of weight values by the number of random sampling to obtain a prediction result of the patient to be predicted.
In one embodiment, the system further comprises:
the correction module is used for presetting a matching rule of the characteristic information of the first data set, correcting the weight value output by the decision tree based on the matching rule, and summing the corrected weight value to obtain a weight correction value;
the weight calculation module is used for predicting the test set based on the abdominal aortic aneurysm prediction model to obtain a weight average value of the test set;
and the interpretation module is used for outputting a prediction result of the patient to be predicted when the weight correction value is equal to the weight average value.
Compared with the prior art, the invention has the following beneficial effects:
1. the invention provides an early prediction method of abdominal aortic aneurysm based on a random forest algorithm, which takes characteristic information such as age, sex, smoking, obesity, hypertension, family history of abdominal aortic aneurysm and the like of a patient as a first data set, trains the first data set based on the random forest algorithm, so as to obtain a prediction model based on the random forest algorithm, wherein the random forest consists of a plurality of decision trees, and all the trees are independent of each other. When a sample set is predicted, which classification the sample set belongs to depends on the judgment of each decision tree, the prediction model itself consists of a plurality of classifiers, and then a final prediction result is obtained through voting and averaging.
2. The invention further considers how to verify the reliability degree of the prediction result of the abdominal aortic aneurysm prediction model, and explains the decision process of the algorithm based on the voting tree of the random forest algorithm, wherein the random forest mainly comprises a plurality of CART trees. In order to clearly explain the decision process of the random forest, each characteristic information is taken as a decision tree, and the voting process of the random forest algorithm is deduced. The prediction result of the traditional random forest algorithm is unstable, an interpretable rule is used for assisting in the derivation process of the random forest, firstly, the output result of a decision tree is corrected by using a matching rule, then the correction result is further verified according to the label in the test set, and under the condition that the data set is unchanged, if the correction result is equal to the weight average value obtained by the abdominal aortic aneurysm prediction model on the test set, the prediction result is indicated to have higher prediction precision.
In addition, the application also provides an early abdominal aortic aneurysm prediction system based on a random forest algorithm, which has the same technical effects as the prediction method, and is not repeated here.
Drawings
The accompanying drawings, which are included to provide a further understanding of embodiments of the invention and are incorporated in and constitute a part of this application, illustrate embodiments of the invention. In the drawings:
fig. 1 is a flow chart of an early abdominal aortic aneurysm prediction method based on a random forest algorithm according to an embodiment of the present application;
FIG. 2 is a schematic diagram of a training process of a prediction model according to an embodiment of the present application;
fig. 3 is a schematic block diagram of an early abdominal aortic aneurysm prediction system based on a random forest algorithm according to an embodiment of the present application.
Description of the embodiments
For the purpose of making apparent the objects, technical solutions and advantages of the present invention, the present invention will be further described in detail with reference to the following examples and the accompanying drawings, wherein the exemplary embodiments of the present invention and the descriptions thereof are for illustrating the present invention only and are not to be construed as limiting the present invention.
It should be appreciated that the terms "first," "second," and the like are used for descriptive purposes only and are not to be construed as indicating or implying relative importance or implicitly indicating the number of technical features indicated. Thus, a feature defining "a first" or "a second" may explicitly or implicitly include one or more such feature. In the description of the present invention, the meaning of "a plurality" is two or more, unless explicitly defined otherwise.
As described in the background art, the method for predicting abdominal aortic aneurysm in the prior art mainly has the following drawbacks: 1. lacking in interpretability, current research is focused mainly on post-aneurysmal analysis, but it is clear that it has caused tremendous harm to the patient's body. Secondly, the "black box" nature of artificial intelligence prevents its further application in the medical field, and it is difficult for a doctor to understand the whole process of predicting an abdominal aortic aneurysm by a prediction algorithm, and a mutual trust relationship cannot be established between the doctor and the artificial intelligence system. 2. Lacking effective prediction before abdominal aortic aneurysm formation, many studies have now applied artificial intelligence techniques to vascular surgical analysis and prediction, such as ultrasound screening and Computed Tomography (CT) images, to abdominal aortic aneurysm analysis, but these methods focus only on post-abdominal aortic aneurysm analysis, neglecting how to effectively predict abdominal aortic aneurysm before it forms.
Referring to fig. 1, fig. 1 is a flow chart of an early abdominal aortic aneurysm prediction method based on a random forest algorithm according to an embodiment of the present application, as shown in fig. 1, the method includes the following steps:
s110, acquiring a first data set of characteristic information of an abdominal aortic aneurysm patient, wherein the characteristic information comprises age, gender, whether smoking, whether obesity, whether hypertension and whether there is a family medical history of the abdominal aortic aneurysm.
Specifically, datasets were created for training of artificial intelligence models based on several characteristics of patients with abdominal aortic aneurysms, including age, smoking, men, obesity, hypertension, family history of aneurysms, and the like. To facilitate subsequent random sampling or extraction of the characteristic information in the dataset, the definition will therefore generally express the dataset including the characteristic information of the patient with an abdominal aortic aneurysm in the form of a data matrix, e.g
Figure SMS_1
In the formula, m represents the column number of the matrix, n represents the row number of the matrix, x represents the characteristic information of the first data set, y represents the tag information of the first data set, and S represents the first data set, specifically, because in the actual calculation process, in order to obtain the final prediction probability, the characteristic information is represented in binary, for example, the preset age threshold is 60 years for the age characteristics of one patient, and the preset age threshold may be 55 years for the patients with the age of 60 years or moreAge 50, etc., the characteristic information of the age is represented by 1, for example, the smoking characteristic of the patient is represented by 0, 1, or 1, 0, 1, and average daily smoking amount is represented by age>400,0 represents no smoking or average daily smoking amount × age<400;1 represents drinking, and the average drinking amount per week>7 standard drinking units, 0 means no drinking or average weekly drinking volume<7 standard drinking units; 1 represents a history of abdominal aortic aneurysm in three generations, 0 represents a history of no abdominal aortic aneurysm in three generations; the remaining features are similar and will not be described in further detail here.
And S120, training the first data set based on a random forest algorithm to obtain an abdominal aortic aneurysm prediction model.
In this embodiment, the random forest is an algorithm integrating multiple trees through the idea of ensemble learning, its basic unit is a decision tree, and its essence belongs to a big branch of machine learning, and the ensemble learning (Ensemble Learning) method, please refer to fig. 2, fig. 2 is a schematic training flow diagram of a prediction model provided in this embodiment, and as shown in fig. 2, the training of the prediction model includes the following steps:
s121, randomly extracting characteristic information of the first data set to generate a plurality of random sample sets;
s122, generating classification regression trees of a plurality of random sample sets based on a classification regression algorithm, and calculating the radix coefficients of the plurality of random sample sets;
s123, for a plurality of random sample sets where the current node of the classification regression tree is located, returning the classification regression algorithm to the sub-decision tree and stopping recursion when the coefficient of the foundation of the plurality of random sample sets is smaller than the coefficient threshold; or when the coefficient of the basis of the random sample sets is not smaller than the threshold value of the basis of the coefficient, calculating the coefficient of the basis of each piece of characteristic information in the random sample set where the current node is located;
s124, taking the characteristic information corresponding to the minimum coefficient of the coefficient of each characteristic information as optimal characteristic information, dividing a corresponding random sample set into a first information set and a second information set based on the optimal characteristic information, and taking the first information set and the second information set as a left child node and a right child node of the current node respectively;
s125, calculating the coefficient of the first information set and the second information set, and returning the classification regression algorithm to the sub-decision tree and stopping recursion when the coefficient of the first information set and the second information set is smaller than the threshold value of the coefficient of the first information set and the second information set, so as to generate a plurality of decision trees;
and S126, learning a plurality of decision trees by a plurality of classifiers to obtain an abdominal aortic aneurysm prediction model.
Specifically, in step S121, a plurality of feature information is randomly extracted from the first data set to form a new random sample set, and a plurality of classification regression trees are generated to facilitate the subsequent formation of random forests. And obtaining a prediction result by averaging values obtained by voting the classification regression tree. In this embodiment, each node is segmented by a random extraction method, and errors generated under different conditions, such as whether obesity affects abdominal aortic carcinoma, and whether smoking is performed, are compared to determine the influence of different feature information on the probability of illness. In a further embodiment, a plurality of characteristic information is randomly extracted from the first data set to obtain the random sample set, wherein the data format of the first data set is a data matrix of a plurality of rows and a plurality of columns.
In steps S122 and S123, the coefficient of kunning is an important criterion for measuring uncertainty of the classification regression algorithm. The smaller the coefficient of kunity, the lower the uncertainty and the better the performance. Since the random sample set is randomly sampled from the first data set, assuming that the probability of the abdominal aortic carcinoma patient in the random sample set is p, the coefficient of the kunity of the probability distribution is as follows
Figure SMS_2
And (5) calculating to obtain the product.
Further, in the random sample set, the number of samples is defined as |d|, the number of patients with abdominal aortic carcinoma is defined as |c|, and the coefficient of the foundation expression is:
Figure SMS_3
Gini(D) Representing the coefficient of kunity.
Further, if the random sample set is divided into D1 and D2 by the optimal feature information a, the expression of the coefficient of the random sample set is defined as:
Figure SMS_4
wherein D is 1 Represents a first information set, D 2 Representing a second set of information. Where Gini (D, a) represents the uncertainty of the random sample set and a represents the optimal feature information. D is derived from the optimal feature information packet, the smaller the Gini coefficient, the lower the uncertainty of the random sample set, so that the higher the accuracy of the final prediction result.
In step S124, for each random sample set, a classification regression tree is generated according to a classification regression algorithm starting from the root node. For the random sample set where the current node is located, if the coefficient of the random sample set is smaller than the coefficient threshold or no characteristic information to be selected is available, the classification regression algorithm returns to the sub-decision tree and stops recursion.
If the coefficient of the random sample sets is not smaller than the threshold value of the coefficient, the characteristic information corresponding to the minimum coefficient of the characteristic information is used as the optimal characteristic information, the corresponding random sample set is divided into a first information set and a second information set based on the optimal characteristic information, and the first information set and the second information set are respectively used as a left child node and a right child node of the current node.
Further, in step S125, the first information set and the second information set may be calculated by using the mathematical expression that the first information set and the second information set have the same coefficient, which is not described herein. On the basis, if the coefficient of the first information set and the second information set is smaller than the coefficient threshold, the classification regression algorithm returns to the sub-decision tree and stops recursion, so that a plurality of decision trees are generated.
In step S126, since the random forest algorithm is an integrated learning artificial intelligence algorithm, the algorithm is composed of a plurality of classifiers, and the classifiers vote or average the final output, so that the result of the random forest algorithm has higher accuracy and generalization performance, and thus learning and classifying a plurality of decision trees are completed through the plurality of classifiers, so as to obtain the abdominal aortic aneurysm prediction model.
S130, acquiring a second data set of the patient to be predicted, which corresponds to the characteristic information of the patient with the abdominal aortic aneurysm, inputting the second data set into the abdominal aortic aneurysm prediction model for prediction, and outputting a prediction result of the patient to be predicted.
Specifically, a second data set corresponding to characteristic information of a patient to be predicted and an abdominal aortic aneurysm patient is input to a trained abdominal aortic aneurysm prediction model for prediction, and a final prediction result is output, specifically as follows: dividing the second data set into a training set and a testing set according to a preset proportion; randomly sampling the characteristic information of the training set for a plurality of times to obtain a sampling sample set, wherein the sampled characteristic information is put back into a second data set for next random sampling after each random sampling; generating a plurality of weight values for the sample set based on the plurality of decision trees; and calculating the sum value of the plurality of weight values, and dividing the sum value of the plurality of weight values by the number of random sampling to obtain a prediction result of the patient to be predicted. The above process is a specific process of random forest algorithm calculation, and is common knowledge of a person skilled in the art, so this is not explained in detail, and it is to be understood that the prediction result is presented in a probability value, for example, a percentage. Further, the second data set is divided into a training set and a test set according to a preset ratio, which is common knowledge of a person skilled in the art, wherein the preset ratio is 8 to 2, and may be 7 to 3, which is not particularly limited in this embodiment.
In summary, as shown in the foregoing embodiments, the prediction method provided in the embodiment of the present application uses, as a first data set, characteristic information such as age, gender, smoking, obesity, hypertension, and family history of abdominal aortic aneurysm of a patient, and trains the first data set based on a random forest algorithm, thereby obtaining a prediction model based on the random forest algorithm, where the random forest is composed of a plurality of decision trees, and all the trees are independent of each other. When a sample set is predicted, which classification the sample set belongs to depends on the judgment of each decision tree, the prediction model itself consists of a plurality of classifiers, and then a final prediction result is obtained through voting and averaging.
In one embodiment, the method further comprises: presetting a matching rule of characteristic information of a first data set, correcting a weight value output by a decision tree based on the matching rule, and summing the corrected weight value to obtain a weight correction value; predicting the test set based on the abdominal aortic aneurysm prediction model to obtain a weight average value of the test set; and when the weight correction value is equal to the weight average value, outputting a prediction result of the patient to be predicted.
Specifically, the matching rule of the first data set of the characteristic information of the abdominal aortic aneurysm patient is specifically: using a set of
Figure SMS_5
The rules classify the characteristic information, such as whether smoking, drinking, etc. For example, the matching rule for the feature information of the first data set may be expressed as follows: />
Figure SMS_6
,/>
Figure SMS_7
Wherein, wherein->
Figure SMS_8
Regular expression representing the i-th characteristic information of the first dataset,/or->
Figure SMS_9
Rule calculation result representing the ith characteristic information, < +.>
Figure SMS_10
Indicating whether the ith characteristic information is contained, e.g. whether the patient smoked or not。
Performing a connection operation on the matching rules of the plurality of feature information to obtain an average value of the matching rules, for example, if
Figure SMS_11
Wherein i represents the i-th characteristic information, < +.>
Figure SMS_12
Representing a join operation. Further, the weight value output by the decision tree is corrected based on the average value of the obtained matching rules.
In this embodiment, considering the problem of reliability of the prediction result of the patient to be predicted for the abdominal aortic aneurysm, the decision process of the algorithm is explained using a voting tree based on a random forest method. Random forests are mainly composed of a number of decision trees. In order to clearly explain the decision process of the random forest, the embodiment takes each feature information as a decision tree, and deduces the voting process of the random forest algorithm. In the embodiment, how to verify the reliability degree of the prediction result of the abdominal aortic aneurysm prediction model is further considered, the decision process of the random forest algorithm is explained based on the voting tree of the random forest algorithm, and the random forest mainly comprises a plurality of decision trees. In order to clearly explain the decision process of the random forest, each characteristic information is taken as a decision tree, and the voting process of the random forest algorithm is deduced. In the embodiment, an interpretable rule is used to assist the derivation process of the random forest, firstly, the output result of a decision tree is corrected by using a matching rule, then the corrected result is further verified according to a label in a test set, under the condition that a data set is unchanged, if the corrected result is equal to a weight average value obtained by the test set through an abdominal aortic aneurysm prediction model, the prediction result has higher prediction precision, and the prediction process is transparent to a doctor, so that a trusted prediction model is provided for early prediction of the abdominal aortic aneurysm, a trusted relation is established between the doctor and an artificial intelligent system, and the reliability of the final prediction result is ensured.
In one embodiment, a plurality of output results of the abdominal aortic aneurysm prediction model are selected by adopting a receiving operator characteristic curve, so that the abdominal aortic aneurysm outputs a prediction result of a patient to be predicted.
Specifically, in the random sample set, the number of positive samples is the same as the number of negative samples, and the output of the samples is positive or negative, and the reliability of the plurality of classifiers is evaluated by using a receiver operator characteristic curve (ROC) in the embodiment. The multiple classifiers classify the samples as positive or negative by two classifier labels, which can be represented by a confusion matrix, of four types, which is prior art, and is not specifically explained in excess. True Positive Rate (TPR) refers to the percentage of correctly predicted positive samples among all samples for which the predicted result is positive; false Positive Rate (FPR) refers to the percentage of samples that are mispredicted as positive among all samples with negative truth;
the receive operator profile defines a False Positive Rate (FPR) as the X-axis and a True Positive Rate (TPR) as the Y-axis. The lower Area (AUC) of the receive operator characteristic curve refers to the area between the receive operator characteristic curve and the X-axis, the larger the AUC value, the closer the curve is to the upper left, which indicates that the higher the proportion of correct samples, the lower the proportion of erroneous samples. If the AUC has a value of 0.9-1: perfect precision; 0.7-0.9: high precision; less than 0.7 is the same as the random effect.
Based on the working principle of the receiving operator characteristic curve, since the abdominal aortic aneurysm prediction model can obtain a plurality of prediction results, the method realizes the screening of the accuracy of the plurality of prediction results based on the receiving operator characteristic curve in the machine learning algorithm, so that the accuracy of the finally output prediction results is ensured.
Based on the same inventive concept, the present embodiment also provides an early abdominal aortic aneurysm predicting system based on a random forest algorithm, which corresponds to the early abdominal aortic aneurysm predicting method based on a random forest algorithm, so that the early abdominal aortic aneurysm predicting system based on a random forest algorithm of the present embodiment can implement the early abdominal aortic aneurysm predicting method based on a random forest algorithm shown in the foregoing embodiment, and in order to avoid repetition, the system is not described here, as shown in fig. 3, and includes:
an acquisition module 310 for acquiring a first data set of characteristic information of an abdominal aortic aneurysm patient, wherein the characteristic information comprises age, gender, whether smoking, obesity, hypertension and whether there is a family history of abdominal aortic aneurysm;
a training module 320, configured to train the first data set based on a random forest algorithm to obtain an abdominal aortic aneurysm prediction model;
the prediction module 330 is configured to obtain a second data set of the patient to be predicted, which corresponds to the characteristic information of the patient with abdominal aortic aneurysm, input the second data set into the abdominal aortic aneurysm prediction model for prediction, and output a prediction result of the patient to be predicted.
In one embodiment, the training module includes:
the random extraction module is used for randomly extracting the characteristic information of the first data set to generate a plurality of random sample sets;
the classification tree generation module is used for generating classification regression trees of a plurality of random sample sets based on a classification regression algorithm and calculating the coefficient of the basis of the random sample sets;
the judging module is used for returning the classification regression algorithm to the sub-decision tree and stopping recursion when the coefficient of the foundation of the plurality of random sample sets is smaller than the threshold value of the coefficient of the foundation for the plurality of random sample sets where the current node of the classification regression tree is located; or when the coefficient of the basis of the random sample sets is not smaller than the threshold value of the basis of the coefficient, calculating the coefficient of the basis of each piece of characteristic information in the random sample set where the current node is located;
the sample dividing module is used for taking the characteristic information corresponding to the minimum coefficient of the coefficient of each characteristic information as optimal characteristic information, dividing a corresponding random sample set into a first information set and a second information set based on the optimal characteristic information, and taking the first information set and the second information set as a left child node and a right child node of the current node respectively;
the decision tree generation module is used for calculating the coefficient of the first information set and the second information set, and when the coefficient of the first information set and the second information set is smaller than the threshold value of the coefficient of the first information set and the second information set, the classification regression algorithm returns to the sub-decision tree and stops recursion to generate a plurality of decision trees;
and the learning module is used for learning the plurality of decision trees by the plurality of classifiers to obtain an abdominal aortic aneurysm prediction model.
In one embodiment, the prediction module is further configured to: dividing the second data set into a training set and a testing set according to a preset proportion; randomly sampling the characteristic information of the training set for a plurality of times to obtain a sampling sample set, wherein the sampled characteristic information is put back into a second data set for next random sampling after each random sampling; generating a plurality of weight values for the sample set based on the plurality of decision trees; and calculating the sum value of the plurality of weight values, and dividing the sum value of the plurality of weight values by the number of random sampling to obtain a prediction result of the patient to be predicted.
In one embodiment, the system further comprises:
the correction module is used for presetting a matching rule of the characteristic information of the first data set, correcting the weight value output by the decision tree based on the matching rule, and summing the corrected weight value to obtain a weight correction value;
the weight calculation module is used for predicting the test set based on the abdominal aortic aneurysm prediction model to obtain a weight average value of the test set;
and the interpretation module is used for outputting a prediction result of the patient to be predicted when the weight correction value is equal to the weight average value.
It can be seen that the early abdominal aortic aneurysm prediction system based on the random forest algorithm in the above embodiment has the following beneficial effects: 1. the method comprises the steps of taking characteristic information such as age, sex, smoking, obesity, hypertension and family history of abdominal aortic aneurysm of a patient as a first data set, training the first data set based on a random forest algorithm, so that a prediction model based on the random forest algorithm is obtained, wherein a random forest consists of a plurality of decision trees, and all trees are independent of each other. When a sample set is predicted, which classification the sample set belongs to depends on the judgment of each decision tree, the prediction model itself consists of a plurality of classifiers, and then a final prediction result is obtained through voting and averaging.
2. Further consider how to verify the reliability degree of the prediction result of the abdominal aortic aneurysm prediction model, the decision making process of the algorithm is explained based on a voting tree of a random forest algorithm, and the random forest mainly comprises a plurality of CART trees. In order to clearly explain the decision process of the random forest, each characteristic information is taken as a decision tree, and the voting process of the random forest algorithm is deduced. The prediction result of the traditional random forest algorithm is unstable, an interpretable rule is used for assisting in the derivation process of the random forest, firstly, the output result of a decision tree is corrected by using a matching rule, then the correction result is further verified according to the label in the test set, and under the condition that the data set is unchanged, if the correction result is equal to the weight average value obtained by the abdominal aortic aneurysm prediction model on the test set, the prediction result is indicated to have higher prediction precision.
The foregoing description of the embodiments has been provided for the purpose of illustrating the general principles of the invention, and is not meant to limit the scope of the invention, but to limit the invention to the particular embodiments, and any modifications, equivalents, improvements, etc. that fall within the spirit and principles of the invention are intended to be included within the scope of the invention.

Claims (10)

1. An early abdominal aortic aneurysm prediction method based on a random forest algorithm is characterized by comprising the following steps:
obtaining a first dataset of characteristic information of an abdominal aortic aneurysm patient, wherein the characteristic information comprises age, gender, whether smoke is drawn, whether obesity, whether hypertension is present, and whether there is a family history of abdominal aortic aneurysm;
training the first data set based on a random forest algorithm to obtain an abdominal aortic aneurysm prediction model;
and acquiring a second data set of the patient to be predicted, which corresponds to the characteristic information of the patient with the abdominal aortic aneurysm, inputting the second data set into the abdominal aortic aneurysm prediction model for prediction, and outputting a prediction result of the patient to be predicted.
2. The early abdominal aortic aneurysm prediction method based on the random forest algorithm according to claim 1, wherein the first data set is trained based on the random forest algorithm to obtain an abdominal aortic aneurysm prediction model, specifically:
randomly extracting characteristic information of the first data set to generate a plurality of random sample sets;
generating a classification regression tree of a plurality of random sample sets based on a classification regression algorithm, and calculating the coefficient of the basis of the random sample sets;
for a plurality of random sample sets where the current node of the classification regression tree is located, returning the classification regression algorithm to the sub-decision tree and stopping recursion when the coefficient of the foundation of the plurality of random sample sets is smaller than the threshold value of the coefficient of the foundation; or when the coefficient of the basis of the random sample sets is not smaller than the threshold value of the basis of the coefficient, calculating the coefficient of the basis of each piece of characteristic information in the random sample set where the current node is located;
taking the feature information corresponding to the minimum coefficient of the coefficient of each feature information as optimal feature information, dividing a corresponding random sample set into a first information set and a second information set based on the optimal feature information, and taking the first information set and the second information set as a left child node and a right child node of the current node respectively;
calculating the coefficient of the first information set and the second information set, returning the sub decision trees by the classification regression algorithm and stopping recursion when the coefficient of the first information set and the second information set is smaller than the threshold value of the coefficient of the first information set and the second information set, and generating a plurality of decision trees;
and learning the decision trees by a plurality of classifiers to obtain an abdominal aortic aneurysm prediction model.
3. The early prediction method of abdominal aortic aneurysm based on random forest algorithm according to claim 2, wherein the second data set is input into the abdominal aortic aneurysm prediction model for prediction, and the prediction result of the patient to be predicted is output, specifically:
dividing the second data set into a training set and a testing set according to a preset proportion;
randomly sampling the characteristic information of the training set for a plurality of times to obtain a sampling sample set, wherein the sampled characteristic information is put back into a second data set for next random sampling after each random sampling;
generating a plurality of weight values for the sample set based on the plurality of decision trees;
and calculating the sum value of the plurality of weight values, and dividing the sum value of the plurality of weight values by the number of random sampling to obtain a prediction result of the patient to be predicted.
4. The method for early prediction of abdominal aortic aneurysm based on random forest algorithm according to claim 3, wherein the method further comprises:
presetting a matching rule of characteristic information of a first data set, correcting a weight value output by a decision tree based on the matching rule, and summing the corrected weight value to obtain a weight correction value;
predicting the test set based on the abdominal aortic aneurysm prediction model to obtain a weight average value of the test set;
and when the weight correction value is equal to the weight average value, outputting a prediction result of the patient to be predicted.
5. The method of claim 2, wherein a plurality of characteristic information is randomly extracted from the first dataset to obtain the random sample set, wherein the data format of the first dataset is a multi-row multi-column data matrix.
6. The early abdominal aortic aneurysm prediction method based on the random forest algorithm according to claim 1, wherein a plurality of output results of the abdominal aortic aneurysm prediction model are selected by adopting a receiving operator characteristic curve so that the abdominal aortic aneurysm outputs a prediction result of a patient to be predicted.
7. An early abdominal aortic aneurysm prediction system based on a random forest algorithm, comprising:
an acquisition module for acquiring a first data set of characteristic information of an abdominal aortic aneurysm patient, wherein the characteristic information comprises age, gender, whether smoking, whether obesity, whether hypertension and whether there is a family medical history of abdominal aortic aneurysm;
the training module is used for training the first data set based on a random forest algorithm to obtain an abdominal aortic aneurysm prediction model;
the prediction module is used for acquiring a second data set of the patient to be predicted, which corresponds to the characteristic information of the patient with the abdominal aortic aneurysm, inputting the second data set into the abdominal aortic aneurysm prediction model for prediction, and outputting a prediction result of the patient to be predicted.
8. The early abdominal aortic aneurysm prediction system based on random forest algorithm according to claim 7, wherein the training module comprises:
the random extraction module is used for randomly extracting the characteristic information of the first data set to generate a plurality of random sample sets;
the classification tree generation module is used for generating classification regression trees of a plurality of random sample sets based on a classification regression algorithm and calculating the coefficient of the basis of the random sample sets;
the judging module is used for returning the classification regression algorithm to the sub-decision tree and stopping recursion when the coefficient of the foundation of the plurality of random sample sets is smaller than the threshold value of the coefficient of the foundation for the plurality of random sample sets where the current node of the classification regression tree is located; or when the coefficient of the basis of the random sample sets is not smaller than the threshold value of the basis of the coefficient, calculating the coefficient of the basis of each piece of characteristic information in the random sample set where the current node is located;
the sample dividing module is used for taking the characteristic information corresponding to the minimum coefficient of the coefficient of each characteristic information as optimal characteristic information, dividing a corresponding random sample set into a first information set and a second information set based on the optimal characteristic information, and taking the first information set and the second information set as a left child node and a right child node of the current node respectively;
the decision tree generation module is used for calculating the coefficient of the first information set and the second information set, and when the coefficient of the first information set and the second information set is smaller than the threshold value of the coefficient of the first information set and the second information set, the classification regression algorithm returns to the sub-decision tree and stops recursion to generate a plurality of decision trees;
and the learning module is used for learning the plurality of decision trees by the plurality of classifiers to obtain an abdominal aortic aneurysm prediction model.
9. The early abdominal aortic aneurysm prediction system based on random forest algorithm according to claim 8, wherein the prediction module is further configured to: dividing the second data set into a training set and a testing set according to a preset proportion; randomly sampling the characteristic information of the training set for a plurality of times to obtain a sampling sample set, wherein the sampled characteristic information is put back into a second data set for next random sampling after each random sampling; generating a plurality of weight values for the sample set based on the plurality of decision trees; and calculating the sum value of the plurality of weight values, and dividing the sum value of the plurality of weight values by the number of random sampling to obtain a prediction result of the patient to be predicted.
10. The early abdominal aortic aneurysm prediction system based on random forest algorithm according to claim 9, wherein the system further comprises:
the correction module is used for presetting a matching rule of the characteristic information of the first data set, correcting the weight value output by the decision tree based on the matching rule, and summing the corrected weight value to obtain a weight correction value;
the weight calculation module is used for predicting the test set based on the abdominal aortic aneurysm prediction model to obtain a weight average value of the test set;
and the interpretation module is used for outputting a prediction result of the patient to be predicted when the weight correction value is equal to the weight average value.
CN202310212701.0A 2023-03-08 2023-03-08 Abdominal aortic aneurysm early prediction method and system based on random forest algorithm Active CN116092680B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN202310212701.0A CN116092680B (en) 2023-03-08 2023-03-08 Abdominal aortic aneurysm early prediction method and system based on random forest algorithm

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN202310212701.0A CN116092680B (en) 2023-03-08 2023-03-08 Abdominal aortic aneurysm early prediction method and system based on random forest algorithm

Publications (2)

Publication Number Publication Date
CN116092680A true CN116092680A (en) 2023-05-09
CN116092680B CN116092680B (en) 2023-06-09

Family

ID=86210324

Family Applications (1)

Application Number Title Priority Date Filing Date
CN202310212701.0A Active CN116092680B (en) 2023-03-08 2023-03-08 Abdominal aortic aneurysm early prediction method and system based on random forest algorithm

Country Status (1)

Country Link
CN (1) CN116092680B (en)

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN117059214A (en) * 2023-07-21 2023-11-14 南京智慧云网络科技有限公司 Clinical scientific research data integration and intelligent analysis system and method based on artificial intelligence

Citations (11)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20060184496A1 (en) * 2005-01-14 2006-08-17 Wang Lu-Yong System and method for molecular diagnosis of depression based on boosting classification
CN109816158A (en) * 2019-01-04 2019-05-28 平安科技(深圳)有限公司 Combined method, device, equipment and the readable storage medium storing program for executing of prediction model
CN110534193A (en) * 2019-09-02 2019-12-03 强联智创(北京)科技有限公司 A kind of aneurysm rupture methods of risk assessment and system
CN110603329A (en) * 2017-03-02 2019-12-20 优美佳肿瘤技术有限公司 Methylation markers for diagnosis of hepatocellular carcinoma and lung cancer
CN110786824A (en) * 2019-12-02 2020-02-14 中山大学 Coarse marking fundus oculi illumination bleeding lesion detection method and system based on bounding box correction network
US20210104321A1 (en) * 2018-11-15 2021-04-08 Ampel Biosolutions, Llc Machine learning disease prediction and treatment prioritization
CN112617770A (en) * 2020-12-28 2021-04-09 首都医科大学附属北京天坛医院 Intracranial aneurysm risk prediction method based on artificial intelligence and related device
US20210350930A1 (en) * 2020-05-11 2021-11-11 Roche Molecular Systems, Inc. Clinical predictor based on multiple machine learning models
CN113962278A (en) * 2021-01-12 2022-01-21 大连理工大学 Intelligent ensemble learning classification method based on clustering
CN114639441A (en) * 2022-05-18 2022-06-17 山东建筑大学 Transcription factor binding site prediction method based on weighted multi-granularity scanning
CN115553737A (en) * 2022-10-13 2023-01-03 昆明同心医联科技有限公司 Aneurysm rupture state recognition system and medical assistance system

Patent Citations (11)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20060184496A1 (en) * 2005-01-14 2006-08-17 Wang Lu-Yong System and method for molecular diagnosis of depression based on boosting classification
CN110603329A (en) * 2017-03-02 2019-12-20 优美佳肿瘤技术有限公司 Methylation markers for diagnosis of hepatocellular carcinoma and lung cancer
US20210104321A1 (en) * 2018-11-15 2021-04-08 Ampel Biosolutions, Llc Machine learning disease prediction and treatment prioritization
CN109816158A (en) * 2019-01-04 2019-05-28 平安科技(深圳)有限公司 Combined method, device, equipment and the readable storage medium storing program for executing of prediction model
CN110534193A (en) * 2019-09-02 2019-12-03 强联智创(北京)科技有限公司 A kind of aneurysm rupture methods of risk assessment and system
CN110786824A (en) * 2019-12-02 2020-02-14 中山大学 Coarse marking fundus oculi illumination bleeding lesion detection method and system based on bounding box correction network
US20210350930A1 (en) * 2020-05-11 2021-11-11 Roche Molecular Systems, Inc. Clinical predictor based on multiple machine learning models
CN112617770A (en) * 2020-12-28 2021-04-09 首都医科大学附属北京天坛医院 Intracranial aneurysm risk prediction method based on artificial intelligence and related device
CN113962278A (en) * 2021-01-12 2022-01-21 大连理工大学 Intelligent ensemble learning classification method based on clustering
CN114639441A (en) * 2022-05-18 2022-06-17 山东建筑大学 Transcription factor binding site prediction method based on weighted multi-granularity scanning
CN115553737A (en) * 2022-10-13 2023-01-03 昆明同心医联科技有限公司 Aneurysm rupture state recognition system and medical assistance system

Non-Patent Citations (3)

* Cited by examiner, † Cited by third party
Title
盛昌等: "机器学习模型预测开放手术修复腹主动脉瘤后急性肾损伤的发生", 《中南大学学报(医学版)》, pages 213 - 220 *
聂丽霞;刘辉;邹凌;: "基于异构网络特征与梯度提升决策树的协同药物预测", 计算机应用与软件, no. 04, pages 48 - 52 *
邓成龙等: "基于随机森林的宫颈鳞癌放化疗疗效预测", 《软件学报》, pages 3960 - 3976 *

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN117059214A (en) * 2023-07-21 2023-11-14 南京智慧云网络科技有限公司 Clinical scientific research data integration and intelligent analysis system and method based on artificial intelligence

Also Published As

Publication number Publication date
CN116092680B (en) 2023-06-09

Similar Documents

Publication Publication Date Title
CN108682457B (en) Patient long-term prognosis quantitative prediction and intervention system and method
CN116092680B (en) Abdominal aortic aneurysm early prediction method and system based on random forest algorithm
JP2012512730A (en) Method for calculating brightness level in a target area of a digital X-ray image for medical use
Mualla et al. Dental Age Estimation Based on X-ray Images.
CN111000553A (en) Intelligent classification method for electrocardiogram data based on voting ensemble learning
CN111803059A (en) Electrocardiosignal classification method and device based on time domain convolution network
US20210304896A1 (en) Systems and methods for medical diagnosis
CN115938590B (en) Construction method and prediction system of colorectal cancer postoperative LARS prediction model
CN112052874B (en) Physiological data classification method and system based on generation countermeasure network
CN112183572A (en) Method and device for generating prediction model for predicting pneumonia severity
CN111833321A (en) Window-adjusting optimization-enhanced intracranial hemorrhage detection model and construction method thereof
CN112669319B (en) Multi-view multi-scale lymph node false positive inhibition modeling method
CN114494263B (en) Medical image lesion detection method, system and equipment integrating clinical information
JP7454679B2 (en) Binocular keratoconus evaluation system based on multimodal data
CN115346598A (en) Chronic kidney disease genetic gene risk screening system
CN114565617A (en) Pruning U-Net + + based breast tumor image segmentation method and system
Aranha et al. Deep transfer learning strategy to diagnose eye-related conditions and diseases: An approach based on low-quality fundus images
Mayya et al. A novel medical support deep learning fusion model for the diagnosis of COVID-19
CN114649092A (en) Auxiliary diagnosis method and device based on semi-supervised learning and multi-scale feature fusion
CN114093521A (en) Random forest based method and system for estimating blood sugar by reconstructing homogenized samples
TWI768951B (en) Electronic device and method of training classification model for age-related macular degeneration
CN117133459B (en) Machine learning-based postoperative intracranial infection prediction method and system
CN116153516B (en) Disease big data mining analysis system based on distributed computing
CN115240854B (en) Pancreatitis prognosis data processing method and system
Alibraheemi Classification Covid-19 disease based on CNN and Hybrid Models

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant