CN116705286B - Prediction method based on metabolism-related fatty liver disease, electronic equipment and storage medium - Google Patents

Prediction method based on metabolism-related fatty liver disease, electronic equipment and storage medium Download PDF

Info

Publication number
CN116705286B
CN116705286B CN202310489636.6A CN202310489636A CN116705286B CN 116705286 B CN116705286 B CN 116705286B CN 202310489636 A CN202310489636 A CN 202310489636A CN 116705286 B CN116705286 B CN 116705286B
Authority
CN
China
Prior art keywords
mafld
risk
fatty liver
ccr
hba1c
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Active
Application number
CN202310489636.6A
Other languages
Chinese (zh)
Other versions
CN116705286A (en
Inventor
刘莉
肖芦山
王家仁
曾琳
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Southern Hospital Southern Medical University
Original Assignee
Southern Hospital Southern Medical University
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Southern Hospital Southern Medical University filed Critical Southern Hospital Southern Medical University
Priority to CN202310489636.6A priority Critical patent/CN116705286B/en
Publication of CN116705286A publication Critical patent/CN116705286A/en
Application granted granted Critical
Publication of CN116705286B publication Critical patent/CN116705286B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Classifications

    • GPHYSICS
    • G16INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR SPECIFIC APPLICATION FIELDS
    • G16HHEALTHCARE INFORMATICS, i.e. INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR THE HANDLING OR PROCESSING OF MEDICAL OR HEALTHCARE DATA
    • G16H50/00ICT specially adapted for medical diagnosis, medical simulation or medical data mining; ICT specially adapted for detecting, monitoring or modelling epidemics or pandemics
    • G16H50/20ICT specially adapted for medical diagnosis, medical simulation or medical data mining; ICT specially adapted for detecting, monitoring or modelling epidemics or pandemics for computer-aided diagnosis, e.g. based on medical expert systems
    • GPHYSICS
    • G16INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR SPECIFIC APPLICATION FIELDS
    • G16BBIOINFORMATICS, i.e. INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR GENETIC OR PROTEIN-RELATED DATA PROCESSING IN COMPUTATIONAL MOLECULAR BIOLOGY
    • G16B20/00ICT specially adapted for functional genomics or proteomics, e.g. genotype-phenotype associations
    • G16B20/50Mutagenesis
    • GPHYSICS
    • G16INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR SPECIFIC APPLICATION FIELDS
    • G16BBIOINFORMATICS, i.e. INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR GENETIC OR PROTEIN-RELATED DATA PROCESSING IN COMPUTATIONAL MOLECULAR BIOLOGY
    • G16B35/00ICT specially adapted for in silico combinatorial libraries of nucleic acids, proteins or peptides
    • G16B35/20Screening of libraries
    • GPHYSICS
    • G16INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR SPECIFIC APPLICATION FIELDS
    • G16BBIOINFORMATICS, i.e. INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR GENETIC OR PROTEIN-RELATED DATA PROCESSING IN COMPUTATIONAL MOLECULAR BIOLOGY
    • G16B50/00ICT programming tools or database systems specially adapted for bioinformatics
    • GPHYSICS
    • G16INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR SPECIFIC APPLICATION FIELDS
    • G16HHEALTHCARE INFORMATICS, i.e. INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR THE HANDLING OR PROCESSING OF MEDICAL OR HEALTHCARE DATA
    • G16H50/00ICT specially adapted for medical diagnosis, medical simulation or medical data mining; ICT specially adapted for detecting, monitoring or modelling epidemics or pandemics
    • G16H50/30ICT specially adapted for medical diagnosis, medical simulation or medical data mining; ICT specially adapted for detecting, monitoring or modelling epidemics or pandemics for calculating health indices; for individual health risk assessment
    • YGENERAL TAGGING OF NEW TECHNOLOGICAL DEVELOPMENTS; GENERAL TAGGING OF CROSS-SECTIONAL TECHNOLOGIES SPANNING OVER SEVERAL SECTIONS OF THE IPC; TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
    • Y02TECHNOLOGIES OR APPLICATIONS FOR MITIGATION OR ADAPTATION AGAINST CLIMATE CHANGE
    • Y02ATECHNOLOGIES FOR ADAPTATION TO CLIMATE CHANGE
    • Y02A90/00Technologies having an indirect contribution to adaptation to climate change
    • Y02A90/10Information and communication technologies [ICT] supporting adaptation to climate change, e.g. for weather forecasting or climate simulation

Landscapes

  • Engineering & Computer Science (AREA)
  • Health & Medical Sciences (AREA)
  • Medical Informatics (AREA)
  • Life Sciences & Earth Sciences (AREA)
  • Physics & Mathematics (AREA)
  • General Health & Medical Sciences (AREA)
  • Bioinformatics & Cheminformatics (AREA)
  • Public Health (AREA)
  • Biotechnology (AREA)
  • Biophysics (AREA)
  • Evolutionary Biology (AREA)
  • Biomedical Technology (AREA)
  • Databases & Information Systems (AREA)
  • Bioinformatics & Computational Biology (AREA)
  • Theoretical Computer Science (AREA)
  • Spectroscopy & Molecular Physics (AREA)
  • Data Mining & Analysis (AREA)
  • Library & Information Science (AREA)
  • Molecular Biology (AREA)
  • Epidemiology (AREA)
  • Chemical & Material Sciences (AREA)
  • Pathology (AREA)
  • Primary Health Care (AREA)
  • Analytical Chemistry (AREA)
  • Proteomics, Peptides & Aminoacids (AREA)
  • Bioethics (AREA)
  • Genetics & Genomics (AREA)
  • Biochemistry (AREA)
  • Investigating Or Analysing Biological Materials (AREA)

Abstract

The invention provides a prediction method, electronic equipment and storage medium based on metabolic-related fatty liver disease, wherein indexes obviously related to MAFLD risks are obtained through single-factor and multi-factor Logistic regression analysis, and a prediction model with higher accuracy is established by using a nomogram method according to the indexes. The model is simple, quick and convenient to implement, can be used for large-scale MAFLD screening, improves the identification accuracy of people with higher MAFLD disease risk, can reduce the waste of medical resources, is favorable for improving the diagnosis efficiency of MAFLD, helps MAFLD patients to receive treatment in time, and prevents disease deterioration.

Description

Prediction method based on metabolism-related fatty liver disease, electronic equipment and storage medium
Technical Field
The invention belongs to the field of bioinformatics analysis, and particularly relates to a prediction method based on metabolic-related fatty liver disease, electronic equipment and a storage medium.
Background
Nonalcoholic fatty liver disease (NAFLD) is a global public health problem with increasing prevalence, and the current definition of NAFLD requires the exclusion of heavy drinking and other secondary causes of chronic liver disease. However, fatty liver disease is not only focused on patients with or without excessive alcohol consumption, but may also be a disease driver in other forms of chronic liver disease patients. To better understand fatty liver disease, a new name and definition of adult NAFLD, a metabolic-related fatty liver disease (MAFLD), was proposed by an international panel of experts from 22 countries. MAFLD affects approximately one-fourth of adults worldwide, with prevalence of up to 50.7% in overweight and obese people.
The risk of cardiovascular events in MAFLD patients has increased significantly, as has the number of hepatocellular carcinoma (HCC) patients associated with MAFLD. Therefore, it is desirable to diagnose MAFLD patients as early as possible and to take appropriate intervention to prevent further disease progression. The diagnostic criteria for MAFLD were: histological (biopsy), imaging, or blood biomarkers confirm liver fat accumulation (i.e., liver steatosis) and meet at least one of three criteria: overweight or obese (caucasian body mass index [ BMI ] 25kg/m2 or Asian BMI 25kg/m 2), type 2 diabetes (T2 DM), metabolic disorders. Metabolic disorder refers to at least two of the following metabolic risk situations: increased waistline, arterial hypertension, hypertriglyceridemia, low high density lipoprotein cholesterol, pre-diabetes, subclinical inflammation and insulin resistance.
The diagnosis of MAFLD is complex, and the diagnosis of all people suspected to suffer from MAFLD by using the diagnosis standard can cost a lot of manpower and material resources, and can delay the real MAFLD patient from timely treatment. While some existing models or biomarkers for screening and diagnosing MAFLD are built in small sample populations, for example, MAFLD risk prediction models based on BMI and Waist Circumference (WC) are developed in 535 overweight or obese populations, are not suitable for general populations, and have an AUC of only 0.79. Therefore, there is a need to build MAFLD predictive models that can be used for large-scale crowd screening and are easy to implement.
Disclosure of Invention
Diagnosis of MAFLD as early as possible is helpful for patients to receive treatment in time, delay disease progression and prevent serious complications, but MAFLD prediction models which are suitable for large-scale people, convenient to implement and high in accuracy are still lacking at present. The invention aims to provide a prediction model of metabolic-related fatty liver disease based on creatinine and Cystatin C Ratio (CCR), a construction method and application thereof. The invention aims to provide a prediction model and a model construction method of metabolic-related fatty liver disease based on the ratio of creatinine to cystatin C, aiming at the defects existing at present.
The invention provides a method for predicting metabolic-related fatty liver disease based on the ratio of creatinine to cystatin C, which comprises the following steps:
(1) The following index data of the participants are obtained from the database: age, sex, race, BMI, T2DM, hbA1c, TG, CHOL, CCR, GGT, AST, ALT,
wherein, the whole names of the indexes are respectively: MAFLD: metabolic-related fatty liver disease, BMI: body mass index, GGT: glutamyl transferase, CCR: creatinine to cystatin C ratio, AST: aspartate aminotransferase, ALT: alanine transamination, T2DM: type 2 diabetes, hbA1c: glycosylated hemoglobin, TG: triglyceride, CHOL:
cholesterol;
(2) The correlation between each index and the MAFLD disease risk is analyzed by using single-factor Logistic regression, indexes which are obviously correlated with the MAFLD disease risk are screened out, and independent risk factors which are obviously correlated with the MAFLD disease risk are obtained by using multi-factor Logistic analysis, wherein the method comprises the following steps: age, gender, BMI, CCR, T DM, hbA1c, TG, CHOL, wherein age, gender, CCR, hbA1c, TG, CHOL are selected to build a predictive model;
(3) Scoring the independent risk factors by using a nomogram, wherein the scoring formula of each independent risk factor is as follows:
CCR score = -0.243736557 ccr+48.747311336;
age score = -0.09417962 age+7.0634471474;
gender scoring: male= 12.89266, female=0;
HbA1c score = 0.711006907 HbA1c-14.220138147;
TG score = 0.166666667 TG;
CHOL score = -1.302132565 x chol+12.370299771;
(4) Constructing a prediction model according to the scores, wherein the total risk assessment score (T) of the prediction model is the sum of scores of six indexes including age, gender and HbA1c, TG, CHOL, CCR;
(5) The calculation formula for obtaining the MAFLD disease risk (R) according to the risk assessment total score is as follows:
R=-5.191e -06 *T3+0.001311815*T2-0.089010605*T+1.792531915;
(6) And inputting six index values of the detection crowd into a calculation formula of the MAFLD disease risk (R) to obtain the corresponding MAFLD disease risk.
Preferably, before the step of obtaining the following indexes of the participants from the database, the step of screening the participants meeting the conditions is further included:
(1) Excluding participants who were unable to diagnose whether they had MAFLD because of the lack of information;
(2) Excluding participants with missing study index (age, sex, race, BMI, T2DM, hbA1c, TG, CHOL, CCR) information;
(3) Since the level of creatinine to cystatin C ratio is affected by renal function, the glomerular filtration rate is excluded from being lower than 60ml/min/1.73m 2 Is a participant in the process.
Preferably, the single-factor Logistic regression analysis and the multi-factor Logistic regression analysis use R language for data analysis and glm function for regression analysis.
Preferably, the correlation between each index and the risk of MAFLD disease is analyzed using single factor Logistic regression, and the variability between MAFLD groups and non-MAFLD group classification or continuous variables in baseline data is assessed by independent sample t-test, chi-square test, or Mann-Whitney U test.
Preferably, when P < 0.05, this indicator is shown to be significantly correlated with risk of MAFLD disease; wherein, P is a statistical index for judging whether the result has statistical significance, namely whether the correlation is significant.
Preferably, after obtaining the calculation formula of the risk of the MAFLD according to the risk assessment total score, the method further comprises: a subject operating characteristic curve (ROC) is plotted and the accuracy of the predictive model is verified by the area under ROC curve (AUC). AUC, also known as C statistic, is mainly used to evaluate the accuracy of diagnostic tests, or the accuracy of binary regression model predictions. The range of AUC is 0.5-1.0, AUC is 0.5 and corresponds to random classification, 1.0 shows complete accuracy, 0.51-0.59 shows poor accuracy of the model, 0.60-0.69 shows poor accuracy, 0.70-0.79 shows moderate accuracy, 0.80-0.89 shows good accuracy, and 0.90 or more shows good accuracy. The AUC of the prediction model established by the invention for predicting the MAFLD risk is 0.841, which shows that the accuracy is higher.
Preferably, after six index values of the crowd to be detected are input into the prediction model to obtain corresponding MAFLD disease risks, the method further comprises the step of screening out crowds with high MAFLD disease risks, including:
and marking the people with MAFLD disease risks exceeding the reference value in the detected people, and reminding the people to seek medical attention in time.
The optimal cut-off value of the ROC curve is 0.637, and the specificity and the sensitivity of the model are 75.8% and 76.1% respectively, namely 63.7% can be used as reference values for distinguishing the risk of MAFLD.
Preferably, the present application further includes an electronic device, including:
a processor;
a memory for storing processor-executable instructions;
wherein the processor is configured to implement the method of predicting metabolic-related fatty liver disease based on a creatinine to cystatin C ratio when executing the instructions.
Also included is a computer readable storage medium comprising instructions that instruct a device to perform the method of predicting a metabolic-related fatty liver disease based on a creatinine to cystatin C ratio. Compared with the prior art, the invention has the following positive effects:
1. compared with the traditional prediction model, the invention additionally introduces 6 indexes of age, sex and HbA1c, TG, CHOL, CCR, namely BMI and Waistline (WC), and a nonogram method is used for scoring and establishing the prediction model according to the indexes, so that the accuracy of the prediction model is higher.
2. The traditional prediction is mainly carried out in specific crowds such as overweight or obese crowds, and is not suitable for most common crowds, and the prediction model established by the invention can be suitable for the prediction of the common crowds, and the screening range and the audience are wider.
3. The prediction model is simple and convenient to operate, the MAFLD disease risk can be obtained by inputting index information, the implementation is convenient, the prediction model can be used for large-scale MAFLD screening, and the recognition rate of people with high MAFLD disease risk is improved.
Drawings
The invention will be further understood from the following description taken in conjunction with the accompanying drawings. The components in the figures are not necessarily to scale, emphasis instead being placed upon illustrating the principles of the embodiments. Like reference numerals designate corresponding parts throughout the different views.
Fig. 1: flow chart for screening participants
Fig. 2: nomograms predicting MAFLD risk
Fig. 3: subject work characteristic curve of predictive model
Detailed Description
The following embodiments of the present invention are described in terms of specific examples, and those skilled in the art will appreciate the advantages and effects of the present invention from the disclosure herein. The invention is capable of other and different embodiments and its several details are capable of modification and variation in various respects, all without departing from the spirit of the present invention. The drawings of the present invention are merely schematic illustrations, and are not intended to be drawn to actual dimensions. The following embodiments will further illustrate the related art content of the present invention in detail, but the disclosure is not intended to limit the scope of the present invention.
The following describes in detail the embodiments of the present invention with reference to fig. 1-3.
Table 1 shows the index involved in the predictive model and its abbreviation control
Abbreviations (abbreviations) Terminology
MAFLD Metabolism related fatty liver disease
BMI Body mass index
WC Waistline
GGT Glutamyl transferase
CCR Creatinine to cystatin C ratio
AST Aspartate aminotransferase
ALT Alanine aminotransferase
T2DM Type 2 diabetes mellitus
HbA1c Glycosylated hemoglobin
TG Triglycerides (Triglycerides)
CHOL Cholesterol
The diagnostic basis for MAFLD is liver steatosis and the occurrence of any of the following 3 conditions: (1) overweight or obese, (2) T2DM, and (3) at least 2 metabolic abnormalities. Fatty Liver Index (FLI) is used to determine participants due to lack of tissue biopsy and ultrasound image dataIs a hepatic steatosis condition. The fatty liver index is calculated by the following formula: fli=100× (e 0.953 ×loge(TG)+0.139×BMI+0.718×loge(GGT)+0.053×WC-15.745)/(1+e 0.953 Xloge (TG) +0.139 XBMI+0.718 Xloge (GGT) +0.053 XWC-15.745), FLI.gtoreq.60 is defined as liver steatosis.
According to the diagnosis basis of MAFLD, the prediction model of the metabolic-related fatty liver disease based on the ratio of creatinine to cystatin C and the modeling method thereof comprise the following steps:
1. screening eligible participants
Based on the diagnostic criteria of MAFLD, portions of the participant data that do not meet diagnostic criteria are eliminated when screening the database for appropriate participant baseline data. The steps of screening participants are shown in fig. 1, and the screening for baseline data includes:
1) 35542 participants who were unable to diagnose whether they had MAFLD due to the lack of information were excluded;
2) The participants with 94344 study index (age, sex, race, BMI, T2DM, hbA1c, TG, CHOL, CCR) information missing were continuously excluded;
3) Since the levels of creatinine and cystatin C are affected by renal function, the filtration rate of 3861 glomeruli is lower than 60ml/min/1.73m 2 Is excluded from participants in the study.
Baseline data of 368634 participants were finally included for analysis, with a median age of 58 years and a female percentage of 54%. 139002 participants were diagnosed with MAFLD at baseline. Compared to the non-MAFLD population, MAFLD patients were mostly men, aged higher, higher BMI, GGT, hbA1c, TG, CHOL levels, lower CCR, ALB levels.
CCR is divided into three groups according to the triad of CCR levels: CCR tertile 1, CCR tertile 2, CCR tertile 3. Baseline data for participants are shown in the following table:
table 2: baseline characteristics of participants
Wherein n represents the number of people; and P is used for judging whether the statistical result has statistical significance.
The classifier variables in the baseline data of the present invention are expressed in numbers and percentages [ n (%) ], for example CCR tertile 1, as: 121649 (33); the continuous variable is expressed using a median and first and third quartiles [ medium (Q1, Q3) ] such as age expressed as: 58 (50,63). The independent sample t-test, chi-square test, or Mann-Whitney U test was used to evaluate the variability between the classification of the MAFLD group and the non-MAFLD group or continuous variable in the baseline data.
2. Independent risk factors related to MAFLD (magnetic resonance imaging) are obtained by adopting single-factor and multi-factor Logistic regression analysis
The data analysis was performed using the R language and the glm function was used for Logistic single and multi-factor analysis. Inclusion age, gender, race, BMI, CCR, T DM, hbA1c, TG, CHOL for single factor analysis; the analysis results are shown in the following table:
table 3: single-and multi-factor Logistic regression analysis
Wherein, OR: ratio (odds ratio), also known as odds ratio, is used to reflect the difference in exposure between cases and controls; or=1, indicating no correlation between exposure and disease; OR > 1, indicating that exposure is a risk factor for disease (positive correlation); OR < 1, indicating that the exposure factor is a protective factor (negative correlation) for the disease. The OR value is a point estimate, and the 95% confidence interval (95% CI) is an interval estimate, with the breadth of the confidence interval reflecting the accuracy of the parameter estimate, the narrower the confidence interval, the more accurate the estimate. The results are considered statistically significant, i.e., correlation is significant, when P < 0.05.
The present invention uses single and multi-factor Logistic regression analysis to determine variables that are significantly correlated (P < 0.05) to the risk of MAFLD. The variables P < 0.05 in the multifactor analysis were included in the predictive model. As shown in table 3, the single factor analysis results showed that age, gender, BMI, CCR, T DM, hbA1c, TG, CHOL were significantly correlated with risk of having MAFLD (P < 0.05), and these 8 indicators were included as covariates in the multi-factor analysis, and the results showed that age, gender, BMI, CCR, T DM, hbA1c, TG, CHOL were significantly correlated with risk of MAFLD (P < 0.05), thus determining that 8 indicators of age, gender, BMI, CCR, T DM, hbA1c, TG, CHO were independent risk factors of MAFLD.
3. Predictive model establishment by nomogram method scoring
Since BMI, T2DM are already included in the main diagnostic criteria of MAFLD, six indicators of age, gender, hbA1c, TG, CHOL, CCR were included, and R-package rms was used to build nomogram method scores to build a predictive model for predicting the risk of developing MAFLD in the population, and the model results are shown in FIG. 2.
Formula using R-package nomogram ex extraction model:
risk of MAFLD disease R= -5.191e -06 *T 3 +0.001311815*T 2 -0.089010605*T+
1.792531915, wherein the total risk assessment score T is the sum of the index scores.
The score for each index is:
CCR score = -0.243736557 ccr+48.747311336;
age score = -0.09417962 age+7.0634471474;
gender scoring: male= 12.89266, female=0;
HbA1c score = 0.711006907 HbA1c-14.220138147;
TG score = 0.166666667 TG;
CHOL score = -1.302132565 x chol+12.370299771.
For example, a 30 year old (4.238082874 score) male (12.89266 score) with a CCR of 64.6
(33.001929753 minutes), hbA1c of 37.9mmol/mol (12.7270236283 minutes), TG of 234.5mg/dL (39.0833334115 minutes), CHOL of 7.36mmol/L (2.7865636926 minutes), and calculating the sum of the scores of the indexes, namely, the total score T of 104.7295933602 minutes, to show that the MAFLD disease risk is 90%, and reminding the patient to take a definite diagnosis in time and take intervention measures. For the treatment of MAFLD, there is no approved drug specifically for MAFLD treatment, and the main treatment methods are to change life style, including adjustment of dietary structure and exercise enhancement.
A30 year old (4.238082874 score) female (0 score), CCR 86.7 (27.6153518441 score), hbA1c 29.7mmol/mol (7.6077738979 score), TG 165.5mg/dL (27.5833333885 score), CHOL 6.38mmol/L (4.0626536063 score), total T71.1071956108 score, and MAFLD risk 23%, indicating lower risk, no need for diagnosis, and regular monitoring.
4. Evaluating accuracy of predictive models
The efficacy of the predictive model was assessed by plotting the subject's working characteristics curve (Receiver Operating Characteristic Curve, ROC curve) and calculating the area under the ROC curve (AUC). ROC curves, which are a curve drawn by continuously measuring different observation tangent point values of a variable, can be used to evaluate the predictive performance of a statistical model to predict a classification result, the Y-axis of the curve being sensitivity and the X-axis being specificity. AUC, also known as C statistic, is mainly used to evaluate the accuracy of diagnostic tests, or the accuracy of binary regression model predictions. The range of AUC is 0.5-1.0, AUC is 0.5G to indicate corresponding random classification, 1.0 indicates complete accuracy, 0.51-0.59 indicates poor accuracy of the model, 0.60-0.69 indicates poor accuracy, 0.70-0.79 indicates moderate accuracy, 0.80-0.89 indicates good accuracy, and 0.90 or more indicates good accuracy.
According to the invention, an R packet pROC is adopted to draw an ROC curve, an AUC value is calculated, and an optimal cut-off value is found, as shown in figure 3, the AUC of the model prediction MAFLD risk is 0.841, which shows that the accuracy is higher; the cut-off value is 0.637, and the specificity and sensitivity of the model are 75.8% and 76.1% respectively, namely 63.7% can be used as reference values for distinguishing the risk of MAFLD.
5. Application of predictive model
The predictive model can be used for making a small program for a needed person, and the user can obtain the risk of the MAFLD by inputting the numerical values of six indexes including age, sex and HbA1c, TG, CHOL, CCR of the user or the family into the predictive model, and the small program can provide instruction in aspects of diet and exercise for the user with higher risk probability of the MAFLD. The predictive model may also be used to screen MAFLD patients during hospital physical examination or community screening.
Correspondingly, the electronic equipment is used for realizing a prediction method of metabolic-related fatty liver disease based on the ratio of creatinine to cystatin C, and the electronic equipment at least comprises a processor and a memory. The memory is for storing instructions executable by the processor 21, the processor being configured to implement a method of predicting metabolic-related fatty liver disease based on a creatinine to cystatin C ratio when the instructions are executed.
Also included is a computer readable storage medium comprising instructions that instruct a device to perform a method of predicting a metabolic-related fatty liver disease based on a creatinine to cystatin C ratio.
The program to be executed in the electronic device may be a program (a program for causing a computer to function) for controlling a central processing unit (Central Processing Unit, CPU) or the like to realize the functions of the above-described embodiment according to an aspect of the present invention. Information processed by these devices is temporarily stored in a random access Memory (Random Access Memory, RAM) when the processing is performed, and then stored in various ROMs such as a Read Only Memory (Flash ROM) and a Hard Disk Drive (HDD), and Read, corrected, and written by a CPU as necessary.
The electronic device according to the above embodiment may be partially implemented by a computer. In this case, the program for realizing the control function may be recorded on a computer-readable recording medium, and the program recorded on the recording medium may be read into a computer system and executed.
The term "computer system" as used herein refers to a computer system built in an electronic device, and uses hardware including an OS and peripheral devices. The term "computer-readable recording medium" refers to a removable medium such as a flexible disk, a magneto-optical disk, a ROM, or a CD-ROM, and a storage device such as a hard disk incorporated in a computer system.
Also, the "computer-readable recording medium" may include: a medium for dynamically storing a program in a short time, such as a communication line in the case of transmitting the program via a network such as the internet or a communication line such as a telephone line; a medium storing a program for a fixed time, such as a volatile memory in a computer system, which is a server or a client in this case. The program may be a program for realizing a part of the functions described above, or may be a program capable of realizing the functions described above by being combined with a program recorded in a computer system.
The electronic apparatus according to the above embodiment may be realized as an aggregate (device group) including a plurality of devices. Each device constituting the device group may include a part or all of each function or each functional block of the electronic apparatus according to the above embodiment. The device group may have all the functions or functional blocks of the electronic apparatus.
The above disclosure is only a preferred practical embodiment of the present invention and is not intended to limit the scope of the present invention, but various modifications and variations will be apparent to those skilled in the art. All equivalent technical changes, including various equivalent substitutions, improvements, etc. made in the description and the accompanying drawings of the present invention are included in the protection scope of the present invention.

Claims (10)

1. A method for predicting metabolic-related fatty liver disease based on creatinine to cystatin C ratio, comprising the steps of:
(1) The following index data of the participants are obtained from the database: age, sex, race, BMI, T2DM, hbA1c, TG, CHOL, CCR, GGT, AST, ALT,
wherein, the whole names of the indexes are respectively: MAFLD: metabolic-related fatty liver disease, BMI: body mass index, T2DM: type 2 diabetes, hbA1c: glycosylated hemoglobin, TG: triglyceride, CHOL: cholesterol, CCR: creatinine to cystatin C ratio, GGT: glutamyl transferase, AST: aspartate aminotransferase, ALT: converting alanine into ammonia;
(2) Analyzing the correlation between each index and the MAFLD disease risk by using single-factor Logistic regression, screening out indexes obviously related to the MAFLD disease risk, and obtaining independent risk factors obviously related to the MAFLD disease risk by using multi-factor Logistic regression, wherein the method comprises the following steps: age, gender, BMI, CCR, T DM, hbA1c, TG, CHOL, wherein age, gender, CCR, hbA1c, TG, CHOL are selected to build a predictive model;
(3) Scoring the independent risk factors by using a nomogram, wherein the scoring formula of each independent risk factor is as follows:
CCR score = -0.243736557 ccr+48.747311336;
age score = -0.09417962 age+7.063471474;
gender scoring: male= 12.89266, female=0;
HbA1c score = 0.711006907 HbA1c-14.220138147;
TG score = 0.166666667 TG;
CHOL score = -1.302132565 chol+12.370259371;
(4) Constructing a prediction model according to the scores, wherein the total risk assessment score of the prediction model is the sum of scores of six indexes including age, gender and HbA1c, TG, CHOL, CCR;
(5) The calculation formula for obtaining the MAFLD disease risk according to the risk assessment total score is as follows:
R=-5.191e -06 * T 3 + 0.001311815 * T 2 0.089010605 t+ 1.792531915, where T is the total score of risk assessment;
(6) And inputting six index values of the detected crowd into a calculation formula of the MAFLD disease risk to obtain a corresponding MAFLD disease risk.
2. The method for predicting metabolic-related fatty liver disease based on creatinine to cystatin C ratio of claim 1, further comprising the step of screening for eligible participants before the step of retrieving the following indicators of participants from the database in step (1), comprising:
(1) Excluding participants who were unable to diagnose whether they had MAFLD because of the lack of information;
(2) Excluding participants with missing study index information, wherein the study index includes age, gender, race, BMI, T2DM, hbA1c, TG, CHOL, CCR;
(3) Since the level of creatinine to cystatin C ratio is affected by renal function, the glomerular filtration rate is excluded from being lower than 60ml/min/1.73m 2 Is a participant in the process.
3. The method for predicting metabolic-related fatty liver disease based on creatinine to cystatin C ratio of claim 1, wherein the single factor Logistic regression analysis and the multi-factor Logistic regression analysis in step (2) use R language for data analysis and glm function for regression analysis.
4. The method according to claim 1, wherein the analyzing the correlation between each of the indicators and the risk of developing MAFLD using single factor Logistic regression in step (2) comprises: the variability between the MAFLD group and the non-MAFLD group classification variables or continuous variables in the baseline data was assessed by independent sample t-test, chi-square test, or Mann-Whitney U test.
5. The method according to claim 1, wherein for the screening of the index significantly associated with risk of developing MAFLD in step (2), the method comprises:
when (when)PWhen < 0.05, it is shown that the index is significantly correlated with risk of MAFLD disease, wherein,Pthe statistical index is used for judging whether the result has statistical significance, namely whether the correlation is obvious.
6. The method according to claim 1, wherein after obtaining the calculation formula of the risk of developing MAFLD according to the total score of risk assessment in step (5), further comprising: a subject work characteristic curve ROC is plotted and the accuracy of the predictive model is assessed by the area under ROC curve AUC.
7. The method for predicting metabolic-related fatty liver disease based on creatinine and cystatin C ratio according to claim 1, wherein in step (6), six index values of the detected population are input into the prediction model, and after obtaining the corresponding risk of suffering from MAFLD, the method further comprises a step of screening out a population at high risk of suffering from MAFLD, comprising:
and marking the people with MAFLD disease risks exceeding the reference value in the detected people, and reminding the people to seek medical attention in time.
8. The method for predicting metabolic-related fatty liver disease based on the ratio of creatinine to cystatin C according to claim 7, wherein the reference value is 63.7%.
9. An electronic device, comprising:
a processor;
a memory for storing processor-executable instructions;
wherein the processor is configured to implement the method for predicting metabolic-related fatty liver disease based on the creatinine to cystatin C ratio of any one of claims 1 to 8 when executing the instructions.
10. A computer-readable storage medium comprising instructions that instruct a device to perform the method of predicting a metabolic-related fatty liver disease based on a creatinine to cystatin C ratio of any one of claims 1-8.
CN202310489636.6A 2023-05-04 2023-05-04 Prediction method based on metabolism-related fatty liver disease, electronic equipment and storage medium Active CN116705286B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN202310489636.6A CN116705286B (en) 2023-05-04 2023-05-04 Prediction method based on metabolism-related fatty liver disease, electronic equipment and storage medium

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN202310489636.6A CN116705286B (en) 2023-05-04 2023-05-04 Prediction method based on metabolism-related fatty liver disease, electronic equipment and storage medium

Publications (2)

Publication Number Publication Date
CN116705286A CN116705286A (en) 2023-09-05
CN116705286B true CN116705286B (en) 2024-03-08

Family

ID=87834757

Family Applications (1)

Application Number Title Priority Date Filing Date
CN202310489636.6A Active CN116705286B (en) 2023-05-04 2023-05-04 Prediction method based on metabolism-related fatty liver disease, electronic equipment and storage medium

Country Status (1)

Country Link
CN (1) CN116705286B (en)

Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
EP3296744A1 (en) * 2016-09-16 2018-03-21 Biopredictive Method of diagnosis of non-alcoholic fatty liver diseases
CN111063440A (en) * 2019-12-31 2020-04-24 苏州和锐生物科技有限公司 Early non-alcoholic fatty liver disease evaluation model, construction method and application thereof
CN113160983A (en) * 2021-04-09 2021-07-23 南京医科大学附属逸夫医院 Metabolism-related fatty liver disease clinical prediction model
CN115910334A (en) * 2022-10-31 2023-04-04 浙江大学 Dynamic prediction model for risk of early new-onset hypertriglyceridemia after liver transplantation of recipients
CN115910322A (en) * 2022-11-06 2023-04-04 集美大学 Non-alcoholic fatty liver disease diagnosis model and construction method and system thereof

Family Cites Families (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
EP3599615A1 (en) * 2018-07-27 2020-01-29 Biopredictive Method of diagnosis of liver steatosis
US20210298638A1 (en) * 2020-03-27 2021-09-30 Hygieia Sciences LLC Method of Detection of Fatty Liver Diseases by Breath Analysis

Patent Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
EP3296744A1 (en) * 2016-09-16 2018-03-21 Biopredictive Method of diagnosis of non-alcoholic fatty liver diseases
CN111063440A (en) * 2019-12-31 2020-04-24 苏州和锐生物科技有限公司 Early non-alcoholic fatty liver disease evaluation model, construction method and application thereof
CN113160983A (en) * 2021-04-09 2021-07-23 南京医科大学附属逸夫医院 Metabolism-related fatty liver disease clinical prediction model
CN115910334A (en) * 2022-10-31 2023-04-04 浙江大学 Dynamic prediction model for risk of early new-onset hypertriglyceridemia after liver transplantation of recipients
CN115910322A (en) * 2022-11-06 2023-04-04 集美大学 Non-alcoholic fatty liver disease diagnosis model and construction method and system thereof

Also Published As

Publication number Publication date
CN116705286A (en) 2023-09-05

Similar Documents

Publication Publication Date Title
Wang et al. ZJU index: a novel model for predicting nonalcoholic fatty liver disease in a Chinese population
JP6235583B2 (en) Method for detecting and staging liver fibrosis from image acquisition data
US20210208147A1 (en) Method of prognosis and follow up of primary liver cancer
Choi et al. Magnetic resonance-based assessments better capture pathophysiologic profiles and progression in nonalcoholic fatty liver disease
Atta et al. Sonography as a predictor of human immunodeficiency virus–associated nephropathy
Tang et al. Among simple non-invasive scores, Pro-C3 and ADAPT best exclude advanced fibrosis in Asian patients with MAFLD
Shen et al. Relationship between obesity, metabolic syndrome, and nonalcoholic fatty liver disease in the elderly agricultural and fishing population of Taiwan
US20190265241A1 (en) Method of diagnosis of non-alcoholic fatty liver diseases
Schulz et al. Identifying high-risk NASH patients: what we know so far
Procino et al. Reducing NAFLD‐screening time: A comparative study of eight diagnostic methods offering an alternative to ultrasound scans
Hong et al. Reader agreement and accuracy of ultrasound features for hepatic steatosis
Zhang et al. A newly noninvasive model for prediction of non-alcoholic fatty liver disease: utility of serum prolactin levels
Festi et al. Non-invasive diagnostic approach to non-alcoholic fatty liver disease: current evidence and future perspectives
Tang et al. N-terminal propeptide of type 3 collagen-based sequential algorithm can identify high-risk steatohepatitis and fibrosis in MAFLD
RU2523661C1 (en) Method of screening hepatic disorders, and system for implementing method
JP7170368B2 (en) Disease risk assessment method, disease risk assessment device, and disease risk assessment program
Lu et al. Adiponectin–leptin ratio for the early detection of lean non-alcoholic fatty liver disease independent of insulin resistance
Frankland et al. Diagnostic performance of ultrasound hepatorenal index for the diagnosis of hepatic steatosis in children
Sourianarayanane et al. Accuracy of ultrasonographic fatty liver index using point-of-care ultrasound in stratifying non-alcoholic fatty liver disease patients
CN116705286B (en) Prediction method based on metabolism-related fatty liver disease, electronic equipment and storage medium
CN115602319B (en) Noninvasive hepatic fibrosis assessment device
JP2018512208A (en) Computerized optical analysis method of MR (magnetic resonance) images for quantifying or determining liver lesions
LU504525B1 (en) Prediction method, electronic device and storage medium for metabolic associated fatty liver disease based on creatinine to cystatin c ratio
Qiu et al. The newly proposed alanine aminotransferase to high-density lipoprotein cholesterol ratio has shown effectiveness in identifying non-alcoholic fatty liver disease
De Almeida et al. Evaluation of an instrument for screening patients at risk for chronic kidney disease: testing SCORED (Screening for Occult Renal Disease) in a Portuguese population

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant