WO2022245342A1 - Méthodes et systèmes pour la détection d'une maladie ou d'un trouble rénal par analyse d'expression génétique - Google Patents
Méthodes et systèmes pour la détection d'une maladie ou d'un trouble rénal par analyse d'expression génétique Download PDFInfo
- Publication number
- WO2022245342A1 WO2022245342A1 PCT/US2021/033084 US2021033084W WO2022245342A1 WO 2022245342 A1 WO2022245342 A1 WO 2022245342A1 US 2021033084 W US2021033084 W US 2021033084W WO 2022245342 A1 WO2022245342 A1 WO 2022245342A1
- Authority
- WO
- WIPO (PCT)
- Prior art keywords
- kidney disease
- disorder
- subject
- sample
- bodily
- Prior art date
Links
- 208000017169 kidney disease Diseases 0.000 title claims abstract description 623
- 238000000034 method Methods 0.000 title claims abstract description 311
- 230000014509 gene expression Effects 0.000 title claims abstract description 114
- 238000001514 detection method Methods 0.000 title abstract description 3
- 238000010195 expression analysis Methods 0.000 title description 9
- 108090000623 proteins and genes Proteins 0.000 claims abstract description 211
- 238000012545 processing Methods 0.000 claims abstract description 62
- 239000000523 sample Substances 0.000 claims description 332
- 208000007342 Diabetic Nephropathies Diseases 0.000 claims description 250
- 208000033679 diabetic kidney disease Diseases 0.000 claims description 250
- 210000002700 urine Anatomy 0.000 claims description 118
- 238000004422 calculation algorithm Methods 0.000 claims description 113
- 229920002477 rna polymer Polymers 0.000 claims description 99
- 239000000090 biomarker Substances 0.000 claims description 77
- 238000011282 treatment Methods 0.000 claims description 76
- 239000002299 complementary DNA Substances 0.000 claims description 72
- 208000020832 chronic kidney disease Diseases 0.000 claims description 63
- 239000003814 drug Substances 0.000 claims description 62
- 229940079593 drug Drugs 0.000 claims description 59
- 238000012163 sequencing technique Methods 0.000 claims description 56
- 238000010801 machine learning Methods 0.000 claims description 49
- 238000012549 training Methods 0.000 claims description 45
- 238000012360 testing method Methods 0.000 claims description 42
- 230000035945 sensitivity Effects 0.000 claims description 34
- 210000004027 cell Anatomy 0.000 claims description 23
- LFQSCWFLJHTTHZ-UHFFFAOYSA-N Ethanol Chemical compound CCO LFQSCWFLJHTTHZ-UHFFFAOYSA-N 0.000 claims description 21
- 230000008030 elimination Effects 0.000 claims description 21
- 238000003379 elimination reaction Methods 0.000 claims description 21
- 239000000041 non-steroidal anti-inflammatory agent Substances 0.000 claims description 21
- 229940021182 non-steroidal anti-inflammatory drug Drugs 0.000 claims description 21
- 201000005206 focal segmental glomerulosclerosis Diseases 0.000 claims description 20
- 231100000854 focal segmental glomerulosclerosis Toxicity 0.000 claims description 20
- 230000002441 reversible effect Effects 0.000 claims description 20
- 238000012706 support-vector machine Methods 0.000 claims description 20
- 230000008569 process Effects 0.000 claims description 19
- 206010037596 Pyelonephritis Diseases 0.000 claims description 18
- HVYWMOMLDIMFJA-DPAQBDIFSA-N cholesterol Chemical compound C1C=C2C[C@@H](O)CC[C@]2(C)[C@@H]2[C@@H]1[C@@H]1CC[C@H]([C@H](C)CCCC(C)C)[C@@]1(C)CC2 HVYWMOMLDIMFJA-DPAQBDIFSA-N 0.000 claims description 18
- 206010020772 Hypertension Diseases 0.000 claims description 17
- 210000003734 kidney Anatomy 0.000 claims description 17
- 238000003745 diagnosis Methods 0.000 claims description 16
- 206010039073 rheumatoid arthritis Diseases 0.000 claims description 16
- 208000015181 infectious disease Diseases 0.000 claims description 15
- 208000000913 Kidney Calculi Diseases 0.000 claims description 14
- 206010029148 Nephrolithiasis Diseases 0.000 claims description 14
- 230000006378 damage Effects 0.000 claims description 14
- 210000004369 blood Anatomy 0.000 claims description 13
- 239000008280 blood Substances 0.000 claims description 13
- 230000001419 dependent effect Effects 0.000 claims description 13
- 210000001519 tissue Anatomy 0.000 claims description 13
- 206010018372 Glomerulonephritis membranous Diseases 0.000 claims description 12
- 206010018374 Glomerulonephritis minimal lesion Diseases 0.000 claims description 12
- 208000010159 IgA glomerulonephritis Diseases 0.000 claims description 12
- 206010021263 IgA nephropathy Diseases 0.000 claims description 12
- 208000004883 Lipoid Nephrosis Diseases 0.000 claims description 12
- 208000005777 Lupus Nephritis Diseases 0.000 claims description 12
- 241000208125 Nicotiana Species 0.000 claims description 12
- 235000002637 Nicotiana tabacum Nutrition 0.000 claims description 12
- 235000005911 diet Nutrition 0.000 claims description 12
- 230000037213 diet Effects 0.000 claims description 12
- 238000007477 logistic regression Methods 0.000 claims description 12
- 201000008350 membranous glomerulonephritis Diseases 0.000 claims description 12
- 231100000855 membranous nephropathy Toxicity 0.000 claims description 12
- 238000013528 artificial neural network Methods 0.000 claims description 11
- 102000004169 proteins and genes Human genes 0.000 claims description 11
- 230000004580 weight loss Effects 0.000 claims description 11
- XGWFJBFNAQHLEF-UHFFFAOYSA-N 9-anthroic acid Chemical compound C1=CC=C2C(C(=O)O)=C(C=CC=C3)C3=CC2=C1 XGWFJBFNAQHLEF-UHFFFAOYSA-N 0.000 claims description 10
- 208000025760 Benign familial haematuria Diseases 0.000 claims description 10
- 208000004990 Cardiorenal syndrome Diseases 0.000 claims description 10
- 208000026292 Cystic Kidney disease Diseases 0.000 claims description 10
- WHXSMMKQMYFTQS-UHFFFAOYSA-N Lithium Chemical compound [Li] WHXSMMKQMYFTQS-UHFFFAOYSA-N 0.000 claims description 10
- 208000034578 Multiple myelomas Diseases 0.000 claims description 10
- 208000013901 Nephropathies and tubular disease Diseases 0.000 claims description 10
- 206010029155 Nephropathy toxic Diseases 0.000 claims description 10
- 206010035226 Plasma cell myeloma Diseases 0.000 claims description 10
- 208000004531 Renal Artery Obstruction Diseases 0.000 claims description 10
- 206010038378 Renal artery stenosis Diseases 0.000 claims description 10
- 206010038423 Renal cyst Diseases 0.000 claims description 10
- 206010047115 Vasculitis Diseases 0.000 claims description 10
- 206010002022 amyloidosis Diseases 0.000 claims description 10
- 206010014665 endocarditis Diseases 0.000 claims description 10
- 238000007636 ensemble learning method Methods 0.000 claims description 10
- 201000006334 interstitial nephritis Diseases 0.000 claims description 10
- 238000012417 linear regression Methods 0.000 claims description 10
- 229910052744 lithium Inorganic materials 0.000 claims description 10
- 231100000417 nephrotoxicity Toxicity 0.000 claims description 10
- 230000007694 nephrotoxicity Effects 0.000 claims description 10
- 208000030761 polycystic kidney disease Diseases 0.000 claims description 10
- 238000007637 random forest analysis Methods 0.000 claims description 10
- 208000037978 tubular injury Diseases 0.000 claims description 10
- 230000010024 tubular injury Effects 0.000 claims description 10
- 206010036790 Productive cough Diseases 0.000 claims description 9
- 230000037180 bone health Effects 0.000 claims description 9
- 210000001175 cerebrospinal fluid Anatomy 0.000 claims description 9
- 235000012000 cholesterol Nutrition 0.000 claims description 9
- 201000000523 end stage renal failure Diseases 0.000 claims description 9
- 239000012530 fluid Substances 0.000 claims description 9
- 230000002641 glycemic effect Effects 0.000 claims description 9
- 230000036454 renin-angiotensin system Effects 0.000 claims description 9
- 210000003296 saliva Anatomy 0.000 claims description 9
- 210000000582 semen Anatomy 0.000 claims description 9
- 210000002966 serum Anatomy 0.000 claims description 9
- 210000003802 sputum Anatomy 0.000 claims description 9
- 208000024794 sputum Diseases 0.000 claims description 9
- 210000004243 sweat Anatomy 0.000 claims description 9
- 206010055171 Hypertensive nephropathy Diseases 0.000 claims description 8
- 206010023424 Kidney infection Diseases 0.000 claims description 8
- 231100000268 induced nephrotoxicity Toxicity 0.000 claims description 8
- 208000028208 end stage renal disease Diseases 0.000 claims description 7
- 208000019206 urinary tract infection Diseases 0.000 claims description 7
- SNICXCGAKADSCV-JTQLQIEISA-N (-)-Nicotine Chemical compound CN1CCC[C@H]1C1=CC=CN=C1 SNICXCGAKADSCV-JTQLQIEISA-N 0.000 claims description 6
- 238000010208 microarray analysis Methods 0.000 claims description 6
- 229960002715 nicotine Drugs 0.000 claims description 6
- SNICXCGAKADSCV-UHFFFAOYSA-N nicotine Natural products CN1CCCC1C1=CC=CN=C1 SNICXCGAKADSCV-UHFFFAOYSA-N 0.000 claims description 6
- 208000005176 Hepatitis C Diseases 0.000 claims description 4
- 201000004681 Psoriasis Diseases 0.000 claims description 4
- 238000006243 chemical reaction Methods 0.000 claims description 4
- 208000019622 heart disease Diseases 0.000 claims description 4
- 208000002672 hepatitis B Diseases 0.000 claims description 4
- 238000003757 reverse transcription PCR Methods 0.000 claims 2
- 125000003729 nucleotide group Chemical group 0.000 description 71
- 239000002773 nucleotide Substances 0.000 description 68
- 239000012472 biological sample Substances 0.000 description 61
- 150000007523 nucleic acids Chemical class 0.000 description 47
- 208000037265 diseases, disorders, signs and symptoms Diseases 0.000 description 44
- 102000039446 nucleic acids Human genes 0.000 description 43
- 108020004707 nucleic acids Proteins 0.000 description 43
- 108020004414 DNA Proteins 0.000 description 38
- 102000053602 DNA Human genes 0.000 description 38
- 201000010099 disease Diseases 0.000 description 28
- 230000001225 therapeutic effect Effects 0.000 description 25
- 238000003752 polymerase chain reaction Methods 0.000 description 23
- 206010012601 diabetes mellitus Diseases 0.000 description 22
- 238000003556 assay Methods 0.000 description 20
- 230000015654 memory Effects 0.000 description 20
- 238000003860 storage Methods 0.000 description 19
- 208000035475 disorder Diseases 0.000 description 16
- 230000008685 targeting Effects 0.000 description 15
- 238000004458 analytical method Methods 0.000 description 14
- 239000013615 primer Substances 0.000 description 13
- 238000003753 real-time PCR Methods 0.000 description 13
- UUUHXMGGBIUAPW-UHFFFAOYSA-N 1-[1-[2-[[5-amino-2-[[1-[5-(diaminomethylideneamino)-2-[[1-[3-(1h-indol-3-yl)-2-[(5-oxopyrrolidine-2-carbonyl)amino]propanoyl]pyrrolidine-2-carbonyl]amino]pentanoyl]pyrrolidine-2-carbonyl]amino]-5-oxopentanoyl]amino]-3-methylpentanoyl]pyrrolidine-2-carbon Chemical compound C1CCC(C(=O)N2C(CCC2)C(O)=O)N1C(=O)C(C(C)CC)NC(=O)C(CCC(N)=O)NC(=O)C1CCCN1C(=O)C(CCCN=C(N)N)NC(=O)C1CCCN1C(=O)C(CC=1C2=CC=CC=C2NC=1)NC(=O)C1CCC(=O)N1 UUUHXMGGBIUAPW-UHFFFAOYSA-N 0.000 description 12
- 102000004270 Peptidyl-Dipeptidase A Human genes 0.000 description 12
- 108090000882 Peptidyl-Dipeptidase A Proteins 0.000 description 12
- 238000012544 monitoring process Methods 0.000 description 12
- 230000009471 action Effects 0.000 description 11
- 230000003321 amplification Effects 0.000 description 11
- 239000012634 fragment Substances 0.000 description 11
- 230000004547 gene signature Effects 0.000 description 11
- 238000003199 nucleic acid amplification method Methods 0.000 description 11
- 206010001580 Albuminuria Diseases 0.000 description 10
- 230000009274 differential gene expression Effects 0.000 description 10
- 235000018102 proteins Nutrition 0.000 description 10
- 239000002333 angiotensin II receptor antagonist Substances 0.000 description 9
- 229940125364 angiotensin receptor blocker Drugs 0.000 description 9
- 238000004891 communication Methods 0.000 description 9
- 239000003112 inhibitor Substances 0.000 description 9
- 239000012520 frozen sample Substances 0.000 description 8
- NOESYZHRGYRDHS-UHFFFAOYSA-N insulin Chemical compound N1C(=O)C(NC(=O)C(CCC(N)=O)NC(=O)C(CCC(O)=O)NC(=O)C(C(C)C)NC(=O)C(NC(=O)CN)C(C)CC)CSSCC(C(NC(CO)C(=O)NC(CC(C)C)C(=O)NC(CC=2C=CC(O)=CC=2)C(=O)NC(CCC(N)=O)C(=O)NC(CC(C)C)C(=O)NC(CCC(O)=O)C(=O)NC(CC(N)=O)C(=O)NC(CC=2C=CC(O)=CC=2)C(=O)NC(CSSCC(NC(=O)C(C(C)C)NC(=O)C(CC(C)C)NC(=O)C(CC=2C=CC(O)=CC=2)NC(=O)C(CC(C)C)NC(=O)C(C)NC(=O)C(CCC(O)=O)NC(=O)C(C(C)C)NC(=O)C(CC(C)C)NC(=O)C(CC=2NC=NC=2)NC(=O)C(CO)NC(=O)CNC2=O)C(=O)NCC(=O)NC(CCC(O)=O)C(=O)NC(CCCNC(N)=N)C(=O)NCC(=O)NC(CC=3C=CC=CC=3)C(=O)NC(CC=3C=CC=CC=3)C(=O)NC(CC=3C=CC(O)=CC=3)C(=O)NC(C(C)O)C(=O)N3C(CCC3)C(=O)NC(CCCCN)C(=O)NC(C)C(O)=O)C(=O)NC(CC(N)=O)C(O)=O)=O)NC(=O)C(C(C)CC)NC(=O)C(CO)NC(=O)C(C(C)O)NC(=O)C1CSSCC2NC(=O)C(CC(C)C)NC(=O)C(NC(=O)C(CCC(N)=O)NC(=O)C(CC(N)=O)NC(=O)C(NC(=O)C(N)CC=1C=CC=CC=1)C(C)C)CC1=CN=CN1 NOESYZHRGYRDHS-UHFFFAOYSA-N 0.000 description 8
- 238000002560 therapeutic procedure Methods 0.000 description 8
- 238000004393 prognosis Methods 0.000 description 7
- 208000016261 weight loss Diseases 0.000 description 7
- 238000013459 approach Methods 0.000 description 6
- 230000015556 catabolic process Effects 0.000 description 6
- 238000002591 computed tomography Methods 0.000 description 6
- 238000006731 degradation reaction Methods 0.000 description 6
- 238000009396 hybridization Methods 0.000 description 6
- 238000011862 kidney biopsy Methods 0.000 description 6
- 239000003755 preservative agent Substances 0.000 description 6
- 210000005166 vasculature Anatomy 0.000 description 6
- 102000008873 Angiotensin II receptor Human genes 0.000 description 5
- 108050000824 Angiotensin II receptor Proteins 0.000 description 5
- 206010061218 Inflammation Diseases 0.000 description 5
- 230000008901 benefit Effects 0.000 description 5
- 230000008859 change Effects 0.000 description 5
- 230000000295 complement effect Effects 0.000 description 5
- 238000009826 distribution Methods 0.000 description 5
- 230000000004 hemodynamic effect Effects 0.000 description 5
- 230000004054 inflammatory process Effects 0.000 description 5
- 238000007726 management method Methods 0.000 description 5
- 238000002493 microarray Methods 0.000 description 5
- 239000000203 mixture Substances 0.000 description 5
- 230000003287 optical effect Effects 0.000 description 5
- 230000036542 oxidative stress Effects 0.000 description 5
- 230000000391 smoking effect Effects 0.000 description 5
- 108091032973 (ribonucleotides)n+m Proteins 0.000 description 4
- 102000040650 (ribonucleotides)n+m Human genes 0.000 description 4
- 206010004446 Benign prostatic hyperplasia Diseases 0.000 description 4
- 108060006698 EGF receptor Proteins 0.000 description 4
- WQZGKKKJIJFFOK-GASJEMHNSA-N Glucose Natural products OC[C@H]1OC(O)[C@H](O)[C@@H](O)[C@@H]1O WQZGKKKJIJFFOK-GASJEMHNSA-N 0.000 description 4
- 102000004877 Insulin Human genes 0.000 description 4
- 108090001061 Insulin Proteins 0.000 description 4
- 206010028980 Neoplasm Diseases 0.000 description 4
- 208000008589 Obesity Diseases 0.000 description 4
- 108091034117 Oligonucleotide Proteins 0.000 description 4
- 238000003559 RNA-seq method Methods 0.000 description 4
- JLCPHMBAVCMARE-UHFFFAOYSA-N [3-[[3-[[3-[[3-[[3-[[3-[[3-[[3-[[3-[[3-[[3-[[5-(2-amino-6-oxo-1H-purin-9-yl)-3-[[3-[[3-[[3-[[3-[[3-[[5-(2-amino-6-oxo-1H-purin-9-yl)-3-[[5-(2-amino-6-oxo-1H-purin-9-yl)-3-hydroxyoxolan-2-yl]methoxy-hydroxyphosphoryl]oxyoxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(5-methyl-2,4-dioxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxyoxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(5-methyl-2,4-dioxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(5-methyl-2,4-dioxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(5-methyl-2,4-dioxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methyl [5-(6-aminopurin-9-yl)-2-(hydroxymethyl)oxolan-3-yl] hydrogen phosphate Polymers Cc1cn(C2CC(OP(O)(=O)OCC3OC(CC3OP(O)(=O)OCC3OC(CC3O)n3cnc4c3nc(N)[nH]c4=O)n3cnc4c3nc(N)[nH]c4=O)C(COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3CO)n3cnc4c(N)ncnc34)n3ccc(N)nc3=O)n3cnc4c(N)ncnc34)n3ccc(N)nc3=O)n3ccc(N)nc3=O)n3ccc(N)nc3=O)n3cnc4c(N)ncnc34)n3cnc4c(N)ncnc34)n3cc(C)c(=O)[nH]c3=O)n3cc(C)c(=O)[nH]c3=O)n3ccc(N)nc3=O)n3cc(C)c(=O)[nH]c3=O)n3cnc4c3nc(N)[nH]c4=O)n3cnc4c(N)ncnc34)n3cnc4c(N)ncnc34)n3cnc4c(N)ncnc34)n3cnc4c(N)ncnc34)O2)c(=O)[nH]c1=O JLCPHMBAVCMARE-UHFFFAOYSA-N 0.000 description 4
- 238000009534 blood test Methods 0.000 description 4
- 230000003247 decreasing effect Effects 0.000 description 4
- 238000002405 diagnostic procedure Methods 0.000 description 4
- 238000007847 digital PCR Methods 0.000 description 4
- 230000000694 effects Effects 0.000 description 4
- 238000001962 electrophoresis Methods 0.000 description 4
- 238000005516 engineering process Methods 0.000 description 4
- 239000008103 glucose Substances 0.000 description 4
- 229940125396 insulin Drugs 0.000 description 4
- 239000003550 marker Substances 0.000 description 4
- 235000020824 obesity Nutrition 0.000 description 4
- 230000002250 progressing effect Effects 0.000 description 4
- 238000011002 quantification Methods 0.000 description 4
- 238000010839 reverse transcription Methods 0.000 description 4
- 239000004055 small Interfering RNA Substances 0.000 description 4
- 208000024891 symptom Diseases 0.000 description 4
- 238000002604 ultrasonography Methods 0.000 description 4
- 230000002485 urinary effect Effects 0.000 description 4
- 108091093088 Amplicon Proteins 0.000 description 3
- 206010061818 Disease progression Diseases 0.000 description 3
- 206010061481 Renal injury Diseases 0.000 description 3
- 238000000137 annealing Methods 0.000 description 3
- 230000005540 biological transmission Effects 0.000 description 3
- 238000013500 data storage Methods 0.000 description 3
- 230000005750 disease progression Effects 0.000 description 3
- 238000000605 extraction Methods 0.000 description 3
- 238000003384 imaging method Methods 0.000 description 3
- 208000037806 kidney injury Diseases 0.000 description 3
- 238000012986 modification Methods 0.000 description 3
- 230000004048 modification Effects 0.000 description 3
- 230000009467 reduction Effects 0.000 description 3
- GUAHPAJOXVYFON-ZETCQYMHSA-N (8S)-8-amino-7-oxononanoic acid zwitterion Chemical compound C[C@H](N)C(=O)CCCCCC(O)=O GUAHPAJOXVYFON-ZETCQYMHSA-N 0.000 description 2
- 239000005541 ACE inhibitor Substances 0.000 description 2
- 102100027211 Albumin Human genes 0.000 description 2
- 108010088751 Albumins Proteins 0.000 description 2
- IJGRMHOSHXDMSA-UHFFFAOYSA-N Atomic nitrogen Chemical compound N#N IJGRMHOSHXDMSA-UHFFFAOYSA-N 0.000 description 2
- 102000004506 Blood Proteins Human genes 0.000 description 2
- 108010017384 Blood Proteins Proteins 0.000 description 2
- 208000035473 Communicable disease Diseases 0.000 description 2
- 108020004635 Complementary DNA Proteins 0.000 description 2
- 238000001712 DNA sequencing Methods 0.000 description 2
- 108010014303 DNA-directed DNA polymerase Proteins 0.000 description 2
- 102000016928 DNA-directed DNA polymerase Human genes 0.000 description 2
- 206010013654 Drug abuse Diseases 0.000 description 2
- 241000124008 Mammalia Species 0.000 description 2
- 241001465754 Metazoa Species 0.000 description 2
- XJLXINKUBYWONI-NNYOXOHSSA-N NADP zwitterion Chemical compound NC(=O)C1=CC=C[N+]([C@H]2[C@@H]([C@H](O)[C@@H](COP([O-])(=O)OP(O)(=O)OC[C@@H]3[C@H]([C@@H](OP(O)(O)=O)[C@@H](O3)N3C4=NC=NC(N)=C4N=C3)O)O2)O)=C1 XJLXINKUBYWONI-NNYOXOHSSA-N 0.000 description 2
- 108091005461 Nucleic proteins Proteins 0.000 description 2
- 206010061876 Obstruction Diseases 0.000 description 2
- 238000012408 PCR amplification Methods 0.000 description 2
- 208000002193 Pain Diseases 0.000 description 2
- 108091027967 Small hairpin RNA Proteins 0.000 description 2
- 108020004459 Small interfering RNA Proteins 0.000 description 2
- 229940044094 angiotensin-converting-enzyme inhibitor Drugs 0.000 description 2
- 230000003460 anti-nuclear Effects 0.000 description 2
- 230000036772 blood pressure Effects 0.000 description 2
- 210000001124 body fluid Anatomy 0.000 description 2
- 238000004364 calculation method Methods 0.000 description 2
- 238000011976 chest X-ray Methods 0.000 description 2
- 230000001934 delay Effects 0.000 description 2
- 230000036425 denaturation Effects 0.000 description 2
- 238000004925 denaturation Methods 0.000 description 2
- 238000012631 diagnostic technique Methods 0.000 description 2
- 230000002526 effect on cardiovascular system Effects 0.000 description 2
- PCHJSUWPFVWCPO-UHFFFAOYSA-N gold Chemical compound [Au] PCHJSUWPFVWCPO-UHFFFAOYSA-N 0.000 description 2
- 208000026278 immune system disease Diseases 0.000 description 2
- 230000006872 improvement Effects 0.000 description 2
- 238000000338 in vitro Methods 0.000 description 2
- 210000003292 kidney cell Anatomy 0.000 description 2
- 201000006370 kidney failure Diseases 0.000 description 2
- 230000003907 kidney function Effects 0.000 description 2
- 238000002595 magnetic resonance imaging Methods 0.000 description 2
- 238000013507 mapping Methods 0.000 description 2
- 108020004999 messenger RNA Proteins 0.000 description 2
- 108091070501 miRNA Proteins 0.000 description 2
- 239000002679 microRNA Substances 0.000 description 2
- 239000013642 negative control Substances 0.000 description 2
- 239000002547 new drug Substances 0.000 description 2
- 238000007481 next generation sequencing Methods 0.000 description 2
- 208000038000 non-diabetic chronic kidney disease Diseases 0.000 description 2
- 230000002093 peripheral effect Effects 0.000 description 2
- 238000002600 positron emission tomography Methods 0.000 description 2
- KMUONIBRACKNSN-UHFFFAOYSA-N potassium dichromate Chemical compound [K+].[K+].[O-][Cr](=O)(=O)O[Cr]([O-])(=O)=O KMUONIBRACKNSN-UHFFFAOYSA-N 0.000 description 2
- 238000004321 preservation Methods 0.000 description 2
- 230000003449 preventive effect Effects 0.000 description 2
- 238000000513 principal component analysis Methods 0.000 description 2
- 230000004044 response Effects 0.000 description 2
- 239000013049 sediment Substances 0.000 description 2
- 239000004065 semiconductor Substances 0.000 description 2
- 238000009168 stem cell therapy Methods 0.000 description 2
- 239000000126 substance Substances 0.000 description 2
- 201000009032 substance abuse Diseases 0.000 description 2
- 208000011117 substance-related disease Diseases 0.000 description 2
- 238000006467 substitution reaction Methods 0.000 description 2
- 238000001356 surgical procedure Methods 0.000 description 2
- 230000037221 weight management Effects 0.000 description 2
- 101150028074 2 gene Proteins 0.000 description 1
- 101150033839 4 gene Proteins 0.000 description 1
- 208000000044 Amnesia Diseases 0.000 description 1
- 241000894006 Bacteria Species 0.000 description 1
- 229940123208 Biguanide Drugs 0.000 description 1
- 229940127291 Calcium channel antagonist Drugs 0.000 description 1
- 108090000994 Catalytic RNA Proteins 0.000 description 1
- 102000053642 Catalytic RNA Human genes 0.000 description 1
- 108091026890 Coding region Proteins 0.000 description 1
- RYGMFSIKBFXOCR-UHFFFAOYSA-N Copper Chemical compound [Cu] RYGMFSIKBFXOCR-UHFFFAOYSA-N 0.000 description 1
- 108090000695 Cytokines Proteins 0.000 description 1
- 102000004127 Cytokines Human genes 0.000 description 1
- 230000004544 DNA amplification Effects 0.000 description 1
- 239000003155 DNA primer Substances 0.000 description 1
- 101100481408 Danio rerio tie2 gene Proteins 0.000 description 1
- 208000002249 Diabetes Complications Diseases 0.000 description 1
- 206010012655 Diabetic complications Diseases 0.000 description 1
- 102000015781 Dietary Proteins Human genes 0.000 description 1
- 108010010256 Dietary Proteins Proteins 0.000 description 1
- 238000002965 ELISA Methods 0.000 description 1
- 108700024394 Exon Proteins 0.000 description 1
- 241000233866 Fungi Species 0.000 description 1
- 229940127552 Glucagon-like Peptide-1 (GLP-1) Agonists Drugs 0.000 description 1
- 241000282412 Homo Species 0.000 description 1
- 208000026350 Inborn Genetic disease Diseases 0.000 description 1
- 102100034343 Integrase Human genes 0.000 description 1
- 108091092195 Intron Proteins 0.000 description 1
- 208000026139 Memory disease Diseases 0.000 description 1
- 206010027525 Microalbuminuria Diseases 0.000 description 1
- 241001529936 Murinae Species 0.000 description 1
- 101100481410 Mus musculus Tek gene Proteins 0.000 description 1
- 206010028813 Nausea Diseases 0.000 description 1
- 108020004711 Nucleic Acid Probes Proteins 0.000 description 1
- 102000004316 Oxidoreductases Human genes 0.000 description 1
- 108090000854 Oxidoreductases Proteins 0.000 description 1
- BYPFEZZEUUWMEJ-UHFFFAOYSA-N Pentoxifylline Chemical compound O=C1N(CCCCC(=O)C)C(=O)N(C)C2=C1N(C)C=N2 BYPFEZZEUUWMEJ-UHFFFAOYSA-N 0.000 description 1
- 108010004729 Phycoerythrin Proteins 0.000 description 1
- 239000013614 RNA sample Substances 0.000 description 1
- 108010092799 RNA-directed DNA polymerase Proteins 0.000 description 1
- 108091028733 RNTP Proteins 0.000 description 1
- 208000035977 Rare disease Diseases 0.000 description 1
- 108091028664 Ribonucleotide Proteins 0.000 description 1
- 108020004682 Single-Stranded DNA Proteins 0.000 description 1
- 102000000070 Sodium-Glucose Transport Proteins Human genes 0.000 description 1
- 108010080361 Sodium-Glucose Transport Proteins Proteins 0.000 description 1
- 229940100389 Sulfonylurea Drugs 0.000 description 1
- 108020004566 Transfer RNA Proteins 0.000 description 1
- 241000251539 Vertebrata <Metazoa> Species 0.000 description 1
- 241000700605 Viruses Species 0.000 description 1
- 208000027418 Wounds and injury Diseases 0.000 description 1
- 239000012190 activator Substances 0.000 description 1
- OFCNXPDARWKPPY-UHFFFAOYSA-N allopurinol Chemical compound OC1=NC=NC2=C1C=NN2 OFCNXPDARWKPPY-UHFFFAOYSA-N 0.000 description 1
- 229960003459 allopurinol Drugs 0.000 description 1
- 230000003276 anti-hypertensive effect Effects 0.000 description 1
- 238000011861 anti-inflammatory therapy Methods 0.000 description 1
- 230000000417 anti-transforming effect Effects 0.000 description 1
- 230000003078 antioxidant effect Effects 0.000 description 1
- 239000011324 bead Substances 0.000 description 1
- 239000002876 beta blocker Substances 0.000 description 1
- 229940030611 beta-adrenergic blocking agent Drugs 0.000 description 1
- 150000004283 biguanides Chemical class 0.000 description 1
- 238000001574 biopsy Methods 0.000 description 1
- 210000000601 blood cell Anatomy 0.000 description 1
- 239000000480 calcium channel blocker Substances 0.000 description 1
- 201000011510 cancer Diseases 0.000 description 1
- 238000000423 cell based assay Methods 0.000 description 1
- 239000003795 chemical substances by application Substances 0.000 description 1
- 210000000349 chromosome Anatomy 0.000 description 1
- 238000003759 clinical diagnosis Methods 0.000 description 1
- 238000010367 cloning Methods 0.000 description 1
- 238000004624 confocal microscopy Methods 0.000 description 1
- 230000021615 conjugation Effects 0.000 description 1
- 230000008094 contradictory effect Effects 0.000 description 1
- 238000007405 data analysis Methods 0.000 description 1
- 238000013135 deep learning Methods 0.000 description 1
- 230000006735 deficit Effects 0.000 description 1
- 239000005547 deoxyribonucleotide Substances 0.000 description 1
- 125000002637 deoxyribonucleotide group Chemical group 0.000 description 1
- 238000011161 development Methods 0.000 description 1
- 238000000502 dialysis Methods 0.000 description 1
- 235000021245 dietary protein Nutrition 0.000 description 1
- 239000002934 diuretic Substances 0.000 description 1
- 229940030606 diuretics Drugs 0.000 description 1
- 230000035622 drinking Effects 0.000 description 1
- 239000000975 dye Substances 0.000 description 1
- 206010016256 fatigue Diseases 0.000 description 1
- 239000000835 fiber Substances 0.000 description 1
- 238000001914 filtration Methods 0.000 description 1
- 238000000684 flow cytometry Methods 0.000 description 1
- 239000007850 fluorescent dye Substances 0.000 description 1
- 238000007672 fourth generation sequencing Methods 0.000 description 1
- 230000006870 function Effects 0.000 description 1
- 238000003633 gene expression assay Methods 0.000 description 1
- 208000016361 genetic disease Diseases 0.000 description 1
- 230000002068 genetic effect Effects 0.000 description 1
- 239000003102 growth factor Substances 0.000 description 1
- 230000036541 health Effects 0.000 description 1
- 229940088597 hormone Drugs 0.000 description 1
- 239000005556 hormone Substances 0.000 description 1
- 238000010348 incorporation Methods 0.000 description 1
- 230000002757 inflammatory effect Effects 0.000 description 1
- 208000014674 injury Diseases 0.000 description 1
- 210000004153 islets of langerhan Anatomy 0.000 description 1
- 230000000670 limiting effect Effects 0.000 description 1
- 239000007788 liquid Substances 0.000 description 1
- 238000004895 liquid chromatography mass spectrometry Methods 0.000 description 1
- 238000011068 loading method Methods 0.000 description 1
- 238000004519 manufacturing process Methods 0.000 description 1
- 239000000463 material Substances 0.000 description 1
- 238000002483 medication Methods 0.000 description 1
- 230000006984 memory degeneration Effects 0.000 description 1
- 208000023060 memory loss Diseases 0.000 description 1
- WSFSSNUMVMOOMR-NJFSPNSNSA-N methanone Chemical compound O=[14CH2] WSFSSNUMVMOOMR-NJFSPNSNSA-N 0.000 description 1
- 230000008693 nausea Effects 0.000 description 1
- 208000015122 neurodegenerative disease Diseases 0.000 description 1
- 229930027945 nicotinamide-adenine dinucleotide Natural products 0.000 description 1
- 229910052757 nitrogen Inorganic materials 0.000 description 1
- 239000002853 nucleic acid probe Substances 0.000 description 1
- 230000036407 pain Effects 0.000 description 1
- 210000000496 pancreas Anatomy 0.000 description 1
- 244000045947 parasite Species 0.000 description 1
- 229960001476 pentoxifylline Drugs 0.000 description 1
- 239000013612 plasmid Substances 0.000 description 1
- 238000006116 polymerization reaction Methods 0.000 description 1
- 239000002987 primer (paints) Substances 0.000 description 1
- 238000000746 purification Methods 0.000 description 1
- 238000012175 pyrosequencing Methods 0.000 description 1
- 238000005057 refrigeration Methods 0.000 description 1
- 230000001172 regenerating effect Effects 0.000 description 1
- 239000002336 ribonucleotide Substances 0.000 description 1
- 125000002652 ribonucleotide group Chemical group 0.000 description 1
- 108020004418 ribosomal RNA Proteins 0.000 description 1
- 108091092562 ribozyme Proteins 0.000 description 1
- 239000010454 slate Substances 0.000 description 1
- 241000894007 species Species 0.000 description 1
- 238000009580 stem-cell therapy Methods 0.000 description 1
- 230000035882 stress Effects 0.000 description 1
- 238000003786 synthesis reaction Methods 0.000 description 1
- 230000002194 synthesizing effect Effects 0.000 description 1
- 238000005382 thermal cycling Methods 0.000 description 1
- 238000002054 transplantation Methods 0.000 description 1
- 239000013598 vector Substances 0.000 description 1
- 230000000007 visual effect Effects 0.000 description 1
Classifications
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12Q—MEASURING OR TESTING PROCESSES INVOLVING ENZYMES, NUCLEIC ACIDS OR MICROORGANISMS; COMPOSITIONS OR TEST PAPERS THEREFOR; PROCESSES OF PREPARING SUCH COMPOSITIONS; CONDITION-RESPONSIVE CONTROL IN MICROBIOLOGICAL OR ENZYMOLOGICAL PROCESSES
- C12Q1/00—Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions
- C12Q1/68—Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions involving nucleic acids
- C12Q1/6876—Nucleic acid products used in the analysis of nucleic acids, e.g. primers or probes
- C12Q1/6883—Nucleic acid products used in the analysis of nucleic acids, e.g. primers or probes for diseases caused by alterations of genetic material
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12Q—MEASURING OR TESTING PROCESSES INVOLVING ENZYMES, NUCLEIC ACIDS OR MICROORGANISMS; COMPOSITIONS OR TEST PAPERS THEREFOR; PROCESSES OF PREPARING SUCH COMPOSITIONS; CONDITION-RESPONSIVE CONTROL IN MICROBIOLOGICAL OR ENZYMOLOGICAL PROCESSES
- C12Q2600/00—Oligonucleotides characterized by their use
- C12Q2600/158—Expression markers
Definitions
- Diabetic nephropathy is a major cause of kidney disease and is the most common cause of end-stage renal disease (ESRD). In addition, DN is linked to higher cardiovascular and all-cause morbidity and mortality, so timely diagnosis and treatment are critical. Diabetic nephropathy is a diabetic kidney disease that generally refers to kidney damage that results from having high blood glucose levels due to diabetes. Diabetic nephropathy progresses slowly. With efficient early treatment, one can slow or even stop the progression of the disease.
- DN may be associated with measurable biomarkers, such as albuminuria and/or low eGFR, in bodily samples of subjects; however, these biomarkers may be non-specific to DN, and may be attributable to other disease or disorders, such as diabetes, hypertension, IgA nephropathy, membranous nephropathy, lupus nephritis, minimal change disease, rheumatoid arthritis, medicine such as NSAIDs use, smoking, excessive alcohol use, drugs of abuse, obesity, urinary tract infection, kidney stone, benign prostate hyperplasia (BPH), etc.
- diseases with controlled diabetes may also have diabetic nephropathy, whereas poorly controlled diabetic patients may have little kidney damage.
- patients with diabetic nephropathy may also have other type(s) of kidney disease or disorder.
- DN is an under-diagnosed and mis-diagnosed disease among patients.
- DN may be under-diagnosed because diagnostics methods such as renal biopsy can be risky (e.g., with a 1.8% mortality rate), expensive, and time-consuming; therefore, many patients opt out of such diagnostic testing.
- DN may be mis-diagnosed because diagnostics methods may lack adequate sensitivity and specificity.
- a urinary albumin assay may be insensitive and non-specific.
- imaging tests such as X-rays and ultrasound diagnostics to check the structure and size of a subject’s kidney may be time-consuming and indirect, and other panel tests such as urinary sediment test, urine protein electrophoresis (UPEP), serum protein electrophoresis (SPEP), urine blood test, antinuclear antibody (ANA) test, HBV test, HCV test, HIV test may provide limited information, and therefore are indirect and inefficient.
- In vitro diagnostic techniques such as those based on proteomics, genomics, and protein biomarkers, may face challenges in accurately detecting, assessing, and monitoring kidney disease or disorder at high sensitivity, specificity, and accuracy.
- albuminuria especially at low levels, may be confounded with many other factors such as hypertension, smoking, alcohol use, drug abuse, obesity, infection, obstruction, etc. Therefore, many patients may miss the best opportunity for early medical intervention or may receive treatment that is unrelated to the cause of the disease. Receiving such unnecessary and/or ineffective treatment may be expensive, time consuming, and cause delays in providing other effective treatments to patients.
- kidney disease or disorder e.g., DN
- the present disclosure provides methods, systems, and kits for detecting kidney disease or disorders by processing biological samples (e.g., tissue samples, cell samples, and/or bodily fluid samples) obtained from or derived from a subject.
- biological samples e.g., tissue samples, cell samples, and/or bodily fluid samples
- nucleic acids, proteins, or cells of biological samples may be analyzed.
- Biological samples obtained from subjects may be analyzed to measure a presence, absence, or relative assessment of the kidney disease or disorder.
- the analysis may be performed at a set of genomic regions, such as kidney disease-associated genes or genomic loci.
- the subjects may include subjects with kidney disease or disorder (e.g., kidney disease or disorder patients) and subjects without kidney disease or disorder (e.g., normal or healthy controls).
- Methods of the present disclosure may present numerous advantages over current methods, including: ease, safety, and non-invasiveness of sample collection from subjects, possibility of repeated assays, the use of urine samples which have kidney cells suitable for analysis, a direct method of analysis of kidney injury, suitability for monitoring disease progression and treatment efficacy, suitability for sample collection in a home setting, the ability to perform testing without detailed medical history of a subject, and the ability to detect early diabetic changes (e.g., for asymptomatic subjects).
- kidney disease or disorder can be accurately detected using an assay with high sensitivity and specificity in biological samples (e.g., urine samples).
- the urine-based assay can apply machine learning algorithm to analyze a set of biomarkers to accurately distinguish kidney disease or disorder samples from control samples across various stages (e.g., early-stage, mid-stage, or late-stage) of kidney disease or disorder.
- the urine-based assay may offer high specificity, thereby facilitating the non- invasive application of kidney disease or disorder associated biomarkers for treatment monitoring of kidney disease or disorder patients.
- the urine-based assay may offer higher sensitivity and specificity than those of renal biopsy, currently considered as the gold standard for definitive diagnosis of kidney diseases.
- the present disclosure provides a method for processing or analyzing a bodily sample of a subject, comprising: (a) analyzing the bodily sample to yield a data set comprising one or more levels of gene expression products in the bodily sample, which one or more levels of gene expression products correspond to a set of genes associated with a kidney disease or disorder; (b) computer processing the data set from (a) to determine a presence or an elevated risk of the kidney disease or disorder in the subject at an accuracy of at least about 80%; and (c) electronically outputting a report that identifies the presence or the elevated risk of the kidney disease or disorder in the subject determined in (b).
- the bodily sample is selected from the group consisting of: a blood sample, a serum sample, a plasma sample, a saliva sample, a stool sample, a sputum sample, a urine sample, a semen sample, a transvaginal fluid sample, a cerebrospinal fluid sample, a sweat sample, a cell sample, and a tissue sample.
- the bodily sample is the urine sample.
- the urine sample is a fresh urine sample.
- the urine sample is a frozen sample (e.g., freshly frozen sample).
- the urine sample is a preserved urine sample (e.g., stored at room temperature in preservatives to prevent degradation).
- (a) comprises reverse transcribing ribonucleic acid (RNA) molecules obtained or derived from the bodily sample to yield complementary deoxyribonucleic acid (cDNA) molecules, and sequencing at least a portion of the cDNA molecules to yield sequencing reads.
- sequencing reads are mapped to a reference sequence (e.g., a reference genome such as a human genome) to yield the data set, wherein the data set comprises counts of gene transcripts.
- each count of a transcript is indicative of a gene expression event.
- (a) comprises reverse transcribing ribonucleic acid (RNA) molecules obtained or derived from the bodily sample to yield complementary deoxyribonucleic acid (cDNA) molecules, and assaying at least a portion of the cDNA molecules by Real-time polymerase chain reaction (RT-PCR, also called qPCR) to yield the data set.
- RNA ribonucleic acid
- cDNA complementary deoxyribonucleic acid
- RT-PCR Real-time polymerase chain reaction
- qPCR Real-time polymerase chain reaction
- the data set comprises cycle threshold (Ct) values, which are inversely proportional to gene expression levels.
- (a) comprises reverse transcribing ribonucleic acid (RNA) molecules obtained or derived from the bodily sample to yield complementary deoxyribonucleic acid (cDNA) molecules, and assaying at least a portion of the cDNA molecules by microarray analysis (e.g. Affymetrix Microarray) to yield the data set.
- the data set comprises counts of gene transcripts. In some embodiments, each count is indicative of a gene expression event.
- (a) comprises hybridizing ribonucleic acid (RNA) molecules obtained or derived from the bodily sample using specific probe sets, and assaying the hybridized RNA molecules using a Luminex platform to yield the data set.
- the data set comprises counts of gene transcripts.
- (a) comprises selectively enriching at least a portion of the cDNA molecules or the RNA molecules for a set of genomic loci associated with the kidney disease or disorder. In some embodiments, (a) comprises amplifying at least a portion of the cDNA molecules or the RNA molecules. In some embodiments, (a) comprises aligning at least a portion of the sequencing reads to a reference sequence. In some embodiments, (a) comprises generating counts of gene transcripts. In some embodiments, the reference sequence is at least a portion of a human reference genome. In some embodiments, the counts of gene transcripts are normalized to generate normalized counts of gene transcripts for downstream differential gene expression analysis.
- the normalizing comprises determining CPM (counts per million), TPM (transcripts per kilobase million), or RPKM/FPKM (reads/fragments per kilobase of exon per million reads/fragments mapped).
- the kidney disease or disorder is selected from the group consisting of: early-stage kidney disease, mid-stage kidney disease, late-stage kidney disease, end-stage kidney disease, asymptomatic kidney disease, diabetic nephropathy, hypertensive nephropathy, IgA nephropathy, membranous nephropathy, minimal change disease, focal segmental glomerulosclerosis (FSGS), NSAIDs induced nephrotoxicity, thin basement membrane nephropathy, amyloidosis, ANCA vasculitis related to endocarditis and other infections, cardiorenal syndrome, IgG4 nephropathy, interstitial nephritis, lithium nephrotoxicity, lupus nephritis, multiple myeloma, polycystic kidney disease, pyelonephritis (kidney infection), renal artery stenosis, renal cyst, rheumatoid arthritis-associated renal
- the kidney disease or disorder is diabetic nephropathy.
- diabetic nephropathy is early-stage diabetic nephropathy.
- the subject is asymptomatic for the diabetic nephropathy.
- (b) comprises using a trained algorithm to process the data set.
- the trained algorithm comprises a trained machine learning algorithm.
- the trained machine learning algorithm is selected from the group consisting of: a support vector machine (SVM), a naive Bayes classification, a linear regression, a quantile regression, a logistic regression, a nonlinear regression, a random forest, a neural network, an ensemble learning method, a boosting algorithm, an AdaBoost algorithm, a recursive feature elimination algorithm (RFE), and any combination thereof.
- the trained machine learning algorithm comprises the recursive feature elimination (RFE) algorithm.
- the trained machine learning algorithm is trained with a plurality of training samples comprising a first set of bodily samples from subjects having the type of kidney disease or disorder of interest (e.g. diabetic nephropathy) and a second set of bodily samples from subjects having no kidney disease or disorder, wherein the first set of bodily samples and the second set of bodily samples are different from the bodily sample of the subject.
- the trained machine learning algorithm is trained with a plurality of training samples comprising a first set of bodily samples from subjects having the type of kidney disease or disorder of interest (e.g. diabetic nephropathy) and a second set of bodily samples from subjects having other types of kidney disease or disorder, wherein the first set of bodily samples and the second set of bodily samples are different from the bodily sample of the subject.
- (a) comprises comparing one or more levels of gene expression products to a reference.
- the reference corresponds to a set of gene expression products from a first set of bodily samples from subjects having the type of kidney disease or disorder of interest (e.g. diabetic nephropathy) and/or a second set of bodily samples from subjects having no kidney disease or disorder.
- the reference corresponds to a set of gene expression products from a first set of bodily samples from subjects having the type of kidney disease or disorder of interest (e.g. diabetic nephropathy) and/or a second set of bodily samples from subjects having other types of kidney disease or disorder.
- the method further comprises detecting the presence or the elevated risk of the kidney disease or disorder in the subject at a sensitivity of at least about 50%, at least about 55%, at least about 60%, at least about 65%, at least about 70%, at least about 75%, at least about 80%, at least about 85%, at least about 90%, at least about 91%, at least about 92%, at least about 93%, at least about 94%, at least about 95%, at least about 96%, at least about 97%, at least about 98%, or at least about 99%.
- the method further comprises detecting the presence or the elevated risk of the kidney disease or disorder in the subject at a specificity of at least about 50%, at least about 55%, at least about 60%, at least about 65%, at least about 70%, at least about 75%, at least about 80%, at least about 85%, at least about 90%, at least about 91%, at least about 92%, at least about 93%, at least about 94%, at least about 95%, at least about 96%, at least about 97%, at least about 98%, or at least about 99%.
- the method further comprises detecting the presence or the elevated risk of the kidney disease or disorder in the subject at a positive predictive value of at least about 50%, at least about 55%, at least about 60%, at least about 65%, at least about 70%, at least about 75%, at least about 80%, at least about 85%, at least about 90%, at least about 91%, at least about 92%, at least about 93%, at least about 94%, at least about 95%, at least about 96%, at least about 97%, at least about 98%, or at least about 99%.
- the method further comprises detecting the presence or the elevated risk of the kidney disease or disorder in the subject at a negative predictive value of at least about 50%, at least about 55%, at least about 60%, at least about 65%, at least about 70%, at least about 75%, at least about 80%, at least about 85%, at least about 90%, at least about 91%, at least about 92%, at least about 93%, at least about 94%, at least about 95%, at least about 96%, at least about 97%, at least about 98%, or at least about 99%.
- the method further comprises detecting the presence or the elevated risk of the kidney disease or disorder in the subject at an Area Under the Curve (AUC) of at least about 0.50, at least about 0.55, at least about 0.60, at least about 0.65, at least about 0.70, at least about 0.75, at least about 0.80, at least about 0.81, at least about
- AUC Area Under the Curve
- the method further comprises identifying a clinical intervention for the subject based at least in part on the presence or the elevated risk of the kidney disease or disorder determined in (b).
- the clinical intervention is selected from the group consisting of: a drug treatment, intensive glycemic control, high blood pressure control, lower high cholesterol, foster bone health, diet control, lifestyle changes, weight loss, exercise, tobacco cessation, manage alcohol intake, reduce/quit drugs of abuse, and avoiding NSAIDs.
- the drug relies on the blockage of the renin- angiotensin aldosterone system (RAAS, e.g., angiotensin-converting enzyme (ACE) inhibitors and angiotensin II receptor blockers (ARBs)).
- RAAS renin- angiotensin aldosterone system
- ACE angiotensin-converting enzyme
- ARBs angiotensin II receptor blockers
- the drug is an experimental therapy, such as a drug targeting vasculature/hemodynamic effects, a drug-targeting inflammation, and a drug targeting oxidative stress.
- (b) comprises analyzing a first set of genes that differentially distinguishes between a first kidney disease or disorder and negative (NEG) subjects and a second set of genes that differentially distinguishes between the first kidney disease or disorder and a second kidney disease or disorder.
- (b) comprises analyzing a first set of genes that differentially distinguishes between diabetic nephropathy (DN) and negative (NEG) subjects and a second set of genes that differentially distinguishes between DN and other chronic kidney diseases (CKD).
- the first set of genes is selected from the group of genes listed in Table 3 and Table 5, and wherein the second set of genes is selected from the group of genes listed in Table 4 and Table 6.
- (b) comprises generating a first DN vs. NEG score based on the first set of genes and a second DN vs. CKD score based on the second set of genes.
- the first DN vs. NEG score is indicative of glomerular injury when greater than 0.5, or is indicative of tubular injury when less than 0.5.
- (b) comprises analyzing different male-specific or female- specific sets of genes based on a gender of the subject.
- the method further comprises removing at least a subset of negative (NEG) subjects who have a pre-determined characteristic (e.g., are obese, are morbidly obese, are nicotine dependent, are alcohol dependent, are drugs-of-abuse dependent, have kidney stone, have severe hypertension, have urinary tract infection, have heart diseases, have hepatitis B, have hepatitis C, have HIV, have psoriasis, have rheumatoid arthritis, use NSAIDs, etc.), and optionally replacing with additional NEG subjects who do not have the pre-determined characteristic, to generate a modified set of NEG-X subjects (e.g., where X is the pre-determined characteristic).
- a pre-determined characteristic e.g., are obese, are morbidly obese, are nicotine dependent, are alcohol dependent, are drugs-of-abuse dependent, have kidney stone, have severe hypertension, have urinary tract infection, have heart diseases, have hepatitis B, have hepatitis C, have HIV, have
- the method further comprises removing at least a subset of other chronic kidney disease (CKD) subjects who have a pre-determined characteristic (e.g., are obese, are morbidly obese, are nicotine dependent, are alcohol dependent, are drugs-of-abuse dependent, have kidney stone, have severe hypertension, have urinary tract infection, have heart diseases, have hepatitis B, have hepatitis C, have HIV, have psoriasis, have rheumatoid arthritis, use NSAIDs, have IgA nephropathy, have membranous nephropathy, have minimal change disease, have focal segmental glomerulosclerosis (FSGS), have thin basement membrane nephropathy, have amyloidosis, have ANCA vasculitis related to endocarditis and other infections, have cardiorenal syndrome, have IgG4 nephropathy, have interstitial nephritis, have lithium nephrotoxicity, have lupus
- CKD chronic kidney disease
- the DN vs. CKD-Y score is much higher than the DN vs. CKD score (e.g., > 0.1), that is indicative of the kidney damage being a result of the pre-determined characteristic Y.
- the method further comprises analyzing bodily samples of the subject at two or more different time points to yield two or more data sets, and computer processing the two or more data sets to determine the presence, the absence, or the elevated risk of the kidney disease or disorder in the subject. In some embodiments, the method further comprises determining a presence, an absence, or an elevated risk of another type of kidney disease or disorder in the subject, and electronically outputting a report that identifies the presence, the absence, or the elevated risk of the another type of kidney disease or disorder in the subject.
- the method further comprises analyzing bodily samples of the subject at two or more different time points to yield two or more data sets, and computer processing the two or more data sets to determine a presence, an absence, or an elevated risk of another type of kidney disease or disorder in the subject.
- the present disclosure provides a method for processing or analyzing a bodily sample of a subject, comprising: (a) analyzing the bodily sample to yield a data set comprising one or more levels of gene expression products in the bodily sample, which one or more levels of gene expression products correspond to a set of genes associated with a kidney disease or disorder; (b) computer processing the data set from (a) to determine a presence, an absence, or an elevated risk of the kidney disease or disorder in the subject at a sensitivity of at least about 80%; and (c) electronically outputting a report that identifies the presence, the absence, or the elevated risk of the kidney disease or disorder in the subject determined in (b).
- the bodily sample is selected from the group consisting of: a blood sample, a serum sample, a plasma sample, a saliva sample, a stool sample, a sputum sample, a urine sample, a semen sample, a transvaginal fluid sample, a cerebrospinal fluid sample, a sweat sample, a cell sample, and a tissue sample.
- the bodily sample is the urine sample.
- the urine sample is a fresh urine sample.
- the urine sample is a frozen sample (e.g., freshly frozen sample).
- the urine sample is a preserved urine sample (e.g., stored at room temperature in preservatives to prevent degradation).
- (a) comprises reverse transcribing ribonucleic acid (RNA) molecules obtained or derived from the bodily sample to yield complementary deoxyribonucleic acid (cDNA) molecules, and sequencing at least a portion of the cDNA molecules to yield sequencing reads.
- sequencing reads are mapped to a reference sequence (e.g., a reference genome such as a human genome) to yield the data set, wherein the data set comprises counts of gene transcripts.
- each count of a transcript is indicative of a gene expression event.
- (a) comprises reverse transcribing ribonucleic acid (RNA) molecules obtained or derived from the bodily sample to yield complementary deoxyribonucleic acid (cDNA) molecules, and assaying at least a portion of the cDNA molecules by Real-time polymerase chain reaction (RT-PCR, also called qPCR) to yield the data set.
- RNA ribonucleic acid
- cDNA complementary deoxyribonucleic acid
- RT-PCR Real-time polymerase chain reaction
- qPCR Real-time polymerase chain reaction
- the data set comprises cycle threshold (Ct) values, which are inversely proportional to gene expression levels.
- (a) comprises reverse transcribing ribonucleic acid (RNA) molecules obtained or derived from the bodily sample to yield complementary deoxyribonucleic acid (cDNA) molecules, and assaying at least a portion of the cDNA molecules by microarray analysis (e.g. Affymetrix Microarray) to yield the data set.
- the data set comprises counts of gene transcripts. In some embodiments, each count is indicative of a gene expression event.
- (a) comprises hybridizing ribonucleic acid (RNA) molecules obtained or derived from the bodily sample using specific probe sets, and assaying the hybridized RNA molecules using a Luminex platform to yield the data set.
- the data set comprises counts of gene transcripts.
- (a) comprises selectively enriching at least a portion of the cDNA molecules or the RNA molecules for a set of genomic loci associated with the kidney disease or disorder. In some embodiments, (a) comprises amplifying at least a portion of the cDNA molecules or the RNA molecules. In some embodiments, (a) comprises aligning at least a portion of the sequencing reads to a reference sequence. In some embodiments, (a) comprises generating counts of gene transcripts. In some embodiments, the reference sequence is at least a portion of a human reference genome. In some embodiments, the counts of gene transcripts are normalized to generate normalized counts of gene transcripts for downstream differential gene expression analysis.
- the normalizing comprises determining CPM (counts per million), TPM (transcripts per kilobase million), or RPKM/FPKM (reads/fragments per kilobase of exon per million reads/fragments mapped).
- the kidney disease or disorder is selected from the group consisting of: early-stage kidney disease, mid-stage kidney disease, late-stage kidney disease, end-stage kidney disease, asymptomatic kidney disease, diabetic nephropathy, hypertensive nephropathy, IgA nephropathy, membranous nephropathy, minimal change disease, focal segmental glomerulosclerosis (FSGS), NSAIDs induced nephrotoxicity, thin basement membrane nephropathy, amyloidosis, ANCA vasculitis related to endocarditis and other infections, cardiorenal syndrome, IgG4 nephropathy, interstitial nephritis, lithium nephrotoxicity, lupus nephritis, multiple myeloma, polycystic kidney disease, pyelonephritis (kidney infection), renal artery stenosis, renal cyst, rheumatoid arthritis-associated renal
- the kidney disease or disorder is diabetic nephropathy.
- diabetic nephropathy is early-stage diabetic nephropathy.
- the subject is asymptomatic for the diabetic nephropathy.
- (b) comprises using a trained algorithm to process the data set.
- the trained algorithm comprises a trained machine learning algorithm.
- the trained machine learning algorithm is selected from the group consisting of: a support vector machine (SVM), a naive Bayes classification, a linear regression, a quantile regression, a logistic regression, a nonlinear regression, a random forest, a neural network, an ensemble learning method, a boosting algorithm, an AdaBoost algorithm, a recursive feature elimination algorithm (RFE), and any combination thereof.
- the trained machine learning algorithm comprises the recursive feature elimination (RFE) algorithm.
- the trained machine learning algorithm is trained with a plurality of training samples comprising a first set of bodily samples from subjects having the type of kidney disease or disorder of interest (e.g.
- the trained machine learning algorithm is trained with a plurality of training samples comprising a first set of bodily samples from subjects having the type of kidney disease or disorder of interest (e.g. diabetic nephropathy) and a second set of bodily samples from subjects having other types of kidney disease or disorder, wherein the first set of bodily samples and the second set of bodily samples are different from the bodily sample of the subject.
- (a) comprises comparing one or more levels of gene expression products to a reference.
- the reference corresponds to a set of gene expression products from a first set of bodily samples from subjects having the type of kidney disease or disorder of interest (e.g. diabetic nephropathy) and/or a second set of bodily samples from subjects having no kidney disease or disorder.
- the reference corresponds to a set of gene expression products from a first set of bodily samples from subjects having the type of kidney disease or disorder of interest (e.g. diabetic nephropathy) and/or a second set of bodily samples from subjects having other types of kidney disease or disorder.
- the method further comprises detecting the presence or the elevated risk of the kidney disease or disorder in the subject at a sensitivity of at least about 50%, at least about 55%, at least about 60%, at least about 65%, at least about 70%, at least about 75%, at least about 80%, at least about 85%, at least about 90%, at least about 91%, at least about 92%, at least about 93%, at least about 94%, at least about 95%, at least about 96%, at least about 97%, at least about 98%, or at least about 99%.
- the method further comprises detecting the presence or the elevated risk of the kidney disease or disorder in the subject at a specificity of at least about 50%, at least about 55%, at least about 60%, at least about 65%, at least about 70%, at least about 75%, at least about 80%, at least about 85%, at least about 90%, at least about 91%, at least about 92%, at least about 93%, at least about 94%, at least about 95%, at least about 96%, at least about 97%, at least about 98%, or at least about 99%.
- the method further comprises detecting the presence or the elevated risk of the kidney disease or disorder in the subject at a positive predictive value of at least about 50%, at least about 55%, at least about 60%, at least about 65%, at least about 70%, at least about 75%, at least about 80%, at least about 85%, at least about 90%, at least about 91%, at least about 92%, at least about 93%, at least about 94%, at least about 95%, at least about 96%, at least about 97%, at least about 98%, or at least about 99%.
- the method further comprises detecting the presence or the elevated risk of the kidney disease or disorder in the subject at a negative predictive value of at least about 50%, at least about 55%, at least about 60%, at least about 65%, at least about 70%, at least about 75%, at least about 80%, at least about 85%, at least about 90%, at least about 91%, at least about 92%, at least about 93%, at least about 94%, at least about 95%, at least about 96%, at least about 97%, at least about 98%, or at least about 99%.
- the method further comprises detecting the presence or the elevated risk of the kidney disease or disorder in the subject at an Area Under the Curve (AUC) of at least about 0.50, at least about 0.55, at least about 0.60, at least about 0.65, at least about 0.70, at least about 0.75, at least about 0.80, at least about 0.81, at least about
- AUC Area Under the Curve
- the method further comprises identifying a clinical intervention for the subject based at least in part on the presence or the elevated risk of the kidney disease or disorder determined in (b).
- the clinical intervention is selected from the group consisting of: a drug treatment, intensive glycemic control, high blood pressure control, lower high cholesterol, foster bone health, diet control, lifestyle changes, weight loss, exercise, tobacco cessation, manage alcohol intake, reduce/quit drugs of abuse, and avoiding NSAIDs.
- the drug relies on the blockage of the renin- angiotensin aldosterone system (RAAS, e.g., angiotensin-converting enzyme (ACE) inhibitors and angiotensin II receptor blockers (ARBs)).
- RAAS renin- angiotensin aldosterone system
- ACE angiotensin-converting enzyme
- ARBs angiotensin II receptor blockers
- the drug is an experimental therapy, such as a drug targeting vasculature/hemodynamic effects, a drugs targeting inflammation, and a drug targeting oxidative stress.
- (b) comprises analyzing a first set of genes that differentially distinguishes between a first kidney disease or disorder and negative (NEG) subjects and a second set of genes that differentially distinguishes between the first kidney disease or disorder and a second kidney disease or disorder.
- (b) comprises analyzing a first set of genes that differentially distinguishes between diabetic nephropathy (DN) and negative (NEG) subjects and a second set of genes that differentially distinguishes between DN and other chronic kidney diseases (CKD).
- the first set of genes is selected from the group of genes listed in Table 3 and Table 5, and wherein the second set of genes is selected from the group of genes listed in Table 4 and Table 6.
- (b) comprises generating a first DN vs. NEG score based on the first set of genes and a second DN vs. CKD score based on the second set of genes.
- the first DN vs. NEG score is indicative of glomerular injury when greater than 0.5, or is indicative of tubular injury when less than 0.5.
- (b) comprises analyzing different male-specific or female- specific sets of genes based on a gender of the subject.
- the method further comprises analyzing bodily samples of the subject at two or more different time points to yield two or more data sets, and computer processing the two or more data sets to determine the presence, the absence, or the elevated risk of the kidney disease or disorder in the subject. In some embodiments, the method further comprises determining a presence, an absence, or an elevated risk of another type of kidney disease or disorder in the subject, and electronically outputting a report that identifies the presence, the absence, or the elevated risk of the another type of kidney disease or disorder in the subject.
- the method further comprises analyzing bodily samples of the subject at two or more different time points to yield two or more data sets, and computer processing the two or more data sets to determine a presence, an absence, or an elevated risk of another type of kidney disease or disorder in the subject.
- the sensitivity and specificity of determining the presence of the elevated risk of the kidney disease or disorder in the subject comprises a percentage of independent test samples associated with presence or elevated risk of the kidney disease or disorder that are correctly determined as having the presence or the elevated risk of the kidney disease or disorder.
- the method further comprises determining the presence, the absence, or the elevated risk of the kidney disease or disorder in the subject at a specificity of at least about 80%. In some embodiments, the method further comprises (d) electronically outputting a report that identifies the presence or absence of another type of kidney disease or disorder in the subject.
- the present disclosure provides a method for processing or analyzing a bodily sample of a subject, comprising: (a) sequencing ribonucleic acid (RNA) molecules obtained or derived from the bodily sample to yield sequencing reads indicative of counts of gene transcripts in the bodily sample, which counts of gene transcripts correspond to a set of genes associated with a kidney disease or disorder; (b) computer processing the counts of gene transcripts from (a) to determine a presence, an absence, or an elevated risk of the kidney disease or disorder in the subject; and (c) electronically outputting a report that identifies the presence, the absence, or the elevated risk of the kidney disease or disorder in the subject determined in (b).
- RNA ribonucleic acid
- the bodily sample is selected from the group consisting of: a blood sample, a serum sample, a plasma sample, a saliva sample, a stool sample, a sputum sample, a urine sample, a semen sample, a transvaginal fluid sample, a cerebrospinal fluid sample, a sweat sample, a cell sample, and a tissue sample.
- the bodily sample is the urine sample.
- the urine sample is a fresh urine sample.
- the urine sample is a frozen sample (e.g., freshly frozen sample).
- the urine sample is a preserved urine sample (e.g., stored at room temperature in preservatives to prevent degradation).
- (a) comprises reverse transcribing ribonucleic acid (RNA) molecules obtained or derived from the bodily sample to yield complementary deoxyribonucleic acid (cDNA) molecules, and sequencing at least a portion of the cDNA molecules to yield sequencing reads.
- sequencing reads are mapped to a reference sequence (e.g., a reference genome such as a human genome) to yield the data set, wherein the data set comprises counts of gene transcripts.
- each count of a transcript is indicative of a gene expression event.
- (a) comprises selectively enriching at least a portion of the cDNA molecules for a set of genomic loci associated with the kidney disease or disorder.
- (a) comprises selectively enriching at least a portion of the cDNA molecules for a set of genomic loci associated with the kidney disease or disorder. In some embodiments, (a) comprises amplifying at least a portion of the cDNA molecules. In some embodiments, (a) comprises aligning at least a portion of the sequencing reads to a reference sequence to generate counts of gene transcripts. In some embodiments, the reference sequence is at least a portion of a human reference genome. In some embodiments, (a) comprises generating counts of gene transcripts. In some embodiments, the reference sequence is at least a portion of a human reference genome. In some embodiments, the counts of gene transcripts are normalized to generate normalized counts of gene transcripts for downstream differential gene expression analysis.
- the normalizing comprises determining CPM (counts per million), TPM (transcripts per kilobase million), or RPKM/FPKM (reads/fragments per kilobase of exon per million reads/fragments mapped).
- the kidney disease or disorder is selected from the group consisting of: early-stage kidney disease, mid-stage kidney disease, late-stage kidney disease, end-stage kidney disease, asymptomatic kidney disease, diabetic nephropathy, hypertensive nephropathy, IgA nephropathy, membranous nephropathy, minimal change disease, focal segmental glomerulosclerosis (FSGS), NSAIDs induced nephrotoxicity, thin basement membrane nephropathy, amyloidosis, ANCA vasculitis related to endocarditis and other infections, cardiorenal syndrome, IgG4 nephropathy, interstitial nephritis, lithium nephrotoxicity, lupus nephritis, multiple myeloma, polycystic kidney disease, pyelonephritis (kidney infection), renal artery stenosis, renal cyst, rheumatoid arthritis-associated renal
- the kidney disease or disorder is diabetic nephropathy.
- diabetic nephropathy is early-stage diabetic nephropathy.
- the subject is asymptomatic for the diabetic nephropathy.
- (b) comprises using a trained algorithm to process the data set.
- the trained algorithm comprises a trained machine learning algorithm.
- the trained machine learning algorithm is selected from the group consisting of: a support vector machine (SVM), a naive Bayes classification, a linear regression, a quantile regression, a logistic regression, a nonlinear regression, a random forest, a neural network, an ensemble learning method, a boosting algorithm, an AdaBoost algorithm, a recursive feature elimination algorithm (RFE), and any combination thereof.
- the trained machine learning algorithm comprises the recursive feature elimination (RFE) algorithm.
- the trained machine learning algorithm is trained with a plurality of training samples comprising a first set of bodily samples from subjects having the type of kidney disease or disorder of interest (e.g. diabetic nephropathy) and a second set of bodily samples from subjects having no kidney disease or disorder, wherein the first set of bodily samples and the second set of bodily samples are different from the bodily sample of the subject.
- the trained machine learning algorithm is trained with a plurality of training samples comprising a first set of bodily samples from subjects having the type of kidney disease or disorder of interest (e.g. diabetic nephropathy) and a second set of bodily samples from subjects having other types of kidney disease or disorder, wherein the first set of bodily samples and the second set of bodily samples are different from the bodily sample of the subject.
- (a) comprises comparing the counts of gene transcripts (e.g., one or more levels of gene expression products) to a reference.
- the reference corresponds to a set of gene expression products from a first set of bodily samples from subjects having the type of kidney disease or disorder of interest (e.g. diabetic nephropathy) and/or a second set of bodily samples from subjects having no kidney disease or disorder.
- the reference corresponds to a set of gene expression products from a first set of bodily samples from subjects having the type of kidney disease or disorder of interest (e.g. diabetic nephropathy) and/or a second set of bodily samples from subjects having other types of kidney disease or disorder.
- the method further comprises detecting the presence, the absence, or the elevated risk of the kidney disease or disorder in the subject at a sensitivity of at least about 50%, at least about 55%, at least about 60%, at least about 65%, at least about 70%, at least about 75%, at least about 80%, at least about 85%, at least about 90%, at least about 91%, at least about 92%, at least about 93%, at least about 94%, at least about 95%, at least about 96%, at least about 97%, at least about 98%, or at least about 99%.
- the method further comprises detecting the presence or the elevated risk of the kidney disease or disorder in the subject at a specificity of at least about 50%, at least about 55%, at least about 60%, at least about 65%, at least about 70%, at least about 75%, at least about 80%, at least about 85%, at least about 90%, at least about 91%, at least about 92%, at least about 93%, at least about 94%, at least about 95%, at least about 96%, at least about 97%, at least about 98%, or at least about 99%.
- the method further comprises detecting the presence or the elevated risk of the kidney disease or disorder in the subject at a positive predictive value of at least about 50%, at least about 55%, at least about 60%, at least about 65%, at least about 70%, at least about 75%, at least about 80%, at least about 85%, at least about 90%, at least about 91%, at least about 92%, at least about 93%, at least about 94%, at least about 95%, at least about 96%, at least about 97%, at least about 98%, or at least about 99%.
- the method further comprises detecting the presence or the elevated risk of the kidney disease or disorder in the subject at a negative predictive value of at least about 50%, at least about 55%, at least about 60%, at least about 65%, at least about 70%, at least about 75%, at least about 80%, at least about 85%, at least about 90%, at least about 91%, at least about 92%, at least about 93%, at least about 94%, at least about 95%, at least about 96%, at least about 97%, at least about 98%, or at least about 99%.
- the method further comprises identifying a clinical intervention for the subject based at least in part on the presence or the elevated risk of the kidney disease or disorder determined in (b).
- the clinical intervention is selected from the group consisting of: a drug treatment, intensive glycemic control, high blood pressure control, lower high cholesterol, foster bone health, diet control, lifestyle changes, weight loss, exercise, tobacco cessation, manage alcohol intake, reduce/quit drugs of abuse, and avoiding NSAIDs.
- the drug relies on the blockage of the renin- angiotensin aldosterone system (RAAS, e.g., angiotensin-converting enzyme (ACE) inhibitors and angiotensin II receptor blockers (ARBs)).
- RAAS renin- angiotensin aldosterone system
- ACE angiotensin-converting enzyme
- ARBs angiotensin II receptor blockers
- the drug is an experimental therapy, such as a drug targeting vasculature/hemodynamic effects, a drugs targeting inflammation, and a drug targeting oxidative stress.
- (b) comprises analyzing a first set of genes that differentially distinguishes between a first kidney disease or disorder and negative (NEG) subjects and a second set of genes that differentially distinguishes between the first kidney disease or disorder and a second kidney disease or disorder.
- (b) comprises analyzing a first set of genes that differentially distinguishes between diabetic nephropathy (DN) and negative (NEG) subjects and a second set of genes that differentially distinguishes between DN and other chronic kidney diseases (CKD).
- the first set of genes is selected from the group of genes listed in Table 3 and Table 5, and wherein the second set of genes is selected from the group of genes listed in Table 4 and Table 6.
- (b) comprises generating a first DN vs. NEG score based on the first set of genes and a second DN vs. CKD score based on the second set of genes.
- the first DN vs. NEG score is indicative of glomerular injury when greater than 0.5, or is indicative of tubular injury when less than 0.5.
- (b) comprises analyzing different male-specific or female- specific sets of genes based on a gender of the subject.
- the method further comprises analyzing bodily samples of the subject at two or more different time points to yield two or more data sets, and computer processing the two or more data sets to determine the presence, the absence, or the elevated risk of the kidney disease or disorder in the subject. In some embodiments, the method further comprises determining a presence, an absence, or an elevated risk of another type of kidney disease or disorder in the subject, and electronically outputting a report that identifies the presence, the absence, or the elevated risk of the another type of kidney disease or disorder in the subject.
- the method further comprises analyzing bodily samples of the subject at two or more different time points to yield two or more data sets, and computer processing the two or more data sets to determine a presence, an absence, or an elevated risk of another type of kidney disease or disorder in the subject.
- the present disclosure provides a method for processing or analyzing a bodily sample of a subject, comprising: (a) assaying a plurality of cells obtained or derived from the bodily sample to yield a data set comprising quantitative measures of a set of cell-based biomarkers comprising proteins associated with a kidney disease or disorder in the plurality of cells; (b) computer processing the data set from (a) to determine a presence, an absence, or an elevated risk of the kidney disease or disorder in the subject; and (c) electronically outputting a report that identifies the presence, the absence, or the elevated risk of the kidney disease or disorder in the subject determined in (b).
- the method further comprises electronically outputting a report that identifies a presence, an absence, or an elevated risk of other types of kidney disease or disorder in the subject determined in (b).
- the bodily sample is selected from the group consisting of: a blood sample, a serum sample, a plasma sample, a saliva sample, a stool sample, a sputum sample, a urine sample, a semen sample, a transvaginal fluid sample, a cerebrospinal fluid sample, a sweat sample, a cell sample, and a tissue sample.
- the bodily sample is the urine sample.
- the urine sample is a fresh urine sample, a frozen urine sample (e.g., freshly frozen urine sample), or a preserved urine sample (e.g., stored at room temperature in preservatives to prevent degradation).
- (a) comprises using a cellular assay selected from the group consisting of: ELISA, flow-cytometry, LC/MS, confocal microscopy.
- the set of cell-based biomarkers comprising proteins associated with the kidney disease or disorder comprises at least one protein selected from the group consisting of: proteins encoded by a genomic locus associated with the kidney disease or disorder.
- the kidney disease or disorder is selected from the group consisting of: early-stage kidney disease, mid stage kidney disease, late-stage kidney disease, end-stage kidney disease, asymptomatic kidney disease, diabetic nephropathy, hypertensive nephropathy, IgA nephropathy, membranous nephropathy, minimal change disease, focal segmental glomerulosclerosis (FSGS), NSAIDs induced nephrotoxicity, thin basement membrane nephropathy, amyloidosis, ANCA vasculitis related to endocarditis and other infections, cardiorenal syndrome, IgG4 nephropathy, interstitial nephritis, lithium nephrotoxicity, lupus nephritis, multiple myeloma, polycystic kidney disease, pyelonephritis (kidney infection), renal artery stenosis, renal cyst, rheumatoid arthritis-associated renal disease, and kidney stone
- the kidney disease or disorder is diabetic nephropathy.
- diabetic nephropathy is early-stage diabetic nephropathy.
- the subject is asymptomatic for the diabetic nephropathy.
- (b) comprises using a trained algorithm to process the data set.
- the trained algorithm comprises a trained machine learning algorithm.
- the trained machine learning algorithm is selected from the group consisting of: a support vector machine (SVM), a naive Bayes classification, a linear regression, a quantile regression, a logistic regression, a nonlinear regression, a random forest, a neural network, an ensemble learning method, a boosting algorithm, an AdaBoost algorithm, a recursive feature elimination algorithm (RFE), and any combination thereof.
- the trained machine learning algorithm comprises the recursive feature elimination algorithm.
- the trained machine learning algorithm is trained with a plurality of training samples comprising a first set of bodily samples from subjects having the type of kidney disease or disorder of interest (e.g. diabetic nephropathy) and a second set of bodily samples from subjects having no kidney disease or disorder, wherein the first set of bodily samples and the second set of bodily samples are different from the bodily sample of the subject.
- the trained machine learning algorithm is trained with a plurality of training samples comprising a first set of bodily samples from subjects having the type of kidney disease of interest (e.g. diabetic nephropathy) and a second set of bodily samples from subjects having other types of kidney disease or disorder, wherein the first set of bodily samples and the second set of bodily samples are different from the bodily sample of the subject.
- (a) comprises comparing the one or more levels of cell-based biomarkers to a reference.
- the reference corresponds to a set of cell-based biomarkers from a first set of bodily samples from subjects having the type of kidney disease or disorder (e.g. diabetic nephropathy) and/or a second set of bodily samples from subjects having no kidney disease or disorder.
- the reference corresponds to a set of gene expression products from a first set of bodily samples from subjects having the type of kidney disease or disorder of interest (e.g.
- the method further comprises detecting the presence or the elevated risk of the kidney disease or disorder in the subject at a sensitivity of at least about 50%, at least about 55%, at least about 60%, at least about 65%, at least about 70%, at least about 75%, at least about 80%, at least about 85%, at least about 90%, at least about 91%, at least about 92%, at least about 93%, at least about 94%, at least about 95%, at least about 96%, at least about 97%, at least about 98%, or at least about 99%.
- the method further comprises detecting the presence or the elevated risk of the kidney disease or disorder in the subject at a specificity of at least about 50%, at least about 55%, at least about 60%, at least about 65%, at least about 70%, at least about 75%, at least about 80%, at least about 85%, at least about 90%, at least about 91%, at least about 92%, at least about 93%, at least about 94%, at least about 95%, at least about 96%, at least about 97%, at least about 98%, or at least about 99%.
- the method further comprises detecting the presence or the elevated risk of the kidney disease or disorder in the subject at a positive predictive value of at least about 50%, at least about 55%, at least about 60%, at least about 65%, at least about 70%, at least about 75%, at least about 80%, at least about 85%, at least about 90%, at least about 91%, at least about 92%, at least about 93%, at least about 94%, at least about 95%, at least about 96%, at least about 97%, at least about 98%, or at least about 99%.
- the method further comprises detecting the presence or the elevated risk of the kidney disease or disorder in the subject at a negative predictive value of at least about 50%, at least about 55%, at least about 60%, at least about 65%, at least about 70%, at least about 75%, at least about 80%, at least about 85%, at least about 90%, at least about 91%, at least about 92%, at least about 93%, at least about 94%, at least about 95%, at least about 96%, at least about 97%, at least about 98%, or at least about 99%.
- the method further comprises detecting the presence or the elevated risk of the kidney disease or disorder in the subject at an Area Under the Curve (AUC) of at least about 0.50, at least about 0.55, at least about 0.60, at least about 0.65, at least about 0.70, at least about 0.75, at least about 0.80, at least about 0.81, at least about
- AUC Area Under the Curve
- the method further comprises identifying a clinical intervention for the subject based at least in part on the presence or the elevated risk of the kidney disease or disorder determined in (b).
- the clinical intervention is selected from the group consisting of: a drug treatment, intensive glycemic control, high blood pressure control, lower high cholesterol, foster bone health, diet control, lifestyle changes, weight loss, exercise, tobacco cessation, manage alcohol intake, reduce/quit drugs of abuse, and avoiding NSAIDs.
- the drug relies on the blockage of the renin- angiotensin aldosterone system (RAAS, e.g., angiotensin-converting enzyme (ACE) inhibitors and angiotensin II receptor blockers (ARBs)).
- RAAS renin- angiotensin aldosterone system
- ACE angiotensin-converting enzyme
- ARBs angiotensin II receptor blockers
- the drug is an experimental therapy, such as a drug targeting vasculature/hemodynamic effects, a drugs targeting inflammation, and a drug targeting oxidative stress.
- (b) comprises analyzing a first set of genes that differentially distinguishes between a first kidney disease or disorder and negative (NEG) subjects and a second set of genes that differentially distinguishes between the first kidney disease or disorder and a second kidney disease or disorder.
- (b) comprises analyzing a first set of genes that differentially distinguishes between diabetic nephropathy (DN) and negative (NEG) subjects and a second set of genes that differentially distinguishes between DN and other chronic kidney diseases (CKD).
- the first set of genes is selected from the group of genes listed in Table 3 and Table 5, and wherein the second set of genes is selected from the group of genes listed in Table 4 and Table 6.
- (b) comprises generating a first DN vs. NEG score based on the first set of genes and a second DN vs. CKD score based on the second set of genes.
- the first DN vs. NEG score is indicative of glomerular injury when greater than 0.5, or is indicative of tubular injury when less than 0.5.
- (b) comprises analyzing different male-specific or female- specific sets of genes based on a gender of the subject.
- the method further comprises analyzing bodily samples of the subject at two or more different time points to yield two or more data sets, and computer processing the two or more data sets to determine the presence, the absence, or the elevated risk of the kidney disease or disorder in the subject. In some embodiments, the method further comprises determining a presence, an absence, or an elevated risk of another type of kidney disease or disorder in the subject, and electronically outputting a report that identifies the presence, the absence, or the elevated risk of the another type of kidney disease or disorder in the subject.
- the method further comprises analyzing bodily samples of the subject at two or more different time points to yield two or more data sets, and computer processing the two or more data sets to determine a presence, an absence, or an elevated risk of another type of kidney disease or disorder in the subject.
- the present disclosure provides a method for processing or analyzing a bodily sample of a subject, comprising: (a) sequencing nucleic acid molecules obtained or derived from the bodily sample to yield data comprising one or more quantitative measures of a set of genes associated with a kidney disease or disorder, wherein the sequencing comprises using Illumina RNA prep with enrichment tagmentation kit, Illumina Truseq exome kit, Agilent sureselect target enrichment system, or KAPA RNA HyperPrep kit from Roche; (b) computer processing the data from (a) to determine a presence or an elevated risk of the kidney disease or disorder in the subject; and (c) electronically outputting a report that identifies the presence or the elevated risk of the kidney disease or disorder in the subject determined in (b).
- the present disclosure provides a method for processing or analyzing a bodily sample of a subject, comprising: (a) assaying the bodily sample or a derivative thereof to yield data comprising one or more quantitative measures of a set of biomarkers associated with a kidney disease or disorder in the bodily sample or the derivative thereof, wherein the set of biomarkers comprises at least 5 biomarkers selected from the group consisting of: biomarkers listed in Table 3, biomarkers listed in Table 4, biomarkers listed in Table 5, and biomarkers listed in Table 6; (b) computer processing the data from (a) to determine a presence, an absence, or an elevated risk of the kidney disease or disorder in the subject; and (c) electronically outputting a report that identifies the presence, the absence, or the elevated risk of the kidney disease or disorder in the subject determined in (b).
- the present disclosure provides a kit for processing or analyzing a bodily sample of a subject, comprising a set of probes for identifying a presence, absence, or relative amount of a set of genomic regions associated with a kidney disease or disorder in the bodily sample or a derivative thereof, wherein the set of biomarkers comprises at least 5 biomarkers selected from the group consisting of: biomarkers listed in Table 3, biomarkers listed in Table 4, biomarkers listed in Table 5, and biomarkers listed in Table 6.
- the present disclosure provides a method of diagnosing a kidney disease or disorder of a subject, comprising: (a) assaying a bodily sample of the subject or a derivative thereof to yield data comprising one or more quantitative measures of a set of biomarkers associated with a kidney disease or disorder in the bodily sample or the derivative thereof, wherein the set of biomarkers comprises at least 5 biomarkers selected from the group consisting of: biomarkers listed in Table 3, biomarkers listed in Table 4, biomarkers listed in Table 5, and biomarkers listed in Table 6; and (b) providing a diagnosis of the kidney disease or disorder based on a comparison of the set of biomarkers to a set of reference values.
- the present disclosure provides a method for processing or analyzing a urine sample of a subject, comprising (a) sequencing ribonucleic acid (RNA) molecules obtained or derived from the urine sample to yield sequencing reads indicative of counts of gene transcripts in the bodily sample, which counts of gene transcripts correspond to a set of genes associated with diabetic nephropathy; (b) computer processing the counts of gene transcripts from (a) to determine a presence, an absence, or an elevated risk of the diabetic nephropathy in the subject; and (c) electronically outputting a report that identifies the presence, the absence, or the elevated risk of the diabetic nephropathy in the subject determined in (b).
- RNA ribonucleic acid
- the bodily sample is selected from the group consisting of: a blood sample, a serum sample, a plasma sample, a saliva sample, a stool sample, a sputum sample, a urine sample, a semen sample, a transvaginal fluid sample, a cerebrospinal fluid sample, a sweat sample, a cell sample, and a tissue sample.
- the bodily sample is the urine sample.
- the urine sample is a fresh urine sample.
- the urine sample is a frozen sample (e.g., freshly frozen sample).
- the urine sample is a preserved urine sample (e.g., stored at room temperature in preservatives to prevent degradation).
- (a) comprises reverse transcribing ribonucleic acid (RNA) molecules obtained or derived from the bodily sample to yield complementary deoxyribonucleic acid (cDNA) molecules, and sequencing at least a portion of the cDNA molecules to yield sequencing reads.
- sequencing reads are mapped to a reference sequence (e.g., a reference genome such as a human genome) to yield the data set, wherein the data set comprises counts of gene transcripts.
- each count of a transcript is indicative of a gene expression event.
- (a) comprises selectively enriching at least a portion of the cDNA molecules for a set of genomic loci associated with the diabetic nephropathy.
- (a) comprises selectively enriching at least a portion of the cDNA molecules for a set of genomic loci associated with the diabetic nephropathy. In some embodiments, (a) comprises amplifying at least a portion of the cDNA molecules. In some embodiments, (a) comprises aligning at least a portion of the sequencing reads to a reference sequence to generate counts of gene transcripts. In some embodiments, the reference sequence is at least a portion of a human reference genome. In some embodiments, (a) comprises generating counts of gene transcripts. In some embodiments, the reference sequence is at least a portion of a human reference genome.
- the counts of gene transcripts are normalized to generate normalized counts of gene transcripts for downstream differential gene expression analysis.
- the normalizing comprises determining CPM (counts per million), TPM (transcripts per kilobase million), or RPKM/FPKM (reads/fragments per kilobase of exon per million reads/fragments mapped).
- the set of genes associated with the diabetic nephropathy comprises at least one gene selected from the group consisting of: genes listed in Table 3, genes listed in Table 4, genes listed in Table 5, and genes listed in Table 6.
- the diabetic nephropathy comprises early-stage diabetic nephropathy, mid-stage diabetic nephropathy, late-stage diabetic nephropathy, end-stage diabetic nephropathy, or asymptomatic diabetic nephropathy.
- (b) comprises using a trained algorithm to process the data set.
- the trained algorithm comprises a trained machine learning algorithm.
- the trained machine learning algorithm is selected from the group consisting of: a support vector machine (SVM), a naive Bayes classification, a linear regression, a quantile regression, a logistic regression, a nonlinear regression, a random forest, a neural network, an ensemble learning method, a boosting algorithm, an AdaBoost algorithm, a recursive feature elimination algorithm (RFE), and any combination thereof.
- the trained machine learning algorithm comprises the recursive feature elimination (RFE) algorithm.
- the trained machine learning algorithm is trained with a plurality of training samples comprising a first set of urine samples from subjects having the diabetic nephropathy and a second set of urine samples from subjects having no kidney disease, wherein the first set of urine samples and the second set of urine samples are different from the urine sample of the subject.
- the trained machine learning algorithm is trained with a plurality of training samples comprising a first set of urine samples from subjects having the diabetic nephropathy and a second set of urine samples from subjects having other types of kidney diseases, wherein the first set of urine samples and the second set of urine samples are different from the urine sample of the subject.
- (a) comprises comparing the counts of gene transcripts (e.g., one or more levels of gene expression products) to a reference.
- the reference corresponds to a set of gene expression products from a first set of urine samples from subjects having the diabetic nephropathy and/or a second set of urine samples from subjects having no kidney disease.
- the reference corresponds to a set of gene expression products from a first set of urine samples from subjects having the diabetic nephropathy and/or a second set of urine samples from subjects having other types of kidney diseases.
- the method further comprises detecting the presence or the elevated risk of the diabetic nephropathy in the subject at a sensitivity of at least about 50%, at least about 55%, at least about 60%, at least about 65%, at least about 70%, at least about 75%, at least about 80%, at least about 85%, at least about 90%, at least about 91%, at least about 92%, at least about 93%, at least about 94%, at least about 95%, at least about 96%, at least about 97%, at least about 98%, or at least about 99%.
- the method further comprises detecting the presence or the elevated risk of the diabetic nephropathy in the subject at a specificity of at least about 50%, at least about 55%, at least about 60%, at least about 65%, at least about 70%, at least about 75%, at least about 80%, at least about 85%, at least about
- the method further comprises detecting the presence or the elevated risk of the diabetic nephropathy in the subject at a positive predictive value of at least about 50%, at least about 55%, at least about 60%, at least about 65%, at least about 70%, at least about 75%, at least about 80%, at least about 85%, at least about 90%, at least about 91%, at least about 92%, at least about 93%, at least about 94%, at least about 95%, at least about 96%, at least about 97%, at least about 98%, or at least about 99%.
- the method further comprises detecting the presence or the elevated risk of the diabetic nephropathy in the subject at a negative predictive value of at least about 50%, at least about 55%, at least about 60%, at least about 65%, at least about 70%, at least about 75%, at least about 80%, at least about 85%, at least about 90%, at least about 91%, at least about 92%, at least about 93%, at least about 94%, at least about 95%, at least about 96%, at least about 97%, at least about 98%, or at least about 99%.
- the method further comprises detecting the presence or the elevated risk of the diabetic nephropathy in the subject at an Area Under the Curve (AUC) of at least about 0.50, at least about 0.55, at least about 0.60, at least about 0.65, at least about 0.70, at least about 0.75, at least about 0.80, at least about 0.81, at least about 0.82, at least about 0.83, at least about 0.84, at least about 0.85, at least about 0.86, at least about 0.87, at least about 0.88, at least about 0.89, at least about 0.90, at least about 0.91, at least about 0.92, at least about 0.93, at least about 0.94, at least about 0.95, at least about 0.96, at least about 0.97, at least about 0.98, or at least about 0.99.
- AUC Area Under the Curve
- the method further comprises identifying a clinical intervention for the subject based at least in part on the presence or the elevated risk of the diabetic nephropathy determined in (b).
- the clinical intervention is selected from the group consisting of: a drug treatment, intensive glycemic control, high blood pressure control, lower high cholesterol, foster bone health, diet control, lifestyle changes, weight loss, exercise, tobacco cessation, manage alcohol intake, reduce/quit drugs of abuse, and avoiding NSAIDs.
- the drug relies on the blockage of the renin- angiotensin aldosterone system (RAAS, e.g., angiotensin-converting enzyme (ACE) inhibitors and angiotensin II receptor blockers (ARBs)).
- RAAS renin- angiotensin aldosterone system
- ACE angiotensin-converting enzyme
- ARBs angiotensin II receptor blockers
- the drug is an experimental therapy, such as a drug targeting vasculature/hemodynamic effects, a drugs targeting inflammation, and a drug targeting oxidative stress.
- (b) comprises analyzing a first set of genes that differentially distinguishes between diabetic nephropathy (DN) and negative (NEG) subjects and a second set of genes that differentially distinguishes between DN and a second kidney disease or disorder.
- (b) comprises analyzing a first set of genes that differentially distinguishes between DN and negative (NEG) subjects and a second set of genes that differentially distinguishes between DN and other chronic kidney diseases (CKD).
- the first set of genes is selected from the group of genes listed in Table 3 and Table 5
- the second set of genes is selected from the group of genes listed in Table 4 and Table 6.
- (b) comprises generating a first DN vs.
- the first DN vs. NEG score is indicative of glomerular injury when greater than 0.5, or is indicative of tubular injury when less than 0.5.
- (b) comprises analyzing different male-specific or female-specific sets of genes based on a gender of the subject.
- the method further comprises analyzing urine samples of the subject at two or more different time points to yield two or more data sets, and computer processing the two or more data sets to determine the presence or the elevated risk of the diabetic nephropathy in the subject. In some embodiments, the method further comprises determining a presence, an absence, or an elevated risk of another type of kidney disease or disorder in the subject, and electronically outputting a report that identifies the presence, the absence, or the elevated risk of the another type of kidney disease or disorder in the subject.
- the method further comprises analyzing urine samples of the subject at two or more different time points to yield two or more data sets, and computer processing the two or more data sets to determine the presence, the absence, or the elevated risk of another type of kidney disease or disorder in the subject. In some embodiments, the method further comprises analyzing bodily samples of the subject at two or more different time points to yield two or more data sets, and electronically outputting a report that identifies a presence, an absence, or an elevated risk of another type of kidney disease or disorder in the subject.
- the present disclosure provides a method of treating a kidney disease or disorder of a subject, comprising (a) diagnosing the kidney disease or disorder of the subject, according to a method of the present disclosure; and (b) treating the subject for the kidney disease or disorder.
- the present disclosure provides a method or a platform of evaluating new drug efficacy for treating kidney disease(s) or disorder of a subject, comprising (a) comparing score changes before and after drug treatment; and (b) computer processing genes expression levels that are targeted by the new drug.
- Another aspect of the present disclosure provides a non-transitory computer-readable medium comprising machine executable code that, upon execution by one or more computer processors, implements any of the methods above or elsewhere herein.
- Another aspect of the present disclosure provides a system comprising one or more computer processors and computer memory coupled thereto.
- the computer memory comprises machine executable code that, upon execution by the one or more computer processors, implements any of the methods above or elsewhere herein.
- FIG. 1 illustrates a flow-chart for a method 100 of kidney disease or disorder identification in a subject.
- FIGs. 2A-2D illustrate examples of dimensionality reduction analysis for diabetic nephropathy (DN) across different age groups (FIG. 2A), different gender groups (FIG. 2B), and different race/ethnicity groups (FIG. 2C), including an illustration of batch effects (FIG. 2D)
- DN diabetic nephropathy
- FIG. 3 illustrates a computer system that is programmed or otherwise configured to implement methods provided herein.
- nucleic acid includes a plurality of nucleic acids, including mixtures thereof.
- nucleic acid generally refers to a polymeric form of nucleotides of any length, either deoxyribonucleotides (dNTPs) or ribonucleotides (rNTPs), or analogs thereof. Nucleic acids may have any three-dimensional structure, and may perform any function, known or unknown.
- dNTPs deoxyribonucleotides
- rNTPs ribonucleotides
- Non-limiting examples of nucleic acids include DNA, RNA, coding or non-coding regions of a gene or gene fragment, loci (locus) defined from linkage analysis, exons, introns, messenger RNA (mRNA), transfer RNA, ribosomal RNA, short interfering RNA (siRNA), short-hairpin RNA (shRNA), micro-RNA (miRNA), ribozymes, cDNA, recombinant nucleic acids, branched nucleic acids, plasmids, vectors, isolated DNA of any sequence, isolated RNA of any sequence, nucleic acid probes, and primers.
- loci locus defined from linkage analysis, exons, introns, messenger RNA (mRNA), transfer RNA, ribosomal RNA, short interfering RNA (siRNA), short-hairpin RNA (shRNA), micro-RNA (miRNA), ribozymes, cDNA, recombinant nucleic acids, branched nucle
- a nucleic acid may comprise one or more modified nucleotides, such as methylated nucleotides and nucleotide analogs. If present, modifications to the nucleotide structure may be made before or after assembly of the nucleic acid.
- the sequence of nucleotides of a nucleic acid may be interrupted by non-nucleotide components.
- a nucleic acid may be further modified after polymerization, such as by conjugation or binding with a reporter agent.
- target nucleic acid generally refers to a nucleic acid molecule in a starting population of nucleic acid molecules having a nucleotide sequence whose presence, amount, and/or sequence, or changes in one or more of these, are desired to be determined.
- a target nucleic acid may be any type of nucleic acid, including DNA, RNA, and analogs thereof.
- a “target ribonucleic acid (RNA)” generally refers to a target nucleic acid that is RNA.
- a “target deoxyribonucleic acid (DNA)” generally refers to a target nucleic acid that is DNA.
- the term “target” generally refers to a genomic region within a marker gene or marker region.
- the term “reference” generally refers to a sample obtained or derived from a subject who is diagnosed with kidney disease or disorder (kidney disease or disorder patient) or who has received a negative clinical indication of kidney disease or disorder (e.g., a healthy or control subject without kidney disease or disorder).
- the terms “locus” or “region” are generally interchangeable and refer to a specific genomic region on the genome represented by chromosome number, start position, and end position.
- the term “subject,” generally refers to an entity or a medium that has testable or detectable genetic information.
- a subject can be a person or individual, such as a patient.
- a subject can be a vertebrate, such as, for example, a mammal.
- Non-limiting examples of mammals include murines, simians, humans, farm animals, sport animals, and pets.
- sample generally refers to a bodily sample or part(s) of a subject, which is obtained and analyzed to measure or to determine the character of the whole, such as a specimen of tissue, cells, blood, urine, or derivatives thereof.
- biomarker generally refers to any substance, structure, or process that can be measured in a subject’s body or its products and be used to influence or predict a clinical outcome or disease with or without treatment, select an appropriate treatment (or predict whether treatment would be effective), or monitor a current treatment and potentially change the treatment.
- the terms “amplifying” and “amplification” are used interchangeably and generally refer to generating one or more copies or “amplified product” of a nucleic acid.
- the term “DNA amplification” generally refers to generating one or more copies of a DNA molecule or “amplified DNA product.”
- the term “reverse transcription amplification” generally refers to the generation of deoxyribonucleic acid (DNA) from a ribonucleic acid (RNA) template via the action of a reverse transcriptase. Amplification may be performed by polymerase chain reaction (PCR), which is based on using DNA polymerase to synthesize new strands of DNA complementary to the initial template strands.
- PCR polymerase chain reaction
- PCR polymerase chain reaction
- This process for amplifying the target sequence may comprise introducing a large excess of two oligonucleotide primers to the DNA mixture containing the desired target sequence, followed by a precise sequence of thermal cycling in the presence of a DNA polymerase.
- the two primers may be complementary to their respective strands of the double-stranded target sequence.
- the mixture may be denatured and the primers may be annealed to their complementary sequences within the target molecule.
- the primers may be extended with a polymerase so as to form a new pair of complementary strands.
- the steps of denaturation, primer annealing, and polymerase extension can be repeated many times (e.g., denaturation, annealing and extension constitute one “cycle”; there can be numerous “cycles”) to obtain a high concentration of an amplified segment of the desired target sequence.
- the length of the amplified segment of the desired target sequence may be determined by the relative positions of the primers with respect to each other, and therefore, this length is a controllable parameter.
- the method is referred to as “polymerase chain reaction” (PCR). Because the desired amplified segments of the target sequence become the predominant sequences (in terms of concentration) in the mixture, they are said to be “PCR amplified” and are “PCR products” or “amplicons.”
- DNA template generally refers to the sample DNA that contains the target sequence. At the beginning of the reaction, high temperature is applied to the original double-stranded DNA molecule to separate the strands from each other.
- primer generally refers to a short piece of single-stranded DNA that are complementary to the DNA template.
- the polymerase begins synthesizing new DNA from the end of the primer.
- the term “AUC” or “AUROC” generally refers to an abbreviation for the area under a Receiver Operating Characteristic (ROC) curve.
- the ROC curve may be a plot of the true positive rate (TPR) against the false positive rate (FPR) for a plurality of different possible thresholds or cut points of a diagnostic test, thereby illustrating the trade-off between sensitivity and specificity depending on the selected cut point (e.g., any increase in sensitivity is accompanied by a decrease in specificity).
- the area under an ROC curve (AUC) can be a measure for the accuracy of a diagnostic test (e.g., the larger the area, the more accurate the diagnosis), with an optimal value of 1.
- a random test may have an ROC curve lying on the diagonal with an AUC of 0.5 (e.g., representing a random or worthless test).
- Diabetic nephropathy is a major cause of kidney disease and is the most common cause of end-stage renal disease (ESRD). In addition, DN is linked to higher cardiovascular and all-cause morbidity and mortality, so timely diagnosis and treatment are critical. Diabetic nephropathy is a diabetic kidney disease that generally refers to kidney damage that results from having high blood glucose levels due to diabetes. Diabetic nephropathy progresses slowly. With efficient early treatment, one can slow or even stop the progression of the disease.
- DN may be associated with measurable biomarkers, such as albuminuria and/or low eGFR, in bodily samples of subjects; however, these biomarkers may be non-specific to DN, and may be attributable to other disease or disorders, such as diabetes, hypertension, IgA nephropathy, membranous nephropathy, lupus nephritis, minimal change disease, rheumatoid arthritis, medicine such as NSAIDs use, smoking, excessive alcohol use, drugs of abuse, obesity, urinary tract infection, kidney stone, benign prostate hyperplasia (BPH), etc.
- diseases with controlled diabetes may also have diabetic nephropathy, whereas poorly controlled diabetic patients may have little kidney damage.
- patients with diabetic nephropathy may also have other type(s) of kidney disease or disorder.
- DN is an under-diagnosed and mis-diagnosed disease among patients.
- DN may be under-diagnosed because diagnostics methods such as renal biopsy can be risky (e.g., with a 1.8% mortality rate), expensive, and time-consuming; therefore, many patients opt out of such diagnostic testing.
- DN may be mis-diagnosed because diagnostics methods may lack adequate sensitivity and specificity.
- a urinary albumin assay may be insensitive and non-specific.
- imaging tests such as X-rays and ultrasound diagnostics to check the structure and size of a subject’s kidney may be time-consuming and indirect, and other panel tests such as urinary sediment test, urine protein electrophoresis (UPEP), serum protein electrophoresis (SPEP), urine blood test, antinuclear antibody (ANA) test, HBV test, HCV test, HIV test may provide limited information, and therefore are indirect and inefficient.
- In vitro diagnostic techniques such as those based on proteomics, genomics, and protein biomarkers, may face challenges in accurately detecting, assessing, and monitoring kidney disease or disorder at high sensitivity, specificity, and accuracy.
- albuminuria especially at low levels, may be confounded with many other factors such as hypertension, smoking, alcohol use, drug abuse, obesity, infection, obstruction, etc. Therefore, many patients may miss the best opportunity for early medical intervention or may receive treatment that is unrelated to the cause of the disease. Receiving such unnecessary and/or ineffective treatment may be expensive, time consuming, and cause delays in providing other effective treatments to patients.
- kidney disease or disorder e.g., DN
- the present disclosure provides methods, systems, and kits for detecting kidney disease or disorders by processing biological samples (e.g., tissue samples, cell samples, and/or bodily fluid samples) obtained from or derived from a subject.
- biological samples e.g., tissue samples, cell samples, and/or bodily fluid samples
- nucleic acids, proteins, or cells of biological samples may be analyzed.
- Biological samples obtained from subjects may be analyzed to measure a presence, absence, or relative assessment of the kidney disease or disorder. The analysis may be performed at a set of genomic regions, such as kidney disease-associated genes or genomic loci.
- the subjects may include subjects with kidney disease or disorder (e.g., kidney disease or disorder patients) and subjects without kidney disease or disorder (e.g., normal or healthy controls).
- kidney disease or disorder e.g., kidney disease or disorder patients
- subjects without kidney disease or disorder e.g., normal or healthy controls.
- Methods of the present disclosure may present numerous advantages over current methods, including: ease, safety, and non-invasiveness of sample collection from subjects, possibility of repeated assays, the use of urine samples which have kidney cells suitable for analysis, a direct method of analysis of kidney injury, suitability for monitoring disease progression and treatment efficacy, suitability for sample collection in a home setting, the ability to perform testing without detailed medical history of a subject, and the ability to detect early diabetic changes (e.g., for asymptomatic subjects).
- kidney disease or disorder can be accurately detected using an assay with high sensitivity and specificity in biological samples (e.g., urine samples).
- the urine-based assay can apply machine learning algorithm to analyze a set of biomarkers to accurately distinguish kidney disease or disorder samples from control samples across various stages (e.g., early-stage, mid-stage, or late-stage) of kidney disease or disorder.
- the urine-based assay may offer high specificity, thereby facilitating the non- invasive application of kidney disease or disorder associated biomarkers for treatment monitoring of kidney disease or disorder patients.
- the urine-based assay may offer higher sensitivity and specificity than those of renal biopsy, currently considered as the gold standard for definitive diagnosis of kidney diseases.
- FIG. 1 illustrates a flow-chart for a method 100 of kidney disease or disorder identification in a subject.
- the method 100 may comprise analyzing a biological sample (e.g., a bodily sample comprising cells, tissue, blood, urine, or derivatives thereof) from a subject (e.g., a patient) to yield a data set comprising levels of gene expression products in the bodily sample (as in operation 102).
- the levels of gene expression products may correspond to a set of genes associated with a kidney disease or disorder.
- the method 100 may comprise computer processing the data set to determine a presence or an elevated risk of the kidney disease or disorder in the subject (as in operation 104).
- the method 100 may comprise electronically outputting a report that identifies the presence or the elevated risk of the kidney disease or disorder in the subject.
- the biological sample may be obtained or derived from a blood sample, a serum sample, a plasma sample, a saliva sample, a stool sample, a sputum sample, a urine sample, a semen sample, a transvaginal fluid sample, a cerebrospinal fluid sample, a sweat sample, a cell sample, or a tissue sample from a human subject.
- the biological sample may be stored in a variety of storage conditions before processing, such as different temperatures (e.g., at room temperature, under refrigeration or freezer conditions, at 4°C, at -18°C, -20°C, or at -80°C, or liquid nitrogen) or different preservatives (e.g., alcohol, formaldehyde, or potassium dichromate, or urine collection and preservation tube from Norgen Biotek Inc.).
- the biological sample may be a fresh sample (e.g., processed in a suitable time frame to avoid substantial RNA degradation) or a frozen or preserved sample.
- the biological sample may be obtained from a subject with a disease or disorder, from a subject that is suspected of having the disease or disorder, or from a subject that does not have or is not suspected of having the disease or disorder.
- the disease or disorder may be an infectious disease, an immune disorder or disease, a cancer, a genetic disease, a degenerative disease, a lifestyle disease, an injury, a rare disease or an age-related disease.
- the infectious disease may be caused by bacteria, viruses, fungi, and/or parasites.
- the disorder or disease may be a kidney disease or disorder.
- the sample may be taken before and/or after treatment of a subject with a disease or disorder. Samples may be taken before and/or after a treatment.
- Samples may be taken during a treatment or a treatment regime. Multiple samples may be taken from a subject to monitor the effects of the treatment over time. Multiple samples may be taken from a subject to monitor the disease progression over time. Multiple samples may be taken from a subject to evaluate the possibility of a coexisting kidney disease or disorder. The sample may be taken from a subject known or suspected of having a kidney disease or disorder for which a definitive positive or negative diagnosis is not available via clinical tests.
- the sample may be taken from a subject suspected of having a disease or a disorder (e.g., kidney disease or disorder).
- the sample may be taken from a subject experiencing unexplained symptoms, such as fatigue, nausea, weight loss, aches and pains, weakness, or memory loss.
- the sample may be taken from a subject having explained symptoms.
- the sample may be taken from a subject that is asymptomatic for a disease or a disorder (e.g., kidney disease or disorder).
- the sample may be taken from a subject at risk of developing a disease or disorder (e.g., kidney disease or disorder) due to factors such as familial history, age, environmental exposure, lifestyle risk factors, or presence of other known risk factors.
- the biological sample obtained from the subject may be assayed to generate gene expression data indicative of a presence, absence, or relative assessment of a kidney disease or disorder of a subject.
- a presence, absence, or relative assessment of nucleic acid molecules of the biological sample at a panel of kidney disease or disorder-associated genomic loci e.g., quantitative measures of gene expression at a plurality of kidney disease or disorder-associated genomic loci
- the biological samples obtained or derived from the subject may be processed by (i) subjecting the biological sample to conditions that are sufficient to isolate, enrich, or extract a plurality of nucleic acid molecules (e.g., RNA or
- DNA molecules DNA molecules
- a plurality of nucleic acid molecules may be extracted from the biological sample and subjected to further assaying (e.g., sequencing to generate a plurality of counts of gene transcripts).
- the nucleic acid molecules may comprise deoxyribonucleic acid (DNA) or ribonucleic acid (RNA).
- the nucleic acid molecules (e.g., RNA or DNA) may be extracted from the biological sample by a variety of methods, such as a FastDNA Kit protocol from MP Biomedicals or a Allprep DNA/RNAKit from QIAGEN.
- the extraction method may extract all RNA or DNA molecules from a sample.
- the extraction method may selectively extract a portion of RNA or DNA molecules from a sample. Extracted RNA molecules from a sample may be converted to cDNA molecules by reverse transcription (RT).
- the method may comprise a variety of assays suitable for assessing the presence of gene expression at the kidney disease or disorder-specific markers in a biological sample including Next Generation Sequencing (NGS), Real-time PCR, Microarray analysis and Luminex-based gene expression analysis.
- NGS Next Generation Sequencing
- the nucleic acid sequencing may be performed by any suitable sequencing methods, such as shotgun sequencing, single-molecule sequencing, nanopore sequencing, semiconductor sequencing, pyrosequencing, sequencing-by-synthesis (SBS), multiplexed PCR based methods, and exome targeted sequencing.
- total RNAs may be first reverse transcribed into cDNA.
- Multiplexed gene specific primers may be hybridized to the target loci (e.g., one or more of the panel of kidney disease or disorder biomarkers or kidney disease or disorder- associated genomic loci), followed by PCR amplification to create amplicons. Primer sequences may be then removed.
- a second PCR may be performed with sequencing primers to create sequencing-ready fragments.
- the sequencing may comprise use of commercially available kits and protocols such as Ampliseq by Thermo Fisher or illumina, CleanPlex DNA/RNA amplicon sequencing kit by Paragon Genomics.
- total RNAs may be first reverse transcribed into cDNA.
- a suitable number of rounds of PCR may be performed to sufficiently amplify an initial amount of cDNAs to a desired input quantity.
- These initially amplified cDNAs may be then hybridized with gene specific probes (e.g., one or more of the panel of kidney disease or disorder biomarkers or kidney disease or disorder-associated gnomic loci).
- Gene specific probes e.g., one or more of the panel of kidney disease or disorder biomarkers or kidney disease or disorder-associated gnomic loci.
- Targeted gene fragments may be selected and enriched.
- a second PCR may be performed to amplify the enriched products to reach a quantity sufficient for sequencing.
- RNA or DNA molecules may be tagged, e.g., with identifiable tags, to allow for multiplexing of a plurality of samples. Any number of RNA or DNA samples may be multiplexed. For example a multiplexed reaction may contain RNA or DNA from at least about
- RNA or DNA molecules may be traced back to the sample (and the subject) from which the RNA or DNA molecule originated.
- tags may be attached to RNA or DNA molecules by ligation or by PCR amplification with primers.
- RT-PCR real-time PCR
- RNA molecules within a population of nucleic acids may be amplified (e.g., one or more of the panel of kidney disease or disorder biomarkers or kidney disease or disorder- associated genomic loci).
- up to five gene specific primer/probe sets may be used to selectively amplify certain targets in each well.
- the fluorescent tags carried by the probes may emit fluorescence that can be captured by a camera detector.
- the intensity level of the fluorescence defined by the cycle threshold (Ct) value may be inversely proportional to the level of gene expression.
- RTPCR may be performed using any of a number of commercial kits, e.g., provided by Life Technologies, Bio-Rad, Promega, New England Biolabs, etc.
- microarray analysis may be used to assess the presence of gene expression at the kidney disease or disorder-specific markers in a biology sample.
- Probes for target genes e.g. one or more of the panel of kidney disease or disorder biomarkers or kidney disease or disorder-associated genomic loci
- Total RNAs may be first reverse transcribed into cDNA. Fluorescent dyes may be added during reverse transcription. Labeled cDNA products may be then hybridized with the probes in the microarray chip. After hybridization, the microarray may be dried and scanned by a machine that uses a laser to excite the dye and measures the emission levels with a detector. The amount of fluorescence may be proportional to the levels of gene expression.
- Luminex-based gene expression analysis may be used to assess the presence of gene expression at the kidney disease or disorder-specific markers in a biology sample.
- the assay may be based on direct quantification of the RNA targets for multiplexing of 3 to 80 RNA targets and branched DNA (bDNA) signal amplification technology.
- the urine sample may be lysed to release the RNAs and incubated overnight with target specific probe sets and Luminex capture beads.
- the signal amplification tree may be built via sequential hybridization of Pre amplifier, Amplifier, and Label Probe.
- Each amplification unit may provide a 400x signal amplification, and there may be six amplification units per target RNA copy resulting in a 2,400x signal amplification per copy RNA.
- the signal may be detected by using the fluorescent reporter molecule, phycoerythrin, on a Luminex instrument for readout and analysis.
- the Luminex-based gene expression analysis may comprise of using commercially available systems such as QuantiGene Plex Gene Expression Assay by Thermo Fisher.
- sequence reads may be aligned to one or more reference genomes (e.g., a genome of one or more species such as a human genome).
- the aligned sequence reads may be quantified at a panel of genomic loci to generate the data indicative of a distribution of the presence, absence, or relative assessment of the kidney disease or disorder.
- quantification of sequences corresponding to a panel of genomic loci associated with kidney disease or disorder may be performed to generate the gene expression data indicative of the presence, absence, or relative assessment of the kidney disease or disorder.
- the kidney disease or disorder may be identified or monitored in the subject by using probes configured to selectively enrich nucleic acid (e.g., RNA or DNA) molecules corresponding to the panel of genomic loci (e.g., kidney disease or disorder-associated genomic loci).
- the probes may be oligonucleotides.
- the probes may have sequence complementarity with nucleic acid sequences (e.g., about 5 nucleotides, about 10 nucleotides, about 15 nucleotides, about 20 nucleotides, about 25 nucleotides, about 30 nucleotides, about 35 nucleotides, about 40 nucleotides, about 45 nucleotides, about 50 nucleotides, about 55 nucleotides, about 60 nucleotides, about 65 nucleotides, about 70 nucleotides, about 75 nucleotides, about 80 nucleotides, about 85 nucleotides, about 90 nucleotides, about 95 nucleotides, about 100 nucleotides, or more than about 100 nucleotides) from one or more of the individual genomic loci (e.g., kidney disease or disorder-associated genomic loci).
- individual genomic loci e.g., kidney disease or disorder-associated genomic loci.
- the one or more genomic loci may comprise 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, about 15, about 20, about 25, about 30, about 35, about 40, about 45, about 50, about 55, about 60, about 65, about 70, about 75, about 80, about 85, about 90, about 95, about 100, or more than about 100 distinct genomic loci (e.g., kidney disease or disorder-associated genomic loci).
- the panel of genomic loci comprises one or more kidney disease or disorder-associated genomic loci listed in Table 3, Table 4, Table 5, and/or Table 6.
- the biological sample may be processed without any nucleic acid extraction.
- the processing may comprise assaying the biological sample using probes that are selected for the panel of genomic loci (e.g., kidney disease or disorder-associated genomic loci).
- the panel of genomic loci e.g., kidney disease or disorder-associated genomic loci
- the panel of genomic loci may comprise 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, about 15, about 20, about 25, about 30, about 35, about 40, about 45, about 50, about 55, about 60, about 65, about 70, about 75, about 80, about 85, about 90, about 95, about 100, or more than about 100 distinct genomic loci (e.g., kidney disease or disorder-associated genomic loci).
- the panel of genomic loci comprises one or more kidney disease or disorder-associated genomic loci listed in Table 3, Table 4, Table 5, and/or Table 6.
- the processing may comprise assaying the biological sample using probes that are selective for the one or more genomic loci (e.g., kidney disease or disorder-associated genomic loci) among other genomic loci in the biological sample.
- the probes may be nucleic acid molecules (e.g., RNA or DNA) having sequence complementarity with nucleic acid sequences (e.g., about 5 nucleotides, about 10 nucleotides, about 15 nucleotides, about 20 nucleotides, about 25 nucleotides, about 30 nucleotides, about 35 nucleotides, about 40 nucleotides, about 45 nucleotides, about 50 nucleotides, about 55 nucleotides, about 60 nucleotides, about 65 nucleotides, about 70 nucleotides, about 75 nucleotides, about 80 nucleotides, about 85 nucleotides, about 90 nucleotides, about 95 nucleotides, about 100 nucleotides, or more
- nucleic acid molecules may be oligonucleotides or enrichment sequences.
- the assaying of the biological sample using probes that are selected for the one or more genomic loci may comprise use of array hybridization, polymerase chain reaction (PCR), or nucleic acid sequencing (e.g., DNA sequencing or RNA sequencing).
- the assay readouts may be quantified at one or more of the panel of genomic loci (e.g., kidney disease or disorder-associated genomic loci) to generate the gene expression data indicative of a presence, absence, or relative assessment of the kidney disease or disorder.
- quantification of array hybridization or polymerase chain reaction (PCR) corresponding to the panel of kidney disease or disorder-associated genomic loci may be performed to generate gene expression data at the panel of kidney disease or disorder-associated genomic loci in the biological sample.
- Assay readouts may comprise quantitative PCR (qPCR) values, digital PCR (dPCR) values, digital droplet PCR (ddPCR) values, fluorescence values, etc.
- kits for identifying or monitoring a kidney disease or disorder in a subject may comprise probes for identifying a presence, absence, or relative amount of sequences at the panel of kidney disease or disorder-associated genomic loci in a biological sample of the subject, which may be indicative of a kidney disease or disorder.
- the probes may be selective for the sequences at the panel of kidney disease or disorder-associated genomic loci in the biological sample.
- a kit may comprise instructions for using the probes to process the biological sample to generate gene expression data at the panel of kidney disease or disorder-associated genomic loci in a biological sample of the subject.
- the probes in the kit may be selective for the sequences at the plurality of kidney disease or disorder-associated genomic loci in the biological sample.
- the probes in the kit may be configured to selectively enrich nucleic acid (e.g., RNA or DNA) molecules corresponding to the panel of kidney disease or disorder-associated genomic loci.
- the probes in the kit may be - oligonucleotides.
- the probes in the kit may have sequence complementarity with nucleic acid sequences from one or more of the kidney disease or disorder-associated genomic loci.
- the one or more genomic loci may comprise 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, about 15, about 20, about 25, about 30, about 35, about 40, about 45, about 50, about 55, about 60, about 65, about 70, about 75, about 80, about 85, about 90, about 95, about 100, or more than about 100 distinct genomic loci (e.g., kidney disease or disorder-associated genomic loci).
- the one or more genomic loci comprise one or more kidney disease or disorder-associated genomic loci listed in Table 3, Table 4, Table 5, and/or Table 6.
- the instructions in the kit may comprise instructions to assay the biological sample using the probes that are selective for the sequences at the panel of kidney disease or disorder- associated genomic loci in the biological sample.
- the probes may be nucleic acid molecules (e.g., RNA or DNA) having sequence complementarity with nucleic acid sequences (e.g., about 5 nucleotides, about 10 nucleotides, about 15 nucleotides, about 20 nucleotides, about 25 nucleotides, about 30 nucleotides, about 35 nucleotides, about 40 nucleotides, about 45 nucleotides, about 50 nucleotides, about 55 nucleotides, about 60 nucleotides, about 65 nucleotides, about 70 nucleotides, about 75 nucleotides, about 80 nucleotides, about 85 nucleotides, about 90 nucleotides, about 95 nucleotides, about 100 nucleotides, or more than about 100 nucleot
- the instructions to assay the biological sample may comprise introductions to perform array hybridization, polymerase chain reaction (PCR), or nucleic acid sequencing (e.g., RNA sequencing or DNA sequencing) to process the biological sample to generate gene expression data indicative of a presence, absence, or relative amount of sequences at the panel of kidney disease or disorder-associated genomic loci in the biological sample, which may be indicative of a kidney disease or disorder.
- the nucleic acid sequencing may be single-molecule (e.g., single-cell RNA-Seq or single-cell DNA-Seq).
- the instructions in the kit may comprise instructions to measure and interpret assay readouts, which may be quantified at one or more of the panel of kidney disease or disorder- associated genomic loci to generate the gene expression data indicative of a presence, absence, or relative amount of sequences at the panel of kidney disease or disorder-associated genomic loci in the biological sample.
- quantification of array hybridization or polymerase chain reaction (PCR) corresponding to the panel of kidney disease or disorder-associated genomic loci may generate gene expression data at the panel of kidney disease or disorder- associated genomic loci in the biological sample.
- Assay readouts may comprise quantitative PCR (qPCR) values, digital PCR (dPCR) values, digital droplet PCR (ddPCR) values, fluorescence values, etc., normalized values thereof, or ratio values thereof.
- a classifier may be used to process the gene expression data at the panel of kidney disease or disorder-associated genomic loci to classify the biological sample, thereby identifying or assessing a kidney disease or disorder of the subject.
- the classifier may be configured to identify the kidney disease or disorder with an accuracy of at least about 50%, at least about 55%, at least about 60%, at least about 65%, at least about 70%, at least about 75%, at least about 80%, at least about 85%, at least about 90%, at least about 95%, at least about 96%, at least about 97%, at least about 98%, at least about 99%, or more than 99% for at least about 25, at least about 50, at least about 100, at least about 150, at least about 200, at least about 250, at least about 300, at least about 350, at least about 400, at least about 450, at least about 500, or more than about 500 independent samples.
- the classifier may comprise a supervised machine learning algorithm or an unsupervised machine learning algorithm.
- the classifier may comprise a classification and regression tree (CART) algorithm.
- the classifier may comprise, for example, a support vector machine (SVM), a linear regression, a logistic regression, a nonlinear regression, a neural network, an ensemble learning method, a boosting algorithm, an AdaBoost algorithm, a Random Forest, a deep learning algorithm, a naive Bayes classifier, a recursive feature elimination algorithm, or a combination thereof.
- SVM support vector machine
- AdaBoost algorithm AdaBoost algorithm
- Random Forest Random Forest
- a deep learning algorithm a naive Bayes classifier
- a recursive feature elimination algorithm or a combination thereof.
- the plurality of input variables may comprise data indicative of a presence, absence, or relative amount of sequences or transcripts corresponding to each of the plurality of kidney disease or disorder- associated genomic loci.
- an input variable may comprise a number of sequences or transcripts corresponding to or aligning to each of the plurality of kidney disease or disorder- associated genomic loci.
- the classifier may have one or more possible output values, each comprising one of a fixed number of possible values (e.g., a linear classifier, a logistic regression classifier, etc.) indicating a classification of the biological sample by the classifier.
- the classifier may comprise a binary classifier, such that each of the one or more output values comprises one of two values (e.g., (0, 1 ⁇ , (positive, negative ⁇ , or (cancerous, non-cancerous ⁇ ) indicating a classification of the biological sample by the classifier.
- the classifier may be another type of classifier, such that each of the one or more output values comprises one of more than two values (e.g., (0, 1, 2 ⁇ , (positive, negative, or indeterminate ⁇ , or (diseased, non-diseased, or indeterminate ⁇ ) indicating a classification of the biological sample by the classifier.
- the output values may comprise descriptive labels, numerical values, or a combination thereof. Some of the output values may comprise descriptive labels. Such descriptive labels may provide an identification or indication of the disease state of the subject, and may comprise, for example, positive, negative, diseased, non-diseased, or indeterminate.
- Such descriptive labels may provide an identification of a treatment for the subject’s disease state, and may comprise, for example, a therapeutic intervention, a duration of the therapeutic intervention, and/or a dosage of the therapeutic intervention.
- Such descriptive labels may provide an identification of secondary clinical tests that may be appropriate to perform on the subject, and may comprise, for example, a biopsy, a blood test, a computed tomography (CT) scan, a magnetic resonance imaging (MRI) scan, an ultrasound scan, a chest X-ray, a positron emission tomography (PET) scan, or a PET-CT scan.
- CT computed tomography
- MRI magnetic resonance imaging
- PET positron emission tomography
- PET-CT scan positron emission tomography
- Such descriptive labels may provide a prognosis of the disease state of the subject.
- Some descriptive labels may be mapped to numerical values, for example, by mapping “positive” to 1 and “negative” to 0.
- Some of the output values may comprise numerical values, such as binary, integer, or continuous values. Such binary output values may comprise, for example, (0, 1 ⁇ . Such integer output values may comprise, for example, (0, 1, 2 ⁇ . Such continuous output values may comprise, for example, a probability value of at least 0 and no more than 1. Such continuous output values may comprise, for example, an un-normalized probability value of at least 0. Such continuous output values may comprise, for example, an un-normalized probability value of at least 0. Such continuous output values may indicate a prognosis of the disease or disorder state of the subject and may comprise, for example, an indication of an expected or average risk or severity of kidney disease or disorder of the subject.
- Such continuous output values may indicate a prediction of the course of treatment to treat the disease or disorder state of the subject and may comprise, for example, an indication of an expected duration of efficacy of the course of treatment.
- Some numerical values may be mapped to descriptive labels, for example, by mapping 1 to “positive” and 0 to “negative.”
- Some of the output values may be assigned based on one or more cutoff values. For example, a binary classification of samples may assign an output value of “positive” or 1 if the sample indicates that the subject has at least a 50% probability of being diseased. For example, a binary classification of samples may assign an output value of “negative” or 0 if the sample indicates that the subject has less than a 50% probability of being diseased. In this case, a single cutoff value of 50% is used to classify samples into one of the two possible binary output values.
- Examples of single cutoff values may include about 1%, about 2%, about 5%, about 10%, about 15%, about 20%, about 25%, about 30%, about 35%, about 40%, about 45%, about 50%, about 55%, about 60%, about 65%, about 70%, about 75%, about 80%, about 85%, about 90%, about 95%, about 98%, and about 99%.
- a classification of samples may assign an output value of “positive” or 1 if the sample indicates that the subject has a probability of being diseased of at least 50%, at least about 55%, at least about 60%, at least about 65%, at least about 70%, at least about 75%, at least about 80%, at least about 85%, at least about 90%, at least about 95%, at least about 98%, or at least about 99%.
- the classification of samples may assign an output value of “positive” or 1 if the sample indicates that the subject has a probability of being diseased of more than 50%, more than 55%, more than 60%, more than 65%, more than 70%, more than 75%, more than 80%, more than 85%, more than 90%, more than 95%, more than 98%, or more than 99%.
- the classification of samples may assign an output value of “negative” or 0 if the sample indicates that the subject has a probability of being diseased of less than 50%, less than 45%, less than 40%, less than 35%, less than 30%, less than 25%, less than 20%, less than 10%, less than 5%, less than 2%, or less than 1%.
- the classification of samples may assign an output value of “negative” or 0 if the sample indicates that the subject has a probability of being diseased of no more than 50%, no more than 45%, no more than 40%, no more than 35%, no more than 30%, no more than 25%, no more than 20%, no more than 10%, no more than 5%, no more than 2%, or no more than 1%.
- the classification of samples may assign an output value of “indeterminate” or 2 if the sample has not been classified as “positive,” “negative,” 1, or 0. In this case, a set of two cutoff values is used to classify samples into one of the three possible output values.
- Examples of sets of cutoff values may include ⁇ 1%, 99% ⁇ , (2%, 98% ⁇ , (5%, 95% ⁇ , (10%, 90% ⁇ , (15%, 85% ⁇ , (20%, 80% ⁇ , (25%, 75% ⁇ , (30%, 70% ⁇ , (35%, 65% ⁇ ,
- sets of n cutoff values may be used to classify samples into one of n+ 1 possible output values, where n is any positive integer.
- the classifier may be trained with a plurality of independent training samples.
- Each of the independent training samples may comprise a biological sample from a subject, associated data obtained by processing the biological sample (as described elsewhere herein), and one or more known output values corresponding to the biological sample (e.g., a clinical diagnosis, prognosis, treatment efficacy, or absence of a disease or disorder such as a kidney disease or disorder of the subject).
- Independent training samples may comprise biological samples and associated data and outputs obtained from a plurality of different subjects.
- Independent training samples may comprise biological samples and associated data and outputs obtained at a plurality of different time points from the same subject (e.g., before, after, and/or during a course of treatment to treat a disease or disorder of the subject).
- Independent training samples may be associated with presence of the kidney disease or disorder (e.g., training samples comprising biological samples and associated data and outputs obtained from a plurality of subjects known to have the kidney disease or disorder). Independent training samples may be associated with absence of the kidney disease or disorder (e.g., training samples comprising biological samples and associated data and outputs obtained from a plurality of subjects who are known to not have a previous diagnosis of the kidney disease or disorder, or otherwise who are asymptomatic for the kidney disease or disorder).
- presence of the kidney disease or disorder e.g., training samples comprising biological samples and associated data and outputs obtained from a plurality of subjects known to have the kidney disease or disorder.
- Independent training samples may be associated with absence of the kidney disease or disorder (e.g., training samples comprising biological samples and associated data and outputs obtained from a plurality of subjects who are known to not have a previous diagnosis of the kidney disease or disorder, or otherwise who are asymptomatic for the kidney disease or disorder).
- the classifier may be trained with at least about 50, at least about 100, at least about 150, at least about 200, at least about 250, at least about 300, at least about 350, at least about 400, at least about 450, or at least about 500 independent training samples.
- the independent training samples may comprise samples associated with presence of the kidney disease or disorder and/or samples associated with absence of the kidney disease or disorder.
- the classifier may be trained with no more than 500, no more than 450, no more than 400, no more than 350, no more than 300, no more than 250, no more than 200, no more than 150, no more than 100, or no more than 50 independent training samples associated with presence of the kidney disease or disorder.
- the biological sample is independent of samples used to train the classifier.
- the classifier may be trained with a first number of independent training samples associated with presence of the kidney disease or disorder and a second number of independent training samples associated with absence of the kidney disease or disorder.
- the first number of independent training samples associated with presence of the kidney disease or disorder may be no more than the second number of independent training samples associated with absence of the kidney disease or disorder.
- the first number of independent training samples associated with presence of the kidney disease or disorder may be equal to the second number of independent training samples associated with absence of the kidney disease or disorder.
- the first number of independent training samples associated with presence of the kidney disease or disorder may be greater than the second number of independent training samples associated with absence of the kidney disease or disorder.
- the classifier may be configured to identify the kidney disease or disorder with an accuracy of at least about 50%, at least about 55%, at least about 60%, at least about 65%, at least about 70%, at least about 75%, at least about 80%, at least about 81%, at least about 82%, at least about 83%, at least about 84%, at least about 85%, at least about 86%, at least about 87%, at least about 88%, at least about 89%, at least about 90%, at least about 91%, at least about 92%, at least about 93%, at least about 94%, at least about 95%, at least about 96%, at least about 97%, at least about 98%, at least about 99%, or more than about 99%, for at least about 50, at least about 100, at least about 150, at least about 200, at least about 250, at least about 300, or more than about 300 independent samples.
- the accuracy of identifying the kidney disease or disorder by the classifier may be calculated as the percentage of independent test samples (e.g., subjects having the kidney disease or disorder, or apparently healthy subjects with negative clinical test results for the kidney disease or disorder) that are correctly identified or classified as having or not having the kidney disease or disorder, respectively.
- the classifier may be configured to identify the kidney disease or disorder with a positive predictive value (PPV) of at least about 5%, at least about 10%, at least about 15%, at least about 20%, at least about 25%, at least about 30%, at least about 35%, at least about 40%, at least about 50%, at least about 55%, at least about 60%, at least about 65%, at least about 70%, at least about 75%, at least about 80%, at least about 81%, at least about 82%, at least about 83%, at least about 84%, at least about 85%, at least about 86%, at least about 87%, at least about 88%, at least about 89%, at least about 90%, at least about 91%, at least about 92%, at least about 93%, at least about 94%, at least about 95%, at least about 96%, at least about 97%, at least about 98%, at least about 99%, or more than about 99%.
- the PPV of identifying the kidney disease or disorder by the classifier may be calculated as the percentage of biological samples identified
- the classifier may be configured to identify the kidney disease or disorder with a negative predictive value (NPV) of at least about 5%, at least about 10%, at least about 15%, at least about 20%, at least about 25%, at least about 30%, at least about 35%, at least about 40%, at least about 50%, at least about 55%, at least about 60%, at least about 65%, at least about 70%, at least about 75%, at least about 80%, at least about 81%, at least about 82%, at least about 83%, at least about 84%, at least about 85%, at least about 86%, at least about 87%, at least about 88%, at least about 89%, at least about 90%, at least about 91%, at least about 92%, at least about 93%, at least about 94%, at least about 95%, at least about 96%, at least about 97%, at least about 98%, at least about 99%, or more than about 99%.
- the NPV of identifying the kidney disease or disorder by the classifier may be calculated as the percentage of biological samples identified
- the classifier may be configured to identify the kidney disease or disorder with a clinical sensitivity of at least about 5%, at least about 10%, at least about 15%, at least about 20%, at least about 25%, at least about 30%, at least about 35%, at least about 40%, at least about 50%, at least about 55%, at least about 60%, at least about 65%, at least about 70%, at least about 75%, at least about 80%, at least about 81%, at least about 82%, at least about 83%, at least about 84%, at least about 85%, at least about 86%, at least about 87%, at least about 88%, at least about 89%, at least about 90%, at least about 91%, at least about 92%, at least about 93%, at least about 94%, at least about 95%, at least about 96%, at least about 97%, at least about 98%, at least about 99%, or more than about 99%.
- the clinical sensitivity of identifying the kidney disease or disorder by the classifier may be calculated as the percentage of independent test samples associated with presence of the kidney disease or disorder (e.g., subjects known to have the kidney disease or disorder) that are correctly identified or classified as having the kidney disease or disorder.
- a clinical sensitivity may also be referred to as a recall.
- the classifier may be configured to identify the kidney disease or disorder with a clinical specificity of at least about 5%, at least about 10%, at least about 15%, at least about 20%, at least about 25%, at least about 30%, at least about 35%, at least about 40%, at least about 50%, at least about 55%, at least about 60%, at least about 65%, at least about 70%, at least about 75%, at least about 80%, at least about 81%, at least about 82%, at least about 83%, at least about 84%, at least about 85%, at least about 86%, at least about 87%, at least about 88%, at least about 89%, at least about 90%, at least about 91%, at least about 92%, at least about 93%, at least about 94%, at least about 95%, at least about 96%, at least about 97%, at least about 98%, at least about 99%, or more than about 99%.
- the clinical specificity of identifying the kidney disease or disorder by the classifier may be calculated as the percentage of independent test samples associated with absence of the kidney disease or disorder (e.g., apparently healthy subjects with negative clinical test results for the kidney disease or disorder) that are correctly identified or classified as not having the kidney disease or disorder.
- the classifier may be configured to identify the kidney disease or disorder with an Area-Under-Curve (AUC) of at least about 0.50, at least about 0.55, at least about 0.60, at least about 0.65, at least about 0.70, at least about 0.75, at least about 0.80, at least about 0.81, at least about 0.82, at least about 0.83, at least about 0.84, at least about 0.85, at least about 0.86, at least about 0.87, at least about 0.88, at least about 0.89, at least about 0.90, at least about 0.91, at least about 0.92, at least about 0.93, at least about 0.94, at least about 0.95, at least about 0.96, at least about 0.97, at least about 0.98, at least about 0.99, or more than about 0.99.
- the AUC may be calculated as an integral of the Receiver Operator Characteristic (ROC) curve (e.g., the area under the ROC curve) associated with the classifier in classifying biological samples as having or not having the kidney disease or disorder.
- ROC Receiver
- the classifier may be adjusted or tuned to improve the performance, accuracy, PPV, NPV, clinical sensitivity, clinical specificity, or AUC of identifying the kidney disease or disorder.
- the classifier may be adjusted or tuned by adjusting parameters of the classifier (e.g., a set of cutoff values used to classify a sample as described elsewhere herein, or weights of a neural network).
- the classifier may be adjusted or tuned continuously during the training process or after the training process has completed.
- a subset of the inputs may be identified as most influential or most important to be included for making high-quality classifications.
- a subset of the plurality of kidney disease or disorder-associated genomic loci may be identified as most influential or most important to be included for making high-quality classifications or identifications of kidney disease or disorder.
- the plurality of kidney disease or disorder-associated genomic loci or a subset thereof may be ranked based on metrics indicative of each genomic locus’s influence or importance toward making high-quality classifications or identifications of kidney disease or disorder.
- Such metrics may be used to reduce, in some cases significantly, the number of input variables (e.g., predictor variables) that may be used to train the classifier to a desired performance level (e.g., based on a desired minimum accuracy, PPV, NPV, clinical sensitivity, clinical specificity, or AUC). For example, if training the training algorithm with a plurality comprising several dozen or hundreds of input variables in the classifier results in an accuracy of classification of more than 99%, then training the training algorithm instead with only a selected subset of 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, no more than about 10, no more than about 15, no more than about 20, no more than about 25, no more than about 30, no more than about 35, no more than about 40, no more than about 45, no more than about
- the subset may be selected by rank-ordering the entire plurality of input variables and selecting a predetermined number (e.g., 1, 2, 3, 4, 5, 6, 7,
- the selected subset of the influential or most important input variables comprises one or more genomic loci listed in Table 2.
- Identifying or monitoring a kidney disease or disorder After using a classifier to process the gene expression data at the panel of kidney disease or disorder-associated genomic loci to classify the biological sample, a quantitative measure indicative of the presence, absence, or relative assessment of the kidney disease or disorder may be determined (e.g., likelihood or probability of kidney disease or disorder), and the kidney disease or disorder may be identified or a progression or regression of the kidney disease or disorder may be monitored in the subject based at least in part on the quantitative measure (e.g., likelihood or probability of kidney disease or disorder).
- a quantitative measure indicative of the presence, absence, or relative assessment of the kidney disease or disorder may be determined (e.g., likelihood or probability of kidney disease or disorder), and the kidney disease or disorder may be identified or a progression or regression of the kidney disease or disorder may be monitored in the subject based at least in part on the quantitative measure (e.g., likelihood or probability of kidney disease or disorder).
- the kidney disease or disorder may be identified in the subject with an accuracy of at least about 50%, at least about 55%, at least about 60%, at least about 65%, at least about 70%, at least about 75%, at least about 80%, at least about 81%, at least about 82%, at least about 83%, at least about 84%, at least about 85%, at least about 86%, at least about 87%, at least about 88%, at least about 89%, at least about 90%, at least about 91%, at least about 92%, at least about 93%, at least about 94%, at least about 95%, at least about 96%, at least about 97%, at least about 98%, at least about 99%, or more than about 99%.
- the accuracy of identifying the kidney disease or disorder by the classifier may be calculated as the percentage of independent test samples (e.g., subjects having the kidney disease or disorder, or apparently healthy subjects with negative clinical test results for the kidney disease or disorder) that are correctly identified or classified as having or not having the kidney disease or disorder, respectively.
- the kidney disease or disorder may be identified in the subject with a positive predictive value (PPV) of at least about 5%, at least about 10%, at least about 15%, at least about 20%, at least about 25%, at least about 30%, at least about 35%, at least about 40%, at least about 50%, at least about 55%, at least about 60%, at least about 65%, at least about 70%, at least about 75%, at least about 80%, at least about 81%, at least about 82%, at least about
- PSV positive predictive value
- the kidney disease or disorder may be identified in the subject with a negative predictive value (NPV) of at least about 5%, at least about 10%, at least about 15%, at least about 20%, at least about 25%, at least about 30%, at least about 35%, at least about 40%, at least about 50%, at least about 55%, at least about 60%, at least about 65%, at least about 70%, at least about 75%, at least about 80%, at least about 81%, at least about 82%, at least about 83%, at least about 84%, at least about 85%, at least about 86%, at least about 87%, at least about 88%, at least about 89%, at least about 90%, at least about 91%, at least about 92%, at least about 93%, at least about 94%, at least about 95%, at least about 96%, at least about 97%, at least about 98%, at least about 99%, or more than about 99%.
- the NPV of identifying the kidney disease or disorder by the classifier may be calculated as the percentage of biological samples identified or classified
- the kidney disease or disorder may be identified in the subject with a clinical sensitivity of at least about 5%, at least about 10%, at least about 15%, at least about 20%, at least about 25%, at least about 30%, at least about 35%, at least about 40%, at least about 50%, at least about 55%, at least about 60%, at least about 65%, at least about 70%, at least about 75%, at least about 80%, at least about 81%, at least about 82%, at least about 83%, at least about 84%, at least about 85%, at least about 86%, at least about 87%, at least about 88%, at least about 89%, at least about 90%, at least about 91%, at least about 92%, at least about 93%, at least about 94%, at least about 95%, at least about 96%, at least about 97%, at least about 98%, at least about 99%, or more than about 99%.
- the clinical sensitivity of identifying the kidney disease or disorder by the classifier may be calculated as the percentage of independent test samples associated with presence of the kidney disease or disorder (e.g., subjects having the kidney disease or disorder) that are correctly identified or classified as having the kidney disease or disorder.
- a clinical sensitivity may also be referred to as a recall.
- the kidney disease or disorder may be identified in the subject with a clinical specificity of at least about 5%, at least about 10%, at least about 15%, at least about 20%, at least about 25%, at least about 30%, at least about 35%, at least about 40%, at least about 50%, at least about 55%, at least about 60%, at least about 65%, at least about 70%, at least about 75%, at least about 80%, at least about 81%, at least about 82%, at least about 83%, at least about 84%, at least about 85%, at least about 86%, at least about 87%, at least about 88%, at least about 89%, at least about 90%, at least about 91%, at least about 92%, at least about 93%, at least about 94%, at least about 95%, at least about 96%, at least about 97%, at least about 98%, at least about 99%, or more than about 99%.
- the clinical specificity of identifying the kidney disease or disorder by the classifier may be calculated as the percentage of independent test samples associated with absence of the kidney disease or disorder (e.g., apparently healthy subjects with negative clinical test results for the kidney disease or disorder) that are correctly identified or classified as not having the kidney disease or disorder.
- a stage of the kidney disease or disorder (e.g., early stage, mid-stage, or late-stage) may further be identified.
- the stage of the kidney disease or disorder may be determined based at least in part on the gene expression data at a panel of kidney disease or disorder-associated genomic loci (e.g., quantitative measures of differential gene expressions at the kidney disease or disorder- associated genomic loci).
- an early-stage kidney disease or disorder may refer to a stage of kidney disease or disorder before clinical symptoms are manifested in the subject.
- a late-stage kidney disease or disorder may refer to a stage of kidney disease or disorder for which the subject has high severity of the kidney disease or disorder and/or is suffering severe impairment of kidney function (e.g., in need of dialysis, renal transplantation or tight diabetes management or tight blood pressure control).
- the subject may be provided with a therapeutic intervention (e.g., prescribing an appropriate course of treatment to treat the kidney disease or disorder of the subject).
- the therapeutic intervention may comprise an effective dose of medication such as ACE inhibitors or ARBs, drugs targeting the vasculature such as Tie-2 activators, sodium-glucose transport protein 2 (SGLT2) inhibitors and glucagon-like peptide 1 (GLP-1) agonists, anti-inflammatory therapies include inflammatory cytokines inhibitors, pentoxifylline, as well as anti-transforming growth factor a/-epiregulin therapies, anti-oxidative stress therapies include nicotinamide adenine dinucleotide phosphate (NADPH) oxidase inhibitors and allopurinol, an effective dose of insulin for diabetes management, a change in diet or exercise regimen, a surgery, tobacco cessation, avoid NSAIDs, or a combination thereof.
- NADPH nicotinamide adenine dinucleot
- the therapeutic intervention may comprise a subsequent different course of treatment (e.g., to increase treatment efficacy due to inefficacy or non response of the current course of treatment).
- the therapeutic intervention may comprise recommending the subject for a secondary clinical test to confirm a diagnosis of the kidney disease or disorder.
- This secondary clinical test may comprise a renal biopsy, an imaging test, a blood test, a computed tomography (CT) scan, a magnetic resonance imaging (MRI) scan, an ultrasound scan, a chest X-ray, a positron emission tomography (PET) scan, a PET-CT scan, or any combination thereof.
- the subject may be treated upon identifying the subject as having the kidney disease or disorder. Treating the subject may comprise administering an appropriate therapeutic intervention to treat the kidney disease or disorder of the subject.
- the therapeutic intervention may comprise an effective dose of medication, an effective dose of insulin for diabetes management, a change in diet or exercise regimen, a surgery, or a combination thereof. If the subject is currently being treated for the kidney disease or disorder with a course of treatment, the administered therapeutic intervention may comprise a subsequent different course of treatment (e.g., to increase treatment efficacy due to inefficacy or non-response of the current course of treatment).
- the medical intervention may comprise recommending the subject for a secondary clinical test to determine the cause of the kidney disease.
- the secondary clinical test may comprise a renal biopsy. If the subject is a heavy smoker, or a heavy alcohol user, or a drug of abuse user, the medical intervention may comprise recommending the subject to quit smoking, drinking or drug of abuse, and have a second test months after substance cessation. If the subject is morbidly obese, the medical intervention may comprise weight management. The subject may have a second test months after weight loss. If the subject has an ongoing kidney infection or urinary tract infection, the medical intervention may comprise recommending having the infection treated first. The subject may have a second test months after the treatment is complete.
- the subject may be provided with a therapeutic intervention (e.g., prescribing and/or administering an appropriate course of preventive treatment to protect the kidneys of the subject).
- the therapeutic intervention may comprise an effective dose of medication such as ACE inhibitors or ARBs, better glucose control, an effective dose of insulin for diabetes management, a change in diet or exercise regimen, better blood pressure control, avoid NSAIDs, weight management, or a combination thereof.
- the gene expression data at the panel of kidney disease or disorder-associated genomic loci may be assessed over a duration of time to monitor a patient (e.g., subject who has an elevated risk of developing the kidney disease, or subject who has the kidney disease or disorder or who is being treated for kidney disease or disorder).
- a patient e.g., subject who has an elevated risk of developing the kidney disease, or subject who has the kidney disease or disorder or who is being treated for kidney disease or disorder.
- the quantitative measures of gene expression at the kidney disease or disorder-associated genomic loci of the patient may change during the course of intervention or treatment or care.
- the quantitative measures of gene expression at the kidney disease or disorder- associated genomic loci of a patient whose kidney disease or disorder is regressing due to an effective intervention or treatment may shift toward the gene expression profile or distribution of a healthy subject.
- the quantitative measures of gene expression at the kidney disease or disorder-associated genomic loci of a patient whose kidney disease or disorder is progressing due to an ineffective intervention or treatment (or receiving no intervention or treatment) may shift toward the gene expression profile or distribution of a subject with more advanced stage kidney disease or disorder.
- the quantitative measures of gene expression at the kidney disease or disorder-associated genomic loci of a patient who has an elevated risk of developing the kidney disease or disorder and is regressing due to an effective preventive treatment may shift toward the gene expression profile or distribution of a healthy subject.
- the quantitative measures of gene expression at the kidney disease or disorder-associated genomic loci of a patient who has an elevated risk of developing the kidney disease and is progressing due to an ineffective intervention or treatment (or receiving no intervention or treatment) may shift toward the gene expression profile or distribution of a subject with overt kidney disease or disorder.
- the quantitative measures of gene expression at the kidney disease or disorder-associated genomic loci of a patient give a lower score while lab results show no improvement in albuminuria or eGFR may suggest the coexistence or development of another type of chronic kidney disease.
- the progression or regression of the kidney disease or disorder in the subject may be monitored by monitoring a course of intervention or treatment for treating the kidney disease or disorder in the subject.
- the monitoring may comprise assessing the kidney disease or disorder in the subject at two or more time points.
- the assessing may be based at least on the gene expression data at a panel of kidney disease or disorder-associated genomic loci (e.g., quantitative measures of gene expression at the kidney disease or disorder-associated genomic loci) determined at each of the two or more time points.
- a difference in gene expression data at a panel of kidney disease or disorder- associated genomic loci may be indicative of one or more clinical indications, such as (i) a diagnosis of the kidney disease or disorder in the subject, (ii) a prognosis of the kidney disease or disorder in the subject, (iii) a progression of the kidney disease or disorder in the subject, (iv) a regression of the kidney disease or disorder in the subject, (v) an efficacy of the intervention or course of treatment for treating the kidney disease or disorder in the subject, (vi) an inefficacy of the intervention or course of treatment for treating the kidney disease or disorder in the subject, (vii) a possibility of having another type of kidney disease or disorder, (viii) a possibility of another type of co existing kidney disease being regressing or progressing, and (ix) kidney damage related to tobacco, alcohol, or drug of abuse.
- a difference in gene expression data at a panel of kidney disease or disorder- associated genomic loci (e.g., quantitative measures of gene expression at the kidney disease or disorder-associated genomic loci) determined between the two or more time points may be indicative of a diagnosis of the kidney disease or disorder in the subject. For example, if the kidney disease or disorder was not detected in the subject at an earlier time point but was detected in the subject at a later time point, then the difference may be indicative of a diagnosis of the kidney disease or disorder in the subject. A clinical action or decision may be made based on this indication of diagnosis of the kidney disease or disorder in the subject, e.g., prescribing a new therapeutic intervention for the subject.
- a difference in gene expression data at a panel of kidney disease or disorder- associated genomic loci (e.g., quantitative measures of gene expression at the kidney disease or disorder-associated genomic loci) determined between the two or more time points may be indicative of a prognosis of the kidney disease or disorder in the subject.
- a difference in gene expression data at a panel of kidney disease or disorder- associated genomic loci (e.g., quantitative measures of gene expression at the kidney disease or disorder-associated genomic loci) determined between the two or more time points may be indicative of a regression of the kidney disease or disorder in the subject.
- the difference may be indicative of a regression (e.g., decreased tumor load, tumor burden, or tumor size) of the kidney disease or disorder in the subject.
- a clinical action or decision may be made based on this indication of the regression, e.g., continuing or ending a current therapeutic intervention for the subject.
- a difference in gene expression data at a panel of kidney disease or disorder- associated genomic loci (e.g., quantitative measures of gene expression at the kidney disease or disorder-associated genomic loci) determined between the two or more time points may be indicative of an efficacy of the course of treatment for treating the kidney disease or disorder in the subject. For example, if the kidney disease or disorder was detected in the subject at an earlier time point but was not detected in the subject at a later time point, then the difference may be indicative of an efficacy of the course of treatment for treating the kidney disease or disorder in the subject. A clinical action or decision may be made based on this indication of the efficacy of the course of treatment for treating the kidney disease or disorder in the subject, e.g., continuing or ending a current therapeutic intervention for the subject.
- a difference in gene expression data at a panel of kidney disease or disorder- associated genomic loci (e.g., quantitative measures of gene expression at the kidney disease or disorder-associated genomic loci) determined between the two or more time points may be indicative of an inefficacy of the course of treatment for treating the kidney disease or disorder in the subject.
- the difference may be indicative of an inefficacy of the course of treatment for treating the kidney disease or disorder in the subject.
- a clinical action or decision may be made based on this indication of the inefficacy of the course of treatment for treating the kidney disease or disorder in the subject, e.g., ending a current therapeutic intervention and/or switching to (e.g., prescribing) a different new therapeutic intervention for the subject.
- a report may be electronically outputted that identifies or provides an indication of the identification, prognosis, regression, elevated risk of the kidney disease or disorder, or the possibility of having another type of kidney disease or disorder in the subject.
- the subject may not display a kidney disease or disorder (e.g., is asymptomatic of the kidney disease or disorder).
- the report may be presented on a graphical user interface (GUI) of an electronic device of a user.
- GUI graphical user interface
- the user may be the subject, a caretaker, a physician, a nurse, or another health care worker.
- the report may include one or more clinical indications such as (i) a diagnosis of the kidney disease or disorder in the subject, (ii) a prognosis of the kidney disease or disorder in the subject, (iii) a progression of the kidney disease or disorder in the subject, (iv) a regression of the kidney disease or disorder in the subject, (v) an efficacy of the intervention or course of treatment for treating the kidney disease or disorder in the subject, (vi) an inefficacy of the intervention or course of treatment for treating the kidney disease or disorder in the subject, (vii) a possibility of having another type of kidney disease or disorder, (viii) a possibility of another type of co-existing kidney disease being regressing or progressing, and (ix) kidney damage related to tobacco, alcohol, or drug of abuse.
- the report may include one or more clinical actions or decisions made based on these one or more clinical indications.
- a clinical indication of a diagnosis of the kidney disease or disorder in the subject may be accompanied with a clinical action of prescribing a new therapeutic intervention for the subject.
- a clinical indication of a progression of the kidney disease or disorder in the subject may be accompanied with a clinical action of prescribing a new therapeutic intervention or switching therapeutic interventions (e.g., ending a current treatment and prescribing a new treatment) for the subject.
- a clinical indication of a regression of the kidney disease or disorder in the subject may be accompanied with a clinical action of continuing or ending a current therapeutic intervention for the subject.
- a clinical indication of an efficacy of the course of treatment for treating the kidney disease or disorder in the subject may be accompanied with a clinical action of continuing or ending a current therapeutic intervention for the subject.
- a clinical indication of an inefficacy of the course of treatment for treating the kidney disease or disorder in the subject may be accompanied with a clinical action of ending a current therapeutic intervention and/or switching to (e.g., prescribing) a different new therapeutic intervention for the subject.
- FIG. 3 shows a computer system 301 that is programmed or otherwise configured to, for example, determine quantitative measures of gene expression to generate gene expression profiles of RNA molecules at genomic regions; determine a quantitative measure indicative of a presence, absence, elevated risk, or relative assessment of a kidney disease or disorder of a subject; analyze gene expression data; identify or provide an indication of the kidney disease or disorder of the subject; and electronically output a report that identifies or provides an indication of the kidney disease or disorder of the subject.
- the computer system 301 can regulate various aspects of analysis, calculation, and generation of the present disclosure, such as, for example, determining quantitative measures of gene expression to generate gene expression profiles of RNA molecules at genomic regions; determining a quantitative measure indicative of a presence, absence, elevated risk, or relative assessment of a kidney disease or disorder of a subject; analyzing gene expression data; identifying or providing an indication of the kidney disease or disorder of the subject; and electronically outputting a report that identifies or provides an indication of the kidney disease or disorder of the subject.
- the computer system 301 can be an electronic device of a user or a computer system that is remotely located with respect to the electronic device.
- the electronic device can be a mobile electronic device.
- the computer system 301 includes a central processing unit (CPU, also “processor” and “computer processor” herein) 305, which can be a single core or multi core processor, or a plurality of processors for parallel processing.
- the computer system 301 also includes memory or memory location 310 (e.g., random-access memory, read-only memory, flash memory), electronic storage unit 315 (e.g., hard disk), communication interface 320 (e.g., network adapter) for communicating with one or more other systems, and peripheral devices 325, such as cache, other memory, data storage and/or electronic display adapters.
- CPU central processing unit
- computer processor also “computer processor” and “computer processor” herein
- the computer system 301 also includes memory or memory location 310 (e.g., random-access memory, read-only memory, flash memory), electronic storage unit 315 (e.g., hard disk), communication interface 320 (e.g., network adapter) for communicating with one or more other systems, and peripheral devices 325, such as cache, other memory
- the memory 310, storage unit 315, interface 320, and peripheral devices 325 are in communication with the CPU 305 through a communication bus (solid lines), such as a motherboard.
- the storage unit 315 can be a data storage unit (or data repository) for storing data.
- the computer system 301 can be operatively coupled to a computer network (“network”) 330 with the aid of the communication interface 320.
- the network 330 can be the Internet, an internet and/or extranet, or an intranet and/or extranet that is in communication with the Internet.
- the network 330 in some cases is a telecommunication and/or data network.
- the network 330 can include one or more computer servers, which can enable distributed computing, such as cloud computing.
- one or more computer servers may enable cloud computing over the network 330 (“the cloud”) to perform various aspects of analysis, calculation, and generation of the present disclosure, such as, for example, determining quantitative measures of gene expression to generate gene expression profiles of RNA molecules at genomic regions; determining a quantitative measure indicative of a presence, absence, elevated risk, or relative assessment of a kidney disease or disorder of a subject; analyzing gene expression data; identifying or providing an indication of the kidney disease or disorder of the subject; and electronically outputting a report that identifies or provides an indication of the kidney disease or disorder of the subject.
- cloud computing may be provided by cloud computing platforms such as, for example, Amazon Web Services (AWS), Microsoft Azure, Google Cloud Platform, and IBM cloud.
- the network 330 in some cases with the aid of the computer system 301, can implement a peer-to-peer network, which may enable devices coupled to the computer system 301 to behave as a client or a server.
- the CPU 305 may comprise one or more computer processors and/or one or more graphics processing units (GPUs).
- the CPU 305 can execute a sequence of machine-readable instructions, which can be embodied in a program or software.
- the instructions may be stored in a memory location, such as the memory 310.
- the instructions can be directed to the CPU 305, which can subsequently program or otherwise configure the CPU 305 to implement methods of the present disclosure. Examples of operations performed by the CPU 305 can include fetch, decode, execute, and writeback.
- the CPU 305 can be part of a circuit, such as an integrated circuit.
- a circuit such as an integrated circuit.
- One or more other components of the system 301 can be included in the circuit.
- the circuit is an application specific integrated circuit (ASIC).
- ASIC application specific integrated circuit
- the storage unit 315 can store files, such as drivers, libraries and saved programs.
- the storage unit 315 can store user data, e.g., user preferences and user programs.
- the computer system 301 in some cases can include one or more additional data storage units that are external to the computer system 301, such as located on a remote server that is in communication with the computer system 301 through an intranet or the Internet.
- the computer system 301 can communicate with one or more remote computer systems through the network 330.
- the computer system 301 can communicate with a remote computer system of a user.
- remote computer systems include personal computers (e.g., portable PC), slate or tablet PC’s (e.g., Apple® iPad, Samsung® Galaxy Tab), telephones, Smart phones (e.g., Apple® iPhone, Android-enabled device, Blackberry®), or personal digital assistants.
- the user can access the computer system 301 via the network 330.
- Methods as described herein can be implemented by way of machine (e.g., computer processor) executable code stored on an electronic storage location of the computer system 301, such as, for example, on the memory 310 or electronic storage unit 315.
- the machine executable or machine-readable code can be provided in the form of software. During use, the code can be executed by the processor 305. In some cases, the code can be retrieved from the storage unit 315 and stored on the memory 310 for ready access by the processor 305. In some situations, the electronic storage unit 315 can be precluded, and machine-executable instructions are stored on memory 310. [0177]
- the code can be pre-compiled and configured for use with a machine having a processer adapted to execute the code, or can be compiled during runtime. The code can be supplied in a programming language that can be selected to enable the code to execute in a pre compiled or as-compiled fashion.
- aspects of the systems and methods provided herein can be embodied in programming.
- Various aspects of the technology may be thought of as “products” or “articles of manufacture” typically in the form of machine (or processor) executable code and/or associated data that is carried on or embodied in a type of machine- readable medium.
- Machine-executable code can be stored on an electronic storage unit, such as memory (e.g., read-only memory, random-access memory, flash memory) or a hard disk.
- “Storage” type media can include any or all of the tangible memory of the computers, processors or the like, or associated modules thereof, such as various semiconductor memories, tape drives, disk drives and the like, which may provide non-transitory storage at any time for the software programming. All or portions of the software may at times be communicated through the Internet or various other telecommunication networks. Such communications, for example, may enable loading of the software from one computer or processor into another, for example, from a management server or host computer into the computer platform of an application server.
- another type of media that may bear the software elements includes optical, electrical and electromagnetic waves, such as used across physical interfaces between local devices, through wired and optical landline networks and over various air-links.
- a machine-readable medium such as computer-executable code
- a tangible storage medium such as computer-executable code
- Non-volatile storage media include, for example, optical or magnetic disks, such as any of the storage devices in any computer(s) or the like, such as may be used to implement the databases, etc. shown in the drawings.
- Volatile storage media include dynamic memory, such as main memory of such a computer platform.
- Tangible transmission media include coaxial cables; copper wire and fiber optics, including the wires that comprise a bus within a computer system.
- Carrier-wave transmission media may take the form of electric or electromagnetic signals, or acoustic or light waves such as those generated during radio frequency (RF) and infrared (IR) data communications.
- RF radio frequency
- IR infrared
- Common forms of computer-readable media therefore include for example: a floppy disk, a flexible disk, hard disk, magnetic tape, any other magnetic medium, a CD-ROM, DVD or DVD-ROM, any other optical medium, punch cards paper tape, any other physical storage medium with patterns of holes, a RAM, a ROM, a PROM and EPROM, a FLASH-EPROM, any other memory chip or cartridge, a carrier wave transporting data or instructions, cables or links transporting such a carrier wave, or any other medium from which a computer may read programming code and/or data.
- Many of these forms of computer-readable media may be involved in carrying one or more sequences of one or more instructions to a processor for execution.
- the computer system 301 can include or be in communication with an electronic display 335 that comprises a user interface (E ⁇ ) 340 for providing, for example, a visual display of data (e.g., gene expression data) indicative of a presence, absence, or relative assessment of kidney disease or disorder of a subject; a determined presence, absence, elevated risk, or relative assessment of kidney disease or disorder of a subject, an identification of a subject as having kidney disease or disorder; or an electronic report that identifies or provides an indication of the kidney disease or disorder of the subject.
- UFs include, without limitation, a graphical user interface (GET) and web-based user interface.
- Methods and systems of the present disclosure can be implemented by way of one or more algorithms.
- An algorithm can be implemented by way of software upon execution by the central processing unit 305.
- the algorithm can, for example, determine quantitative measures of gene expression to generate gene expression profiles of RNA molecules at genomic regions; determine a quantitative measure indicative of a presence, absence, elevated risk, or relative assessment of a kidney disease or disorder of a subject; analyze gene expression data; identify or provide an indication of the kidney disease or disorder of the subject; and electronically output a report that identifies or provides an indication of the kidney disease or disorder of the subject.
- urine samples were analyzed from subjects to non-invasively assess diabetic nephropathy (DN).
- DN diabetic nephropathy
- NAG no diabetes or diabetes without kidney manifestation
- CKD non-diabetic chronic kidney disease patients
- RNA samples were isolated from the urine samples and subjected to whole transcriptome RNA sequencing.
- the library prep kits may be, for example, chosen from Illumina’s Nextera RNA Enrichment Tagmentation kit; Illumina’s Truseq RNA Exome kit; Agilent’s SureSelect XT HS2 RNA prep kit; and KAPA RNA HyperPrep from Roche.
- FIGs. 2A-2C illustrate examples of dimensionality reduction analysis for diabetic nephropathy (DN) across different age groups (FIG. 2A), different gender groups (FIG. 2B), and different race/ethnicity groups (FIG. 2C). These figures show that no distinct clusters are formed across different age groups (FIG. 2A), and different race/ethnicity groups (FIG. 2C). Male and female subjects formed two distinct clusters (FIG. 2B), so the data were split into male and female groups for separate analysis.
- DN diabetic nephropathy
- the sample is classified as “yes” or “positive” for DN (e.g., the subject has DN); otherwise, the sample is classified as “no” or “negative” (e.g., the subject does not have DN).
- the threshold e.g., 0.5
- Diabetic nephropathy was typically found to be confidently called only when scores from both the DN vs. CKD model and the DN vs. NEG model were above a pre-determined threshold (e.g., about 0.70). For example, when considering only a single biomarker score, a score of DN vs. NEG that is greater than 0.5 may indicate glomerular injury, while a score of DN vs. NEG that is less than 0.5 may indicate tubular injury. Therefore, performing such a two- biomarker assessment approach advantageously yielded significantly increased specificity, as shown in Table 1.
- a pre-determined threshold e.g. 0.70
- Table 1 Clinical explanations of DN, NEG, and CKD scores in patients
- Table 2 Clinical explanations of DN, NEG, and CKD scores in patients
- a therapeutic intervention may be prescribed and/or administered to the patient, such as near-normal blood glucose control, antihypertensive treatment, and restriction of dietary proteins.
- Various drugs may be administered, such as hormones (e.g., insulin), sulfonylureas, biguanides, angiotensin-converting enzyme (ACE) inhibitors, angiotensin receptor blockers (ARBs), beta-adrenergic blocking agents, calcium channel blockers, and diuretics.
- hormones e.g., insulin
- ACE angiotensin-converting enzyme
- ARBs angiotensin receptor blockers
- beta-adrenergic blocking agents e.g., beta-adrenergic blocking agents, calcium channel blockers, and diuretics.
- the biopsy-confirmed samples were scored, and most of them matched. In each comparison, whether DN vs. CKD or DN vs. NEG, several thousand random split events were performed (e.g., 2,000 to 5,000 times). The most frequently occurring genes were identified and selected as the gene signature for diabetic nephropathy. In this case, four sets of gene signatures were identified: two for male and two for female (e.g., DN vs. CKD and DN vs. NEG for each), as shown below in Table 3, Table 4, Table 5, and Table 6. For each of these 4 tables, four or five sets (of various sizes) of strongly predictive differential gene expression markers are provided.
- Table 3 Differential gene expression markers for assessment of DN vs. NEG (female subjects)
- Table 4 Differential gene expression markers for assessment of DN vs. CKD (female subjects)
- Table 5 Differential gene expression markers for assessment of DN vs. NEG (male subjects)
- Table 6 Differential gene expression markers for assessment of DN vs. CKD (male subjects)
- DN diabetic nephropathy
- 2 gene signatures may be used for a male subject (a male-specific DN vs. NEG gene signature, and a male-specific DN vs. CKD gene signature), while 2 different gene signatures may be used for a female subject (a female-specific DN vs. NEG gene signature, and a female-specific DN vs. CKD gene signature).
- a multi-class classifier may be directly trained using a training dataset including positive cases and negative controls for two or more kidney diseases or disorders, such as DN and another kidney disease or disorder (e.g., DN and CKD), by using the gene markers listed in Table 3, Table 4, Table 5, and Table 6 as input features for the multi-class classifier.
- the multi-class classifier may be trained to distinguish between three or more cases: positive for a first kidney disease or disorder, positive for a second kidney disease or disorder, and negative (e.g., having no kidney disease or disorder).
- Patients with diabetic nephropathy may benefit from treatments being developed using regenerative medicine. These techniques may help reverse or slow kidney damage caused by the disease. For example, if a patient’s diabetes can be cured by a treatment such as pancreas islet cell transplant or stem cell therapy, their kidney function may improve. In addition, novel therapies such as stem cell therapies and new medications may be developed for use in treating diabetic nephropathy.
- kidney damage e.g., non-diabetic nephropathy
- kidney disease-vs. -negative score for a given kidney disease increases significantly (e.g., by more than 0.1) as a result of replacing nicotine-dependent subjects in the NO group with non-smoker subjects, then it may be likely that albuminuria may be a result of nicotine use.
- kidney diseases and disorders such as hypertensive nephropathy, IgA nephropathy, membranous nephropathy, minimal change disease, focal segmental glomerulosclerosis (FSGS), NSAIDs induced nephrotoxicity, thin basement membrane nephropathy, amyloidosis, ANCA vasculitis related to endocarditis and other infections, cardiorenal syndrome, IgG4 nephropathy, interstitial nephritis, lithium nephrotoxicity, lupus nephritis, multiple myeloma, polycystic kidney disease, pyelonephritis (kidney infection), renal artery stenosis, renal cyst, rheumatoid arthritis- associated renal disease, and kidney stones.
- hypertensive nephropathy IgA nephropathy, membranous nephropathy, minimal change disease, focal segmental glomerulosclerosis (FSGS), NSA
Landscapes
- Chemical & Material Sciences (AREA)
- Life Sciences & Earth Sciences (AREA)
- Proteomics, Peptides & Aminoacids (AREA)
- Health & Medical Sciences (AREA)
- Organic Chemistry (AREA)
- Wood Science & Technology (AREA)
- Analytical Chemistry (AREA)
- Zoology (AREA)
- Genetics & Genomics (AREA)
- Engineering & Computer Science (AREA)
- Pathology (AREA)
- Immunology (AREA)
- Microbiology (AREA)
- Molecular Biology (AREA)
- Biotechnology (AREA)
- Biophysics (AREA)
- Physics & Mathematics (AREA)
- Biochemistry (AREA)
- Bioinformatics & Cheminformatics (AREA)
- General Engineering & Computer Science (AREA)
- General Health & Medical Sciences (AREA)
- Measuring Or Testing Involving Enzymes Or Micro-Organisms (AREA)
Abstract
La présente invention concerne des méthodes et des systèmes destinés à la détection d'une maladie ou d'un trouble rénal. La méthode de traitement ou d'analyse d'un échantillon corporel prélevé sur un sujet peut comprendre les étapes suivantes : (a) analyse de l'échantillon corporel pour produire un ensemble de données comprenant un ou plusieurs niveaux de produits d'expression génétique dans l'échantillon corporel, ce ou ces niveaux de produits d'expression génétique correspondant à un ensemble de gènes associés à une maladie ou un trouble rénal ; (b) traitement informatique de l'ensemble de données pour déterminer la présence ou un risque élevé de la maladie ou du trouble rénal chez le sujet ; et (c) sortie électronique d'un rapport indiquant la présence ou le risque élevé de la maladie ou du trouble rénal chez le sujet.
Priority Applications (2)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
PCT/US2021/033084 WO2022245342A1 (fr) | 2021-05-19 | 2021-05-19 | Méthodes et systèmes pour la détection d'une maladie ou d'un trouble rénal par analyse d'expression génétique |
CN202180099114.4A CN117580962A (zh) | 2021-05-19 | 2021-05-19 | 通过基因表达分析检测肾脏疾病或受损的方法和系统 |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
PCT/US2021/033084 WO2022245342A1 (fr) | 2021-05-19 | 2021-05-19 | Méthodes et systèmes pour la détection d'une maladie ou d'un trouble rénal par analyse d'expression génétique |
Publications (1)
Publication Number | Publication Date |
---|---|
WO2022245342A1 true WO2022245342A1 (fr) | 2022-11-24 |
Family
ID=84141579
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
PCT/US2021/033084 WO2022245342A1 (fr) | 2021-05-19 | 2021-05-19 | Méthodes et systèmes pour la détection d'une maladie ou d'un trouble rénal par analyse d'expression génétique |
Country Status (2)
Country | Link |
---|---|
CN (1) | CN117580962A (fr) |
WO (1) | WO2022245342A1 (fr) |
Citations (5)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US20070049616A1 (en) * | 2005-08-25 | 2007-03-01 | Ksander Gary M | Organic compounds |
US20130079244A1 (en) * | 2010-04-18 | 2013-03-28 | Giulio Genovese | Methods of predicting predisposition to or risk of kidney disease |
US20150167085A1 (en) * | 2013-09-09 | 2015-06-18 | Daniel R. Salomon | Methods and Systems for Analysis of Organ Transplantation |
US20160033508A1 (en) * | 2014-01-27 | 2016-02-04 | Epic Sciences, Inc. | Circulating tumor cell diagnostics for prostate cancer biomarkers |
US20160263195A1 (en) * | 2013-09-20 | 2016-09-15 | University Of Virginia Patent Foundation | Compositions and methods for protecting the kidney from ischemia reperfusion injury |
-
2021
- 2021-05-19 WO PCT/US2021/033084 patent/WO2022245342A1/fr active Application Filing
- 2021-05-19 CN CN202180099114.4A patent/CN117580962A/zh active Pending
Patent Citations (5)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US20070049616A1 (en) * | 2005-08-25 | 2007-03-01 | Ksander Gary M | Organic compounds |
US20130079244A1 (en) * | 2010-04-18 | 2013-03-28 | Giulio Genovese | Methods of predicting predisposition to or risk of kidney disease |
US20150167085A1 (en) * | 2013-09-09 | 2015-06-18 | Daniel R. Salomon | Methods and Systems for Analysis of Organ Transplantation |
US20160263195A1 (en) * | 2013-09-20 | 2016-09-15 | University Of Virginia Patent Foundation | Compositions and methods for protecting the kidney from ischemia reperfusion injury |
US20160033508A1 (en) * | 2014-01-27 | 2016-02-04 | Epic Sciences, Inc. | Circulating tumor cell diagnostics for prostate cancer biomarkers |
Non-Patent Citations (1)
Title |
---|
PAPALE MASSIMO, DI PAOLO SALVATORE, MAGISTRONI RICCARDO, LAMACCHIA OLGA, DI PALMA ANNA MARIA, DE MATTIA ANGELA, TERESA ROCCHETTI M: "Urine Proteome Analysis May Allow Noninvasive Differential Diagnosis of Diabetic Nephropathy", DIABETES CARE, vol. 33, no. 11, 2010, pages 2409 - 2415, XP093011050 * |
Also Published As
Publication number | Publication date |
---|---|
CN117580962A (zh) | 2024-02-20 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
AU2020221278A1 (en) | Methods and systems for determining a pregnancy-related state of a subject | |
WO2019023517A2 (fr) | Classificateur de séquençage génomique | |
JP2022521791A (ja) | 病原体検出のための配列決定データを使用するためのシステムおよび方法 | |
US20230160019A1 (en) | Rna markers and methods for identifying colon cell proliferative disorders | |
US20220372573A1 (en) | Methods and systems for detection of kidney disease or disorder by gene expression analysis | |
US20230175058A1 (en) | Methods and systems for abnormality detection in the patterns of nucleic acids | |
WO2020088596A1 (fr) | Méthodes, systèmes et nécessaires de prédiction de condition de naissance prématurée | |
JP2017516501A (ja) | 肺がんのタイピング法 | |
EP4121553A1 (fr) | Systèmes et procédés de détection d'un risque de maladie d'alzheimer à l'aide d'un dosage de profilage d'arnm sans circulation | |
JP2022501033A (ja) | 膵臓病変の評価における無細胞dnaヒドロキシメチル化プロファイル | |
JP2022524382A (ja) | 前立腺がんを予測するための方法およびその使用 | |
CA3221494A1 (fr) | Methode, kit et systeme de detection de cancer | |
US20220213558A1 (en) | Methods and systems for urine-based detection of urologic conditions | |
US20220042106A1 (en) | Systems and methods of using cell-free nucleic acids to tailor cancer treatment | |
US20180371553A1 (en) | Methods and compositions for the analysis of cancer biomarkers | |
US20220084632A1 (en) | Clinical classfiers and genomic classifiers and uses thereof | |
WO2020194057A1 (fr) | Biomarqueurs pour la détection de maladies | |
EP3810807A1 (fr) | Méthodes et compositions pour l'analyse de biomarqueurs du cancer | |
WO2022245342A1 (fr) | Méthodes et systèmes pour la détection d'une maladie ou d'un trouble rénal par analyse d'expression génétique | |
WO2019018764A1 (fr) | Méthodes de détermination de réponse à des inhibiteurs de parp | |
WO2018210338A1 (fr) | Procédés de détection d'affections malignes du côlon | |
WO2021041968A1 (fr) | Systèmes et procédés pour prédire et surveiller une réponse de traitement à partir d'acides nucléiques acellulaires | |
US20240254566A1 (en) | Analysis method, kit, and detection device | |
EP4379064A1 (fr) | Procédés de diagnostic et dosage diagnostique comprenant la détection d'arnlnc | |
WO2024155681A1 (fr) | Procédés et systèmes de détection et d'évaluation des maladies hépatiques |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
121 | Ep: the epo has been informed by wipo that ep was designated in this application |
Ref document number: 21940988 Country of ref document: EP Kind code of ref document: A1 |
|
WWE | Wipo information: entry into national phase |
Ref document number: 202180099114.4 Country of ref document: CN |
|
NENP | Non-entry into the national phase |
Ref country code: DE |
|
122 | Ep: pct application non-entry in european phase |
Ref document number: 21940988 Country of ref document: EP Kind code of ref document: A1 |