CN116732164A - Biomarker combinations and their use in predicting ASD disease - Google Patents
Biomarker combinations and their use in predicting ASD disease Download PDFInfo
- Publication number
- CN116732164A CN116732164A CN202310725801.3A CN202310725801A CN116732164A CN 116732164 A CN116732164 A CN 116732164A CN 202310725801 A CN202310725801 A CN 202310725801A CN 116732164 A CN116732164 A CN 116732164A
- Authority
- CN
- China
- Prior art keywords
- autism spectrum
- spectrum disorder
- sample
- combination
- biomarker
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Pending
Links
- 239000000090 biomarker Substances 0.000 title claims abstract description 54
- 208000037265 diseases, disorders, signs and symptoms Diseases 0.000 title claims abstract description 52
- 208000035478 Interatrial communication Diseases 0.000 title description 9
- 206010003664 atrial septal defect Diseases 0.000 title description 9
- 208000029560 autism spectrum disease Diseases 0.000 claims abstract description 82
- 238000001514 detection method Methods 0.000 claims abstract description 12
- 230000035945 sensitivity Effects 0.000 claims abstract description 8
- 201000010099 disease Diseases 0.000 claims abstract description 6
- 102000004169 proteins and genes Human genes 0.000 claims description 58
- 108090000623 proteins and genes Proteins 0.000 claims description 58
- 239000000523 sample Substances 0.000 claims description 47
- 239000003153 chemical reaction reagent Substances 0.000 claims description 23
- 238000000034 method Methods 0.000 claims description 21
- 102100020760 Ferritin heavy chain Human genes 0.000 claims description 15
- 101001002987 Homo sapiens Ferritin heavy chain Proteins 0.000 claims description 15
- 102100033409 40S ribosomal protein S3 Human genes 0.000 claims description 14
- 102100023833 ADP-ribosylation factor 5 Human genes 0.000 claims description 14
- 102100024005 Acid ceramidase Human genes 0.000 claims description 14
- 102100024436 Caldesmon Human genes 0.000 claims description 14
- 108010091675 Cellular Apoptosis Susceptibility Protein Proteins 0.000 claims description 14
- 102100036213 Collagen alpha-2(I) chain Human genes 0.000 claims description 14
- 102100025621 Cytochrome b-245 heavy chain Human genes 0.000 claims description 14
- 102100032249 Dystonin Human genes 0.000 claims description 14
- 108010009900 Endothelial Protein C Receptor Proteins 0.000 claims description 14
- 102000009839 Endothelial Protein C Receptor Human genes 0.000 claims description 14
- 102100029091 Exportin-2 Human genes 0.000 claims description 14
- 102100036702 Glucosamine-6-phosphate isomerase 2 Human genes 0.000 claims description 14
- 102100040408 Heat shock 70 kDa protein 1-like Human genes 0.000 claims description 14
- 102100027421 Heat shock cognate 71 kDa protein Human genes 0.000 claims description 14
- 102100030826 Hemoglobin subunit epsilon Human genes 0.000 claims description 14
- 101000975753 Homo sapiens Acid ceramidase Proteins 0.000 claims description 14
- 101000910297 Homo sapiens Caldesmon Proteins 0.000 claims description 14
- 101000875067 Homo sapiens Collagen alpha-2(I) chain Proteins 0.000 claims description 14
- 101001037977 Homo sapiens Heat shock 70 kDa protein 1-like Proteins 0.000 claims description 14
- 101001080568 Homo sapiens Heat shock cognate 71 kDa protein Proteins 0.000 claims description 14
- 101001047912 Homo sapiens Hydroxymethylglutaryl-CoA lyase, mitochondrial Proteins 0.000 claims description 14
- 101000994365 Homo sapiens Integrin alpha-6 Proteins 0.000 claims description 14
- 101000935043 Homo sapiens Integrin beta-1 Proteins 0.000 claims description 14
- 101000572820 Homo sapiens MICOS complex subunit MIC60 Proteins 0.000 claims description 14
- 101001027796 Homo sapiens Male-specific lethal 1 homolog Proteins 0.000 claims description 14
- 101001028019 Homo sapiens Metastasis-associated protein MTA2 Proteins 0.000 claims description 14
- 101000946889 Homo sapiens Monocyte differentiation antigen CD14 Proteins 0.000 claims description 14
- 101001126487 Homo sapiens Platelet factor 4 variant Proteins 0.000 claims description 14
- 101001136888 Homo sapiens Proteasome subunit alpha type-3 Proteins 0.000 claims description 14
- 101000606548 Homo sapiens Receptor-type tyrosine-protein phosphatase gamma Proteins 0.000 claims description 14
- 101000807820 Homo sapiens V-type proton ATPase subunit S1 Proteins 0.000 claims description 14
- 102100024004 Hydroxymethylglutaryl-CoA lyase, mitochondrial Human genes 0.000 claims description 14
- 102100026211 Immunoglobulin heavy constant delta Human genes 0.000 claims description 14
- 102100032816 Integrin alpha-6 Human genes 0.000 claims description 14
- 102100025304 Integrin beta-1 Human genes 0.000 claims description 14
- 102100026639 MICOS complex subunit MIC60 Human genes 0.000 claims description 14
- 102100025354 Macrophage mannose receptor 1 Human genes 0.000 claims description 14
- 102100037511 Metastasis-associated protein MTA2 Human genes 0.000 claims description 14
- 102100035877 Monocyte differentiation antigen CD14 Human genes 0.000 claims description 14
- 108010082739 NADPH Oxidase 2 Proteins 0.000 claims description 14
- 102100038756 POTE ankyrin domain family member J Human genes 0.000 claims description 14
- 102100030582 Platelet factor 4 variant Human genes 0.000 claims description 14
- 102100037935 Polyubiquitin-C Human genes 0.000 claims description 14
- 102100035908 Proteasome subunit alpha type-3 Human genes 0.000 claims description 14
- 102100039661 Receptor-type tyrosine-protein phosphatase gamma Human genes 0.000 claims description 14
- 108010021188 Superoxide Dismutase-1 Proteins 0.000 claims description 14
- 102100033598 Triosephosphate isomerase Human genes 0.000 claims description 14
- 102100037090 V-type proton ATPase subunit S1 Human genes 0.000 claims description 14
- 102100021161 Vasorin Human genes 0.000 claims description 14
- 238000003745 diagnosis Methods 0.000 claims description 14
- 238000007477 logistic regression Methods 0.000 claims description 14
- PJOHVEQSYPOERL-SHEAVXILSA-N (e)-n-[(4r,4as,7ar,12br)-3-(cyclopropylmethyl)-9-hydroxy-7-oxo-2,4,5,6,7a,13-hexahydro-1h-4,12-methanobenzofuro[3,2-e]isoquinoline-4a-yl]-3-(4-methylphenyl)prop-2-enamide Chemical compound C1=CC(C)=CC=C1\C=C\C(=O)N[C@]1(CCC(=O)[C@@H]2O3)[C@H]4CC5=CC=C(O)C3=C5[C@]12CCN4CC1CC1 PJOHVEQSYPOERL-SHEAVXILSA-N 0.000 claims description 13
- 102100026936 2-oxoglutarate dehydrogenase, mitochondrial Human genes 0.000 claims description 13
- QHXIQBNUQFLDAB-UHFFFAOYSA-N 4-N,6-N-dimethyl-2-N-propan-2-yl-1,3,5-triazine-2,4,6-triamine Chemical compound CNC1=NC(NC)=NC(NC(C)C)=N1 QHXIQBNUQFLDAB-UHFFFAOYSA-N 0.000 claims description 13
- 102100034283 Annexin A5 Human genes 0.000 claims description 13
- 101000693933 Arabidopsis thaliana Fructose-bisphosphate aldolase 8, cytosolic Proteins 0.000 claims description 13
- 101710134389 Carboxy-terminal domain RNA polymerase II polypeptide A small phosphatase 2 Proteins 0.000 claims description 13
- 102100023126 Cell surface glycoprotein MUC18 Human genes 0.000 claims description 13
- 102100030844 Exocyst complex component 1 Human genes 0.000 claims description 13
- 102100029115 Fumarylacetoacetase Human genes 0.000 claims description 13
- 102100041034 Glucosamine-6-phosphate isomerase 1 Human genes 0.000 claims description 13
- 102100025989 Glyoxalase domain-containing protein 4 Human genes 0.000 claims description 13
- 101000982656 Homo sapiens 2-oxoglutarate dehydrogenase, mitochondrial Proteins 0.000 claims description 13
- 101000656561 Homo sapiens 40S ribosomal protein S3 Proteins 0.000 claims description 13
- 101000684206 Homo sapiens ADP-ribosylation factor 5 Proteins 0.000 claims description 13
- 101000780122 Homo sapiens Annexin A5 Proteins 0.000 claims description 13
- 101000623903 Homo sapiens Cell surface glycoprotein MUC18 Proteins 0.000 claims description 13
- 101001016186 Homo sapiens Dystonin Proteins 0.000 claims description 13
- 101001072480 Homo sapiens Glucosamine-6-phosphate isomerase 2 Proteins 0.000 claims description 13
- 101000857136 Homo sapiens Glyoxalase domain-containing protein 4 Proteins 0.000 claims description 13
- 101001083591 Homo sapiens Hemoglobin subunit epsilon Proteins 0.000 claims description 13
- 101000777670 Homo sapiens Hsp90 co-chaperone Cdc37 Proteins 0.000 claims description 13
- 101001055307 Homo sapiens Immunoglobulin heavy constant delta Proteins 0.000 claims description 13
- 101001052493 Homo sapiens Mitogen-activated protein kinase 1 Proteins 0.000 claims description 13
- 101001008394 Homo sapiens Nucleobindin-1 Proteins 0.000 claims description 13
- 101000741898 Homo sapiens POTE ankyrin domain family member J Proteins 0.000 claims description 13
- 101000662049 Homo sapiens Polyubiquitin-C Proteins 0.000 claims description 13
- 101000577619 Homo sapiens Profilin-1 Proteins 0.000 claims description 13
- 101000801742 Homo sapiens Triosephosphate isomerase Proteins 0.000 claims description 13
- 101000639802 Homo sapiens U2 small nuclear ribonucleoprotein B'' Proteins 0.000 claims description 13
- 101000750267 Homo sapiens Vasorin Proteins 0.000 claims description 13
- 102100031568 Hsp90 co-chaperone Cdc37 Human genes 0.000 claims description 13
- 108010031099 Mannose Receptor Proteins 0.000 claims description 13
- 102100024193 Mitogen-activated protein kinase 1 Human genes 0.000 claims description 13
- 102100027439 Nucleobindin-1 Human genes 0.000 claims description 13
- 102100026071 Olfactomedin-4 Human genes 0.000 claims description 13
- 102100028857 Profilin-1 Human genes 0.000 claims description 13
- 102100023341 Ubiquitin-40S ribosomal protein S27a Human genes 0.000 claims description 13
- 101000938454 Homo sapiens Exocyst complex component 1 Proteins 0.000 claims description 12
- 101000914673 Homo sapiens Fanconi anemia group A protein Proteins 0.000 claims description 12
- 101000918487 Homo sapiens Fumarylacetoacetase Proteins 0.000 claims description 12
- 101001039324 Homo sapiens Glucosamine-6-phosphate isomerase 1 Proteins 0.000 claims description 12
- 101000990985 Homo sapiens Myosin regulatory light chain 12B Proteins 0.000 claims description 12
- 101001120760 Homo sapiens Olfactomedin-4 Proteins 0.000 claims description 12
- 101001115218 Homo sapiens Ubiquitin-40S ribosomal protein S27a Proteins 0.000 claims description 12
- 102100030330 Myosin regulatory light chain 12B Human genes 0.000 claims description 12
- 101000832669 Rattus norvegicus Probable alcohol sulfotransferase Proteins 0.000 claims description 12
- 102100033632 Tropomyosin alpha-1 chain Human genes 0.000 claims description 12
- 102100028313 Fibrinogen beta chain Human genes 0.000 claims description 11
- 101000917163 Homo sapiens Fibrinogen beta chain Proteins 0.000 claims description 11
- 101000801701 Homo sapiens Tropomyosin alpha-1 chain Proteins 0.000 claims description 11
- 108010009513 Mitochondrial Aldehyde Dehydrogenase Proteins 0.000 claims description 11
- 238000004458 analytical method Methods 0.000 claims description 10
- 238000004590 computer program Methods 0.000 claims description 10
- 238000010801 machine learning Methods 0.000 claims description 9
- 210000004369 blood Anatomy 0.000 claims description 8
- 239000008280 blood Substances 0.000 claims description 8
- 238000003860 storage Methods 0.000 claims description 8
- -1 myll 12B Proteins 0.000 claims description 7
- 108090000765 processed proteins & peptides Proteins 0.000 claims description 7
- 238000010200 validation analysis Methods 0.000 claims description 7
- 108010033276 Peptide Fragments Proteins 0.000 claims description 6
- 102000007079 Peptide Fragments Human genes 0.000 claims description 6
- 108090000144 Human Proteins Proteins 0.000 claims description 4
- 102000003839 Human Proteins Human genes 0.000 claims description 4
- 230000006870 function Effects 0.000 claims description 4
- 238000000692 Student's t-test Methods 0.000 claims description 3
- 238000002360 preparation method Methods 0.000 claims description 3
- 238000012353 t test Methods 0.000 claims description 3
- 210000001175 cerebrospinal fluid Anatomy 0.000 claims description 2
- 229940074200 diamode Drugs 0.000 claims description 2
- PGYPOBZJRVSMDS-UHFFFAOYSA-N loperamide hydrochloride Chemical compound Cl.C=1C=CC=CC=1C(C=1C=CC=CC=1)(C(=O)N(C)C)CCN(CC1)CCC1(O)C1=CC=C(Cl)C=C1 PGYPOBZJRVSMDS-UHFFFAOYSA-N 0.000 claims description 2
- 108020004999 messenger RNA Proteins 0.000 claims description 2
- 108020004707 nucleic acids Proteins 0.000 claims description 2
- 102000039446 nucleic acids Human genes 0.000 claims description 2
- 150000007523 nucleic acids Chemical class 0.000 claims description 2
- 210000003296 saliva Anatomy 0.000 claims description 2
- 238000012163 sequencing technique Methods 0.000 claims description 2
- 238000013518 transcription Methods 0.000 claims description 2
- 230000035897 transcription Effects 0.000 claims description 2
- 210000002700 urine Anatomy 0.000 claims description 2
- 102100027667 Carboxy-terminal domain RNA polymerase II polypeptide A small phosphatase 2 Human genes 0.000 claims 4
- 102000009645 Mitochondrial Aldehyde Dehydrogenase Human genes 0.000 claims 4
- 102100038836 Superoxide dismutase [Cu-Zn] Human genes 0.000 claims 4
- 102100034461 U2 small nuclear ribonucleoprotein B'' Human genes 0.000 claims 4
- 238000012795 verification Methods 0.000 claims 2
- 239000003795 chemical substances by application Substances 0.000 claims 1
- 230000002349 favourable effect Effects 0.000 abstract description 2
- 235000018102 proteins Nutrition 0.000 description 33
- 210000002381 plasma Anatomy 0.000 description 11
- 102100037714 Male-specific lethal 1 homolog Human genes 0.000 description 10
- 102100021588 Sterol carrier protein 2 Human genes 0.000 description 10
- 102000008221 Superoxide Dismutase-1 Human genes 0.000 description 10
- 102100033816 Aldehyde dehydrogenase, mitochondrial Human genes 0.000 description 8
- 238000011002 quantification Methods 0.000 description 7
- 238000012216 screening Methods 0.000 description 7
- 206010003805 Autism Diseases 0.000 description 6
- 208000020706 Autistic disease Diseases 0.000 description 6
- 150000002500 ions Chemical class 0.000 description 6
- 238000012549 training Methods 0.000 description 6
- 238000012360 testing method Methods 0.000 description 5
- 238000007405 data analysis Methods 0.000 description 4
- BDAGIHXWWSANSR-UHFFFAOYSA-N methanoic acid Natural products OC=O BDAGIHXWWSANSR-UHFFFAOYSA-N 0.000 description 4
- 238000012545 processing Methods 0.000 description 4
- 241001412224 Firmiana Species 0.000 description 3
- 238000009534 blood test Methods 0.000 description 3
- 230000008859 change Effects 0.000 description 3
- 238000010586 diagram Methods 0.000 description 3
- 230000000694 effects Effects 0.000 description 3
- 238000004949 mass spectrometry Methods 0.000 description 3
- 238000001819 mass spectrum Methods 0.000 description 3
- 208000020016 psychiatric disease Diseases 0.000 description 3
- 239000000243 solution Substances 0.000 description 3
- OSWFIVFLDKOXQC-UHFFFAOYSA-N 4-(3-methoxyphenyl)aniline Chemical compound COC1=CC=CC(C=2C=CC(N)=CC=2)=C1 OSWFIVFLDKOXQC-UHFFFAOYSA-N 0.000 description 2
- 108020002663 Aldehyde Dehydrogenase Proteins 0.000 description 2
- 101000788517 Homo sapiens Tubulin beta-2A chain Proteins 0.000 description 2
- 102000004142 Trypsin Human genes 0.000 description 2
- 108090000631 Trypsin Proteins 0.000 description 2
- 102100025225 Tubulin beta-2A chain Human genes 0.000 description 2
- 230000006399 behavior Effects 0.000 description 2
- 238000003759 clinical diagnosis Methods 0.000 description 2
- 238000013480 data collection Methods 0.000 description 2
- 238000005516 engineering process Methods 0.000 description 2
- 238000011156 evaluation Methods 0.000 description 2
- 210000001808 exosome Anatomy 0.000 description 2
- 238000002474 experimental method Methods 0.000 description 2
- 235000019253 formic acid Nutrition 0.000 description 2
- 238000007710 freezing Methods 0.000 description 2
- 230000008014 freezing Effects 0.000 description 2
- 238000012986 modification Methods 0.000 description 2
- 230000004048 modification Effects 0.000 description 2
- 239000003147 molecular marker Substances 0.000 description 2
- 230000001575 pathological effect Effects 0.000 description 2
- 238000001228 spectrum Methods 0.000 description 2
- 238000002560 therapeutic procedure Methods 0.000 description 2
- 239000012588 trypsin Substances 0.000 description 2
- UOTMYNBWXDUBNX-UHFFFAOYSA-N 1-[(3,4-dimethoxyphenyl)methyl]-6,7-dimethoxyisoquinolin-2-ium;chloride Chemical compound Cl.C1=C(OC)C(OC)=CC=C1CC1=NC=CC2=CC(OC)=C(OC)C=C12 UOTMYNBWXDUBNX-UHFFFAOYSA-N 0.000 description 1
- OENIXTHWZWFYIV-UHFFFAOYSA-N 2-[4-[2-[5-(cyclopentylmethyl)-1h-imidazol-2-yl]ethyl]phenyl]benzoic acid Chemical compound OC(=O)C1=CC=CC=C1C(C=C1)=CC=C1CCC(N1)=NC=C1CC1CCCC1 OENIXTHWZWFYIV-UHFFFAOYSA-N 0.000 description 1
- 101710139725 ADP-ribosylation factor 5 Proteins 0.000 description 1
- 102100022974 AP-2 complex subunit alpha-2 Human genes 0.000 description 1
- 206010000117 Abnormal behaviour Diseases 0.000 description 1
- 102100039819 Actin, alpha cardiac muscle 1 Human genes 0.000 description 1
- 102100026656 Actin, alpha skeletal muscle Human genes 0.000 description 1
- 102100036732 Actin, aortic smooth muscle Human genes 0.000 description 1
- 101710106492 Acyl-CoA-binding protein Proteins 0.000 description 1
- 101710169323 Acyl-CoA-binding protein homolog Proteins 0.000 description 1
- 102100038920 Alpha-S1-casein Human genes 0.000 description 1
- 102000006589 Alpha-ketoglutarate dehydrogenase Human genes 0.000 description 1
- 108020004306 Alpha-ketoglutarate dehydrogenase Proteins 0.000 description 1
- ATRRKUHOCOJYRX-UHFFFAOYSA-N Ammonium bicarbonate Chemical compound [NH4+].OC([O-])=O ATRRKUHOCOJYRX-UHFFFAOYSA-N 0.000 description 1
- 229910000013 Ammonium bicarbonate Inorganic materials 0.000 description 1
- VHUUQVKOLVNVRT-UHFFFAOYSA-N Ammonium hydroxide Chemical compound [NH4+].[OH-] VHUUQVKOLVNVRT-UHFFFAOYSA-N 0.000 description 1
- 102000004121 Annexin A5 Human genes 0.000 description 1
- 108090000672 Annexin A5 Proteins 0.000 description 1
- 101001112203 Arabidopsis thaliana Neutral ceramidase 1 Proteins 0.000 description 1
- 102100027961 BAG family molecular chaperone regulator 2 Human genes 0.000 description 1
- 102000004506 Blood Proteins Human genes 0.000 description 1
- 108010017384 Blood Proteins Proteins 0.000 description 1
- 102100028737 CAP-Gly domain-containing linker protein 1 Human genes 0.000 description 1
- 108010025714 CD146 Antigen Proteins 0.000 description 1
- 102000049932 CD146 Antigen Human genes 0.000 description 1
- 102100029968 Calreticulin Human genes 0.000 description 1
- 102100023804 Coagulation factor VII Human genes 0.000 description 1
- 102100035321 Complement factor H-related protein 3 Human genes 0.000 description 1
- 102100035324 Complement factor H-related protein 4 Human genes 0.000 description 1
- 102100030960 DNA replication licensing factor MCM2 Human genes 0.000 description 1
- 108010013976 Dystonin Proteins 0.000 description 1
- 102100038912 E3 SUMO-protein ligase RanBP2 Human genes 0.000 description 1
- KCXVZYZYPLLWCC-UHFFFAOYSA-N EDTA Chemical compound OC(=O)CN(CC(O)=O)CCN(CC(O)=O)CC(O)=O KCXVZYZYPLLWCC-UHFFFAOYSA-N 0.000 description 1
- 238000002965 ELISA Methods 0.000 description 1
- 102100036725 Epithelial discoidin domain-containing receptor 1 Human genes 0.000 description 1
- 101710131668 Epithelial discoidin domain-containing receptor 1 Proteins 0.000 description 1
- 102100039466 Eukaryotic translation initiation factor 5B Human genes 0.000 description 1
- 101710170857 Exocyst complex component 1 Proteins 0.000 description 1
- 102100025637 FACT complex subunit SPT16 Human genes 0.000 description 1
- 108010023321 Factor VII Proteins 0.000 description 1
- 102100031509 Fibrillin-1 Human genes 0.000 description 1
- 102100027944 Flavin reductase (NADPH) Human genes 0.000 description 1
- 101710205671 Glucosamine-6-phosphate deaminase 1 Proteins 0.000 description 1
- 101710087553 Glucosamine-6-phosphate isomerase 1 Proteins 0.000 description 1
- 101710087552 Glucosamine-6-phosphate isomerase 2 Proteins 0.000 description 1
- 102100024013 Golgi SNAP receptor complex member 2 Human genes 0.000 description 1
- 101710096764 Hemoglobin subunit epsilon-1 Proteins 0.000 description 1
- 102100021643 Histone H2A type 2-B Human genes 0.000 description 1
- 102100034533 Histone H2AX Human genes 0.000 description 1
- 101000757394 Homo sapiens AP-2 complex subunit alpha-2 Proteins 0.000 description 1
- 101000959247 Homo sapiens Actin, alpha cardiac muscle 1 Proteins 0.000 description 1
- 101000834207 Homo sapiens Actin, alpha skeletal muscle Proteins 0.000 description 1
- 101000929319 Homo sapiens Actin, aortic smooth muscle Proteins 0.000 description 1
- 101000779444 Homo sapiens Aldehyde dehydrogenase, mitochondrial Proteins 0.000 description 1
- 101000741048 Homo sapiens Alpha-S1-casein Proteins 0.000 description 1
- 101000697872 Homo sapiens BAG family molecular chaperone regulator 2 Proteins 0.000 description 1
- 101000767052 Homo sapiens CAP-Gly domain-containing linker protein 1 Proteins 0.000 description 1
- 101000793651 Homo sapiens Calreticulin Proteins 0.000 description 1
- 101000878136 Homo sapiens Complement factor H-related protein 3 Proteins 0.000 description 1
- 101000878133 Homo sapiens Complement factor H-related protein 4 Proteins 0.000 description 1
- 101000583807 Homo sapiens DNA replication licensing factor MCM2 Proteins 0.000 description 1
- 101001018431 Homo sapiens DNA replication licensing factor MCM7 Proteins 0.000 description 1
- 101001036496 Homo sapiens Eukaryotic translation initiation factor 5B Proteins 0.000 description 1
- 101000836111 Homo sapiens FACT complex subunit SPT16 Proteins 0.000 description 1
- 101000846893 Homo sapiens Fibrillin-1 Proteins 0.000 description 1
- 101000935587 Homo sapiens Flavin reductase (NADPH) Proteins 0.000 description 1
- 101000904234 Homo sapiens Golgi SNAP receptor complex member 2 Proteins 0.000 description 1
- 101000898908 Homo sapiens Histone H2A type 2-B Proteins 0.000 description 1
- 101001067891 Homo sapiens Histone H2AX Proteins 0.000 description 1
- 101000985261 Homo sapiens Hornerin Proteins 0.000 description 1
- 101001046980 Homo sapiens KN motif and ankyrin repeat domain-containing protein 2 Proteins 0.000 description 1
- 101000576894 Homo sapiens Macrophage mannose receptor 1 Proteins 0.000 description 1
- 101000583944 Homo sapiens Methionine adenosyltransferase 2 subunit beta Proteins 0.000 description 1
- 101001114675 Homo sapiens Multimerin-2 Proteins 0.000 description 1
- 101000611202 Homo sapiens Peptidyl-prolyl cis-trans isomerase B Proteins 0.000 description 1
- 101000705756 Homo sapiens Proteasome activator complex subunit 1 Proteins 0.000 description 1
- 101000927799 Homo sapiens Rho guanine nucleotide exchange factor 6 Proteins 0.000 description 1
- 101001010890 Homo sapiens S-formylglutathione hydrolase Proteins 0.000 description 1
- 101000642478 Homo sapiens Serpin B3 Proteins 0.000 description 1
- 101001131204 Homo sapiens Sulfhydryl oxidase 1 Proteins 0.000 description 1
- 101000640289 Homo sapiens Synemin Proteins 0.000 description 1
- 101000844686 Homo sapiens Thioredoxin reductase 1, cytoplasmic Proteins 0.000 description 1
- 101000625727 Homo sapiens Tubulin beta chain Proteins 0.000 description 1
- 101000713585 Homo sapiens Tubulin beta-4A chain Proteins 0.000 description 1
- 101000841477 Homo sapiens Ubiquitin carboxyl-terminal hydrolase 14 Proteins 0.000 description 1
- 101000760747 Homo sapiens Very long-chain specific acyl-CoA dehydrogenase, mitochondrial Proteins 0.000 description 1
- 102100028627 Hornerin Human genes 0.000 description 1
- 235000003332 Ilex aquifolium Nutrition 0.000 description 1
- 235000002296 Ilex sandwicensis Nutrition 0.000 description 1
- 235000002294 Ilex volkensiana Nutrition 0.000 description 1
- 101710141123 Immunoglobulin heavy constant delta Proteins 0.000 description 1
- 102100022888 KN motif and ankyrin repeat domain-containing protein 2 Human genes 0.000 description 1
- FFEARJCKVFRZRR-BYPYZUCNSA-N L-methionine Chemical compound CSCC[C@H](N)C(O)=O FFEARJCKVFRZRR-BYPYZUCNSA-N 0.000 description 1
- 102000043136 MAP kinase family Human genes 0.000 description 1
- 108091054455 MAP kinase family Proteins 0.000 description 1
- 208000036626 Mental retardation Diseases 0.000 description 1
- 102100030932 Methionine adenosyltransferase 2 subunit beta Human genes 0.000 description 1
- 102100023346 Multimerin-2 Human genes 0.000 description 1
- 102000004884 Nucleobindin Human genes 0.000 description 1
- 108090001016 Nucleobindin Proteins 0.000 description 1
- 101710109505 Olfactomedin-4 Proteins 0.000 description 1
- 101000935036 Ovis aries Integrin beta-1 Proteins 0.000 description 1
- 101710189320 POTE ankyrin domain family member J Proteins 0.000 description 1
- 102100040283 Peptidyl-prolyl cis-trans isomerase B Human genes 0.000 description 1
- 102000004211 Platelet factor 4 Human genes 0.000 description 1
- 108090000778 Platelet factor 4 Proteins 0.000 description 1
- 102100030304 Platelet factor 4 Human genes 0.000 description 1
- 102000011195 Profilin Human genes 0.000 description 1
- 108050001408 Profilin Proteins 0.000 description 1
- 102100031300 Proteasome activator complex subunit 1 Human genes 0.000 description 1
- 102100031269 Putative peripheral benzodiazepine receptor-related protein Human genes 0.000 description 1
- 102100033202 Rho guanine nucleotide exchange factor 6 Human genes 0.000 description 1
- 102100029991 S-formylglutathione hydrolase Human genes 0.000 description 1
- 102100036383 Serpin B3 Human genes 0.000 description 1
- 208000025890 Social Communication disease Diseases 0.000 description 1
- 102100034371 Sulfhydryl oxidase 1 Human genes 0.000 description 1
- 102000019197 Superoxide Dismutase Human genes 0.000 description 1
- 108010012715 Superoxide dismutase Proteins 0.000 description 1
- 102100033920 Synemin Human genes 0.000 description 1
- 102100031208 Thioredoxin reductase 1, cytoplasmic Human genes 0.000 description 1
- 101710194411 Triosephosphate isomerase 1 Proteins 0.000 description 1
- 101710128188 Tropomyosin alpha-1 chain Proteins 0.000 description 1
- 101710186379 Tropomyosin-1 Proteins 0.000 description 1
- 102100036788 Tubulin beta-4A chain Human genes 0.000 description 1
- 108010056354 Ubiquitin C Proteins 0.000 description 1
- 102100029163 Ubiquitin carboxyl-terminal hydrolase 14 Human genes 0.000 description 1
- 108010075653 Utrophin Proteins 0.000 description 1
- 102100029092 Utrophin Human genes 0.000 description 1
- 101710090241 Vasorin Proteins 0.000 description 1
- 102100024591 Very long-chain specific acyl-CoA dehydrogenase, mitochondrial Human genes 0.000 description 1
- 101001003006 Xenopus laevis Ferritin heavy chain B Proteins 0.000 description 1
- 238000006640 acetylation reaction Methods 0.000 description 1
- QGZKDVFQNNGYKY-UHFFFAOYSA-N ammonia Natural products N QGZKDVFQNNGYKY-UHFFFAOYSA-N 0.000 description 1
- 235000012538 ammonium bicarbonate Nutrition 0.000 description 1
- 239000001099 ammonium carbonate Substances 0.000 description 1
- 239000003146 anticoagulant agent Substances 0.000 description 1
- 229940127219 anticoagulant drug Drugs 0.000 description 1
- 239000007864 aqueous solution Substances 0.000 description 1
- 238000003491 array Methods 0.000 description 1
- 230000003542 behavioural effect Effects 0.000 description 1
- 238000004422 calculation algorithm Methods 0.000 description 1
- 238000004364 calculation method Methods 0.000 description 1
- 210000004027 cell Anatomy 0.000 description 1
- 230000022131 cell cycle Effects 0.000 description 1
- 229940105772 coagulation factor vii Drugs 0.000 description 1
- 238000004891 communication Methods 0.000 description 1
- 238000010276 construction Methods 0.000 description 1
- 238000007796 conventional method Methods 0.000 description 1
- 238000002790 cross-validation Methods 0.000 description 1
- XUJNEKJLAYXESH-UHFFFAOYSA-N cysteine Natural products SCC(N)C(O)=O XUJNEKJLAYXESH-UHFFFAOYSA-N 0.000 description 1
- 235000018417 cysteine Nutrition 0.000 description 1
- 238000011033 desalting Methods 0.000 description 1
- 238000013461 design Methods 0.000 description 1
- 208000035475 disorder Diseases 0.000 description 1
- 238000013399 early diagnosis Methods 0.000 description 1
- 230000008451 emotion Effects 0.000 description 1
- 230000007613 environmental effect Effects 0.000 description 1
- 230000007071 enzymatic hydrolysis Effects 0.000 description 1
- 238000006047 enzymatic hydrolysis reaction Methods 0.000 description 1
- 238000011049 filling Methods 0.000 description 1
- 239000012634 fragment Substances 0.000 description 1
- 108010022687 fumarylacetoacetase Proteins 0.000 description 1
- 230000004927 fusion Effects 0.000 description 1
- 230000002068 genetic effect Effects 0.000 description 1
- 238000004128 high performance liquid chromatography Methods 0.000 description 1
- 238000004896 high resolution mass spectrometry Methods 0.000 description 1
- 238000003018 immunoassay Methods 0.000 description 1
- 238000003364 immunohistochemistry Methods 0.000 description 1
- 238000002347 injection Methods 0.000 description 1
- 239000007924 injection Substances 0.000 description 1
- 238000002955 isolation Methods 0.000 description 1
- 238000004895 liquid chromatography mass spectrometry Methods 0.000 description 1
- 238000001294 liquid chromatography-tandem mass spectrometry Methods 0.000 description 1
- 239000007791 liquid phase Substances 0.000 description 1
- 239000012160 loading buffer Substances 0.000 description 1
- 238000011068 loading method Methods 0.000 description 1
- 238000004519 manufacturing process Methods 0.000 description 1
- 239000000463 material Substances 0.000 description 1
- 238000010339 medical test Methods 0.000 description 1
- 230000003340 mental effect Effects 0.000 description 1
- 229930182817 methionine Natural products 0.000 description 1
- 230000011987 methylation Effects 0.000 description 1
- 238000007069 methylation reaction Methods 0.000 description 1
- 238000005065 mining Methods 0.000 description 1
- 238000002156 mixing Methods 0.000 description 1
- 238000012544 monitoring process Methods 0.000 description 1
- 238000010844 nanoflow liquid chromatography Methods 0.000 description 1
- 230000003287 optical effect Effects 0.000 description 1
- 230000003647 oxidation Effects 0.000 description 1
- 238000007254 oxidation reaction Methods 0.000 description 1
- 238000004806 packaging method and process Methods 0.000 description 1
- 239000012071 phase Substances 0.000 description 1
- 239000000843 powder Substances 0.000 description 1
- 239000000047 product Substances 0.000 description 1
- 238000005086 pumping Methods 0.000 description 1
- 238000003127 radioimmunoassay Methods 0.000 description 1
- 108010062219 ran-binding protein 2 Proteins 0.000 description 1
- 238000011160 research Methods 0.000 description 1
- 108010011179 ribosomal protein S27a Proteins 0.000 description 1
- 108010033804 ribosomal protein S3 Proteins 0.000 description 1
- 101150007503 rps1 gene Proteins 0.000 description 1
- 238000000926 separation method Methods 0.000 description 1
- 210000002966 serum Anatomy 0.000 description 1
- 208000019116 sleep disease Diseases 0.000 description 1
- 208000020685 sleep-wake disease Diseases 0.000 description 1
- 108010058363 sterol carrier proteins Proteins 0.000 description 1
- 239000006228 supernatant Substances 0.000 description 1
- 238000001262 western blot Methods 0.000 description 1
Classifications
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12Q—MEASURING OR TESTING PROCESSES INVOLVING ENZYMES, NUCLEIC ACIDS OR MICROORGANISMS; COMPOSITIONS OR TEST PAPERS THEREFOR; PROCESSES OF PREPARING SUCH COMPOSITIONS; CONDITION-RESPONSIVE CONTROL IN MICROBIOLOGICAL OR ENZYMOLOGICAL PROCESSES
- C12Q1/00—Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions
- C12Q1/68—Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions involving nucleic acids
- C12Q1/6876—Nucleic acid products used in the analysis of nucleic acids, e.g. primers or probes
- C12Q1/6883—Nucleic acid products used in the analysis of nucleic acids, e.g. primers or probes for diseases caused by alterations of genetic material
-
- G—PHYSICS
- G01—MEASURING; TESTING
- G01N—INVESTIGATING OR ANALYSING MATERIALS BY DETERMINING THEIR CHEMICAL OR PHYSICAL PROPERTIES
- G01N33/00—Investigating or analysing materials by specific methods not covered by groups G01N1/00 - G01N31/00
- G01N33/48—Biological material, e.g. blood, urine; Haemocytometers
- G01N33/50—Chemical analysis of biological material, e.g. blood, urine; Testing involving biospecific ligand binding methods; Immunological testing
- G01N33/53—Immunoassay; Biospecific binding assay; Materials therefor
- G01N33/573—Immunoassay; Biospecific binding assay; Materials therefor for enzymes or isoenzymes
-
- G—PHYSICS
- G01—MEASURING; TESTING
- G01N—INVESTIGATING OR ANALYSING MATERIALS BY DETERMINING THEIR CHEMICAL OR PHYSICAL PROPERTIES
- G01N33/00—Investigating or analysing materials by specific methods not covered by groups G01N1/00 - G01N31/00
- G01N33/48—Biological material, e.g. blood, urine; Haemocytometers
- G01N33/50—Chemical analysis of biological material, e.g. blood, urine; Testing involving biospecific ligand binding methods; Immunological testing
- G01N33/68—Chemical analysis of biological material, e.g. blood, urine; Testing involving biospecific ligand binding methods; Immunological testing involving proteins, peptides or amino acids
- G01N33/6893—Chemical analysis of biological material, e.g. blood, urine; Testing involving biospecific ligand binding methods; Immunological testing involving proteins, peptides or amino acids related to diseases not provided for elsewhere
-
- G—PHYSICS
- G16—INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR SPECIFIC APPLICATION FIELDS
- G16B—BIOINFORMATICS, i.e. INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR GENETIC OR PROTEIN-RELATED DATA PROCESSING IN COMPUTATIONAL MOLECULAR BIOLOGY
- G16B40/00—ICT specially adapted for biostatistics; ICT specially adapted for bioinformatics-related machine learning or data mining, e.g. knowledge discovery or pattern finding
-
- G—PHYSICS
- G16—INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR SPECIFIC APPLICATION FIELDS
- G16H—HEALTHCARE INFORMATICS, i.e. INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR THE HANDLING OR PROCESSING OF MEDICAL OR HEALTHCARE DATA
- G16H50/00—ICT specially adapted for medical diagnosis, medical simulation or medical data mining; ICT specially adapted for detecting, monitoring or modelling epidemics or pandemics
- G16H50/30—ICT specially adapted for medical diagnosis, medical simulation or medical data mining; ICT specially adapted for detecting, monitoring or modelling epidemics or pandemics for calculating health indices; for individual health risk assessment
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12Q—MEASURING OR TESTING PROCESSES INVOLVING ENZYMES, NUCLEIC ACIDS OR MICROORGANISMS; COMPOSITIONS OR TEST PAPERS THEREFOR; PROCESSES OF PREPARING SUCH COMPOSITIONS; CONDITION-RESPONSIVE CONTROL IN MICROBIOLOGICAL OR ENZYMOLOGICAL PROCESSES
- C12Q2600/00—Oligonucleotides characterized by their use
- C12Q2600/158—Expression markers
-
- G—PHYSICS
- G01—MEASURING; TESTING
- G01N—INVESTIGATING OR ANALYSING MATERIALS BY DETERMINING THEIR CHEMICAL OR PHYSICAL PROPERTIES
- G01N2800/00—Detection or diagnosis of diseases
- G01N2800/30—Psychoses; Psychiatry
Landscapes
- Health & Medical Sciences (AREA)
- Life Sciences & Earth Sciences (AREA)
- Engineering & Computer Science (AREA)
- Chemical & Material Sciences (AREA)
- Immunology (AREA)
- Biomedical Technology (AREA)
- Molecular Biology (AREA)
- Physics & Mathematics (AREA)
- General Health & Medical Sciences (AREA)
- Biotechnology (AREA)
- Medical Informatics (AREA)
- Proteomics, Peptides & Aminoacids (AREA)
- Analytical Chemistry (AREA)
- Urology & Nephrology (AREA)
- Hematology (AREA)
- Pathology (AREA)
- Microbiology (AREA)
- Organic Chemistry (AREA)
- Biochemistry (AREA)
- Public Health (AREA)
- Cell Biology (AREA)
- Genetics & Genomics (AREA)
- Bioinformatics & Cheminformatics (AREA)
- Zoology (AREA)
- Wood Science & Technology (AREA)
- Food Science & Technology (AREA)
- Medicinal Chemistry (AREA)
- General Physics & Mathematics (AREA)
- Data Mining & Analysis (AREA)
- Databases & Information Systems (AREA)
- Biophysics (AREA)
- Epidemiology (AREA)
- General Engineering & Computer Science (AREA)
- Primary Health Care (AREA)
- Artificial Intelligence (AREA)
- Bioethics (AREA)
- Computer Vision & Pattern Recognition (AREA)
- Evolutionary Computation (AREA)
- Software Systems (AREA)
- Bioinformatics & Computational Biology (AREA)
Abstract
The application discloses a biomarker combination and application thereof in prediction of ASD (autism spectrum disorder) diseases. The biomarker combination consists of 52 biomarkers, and particularly, the description in the specification of the application is shown, the 52 biomarkers in the application can be used for risk prediction and detection of autism spectrum disorder patients, have the advantages of high sensitivity and high specificity, and provide favorable technical support for predicting autism spectrum disorder diseases, intervention treatment and the like.
Description
Technical Field
The application belongs to the field of biomedical technology and diagnosis, and particularly relates to a biomarker combination and application thereof in prediction of ASD.
Background
Autism spectrum disorder (Autism Spectrum Disorder, ASD) is a mental disorder disease that is broadly developed in infant and young children, and is mainly manifested by social communication disorder, narrow interest or range of motion, repeated plate-carving behavior, and mental behavior disorder such as mental retardation, unstable emotion, sleep disorder, etc. with different degrees. Among this spectrum disorder, childhood autism is one of the most severe of childhood mental disorders. Over the last decade, diagnosis of ASD has become increasingly popular worldwide. China accounts for 22% of the world population, and by 2020, the prevalence of Chinese ASD is 0.7% which is significantly lower than that of U.S. 1.7%, and the reasons for this difference may be genetic factors, environmental factors, diagnostic means, etc.
Currently, the means of diagnosing ASD in common use include the following: manual for diagnosis and statistics of mental disorders (DMS-5), diagnosis and observation of Autism (ADOS), behavioral autism (Autism Behavior Checklist, ABC), rating scale for autism in children (ChildhoodAutism Rating Scale, CARS), table for language and intellectual screening, and other evaluation scales, electroencephalogram, mri, etc. Although diseases can be clinically screened from multiple angles, the main problems of the existing diagnosis means are that the diagnosis scale is high in subjectivity, misdiagnosis is easy to cause, the evaluation time is short, the actual conditions of some patients cannot be reflected, and in addition, most patients cannot find problems through electroencephalogram and magnetic resonance. Therefore, it is important to find a molecular marker for early screening diagnosis of ASD by means of medical tests (e.g. blood tests).
Molecular markers generally refer to a general physiological or pathological or biochemical index characteristic of a therapeutic process that can be objectively determined and evaluated. The blood test can provide the change condition of the expression level of protein molecules in blood, which not only can help us to screen and diagnose the early autism spectrum disorder patients, but also can provide a new solution for researching the generation of the autism spectrum disorder. Timely intervention and timely treatment of autism spectrum disorder can have good effect. Therefore, the application discloses a novel non-invasive early auxiliary diagnosis kit for autism spectrum disorder, which has great social and economic significance and clinical application prospect.
Disclosure of Invention
In order to solve the technical problems, the application provides a biomarker combination and application thereof in predicting autism spectrum disorder diseases.
The present application provides in a first aspect the use of a biomarker combination consisting of ALDH2, ARF5, ATP6AP1, CD14, CDC37, COL1A2, CYBB, DST, F7, FAH, FGB, GLOD4, GNPDA2, HBE1, HMGCL, HSPA1L, IGHD, IMMT, ITGA6, MCAM, MRC1, MSL1, MYL12B, NUCB1, OGDH, PF4V1, PFN1, POTEJ, PROCR, PSMA3, PTPRG, RPS3, SCP2, TPI1, VASN, ANXA5, ASAH1, CALD1, CSE1L, EXOC1, FTH1, GNPDA1, HSPA8, ITGB1, MTA2, OLFM4, MAPK1, RPS27A, SOD1, TPM1 and UBC for the manufacture of a product for predicting autism spectrum disorder.
Autism spectrum disorder according to the present application may include both mild to moderate autism spectrum disorder and severe autism spectrum disorder.
In a second aspect the application provides a reagent combination for detecting a biomarker combination consisting of ALDH2, ARF5, ATP6AP1, CD14, CDC37, COL1A2, CYBB, DST, F, FAH, FGB, GLOD4, GNPDA2, HBE1, HMGCL, HSPA1L, IGHD, IMMT, ITGA6, MCAM, MRC1, MSL1, MYL12B, NUCB1, OGDH, PF4V1, PFN1, POTEJ, PROCR, PSMA3, PTPRG, RPS3, SCP2, TPI1, VASN, ANXA5, ASAH1, CALD1, CSE1L, EXOC1, FTH1, GNPDA1, HSPA8, ITGB1, MTA2, OLFM4, MAPK1, RPS27A, SOD1, TPM1 and UBC.
In a preferred embodiment, the combination of reagents is used to detect the expression level of the biomarker combination.
In a preferred embodiment, the combination of reagents comprises a reagent that specifically binds to the biomarker, or comprises a biomolecular reagent that specifically hybridizes to a nucleic acid encoding the biomarker.
In a certain preferred embodiment, the combination of reagents comprises reagents for genomic, transcriptome and/or proteomic sequencing.
In a preferred embodiment, the expression level is a protein expression level and/or an mRNA transcription level, and/or the biomolecular reagent is selected from one or more of a primer, a probe and an antibody.
Preferably, the protein expression level is detected by one or more of mass spectrometry, chips such as protein chips or microfluidic chips, digital single molecule immunoassays, ELISA, radioimmunoassays, immunonephelometry, immunohistochemistry, and Western blotting.
In a third aspect the application provides a biomarker combination consisting of ALDH2, ARF5, ATP6AP1, CD14, CDC37, COL1A2, CYBB, DST, F, FAH, FGB, GLOD4, GNPDA2, HBE1, HMGCL, HSPA1L, IGHD, IMMT, ITGA6, MCAM, MRC1, MSL1, MYL12B, NUCB1, OGDH, PF4V1, PFN1, POTEJ, PROCR, PSMA3, PTPRG, RPS3, SCP2, TPI1, VASN, ANXA5, ASAH1, CALD1, CSE1L, EXOC1, FTH1, GNPDA1, HSPA8, ITGB1, MTA2, OLFM4, MAPK1, RPS27A, SOD1, RPS 1 and UBC.
In a fourth aspect the application provides a kit comprising a combination of reagents as described in the second aspect of the application and/or a combination of biomarkers as described in the third aspect of the application.
In a fifth aspect, the present application provides a method of constructing a predictive model of autism spectrum disorder disease, the method comprising: inputting protein expression data corresponding to biomarker combinations in the sample into an R language Caret packet containing a logistic regression model for machine learning to obtain an autism spectrum disorder disease prediction model;
the biomarker combinations consist of ALDH2, ARF5, ATP6AP1, CD14, CDC37, COL1A2, CYBB, DST, F, FAH, FGB, GLOD4, GNPDA2, HBE1, HMGCL, HSPA1L, IGHD, IMMT, ITGA6, MCAM, MRC1, MSL1, MYL12B, NUCB1, OGDH, PF4V1, PFN1, POTEJ, PROCR, PSMA3, PTPRG, RPS3, SCP2, TPI1, VASN, ANXA5, ASAH1, CALD1, CSE1L, EXOC1, FTH1, GNPDA1, HSPA8, ITGB1, MTA2, OLFM4, MAPK1, RPS27A, SOD1, TPM1, and UBC.
In a preferred embodiment, the sample is a body fluid exosome.
In a preferred embodiment, the sample is from blood, urine, saliva or cerebrospinal fluid.
Preferably, the blood is serum or plasma.
In a preferred embodiment, the samples are subjected to DIA-mode collection of the protein expression data and peptide fragment matching by Firmiana software prior to machine learning.
In a certain preferred embodiment, the sample is from a patient suffering from a disease comprising autism spectrum disorder and a healthy person.
In a preferred embodiment, FOT of the proteins corresponding to the biomarker combinations are input as protein expression data into the R language Caret package of the logistic regression model for machine learning.
In a preferred embodiment, the protein expression data entered into the logistic regression model satisfy: the expression of the protein corresponding to the biomarker combination in the sample is 1.5 or more than 1.5 of the expression of the corresponding protein of the healthy human, and the t-test p value is less than 0.05.
In a preferred embodiment, the peptide fragment matches utilize the UniProt human protein database.
In a preferred embodiment, the samples are grouped prior to machine learning to obtain a modeling set of samples for autism spectrum disorder disease prediction model construction and a validation set of samples for validating the autism spectrum disorder disease prediction model.
In a preferred embodiment, the protein expression data entered into the logistic regression model is protein abundance greater than or equal to 30%.
In a preferred embodiment, the step of using the validation set sample for validation comprises: calculating the area under the line, the sensitivity and the specificity of a specificity curve of the protein expression data of the biomarker combinations in the sample; and judging the accuracy of the prediction model according to the offline area, sensitivity and specificity.
In a preferred embodiment, the method further comprises determining whether the sample has autism spectrum disorder, determining that the sample is ill when the probability of the sample being autism spectrum disorder is greater than or equal to 0.5, and determining that the sample is normal when the probability of the sample being autism spectrum disorder is greater than or equal to 0.5.
In a preferred embodiment of the application, the protein expression data is obtained by LC-MS technology, collected using DIA detection means.
Preferably, the peptide fragment matching is carried out on the data acquired in the DIA detection mode through Firmiana software. More preferably, the database of peptide segment matches is the UniProt human protein database.
In a preferred embodiment, the FOT of the protein corresponding to the biomarker combination is input as protein expression data into the R language caret package of the logistic regression model for machine learning.
Further preferably, the protein expression data after the Firmiana treatment is used: protein quantification was performed using the unlabeled intensity-based absolute quantification (iBAQ) method, FOT (Fraction of total) was calculated for each protein, defined as the iBAQ (intensity-based absorption-protein-quantification) of that protein divided by the total iBAQ of all identified proteins in the sample, and FOT for each protein was entered as protein expression data into a logistic regression model.
In a sixth aspect the present application provides a predictive model of autism spectrum disorder disease, the predictive model being constructed by a method as described in the fifth aspect of the application.
A seventh aspect of the present application provides a prediction system for autism spectrum disorder diseases, the prediction system comprising an analysis and judgment module, wherein the analysis and judgment module contains a prediction model according to the fifth aspect of the present application, and is used for judging whether the autism spectrum disorder diseases are probability;
wherein the biomarker combination consists of ACADVL, ACTA1, ACTA2, ACTC1, AP2A2, ARHGEF6, BLVRB, CALR, CFHR3, CFHR4, ESD, CLIP1, COPE, BAG2, CSN1S1, DBI, DDR1, EIF5B, FBN1, FTH1, glad 2, GOSR2, H2AC21, H2AX, HRNR, KANK2, MAT2B, MCM2, PPIB, PSME1, QSOX1, MMRN2, RANBP2, SERPINB3, SPARC, SOD1, SUPT16H, SYNM, TUBB2A, TUBB B, TUBB4A, USP14, TXNRD1, and UTRN.
In a preferred embodiment, the prediction system further comprises an output module and/or a detection module, the output module outputs the judgment result of the analysis judgment module, and the detection module detects the protein expression level corresponding to the biomarker combination in the sample to be detected and transmits the expression level data to the analysis judgment module.
In an eighth aspect, the application provides a method of predicting autism spectrum disorder in a sample using a combination of reagents as described in the second aspect of the application, a combination of biomarkers as described in the third aspect of the application, a kit as described in the fourth aspect of the application, a predictive model as described in the sixth aspect of the application or a predictive system as described in the seventh aspect of the application.
A ninth aspect of the application provides the use of a combination of reagents as described in the second aspect of the application, a combination of biomarkers as described in the third aspect of the application, a kit as described in the fourth aspect of the application, a predictive model as described in the sixth aspect of the application or a predictive system as described in the seventh aspect of the application in the prediction of autism spectrum disorder.
In a tenth aspect the application provides the use of a combination of reagents according to the second aspect of the application for the preparation of a kit for the prediction or diagnosis of autism spectrum disorder disease; wherein the biomarker combination consists of ALDH2, ARF5, ATP6AP1, CD14, CDC37, COL1A2, CYBB, DST, F7, FAH, FGB, GLOD4, GNPDA2, HBE1, HMGCL, HSPA1L, IGHD, IMMT, ITGA6, MCAM, MRC1, MSL1, MYL12B, NUCB1, OGDH, PF4V1, PFN1, POTEJ, PROCR, PSMA3, PTPRG, RPS3, SCP2, TPI1, VASN, ANXA5, ASAH1, CALD1, CSE1L, EXOC1, FTH1, GNPDA1, HSPA8, ITGB1, MTA2, fm4, MAPK1, RPS27A, SOD1, TPM1, and UBC.
An eleventh aspect of the present application provides a computer readable storage medium storing a computer program which, when executed by a processor, performs the steps of the method according to the eighth aspect of the present application or performs the function of the predictive model according to the sixth aspect of the present application or the predictive system according to the seventh aspect of the present application.
A twelfth aspect of the application provides an electronic device comprising a memory storing a computer program for executing the computer program for carrying out the steps of the method according to the eighth aspect of the application or for carrying out the functions of the predictive model according to the sixth aspect of the application or the predictive system according to the seventh aspect of the application.
Molecular markers generally refer to a general physiological or pathological or biochemical index characteristic of a therapeutic process that can be objectively determined and evaluated. The blood test can provide the change condition of the expression level of protein molecules in blood, which not only can help us to screen and diagnose the early autism spectrum disorder patients, but also can provide a new solution for researching the generation of the autism spectrum disorder. Timely intervention and timely treatment of autism spectrum disorder can have good effect. Therefore, the application discloses a novel non-invasive early auxiliary diagnosis kit for autism spectrum disorder, which has great social and economic significance and clinical application prospect.
The inventor analyzes the plasma biomarker which can be applied to clinical diagnosis of the autism spectrum disorder by researching the variation trend of the protein molecule expression level in the plasma samples of the autism spectrum disorder patients and normal people similar to the conditions of the autism spectrum disorder patients, and provides possibility for early screening diagnosis and intervention treatment of the autism spectrum disorder patients.
On the basis of conforming to the common knowledge in the field, the above preferred conditions can be arbitrarily combined to obtain the preferred examples of the application.
The reagents and materials used in the present application are commercially available.
The application has the positive progress effects that:
the 52 protein biomarkers provided by the application are found through experiments that the expression level in a blood sample of an autism spectrum disorder patient has obvious change, so that the 52 protein biomarkers provided by the application can be used as risk prediction and detection of the autism spectrum disorder patient, have the advantages of high sensitivity and high specificity, and provide favorable technical support for early screening diagnosis, intervention treatment and the like of the autism spectrum disorder patient.
The corresponding auxiliary early diagnosis kit is developed based on the plasma protein molecular marker of the autism spectrum disorder patient, has wide scientific research value and provides great convenience for early clinical diagnosis, intervention treatment and the like.
Drawings
Figure 1 is a ROC curve under training set for 52 protein molecular biomarker combinations in autism spectrum disorder disease group and healthy control group.
Figure 2 is a ROC curve with 52 protein molecule biomarker combinations under internal validation in autism spectrum disorder disease group and healthy control group.
Fig. 3 is a schematic diagram of the architecture of a system for predicting the risk of autism spectrum disorder disease.
Fig. 4 is a schematic structural diagram of an electronic device.
Detailed Description
The application is further illustrated by means of the following examples, which are not intended to limit the scope of the application. The experimental methods, in which specific conditions are not noted in the following examples, were selected according to conventional methods and conditions, or according to the commercial specifications.
Example 1
The autism spectrum disorder disease group and healthy control group samples required in the examples were all from the third affiliated hospital of the university of Zhengzhou, of which 247 autism spectrum disorder disease group and 244 healthy control group. The design and implementation of this study was approved and supervised by the third affiliated hospital medical ethics committee of the university of zheng by ethical voting. Written informed consent was obtained for all patients.
1. Separation of plasma
Collecting whole blood sample, mixing in EDTA anticoagulant tube, centrifuging at 4deg.C for 10min with 1,600Xg, centrifuging, collecting supernatant (blood plasma) in new EP tube, centrifuging at 16,000Xg for 10min to remove cell debris, packaging blood plasma in centrifuge tube, and freezing at-80deg.C for use.
2. Plasma sample pretreatment
To 2. Mu.L of plasma sample was added 100. Mu.L of 50mM ammonium bicarbonate, vortexed and mixed for 1min, the sample was incubated at 95℃for 4min to thermally denature the protein, cooled to room temperature, 2. Mu.g of Trypsin (Trypsin) was added to the system, and the system was shaken for 18h at 37℃and then 10. Mu.L of aqueous ammonia was added to the system to stop the enzymatic hydrolysis. Desalting the peptide sample after enzymolysis, pumping, and freezing at-80 ℃ until mass spectrum detection.
3. Mass spectrometric detection of ASD plasma samples
The peptide sample was detected by a Orbitrap Fusion Lumos three-in-one high resolution mass spectrometry system (Thermo Fisher Scientific, rockford, USA) in tandem with a high performance liquid chromatography system (EASY-nLC 1200,Thermo Fisher) and mass spectrometry data of the whole protein corresponding to the peptide sample was obtained. The specific operation is as follows:
the nano-flow liquid chromatography is adopted, and the chromatographic column is a self-made C18 chromatographic column (150 μm ID×8cm,1.9 μm +.And (3) filling). The temperature of the column temperature box is 60 ℃. The dry powder peptide is re-dissolved by using a loading buffer (0.1% formic acid aqueous solution), separated by a chromatographic column after loading, eluted by 600nL/min of linear 6-30% mobile phase B (ACN and 0.1% formic acid), and a mass spectrum detection means of liquid phase combination Data Independent Acquisition (DIA) is utilized. The DIA mass spectrometry detection parameters were set as follows: the ion mode is positive ions; the resolution of the primary mass spectrum is 30K, the maximum injection time is 20ms, the AGC Target is 3e6, and the scanning range is 300-1400m/z; the secondary scanning resolution is 15K, 30 variable isolation windows are acquired, and the collision energy is 27%. The liquid chromatography tandem mass spectrometry system uses Xcalibur software control for data acquisition.
4. Data analysis
All data were using Firmiana. The Firmiana is a workflow based on Galaxy system, and consists of a plurality of functional modules such as a user login interface, raw data, identification and quantification, data analysis, knowledge mining and the like. DIA data were searched against the UniProt human protein database (updated at 2013.07.04, 32015 entries) using FragPipe (v 12.1) and MSFragger (2.2). The mass difference of the parent ion was 20ppm and the mass difference of the daughter ion was 50mmu. At most two leaky sites are allowed. The search engine sets cysteine carbamoyl methylation as the fixed modification and N-acetylation and oxidation of methionine as the variable modification. The parent ion charge range is set to +2, +3, and +4. The error discovery rate (FDR) was set to 1%.
The identified peptide fragment quantification results are recorded as the average of the peak areas of chromatographic fragment ions in all reference spectra libraries. Protein quantification was performed using the unlabeled intensity-based absolute quantification (iBAQ) method. We calculated the peak area values as part of the corresponding proteins. Total Fraction (FOT) is used to represent normalized abundance of a particular protein in a sample. FOT is defined as the iBAQ of the protein divided by the total iBAQ of all identified proteins in the sample. Proteins with at least one proprietary peptide fragment (unique peptide) and 1% fdr were selected for further analysis.
5. Establishing a predictive model
368 samples (75%) were randomly drawn from all samples as training sets (i.e., modeling sets), and the remaining 123 samples (25%) were used as internal test sets. First, 2631 more broadly most present proteins were screened by Frequency > 30%. And selecting molecules with obvious difference in expression (FOT difference multiple is more than 1.5 times and t-test p value is less than 0.05) between the sample of the autism spectrum disorder disease and the monitoring sample by comparison, wherein 355 proteins are selected as candidate markers.
Based on the regression classifier, and according to the logistic regression analysis, FOT values of the candidate markers are input into a Caret R package (namely, R language Caret package) to establish a prediction model.
The following biomarker combinations were analyzed and screened through the R-package:
ALDH2, ARF5, ATP6AP1, CD14, CDC37, COL1A2, CYBB, DST, F, FAH, FGB, GLOD, GNPDA2, HBE1, HMGCL, HSPA1L, IGHD, IMMT, ITGA6, MCAM, MRC1, MSL1, MYL12B, NUCB1, OGDH, PF4V1, PFN1, POTEJ, PROCR, PSMA3, PTPRG, RPS3, SCP2, TPI1, VASN, ANXA5, ASAH1, CALD1, CSE1L, EXOC1, FTH1, GNPDA1, HSPA8, ITGB1, MTA2, OLFM4, MAPK1, RPS27A, SOD1, TPM1, and UBC.
6. Early screening protein biomarkers for autism spectrum disorder disease
Classifier set-up for early screening of protein molecules for autism spectrum disorder disease, including two stages of discovery, testing.
In the examples, 491 blood samples of autistic patients and healthy people are randomly divided into a training set and an internal test set, wherein the training set comprises 368 samples (75%) which are randomly extracted, and the rest 123 samples (25%) are used as the internal test set. And constructing a classifier by adopting a logistic regression (Logistic Regression) algorithm on 368 samples in the training set. Logistic regression employs a 10-fold cross-validation method in estimating error rates, first dividing 368 samples randomly into 10 aliquots. The model was constructed with 9 aliquots of the samples, and the test was performed with the remaining 10% of the samples, and repeated 10 times, and the average value of the ROC curve (Receiver Operating Curve) was calculated for 10 times.
Calculating AUC (Area Under the ROC Curve) for relative expression level ROC curves (Receiver Operating Curve) of 52 protein molecular markers (ALDH 2, ARF5, ATP6AP1, CD14, CDC37, COL1A2, CYBB, DST, F7, FAH, FGB, GLOD4, GNPDA2, HBE1, HMGCL, HSPA1L, IGHD, IMMT, ITGA6, MCAM, MRC1, MSL1, MYL12B, NUCB1, OGDH, PF4V1, PFN1, POTEJ, PROCR, PSMA3, PTPRG, RPS3, SCP2, TPI1, VASN, ANXA5, ASAH1, CALD1, CSE1L, EXOC1, FTH1, GNPDA1, HSPA8, ITGB1, MTA2, OLFM4, MAPK1, RPS27A, SOD1, TPM1, UBC) in autism spectrum disorder plasma samples of a third affiliated hospital of university, analytical methods see Karimollah Hajian-tiki Receiver Operating Characteristic (ROC 20132, caspian J Intern Med; 4 (2):627-635. And models were built to analyze these markers.
The training set included 368 cases, auc=0.942, diagnostic sensitivity 84.46%, specificity 88.57% (see fig. 1), and the remaining 123 cases were internal validation sets, auc=0.867, diagnostic sensitivity 83.33%, and specificity 72.46% (see fig. 2).
From the above results, it can be seen that the use of 52 protein molecular markers (ALDH 2, ARF5, ATP6AP1, CD14, CDC37, COL1A2, CYBB, DST, F7, FAH, FGB, GLOD4, GNPDA2, HBE1, HMGCL, HSPA1L, IGHD, IMMT, ITGA6, MCAM, MRC1, MSL1, myll 12B, NUCB1, OGDH, PF4V1, PFN1, POTEJ, PROCR, PSMA3, PTPRG, RPS3, SCP2, TPI1, VASN, ANXA5, ASAH1, CALD1, CSE1L, EXOC1, FTH1, GNPDA1, HSPA8, ITGB1, MTA2, OLFM4, MAPK1, RPS27A, SOD1, TPM1, UBC) in combination in the plasma of patients with autism spectrum disorders can be used for early screening and diagnosis of autism spectrum disorders.
And (3) inputting the FOT of the protein molecular biomarker, which is collected by the DIA, into the obtained prediction model for a sample to be tested, so as to obtain an output result of predicting the autism spectrum disorder disease. Namely, when the probability is greater than or equal to 0.5, judging that the autism spectrum disorder disease; when the probability is less than 0.5, the judgment is normal.
Example 2 System for predicting autism spectrum disorder disease
System 61 for predicting autism spectrum disorder disease: the data processing module 52 and the judging and outputting module 53 further include a data collecting module 51 (fig. 3).
The data collection module 51 is used to collect the expression level data of the biomarker combinations in the patient body fluid exosome sample and transmit them to the data processing module.
The data processing module 52 is configured to analyze the expression level data of the received or input biomarker combinations according to the data analysis method described in example 1 to obtain a calculation result. Wherein the expression level data of the biomarker combinations can be collected by the data collection module 51, and the expression level data of the biomarker combinations can also be obtained from other sources.
The judging and outputting module 53 is configured to judge whether the calculated result meets a preset judging condition, that is, the probability of suffering from the autism spectrum disorder disease is greater than or equal to the probability of not suffering from the autism spectrum disorder disease, so as to predict the risk of suffering from the autism spectrum disorder disease, and output a prediction result; wherein, in the judging and outputting module, when the expression level data satisfies that the probability of suffering from the autism spectrum disorder disease is greater than or equal to the probability of not suffering from the autism spectrum disorder disease, outputting a prediction result of "risk of suffering from the autism spectrum disorder disease"; when the expression level data does not meet the judgment condition, namely the probability of suffering from the autism spectrum disorder disease is smaller than the probability of not suffering from the autism spectrum disorder disease, outputting a prediction result as 'no risk of suffering from the autism spectrum disorder disease'.
Example 3 electronic device
The present embodiment provides an electronic device, which may be expressed in the form of a computing device (e.g., may be a server device), including a memory, a processor, and a computer program stored on the memory and executable on the processor, where the processor may implement the method for predicting autism spectrum disorder disease in embodiment 1 of the present application when executing the computer program.
Fig. 4 shows a schematic diagram of the hardware structure of the present embodiment, and the electronic device 4 specifically includes:
at least one processor 91, at least one memory 92, and a bus 93 for connecting the different system components (including the processor 91 and the memory 92), wherein:
the bus 93 includes a data bus, an address bus, and a control bus.
The memory 92 includes volatile memory such as Random Access Memory (RAM) 921 and/or cache memory 922, and may further include Read Only Memory (ROM) 923.
Memory 92 also includes a program/utility 925 having a set (at least one) of program modules 924, such program modules 924 including, but not limited to: an operating system, one or more application programs, other program modules, and program data, each or some combination of which may include an implementation of a network environment.
The processor 91 executes various functional applications and data processing, such as the data analysis method of embodiment 1 of the present application, by running a computer program stored in the memory 92.
The electronic device 9 may further communicate with one or more external devices 94 (e.g., keyboard, pointing device, etc.). Such communication may occur through an input/output (I/O) interface 95. Also, the electronic device 9 may communicate with one or more networks such as a Local Area Network (LAN), a Wide Area Network (WAN) and/or a public network, such as the Internet, through a network adapter 96. The network adapter 96 communicates with other modules of the electronic device 9 via the bus 93. It should be appreciated that although not shown in the figures, other hardware and/or software modules may be used in connection with the electronic device 9, including but not limited to: microcode, device drivers, redundant processors, external disk drive arrays, RAID (disk array) systems, tape drives, data backup storage systems, and the like.
It should be noted that although several units/modules or sub-units/modules of an electronic device are mentioned in the above detailed description, such a division is merely exemplary and not mandatory. Indeed, the features and functionality of two or more units/modules described above may be embodied in one unit/module in accordance with embodiments of the present application. Conversely, the features and functions of one unit/module described above may be further divided into ones that are embodied by a plurality of units/modules.
Embodiment 4 computer-readable storage Medium
An embodiment of the present application provides a computer-readable storage medium having stored thereon a computer program which, when executed by a processor, performs the steps of the method of predicting autism spectrum disorder in embodiment 1 of the present application.
More specifically, among others, readable storage media may be employed including, but not limited to: portable disk, hard disk, random access memory, read only memory, erasable programmable read only memory, optical storage device, magnetic storage device, or any suitable combination of the foregoing.
In a possible embodiment, the application may also be implemented in the form of a program product comprising program code for causing a terminal device to carry out the steps of carrying out the method of predicting autism spectrum disorder in embodiment 1 of the application, when said program product is run on the terminal device.
Wherein the program code for carrying out the application may be written in any combination of one or more programming languages, which program code may execute entirely on the user device, partly on the user device, as a stand-alone software package, partly on the user device and partly on the remote device or entirely on the remote device.
Finally, the above embodiments are only for illustrating the technical solution of the present application, and are not limiting.
Biomarker holly (refer to genegards database)
ALDH2:Aldehyde Dehydrogenase 2 Family Member
ARF5:ADP Ribosylation Factor 5
ATP6AP1:ATPase H+Transporting Accessory Protein 1
CD14:CD14 Molecule
CDC37:Cell Division Cycle 37
COL1A2:Collagen Type I Alpha 2 Chain
CYBB:Cytochrome B-245 Beta Chain
DST:Dystonin
F7:Coagulation Factor VII
FAH:Fumarylacetoacetate Hydrolase
FGB:Fibrinogen Beta Chain
GLOD4:Glyoxalase Domain Containing 4
GNPDA2:Glucosamine-6-Phosphate Deaminase 2
HBE1:Hemoglobin Subunit Epsilon 1
HMGCL:3-Hydroxy-3-Methylglutaryl-CoA Lyase
HSPA1L:Heat Shock Protein Family A(Hsp70)Member 1 Like
IGHD:Immunoglobulin Heavy Constant Delta
IMMT:Inner Membrane Mitochondrial Protein
ITGA6:Integrin Subunit Alpha 6
MCAM:Melanoma Cell Adhesion Molecule
MRC1:Mannose Receptor C-Type 1
MSL1:MSL Complex Subunit 1
MYL12B:Myosin Light Chain 12B
NUCB1:Nucleobindin 1
OGDH:Oxoglutarate Dehydrogenase
PF4:Platelet Factor 4
PF4V1:Platelet Factor 4 Variant 1
PFN1:Profilin 1
POTEJ:POTE Ankyrin Domain Family Member J
PROCR:Protein C Receptor
PSMA3:Proteasome 20S Subunit Alpha 3
PTPRG:Protein Tyrosine Phosphatase Receptor Type G
RPS3:Ribosomal Protein S3
SCP2:Sterol Carrier Protein 2
TPI1:Triosephosphate Isomerase 1
VASN:Vasorin
ANXA5:Annexin A5
ASAH1:N-Acylsphingosine Amidohydrolase 1
CALD1:Caldesmon 1
CSE1L:Chromosome Segregation 1 Like
EXOC1:Exocyst Complex Component 1
FTH1:Ferritin Heavy Chain 1
GNPDA1:Glucosamine-6-Phosphate Deaminase 1
HSPA8:Heat Shock Protein Family A(Hsp70)Member 8
ITGB1:Integrin Subunit Beta 1
MTA2:Metastasis Associated 1 Family Member 2
OLFM4:Olfactomedin 4
MAPK1:Mitogen-Activated Protein Kinase 1RPS27A:Ribosomal Protein S27a SOD1:Superoxide Dismutase 1TPM1:Tropomyosin 1
UBC:Ubiquitin C。
Claims (16)
1. Use of a biomarker combination consisting of ALDH2, ARF5, ATP6AP1, CD14, CDC37, COL1A2, CYBB, DST, F7, FAH, FGB, GLOD4, GNPDA2, HBE1, HMGCL, HSPA1L, IGHD, IMMT, ITGA6, MCAM, MRC1, MSL1, MYL12B, NUCB1, OGDH, PF4V1, PFN1, POTEJ, PROCR, PSMA3, PTPRG, RPS3, SCP2, TPI1, VASN, ANXA5, ASAH1, CALD1, CSE1L, EXOC1, FTH1, GNPDA1, HSPA8, ITGB1, MTA2, OLFM4, MAPK1, RPS27A, SOD1, TPM1 and UBC for the preparation of a product for predicting autism spectrum disorder disease.
2. A reagent combination for detecting a biomarker combination, characterized in that the biomarker combination consists of ALDH2, ARF5, ATP6AP1, CD14, CDC37, COL1A2, CYBB, DST, F7, FAH, FGB, GLOD4, GNPDA2, HBE1, HMGCL, HSPA1L, IGHD, IMMT, ITGA6, MCAM, MRC1, MSL1, myll 12B, NUCB1, OGDH, PF4V1, PFN1, POTEJ, PROCR, PSMA3, PTPRG, RPS3, SCP2, TPI1, VASN, ANXA5, ASAH1, CALD1, CSE1L, EXOC1, FTH1, GNPDA1, HSPA8, ITGB1, MTA2, OLFM4, MAPK1, RPS27A, SOD1, TPM1 and UBC.
3. The combination of reagents according to claim 2, wherein the combination of reagents is used to detect the expression level of the combination of biomarkers,
and/or, the combination of reagents comprises reagents for genomic, transcriptome and/or proteomic sequencing,
and/or the combination of reagents comprises a biomolecular reagent that specifically binds to the biomarker, or specifically hybridizes to a nucleic acid encoding the biomarker.
4. The combination of reagents according to claim 3, wherein the expression level is protein expression level and/or mRNA transcription level and/or the biomolecular reagent is selected from one or more of a primer, a probe and an antibody.
5. A biomarker combination, characterized in that the biomarker combination consists of ALDH2, ARF5, ATP6AP1, CD14, CDC37, COL1A2, CYBB, DST, F, FAH, FGB, GLOD4, GNPDA2, HBE1, HMGCL, HSPA1L, IGHD, IMMT, ITGA6, MCAM, MRC1, MSL1, MYL12B, NUCB1, OGDH, PF4V1, PFN1, POTEJ, PROCR, PSMA3, PTPRG, RPS3, SCP2, TPI1, VASN, ANXA5, ASAH1, CALD1, CSE1L, EXOC, FTH1, gn 1, HSPA8, ITGB1, MTA2, OLFM4, MAPK1, RPS27A, SOD1, TPM1 and UBC.
6. A kit comprising a combination of reagents according to any one of claims 2 to 4 and/or a biomarker combination according to claim 5.
7. A method of constructing a predictive model of autism spectrum disorder disease, the method comprising: inputting protein expression data corresponding to biomarker combinations in the sample into an R language Caret packet containing a logistic regression model for machine learning to obtain an autism spectrum disorder disease prediction model;
the biomarker combinations consist of ALDH2, ARF5, ATP6AP1, CD14, CDC37, COL1A2, CYBB, DST, F, FAH, FGB, GLOD4, GNPDA2, HBE1, HMGCL, HSPA1L, IGHD, IMMT, ITGA6, MCAM, MRC1, MSL1, MYL12B, NUCB1, OGDH, PF4V1, PFN1, POTEJ, PROCR, PSMA3, PTPRG, RPS3, SCP2, TPI1, VASN, ANXA5, ASAH1, CALD1, CSE1L, EXOC1, FTH1, GNPDA1, HSPA8, ITGB1, MTA2, OLFM4, MAPK1, RPS27A, SOD1, TPM1, and UBC.
8. The method of claim 7, wherein the sample is from blood, urine, saliva, or cerebrospinal fluid;
and/or, before machine learning, the sample acquires the protein expression data in a DIA mode and carries out peptide segment matching through Firmiana software;
and/or, the sample is from a patient and healthy person comprising autism spectrum disorder disease;
and/or, inputting the FOT of the protein corresponding to the biomarker combination as protein expression data into an R language Caret package of a logistic regression model for machine learning;
and/or grouping the samples before machine learning to obtain a modeling group sample and a verification group sample, wherein the modeling group sample is used for constructing the autism spectrum disorder disease prediction model, and the verification group sample is used for verifying the autism spectrum disorder disease prediction model.
9. The method of claim 8, wherein the peptide fragment matching utilizes a UniProt human protein database;
and/or, the protein expression data input into the logistic regression model satisfies: the expression of the protein corresponding to the biomarker combination of the patient in the sample is 1.5 or more than 1.5 of the expression of the corresponding protein of the healthy human, and the t-test p value is less than 0.05;
and/or inputting protein expression data of the logistic regression model to the protein abundance of more than or equal to 30%;
and/or the step of employing the validation set sample for validation comprises: calculating the area under the line, the sensitivity and the specificity of a specificity curve of the protein expression data of the biomarker combinations in the sample; judging the accuracy of a prediction model according to the offline area, sensitivity and specificity;
and/or, the method further comprises determining whether the sample has autism spectrum disorder,
and judging that the disease is diseased when the probability of the predicted sample for the autism spectrum disorder disease is greater than or equal to 0.5, and judging that the disease is normal when the probability of the predicted sample for the autism spectrum disorder is not greater than or equal to 0.5.
10. A predictive model of an autism spectrum disorder disease, wherein the predictive model is constructed by a method according to any one of claims 7 to 9.
11. A prediction system for autism spectrum disorder diseases is characterized in that the prediction system comprises an analysis and judgment module,
the analysis and judgment module comprises the prediction model as claimed in claim 10, and outputs a prediction result of whether the sample suffers from autism spectrum disorder.
12. The prediction system of claim 11, wherein the prediction system further comprises an output module and/or a detection module; the output module outputs the judging result of the analysis judging module, and the detection module detects the protein expression level corresponding to the biomarker combination in the sample to be detected and transmits the expression level data to the analysis judging module.
13. Use of the predictive model according to claim 10 or the predictive system according to claim 11 or 12 for the prediction of autism spectrum disorder diseases.
14. Use of a combination of agents according to any one of claims 2 to 4 for the preparation of a kit for the prediction or diagnosis of diseases of autism spectrum disorder.
15. A computer readable storage medium storing a computer program, which, when executed by a processor, performs the function of the predictive model of claim 10 or the predictive system of claim 11 or 12.
16. An electronic device comprising a memory storing a computer program and a processor, wherein the processor is configured to execute the computer program to implement the functionality of the predictive model of claim 10 or the predictive system of claim 11 or 12.
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN202310725801.3A CN116732164A (en) | 2023-06-16 | 2023-06-16 | Biomarker combinations and their use in predicting ASD disease |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN202310725801.3A CN116732164A (en) | 2023-06-16 | 2023-06-16 | Biomarker combinations and their use in predicting ASD disease |
Publications (1)
Publication Number | Publication Date |
---|---|
CN116732164A true CN116732164A (en) | 2023-09-12 |
Family
ID=87907774
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN202310725801.3A Pending CN116732164A (en) | 2023-06-16 | 2023-06-16 | Biomarker combinations and their use in predicting ASD disease |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN116732164A (en) |
Cited By (2)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN117025758A (en) * | 2023-09-19 | 2023-11-10 | 华北理工大学 | Application of circular RNA in preparation of biomarker for diagnosing autism |
CN117070635A (en) * | 2023-10-12 | 2023-11-17 | 上海爱谱蒂康生物科技有限公司 | Application of biomarker combination in preparation of kit for predicting transparent renal cell carcinoma |
-
2023
- 2023-06-16 CN CN202310725801.3A patent/CN116732164A/en active Pending
Cited By (4)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN117025758A (en) * | 2023-09-19 | 2023-11-10 | 华北理工大学 | Application of circular RNA in preparation of biomarker for diagnosing autism |
CN117025758B (en) * | 2023-09-19 | 2024-04-05 | 华北理工大学 | Application of circular RNA in preparation of biomarker for diagnosing autism |
CN117070635A (en) * | 2023-10-12 | 2023-11-17 | 上海爱谱蒂康生物科技有限公司 | Application of biomarker combination in preparation of kit for predicting transparent renal cell carcinoma |
CN117070635B (en) * | 2023-10-12 | 2024-01-26 | 上海爱谱蒂康生物科技有限公司 | Application of biomarker combination in preparation of kit for predicting transparent renal cell carcinoma |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
Hawkridge et al. | Mass spectrometry–based biomarker discovery: toward a global proteome index of individuality | |
Jain et al. | The handbook of biomarkers | |
Lopez et al. | High-resolution serum proteomic profiling of Alzheimer disease samples reveals disease-specific, carrier-protein–bound mass signatures | |
Swan et al. | Application of machine learning to proteomics data: classification and biomarker identification in postgenomics biology | |
CN116732164A (en) | Biomarker combinations and their use in predicting ASD disease | |
CN111479934A (en) | Identification and use of biological indicators for diagnostic and therapy monitoring | |
US9103837B2 (en) | Chronic obstructive pulmonary disease (COPD) biomarkers and uses thereof | |
WO2005017646A2 (en) | System, software and methods for biomarker identification | |
US20080086272A1 (en) | Identification and use of biomarkers for the diagnosis and the prognosis of inflammatory diseases | |
CN104969071A (en) | Method for evaluation of presence of or risk of colon tumors | |
CA2681010A1 (en) | Apolipoprotein fingerprinting technique | |
Yu et al. | Preliminary proteomic-based identification of a novel protein for Down's syndrome in maternal serum | |
Christians et al. | The role of proteomics in the study of kidney diseases and in the development of diagnostic tools | |
CN113791219B (en) | Biomarker for acute ischemic stroke recurrence risk analysis and application thereof | |
Chao et al. | Towards proteome standards: the use of absolute quantitation in high-throughput biomarker discovery | |
CN116769900A (en) | Biomarker combinations and their use in predicting ASD disease progression | |
US20160018413A1 (en) | Methods of Prognosing Preeclampsia | |
Richens et al. | Systems biology coupled with label-free high-throughput detection as a novel approach for diagnosis of chronic obstructive pulmonary disease | |
WO2016123058A1 (en) | Biomarkers for detection of tuberculosis risk | |
CN113358881B (en) | Biomarker for NMOSD prediction or recurrence monitoring and application thereof | |
US20180356419A1 (en) | Biomarkers for detection of tuberculosis risk | |
CN103488913A (en) | A computational method for mapping peptides to proteins using sequencing data | |
CN117051102B (en) | Application of biomarker combination in preparation of products for predicting parkinsonism | |
CN117070635B (en) | Application of biomarker combination in preparation of kit for predicting transparent renal cell carcinoma | |
Liu et al. | Using SELDI‐TOF MS to identify serum biomarkers of rheumatoid arthritis |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PB01 | Publication | ||
PB01 | Publication | ||
SE01 | Entry into force of request for substantive examination | ||
SE01 | Entry into force of request for substantive examination |